BLASTP 2.2.22 [Sep-27-2009]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= gi|254780231|ref|YP_003064644.1| hypothetical protein
CLIBASIA_00580 [Candidatus Liberibacter asiaticus str. psy62]
         (98 letters)

Database: nr 
           13,984,884 sequences; 4,792,584,752 total letters

Searching..................................................done



>gi|159185312|ref|NP_530540.1| hypothetical protein Atu2660 [Agrobacterium tumefaciens str. C58]
 gi|17741163|gb|AAL43641.1| conserved hypothetical protein [Agrobacterium tumefaciens str. C58]
          Length = 183

 Score =  114 bits (287), Expect = 3e-24,   Method: Composition-based stats.
 Identities = 35/93 (37%), Positives = 55/93 (59%), Gaps = 2/93 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + VRL PN  +  I  +E   D     H+K +V+A P+ GKANKA++ +LAKKL L K
Sbjct: 92  VRLSVRLTPNGGRDAIDGVEQDADG--NAHLKARVSAVPEGGKANKALIVLLAKKLGLPK 149

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
           SS+  +S +++  KI+ ID D ++  +L +   
Sbjct: 150 SSITFISGETARKKILRIDTDPEDFEKLFKKLA 182


>gi|116254232|ref|YP_770070.1| hypothetical protein RL4503 [Rhizobium leguminosarum bv. viciae
          3841]
 gi|166227262|sp|Q1MAP9|Y4503_RHIL3 RecName: Full=UPF0235 protein RL4503
 gi|115258880|emb|CAK09988.1| conserved hypothetical protein [Rhizobium leguminosarum bv.
          viciae 3841]
          Length = 103

 Score =  109 bits (272), Expect = 2e-22,   Method: Composition-based stats.
 Identities = 32/89 (35%), Positives = 56/89 (62%), Gaps = 2/89 (2%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + VRL PN  +  +  +E   D      +K +VTA P+KGKANKA++ ++AK L + KS
Sbjct: 13 RLAVRLTPNGGRDALDGIE--ADGEGEAFLKARVTAVPEKGKANKALMLLIAKSLRIPKS 70

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          S+ ++S +++  KI+ ID D +++ + L+
Sbjct: 71 SVSLVSGETARKKILRIDGDPEDLVKKLE 99


>gi|190893761|ref|YP_001980303.1| hypothetical protein RHECIAT_CH0004196 [Rhizobium etli CIAT 652]
 gi|226706142|sp|B3PQB3|Y4196_RHIE6 RecName: Full=UPF0235 protein RHECIAT_CH0004196
 gi|190699040|gb|ACE93125.1| hypothetical conserved protein [Rhizobium etli CIAT 652]
          Length = 103

 Score =  108 bits (270), Expect = 3e-22,   Method: Composition-based stats.
 Identities = 30/92 (32%), Positives = 58/92 (63%), Gaps = 2/92 (2%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
            + VRL PN  +     +E   +     ++K +VTA P+KGKANKA++A+++K L ++KS
Sbjct: 13  RLTVRLTPNGGRDAFDGIETGSEGET--YLKARVTAIPEKGKANKALIALVSKSLGVAKS 70

Query: 63  SLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
           S+ ++S +++  KI+ I+ D +++ + L+   
Sbjct: 71  SITLVSGETARKKILRIEGDPEDLAKKLETLS 102


>gi|29839726|sp|Q8UC38|Y2660_AGRT5 RecName: Full=UPF0235 protein Atu2660
          Length = 112

 Score =  107 bits (267), Expect = 7e-22,   Method: Composition-based stats.
 Identities = 35/93 (37%), Positives = 55/93 (59%), Gaps = 2/93 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + VRL PN  +  I  +E   D     H+K +V+A P+ GKANKA++ +LAKKL L K
Sbjct: 21  VRLSVRLTPNGGRDAIDGVEQDADG--NAHLKARVSAVPEGGKANKALIVLLAKKLGLPK 78

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
           SS+  +S +++  KI+ ID D ++  +L +   
Sbjct: 79  SSITFISGETARKKILRIDTDPEDFEKLFKKLA 111


>gi|241206712|ref|YP_002977808.1| hypothetical protein Rleg_4028 [Rhizobium leguminosarum bv.
          trifolii WSM1325]
 gi|240860602|gb|ACS58269.1| protein of unknown function DUF167 [Rhizobium leguminosarum bv.
          trifolii WSM1325]
          Length = 103

 Score =  106 bits (266), Expect = 8e-22,   Method: Composition-based stats.
 Identities = 31/89 (34%), Positives = 55/89 (61%), Gaps = 2/89 (2%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + VRL PN  +  +  +E   D      +K +VTA P+KGKANKA++ ++A+ L + KS
Sbjct: 13 RLAVRLTPNGGRDALDGIE--ADGEGEAFLKARVTAVPEKGKANKALILLIAQSLRIPKS 70

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          S+ ++S  ++  KI+ ID D +++ + L+
Sbjct: 71 SVSLISGDTARKKILRIDGDPEDLVKKLE 99


>gi|327188872|gb|EGE56064.1| hypothetical protein RHECNPAF_750023 [Rhizobium etli CNPAF512]
          Length = 103

 Score =  106 bits (266), Expect = 9e-22,   Method: Composition-based stats.
 Identities = 30/92 (32%), Positives = 59/92 (64%), Gaps = 2/92 (2%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
            + VRL PN  +     +E   D+    ++K +VTA P+KGKANKA++A+++K + ++KS
Sbjct: 13  RLTVRLTPNGGRDAFDGIET--DSEGETYLKARVTAVPEKGKANKALIALVSKSVGVAKS 70

Query: 63  SLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
           S+ ++S +++  KI+ I+ D +++ + L+   
Sbjct: 71  SITLVSGETARKKILRIEGDPEDLAKKLETLS 102


>gi|218680333|ref|ZP_03528230.1| hypothetical protein RetlC8_16135 [Rhizobium etli CIAT 894]
          Length = 103

 Score =  106 bits (266), Expect = 9e-22,   Method: Composition-based stats.
 Identities = 30/91 (32%), Positives = 56/91 (61%), Gaps = 2/91 (2%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
            + VRL PN  +  I  +E   D      ++ +VT+ P+KGKANKA++ ++A+ L + KS
Sbjct: 13  RLAVRLTPNGGRDAIDGIE--ADGEGETFLRARVTSVPEKGKANKALILLVAQSLRIPKS 70

Query: 63  SLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
           S+ ++S +++  KI+ ID D +++ + L+  
Sbjct: 71  SISLVSGETARKKILRIDGDPEDLAKKLETL 101


>gi|237807736|ref|YP_002892176.1| hypothetical protein Tola_0962 [Tolumonas auensis DSM 9187]
 gi|259710178|sp|C4LCM6|Y962_TOLAT RecName: Full=UPF0235 protein Tola_0962
 gi|237499997|gb|ACQ92590.1| protein of unknown function DUF167 [Tolumonas auensis DSM 9187]
          Length = 96

 Score =  106 bits (265), Expect = 1e-21,   Method: Composition-based stats.
 Identities = 20/91 (21%), Positives = 40/91 (43%), Gaps = 8/91 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V + P A +  I              +KI +TA P  G+AN  ++  LAK+  ++KS 
Sbjct: 13 LDVYIQPKASRDQIQGWH-------GEELKIAITAPPVDGQANAHLIKFLAKQFKVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
          + +   +    K + I    +++  +L  + 
Sbjct: 66 IVIHKGELGRHKTVRIT-SPQQLPAILDQSA 95


>gi|325294029|ref|YP_004279893.1| hypothetical protein AGROH133_08898 [Agrobacterium sp. H13-3]
 gi|325061882|gb|ADY65573.1| hypothetical protein AGROH133_08898 [Agrobacterium sp. H13-3]
          Length = 112

 Score =  105 bits (263), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 33/93 (35%), Positives = 53/93 (56%), Gaps = 2/93 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + VRL PN  +  I  +E   D     H+K +V+A P+ GKANKA++ +LAKK  L K
Sbjct: 21  IRLSVRLTPNGGRDAIDGVEQDADG--NAHLKARVSAVPEGGKANKALVILLAKKFGLPK 78

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
           S +  +S +++  KI+ ID D ++   L +  +
Sbjct: 79  SPITFISGETARKKILRIDTDPEDFETLFRKLE 111


>gi|222150063|ref|YP_002551020.1| hypothetical protein Avi_4153 [Agrobacterium vitis S4]
 gi|221737045|gb|ACM38008.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 104

 Score =  105 bits (263), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 31/94 (32%), Positives = 61/94 (64%), Gaps = 2/94 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + VRL PN  + GI  +++  +     H+K++V+  P+KG+ANKA++A+LAK+L ++K
Sbjct: 12  IRLAVRLTPNGGRDGIDGVDVNANGE--AHLKVRVSDVPEKGRANKALIALLAKRLGVAK 69

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
           S++ ++S  ++  KI+ ID D +++   L+   +
Sbjct: 70  SAVSLISGDAARQKILRIDGDPEDLIGRLETITA 103


>gi|209551279|ref|YP_002283196.1| hypothetical protein Rleg2_3707 [Rhizobium leguminosarum bv.
           trifolii WSM2304]
 gi|226705832|sp|B5ZTD4|Y3707_RHILW RecName: Full=UPF0235 protein Rleg2_3707
 gi|209537035|gb|ACI56970.1| protein of unknown function DUF167 [Rhizobium leguminosarum bv.
           trifolii WSM2304]
          Length = 103

 Score =  105 bits (262), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 31/92 (33%), Positives = 55/92 (59%), Gaps = 2/92 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +RL PN  +  I   E   D     ++K +VT  P+KGKANKA++ ++AK L ++K
Sbjct: 12  VRLAIRLTPNGGRDAIDGAET--DGEGEAYLKTRVTTVPEKGKANKALILLIAKSLGIAK 69

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
           SS+ ++S  ++  KI+ ID D +++ + L+  
Sbjct: 70  SSVSLVSGDTARKKILRIDGDPEDLGKKLETL 101


>gi|222087485|ref|YP_002546022.1| hypothetical protein Arad_4365 [Agrobacterium radiobacter K84]
 gi|221724933|gb|ACM28089.1| conserved hypothetical protein [Agrobacterium radiobacter K84]
          Length = 104

 Score =  105 bits (262), Expect = 3e-21,   Method: Composition-based stats.
 Identities = 34/92 (36%), Positives = 58/92 (63%), Gaps = 2/92 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + VRL PN  +  +  +E   D     ++K +V+A P+KGKANKA++A+LAK+L++ K
Sbjct: 12  VRLSVRLTPNGGRDAVDGIETGADGE--AYLKARVSAVPEKGKANKALIALLAKRLSIPK 69

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
           SSL ++S  ++  KI+ ID D +++   L+  
Sbjct: 70  SSLSLISGDTARKKILRIDGDPEDLIGRLKAI 101


>gi|86359493|ref|YP_471385.1| hypothetical protein RHE_CH03912 [Rhizobium etli CFN 42]
 gi|123510540|sp|Q2K3C8|Y3912_RHIEC RecName: Full=UPF0235 protein RHE_CH03912
 gi|86283595|gb|ABC92658.1| hypothetical conserved protein [Rhizobium etli CFN 42]
          Length = 112

 Score =  104 bits (261), Expect = 4e-21,   Method: Composition-based stats.
 Identities = 28/89 (31%), Positives = 57/89 (64%), Gaps = 2/89 (2%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
            + VRL PN  +     ++   D+    ++  +VTA P+KGKANKA++A+++K + ++KS
Sbjct: 22  RLTVRLTPNGGRDAFDGIDT--DSEGETYLGARVTAVPEKGKANKALIALVSKSVGVAKS 79

Query: 63  SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
           S+ ++S +++  KI+ I+ D +++   L+
Sbjct: 80  SVSVISGETARKKILRIEGDPEDLARKLE 108


>gi|227823716|ref|YP_002827689.1| hypothetical protein NGR_c32030 [Sinorhizobium fredii NGR234]
 gi|227342718|gb|ACP26936.1| hypothetical protein NGR_c32030 [Sinorhizobium fredii NGR234]
          Length = 105

 Score =  104 bits (260), Expect = 5e-21,   Method: Composition-based stats.
 Identities = 32/91 (35%), Positives = 51/91 (56%), Gaps = 2/91 (2%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
            + VRL PN  +  I   EI  D     H+K++V A P+KGKAN A++ +LAK   L+K+
Sbjct: 14  RLTVRLTPNGGRDAIDGFEIAADGE--EHLKVRVRAVPEKGKANDALIGLLAKAFGLAKN 71

Query: 63  SLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
            + ++S  +   KI+ I+ D + I + L   
Sbjct: 72  RIALVSGDTQRKKILRIEADPEAIQKRLTEI 102


>gi|116751493|ref|YP_848180.1| hypothetical protein Sfum_4080 [Syntrophobacter fumaroxidans
          MPOB]
 gi|116700557|gb|ABK19745.1| protein of unknown function DUF167 [Syntrophobacter fumaroxidans
          MPOB]
          Length = 105

 Score =  104 bits (260), Expect = 5e-21,   Method: Composition-based stats.
 Identities = 25/87 (28%), Positives = 40/87 (45%), Gaps = 7/87 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V L P A K+  A +           +KI++TA P +G+ANK  +  LA    +S+S 
Sbjct: 19 LDVYLQPRASKNEWAGMHQ-------GCLKIRLTAPPVEGEANKECVKFLAGAFGVSRSD 71

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90
          + ++    S  K I I     EI   +
Sbjct: 72 VEIIRGHKSRRKTILIRNSTPEILRAV 98


>gi|304313340|ref|YP_003812938.1| hypothetical protein HDN1F_37330 [gamma proteobacterium HdN1]
 gi|301799073|emb|CBL47316.1| Conserved hypothetical protein [gamma proteobacterium HdN1]
          Length = 108

 Score =  104 bits (259), Expect = 5e-21,   Method: Composition-based stats.
 Identities = 20/88 (22%), Positives = 38/88 (43%), Gaps = 8/88 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          +   L P A   G               +KI++TA P  G+AN  ++  LAK   + +  
Sbjct: 18 LHCYLQPRAANDGFVG-------EHGGRLKIRITAPPVDGQANAHLIRFLAKAFGVPQQQ 70

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          +++   ++   K I I +   +I + L+
Sbjct: 71 VQIEQGETGRSKRIRI-RTPSKIPQELR 97


>gi|218461695|ref|ZP_03501786.1| hypothetical protein RetlK5_20376 [Rhizobium etli Kim 5]
          Length = 103

 Score =  103 bits (258), Expect = 7e-21,   Method: Composition-based stats.
 Identities = 29/92 (31%), Positives = 59/92 (64%), Gaps = 2/92 (2%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
            + VRL PN  +     +E   D+    ++K +VTA P+KGKANKA++A++++ + ++KS
Sbjct: 13  RLTVRLTPNGGRGAFDGIET--DSEGETYLKARVTAVPEKGKANKALIALVSQSVGVAKS 70

Query: 63  SLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
           S+ ++S +++  KI+ I+ D +++ + L+   
Sbjct: 71  SVSLVSGETARKKILRIEGDPEDLAQKLEKLS 102


>gi|292493759|ref|YP_003529198.1| hypothetical protein Nhal_3796 [Nitrosococcus halophilus Nc4]
 gi|291582354|gb|ADE16811.1| protein of unknown function DUF167 [Nitrosococcus halophilus Nc4]
          Length = 102

 Score =  103 bits (258), Expect = 8e-21,   Method: Composition-based stats.
 Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 7/84 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +RL P A    I              +K+++TA P +GKAN  ++  LAK   +SKS 
Sbjct: 15 IQIRLQPRASCDEIIG-------PHGDRLKVRITAPPVEGKANADLIRFLAKTFRVSKSQ 67

Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87
          +R+LS  +   K + I+K  K + 
Sbjct: 68 VRLLSGATGRDKRVCIEKPAKLLP 91


>gi|189423484|ref|YP_001950661.1| hypothetical protein Glov_0413 [Geobacter lovleyi SZ]
 gi|189419743|gb|ACD94141.1| protein of unknown function DUF167 [Geobacter lovleyi SZ]
          Length = 101

 Score =  103 bits (257), Expect = 9e-21,   Method: Composition-based stats.
 Identities = 22/87 (25%), Positives = 41/87 (47%), Gaps = 7/87 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V + P A ++    +           +K+++T+ P  G AN+     LAK+L + KS+
Sbjct: 18 LRVFVQPRASRNQFCGIH-------EGELKLRLTSPPVDGAANECCREFLAKQLKVPKSA 70

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90
          + ++S  SS  K + I     +  E L
Sbjct: 71 VTLISGDSSRHKRLRIAGATTQQIEQL 97


>gi|188581339|ref|YP_001924784.1| hypothetical protein Mpop_2087 [Methylobacterium populi BJ001]
 gi|259646581|sp|B1ZLG8|Y2087_METPB RecName: Full=UPF0235 protein Mpop_2087
 gi|179344837|gb|ACB80249.1| protein of unknown function DUF167 [Methylobacterium populi
          BJ001]
          Length = 110

 Score =  103 bits (257), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 24/86 (27%), Positives = 46/86 (53%), Gaps = 2/86 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VRL P A ++G+  +    D      + ++V A P +G AN A+ A +AK L L KS 
Sbjct: 15 LAVRLTPRAGRTGLDGVRTEPDGRP--ILCLRVAAPPVEGAANAALTAFVAKSLGLRKSE 72

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + ++S +++  K +++  D + +   
Sbjct: 73 VTLVSGETARTKRLHLSGDPQALAAR 98


>gi|146309167|ref|YP_001189632.1| hypothetical protein Pmen_4153 [Pseudomonas mendocina ymp]
 gi|205829317|sp|A4XZY4|Y4153_PSEMY RecName: Full=UPF0235 protein Pmen_4153
 gi|145577368|gb|ABP86900.1| protein of unknown function DUF167 [Pseudomonas mendocina ymp]
          Length = 98

 Score =  103 bits (257), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 7/80 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          +   L P A K   A L           +KI++TA P +GKAN  +LA LAK   ++K+ 
Sbjct: 13 LDCHLQPKASKDEFAGLH-------GERLKIRLTAPPVEGKANAHLLAFLAKAFGVAKAQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDC 83
          + + S + +  K + I    
Sbjct: 66 VSLESGELNRHKRLRIHAPQ 85


>gi|113969534|ref|YP_733327.1| hypothetical protein Shewmr4_1190 [Shewanella sp. MR-4]
 gi|123130613|sp|Q0HKZ7|Y1190_SHESM RecName: Full=UPF0235 protein Shewmr4_1190
 gi|113884218|gb|ABI38270.1| conserved hypothetical protein [Shewanella sp. MR-4]
          Length = 96

 Score =  103 bits (257), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 24/91 (26%), Positives = 38/91 (41%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   + KS 
Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +L  +    K + I        EI  LL+
Sbjct: 66 VHILKGELGRHKQVRISAPKNVPAEIATLLE 96


>gi|167630164|ref|YP_001680663.1| conserved hypothetical protein, uncharacterized acr, yggu family
          [Heliobacterium modesticaldum Ice1]
 gi|259646567|sp|B0TGP1|Y2027_HELMI RecName: Full=UPF0235 protein Helmi_20270
 gi|167592904|gb|ABZ84652.1| conserved hypothetical protein, uncharacterized acr, yggu family
          [Heliobacterium modesticaldum Ice1]
          Length = 96

 Score =  103 bits (257), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 24/90 (26%), Positives = 48/90 (53%), Gaps = 8/90 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
              +R+ P A K+ +  L           +K+++TA P  G+AN A L  +AK L LS+
Sbjct: 12 IRFRIRVQPRASKNEVCGL-------LDDALKVRLTAPPVDGEANAACLQFIAKTLGLSR 64

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
          S +R+++ ++S LK + ++    +++ +  
Sbjct: 65 SQVRLVAGETSRLKTLEVEGVSAEDLRKRF 94


>gi|226942467|ref|YP_002797540.1| hypothetical protein Avin_03050 [Azotobacter vinelandii DJ]
 gi|259646933|sp|C1DI68|Y305_AZOVD RecName: Full=UPF0235 protein Avin_03050
 gi|226717394|gb|ACO76565.1| conserved hypothetical protein [Azotobacter vinelandii DJ]
          Length = 99

 Score =  102 bits (256), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 25/79 (31%), Positives = 38/79 (48%), Gaps = 7/79 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          +   L P A K   A L           +KI++TA P +GKAN  +LA LA    + KS 
Sbjct: 13 LACHLQPKASKDEFAGLH-------GERLKIRLTAPPVEGKANAHLLAFLAGVFGVPKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKD 82
          + + S +S+  K + I + 
Sbjct: 66 VSLESGESNRQKRVRIRRP 84


>gi|117926928|ref|YP_867545.1| hypothetical protein Mmc1_3654 [Magnetococcus sp. MC-1]
 gi|166990883|sp|A0LDU6|Y3654_MAGSM RecName: Full=UPF0235 protein Mmc1_3654
 gi|117610684|gb|ABK46139.1| protein of unknown function DUF167 [Magnetococcus sp. MC-1]
          Length = 98

 Score =  102 bits (256), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 20/88 (22%), Positives = 40/88 (45%), Gaps = 7/88 (7%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
          ++ +R+ P A +  +   +          +K+ + A P  G ANKA+   LAK+L ++K 
Sbjct: 12 HLTIRVQPKAAQERVMGWQ-------GEQLKVALNAPPVDGAANKALCHFLAKQLGIAKG 64

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELL 90
           + ++  + S  K + I      I +  
Sbjct: 65 QVTLVRGEKSREKQLVIQGISPSIWQQF 92


>gi|117919640|ref|YP_868832.1| hypothetical protein Shewana3_1191 [Shewanella sp. ANA-3]
 gi|166228912|sp|A0KUF7|Y1191_SHESA RecName: Full=UPF0235 protein Shewana3_1191
 gi|117611972|gb|ABK47426.1| conserved hypothetical protein [Shewanella sp. ANA-3]
          Length = 96

 Score =  102 bits (256), Expect = 1e-20,   Method: Composition-based stats.
 Identities = 24/91 (26%), Positives = 38/91 (41%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   + KS 
Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65

Query: 64 LRMLSKQSSPLKIIYID---KDCKEITELLQ 91
          + +L  +    K + I        EI  LL+
Sbjct: 66 VHILKGELGRHKQVRISPPKNVPAEIATLLE 96


>gi|24374867|ref|NP_718910.1| hypothetical protein SO_3356 [Shewanella oneidensis MR-1]
 gi|29839709|sp|Q8EBY9|Y3356_SHEON RecName: Full=UPF0235 protein SO_3356
 gi|24349562|gb|AAN56354.1|AE015772_14 conserved hypothetical protein TIGR00251 [Shewanella oneidensis
          MR-1]
          Length = 96

 Score =  102 bits (255), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 24/91 (26%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   + KS 
Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +L  +    K + I+       EI+ LL+
Sbjct: 66 VHILKGELGRHKQVRINAPKSVPAEISALLE 96


>gi|146293784|ref|YP_001184208.1| hypothetical protein Sputcn32_2690 [Shewanella putrefaciens
          CN-32]
 gi|166228432|sp|A4Y8X6|Y2690_SHEPC RecName: Full=UPF0235 protein Sputcn32_2690
 gi|145565474|gb|ABP76409.1| protein of unknown function DUF167 [Shewanella putrefaciens
          CN-32]
 gi|319427156|gb|ADV55230.1| protein of unknown function DUF167 [Shewanella putrefaciens 200]
          Length = 96

 Score =  102 bits (254), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V + P A +  I  L           +K+ +TA P  GKAN  +   LAK   + KS 
Sbjct: 13 LNVYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + ++  +    K I I        E++ELL+
Sbjct: 66 VYIIKGELGRHKQIRIVTPKLIPPEVSELLE 96


>gi|70733126|ref|YP_262899.1| hypothetical protein PFL_5841 [Pseudomonas fluorescens Pf-5]
 gi|123652292|sp|Q4K4D4|Y5841_PSEF5 RecName: Full=UPF0235 protein PFL_5841
 gi|68347425|gb|AAY95031.1| conserved hypothetical protein TIGR00251 [Pseudomonas fluorescens
          Pf-5]
          Length = 97

 Score =  102 bits (254), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 7/81 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          +   L P A     A L           +KI++TA P +GKAN  ++A LAK   + KS+
Sbjct: 13 LDCHLQPKASSDEFAGLH-------GERLKIRLTAPPVEGKANAHLMAFLAKAFGIPKSN 65

Query: 64 LRMLSKQSSPLKIIYIDKDCK 84
          + ++S + +  K + +    K
Sbjct: 66 VSLVSGELNRQKRVRLQAPKK 86


>gi|114046767|ref|YP_737317.1| hypothetical protein Shewmr7_1261 [Shewanella sp. MR-7]
 gi|123131606|sp|Q0HX95|Y1261_SHESR RecName: Full=UPF0235 protein Shewmr7_1261
 gi|113888209|gb|ABI42260.1| conserved hypothetical protein [Shewanella sp. MR-7]
          Length = 96

 Score =  102 bits (254), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 24/91 (26%), Positives = 39/91 (42%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   + KS 
Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +L  +    K++ I        EI  LL+
Sbjct: 66 VHILKGELGRHKLVRISAPKNVPAEIATLLE 96


>gi|330505393|ref|YP_004382262.1| hypothetical protein MDS_4479 [Pseudomonas mendocina NK-01]
 gi|328919679|gb|AEB60510.1| hypothetical protein MDS_4479 [Pseudomonas mendocina NK-01]
          Length = 96

 Score =  101 bits (253), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 36/80 (45%), Gaps = 7/80 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          +   L P A K   A L           +KI++TA P  GKAN  + A LAK   ++KS 
Sbjct: 13 LDCHLQPKASKDEFAGLH-------GERLKIRLTAPPVDGKANAHLQAFLAKAFGVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDC 83
          + + S + +  K + I    
Sbjct: 66 VILESGELNRQKRLRIRAPQ 85


>gi|114564027|ref|YP_751541.1| hypothetical protein Sfri_2863 [Shewanella frigidimarina NCIMB 400]
 gi|122299086|sp|Q07Z62|Y2863_SHEFN RecName: Full=UPF0235 protein Sfri_2863
 gi|114335320|gb|ABI72702.1| conserved hypothetical protein [Shewanella frigidimarina NCIMB 400]
          Length = 104

 Score =  101 bits (253), Expect = 3e-20,   Method: Composition-based stats.
 Identities = 26/91 (28%), Positives = 39/91 (42%), Gaps = 10/91 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V + P A +  I  L           +KI +TA P  GKAN  +   LAK   ++KS 
Sbjct: 17  LFVYVQPKASRDQIVGL-------YGNELKIAITAPPIDGKANAYLSKYLAKACKVAKSQ 69

Query: 64  LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
           + ++  +    K I I +      EI  LL 
Sbjct: 70  VHIIKGEQGRHKQIRISQPQVIPPEIAALLS 100


>gi|120598144|ref|YP_962718.1| hypothetical protein Sputw3181_1321 [Shewanella sp. W3-18-1]
 gi|166200340|sp|A1RHL9|Y1321_SHESW RecName: Full=UPF0235 protein Sputw3181_1321
 gi|120558237|gb|ABM24164.1| protein of unknown function DUF167 [Shewanella sp. W3-18-1]
          Length = 96

 Score =  101 bits (252), Expect = 4e-20,   Method: Composition-based stats.
 Identities = 24/91 (26%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   + KS 
Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + ++  +    K I I        E++ELL+
Sbjct: 66 VYIIKGELGRHKQIRIVTPKLIPPEVSELLE 96


>gi|116623504|ref|YP_825660.1| hypothetical protein Acid_4414 [Candidatus Solibacter usitatus
          Ellin6076]
 gi|116226666|gb|ABJ85375.1| protein of unknown function DUF167 [Candidatus Solibacter
          usitatus Ellin6076]
          Length = 88

 Score =  101 bits (252), Expect = 4e-20,   Method: Composition-based stats.
 Identities = 22/93 (23%), Positives = 44/93 (47%), Gaps = 8/93 (8%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          M  + VR+ P A++S I               K+ + A P  GKAN   +  LA    + 
Sbjct: 1  MARLTVRVHPRARRSEITG-------RLGDAWKLALAAPPVDGKANDECVRFLAGWAGVP 53

Query: 61 KSSLRMLSKQSSPLKIIYIDKDC-KEITELLQN 92
          +S +R+++  +S +K++ I+    +++   L+ 
Sbjct: 54 RSRVRIVTGLTSRIKVVEIEGVPQEDLERRLKA 86


>gi|91975173|ref|YP_567832.1| hypothetical protein RPD_0693 [Rhodopseudomonas palustris BisB5]
 gi|91681629|gb|ABE37931.1| protein of unknown function DUF167 [Rhodopseudomonas palustris
           BisB5]
          Length = 107

 Score =  100 bits (251), Expect = 5e-20,   Method: Composition-based stats.
 Identities = 31/92 (33%), Positives = 50/92 (54%), Gaps = 2/92 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V VR+ P   +  I  LE   D      +K++V A    G+AN+A++ +LAK L + K +
Sbjct: 13  VAVRVTPRGGRDEIDGLETLSDGRP--VVKVRVRAIADGGEANRAVIELLAKSLGVPKRN 70

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
           +R+LS  +S  K I ID D  ++ E L+   +
Sbjct: 71  VRLLSGATSRQKQIAIDGDPTKLGEALRRLTA 102


>gi|77166447|ref|YP_344972.1| hypothetical protein Noc_3000 [Nitrosococcus oceani ATCC 19707]
 gi|254435182|ref|ZP_05048689.1| conserved hypothetical protein TIGR00251 [Nitrosococcus oceani
          AFC27]
 gi|123593242|sp|Q3J6V4|Y3000_NITOC RecName: Full=UPF0235 protein Noc_3000
 gi|76884761|gb|ABA59442.1| Conserved hypothetical protein 251 [Nitrosococcus oceani ATCC
          19707]
 gi|207088293|gb|EDZ65565.1| conserved hypothetical protein TIGR00251 [Nitrosococcus oceani
          AFC27]
          Length = 102

 Score =  100 bits (251), Expect = 5e-20,   Method: Composition-based stats.
 Identities = 26/84 (30%), Positives = 43/84 (51%), Gaps = 7/84 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +RL P AK   +              +KI++TA P +GKAN  +L  LAK   +S++ 
Sbjct: 15 IQIRLQPRAKGDEVIG-------PHGDRLKIRITAPPVEGKANTHLLRFLAKTFQVSRNQ 67

Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87
          + +LS  +S  K + I+K  K + 
Sbjct: 68 VYLLSGATSRDKRVRIEKPTKLLP 91


>gi|126175230|ref|YP_001051379.1| hypothetical protein Sbal_3028 [Shewanella baltica OS155]
 gi|153001556|ref|YP_001367237.1| hypothetical protein Shew185_3043 [Shewanella baltica OS185]
 gi|160876292|ref|YP_001555608.1| hypothetical protein Sbal195_3186 [Shewanella baltica OS195]
 gi|304410074|ref|ZP_07391693.1| protein of unknown function DUF167 [Shewanella baltica OS183]
 gi|307302214|ref|ZP_07581972.1| protein of unknown function DUF167 [Shewanella baltica BA175]
 gi|166229359|sp|A3D6Z7|Y3028_SHEB5 RecName: Full=UPF0235 protein Sbal_3028
 gi|166229364|sp|A6WQT5|Y3043_SHEB8 RecName: Full=UPF0235 protein Shew185_3043
 gi|189039841|sp|A9KXP6|Y3186_SHEB9 RecName: Full=UPF0235 protein Sbal195_3186
 gi|125998435|gb|ABN62510.1| protein of unknown function DUF167 [Shewanella baltica OS155]
 gi|151366174|gb|ABS09174.1| protein of unknown function DUF167 [Shewanella baltica OS185]
 gi|160861814|gb|ABX50348.1| protein of unknown function DUF167 [Shewanella baltica OS195]
 gi|304351483|gb|EFM15882.1| protein of unknown function DUF167 [Shewanella baltica OS183]
 gi|306914252|gb|EFN44673.1| protein of unknown function DUF167 [Shewanella baltica BA175]
 gi|315268481|gb|ADT95334.1| protein of unknown function DUF167 [Shewanella baltica OS678]
          Length = 99

 Score =  100 bits (251), Expect = 5e-20,   Method: Composition-based stats.
 Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 8/87 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   + KS 
Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKTFKVPKSD 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90
          + ++  +    K I +  D K I  ++
Sbjct: 66 IHIMKGELGRHKQIRVI-DPKIIPSII 91


>gi|329849894|ref|ZP_08264740.1| hypothetical protein ABI_27900 [Asticcacaulis biprosthecum C19]
 gi|328841805|gb|EGF91375.1| hypothetical protein ABI_27900 [Asticcacaulis biprosthecum C19]
          Length = 86

 Score =  100 bits (250), Expect = 6e-20,   Method: Composition-based stats.
 Identities = 28/83 (33%), Positives = 44/83 (53%), Gaps = 2/83 (2%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VRL P +    I      +D      +K++V A P +G+AN+A++  LAK L + K
Sbjct: 1  MRLAVRLTPRSSADAIDGW--GEDEQGRRFLKVRVRAAPIEGRANEALIVFLAKTLGVPK 58

Query: 62 SSLRMLSKQSSPLKIIYIDKDCK 84
          S L +++  +S LK I ID D  
Sbjct: 59 SRLSLVAGDTSRLKQIEIDGDVD 81


>gi|121602415|ref|YP_988648.1| hypothetical protein BARBAKC583_0328 [Bartonella bacilliformis
           KC583]
 gi|120614592|gb|ABM45193.1| conserved hypothetical protein TIGR00251 [Bartonella bacilliformis
           KC583]
          Length = 107

 Score =  100 bits (250), Expect = 6e-20,   Method: Composition-based stats.
 Identities = 26/94 (27%), Positives = 48/94 (51%), Gaps = 2/94 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + VRL P A  + I  +E   D     ++ I++ A P+ GKANKA++  LAK+  +  S 
Sbjct: 12  LFVRLTPKASMNNIVGVESRDDGKQ--YLIIRLCAVPEDGKANKALIKFLAKQWKIPSSC 69

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97
           + + +   S  K +      ++I ++L +  + T
Sbjct: 70  ISLENGAISRYKQLRFSGGVEKIEKILHSLGNYT 103


>gi|86747236|ref|YP_483732.1| hypothetical protein RPB_0109 [Rhodopseudomonas palustris HaA2]
 gi|123293376|sp|Q2J3Y9|Y109_RHOP2 RecName: Full=UPF0235 protein RPB_0109
 gi|86570264|gb|ABD04821.1| Protein of unknown function DUF167 [Rhodopseudomonas palustris
           HaA2]
          Length = 108

 Score =  100 bits (250), Expect = 6e-20,   Method: Composition-based stats.
 Identities = 32/92 (34%), Positives = 50/92 (54%), Gaps = 2/92 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V VR+ P   +  I  LE   D      +K++V A    G+AN+A++ +LAK L + K +
Sbjct: 14  VAVRVTPRGDRDEIDGLETLSDGRP--VVKLRVRAIADGGEANRAVIELLAKALGVPKRN 71

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
           +R+LS  +S  K I ID D K + E L+   +
Sbjct: 72  VRLLSGATSRQKQIAIDGDPKSLGETLRQLTA 103


>gi|1388023|gb|AAB88056.1| putative [Bartonella bacilliformis]
          Length = 109

 Score =  100 bits (250), Expect = 6e-20,   Method: Composition-based stats.
 Identities = 26/94 (27%), Positives = 48/94 (51%), Gaps = 2/94 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + VRL P A  + I  +E   D     ++ I++ A P+ GKANKA++  LAK+  +  S 
Sbjct: 14  LFVRLTPKASMNNIVGVESRDDGKQ--YLIIRLCAVPEDGKANKALIKFLAKQWKIPSSC 71

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97
           + + +   S  K +      ++I ++L +  + T
Sbjct: 72  ISLENGAISRYKQLRFSGGVEKIEKILHSLGNYT 105


>gi|217972515|ref|YP_002357266.1| hypothetical protein Sbal223_1335 [Shewanella baltica OS223]
 gi|254800042|sp|B8E927|Y1335_SHEB2 RecName: Full=UPF0235 protein Sbal223_1335
 gi|217497650|gb|ACK45843.1| protein of unknown function DUF167 [Shewanella baltica OS223]
          Length = 99

 Score =  100 bits (250), Expect = 7e-20,   Method: Composition-based stats.
 Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 8/87 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   + KS 
Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKTFKVPKSD 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90
          + ++  +    K I +  D K I  ++
Sbjct: 66 IHIMKGELGRHKQIRVI-DPKIIPSVI 91


>gi|157960952|ref|YP_001500986.1| hypothetical protein Spea_1124 [Shewanella pealeana ATCC 700345]
 gi|157845952|gb|ABV86451.1| protein of unknown function DUF167 [Shewanella pealeana ATCC
          700345]
          Length = 90

 Score =  100 bits (250), Expect = 7e-20,   Method: Composition-based stats.
 Identities = 22/90 (24%), Positives = 36/90 (40%), Gaps = 10/90 (11%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  +   L+K   + K  
Sbjct: 8  LQLYIQPKASRDQIVGLH-------GEEIKIAITAPPVDGKANAHLTKYLSKAFKVPKGD 60

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90
          + ++  Q    K + I        E+  LL
Sbjct: 61 IDIIKGQMGRHKQVRITAPKLIPAEVQNLL 90


>gi|193213756|ref|YP_001994955.1| hypothetical protein Ctha_0036 [Chloroherpeton thalassium ATCC
          35110]
 gi|193087233|gb|ACF12508.1| protein of unknown function DUF167 [Chloroherpeton thalassium
          ATCC 35110]
          Length = 97

 Score =  100 bits (250), Expect = 7e-20,   Method: Composition-based stats.
 Identities = 27/83 (32%), Positives = 40/83 (48%), Gaps = 7/83 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           +  VRL P A K+ I              +KI++ A P +  ANKA +  LAK   ++K
Sbjct: 11 VDFSVRLQPRASKNEIVG-------EYDGALKIRIAAPPVENAANKACIEFLAKTFGIAK 63

Query: 62 SSLRMLSKQSSPLKIIYIDKDCK 84
          S + +LS  +S  K+I I    K
Sbjct: 64 SQVEILSGDTSRNKLIRIYGIDK 86


>gi|212636425|ref|YP_002312950.1| cytosolic protein [Shewanella piezotolerans WP3]
 gi|212557909|gb|ACJ30363.1| Cytosolic protein, putative [Shewanella piezotolerans WP3]
          Length = 90

 Score =  100 bits (249), Expect = 7e-20,   Method: Composition-based stats.
 Identities = 23/90 (25%), Positives = 37/90 (41%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  +   L+K   + K  
Sbjct: 8  LQLYVQPKASRDQIVGLH-------GNELKIAITAPPIDGKANIHLAKYLSKAFKVPKGD 60

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +L   +   K + I    K I   +QN 
Sbjct: 61 IDILKGLTGRHKQVLIS-SPKVIPPEIQNL 89


>gi|111075036|gb|ABH04883.1| uncharacterized conserved protein yggY [Heliobacillus mobilis]
          Length = 101

 Score =  100 bits (249), Expect = 8e-20,   Method: Composition-based stats.
 Identities = 25/98 (25%), Positives = 50/98 (51%), Gaps = 8/98 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
               +R+ P A K+ +  L           +K+++TA P  G+AN A  A  AK L+L K
Sbjct: 11  VRFKIRVQPRASKNEVCGL-------LDDALKVRLTAPPVDGEANGACQAFFAKTLSLPK 63

Query: 62  SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNNDSLTL 98
           S +R+++ ++S  K + +     ++I +L  +  + ++
Sbjct: 64  SQVRLVAGETSRTKTVEVIGVSKEQILKLFDSKQTSSV 101


>gi|39995970|ref|NP_951921.1| hypothetical protein GSU0864 [Geobacter sulfurreducens PCA]
 gi|39982735|gb|AAR34194.1| conserved hypothetical protein TIGR00251 [Geobacter sulfurreducens
           PCA]
 gi|307634758|gb|ADI83708.2| protein of unknown function DUF167 [Geobacter sulfurreducens KN400]
          Length = 107

 Score =  100 bits (249), Expect = 8e-20,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 47/89 (52%), Gaps = 8/89 (8%)

Query: 5   IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
            V + P A ++ I  ++          +K+++T+ P +G+AN+  +  LAK+L + KS +
Sbjct: 23  SVHVQPRASRNEICGVQ-------GEAIKLRLTSPPVEGEANRLCVEFLAKRLGVPKSCV 75

Query: 65  RMLSKQSSPLKIIYIDK-DCKEITELLQN 92
            +++ + S  K I +   D   +  LL+N
Sbjct: 76  AIIAGEKSRHKTIRVSGSDAAAVLALLEN 104


>gi|77461543|ref|YP_351050.1| hypothetical protein Pfl01_5322 [Pseudomonas fluorescens Pf0-1]
 gi|123602932|sp|Q3K595|Y5322_PSEPF RecName: Full=UPF0235 protein Pfl01_5322
 gi|77385546|gb|ABA77059.1| conserved hypothetical protein [Pseudomonas fluorescens Pf0-1]
          Length = 96

 Score =  100 bits (249), Expect = 9e-20,   Method: Composition-based stats.
 Identities = 25/86 (29%), Positives = 42/86 (48%), Gaps = 8/86 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          +   L P A+      L           +KI++TA P +GKAN  ++  LAK   +SKS 
Sbjct: 13 LECHLQPAARSDDFCGLH-------GDRLKIRLTAPPVEGKANAYLMGFLAKAFGVSKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +LS + +  K + I    K++ +L
Sbjct: 66 VSLLSGELNRQKRVRI-GAPKKLPDL 90


>gi|182413313|ref|YP_001818379.1| hypothetical protein Oter_1495 [Opitutus terrae PB90-1]
 gi|177840527|gb|ACB74779.1| protein of unknown function DUF167 [Opitutus terrae PB90-1]
          Length = 90

 Score = 99.8 bits (248), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 8/94 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
          C + ++ IPNA ++ I              +K+KV A P +G+AN+ +   LA +L L +
Sbjct: 4  CTIAIKAIPNAPRNQIVGW-------LGDALKVKVHAPPLEGRANEELCEFLADELGLPR 56

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNND 94
           ++ +L   +S  K++ I+  D  ++   L + D
Sbjct: 57 RAVSVLRGDTSRQKLVQIEGLDLAQLKAKLSSLD 90


>gi|330812367|ref|YP_004356829.1| hypothetical protein PSEBR_a5324 [Pseudomonas brassicacearum
          subsp. brassicacearum NFM421]
 gi|327380475|gb|AEA71825.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp.
          brassicacearum NFM421]
          Length = 97

 Score = 99.8 bits (248), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 27/86 (31%), Positives = 44/86 (51%), Gaps = 8/86 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          +   L P A+    A L           +KI++TA P +GKAN  ++A LAK   +SKS 
Sbjct: 13 LECHLQPAARSDDFAGLH-------GDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + ++S + +  K + I    K++ EL
Sbjct: 66 VSLVSGELNRQKRVRIH-SPKKLPEL 90


>gi|294139811|ref|YP_003555789.1| hypothetical protein SVI_1040 [Shewanella violacea DSS12]
 gi|293326280|dbj|BAJ01011.1| conserved hypothetical protein [Shewanella violacea DSS12]
          Length = 100

 Score = 99.8 bits (248), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 20/90 (22%), Positives = 37/90 (41%), Gaps = 10/90 (11%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  +           +KI +TA P  GKAN  +   L+K   + K  
Sbjct: 13 LNLYIQPKASRDKIIGVH-------GNELKIAITAPPVDGKANAHLTKFLSKAFKVPKGD 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90
          + +   +    K + I       ++I +LL
Sbjct: 66 IIIHKGELGRHKQVEILTPRVIPEQIADLL 95


>gi|297620780|ref|YP_003708917.1| yggU family protein [Waddlia chondrophila WSU 86-1044]
 gi|297376081|gb|ADI37911.1| yggU family protein [Waddlia chondrophila WSU 86-1044]
          Length = 94

 Score = 99.8 bits (248), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 29/90 (32%), Positives = 48/90 (53%), Gaps = 7/90 (7%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V+LIPNA K  I+  E          +K+++TA P+KGKAN  ++  LA +L +SK
Sbjct: 9  VVLPVKLIPNAGKDEISGWE-------NGILKVRITAVPEKGKANAHLIKFLASQLKVSK 61

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          S + ++  + +  K + I  D   + E L 
Sbjct: 62 SDITLIKGEKNRHKTLLIKGDADVVGERLS 91


>gi|119775613|ref|YP_928353.1| hypothetical protein Sama_2480 [Shewanella amazonensis SB2B]
 gi|189039696|sp|A1S8H7|Y2480_SHEAM RecName: Full=UPF0235 protein Sama_2480
 gi|119768113|gb|ABM00684.1| conserved hypothetical protein [Shewanella amazonensis SB2B]
          Length = 95

 Score = 99.8 bits (248), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 18/90 (20%), Positives = 36/90 (40%), Gaps = 10/90 (11%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  +  L           +K+ +TA P  GKAN  +  +LAK   + K  
Sbjct: 13 LALYVQPKASRDELVGLH-------GEELKLAITAPPVDGKANAHICKLLAKAFKVPKGK 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90
          + +   +    K++ I        +  + L
Sbjct: 66 VSIERGELGRHKLVRIQAPEIIPDDFAQFL 95


>gi|83313474|ref|YP_423738.1| hypothetical protein amb4375 [Magnetospirillum magneticum AMB-1]
 gi|82948315|dbj|BAE53179.1| Uncharacterized conserved protein [Magnetospirillum magneticum
          AMB-1]
          Length = 108

 Score = 99.8 bits (248), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 27/88 (30%), Positives = 46/88 (52%), Gaps = 2/88 (2%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           V VRL P A +  I       D +  + +K +VTA P+ GKAN A+L +L+K   + KS
Sbjct: 14 KVAVRLTPKASRDRINGPAAEADGA--VVLKAQVTAVPEDGKANAALLKLLSKAWKIPKS 71

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELL 90
           + ++   +   K+I I  + +++   L
Sbjct: 72 DMDIVLGATDRRKVILISGETEDLRHRL 99


>gi|254780231|ref|YP_003064644.1| hypothetical protein CLIBASIA_00580 [Candidatus Liberibacter
          asiaticus str. psy62]
 gi|254039908|gb|ACT56704.1| hypothetical protein CLIBASIA_00580 [Candidatus Liberibacter
          asiaticus str. psy62]
          Length = 98

 Score = 99.4 bits (247), Expect = 1e-19,   Method: Composition-based stats.
 Identities = 98/98 (100%), Positives = 98/98 (100%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS
Sbjct: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60

Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL 98
          KSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL
Sbjct: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL 98


>gi|119899753|ref|YP_934966.1| hypothetical protein azo3464 [Azoarcus sp. BH72]
 gi|166232594|sp|A1KB74|Y3464_AZOSB RecName: Full=UPF0235 protein azo3464
 gi|119672166|emb|CAL96080.1| conserved hypothetical protein [Azoarcus sp. BH72]
          Length = 98

 Score = 99.4 bits (247), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 24/90 (26%), Positives = 45/90 (50%), Gaps = 7/90 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A+++G A L           MKI++ A P  GKAN A+ A LA    + KS+
Sbjct: 15 LTLHIQPGARQTGFAGLH-------GEAMKIRLAAPPVDGKANAALCAFLADFCEVPKSA 67

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + ++S ++S  K + ++     +   L+  
Sbjct: 68 VTLVSGETSRAKRVRVETKTPGLAGRLRAL 97


>gi|254561320|ref|YP_003068415.1| hypothetical protein METDI2902 [Methylobacterium extorquens DM4]
 gi|254268598|emb|CAX24557.1| conserved hypothetical protein,UPF0235 protein [Methylobacterium
           extorquens DM4]
          Length = 105

 Score = 99.0 bits (246), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + VRL P A ++G+  +           + ++V A P +G AN A+ A +AK L L K+ 
Sbjct: 15  LAVRLTPRASRTGLDGVRTEASGRP--VLSLRVAAPPVEGAANAALTAFVAKSLGLRKAE 72

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
           + ++S ++S  K +++  D + +   ++ 
Sbjct: 73  VTLVSGETSRTKRLHLSGDPQMLAARVEA 101


>gi|167623103|ref|YP_001673397.1| hypothetical protein Shal_1169 [Shewanella halifaxensis HAW-EB4]
 gi|167353125|gb|ABZ75738.1| protein of unknown function DUF167 [Shewanella halifaxensis
          HAW-EB4]
          Length = 90

 Score = 98.7 bits (245), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 22/90 (24%), Positives = 35/90 (38%), Gaps = 10/90 (11%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  +   L+K   + K  
Sbjct: 8  LQLYIQPKASRDQIVGLH-------GEEIKIAITAPPVDGKANAHLTKYLSKAFKVPKGD 60

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90
          + +   Q    K + I        E+  LL
Sbjct: 61 IEITKGQMGRHKQVKITAPKLIPAEVQNLL 90


>gi|163851553|ref|YP_001639596.1| hypothetical protein Mext_2130 [Methylobacterium extorquens PA1]
 gi|259646591|sp|A9W4L9|Y2130_METEP RecName: Full=UPF0235 protein Mext_2130
 gi|163663158|gb|ABY30525.1| protein of unknown function DUF167 [Methylobacterium extorquens
           PA1]
          Length = 105

 Score = 98.7 bits (245), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 24/89 (26%), Positives = 46/89 (51%), Gaps = 2/89 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + VRL P A ++G+  +           + ++V A P +G AN A+ A +AK L L K+ 
Sbjct: 15  LAVRLTPRASRTGLDGVRTEASGRP--VLSLRVAAPPVEGAANAALTAFVAKSLGLRKAE 72

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
           + +LS ++S  K + +  D + +   ++ 
Sbjct: 73  VTLLSGETSRTKRLLLSGDPQTLAARVEA 101


>gi|240138720|ref|YP_002963192.1| hypothetical protein MexAM1_META1p2120 [Methylobacterium extorquens
           AM1]
 gi|240008689|gb|ACS39915.1| conserved hypothetical protein,UPF0235 protein [Methylobacterium
           extorquens AM1]
          Length = 105

 Score = 98.7 bits (245), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 24/89 (26%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + VRL P A ++G+  +           + ++V A P +G AN A+ A +AK L L K+ 
Sbjct: 15  LAVRLTPRASRTGLDGVRTEASGQP--VLSLRVAAPPVEGAANAALTAFVAKSLGLRKAE 72

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
           + +LS ++S  K +++  D + +   ++ 
Sbjct: 73  VTLLSGETSRTKRLHLSGDPQTLAARVEA 101


>gi|71909498|ref|YP_287085.1| hypothetical protein Daro_3887 [Dechloromonas aromatica RCB]
 gi|123626353|sp|Q478W6|Y3887_DECAR RecName: Full=UPF0235 protein Daro_3887
 gi|71849119|gb|AAZ48615.1| Conserved hypothetical protein 251 [Dechloromonas aromatica RCB]
          Length = 97

 Score = 98.7 bits (245), Expect = 2e-19,   Method: Composition-based stats.
 Identities = 27/89 (30%), Positives = 45/89 (50%), Gaps = 7/89 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P AKKS  A L           +KI++ A P  GKAN+A++  +A  L L+KS+
Sbjct: 15 LTLHIQPGAKKSEFAGLH-------GDALKIRLAAPPVDGKANEALIRFIADALGLAKSA 67

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
          + + S Q+S  K++ I          L +
Sbjct: 68 VHLKSGQTSRRKVLEILGTSTTTIAGLAD 96


>gi|144900433|emb|CAM77297.1| protein containing DUF167 [Magnetospirillum gryphiswaldense
          MSR-1]
          Length = 107

 Score = 98.7 bits (245), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 2/88 (2%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           V +RL P   ++ I  L    D    + +K  VTA P+ GKAN A++ MLAK+  ++KS
Sbjct: 14 RVFIRLTPKGSRNKIDGLAAEADG--GMVLKASVTAVPEDGKANAALIKMLAKEWRVAKS 71

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELL 90
             +++  +   K + I  D  E+   L
Sbjct: 72 DFEIVAGATDRRKTVLISGDGAEMAARL 99


>gi|262273743|ref|ZP_06051556.1| hypothetical protein VHA_000718 [Grimontia hollisae CIP 101886]
 gi|262222158|gb|EEY73470.1| hypothetical protein VHA_000718 [Grimontia hollisae CIP 101886]
          Length = 101

 Score = 98.7 bits (245), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 19/92 (20%), Positives = 38/92 (41%), Gaps = 10/92 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + + + P A +                 +K+ +TA P  GKAN  +   L+K+  ++KS 
Sbjct: 16  LRLYIQPKASRDQWMGQH-------GEEIKLAITAPPVDGKANAHLTKFLSKQFKVAKSQ 68

Query: 64  LRMLSKQSSPLKIIYIDKD---CKEITELLQN 92
           + +   +    K + I +       I++LL  
Sbjct: 69  IDIEKGELGRHKQVRIHRPVLCPDAISDLLPA 100


>gi|134298585|ref|YP_001112081.1| hypothetical protein Dred_0717 [Desulfotomaculum reducens MI-1]
 gi|189040591|sp|A4J2F3|Y717_DESRM RecName: Full=UPF0235 protein Dred_0717
 gi|134051285|gb|ABO49256.1| protein of unknown function DUF167 [Desulfotomaculum reducens
          MI-1]
          Length = 94

 Score = 98.7 bits (245), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 19/89 (21%), Positives = 41/89 (46%), Gaps = 7/89 (7%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            V VR+ P A K+ +A             +K+++TA P  G AN+A      +   ++K
Sbjct: 11 VVVKVRVQPRASKNSLAG-------EMEGALKVRLTAPPVDGAANEACCKFFGELFGVAK 63

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          S + +++  +   K+++I    ++    +
Sbjct: 64 SKVEIIAGHTGRNKLVHIQGVTEKQARFI 92


>gi|145300496|ref|YP_001143337.1| hypothetical protein ASA_3628 [Aeromonas salmonicida subsp.
          salmonicida A449]
 gi|166232617|sp|A4SRR7|Y3628_AERS4 RecName: Full=UPF0235 protein ASA_3628
 gi|142853268|gb|ABO91589.1| conserved hypothetical protein [Aeromonas salmonicida subsp.
          salmonicida A449]
          Length = 99

 Score = 98.7 bits (245), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 10/93 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  LAK+  ++K  
Sbjct: 13 LHLMIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANSHLIKYLAKQFKVAKGQ 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQNN 93
          +R++  +    K + I+       EI  LL+  
Sbjct: 66 VRIVRGELGRHKTVAIESPRQIPAEIHALLETQ 98


>gi|315497739|ref|YP_004086543.1| hypothetical protein Astex_0706 [Asticcacaulis excentricus CB 48]
 gi|315415751|gb|ADU12392.1| protein of unknown function DUF167 [Asticcacaulis excentricus CB
          48]
          Length = 89

 Score = 98.3 bits (244), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 31/89 (34%), Positives = 52/89 (58%), Gaps = 2/89 (2%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          M  ++VRL P A    +   ++  D     ++K++VTA P +G+AN+A++A LAK+L L 
Sbjct: 1  MARLVVRLTPKAAADRVDGWDM--DEQGRPYLKVRVTAPPIEGRANEALIAFLAKRLKLP 58

Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITEL 89
          KS L +L+  SS LK I ++   +   + 
Sbjct: 59 KSRLSLLAGDSSRLKQIEVEGFDEAALKA 87


>gi|218530362|ref|YP_002421178.1| hypothetical protein Mchl_2407 [Methylobacterium chloromethanicum
           CM4]
 gi|259646623|sp|B7KZL8|Y2407_METC4 RecName: Full=UPF0235 protein Mchl_2407
 gi|218522665|gb|ACK83250.1| protein of unknown function DUF167 [Methylobacterium
           chloromethanicum CM4]
          Length = 105

 Score = 98.3 bits (244), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 24/89 (26%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + VRL P A ++G+  +           + ++V A P +G AN A+ A +AK L L K+ 
Sbjct: 15  LAVRLTPRASRTGLDGVRTEASGRP--VLSLRVAAPPVEGAANAALTAFVAKSLGLRKAE 72

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
           + +LS ++S  K +++  D + +   ++ 
Sbjct: 73  VTLLSGETSRTKRLHLSGDPQMLAARVEA 101


>gi|153874132|ref|ZP_02002465.1| conserved hypothetical protein [Beggiatoa sp. PS]
 gi|152069404|gb|EDN67535.1| conserved hypothetical protein [Beggiatoa sp. PS]
          Length = 91

 Score = 98.3 bits (244), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 18/81 (22%), Positives = 38/81 (46%), Gaps = 7/81 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V + P + ++ +  +           +K+K+ A P  GKAN  +  + +K+  ++KS 
Sbjct: 13 LSVHVQPRSSQTSVVGVH-------GDRLKVKIMAAPVDGKANAEVCKLFSKQFGVAKSK 65

Query: 64 LRMLSKQSSPLKIIYIDKDCK 84
          + + +  +S  K I I    K
Sbjct: 66 IIIENGHTSRDKRICIKSPQK 86


>gi|49474981|ref|YP_033022.1| hypothetical protein BH01670 [Bartonella henselae str. Houston-1]
 gi|49237786|emb|CAF26979.1| hypothetical protein BH01670 [Bartonella henselae str. Houston-1]
          Length = 111

 Score = 98.3 bits (244), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 27/91 (29%), Positives = 48/91 (52%), Gaps = 2/91 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V +IP A    I  +E   D     H+ I++ A P+ GKANKA++  LAK+  +  S 
Sbjct: 12  LFVYIIPKASGDKIMGIECKNDGKR--HLVIRLRAIPENGKANKALIKFLAKQWKIPSSY 69

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
           + + S  +S  K +Y     +++ E+L+  +
Sbjct: 70  ISLKSGGTSRYKQLYFSGYLEKLKEILRALE 100


>gi|260773605|ref|ZP_05882521.1| hypothetical protein VIB_002079 [Vibrio metschnikovii CIP 69.14]
 gi|260612744|gb|EEX37947.1| hypothetical protein VIB_002079 [Vibrio metschnikovii CIP 69.14]
          Length = 97

 Score = 98.3 bits (244), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 22/90 (24%), Positives = 36/90 (40%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  +              +KI +TA P  GKAN  +   LAK   +SKS+
Sbjct: 13 LRIYVQPKASRDSLVGQH-------GDELKIAITAPPVDGKANAHLSRYLAKLCKVSKSA 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +   Q    K + I      I   +Q  
Sbjct: 66 VEIEKGQLGRHKQVRIL-SPTVIPAEIQAL 94


>gi|49473825|ref|YP_031867.1| hypothetical protein BQ01570 [Bartonella quintana str. Toulouse]
 gi|49239328|emb|CAF25660.1| hypothetical protein BQ01570 [Bartonella quintana str. Toulouse]
          Length = 114

 Score = 98.3 bits (244), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 29/94 (30%), Positives = 49/94 (52%), Gaps = 2/94 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V LIP A    I  +E   D      + I++   P+ GKANKA++  LAK+  +  S 
Sbjct: 12  LFVYLIPKASVDKIIGVECRDDGKQ--RLVIRLRTLPENGKANKALIKFLAKQWKIPSSY 69

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97
           + + S ++S  K +Y     +E+ E+LQ+  + T
Sbjct: 70  ISLKSGETSRYKQLYFSGYLQEVGEILQSLYTST 103


>gi|316931646|ref|YP_004106628.1| hypothetical protein Rpdx1_0252 [Rhodopseudomonas palustris DX-1]
 gi|315599360|gb|ADU41895.1| protein of unknown function DUF167 [Rhodopseudomonas palustris
          DX-1]
          Length = 108

 Score = 98.3 bits (244), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 31/88 (35%), Positives = 50/88 (56%), Gaps = 2/88 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V VR+ P   +  I  LE   D      +K++V A    G+AN+A+  +LAK + + K +
Sbjct: 14 VAVRVTPRGGRDDIDGLETLSDGRP--VLKVRVRAIADGGEANRAVTELLAKAVGVPKRN 71

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          +R+LS  +S  K I ID D K++ E+L+
Sbjct: 72 VRLLSGATSRQKQIAIDGDPKQLGEVLR 99


>gi|90421586|ref|YP_529956.1| hypothetical protein RPC_0058 [Rhodopseudomonas palustris BisB18]
 gi|122477773|sp|Q21D99|Y058_RHOPB RecName: Full=UPF0235 protein RPC_0058
 gi|90103600|gb|ABD85637.1| protein of unknown function DUF167 [Rhodopseudomonas palustris
           BisB18]
          Length = 107

 Score = 98.3 bits (244), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 29/94 (30%), Positives = 52/94 (55%), Gaps = 2/94 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +V +R+ P   +  I  LE   +      +K++V A  + G+AN+A+  +LAK L + K
Sbjct: 12  ISVALRVTPRGGRDDIDGLETLANG--RTVVKVRVRAIAEGGEANRAVTELLAKALGVPK 69

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
            ++R+LS  +S LK + +D D  E+ E L+   +
Sbjct: 70  RAVRVLSGTTSRLKQVAVDGDPNELGEALRKLTA 103


>gi|114566491|ref|YP_753645.1| hypothetical protein Swol_0959 [Syntrophomonas wolfei subsp.
          wolfei str. Goettingen]
 gi|122318444|sp|Q0AYD0|Y959_SYNWW RecName: Full=UPF0235 protein Swol_0959
 gi|114337426|gb|ABI68274.1| conserved hypothetical protein [Syntrophomonas wolfei subsp.
          wolfei str. Goettingen]
          Length = 102

 Score = 98.3 bits (244), Expect = 3e-19,   Method: Composition-based stats.
 Identities = 23/80 (28%), Positives = 39/80 (48%), Gaps = 7/80 (8%)

Query: 6  VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65
          V++ P + ++ I              +KIKV A P +G AN+A+   LA+   L K  +R
Sbjct: 19 VKVQPRSSRNQIVG-------EHEGDLKIKVMAPPVEGAANQALQKFLAELFKLPKKDIR 71

Query: 66 MLSKQSSPLKIIYIDKDCKE 85
          ++  ++S  KI+ I     E
Sbjct: 72 IVRGETSRHKIVEIRGIEAE 91


>gi|330831217|ref|YP_004394169.1| hypothetical protein B565_3517 [Aeromonas veronii B565]
 gi|328806353|gb|AEB51552.1| hypothetical protein B565_3517 [Aeromonas veronii B565]
          Length = 100

 Score = 97.9 bits (243), Expect = 4e-19,   Method: Composition-based stats.
 Identities = 19/90 (21%), Positives = 41/90 (45%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  LAK+  ++K  
Sbjct: 13 LHLVIQPKASRDQIIGLH-------GEELKVAITAPPVDGQANSHLIKFLAKQFKVAKGQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + ++  +    K + ID   K++ + +   
Sbjct: 66 ITIVRGELGRHKTVAID-SPKQLPQEVSAL 94


>gi|323493577|ref|ZP_08098698.1| hypothetical protein VIBR0546_04984 [Vibrio brasiliensis LMG
          20546]
 gi|323312100|gb|EGA65243.1| hypothetical protein VIBR0546_04984 [Vibrio brasiliensis LMG
          20546]
          Length = 96

 Score = 97.5 bits (242), Expect = 5e-19,   Method: Composition-based stats.
 Identities = 19/90 (21%), Positives = 36/90 (40%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I              +K+ +TA P  GKAN  +   LAK+  ++K  
Sbjct: 14 LRLYIQPKASRDKIVGQH-------GEELKVAITAPPVDGKANAHLSKYLAKQFKVAKGL 66

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +   +    K + I     +I   +Q  
Sbjct: 67 ITIEKGELGRHKQVRIS-SPTQIPNEIQAI 95


>gi|39933491|ref|NP_945767.1| hypothetical protein RPA0414 [Rhodopseudomonas palustris CGA009]
 gi|39647337|emb|CAE25858.1| DUF167 [Rhodopseudomonas palustris CGA009]
          Length = 112

 Score = 97.5 bits (242), Expect = 5e-19,   Method: Composition-based stats.
 Identities = 31/88 (35%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V VR+ P   +  I  LE   D      +K++V A    G+AN+A+  +LAK + + K +
Sbjct: 18  VAVRVTPRGGRDDIDGLETLSDGRP--VVKVRVRAIADGGEANRAVTELLAKAVGVPKRN 75

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
           +R+LS  +S  K I ID D K++ E L+
Sbjct: 76  VRLLSGATSRQKQIAIDGDPKQLGEALR 103


>gi|59711034|ref|YP_203810.1| hypothetical protein VF_0427 [Vibrio fischeri ES114]
 gi|59479135|gb|AAW84922.1| conserved protein [Vibrio fischeri ES114]
          Length = 95

 Score = 97.5 bits (242), Expect = 5e-19,   Method: Composition-based stats.
 Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + L P A +  I  +           +KI +TA P  GKAN  ++   +K   ++K  
Sbjct: 12 LRLYLQPKASRDQIVGIH-------GEELKIAITAPPVDGKANAHLIKYFSKLFKVAKGK 64

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +   + +  K + I    + I   +Q  
Sbjct: 65 ITVEKGELNRHKQVRIH-SPELIPNEIQQL 93


>gi|257095142|ref|YP_003168783.1| hypothetical protein CAP2UW1_3597 [Candidatus Accumulibacter
          phosphatis clade IIA str. UW-1]
 gi|257047666|gb|ACV36854.1| protein of unknown function DUF167 [Candidatus Accumulibacter
          phosphatis clade IIA str. UW-1]
          Length = 96

 Score = 97.5 bits (242), Expect = 5e-19,   Method: Composition-based stats.
 Identities = 27/90 (30%), Positives = 45/90 (50%), Gaps = 7/90 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V + L P AK + IA             +K+++TA P  G+AN A++  LA++L LS+S+
Sbjct: 14 VTIHLQPGAKANEIAG-------RHGDALKVRITAPPVDGRANAALVDFLAQRLGLSRSA 66

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + + S  +S  K++ I     E    L   
Sbjct: 67 VELKSGLTSRRKVLRISGASAEAVLCLLAE 96


>gi|209966140|ref|YP_002299055.1| hypothetical protein RC1_2875 [Rhodospirillum centenum SW]
 gi|209959606|gb|ACJ00243.1| conserved hypothetical protein [Rhodospirillum centenum SW]
          Length = 117

 Score = 97.5 bits (242), Expect = 6e-19,   Method: Composition-based stats.
 Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             V +R+ P A ++ +              +K+ VTA P+ GKAN A++A+LAK   L K
Sbjct: 23  VRVALRVTPKASRTAVQG--PMDGPEGRTLLKLAVTAVPEDGKANAAVIALLAKHWRLPK 80

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
           SS+ ++S  +   K+++I  D  ++   +
Sbjct: 81  SSMSIVSGGTDRTKVLFIAGDAADLLARI 109


>gi|323702139|ref|ZP_08113806.1| protein of unknown function DUF167 [Desulfotomaculum nigrificans
          DSM 574]
 gi|323532826|gb|EGB22698.1| protein of unknown function DUF167 [Desulfotomaculum nigrificans
          DSM 574]
          Length = 95

 Score = 97.1 bits (241), Expect = 6e-19,   Method: Composition-based stats.
 Identities = 20/89 (22%), Positives = 42/89 (47%), Gaps = 7/89 (7%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VR+ P A K+ +A             +K+++TA P  G AN+A    LA+   ++K
Sbjct: 11 VVLRVRVQPRAAKNSLAG-------EMEGALKVRLTAPPVDGAANEACCRFLAEVFGVAK 63

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          S++ ++S  +   K++ +    +     +
Sbjct: 64 SNVEIISGHTGRNKVVRVAGIDEARARRV 92


>gi|78188297|ref|YP_378635.1| hypothetical protein Cag_0319 [Chlorobium chlorochromatii CaD3]
 gi|123580414|sp|Q3ATT3|Y319_CHLCH RecName: Full=UPF0235 protein Cag_0319
 gi|78170496|gb|ABB27592.1| conserved hypothetical protein [Chlorobium chlorochromatii CaD3]
          Length = 99

 Score = 97.1 bits (241), Expect = 6e-19,   Method: Composition-based stats.
 Identities = 16/87 (18%), Positives = 36/87 (41%), Gaps = 7/87 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VR  P + KS ++             +K+ + + P    AN+    +LA+   +  S 
Sbjct: 13 IAVRAQPRSSKSMVSG-------EWNGALKVHLQSPPVDDAANEECCRLLARLFQVPPSR 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90
          + +++  SS  K + ++     +   L
Sbjct: 66 VHLVAGHSSRNKRVMVEGVSAAMATEL 92


>gi|163803734|ref|ZP_02197592.1| hypothetical protein 1103602000580_AND4_13653 [Vibrio sp. AND4]
 gi|159172453|gb|EDP57321.1| hypothetical protein AND4_13653 [Vibrio sp. AND4]
          Length = 96

 Score = 97.1 bits (241), Expect = 7e-19,   Method: Composition-based stats.
 Identities = 21/93 (22%), Positives = 38/93 (40%), Gaps = 8/93 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P A +  I  L           +KI +TA P  GKAN  +   LAK+  ++K
Sbjct: 12 VVLRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKVAK 64

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
            + +   +    K I I    ++I   ++   
Sbjct: 65 GLVHIEKGELGRHKQIRIK-SPEQIPTEIKAIT 96


>gi|319408144|emb|CBI81797.1| conserved hypothetical protein [Bartonella schoenbuchensis R1]
          Length = 108

 Score = 97.1 bits (241), Expect = 7e-19,   Method: Composition-based stats.
 Identities = 25/88 (28%), Positives = 44/88 (50%), Gaps = 2/88 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VRLIP A    I  +E     +   H+ I++   P+ GKANKA++  L ++  +  S 
Sbjct: 12 LFVRLIPKASMDSIVGVESRDGETQ--HLVIRLRTVPEDGKANKALIKFLGRQWKIPPSY 69

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          + + S  +S  K +      +E+ + LQ
Sbjct: 70 ISLKSGMTSRYKQLRFSGYVEELEQKLQ 97


>gi|197335333|ref|YP_002155183.1| hypothetical protein VFMJ11_0427 [Vibrio fischeri MJ11]
 gi|197316823|gb|ACH66270.1| conserved hypothetical protein [Vibrio fischeri MJ11]
          Length = 95

 Score = 97.1 bits (241), Expect = 7e-19,   Method: Composition-based stats.
 Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + L P A +  I  +           +KI +TA P  GKAN  ++   +K   ++K  
Sbjct: 12 LRLYLQPKASRDQIVGIH-------GEELKIAITAPPVDGKANAHLIKYFSKLFKVAKGK 64

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +   + +  K + I    + I   +Q  
Sbjct: 65 ITVEKGELNRHKQVRIH-SPEIIPNEIQQL 93


>gi|157374368|ref|YP_001472968.1| hypothetical protein Ssed_1229 [Shewanella sediminis HAW-EB3]
 gi|189038638|sp|A8FSL7|Y1229_SHESH RecName: Full=UPF0235 protein Ssed_1229
 gi|157316742|gb|ABV35840.1| protein of unknown function DUF167 [Shewanella sediminis HAW-EB3]
          Length = 95

 Score = 97.1 bits (241), Expect = 7e-19,   Method: Composition-based stats.
 Identities = 22/90 (24%), Positives = 38/90 (42%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  +           +KI +TA P  GKAN  ++  L+K   + K  
Sbjct: 13 LNLYIQPKASRDQIVGVH-------GEELKIAITAPPVDGKANAHLIKYLSKAFKVPKGD 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +L  Q    K I I    + I E++   
Sbjct: 66 IVILKGQLGRHKQIKIL-SPRLIPEIINAL 94


>gi|78222380|ref|YP_384127.1| hypothetical protein Gmet_1164 [Geobacter metallireducens GS-15]
 gi|78193635|gb|ABB31402.1| protein of unknown function DUF167 [Geobacter metallireducens
           GS-15]
          Length = 102

 Score = 97.1 bits (241), Expect = 7e-19,   Method: Composition-based stats.
 Identities = 22/86 (25%), Positives = 42/86 (48%), Gaps = 7/86 (8%)

Query: 5   IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
            V + P A K+GI  ++          +K+++T+ P +G+AN+     LAK L + KS++
Sbjct: 22  SVHVQPRASKNGICGIQ-------GDAIKLRLTSPPVEGEANRLCTEYLAKLLKVPKSAV 74

Query: 65  RMLSKQSSPLKIIYIDKDCKEITELL 90
            +++   S  K I +     +    L
Sbjct: 75  TIIAGDKSRHKTIRVSGATAQAVHNL 100


>gi|192362044|ref|YP_001980615.1| hypothetical protein CJA_0091 [Cellvibrio japonicus Ueda107]
 gi|226734040|sp|B3PFH5|Y091_CELJU RecName: Full=UPF0235 protein CJA_0091
 gi|190688209|gb|ACE85887.1| conserved hypothetical protein TIGR00251 [Cellvibrio japonicus
          Ueda107]
          Length = 101

 Score = 97.1 bits (241), Expect = 7e-19,   Method: Composition-based stats.
 Identities = 19/79 (24%), Positives = 37/79 (46%), Gaps = 7/79 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          +  +L P A    I  +           +KI++TA P  GKAN+ ++  L+K+  + K +
Sbjct: 17 LHCQLQPKASGDDIVGVH-------GDRLKIRITAPPVDGKANEYLIKWLSKQFRVPKGN 69

Query: 64 LRMLSKQSSPLKIIYIDKD 82
          +++L  +    K + I   
Sbjct: 70 IKILQGELGRHKTLGIHAP 88


>gi|258514375|ref|YP_003190597.1| hypothetical protein Dtox_1088 [Desulfotomaculum acetoxidans DSM
          771]
 gi|257778080|gb|ACV61974.1| protein of unknown function DUF167 [Desulfotomaculum acetoxidans
          DSM 771]
          Length = 98

 Score = 97.1 bits (241), Expect = 8e-19,   Method: Composition-based stats.
 Identities = 23/87 (26%), Positives = 47/87 (54%), Gaps = 8/87 (9%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           VR+ P A K  +A L           +KI++TA P +G+AN+A+   LAK L ++++ +
Sbjct: 14 KVRVQPRASKDQVAGL-------WEDAVKIRLTAPPVEGEANRALCDFLAKHLGVTRAQV 66

Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELL 90
           +++ Q+   K++ +     + + + L
Sbjct: 67 DLVTGQTGRNKLVRVSGITAESVLQRL 93


>gi|254172671|ref|ZP_04879346.1| conserved hypothetical protein TIGR00251 [Thermococcus sp. AM4]
 gi|214033600|gb|EEB74427.1| conserved hypothetical protein TIGR00251 [Thermococcus sp. AM4]
          Length = 113

 Score = 97.1 bits (241), Expect = 8e-19,   Method: Composition-based stats.
 Identities = 22/87 (25%), Positives = 43/87 (49%), Gaps = 8/87 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V + P AKK+ I  ++          +K++V A P  GKANK ++  L+K L    + 
Sbjct: 33  LFVYVQPKAKKNEIEGID-----EWRGRLKVRVKAPPVGGKANKELVKFLSKLLG---AE 84

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELL 90
           + ++  ++S  K + +    +E+   L
Sbjct: 85  VELVRGETSREKDLLVRLSAEEVRRKL 111


>gi|192288849|ref|YP_001989454.1| hypothetical protein Rpal_0418 [Rhodopseudomonas palustris TIE-1]
 gi|226706140|sp|B3QA92|Y418_RHOPT RecName: Full=UPF0235 protein Rpal_0418
 gi|192282598|gb|ACE98978.1| protein of unknown function DUF167 [Rhodopseudomonas palustris
          TIE-1]
          Length = 108

 Score = 97.1 bits (241), Expect = 8e-19,   Method: Composition-based stats.
 Identities = 31/88 (35%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V VR+ P   +  I  LE   D      +K++V A    G+AN+A+  +LAK + + K +
Sbjct: 14 VAVRVTPRGGRDDIDGLETLSDGRP--VVKVRVRAIADGGEANRAVTELLAKAVGVPKRN 71

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          +R+LS  +S  K I ID D K++ E L+
Sbjct: 72 VRLLSGATSRQKQIAIDGDPKQLGEALR 99


>gi|170725690|ref|YP_001759716.1| hypothetical protein Swoo_1329 [Shewanella woodyi ATCC 51908]
 gi|226695928|sp|B1KIX3|Y1329_SHEWM RecName: Full=UPF0235 protein Swoo_1329
 gi|169811037|gb|ACA85621.1| protein of unknown function DUF167 [Shewanella woodyi ATCC 51908]
          Length = 95

 Score = 97.1 bits (241), Expect = 8e-19,   Method: Composition-based stats.
 Identities = 19/90 (21%), Positives = 38/90 (42%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  +           +KI +TA P  GKAN  ++  L+K   + K  
Sbjct: 13 LNLYIQPKASRDQIVGVH-------GEELKIAITAPPVDGKANAHLIKYLSKAFKVPKGD 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +L  +    K + +    + I E + + 
Sbjct: 66 INILKGEQGRHKQVKVI-SPRVIPENISSQ 94


>gi|50122551|ref|YP_051718.1| hypothetical protein ECA3630 [Pectobacterium atrosepticum
          SCRI1043]
 gi|81644033|sp|Q6D118|Y3630_ERWCT RecName: Full=UPF0235 protein ECA3630
 gi|49613077|emb|CAG76528.1| conserved hypothetical protein [Pectobacterium atrosepticum
          SCRI1043]
          Length = 96

 Score = 96.7 bits (240), Expect = 9e-19,   Method: Composition-based stats.
 Identities = 20/91 (21%), Positives = 39/91 (42%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  +   LAK+  ++KS 
Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLTKFLAKQFRVAKSL 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K I I        ++ E ++
Sbjct: 66 VVIEKGELGRHKQIRITHPQHIPADVAEFIE 96


>gi|85060008|ref|YP_455710.1| hypothetical protein SG2030 [Sodalis glossinidius str.
          'morsitans']
 gi|123518874|sp|Q2NRC0|Y2030_SODGM RecName: Full=UPF0235 protein SG2030
 gi|84780528|dbj|BAE75305.1| conserved hypothetical protein [Sodalis glossinidius str.
          'morsitans']
          Length = 101

 Score = 96.7 bits (240), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 22/90 (24%), Positives = 41/90 (45%), Gaps = 10/90 (11%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  IA             +K+ +TA P  G+AN  ++  LAK+  ++KS 
Sbjct: 17 LRLYIQPRASRDHIAGAH-------GDEIKVAITAPPVDGQANSHLIRFLAKEFGVAKSR 69

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90
          + +   +    K + ID+     + I  LL
Sbjct: 70 VILEKGELGRHKQLRIDQPRQLPEVIARLL 99


>gi|240103723|ref|YP_002960032.1| hypothetical protein TGAM_1666 [Thermococcus gammatolerans EJ3]
 gi|239911277|gb|ACS34168.1| Conserved hypothetical protein [Thermococcus gammatolerans EJ3]
          Length = 124

 Score = 96.7 bits (240), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 22/87 (25%), Positives = 45/87 (51%), Gaps = 8/87 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           ++V + P AKK+ I  ++          +K+KV A P  GKANK ++  L+K L    + 
Sbjct: 44  LLVYVQPKAKKNEIEGID-----EWRGRLKVKVKAPPVGGKANKELVKFLSKVLG---AE 95

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELL 90
           + ++  ++S  K + +    +++ + L
Sbjct: 96  VELVRGETSREKDLLVRMSAEDVKKRL 122


>gi|281355950|ref|ZP_06242443.1| protein of unknown function DUF167 [Victivallis vadensis ATCC
          BAA-548]
 gi|281317319|gb|EFB01340.1| protein of unknown function DUF167 [Victivallis vadensis ATCC
          BAA-548]
          Length = 100

 Score = 96.7 bits (240), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 21/91 (23%), Positives = 45/91 (49%), Gaps = 8/91 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
          C V +R+ P A ++ +  +           +KI + A P  GKAN+ +  + A+   L K
Sbjct: 16 CLVSLRVQPGASRNAVVGM-------YGDAVKIALQAPPVDGKANQLLCRLFAEWSGLPK 68

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91
          S++ + S Q+   K++ +     +++  +L+
Sbjct: 69 SAVELRSGQTGRSKVLELSGITAEQLKAILE 99


>gi|163867463|ref|YP_001608662.1| hypothetical protein Btr_0184 [Bartonella tribocorum CIP 105476]
 gi|161017109|emb|CAK00667.1| conserved hypothetical protein [Bartonella tribocorum CIP 105476]
          Length = 108

 Score = 96.7 bits (240), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 24/89 (26%), Positives = 46/89 (51%), Gaps = 2/89 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V LIP +    I  +E         ++ I++ A P+ GKANKA++   AK+  +  SS
Sbjct: 12 LFVYLIPKSSVDKIIGIECRDGEKQ--YLVIRLRAVPEDGKANKALIKFFAKQWKIPSSS 69

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
          + + S  +S  K ++     +E+ ++ Q+
Sbjct: 70 ISLKSGATSRYKQLHFSTHLEELKQIWQS 98


>gi|157372267|ref|YP_001480256.1| hypothetical protein Spro_4033 [Serratia proteamaculans 568]
 gi|166979954|sp|A8GJ38|Y4033_SERP5 RecName: Full=UPF0235 protein Spro_4033
 gi|157324031|gb|ABV43128.1| protein of unknown function DUF167 [Serratia proteamaculans 568]
          Length = 96

 Score = 96.3 bits (239), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 18/89 (20%), Positives = 39/89 (43%), Gaps = 9/89 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  LAK+  ++K +
Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFLAKQFKVAKGN 65

Query: 64 LRMLSKQSSPLKIIYIDKDCK--EITELL 90
          + +   +    K + I    +  ++   L
Sbjct: 66 VTIEKGELGRHKQLRIVNPQQIPDVVAAL 94


>gi|282891527|ref|ZP_06300019.1| hypothetical protein pah_c178o054 [Parachlamydia acanthamoebae
          str. Hall's coccus]
 gi|281498618|gb|EFB40945.1| hypothetical protein pah_c178o054 [Parachlamydia acanthamoebae
          str. Hall's coccus]
          Length = 93

 Score = 96.3 bits (239), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 24/81 (29%), Positives = 47/81 (58%), Gaps = 7/81 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +++IPNA ++ I   E          +K+ + + P+KGKAN+A++  LAK L L K  
Sbjct: 8  LAIKVIPNASRNAILGWE-------NDELKMYIASVPEKGKANEAVIKFLAKFLGLRKQQ 60

Query: 64 LRMLSKQSSPLKIIYIDKDCK 84
          ++++  +++  KI+ I+   K
Sbjct: 61 IQIIRGETNRHKILQIEGIDK 81


>gi|117621083|ref|YP_858117.1| hypothetical protein AHA_3661 [Aeromonas hydrophila subsp.
          hydrophila ATCC 7966]
 gi|166232625|sp|A0KPB6|Y3661_AERHH RecName: Full=UPF0235 protein AHA_3661
 gi|117562490|gb|ABK39438.1| conserved hypothetical protein [Aeromonas hydrophila subsp.
          hydrophila ATCC 7966]
          Length = 99

 Score = 96.3 bits (239), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 22/89 (24%), Positives = 41/89 (46%), Gaps = 10/89 (11%)

Query: 8  LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67
          + P A +  I  L           +K+ +TA P  G+AN  ++  LAK+  ++K  +R++
Sbjct: 17 IQPKASRDQIVGLH-------GEELKVAITAPPVDGQANSHLIKYLAKQFKVAKGQVRIV 69

Query: 68 SKQSSPLKIIYIDKD---CKEITELLQNN 93
            +    K + I+       E++ LL N 
Sbjct: 70 RGELGRHKTVAIEAPRQIPAEVSALLDNQ 98


>gi|323498669|ref|ZP_08103660.1| hypothetical protein VISI1226_18936 [Vibrio sinaloensis DSM
          21326]
 gi|323316269|gb|EGA69289.1| hypothetical protein VISI1226_18936 [Vibrio sinaloensis DSM
          21326]
          Length = 96

 Score = 96.3 bits (239), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 19/90 (21%), Positives = 38/90 (42%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK+  ++K  
Sbjct: 14 IRLYIQPKASRDKIVGLH-------GDELKVAITAPPVDGKANAHLSKYLAKQFKVAKGL 66

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +   +    K ++I     +I   ++  
Sbjct: 67 IDIEKGELGRHKQLWI-CSPAQIPTEIEAI 95


>gi|37523404|ref|NP_926781.1| hypothetical protein glr3835 [Gloeobacter violaceus PCC 7421]
 gi|47117439|sp|Q7NEP3|Y3835_GLOVI RecName: Full=UPF0235 protein glr3835
 gi|35214408|dbj|BAC91776.1| glr3835 [Gloeobacter violaceus PCC 7421]
          Length = 111

 Score = 96.3 bits (239), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 19/91 (20%), Positives = 36/91 (39%), Gaps = 7/91 (7%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V   P A  S +   +           K+++ A P +GKAN   +A++A    + +  
Sbjct: 28  LTVWAQPRASCSEVVGWQQ-------NAFKVRLAAPPVEGKANAECVALIAAFFGVPRRQ 80

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
           + ++  Q    K I I+     +   LQ   
Sbjct: 81  VSLVQGQQGRHKKIRIEAPADLLLVALQKLS 111


>gi|271501907|ref|YP_003334933.1| hypothetical protein Dd586_3394 [Dickeya dadantii Ech586]
 gi|270345462|gb|ACZ78227.1| protein of unknown function DUF167 [Dickeya dadantii Ech586]
          Length = 96

 Score = 96.3 bits (239), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 7/80 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  LAK+  ++K  
Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFLAKQFRVAKGM 65

Query: 64 LRMLSKQSSPLKIIYIDKDC 83
          + +   +    K I I    
Sbjct: 66 VTIEKGELGRHKQIRIVNPQ 85


>gi|261822847|ref|YP_003260953.1| hypothetical protein Pecwa_3610 [Pectobacterium wasabiae WPP163]
 gi|261606860|gb|ACX89346.1| protein of unknown function DUF167 [Pectobacterium wasabiae
          WPP163]
          Length = 96

 Score = 96.3 bits (239), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 20/91 (21%), Positives = 39/91 (42%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  +   LAK+  ++KS 
Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLTKFLAKQFRVAKSL 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K I I        +I + ++
Sbjct: 66 VVIEKGELGRHKQIRITHPQHIPADIADFIE 96


>gi|115522078|ref|YP_778989.1| hypothetical protein RPE_0048 [Rhodopseudomonas palustris BisA53]
 gi|115516025|gb|ABJ04009.1| protein of unknown function DUF167 [Rhodopseudomonas palustris
           BisA53]
          Length = 107

 Score = 96.3 bits (239), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 27/94 (28%), Positives = 51/94 (54%), Gaps = 2/94 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +V +R+ P   +  I  LE   +      +K++V A  + G+AN+A+  +LAK L + K
Sbjct: 11  ISVALRVTPRGGRDAIDGLETLANG--RTVVKVRVRAIAEGGEANRAVTELLAKALGVPK 68

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
            ++R+LS  +S LK I +D + + +   L+   +
Sbjct: 69  RAVRVLSGTTSRLKQIAVDGNPELLGAALRKLTA 102


>gi|218710632|ref|YP_002418253.1| hypothetical protein VS_2686 [Vibrio splendidus LGP32]
 gi|218323651|emb|CAV19947.1| Hypothetical protein VS_2686 [Vibrio splendidus LGP32]
          Length = 96

 Score = 96.3 bits (239), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  +   LAK+  ++K  
Sbjct: 14 LRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQ 66

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +   +    K + I     ++   ++  
Sbjct: 67 ITIEKGELGRHKQVRI-CSPSQLPTEVKAI 95


>gi|293394481|ref|ZP_06638777.1| conserved hypothetical protein [Serratia odorifera DSM 4582]
 gi|291422946|gb|EFE96179.1| conserved hypothetical protein [Serratia odorifera DSM 4582]
          Length = 97

 Score = 96.0 bits (238), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 19/76 (25%), Positives = 36/76 (47%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V + + P A +  I  L           +K+ +TA P  G+AN  +L  +AK+  ++KS+
Sbjct: 13 VRLYIQPKASRDQIIGLH-------GDEIKVAITAPPVDGQANAHLLKFIAKQFKVAKSN 65

Query: 64 LRMLSKQSSPLKIIYI 79
          + +   +    K + I
Sbjct: 66 VTIEKGELGRHKQLRI 81


>gi|13473522|ref|NP_105090.1| hypothetical protein msl4154 [Mesorhizobium loti MAFF303099]
 gi|29839606|sp|Q98EP2|Y4154_RHILO RecName: Full=UPF0235 protein msl4154
 gi|14024272|dbj|BAB50876.1| msl4154 [Mesorhizobium loti MAFF303099]
          Length = 102

 Score = 96.0 bits (238), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 25/92 (27%), Positives = 49/92 (53%), Gaps = 2/92 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            ++ VRL P +    +  +E   D     H+K +V A P+ G AN+A+  ++AK L +  
Sbjct: 12  IDLFVRLTPKSSLDRLEGVETSADG--RSHLKARVRAVPENGAANQALERLVAKTLGVPA 69

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
           SS+ +++  +S LK + I  D + + + ++  
Sbjct: 70  SSVSVVAGGTSRLKTVRIVGDPEALAQRVEAL 101


>gi|23015143|ref|ZP_00054928.1| COG1872: Uncharacterized conserved protein [Magnetospirillum
          magnetotacticum MS-1]
          Length = 94

 Score = 96.0 bits (238), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 27/86 (31%), Positives = 44/86 (51%), Gaps = 2/86 (2%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           VRL P A +  I       D S  + +K +VT  P+ GKAN A+L +L+K   + KS +
Sbjct: 2  AVRLTPKASRDRIMGAAPEADGS--VVLKAQVTTVPEDGKANAALLKLLSKAWKIPKSDM 59

Query: 65 RMLSKQSSPLKIIYIDKDCKEITELL 90
           ++   +   K+I I  D + + + L
Sbjct: 60 DIVLGATDRRKVILISGDSEVLRKRL 85


>gi|95930347|ref|ZP_01313084.1| conserved hypothetical protein [Desulfuromonas acetoxidans DSM 684]
 gi|95133599|gb|EAT15261.1| conserved hypothetical protein [Desulfuromonas acetoxidans DSM 684]
          Length = 103

 Score = 96.0 bits (238), Expect = 1e-18,   Method: Composition-based stats.
 Identities = 23/92 (25%), Positives = 43/92 (46%), Gaps = 8/92 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + + + P A K+ +  L+          +K+++T+ P +G ANK      AK L +SK
Sbjct: 17  VVIALFVQPRASKNSLCGLQ-------GEELKVRLTSPPVEGAANKLCCTFFAKLLGVSK 69

Query: 62  SSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92
           SS+ ++    S  K I ++     E+ + L  
Sbjct: 70  SSVTLIRGDKSRHKQIVVEGVSLDEVKQRLAK 101


>gi|242238224|ref|YP_002986405.1| hypothetical protein Dd703_0772 [Dickeya dadantii Ech703]
 gi|242130281|gb|ACS84583.1| protein of unknown function DUF167 [Dickeya dadantii Ech703]
          Length = 99

 Score = 96.0 bits (238), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 17/80 (21%), Positives = 35/80 (43%), Gaps = 6/80 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 13 IRLYIQPKASRDQIVGLH------GNDEVKVAITAPPVDGQANAHLIQFMAKQFRVAKSR 66

Query: 64 LRMLSKQSSPLKIIYIDKDC 83
          + +   +    K + I    
Sbjct: 67 VTIEKGELGRHKQLRIHSPQ 86


>gi|153835271|ref|ZP_01987938.1| conserved hypothetical protein [Vibrio harveyi HY01]
 gi|156975836|ref|YP_001446743.1| hypothetical protein VIBHAR_03581 [Vibrio harveyi ATCC BAA-1116]
 gi|269960447|ref|ZP_06174820.1| conserved hypothetical protein [Vibrio harveyi 1DA3]
 gi|166232611|sp|A7MZ92|Y3581_VIBHB RecName: Full=UPF0235 protein VIBHAR_03581
 gi|148868246|gb|EDL67386.1| conserved hypothetical protein [Vibrio harveyi HY01]
 gi|156527430|gb|ABU72516.1| hypothetical protein VIBHAR_03581 [Vibrio harveyi ATCC BAA-1116]
 gi|269834874|gb|EEZ88960.1| conserved hypothetical protein [Vibrio harveyi 1DA3]
          Length = 96

 Score = 96.0 bits (238), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 21/92 (22%), Positives = 38/92 (41%), Gaps = 8/92 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P A +  I  L           +KI +TA P  GKAN  +   LAK+  ++K
Sbjct: 12 VVLRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKVAK 64

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
            + +   +    K I I+    +I   ++  
Sbjct: 65 GLVHIEKGELGRHKQIRIE-SPVQIPTEIKAI 95


>gi|332172191|gb|AEE21445.1| protein of unknown function DUF167 [Glaciecola agarilytica
          4H-3-7+YE-5]
          Length = 98

 Score = 96.0 bits (238), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 22/93 (23%), Positives = 39/93 (41%), Gaps = 10/93 (10%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + + + P A +  I  +           +KI +TA P  GKAN  +   LAK+  ++KS
Sbjct: 13 QLRIYIQPKAARDEIVGMH-------GDALKIAITAPPVDGKANAHLCKYLAKQCGVAKS 65

Query: 63 SLRMLSKQSSPLKIIYIDKD---CKEITELLQN 92
           + +   Q +  K + +       + I  LL  
Sbjct: 66 KVAITKGQLNRHKTVVVCAPGVIPEAIQALLNE 98


>gi|227112386|ref|ZP_03826042.1| hypothetical protein PcarbP_05447 [Pectobacterium carotovorum
          subsp. brasiliensis PBR1692]
 gi|253689815|ref|YP_003019005.1| hypothetical protein PC1_3453 [Pectobacterium carotovorum subsp.
          carotovorum PC1]
 gi|259646966|sp|C6DE33|Y3453_PECCP RecName: Full=UPF0235 protein PC1_3453
 gi|251756393|gb|ACT14469.1| protein of unknown function DUF167 [Pectobacterium carotovorum
          subsp. carotovorum PC1]
          Length = 96

 Score = 96.0 bits (238), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 7/77 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  +   LAK+  ++KS 
Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLTKFLAKQFRVAKSL 65

Query: 64 LRMLSKQSSPLKIIYID 80
          + +   +    K I I 
Sbjct: 66 VVIEKGELGRHKQIRIT 82


>gi|109896736|ref|YP_659991.1| hypothetical protein Patl_0407 [Pseudoalteromonas atlantica T6c]
 gi|109699017|gb|ABG38937.1| conserved hypothetical protein [Pseudoalteromonas atlantica T6c]
          Length = 90

 Score = 96.0 bits (238), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 20/81 (24%), Positives = 37/81 (45%), Gaps = 7/81 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           ++ +   P A +  +  L           +K+ +TA P  GKAN  ++  LAK+  ++K
Sbjct: 1  MHLRIYTQPKASRDEVVGLH-------GDELKVAITAPPVDGKANTHLIKYLAKQCGVAK 53

Query: 62 SSLRMLSKQSSPLKIIYIDKD 82
          S + +   Q +  K + I K 
Sbjct: 54 SKVVITKGQLNRHKTVLISKP 74


>gi|311278130|ref|YP_003940361.1| hypothetical protein Entcl_0802 [Enterobacter cloacae SCF1]
 gi|308747325|gb|ADO47077.1| protein of unknown function DUF167 [Enterobacter cloacae SCF1]
          Length = 96

 Score = 96.0 bits (238), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  +  L           +K+ +TA P  G+AN  ++  LAK+  ++KS 
Sbjct: 13 LRLYIQPKASRDSLVGLH-------GDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYI 79
          + +   +    K + I
Sbjct: 66 VIIEKGELGRHKQVRI 81


>gi|295691445|ref|YP_003595138.1| hypothetical protein Cseg_4109 [Caulobacter segnis ATCC 21756]
 gi|295433348|gb|ADG12520.1| protein of unknown function DUF167 [Caulobacter segnis ATCC
          21756]
          Length = 94

 Score = 96.0 bits (238), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 24/88 (27%), Positives = 46/88 (52%), Gaps = 3/88 (3%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +RL P   +  +       D     ++K++V + P  G AN A++A LAK L + +S+
Sbjct: 5  LAIRLTPRGGRDAVEGW--ALDPEGRPYLKVRVASPPVDGAANAALIAFLAKSLKIPRSA 62

Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90
          +R+ S +++ +K + ID  D  + T   
Sbjct: 63 VRLASGETARIKRLEIDDVDQADFTRAF 90


>gi|218672707|ref|ZP_03522376.1| hypothetical protein RetlG_14247 [Rhizobium etli GR56]
          Length = 89

 Score = 96.0 bits (238), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 28/79 (35%), Positives = 50/79 (63%), Gaps = 2/79 (2%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + VRL PN  +     +E         ++K +VTA P+KGKANKA++A+++K L ++KS
Sbjct: 13 RLTVRLTPNGGRDAFDGIETDSQGET--YLKARVTAVPEKGKANKALIALVSKSLGVAKS 70

Query: 63 SLRMLSKQSSPLKIIYIDK 81
          S+ ++S +++  KI+ I+ 
Sbjct: 71 SVSLVSGETARKKILRIEG 89


>gi|118578470|ref|YP_899720.1| hypothetical protein Ppro_0022 [Pelobacter propionicus DSM 2379]
 gi|118501180|gb|ABK97662.1| protein of unknown function DUF167 [Pelobacter propionicus DSM
          2379]
          Length = 103

 Score = 96.0 bits (238), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 8/91 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P A K+ I  L           +K+++T+ P  G ANK     LA  L + K
Sbjct: 16 VVLNLYIQPRASKNEICGLV-------DNSLKLRLTSPPVDGAANKLCREFLADLLHVPK 68

Query: 62 SSLRMLSKQSSPLKIIYI-DKDCKEITELLQ 91
          S++ ++S ++S  K + I   D   I+  +Q
Sbjct: 69 SAVEIISGETSRHKRVRIATADSDLISSRIQ 99


>gi|262393226|ref|YP_003285080.1| hypothetical protein VEA_002453 [Vibrio sp. Ex25]
 gi|262336820|gb|ACY50615.1| hypothetical protein VEA_002453 [Vibrio sp. Ex25]
          Length = 96

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 21/90 (23%), Positives = 38/90 (42%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  +   LAK+  ++K  
Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +   +    K I I+    +I   ++  
Sbjct: 67 VHIEKGELGRHKQIRIE-SPTQIPTEIKAI 95


>gi|257465218|ref|ZP_05629589.1| hypothetical protein AM202_01815 [Actinobacillus minor 202]
 gi|257450878|gb|EEV24921.1| hypothetical protein AM202_01815 [Actinobacillus minor 202]
          Length = 100

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 25/92 (27%), Positives = 42/92 (45%), Gaps = 8/92 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + L P A +  I  L           +KI +TA P  G AN  +L  L+K   + K
Sbjct: 14 IRLRIFLQPKASRDQIVGLH-------DEELKIAITAPPVDGAANAHLLKFLSKLFKVPK 66

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          SS+ +   +    K I+I  + K+I + ++N 
Sbjct: 67 SSIALEKGELQRHKQIFIP-EPKQIPQEIENL 97


>gi|291286027|ref|YP_003502843.1| hypothetical protein Dacet_0081 [Denitrovibrio acetiphilus DSM
          12809]
 gi|290883187|gb|ADD66887.1| protein of unknown function DUF167 [Denitrovibrio acetiphilus DSM
          12809]
          Length = 84

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 26/91 (28%), Positives = 50/91 (54%), Gaps = 7/91 (7%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V + P AKK+ ++ +           +KI+V A P +G AN+ ++  L+K+L +SK
Sbjct: 1  MKLSVYVQPGAKKTELSGMH-------DGKIKIRVCAPPVEGAANEVLVKFLSKQLKISK 53

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92
          S ++++S + S  KI+ I+ D  ++   L  
Sbjct: 54 SGIKIISGEKSRHKIVEINMDTLDVMNCLSK 84


>gi|269965742|ref|ZP_06179839.1| conserved hypothetical protein [Vibrio alginolyticus 40B]
 gi|269829610|gb|EEZ83847.1| conserved hypothetical protein [Vibrio alginolyticus 40B]
          Length = 96

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 20/90 (22%), Positives = 38/90 (42%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  +   LAK+  ++K  
Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +   +    K + I+    +I   ++  
Sbjct: 67 VHIEKGELGRHKQVRIE-SPTQIPTEIKAI 95


>gi|27364893|ref|NP_760421.1| hypothetical protein VV1_1522 [Vibrio vulnificus CMCP6]
 gi|29839706|sp|Q8DCB7|Y1522_VIBVU RecName: Full=UPF0235 protein VV1_1522
 gi|27361038|gb|AAO09948.1| UPF0235 protein [Vibrio vulnificus CMCP6]
          Length = 96

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 20/92 (21%), Positives = 37/92 (40%), Gaps = 10/92 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P A +  I  L           +KI +TA P  GKAN  +  +L K   ++K
Sbjct: 12 VVLRLYIQPKASRDKILGLH-------GDELKIAITAPPVDGKANGHLTKLLGKWFKVAK 64

Query: 62 SSLRMLSKQSSPLKIIYIDKD---CKEITELL 90
          S + +   +    K + +        E+  +L
Sbjct: 65 SLVTIEKGELGRHKQVRVHTPQQIPDEVKAIL 96


>gi|56476391|ref|YP_157980.1| hypothetical protein ebA1762 [Aromatoleum aromaticum EbN1]
 gi|81358142|sp|Q5P6I2|Y954_AZOSE RecName: Full=UPF0235 protein AZOSEA09540
 gi|56312434|emb|CAI07079.1| conserved hypothetical protein [Aromatoleum aromaticum EbN1]
          Length = 97

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 23/90 (25%), Positives = 41/90 (45%), Gaps = 7/90 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P AKK+                MK+++ A P  GKAN A+   LA    + +S+
Sbjct: 14 LSLHVQPGAKKTEFVG-------PHGEAMKLRLAAPPVDGKANAALTVFLAAFCGVGRSA 66

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +LS ++S  K + I+    E    L+  
Sbjct: 67 VSLLSGETSRAKRVRIEGAGSEALARLRAL 96


>gi|294634379|ref|ZP_06712916.1| putative cytoplasmic protein [Edwardsiella tarda ATCC 23685]
 gi|291092187|gb|EFE24748.1| putative cytoplasmic protein [Edwardsiella tarda ATCC 23685]
          Length = 96

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 18/91 (19%), Positives = 39/91 (42%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 13 LRLYIQPKASRDQIVGLH-------GEELKVAITAPPVDGQANAHLIKFIAKQFRVAKSL 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K + I +       +   L+
Sbjct: 66 ITIEKGELGRHKQLRIHQPQQIPDVVAAALE 96


>gi|227325979|ref|ZP_03830003.1| hypothetical protein PcarcW_01116 [Pectobacterium carotovorum
          subsp. carotovorum WPP14]
          Length = 96

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 7/77 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  +   LAK+  ++KS 
Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLTKFLAKQFRVAKSL 65

Query: 64 LRMLSKQSSPLKIIYID 80
          + +   +    K I I 
Sbjct: 66 VVIEKGELGRHKQIRIT 82


>gi|226942094|ref|YP_002797168.1| hypothetical protein LHK_03181 [Laribacter hongkongensis HLHK9]
 gi|254801627|sp|C1D6C4|Y3181_LARHH RecName: Full=UPF0235 protein LHK_03181
 gi|226717021|gb|ACO76159.1| Uncharacterized conserved protein [Laribacter hongkongensis
          HLHK9]
          Length = 97

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 25/89 (28%), Positives = 44/89 (49%), Gaps = 7/89 (7%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P A+++ +A L           +KI++ A P  GKAN  +LA LA+ L +S+
Sbjct: 11 VRLTLHVQPGARRTEVAGLH-------GDALKIRLAAPPVDGKANACLLAFLARGLGVSR 63

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          S++ + S   S  K++ I     E    L
Sbjct: 64 SAVTLKSGDCSRHKVVDIRGITPEAAAGL 92


>gi|260767468|ref|ZP_05876405.1| hypothetical protein VFA_000519 [Vibrio furnissii CIP 102972]
 gi|260617580|gb|EEX42762.1| hypothetical protein VFA_000519 [Vibrio furnissii CIP 102972]
 gi|315181248|gb|ADT88162.1| hypothetical protein vfu_A03054 [Vibrio furnissii NCTC 11218]
          Length = 96

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 21/90 (23%), Positives = 37/90 (41%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK+  ++K  
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGKANAHLSKYLAKQCKVAKGL 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +   +    K I I     +I   +Q  
Sbjct: 66 IDIEKGELGRHKQIRIH-TPAQIPPEVQAI 94


>gi|329890913|ref|ZP_08269256.1| hypothetical protein BDIM_26220 [Brevundimonas diminuta ATCC
          11568]
 gi|328846214|gb|EGF95778.1| hypothetical protein BDIM_26220 [Brevundimonas diminuta ATCC
          11568]
          Length = 93

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 27/92 (29%), Positives = 43/92 (46%), Gaps = 3/92 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V+L P A    I   ++  D      +K++V A P +G+AN A+ A LAK L + K
Sbjct: 3  ARIPVKLTPRASADRIDGWDVDPDGRP--VLKVRVRAQPVEGEANAALTAFLAKALGVPK 60

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92
            + +     S LK+I +D     E+   L  
Sbjct: 61 RDVALARGGQSRLKMIEVDGLTDAEVRARLPA 92


>gi|240949737|ref|ZP_04754069.1| hypothetical protein AM305_00329 [Actinobacillus minor NM305]
 gi|240295769|gb|EER46456.1| hypothetical protein AM305_00329 [Actinobacillus minor NM305]
          Length = 100

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 25/92 (27%), Positives = 42/92 (45%), Gaps = 8/92 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + L P A +  I  L           +KI +TA P  G AN  +L  L+K   + K
Sbjct: 14 IRLRIFLQPKASRDQIVGLH-------DEELKIAITAPPVDGAANAHLLKFLSKLFKVPK 66

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          SS+ +   +    K I+I  + K+I + ++N 
Sbjct: 67 SSIALEKGELQRHKQIFIP-EPKQIPQEIENL 97


>gi|299139441|ref|ZP_07032616.1| protein of unknown function DUF167 [Acidobacterium sp. MP5ACTX8]
 gi|298598710|gb|EFI54873.1| protein of unknown function DUF167 [Acidobacterium sp. MP5ACTX8]
          Length = 99

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 23/90 (25%), Positives = 45/90 (50%), Gaps = 8/90 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
          C++ VR+ P AK++ +              +KI +T  P  G+AN A++A L+ +L + +
Sbjct: 12 CSLPVRVHPGAKQNAVTGTH-------DGSLKISLTTPPTDGRANTALIAFLSDRLNIPR 64

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
          + + +L+  +S  K + I      E+   L
Sbjct: 65 AHIELLTGATSRSKTLRIAGLTSAEVEARL 94


>gi|123443630|ref|YP_001007602.1| hypothetical protein YE3436 [Yersinia enterocolitica subsp.
          enterocolitica 8081]
 gi|166232591|sp|A1JPU6|Y3436_YERE8 RecName: Full=UPF0235 protein YE3436
 gi|122090591|emb|CAL13460.1| conserved hypothetical protein [Yersinia enterocolitica subsp.
          enterocolitica 8081]
          Length = 96

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 13 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K I I        E+  LL+
Sbjct: 66 VIIEKGELGRHKQIKIVNPQQIPPEVATLLE 96


>gi|52424377|ref|YP_087514.1| hypothetical protein MS0322 [Mannheimia succiniciproducens
          MBEL55E]
 gi|81387574|sp|Q65VT1|Y322_MANSM RecName: Full=UPF0235 protein MS0322
 gi|52306429|gb|AAU36929.1| unknown [Mannheimia succiniciproducens MBEL55E]
          Length = 95

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 24/89 (26%), Positives = 41/89 (46%), Gaps = 8/89 (8%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + + L P A +  I  +           +KI +TA P  G AN  +L  L+K   + KS
Sbjct: 12 RLRIFLQPKASRDKIIGIH-------DDELKIAITAPPVDGAANAHLLKYLSKAFKVPKS 64

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          ++ +   + +  K ++I  + K I E LQ
Sbjct: 65 AIILEKGELNRHKQLFIP-EPKLIPEELQ 92


>gi|152979630|ref|YP_001345259.1| hypothetical protein Asuc_1977 [Actinobacillus succinogenes 130Z]
 gi|171704390|sp|A6VQS7|Y1977_ACTSZ RecName: Full=UPF0235 protein Asuc_1977
 gi|150841353|gb|ABR75324.1| protein of unknown function DUF167 [Actinobacillus succinogenes
          130Z]
          Length = 98

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 23/94 (24%), Positives = 42/94 (44%), Gaps = 10/94 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + L P A K  I  L           +KI +TA P  G AN  ++  L+K   + K
Sbjct: 11 IRLRIMLQPKASKDAIIGLH-------DEELKISITAPPVDGAANAHLIKYLSKAFKVPK 63

Query: 62 SSLRMLSKQSSPLKIIYIDKD---CKEITELLQN 92
          S++++   + +  K ++I       + + +LL N
Sbjct: 64 SAVQLEKGELNRHKQVFIPAPKIIPEAVRQLLDN 97


>gi|261250161|ref|ZP_05942737.1| hypothetical protein VIA_000181 [Vibrio orientalis CIP 102891]
 gi|260939277|gb|EEX95263.1| hypothetical protein VIA_000181 [Vibrio orientalis CIP 102891]
          Length = 96

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I              +KI +TA P  GKAN  +   LAK+  ++K  
Sbjct: 14 LRLYIQPKASRDKIVGQH-------GEELKIAITAPPVDGKANAHLSKYLAKQFKVAKGL 66

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +   +    K + I     +I + ++  
Sbjct: 67 ITIEKGELGRHKQVRIQ-SPVQIPQEIKAI 95


>gi|77918231|ref|YP_356046.1| hypothetical protein Pcar_0617 [Pelobacter carbinolicus DSM 2380]
 gi|123574815|sp|Q3A6Y1|Y617_PELCD RecName: Full=UPF0235 protein Pcar_0617
 gi|77544314|gb|ABA87876.1| conserved hypothetical protein TIGR00251 [Pelobacter carbinolicus
          DSM 2380]
          Length = 95

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 25/90 (27%), Positives = 44/90 (48%), Gaps = 8/90 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V + P A ++ +A L+          +KI++T+ P +G ANK     LAK L ++K
Sbjct: 12 VVLSVHVQPRASRNELAGLQ-------GESLKIRLTSPPVEGAANKLCREFLAKLLGVAK 64

Query: 62 SSLRMLSKQSSPLKIIYIDKDC-KEITELL 90
          S + ++S   S  K + I+     E+   L
Sbjct: 65 SRVTLVSGDKSRHKRLLIEGVTLDEVRNKL 94


>gi|161486600|ref|NP_935670.2| hypothetical protein VV2877 [Vibrio vulnificus YJ016]
 gi|47117406|sp|Q7MHJ2|Y2877_VIBVY RecName: Full=UPF0235 protein VV2877
          Length = 96

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 20/92 (21%), Positives = 37/92 (40%), Gaps = 10/92 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P A +  I  L           +KI +TA P  GKAN  +  +L K   ++K
Sbjct: 12 VVLRLYIQPKASRDKILGLH-------GDELKIAITAPPVDGKANGHLTKLLGKWFKVAK 64

Query: 62 SSLRMLSKQSSPLKIIYIDKD---CKEITELL 90
          S + +   +    K + +        E+  +L
Sbjct: 65 SLVTIEKGELGRHKQVRVHTPQQIPDEVKAIL 96


>gi|91228690|ref|ZP_01262604.1| hypothetical protein V12G01_14044 [Vibrio alginolyticus 12G01]
 gi|91187761|gb|EAS74079.1| hypothetical protein V12G01_14044 [Vibrio alginolyticus 12G01]
          Length = 96

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 21/90 (23%), Positives = 38/90 (42%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  +   LAK+  ++K  
Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +   +    K I I+    +I   ++  
Sbjct: 67 VHIEKGELGRHKQIRIE-SPTQIPTEIKAI 95


>gi|330444266|ref|YP_004377252.1| hypothetical protein G5S_0576 [Chlamydophila pecorum E58]
 gi|328807376|gb|AEB41549.1| conserved hypothetical protein [Chlamydophila pecorum E58]
          Length = 100

 Score = 95.6 bits (237), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 29/90 (32%), Positives = 53/90 (58%), Gaps = 10/90 (11%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          +C + V++ P AK++ I   +          +K++VT  P+KGKAN+A++++LAK L + 
Sbjct: 16 LCILEVQVTPKAKENKIVGFQ-------GEVLKVRVTEPPEKGKANEAVVSLLAKALGIP 68

Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          K  + ++S +SS  K + I    K++ E L
Sbjct: 69 KRDVTLVSGESSRKKKLMI---PKKVQEKL 95


>gi|296133363|ref|YP_003640610.1| protein of unknown function DUF167 [Thermincola sp. JR]
 gi|296031941|gb|ADG82709.1| protein of unknown function DUF167 [Thermincola potens JR]
          Length = 106

 Score = 95.2 bits (236), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 20/85 (23%), Positives = 45/85 (52%), Gaps = 7/85 (8%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           +++ P A K+ +  ++          +K+K+TA P +G AN+A +   A+  +++KS +
Sbjct: 14 KIKVQPKASKNELKGVQ-------GDSLKVKLTAPPVEGAANEACIRFFAELFSVAKSQV 66

Query: 65 RMLSKQSSPLKIIYIDKDCKEITEL 89
           +++  +S  K++ +    KE  E 
Sbjct: 67 EIITGHTSRTKLLKVKGLTKEEAEK 91


>gi|193212041|ref|YP_001997994.1| hypothetical protein Cpar_0370 [Chlorobaculum parvum NCIB 8327]
 gi|226705834|sp|B3QL06|Y370_CHLP8 RecName: Full=UPF0235 protein Cpar_0370
 gi|193085518|gb|ACF10794.1| protein of unknown function DUF167 [Chlorobaculum parvum NCIB
          8327]
          Length = 105

 Score = 95.2 bits (236), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 45/89 (50%), Gaps = 8/89 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VR+ P + K+GIA             +KI + + P    ANK    +LAK L + +S+
Sbjct: 13 LSVRVQPRSSKTGIAG-------RYGDQVKICLKSAPVDNAANKECCQLLAKTLGVPRSN 65

Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQ 91
          + +++ Q+S  K++ ++     E+ + L 
Sbjct: 66 VSVMNGQTSRSKVLKVEGMTPSELRKALA 94


>gi|332162814|ref|YP_004299391.1| hypothetical protein YE105_C3194 [Yersinia enterocolitica subsp.
          palearctica 105.5R(r)]
 gi|318604345|emb|CBY25843.1| upf0235 protein VC0458 [Yersinia enterocolitica subsp.
          palearctica Y11]
 gi|325667044|gb|ADZ43688.1| hypothetical protein YE105_C3194 [Yersinia enterocolitica subsp.
          palearctica 105.5R(r)]
 gi|330859004|emb|CBX69362.1| UPF0235 protein YE3436 [Yersinia enterocolitica W22703]
          Length = 96

 Score = 95.2 bits (236), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 13 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K I I        E+  LL+
Sbjct: 66 VIIEKGELGRHKQIKIVNPQQIPPEVAALLE 96


>gi|251788394|ref|YP_003003115.1| hypothetical protein Dd1591_0757 [Dickeya zeae Ech1591]
 gi|247537015|gb|ACT05636.1| protein of unknown function DUF167 [Dickeya zeae Ech1591]
          Length = 100

 Score = 95.2 bits (236), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 7/80 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  LAK+  ++K  
Sbjct: 17 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFLAKQFRVAKGM 69

Query: 64 LRMLSKQSSPLKIIYIDKDC 83
          + +   +    K I I    
Sbjct: 70 VTIEKGELGRHKQIRIVNPQ 89


>gi|320155276|ref|YP_004187655.1| osmotic shock response integral membrane protein YggT [Vibrio
          vulnificus MO6-24/O]
 gi|319930588|gb|ADV85452.1| integral membrane protein YggT, involved in response to
          extracytoplasmic stress (osmotic shock) [Vibrio
          vulnificus MO6-24/O]
          Length = 96

 Score = 95.2 bits (236), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 20/92 (21%), Positives = 37/92 (40%), Gaps = 10/92 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P A +  I  L           +KI +TA P  GKAN  +  +L K   ++K
Sbjct: 12 VVLRLYIQPKASRDKILGLH-------GDELKIAITAPPVDGKANGHLTKLLGKWFKVAK 64

Query: 62 SSLRMLSKQSSPLKIIYIDKD---CKEITELL 90
          S + +   +    K + +        E+  +L
Sbjct: 65 SLVTIEKGELGRHKQVRVHAPQQIPDEVKAIL 96


>gi|238752326|ref|ZP_04613805.1| hypothetical protein yrohd0001_19180 [Yersinia rohdei ATCC 43380]
 gi|238709487|gb|EEQ01726.1| hypothetical protein yrohd0001_19180 [Yersinia rohdei ATCC 43380]
          Length = 90

 Score = 95.2 bits (236), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 20/91 (21%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 7  LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFKVAKSQ 59

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K + I        E+  LL+
Sbjct: 60 VIIEKGELGRHKQLKIVNPQQIPPEVAALLE 90


>gi|270263061|ref|ZP_06191331.1| threonine dehydratase [Serratia odorifera 4Rx13]
 gi|270042749|gb|EFA15843.1| threonine dehydratase [Serratia odorifera 4Rx13]
          Length = 96

 Score = 95.2 bits (236), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 18/90 (20%), Positives = 41/90 (45%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS+
Sbjct: 13 IRLYIQPKASRDQIIGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFKVAKSN 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +   +    K + I    ++I  ++   
Sbjct: 66 VTIEKGELGRHKQLRIVN-PQQIPAVVAAL 94


>gi|254784473|ref|YP_003071901.1| hypothetical protein TERTU_0220 [Teredinibacter turnerae T7901]
 gi|237683524|gb|ACR10788.1| conserved hypothetical protein [Teredinibacter turnerae T7901]
          Length = 99

 Score = 95.2 bits (236), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 24/81 (29%), Positives = 38/81 (46%), Gaps = 8/81 (9%)

Query: 8  LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67
          L P A     A L+          +KI++TA P  GKAN  ++  LA++  ++KS + ++
Sbjct: 18 LQPKASSDAFAGLQA-------DRLKIRITAPPTDGKANAHLVKYLARQFGVAKSDIEIV 70

Query: 68 SKQSSPLKIIYIDKDCKEITE 88
            Q S  K + I    K I  
Sbjct: 71 RGQLSRQKTLQI-NHPKHIPA 90


>gi|238795055|ref|ZP_04638648.1| hypothetical protein yinte0001_4160 [Yersinia intermedia ATCC
          29909]
 gi|238725603|gb|EEQ17164.1| hypothetical protein yinte0001_4160 [Yersinia intermedia ATCC
          29909]
          Length = 90

 Score = 95.2 bits (236), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 7  LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANTHLIKFIAKQFRVAKSQ 59

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K I I        E+  LL+
Sbjct: 60 VVIEKGELGRHKQIKIVNPQQIPPEVAALLE 90


>gi|295106861|emb|CBL04404.1| Uncharacterized conserved protein [Gordonibacter pamelaeae
          7-10-1-b]
          Length = 96

 Score = 95.2 bits (236), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 20/92 (21%), Positives = 46/92 (50%), Gaps = 2/92 (2%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V + P + +  ++ +      +    + ++VTA P  GKANKA+  ++A+ L + K
Sbjct: 4  AIIAVHVTPRSGRDEVSGVRADAAGA--DEVCVRVTAPPDGGKANKAVCKLVAEALGVPK 61

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          S + + S  ++  K + ++ D  ++   L + 
Sbjct: 62 SRVGVASGHTARRKRLSVEADQAQVDAWLASL 93


>gi|308050656|ref|YP_003914222.1| hypothetical protein Fbal_2946 [Ferrimonas balearica DSM 9799]
 gi|307632846|gb|ADN77148.1| protein of unknown function DUF167 [Ferrimonas balearica DSM
          9799]
          Length = 96

 Score = 95.2 bits (236), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 19/90 (21%), Positives = 38/90 (42%), Gaps = 10/90 (11%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  +  L            K+ +TA P  GKAN  ++  LAK+  ++K  
Sbjct: 14 LKLYIQPKASRDQLVGLH-------GEEFKVAITAPPVDGKANAHLVKFLAKQFKVAKGQ 66

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90
          + ++  +    K + I         + +LL
Sbjct: 67 ISIVKGELGRHKQLKIQSPTVIPDPLAQLL 96


>gi|269137631|ref|YP_003294331.1| hypothetical protein ETAE_0273 [Edwardsiella tarda EIB202]
 gi|267983291|gb|ACY83120.1| hypothetical protein ETAE_0273 [Edwardsiella tarda EIB202]
 gi|304557696|gb|ADM40360.1| hypothetical protein ETAF_0236 [Edwardsiella tarda FL6-60]
          Length = 96

 Score = 95.2 bits (236), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 19/91 (20%), Positives = 39/91 (42%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  +L  +AK+  ++KS 
Sbjct: 13 LRLYIQPKASRDLIIGLH-------GDELKVAITAPPVDGQANAHLLKFIAKQFRVAKSR 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K + I +       +   L+
Sbjct: 66 ITLEKGELGRHKQLRISQPQQIPDAVAAALE 96


>gi|307132433|ref|YP_003884449.1| hypothetical protein Dda3937_02598 [Dickeya dadantii 3937]
 gi|306529962|gb|ADM99892.1| conserved protein [Dickeya dadantii 3937]
          Length = 100

 Score = 95.2 bits (236), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 19/91 (20%), Positives = 39/91 (42%), Gaps = 10/91 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + + + P A +  I  L           +K+ +TA P  G+AN  ++  LA++  ++K  
Sbjct: 17  IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFLARQFRVAKGM 69

Query: 64  LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
           + +   +    K I I        ++ EL+ 
Sbjct: 70  VTIEKGELGRHKQIRIVNPQAIPADVAELIS 100


>gi|288959535|ref|YP_003449876.1| hypothetical protein AZL_026940 [Azospirillum sp. B510]
 gi|288911843|dbj|BAI73332.1| hypothetical protein AZL_026940 [Azospirillum sp. B510]
          Length = 113

 Score = 94.8 bits (235), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 47/89 (52%), Gaps = 2/89 (2%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
            V +R+ P A ++ +  +           +K+ VTA P+ GKAN+A++ +L+K   + K+
Sbjct: 15  RVALRVTPKASRNAVTGMADTAAG--GRVLKLAVTAVPENGKANEAVIKLLSKAWKVPKT 72

Query: 63  SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
           SL +++  +   KI+++  D   +   L 
Sbjct: 73  SLTVVAGATDRNKILHVAGDPAALLARLS 101


>gi|322418850|ref|YP_004198073.1| hypothetical protein GM18_1329 [Geobacter sp. M18]
 gi|320125237|gb|ADW12797.1| protein of unknown function DUF167 [Geobacter sp. M18]
          Length = 104

 Score = 94.8 bits (235), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 22/88 (25%), Positives = 42/88 (47%), Gaps = 8/88 (9%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           V + P A +S I              ++I++T+ P    ANK  + ++AK L L+KS +
Sbjct: 18 TVHVQPRASRSEICG-------PKDGELRIRLTSPPVDDAANKQCVELIAKSLGLAKSKV 70

Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELLQ 91
           + S   S  K++ ++  D  ++  L +
Sbjct: 71 SIKSGAKSRHKVVRVEGVDQDDLLRLFK 98


>gi|284046655|ref|YP_003396995.1| hypothetical protein Cwoe_5214 [Conexibacter woesei DSM 14684]
 gi|283950876|gb|ADB53620.1| protein of unknown function DUF167 [Conexibacter woesei DSM
          14684]
          Length = 88

 Score = 94.8 bits (235), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 7/81 (8%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          M ++ VRL P AK++ I              + ++VTA P  GKAN A+  +LAK L ++
Sbjct: 1  MGDLRVRLQPRAKRNEIVG-------ERDGALVVRVTAPPVDGKANAALCRLLAKALGVA 53

Query: 61 KSSLRMLSKQSSPLKIIYIDK 81
           S++ ++  QS+  K++++D 
Sbjct: 54 PSTVTVVRGQSARDKVVHVDA 74


>gi|238763264|ref|ZP_04624229.1| hypothetical protein ykris0001_28410 [Yersinia kristensenii ATCC
          33638]
 gi|238698537|gb|EEP91289.1| hypothetical protein ykris0001_28410 [Yersinia kristensenii ATCC
          33638]
          Length = 90

 Score = 94.8 bits (235), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 7  LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQ 59

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K I I        E+  LL+
Sbjct: 60 VIIEKGELGRHKQIKIVNPQQIPPEVAALLE 90


>gi|328474077|gb|EGF44882.1| hypothetical protein VP10329_15255 [Vibrio parahaemolyticus
          10329]
          Length = 96

 Score = 94.8 bits (235), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 22/90 (24%), Positives = 38/90 (42%), Gaps = 10/90 (11%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  +   LAK+  ++K  
Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90
          + +   +    K I I+       E+  +L
Sbjct: 67 VHIEKGELGRHKQIRIESPVQIPAEVKAIL 96


>gi|317493545|ref|ZP_07951966.1| hypothetical protein HMPREF0864_02731 [Enterobacteriaceae
          bacterium 9_2_54FAA]
 gi|316918488|gb|EFV39826.1| hypothetical protein HMPREF0864_02731 [Enterobacteriaceae
          bacterium 9_2_54FAA]
          Length = 99

 Score = 94.8 bits (235), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 20/92 (21%), Positives = 41/92 (44%), Gaps = 8/92 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  +   +AK+  ++KS 
Sbjct: 15 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLQKFIAKQFRVAKSQ 67

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
          + +   +    K + I    ++I E++    S
Sbjct: 68 VVIEKGELGRHKQVRIS-QPQQIPEVVSALRS 98


>gi|148980491|ref|ZP_01816088.1| hypothetical protein VSWAT3_21090 [Vibrionales bacterium SWAT-3]
 gi|145961216|gb|EDK26530.1| hypothetical protein VSWAT3_21090 [Vibrionales bacterium SWAT-3]
          Length = 96

 Score = 94.8 bits (235), Expect = 3e-18,   Method: Composition-based stats.
 Identities = 20/90 (22%), Positives = 38/90 (42%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  +   LAK+  ++K  
Sbjct: 14 LRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQ 66

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          +++   +    K + I     +I   ++  
Sbjct: 67 IKIEKGELGRHKQVRI-CSPSQIPTEVKAI 95


>gi|42522071|ref|NP_967451.1| hypothetical protein Bd0463 [Bdellovibrio bacteriovorus HD100]
 gi|39574602|emb|CAE78444.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100]
          Length = 94

 Score = 94.8 bits (235), Expect = 4e-18,   Method: Composition-based stats.
 Identities = 18/91 (19%), Positives = 42/91 (46%), Gaps = 8/91 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P + K+ +              +KIK+TA P  GKAN+ ++  L+    + K
Sbjct: 9  VRLHLFIQPKSSKNEVVG-------PHNGEIKIKLTAPPVDGKANECLIEFLSDLFDIPK 61

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91
            + ++  ++   K++ +   D ++  E L+
Sbjct: 62 RDVHLIKGETGRHKVVELAGLDVEKTREALR 92


>gi|90412037|ref|ZP_01220044.1| hypothetical protein P3TCK_24666 [Photobacterium profundum 3TCK]
 gi|90327015|gb|EAS43394.1| hypothetical protein P3TCK_24666 [Photobacterium profundum 3TCK]
          Length = 97

 Score = 94.8 bits (235), Expect = 4e-18,   Method: Composition-based stats.
 Identities = 24/90 (26%), Positives = 38/90 (42%), Gaps = 10/90 (11%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  +   LAK+  ++KS 
Sbjct: 15 IRLYIQPKASRDQIVGLH-------GEELKIAITAPPVDGKANAHLSKFLAKQFRVAKSQ 67

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90
          + +        K + I+        I ELL
Sbjct: 68 VLIEKGMQGRHKQVRIESPREIPPVIAELL 97


>gi|261342370|ref|ZP_05970228.1| putative cytoplasmic protein [Enterobacter cancerogenus ATCC
          35316]
 gi|288315005|gb|EFC53943.1| putative cytoplasmic protein [Enterobacter cancerogenus ATCC
          35316]
          Length = 98

 Score = 94.4 bits (234), Expect = 4e-18,   Method: Composition-based stats.
 Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 7/80 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  +   LAK+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDC 83
          + +   +    K + I    
Sbjct: 66 VIIEKGELGRHKQVKILNPQ 85


>gi|37199811|dbj|BAC95641.1| conserved hypothetical protein [Vibrio vulnificus YJ016]
          Length = 101

 Score = 94.4 bits (234), Expect = 4e-18,   Method: Composition-based stats.
 Identities = 20/92 (21%), Positives = 37/92 (40%), Gaps = 10/92 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + + + P A +  I  L           +KI +TA P  GKAN  +  +L K   ++K
Sbjct: 17  VVLRLYIQPKASRDKILGLH-------GDELKIAITAPPVDGKANGHLTKLLGKWFKVAK 69

Query: 62  SSLRMLSKQSSPLKIIYIDKD---CKEITELL 90
           S + +   +    K + +        E+  +L
Sbjct: 70  SLVTIEKGELGRHKQVRVHTPQQIPDEVKAIL 101


>gi|86146416|ref|ZP_01064740.1| hypothetical protein MED222_22586 [Vibrio sp. MED222]
 gi|85835895|gb|EAQ54029.1| hypothetical protein MED222_22586 [Vibrio sp. MED222]
          Length = 96

 Score = 94.4 bits (234), Expect = 4e-18,   Method: Composition-based stats.
 Identities = 20/90 (22%), Positives = 38/90 (42%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  +   LAK+  ++K  
Sbjct: 14 LRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQ 66

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          +++   +    K + I     +I   ++  
Sbjct: 67 IKIEKGELGRHKQVRI-CSPSQIPTEVKAI 95


>gi|295097492|emb|CBK86582.1| conserved hypothetical protein TIGR00251 [Enterobacter cloacae
          subsp. cloacae NCTC 9394]
          Length = 98

 Score = 94.4 bits (234), Expect = 4e-18,   Method: Composition-based stats.
 Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 7/80 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  +   LAK+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDC 83
          + +   +    K + I    
Sbjct: 66 VIIEKGELGRHKQVKILNPQ 85


>gi|238754614|ref|ZP_04615968.1| hypothetical protein yruck0001_4830 [Yersinia ruckeri ATCC 29473]
 gi|238707245|gb|EEP99608.1| hypothetical protein yruck0001_4830 [Yersinia ruckeri ATCC 29473]
          Length = 93

 Score = 94.4 bits (234), Expect = 5e-18,   Method: Composition-based stats.
 Identities = 20/91 (21%), Positives = 39/91 (42%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 10 IRLYIQPKASRDQIIGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSH 62

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +        K I I        ++  LL+
Sbjct: 63 VIIEKGDLGRHKQIKIINPQQIPPQVAALLE 93


>gi|238918244|ref|YP_002931758.1| hypothetical protein NT01EI_0281 [Edwardsiella ictaluri 93-146]
 gi|259646912|sp|C5BCR7|Y281_EDWI9 RecName: Full=UPF0235 protein NT01EI_0281
 gi|238867812|gb|ACR67523.1| conserved hypothetical protein [Edwardsiella ictaluri 93-146]
          Length = 96

 Score = 94.4 bits (234), Expect = 5e-18,   Method: Composition-based stats.
 Identities = 19/89 (21%), Positives = 40/89 (44%), Gaps = 8/89 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  +L  +AK+  ++KS 
Sbjct: 13 LRLYIQPKASRDLIIGLH-------GDELKVAITAPPVDGQANAHLLKFIAKQFRVAKSR 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
          + +   +    K + I    ++I + +  
Sbjct: 66 ITLEKGELGRHKQLRIS-QPQQIPDAVAA 93


>gi|289208238|ref|YP_003460304.1| hypothetical protein TK90_1056 [Thioalkalivibrio sp. K90mix]
 gi|288943869|gb|ADC71568.1| protein of unknown function DUF167 [Thioalkalivibrio sp. K90mix]
          Length = 85

 Score = 94.4 bits (234), Expect = 5e-18,   Method: Composition-based stats.
 Identities = 23/92 (25%), Positives = 46/92 (50%), Gaps = 8/92 (8%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          M  + VR+ P +K+  I              +K++V A P+KG+AN+A+ A+LAK L   
Sbjct: 1  MARLRVRVAPGSKRDAIGPW-------MGDILKLRVQAPPEKGRANEAVCALLAKALGCP 53

Query: 61 KSSLRMLSKQSSPLKIIYIDKDCK-EITELLQ 91
             + +++  ++  K + I+   + ++   L 
Sbjct: 54 ARDVSVVAGATARDKTVAIEGYSEADLRRALS 85


>gi|54310243|ref|YP_131263.1| hypothetical protein PBPRA3146 [Photobacterium profundum SS9]
 gi|46914684|emb|CAG21461.1| conserved hypothetical protein [Photobacterium profundum SS9]
          Length = 101

 Score = 94.4 bits (234), Expect = 5e-18,   Method: Composition-based stats.
 Identities = 23/90 (25%), Positives = 37/90 (41%), Gaps = 10/90 (11%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + + + P A +  I  L           +KI +TA P  GKAN  +   LAK+  ++K  
Sbjct: 19  IRLYIQPKASRDQIVGLH-------GEELKIAITAPPVDGKANAHLSKFLAKQFRVAKGQ 71

Query: 64  LRMLSKQSSPLKIIYIDKD---CKEITELL 90
           + +        K + I+        I ELL
Sbjct: 72  VLIEKGMQGRHKQVRIESPREIPPVIAELL 101


>gi|269837719|ref|YP_003319947.1| hypothetical protein Sthe_1691 [Sphaerobacter thermophilus DSM
          20745]
 gi|269786982|gb|ACZ39125.1| protein of unknown function DUF167 [Sphaerobacter thermophilus
          DSM 20745]
          Length = 102

 Score = 94.4 bits (234), Expect = 5e-18,   Method: Composition-based stats.
 Identities = 19/92 (20%), Positives = 41/92 (44%), Gaps = 8/92 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            V VR+ P A ++ +  +           +++++ A P +G AN+A+   LA  L L K
Sbjct: 14 TQVTVRVTPRASRTQVDGV-------ADGALRVRLAAPPVEGAANRALTEFLANLLRLPK 66

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92
            + +++      K + +      +++E L  
Sbjct: 67 RDVELVAGARGRQKTVLLRGLTPADVSERLTA 98


>gi|325577506|ref|ZP_08147868.1| hypothetical protein HMPREF9417_0609 [Haemophilus parainfluenzae
          ATCC 33392]
 gi|325160610|gb|EGC72734.1| hypothetical protein HMPREF9417_0609 [Haemophilus parainfluenzae
          ATCC 33392]
          Length = 95

 Score = 94.0 bits (233), Expect = 6e-18,   Method: Composition-based stats.
 Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + + L P A K  I  L           +KI +TA P  G+AN  +L  L+K   + KS
Sbjct: 12 RLKIILQPKASKDQIVGLH-------DDELKITITAPPVDGQANAHLLKFLSKTFKVPKS 64

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          S+ +   + +  K ++I    K I   +QN 
Sbjct: 65 SIVLEKGELNRHKQVWIP-SPKLIPSEIQNL 94


>gi|322834241|ref|YP_004214268.1| hypothetical protein Rahaq_3549 [Rahnella sp. Y9602]
 gi|321169442|gb|ADW75141.1| protein of unknown function DUF167 [Rahnella sp. Y9602]
          Length = 101

 Score = 94.0 bits (233), Expect = 6e-18,   Method: Composition-based stats.
 Identities = 20/92 (21%), Positives = 39/92 (42%), Gaps = 10/92 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + + + P A +  +  L           +K+ +TA P  G+AN  ++  LAK+  ++KS 
Sbjct: 17  LRLVIQPKASRDSLVGLH-------GDELKVAITAPPVDGQANTHLVKFLAKQFKVAKSQ 69

Query: 64  LRMLSKQSSPLKIIYID---KDCKEITELLQN 92
           + +   +    K + I        E+  LL  
Sbjct: 70  VSIEKGELGRHKQVRITHPQNIPTEVAVLLAE 101


>gi|33151941|ref|NP_873294.1| hypothetical protein HD0778 [Haemophilus ducreyi 35000HP]
 gi|47117473|sp|Q7VN15|Y778_HAEDU RecName: Full=UPF0235 protein HD_0778
 gi|33148163|gb|AAP95683.1| conserved hypothetical protein [Haemophilus ducreyi 35000HP]
          Length = 97

 Score = 94.0 bits (233), Expect = 6e-18,   Method: Composition-based stats.
 Identities = 23/92 (25%), Positives = 39/92 (42%), Gaps = 10/92 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + L P A +  I  L           +K+ +TA P  G AN  +L  L+K   + K
Sbjct: 13 IRLRIFLQPKASRDQIVGLH-------DNELKVAITAPPVDGAANAYLLKYLSKLFKVPK 65

Query: 62 SSLRMLSKQSSPLKIIYIDKD---CKEITELL 90
          SS+ +   +    K +++       KEI + L
Sbjct: 66 SSIVLEKGELQRHKQLFVPAPKLLPKEIEQWL 97


>gi|146312998|ref|YP_001178072.1| hypothetical protein Ent638_3359 [Enterobacter sp. 638]
 gi|166990826|sp|A4WE91|Y3359_ENT38 RecName: Full=UPF0235 protein Ent638_3359
 gi|145319874|gb|ABP62021.1| protein of unknown function DUF167 [Enterobacter sp. 638]
          Length = 96

 Score = 94.0 bits (233), Expect = 7e-18,   Method: Composition-based stats.
 Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 7/80 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  +   LAK+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDC 83
          + +   +    K + I    
Sbjct: 66 VIIEKGELGRHKQVKILNPQ 85


>gi|28899393|ref|NP_798998.1| hypothetical protein VP2619 [Vibrio parahaemolyticus RIMD
          2210633]
 gi|260366264|ref|ZP_05778723.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030]
 gi|260878919|ref|ZP_05891274.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034]
 gi|260898280|ref|ZP_05906776.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466]
 gi|33301894|sp|Q87LJ3|Y2619_VIBPA RecName: Full=UPF0235 protein VP2619
 gi|28807629|dbj|BAC60882.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD
          2210633]
 gi|308085856|gb|EFO35551.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466]
 gi|308090459|gb|EFO40154.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034]
 gi|308113515|gb|EFO51055.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030]
          Length = 96

 Score = 94.0 bits (233), Expect = 7e-18,   Method: Composition-based stats.
 Identities = 22/90 (24%), Positives = 38/90 (42%), Gaps = 10/90 (11%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  +   LAK+  ++K  
Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90
          + +   +    K I I+       E+  +L
Sbjct: 67 VHIEKGELGRHKQIRIESPVQIPAEVKAIL 96


>gi|84394062|ref|ZP_00992798.1| hypothetical protein V12B01_07755 [Vibrio splendidus 12B01]
 gi|84375304|gb|EAP92215.1| hypothetical protein V12B01_07755 [Vibrio splendidus 12B01]
          Length = 96

 Score = 94.0 bits (233), Expect = 7e-18,   Method: Composition-based stats.
 Identities = 20/90 (22%), Positives = 37/90 (41%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  +   LAK+  ++K  
Sbjct: 14 LRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQ 66

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          +++   +    K + I      I   ++  
Sbjct: 67 IKIEKGELGRHKQVRI-CSPSHIPTEVKAI 95


>gi|320539543|ref|ZP_08039210.1| putative conserved protein [Serratia symbiotica str. Tucson]
 gi|320030396|gb|EFW12408.1| putative conserved protein [Serratia symbiotica str. Tucson]
          Length = 97

 Score = 94.0 bits (233), Expect = 7e-18,   Method: Composition-based stats.
 Identities = 18/90 (20%), Positives = 42/90 (46%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  LAK+  +++S+
Sbjct: 13 IRLYIQPKASRDKIIGLH-------GDEVKVAITAPPVDGQANAHLIKFLAKQFKVARSN 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +   +    K ++I    ++I  ++   
Sbjct: 66 VTIEKGELGRHKQLHII-HPQQIPAVVAAL 94


>gi|238787374|ref|ZP_04631173.1| hypothetical protein yfred0001_33630 [Yersinia frederiksenii ATCC
          33641]
 gi|238724636|gb|EEQ16277.1| hypothetical protein yfred0001_33630 [Yersinia frederiksenii ATCC
          33641]
          Length = 90

 Score = 93.6 bits (232), Expect = 7e-18,   Method: Composition-based stats.
 Identities = 20/91 (21%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 7  LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQ 59

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K + I        E+  LL+
Sbjct: 60 VIIEKGELGRHKQLKIVNPQQIPPEVAALLK 90


>gi|238786243|ref|ZP_04630189.1| hypothetical protein yberc0001_39320 [Yersinia bercovieri ATCC
          43970]
 gi|238798802|ref|ZP_04642272.1| hypothetical protein ymoll0001_31720 [Yersinia mollaretii ATCC
          43969]
 gi|238712858|gb|EEQ04924.1| hypothetical protein yberc0001_39320 [Yersinia bercovieri ATCC
          43970]
 gi|238717373|gb|EEQ09219.1| hypothetical protein ymoll0001_31720 [Yersinia mollaretii ATCC
          43969]
          Length = 90

 Score = 93.6 bits (232), Expect = 7e-18,   Method: Composition-based stats.
 Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 7  LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQ 59

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K I I        E+  LL+
Sbjct: 60 VIIEKGELGRHKQIKIVNPQQIPPEVAVLLE 90


>gi|167854882|ref|ZP_02477658.1| hypothetical protein HPS_05418 [Haemophilus parasuis 29755]
 gi|219871635|ref|YP_002476010.1| hypothetical protein HAPS_1504 [Haemophilus parasuis SH0165]
 gi|254800538|sp|B8F6W0|Y1504_HAEPS RecName: Full=UPF0235 protein HAPS_1504
 gi|167853949|gb|EDS25187.1| hypothetical protein HPS_05418 [Haemophilus parasuis 29755]
 gi|219691839|gb|ACL33062.1| conserved hypothetical protein [Haemophilus parasuis SH0165]
          Length = 97

 Score = 93.6 bits (232), Expect = 7e-18,   Method: Composition-based stats.
 Identities = 23/90 (25%), Positives = 41/90 (45%), Gaps = 8/90 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + L P A +  I  L           +KI +TA P  G+AN  +L  L+K   + K
Sbjct: 13 IRLRIFLQPKASRDQIVGLH-------DNELKIAITAPPIDGQANAHLLKYLSKLFKVPK 65

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          SS+ +   +    K I++  + K I + ++
Sbjct: 66 SSIVLEKGELQRHKQIFVP-EPKLIPKEIE 94


>gi|254508600|ref|ZP_05120716.1| conserved hypothetical protein TIGR00251 [Vibrio parahaemolyticus
          16]
 gi|219548451|gb|EED25460.1| conserved hypothetical protein TIGR00251 [Vibrio parahaemolyticus
          16]
          Length = 96

 Score = 93.6 bits (232), Expect = 7e-18,   Method: Composition-based stats.
 Identities = 17/90 (18%), Positives = 38/90 (42%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  +  L           +K+ +TA P  GKAN  +   LAK+  ++K  
Sbjct: 14 LRLYIQPKASRDKLVGLH-------GDELKVAITAPPVDGKANAHLSKYLAKQFKVAKGL 66

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          +++   +    K + +     +I   ++  
Sbjct: 67 IKIEKGELGRHKQLRV-NSPTQIPAEIKAI 95


>gi|296533899|ref|ZP_06896426.1| protein of hypothetical function DUF167 [Roseomonas cervicalis
          ATCC 49957]
 gi|296265774|gb|EFH11872.1| protein of hypothetical function DUF167 [Roseomonas cervicalis
          ATCC 49957]
          Length = 101

 Score = 93.6 bits (232), Expect = 8e-18,   Method: Composition-based stats.
 Identities = 24/90 (26%), Positives = 44/90 (48%), Gaps = 3/90 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V+  P A+++G+       D      +K+ V   P+ G+ANKA+ A+LA  L +  
Sbjct: 12 VELRVKAQPKARRAGLQGWIAAPDGP---RLKLAVHEAPEDGRANKAICALLAGALHVPP 68

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          S++ ++   +S  K   I  D   + E L+
Sbjct: 69 SAITVVQGATSREKTCRILGDSARLHETLE 98


>gi|89074110|ref|ZP_01160609.1| hypothetical protein SKA34_22122 [Photobacterium sp. SKA34]
 gi|89050046|gb|EAR55572.1| hypothetical protein SKA34_22122 [Photobacterium sp. SKA34]
          Length = 97

 Score = 93.6 bits (232), Expect = 8e-18,   Method: Composition-based stats.
 Identities = 21/90 (23%), Positives = 37/90 (41%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  ++  L+K+  ++K  
Sbjct: 15 IRLYIQPKASRDQIVGLH-------GNEVKIAITAPPVDGKANAHLVKYLSKQFKVAKGL 67

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +        K I I+   K I   +   
Sbjct: 68 IHLEKGLQGRHKQIRIE-TPKVIPSEIATI 96


>gi|82701713|ref|YP_411279.1| hypothetical protein Nmul_A0579 [Nitrosospira multiformis ATCC
           25196]
 gi|82409778|gb|ABB73887.1| Protein of unknown function DUF167 [Nitrosospira multiformis ATCC
           25196]
          Length = 105

 Score = 93.6 bits (232), Expect = 8e-18,   Method: Composition-based stats.
 Identities = 23/92 (25%), Positives = 40/92 (43%), Gaps = 7/92 (7%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + + + P A+++ +              +KIK+ A P +G AN A+LA LA    + +  
Sbjct: 21  LTLHIQPGARRTEVVGSH-------GDALKIKLAAPPVEGAANVALLAFLAGVFGVPQRQ 73

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
           + +     S  KI+ ID        LL+   S
Sbjct: 74  VILRQGARSRRKIVEIDGTACGADTLLKQTAS 105


>gi|330431471|gb|AEC16530.1| hypothetical protein UMN179_00494 [Gallibacterium anatis UMN179]
          Length = 95

 Score = 93.6 bits (232), Expect = 8e-18,   Method: Composition-based stats.
 Identities = 23/85 (27%), Positives = 41/85 (48%), Gaps = 8/85 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + L P A K  I  L           +KI +TA P  GKAN  +L  L+K+  ++K+ 
Sbjct: 12 LNIILQPKAGKDQIVGL-------YGDELKITITAPPIDGKANAHLLKFLSKQFKVAKTQ 64

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITE 88
          + +   + S  K ++I    ++I +
Sbjct: 65 IELRKGELSRHKQVFIP-SPEQIPQ 88


>gi|301154876|emb|CBW14339.1| conserved protein [Haemophilus parainfluenzae T3T1]
          Length = 95

 Score = 93.6 bits (232), Expect = 8e-18,   Method: Composition-based stats.
 Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + + L P A K  I  L           +KI +TA P  G+AN  +L  L+K   + KS
Sbjct: 12 RLKIILQPKASKDQIVGLH-------DDELKITITAPPVDGQANAHLLKFLSKAFKVPKS 64

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          S+ +   + +  K ++I    K I   +QN 
Sbjct: 65 SIVLEKGELNRHKQVWIP-SPKLIPSEIQNL 94


>gi|16127852|ref|NP_422416.1| hypothetical protein CC_3622 [Caulobacter crescentus CB15]
 gi|221236673|ref|YP_002519110.1| YggU superfamily protein [Caulobacter crescentus NA1000]
 gi|47117618|sp|Q9A2E3|Y3622_CAUCR RecName: Full=UPF0235 protein CC_3622
 gi|254803798|sp|B8H6E1|Y3737_CAUCN RecName: Full=UPF0235 protein CCNA_03737
 gi|13425372|gb|AAK25584.1| conserved hypothetical protein [Caulobacter crescentus CB15]
 gi|220965846|gb|ACL97202.1| YggU superfamily protein [Caulobacter crescentus NA1000]
          Length = 98

 Score = 93.6 bits (232), Expect = 9e-18,   Method: Composition-based stats.
 Identities = 22/90 (24%), Positives = 48/90 (53%), Gaps = 3/90 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            ++VRL P   +       +  D     ++K++V + P +G AN A++A LAK L + +
Sbjct: 7  VTLVVRLTPRGGRDAAEGWALDADGRL--YLKVRVASPPVEGAANAALIAFLAKTLKIPR 64

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
          S++R+ + +++ LK + ++  D  ++    
Sbjct: 65 SAVRLAAGETARLKRLELEGVDPADVARAF 94


>gi|330446896|ref|ZP_08310547.1| conserved hypothetical protein [Photobacterium leiognathi subsp.
          mandapamensis svers.1.1.]
 gi|328491087|dbj|GAA05044.1| conserved hypothetical protein [Photobacterium leiognathi subsp.
          mandapamensis svers.1.1.]
          Length = 97

 Score = 93.6 bits (232), Expect = 9e-18,   Method: Composition-based stats.
 Identities = 21/88 (23%), Positives = 38/88 (43%), Gaps = 8/88 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V + + P A +  I  L           +KI +TA P  GKAN  ++  L+K+  ++K  
Sbjct: 15 VRLYIQPKASRDQIVGLH-------GDEIKIAITAPPVDGKANAHLVKYLSKQFKVAKGL 67

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          + +        K + I+   K I   ++
Sbjct: 68 IHVEKGLQGRHKQVRIEA-PKAIPNEIE 94


>gi|260913359|ref|ZP_05919840.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325]
 gi|260632590|gb|EEX50760.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325]
          Length = 100

 Score = 93.3 bits (231), Expect = 9e-18,   Method: Composition-based stats.
 Identities = 25/89 (28%), Positives = 40/89 (44%), Gaps = 8/89 (8%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + + L P A K  I  L           +KI +TA P  G+AN  +L  L+K   + KS
Sbjct: 12 RLRIFLQPKASKDQIVGLH-------DDELKITITAPPIDGQANAHLLKFLSKTFKVPKS 64

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          S+ +   + +  K I +    K I E++ 
Sbjct: 65 SIVLEKGELNRHKQILVPN-PKIIPEIVS 92


>gi|152971905|ref|YP_001337014.1| hypothetical protein KPN_03387 [Klebsiella pneumoniae subsp.
          pneumoniae MGH 78578]
 gi|206579395|ref|YP_002236595.1| conserved hypothetical protein TIGR00251 [Klebsiella pneumoniae
          342]
 gi|238896484|ref|YP_002921222.1| hypothetical protein KP1_4664 [Klebsiella pneumoniae NTUH-K2044]
 gi|262042605|ref|ZP_06015761.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
          rhinoscleromatis ATCC 13884]
 gi|329998593|ref|ZP_08303177.1| TIGR00251 family protein [Klebsiella sp. MS 92-3]
 gi|166990767|sp|A6TDW3|Y3323_KLEP7 RecName: Full=UPF0235 protein KPN78578_33230
 gi|226708043|sp|B5XU96|Y722_KLEP3 RecName: Full=UPF0235 protein KPK_0722
 gi|150956754|gb|ABR78784.1| hypothetical protein KPN_03387 [Klebsiella pneumoniae subsp.
          pneumoniae MGH 78578]
 gi|206568453|gb|ACI10229.1| conserved hypothetical protein TIGR00251 [Klebsiella pneumoniae
          342]
 gi|238548804|dbj|BAH65155.1| hypothetical protein KP1_4664 [Klebsiella pneumoniae subsp.
          pneumoniae NTUH-K2044]
 gi|259040039|gb|EEW41154.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
          rhinoscleromatis ATCC 13884]
 gi|328538612|gb|EGF64712.1| TIGR00251 family protein [Klebsiella sp. MS 92-3]
          Length = 96

 Score = 93.3 bits (231), Expect = 9e-18,   Method: Composition-based stats.
 Identities = 18/92 (19%), Positives = 39/92 (42%), Gaps = 8/92 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  +           +K+ +TA P  G+AN  ++  LAK+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGVH-------GDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
          + +   +    K + I    ++I   +     
Sbjct: 66 VLIEKGELGRHKQVKIIA-PQQIPTAVAALTE 96


>gi|121997739|ref|YP_001002526.1| hypothetical protein Hhal_0948 [Halorhodospira halophila SL1]
 gi|121589144|gb|ABM61724.1| protein of unknown function DUF167 [Halorhodospira halophila SL1]
          Length = 101

 Score = 93.3 bits (231), Expect = 9e-18,   Method: Composition-based stats.
 Identities = 20/79 (25%), Positives = 40/79 (50%), Gaps = 8/79 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V VR+ P AK+  +          +   +++++   P  GKAN A+  +LA++  ++KS+
Sbjct: 15 VHVRVTPRAKRESLD--------VEGERLRVRLNTPPVDGKANTALRKLLARQFGVAKSA 66

Query: 64 LRMLSKQSSPLKIIYIDKD 82
          + +L  + S  K + I   
Sbjct: 67 VSLLRGERSRDKTVRITAP 85


>gi|258620721|ref|ZP_05715756.1| conserved hypothetical protein [Vibrio mimicus VM573]
 gi|258625606|ref|ZP_05720488.1| conserved hypothetical protein [Vibrio mimicus VM603]
 gi|258582108|gb|EEW06975.1| conserved hypothetical protein [Vibrio mimicus VM603]
 gi|258586919|gb|EEW11633.1| conserved hypothetical protein [Vibrio mimicus VM573]
          Length = 97

 Score = 93.3 bits (231), Expect = 9e-18,   Method: Composition-based stats.
 Identities = 23/92 (25%), Positives = 41/92 (44%), Gaps = 10/92 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK+  ++K S
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKFLAKQCKVAKGS 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQN 92
          + +   +    K + I +      EI  LL++
Sbjct: 66 VVIEKGELGRHKQVRIQQPSQIPPEIAALLES 97


>gi|283788525|ref|YP_003368390.1| hypothetical protein ROD_50331 [Citrobacter rodentium ICC168]
 gi|282951979|emb|CBG91706.1| conserved hypothetical protein [Citrobacter rodentium ICC168]
          Length = 96

 Score = 93.3 bits (231), Expect = 9e-18,   Method: Composition-based stats.
 Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYI 79
          + +   +    K + I
Sbjct: 66 VAIEKGELGRHKQVKI 81


>gi|92116228|ref|YP_575957.1| hypothetical protein Nham_0608 [Nitrobacter hamburgensis X14]
 gi|91799122|gb|ABE61497.1| protein of unknown function DUF167 [Nitrobacter hamburgensis X14]
          Length = 106

 Score = 93.3 bits (231), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 26/93 (27%), Positives = 54/93 (58%), Gaps = 2/93 (2%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           ++ +R+ P   + GI  +E+  D      +K++V A  + G+AN+A++A+LAK L + K 
Sbjct: 12  SIALRVTPRGGRDGIDGIEMLADGRP--VVKVRVRAIAEGGEANRAVMAVLAKALGVRKI 69

Query: 63  SLRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
            +R+L+  +S LK + +  D  ++ + L+   +
Sbjct: 70  DVRILAGATSRLKQVAVGGDPVKLGDALRALTA 102


>gi|238759331|ref|ZP_04620497.1| hypothetical protein yaldo0001_5600 [Yersinia aldovae ATCC 35236]
 gi|238702492|gb|EEP95043.1| hypothetical protein yaldo0001_5600 [Yersinia aldovae ATCC 35236]
          Length = 90

 Score = 93.3 bits (231), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 20/91 (21%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 7  LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQ 59

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K I I        E+  L++
Sbjct: 60 VILEKGELGRHKQIKIINPQQVPPEVAALIE 90


>gi|127512067|ref|YP_001093264.1| hypothetical protein Shew_1134 [Shewanella loihica PV-4]
 gi|166228889|sp|A3QC07|Y1134_SHELP RecName: Full=UPF0235 protein Shew_1134
 gi|126637362|gb|ABO23005.1| protein of unknown function DUF167 [Shewanella loihica PV-4]
          Length = 96

 Score = 93.3 bits (231), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 21/88 (23%), Positives = 37/88 (42%), Gaps = 8/88 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I              +K+ +TA P  GKAN  +   LAK+   +K +
Sbjct: 13 IRLYIQPKASRDQIVG-------PHGDELKVAITAPPIDGKANAHLCKFLAKQFKTAKGN 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          + +   +    K I +    KEI E + 
Sbjct: 66 ILIEKGELGRHKQIRVVM-PKEIPEAIS 92


>gi|315633577|ref|ZP_07888867.1| conserved hypothetical protein [Aggregatibacter segnis ATCC
          33393]
 gi|315477619|gb|EFU68361.1| conserved hypothetical protein [Aggregatibacter segnis ATCC
          33393]
          Length = 97

 Score = 93.3 bits (231), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 24/90 (26%), Positives = 39/90 (43%), Gaps = 8/90 (8%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + + L P A K  I  L           +KI +TA P  G+AN  +L  L+K   + KS
Sbjct: 12 RLRIFLQPKAAKDQIVGLH-------DDELKISITAPPVDGQANAHLLKFLSKLFKVPKS 64

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92
          S+ +   + +  K + I    K I   ++ 
Sbjct: 65 SIVLEKGELNRHKQVLIPC-PKAIPPQVEA 93


>gi|148263238|ref|YP_001229944.1| hypothetical protein Gura_1167 [Geobacter uraniireducens Rf4]
 gi|146396738|gb|ABQ25371.1| protein of unknown function DUF167 [Geobacter uraniireducens Rf4]
          Length = 102

 Score = 93.3 bits (231), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 21/88 (23%), Positives = 44/88 (50%), Gaps = 8/88 (9%)

Query: 5   IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
            V + P A ++ I  ++          +K+++TA P +  ANK  + +LAK L ++KS +
Sbjct: 22  TVHVQPRASRNEICGVQ-------GDELKLRLTAPPVEDAANKLCVELLAKALKVAKSRV 74

Query: 65  RMLSKQSSPLKIIYIDK-DCKEITELLQ 91
            + +   S  K + ++    + +  LL+
Sbjct: 75  TITAGAKSRHKTVKVEGITTEPVLSLLK 102


>gi|147678156|ref|YP_001212371.1| hypothetical protein PTH_1821 [Pelotomaculum thermopropionicum
          SI]
 gi|189039033|sp|A5D180|Y1821_PELTS RecName: Full=UPF0235 protein PTH_1821
 gi|146274253|dbj|BAF60002.1| uncharacterized conserved protein [Pelotomaculum
          thermopropionicum SI]
          Length = 95

 Score = 93.3 bits (231), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 7/88 (7%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VR+ P A ++ +A L           +KI++TA P  G+AN+A  A LA  L+L  
Sbjct: 11 VLLKVRVQPRAARNQVAGL-------YEDALKIRLTAPPVDGEANEACRAFLADSLSLPP 63

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITEL 89
          S + ++S  +S  K++ I     E    
Sbjct: 64 SKVEIVSGHASRTKVVKIAGVGAEKVRR 91


>gi|29840014|ref|NP_829120.1| hypothetical protein CCA00247 [Chlamydophila caviae GPIC]
 gi|33301875|sp|Q824A6|Y247_CHLCV RecName: Full=UPF0235 protein CCA_00247
 gi|29834361|gb|AAP04998.1| conserved hypothetical protein [Chlamydophila caviae GPIC]
          Length = 92

 Score = 93.3 bits (231), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 27/85 (31%), Positives = 50/85 (58%), Gaps = 7/85 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V++ P ++++ I   E          +KI+VT  P+KGKAN+A++A+LAK L+L K  
Sbjct: 8  LEVKVTPKSRENKIVGFE-------GEVLKIRVTEAPEKGKANEAVIALLAKTLSLPKRD 60

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITE 88
          + ++S ++S  K + + K  + I  
Sbjct: 61 VTLISGETSRKKRLLLPKSTESIIS 85


>gi|288933577|ref|YP_003437636.1| hypothetical protein Kvar_0694 [Klebsiella variicola At-22]
 gi|290511356|ref|ZP_06550725.1| conserved hypothetical protein [Klebsiella sp. 1_1_55]
 gi|288888306|gb|ADC56624.1| protein of unknown function DUF167 [Klebsiella variicola At-22]
 gi|289776349|gb|EFD84348.1| conserved hypothetical protein [Klebsiella sp. 1_1_55]
          Length = 96

 Score = 92.9 bits (230), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 18/92 (19%), Positives = 39/92 (42%), Gaps = 8/92 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  +           +K+ +TA P  G+AN  ++  LAK+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGVH-------GDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
          + +   +    K + I    ++I   +     
Sbjct: 66 VLIEKGELGRHKQVKIIA-PQQIPTAVAALTE 96


>gi|302879874|ref|YP_003848438.1| hypothetical protein Galf_2678 [Gallionella capsiferriformans
          ES-2]
 gi|302582663|gb|ADL56674.1| protein of unknown function DUF167 [Gallionella capsiferriformans
          ES-2]
          Length = 96

 Score = 92.9 bits (230), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 20/90 (22%), Positives = 40/90 (44%), Gaps = 7/90 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P AK+S I  L           +K+K+ A P  G+AN+A+L  +A+   +    
Sbjct: 14 LTLHVQPGAKRSEICGLH-------GEALKLKLAAPPIDGRANEALLKYIAELFRVPVRQ 66

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +     S  K++ +     +   LL + 
Sbjct: 67 VELRQGAQSRHKVVAVTDSAIQPESLLTDQ 96


>gi|62184885|ref|YP_219670.1| hypothetical protein CAB243 [Chlamydophila abortus S26/3]
 gi|81312941|sp|Q5L6M2|Y243_CHLAB RecName: Full=UPF0235 protein CAB243
 gi|62147952|emb|CAH63699.1| conserved hypothetical protein [Chlamydophila abortus S26/3]
          Length = 96

 Score = 92.9 bits (230), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 28/85 (32%), Positives = 48/85 (56%), Gaps = 7/85 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V++ P +K++ I   E          +KI+VT  P+KGKAN+A++A+LAK L+L K  
Sbjct: 8  LEVKVTPKSKQNTIVGFE-------GEVLKIRVTEVPEKGKANEAVIALLAKALSLPKRD 60

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITE 88
          + ++   +S  K I + K  + I  
Sbjct: 61 ITLIPGDTSRKKRILLPKSTESIVS 85


>gi|261867124|ref|YP_003255046.1| hypothetical protein D11S_0417 [Aggregatibacter
          actinomycetemcomitans D11S-1]
 gi|261412456|gb|ACX81827.1| hypothetical protein D11S_0417 [Aggregatibacter
          actinomycetemcomitans D11S-1]
          Length = 97

 Score = 92.9 bits (230), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 24/89 (26%), Positives = 39/89 (43%), Gaps = 8/89 (8%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + + L P A K  I  L           +KI +TA P  G+AN  +L  L+K   + KS
Sbjct: 12 RLRIFLQPKAAKDQIVGLH-------DDELKISITAPPVDGQANAHLLKFLSKLFKVPKS 64

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          S+ +   + +  K + I    K I   ++
Sbjct: 65 SIVLEKGELNRHKQVLIP-SPKVIPPQIE 92


>gi|262172428|ref|ZP_06040106.1| hypothetical protein VII_003257 [Vibrio mimicus MB-451]
 gi|261893504|gb|EEY39490.1| hypothetical protein VII_003257 [Vibrio mimicus MB-451]
          Length = 97

 Score = 92.9 bits (230), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 23/92 (25%), Positives = 40/92 (43%), Gaps = 10/92 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   ++K S
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQN 92
          + +   +    K + I +      EI  LL++
Sbjct: 66 VVIEKGELGRHKQVRIQQPNQIPPEIAALLES 97


>gi|257056705|ref|YP_003134537.1| hypothetical protein Svir_27290 [Saccharomonospora viridis DSM
           43017]
 gi|256586577|gb|ACU97710.1| uncharacterized conserved protein [Saccharomonospora viridis DSM
           43017]
          Length = 118

 Score = 92.9 bits (230), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 19/89 (21%), Positives = 39/89 (43%), Gaps = 3/89 (3%)

Query: 5   IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
            +R+ P AK+  +  +    D +    + + V A    GKAN+A+  +LA+ L++    L
Sbjct: 22  AIRVKPGAKRDAVGGI---WDGALGEALVVSVRAPAVDGKANEAVCRVLAEALSVRARDL 78

Query: 65  RMLSKQSSPLKIIYIDKDCKEITELLQNN 93
            ++    +  K++ +        E L   
Sbjct: 79  TVVKGHRARDKLVELRDPPPGCAERLAEL 107


>gi|15603178|ref|NP_246251.1| hypothetical protein PM1313 [Pasteurella multocida subsp.
          multocida str. Pm70]
 gi|29839744|sp|Q9CLC6|Y1313_PASMU RecName: Full=UPF0235 protein PM1313
 gi|12721676|gb|AAK03397.1| unknown [Pasteurella multocida subsp. multocida str. Pm70]
          Length = 99

 Score = 92.9 bits (230), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 26/92 (28%), Positives = 40/92 (43%), Gaps = 10/92 (10%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + + L P A K  I  L           +KI +TA P  G+AN  +L  L+K   + KS
Sbjct: 15 RLRIFLQPKASKDQIVGLH-------DNELKITITAPPIDGQANAHLLKFLSKTFKVPKS 67

Query: 63 SLRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          S+ +   + +  K I I        E+  LL+
Sbjct: 68 SIVLEKGELNRHKQILIPNPKVIPTEVNVLLK 99


>gi|293390704|ref|ZP_06635038.1| hypothetical protein D7S_0840 [Aggregatibacter
          actinomycetemcomitans D7S-1]
 gi|290951238|gb|EFE01357.1| hypothetical protein D7S_0840 [Aggregatibacter
          actinomycetemcomitans D7S-1]
          Length = 97

 Score = 92.5 bits (229), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 24/89 (26%), Positives = 40/89 (44%), Gaps = 8/89 (8%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + + L P A K+ I  L           +KI +TA P  G+AN  +L  L+K   + KS
Sbjct: 12 RLRIFLQPKAAKNQIVGLH-------DDELKISITAPPVDGQANAHLLKFLSKLFKVPKS 64

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          S+ +   + +  K + I    K I   ++
Sbjct: 65 SIVLEKGELNRHKQVLIP-SPKVIPPQIE 92


>gi|260775569|ref|ZP_05884466.1| hypothetical protein VIC_000947 [Vibrio coralliilyticus ATCC
          BAA-450]
 gi|260608750|gb|EEX34915.1| hypothetical protein VIC_000947 [Vibrio coralliilyticus ATCC
          BAA-450]
          Length = 96

 Score = 92.5 bits (229), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 19/92 (20%), Positives = 37/92 (40%), Gaps = 8/92 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P A +  +  L           +KI +TA P  GKAN  +   L+K+  ++K
Sbjct: 12 VLLRLYIQPKASRDKLIGLH-------GDEIKIAITAPPVDGKANAHLSKYLSKQFKVAK 64

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
            + +   +    K + I      I E ++  
Sbjct: 65 GLITIEKGELGRHKQVRIQ-SPAHIPETIKAI 95


>gi|296104617|ref|YP_003614763.1| hypothetical protein ECL_04282 [Enterobacter cloacae subsp.
          cloacae ATCC 13047]
 gi|295059076|gb|ADF63814.1| hypothetical protein ECL_04282 [Enterobacter cloacae subsp.
          cloacae ATCC 13047]
          Length = 95

 Score = 92.5 bits (229), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 7/80 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  +   LAK+  ++KS 
Sbjct: 13 LRLYIQPKASRDNIVGLH-------GDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDC 83
          + +   +    K + I    
Sbjct: 66 VIIEKGELGRHKQVKILNPQ 85


>gi|90580282|ref|ZP_01236089.1| hypothetical protein VAS14_20161 [Vibrio angustum S14]
 gi|90438584|gb|EAS63768.1| hypothetical protein VAS14_20161 [Vibrio angustum S14]
          Length = 97

 Score = 92.5 bits (229), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 20/79 (25%), Positives = 34/79 (43%), Gaps = 7/79 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  GKAN  ++  LAK+  ++K  
Sbjct: 15 IRLYIQPKASRDQIVGLH-------GNEVKIAITAPPVDGKANAHLVKYLAKQFKVAKGL 67

Query: 64 LRMLSKQSSPLKIIYIDKD 82
          + +        K I I+  
Sbjct: 68 IHVEKGLQGRHKQIRIEAP 86


>gi|170023077|ref|YP_001719582.1| hypothetical protein YPK_0828 [Yersinia pseudotuberculosis YPIII]
 gi|226708089|sp|B1JNN7|Y828_YERPY RecName: Full=UPF0235 protein YPK_0828
 gi|169749611|gb|ACA67129.1| protein of unknown function DUF167 [Yersinia pseudotuberculosis
          YPIII]
          Length = 96

 Score = 92.5 bits (229), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 21/91 (23%), Positives = 41/91 (45%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 13 LKLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K I +        E+T LL+
Sbjct: 66 VIIEKGELGRHKQIKVINPQQIPPEVTILLE 96


>gi|162418314|ref|YP_001604771.1| hypothetical protein YpAngola_A0141 [Yersinia pestis Angola]
 gi|165924856|ref|ZP_02220688.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar
           Orientalis str. F1991016]
 gi|165937260|ref|ZP_02225824.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar
           Orientalis str. IP275]
 gi|166010278|ref|ZP_02231176.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar
           Antiqua str. E1979001]
 gi|166212808|ref|ZP_02238843.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar
           Antiqua str. B42003004]
 gi|167422007|ref|ZP_02313760.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar
           Orientalis str. MG05-1020]
 gi|167426697|ref|ZP_02318450.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar
           Mediaevalis str. K1973002]
 gi|186896650|ref|YP_001873762.1| hypothetical protein YPTS_3350 [Yersinia pseudotuberculosis PB1/+]
 gi|294502893|ref|YP_003566955.1| hypothetical protein YPZ3_0783 [Yersinia pestis Z176003]
 gi|21960273|gb|AAM86880.1|AE013934_3 hypothetical protein y3330 [Yersinia pestis KIM 10]
 gi|45438104|gb|AAS63652.1| conserved hypothetical protein [Yersinia pestis biovar Microtus
           str. 91001]
 gi|162351129|gb|ABX85077.1| conserved hypothetical protein TIGR00251 [Yersinia pestis Angola]
 gi|165914734|gb|EDR33347.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar
           Orientalis str. IP275]
 gi|165923056|gb|EDR40207.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar
           Orientalis str. F1991016]
 gi|165990764|gb|EDR43065.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar
           Antiqua str. E1979001]
 gi|166206100|gb|EDR50580.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar
           Antiqua str. B42003004]
 gi|166960144|gb|EDR56165.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar
           Orientalis str. MG05-1020]
 gi|167054300|gb|EDR64119.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar
           Mediaevalis str. K1973002]
 gi|186699676|gb|ACC90305.1| protein of unknown function DUF167 [Yersinia pseudotuberculosis
           PB1/+]
 gi|262360928|gb|ACY57649.1| hypothetical protein YPD4_0740 [Yersinia pestis D106004]
 gi|294353352|gb|ADE63693.1| hypothetical protein YPZ3_0783 [Yersinia pestis Z176003]
          Length = 100

 Score = 92.5 bits (229), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 21/91 (23%), Positives = 41/91 (45%), Gaps = 10/91 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 17  LKLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANTHLVKFIAKQFRVAKSQ 69

Query: 64  LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
           + +   +    K I +        E+T LL+
Sbjct: 70  VIIEKGELGRHKQIKVINPQQIPPEVTILLE 100


>gi|121535253|ref|ZP_01667067.1| protein of unknown function DUF167 [Thermosinus carboxydivorans
          Nor1]
 gi|121306138|gb|EAX47066.1| protein of unknown function DUF167 [Thermosinus carboxydivorans
          Nor1]
          Length = 100

 Score = 92.5 bits (229), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 22/87 (25%), Positives = 44/87 (50%), Gaps = 8/87 (9%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           +++ P A ++ +  L           +K+ V + P +G+AN+A LA  A    ++K+ +
Sbjct: 18 KIKVQPRASRNAVIGL-------AGDSLKVCVASPPVEGEANQACLAFFAALFGVAKTRI 70

Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELL 90
           ++S Q S  K+I I   D ++   +L
Sbjct: 71 VLVSGQKSRSKVIKIMGIDMEQFKTVL 97


>gi|62129245|gb|AAX66948.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
          serovar Choleraesuis str. SC-B67]
          Length = 100

 Score = 92.5 bits (229), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQ 69

Query: 64 LRMLSKQSSPLKIIYI 79
          + +   +    K + I
Sbjct: 70 IVIEKGELGRHKQVKI 85


>gi|300024715|ref|YP_003757326.1| hypothetical protein Hden_3210 [Hyphomicrobium denitrificans ATCC
           51888]
 gi|299526536|gb|ADJ25005.1| protein of unknown function DUF167 [Hyphomicrobium denitrificans
           ATCC 51888]
          Length = 116

 Score = 92.5 bits (229), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 27/88 (30%), Positives = 43/88 (48%), Gaps = 3/88 (3%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V  RL P + K  I  +    D       + +V A P+ G AN A+  ++A+ L L K S
Sbjct: 24  VHFRLTPKSSKDAIEGVTSTSDGP---AFQARVRAVPEHGAANAALEQLVARWLDLPKRS 80

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
           + +     S LK + ID + +E+  LL+
Sbjct: 81  VSLAKGGKSRLKALQIDGEPEELDRLLE 108


>gi|224584895|ref|YP_002638694.1| hypothetical protein SPC_3167 [Salmonella enterica subsp.
          enterica serovar Paratyphi C strain RKS4594]
 gi|224469423|gb|ACN47253.1| hypothetical protein SPC_3167 [Salmonella enterica subsp.
          enterica serovar Paratyphi C strain RKS4594]
          Length = 93

 Score = 92.5 bits (229), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 10 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQ 62

Query: 64 LRMLSKQSSPLKIIYI 79
          + +   +    K + I
Sbjct: 63 IVIEKGELGRHKQVKI 78


>gi|254443993|ref|ZP_05057469.1| conserved hypothetical protein [Verrucomicrobiae bacterium
          DG1235]
 gi|198258301|gb|EDY82609.1| conserved hypothetical protein [Verrucomicrobiae bacterium
          DG1235]
          Length = 94

 Score = 92.1 bits (228), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 27/87 (31%), Positives = 50/87 (57%), Gaps = 6/87 (6%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V+++PNA +S IA         +   +KI++ + PQ GKANKA++A LAK+  +SK+ 
Sbjct: 11 LSVKVLPNASRSEIAGW------LEDGSLKIRIQSPPQDGKANKALIAFLAKETGVSKNQ 64

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90
          + +   ++S  K+I  ++      + L
Sbjct: 65 ISIARGETSRQKLIAFERLSSSQWQRL 91


>gi|187735949|ref|YP_001878061.1| protein of unknown function DUF167 [Akkermansia muciniphila ATCC
          BAA-835]
 gi|187426001|gb|ACD05280.1| protein of unknown function DUF167 [Akkermansia muciniphila ATCC
          BAA-835]
          Length = 96

 Score = 92.1 bits (228), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 31/94 (32%), Positives = 53/94 (56%), Gaps = 3/94 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +++IPNAKKS     E  +D      +K+++ A P +GKANKA++  L+  L + +
Sbjct: 1  MKLALKVIPNAKKSEAVGWE--EDPRAGRALKLRIAAPPVEGKANKAVVLFLSAWLDIPR 58

Query: 62 SSLRMLSKQSSPLKIIYI-DKDCKEITELLQNND 94
          SS+  L  +SS LK++ + D    ++  LL   D
Sbjct: 59 SSISFLRGESSRLKVVELPDGCEGKLARLLSAED 92


>gi|108806318|ref|YP_650234.1| hypothetical protein YPA_0321 [Yersinia pestis Antiqua]
 gi|108813301|ref|YP_649068.1| hypothetical protein YPN_3141 [Yersinia pestis Nepal516]
 gi|145597878|ref|YP_001161954.1| hypothetical protein YPDSF_0571 [Yersinia pestis Pestoides F]
 gi|149367047|ref|ZP_01889080.1| hypothetical protein YPE_2325 [Yersinia pestis CA88-4125]
 gi|161484758|ref|NP_670629.2| hypothetical protein y3330 [Yersinia pestis KIM 10]
 gi|161511310|ref|NP_994775.2| hypothetical protein YP_3498 [Yersinia pestis biovar Microtus
          str. 91001]
 gi|167399786|ref|ZP_02305304.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar
          Antiqua str. UG05-0454]
 gi|167468163|ref|ZP_02332867.1| hypothetical protein YpesF_09749 [Yersinia pestis FV-1]
 gi|218928116|ref|YP_002345991.1| hypothetical protein YPO0944 [Yersinia pestis CO92]
 gi|229837637|ref|ZP_04457799.1| conserved protein [Yersinia pestis Pestoides A]
 gi|229840863|ref|ZP_04461022.1| conserved protein [Yersinia pestis biovar Orientalis str. PEXU2]
 gi|229842576|ref|ZP_04462731.1| conserved protein [Yersinia pestis biovar Orientalis str. India
          195]
 gi|229903764|ref|ZP_04518877.1| conserved protein [Yersinia pestis Nepal516]
 gi|270487542|ref|ZP_06204616.1| conserved hypothetical protein TIGR00251 [Yersinia pestis KIM
          D27]
 gi|29839591|sp|Q8ZHF5|Y944_YERPE RecName: Full=UPF0235 protein YPO0944/y3330/YP_3498
 gi|122383790|sp|Q1CB83|Y321_YERPA RecName: Full=UPF0235 protein YPA_0321
 gi|122384257|sp|Q1CEW2|Y3141_YERPN RecName: Full=UPF0235 protein YPN_3141
 gi|166229066|sp|A4TI69|Y571_YERPP RecName: Full=UPF0235 protein YPDSF_0571
 gi|108776949|gb|ABG19468.1| hypothetical protein YPN_3141 [Yersinia pestis Nepal516]
 gi|108778231|gb|ABG12289.1| hypothetical protein YPA_0321 [Yersinia pestis Antiqua]
 gi|115346727|emb|CAL19610.1| conserved hypothetical protein [Yersinia pestis CO92]
 gi|145209574|gb|ABP38981.1| hypothetical protein YPDSF_0571 [Yersinia pestis Pestoides F]
 gi|149290661|gb|EDM40737.1| hypothetical protein YPE_2325 [Yersinia pestis CA88-4125]
 gi|167050494|gb|EDR61902.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar
          Antiqua str. UG05-0454]
 gi|229679534|gb|EEO75637.1| conserved protein [Yersinia pestis Nepal516]
 gi|229690886|gb|EEO82940.1| conserved protein [Yersinia pestis biovar Orientalis str. India
          195]
 gi|229697229|gb|EEO87276.1| conserved protein [Yersinia pestis biovar Orientalis str. PEXU2]
 gi|229704325|gb|EEO91336.1| conserved protein [Yersinia pestis Pestoides A]
 gi|270336046|gb|EFA46823.1| conserved hypothetical protein TIGR00251 [Yersinia pestis KIM
          D27]
 gi|320013974|gb|ADV97545.1| conserved protein [Yersinia pestis biovar Medievalis str. Harbin
          35]
          Length = 96

 Score = 92.1 bits (228), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 21/91 (23%), Positives = 41/91 (45%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 13 LKLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANTHLVKFIAKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K I +        E+T LL+
Sbjct: 66 VIIEKGELGRHKQIKVINPQQIPPEVTILLE 96


>gi|319780857|ref|YP_004140333.1| hypothetical protein Mesci_1119 [Mesorhizobium ciceri biovar
           biserrulae WSM1271]
 gi|317166745|gb|ADV10283.1| protein of unknown function DUF167 [Mesorhizobium ciceri biovar
           biserrulae WSM1271]
          Length = 105

 Score = 92.1 bits (228), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 23/92 (25%), Positives = 49/92 (53%), Gaps = 2/92 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + VRL P A    +  +E   D  +  H+K +V A P+ G AN A+  ++AK + +  
Sbjct: 12  VELFVRLTPKAALDRLEGIETTAD--ERSHLKARVRAVPENGAANHALEKLIAKAIGVPG 69

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
           S++ +++  ++ LK + I+ D + + + ++  
Sbjct: 70  SAVSVVAGGTARLKTVRIEGDPETLAKSIEAL 101


>gi|237729884|ref|ZP_04560365.1| conserved hypothetical protein [Citrobacter sp. 30_2]
 gi|283835286|ref|ZP_06355027.1| putative cytoplasmic protein [Citrobacter youngae ATCC 29220]
 gi|226908490|gb|EEH94408.1| conserved hypothetical protein [Citrobacter sp. 30_2]
 gi|291068444|gb|EFE06553.1| putative cytoplasmic protein [Citrobacter youngae ATCC 29220]
          Length = 96

 Score = 92.1 bits (228), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYI 79
          + +   +    K + I
Sbjct: 66 VVIEKGELGRHKQVKI 81


>gi|30248412|ref|NP_840482.1| hypothetical protein NE0395 [Nitrosomonas europaea ATCC 19718]
 gi|47117515|sp|Q82X93|Y395_NITEU RecName: Full=UPF0235 protein NE0395
 gi|30138298|emb|CAD84306.1| DUF167 [Nitrosomonas europaea ATCC 19718]
          Length = 100

 Score = 92.1 bits (228), Expect = 2e-17,   Method: Composition-based stats.
 Identities = 17/96 (17%), Positives = 43/96 (44%), Gaps = 10/96 (10%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  + + + P A+++    +           +KIK+ A P  GKAN+A+   LAK+  + 
Sbjct: 12  LLILKLYVQPGARQTEAVGI-------CGEELKIKLAALPVDGKANRALTEFLAKRFNVP 64

Query: 61  KSSLRMLSKQSSPLKIIYI---DKDCKEITELLQNN 93
           + ++ +   + S  K++ +       + +   ++  
Sbjct: 65  RKNITLKRGEQSRHKVVEVCQSSNGPEVLFSEMRAE 100


>gi|300115523|ref|YP_003762098.1| hypothetical protein Nwat_3056 [Nitrosococcus watsonii C-113]
 gi|299541460|gb|ADJ29777.1| protein of unknown function DUF167 [Nitrosococcus watsonii C-113]
          Length = 102

 Score = 92.1 bits (228), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 25/84 (29%), Positives = 42/84 (50%), Gaps = 7/84 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V +RL P A+   +              +KI++TA P +GKAN  +L  L K   +S++ 
Sbjct: 15 VQIRLQPRARGDEVIG-------PHGNRLKIRITAPPVEGKANTQLLRFLVKTFQVSRNQ 67

Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87
          + +LS  +S  K + I+K  K + 
Sbjct: 68 VYLLSGTASRDKRVRIEKPAKLLP 91


>gi|215488251|ref|YP_002330682.1| hypothetical protein E2348C_3206 [Escherichia coli O127:H6 str.
          E2348/69]
 gi|312964784|ref|ZP_07779024.1| conserved hypothetical protein [Escherichia coli 2362-75]
 gi|331684580|ref|ZP_08385172.1| conserved hypothetical protein [Escherichia coli H299]
 gi|254814154|sp|B7UI00|YGGU_ECO27 RecName: Full=UPF0235 protein yggU
 gi|215266323|emb|CAS10754.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
 gi|312290340|gb|EFR18220.1| conserved hypothetical protein [Escherichia coli 2362-75]
 gi|331078195|gb|EGI49401.1| conserved hypothetical protein [Escherichia coli H299]
          Length = 96

 Score = 92.1 bits (228), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 22/90 (24%), Positives = 38/90 (42%), Gaps = 10/90 (11%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90
          + +   +    K I I        EI  LL
Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEIAALL 95


>gi|170766160|ref|ZP_02900971.1| conserved hypothetical protein [Escherichia albertii TW07627]
 gi|170125306|gb|EDS94237.1| conserved hypothetical protein [Escherichia albertii TW07627]
          Length = 96

 Score = 91.7 bits (227), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 66 VVIEKGELGRHKQIRIINPQQIPPEI 91


>gi|110635561|ref|YP_675769.1| hypothetical protein Meso_3232 [Mesorhizobium sp. BNC1]
 gi|110286545|gb|ABG64604.1| protein of unknown function DUF167 [Chelativorans sp. BNC1]
          Length = 116

 Score = 91.7 bits (227), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 24/92 (26%), Positives = 40/92 (43%), Gaps = 2/92 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + VRL P +    I  +    D      +K +V A P+ GKAN+A+  +LA  L + +  
Sbjct: 16  IFVRLTPKSSSDAIEGVTEGPDGQ--AFLKARVRAIPEAGKANEALERLLASALGVPRRD 73

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
           + + S  +S  K + I  D   +   L     
Sbjct: 74  VAVSSGAASRRKTVSITGDAAPLIASLDTIAE 105


>gi|190150700|ref|YP_001969225.1| hypothetical protein APP7_1431 [Actinobacillus pleuropneumoniae
          serovar 7 str. AP76]
 gi|307264052|ref|ZP_07545650.1| hypothetical protein appser13_14550 [Actinobacillus
          pleuropneumoniae serovar 13 str. N273]
 gi|226734158|sp|B3GYF9|Y1431_ACTP7 RecName: Full=UPF0235 protein APP7_1431
 gi|189915831|gb|ACE62083.1| hypothetical protein APP7_1431 [Actinobacillus pleuropneumoniae
          serovar 7 str. AP76]
 gi|306870598|gb|EFN02344.1| hypothetical protein appser13_14550 [Actinobacillus
          pleuropneumoniae serovar 13 str. N273]
          Length = 97

 Score = 91.7 bits (227), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 22/92 (23%), Positives = 40/92 (43%), Gaps = 8/92 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + L P A +  I  L           +KI +TA P  G AN  +L  L+K   + K
Sbjct: 13 IRLRIFLQPKASRDQIVGLH-------DSELKIAITAPPVDGAANAHLLKYLSKLFKVPK 65

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          SS+ +   +    K +++  + K I + ++  
Sbjct: 66 SSIVLEKGELQRHKQLFVP-EPKLIPKEIEAL 96


>gi|297580597|ref|ZP_06942523.1| conserved hypothetical protein [Vibrio cholerae RC385]
 gi|297535013|gb|EFH73848.1| conserved hypothetical protein [Vibrio cholerae RC385]
          Length = 96

 Score = 91.7 bits (227), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 23/91 (25%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   ++K S
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65

Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91
          + +   +    K + I    +   EIT L++
Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEITALIE 96


>gi|328952006|ref|YP_004369340.1| UPF0235 protein yggU [Desulfobacca acetoxidans DSM 11109]
 gi|328452330|gb|AEB08159.1| UPF0235 protein yggU [Desulfobacca acetoxidans DSM 11109]
          Length = 101

 Score = 91.7 bits (227), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 7/90 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + ++P A  + I              +KI++ A P+KG ANK +L  LAK L L K+ 
Sbjct: 13 LRIHVVPGAASNQIMG-------PHGDRLKIRIAAAPEKGAANKELLNYLAKCLGLPKNR 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          L + S     +K++ +     E+ E LQ  
Sbjct: 66 LHLKSGAQDRVKVVEVVGLAPEVQERLQAL 95


>gi|296445622|ref|ZP_06887577.1| protein of unknown function DUF167 [Methylosinus trichosporium
           OB3b]
 gi|296256867|gb|EFH03939.1| protein of unknown function DUF167 [Methylosinus trichosporium
           OB3b]
          Length = 109

 Score = 91.7 bits (227), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 24/91 (26%), Positives = 49/91 (53%), Gaps = 2/91 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + VRL P   +  ++ +E   D      +K +V A P+ G+AN+A++A++A+ L   K
Sbjct: 14  VVLWVRLTPKGGRDALSGVETLADG--RAVLKARVRAAPEDGRANEALVALIAQALGAPK 71

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92
            S+++ +  ++ LK ++I  D   +   L+ 
Sbjct: 72  RSVQIAAGHTARLKKLFIAGDPASLVAALEK 102


>gi|51597526|ref|YP_071717.1| hypothetical protein YPTB3216 [Yersinia pseudotuberculosis IP
          32953]
 gi|153948675|ref|YP_001399811.1| hypothetical protein YpsIP31758_0827 [Yersinia pseudotuberculosis
          IP 31758]
 gi|81638596|sp|Q666N2|Y3216_YERPS RecName: Full=UPF0235 protein YPTB3216
 gi|167016819|sp|A7FEY3|Y827_YERP3 RecName: Full=UPF0235 protein YpsIP31758_0827
 gi|51590808|emb|CAH22454.1| Conserved hypothetical protein [Yersinia pseudotuberculosis IP
          32953]
 gi|152960170|gb|ABS47631.1| conserved hypothetical protein TIGR00251 [Yersinia
          pseudotuberculosis IP 31758]
          Length = 96

 Score = 91.7 bits (227), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 21/91 (23%), Positives = 41/91 (45%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  +AK+  ++KS 
Sbjct: 13 LKLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K I +        E+T LL+
Sbjct: 66 VIIEKGELGRHKQIKVINPQQIPPEVTILLK 96


>gi|83594855|ref|YP_428607.1| hypothetical protein Rru_A3526 [Rhodospirillum rubrum ATCC 11170]
 gi|83577769|gb|ABC24320.1| Protein of unknown function DUF167 [Rhodospirillum rubrum ATCC
           11170]
          Length = 113

 Score = 91.7 bits (227), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 29/88 (32%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
            + +RL P A + G++ +    D S    +K  VTA P+ GKAN A+L +L+++  L +S
Sbjct: 18  RLALRLTPKAGRDGVSGVVAEADGSL--VVKASVTAVPEDGKANAALLKLLSRQWKLPRS 75

Query: 63  SLRMLSKQSSPLKIIYIDKDCKEITELL 90
           SL ++  Q+   K+I I  +   +T  L
Sbjct: 76  SLAVVHGQTDRRKVIEISGEPALLTPRL 103


>gi|331648708|ref|ZP_08349796.1| conserved hypothetical protein [Escherichia coli M605]
 gi|331042455|gb|EGI14597.1| conserved hypothetical protein [Escherichia coli M605]
          Length = 100

 Score = 91.7 bits (227), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 70 VVIEKGELGRHKQIKIINPQQIPPEI 95


>gi|162139546|ref|YP_218029.2| hypothetical protein SC3042 [Salmonella enterica subsp. enterica
          serovar Choleraesuis str. SC-B67]
 gi|168242905|ref|ZP_02667837.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Heidelberg str. SL486]
 gi|194448304|ref|YP_002047090.1| hypothetical protein SeHA_C3341 [Salmonella enterica subsp.
          enterica serovar Heidelberg str. SL476]
 gi|197249095|ref|YP_002148016.1| hypothetical protein SeAg_B3265 [Salmonella enterica subsp.
          enterica serovar Agona str. SL483]
 gi|226730824|sp|B5F5M7|YGGU_SALA4 RecName: Full=UPF0235 protein yggU
 gi|226730828|sp|B4THI7|YGGU_SALHS RecName: Full=UPF0235 protein yggU
 gi|194406608|gb|ACF66827.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Heidelberg str. SL476]
 gi|197212798|gb|ACH50195.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Agona str. SL483]
 gi|205338138|gb|EDZ24902.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Heidelberg str. SL486]
 gi|322716095|gb|EFZ07666.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar
          Choleraesuis str. A50]
          Length = 96

 Score = 91.7 bits (227), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYI 79
          + +   +    K + I
Sbjct: 66 IVIEKGELGRHKQVKI 81


>gi|156932605|ref|YP_001436521.1| hypothetical protein ESA_00387 [Cronobacter sakazakii ATCC
          BAA-894]
 gi|260599282|ref|YP_003211853.1| hypothetical protein CTU_34900 [Cronobacter turicensis z3032]
 gi|166229055|sp|A7MP89|Y387_ENTS8 RecName: Full=UPF0235 protein ESA_00387
 gi|156530859|gb|ABU75685.1| hypothetical protein ESA_00387 [Cronobacter sakazakii ATCC
          BAA-894]
 gi|260218459|emb|CBA33595.1| UPF0235 protein ESA_00387 [Cronobacter turicensis z3032]
          Length = 96

 Score = 91.7 bits (227), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 19/91 (20%), Positives = 40/91 (43%), Gaps = 8/91 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  LAK+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIIGLH-------GDELKVAITAPPVDGQANAHLVKYLAKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
          + +   +    K + I  + ++I   +    
Sbjct: 66 VVIEKGELGRHKQVKII-EPQQIPTEVAAVT 95


>gi|187732088|ref|YP_001881726.1| hypothetical protein SbBS512_E3385 [Shigella boydii CDC 3083-94]
 gi|188496122|ref|ZP_03003392.1| conserved hypothetical protein TIGR00251 [Escherichia coli 53638]
 gi|331643646|ref|ZP_08344777.1| conserved hypothetical protein [Escherichia coli H736]
 gi|882482|gb|AAA69120.1| ORF_o100 [Escherichia coli str. K-12 substr. MG1655]
 gi|81246835|gb|ABB67543.1| conserved hypothetical protein [Shigella boydii Sb227]
 gi|187429080|gb|ACD08354.1| conserved hypothetical protein [Shigella boydii CDC 3083-94]
 gi|188491321|gb|EDU66424.1| conserved hypothetical protein TIGR00251 [Escherichia coli 53638]
 gi|331037117|gb|EGI09341.1| conserved hypothetical protein [Escherichia coli H736]
          Length = 100

 Score = 91.3 bits (226), Expect = 3e-17,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 70 VVIEKGELGRHKQIKIINPQQIPPEI 95


>gi|217979671|ref|YP_002363818.1| protein of unknown function DUF167 [Methylocella silvestris BL2]
 gi|217505047|gb|ACK52456.1| protein of unknown function DUF167 [Methylocella silvestris BL2]
          Length = 118

 Score = 91.3 bits (226), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 25/97 (25%), Positives = 49/97 (50%), Gaps = 2/97 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + VRL P + +  I       D      +K +V A PQ G+AN A++ ++AK L L+ 
Sbjct: 22  VVLTVRLTPKSARDEIEGASQLSDG--RAVLKARVRAAPQDGEANAALIRLVAKALRLAP 79

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL 98
           S++R+ +  ++ LK + +  D + + + L    +  +
Sbjct: 80  SAVRVEAGATARLKTLCLTGDPETLQQSLAELAARAV 116


>gi|168703344|ref|ZP_02735621.1| hypothetical protein GobsU_27681 [Gemmata obscuriglobus UQM 2246]
          Length = 101

 Score = 91.3 bits (226), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 28/94 (29%), Positives = 51/94 (54%), Gaps = 8/94 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
          C + VR+ P AKK+ +              +++ VTA P+ G+AN A+LA+L     L +
Sbjct: 12 CTLAVRVQPKAKKNAVLG-------ERASALRVSVTAPPEDGRANDAVLALLCDHFKLQR 64

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNND 94
          S L +LS Q++  K+I +     +++ +L+  +D
Sbjct: 65 SQLALLSGQTNRNKVILVRGVTPQQLADLIPASD 98


>gi|289164111|ref|YP_003454249.1| hypothetical protein LLO_0767 [Legionella longbeachae NSW150]
 gi|288857284|emb|CBJ11111.1| hypothetical protein LLO_0767 [Legionella longbeachae NSW150]
          Length = 91

 Score = 91.3 bits (226), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 19/79 (24%), Positives = 37/79 (46%), Gaps = 7/79 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P AKKS I  +           +KI++ A P +G+ANK +L  +A+   +  S 
Sbjct: 12 LYLYVQPGAKKSEIVGMH-------EGVLKIRLNAPPIEGRANKELLKYVAQLFKVPPSQ 64

Query: 64 LRMLSKQSSPLKIIYIDKD 82
          + +     S  K++ +   
Sbjct: 65 VVLKRGDKSRHKVLLVKNS 83


>gi|171914386|ref|ZP_02929856.1| hypothetical protein VspiD_24445 [Verrucomicrobium spinosum DSM
          4136]
          Length = 92

 Score = 91.3 bits (226), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 22/84 (26%), Positives = 42/84 (50%), Gaps = 2/84 (2%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           N+  ++ PNA++S I       D      + +K+ A   +GKANK ++  LA++L  +K
Sbjct: 6  VNLACKVTPNARRSEIVGW--GADEQGRGVLLVKLAAPALEGKANKELVRFLAEQLGCAK 63

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85
            + +L   +S  K++ +     E
Sbjct: 64 GEVSLLRGDASRTKLLRVPGKAYE 87


>gi|157148503|ref|YP_001455822.1| hypothetical protein CKO_04329 [Citrobacter koseri ATCC BAA-895]
 gi|166227247|sp|A8APH3|Y4329_CITK8 RecName: Full=UPF0235 protein CKO_04329
 gi|157085708|gb|ABV15386.1| hypothetical protein CKO_04329 [Citrobacter koseri ATCC BAA-895]
          Length = 96

 Score = 91.3 bits (226), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYI 79
          + +   +    K + I
Sbjct: 66 VVIEKGELGRHKQVKI 81


>gi|83648992|ref|YP_437427.1| hypothetical protein HCH_06357 [Hahella chejuensis KCTC 2396]
 gi|83637035|gb|ABC33002.1| uncharacterized conserved protein [Hahella chejuensis KCTC 2396]
          Length = 102

 Score = 91.3 bits (226), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 25/88 (28%), Positives = 44/88 (50%), Gaps = 8/88 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          +   L P AKK  I              +KIK++A P  G+AN+ ++  LAK   + +  
Sbjct: 20 LQCHLQPGAKKDEIVGTH-------GDALKIKISAPPIDGRANQQLVRFLAKLCRVKQQD 72

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          +++L+ +SS  K I +     +I +LL+
Sbjct: 73 VQILAGESSRQKRIRVQNLT-DIPKLLK 99


>gi|307293981|ref|ZP_07573825.1| protein of unknown function DUF167 [Sphingobium chlorophenolicum
          L-1]
 gi|306880132|gb|EFN11349.1| protein of unknown function DUF167 [Sphingobium chlorophenolicum
          L-1]
          Length = 109

 Score = 91.3 bits (226), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 24/84 (28%), Positives = 47/84 (55%), Gaps = 2/84 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VRL P A +  I  +   +  +    +  +V A P+KG+AN A++A+LAK+L   +S+
Sbjct: 13 LAVRLTPGAAREDIGGVWTDEKGAQ--WLGARVRAVPEKGRANTALIALLAKRLDWPRSA 70

Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87
          + + S  ++ LK + I+   + + 
Sbjct: 71 ISLESGDTNRLKRLRIEGGGEPLP 94


>gi|51244641|ref|YP_064525.1| hypothetical protein DP0789 [Desulfotalea psychrophila LSv54]
 gi|50875678|emb|CAG35518.1| hypothetical protein DP0789 [Desulfotalea psychrophila LSv54]
          Length = 95

 Score = 91.3 bits (226), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 38/89 (42%), Gaps = 7/89 (7%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            ++V   P A K+ +  L           +KI   + P  GKANK ++  L++ L   K
Sbjct: 12 VILLVYTQPRASKTKVVGLH-------DGMLKIACCSPPVDGKANKELIVFLSRLLDCRK 64

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
            + +L  QSS  K   +     E+ + L
Sbjct: 65 CDIELLRGQSSRRKQFVLTGVDAELLDKL 93


>gi|91212335|ref|YP_542321.1| hypothetical protein UTI89_C3342 [Escherichia coli UTI89]
 gi|110806861|ref|YP_690381.1| hypothetical protein SFV_3007 [Shigella flexneri 5 str. 8401]
 gi|157155173|ref|YP_001464306.1| hypothetical protein EcE24377A_3297 [Escherichia coli E24377A]
 gi|157162414|ref|YP_001459732.1| hypothetical protein EcHS_A3113 [Escherichia coli HS]
 gi|237706394|ref|ZP_04536875.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA]
 gi|254038003|ref|ZP_04872061.1| conserved hypothetical protein [Escherichia sp. 1_1_43]
 gi|331654465|ref|ZP_08355465.1| conserved hypothetical protein [Escherichia coli M718]
 gi|331674433|ref|ZP_08375193.1| conserved hypothetical protein [Escherichia coli TA280]
 gi|331678947|ref|ZP_08379621.1| conserved hypothetical protein [Escherichia coli H591]
 gi|26109782|gb|AAN81987.1|AE016766_75 Hypothetical protein yggU [Escherichia coli CFT073]
 gi|24053355|gb|AAN44425.1| orf, conserved hypothetical protein [Shigella flexneri 2a str.
          301]
 gi|30042526|gb|AAP18250.1| hypothetical protein S3148 [Shigella flexneri 2a str. 2457T]
 gi|91073909|gb|ABE08790.1| hypothetical protein UTI89_C3342 [Escherichia coli UTI89]
 gi|110616409|gb|ABF05076.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401]
 gi|157068094|gb|ABV07349.1| conserved hypothetical protein TIGR00251 [Escherichia coli HS]
 gi|157077203|gb|ABV16911.1| conserved hypothetical protein TIGR00251 [Escherichia coli
          E24377A]
 gi|195183146|dbj|BAG66691.1| predicted protein [Escherichia coli O111:H-]
 gi|226839627|gb|EEH71648.1| conserved hypothetical protein [Escherichia sp. 1_1_43]
 gi|226899434|gb|EEH85693.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA]
 gi|294491613|gb|ADE90369.1| conserved hypothetical protein TIGR00251 [Escherichia coli
          IHE3034]
 gi|331047847|gb|EGI19924.1| conserved hypothetical protein [Escherichia coli M718]
 gi|331068527|gb|EGI39922.1| conserved hypothetical protein [Escherichia coli TA280]
 gi|331073777|gb|EGI45098.1| conserved hypothetical protein [Escherichia coli H591]
 gi|332102705|gb|EGJ06051.1| conserved hypothetical protein [Shigella sp. D9]
          Length = 100

 Score = 91.3 bits (226), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 70 VVIEKGELGRHKQIKIINPQQIPPEI 95


>gi|209694133|ref|YP_002262061.1| hypothetical protein VSAL_I0540 [Aliivibrio salmonicida LFI1238]
 gi|208008084|emb|CAQ78225.1| conserved hypothetical protein [Aliivibrio salmonicida LFI1238]
          Length = 83

 Score = 91.3 bits (226), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 15/73 (20%), Positives = 31/73 (42%), Gaps = 7/73 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + L P A +  I  +           +K+ +TA P  GKAN  ++   +K   ++K  
Sbjct: 12 LRLYLQPKASRDQIVGIH-------GEELKVAITAPPVDGKANAHLIKYFSKLFKVAKGK 64

Query: 64 LRMLSKQSSPLKI 76
          + +   + +  K 
Sbjct: 65 ITVEKGELNRHKQ 77


>gi|28373966|pdb|1N91|A Chain A, Solution Nmr Structure Of Protein Yggu From Escherichia
          Coli. Northeast Structural Genomics Consortium Target
          Er14.
 gi|60594359|pdb|1YH5|A Chain A, Solution Nmr Structure Of Protein Yggu From Escherichia
          Coli. Northeast Structural Genomics Consortium Target
          Er14
          Length = 108

 Score = 91.3 bits (226), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 18/81 (22%), Positives = 35/81 (43%), Gaps = 7/81 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69

Query: 64 LRMLSKQSSPLKIIYIDKDCK 84
          + +   +    K I I    +
Sbjct: 70 VVIEKGELGRHKQIKIINPQQ 90


>gi|240849835|ref|YP_002971223.1| hypothetical protein Bgr_01760 [Bartonella grahamii as4aup]
 gi|240266958|gb|ACS50546.1| hypothetical protein Bgr_01760 [Bartonella grahamii as4aup]
          Length = 104

 Score = 91.3 bits (226), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 2/89 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V LIP +    I  +E         ++ I++ A P+ GKANKA++  LAK+  +  SS
Sbjct: 12 LFVYLIPKSSVDKIIGVECRDGEKQ--YLVIRLRAVPEDGKANKALIKFLAKQWKIPSSS 69

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
          + + +   S  K +Y     +E+ ++ Q+
Sbjct: 70 ISLKNGAISRYKQLYFSTHLEELKQIWQS 98


>gi|149192342|ref|ZP_01870547.1| hypothetical protein VSAK1_11360 [Vibrio shilonii AK1]
 gi|148833820|gb|EDL50852.1| hypothetical protein VSAK1_11360 [Vibrio shilonii AK1]
          Length = 95

 Score = 90.9 bits (225), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 20/90 (22%), Positives = 35/90 (38%), Gaps = 10/90 (11%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  +               K+ +TA P  GKAN  +   LAK+  ++K  
Sbjct: 13 LRIYVQPKASRDKLVG-------EHGEEFKVAITAPPVDGKANAHLSKYLAKQFKVAKGQ 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90
          + +   +    K + I       KE   LL
Sbjct: 66 VLIEKGELGRHKQLRIVSPALIPKEFMALL 95


>gi|119946664|ref|YP_944344.1| hypothetical protein Ping_3043 [Psychromonas ingrahamii 37]
 gi|166229363|sp|A1SZ30|Y3043_PSYIN RecName: Full=UPF0235 protein Ping_3043
 gi|119865268|gb|ABM04745.1| hypothetical protein DUF167 [Psychromonas ingrahamii 37]
          Length = 98

 Score = 90.9 bits (225), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 19/92 (20%), Positives = 40/92 (43%), Gaps = 8/92 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + L P + +     L           +KI +TA P  GKAN  ++  L+K+  ++K +
Sbjct: 15 LRLVLQPKSSRDQFIGL-------LGDELKIAITAPPVDGKANAHLIKFLSKQFKVAKGA 67

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
          + +     S  K + +    K++ E   + + 
Sbjct: 68 IIIEKGLLSRHKRVRV-CAPKKMPEFFNSLNE 98


>gi|23011359|ref|ZP_00051742.1| COG1872: Uncharacterized conserved protein [Magnetospirillum
          magnetotacticum MS-1]
          Length = 99

 Score = 90.9 bits (225), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 26/89 (29%), Positives = 50/89 (56%), Gaps = 2/89 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VRL P A ++G+  + +  D      + ++V A P +G AN A+ A +AK L L K+ 
Sbjct: 8  LSVRLTPRASRTGLDGVRVDADGRP--VLGLRVAAPPVEGAANAALTAFVAKSLKLRKAE 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
          + ++S ++S  K +++  D KE+   ++ 
Sbjct: 66 VVLVSGEASRTKRLHLTGDAKELAARVET 94


>gi|161506346|ref|YP_001573458.1| hypothetical protein SARI_04543 [Salmonella enterica subsp.
          arizonae serovar 62:z4,z23:-- str. RSK2980]
 gi|189030103|sp|A9MQS2|YGGU_SALAR RecName: Full=UPF0235 protein yggU
 gi|160867693|gb|ABX24316.1| hypothetical protein SARI_04543 [Salmonella enterica subsp.
          arizonae serovar 62:z4,z23:--]
          Length = 96

 Score = 90.9 bits (225), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDCIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYI 79
          + +   +    K + I
Sbjct: 66 VAIEKGELGRHKQVKI 81


>gi|329942568|ref|ZP_08291378.1| hypothetical protein G5Q_0265 [Chlamydophila psittaci Cal10]
 gi|313847795|emb|CBY16785.1| conserved hypothetical protein [Chlamydophila psittaci RD1]
 gi|325506510|gb|ADZ18148.1| conserved hypothetical protein [Chlamydophila psittaci 6BC]
 gi|328815478|gb|EGF85466.1| hypothetical protein G5Q_0265 [Chlamydophila psittaci Cal10]
 gi|328914447|gb|AEB55280.1| conserved hypothetical protein [Chlamydophila psittaci 6BC]
          Length = 92

 Score = 90.9 bits (225), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 29/85 (34%), Positives = 50/85 (58%), Gaps = 7/85 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V++ P +K++ I   E          +KI+VT  P+KGKAN+A++A+LAK L+L K  
Sbjct: 8  LEVKVTPKSKENKIVGFE-------GEVLKIRVTEVPEKGKANEAVIALLAKTLSLPKRD 60

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITE 88
          + ++S ++S  K I + K  + I  
Sbjct: 61 VTLISGETSKNKRILLPKATESIVS 85


>gi|12517499|gb|AAG58084.1|AE005525_10 orf, hypothetical protein [Escherichia coli O157:H7 str. EDL933]
 gi|13363301|dbj|BAB37252.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
 gi|209760028|gb|ACI78326.1| hypothetical protein ECs3829 [Escherichia coli]
 gi|209760030|gb|ACI78327.1| hypothetical protein ECs3829 [Escherichia coli]
 gi|209760032|gb|ACI78328.1| hypothetical protein ECs3829 [Escherichia coli]
 gi|209760034|gb|ACI78329.1| hypothetical protein ECs3829 [Escherichia coli]
 gi|209760036|gb|ACI78330.1| hypothetical protein ECs3829 [Escherichia coli]
          Length = 100

 Score = 90.9 bits (225), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 18/81 (22%), Positives = 35/81 (43%), Gaps = 7/81 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69

Query: 64 LRMLSKQSSPLKIIYIDKDCK 84
          + +   +    K I I    +
Sbjct: 70 VVIEKGELGRHKQIKIINPQQ 90


>gi|56415040|ref|YP_152115.1| hypothetical protein SPA2965 [Salmonella enterica subsp. enterica
          serovar Paratyphi A str. ATCC 9150]
 gi|168819868|ref|ZP_02831868.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Weltevreden str. HI_N05-537]
 gi|194446740|ref|YP_002042361.1| hypothetical protein SNSL254_A3349 [Salmonella enterica subsp.
          enterica serovar Newport str. SL254]
 gi|197363969|ref|YP_002143606.1| hypothetical protein SSPA2764 [Salmonella enterica subsp.
          enterica serovar Paratyphi A str. AKU_12601]
 gi|204928324|ref|ZP_03219524.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Javiana str. GA_MM04042433]
 gi|238909900|ref|ZP_04653737.1| hypothetical protein SentesTe_02030 [Salmonella enterica subsp.
          enterica serovar Tennessee str. CDC07-0191]
 gi|81361856|sp|Q5PML0|YGGU_SALPA RecName: Full=UPF0235 protein yggU
 gi|226730829|sp|B4T5K9|YGGU_SALNS RecName: Full=UPF0235 protein yggU
 gi|226730830|sp|B5BFQ9|YGGU_SALPK RecName: Full=UPF0235 protein yggU
 gi|56129297|gb|AAV78803.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Paratyphi A str. ATCC 9150]
 gi|194405403|gb|ACF65625.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Newport str. SL254]
 gi|197095446|emb|CAR61005.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Paratyphi A str. AKU_12601]
 gi|204322646|gb|EDZ07843.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Javiana str. GA_MM04042433]
 gi|205343351|gb|EDZ30115.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Weltevreden str. HI_N05-537]
 gi|320087534|emb|CBY97299.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar
          Weltevreden str. 2007-60-3289-1]
 gi|322613499|gb|EFY10440.1| hypothetical protein SEEM315_07205 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 315996572]
 gi|322621091|gb|EFY17949.1| hypothetical protein SEEM971_19929 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 495297-1]
 gi|322624155|gb|EFY20989.1| hypothetical protein SEEM973_20050 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 495297-3]
 gi|322628106|gb|EFY24895.1| hypothetical protein SEEM974_21245 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 495297-4]
 gi|322633225|gb|EFY29967.1| hypothetical protein SEEM201_12335 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 515920-1]
 gi|322636197|gb|EFY32905.1| hypothetical protein SEEM202_13733 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 515920-2]
 gi|322639535|gb|EFY36223.1| hypothetical protein SEEM954_11862 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 531954]
 gi|322647532|gb|EFY44021.1| hypothetical protein SEEM054_10407 [Salmonella enterica subsp.
          enterica serovar Montevideo str. NC_MB110209-0054]
 gi|322648716|gb|EFY45163.1| hypothetical protein SEEM675_04396 [Salmonella enterica subsp.
          enterica serovar Montevideo str. OH_2009072675]
 gi|322653771|gb|EFY50097.1| hypothetical protein SEEM965_22046 [Salmonella enterica subsp.
          enterica serovar Montevideo str. CASC_09SCPH15965]
 gi|322657877|gb|EFY54145.1| hypothetical protein SEEM19N_18161 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 19N]
 gi|322663980|gb|EFY60179.1| hypothetical protein SEEM801_04461 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 81038-01]
 gi|322669009|gb|EFY65160.1| hypothetical protein SEEM507_10116 [Salmonella enterica subsp.
          enterica serovar Montevideo str. MD_MDA09249507]
 gi|322672997|gb|EFY69104.1| hypothetical protein SEEM877_00665 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 414877]
 gi|322678012|gb|EFY74075.1| hypothetical protein SEEM867_02147 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 366867]
 gi|322681188|gb|EFY77221.1| hypothetical protein SEEM180_20639 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 413180]
 gi|322687882|gb|EFY83849.1| hypothetical protein SEEM600_15746 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 446600]
 gi|323194922|gb|EFZ80109.1| hypothetical protein SEEM581_15857 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 609458-1]
 gi|323199626|gb|EFZ84716.1| hypothetical protein SEEM501_16285 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 556150-1]
 gi|323202627|gb|EFZ87667.1| hypothetical protein SEEM460_18160 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 609460]
 gi|323207886|gb|EFZ92832.1| hypothetical protein SEEM020_18480 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 507440-20]
 gi|323212562|gb|EFZ97379.1| hypothetical protein SEEM6152_10068 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 556152]
 gi|323214955|gb|EFZ99703.1| hypothetical protein SEEM0077_03334 [Salmonella enterica subsp.
          enterica serovar Montevideo str. MB101509-0077]
 gi|323222685|gb|EGA07050.1| hypothetical protein SEEM0047_09590 [Salmonella enterica subsp.
          enterica serovar Montevideo str. MB102109-0047]
 gi|323225428|gb|EGA09660.1| hypothetical protein SEEM0055_15033 [Salmonella enterica subsp.
          enterica serovar Montevideo str. MB110209-0055]
 gi|323230557|gb|EGA14675.1| hypothetical protein SEEM0052_19874 [Salmonella enterica subsp.
          enterica serovar Montevideo str. MB111609-0052]
 gi|323235092|gb|EGA19178.1| hypothetical protein SEEM3312_06118 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 2009083312]
 gi|323239131|gb|EGA23181.1| hypothetical protein SEEM5258_05995 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 2009085258]
 gi|323244511|gb|EGA28517.1| hypothetical protein SEEM1156_01212 [Salmonella enterica subsp.
          enterica serovar Montevideo str. 315731156]
 gi|323247126|gb|EGA31092.1| hypothetical protein SEEM9199_21270 [Salmonella enterica subsp.
          enterica serovar Montevideo str. IA_2009159199]
 gi|323253391|gb|EGA37220.1| hypothetical protein SEEM8282_09961 [Salmonella enterica subsp.
          enterica serovar Montevideo str. IA_2010008282]
 gi|323256302|gb|EGA40038.1| hypothetical protein SEEM8283_11916 [Salmonella enterica subsp.
          enterica serovar Montevideo str. IA_2010008283]
 gi|323262522|gb|EGA46078.1| hypothetical protein SEEM8284_02121 [Salmonella enterica subsp.
          enterica serovar Montevideo str. IA_2010008284]
 gi|323267382|gb|EGA50866.1| hypothetical protein SEEM8285_21407 [Salmonella enterica subsp.
          enterica serovar Montevideo str. IA_2010008285]
 gi|323269214|gb|EGA52669.1| hypothetical protein SEEM8287_10997 [Salmonella enterica subsp.
          enterica serovar Montevideo str. IA_2010008287]
 gi|323669738|emb|CBJ94862.1| conserved hypothetical protein [Salmonella bongori]
 gi|327412908|emb|CAX67922.1| conserved hypothetical protein [Salmonella bongori]
          Length = 96

 Score = 90.9 bits (225), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYI 79
          + +   +    K + I
Sbjct: 66 IVIEKGELGRHKQVKI 81


>gi|222054893|ref|YP_002537255.1| protein of unknown function DUF167 [Geobacter sp. FRC-32]
 gi|221564182|gb|ACM20154.1| protein of unknown function DUF167 [Geobacter sp. FRC-32]
          Length = 102

 Score = 90.9 bits (225), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 22/80 (27%), Positives = 38/80 (47%), Gaps = 7/80 (8%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           V + P A ++ I  ++          +++++TA P    ANK  + +LAK L ++KS L
Sbjct: 22 TVHVQPRASRNEICGVQ-------GDELRLRLTAPPVDDAANKLCIELLAKALGVAKSHL 74

Query: 65 RMLSKQSSPLKIIYIDKDCK 84
           + S   S  K I  +   K
Sbjct: 75 ALTSGAKSRHKTIRAEGVSK 94


>gi|81242423|gb|ABB63133.1| conserved hypothetical protein [Shigella dysenteriae Sd197]
          Length = 100

 Score = 90.9 bits (225), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANGHLVKFLGKQFRVAKSQ 69

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 70 VVIEKGELGRHKQIKIINPQQIPPEI 95


>gi|15618408|ref|NP_224693.1| hypothetical protein CPn0497 [Chlamydophila pneumoniae CWL029]
 gi|15836028|ref|NP_300552.1| hypothetical protein CPj0497 [Chlamydophila pneumoniae J138]
 gi|16752546|ref|NP_444808.1| hypothetical protein CP0257 [Chlamydophila pneumoniae AR39]
 gi|33241848|ref|NP_876789.1| hypothetical protein CpB0517 [Chlamydophila pneumoniae TW-183]
 gi|29839694|sp|Q9Z854|Y497_CHLPN RecName: Full=UPF0235 protein CPn_0497/CP_0257/CPj0497/CpB0517
 gi|4376783|gb|AAD18637.1| CT388 hypothetical protein [Chlamydophila pneumoniae CWL029]
 gi|7189184|gb|AAF38120.1| conserved hypothetical protein [Chlamydophila pneumoniae AR39]
 gi|8978867|dbj|BAA98703.1| CT388 hypothetical protein [Chlamydophila pneumoniae J138]
 gi|33236357|gb|AAP98446.1| hypothetical protein CpB0517 [Chlamydophila pneumoniae TW-183]
 gi|269303374|gb|ACZ33474.1| conserved hypothetical protein TIGR00251 [Chlamydophila
          pneumoniae LPCoLN]
          Length = 90

 Score = 90.9 bits (225), Expect = 5e-17,   Method: Composition-based stats.
 Identities = 25/87 (28%), Positives = 49/87 (56%), Gaps = 7/87 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V++ P AK++ I   +          +K++VT  P+KGKAN A++++LAK L+L K  
Sbjct: 7  LEVKVTPKAKENKIVGFD-------GQALKVRVTEPPEKGKANDAVISLLAKALSLPKRD 59

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90
          + +++ ++S  K   +    ++I   L
Sbjct: 60 VTLIAGETSRKKKFLLPNRVQDIIFSL 86


>gi|229530300|ref|ZP_04419688.1| hypothetical protein VCG_003420 [Vibrio cholerae 12129(1)]
 gi|229332073|gb|EEN97561.1| hypothetical protein VCG_003420 [Vibrio cholerae 12129(1)]
 gi|327483312|gb|AEA77719.1| UPF0235 protein [Vibrio cholerae LMA3894-4]
          Length = 96

 Score = 90.9 bits (225), Expect = 6e-17,   Method: Composition-based stats.
 Identities = 23/91 (25%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   ++K S
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65

Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91
          + +   +    K + I    +   EIT L++
Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEITALIE 96


>gi|147674564|ref|YP_001215984.1| hypothetical protein VC0395_A0010 [Vibrio cholerae O395]
 gi|153216287|ref|ZP_01950380.1| conserved hypothetical protein [Vibrio cholerae 1587]
 gi|262167148|ref|ZP_06034862.1| hypothetical protein VIJ_000308 [Vibrio cholerae RC27]
 gi|172047756|sp|A5F9H9|Y1210_VIBC3 RecName: Full=UPF0235 protein VC0395_A0010/VC395_0502
 gi|124114376|gb|EAY33196.1| conserved hypothetical protein [Vibrio cholerae 1587]
 gi|146316447|gb|ABQ20986.1| conserved hypothetical protein [Vibrio cholerae O395]
 gi|227012311|gb|ACP08521.1| conserved hypothetical protein [Vibrio cholerae O395]
 gi|262024448|gb|EEY43135.1| hypothetical protein VIJ_000308 [Vibrio cholerae RC27]
          Length = 96

 Score = 90.6 bits (224), Expect = 6e-17,   Method: Composition-based stats.
 Identities = 23/91 (25%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   ++K S
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65

Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91
          + +   +    K + I    +   EIT L++
Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEITALIE 96


>gi|281179962|dbj|BAI56292.1| conserved hypothetical protein [Escherichia coli SE15]
 gi|330908988|gb|EGH37502.1| hypothetical protein ECAA86_03160 [Escherichia coli AA86]
          Length = 96

 Score = 90.6 bits (224), Expect = 6e-17,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91


>gi|323136497|ref|ZP_08071579.1| protein of unknown function DUF167 [Methylocystis sp. ATCC 49242]
 gi|322398571|gb|EFY01091.1| protein of unknown function DUF167 [Methylocystis sp. ATCC 49242]
          Length = 110

 Score = 90.6 bits (224), Expect = 6e-17,   Method: Composition-based stats.
 Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 2/91 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V +RL P   +  I  +E   D      +K +V A P+ G+AN A++ ++AK L   K++
Sbjct: 18  VWLRLTPKGGRDAIEGVETLSDG--RAVLKARVRAAPEDGRANAALIELIAKALRAPKNA 75

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
           + + S ++S +K I+I  D     + L    
Sbjct: 76  VSIRSGETSRVKKIFIAGDSATYLDALAKLA 106


>gi|291615210|ref|YP_003525367.1| hypothetical protein Slit_2755 [Sideroxydans lithotrophicus ES-1]
 gi|291585322|gb|ADE12980.1| protein of unknown function DUF167 [Sideroxydans lithotrophicus
          ES-1]
          Length = 94

 Score = 90.6 bits (224), Expect = 6e-17,   Method: Composition-based stats.
 Identities = 20/90 (22%), Positives = 42/90 (46%), Gaps = 7/90 (7%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          +  + + + P AK++ +A L           +KI++ A P +G+AN+A+L  +A+   + 
Sbjct: 11 ILTLTLHIQPGAKRTEVAGLH-------GAALKIRLAAPPIEGRANEALLKFIAESFGVP 63

Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELL 90
             + +     S  K++ +     E   LL
Sbjct: 64 LRQVELKQGGQSRHKVVAVTASKIEPESLL 93


>gi|89109730|ref|AP_003510.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110]
 gi|90111518|ref|NP_417428.2| conserved protein, UPF0235 family [Escherichia coli str. K-12
          substr. MG1655]
 gi|161984863|ref|YP_409371.2| hypothetical protein SBO_3037 [Shigella boydii Sb227]
 gi|170082505|ref|YP_001731825.1| hypothetical protein ECDH10B_3128 [Escherichia coli str. K-12
          substr. DH10B]
 gi|238902075|ref|YP_002927871.1| hypothetical protein BWG_2675 [Escherichia coli BW2952]
 gi|253772209|ref|YP_003035040.1| hypothetical protein ECBD_0787 [Escherichia coli
          'BL21-Gold(DE3)pLysS AG']
 gi|300947684|ref|ZP_07161853.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          116-1]
 gi|300954200|ref|ZP_07166665.1| hypothetical protein HMPREF9547_00147 [Escherichia coli MS 175-1]
 gi|301643693|ref|ZP_07243732.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          146-1]
 gi|307139638|ref|ZP_07498994.1| hypothetical protein EcolH7_16123 [Escherichia coli H736]
 gi|6920084|sp|P52060|YGGU_ECOLI RecName: Full=UPF0235 protein yggU
 gi|226730819|sp|B1XFB3|YGGU_ECODH RecName: Full=UPF0235 protein yggU
 gi|259710253|sp|C5A0M2|YGGU_ECOBW RecName: Full=UPF0235 protein yggU
 gi|85675763|dbj|BAE77016.1| conserved hypothetical protein [Escherichia coli str. K12 substr.
          W3110]
 gi|87082189|gb|AAC75990.2| conserved protein, UPF0235 family [Escherichia coli str. K-12
          substr. MG1655]
 gi|169890340|gb|ACB04047.1| conserved protein [Escherichia coli str. K-12 substr. DH10B]
 gi|238860521|gb|ACR62519.1| conserved protein [Escherichia coli BW2952]
 gi|253323253|gb|ACT27855.1| protein of unknown function DUF167 [Escherichia coli
          'BL21-Gold(DE3)pLysS AG']
 gi|260448004|gb|ACX38426.1| protein of unknown function DUF167 [Escherichia coli DH1]
 gi|284922896|emb|CBG35985.1| conserved hypothetical protein [Escherichia coli 042]
 gi|300318784|gb|EFJ68568.1| hypothetical protein HMPREF9547_00147 [Escherichia coli MS 175-1]
 gi|300452730|gb|EFK16350.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          116-1]
 gi|301077895|gb|EFK92701.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          146-1]
 gi|309703308|emb|CBJ02644.1| conserved hypothetical protein [Escherichia coli ETEC H10407]
 gi|315137550|dbj|BAJ44709.1| hypothetical protein ECDH1ME8569_2853 [Escherichia coli DH1]
 gi|315614885|gb|EFU95523.1| conserved hypothetical protein [Escherichia coli 3431]
 gi|320174051|gb|EFW49221.1| hypothetical protein SDB_03448 [Shigella dysenteriae CDC 74-1112]
 gi|320184300|gb|EFW59112.1| hypothetical protein SGF_03464 [Shigella flexneri CDC 796-83]
 gi|323173833|gb|EFZ59462.1| hypothetical protein ECLT68_2152 [Escherichia coli LT-68]
 gi|323936044|gb|EGB32339.1| yggU [Escherichia coli E1520]
 gi|332091357|gb|EGI96445.1| hypothetical protein SB359474_3531 [Shigella boydii 3594-74]
          Length = 96

 Score = 90.6 bits (224), Expect = 6e-17,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91


>gi|254497376|ref|ZP_05110179.1| conserved hypothetical protein [Legionella drancourtii LLAP12]
 gi|254353429|gb|EET12161.1| conserved hypothetical protein [Legionella drancourtii LLAP12]
          Length = 91

 Score = 90.6 bits (224), Expect = 7e-17,   Method: Composition-based stats.
 Identities = 20/79 (25%), Positives = 37/79 (46%), Gaps = 7/79 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P AK + IA             +KI++ A P +G+AN+A+L  +A+  A+    
Sbjct: 12 INLYIQPGAKHTEIAGFH-------GEALKIRLHAPPIEGRANEALLKFIAQIFAVPTRQ 64

Query: 64 LRMLSKQSSPLKIIYIDKD 82
          + +     S LK + I   
Sbjct: 65 VVLKRGDKSRLKTLIITGS 83


>gi|319899260|ref|YP_004159353.1| hypothetical protein BARCL_1102 [Bartonella clarridgeiae 73]
 gi|319403224|emb|CBI76783.1| conserved protein of unknown function [Bartonella clarridgeiae
          73]
          Length = 110

 Score = 90.6 bits (224), Expect = 7e-17,   Method: Composition-based stats.
 Identities = 32/89 (35%), Positives = 50/89 (56%), Gaps = 2/89 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VRLIP A    I  +E   D     H+ I++ A P+ GKAN+A++  LAK+  +  S 
Sbjct: 12 LFVRLIPKASVDSIIKVEDRDDGKQ--HLIIRLRAIPENGKANRALIKFLAKQWKIPSSC 69

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
          + + S  +S  K +Y  K  KEI ++LQ+
Sbjct: 70 ISLGSGATSHYKQLYFSKYLKEIEQILQS 98


>gi|262403916|ref|ZP_06080473.1| hypothetical protein VOA_001904 [Vibrio sp. RC586]
 gi|262349878|gb|EEY99014.1| hypothetical protein VOA_001904 [Vibrio sp. RC586]
          Length = 96

 Score = 90.2 bits (223), Expect = 8e-17,   Method: Composition-based stats.
 Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK+  ++K S
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKQCKVAKGS 65

Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91
          + +   +    K + I    +   EI  L++
Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPPEIAALIE 96


>gi|110643102|ref|YP_670832.1| hypothetical protein ECP_2947 [Escherichia coli 536]
 gi|117625180|ref|YP_854168.1| hypothetical protein APECO1_3568 [Escherichia coli APEC O1]
 gi|161485880|ref|NP_708718.2| hypothetical protein SF2944 [Shigella flexneri 2a str. 301]
 gi|161486124|ref|NP_755414.2| hypothetical protein c3539 [Escherichia coli CFT073]
 gi|161486442|ref|NP_838440.2| hypothetical protein S3148 [Shigella flexneri 2a str. 2457T]
 gi|170018806|ref|YP_001723760.1| hypothetical protein EcolC_0761 [Escherichia coli ATCC 8739]
 gi|170681809|ref|YP_001745114.1| hypothetical protein EcSMS35_3095 [Escherichia coli SMS-3-5]
 gi|191167930|ref|ZP_03029733.1| conserved hypothetical protein [Escherichia coli B7A]
 gi|191171828|ref|ZP_03033374.1| conserved hypothetical protein [Escherichia coli F11]
 gi|193063515|ref|ZP_03044604.1| conserved hypothetical protein [Escherichia coli E22]
 gi|193067470|ref|ZP_03048438.1| conserved hypothetical protein [Escherichia coli E110019]
 gi|194426264|ref|ZP_03058819.1| conserved hypothetical protein [Escherichia coli B171]
 gi|194431742|ref|ZP_03064033.1| conserved hypothetical protein [Shigella dysenteriae 1012]
 gi|194436768|ref|ZP_03068868.1| conserved hypothetical protein [Escherichia coli 101-1]
 gi|209920412|ref|YP_002294496.1| hypothetical protein ECSE_3221 [Escherichia coli SE11]
 gi|218550200|ref|YP_002383991.1| hypothetical protein EFER_2892 [Escherichia fergusonii ATCC
          35469]
 gi|218555512|ref|YP_002388425.1| hypothetical protein ECIAI1_3086 [Escherichia coli IAI1]
 gi|218559944|ref|YP_002392857.1| hypothetical protein ECS88_3235 [Escherichia coli S88]
 gi|218691077|ref|YP_002399289.1| hypothetical protein ECED1_3416 [Escherichia coli ED1a]
 gi|218696551|ref|YP_002404218.1| hypothetical protein EC55989_3246 [Escherichia coli 55989]
 gi|218701663|ref|YP_002409292.1| hypothetical protein ECIAI39_3371 [Escherichia coli IAI39]
 gi|218706468|ref|YP_002413987.1| hypothetical protein ECUMN_3305 [Escherichia coli UMN026]
 gi|227888508|ref|ZP_04006313.1| protein of hypothetical function DUF167 [Escherichia coli 83972]
 gi|254162863|ref|YP_003045971.1| hypothetical protein ECB_02783 [Escherichia coli B str. REL606]
 gi|256019245|ref|ZP_05433110.1| hypothetical protein ShiD9_10034 [Shigella sp. D9]
 gi|260845623|ref|YP_003223401.1| hypothetical protein ECO103_3533 [Escherichia coli O103:H2 str.
          12009]
 gi|260857086|ref|YP_003230977.1| hypothetical protein ECO26_4052 [Escherichia coli O26:H11 str.
          11368]
 gi|260869640|ref|YP_003236042.1| hypothetical protein ECO111_3701 [Escherichia coli O111:H- str.
          11128]
 gi|293406460|ref|ZP_06650386.1| hypothetical protein ECGG_01757 [Escherichia coli FVEC1412]
 gi|293416214|ref|ZP_06658854.1| hypothetical protein ECDG_03817 [Escherichia coli B185]
 gi|293449283|ref|ZP_06663704.1| hypothetical protein ECCG_02314 [Escherichia coli B088]
 gi|297520255|ref|ZP_06938641.1| hypothetical protein EcolOP_21662 [Escherichia coli OP50]
 gi|298382197|ref|ZP_06991794.1| hypothetical protein ECFG_01943 [Escherichia coli FVEC1302]
 gi|300815577|ref|ZP_07095801.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          107-1]
 gi|300824812|ref|ZP_07104916.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          119-7]
 gi|300900230|ref|ZP_07118414.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          198-1]
 gi|300906483|ref|ZP_07124178.1| hypothetical protein HMPREF9536_04445 [Escherichia coli MS 84-1]
 gi|300921295|ref|ZP_07137664.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          115-1]
 gi|300928104|ref|ZP_07143649.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          187-1]
 gi|300940765|ref|ZP_07155311.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          21-1]
 gi|300980105|ref|ZP_07174848.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          45-1]
 gi|300995466|ref|ZP_07181114.1| hypothetical protein HMPREF9553_04604 [Escherichia coli MS 200-1]
 gi|301027296|ref|ZP_07190642.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          69-1]
 gi|301027722|ref|ZP_07191032.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          196-1]
 gi|301049252|ref|ZP_07196225.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          185-1]
 gi|301306562|ref|ZP_07212624.1| hypothetical protein HMPREF9347_05170 [Escherichia coli MS 124-1]
 gi|301328107|ref|ZP_07221248.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          78-1]
 gi|306812143|ref|ZP_07446341.1| hypothetical protein ECNC101_09529 [Escherichia coli NC101]
 gi|309794042|ref|ZP_07688467.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          145-7]
 gi|312972805|ref|ZP_07786978.1| conserved hypothetical protein [Escherichia coli 1827-70]
 gi|331659088|ref|ZP_08360030.1| conserved hypothetical protein [Escherichia coli TA206]
 gi|331669699|ref|ZP_08370545.1| conserved hypothetical protein [Escherichia coli TA271]
 gi|29839713|sp|Q8FE28|YGGU_ECOL6 RecName: Full=UPF0235 protein yggU
 gi|47117526|sp|Q83JS1|YGGU_SHIFL RecName: Full=UPF0235 protein yggU
 gi|123343643|sp|Q0TDP8|YGGU_ECOL5 RecName: Full=UPF0235 protein yggU
 gi|166227348|sp|A1AFD9|YGGU_ECOK1 RecName: Full=UPF0235 protein yggU
 gi|189030102|sp|B1IT54|YGGU_ECOLC RecName: Full=UPF0235 protein yggU
 gi|226730815|sp|B7MME1|YGGU_ECO45 RecName: Full=UPF0235 protein yggU
 gi|226730817|sp|B7NI16|YGGU_ECO7I RecName: Full=UPF0235 protein yggU
 gi|226730818|sp|B7LYY3|YGGU_ECO8A RecName: Full=UPF0235 protein yggU
 gi|226730820|sp|B7N7K6|YGGU_ECOLU RecName: Full=UPF0235 protein yggU
 gi|226730821|sp|B6I789|YGGU_ECOSE RecName: Full=UPF0235 protein yggU
 gi|226730822|sp|B1LDG2|YGGU_ECOSM RecName: Full=UPF0235 protein yggU
 gi|226730823|sp|B7LPS0|YGGU_ESCF3 RecName: Full=UPF0235 protein yggU
 gi|254814155|sp|B7LFL6|YGGU_ECO55 RecName: Full=UPF0235 protein yggU
 gi|254814156|sp|B7MZQ2|YGGU_ECO81 RecName: Full=UPF0235 protein yggU
 gi|110344694|gb|ABG70931.1| hypothetical protein YggU [Escherichia coli 536]
 gi|115514304|gb|ABJ02379.1| conserved hypothetical protein [Escherichia coli APEC O1]
 gi|169753734|gb|ACA76433.1| protein of unknown function DUF167 [Escherichia coli ATCC 8739]
 gi|170519527|gb|ACB17705.1| conserved hypothetical protein [Escherichia coli SMS-3-5]
 gi|190902015|gb|EDV61761.1| conserved hypothetical protein [Escherichia coli B7A]
 gi|190907863|gb|EDV67456.1| conserved hypothetical protein [Escherichia coli F11]
 gi|192930792|gb|EDV83397.1| conserved hypothetical protein [Escherichia coli E22]
 gi|192959427|gb|EDV89862.1| conserved hypothetical protein [Escherichia coli E110019]
 gi|194415572|gb|EDX31839.1| conserved hypothetical protein [Escherichia coli B171]
 gi|194420098|gb|EDX36176.1| conserved hypothetical protein [Shigella dysenteriae 1012]
 gi|194424250|gb|EDX40237.1| conserved hypothetical protein [Escherichia coli 101-1]
 gi|209913671|dbj|BAG78745.1| conserved hypothetical protein [Escherichia coli SE11]
 gi|218353283|emb|CAU99245.1| conserved hypothetical protein [Escherichia coli 55989]
 gi|218357741|emb|CAQ90385.1| conserved hypothetical protein [Escherichia fergusonii ATCC
          35469]
 gi|218362280|emb|CAQ99901.1| conserved hypothetical protein [Escherichia coli IAI1]
 gi|218366713|emb|CAR04470.1| conserved hypothetical protein [Escherichia coli S88]
 gi|218371649|emb|CAR19488.1| conserved hypothetical protein [Escherichia coli IAI39]
 gi|218428641|emb|CAR09570.2| conserved hypothetical protein [Escherichia coli ED1a]
 gi|218433565|emb|CAR14468.1| conserved hypothetical protein [Escherichia coli UMN026]
 gi|222034648|emb|CAP77390.1| UPF0235 protein yggU [Escherichia coli LF82]
 gi|227834777|gb|EEJ45243.1| protein of hypothetical function DUF167 [Escherichia coli 83972]
 gi|242378479|emb|CAQ33263.1| conserved protein [Escherichia coli BL21(DE3)]
 gi|253974764|gb|ACT40435.1| hypothetical protein ECB_02783 [Escherichia coli B str. REL606]
 gi|253978930|gb|ACT44600.1| hypothetical protein ECD_02783 [Escherichia coli BL21(DE3)]
 gi|257755735|dbj|BAI27237.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368]
 gi|257760770|dbj|BAI32267.1| conserved predicted protein [Escherichia coli O103:H2 str. 12009]
 gi|257765996|dbj|BAI37491.1| conserved predicted protein [Escherichia coli O111:H- str. 11128]
 gi|291322373|gb|EFE61802.1| hypothetical protein ECCG_02314 [Escherichia coli B088]
 gi|291426466|gb|EFE99498.1| hypothetical protein ECGG_01757 [Escherichia coli FVEC1412]
 gi|291432403|gb|EFF05385.1| hypothetical protein ECDG_03817 [Escherichia coli B185]
 gi|298277337|gb|EFI18853.1| hypothetical protein ECFG_01943 [Escherichia coli FVEC1302]
 gi|299879158|gb|EFI87369.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          196-1]
 gi|300298948|gb|EFJ55333.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          185-1]
 gi|300304828|gb|EFJ59348.1| hypothetical protein HMPREF9553_04604 [Escherichia coli MS 200-1]
 gi|300356246|gb|EFJ72116.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          198-1]
 gi|300395102|gb|EFJ78640.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          69-1]
 gi|300401726|gb|EFJ85264.1| hypothetical protein HMPREF9536_04445 [Escherichia coli MS 84-1]
 gi|300409362|gb|EFJ92900.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          45-1]
 gi|300411757|gb|EFJ95067.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          115-1]
 gi|300454465|gb|EFK17958.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          21-1]
 gi|300463870|gb|EFK27363.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          187-1]
 gi|300522719|gb|EFK43788.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          119-7]
 gi|300531506|gb|EFK52568.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          107-1]
 gi|300838180|gb|EFK65940.1| hypothetical protein HMPREF9347_05170 [Escherichia coli MS 124-1]
 gi|300845411|gb|EFK73171.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          78-1]
 gi|305854181|gb|EFM54619.1| hypothetical protein ECNC101_09529 [Escherichia coli NC101]
 gi|307554935|gb|ADN47710.1| conserved hypothetical protein [Escherichia coli ABU 83972]
 gi|307625473|gb|ADN69777.1| hypothetical protein UM146_01750 [Escherichia coli UM146]
 gi|308122449|gb|EFO59711.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          145-7]
 gi|310332747|gb|EFP99960.1| conserved hypothetical protein [Escherichia coli 1827-70]
 gi|312947484|gb|ADR28311.1| hypothetical protein NRG857_14505 [Escherichia coli O83:H1 str.
          NRG 857C]
 gi|313648003|gb|EFS12449.1| hypothetical protein SF2457T_3601 [Shigella flexneri 2a str.
          2457T]
 gi|315256845|gb|EFU36813.1| putative cytoplasmic protein [Escherichia coli MS 85-1]
 gi|315289499|gb|EFU48894.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          110-3]
 gi|315293933|gb|EFU53285.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          153-1]
 gi|315295642|gb|EFU54965.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          16-3]
 gi|320181042|gb|EFW55963.1| hypothetical protein SGB_01866 [Shigella boydii ATCC 9905]
 gi|320195072|gb|EFW69701.1| hypothetical protein EcoM_02835 [Escherichia coli WV_060327]
 gi|320202617|gb|EFW77187.1| hypothetical protein ECoL_00280 [Escherichia coli EC4100B]
 gi|323154662|gb|EFZ40861.1| hypothetical protein ECEPECA14_3462 [Escherichia coli EPECa14]
 gi|323162601|gb|EFZ48448.1| hypothetical protein ECE128010_1205 [Escherichia coli E128010]
 gi|323180413|gb|EFZ65965.1| hypothetical protein ECOK1180_1095 [Escherichia coli 1180]
 gi|323183524|gb|EFZ68921.1| hypothetical protein ECOK1357_3303 [Escherichia coli 1357]
 gi|323188653|gb|EFZ73938.1| hypothetical protein ECRN5871_3073 [Escherichia coli RN587/1]
 gi|323941961|gb|EGB38140.1| hypothetical protein ERDG_01736 [Escherichia coli E482]
 gi|323946549|gb|EGB42572.1| yggU [Escherichia coli H120]
 gi|323951609|gb|EGB47484.1| hypothetical protein ERKG_02252 [Escherichia coli H252]
 gi|323957323|gb|EGB53045.1| hypothetical protein ERLG_01390 [Escherichia coli H263]
 gi|323960754|gb|EGB56375.1| hypothetical protein ERGG_02689 [Escherichia coli H489]
 gi|323966470|gb|EGB61903.1| hypothetical protein ERJG_02059 [Escherichia coli M863]
 gi|323971760|gb|EGB66987.1| hypothetical protein ERHG_02238 [Escherichia coli TA007]
 gi|323978752|gb|EGB73833.1| hypothetical protein ERFG_00343 [Escherichia coli TW10509]
 gi|324005509|gb|EGB74728.1| hypothetical protein HMPREF9532_04861 [Escherichia coli MS 57-2]
 gi|324011796|gb|EGB81015.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
          60-1]
 gi|324017226|gb|EGB86445.1| hypothetical protein HMPREF9542_04137 [Escherichia coli MS 117-3]
 gi|324115032|gb|EGC08997.1| hypothetical protein ERIG_00360 [Escherichia fergusonii B253]
 gi|324119748|gb|EGC13628.1| hypothetical protein ERBG_00336 [Escherichia coli E1167]
 gi|325498510|gb|EGC96369.1| hypothetical protein ECD227_2607 [Escherichia fergusonii ECD227]
 gi|327251721|gb|EGE63407.1| hypothetical protein ECSTEC7V_3572 [Escherichia coli STEC_7v]
 gi|331053670|gb|EGI25699.1| conserved hypothetical protein [Escherichia coli TA206]
 gi|331063367|gb|EGI35280.1| conserved hypothetical protein [Escherichia coli TA271]
 gi|332086882|gb|EGI92018.1| hypothetical protein SB521682_3508 [Shigella boydii 5216-82]
 gi|332087620|gb|EGI92747.1| hypothetical protein SD15574_3393 [Shigella dysenteriae 155-74]
          Length = 96

 Score = 90.2 bits (223), Expect = 8e-17,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91


>gi|257062952|ref|YP_003142624.1| hypothetical protein Shel_02050 [Slackia heliotrinireducens DSM
          20476]
 gi|256790605|gb|ACV21275.1| uncharacterized conserved protein [Slackia heliotrinireducens DSM
          20476]
          Length = 106

 Score = 90.2 bits (223), Expect = 8e-17,   Method: Composition-based stats.
 Identities = 22/87 (25%), Positives = 48/87 (55%), Gaps = 2/87 (2%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +   P A+++ +A +    D +  + ++++VT  P+ GKANKA+   LAK + +SK
Sbjct: 14 TQIPIHATPKAQRNAVAGV--KADDTGRLEVQVRVTVAPEGGKANKAVCETLAKAIGVSK 71

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITE 88
          S + ++  ++S  K+  ++    +I  
Sbjct: 72 SKVSIVRGETSRHKMAQVEAPSADIEA 98


>gi|302038855|ref|YP_003799177.1| hypothetical protein NIDE3568 [Candidatus Nitrospira defluvii]
 gi|300606919|emb|CBK43252.1| conserved protein of unknown function DUF167 [Candidatus Nitrospira
           defluvii]
          Length = 105

 Score = 90.2 bits (223), Expect = 8e-17,   Method: Composition-based stats.
 Identities = 21/90 (23%), Positives = 38/90 (42%), Gaps = 8/90 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + V + P A +S  A L           +KI++ A P  G AN  +   LA+   +  
Sbjct: 20  VTISVHVQPKASRSECAGLH-------GHAVKIRIAAPPADGAANAELCRFLARCCEVPL 72

Query: 62  SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
           S++ +LS   S  K + +     +++   L
Sbjct: 73  SAVHILSGAGSRQKRVLVKGRTAEQVRAQL 102


>gi|161367529|ref|NP_289525.2| hypothetical protein Z4298 [Escherichia coli O157:H7 str. EDL933]
 gi|162139760|ref|NP_311856.2| hypothetical protein ECs3829 [Escherichia coli O157:H7 str.
          Sakai]
 gi|168747555|ref|ZP_02772577.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC4113]
 gi|168753905|ref|ZP_02778912.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC4401]
 gi|168760095|ref|ZP_02785102.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC4501]
 gi|168766960|ref|ZP_02791967.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC4486]
 gi|168773408|ref|ZP_02798415.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC4196]
 gi|168781812|ref|ZP_02806819.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC4076]
 gi|168785811|ref|ZP_02810818.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC869]
 gi|168797528|ref|ZP_02822535.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC508]
 gi|195937091|ref|ZP_03082473.1| hypothetical protein EscherichcoliO157_11661 [Escherichia coli
          O157:H7 str. EC4024]
 gi|208807147|ref|ZP_03249484.1| conserved hypothetical protein TIGR00251 [Escherichia coli
          O157:H7 str. EC4206]
 gi|208813482|ref|ZP_03254811.1| conserved hypothetical protein TIGR00251 [Escherichia coli
          O157:H7 str. EC4045]
 gi|208818265|ref|ZP_03258585.1| conserved hypothetical protein TIGR00251 [Escherichia coli
          O157:H7 str. EC4042]
 gi|209399373|ref|YP_002272433.1| conserved hypothetical protein TIGR00251 [Escherichia coli
          O157:H7 str. EC4115]
 gi|217327282|ref|ZP_03443365.1| conserved hypothetical protein TIGR00251 [Escherichia coli
          O157:H7 str. TW14588]
 gi|254794905|ref|YP_003079742.1| hypothetical protein ECSP_3924 [Escherichia coli O157:H7 str.
          TW14359]
 gi|261226265|ref|ZP_05940546.1| hypothetical protein EscherichiacoliO157_16963 [Escherichia coli
          O157:H7 str. FRIK2000]
 gi|261256477|ref|ZP_05949010.1| hypothetical protein EscherichiacoliO157EcO_11656 [Escherichia
          coli O157:H7 str. FRIK966]
 gi|291284274|ref|YP_003501092.1| hypothetical protein G2583_3612 [Escherichia coli O55:H7 str.
          CB9615]
 gi|29839727|sp|Q8XCU6|YGGU_ECO57 RecName: Full=UPF0235 protein yggU
 gi|226730816|sp|B5YQF1|YGGU_ECO5E RecName: Full=UPF0235 protein yggU
 gi|187770933|gb|EDU34777.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC4196]
 gi|188017898|gb|EDU56020.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC4113]
 gi|189000572|gb|EDU69558.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC4076]
 gi|189358605|gb|EDU77024.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC4401]
 gi|189363683|gb|EDU82102.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC4486]
 gi|189369349|gb|EDU87765.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC4501]
 gi|189374116|gb|EDU92532.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC869]
 gi|189379790|gb|EDU98206.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
          EC508]
 gi|208726948|gb|EDZ76549.1| conserved hypothetical protein TIGR00251 [Escherichia coli
          O157:H7 str. EC4206]
 gi|208734759|gb|EDZ83446.1| conserved hypothetical protein TIGR00251 [Escherichia coli
          O157:H7 str. EC4045]
 gi|208738388|gb|EDZ86070.1| conserved hypothetical protein TIGR00251 [Escherichia coli
          O157:H7 str. EC4042]
 gi|209160773|gb|ACI38206.1| conserved hypothetical protein TIGR00251 [Escherichia coli
          O157:H7 str. EC4115]
 gi|217319649|gb|EEC28074.1| conserved hypothetical protein TIGR00251 [Escherichia coli
          O157:H7 str. TW14588]
 gi|254594305|gb|ACT73666.1| conserved protein [Escherichia coli O157:H7 str. TW14359]
 gi|290764147|gb|ADD58108.1| conserved hypothetical protein [Escherichia coli O55:H7 str.
          CB9615]
 gi|320189302|gb|EFW63961.1| hypothetical protein ECoD_04306 [Escherichia coli O157:H7 str.
          EC1212]
 gi|320640599|gb|EFX10138.1| hypothetical protein ECO5101_04264 [Escherichia coli O157:H7 str.
          G5101]
 gi|320645846|gb|EFX14831.1| hypothetical protein ECO9389_23751 [Escherichia coli O157:H- str.
          493-89]
 gi|320651146|gb|EFX19586.1| hypothetical protein ECO2687_11738 [Escherichia coli O157:H- str.
          H 2687]
 gi|320656642|gb|EFX24538.1| hypothetical protein ECO7815_01895 [Escherichia coli O55:H7 str.
          3256-97 TW 07815]
 gi|320662161|gb|EFX29562.1| hypothetical protein ECO5905_09833 [Escherichia coli O55:H7 str.
          USDA 5905]
 gi|320667236|gb|EFX34199.1| hypothetical protein ECOSU61_08919 [Escherichia coli O157:H7 str.
          LSU-61]
 gi|326338959|gb|EGD62774.1| hypothetical protein ECoA_04497 [Escherichia coli O157:H7 str.
          1044]
 gi|326343159|gb|EGD66927.1| hypothetical protein ECF_02672 [Escherichia coli O157:H7 str.
          1125]
          Length = 96

 Score = 90.2 bits (223), Expect = 9e-17,   Method: Composition-based stats.
 Identities = 18/81 (22%), Positives = 35/81 (43%), Gaps = 7/81 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCK 84
          + +   +    K I I    +
Sbjct: 66 VVIEKGELGRHKQIKIINPQQ 86


>gi|293412313|ref|ZP_06655036.1| hypothetical protein ECEG_02320 [Escherichia coli B354]
 gi|291469084|gb|EFF11575.1| hypothetical protein ECEG_02320 [Escherichia coli B354]
          Length = 96

 Score = 90.2 bits (223), Expect = 9e-17,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91


>gi|153828377|ref|ZP_01981044.1| conserved hypothetical protein [Vibrio cholerae 623-39]
 gi|229519844|ref|ZP_04409278.1| hypothetical protein VIF_000358 [Vibrio cholerae TM 11079-80]
 gi|148876086|gb|EDL74221.1| conserved hypothetical protein [Vibrio cholerae 623-39]
 gi|229343132|gb|EEO08116.1| hypothetical protein VIF_000358 [Vibrio cholerae TM 11079-80]
          Length = 96

 Score = 90.2 bits (223), Expect = 9e-17,   Method: Composition-based stats.
 Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   ++K S
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65

Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91
          + +   +    K + I    +   EI  L++
Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEIAALIE 96


>gi|261210035|ref|ZP_05924333.1| hypothetical protein VCJ_000277 [Vibrio sp. RC341]
 gi|260840800|gb|EEX67342.1| hypothetical protein VCJ_000277 [Vibrio sp. RC341]
          Length = 96

 Score = 90.2 bits (223), Expect = 9e-17,   Method: Composition-based stats.
 Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK+  ++K S
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKQCKVAKGS 65

Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91
          + +   +    K + I    +   EI  L++
Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEIAALIE 96


>gi|161950048|ref|YP_404624.2| hypothetical protein SDY_3119 [Shigella dysenteriae Sd197]
 gi|309785219|ref|ZP_07679850.1| conserved hypothetical protein [Shigella dysenteriae 1617]
 gi|308926339|gb|EFP71815.1| conserved hypothetical protein [Shigella dysenteriae 1617]
          Length = 96

 Score = 90.2 bits (223), Expect = 9e-17,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANGHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91


>gi|149057384|gb|EDM08707.1| similar to RIKEN cDNA 3110040N11, isoform CRA_d [Rattus norvegicus]
          Length = 181

 Score = 90.2 bits (223), Expect = 9e-17,   Method: Composition-based stats.
 Identities = 19/96 (19%), Positives = 39/96 (40%), Gaps = 9/96 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L             + + A P +G+AN  +   L+K L L K
Sbjct: 91  VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 143

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95
           S + +     S  K++ +      +E+ E L+    
Sbjct: 144 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTEAE 179


>gi|256024538|ref|ZP_05438403.1| hypothetical protein E4_14269 [Escherichia sp. 4_1_40B]
          Length = 96

 Score = 90.2 bits (223), Expect = 9e-17,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 36/86 (41%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K   ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKLFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91


>gi|121728574|ref|ZP_01681595.1| conserved hypothetical protein [Vibrio cholerae V52]
 gi|153802603|ref|ZP_01957189.1| conserved hypothetical protein [Vibrio cholerae MZO-3]
 gi|229512520|ref|ZP_04401991.1| hypothetical protein VCB_000160 [Vibrio cholerae TMA 21]
 gi|254291169|ref|ZP_04961965.1| conserved hypothetical protein [Vibrio cholerae AM-19226]
 gi|121629130|gb|EAX61573.1| conserved hypothetical protein [Vibrio cholerae V52]
 gi|124121866|gb|EAY40609.1| conserved hypothetical protein [Vibrio cholerae MZO-3]
 gi|150422863|gb|EDN14814.1| conserved hypothetical protein [Vibrio cholerae AM-19226]
 gi|229350413|gb|EEO15362.1| hypothetical protein VCB_000160 [Vibrio cholerae TMA 21]
          Length = 96

 Score = 90.2 bits (223), Expect = 9e-17,   Method: Composition-based stats.
 Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   ++K S
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65

Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91
          + +   +    K + I    +   EI  L++
Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEIAALIE 96


>gi|16761877|ref|NP_457494.1| hypothetical protein STY3255 [Salmonella enterica subsp. enterica
          serovar Typhi str. CT18]
 gi|29143364|ref|NP_806706.1| hypothetical protein t3014 [Salmonella enterica subsp. enterica
          serovar Typhi str. Ty2]
 gi|161616066|ref|YP_001590031.1| hypothetical protein SPAB_03867 [Salmonella enterica subsp.
          enterica serovar Paratyphi B str. SPB7]
 gi|167552006|ref|ZP_02345759.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Saintpaul str. SARA29]
 gi|168234347|ref|ZP_02659405.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Kentucky str. CDC 191]
 gi|168236170|ref|ZP_02661228.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Schwarzengrund str. SL480]
 gi|168264452|ref|ZP_02686425.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Hadar str. RI_05P066]
 gi|168463711|ref|ZP_02697628.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Newport str. SL317]
 gi|194470418|ref|ZP_03076402.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Kentucky str. CVM29188]
 gi|194735973|ref|YP_002116050.1| hypothetical protein SeSA_A3276 [Salmonella enterica subsp.
          enterica serovar Schwarzengrund str. CVM19633]
 gi|198244387|ref|YP_002217077.1| hypothetical protein SeD_A3445 [Salmonella enterica subsp.
          enterica serovar Dublin str. CT_02021853]
 gi|200388001|ref|ZP_03214613.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Virchow str. SL491]
 gi|205354025|ref|YP_002227826.1| hypothetical protein SG2996 [Salmonella enterica subsp. enterica
          serovar Gallinarum str. 287/91]
 gi|207858363|ref|YP_002245014.1| hypothetical protein SEN2945 [Salmonella enterica subsp. enterica
          serovar Enteritidis str. P125109]
 gi|213027996|ref|ZP_03342443.1| hypothetical protein Salmonelentericaenterica_39070 [Salmonella
          enterica subsp. enterica serovar Typhi str. 404ty]
 gi|213051778|ref|ZP_03344656.1| hypothetical protein Salmoneentericaenterica_01893 [Salmonella
          enterica subsp. enterica serovar Typhi str. E00-7866]
 gi|213422843|ref|ZP_03355881.1| hypothetical protein Salmonentericaenterica_35767 [Salmonella
          enterica subsp. enterica serovar Typhi str. E01-6750]
 gi|213424105|ref|ZP_03356998.1| hypothetical protein SentesTyphi_00030 [Salmonella enterica
          subsp. enterica serovar Typhi str. E02-1180]
 gi|213580619|ref|ZP_03362445.1| hypothetical protein SentesTyph_05132 [Salmonella enterica subsp.
          enterica serovar Typhi str. E98-0664]
 gi|213609099|ref|ZP_03368925.1| hypothetical protein SentesTyp_00567 [Salmonella enterica subsp.
          enterica serovar Typhi str. E98-2068]
 gi|213648208|ref|ZP_03378261.1| hypothetical protein SentesTy_13504 [Salmonella enterica subsp.
          enterica serovar Typhi str. J185]
 gi|213850158|ref|ZP_03381056.1| hypothetical protein SentesT_00598 [Salmonella enterica subsp.
          enterica serovar Typhi str. M223]
 gi|289825938|ref|ZP_06545097.1| hypothetical protein Salmonellentericaenterica_11175 [Salmonella
          enterica subsp. enterica serovar Typhi str. E98-3139]
 gi|29839732|sp|Q8Z3U7|YGGU_SALTI RecName: Full=UPF0235 protein yggU
 gi|189030121|sp|A9N4P8|YGGU_SALPB RecName: Full=UPF0235 protein yggU
 gi|226730825|sp|B5FUW5|YGGU_SALDC RecName: Full=UPF0235 protein yggU
 gi|226730826|sp|B5QY78|YGGU_SALEP RecName: Full=UPF0235 protein yggU
 gi|226730827|sp|B5RE62|YGGU_SALG2 RecName: Full=UPF0235 protein yggU
 gi|226730831|sp|B4TV71|YGGU_SALSV RecName: Full=UPF0235 protein yggU
 gi|25370114|pir||AF0878 conserved hypothetical protein STY3255 [imported] - Salmonella
          enterica subsp. enterica serovar Typhi (strain CT18)
 gi|16504179|emb|CAD02926.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Typhi]
 gi|29138998|gb|AAO70566.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Typhi str. Ty2]
 gi|161365430|gb|ABX69198.1| hypothetical protein SPAB_03867 [Salmonella enterica subsp.
          enterica serovar Paratyphi B str. SPB7]
 gi|194456782|gb|EDX45621.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Kentucky str. CVM29188]
 gi|194711475|gb|ACF90696.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Schwarzengrund str. CVM19633]
 gi|195633251|gb|EDX51665.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Newport str. SL317]
 gi|197290920|gb|EDY30274.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Schwarzengrund str. SL480]
 gi|197938903|gb|ACH76236.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Dublin str. CT_02021853]
 gi|199605099|gb|EDZ03644.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Virchow str. SL491]
 gi|205273806|emb|CAR38801.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Gallinarum str. 287/91]
 gi|205323332|gb|EDZ11171.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Saintpaul str. SARA29]
 gi|205331696|gb|EDZ18460.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Kentucky str. CDC 191]
 gi|205347059|gb|EDZ33690.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar Hadar str. RI_05P066]
 gi|206710166|emb|CAR34522.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Enteritidis str. P125109]
 gi|326624849|gb|EGE31194.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar
          Dublin str. 3246]
 gi|326629138|gb|EGE35481.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar
          Gallinarum str. 9]
          Length = 96

 Score = 90.2 bits (223), Expect = 9e-17,   Method: Composition-based stats.
 Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  +   L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLTKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYI 79
          + +   +    K + I
Sbjct: 66 IVIEKGELGRHKQVKI 81


>gi|16766403|ref|NP_462018.1| hypothetical protein STM3102 [Salmonella enterica subsp. enterica
          serovar Typhimurium str. LT2]
 gi|167990371|ref|ZP_02571471.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar 4,[5],12:i:- str. CVM23701]
 gi|197262086|ref|ZP_03162160.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Saintpaul str. SARA23]
 gi|29839738|sp|Q8ZM46|YGGU_SALTY RecName: Full=UPF0235 protein yggU
 gi|16421655|gb|AAL21977.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
          serovar Typhimurium str. LT2]
 gi|197240341|gb|EDY22961.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Saintpaul str. SARA23]
 gi|205331149|gb|EDZ17913.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
          subsp. enterica serovar 4,[5],12:i:- str. CVM23701]
 gi|261248233|emb|CBG26070.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Typhimurium str. D23580]
 gi|267995267|gb|ACY90152.1| hypothetical protein STM14_3746 [Salmonella enterica subsp.
          enterica serovar Typhimurium str. 14028S]
 gi|301159657|emb|CBW19176.1| conserved hypothetical protein [Salmonella enterica subsp.
          enterica serovar Typhimurium str. SL1344]
 gi|312914124|dbj|BAJ38098.1| hypothetical protein STMDT12_C31550 [Salmonella enterica subsp.
          enterica serovar Typhimurium str. T000240]
 gi|321225775|gb|EFX50829.1| UPF0235 protein VC [Salmonella enterica subsp. enterica serovar
          Typhimurium str. TN061786]
 gi|323131458|gb|ADX18888.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
          serovar Typhimurium str. 4/74]
          Length = 96

 Score = 90.2 bits (223), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 18/76 (23%), Positives = 33/76 (43%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +KI +TA P  G+AN  +   L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKIAITAPPVDGQANSHLTKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYI 79
          + +   +    K + I
Sbjct: 66 IVIEKGELGRHKQVKI 81


>gi|254295256|ref|YP_003061279.1| hypothetical protein Hbal_2912 [Hirschia baltica ATCC 49814]
 gi|254043787|gb|ACT60582.1| protein of unknown function DUF167 [Hirschia baltica ATCC 49814]
          Length = 109

 Score = 89.8 bits (222), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 28/91 (30%), Positives = 49/91 (53%), Gaps = 3/91 (3%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           +++ R+ PNA K  + +    +D +   ++K++V A P KGKANKA+  +LA    L KS
Sbjct: 13  DIVARVTPNASKDAVEA--PEQDAAGRTYLKLRVRAIPDKGKANKAVEKLLASHFNLPKS 70

Query: 63  SLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
            + ++   +  LK I I  D  +++  L   
Sbjct: 71  KVAVVKGSTDRLKTIRIS-DGADLSSQLAEK 100


>gi|114326941|ref|YP_744098.1| putative cytoplasmic protein [Granulibacter bethesdensis CGDNIH1]
 gi|122328075|sp|Q0BVH7|Y277_GRABC RecName: Full=UPF0235 protein GbCGDNIH1_0277
 gi|114315115|gb|ABI61175.1| hypothetical cytosolic protein [Granulibacter bethesdensis CGDNIH1]
          Length = 107

 Score = 89.8 bits (222), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 25/92 (27%), Positives = 49/92 (53%), Gaps = 2/92 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +R++P A+K G+       D      +KI V+A   KG+AN+A+  MLAK L +  S 
Sbjct: 13  LALRVVPKARKIGLGGTVPGADGKP--RLKISVSAPADKGQANEAVRDMLAKALRVPASR 70

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
           + +L   ++  K++ ++ D + +   ++   S
Sbjct: 71  ITLLQGLTARDKLVRVEGDPETLGSTVETLAS 102


>gi|262191069|ref|ZP_06049276.1| hypothetical protein VIH_001440 [Vibrio cholerae CT 5369-93]
 gi|262033045|gb|EEY51576.1| hypothetical protein VIH_001440 [Vibrio cholerae CT 5369-93]
          Length = 96

 Score = 89.8 bits (222), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   ++K S
Sbjct: 13 LRLYIQPKASRDSIIGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65

Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91
          + +   +    K + I    +   EI  L++
Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEIAALIE 96


>gi|322513549|ref|ZP_08066649.1| hypothetical protein HMPREF0027_0401 [Actinobacillus ureae ATCC
          25976]
 gi|322120620|gb|EFX92514.1| hypothetical protein HMPREF0027_0401 [Actinobacillus ureae ATCC
          25976]
          Length = 99

 Score = 89.8 bits (222), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 20/90 (22%), Positives = 39/90 (43%), Gaps = 8/90 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + L P A +  I  L           +KI +T  P  G AN  +L  L+K   + K
Sbjct: 13 IRLRIFLQPKASRDQIVGLH-------DNELKIAITTPPVDGAANAHLLKYLSKLFKVPK 65

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          SS+ +   +    K +++  + K + + ++
Sbjct: 66 SSIVLEKGELQRHKQLFVP-EPKLLPKEIE 94


>gi|27375661|ref|NP_767190.1| hypothetical protein bsl0550 [Bradyrhizobium japonicum USDA 110]
 gi|27348798|dbj|BAC45815.1| bsl0550 [Bradyrhizobium japonicum USDA 110]
          Length = 94

 Score = 89.8 bits (222), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 31/87 (35%), Positives = 47/87 (54%), Gaps = 2/87 (2%)

Query: 9  IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
           P   +  I  +E   D      +K++V A    G+ANKA+L +LAK L + K+S+R+LS
Sbjct: 2  TPRGGRDDIDGIEQLADG--RSVLKVRVRAIADGGEANKAVLVLLAKSLGVPKASVRLLS 59

Query: 69 KQSSPLKIIYIDKDCKEITELLQNNDS 95
            +S LK I +D D   + E L+   S
Sbjct: 60 GATSRLKQIAVDGDPARLGETLRQLAS 86


>gi|241762300|ref|ZP_04760381.1| protein of unknown function DUF167 [Zymomonas mobilis subsp.
           mobilis ATCC 10988]
 gi|241373203|gb|EER62833.1| protein of unknown function DUF167 [Zymomonas mobilis subsp.
           mobilis ATCC 10988]
          Length = 113

 Score = 89.8 bits (222), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 28/91 (30%), Positives = 54/91 (59%), Gaps = 2/91 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +R+   A K+GI   +   DT+     +I+V A P +G +NK ++A L+K  ++ K
Sbjct: 19  IRLALRVTARASKTGITMFDK--DTAGRGLFRIRVAAPPVEGASNKNLMAYLSKSFSVPK 76

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92
            ++R+ S + S +KI++I  D K +TE+ ++
Sbjct: 77  GAVRIESGEHSKIKILHIAGDVKRLTEIAED 107


>gi|320161245|ref|YP_004174469.1| hypothetical protein ANT_18430 [Anaerolinea thermophila UNI-1]
 gi|319995098|dbj|BAJ63869.1| hypothetical protein ANT_18430 [Anaerolinea thermophila UNI-1]
          Length = 105

 Score = 89.4 bits (221), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 23/76 (30%), Positives = 46/76 (60%), Gaps = 6/76 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VR+ P A K+ I       +  D   +KI++TA P +GKAN+A++  L++ L + ++S
Sbjct: 19 ITVRVTPRASKNEI------YEILDDGTVKIRLTAPPVEGKANEALIDFLSEVLDVPRTS 72

Query: 64 LRMLSKQSSPLKIIYI 79
          L +++ ++   KI+ +
Sbjct: 73 LEIVAGETGRDKIVTV 88


>gi|289811236|ref|ZP_06541865.1| hypothetical protein Salmonellaentericaenterica_45672 [Salmonella
          enterica subsp. enterica serovar Typhi str. AG3]
          Length = 94

 Score = 89.4 bits (221), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  +   L K+  ++KS 
Sbjct: 11 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLTKFLGKQFRVAKSQ 63

Query: 64 LRMLSKQSSPLKIIYI 79
          + +   +    K + I
Sbjct: 64 IVIEKGELGRHKQVKI 79


>gi|90407095|ref|ZP_01215284.1| hypothetical protein PCNPT3_02605 [Psychromonas sp. CNPT3]
 gi|90311817|gb|EAS39913.1| hypothetical protein PCNPT3_02605 [Psychromonas sp. CNPT3]
          Length = 96

 Score = 89.4 bits (221), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 21/87 (24%), Positives = 38/87 (43%), Gaps = 8/87 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + L P A +     L           +KI +TA P  G+ANK ++  L+K+  + K  
Sbjct: 12 LRLVLQPKASRDAFIGL-------LGDELKITITAPPVDGQANKHLIKFLSKQFKVPKRD 64

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90
          + +     +  K+I I    K+I +  
Sbjct: 65 ITVEKGLLNRHKLIRIK-SPKKIPDFF 90


>gi|224826207|ref|ZP_03699310.1| protein of unknown function DUF167 [Lutiella nitroferrum 2002]
 gi|224601844|gb|EEG08024.1| protein of unknown function DUF167 [Lutiella nitroferrum 2002]
          Length = 108

 Score = 89.4 bits (221), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 7/79 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P AKK+ +A             +K+++ A P +GKAN  +LA LA++  + K
Sbjct: 23 IRLTLHVQPGAKKTDLAG-------EHGGALKLRLAAPPVEGKANAMLLAWLAERFEVPK 75

Query: 62 SSLRMLSKQSSPLKIIYID 80
            + +LS   S  KI+ I 
Sbjct: 76 RDVVLLSGDKSRHKIVEIK 94


>gi|225164758|ref|ZP_03726990.1| conserved hypothetical protein [Opitutaceae bacterium TAV2]
 gi|224800632|gb|EEG18996.1| conserved hypothetical protein [Opitutaceae bacterium TAV2]
          Length = 111

 Score = 89.4 bits (221), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 23/92 (25%), Positives = 43/92 (46%), Gaps = 8/92 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           C + ++ IPNA ++ IA             +K+KV+A   +G+AN+ +   LA+ L + +
Sbjct: 20  CILSIKAIPNASRNAIAGW-------LGDALKVKVSAPALEGRANEQLCDFLAETLGIPR 72

Query: 62  SSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92
            ++ +   + S  K I I   D   +   L  
Sbjct: 73  RAVTVAGGEKSRQKRIQIIGLDLPAVRARLNA 104


>gi|223937408|ref|ZP_03629313.1| protein of unknown function DUF167 [bacterium Ellin514]
 gi|223893959|gb|EEF60415.1| protein of unknown function DUF167 [bacterium Ellin514]
          Length = 100

 Score = 89.4 bits (221), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 24/94 (25%), Positives = 41/94 (43%), Gaps = 15/94 (15%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + ++L P A  + I              ++IKVTA P    AN+A+L +LA  L   +  
Sbjct: 16 LSIKLQPRASANQIG-------EPLGNELRIKVTAPPVDAAANEALLRLLADILKCPRGK 68

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97
          + ++   +S  K+I +          L+ N  LT
Sbjct: 69 VELVRGHTSRHKVIKLHG--------LEANAVLT 94


>gi|319406122|emb|CBI79752.1| conserved hypothetical protein [Bartonella sp. AR 15-3]
          Length = 107

 Score = 89.4 bits (221), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 33/88 (37%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VRLIP A    I  +E   D     H+ I++ A P+ GKANKA++  LAK+  +  S 
Sbjct: 12 LFVRLIPKASVDSIIKVENRGDGKQ--HLIIRLRAIPENGKANKALIKFLAKQWKIPSSC 69

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          + + S  +S  K +Y  K  KEI ++LQ
Sbjct: 70 ISLGSGATSHYKQLYFSKYIKEIEQILQ 97


>gi|149200997|ref|ZP_01877972.1| hypothetical protein RTM1035_15267 [Roseovarius sp. TM1035]
 gi|149145330|gb|EDM33356.1| hypothetical protein RTM1035_15267 [Roseovarius sp. TM1035]
          Length = 84

 Score = 89.4 bits (221), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 8/77 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +R+ P A ++ I         ++   +++ VT  P+ GKAN A+  +LAK L + KS 
Sbjct: 16 LTLRVTPKAARNRIV--------AEDDVLRVYVTTVPEDGKANAAVQKLLAKALGVPKSR 67

Query: 64 LRMLSKQSSPLKIIYID 80
          L +L   +S  K+  +D
Sbjct: 68 LTLLRGHTSRDKVFQVD 84


>gi|270157502|ref|ZP_06186159.1| conserved hypothetical protein [Legionella longbeachae D-4968]
 gi|269989527|gb|EEZ95781.1| conserved hypothetical protein [Legionella longbeachae D-4968]
          Length = 80

 Score = 89.4 bits (221), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 7/77 (9%)

Query: 6  VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65
          + + P AKKS I  +           +KI++ A P +G+ANK +L  +A+   +  S + 
Sbjct: 3  LYVQPGAKKSEIVGMH-------EGVLKIRLNAPPIEGRANKELLKYVAQLFKVPPSQVV 55

Query: 66 MLSKQSSPLKIIYIDKD 82
          +     S  K++ +   
Sbjct: 56 LKRGDKSRHKVLLVKNS 72


>gi|293360116|ref|XP_002729708.1| PREDICTED: hypothetical protein [Rattus norvegicus]
          Length = 304

 Score = 89.4 bits (221), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 19/96 (19%), Positives = 39/96 (40%), Gaps = 9/96 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L             + + A P +G+AN  +   L+K L L K
Sbjct: 214 VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 266

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95
           S + +     S  K++ +      +E+ E L+    
Sbjct: 267 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTEAE 302


>gi|56551705|ref|YP_162544.1| hypothetical protein ZMO0809 [Zymomonas mobilis subsp. mobilis ZM4]
 gi|260752718|ref|YP_003225611.1| hypothetical protein Za10_0477 [Zymomonas mobilis subsp. mobilis
           NCIMB 11163]
 gi|56543279|gb|AAV89433.1| protein of unknown function DUF167 [Zymomonas mobilis subsp.
           mobilis ZM4]
 gi|258552081|gb|ACV75027.1| protein of unknown function DUF167 [Zymomonas mobilis subsp.
           mobilis NCIMB 11163]
          Length = 113

 Score = 89.0 bits (220), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 27/91 (29%), Positives = 54/91 (59%), Gaps = 2/91 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +R+   A K+GI   +   DT+     +I+V A P +G +NK ++A L+K  ++ K
Sbjct: 19  IRLALRVTARASKTGITMFDK--DTAGRGLFRIRVAAPPVEGASNKNLMAYLSKSFSVPK 76

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92
            ++++ S + S +KI++I  D K +TE+ ++
Sbjct: 77  GAVKIESGEHSKIKILHIAGDVKRLTEIAED 107


>gi|148251886|ref|YP_001236471.1| hypothetical protein BBta_0270 [Bradyrhizobium sp. BTAi1]
 gi|146404059|gb|ABQ32565.1| hypothetical protein BBta_0270 [Bradyrhizobium sp. BTAi1]
          Length = 111

 Score = 89.0 bits (220), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 24/90 (26%), Positives = 49/90 (54%), Gaps = 2/90 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V +R+ P   +  I  +E   D      +K++V A    G+AN+A+  +LAK + ++K++
Sbjct: 15  VALRVTPRGGRDAIDGIETLSDG--RSVLKLRVRAVADGGEANRAVTELLAKAIGVTKAA 72

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
           +R+ S  ++ LK + I  D   + + L++ 
Sbjct: 73  VRITSGATARLKQVTITGDASRLDQALRDL 102


>gi|297697323|ref|XP_002825813.1| PREDICTED: UPF0235 protein C15orf40-like [Pongo abelii]
          Length = 242

 Score = 89.0 bits (220), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 21/98 (21%), Positives = 42/98 (42%), Gaps = 9/98 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L           + + + A P +G+AN  +   L+K L L K
Sbjct: 152 VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 204

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDSLT 97
           S + +     S  K++ +      +EI E L+   + T
Sbjct: 205 SDVVLDKGGKSREKVVKLLASTTPEEILEKLKKEATKT 242


>gi|298293297|ref|YP_003695236.1| hypothetical protein Snov_3343 [Starkeya novella DSM 506]
 gi|296929808|gb|ADH90617.1| protein of unknown function DUF167 [Starkeya novella DSM 506]
          Length = 106

 Score = 89.0 bits (220), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 24/90 (26%), Positives = 39/90 (43%), Gaps = 2/90 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V VR  P   +  I       D      +K +V+   + GKAN A+  +LAK   ++ S 
Sbjct: 16  VTVRATPRGGRDAIDGFVELGDG--RTALKARVSVAAEDGKANAALGKLLAKAAGIAPSR 73

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
           + ++S  +   K   ++ D  EI   LQ  
Sbjct: 74  VDLVSGATGRTKAFKLNGDAAEIAARLQAL 103


>gi|302342363|ref|YP_003806892.1| hypothetical protein Deba_0928 [Desulfarculus baarsii DSM 2075]
 gi|301638976|gb|ADK84298.1| protein of unknown function DUF167 [Desulfarculus baarsii DSM
          2075]
          Length = 95

 Score = 89.0 bits (220), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 21/87 (24%), Positives = 43/87 (49%), Gaps = 8/87 (9%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           VR+ P A +  +A         +   +K+++ A P  G+AN+A+L ++AK L+L +  +
Sbjct: 14 AVRVSPRASRDQLAG-------EEGGALKVRLCAPPVDGQANEALLRLVAKALSLPRRDV 66

Query: 65 RMLSKQSSPLKIIYIDKDC-KEITELL 90
           + S   S  K + +     +++   L
Sbjct: 67 SLASGPRSRQKRLLVKGLGREQLLARL 93


>gi|298528115|ref|ZP_07015519.1| protein of unknown function DUF167 [Desulfonatronospira
           thiodismutans ASO3-1]
 gi|298511767|gb|EFI35669.1| protein of unknown function DUF167 [Desulfonatronospira
           thiodismutans ASO3-1]
          Length = 119

 Score = 89.0 bits (220), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 21/88 (23%), Positives = 36/88 (40%), Gaps = 7/88 (7%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V L P A +  +  +           +KI V A P  GKANKA+   L++ L + K  
Sbjct: 36  LRVVLKPGADRDEVLGIHA-------GRLKISVKAPPVDGKANKALCIFLSRSLGIRKKQ 88

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
           + +     S  K + +      +   L+
Sbjct: 89  VWIQRGLQSRNKDLIVSGVAGTVFNALK 116


>gi|300925051|ref|ZP_07140969.1| hypothetical protein HMPREF9548_03158 [Escherichia coli MS 182-1]
 gi|300418797|gb|EFK02108.1| hypothetical protein HMPREF9548_03158 [Escherichia coli MS 182-1]
          Length = 96

 Score = 89.0 bits (220), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLN-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91


>gi|21674646|ref|NP_662711.1| hypothetical protein CT1832 [Chlorobium tepidum TLS]
 gi|29839718|sp|Q8KBF5|Y1832_CHLTE RecName: Full=UPF0235 protein CT1832
 gi|21647849|gb|AAM73053.1| conserved hypothetical protein [Chlorobium tepidum TLS]
          Length = 105

 Score = 89.0 bits (220), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 24/88 (27%), Positives = 43/88 (48%), Gaps = 8/88 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VR+ P + KSG+A +           +KI + + P    ANK    +LAK L + +SS
Sbjct: 13 LSVRVQPRSSKSGVAGM-------YGEQLKICLKSAPVDNAANKECCELLAKALGVPRSS 65

Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90
          + ++   SS  K++ ++      + E L
Sbjct: 66 VSVMKGASSRSKVLKVEGVTPAAVREAL 93


>gi|331664536|ref|ZP_08365442.1| conserved hypothetical protein [Escherichia coli TA143]
 gi|331058467|gb|EGI30448.1| conserved hypothetical protein [Escherichia coli TA143]
          Length = 96

 Score = 89.0 bits (220), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 66 VVIEKGELGRHKHIKIINPQQIPPEI 91


>gi|85704715|ref|ZP_01035816.1| hypothetical protein ROS217_06535 [Roseovarius sp. 217]
 gi|85670533|gb|EAQ25393.1| hypothetical protein ROS217_06535 [Roseovarius sp. 217]
          Length = 84

 Score = 89.0 bits (220), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 8/77 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +R+ P A ++ I         ++   +++ VT  P+ GKAN A+  +LAK L + KS 
Sbjct: 16 IALRVTPKAARNRIV--------AEDGALRVYVTTVPEDGKANAAVQKLLAKALGVPKSR 67

Query: 64 LRMLSKQSSPLKIIYID 80
          L +L   +S  K+  +D
Sbjct: 68 LSLLRGHTSRDKVFQVD 84


>gi|88859056|ref|ZP_01133697.1| hypothetical protein PTD2_08629 [Pseudoalteromonas tunicata D2]
 gi|88819282|gb|EAR29096.1| hypothetical protein PTD2_08629 [Pseudoalteromonas tunicata D2]
          Length = 101

 Score = 89.0 bits (220), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 17/82 (20%), Positives = 35/82 (42%), Gaps = 7/82 (8%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          +  + + + P A +     L           +K+ +TA P  G+AN  ++  LAK+  ++
Sbjct: 10 ILTLRLYVQPKASQDKFIGLH-------GNELKVAITAPPVDGQANSHLIKFLAKQCKVA 62

Query: 61 KSSLRMLSKQSSPLKIIYIDKD 82
          K+ + +        K + I K 
Sbjct: 63 KNQVCIKKGLQGRHKEVQISKP 84


>gi|307310426|ref|ZP_07590074.1| protein of unknown function DUF167 [Escherichia coli W]
 gi|306909321|gb|EFN39816.1| protein of unknown function DUF167 [Escherichia coli W]
 gi|315062259|gb|ADT76586.1| conserved hypothetical protein [Escherichia coli W]
 gi|323377157|gb|ADX49425.1| protein of unknown function DUF167 [Escherichia coli KO11]
          Length = 96

 Score = 88.6 bits (219), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+AN  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSYLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91


>gi|15640485|ref|NP_230112.1| hypothetical protein VC0458 [Vibrio cholerae O1 biovar El Tor
          str. N16961]
 gi|153823175|ref|ZP_01975842.1| conserved hypothetical protein [Vibrio cholerae B33]
 gi|229509068|ref|ZP_04398556.1| hypothetical protein VCE_000471 [Vibrio cholerae B33]
 gi|229519736|ref|ZP_04409179.1| hypothetical protein VCC_003768 [Vibrio cholerae RC9]
 gi|229606248|ref|YP_002876896.1| hypothetical protein VCD_001149 [Vibrio cholerae MJ-1236]
 gi|254850689|ref|ZP_05240039.1| conserved hypothetical protein [Vibrio cholerae MO10]
 gi|255744295|ref|ZP_05418248.1| hypothetical protein VCH_000606 [Vibrio cholera CIRS 101]
 gi|262147280|ref|ZP_06028079.1| hypothetical protein VIG_000128 [Vibrio cholerae INDRE 91/1]
 gi|29839647|sp|Q9KUQ7|Y458_VIBCH RecName: Full=UPF0235 protein VC_0458
 gi|9654883|gb|AAF93631.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor
          str. N16961]
 gi|126519301|gb|EAZ76524.1| conserved hypothetical protein [Vibrio cholerae B33]
 gi|229344425|gb|EEO09400.1| hypothetical protein VCC_003768 [Vibrio cholerae RC9]
 gi|229353993|gb|EEO18927.1| hypothetical protein VCE_000471 [Vibrio cholerae B33]
 gi|229368903|gb|ACQ59326.1| hypothetical protein VCD_001149 [Vibrio cholerae MJ-1236]
 gi|254846394|gb|EET24808.1| conserved hypothetical protein [Vibrio cholerae MO10]
 gi|255738235|gb|EET93627.1| hypothetical protein VCH_000606 [Vibrio cholera CIRS 101]
 gi|262031274|gb|EEY49889.1| hypothetical protein VIG_000128 [Vibrio cholerae INDRE 91/1]
          Length = 96

 Score = 88.6 bits (219), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   ++K S
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65

Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91
          + +   +    K + I    +   EI  L++
Sbjct: 66 VVVEKGELGRHKQVRILQPSQIPAEIAALIE 96


>gi|121590704|ref|ZP_01678036.1| conserved hypothetical protein [Vibrio cholerae 2740-80]
 gi|153819156|ref|ZP_01971823.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457]
 gi|153826562|ref|ZP_01979229.1| conserved hypothetical protein [Vibrio cholerae MZO-2]
 gi|227080668|ref|YP_002809219.1| hypothetical protein VCM66_0443 [Vibrio cholerae M66-2]
 gi|229507096|ref|ZP_04396602.1| hypothetical protein VCF_002318 [Vibrio cholerae BX 330286]
 gi|298501011|ref|ZP_07010812.1| conserved hypothetical protein [Vibrio cholerae MAK 757]
 gi|254803911|sp|C3LRX5|Y443_VIBCM RecName: Full=UPF0235 protein VCM66_0443
 gi|121547435|gb|EAX57544.1| conserved hypothetical protein [Vibrio cholerae 2740-80]
 gi|126510301|gb|EAZ72895.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457]
 gi|149739654|gb|EDM53868.1| conserved hypothetical protein [Vibrio cholerae MZO-2]
 gi|227008556|gb|ACP04768.1| conserved hypothetical protein [Vibrio cholerae M66-2]
 gi|229355841|gb|EEO20761.1| hypothetical protein VCF_002318 [Vibrio cholerae BX 330286]
 gi|297540259|gb|EFH76319.1| conserved hypothetical protein [Vibrio cholerae MAK 757]
          Length = 96

 Score = 88.6 bits (219), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   ++K S
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65

Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91
          + +   +    K + I    +   EI  L++
Sbjct: 66 VVVEKGELGRHKQVRILQPSQIPAEIAALIE 96


>gi|312116202|ref|YP_004013798.1| hypothetical protein Rvan_3519 [Rhodomicrobium vannielii ATCC
           17100]
 gi|311221331|gb|ADP72699.1| protein of unknown function DUF167 [Rhodomicrobium vannielii ATCC
           17100]
          Length = 123

 Score = 88.6 bits (219), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 26/90 (28%), Positives = 44/90 (48%), Gaps = 3/90 (3%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + VRL P A  + +A +E          +K  VT  P+ GKAN A++ ++A  L + K
Sbjct: 22  VLLHVRLTPKASSARVAGVEAFDGKP---VLKAYVTTPPEDGKANAALVVLVASWLGVPK 78

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
           SS+ M + Q S LK + +     ++   + 
Sbjct: 79  SSVSMAAGQKSRLKTVAVAGKADDLLAKIA 108


>gi|307261870|ref|ZP_07543532.1| hypothetical protein appser12_14270 [Actinobacillus
          pleuropneumoniae serovar 12 str. 1096]
 gi|306868417|gb|EFN00232.1| hypothetical protein appser12_14270 [Actinobacillus
          pleuropneumoniae serovar 12 str. 1096]
          Length = 97

 Score = 88.6 bits (219), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 22/92 (23%), Positives = 40/92 (43%), Gaps = 8/92 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + L P A +  I  L           +KI +TA P  G AN  +L  L+K   + K
Sbjct: 13 IRLRIFLQPKASRDQIVGLH-------DNELKIAITALPVDGAANAHLLKYLSKLFKVPK 65

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          SS+ +   +    K +++  + K I + ++  
Sbjct: 66 SSIVLEKGELQRHKQLFVP-EPKLIPKEIEAL 96


>gi|73856987|gb|AAZ89694.1| conserved hypothetical protein [Shigella sonnei Ss046]
          Length = 100

 Score = 88.6 bits (219), Expect = 2e-16,   Method: Composition-based stats.
 Identities = 20/90 (22%), Positives = 37/90 (41%), Gaps = 10/90 (11%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+ N  ++  L K+  ++KS 
Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQGNSHLVKFLGKQFRVAKSQ 69

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90
          + +   +    K I I        EI  L+
Sbjct: 70 VVIEKGELGRHKQIKIINPQQIPPEIAALI 99


>gi|114332111|ref|YP_748333.1| hypothetical protein Neut_2146 [Nitrosomonas eutropha C91]
 gi|122313207|sp|Q0AE64|Y2146_NITEC RecName: Full=UPF0235 protein Neut_2146
 gi|114309125|gb|ABI60368.1| protein of unknown function DUF167 [Nitrosomonas eutropha C91]
          Length = 99

 Score = 88.6 bits (219), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 18/93 (19%), Positives = 39/93 (41%), Gaps = 10/93 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A+++    +           +KIK+ A P  GKAN+A+   LAK+  +    
Sbjct: 14 LKLYIQPGARQTEAIGVH-------GEELKIKLAAPPMDGKANRALAVFLAKRFNVPLKH 66

Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQNN 93
          + +     S  K++ I       + +   ++  
Sbjct: 67 ITLKWGAQSRHKVVEIYQPVNGPEVLFNEIRAE 99


>gi|307824763|ref|ZP_07654986.1| protein of unknown function DUF167 [Methylobacter tundripaludum
          SV96]
 gi|307734121|gb|EFO04975.1| protein of unknown function DUF167 [Methylobacter tundripaludum
          SV96]
          Length = 88

 Score = 88.6 bits (219), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 24/85 (28%), Positives = 43/85 (50%), Gaps = 8/85 (9%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           + + P A K   A L           +K++V A P  GKAN+ ++A +A +  +SKS+ 
Sbjct: 2  NLHVQPKASKDEWAGLH-------GERLKLRVKAAPVDGKANQHLIAFIADEFGVSKSAC 54

Query: 65 RMLSKQSSPLKIIYIDKDCKEITEL 89
          ++++ +S   K I I    K++  L
Sbjct: 55 KLITGESGREKRIAI-NSPKKLPSL 78


>gi|320352454|ref|YP_004193793.1| hypothetical protein Despr_0318 [Desulfobulbus propionicus DSM
           2032]
 gi|320120956|gb|ADW16502.1| protein of unknown function DUF167 [Desulfobulbus propionicus DSM
           2032]
          Length = 110

 Score = 88.6 bits (219), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 27/90 (30%), Positives = 46/90 (51%), Gaps = 8/90 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +++ P A  + +A L+          +K++VT  P  GKAN+A++A LAK   L KSS
Sbjct: 21  LRLQVQPRAAANHLAGLQ-------GDMLKLRVTTPPVDGKANQAVVAYLAKLFHLPKSS 73

Query: 64  LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92
           + + S   S  K + I     +E+  +L  
Sbjct: 74  VVLKSGHQSRGKTVVIASGHEQEVRAVLAA 103


>gi|53728760|ref|ZP_00135410.2| COG1872: Uncharacterized conserved protein [Actinobacillus
          pleuropneumoniae serovar 1 str. 4074]
 gi|126208844|ref|YP_001054069.1| hypothetical protein APL_1380 [Actinobacillus pleuropneumoniae
          L20]
 gi|303250781|ref|ZP_07336976.1| hypothetical protein APP6_1906 [Actinobacillus pleuropneumoniae
          serovar 6 str. Femo]
 gi|303252447|ref|ZP_07338612.1| hypothetical protein APP2_1422 [Actinobacillus pleuropneumoniae
          serovar 2 str. 4226]
 gi|307246303|ref|ZP_07528382.1| hypothetical protein appser1_15050 [Actinobacillus
          pleuropneumoniae serovar 1 str. 4074]
 gi|307248416|ref|ZP_07530437.1| hypothetical protein appser2_13900 [Actinobacillus
          pleuropneumoniae serovar 2 str. S1536]
 gi|307250644|ref|ZP_07532582.1| hypothetical protein appser4_14180 [Actinobacillus
          pleuropneumoniae serovar 4 str. M62]
 gi|307253024|ref|ZP_07534909.1| hypothetical protein appser6_15320 [Actinobacillus
          pleuropneumoniae serovar 6 str. Femo]
 gi|307255287|ref|ZP_07537100.1| hypothetical protein appser9_15200 [Actinobacillus
          pleuropneumoniae serovar 9 str. CVJ13261]
 gi|307257450|ref|ZP_07539217.1| hypothetical protein appser10_14450 [Actinobacillus
          pleuropneumoniae serovar 10 str. D13039]
 gi|307259722|ref|ZP_07541443.1| hypothetical protein appser11_15170 [Actinobacillus
          pleuropneumoniae serovar 11 str. 56153]
 gi|166200384|sp|A3N228|Y1380_ACTP2 RecName: Full=UPF0235 protein APL_1380
 gi|126097636|gb|ABN74464.1| hypothetical protein APL_1380 [Actinobacillus pleuropneumoniae
          serovar 5b str. L20]
 gi|302648720|gb|EFL78911.1| hypothetical protein APP2_1422 [Actinobacillus pleuropneumoniae
          serovar 2 str. 4226]
 gi|302650386|gb|EFL80547.1| hypothetical protein APP6_1906 [Actinobacillus pleuropneumoniae
          serovar 6 str. Femo]
 gi|306852773|gb|EFM84999.1| hypothetical protein appser1_15050 [Actinobacillus
          pleuropneumoniae serovar 1 str. 4074]
 gi|306855058|gb|EFM87240.1| hypothetical protein appser2_13900 [Actinobacillus
          pleuropneumoniae serovar 2 str. S1536]
 gi|306857316|gb|EFM89434.1| hypothetical protein appser4_14180 [Actinobacillus
          pleuropneumoniae serovar 4 str. M62]
 gi|306859482|gb|EFM91510.1| hypothetical protein appser6_15320 [Actinobacillus
          pleuropneumoniae serovar 6 str. Femo]
 gi|306861736|gb|EFM93717.1| hypothetical protein appser9_15200 [Actinobacillus
          pleuropneumoniae serovar 9 str. CVJ13261]
 gi|306864030|gb|EFM95946.1| hypothetical protein appser10_14450 [Actinobacillus
          pleuropneumoniae serovar 10 str. D13039]
 gi|306866190|gb|EFM98057.1| hypothetical protein appser11_15170 [Actinobacillus
          pleuropneumoniae serovar 11 str. 56153]
          Length = 97

 Score = 88.6 bits (219), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 22/92 (23%), Positives = 40/92 (43%), Gaps = 8/92 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + L P A +  I  L           +KI +TA P  G AN  +L  L+K   + K
Sbjct: 13 IRLRIFLQPKASRDQIVGLH-------DNELKIAITALPVDGAANAHLLKYLSKLFKVPK 65

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          SS+ +   +    K +++  + K I + ++  
Sbjct: 66 SSIVLEKGELQRHKQLFVP-EPKLIPKEIEAL 96


>gi|113461793|ref|YP_719862.1| hypothetical protein HS_1657 [Haemophilus somnus 129PT]
 gi|170718106|ref|YP_001785139.1| hypothetical protein HSM_1819 [Haemophilus somnus 2336]
 gi|123132241|sp|Q0I525|Y1657_HAES1 RecName: Full=UPF0235 protein HS_1657
 gi|226696075|sp|B0UWD6|Y1819_HAES2 RecName: Full=UPF0235 protein HSM_1819
 gi|112823836|gb|ABI25925.1| conserved hypothetical protein [Haemophilus somnus 129PT]
 gi|168826235|gb|ACA31606.1| protein of unknown function DUF167 [Haemophilus somnus 2336]
          Length = 99

 Score = 88.6 bits (219), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 7/77 (9%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + + L P A K  +  L           +KI +TA P  G+AN  +L  L+K   ++KS
Sbjct: 12 RLRIFLQPKASKDHLIGL-------YDNALKISITAPPIDGQANAHLLKFLSKTFKVAKS 64

Query: 63 SLRMLSKQSSPLKIIYI 79
           + +   + S  K I I
Sbjct: 65 QIILEKGELSRHKQILI 81


>gi|332039865|gb|EGI76260.1| hypothetical protein HGR_12177 [Hylemonella gracilis ATCC 19624]
          Length = 115

 Score = 88.3 bits (218), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 21/83 (25%), Positives = 41/83 (49%), Gaps = 7/83 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            V + ++PNA ++ I  L           +K+++ A P  GKAN+A+   LAK L++  
Sbjct: 23 VLVDLHVMPNASRTQIQGL-------FDGALKVRLQAPPVDGKANEALRVWLAKTLSIPN 75

Query: 62 SSLRMLSKQSSPLKIIYIDKDCK 84
          SS+ +    ++  K +++     
Sbjct: 76 SSVTLQHGATARRKQLHVAAHSA 98


>gi|253701327|ref|YP_003022516.1| hypothetical protein GM21_2722 [Geobacter sp. M21]
 gi|251776177|gb|ACT18758.1| protein of unknown function DUF167 [Geobacter sp. M21]
          Length = 99

 Score = 88.3 bits (218), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 21/88 (23%), Positives = 44/88 (50%), Gaps = 8/88 (9%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           V + P A +S I         +    +++++T+ P +  ANK  + ++AK L ++KS +
Sbjct: 17 TVHVQPRASRSEICG-------AKEGELRLRLTSPPVEDAANKQCVELIAKTLGVAKSKV 69

Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELLQ 91
           + S   S  K++ ++  D   +  LL+
Sbjct: 70 SIRSGAKSRHKVVKVEGVDHDALLSLLK 97


>gi|94266431|ref|ZP_01290126.1| Protein of unknown function DUF167 [delta proteobacterium MLMS-1]
 gi|93452973|gb|EAT03472.1| Protein of unknown function DUF167 [delta proteobacterium MLMS-1]
          Length = 121

 Score = 88.3 bits (218), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 22/88 (25%), Positives = 45/88 (51%), Gaps = 8/88 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + VR  P A ++ +A        +    ++I+V A P  GKAN+A+L  LA +  L +++
Sbjct: 38  LRVRAQPGAARTEVAG-------TYGARLRIRVAAPPVDGKANRALLTFLASRCGLVRNA 90

Query: 64  LRMLSKQSSPLKIIYIDK-DCKEITELL 90
           + ++  Q    K+  ++    +++T  L
Sbjct: 91  VTLVGGQRGRDKLFRLEGIGPEQLTTCL 118


>gi|78186226|ref|YP_374269.1| hypothetical protein Plut_0338 [Chlorobium luteolum DSM 273]
 gi|123583481|sp|Q3B605|Y338_PELLD RecName: Full=UPF0235 protein Plut_0338
 gi|78166128|gb|ABB23226.1| conserved hypothetical protein [Chlorobium luteolum DSM 273]
          Length = 101

 Score = 88.3 bits (218), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 18/91 (19%), Positives = 39/91 (42%), Gaps = 7/91 (7%)

Query: 6  VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65
          VR+ P + KS ++             +K+ + A P    AN+    + +K  ++  S + 
Sbjct: 15 VRVQPRSSKSAVSG-------PYGNALKVTLKAAPVDDAANRECCRLFSKLFSIPDSRVS 67

Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96
          ++S  +S  K + ++    E    +  N S+
Sbjct: 68 IVSGAASRTKSVMLEGLSAEEARSILRNSSI 98


>gi|197117881|ref|YP_002138308.1| hypothetical protein Gbem_1494 [Geobacter bemidjiensis Bem]
 gi|197087241|gb|ACH38512.1| protein of unknown function DUF167 [Geobacter bemidjiensis Bem]
          Length = 99

 Score = 88.3 bits (218), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 21/88 (23%), Positives = 44/88 (50%), Gaps = 8/88 (9%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           V + P A +S I         +    +++++T+ P +  ANK  + ++AK L ++KS +
Sbjct: 17 TVHVQPRASRSEICG-------AKEGELRLRLTSPPVEDAANKQCVELIAKTLGVAKSKV 69

Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELLQ 91
           + S   S  K++ ++  D   +  LL+
Sbjct: 70 SIKSGAKSRHKVVKVEGVDHDALLSLLK 97


>gi|157743130|gb|AAI49509.1| C21H15orf40 protein [Bos taurus]
          Length = 132

 Score = 88.3 bits (218), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 19/94 (20%), Positives = 41/94 (43%), Gaps = 9/94 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            ++ +   P +K++ +  +           + + + A P +G+AN  +   L+K L L K
Sbjct: 42  VSIAIHAKPGSKQNAVTDVTT-------EAVSVAIAAPPTEGEANAELCRYLSKVLELRK 94

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93
           S + +     S  K++ +      +EI E L+  
Sbjct: 95  SDVVLDKGGKSREKVVKLLASTPPEEILEKLKKQ 128


>gi|229525149|ref|ZP_04414554.1| hypothetical protein VCA_002760 [Vibrio cholerae bv. albensis
          VL426]
 gi|229338730|gb|EEO03747.1| hypothetical protein VCA_002760 [Vibrio cholerae bv. albensis
          VL426]
          Length = 96

 Score = 88.3 bits (218), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 10/91 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  GKAN  +   LAK   ++K S
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65

Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91
          + +   +    K + I +      EI+ L++
Sbjct: 66 VVVEKGELGRHKQVRILQPSLIPAEISALIE 96


>gi|259089201|ref|NP_001158638.1| UPF0235 protein C15orf40 [Oncorhynchus mykiss]
 gi|225705490|gb|ACO08591.1| UPF0235 protein C15orf40 [Oncorhynchus mykiss]
          Length = 182

 Score = 88.3 bits (218), Expect = 3e-16,   Method: Composition-based stats.
 Identities = 21/96 (21%), Positives = 41/96 (42%), Gaps = 9/96 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + V   P +K++ I  + I           + + A P  G+AN  ++  L+K L L +
Sbjct: 93  VTISVHAKPGSKQNAITDVSIEAVG-------VAIAAPPTGGEANAELVRYLSKVLELKR 145

Query: 62  SSLRMLSKQSSPLKIIYIDKD--CKEITELLQNNDS 95
           S + +     S  KII +      +++ + L+   S
Sbjct: 146 SEVVLDKGSRSREKIIKVTGSLTPEQVLDRLKQEAS 181


>gi|293348273|ref|XP_002726816.1| PREDICTED: hypothetical protein [Rattus norvegicus]
          Length = 178

 Score = 87.9 bits (217), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 19/96 (19%), Positives = 39/96 (40%), Gaps = 9/96 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L             + + A P +G+AN  +   L+K L L K
Sbjct: 88  VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 140

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95
           S + +     S  K++ +      +E+ E L+    
Sbjct: 141 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTEAE 176


>gi|256379751|ref|YP_003103411.1| hypothetical protein Amir_5752 [Actinosynnema mirum DSM 43827]
 gi|255924054|gb|ACU39565.1| protein of unknown function DUF167 [Actinosynnema mirum DSM
          43827]
          Length = 90

 Score = 87.9 bits (217), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 20/84 (23%), Positives = 37/84 (44%), Gaps = 6/84 (7%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          M    VR+ P +K+  +          D   + + V A   +GKAN+A+   LAK   + 
Sbjct: 1  MLKFAVRVKPGSKRDAVGG------RWDERALVVSVAAPAVEGKANEAVRRALAKAFGVR 54

Query: 61 KSSLRMLSKQSSPLKIIYIDKDCK 84
          +  + ++S +    K++ ID    
Sbjct: 55 RQDVEIVSGERGRDKVVVIDPAPD 78


>gi|319407608|emb|CBI81258.1| conserved hypothetical protein [Bartonella sp. 1-1C]
          Length = 101

 Score = 87.9 bits (217), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 2/89 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VRLIP A    I  +E   D     ++ I++ A P+ GKANKA++  LAK+  +  S 
Sbjct: 12 LFVRLIPKASVDSIIKVENRDDGKQ--YLIIRLRAIPENGKANKALIKFLAKQWKIPSSC 69

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
          + + S  +S  K +Y  K  KE+ ++LQ+
Sbjct: 70 ISLGSGTTSHYKQLYFSKYLKEVEQILQS 98


>gi|288942590|ref|YP_003444830.1| hypothetical protein Alvin_2893 [Allochromatium vinosum DSM 180]
 gi|288897962|gb|ADC63798.1| protein of unknown function DUF167 [Allochromatium vinosum DSM
          180]
          Length = 100

 Score = 87.9 bits (217), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 18/77 (23%), Positives = 38/77 (49%), Gaps = 5/77 (6%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +R+ P A K   A     +      + ++++ A P +GKAN+A+   +A    ++ S 
Sbjct: 13 LRLRVQPRAPKDAFA-----EPDPSGDYYRVRLKAPPVEGKANQALRRFVADAFEVTLSQ 67

Query: 64 LRMLSKQSSPLKIIYID 80
          + +LS + +  K + I 
Sbjct: 68 VEILSGEQARYKRLRIR 84


>gi|319404616|emb|CBI78222.1| conserved hypothetical protein [Bartonella rochalimae ATCC
          BAA-1498]
          Length = 101

 Score = 87.9 bits (217), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 2/89 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VRLIP A    I  +E   D     ++ I++ A P+ GKANKA++  LAK+  +  S 
Sbjct: 12 LFVRLIPKASVDSIIKVENRDDGKQ--YLIIRLRAIPENGKANKALIKFLAKQWKIPSSC 69

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
          + + S  +S  K +Y  K  KE+ ++LQ+
Sbjct: 70 ISLGSGATSHYKQLYFSKYLKEVEKILQS 98


>gi|254472462|ref|ZP_05085862.1| conserved hypothetical protein [Pseudovibrio sp. JE062]
 gi|211958745|gb|EEA93945.1| conserved hypothetical protein [Pseudovibrio sp. JE062]
          Length = 105

 Score = 87.9 bits (217), Expect = 4e-16,   Method: Composition-based stats.
 Identities = 25/89 (28%), Positives = 51/89 (57%), Gaps = 2/89 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + VRL P + K  I  +    D    +  K++  A P+KG ANKA+ A+ AK L++ KSS
Sbjct: 16  ITVRLTPKSSKDQIEKIGAQSDGRPLVLAKVR--AVPEKGAANKAVAALFAKALSVPKSS 73

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
             +++  ++ +K + +  + +++ + L++
Sbjct: 74  AELIAGSTARIKTLRVLGEPQDLAKRLED 102


>gi|114778113|ref|ZP_01453000.1| hypothetical protein SPV1_00607 [Mariprofundus ferrooxydans PV-1]
 gi|114551531|gb|EAU54085.1| hypothetical protein SPV1_00607 [Mariprofundus ferrooxydans PV-1]
          Length = 101

 Score = 87.9 bits (217), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 23/87 (26%), Positives = 41/87 (47%), Gaps = 7/87 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V +   P A+K  +  +           +KI V    Q GKAN+A++  +A  L LS++ 
Sbjct: 16 VNIHAQPGARKPALRGMH-------GDALKIAVAEAAQDGKANEAIVRFIADALNLSRAD 68

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90
          + + S  +S  K +++  D  E+   L
Sbjct: 69 VDVASGHTSRRKRLFLHGDGSELRARL 95


>gi|161986454|ref|YP_311929.2| hypothetical protein SSON_3107 [Shigella sonnei Ss046]
 gi|323167995|gb|EFZ53684.1| hypothetical protein SS53G_1711 [Shigella sonnei 53G]
          Length = 96

 Score = 87.9 bits (217), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 18/86 (20%), Positives = 36/86 (41%), Gaps = 7/86 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P A +  I  L           +K+ +TA P  G+ N  ++  L K+  ++KS 
Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQGNSHLVKFLGKQFRVAKSQ 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89
          + +   +    K I I    +   E+
Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91


>gi|301789543|ref|XP_002930186.1| PREDICTED: UPF0235 protein C15orf40 homolog [Ailuropoda
           melanoleuca]
          Length = 214

 Score = 87.5 bits (216), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 19/94 (20%), Positives = 40/94 (42%), Gaps = 9/94 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  +           + + + A P +G+AN  +   L+K L L K
Sbjct: 124 VTIAIHAKPGSKQNAVTDVTA-------EAVSVAIAAPPSEGEANAELCRYLSKVLELRK 176

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93
           S + +     S  K++ +      +EI E L+  
Sbjct: 177 SDVVLDKGGKSREKVVKLLASTTTEEILEKLKQQ 210


>gi|120602738|ref|YP_967138.1| hypothetical protein Dvul_1694 [Desulfovibrio vulgaris DP4]
 gi|120562967|gb|ABM28711.1| protein of unknown function DUF167 [Desulfovibrio vulgaris DP4]
          Length = 108

 Score = 87.5 bits (216), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 19/81 (23%), Positives = 41/81 (50%), Gaps = 7/81 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          ++V + P AKK G+A +           ++++++A     KAN+ +   +A  L +  + 
Sbjct: 20 LLVWVQPGAKKDGLAGV-------ADGRLRVRLSAPAVDNKANRGLERYMASLLGVRPAR 72

Query: 64 LRMLSKQSSPLKIIYIDKDCK 84
          + + S Q+S  K + I+ D +
Sbjct: 73 VSVASGQTSRRKRVVIESDAE 93


>gi|145590207|ref|YP_001156804.1| hypothetical protein Pnuc_2029 [Polynucleobacter necessarius
          subsp. asymbioticus QLW-P1DMWA-1]
 gi|145048613|gb|ABP35240.1| protein of unknown function DUF167 [Polynucleobacter necessarius
          subsp. asymbioticus QLW-P1DMWA-1]
          Length = 98

 Score = 87.5 bits (216), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 23/90 (25%), Positives = 46/90 (51%), Gaps = 9/90 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +   P AK + +  L           +KI + A   + KAN+ +L  L+K+L + +  
Sbjct: 15 LNLHCQPGAKVTKVVGLH-------DGCLKISLQAPAIENKANELLLGWLSKQLKIPQKQ 67

Query: 64 LRMLSKQSSPLKIIYIDKD--CKEITELLQ 91
          ++ +S Q+S +K + I      ++IT++LQ
Sbjct: 68 IQFISGQNSRIKRVEIWGSITPEQITQILQ 97


>gi|301064653|ref|ZP_07205047.1| conserved hypothetical protein TIGR00251 [delta proteobacterium
          NaphS2]
 gi|300441273|gb|EFK05644.1| conserved hypothetical protein TIGR00251 [delta proteobacterium
          NaphS2]
          Length = 95

 Score = 87.5 bits (216), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 26/93 (27%), Positives = 45/93 (48%), Gaps = 8/93 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V+++P + ++ I               +IK+TA P +GKANKA++  LAKK    K
Sbjct: 10 VVIRVKVLPRSSRTEIVG-------KTDGIYRIKLTAPPVEGKANKALINFLAKKTGSPK 62

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
            +R++  + S  K I I+     +I + L   
Sbjct: 63 QKIRIVKGEQSRNKTIRIENLSSDDILKYLNEE 95


>gi|46579785|ref|YP_010593.1| hypothetical protein DVU1374 [Desulfovibrio vulgaris str.
          Hildenborough]
 gi|46449200|gb|AAS95852.1| conserved hypothetical protein [Desulfovibrio vulgaris str.
          Hildenborough]
 gi|311233576|gb|ADP86430.1| protein of unknown function DUF167 [Desulfovibrio vulgaris RCH1]
          Length = 106

 Score = 87.5 bits (216), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 19/81 (23%), Positives = 41/81 (50%), Gaps = 7/81 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          ++V + P AKK G+A +           ++++++A     KAN+ +   +A  L +  + 
Sbjct: 18 LLVWVQPGAKKDGLAGV-------ADGRLRVRLSAPAVDNKANRGLERYMASLLGVRPAR 70

Query: 64 LRMLSKQSSPLKIIYIDKDCK 84
          + + S Q+S  K + I+ D +
Sbjct: 71 VSVASGQTSRRKRVVIESDAE 91


>gi|114763599|ref|ZP_01443004.1| hypothetical protein 1100011001330_R2601_13754 [Pelagibaca
          bermudensis HTCC2601]
 gi|114543879|gb|EAU46891.1| hypothetical protein R2601_13754 [Roseovarius sp. HTCC2601]
          Length = 85

 Score = 87.5 bits (216), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 23/80 (28%), Positives = 39/80 (48%), Gaps = 8/80 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +R+ P A ++ I          +   +++ VT  P+ GKAN A+  +LAK L L K
Sbjct: 14 ATLELRVTPKASRNEI--------REEGGTLRVYVTTVPEDGKANAAVQKLLAKALGLPK 65

Query: 62 SSLRMLSKQSSPLKIIYIDK 81
          S L ++   +S  K   I+ 
Sbjct: 66 SRLVLVRGATSRDKAFRIEA 85


>gi|165976805|ref|YP_001652398.1| hypothetical protein APJL_1398 [Actinobacillus pleuropneumoniae
          serovar 3 str. JL03]
 gi|226734144|sp|B0BQW9|Y1398_ACTPJ RecName: Full=UPF0235 protein APJL_1398
 gi|165876906|gb|ABY69954.1| hypothetical protein APJL_1398 [Actinobacillus pleuropneumoniae
          serovar 3 str. JL03]
          Length = 98

 Score = 87.5 bits (216), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 22/93 (23%), Positives = 40/93 (43%), Gaps = 9/93 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + L P A +  I  L           +KI +TA P  G AN  +L  L+K   + K
Sbjct: 13 IRLRIFLQPKASRDQIVGLH-------DSELKIAITAPPVDGAANAHLLKYLSKLFKVPK 65

Query: 62 SSLRMLSKQSSPL-KIIYIDKDCKEITELLQNN 93
          SS+ +   +     K +++  + K I + ++  
Sbjct: 66 SSIVLEKGELQRHNKQLFVP-EPKLIPKEIEAL 97


>gi|167386103|ref|XP_001737619.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165899553|gb|EDR26129.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 118

 Score = 87.5 bits (216), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 21/93 (22%), Positives = 47/93 (50%), Gaps = 8/93 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + + + PNAK S +  +E          +K+ + A P  GKAN  ++A +A    + K
Sbjct: 33  VIIEIEIKPNAKTSELQGVE-------DGILKVAIDAPPIDGKANTEVIAFMASTFGIKK 85

Query: 62  SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
           S++ ++  Q+S  K +  +    +++ +++Q+ 
Sbjct: 86  SNVSLIKGQTSHHKTLQFENWTREKVLQIIQSK 118


>gi|225873890|ref|YP_002755349.1| hypothetical protein ACP_2305 [Acidobacterium capsulatum ATCC
          51196]
 gi|225793057|gb|ACO33147.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC
          51196]
          Length = 112

 Score = 87.5 bits (216), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 20/80 (25%), Positives = 39/80 (48%), Gaps = 3/80 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VR+ P A +S    +    +      +++ + A P  G+AN+ +L  LA++L L  
Sbjct: 14 AVLAVRVTPRASRSSFQGV---LEKEGQTMLRVALHAPPIDGRANEELLDFLARQLDLPG 70

Query: 62 SSLRMLSKQSSPLKIIYIDK 81
          SSL ++    S  K++ +  
Sbjct: 71 SSLEIIRGLQSREKLVRMTG 90


>gi|57640703|ref|YP_183181.1| hypothetical protein TK0768 [Thermococcus kodakarensis KOD1]
 gi|73921163|sp|Q5JHB2|Y768_PYRKO RecName: Full=UPF0235 protein TK0768
 gi|57159027|dbj|BAD84957.1| hypothetical protein, conserved, YggU family [Thermococcus
          kodakarensis KOD1]
          Length = 94

 Score = 87.5 bits (216), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 19/90 (21%), Positives = 45/90 (50%), Gaps = 9/90 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +++ + P AKK+ I  ++          +K+++ A P +GKANK ++   +K L    
Sbjct: 11 VLIMIYVQPKAKKNAIEGVDG-----WRGRLKVRIAAPPVEGKANKEVVKFFSKLLG--- 62

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
          + + ++  ++S  K + +     +E+ + L
Sbjct: 63 AEVNIVRGETSREKDLLVKGLSVEEVRKKL 92


>gi|212223789|ref|YP_002307025.1| hypothetical protein TON_0641 [Thermococcus onnurineus NA1]
 gi|226707988|sp|B6YUU2|Y641_THEON RecName: Full=UPF0235 protein TON_0641
 gi|212008746|gb|ACJ16128.1| hypothetical protein, conserved [Thermococcus onnurineus NA1]
          Length = 94

 Score = 87.1 bits (215), Expect = 6e-16,   Method: Composition-based stats.
 Identities = 21/90 (23%), Positives = 44/90 (48%), Gaps = 9/90 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +++ + P AKK+ I  ++          +K+K+ A P +GKANK ++   +K L    
Sbjct: 11 AVILLYVQPKAKKNEIEGVD-----EWRGRLKVKIKAPPVEGKANKEVVRFFSKMLG--- 62

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
          + + ++   +S  K + +     KE+ + L
Sbjct: 63 TEVEIIRGGTSREKDLLVKGFSSKEVLKKL 92


>gi|332107965|gb|EGJ09189.1| hypothetical protein RBXJA2T_02622 [Rubrivivax benzoatilyticus
          JA2]
          Length = 103

 Score = 87.1 bits (215), Expect = 7e-16,   Method: Composition-based stats.
 Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 7/89 (7%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + V ++PNA+++G   L           +++++ A P  GKAN+ +LA LA +L L K 
Sbjct: 17 RLRVAVVPNARRTGADGLH-------DGALRVRLNAPPVDGKANETLLAWLADELDLPKR 69

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          ++R+   Q+   K I +D   + +   L 
Sbjct: 70 AVRLTHGQTGRRKTIELDAAPEAVAAWLA 98


>gi|328780498|ref|XP_392249.2| PREDICTED: UPF0235 protein C15orf40 homolog [Apis mellifera]
          Length = 144

 Score = 87.1 bits (215), Expect = 8e-16,   Method: Composition-based stats.
 Identities = 19/93 (20%), Positives = 40/93 (43%), Gaps = 8/93 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + ++  P AK + I  +             + ++A P +G+AN  ++  LA  L + K
Sbjct: 56  VTIKIQAKPGAKHNNITDISEDAVG-------VAISAPPVEGEANTELVKYLASVLGMRK 108

Query: 62  SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
           S + +     S  KI+ +     +++ E L+  
Sbjct: 109 SDVTLDRGSKSRQKIVVVSGISVEKVLEKLKGE 141


>gi|84685306|ref|ZP_01013204.1| hypothetical protein 1099457000258_RB2654_10573 [Maritimibacter
          alkaliphilus HTCC2654]
 gi|84666463|gb|EAQ12935.1| hypothetical protein RB2654_10573 [Rhodobacterales bacterium
          HTCC2654]
          Length = 85

 Score = 86.7 bits (214), Expect = 9e-16,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 42/77 (54%), Gaps = 7/77 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VR+ P A ++ ++         +  H+K+ VT  P+ GKA  A++ +LA  L ++KS 
Sbjct: 16 LTVRVTPKASRNAVS-------VDEDGHLKVSVTTVPEDGKATAAVVKLLAHALGVAKSD 68

Query: 64 LRMLSKQSSPLKIIYID 80
          L ++   +S  K+  ID
Sbjct: 69 LTLVRGATSRDKVFRID 85


>gi|170742084|ref|YP_001770739.1| hypothetical protein M446_3939 [Methylobacterium sp. 4-46]
 gi|226706076|sp|B0UH52|Y3939_METS4 RecName: Full=UPF0235 protein M446_3939
 gi|168196358|gb|ACA18305.1| protein of unknown function DUF167 [Methylobacterium sp. 4-46]
          Length = 104

 Score = 86.7 bits (214), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 20/87 (22%), Positives = 42/87 (48%), Gaps = 2/87 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V VR  P   +  +  +E   D      +K++V A P+ G AN A+ A+LA+ L     +
Sbjct: 14 VRVRATPRGGRDAVEGIETRADGLP--VLKVRVRAAPEDGAANAAIRAVLAEALGCPARA 71

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90
          + + +  ++ +K+  +  D + +   +
Sbjct: 72 VTLAAGATARVKLFRVAGDGQALAARI 98


>gi|157123809|ref|XP_001653923.1| hypothetical protein AaeL_AAEL009675 [Aedes aegypti]
 gi|108874207|gb|EAT38432.1| conserved hypothetical protein [Aedes aegypti]
          Length = 149

 Score = 86.7 bits (214), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 20/96 (20%), Positives = 42/96 (43%), Gaps = 10/96 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + ++  P AK +GI  +             +++ A P  G+AN  ++  L+K L L K
Sbjct: 58  VLIKIQAKPGAKTNGITDIGEEGVG-------VQIAAPPVDGEANTELVKYLSKLLELRK 110

Query: 62  SSLRMLSKQSSPLKIIYIDKD---CKEITELLQNND 94
           S + +     S  K I ++K     +++ ++ +   
Sbjct: 111 SDVSLDRGSKSRQKTIVLEKGCRMPEQVLDVFRKEA 146


>gi|171057075|ref|YP_001789424.1| hypothetical protein Lcho_0384 [Leptothrix cholodnii SP-6]
 gi|170774520|gb|ACB32659.1| protein of unknown function DUF167 [Leptothrix cholodnii SP-6]
          Length = 125

 Score = 86.7 bits (214), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 19/83 (22%), Positives = 40/83 (48%), Gaps = 7/83 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V ++PNA+++ +  L           +++++ A P  G AN A+   LA +L +SK  
Sbjct: 35  IDVAVVPNARRTEVVGLHDQA-------LRLRLAAPPVDGAANDALQRWLADELGVSKQQ 87

Query: 64  LRMLSKQSSPLKIIYIDKDCKEI 86
           + +L   S   K + +     ++
Sbjct: 88  VSLLRGASGRRKRLRVQVPPAQM 110


>gi|67482694|ref|XP_656664.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56473879|gb|EAL51278.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
          Length = 118

 Score = 86.7 bits (214), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 22/93 (23%), Positives = 47/93 (50%), Gaps = 8/93 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + V + PNAK S I  +E          +K+ + + P  GKAN  ++A +A    + K
Sbjct: 33  VIIEVEIKPNAKTSEIQGVE-------DGLLKVSINSPPVDGKANTEVIAFMASTFGIKK 85

Query: 62  SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
           S+++++  Q+S  K +  +    +++ +++Q  
Sbjct: 86  SNVKLIKGQTSHHKTLQFENWTREKVLQIIQAK 118


>gi|310822209|ref|YP_003954567.1| hypothetical protein STAUR_4962 [Stigmatella aurantiaca DW4/3-1]
 gi|309395281|gb|ADO72740.1| conserved uncharacterized protein [Stigmatella aurantiaca
          DW4/3-1]
          Length = 98

 Score = 86.7 bits (214), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 18/94 (19%), Positives = 43/94 (45%), Gaps = 7/94 (7%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V + P A ++ +              +K+++ A P  G+AN A++  LAK+L L +
Sbjct: 12 VELAVLVQPRASRTRVVG-------EHDGMLKLQLAAPPVDGEANAALVEFLAKRLGLPR 64

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
            + +++  ++  K +++        E + +  S
Sbjct: 65 RQVTLVAGDAARRKRVFLAGVDAARVEAVMSQAS 98


>gi|149456878|ref|XP_001519576.1| PREDICTED: similar to chromosome 15 open reading frame 40, partial
           [Ornithorhynchus anatinus]
          Length = 119

 Score = 86.3 bits (213), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 24/98 (24%), Positives = 39/98 (39%), Gaps = 9/98 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + V   P AK++ +  + +           + + A P +G+AN  +   LAK L L K
Sbjct: 29  VTIAVHAKPGAKQNAVTDVSVEAVG-------VAIAAPPSEGEANAELCRYLAKILELRK 81

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDSLT 97
           S + +     S  K+I I       EI   L+     T
Sbjct: 82  SDVVLDRGGKSREKVIKILSSTTPDEILAKLKKQTETT 119


>gi|154705806|ref|YP_001424486.1| hypothetical cytosolic protein [Coxiella burnetii Dugway
          5J108-111]
 gi|154355092|gb|ABS76554.1| hypothetical cytosolic protein [Coxiella burnetii Dugway
          5J108-111]
          Length = 92

 Score = 86.3 bits (213), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 26/79 (32%), Positives = 46/79 (58%), Gaps = 7/79 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V + P AK++ I+            H+KI++ A P +GKANKA++  LA++L L+ SS
Sbjct: 7  LTVYIQPGAKQTQISGKH-------GEHIKIRLQAPPTEGKANKALIDFLAQRLKLNPSS 59

Query: 64 LRMLSKQSSPLKIIYIDKD 82
          + ++  + + LK I I+  
Sbjct: 60 ITIIRGEKARLKTIAIESS 78


>gi|85859583|ref|YP_461785.1| putative cytoplasmic protein [Syntrophus aciditrophicus SB]
 gi|85722674|gb|ABC77617.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB]
          Length = 118

 Score = 86.3 bits (213), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 25/83 (30%), Positives = 40/83 (48%), Gaps = 7/83 (8%)

Query: 6  VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65
          V ++P + K  +A        +    ++IK+TA P  GKAN   L  LA  L + K  + 
Sbjct: 17 VHVLPRSAKCALAG-------AQEGALRIKLTAPPVDGKANDECLEFLAGILGVKKGQMD 69

Query: 66 MLSKQSSPLKIIYIDKDCKEITE 88
          ++S  +S  KI+ I    +E  E
Sbjct: 70 IISGHTSRRKIVQIMNVPREPLE 92


>gi|194334667|ref|YP_002016527.1| hypothetical protein Paes_1868 [Prosthecochloris aestuarii DSM
          271]
 gi|226696156|sp|B4S4I4|Y1868_PROA2 RecName: Full=UPF0235 protein Paes_1868
 gi|194312485|gb|ACF46880.1| protein of unknown function DUF167 [Prosthecochloris aestuarii
          DSM 271]
          Length = 104

 Score = 86.3 bits (213), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 24/86 (27%), Positives = 35/86 (40%), Gaps = 7/86 (8%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           V+  P + +S I              +KI + A P    AN      LAK L ++ S +
Sbjct: 14 FVKAQPRSSRSAIIG-------EYDGKIKISLKAAPVDDAANVECCRFLAKSLGVASSRV 66

Query: 65 RMLSKQSSPLKIIYIDKDCKEITELL 90
          R+LS  SS +K + ID         L
Sbjct: 67 RILSGHSSRIKRLTIDGMGAAEAATL 92


>gi|189347519|ref|YP_001944048.1| hypothetical protein Clim_2040 [Chlorobium limicola DSM 245]
 gi|189341666|gb|ACD91069.1| protein of unknown function DUF167 [Chlorobium limicola DSM 245]
          Length = 101

 Score = 85.9 bits (212), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 22/88 (25%), Positives = 40/88 (45%), Gaps = 8/88 (9%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           V+  P + KS I         +    +K+ + A P    AN+   A+ AK      S L
Sbjct: 13 QVKAQPRSSKSRITG-------AYDRGVKVTLKAAPVDDAANEECCALFAKVFGFPVSRL 65

Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELLQ 91
           ++S +SS  K + ++    +E++ LL+
Sbjct: 66 CIVSGRSSRNKTLRVEGTSAEEVSRLLR 93


>gi|251793930|ref|YP_003008662.1| hypothetical protein NT05HA_2269 [Aggregatibacter aphrophilus
          NJ8700]
 gi|247535329|gb|ACS98575.1| hypothetical protein NT05HA_2269 [Aggregatibacter aphrophilus
          NJ8700]
          Length = 97

 Score = 85.9 bits (212), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 21/76 (27%), Positives = 35/76 (46%), Gaps = 7/76 (9%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + + L P A K  I  L           +KI++TA P  G+AN  +L  L+K   + KS
Sbjct: 12 RLRIFLQPKAAKDHIVGLH-------DDELKIRITAPPIDGQANAHLLKFLSKLFKVPKS 64

Query: 63 SLRMLSKQSSPLKIIY 78
          S+ +   + +  K + 
Sbjct: 65 SIVLEKGELNCHKQVL 80


>gi|221632637|ref|YP_002521858.1| hypothetical protein trd_0618 [Thermomicrobium roseum DSM 5159]
 gi|221157137|gb|ACM06264.1| DUF167 [Thermomicrobium roseum DSM 5159]
          Length = 102

 Score = 85.6 bits (211), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 28/87 (32%), Positives = 49/87 (56%), Gaps = 8/87 (9%)

Query: 6  VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65
          V++ P A ++ +A        S    + ++VTA P+ G+AN+A+L +LA+ L L + S+R
Sbjct: 17 VQVQPRAPRAEVAG-------SRRDALLVRVTAPPRDGEANEAVLRLLAETLHLPRGSIR 69

Query: 66 MLSKQSSPLKIIYIDK-DCKEITELLQ 91
          +++  +   K I ID    KE+ E L 
Sbjct: 70 IIAGTAQRRKRIRIDGLTSKELLERLA 96


>gi|158424942|ref|YP_001526234.1| hypothetical protein AZC_3318 [Azorhizobium caulinodans ORS 571]
 gi|158331831|dbj|BAF89316.1| protein of unknown function [Azorhizobium caulinodans ORS 571]
          Length = 104

 Score = 85.6 bits (211), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 2/89 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V VR  P   +  +  +    D      +KI+V A P+ G A  A+  +LA    ++ S+
Sbjct: 13 VSVRATPKGGRDALDGVSQLSDG--RDVLKIRVRAAPEDGAATAAVAKVLAGAAGVAPSA 70

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
          +R+ S  ++ LK+  I  D   +   L+ 
Sbjct: 71 VRLASGATARLKVFRISGDAARLRATLEA 99


>gi|170029973|ref|XP_001842865.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167865325|gb|EDS28708.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 157

 Score = 85.6 bits (211), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 21/97 (21%), Positives = 42/97 (43%), Gaps = 10/97 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K +GI  +E            +++ A P  G+AN  ++  LAK L L K
Sbjct: 65  VLIKILAKPGSKFNGITGIEDEGVG-------VQIAAPPIDGEANTELVKYLAKLLDLRK 117

Query: 62  SSLRMLSKQSSPLKIIYID---KDCKEITELLQNNDS 95
           S + +     S  K I ++   +   ++ E+ +   +
Sbjct: 118 SDVSLDRGSKSRQKTIVLEKGCRTPDQVLEIFRREAT 154


>gi|83945079|ref|ZP_00957445.1| hypothetical protein OA2633_10629 [Oceanicaulis alexandrii
          HTCC2633]
 gi|83851861|gb|EAP89716.1| hypothetical protein OA2633_10629 [Oceanicaulis alexandrii
          HTCC2633]
          Length = 101

 Score = 85.6 bits (211), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 24/94 (25%), Positives = 44/94 (46%), Gaps = 4/94 (4%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + VR+ P A ++G        D      +  +V A P KG AN  + A+ AK L + KS
Sbjct: 6  RLFVRVQPRASRAGFDGARAGTDGRI--RLAARVRAAPDKGAANTELCALTAKTLGVPKS 63

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96
          ++ +++  +   K + +    +E  +LL    +L
Sbjct: 64 TVSVIAGATQREKTLLVR--SEESIQLLDAVTAL 95


>gi|303245495|ref|ZP_07331779.1| protein of unknown function DUF167 [Desulfovibrio fructosovorans
           JJ]
 gi|302493344|gb|EFL53206.1| protein of unknown function DUF167 [Desulfovibrio fructosovorans
           JJ]
          Length = 113

 Score = 85.6 bits (211), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 18/81 (22%), Positives = 40/81 (49%), Gaps = 7/81 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V + P   +  +A L           +++++ A   +G+AN A+ A LA+   +    
Sbjct: 28  LRVAVAPGGSRDALAGL-------AEDRLRVRLRAKAVEGQANAALTAFLAECFGVRPRQ 80

Query: 64  LRMLSKQSSPLKIIYIDKDCK 84
           +R++S + S  KI+ I+ + +
Sbjct: 81  VRIVSGEKSRKKIVRINAESE 101


>gi|220923538|ref|YP_002498840.1| hypothetical protein Mnod_3628 [Methylobacterium nodulans ORS
          2060]
 gi|219948145|gb|ACL58537.1| protein of unknown function DUF167 [Methylobacterium nodulans ORS
          2060]
          Length = 102

 Score = 85.6 bits (211), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 22/81 (27%), Positives = 39/81 (48%), Gaps = 2/81 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V VR  P   +  I  +E   D      +K++V A P+ G AN A+  +L   L     +
Sbjct: 12 VRVRATPRGGRDAIDGIETRADGLS--VLKVRVRAAPEDGAANTAIRDLLKTALGCPARA 69

Query: 64 LRMLSKQSSPLKIIYIDKDCK 84
          +R+ +  ++ +KI  I+ D +
Sbjct: 70 VRLTAGATARVKIFRIEGDGE 90


>gi|157826313|ref|YP_001494033.1| hypothetical protein A1C_06510 [Rickettsia akari str. Hartford]
 gi|166228807|sp|A8GQ50|Y6510_RICAH RecName: Full=UPF0235 protein A1C_06510
 gi|157800271|gb|ABV75525.1| hypothetical protein A1C_06510 [Rickettsia akari str. Hartford]
          Length = 105

 Score = 85.6 bits (211), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 23/84 (27%), Positives = 50/84 (59%), Gaps = 3/84 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +++ PN+K++ I++  I  +     ++K+ + ATP+KGKAN+ ++  LAK   LS+
Sbjct: 14 VLLNLKVKPNSKQNLISNFVIINNIP---YLKLSIKATPEKGKANEEIINYLAKAWKLSR 70

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85
          S++ ++   +  +K I I    ++
Sbjct: 71 SNIEIIKGHTHSVKTILIKNINED 94


>gi|296109159|ref|YP_003616108.1| protein of unknown function DUF167 [Methanocaldococcus infernus
          ME]
 gi|295433973|gb|ADG13144.1| protein of unknown function DUF167 [Methanocaldococcus infernus
          ME]
          Length = 92

 Score = 85.2 bits (210), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 25/90 (27%), Positives = 45/90 (50%), Gaps = 9/90 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + PNAKK+ I   +          +++KV A P +GKANK ++   +K        
Sbjct: 11 LDIIVTPNAKKTEIVGRD-----EWRNRLEVKVKAPPVEGKANKEIIKFFSKLFG----D 61

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + +++ + S  K I I K  KE+ E+L + 
Sbjct: 62 VEIVAGEKSSKKTILIRKPLKEVEEILNSL 91


>gi|311260654|ref|XP_001929217.2| PREDICTED: UPF0235 protein C15orf40 homolog [Sus scrofa]
          Length = 154

 Score = 85.2 bits (210), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 19/94 (20%), Positives = 39/94 (41%), Gaps = 9/94 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L           + + + A P +G+AN  +   L+K   L K
Sbjct: 64  VTIAIHAKPGSKQNAVTDLTT-------EAVSVAIAAPPSEGEANAELCRYLSKVFELRK 116

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93
           S + +     S  K++ +      +EI E L+  
Sbjct: 117 SDVVLDKGGKSREKVVKLLASTTPEEILEKLKKQ 150


>gi|238650830|ref|YP_002916685.1| hypothetical protein RPR_04990 [Rickettsia peacockii str. Rustic]
 gi|259647069|sp|C4K236|Y4990_RICPU RecName: Full=UPF0235 protein RPR_04990
 gi|238624928|gb|ACR47634.1| hypothetical protein RPR_04990 [Rickettsia peacockii str. Rustic]
          Length = 105

 Score = 85.2 bits (210), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 22/84 (26%), Positives = 49/84 (58%), Gaps = 3/84 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +  ++ PN+K++ I++  I  +     ++K+ + A P++GKAN+ ++  LAK+  LS+
Sbjct: 14 ALLSFKVKPNSKQNLISNFVIINNIP---YLKLSIKAIPEQGKANEEIINYLAKEWKLSR 70

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85
          S++ ++   +  LK I I    ++
Sbjct: 71 SNIEIIKGHTHSLKTILIKNINED 94


>gi|109082186|ref|XP_001111768.1| PREDICTED: UPF0235 protein C15orf40-like isoform 1 [Macaca mulatta]
          Length = 154

 Score = 85.2 bits (210), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 21/98 (21%), Positives = 41/98 (41%), Gaps = 9/98 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L           + + + A P +G+AN  +   L+K L L K
Sbjct: 64  VTITIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 116

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDSLT 97
           S + +     S  K++ +      +EI E L+     T
Sbjct: 117 SDVVLDKGGKSREKVVKLLASTTPEEILEKLKKEARKT 154


>gi|152991249|ref|YP_001356971.1| hypothetical protein NIS_1507 [Nitratiruptor sp. SB155-2]
 gi|151423110|dbj|BAF70614.1| conserved hypothetical protein [Nitratiruptor sp. SB155-2]
          Length = 95

 Score = 85.2 bits (210), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 7/77 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           ++ ++  PNA K+ IA +           +KI + A   +G ANK ++  L+K   ++K
Sbjct: 10 VHMFIKAQPNASKNKIAGI-------LGDSLKIAIKAPAVEGAANKELVKFLSKTFKVAK 62

Query: 62 SSLRMLSKQSSPLKIIY 78
          S +   S ++S  K I 
Sbjct: 63 SDIVFASGETSKRKHIV 79


>gi|126733001|ref|ZP_01748760.1| hypothetical protein SSE37_15953 [Sagittula stellata E-37]
 gi|126706530|gb|EBA05608.1| hypothetical protein SSE37_15953 [Sagittula stellata E-37]
          Length = 84

 Score = 85.2 bits (210), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 20/78 (25%), Positives = 39/78 (50%), Gaps = 8/78 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VR+ P A ++ +              +++ VT  P+ GKA  A++ +LAK L + K
Sbjct: 14 ATLAVRVTPKASRNAVE--------RTDDALRVYVTTVPEGGKATAAVVKLLAKALGVPK 65

Query: 62 SSLRMLSKQSSPLKIIYI 79
          S L ++  ++S  K+  +
Sbjct: 66 SRLELVRGETSRDKVFRV 83


>gi|88812461|ref|ZP_01127710.1| hypothetical protein NB231_13261 [Nitrococcus mobilis Nb-231]
 gi|88790247|gb|EAR21365.1| hypothetical protein NB231_13261 [Nitrococcus mobilis Nb-231]
          Length = 99

 Score = 85.2 bits (210), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 17/78 (21%), Positives = 37/78 (47%), Gaps = 7/78 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VR+ P A +  +              +++++TA P +GKAN+ +   L   L +++S 
Sbjct: 13 LTVRVQPRAARDEL-------KIDADGRLRLRITAPPVEGKANEHLRHFLGHALGVARSQ 65

Query: 64 LRMLSKQSSPLKIIYIDK 81
          + + +  +S  K I +  
Sbjct: 66 VSVATGATSRNKRIVVQN 83


>gi|126306449|ref|XP_001373757.1| PREDICTED: similar to chromosome 15 open reading frame 40,
           [Monodelphis domestica]
          Length = 146

 Score = 85.2 bits (210), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 21/94 (22%), Positives = 40/94 (42%), Gaps = 9/94 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P +K++ I  +           + + + A P +G+AN  +   L+K L L KS 
Sbjct: 58  IAIHAKPGSKQNAITDVTTEN-------VSVAIAAPPSEGEANTELCRYLSKVLELRKSD 110

Query: 64  LRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95
           + +     S  K++ I      +EI E L+    
Sbjct: 111 VILDKGGKSREKVVKILASTTPEEILEKLKRQAE 144


>gi|32475275|ref|NP_868269.1| hypothetical protein RB8260 [Rhodopirellula baltica SH 1]
 gi|47117454|sp|Q7UFY2|Y8260_RHOBA RecName: Full=UPF0235 protein RB8260
 gi|32445816|emb|CAD78547.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
          Length = 108

 Score = 85.2 bits (210), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 27/76 (35%), Positives = 44/76 (57%), Gaps = 7/76 (9%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           VR+ P AKK+ +  L           +K+ V   P+ GKANKA++A LAK L +SK  +
Sbjct: 21 RVRVTPKAKKASVGGLH-------DGALKVSVHTVPEDGKANKAVIASLAKWLRVSKGRV 73

Query: 65 RMLSKQSSPLKIIYID 80
           +++ ++S LK I ++
Sbjct: 74 AIVAGETSRLKTIVVE 89


>gi|189500987|ref|YP_001960457.1| hypothetical protein Cphamn1_2066 [Chlorobium phaeobacteroides
          BS1]
 gi|226701149|sp|B3EMY7|Y2066_CHLPB RecName: Full=UPF0235 protein Cphamn1_2066
 gi|189496428|gb|ACE04976.1| protein of unknown function DUF167 [Chlorobium phaeobacteroides
          BS1]
          Length = 101

 Score = 85.2 bits (210), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 20/87 (22%), Positives = 39/87 (44%), Gaps = 8/87 (9%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           ++  P + KS I              +K+ + A P  G+AN     +LA+ L +++SS+
Sbjct: 14 SIKAQPRSSKSMITG-------EYDGSIKVNLKAPPVDGEANLECCRLLARTLGVARSSV 66

Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELL 90
           ++S     +K + +      E TE +
Sbjct: 67 EIVSGTRGKMKRVKVFGLSAVEFTEKI 93


>gi|237749382|ref|ZP_04579862.1| predicted protein [Oxalobacter formigenes OXCC13]
 gi|229380744|gb|EEO30835.1| predicted protein [Oxalobacter formigenes OXCC13]
          Length = 100

 Score = 85.2 bits (210), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 28/91 (30%), Positives = 50/91 (54%), Gaps = 10/91 (10%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + V++ PNAKK+ I        +SD   ++I++ A P  GKAN+A++  +AKKL   K 
Sbjct: 14 RIAVQVSPNAKKTEIV-------SSDGEALRIRLQAPPVDGKANEALVQFIAKKLRTPKR 66

Query: 63 SLRMLSKQSSPLKIIYI---DKDCKEITELL 90
          ++ +    S+  K++ I   D   +E+ + L
Sbjct: 67 NVSITHGLSAKHKLLEIGLPDIPEEELEKQL 97


>gi|50753027|ref|XP_413838.1| PREDICTED: similar to Chromosome 15 open reading frame 40 [Gallus
           gallus]
          Length = 158

 Score = 85.2 bits (210), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 22/97 (22%), Positives = 39/97 (40%), Gaps = 9/97 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             V VR  P ++ S +  +             + + A P +G+AN  +   L+K L + K
Sbjct: 69  VRVSVRAKPGSRCSAVTDVTAEAVG-------VAIAAPPSEGEANAELCRYLSKVLGVKK 121

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDSL 96
           S + +     S  K++ I       E+ E L+   S 
Sbjct: 122 SDVILEKGGKSRDKVVKILVSVTPDEVLEKLKKEAST 158


>gi|67005257|gb|AAY62183.1| Conserved hypothetical protein [Rickettsia felis URRWXCal2]
          Length = 110

 Score = 84.8 bits (209), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 51/89 (57%), Gaps = 3/89 (3%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +++ PN+K++ I+   I  +     ++K+ + ATP++GKAN+ ++  LAK+  LS+
Sbjct: 19  VLLNLKVKPNSKQNLISDFVIINNIP---YLKLSIKATPEQGKANEEIINYLAKEWKLSR 75

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
             + ++   ++ LK I I    ++   L+
Sbjct: 76  KDIEIIKGHTNSLKTILIKNIDEDYLNLI 104


>gi|163795292|ref|ZP_02189259.1| hypothetical protein BAL199_14277 [alpha proteobacterium BAL199]
 gi|159179278|gb|EDP63809.1| hypothetical protein BAL199_14277 [alpha proteobacterium BAL199]
          Length = 112

 Score = 84.8 bits (209), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 3/89 (3%)

Query: 5   IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
            V++ P A    I  +    D +    +++ VTA P+ G+ANKA+ A+LAK+  + KSS+
Sbjct: 17  AVKVTPKAAADRIRGVVQ--DEAGVAWLQVSVTAVPEDGRANKAVTALLAKRWRVPKSSI 74

Query: 65  RMLSKQSSPLKIIYI-DKDCKEITELLQN 92
            ++   +   K++ +   D   +T  LQ 
Sbjct: 75  EIVQGTTERRKVLLVRSDDTAALTARLQT 103


>gi|78357213|ref|YP_388662.1| hypothetical protein Dde_2170 [Desulfovibrio desulfuricans subsp.
          desulfuricans str. G20]
 gi|78219618|gb|ABB38967.1| conserved hypothetical protein [Desulfovibrio desulfuricans
          subsp. desulfuricans str. G20]
          Length = 118

 Score = 84.8 bits (209), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 7/81 (8%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + V   P AK SGIA L           ++I+++A     KANK ++  +A+   + ++
Sbjct: 16 RLKVWAQPGAKHSGIAGL-------YDGRVRIRLSAPAVDNKANKELIRFVAQLCGVKQN 68

Query: 63 SLRMLSKQSSPLKIIYIDKDC 83
           +R+ S  SS  K++ I++D 
Sbjct: 69 RVRLESGVSSRKKVLLIERDT 89


>gi|162022116|ref|YP_247348.2| hypothetical protein RF_1332 [Rickettsia felis URRWXCal2]
 gi|126253831|sp|Q4UJV6|Y1332_RICFE RecName: Full=UPF0235 protein RF_1332
          Length = 105

 Score = 84.8 bits (209), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 51/89 (57%), Gaps = 3/89 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +++ PN+K++ I+   I  +     ++K+ + ATP++GKAN+ ++  LAK+  LS+
Sbjct: 14 VLLNLKVKPNSKQNLISDFVIINNIP---YLKLSIKATPEQGKANEEIINYLAKEWKLSR 70

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
            + ++   ++ LK I I    ++   L+
Sbjct: 71 KDIEIIKGHTNSLKTILIKNIDEDYLNLI 99


>gi|258593710|emb|CBE70051.1| conserved hypothetical protein [NC10 bacterium 'Dutch sediment']
          Length = 102

 Score = 84.8 bits (209), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 25/87 (28%), Positives = 42/87 (48%), Gaps = 7/87 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           +  VRL P A +  I       D      ++++V A P +G+AN A L +LAK L L  
Sbjct: 11 ASFRVRLQPKASREAI-------DGEVDGVLRLRVNAPPVEGQANDACLRLLAKTLDLPI 63

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITE 88
          S L +++ Q + +K I +     ++  
Sbjct: 64 SRLGIVAGQQARVKTIRVTDASADLLR 90


>gi|182680478|ref|YP_001834624.1| hypothetical protein Bind_3579 [Beijerinckia indica subsp. indica
          ATCC 9039]
 gi|182636361|gb|ACB97135.1| protein of unknown function DUF167 [Beijerinckia indica subsp.
          indica ATCC 9039]
          Length = 87

 Score = 84.8 bits (209), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 26/81 (32%), Positives = 43/81 (53%), Gaps = 3/81 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +R+ P A  + I  +E   D S    ++I VT  P+ GKAN+ +L +LAK L +  
Sbjct: 10 VEIAIRVTPKASANRIV-VETAPDGS--ERLRIYVTTVPENGKANRDVLRLLAKHLDIPP 66

Query: 62 SSLRMLSKQSSPLKIIYIDKD 82
          SSL ++   +   KI+   +D
Sbjct: 67 SSLEIIRGSTGRDKIVRFSRD 87


>gi|313673537|ref|YP_004051648.1| hypothetical protein Calni_1577 [Calditerrivibrio nitroreducens
          DSM 19672]
 gi|312940293|gb|ADR19485.1| protein of unknown function DUF167 [Calditerrivibrio
          nitroreducens DSM 19672]
          Length = 86

 Score = 84.8 bits (209), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 26/93 (27%), Positives = 49/93 (52%), Gaps = 9/93 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P AKK+         D      +KIK+ + P  GKAN+ +++ +++ L LSK
Sbjct: 1  MRIKIYVQPGAKKTA-------YDGEFNGCIKIKIKSPPTDGKANEELISFISQSLNLSK 53

Query: 62 SSLRMLSKQSSPLKIIYIDK--DCKEITELLQN 92
            + ++S + S  KII + +  D + I E L++
Sbjct: 54 KEVGIISGEKSRYKIIEVPENYDMEFIKEKLKD 86


>gi|291278644|ref|YP_003495479.1| hypothetical protein DEFDS_0212 [Deferribacter desulfuricans
          SSM1]
 gi|290753346|dbj|BAI79723.1| conserved hypothetical protein [Deferribacter desulfuricans SSM1]
          Length = 82

 Score = 84.4 bits (208), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 7/77 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +   + P AKK+ +A              KIKV + P  G ANK ++  LAKKL +SK
Sbjct: 3  VRITFYIQPGAKKTEVAG-------EFNNMTKIKVASPPVDGAANKELIKFLAKKLGVSK 55

Query: 62 SSLRMLSKQSSPLKIIY 78
          SS++++S + S +K + 
Sbjct: 56 SSVKIVSGEKSRIKTVE 72


>gi|189485621|ref|YP_001956562.1| hypothetical protein TGRD_618 [uncultured Termite group 1
          bacterium phylotype Rs-D17]
 gi|254806541|sp|B1GYK8|Y618_UNCTG RecName: Full=UPF0235 protein TGRD_618
 gi|170287580|dbj|BAG14101.1| conserved hypothetical protein [uncultured Termite group 1
          bacterium phylotype Rs-D17]
          Length = 87

 Score = 84.4 bits (208), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 19/93 (20%), Positives = 44/93 (47%), Gaps = 8/93 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VR+IPN+K++ +        +     +++K+TA   +G+AN+ +   L+    + +
Sbjct: 1  MIIKVRVIPNSKRNEVV-------SRVGSILRVKITAPAIEGRANEELCDFLSDFFDVKR 53

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
          S + +   +    K I +     +E+ E+L   
Sbjct: 54 SMIFLRKGERGREKTIEVLGRLEEELNEVLDTI 86


>gi|126273662|ref|XP_001365632.1| PREDICTED: similar to chromosome 15 open reading frame 40,
           [Monodelphis domestica]
          Length = 146

 Score = 84.4 bits (208), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 21/94 (22%), Positives = 40/94 (42%), Gaps = 9/94 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P +K++ I  +           + + + A P +G+AN  +   L+K L L KS 
Sbjct: 58  IAIHAKPGSKQNAITDVTTEN-------VSVAIAAPPSEGEANTELCRYLSKVLELRKSD 110

Query: 64  LRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95
           + +     S  K++ I      +EI E L+    
Sbjct: 111 VILDKGGKSREKVVKILASTTPEEILEKLKRQAE 144


>gi|239906817|ref|YP_002953558.1| hypothetical protein DMR_21810 [Desulfovibrio magneticus RS-1]
 gi|239796683|dbj|BAH75672.1| hypothetical protein [Desulfovibrio magneticus RS-1]
          Length = 122

 Score = 84.4 bits (208), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 17/81 (20%), Positives = 39/81 (48%), Gaps = 7/81 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V + P   K  +A L           +++++ A   +G+AN A+   +A+ L +    
Sbjct: 37  LRVAVTPGGAKDALAGL-------AEDRLRVRLRAKAVEGQANAALTDFVARCLGVKPRQ 89

Query: 64  LRMLSKQSSPLKIIYIDKDCK 84
           +R++S + S  K + I+ + +
Sbjct: 90  VRIISGEKSRKKTLRIETESE 110


>gi|317050344|ref|YP_004111460.1| hypothetical protein Selin_0146 [Desulfurispirillum indicum S5]
 gi|316945428|gb|ADU64904.1| protein of unknown function DUF167 [Desulfurispirillum indicum
          S5]
          Length = 92

 Score = 84.4 bits (208), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 16/87 (18%), Positives = 41/87 (47%), Gaps = 7/87 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +++ P + ++ +              +K+++ A P  G AN+ ++   ++ L++ KS 
Sbjct: 11 LQLKVQPRSSRTELL-------READGQLKLRLNAPPVDGAANQQVIEFFSRLLSIPKSR 63

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90
          + ++  Q S  K+I +      I E +
Sbjct: 64 ISIVQGQQSRRKVIALLGVEPAILEKV 90


>gi|94971231|ref|YP_593279.1| hypothetical protein Acid345_4205 [Candidatus Koribacter
          versatilis Ellin345]
 gi|166227235|sp|Q1IIU5|Y4205_ACIBL RecName: Full=UPF0235 protein Acid345_4205
 gi|94553281|gb|ABF43205.1| protein of unknown function DUF167 [Candidatus Koribacter
          versatilis Ellin345]
          Length = 96

 Score = 84.4 bits (208), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 26/91 (28%), Positives = 48/91 (52%), Gaps = 8/91 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           +  VRL P AKK+ I              +K+ VT  P  G+AN+A++  +A  L +++
Sbjct: 11 VSFAVRLQPKAKKTAIIG-------ELNGALKLGVTDPPIDGRANEALIRFVAGLLKVTR 63

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91
          SS+ + + +SS  K+I I+    +++   L+
Sbjct: 64 SSVTIAAGESSRNKVIRIEGVTAEQVRFRLK 94


>gi|220916321|ref|YP_002491625.1| protein of unknown function DUF167 [Anaeromyxobacter dehalogenans
          2CP-1]
 gi|254799985|sp|B8JFX1|Y1215_ANAD2 RecName: Full=UPF0235 protein A2cp1_1215
 gi|219954175|gb|ACL64559.1| protein of unknown function DUF167 [Anaeromyxobacter dehalogenans
          2CP-1]
          Length = 95

 Score = 84.4 bits (208), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 15/78 (19%), Positives = 34/78 (43%), Gaps = 7/78 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P A ++                +KI++ A P  G AN A++  LA  L + +
Sbjct: 11 AVLEILVQPRASRTRAVG-------EHDGRLKIQLAAPPVDGAANAALVEFLAAALGVRR 63

Query: 62 SSLRMLSKQSSPLKIIYI 79
          + + +L  ++   K + +
Sbjct: 64 ADVELLRGETGRRKTVRV 81


>gi|15893224|ref|NP_360938.1| hypothetical protein RC1301 [Rickettsia conorii str. Malish 7]
 gi|34581107|ref|ZP_00142587.1| hypothetical protein [Rickettsia sibirica 246]
 gi|229587206|ref|YP_002845707.1| hypothetical protein RAF_ORF1191 [Rickettsia africae ESF-5]
 gi|29839741|sp|Q92G24|Y1301_RICCN RecName: Full=UPF0235 protein RC1301
 gi|259645748|sp|C3PLX4|Y1191_RICAE RecName: Full=UPF0235 protein RAF_ORF1191
 gi|15620440|gb|AAL03839.1| unknown [Rickettsia conorii str. Malish 7]
 gi|28262492|gb|EAA25996.1| unknown [Rickettsia sibirica 246]
 gi|228022256|gb|ACP53964.1| Unknown [Rickettsia africae ESF-5]
          Length = 105

 Score = 84.4 bits (208), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 51/89 (57%), Gaps = 3/89 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +  ++ PN+K++ I++  I  +     ++K+ + A P++GKAN+ ++  LAK+  LS+
Sbjct: 14 ALLSFKVKPNSKQNLISNFVIINNIP---YLKLSIKAIPEQGKANEEIINYLAKEWKLSR 70

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          S++ ++   +  LK I I    ++   L+
Sbjct: 71 SNIEIIKGHTHSLKTILIKNINEDYLNLI 99


>gi|115495117|ref|NP_001068854.1| hypothetical protein LOC509050 [Bos taurus]
 gi|122140809|sp|Q3ZBP8|CO040_BOVIN RecName: Full=UPF0235 protein C15orf40 homolog
 gi|73587092|gb|AAI03180.1| Chromosome 15 open reading frame 40 ortholog [Bos taurus]
 gi|296475539|gb|DAA17654.1| hypothetical protein LOC509050 [Bos taurus]
          Length = 126

 Score = 84.4 bits (208), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 19/94 (20%), Positives = 41/94 (43%), Gaps = 9/94 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            ++ +   P +K++ +  +           + + + A P +G+AN  +   L+K L L K
Sbjct: 36  VSIAIHAKPGSKQNAVTDVTT-------EAVSVAIAAPPTEGEANAELCRYLSKVLELRK 88

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93
           S + +     S  K++ +      +EI E L+  
Sbjct: 89  SDVVLDKGGKSREKVVKLLASTPPEEILEKLKKQ 122


>gi|146337438|ref|YP_001202486.1| hypothetical protein BRADO0278 [Bradyrhizobium sp. ORS278]
 gi|146190244|emb|CAL74236.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278]
          Length = 109

 Score = 84.4 bits (208), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 24/78 (30%), Positives = 43/78 (55%), Gaps = 2/78 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V +R+ P   +  I  +E   D      +K++V A    G+AN+A+  +LAK + ++K +
Sbjct: 13 VALRVTPRGGRDAIDGIETLSDG--RSVLKVRVRAIADGGEANRAVTELLAKAIGVTKKA 70

Query: 64 LRMLSKQSSPLKIIYIDK 81
          +R+ S  +S LK + ID 
Sbjct: 71 VRITSGTTSRLKQVAIDG 88


>gi|299133137|ref|ZP_07026332.1| protein of unknown function DUF167 [Afipia sp. 1NLS2]
 gi|298593274|gb|EFI53474.1| protein of unknown function DUF167 [Afipia sp. 1NLS2]
          Length = 108

 Score = 84.4 bits (208), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 25/94 (26%), Positives = 52/94 (55%), Gaps = 2/94 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V VR+ P   +  +  +E   +    + ++++V A  + G+AN+A+  + A+ L + KS 
Sbjct: 14  VAVRVTPRGGRDAVDGIEELANGKSVVKVRVRVAA--EGGEANRAVTELFAELLRVPKSK 71

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97
           +R+ S  +S +K + I+ D K++ E L+   + T
Sbjct: 72  VRVASGVTSRVKQLTIEGDPKQLGEALKAATAAT 105


>gi|218782636|ref|YP_002433954.1| hypothetical protein Dalk_4809 [Desulfatibacillum alkenivorans
           AK-01]
 gi|218764020|gb|ACL06486.1| protein of unknown function DUF167 [Desulfatibacillum alkenivorans
           AK-01]
          Length = 106

 Score = 84.0 bits (207), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 27/97 (27%), Positives = 49/97 (50%), Gaps = 11/97 (11%)

Query: 6   VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65
           ++++P +  + +  L+          +KIK+ A P  G ANK  +  LAK L L KSS++
Sbjct: 17  IKVLPRSSVNAVVGLQ-------DGALKIKLKAPPVGGAANKMCIQFLAKTLKLPKSSIK 69

Query: 66  MLSKQSSPLKIIYI----DKDCKEITELLQNNDSLTL 98
           +LS ++   K I +    +   KE+  L +  + L +
Sbjct: 70  ILSGETGRSKQIMVRPREEGSKKELARLRKIIEELAV 106


>gi|296269396|ref|YP_003652028.1| hypothetical protein Tbis_1417 [Thermobispora bispora DSM 43833]
 gi|296092183|gb|ADG88135.1| protein of unknown function DUF167 [Thermobispora bispora DSM
          43833]
          Length = 89

 Score = 84.0 bits (207), Expect = 6e-15,   Method: Composition-based stats.
 Identities = 18/88 (20%), Positives = 39/88 (44%), Gaps = 6/88 (6%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            V +R+ P A +  +        T     + ++V A    G+A +A L  +A    + +
Sbjct: 1  MRVSIRVRPGASREYVGG------TYGDGAIVVRVCAPAVDGRATEAALKAVASAFGVRR 54

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITEL 89
            +R++S  ++  K++ I  D +E+   
Sbjct: 55 GDVRLVSGATARDKVVEIAGDEEELARR 82


>gi|194337426|ref|YP_002019220.1| protein of unknown function DUF167 [Pelodictyon
          phaeoclathratiforme BU-1]
 gi|226701435|sp|B4SES7|Y2415_PELPB RecName: Full=UPF0235 protein Ppha_2415
 gi|194309903|gb|ACF44603.1| protein of unknown function DUF167 [Pelodictyon
          phaeoclathratiforme BU-1]
          Length = 97

 Score = 84.0 bits (207), Expect = 6e-15,   Method: Composition-based stats.
 Identities = 19/85 (22%), Positives = 36/85 (42%), Gaps = 7/85 (8%)

Query: 6  VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65
          V+  P + KS +  L           +K+ + A P    AN+    + +K   +  S + 
Sbjct: 15 VKAQPRSSKSRVCGL-------YNGGLKVNLKAAPVDDAANRECCELFSKLFRIPPSRVH 67

Query: 66 MLSKQSSPLKIIYIDKDCKEITELL 90
          +LS QSS  K + ++    +   L+
Sbjct: 68 ILSGQSSRTKTVMVEGISSKAAALV 92


>gi|260800225|ref|XP_002595035.1| hypothetical protein BRAFLDRAFT_237405 [Branchiostoma floridae]
 gi|229280275|gb|EEN51046.1| hypothetical protein BRAFLDRAFT_237405 [Branchiostoma floridae]
          Length = 98

 Score = 84.0 bits (207), Expect = 6e-15,   Method: Composition-based stats.
 Identities = 22/92 (23%), Positives = 39/92 (42%), Gaps = 7/92 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V +   P AK + I  +             +++TA P +G+AN  +   LA  L + KS+
Sbjct: 12 VAIHAKPGAKANAITDVTTETVG-------VQITAPPMEGEANAELCRYLAGVLEVKKSA 64

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
          + +     S  K + +D     I  +LQ   +
Sbjct: 65 VSLERGAKSREKTVRVDTPGTTIDAVLQRIKA 96


>gi|156548436|ref|XP_001604898.1| PREDICTED: similar to conserved hypothetical protein [Nasonia
           vitripennis]
          Length = 122

 Score = 84.0 bits (207), Expect = 6e-15,   Method: Composition-based stats.
 Identities = 22/95 (23%), Positives = 42/95 (44%), Gaps = 8/95 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + V+  P AK++ I                I ++A PQ+G+AN  ++  LA  L + K
Sbjct: 33  VTIKVQAKPGAKQNNITDFSEETVG-------IAISAPPQEGEANAELVKYLASILNVRK 85

Query: 62  SSLRMLSKQSSPLKIIYIDKDC-KEITELLQNNDS 95
           S + +     S  K + +     +++TE L+   +
Sbjct: 86  SDVTLDRGSRSRQKKVIVTGSSVEKVTEKLKAEAA 120


>gi|114658629|ref|XP_001148978.1| PREDICTED: similar to Chromosome 15 open reading frame 40 [Pan
           troglodytes]
          Length = 283

 Score = 84.0 bits (207), Expect = 6e-15,   Method: Composition-based stats.
 Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 9/87 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L           + + + A P +G+AN  +   L+K L L K
Sbjct: 193 VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 245

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEI 86
           S + +     S  K++ +      +EI
Sbjct: 246 SDVVLDKGGKSREKVVKLLASTTPEEI 272


>gi|258545474|ref|ZP_05705708.1| cytoplasmic protein [Cardiobacterium hominis ATCC 15826]
 gi|258519174|gb|EEV88033.1| cytoplasmic protein [Cardiobacterium hominis ATCC 15826]
          Length = 97

 Score = 84.0 bits (207), Expect = 6e-15,   Method: Composition-based stats.
 Identities = 24/79 (30%), Positives = 38/79 (48%), Gaps = 7/79 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V++   A +     +            KI +TA P+ GKANK + A LAK  A++K +
Sbjct: 13 LAVKITARASRDQCQGIH-------DERYKIAITAPPEDGKANKHLTAWLAKTFAVAKKN 65

Query: 64 LRMLSKQSSPLKIIYIDKD 82
          + + S   SPLK + I   
Sbjct: 66 VALQSGAFSPLKTLRITAP 84


>gi|297617010|ref|YP_003702169.1| hypothetical protein Slip_0824 [Syntrophothermus lipocalidus DSM
          12680]
 gi|297144847|gb|ADI01604.1| protein of unknown function DUF167 [Syntrophothermus lipocalidus
          DSM 12680]
          Length = 96

 Score = 84.0 bits (207), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 22/83 (26%), Positives = 45/83 (54%), Gaps = 7/83 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
              VR++P A K+ +              +KIK+TA P +G+AN+A+++ LAK   +++
Sbjct: 10 IRFEVRVLPRASKNEVIG-------EVEGAVKIKLTAAPLEGEANQALISFLAKISGVAR 62

Query: 62 SSLRMLSKQSSPLKIIYIDKDCK 84
           ++ ++  ++S  K++ I    K
Sbjct: 63 KNVTIIKGETSRHKLVEITGIDK 85


>gi|85713678|ref|ZP_01044668.1| hypothetical protein NB311A_04039 [Nitrobacter sp. Nb-311A]
 gi|85699582|gb|EAQ37449.1| hypothetical protein NB311A_04039 [Nitrobacter sp. Nb-311A]
          Length = 106

 Score = 83.6 bits (206), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 24/93 (25%), Positives = 54/93 (58%), Gaps = 2/93 (2%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
            + +R+ P   + GI  +E+  D    + ++++  A  + G+AN+A++A+LAK L + K 
Sbjct: 12  RIALRVTPRGGRDGIDGIEMLADGRPVVKVRVRAVA--EGGEANRAVMAVLAKALGVRKV 69

Query: 63  SLRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
            +R+L+  +S LK + ++ D  ++ + L+   +
Sbjct: 70  DVRILAGATSRLKQVAVEGDPVQLGDALRALTA 102


>gi|325108674|ref|YP_004269742.1| hypothetical protein Plabr_2117 [Planctomyces brasiliensis DSM
           5305]
 gi|324968942|gb|ADY59720.1| UPF0235 protein yggU [Planctomyces brasiliensis DSM 5305]
          Length = 109

 Score = 83.6 bits (206), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 27/93 (29%), Positives = 48/93 (51%), Gaps = 8/93 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +R+ P AK++ +  +           +K+ VT   ++GKAN+ +L +LAK L L K
Sbjct: 20  VLLPLRVTPGAKRNAVGGVH-------DGALKVAVTQIAERGKANQQVLKILAKALGLKK 72

Query: 62  SSLRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93
           S L ++S ++S  K I   D    E+ + + N 
Sbjct: 73  SQLTLVSGETSRNKRIACRDVSAAELLQRISNL 105


>gi|300866080|ref|ZP_07110810.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300335941|emb|CBN55968.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 78

 Score = 83.6 bits (206), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 22/79 (27%), Positives = 40/79 (50%), Gaps = 7/79 (8%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          M  + +++ PN+K+  I               KI + + P  GKAN+ ++ +LAKK  + 
Sbjct: 1  MAILTIKVKPNSKQQNI-------QQEPDGSFKISLKSPPIDGKANEELIKLLAKKFGIP 53

Query: 61 KSSLRMLSKQSSPLKIIYI 79
          KS + + S  SS  K++ +
Sbjct: 54 KSQITIKSGLSSKNKLVEL 72


>gi|114798070|ref|YP_761927.1| hypothetical protein HNE_3254 [Hyphomonas neptunium ATCC 15444]
 gi|114738244|gb|ABI76369.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444]
          Length = 99

 Score = 83.6 bits (206), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 22/82 (26%), Positives = 42/82 (51%), Gaps = 2/82 (2%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           +  R+ P A    +       D +    +K++V A P +G AN A+ A++AK L + KS
Sbjct: 4  RLTARVQPKAASDRLDGW--AADEAGRPFLKLRVRALPAEGAANAAVEALVAKALGVPKS 61

Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84
          ++R+++   + LK + I+    
Sbjct: 62 AVRVVTGGKNRLKSLEIEGPPD 83


>gi|323697733|ref|ZP_08109645.1| protein of unknown function DUF167 [Desulfovibrio sp. ND132]
 gi|323457665|gb|EGB13530.1| protein of unknown function DUF167 [Desulfovibrio desulfuricans
          ND132]
          Length = 98

 Score = 83.6 bits (206), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 7/81 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V   P A+K  +A             +K+++ A     KANK ++A +A+ L L KS 
Sbjct: 12 LDVWAQPGARKDEVAG-------EYQGCLKVRLRAPAVDNKANKGLVAYIARLLQLKKSQ 64

Query: 64 LRMLSKQSSPLKIIYIDKDCK 84
          + ++S  +S  K + ++   +
Sbjct: 65 VEIVSGHASRRKHLALNTAGE 85


>gi|209883467|ref|YP_002287324.1| hypothetical protein OCAR_4310 [Oligotropha carboxidovorans OM5]
 gi|226706162|sp|B6JAU6|Y4310_OLICO RecName: Full=UPF0235 protein OCAR_4310
 gi|209871663|gb|ACI91459.1| hypothetical protein OCAR_4310 [Oligotropha carboxidovorans OM5]
          Length = 106

 Score = 83.6 bits (206), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 25/93 (26%), Positives = 52/93 (55%), Gaps = 2/93 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V VR+ P   +  +  +E+  +    + ++++V A  + G+AN+A+  + A  L + KS 
Sbjct: 14  VAVRVTPRGGRDAVDGIEMLANGKSVVKVRVRVAA--EGGEANRAVTELFAGLLRVPKSK 71

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96
           +++ S  +S +K I I+ D K++ E L+   S+
Sbjct: 72  VKVASGVTSRIKQIAIEGDPKQLGEALKAATSI 104


>gi|317153487|ref|YP_004121535.1| hypothetical protein Daes_1777 [Desulfovibrio aespoeensis Aspo-2]
 gi|316943738|gb|ADU62789.1| protein of unknown function DUF167 [Desulfovibrio aespoeensis
          Aspo-2]
          Length = 102

 Score = 83.6 bits (206), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 22/82 (26%), Positives = 43/82 (52%), Gaps = 7/82 (8%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + V + P A+KS +A +           +KI++ A     KANKA++A +A  L + KS
Sbjct: 15 RIAVWVQPGARKSEVAGV-------YQQCVKIRLCAPAVDNKANKALVAFVASVLNVKKS 67

Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84
           + + S Q++  K++ ++   +
Sbjct: 68 QVVIESGQTTRKKLLALNTVAE 89


>gi|238899239|ref|YP_002924922.1| hypothetical protein HDEF_2231 [Candidatus Hamiltonella defensa 5AT
           (Acyrthosiphon pisum)]
 gi|229467000|gb|ACQ68774.1| conserved hypothetical protein [Candidatus Hamiltonella defensa 5AT
           (Acyrthosiphon pisum)]
          Length = 103

 Score = 83.6 bits (206), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 25/96 (26%), Positives = 44/96 (45%), Gaps = 10/96 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V + P ++   I  +           +KI +TA P   KANK ++  L+KK  ++KS 
Sbjct: 15  LNVYIQPQSRFDNIVGIHH-------NEIKINLTALPVDNKANKHLIHFLSKKCKVAKSH 67

Query: 64  LRMLSKQSSPLKIIYI---DKDCKEITELLQNNDSL 96
           + +   Q S  K + I        EI ++L + + L
Sbjct: 68  IMIEKGQLSRHKQVRIVEPKVIPVEIKQILADRNKL 103


>gi|258405784|ref|YP_003198526.1| hypothetical protein Dret_1664 [Desulfohalobium retbaense DSM
          5692]
 gi|257798011|gb|ACV68948.1| protein of unknown function DUF167 [Desulfohalobium retbaense DSM
          5692]
          Length = 105

 Score = 83.6 bits (206), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 22/82 (26%), Positives = 39/82 (47%), Gaps = 7/82 (8%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + + + P AKK+ +  +           +KIK+ A P   KAN+A+   +A +L L  +
Sbjct: 23 RLRLWVQPKAKKTAVVGV-------YQDCLKIKLQAPPVDNKANQAVCRFVAARLGLRPA 75

Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84
           + + S Q+S  K I I    +
Sbjct: 76 DVDLGSGQASRKKTIVIASRDE 97


>gi|153003993|ref|YP_001378318.1| hypothetical protein Anae109_1126 [Anaeromyxobacter sp. Fw109-5]
 gi|166977708|sp|A7H9D8|Y1126_ANADF RecName: Full=UPF0235 protein Anae109_1126
 gi|152027566|gb|ABS25334.1| protein of unknown function DUF167 [Anaeromyxobacter sp. Fw109-5]
          Length = 95

 Score = 83.6 bits (206), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 15/79 (18%), Positives = 35/79 (44%), Gaps = 7/79 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P A ++ +              +KI++ A P  G AN A++  LA+ L + K
Sbjct: 11 VVLELLVQPRASRTRVLG-------EHGGRLKIQLAAPPVDGAANAALVEFLAEALEVRK 63

Query: 62 SSLRMLSKQSSPLKIIYID 80
            + ++  ++   K + + 
Sbjct: 64 QDVVLVRGETGRRKAVRVT 82


>gi|170290086|ref|YP_001736902.1| hypothetical protein Kcr_0466 [Candidatus Korarchaeum cryptofilum
          OPF8]
 gi|170174166|gb|ACB07219.1| protein of unknown function DUF167 [Candidatus Korarchaeum
          cryptofilum OPF8]
          Length = 72

 Score = 83.6 bits (206), Expect = 9e-15,   Method: Composition-based stats.
 Identities = 25/79 (31%), Positives = 46/79 (58%), Gaps = 8/79 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            V V ++PNA ++G+          +  H+++ V A P KGKAN+A++ +LA+   + K
Sbjct: 1  MRVSVLVVPNAGRNGVV--------EEGDHLRVYVRAPPVKGKANEAVIEVLAEFFGVKK 52

Query: 62 SSLRMLSKQSSPLKIIYID 80
          S +R++S + S  K++ I 
Sbjct: 53 SDIRIISGERSREKVVEIR 71


>gi|165933862|ref|YP_001650651.1| hypothetical protein RrIowa_1526 [Rickettsia rickettsii str.
          Iowa]
 gi|189038764|sp|B0BVI5|Y1526_RICRO RecName: Full=UPF0235 protein RrIowa_1526
 gi|165908949|gb|ABY73245.1| hypothetical cytosolic protein [Rickettsia rickettsii str. Iowa]
          Length = 105

 Score = 83.6 bits (206), Expect = 9e-15,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 50/89 (56%), Gaps = 3/89 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +  ++ PN+K++ I++  I  +     ++K+ + A P++GKAN  ++  LAK+  LS+
Sbjct: 14 ALLSFKVKPNSKQNLISNFVIINNIQ---YLKLSIKAIPEQGKANAEIINYLAKEWKLSR 70

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          S++ ++   +  LK I I    ++   L+
Sbjct: 71 SNIEIIKGHTHSLKTILIKNINEDYLNLI 99


>gi|284039659|ref|YP_003389589.1| hypothetical protein Slin_4812 [Spirosoma linguale DSM 74]
 gi|283818952|gb|ADB40790.1| protein of unknown function DUF167 [Spirosoma linguale DSM 74]
          Length = 86

 Score = 83.2 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 21/89 (23%), Positives = 42/89 (47%), Gaps = 7/89 (7%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + ++  P +K   +              +  K+ A  Q GKAN  ++  LAK+L + K
Sbjct: 1  MTLHLKAKPGSKIDQL-------FYDAAGQLNAKIRAPAQDGKANAYLIEFLAKQLGIPK 53

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          S + +++  ++P K I +D   + +T+ L
Sbjct: 54 SGVSIVAGFTNPHKRIEVDVPEEVLTDFL 82


>gi|13385576|ref|NP_080353.1| hypothetical protein LOC67290 [Mus musculus]
 gi|29839616|sp|Q9CRC3|CO040_MOUSE RecName: Full=UPF0235 protein C15orf40 homolog
 gi|12851848|dbj|BAB29184.1| unnamed protein product [Mus musculus]
 gi|12857526|dbj|BAB31031.1| unnamed protein product [Mus musculus]
 gi|12859380|dbj|BAB31634.1| unnamed protein product [Mus musculus]
          Length = 126

 Score = 83.2 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 18/96 (18%), Positives = 39/96 (40%), Gaps = 9/96 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P ++++ +  L             + + A P +G+AN  +   L+K L L K
Sbjct: 36  VTIAIHAKPGSRQNAVTDLSTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 88

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95
           S + +     S  K++ +      +E+ E L+    
Sbjct: 89  SDVVLDKGGKSREKVVKLLASTTPEEVLEKLKTEAE 124


>gi|189219304|ref|YP_001939945.1| hypothetical protein Minf_1293 [Methylacidiphilum infernorum V4]
 gi|189186162|gb|ACD83347.1| Uncharacterized conserved protein [Methylacidiphilum infernorum
          V4]
          Length = 97

 Score = 83.2 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 23/80 (28%), Positives = 43/80 (53%), Gaps = 7/80 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V++  NAKK+ I         S    +KI+++A P +GKAN A+L+ L+ +L + K
Sbjct: 4  ARLFVKVQANAKKTEICG-------SYADALKIRLSAPPVEGKANDALLSFLSLRLCVPK 56

Query: 62 SSLRMLSKQSSPLKIIYIDK 81
            +R+   + +  K + I+ 
Sbjct: 57 RLIRIEKGEKNSKKTVVIEG 76


>gi|91787002|ref|YP_547954.1| hypothetical protein Bpro_1104 [Polaromonas sp. JS666]
 gi|91696227|gb|ABE43056.1| protein of unknown function DUF167 [Polaromonas sp. JS666]
          Length = 113

 Score = 83.2 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 18/76 (23%), Positives = 39/76 (51%), Gaps = 3/76 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            V V ++PNA K+    L         + +++++   P  GKAN+A++  LA  L + +
Sbjct: 19 VLVDVHVVPNAAKTQPVGLHGE---PGQLALRLRLQVPPVDGKANQALIRWLAHSLGVPQ 75

Query: 62 SSLRMLSKQSSPLKII 77
          +++  +  ++S  K +
Sbjct: 76 NAITPVRGETSRRKQL 91


>gi|170749303|ref|YP_001755563.1| hypothetical protein Mrad2831_2896 [Methylobacterium
          radiotolerans JCM 2831]
 gi|170655825|gb|ACB24880.1| protein of unknown function DUF167 [Methylobacterium
          radiotolerans JCM 2831]
          Length = 93

 Score = 83.2 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 2/83 (2%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VRL P   +         +      ++K +V A P +G AN A++ ++AK L + +
Sbjct: 3  IRLAVRLTPRGGRDAAEGWARDEKGQP--YLKARVAAPPVEGAANAALVVLIAKALKVGR 60

Query: 62 SSLRMLSKQSSPLKIIYIDKDCK 84
           S+R+++   S LKI+ ID   +
Sbjct: 61 GSVRIVTGDQSRLKILEIDGVAQ 83


>gi|241835850|ref|XP_002415074.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215509286|gb|EEC18739.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 104

 Score = 83.2 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 25/92 (27%), Positives = 38/92 (41%), Gaps = 9/92 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V   P A +S I  +             +++ A P  G+AN  ++  LAK L L KS 
Sbjct: 17  IRVHAKPGASESRITDIGTDGVG-------VQIAAPPMDGEANAELVRFLAKVLNLRKSD 69

Query: 64  LRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93
           + +     S  K++ I       EI  LLQ  
Sbjct: 70  VSLEKGSRSKDKVVMIASPATAAEILSLLQQK 101


>gi|157964988|ref|YP_001499812.1| hypothetical protein RMA_1324 [Rickettsia massiliae MTU5]
 gi|157844764|gb|ABV85265.1| hypothetical protein RMA_1324 [Rickettsia massiliae MTU5]
          Length = 107

 Score = 83.2 bits (205), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 50/89 (56%), Gaps = 3/89 (3%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             +  ++ PN+K++ I++  I  +     ++K+ + A P +GKAN+ ++  LAK+  LS+
Sbjct: 16  ALLSFKVKPNSKQNLISNFVIINNIP---YLKLSIKAIPAQGKANEEIINYLAKEWKLSR 72

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
           S++ ++   +  LK I I    ++   L+
Sbjct: 73  SNIEIIKGHTHSLKTILIKNINEDYLNLI 101


>gi|67078514|ref|NP_001019920.1| hypothetical protein LOC293059 [Rattus norvegicus]
 gi|81908918|sp|Q505I4|CO040_RAT RecName: Full=UPF0235 protein C15orf40 homolog
 gi|63100368|gb|AAH94529.1| Similar to RIKEN cDNA 3110040N11 [Rattus norvegicus]
 gi|149044058|gb|EDL97440.1| rCG63322 [Rattus norvegicus]
 gi|149057383|gb|EDM08706.1| similar to RIKEN cDNA 3110040N11, isoform CRA_c [Rattus norvegicus]
 gi|149057387|gb|EDM08710.1| similar to RIKEN cDNA 3110040N11, isoform CRA_c [Rattus norvegicus]
          Length = 126

 Score = 82.9 bits (204), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 19/96 (19%), Positives = 39/96 (40%), Gaps = 9/96 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L             + + A P +G+AN  +   L+K L L K
Sbjct: 36  VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 88

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95
           S + +     S  K++ +      +E+ E L+    
Sbjct: 89  SDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTEAE 124


>gi|256828220|ref|YP_003156948.1| protein of unknown function DUF167 [Desulfomicrobium baculatum
          DSM 4028]
 gi|256577396|gb|ACU88532.1| protein of unknown function DUF167 [Desulfomicrobium baculatum
          DSM 4028]
          Length = 106

 Score = 82.9 bits (204), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 7/82 (8%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + + + P A+K+ +A L           +KI+V A     KAN A+   ++K L +  S
Sbjct: 18 RLGIWVQPGARKTEVAGLH-------GDFLKIRVQARAVDNKANSALTVFVSKILGIKAS 70

Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84
           + + S  +S  K + +D + +
Sbjct: 71 QVVIESGHASRQKNLLLDVEEE 92


>gi|242279866|ref|YP_002991995.1| hypothetical protein Desal_2400 [Desulfovibrio salexigens DSM
          2638]
 gi|242122760|gb|ACS80456.1| protein of unknown function DUF167 [Desulfovibrio salexigens DSM
          2638]
          Length = 106

 Score = 82.9 bits (204), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 22/82 (26%), Positives = 40/82 (48%), Gaps = 7/82 (8%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           V V + P AK  GI              +++++ A     KANKA+ A +A +L L K 
Sbjct: 21 RVSVWVQPGAKNEGITG-------EYQDSVRVRINAPAVDNKANKALAAFVATRLGLKKR 73

Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84
          ++ + S  S+  K++ ++ D +
Sbjct: 74 NISIASGHSNRKKVLLVESDVE 95


>gi|157829135|ref|YP_001495377.1| hypothetical protein A1G_07140 [Rickettsia rickettsii str.
          'Sheila Smith']
 gi|166228842|sp|A8GTZ4|Y7140_RICRS RecName: Full=UPF0235 protein A1G_07140
 gi|157801616|gb|ABV76869.1| hypothetical protein A1G_07140 [Rickettsia rickettsii str.
          'Sheila Smith']
          Length = 105

 Score = 82.9 bits (204), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 50/89 (56%), Gaps = 3/89 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +  ++ PN+K++ I++  I  +     ++K+ + A P++GKAN  ++  LAK+  LS+
Sbjct: 14 ALLSFKVKPNSKQNLISNFVIINNIQ---YLKLSIKAIPEQGKANSEIINYLAKEWKLSR 70

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          S++ ++   +  LK I I    ++   L+
Sbjct: 71 SNIEIIKGHTHSLKTILIKNINEDYLNLI 99


>gi|220909519|ref|YP_002484830.1| hypothetical protein Cyan7425_4156 [Cyanothece sp. PCC 7425]
 gi|219866130|gb|ACL46469.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7425]
          Length = 89

 Score = 82.9 bits (204), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 26/95 (27%), Positives = 51/95 (53%), Gaps = 8/95 (8%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          M  + V+++P++ +  +              +K+KV A P+KGKAN A++A+LA  L + 
Sbjct: 1  MAKLKVKVVPSSSRDLVVGW-------LGEALKVKVKAPPEKGKANAAVIALLATHLGID 53

Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELLQNND 94
          ++ + +LS  +S  K++ I+  D  +I   L +  
Sbjct: 54 QTCIEVLSGHTSAAKVLSIEGLDQTQIRAALSDVS 88


>gi|126657163|ref|ZP_01728329.1| hypothetical protein CY0110_24581 [Cyanothece sp. CCY0110]
 gi|126621434|gb|EAZ92145.1| hypothetical protein CY0110_24581 [Cyanothece sp. CCY0110]
          Length = 75

 Score = 82.9 bits (204), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 26/80 (32%), Positives = 42/80 (52%), Gaps = 7/80 (8%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          +  + V++ PNAK+  I       +     H KI V + P  GKAN+ ++ +LAK   + 
Sbjct: 2  LLKLQVKVKPNAKQQKI-------EEMADNHFKIAVKSPPTDGKANQELITLLAKHFNVP 54

Query: 61 KSSLRMLSKQSSPLKIIYID 80
          KS + + S  SS  K++ ID
Sbjct: 55 KSHILIKSGVSSRNKLVEID 74


>gi|239948125|ref|ZP_04699878.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes
          scapularis]
 gi|239922401|gb|EER22425.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes
          scapularis]
          Length = 92

 Score = 82.9 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 20/80 (25%), Positives = 45/80 (56%), Gaps = 3/80 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +++ PN+ ++ I+      +     ++K+ + A P++GKAN+ ++  LAK+  LS+
Sbjct: 14 ALLSLKVKPNSNRNLISDFITINNIP---YLKLSIKAVPEQGKANEEIINYLAKEWKLSR 70

Query: 62 SSLRMLSKQSSPLKIIYIDK 81
          S++ ++   +  LK I I  
Sbjct: 71 SNIEIIKGHTHSLKTILIKN 90


>gi|146276431|ref|YP_001166590.1| hypothetical protein Rsph17025_0379 [Rhodobacter sphaeroides ATCC
          17025]
 gi|145554672|gb|ABP69285.1| protein of unknown function DUF167 [Rhodobacter sphaeroides ATCC
          17025]
          Length = 84

 Score = 82.9 bits (204), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 22/76 (28%), Positives = 37/76 (48%), Gaps = 8/76 (10%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           VR+ P A +  I              +++ VT  P+ GKAN+A+  +LAK L ++KS L
Sbjct: 17 AVRVTPRASRERIE--------VQEGTVRVHVTCVPEDGKANRAVTEVLAKALGVAKSRL 68

Query: 65 RMLSKQSSPLKIIYID 80
           ++   +   K   +D
Sbjct: 69 TLVRGATGRDKTFRLD 84


>gi|115380361|ref|ZP_01467361.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
 gi|115362627|gb|EAU61862.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
          Length = 83

 Score = 82.5 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 17/88 (19%), Positives = 41/88 (46%), Gaps = 7/88 (7%)

Query: 8  LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67
          + P A ++ +              +K+++ A P  G+AN A++  LAK+L L +  + ++
Sbjct: 3  VQPRASRTRVVG-------EHDGMLKLQLAAPPVDGEANAALVEFLAKRLGLPRRQVTLV 55

Query: 68 SKQSSPLKIIYIDKDCKEITELLQNNDS 95
          +  ++  K +++        E + +  S
Sbjct: 56 AGDAARRKRVFLAGVDAARVEAVMSQAS 83


>gi|186683679|ref|YP_001866875.1| hypothetical protein Npun_R3528 [Nostoc punctiforme PCC 73102]
 gi|226703856|sp|B2J1T0|Y3528_NOSP7 RecName: Full=UPF0235 protein Npun_R3528
 gi|186466131|gb|ACC81932.1| protein of unknown function DUF167 [Nostoc punctiforme PCC 73102]
          Length = 75

 Score = 82.5 bits (203), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 25/78 (32%), Positives = 44/78 (56%), Gaps = 7/78 (8%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           V++ PN+K+  I       +      + + + + P  GKAN+ ++ +LAKK  ++KS +
Sbjct: 4  KVKVKPNSKQQKI-------EEQPDGSLTVYLKSPPVDGKANEELIKLLAKKFDVAKSDI 56

Query: 65 RMLSKQSSPLKIIYIDKD 82
          R+ S  SS  K+I ID+D
Sbjct: 57 RIKSGLSSRQKLIEIDRD 74


>gi|332025796|gb|EGI65953.1| UPF0235 protein C15orf40-like protein [Acromyrmex echinatior]
          Length = 139

 Score = 82.1 bits (202), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 16/91 (17%), Positives = 37/91 (40%), Gaps = 8/91 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + ++  P AK + +  +             I ++A P +G+AN  ++  LA    + KS 
Sbjct: 53  IKIQAKPGAKCNNVTDISDEAIG-------IAISAPPTEGEANAELVKYLASIFGVRKSD 105

Query: 64  LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
           + +     S  K++ +      ++   L+  
Sbjct: 106 VSLDRGSRSRQKVVIVSGISTDQVLTKLKGE 136


>gi|325294222|ref|YP_004280736.1| yggU [Desulfurobacterium thermolithotrophum DSM 11699]
 gi|325064670|gb|ADY72677.1| UPF0235 protein yggU [Desulfurobacterium thermolithotrophum DSM
          11699]
          Length = 101

 Score = 82.1 bits (202), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 22/78 (28%), Positives = 45/78 (57%), Gaps = 7/78 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V++ P A ++ I  +E          +KIKVT  P+ GKAN+ ++ +L+K L + K  
Sbjct: 13 IEVKVQPKASRNKIEKVE-------EGRLKIKVTVPPEGGKANQKIIELLSKALKVPKRD 65

Query: 64 LRMLSKQSSPLKIIYIDK 81
          + ++  ++S +K++ I+ 
Sbjct: 66 IDIVKGETSRIKVVRIEG 83


>gi|20381446|gb|AAH27500.1| RIKEN cDNA 3110040N11 gene [Mus musculus]
 gi|74206730|dbj|BAE41614.1| unnamed protein product [Mus musculus]
 gi|148674975|gb|EDL06922.1| RIKEN cDNA 3110040N11, isoform CRA_d [Mus musculus]
 gi|148674977|gb|EDL06924.1| RIKEN cDNA 3110040N11, isoform CRA_d [Mus musculus]
          Length = 126

 Score = 82.1 bits (202), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 18/96 (18%), Positives = 39/96 (40%), Gaps = 9/96 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P ++++ +  L             + + A P +G+AN  +   L+K L L K
Sbjct: 36  VTIAIHAKPGSRQNAVTDLSTEAVG-------VAIAAPPSQGEANAELCRYLSKVLDLRK 88

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95
           S + +     S  K++ +      +E+ E L+    
Sbjct: 89  SDVVLDKGGKSREKVVKLLASTTPEEVLEKLKTEAE 124


>gi|322792209|gb|EFZ16226.1| hypothetical protein SINV_80163 [Solenopsis invicta]
          Length = 122

 Score = 82.1 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 17/91 (18%), Positives = 37/91 (40%), Gaps = 8/91 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + ++  P AK + I  +             I ++A P +G+AN  ++  LA    + KS 
Sbjct: 36  IKIQAKPGAKCNNITDISDEAVG-------IAISAPPTEGEANAELVKYLASTFGVRKSD 88

Query: 64  LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
           + +     S  K++ +      ++   L+  
Sbjct: 89  VTLDRGSRSRQKVVVVSGITTDQVLTKLKGE 119


>gi|218885259|ref|YP_002434580.1| hypothetical protein DvMF_0151 [Desulfovibrio vulgaris str.
          'Miyazaki F']
 gi|218756213|gb|ACL07112.1| protein of unknown function DUF167 [Desulfovibrio vulgaris str.
          'Miyazaki F']
          Length = 106

 Score = 82.1 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 22/81 (27%), Positives = 43/81 (53%), Gaps = 7/81 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V+VR +P A+KS           +    +K+++ A     KANKA+   +A  L + ++ 
Sbjct: 22 VLVRAVPGARKSACEG-------TADGRLKVRLAAPAVDNKANKALEEFVASALGMRRNR 74

Query: 64 LRMLSKQSSPLKIIYIDKDCK 84
          +R++S  +S LK + ++ D +
Sbjct: 75 VRLVSGHTSRLKKLIVESDVE 95


>gi|288931799|ref|YP_003435859.1| hypothetical protein Ferp_1433 [Ferroglobus placidus DSM 10642]
 gi|288894047|gb|ADC65584.1| protein of unknown function DUF167 [Ferroglobus placidus DSM
          10642]
          Length = 94

 Score = 82.1 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 21/88 (23%), Positives = 42/88 (47%), Gaps = 8/88 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V + P++KK+ IA  +          + +KV A P +GKAN+ +   L +      
Sbjct: 12 VIITVHVTPSSKKNEIAGYD-----PWKKALSVKVKAPPVEGKANRELEKFLKEYFG--- 63

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITEL 89
           +++++S + S +K + I    +E  E 
Sbjct: 64 KNVKLVSGEKSRVKKVLIVGCSREEVEK 91


>gi|119384906|ref|YP_915962.1| hypothetical protein Pden_2174 [Paracoccus denitrificans PD1222]
 gi|189039514|sp|A1B422|Y2174_PARDP RecName: Full=UPF0235 protein Pden_2174
 gi|119374673|gb|ABL70266.1| protein of unknown function DUF167 [Paracoccus denitrificans
          PD1222]
          Length = 82

 Score = 82.1 bits (202), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 23/79 (29%), Positives = 42/79 (53%), Gaps = 8/79 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VR+ P A ++ +          D   +++ VT  P+ GKAN A++ +LAK L ++K
Sbjct: 12 AEIAVRVTPRASRNAVI--------LDGEAIRVTVTTVPEDGKANAAVVKLLAKALGVAK 63

Query: 62 SSLRMLSKQSSPLKIIYID 80
          S L ++   ++  K+  ID
Sbjct: 64 SRLVLVRGATARDKLFRID 82


>gi|47213227|emb|CAF89748.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 120

 Score = 81.7 bits (201), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 20/93 (21%), Positives = 39/93 (41%), Gaps = 9/93 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + V   P +K S + ++           +++ + A P  G+AN  ++  LA+ L L K
Sbjct: 32  VTITVHAKPGSKHSRVTAVST-------EAVEVAIAAPPVDGEANVELVRFLAEVLELKK 84

Query: 62  SSLRMLSKQSSPLKIIYIDK--DCKEITELLQN 92
             L +     S  K + +D     +E+   L+ 
Sbjct: 85  GHLHLDKGSRSRDKQVRVDSPLSPEEVLRRLRQ 117


>gi|260467362|ref|ZP_05813535.1| protein of unknown function DUF167 [Mesorhizobium opportunistum
           WSM2075]
 gi|259028889|gb|EEW30192.1| protein of unknown function DUF167 [Mesorhizobium opportunistum
           WSM2075]
          Length = 103

 Score = 81.7 bits (201), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 22/92 (23%), Positives = 49/92 (53%), Gaps = 2/92 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            ++ VRL P +    +  +E   D     H+K++V A P+ G AN+A+  + AK L +  
Sbjct: 12  IDLFVRLTPKSSVDRLEGVETAADG--RSHLKVRVRAVPENGAANQALERLAAKTLGVPV 69

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
           S++ +++  ++ LK + +  D + ++  ++  
Sbjct: 70  SAVSVVAGGTARLKTLRVAGDPEALSRSIEAL 101


>gi|157804228|ref|YP_001492777.1| hypothetical protein A1E_05380 [Rickettsia canadensis str.
          McKiel]
 gi|166227320|sp|A8F059|Y5380_RICCK RecName: Full=UPF0235 protein A1E_05380
 gi|157785491|gb|ABV73992.1| hypothetical protein A1E_05380 [Rickettsia canadensis str.
          McKiel]
          Length = 105

 Score = 81.7 bits (201), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 20/84 (23%), Positives = 47/84 (55%), Gaps = 3/84 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +++ P++K++ I+   I  +     ++K+ +   P++GKAN+ ++  LAK   LS+
Sbjct: 14 ALLNLKVKPDSKQNLISDFVIINNLP---YLKLFIKTAPEQGKANEEIINYLAKAWKLSR 70

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85
          S++ ++   +  LK I I    ++
Sbjct: 71 SNIEIIKGHTHSLKTILIKNIDED 94


>gi|308270456|emb|CBX27068.1| UPF0235 protein PTH_1821 [uncultured Desulfobacterium sp.]
          Length = 106

 Score = 81.7 bits (201), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 26/91 (28%), Positives = 45/91 (49%), Gaps = 7/91 (7%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           V ++P + K+ IA L           +KIK+TA P  G AN   +  LAK L++S S++
Sbjct: 14 KVYILPRSSKNMIAGL-------FGDALKIKLTAAPVDGSANNMCIKYLAKILSVSASNI 66

Query: 65 RMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95
           ++S  +   K I +  + K ++   +   S
Sbjct: 67 EIVSGHTGKTKYILLKNNEKTLSSSTEALIS 97


>gi|110596999|ref|ZP_01385289.1| Protein of unknown function DUF167 [Chlorobium ferrooxidans DSM
          13031]
 gi|110341686|gb|EAT60146.1| Protein of unknown function DUF167 [Chlorobium ferrooxidans DSM
          13031]
          Length = 97

 Score = 81.7 bits (201), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 18/84 (21%), Positives = 35/84 (41%), Gaps = 7/84 (8%)

Query: 6  VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65
          V+  P + KS +  L           +K+ + A P    AN+    + +K   +  S + 
Sbjct: 15 VKAQPRSSKSRVCGL-------YNGGLKVSLKAAPVDDAANRECCDLFSKVFHIPPSRVH 67

Query: 66 MLSKQSSPLKIIYIDKDCKEITEL 89
          +++ +SS  K + +D    E   L
Sbjct: 68 IIAGKSSRTKTVMLDGVTVEAAAL 91


>gi|145224153|ref|YP_001134831.1| hypothetical protein Mflv_3569 [Mycobacterium gilvum PYR-GCK]
 gi|315444488|ref|YP_004077367.1| hypothetical protein Mspyr1_29120 [Mycobacterium sp. Spyr1]
 gi|189040164|sp|A4T9S3|Y3569_MYCGI RecName: Full=UPF0235 protein Mflv_3569
 gi|145216639|gb|ABP46043.1| protein of unknown function DUF167 [Mycobacterium gilvum PYR-GCK]
 gi|315262791|gb|ADT99532.1| uncharacterized conserved protein [Mycobacterium sp. Spyr1]
          Length = 75

 Score = 81.7 bits (201), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 18/76 (23%), Positives = 37/76 (48%), Gaps = 6/76 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VR+ P ++K  +       +  +   + + V      GKAN+A+  +LA+ L + +S 
Sbjct: 5  ISVRVKPGSRKGPLV------EAGEDGALTLYVQERAVDGKANEAVTKLLAEHLGVPRSR 58

Query: 64 LRMLSKQSSPLKIIYI 79
          + ++S  +S  K   I
Sbjct: 59 IELVSGATSRHKRFRI 74


>gi|75910098|ref|YP_324394.1| hypothetical protein Ava_3894 [Anabaena variabilis ATCC 29413]
 gi|123608489|sp|Q3M687|Y3894_ANAVT RecName: Full=UPF0235 protein Ava_3894
 gi|75703823|gb|ABA23499.1| Protein of unknown function DUF167 [Anabaena variabilis ATCC
          29413]
          Length = 75

 Score = 81.7 bits (201), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 7/78 (8%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           V++ PN+K+  IA         D   + + + + P  GKAN+ ++ +LA+K A+ KS +
Sbjct: 4  KVKVKPNSKQQKIA-------EQDDGSLTVHLKSPPVDGKANEELIKLLAEKFAVPKSHI 56

Query: 65 RMLSKQSSPLKIIYIDKD 82
           + S  SS  K+I ID D
Sbjct: 57 TIKSGLSSRQKLIEIDTD 74


>gi|54295945|ref|YP_122257.1| hypothetical protein plpp0102 [Legionella pneumophila str. Paris]
 gi|53755777|emb|CAH17279.1| hypothetical protein plpp0102 [Legionella pneumophila str. Paris]
          Length = 98

 Score = 81.7 bits (201), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 17/78 (21%), Positives = 34/78 (43%), Gaps = 7/78 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P AK + +         +    +KIK+ A   + KAN  ++  L+    + KS 
Sbjct: 17 LSLLIQPGAKCNQVVG-------AVGEELKIKIAAPSIEVKANMELVRYLSVLFKVPKSQ 69

Query: 64 LRMLSKQSSPLKIIYIDK 81
          +++     S  KII +  
Sbjct: 70 IKIKRGLKSRHKIIEVIG 87


>gi|291413562|ref|XP_002723040.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
          Length = 154

 Score = 81.7 bits (201), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 20/96 (20%), Positives = 38/96 (39%), Gaps = 9/96 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P AK++ +  L           + + + A P +G+AN  +   L+K L L K
Sbjct: 64  VTIAIHAKPGAKQNAVTDLTA-------EAVSVAIAAPPSEGEANAELCRYLSKVLELRK 116

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95
           S + +     S  K++ +       EI   L+    
Sbjct: 117 SDVVLDKGGKSREKVVKLLASTTPDEILGKLKREAE 152


>gi|197121557|ref|YP_002133508.1| hypothetical protein AnaeK_1146 [Anaeromyxobacter sp. K]
 gi|226696231|sp|B4UGV4|Y1146_ANASK RecName: Full=UPF0235 protein AnaeK_1146
 gi|196171406|gb|ACG72379.1| protein of unknown function DUF167 [Anaeromyxobacter sp. K]
          Length = 95

 Score = 81.7 bits (201), Expect = 3e-14,   Method: Composition-based stats.
 Identities = 15/78 (19%), Positives = 34/78 (43%), Gaps = 7/78 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P A ++                +KI++ A P  G AN A++  LA  L + +
Sbjct: 11 AVLELLVQPRASRTRAVG-------EHDGRLKIQLAAPPVDGAANAALVEFLAVALGVRR 63

Query: 62 SSLRMLSKQSSPLKIIYI 79
          + + +L  ++   K + +
Sbjct: 64 ADVALLRGETGRRKTVRV 81


>gi|75674677|ref|YP_317098.1| hypothetical protein Nwi_0479 [Nitrobacter winogradskyi Nb-255]
 gi|74419547|gb|ABA03746.1| Protein of unknown function DUF167 [Nitrobacter winogradskyi
           Nb-255]
          Length = 106

 Score = 81.3 bits (200), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 23/93 (24%), Positives = 51/93 (54%), Gaps = 2/93 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +R+ P   +  I  +E+  D    + ++++  A    G+AN+A+ A+LAK L + K  
Sbjct: 13  IALRVTPRGGRDAIDGIEMLADGRPVVKVRVRAVA--DGGEANRAVTAVLAKALGVRKID 70

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96
           +R+L+  +S LK + ++ D  ++   L+   ++
Sbjct: 71  VRILAGATSRLKQVAVEGDPVQLGNALRALTAV 103


>gi|260892905|ref|YP_003239002.1| protein of unknown function DUF167 [Ammonifex degensii KC4]
 gi|260865046|gb|ACX52152.1| protein of unknown function DUF167 [Ammonifex degensii KC4]
          Length = 103

 Score = 81.3 bits (200), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 16/89 (17%), Positives = 41/89 (46%), Gaps = 8/89 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +R+ P +              +    +++++TA P +GKAN+ +L  L++ L +    
Sbjct: 20  IHLRVTPRSS--------TLALEAGEGFLRVRLTAPPVEGKANELLLEFLSRVLDIPARR 71

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
           L+++       K++ +D     + E ++ 
Sbjct: 72  LQLVKGLKGREKVVLVDMPLPLVAEKIEK 100


>gi|152991960|ref|YP_001357681.1| hypothetical protein SUN_0364 [Sulfurovum sp. NBC37-1]
 gi|151423821|dbj|BAF71324.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1]
          Length = 95

 Score = 81.3 bits (200), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 23/91 (25%), Positives = 42/91 (46%), Gaps = 11/91 (12%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           ++ ++  P A ++    +           +KI++ A   +G ANK ++  LAK   + K
Sbjct: 10 VSMRIKAQPAASRNEFCDIY------GEDAIKIRIKAPAVEGAANKELMKFLAKSFKVPK 63

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92
          S +   S Q+S +KI+        +TE  QN
Sbjct: 64 SDIIFKSGQNSKIKIVEFP-----LTEKFQN 89


>gi|149690913|ref|XP_001498126.1| PREDICTED: similar to chromosome 15 open reading frame 40 [Equus
           caballus]
          Length = 167

 Score = 81.3 bits (200), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 15/82 (18%), Positives = 34/82 (41%), Gaps = 7/82 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  +           + + + A P +G+AN  +   L+K L L K
Sbjct: 77  VTIAIHAKPGSKQNAVTDVTA-------EAVSVAIAAPPSEGEANAELCRYLSKVLDLRK 129

Query: 62  SSLRMLSKQSSPLKIIYIDKDC 83
           S + +     S  K++ +    
Sbjct: 130 SDVVLDKGGKSREKVVKLLAST 151


>gi|148236990|ref|NP_001089221.1| hypothetical protein LOC734268 [Xenopus laevis]
 gi|57920974|gb|AAH89152.1| MGC85153 protein [Xenopus laevis]
          Length = 121

 Score = 81.3 bits (200), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 20/93 (21%), Positives = 38/93 (40%), Gaps = 9/93 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P AK++ I  +             + + A P +G+AN  +   L+K L L K
Sbjct: 31  VTISIHAKPGAKQNAITDVTADAVG-------VAIAAPPTEGEANAELCRYLSKVLVLKK 83

Query: 62  SSLRMLSKQSSPLKIIYIDKD--CKEITELLQN 92
           S + +     S  K++ I      + + E L+ 
Sbjct: 84  SEVSLDKGGKSREKVVKISASITPEVVLERLKE 116


>gi|194748771|ref|XP_001956818.1| GF24383 [Drosophila ananassae]
 gi|190624100|gb|EDV39624.1| GF24383 [Drosophila ananassae]
          Length = 126

 Score = 81.3 bits (200), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 24/95 (25%), Positives = 42/95 (44%), Gaps = 10/95 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AK++GI  +             +++ A P +G+AN  ++  L+K L L KS 
Sbjct: 39  IQILAKPGAKQNGITGISTEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 91

Query: 64  LRMLSKQSSPLKIIYIDK---DCKEITELLQNNDS 95
           + +     S  KII I K     +   +LL+    
Sbjct: 92  VSLDKGSRSRNKIILITKGAITTEAAEQLLRKESE 126


>gi|242398935|ref|YP_002994359.1| hypothetical protein TSIB_0952 [Thermococcus sibiricus MM 739]
 gi|242265328|gb|ACS90010.1| hypothetical protein TSIB_0952 [Thermococcus sibiricus MM 739]
          Length = 94

 Score = 80.9 bits (199), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 19/91 (20%), Positives = 42/91 (46%), Gaps = 9/91 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P A  + I  ++          +K+K+ A P +GKANK ++   +K L    
Sbjct: 9  VILQIYVQPKANTNEIEGVD-----EWRGRLKVKIKAPPVEGKANKEVVKFFSKLLG--- 60

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91
          +   ++  ++S  K + I     +E+ + L+
Sbjct: 61 AEASLIKGETSREKDLLIRGISIEEVKKKLK 91


>gi|224373471|ref|YP_002607843.1| hypothetical protein NAMH_1451 [Nautilia profundicola AmH]
 gi|223589921|gb|ACM93657.1| conserved hypothetical protein [Nautilia profundicola AmH]
          Length = 94

 Score = 80.9 bits (199), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 18/91 (19%), Positives = 43/91 (47%), Gaps = 9/91 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + ++  PN+ K+ IA             +K+ + A   +G ANK ++  + K+  + KS 
Sbjct: 11 IKIKAQPNSSKNKIAG-------KYGESLKVNIKAPAVEGAANKELIKFIGKEFKIPKSE 63

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
          + +   ++S  K++ +     +I + L+  +
Sbjct: 64 IAI-KGETSKQKVLIVP-FRDDIIKKLEEIN 92


>gi|261855187|ref|YP_003262470.1| hypothetical protein Hneap_0569 [Halothiobacillus neapolitanus c2]
 gi|261835656|gb|ACX95423.1| protein of unknown function DUF167 [Halothiobacillus neapolitanus
           c2]
          Length = 122

 Score = 80.9 bits (199), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 20/102 (19%), Positives = 45/102 (44%), Gaps = 14/102 (13%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +++ P A ++    +           +K+ +T  P  GKAN+A++  +AK  +++K+ 
Sbjct: 19  LKLKVQPRASRTAWGEV-------IGGRIKLYLTTPPIDGKANQAVIDFIAKTFSVAKNR 71

Query: 64  LRMLSKQSSPLKIIYI-------DKDCKEITELLQNNDSLTL 98
           + +   ++S  K I +           +    LLQ   S  +
Sbjct: 72  VEIRRGETSRSKDIALAWLTVPKASSSEAERLLLQALSSKAI 113


>gi|17230688|ref|NP_487236.1| hypothetical protein asl3196 [Nostoc sp. PCC 7120]
 gi|29839730|sp|Q8YS95|Y3196_ANASP RecName: Full=UPF0235 protein asl3196
 gi|17132291|dbj|BAB74895.1| asl3196 [Nostoc sp. PCC 7120]
          Length = 75

 Score = 80.9 bits (199), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 7/78 (8%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           V++ PN+K+  IA         D   + + + + P  GKAN+ ++ +LA+K  + KS +
Sbjct: 4  RVKVKPNSKQQKIA-------EQDDGSLTVHLKSPPVDGKANEELIKLLAEKFDVPKSHI 56

Query: 65 RMLSKQSSPLKIIYIDKD 82
           + S  SS  K+I I+ D
Sbjct: 57 TIKSGLSSKQKLIEIETD 74


>gi|320108335|ref|YP_004183925.1| hypothetical protein AciPR4_3175 [Terriglobus saanensis SP1PR4]
 gi|319926856|gb|ADV83931.1| protein of unknown function DUF167 [Terriglobus saanensis SP1PR4]
          Length = 104

 Score = 80.9 bits (199), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 40/77 (51%), Gaps = 7/77 (9%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           VR+ P AK+SG+  +           +KI + A    GKAN+A++  +A  L + + S+
Sbjct: 19 AVRVQPGAKRSGVVGI-------YGEAVKIALVAPAVDGKANEALVRFVATLLDVPRMSV 71

Query: 65 RMLSKQSSPLKIIYIDK 81
           +LS  SS  K++ +  
Sbjct: 72 EILSGVSSRSKVVKVLG 88


>gi|24656569|ref|NP_647784.1| CG14966 [Drosophila melanogaster]
 gi|7292328|gb|AAF47735.1| CG14966 [Drosophila melanogaster]
 gi|289526411|gb|ADD01328.1| RE68649p [Drosophila melanogaster]
          Length = 140

 Score = 80.9 bits (199), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 25/94 (26%), Positives = 43/94 (45%), Gaps = 10/94 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AK++GI  +             +++ A P +G+AN  ++  L+K L L KS 
Sbjct: 52  IQILAKPGAKQNGITGIGFEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 104

Query: 64  LRMLSKQSSPLKIIYIDK---DCKEITELLQNND 94
           + +     S  KII I K     + I +LL+   
Sbjct: 105 VSLDKGSRSRNKIIMITKGVSTVEAIEQLLRKES 138


>gi|73951609|ref|XP_545872.2| PREDICTED: similar to CG14966-PA [Canis familiaris]
          Length = 152

 Score = 80.9 bits (199), Expect = 5e-14,   Method: Composition-based stats.
 Identities = 20/94 (21%), Positives = 41/94 (43%), Gaps = 9/94 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +R  P +K++ +  +           + + + A P +G+AN  +   L+K L L K
Sbjct: 62  VTIAIRAKPGSKQNAVTDVTA-------EAVSVAIAAPPSEGEANAELCRYLSKVLELRK 114

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93
           S + +     S  K++ +      +EI E L+  
Sbjct: 115 SDVVLDKGGKSREKVVKLLASTTAEEILEKLKQQ 148


>gi|237747224|ref|ZP_04577704.1| predicted protein [Oxalobacter formigenes HOxBLS]
 gi|229378575|gb|EEO28666.1| predicted protein [Oxalobacter formigenes HOxBLS]
          Length = 100

 Score = 80.9 bits (199), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 27/92 (29%), Positives = 54/92 (58%), Gaps = 10/92 (10%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + V++IPNA+KS I        +S+   ++I++ A P  GKAN+A++ +LAKKL + + 
Sbjct: 14 RIAVQVIPNARKSEIV-------SSEGETLRIRLQAQPVDGKANEALVQLLAKKLRVPRK 66

Query: 63 SLRMLSKQSSPLKIIYI---DKDCKEITELLQ 91
           + +    ++  K++ +   D+  ++I + LQ
Sbjct: 67 QVSITHGLANKRKLLEVIVSDRSQEDIVKQLQ 98


>gi|54298708|ref|YP_125077.1| hypothetical protein lpp2772 [Legionella pneumophila str. Paris]
 gi|53752493|emb|CAH13925.1| hypothetical protein lpp2772 [Legionella pneumophila str. Paris]
          Length = 95

 Score = 80.9 bits (199), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 18/78 (23%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +   PNAKK+ + ++           + I + A PQ+G+AN  +L  +++   + K
Sbjct: 10 VEIAIYAKPNAKKTKLMAI-------SDDRLHIALHAKPQEGEANNELLFFISQFFKIPK 62

Query: 62 SSLRMLSKQSSPLKIIYI 79
          + + ++  +SS  K+I +
Sbjct: 63 TQIELIKGKSSRHKLIRL 80


>gi|283850632|ref|ZP_06367919.1| protein of unknown function DUF167 [Desulfovibrio sp. FW1012B]
 gi|283573875|gb|EFC21848.1| protein of unknown function DUF167 [Desulfovibrio sp. FW1012B]
          Length = 125

 Score = 80.5 bits (198), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 16/81 (19%), Positives = 35/81 (43%), Gaps = 7/81 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V + P   +  +A L           +++++ A   +G+AN  +   LA  L L    
Sbjct: 40  LRVVVTPGGSRDTLAGL-------AEGRLRVRLRAKAVEGQANAGLTVFLAGCLGLRPRQ 92

Query: 64  LRMLSKQSSPLKIIYIDKDCK 84
           + ++S + S  K + I  + +
Sbjct: 93  VAIVSGEKSRKKTLRISAESE 113


>gi|84996333|ref|XP_952888.1| proton translocating inorganic pyrophosphatase [Theileria annulata
            strain Ankara]
 gi|65303885|emb|CAI76264.1| proton translocating inorganic pyrophosphatase, putative [Theileria
            annulata]
          Length = 1204

 Score = 80.5 bits (198), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 16/96 (16%), Positives = 39/96 (40%), Gaps = 25/96 (26%)

Query: 4    VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK------- 56
            + V + P ++++ I              + +++ A P++G+ NKA++  ++K        
Sbjct: 1098 LKVNVKPGSRQTQIIG-------ESEGRLSVQIAAPPREGECNKALIEFISKTRNFYYFL 1150

Query: 57   -----------LALSKSSLRMLSKQSSPLKIIYIDK 81
                       + + K ++ +L    S  KI+ I  
Sbjct: 1151 AFNTFNVLIFLVGVKKGNVTLLHGHKSRDKILSITG 1186


>gi|15604669|ref|NP_221187.1| hypothetical protein RP839 [Rickettsia prowazekii str. Madrid E]
 gi|6686136|sp|Q9ZCC0|Y839_RICPR RecName: Full=UPF0235 protein RP839
 gi|3861364|emb|CAA15263.1| unknown [Rickettsia prowazekii]
 gi|292572500|gb|ADE30415.1| hypothetical protein rpr22_CDS819 [Rickettsia prowazekii Rp22]
          Length = 105

 Score = 80.5 bits (198), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 26/89 (29%), Positives = 51/89 (57%), Gaps = 3/89 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V++ P AK++ I +  I  +     ++K+ + ATP++GKAN+ ++  LAK+  LS+
Sbjct: 14 ALINVKVKPYAKQNLIGNFVIINNIP---YIKLAIKATPEQGKANEGIIHYLAKEWELSR 70

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          SS+ ++   +  LK I I    ++   L+
Sbjct: 71 SSIEIIKGHTHSLKTILIKNINEDYLNLI 99


>gi|46446134|ref|YP_007499.1| hypothetical protein pc0500 [Candidatus Protochlamydia
          amoebophila UWE25]
 gi|46399775|emb|CAF23224.1| conserved hypothetical protein [Candidatus Protochlamydia
          amoebophila UWE25]
          Length = 92

 Score = 80.5 bits (198), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 24/89 (26%), Positives = 43/89 (48%), Gaps = 7/89 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +++IP A  S     E          +KI++ A P KG+AN  ++  L+    + KSS
Sbjct: 11 IKIKVIPLASFSEKVGWE-------GDELKIRLAAIPDKGQANTELIRFLSSLFKIRKSS 63

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
          ++++  Q+S  K I I     E  ++L  
Sbjct: 64 IQLIQGQTSRHKKICIQDISLERLQVLLA 92


>gi|52842921|ref|YP_096720.1| hypothetical protein lpg2716 [Legionella pneumophila subsp.
          pneumophila str. Philadelphia 1]
 gi|52630032|gb|AAU28773.1| hypothetical protein lpg2716 [Legionella pneumophila subsp.
          pneumophila str. Philadelphia 1]
          Length = 95

 Score = 80.5 bits (198), Expect = 7e-14,   Method: Composition-based stats.
 Identities = 19/78 (24%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +   PNAKKS + ++           + I + A PQ+G+AN  +L  +++   + K
Sbjct: 10 VEIAIYAKPNAKKSKLMAI-------SDDRLHIALHAKPQEGEANNELLFFISQFFKIPK 62

Query: 62 SSLRMLSKQSSPLKIIYI 79
          + + ++  +SS  K+I +
Sbjct: 63 TQIELIKGKSSRHKLIRL 80


>gi|330795787|ref|XP_003285952.1| hypothetical protein DICPUDRAFT_23568 [Dictyostelium purpureum]
 gi|325084041|gb|EGC37478.1| hypothetical protein DICPUDRAFT_23568 [Dictyostelium purpureum]
          Length = 110

 Score = 80.5 bits (198), Expect = 8e-14,   Method: Composition-based stats.
 Identities = 24/88 (27%), Positives = 48/88 (54%), Gaps = 7/88 (7%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  + + + PN+K++ I S E          + ++++  P  G+ANK ++  L+K+L L 
Sbjct: 23  ILKLNINVHPNSKENQIISFE-------NEILSLRISEPPIDGQANKGVVEFLSKELGLR 75

Query: 61  KSSLRMLSKQSSPLKIIYIDKDCKEITE 88
           KS++++     S  K I ID + + IT+
Sbjct: 76  KSNIQVSKGSKSRNKSIEIDLESESITK 103


>gi|52345734|ref|NP_001004913.1| chromosome 15 open reading frame 40 [Xenopus (Silurana) tropicalis]
 gi|49523245|gb|AAH75357.1| MGC89060 protein [Xenopus (Silurana) tropicalis]
          Length = 120

 Score = 80.2 bits (197), Expect = 8e-14,   Method: Composition-based stats.
 Identities = 20/93 (21%), Positives = 38/93 (40%), Gaps = 9/93 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P AK++ I  +             + + A P +G+AN  +   L+K L L K
Sbjct: 30  VIISIHAKPGAKQNAITDVTADAVG-------VAIAAPPTEGEANAELCRYLSKVLVLKK 82

Query: 62  SSLRMLSKQSSPLKIIYIDKD--CKEITELLQN 92
           S + +     S  K++ I      + + E L+ 
Sbjct: 83  SEVSLDKGGKSREKVVKISASITPEVVLEKLKE 115


>gi|224062575|ref|XP_002197110.1| PREDICTED: hypothetical protein, partial [Taeniopygia guttata]
          Length = 138

 Score = 80.2 bits (197), Expect = 8e-14,   Method: Composition-based stats.
 Identities = 21/89 (23%), Positives = 36/89 (40%), Gaps = 9/89 (10%)

Query: 9   IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
            P A+ S +  +             + + A P +G+AN  +   L+K L + KS + +  
Sbjct: 56  KPGARCSAVTDVTAEAVG-------VAIAAPPSEGEANAELCRYLSKVLEVKKSDVILEK 108

Query: 69  KQSSPLKIIYI--DKDCKEITELLQNNDS 95
              S  K++ I       EI E L+   S
Sbjct: 109 GGKSRDKVVKISVSATPDEILEKLKKEAS 137


>gi|294676098|ref|YP_003576713.1| hypothetical protein RCAP_rcc00541 [Rhodobacter capsulatus SB
          1003]
 gi|294474918|gb|ADE84306.1| protein of unknown function DUF167 [Rhodobacter capsulatus SB
          1003]
          Length = 83

 Score = 80.2 bits (197), Expect = 9e-14,   Method: Composition-based stats.
 Identities = 21/79 (26%), Positives = 41/79 (51%), Gaps = 8/79 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +R+ P A ++ I              ++  VT  P+ GKAN A++ +LAK L ++K
Sbjct: 13 AEIALRVTPKASRNEIV--------VAEDGLRAYVTVVPEGGKANAAVVKLLAKSLGVAK 64

Query: 62 SSLRMLSKQSSPLKIIYID 80
          S L ++  +++  K+  +D
Sbjct: 65 SRLTLIRGETARDKVFRLD 83


>gi|163737739|ref|ZP_02145156.1| hypothetical protein RGBS107_19448 [Phaeobacter gallaeciensis
          BS107]
 gi|163744042|ref|ZP_02151409.1| hypothetical protein RG210_12581 [Phaeobacter gallaeciensis 2.10]
 gi|161382658|gb|EDQ07060.1| hypothetical protein RG210_12581 [Phaeobacter gallaeciensis 2.10]
 gi|161389265|gb|EDQ13617.1| hypothetical protein RGBS107_19448 [Phaeobacter gallaeciensis
          BS107]
          Length = 98

 Score = 80.2 bits (197), Expect = 9e-14,   Method: Composition-based stats.
 Identities = 23/76 (30%), Positives = 41/76 (53%), Gaps = 1/76 (1%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VR+ P A ++ I  +   +     + +KI VTA P+ GKA  A+ A+LA+ + ++ 
Sbjct: 20 AEIPVRVTPKASRNAILPIPQAESGQ-GVSLKITVTAAPENGKATAAVQALLARAMRIAP 78

Query: 62 SSLRMLSKQSSPLKII 77
          S L +L   +S  K+ 
Sbjct: 79 SDLELLRGATSRDKVF 94


>gi|86157514|ref|YP_464299.1| hypothetical protein Adeh_1087 [Anaeromyxobacter dehalogenans
          2CP-C]
 gi|123499918|sp|Q2IPY3|Y1087_ANADE RecName: Full=UPF0235 protein Adeh_1087
 gi|85774025|gb|ABC80862.1| protein of unknown function DUF167 [Anaeromyxobacter dehalogenans
          2CP-C]
          Length = 95

 Score = 80.2 bits (197), Expect = 9e-14,   Method: Composition-based stats.
 Identities = 15/78 (19%), Positives = 34/78 (43%), Gaps = 7/78 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P A ++                +KI++ A P  G AN A++  LA  L + +
Sbjct: 11 AVLEILVQPRASRTRAVG-------EHDGRLKIQLAAPPVDGAANAALVEFLAVALGVRR 63

Query: 62 SSLRMLSKQSSPLKIIYI 79
          + + +L  ++   K + +
Sbjct: 64 ADVALLRGEAGRRKTVRV 81


>gi|195428527|ref|XP_002062324.1| GK17477 [Drosophila willistoni]
 gi|194158409|gb|EDW73310.1| GK17477 [Drosophila willistoni]
          Length = 127

 Score = 80.2 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 22/88 (25%), Positives = 41/88 (46%), Gaps = 7/88 (7%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AK++GI  + +           +++ A P +G+AN  ++  L+K L L KS 
Sbjct: 39  IKILAKPGAKQNGITDIGLEGVG-------VQIAAPPSEGEANAELVKYLSKVLGLRKSD 91

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
           + +     S  KII + K    +   L+
Sbjct: 92  VSLDKGSRSRNKIILVSKGVSTVEACLE 119


>gi|195160833|ref|XP_002021278.1| GL25246 [Drosophila persimilis]
 gi|198465041|ref|XP_001353470.2| GA13391 [Drosophila pseudoobscura pseudoobscura]
 gi|194118391|gb|EDW40434.1| GL25246 [Drosophila persimilis]
 gi|198149990|gb|EAL30981.2| GA13391 [Drosophila pseudoobscura pseudoobscura]
          Length = 125

 Score = 80.2 bits (197), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 25/93 (26%), Positives = 45/93 (48%), Gaps = 10/93 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AK +GI ++++           +++ A P +G+AN  ++  L+K L L KS 
Sbjct: 37  IKILAKPGAKHNGITNIDLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 89

Query: 64  LRMLSKQSSPLKIIYIDK---DCKEITELLQNN 93
           + +     S  KII I K     + I +LL+  
Sbjct: 90  VSLDKGSRSRNKIILISKGASTVESIQQLLRKE 122


>gi|329893887|ref|ZP_08269938.1| hypothetical protein IMCC3088_2481 [gamma proteobacterium
          IMCC3088]
 gi|328923406|gb|EGG30722.1| hypothetical protein IMCC3088_2481 [gamma proteobacterium
          IMCC3088]
          Length = 83

 Score = 79.8 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 4/79 (5%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V VR+ P AK++ I      +D      +++ VT  P+ GKAN+A++  LA    + ++ 
Sbjct: 6  VSVRVTPKAKQAQI----KVEDIEGQSVLRVYVTVAPEDGKANRAVMRALADYFDVPRTR 61

Query: 64 LRMLSKQSSPLKIIYIDKD 82
          +++LS      K   ID D
Sbjct: 62 IKLLSGAKQRDKRFSIDAD 80


>gi|161723222|ref|NP_219898.2| hypothetical protein CT388 [Chlamydia trachomatis D/UW-3/CX]
 gi|162019865|ref|YP_328205.2| hypothetical protein CTA_0423 [Chlamydia trachomatis A/HAR-13]
          Length = 100

 Score = 79.8 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 26/95 (27%), Positives = 53/95 (55%), Gaps = 8/95 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VR+   A+++ +  LE          ++++VT  P+KGKAN A++A+LA  L++ KS 
Sbjct: 8  LEVRVTTKARENRVVCLE-------DGILRVRVTEVPEKGKANDAVVALLANFLSIPKSD 60

Query: 64 LRMLSKQSSPLKIIYIDKDCKE-ITELLQNNDSLT 97
          + +++ ++S  K + + +  K  + E   +  S T
Sbjct: 61 VTLIAGEASRRKKVLLPRSIKAFLLEQFPSESSST 95


>gi|221640262|ref|YP_002526524.1| hypothetical protein RSKD131_2163 [Rhodobacter sphaeroides KD131]
 gi|221161043|gb|ACM02023.1| Hypothetical Protein RSKD131_2163 [Rhodobacter sphaeroides KD131]
          Length = 85

 Score = 79.8 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 22/76 (28%), Positives = 38/76 (50%), Gaps = 8/76 (10%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           VR+ P A ++ +              +++ VT  P+ GKAN+A+   LAK L ++KS L
Sbjct: 18 AVRVTPRAARAKVD--------LQEGVVRVHVTCVPEDGKANRAVTEALAKALGVAKSRL 69

Query: 65 RMLSKQSSPLKIIYID 80
           ++   +S  K   +D
Sbjct: 70 TLVRGATSRDKTFRLD 85


>gi|118353243|ref|XP_001009893.1| hypothetical protein TTHERM_00161650 [Tetrahymena thermophila]
 gi|89291659|gb|EAR89647.1| hypothetical protein TTHERM_00161650 [Tetrahymena thermophila
           SB210]
          Length = 127

 Score = 79.8 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 22/93 (23%), Positives = 45/93 (48%), Gaps = 11/93 (11%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   PN+K S I+ +           + I + A P+ G+AN  ++  +++ L + KSS
Sbjct: 39  ISIHAKPNSKISQISGI-------SDEGVDINIAAPPKDGEANAELIDYISQVLGVKKSS 91

Query: 64  LRMLSKQSSPLKIIYID----KDCKEITELLQN 92
           L +     S  K++ I      D +E+ + L++
Sbjct: 92  LSLDKGGKSRNKLMEISDSGYADVEELYQALKD 124


>gi|307176568|gb|EFN66055.1| UPF0235 protein C15orf40-like protein [Camponotus floridanus]
          Length = 122

 Score = 79.8 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 18/93 (19%), Positives = 37/93 (39%), Gaps = 8/93 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + ++  P AK + I  +             I ++A P +G+AN  ++  LA    L K
Sbjct: 34  VVIKIQAKPGAKCNNITDISDEAVG-------IAISAPPMEGEANAELVKYLASIFELRK 86

Query: 62  SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
           S++ +     S  K + +      ++   L+  
Sbjct: 87  SNVSLDRGSRSRQKTVTVSGITTDQVLAKLKGE 119


>gi|328866439|gb|EGG14823.1| hypothetical protein DFA_10696 [Dictyostelium fasciculatum]
          Length = 152

 Score = 79.8 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 22/88 (25%), Positives = 40/88 (45%), Gaps = 6/88 (6%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  + V + PNAK+S I              + ++++  P  GKAN  ++  L+ +L L 
Sbjct: 56  IVRLNVNVHPNAKQSTIVQFTDD------GCLDLRISQPPIDGKANDEVIDFLSDELKLK 109

Query: 61  KSSLRMLSKQSSPLKIIYIDKDCKEITE 88
           K  + +     S  K+I ID     +T+
Sbjct: 110 KRFITVDKGLKSRNKVIAIDLSESSLTK 137


>gi|300087416|ref|YP_003757938.1| hypothetical protein Dehly_0296 [Dehalogenimonas
          lykanthroporepellens BL-DC-9]
 gi|299527149|gb|ADJ25617.1| protein of unknown function DUF167 [Dehalogenimonas
          lykanthroporepellens BL-DC-9]
          Length = 96

 Score = 79.8 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 23/94 (24%), Positives = 48/94 (51%), Gaps = 8/94 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +++ P + ++ I              ++++VTA P+ GKAN+A+  +LA++L L K
Sbjct: 10 AVIALKVQPGSGRNEITDTAA-------EIIRVRVTAAPEHGKANRAVAELLAERLGLPK 62

Query: 62 SSLRMLSKQSSPLKIIYIDKDCK-EITELLQNND 94
          S + ++   +S  K+  +    + E+ E L   D
Sbjct: 63 SRVTIVRGLTSRRKVAAVAGLSEAEVREKLGKTD 96


>gi|54295557|ref|YP_127972.1| hypothetical protein lpl2644 [Legionella pneumophila str. Lens]
 gi|53755389|emb|CAH16885.1| hypothetical protein lpl2644 [Legionella pneumophila str. Lens]
          Length = 95

 Score = 79.8 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 19/78 (24%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +   PNAKKS + ++           + I + A PQ+G+AN  +L  +++   + K
Sbjct: 10 VEIAIYAKPNAKKSKLMAI-------SDDSLHIALHAKPQEGEANNELLFFISQFFKIPK 62

Query: 62 SSLRMLSKQSSPLKIIYI 79
          + + ++  +SS  K+I +
Sbjct: 63 TQIELIKGKSSRHKLIRL 80


>gi|162456439|ref|YP_001618806.1| hypothetical protein sce8156 [Sorangium cellulosum 'So ce 56']
 gi|161167021|emb|CAN98326.1| hypothetical protein sce8156 [Sorangium cellulosum 'So ce 56']
          Length = 139

 Score = 79.8 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 16/88 (18%), Positives = 39/88 (44%), Gaps = 7/88 (7%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + V++ P + +S I  +           + + +TA P +G AN  ++ +L++ L + K
Sbjct: 52  VRISVQVRPKSSRSAIVGVR-------EGALDVSLTAPPVEGAANAELVKLLSRALDVRK 104

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITEL 89
           S +++    S   K++ +    +     
Sbjct: 105 SDVQIALGASGRSKVVAVRGLKEAEARK 132


>gi|307102424|gb|EFN50700.1| hypothetical protein CHLNCDRAFT_142618 [Chlorella variabilis]
          Length = 127

 Score = 79.8 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 19/92 (20%), Positives = 39/92 (42%), Gaps = 9/92 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           C + V   P +K   +              +++ V A P  G+AN A++  +A+ L L +
Sbjct: 43  CRIGVHAKPGSKVCSVT--------LGPDALEVAVDAKPVDGEANAALIEFVAEVLGLKR 94

Query: 62  SSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92
             + + S  +S  K++ +   D     + L+ 
Sbjct: 95  RDVTLASGTTSRHKVLAVAGIDAHAALQRLRQ 126


>gi|325982093|ref|YP_004294495.1| hypothetical protein NAL212_1446 [Nitrosomonas sp. AL212]
 gi|325531612|gb|ADZ26333.1| protein of unknown function DUF167 [Nitrosomonas sp. AL212]
          Length = 113

 Score = 79.8 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 22/93 (23%), Positives = 41/93 (44%), Gaps = 8/93 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + + +   AK +  A L           +KI++ A P +GKAN  ++  LA +  +    
Sbjct: 17  LTLHIQTGAKITKAAGL-------LGGALKIRLAAAPVEGKANSTLIKFLAAQFDVPIGQ 69

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96
           +R+     S  K+I I +   +   +L N + L
Sbjct: 70  VRLKQGGKSRHKVIVIHRSVHD-PRVLFNIEGL 101


>gi|269127125|ref|YP_003300495.1| hypothetical protein Tcur_2915 [Thermomonospora curvata DSM
          43183]
 gi|268312083|gb|ACY98457.1| protein of unknown function DUF167 [Thermomonospora curvata DSM
          43183]
          Length = 91

 Score = 79.8 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 23/92 (25%), Positives = 44/92 (47%), Gaps = 7/92 (7%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            V +R+ P A ++ +              + +KV A   +GKA +A L  +A  L + +
Sbjct: 4  VRVAIRVGPGASRTKVGGAH-------GEALVVKVAARAVEGKATEAALRAVADALGVRR 56

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
            +R++S  +S  K+I +D D + +TE +   
Sbjct: 57 RDVRLISGATSREKLIEVDGDERALTEKINAL 88


>gi|194866006|ref|XP_001971712.1| GG14280 [Drosophila erecta]
 gi|190653495|gb|EDV50738.1| GG14280 [Drosophila erecta]
          Length = 127

 Score = 79.8 bits (196), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 26/94 (27%), Positives = 44/94 (46%), Gaps = 10/94 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AK++GI  + +           +++ A P +G+AN  ++  L+K L L KS 
Sbjct: 40  IQILAKPGAKQNGITGIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 92

Query: 64  LRMLSKQSSPLKIIYIDK---DCKEITELLQNND 94
           + +     S  KII I K     + I ELL+   
Sbjct: 93  VSLDKGSRSRNKIIMITKGVSTVEAIEELLRKES 126


>gi|126463218|ref|YP_001044332.1| hypothetical protein Rsph17029_2458 [Rhodobacter sphaeroides ATCC
          17029]
 gi|126104882|gb|ABN77560.1| protein of unknown function DUF167 [Rhodobacter sphaeroides ATCC
          17029]
          Length = 85

 Score = 79.4 bits (195), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 22/76 (28%), Positives = 38/76 (50%), Gaps = 8/76 (10%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           VR+ P A ++ +              +++ VT  P+ GKAN+A+   LAK L ++KS L
Sbjct: 18 AVRVTPRAARAKVD--------LQEGVVRVHVTCVPEDGKANRAVTEALAKALGVAKSRL 69

Query: 65 RMLSKQSSPLKIIYID 80
           ++   +S  K   +D
Sbjct: 70 TLVRGATSRDKTFRLD 85


>gi|315230060|ref|YP_004070496.1| hypothetical protein TERMP_00296 [Thermococcus barophilus MP]
 gi|315183088|gb|ADT83273.1| hypothetical protein TERMP_00296 [Thermococcus barophilus MP]
          Length = 92

 Score = 79.4 bits (195), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 25/90 (27%), Positives = 44/90 (48%), Gaps = 9/90 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            ++V + PN K++ I  ++          +K+KV+A P  GKANK +   L+K L    
Sbjct: 9  VILLVHVQPNTKRNSIEGVD-----KWKGRIKVKVSAPPVGGKANKELTKFLSKLLG--- 60

Query: 62 SSLRMLSKQSSPLKIIYIDKDC-KEITELL 90
            + +L  ++S  K + I     +E+ E L
Sbjct: 61 KEVVILRGETSREKDLLIKGATIEEVKEKL 90


>gi|118794573|ref|XP_321597.3| AGAP001528-PA [Anopheles gambiae str. PEST]
 gi|116116359|gb|EAA01322.3| AGAP001528-PA [Anopheles gambiae str. PEST]
          Length = 147

 Score = 79.4 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 25/97 (25%), Positives = 41/97 (42%), Gaps = 12/97 (12%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V +   P AK SGI  +              ++ A P  G+AN  ++  L+K L L KS 
Sbjct: 55  VKILAKPGAKTSGITDVSEEGIG-------CQIAAPPIDGEANTELIKYLSKLLDLRKSD 107

Query: 64  LRMLSKQSSPLKIIYIDK-----DCKEITELLQNNDS 95
           + +     S  K I +DK       +++  + +N  S
Sbjct: 108 ISLDRGSKSRQKTIVLDKAGCRHSPEQLLVIFRNEAS 144


>gi|126178968|ref|YP_001046933.1| hypothetical protein Memar_1018 [Methanoculleus marisnigri JR1]
 gi|125861762|gb|ABN56951.1| protein of unknown function DUF167 [Methanoculleus marisnigri
          JR1]
          Length = 105

 Score = 79.4 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 16/83 (19%), Positives = 36/83 (43%), Gaps = 4/83 (4%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + +   AK+S                +K ++ A    GKAN+A+  +LA+   + +
Sbjct: 15 VTITLDVTAGAKRSSF----PAGYNEWRKSIKCQIAAPAVGGKANRAITDLLAETFGVPR 70

Query: 62 SSLRMLSKQSSPLKIIYIDKDCK 84
          + + +++  +S  K + I    K
Sbjct: 71 ADVSIITGHTSSSKTVAIAGVSK 93


>gi|166154599|ref|YP_001654717.1| hypothetical protein CTL0644 [Chlamydia trachomatis 434/Bu]
 gi|166155474|ref|YP_001653729.1| hypothetical protein CTLon_0641 [Chlamydia trachomatis
           L2b/UCH-1/proctitis]
 gi|237802813|ref|YP_002888007.1| hypothetical protein JALI_3871 [Chlamydia trachomatis B/Jali20/OT]
 gi|237804735|ref|YP_002888889.1| hypothetical protein CTB_3871 [Chlamydia trachomatis B/TZ1A828/OT]
 gi|255311194|ref|ZP_05353764.1| hypothetical protein Ctra62_02010 [Chlamydia trachomatis 6276]
 gi|255317495|ref|ZP_05358741.1| hypothetical protein Ctra6_02000 [Chlamydia trachomatis 6276s]
 gi|255348753|ref|ZP_05380760.1| hypothetical protein Ctra70_02050 [Chlamydia trachomatis 70]
 gi|255503293|ref|ZP_05381683.1| hypothetical protein Ctra7_02060 [Chlamydia trachomatis 70s]
 gi|255506972|ref|ZP_05382611.1| hypothetical protein CtraD_02040 [Chlamydia trachomatis D(s)2923]
 gi|301335866|ref|ZP_07224110.1| hypothetical protein CtraL_03535 [Chlamydia trachomatis L2tet1]
 gi|29839454|sp|O84393|Y388_CHLTR RecName: Full=UPF0235 protein CT_388
 gi|123606912|sp|Q3KLW5|Y423_CHLTA RecName: Full=UPF0235 protein CTA_0423
 gi|226707987|sp|B0BC23|Y641_CHLTB RecName: Full=UPF0235 protein CTLon_0641
 gi|226707989|sp|B0B7V8|Y644_CHLT2 RecName: Full=UPF0235 protein CTL0644
 gi|3328814|gb|AAC67985.1| hypothetical protein CT_388 [Chlamydia trachomatis D/UW-3/CX]
 gi|76167649|gb|AAX50657.1| hypothetical cytosolic protein [Chlamydia trachomatis A/HAR-13]
 gi|165930587|emb|CAP04084.1| conserved hypothetical protein [Chlamydia trachomatis 434/Bu]
 gi|165931462|emb|CAP07038.1| conserved hypothetical protein [Chlamydia trachomatis
           L2b/UCH-1/proctitis]
 gi|231273035|emb|CAX09948.1| conserved hypothetical protein [Chlamydia trachomatis B/TZ1A828/OT]
 gi|231274047|emb|CAX10841.1| conserved hypothetical protein [Chlamydia trachomatis B/Jali20/OT]
 gi|289525430|emb|CBJ14907.1| conserved hypothetical protein [Chlamydia trachomatis Sweden2]
 gi|296434982|gb|ADH17160.1| hypothetical protein E150_02035 [Chlamydia trachomatis E/150]
 gi|296435909|gb|ADH18083.1| hypothetical protein G9768_02005 [Chlamydia trachomatis G/9768]
 gi|296436835|gb|ADH19005.1| hypothetical protein G11222_02005 [Chlamydia trachomatis G/11222]
 gi|296437769|gb|ADH19930.1| hypothetical protein G11074_02005 [Chlamydia trachomatis G/11074]
 gi|296438702|gb|ADH20855.1| hypothetical protein E11023_02020 [Chlamydia trachomatis E/11023]
 gi|297140269|gb|ADH97027.1| hypothetical protein CTG9301_02010 [Chlamydia trachomatis G/9301]
 gi|297748518|gb|ADI51064.1| Hypothetical cytosolic protein [Chlamydia trachomatis D-EC]
 gi|297749398|gb|ADI52076.1| Hypothetical cytosolic protein [Chlamydia trachomatis D-LC]
          Length = 115

 Score = 79.4 bits (195), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 26/95 (27%), Positives = 53/95 (55%), Gaps = 8/95 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + VR+   A+++ +  LE          ++++VT  P+KGKAN A++A+LA  L++ KS 
Sbjct: 23  LEVRVTTKARENRVVCLE-------DGILRVRVTEVPEKGKANDAVVALLANFLSIPKSD 75

Query: 64  LRMLSKQSSPLKIIYIDKDCKE-ITELLQNNDSLT 97
           + +++ ++S  K + + +  K  + E   +  S T
Sbjct: 76  VTLIAGEASRRKKVLLPRSIKAFLLEQFPSESSST 110


>gi|237858662|ref|NP_653198.2| hypothetical protein LOC123207 isoform a [Homo sapiens]
 gi|119582835|gb|EAW62431.1| chromosome 15 open reading frame 40, isoform CRA_a [Homo sapiens]
          Length = 153

 Score = 79.0 bits (194), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 9/87 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L           + + + A P +G+AN  +   L+K L L K
Sbjct: 63  VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 115

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEI 86
           S + +     S  K++ +      +EI
Sbjct: 116 SDVVLDKGGKSREKVVKLLASTTPEEI 142


>gi|119493536|ref|ZP_01624202.1| hypothetical protein L8106_18212 [Lyngbya sp. PCC 8106]
 gi|119452653|gb|EAW33834.1| hypothetical protein L8106_18212 [Lyngbya sp. PCC 8106]
          Length = 81

 Score = 79.0 bits (194), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 22/82 (26%), Positives = 40/82 (48%), Gaps = 7/82 (8%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          M  + V++ PN+K+  +   E            I + + P  GKAN+ ++ +LAK+  + 
Sbjct: 1  MVVLHVKVKPNSKQQSMVKNE-------EGTYIIHLKSPPIDGKANQELIKILAKQFNIP 53

Query: 61 KSSLRMLSKQSSPLKIIYIDKD 82
          KS + + S  SS  K+I +   
Sbjct: 54 KSQVSIKSGLSSKNKLIELPDS 75


>gi|218246502|ref|YP_002371873.1| hypothetical protein PCC8801_1666 [Cyanothece sp. PCC 8801]
 gi|257059535|ref|YP_003137423.1| hypothetical protein Cyan8802_1684 [Cyanothece sp. PCC 8802]
 gi|218166980|gb|ACK65717.1| protein of unknown function DUF167 [Cyanothece sp. PCC 8801]
 gi|256589701|gb|ACV00588.1| protein of unknown function DUF167 [Cyanothece sp. PCC 8802]
          Length = 72

 Score = 79.0 bits (194), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 23/76 (30%), Positives = 41/76 (53%), Gaps = 7/76 (9%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           V++ PNAKK  I          +   + + + + P +GKAN+ ++ +LAKK  +S+S +
Sbjct: 4  QVKVKPNAKKQKI-------QEEEDGSLVVYLKSPPIEGKANQELIKLLAKKFGVSQSQI 56

Query: 65 RMLSKQSSPLKIIYID 80
           + S  SS  K + I+
Sbjct: 57 SIKSGLSSRNKWVEIE 72


>gi|332187012|ref|ZP_08388753.1| hypothetical protein SUS17_2027 [Sphingomonas sp. S17]
 gi|332013022|gb|EGI55086.1| hypothetical protein SUS17_2027 [Sphingomonas sp. S17]
          Length = 84

 Score = 79.0 bits (194), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 15/85 (17%), Positives = 39/85 (45%), Gaps = 7/85 (8%)

Query: 9  IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
           P   +  +             H   +++A P  G AN A++ ++AK   ++K ++ +++
Sbjct: 2  TPRGGRDMLT-------AGTEDHFAARLSAPPVDGAANAALVPLVAKHFGVAKRAVTIVA 54

Query: 69 KQSSPLKIIYIDKDCKEITELLQNN 93
           +++ LK ++I  D   +  + +  
Sbjct: 55 GETARLKRLHIAGDPHILARMAEAL 79


>gi|56695839|ref|YP_166190.1| hypothetical protein SPO0937 [Ruegeria pomeroyi DSS-3]
 gi|56677576|gb|AAV94242.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3]
          Length = 92

 Score = 79.0 bits (194), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 19/78 (24%), Positives = 37/78 (47%), Gaps = 8/78 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +R+ P A +  +              ++I VTA P+ GKAN+A+  +LA+ + ++ S 
Sbjct: 22 IALRVTPKAARDSVT--------LAGEGLRITVTAPPEDGKANEAVRKLLARAMGVAPSR 73

Query: 64 LRMLSKQSSPLKIIYIDK 81
          L +   Q++  K      
Sbjct: 74 LTLRRGQTARDKTFVYLG 91


>gi|195491333|ref|XP_002093518.1| GE20708 [Drosophila yakuba]
 gi|194179619|gb|EDW93230.1| GE20708 [Drosophila yakuba]
          Length = 128

 Score = 79.0 bits (194), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 25/94 (26%), Positives = 44/94 (46%), Gaps = 10/94 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AK++GI  + +           +++ A P +G+AN  ++  L+K L L KS 
Sbjct: 40  IQILAKPGAKQNGITGIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 92

Query: 64  LRMLSKQSSPLKIIYIDK---DCKEITELLQNND 94
           + +     S  KII I K     + I +LL+   
Sbjct: 93  VSLDKGSRSRNKIIMITKGVSTVEAIEQLLRKES 126


>gi|149177442|ref|ZP_01856046.1| hypothetical protein PM8797T_19111 [Planctomyces maris DSM 8797]
 gi|148843775|gb|EDL58134.1| hypothetical protein PM8797T_19111 [Planctomyces maris DSM 8797]
          Length = 104

 Score = 79.0 bits (194), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VR  P + K+GI  +           +K+ VT  P+KGKANKA+L +L   L L +S 
Sbjct: 16 LPVRAQPRSSKNGIEGVH-------DGRLKVCVTQVPEKGKANKALLKVLQTALKLKRSQ 68

Query: 64 LRMLSKQSSPLKIIYI 79
          + +   +++ LKI  I
Sbjct: 69 IELYKGETAALKIFRI 84


>gi|159038970|ref|YP_001538223.1| hypothetical protein Sare_3430 [Salinispora arenicola CNS-205]
 gi|157917805|gb|ABV99232.1| protein of unknown function DUF167 [Salinispora arenicola CNS-205]
          Length = 107

 Score = 79.0 bits (194), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 20/87 (22%), Positives = 37/87 (42%), Gaps = 3/87 (3%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V VR+ P + +  +              + + VTA P  G+AN+A    LA  L +  ++
Sbjct: 18  VAVRVKPGSSRIRVGG---RYVGPHGPALIVAVTAPPVDGRANEAARRALADALGVRSAA 74

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELL 90
           + + +  +   K   +D    +IT  L
Sbjct: 75  VSLEAGATGRNKTFRVDGVAADITRTL 101


>gi|77464376|ref|YP_353880.1| hypothetical protein RSP_0800 [Rhodobacter sphaeroides 2.4.1]
 gi|77388794|gb|ABA79979.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1]
          Length = 85

 Score = 78.6 bits (193), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 22/76 (28%), Positives = 38/76 (50%), Gaps = 8/76 (10%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           VR+ P A ++ +              +++ VT  P+ GKAN+A+   LAK L ++KS L
Sbjct: 18 AVRVTPRAARAKVE--------LQEGVVRVHVTCVPEDGKANRAVTEALAKALGVAKSRL 69

Query: 65 RMLSKQSSPLKIIYID 80
           ++   +S  K   +D
Sbjct: 70 TLVRGATSRDKTFRLD 85


>gi|296204187|ref|XP_002749223.1| PREDICTED: UPF0235 protein C15orf40-like [Callithrix jacchus]
          Length = 154

 Score = 78.6 bits (193), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 9/87 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L           + + + A P +G+AN  +   L+K L L K
Sbjct: 64  VTIAIHAKPGSKQNAVTDLTA-------EAINVAIAAPPSEGEANAELCRYLSKVLELRK 116

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEI 86
           S + +     S  K++ +      +EI
Sbjct: 117 SDVVLDKGCKSREKVVKLLASTTPEEI 143


>gi|195587403|ref|XP_002083454.1| GD13347 [Drosophila simulans]
 gi|194195463|gb|EDX09039.1| GD13347 [Drosophila simulans]
          Length = 128

 Score = 78.6 bits (193), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 24/94 (25%), Positives = 44/94 (46%), Gaps = 10/94 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AK++GI  + +           +++ A P +G+AN  ++  L+K L L KS 
Sbjct: 40  IQILAKPGAKQNGITGIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 92

Query: 64  LRMLSKQSSPLKIIYIDK---DCKEITELLQNND 94
           + +     S  KII I K     + I ++L+   
Sbjct: 93  VSLDKGSRSRNKIIMITKGASTVEAIEQMLRKES 126


>gi|209525443|ref|ZP_03273983.1| protein of unknown function DUF167 [Arthrospira maxima CS-328]
 gi|209494123|gb|EDZ94438.1| protein of unknown function DUF167 [Arthrospira maxima CS-328]
          Length = 73

 Score = 78.6 bits (193), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 21/80 (26%), Positives = 41/80 (51%), Gaps = 7/80 (8%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          M    +++ PN+ +  I              + + + + P  GKAN+ ++ +LAKKL + 
Sbjct: 1  MAFFNIQVKPNSPQQLI-------RKEADGSLTVYLKSPPVDGKANQELIKLLAKKLDVP 53

Query: 61 KSSLRMLSKQSSPLKIIYID 80
          KS++++ S  SS  K++ I 
Sbjct: 54 KSNIKIKSGLSSRRKLVEIS 73


>gi|87311293|ref|ZP_01093415.1| hypothetical protein DSM3645_27241 [Blastopirellula marina DSM
          3645]
 gi|87286033|gb|EAQ77945.1| hypothetical protein DSM3645_27241 [Blastopirellula marina DSM
          3645]
          Length = 100

 Score = 78.6 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 7/78 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VR +P +KK+ I              +K+ VTA P+ GKANKA++ +LAKKL L K
Sbjct: 11 VILPVRALPGSKKNEIRG-------EQQGALKVSVTAAPEDGKANKAIVELLAKKLVLRK 63

Query: 62 SSLRMLSKQSSPLKIIYI 79
          S L +++  +   K + +
Sbjct: 64 SQLEIIAGHTHRQKRVLV 81


>gi|294916777|ref|XP_002778391.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239886749|gb|EER10186.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 132

 Score = 78.6 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 20/95 (21%), Positives = 46/95 (48%), Gaps = 9/95 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAK-KLALS 60
             + +R  P AK S +  ++          + +++ A  + G+AN+ +L+ L+K  L + 
Sbjct: 41  ARIAIRAKPGAKVSCLTGIDA------EGALGVQLNAPARDGEANEELLSFLSKEVLGVK 94

Query: 61  KSSLRMLSKQSSPLKIIYIDK--DCKEITELLQNN 93
           K  + ++    S  K++ I      ++++ LL+N 
Sbjct: 95  KKDVALVQGSKSREKVVEIADVLTVEDVSRLLRNE 129


>gi|328717531|ref|XP_003246233.1| PREDICTED: UPF0235 protein C15orf40 homolog [Acyrthosiphon pisum]
          Length = 136

 Score = 78.6 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 22/95 (23%), Positives = 39/95 (41%), Gaps = 10/95 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P AK + I  +             +++ A P  G+AN  ++  L+K L L K
Sbjct: 45  VVIKINAKPGAKNNNITDISSDGIG-------VQINAPPTDGEANAELIKYLSKVLGLRK 97

Query: 62  SSLRMLSKQSSPLKIIYIDKDC---KEITELLQNN 93
           S L +     S  KI+ +       + ITE ++  
Sbjct: 98  SDLSLDRGSRSRNKILIVHNTSLGIEGITEKIKEE 132


>gi|270308462|ref|YP_003330520.1| hypothetical protein DhcVS_1075 [Dehalococcoides sp. VS]
 gi|270154354|gb|ACZ62192.1| hypothetical protein DhcVS_1075 [Dehalococcoides sp. VS]
          Length = 97

 Score = 78.6 bits (193), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 16/76 (21%), Positives = 41/76 (53%), Gaps = 7/76 (9%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           V +++ P+++++ ++  E          +K+++ A P+KGKANK ++  L++ L   K+
Sbjct: 9  RVNLKIFPSSQRNELSGYE-------NGLLKLRIAAQPEKGKANKELIDYLSELLDTPKA 61

Query: 63 SLRMLSKQSSPLKIIY 78
           + +    +   K++ 
Sbjct: 62 EIEICRGHTGRNKVLV 77


>gi|163783198|ref|ZP_02178192.1| hypothetical protein HG1285_14279 [Hydrogenivirga sp. 128-5-R1-1]
 gi|159881532|gb|EDP75042.1| hypothetical protein HG1285_14279 [Hydrogenivirga sp. 128-5-R1-1]
          Length = 73

 Score = 78.2 bits (192), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 21/79 (26%), Positives = 48/79 (60%), Gaps = 7/79 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V++ P++K+ G+  +           ++++V+A P++GKAN+ ++ +LAK   + K
Sbjct: 1  MKLKVKVKPSSKREGVREVSP-------GELEVRVSAPPERGKANERLIELLAKHYGVRK 53

Query: 62 SSLRMLSKQSSPLKIIYID 80
           ++R+L  ++S  K++ ID
Sbjct: 54 GAVRILRGETSREKVVEID 72


>gi|328541881|ref|YP_004301990.1| hypothetical protein SL003B_0257 [Polymorphum gilvum SL003B-26A1]
 gi|326411632|gb|ADZ68695.1| hypothetical protein SL003B_0257 [Polymorphum gilvum SL003B-26A1]
          Length = 106

 Score = 78.2 bits (192), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 25/79 (31%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V VRL P A K  +       D      +  +V A P+KG AN A+ A++AK L + KS+
Sbjct: 17 VDVRLTPRAGKDAVEGCSELSDGRP--VVLARVRAIPEKGAANAALEALIAKALGVPKSA 74

Query: 64 LRMLSKQSSPLKIIYIDKD 82
          + + +   + LK + +  D
Sbjct: 75 VAIDAGAGARLKTLKVSGD 93


>gi|255022260|ref|ZP_05294254.1| hypothetical protein ACA_0401 [Acidithiobacillus caldus ATCC
          51756]
 gi|254968316|gb|EET25884.1| hypothetical protein ACA_0401 [Acidithiobacillus caldus ATCC
          51756]
          Length = 115

 Score = 78.2 bits (192), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 17/77 (22%), Positives = 34/77 (44%), Gaps = 7/77 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V++ P A+   +              +KI++ A    G AN A+L+ LA++L L    
Sbjct: 15 LTVQVQPGARDDCVVGYH-------GDALKIRLRARAVDGAANAALLSFLARRLDLGPGQ 67

Query: 64 LRMLSKQSSPLKIIYID 80
          + +     S  K++ + 
Sbjct: 68 VVLRHGTHSRRKVLVLS 84


>gi|307152217|ref|YP_003887601.1| hypothetical protein Cyan7822_2348 [Cyanothece sp. PCC 7822]
 gi|306982445|gb|ADN14326.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7822]
          Length = 73

 Score = 78.2 bits (192), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 27/78 (34%), Positives = 42/78 (53%), Gaps = 7/78 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V++ PNAK+  I  LE          + I + + P  GKAN+ ++ +LAKK  +SK
Sbjct: 1  MKIQVKVKPNAKQQKIEELE-------DGSLVISLKSPPVDGKANEELIKLLAKKYQVSK 53

Query: 62 SSLRMLSKQSSPLKIIYI 79
          S + + S  SS  K+I I
Sbjct: 54 SQISIQSGLSSRNKLIEI 71


>gi|196231015|ref|ZP_03129875.1| protein of unknown function DUF167 [Chthoniobacter flavus
          Ellin428]
 gi|196224845|gb|EDY19355.1| protein of unknown function DUF167 [Chthoniobacter flavus
          Ellin428]
          Length = 93

 Score = 78.2 bits (192), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 20/88 (22%), Positives = 43/88 (48%), Gaps = 7/88 (7%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +R++PNA++S +  +           +K+KV A    GKAN+A+   LA+ L +  
Sbjct: 5  AILRLRIVPNARRSEVVGVH-------GDAVKVKVQAPAMDGKANEALRDFLAEVLTVPA 57

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITEL 89
           ++ +++ + S  K++ I     +    
Sbjct: 58 RAVEIVAGEKSRDKVVAIADLETDEARR 85


>gi|145538297|ref|XP_001454854.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124422631|emb|CAK87457.1| unnamed protein product [Paramecium tetraurelia]
          Length = 106

 Score = 78.2 bits (192), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 18/90 (20%), Positives = 42/90 (46%), Gaps = 7/90 (7%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           +++   PN+K S I  +           + I + A P+ G+AN  +   +A+ L + K++
Sbjct: 20  LVINAKPNSKVSQITGI-------SDEAVDINIAAPPKDGEANAELCDFVAQTLGVKKTA 72

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
           +++        K++ I+   K+I +  +  
Sbjct: 73  IQVNKGGKGRNKLVSIESKFKDINDFFEKL 102


>gi|29839587|sp|Q8WUR7|CO040_HUMAN RecName: Full=UPF0235 protein C15orf40
 gi|18043732|gb|AAH19820.1| Chromosome 15 open reading frame 40 [Homo sapiens]
 gi|189053288|dbj|BAG35094.1| unnamed protein product [Homo sapiens]
          Length = 126

 Score = 77.9 bits (191), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 9/87 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L           + + + A P +G+AN  +   L+K L L K
Sbjct: 36  VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 88

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEI 86
           S + +     S  K++ +      +EI
Sbjct: 89  SDVVLDKGGKSREKVVKLLASTTPEEI 115


>gi|258597870|ref|XP_001348716.2| conserved protein, unknown function [Plasmodium falciparum 3D7]
 gi|255528895|gb|AAN37155.2| conserved protein, unknown function [Plasmodium falciparum 3D7]
          Length = 147

 Score = 77.9 bits (191), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 19/91 (20%), Positives = 40/91 (43%), Gaps = 7/91 (7%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +R+ PNAKK+ I          D   + I +   P   ++N A++   +  L L K  
Sbjct: 61  ITLRVKPNAKKTSI------YFNQDKEVLNINIQEQPVNNQSNVAIIGYFSDILNLKKRD 114

Query: 64  LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
           + ++S   S  K++ +      ++   ++ N
Sbjct: 115 ISIVSGLKSRDKVLMVSNISLDDLNNKIEEN 145


>gi|225849498|ref|YP_002729663.1| hypothetical protein SULAZ_1705 [Sulfurihydrogenibium azorense
          Az-Fu1]
 gi|225644124|gb|ACN99174.1| conserved domain protein [Sulfurihydrogenibium azorense Az-Fu1]
          Length = 72

 Score = 77.9 bits (191), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 19/79 (24%), Positives = 42/79 (53%), Gaps = 7/79 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V++ P + K+ +  +E           +++ T  P+KGKAN  ++ ++++ L + K
Sbjct: 1  MRIKVKVKPGSSKNEVKKIE-------ENFYEVRCTTIPEKGKANDKVIELMSEYLDVPK 53

Query: 62 SSLRMLSKQSSPLKIIYID 80
          S ++++   SS  K I I+
Sbjct: 54 SRIKIIKGHSSREKEIEIE 72


>gi|15678665|ref|NP_275780.1| hypothetical protein MTH637 [Methanothermobacter
          thermautotrophicus str. Delta H]
 gi|29839449|sp|O26734|Y637_METTH RecName: Full=UPF0235 protein MTH_637
 gi|2621719|gb|AAB85143.1| conserved protein [Methanothermobacter thermautotrophicus str.
          Delta H]
          Length = 104

 Score = 77.9 bits (191), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 21/90 (23%), Positives = 41/90 (45%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V + + P + K GI     P        +++K+ + PQKGKAN+ ++   ++        
Sbjct: 16 VNIEVSPASGKFGI-----PSYNEWRKRIEVKIHSPPQKGKANREIIKEFSETFG---RD 67

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + ++S Q S  K I I    +++   L + 
Sbjct: 68 VEIVSGQKSRQKTIRIQGMGRDLFLKLVSE 97


>gi|149195332|ref|ZP_01872419.1| hypothetical protein CMTB2_08545 [Caminibacter mediatlanticus
          TB-2]
 gi|149134524|gb|EDM23013.1| hypothetical protein CMTB2_08545 [Caminibacter mediatlanticus
          TB-2]
          Length = 94

 Score = 77.9 bits (191), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 8/88 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V+  PN+ K+ IA L           +KI + A   +G ANK ++  L+K   +SK+ 
Sbjct: 12 LNVKAQPNSSKNKIAGLY------GEDAIKINIKAPAVEGAANKELIKFLSKMFKVSKND 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          + ++  ++S  K I I    +++ E ++
Sbjct: 66 I-IIKGETSKKKQI-IMPINEKVKEFIE 91


>gi|195337079|ref|XP_002035160.1| GM14073 [Drosophila sechellia]
 gi|194128253|gb|EDW50296.1| GM14073 [Drosophila sechellia]
          Length = 128

 Score = 77.9 bits (191), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 23/94 (24%), Positives = 43/94 (45%), Gaps = 10/94 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AK++GI  + +           +++ A P +G+AN  ++  L+K L L KS 
Sbjct: 40  IQILAKPGAKQNGITGIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 92

Query: 64  LRMLSKQSSPLKIIYIDK---DCKEITELLQNND 94
           + +     S  K I I K     + I ++L+   
Sbjct: 93  VSLDKGSRSRNKKIMITKGVSTVEAIEQMLRKES 126


>gi|242021057|ref|XP_002430963.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212516183|gb|EEB18225.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 117

 Score = 77.5 bits (190), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 22/93 (23%), Positives = 41/93 (44%), Gaps = 10/93 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AK + I  ++            +++ A P  G+AN  ++  ++  L L K+ 
Sbjct: 30  LQIFAKPGAKTNAITGIDEEGIG-------VQINARPVDGEANSELVNYMSCLLGLRKTE 82

Query: 64  LRMLSKQSSPLKIIYIDK---DCKEITELLQNN 93
           + +     S  KI+ I K     +EI E L+N 
Sbjct: 83  ISLEKGSKSRQKILLISKKDLSTEEIIEKLKNE 115


>gi|20150521|pdb|1JRM|A Chain A, Nmr Structure Of Mth0637. Ontario Centre For Structural
          Proteomics Target Mth0637_1_104; Northeast Structural
          Genomics Target Tt135
          Length = 104

 Score = 77.5 bits (190), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 21/90 (23%), Positives = 41/90 (45%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V + + P + K GI     P        +++K+ + PQKGKAN+ ++   ++        
Sbjct: 16 VNIEVSPASGKFGI-----PSYNEWRKRIEVKIHSPPQKGKANREIIKEFSETFG---RD 67

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
          + ++S Q S  K I I    +++   L + 
Sbjct: 68 VEIVSGQKSRQKTIRIQGMGRDLFLKLVSE 97


>gi|66813694|ref|XP_641026.1| hypothetical protein DDB_G0280783 [Dictyostelium discoideum AX4]
 gi|74855711|sp|Q54UW1|U235_DICDI RecName: Full=UPF0235 protein
 gi|60469052|gb|EAL67049.1| hypothetical protein DDB_G0280783 [Dictyostelium discoideum AX4]
          Length = 124

 Score = 77.5 bits (190), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 7/88 (7%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  + V + PN+K+S I S E          + ++++  P  GKAN  ++  L+K+L + 
Sbjct: 31  IIKINVNVHPNSKESSIVSFEDQ-------ILSLRISEPPIDGKANIGVIEFLSKELNIR 83

Query: 61  KSSLRMLSKQSSPLKIIYIDKDCKEITE 88
           KS++ +     S  K + ID   + IT+
Sbjct: 84  KSNIEVGKGSKSRNKSVEIDISSENITK 111


>gi|296165029|ref|ZP_06847584.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
          ATCC BAA-614]
 gi|295899677|gb|EFG79128.1| conserved hypothetical protein [Mycobacterium parascrofulaceum
          ATCC BAA-614]
          Length = 77

 Score = 77.5 bits (190), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 21/78 (26%), Positives = 36/78 (46%), Gaps = 6/78 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V VR+ P ++K  +       +      + I V      GKAN A+  +LA  L L +S 
Sbjct: 6  VAVRVKPGSRKGPLV------EEGPNGELTIYVRERAVDGKANDAVTGLLAAHLELPRSR 59

Query: 64 LRMLSKQSSPLKIIYIDK 81
          + ++S ++S LK   +  
Sbjct: 60 VELISGRTSRLKRFRVSG 77


>gi|268567307|ref|XP_002639944.1| Hypothetical protein CBG10764 [Caenorhabditis briggsae]
          Length = 258

 Score = 77.5 bits (190), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 19/77 (24%), Positives = 38/77 (49%), Gaps = 7/77 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AKKSG+ ++           + + + A P++G AN+ +++ L   L L K+ 
Sbjct: 35  LRIHAKPGAKKSGVVAIN-------ESEIDVAIGAAPREGAANEELVSYLMSALGLRKNE 87

Query: 64  LRMLSKQSSPLKIIYID 80
           L+      S  K++ I+
Sbjct: 88  LQFDKGAKSRSKVVLIE 104


>gi|158520306|ref|YP_001528176.1| hypothetical protein Dole_0289 [Desulfococcus oleovorans Hxd3]
 gi|226701656|sp|A8ZS81|Y289_DESOH RecName: Full=UPF0235 protein Dole_0289
 gi|158509132|gb|ABW66099.1| protein of unknown function DUF167 [Desulfococcus oleovorans
          Hxd3]
          Length = 103

 Score = 77.5 bits (190), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 18/76 (23%), Positives = 37/76 (48%), Gaps = 7/76 (9%)

Query: 8  LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67
          + P + K+ +              +KIK+TA P  G+AN+  +A L++ L + K+S+ + 
Sbjct: 17 VAPRSSKNMVVGAH-------DNALKIKITAPPVDGRANEMCVAFLSRLLGIPKTSITIA 69

Query: 68 SKQSSPLKIIYIDKDC 83
          +  +S  K + +    
Sbjct: 70 AGAASKRKEVCLALSP 85


>gi|22299537|ref|NP_682784.1| hypothetical protein tsr1994 [Thermosynechococcus elongatus BP-1]
 gi|29839707|sp|Q8DHG5|Y1994_THEEB RecName: Full=UPF0235 protein tsr1994
 gi|22295720|dbj|BAC09546.1| tsr1994 [Thermosynechococcus elongatus BP-1]
          Length = 74

 Score = 77.5 bits (190), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 18/79 (22%), Positives = 40/79 (50%), Gaps = 7/79 (8%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          M    V + PNA+++ ++        +    + + V A    G+AN+ ++A+LA    + 
Sbjct: 1  MGKKHVIVKPNARQASVS-------ITPAGQLLVTVRAPASDGQANQELIALLAAYFGVP 53

Query: 61 KSSLRMLSKQSSPLKIIYI 79
          KS ++++   +S  K+I +
Sbjct: 54 KSRIQLVKGHTSRHKVIEL 72


>gi|309361190|emb|CAP30075.2| hypothetical protein CBG_10764 [Caenorhabditis briggsae AF16]
          Length = 266

 Score = 77.5 bits (190), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 19/77 (24%), Positives = 38/77 (49%), Gaps = 7/77 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AKKSG+ ++           + + + A P++G AN+ +++ L   L L K+ 
Sbjct: 42  LRIHAKPGAKKSGVVAIN-------ESEIDVAIGAAPREGAANEELVSYLMSALGLRKNE 94

Query: 64  LRMLSKQSSPLKIIYID 80
           L+      S  K++ I+
Sbjct: 95  LQFDKGAKSRSKVVLIE 111


>gi|327540387|gb|EGF26973.1| protein containing DUF167 [Rhodopirellula baltica WH47]
          Length = 87

 Score = 77.5 bits (190), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 26/74 (35%), Positives = 43/74 (58%), Gaps = 7/74 (9%)

Query: 7  RLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRM 66
          R+ P AKK+ +  L           +K+ V   P+ GKANKA++A LAK L +SK  + +
Sbjct: 2  RVTPKAKKASVGGLH-------DGALKVSVHMVPEDGKANKAVIASLAKWLRVSKGRVAI 54

Query: 67 LSKQSSPLKIIYID 80
          ++ ++S LK I ++
Sbjct: 55 VAGETSRLKTIVVE 68


>gi|73748976|ref|YP_308215.1| hypothetical protein cbdb_A1230 [Dehalococcoides sp. CBDB1]
 gi|147669743|ref|YP_001214561.1| hypothetical protein DehaBAV1_1103 [Dehalococcoides sp. BAV1]
 gi|289432972|ref|YP_003462845.1| hypothetical protein DehalGT_1029 [Dehalococcoides sp. GT]
 gi|123619917|sp|Q3ZYH5|Y1230_DEHSC RecName: Full=UPF0235 protein cbdbA1230
 gi|189038739|sp|A5FQ39|Y1103_DEHSB RecName: Full=UPF0235 protein DehaBAV1_1103
 gi|73660692|emb|CAI83299.1| conserved hypothetical protein [Dehalococcoides sp. CBDB1]
 gi|146270691|gb|ABQ17683.1| protein of unknown function DUF167 [Dehalococcoides sp. BAV1]
 gi|288946692|gb|ADC74389.1| protein of unknown function DUF167 [Dehalococcoides sp. GT]
          Length = 97

 Score = 77.5 bits (190), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 21/75 (28%), Positives = 41/75 (54%), Gaps = 7/75 (9%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           V +++IP+A+K+ +A  E          +K+K+ A P+KGKANK ++  L+  L   K+
Sbjct: 9  KVNLKIIPSARKNELAGYE-------NGLLKLKIAAQPEKGKANKELIDYLSDLLDTPKA 61

Query: 63 SLRMLSKQSSPLKII 77
           + +    +   K++
Sbjct: 62 EIEICHGHTGRNKVL 76


>gi|145543083|ref|XP_001457228.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124425043|emb|CAK89831.1| unnamed protein product [Paramecium tetraurelia]
          Length = 106

 Score = 77.5 bits (190), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 19/90 (21%), Positives = 42/90 (46%), Gaps = 7/90 (7%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           +++   PN+K S I  +           + + + A P+ G+AN  +   +A+ L + K++
Sbjct: 20  LVINAKPNSKVSQITGI-------SDEAVDVNIAAPPKDGEANAELCDFVAQTLGVKKTA 72

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
           +++        K+I I+   K+I E  +  
Sbjct: 73  IQVQKGGKGRNKLIKIESKFKDINEFYEKL 102


>gi|145219218|ref|YP_001129927.1| hypothetical protein Cvib_0403 [Prosthecochloris vibrioformis DSM
          265]
 gi|189040268|sp|A4SD66|Y403_PROVI RecName: Full=UPF0235 protein Cvib_0403
 gi|145205382|gb|ABP36425.1| protein of unknown function DUF167 [Chlorobium phaeovibrioides
          DSM 265]
          Length = 100

 Score = 77.5 bits (190), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 17/84 (20%), Positives = 34/84 (40%), Gaps = 7/84 (8%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           VR+ P A K+ ++             +KI + A P    AN+    + A    ++   +
Sbjct: 16 SVRVQPRASKTAVSG-------PYAGGLKITLKAAPVDDAANRECCRLFAGMFGIADGRV 68

Query: 65 RMLSKQSSPLKIIYIDKDCKEITE 88
           ++S +SS  K + ++       E
Sbjct: 69 HVVSGRSSRSKSVMLEGVSSREAE 92


>gi|198282397|ref|YP_002218718.1| hypothetical protein Lferr_0253 [Acidithiobacillus ferrooxidans
           ATCC 53993]
 gi|198246918|gb|ACH82511.1| protein of unknown function DUF167 [Acidithiobacillus ferrooxidans
           ATCC 53993]
          Length = 113

 Score = 77.5 bits (190), Expect = 7e-13,   Method: Composition-based stats.
 Identities = 19/88 (21%), Positives = 38/88 (43%), Gaps = 8/88 (9%)

Query: 9   IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
            P+A +  I              +K+ VTA P+ G+A   M+  LA +  ++ S++ ++ 
Sbjct: 32  KPSAGRDAIG-------KPKGAQIKVSVTAEPRNGRATDHMVRFLAGEFGVAPSAIEVVF 84

Query: 69  KQSSPLKIIYIDKDCKEITELLQNNDSL 96
            + +  K + I      +  + Q   SL
Sbjct: 85  GRMNVNKQLRIKA-PTHLPPVFQVQASL 111


>gi|196001449|ref|XP_002110592.1| hypothetical protein TRIADDRAFT_54758 [Trichoplax adhaerens]
 gi|190586543|gb|EDV26596.1| hypothetical protein TRIADDRAFT_54758 [Trichoplax adhaerens]
          Length = 132

 Score = 77.1 bits (189), Expect = 7e-13,   Method: Composition-based stats.
 Identities = 21/95 (22%), Positives = 45/95 (47%), Gaps = 11/95 (11%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + ++  P +K++ +  +             I++ A  ++G+AN  ++  L+  L + K
Sbjct: 41  VLITIKAKPGSKENAVTDISSDGIG-------IQIAAPAREGEANSELIKFLSSILKVKK 93

Query: 62  SSLRMLSKQSSPLKIIYIDKDC----KEITELLQN 92
           SS+ +     S  K I ++K+     K++ ELLQ 
Sbjct: 94  SSILLDKGSKSRHKTICVNKNADLTEKQVLELLQE 128


>gi|145253637|ref|XP_001398331.1| yggU family protein [Aspergillus niger CBS 513.88]
 gi|134083900|emb|CAK48804.1| unnamed protein product [Aspergillus niger]
          Length = 126

 Score = 77.1 bits (189), Expect = 7e-13,   Method: Composition-based stats.
 Identities = 19/78 (24%), Positives = 35/78 (44%), Gaps = 5/78 (6%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           +   + PNA  +             T  + + V A P+KG+AN A+  +LA+   + KS
Sbjct: 27 QISCHVKPNASSNR-----EGITAIGTDRVDVCVAAVPRKGEANAAVSRVLAQIFQVPKS 81

Query: 63 SLRMLSKQSSPLKIIYID 80
          ++ ++    S  K + I 
Sbjct: 82 NVEVIRGLKSREKTLAIS 99


>gi|195376389|ref|XP_002046979.1| GJ12187 [Drosophila virilis]
 gi|194154137|gb|EDW69321.1| GJ12187 [Drosophila virilis]
          Length = 124

 Score = 77.1 bits (189), Expect = 7e-13,   Method: Composition-based stats.
 Identities = 23/93 (24%), Positives = 44/93 (47%), Gaps = 10/93 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AK++GI  + +           +++ A P +G+AN  ++  L+K L L KS 
Sbjct: 36  IKILAKPGAKQNGITDIGLDGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 88

Query: 64  LRMLSKQSSPLKIIYIDK---DCKEITELLQNN 93
           + +     S  KI+ + K     + I +LL+  
Sbjct: 89  VSLDKGSRSRNKIVLVTKGASTVEAIEQLLRKE 121


>gi|156373856|ref|XP_001629526.1| predicted protein [Nematostella vectensis]
 gi|156216528|gb|EDO37463.1| predicted protein [Nematostella vectensis]
          Length = 133

 Score = 77.1 bits (189), Expect = 7e-13,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V +   P AK++ I  L             +++ A P++G+AN  ++  ++    + KSS
Sbjct: 44  VKIHAKPGAKQNRITELSPDFVG-------VQIAAQPKEGEANDELVRYMSSVFGVKKSS 96

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITEL 89
           + +     S  KII I      + + 
Sbjct: 97  VTLDKGAKSRDKIIRISSSGLTLKQA 122


>gi|318041795|ref|ZP_07973751.1| hypothetical protein SCB01_08799 [Synechococcus sp. CB0101]
          Length = 97

 Score = 77.1 bits (189), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 20/88 (22%), Positives = 41/88 (46%), Gaps = 8/88 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V +R+ P A +  +  L           + I + A P  G AN+A+L ++A++L +S ++
Sbjct: 12 VAIRVQPRASRERVLGLR-------GEAIAIALKAPPVDGAANEALLKLIARQLKVSAAA 64

Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90
          + ++   S   K I +      ++   L
Sbjct: 65 VELVRGASGRSKWIRVAGWSADQVRAAL 92


>gi|296134781|ref|YP_003642023.1| protein of unknown function DUF167 [Thiomonas intermedia K12]
 gi|294338737|emb|CAZ87069.1| conserved hypothetical protein [Thiomonas sp. 3As]
 gi|295794903|gb|ADG29693.1| protein of unknown function DUF167 [Thiomonas intermedia K12]
          Length = 106

 Score = 77.1 bits (189), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 19/88 (21%), Positives = 38/88 (43%), Gaps = 8/88 (9%)

Query: 9   IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
            P+A    I              +K+ VTA P+ G+A   M+  LAK+  +  S++ ++ 
Sbjct: 26  KPSAGVDAIG-------KPKGPQLKVSVTAAPRAGRATDHMVRFLAKEFGVPTSAIEVVF 78

Query: 69  KQSSPLKIIYIDKDCKEITELLQNNDSL 96
            + +  K + I    +++  + Q    L
Sbjct: 79  GRMNVNKQLRIHA-PQKLPAVFQQTSLL 105


>gi|120403830|ref|YP_953659.1| hypothetical protein Mvan_2846 [Mycobacterium vanbaalenii PYR-1]
 gi|166200356|sp|A1T903|Y2846_MYCVP RecName: Full=UPF0235 protein Mvan_2846
 gi|119956648|gb|ABM13653.1| protein of unknown function DUF167 [Mycobacterium vanbaalenii
          PYR-1]
          Length = 75

 Score = 77.1 bits (189), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 6/76 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V VR+ P +KK  +       +T     + + V      GKAN A++ +LA+   + +S 
Sbjct: 5  VSVRVKPGSKKGPLV------ETGPDGELTVYVRERAVDGKANAAVIRVLAEHFGVPRSL 58

Query: 64 LRMLSKQSSPLKIIYI 79
          + +    SS +K   I
Sbjct: 59 VELTGGASSRIKRFRI 74


>gi|88602392|ref|YP_502570.1| hypothetical protein Mhun_1102 [Methanospirillum hungatei JF-1]
 gi|88187854|gb|ABD40851.1| conserved hypothetical protein [Methanospirillum hungatei JF-1]
          Length = 117

 Score = 77.1 bits (189), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 22/82 (26%), Positives = 39/82 (47%), Gaps = 4/82 (4%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + + +   +KKS    L             I V A P +GKANKA++ ++A    L   +
Sbjct: 32  ITIEVSAGSKKS----LFPDGYNPWRKAFGIAVKAPPVEGKANKAIMELIAGYFHLPVHA 87

Query: 64  LRMLSKQSSPLKIIYIDKDCKE 85
           + +LS Q+S +K + I    ++
Sbjct: 88  VTILSGQTSSVKKVRIHGISRQ 109


>gi|327289069|ref|XP_003229247.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 1 [Anolis
           carolinensis]
          Length = 119

 Score = 77.1 bits (189), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 21/94 (22%), Positives = 38/94 (40%), Gaps = 9/94 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + V   P +K++ +  L             I + A P  G+AN  +   L+K L + K
Sbjct: 30  VTIAVHAKPGSKQNAVTDLSAEAVG-------IAIAAPPSDGEANAELCRYLSKVLEVRK 82

Query: 62  SSLRMLSKQSSPLKIIYIDKD--CKEITELLQNN 93
           SS  +     S  K++ I      +E+ + L+  
Sbjct: 83  SSSLLKQGGRSREKLVKILAPLTPEEVLQKLRKE 116


>gi|289547963|ref|YP_003472951.1| hypothetical protein Thal_0188 [Thermocrinis albus DSM 14484]
 gi|289181580|gb|ADC88824.1| protein of unknown function DUF167 [Thermocrinis albus DSM 14484]
          Length = 77

 Score = 77.1 bits (189), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 23/80 (28%), Positives = 39/80 (48%), Gaps = 7/80 (8%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          M  + V+  P A K  +  L            ++ V   P+ GKAN+ +L +L+K L + 
Sbjct: 3  MIVIHVKAKPKASKEYVKELSP-------NFYEVAVKEPPEDGKANERILELLSKHLKVP 55

Query: 61 KSSLRMLSKQSSPLKIIYID 80
          KS +++L   SS +K+  I 
Sbjct: 56 KSRIKLLRGTSSRIKVFCIS 75


>gi|195127445|ref|XP_002008179.1| GI11963 [Drosophila mojavensis]
 gi|193919788|gb|EDW18655.1| GI11963 [Drosophila mojavensis]
          Length = 125

 Score = 76.7 bits (188), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 23/93 (24%), Positives = 43/93 (46%), Gaps = 10/93 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AK++GI  + +           +++ A P +G+AN  ++  L+K L L KS 
Sbjct: 37  IKILAKPGAKQNGITDIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 89

Query: 64  LRMLSKQSSPLKIIYIDK---DCKEITELLQNN 93
           + +     S  KII + K     + I + L+  
Sbjct: 90  VSLDKGSRSRNKIILVCKGVTTVEAIEQSLRKE 122


>gi|51474006|ref|YP_067763.1| hypothetical protein RT0827 [Rickettsia typhi str. Wilmington]
 gi|81390320|sp|Q68Y09|Y827_RICTY RecName: Full=UPF0235 protein RT0827
 gi|51460318|gb|AAU04281.1| conserved hypothetical protein [Rickettsia typhi str. Wilmington]
          Length = 105

 Score = 76.7 bits (188), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 22/89 (24%), Positives = 49/89 (55%), Gaps = 3/89 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +++ P +K+     +      ++  ++K+ +TA P++GKAN+ ++  LAK+  LS+
Sbjct: 14 ALINIKVKPYSKQ---NLINNFVIINNIPYIKLSITAAPEQGKANEGIINYLAKEWKLSR 70

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          SS+ ++   +  LK I I    ++   L+
Sbjct: 71 SSIEIIKGHTHSLKTILIKNINEDYLNLI 99


>gi|262197893|ref|YP_003269102.1| hypothetical protein Hoch_4719 [Haliangium ochraceum DSM 14365]
 gi|262081240|gb|ACY17209.1| protein of unknown function DUF167 [Haliangium ochraceum DSM
          14365]
          Length = 101

 Score = 76.7 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 22/80 (27%), Positives = 38/80 (47%), Gaps = 7/80 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P A  + +  +           +K+ VTA P  G AN A+LA+LAK L   +
Sbjct: 9  VLLDILVRPRAAHARVGPVH-------GERLKVAVTAPPADGAANDAVLALLAKHLGRPR 61

Query: 62 SSLRMLSKQSSPLKIIYIDK 81
            L ++  QSS  K +++  
Sbjct: 62 RDLCLVGGQSSRRKTVHVAG 81


>gi|171679393|ref|XP_001904643.1| hypothetical protein [Podospora anserina S mat+]
 gi|170939322|emb|CAP64550.1| unnamed protein product [Podospora anserina S mat+]
          Length = 119

 Score = 76.7 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 19/75 (25%), Positives = 36/75 (48%), Gaps = 5/75 (6%)

Query: 8  LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67
          + P A K+          +     ++I V A  ++G+ANKA++ +L++ L L KS+L + 
Sbjct: 26 VKPGASKNR-----EGVASVGEDAVEICVAAQAREGEANKAVIKVLSEVLDLPKSNLEIT 80

Query: 68 SKQSSPLKIIYIDKD 82
              S  K + +   
Sbjct: 81 QGHKSRNKTVAVIGP 95


>gi|291412810|ref|XP_002722671.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
          Length = 154

 Score = 76.7 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 19/96 (19%), Positives = 34/96 (35%), Gaps = 9/96 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P AK++ +  L                   P +G+AN  +   L+K L L K
Sbjct: 64  VTIAIHAKPGAKQNAVTDLTAEAVIVAIAA-------PPSEGEANAELCRYLSKVLELRK 116

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95
           S + +     S  K++ +       EI   L+    
Sbjct: 117 SDVVLDKGGKSREKVVKLLASTTPDEILGKLKQEAE 152


>gi|126465433|ref|YP_001040542.1| hypothetical protein Smar_0527 [Staphylothermus marinus F1]
 gi|126014256|gb|ABN69634.1| protein of unknown function DUF167 [Staphylothermus marinus F1]
          Length = 110

 Score = 76.7 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 21/95 (22%), Positives = 43/95 (45%), Gaps = 9/95 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + + + PN+ +  +          +   +    T  P+KG+AN A++  L+KK+ L  
Sbjct: 22  VIIPIYVKPNSDRDALV--------LEGDELVFYTTEIPEKGRANAALIRFLSKKVGLPH 73

Query: 62  SSLRMLSKQSSPLKIIYI-DKDCKEITELLQNNDS 95
           S + ++    +  K I + D D +++ E L    S
Sbjct: 74  SKIDIIYGARTRSKKILVRDIDTEKLAEKLTEIIS 108


>gi|15835282|ref|NP_297041.1| hypothetical protein TC0667 [Chlamydia muridarum Nigg]
 gi|270285456|ref|ZP_06194850.1| hypothetical protein CmurN_03383 [Chlamydia muridarum Nigg]
 gi|270289467|ref|ZP_06195769.1| hypothetical protein CmurW_03473 [Chlamydia muridarum Weiss]
 gi|301336853|ref|ZP_07225055.1| hypothetical protein CmurM_03440 [Chlamydia muridarum MopnTet14]
 gi|29839670|sp|Q9PK06|Y667_CHLMU RecName: Full=UPF0235 protein TC_0667
 gi|7190702|gb|AAF39489.1| conserved hypothetical protein [Chlamydia muridarum Nigg]
          Length = 100

 Score = 76.7 bits (188), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 21/78 (26%), Positives = 49/78 (62%), Gaps = 7/78 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +R+   A+++ + SLE          ++++VT  P++GKAN A++A+LAK L++ K+ 
Sbjct: 8  LEIRVTTKARENKVVSLE-------DGILRVRVTEAPERGKANDAVVALLAKFLSIPKND 60

Query: 64 LRMLSKQSSPLKIIYIDK 81
          + +++ ++S  K + + +
Sbjct: 61 VTLIAGEASRRKKVLLPR 78


>gi|15841462|ref|NP_336499.1| hypothetical protein MT2038 [Mycobacterium tuberculosis CDC1551]
 gi|148661795|ref|YP_001283318.1| PE-PGRS family protein [Mycobacterium tuberculosis H37Ra]
 gi|167970491|ref|ZP_02552768.1| PE-PGRS family protein [Mycobacterium tuberculosis H37Ra]
 gi|215404212|ref|ZP_03416393.1| PE-PGRS family protein [Mycobacterium tuberculosis 02_1987]
 gi|215411676|ref|ZP_03420472.1| PE-PGRS family protein [Mycobacterium tuberculosis 94_M4241A]
 gi|215427341|ref|ZP_03425260.1| PE-PGRS family protein [Mycobacterium tuberculosis T92]
 gi|215430902|ref|ZP_03428821.1| PE-PGRS family protein [Mycobacterium tuberculosis EAS054]
 gi|215446194|ref|ZP_03432946.1| PE-PGRS family protein [Mycobacterium tuberculosis T85]
 gi|218753699|ref|ZP_03532495.1| PE-PGRS family protein [Mycobacterium tuberculosis GM 1503]
 gi|218753702|ref|ZP_03532498.1| PE-PGRS family protein [Mycobacterium tuberculosis GM 1503]
 gi|219557946|ref|ZP_03537022.1| PE-PGRS family protein [Mycobacterium tuberculosis T17]
 gi|253798966|ref|YP_003031967.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN 1435]
 gi|254232156|ref|ZP_04925483.1| hypothetical protein TBCG_01934 [Mycobacterium tuberculosis C]
 gi|254364802|ref|ZP_04980848.1| hypothetical protein TBHG_01941 [Mycobacterium tuberculosis str.
          Haarlem]
 gi|254551007|ref|ZP_05141454.1| PE-PGRS family protein [Mycobacterium tuberculosis '98-R604
          INH-RIF-EM']
 gi|260186958|ref|ZP_05764432.1| PE-PGRS family protein [Mycobacterium tuberculosis CPHL_A]
 gi|260201085|ref|ZP_05768576.1| PE-PGRS family protein [Mycobacterium tuberculosis T46]
 gi|260205263|ref|ZP_05772754.1| PE-PGRS family protein [Mycobacterium tuberculosis K85]
 gi|289443475|ref|ZP_06433219.1| PE-PGRS family protein [Mycobacterium tuberculosis T46]
 gi|289447602|ref|ZP_06437346.1| PE-PGRS family protein [Mycobacterium tuberculosis CPHL_A]
 gi|289570082|ref|ZP_06450309.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
 gi|289574658|ref|ZP_06454885.1| PE-PGRS family protein [Mycobacterium tuberculosis K85]
 gi|289750564|ref|ZP_06509942.1| PE-PGRS family protein [Mycobacterium tuberculosis T92]
 gi|289754087|ref|ZP_06513465.1| PE-PGRS family protein [Mycobacterium tuberculosis EAS054]
 gi|289758097|ref|ZP_06517475.1| PE-PGRS family protein [Mycobacterium tuberculosis T85]
 gi|294996916|ref|ZP_06802607.1| PE-PGRS family protein [Mycobacterium tuberculosis 210]
 gi|297634555|ref|ZP_06952335.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN 4207]
 gi|297731543|ref|ZP_06960661.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN R506]
 gi|306776218|ref|ZP_07414555.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu001]
 gi|306779999|ref|ZP_07418336.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu002]
 gi|306784749|ref|ZP_07423071.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu003]
 gi|306789106|ref|ZP_07427428.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu004]
 gi|306793440|ref|ZP_07431742.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu005]
 gi|306797824|ref|ZP_07436126.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu006]
 gi|306803704|ref|ZP_07440372.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu008]
 gi|306808278|ref|ZP_07444946.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu007]
 gi|306968102|ref|ZP_07480763.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu009]
 gi|306972327|ref|ZP_07484988.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu010]
 gi|307080037|ref|ZP_07489207.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu011]
 gi|313658876|ref|ZP_07815756.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN V2475]
 gi|166227505|sp|A5U406|Y1997_MYCTA RecName: Full=UPF0235 protein MRA_1997
 gi|13881702|gb|AAK46313.1| conserved hypothetical protein [Mycobacterium tuberculosis
          CDC1551]
 gi|124601215|gb|EAY60225.1| hypothetical protein TBCG_01934 [Mycobacterium tuberculosis C]
 gi|134150316|gb|EBA42361.1| hypothetical protein TBHG_01941 [Mycobacterium tuberculosis str.
          Haarlem]
 gi|148505947|gb|ABQ73756.1| PE-PGRS family protein [Mycobacterium tuberculosis H37Ra]
 gi|253320469|gb|ACT25072.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN 1435]
 gi|289416394|gb|EFD13634.1| PE-PGRS family protein [Mycobacterium tuberculosis T46]
 gi|289420560|gb|EFD17761.1| PE-PGRS family protein [Mycobacterium tuberculosis CPHL_A]
 gi|289539089|gb|EFD43667.1| PE-PGRS family protein [Mycobacterium tuberculosis K85]
 gi|289543836|gb|EFD47484.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
 gi|289691151|gb|EFD58580.1| PE-PGRS family protein [Mycobacterium tuberculosis T92]
 gi|289694674|gb|EFD62103.1| PE-PGRS family protein [Mycobacterium tuberculosis EAS054]
 gi|289713661|gb|EFD77673.1| PE-PGRS family protein [Mycobacterium tuberculosis T85]
 gi|308215330|gb|EFO74729.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu001]
 gi|308327103|gb|EFP15954.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu002]
 gi|308330482|gb|EFP19333.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu003]
 gi|308334316|gb|EFP23167.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu004]
 gi|308338117|gb|EFP26968.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu005]
 gi|308341810|gb|EFP30661.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu006]
 gi|308345297|gb|EFP34148.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu007]
 gi|308349599|gb|EFP38450.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu008]
 gi|308354227|gb|EFP43078.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu009]
 gi|308358205|gb|EFP47056.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu010]
 gi|308362136|gb|EFP50987.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu011]
 gi|323719473|gb|EGB28600.1| hypothetical protein TMMG_01247 [Mycobacterium tuberculosis
          CDC1551A]
 gi|326903595|gb|EGE50528.1| hypothetical protein TBPG_01471 [Mycobacterium tuberculosis
          W-148]
 gi|328458721|gb|AEB04144.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN 4207]
          Length = 76

 Score = 76.3 bits (187), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 21/79 (26%), Positives = 37/79 (46%), Gaps = 6/79 (7%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
          +V+VR+ P + K  +  +           + I V      GKAN A+  +LA  L L KS
Sbjct: 4  SVVVRVKPGSHKGPLVEV------GPNGELIIYVREPAIDGKANDAVTRLLAAHLQLPKS 57

Query: 63 SLRMLSKQSSPLKIIYIDK 81
           ++++S  +S  K   + +
Sbjct: 58 RVKLVSGATSRFKRFRLSR 76


>gi|154244464|ref|YP_001415422.1| hypothetical protein Xaut_0507 [Xanthobacter autotrophicus Py2]
 gi|154158549|gb|ABS65765.1| protein of unknown function DUF167 [Xanthobacter autotrophicus
          Py2]
          Length = 110

 Score = 76.3 bits (187), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 20/84 (23%), Positives = 37/84 (44%), Gaps = 2/84 (2%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            V VR  P   +  +  +    D      +KI+V   P+ G A  A+  +LA+   ++ 
Sbjct: 11 VEVTVRATPRGGRDALDGVAELSDG--RAVLKIRVKVAPEDGAATAAVARVLAQAAGVAA 68

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85
          S +R+ S  ++ +K   I  D  +
Sbjct: 69 SQVRLASGATARVKTFRIAGDAAK 92


>gi|14520717|ref|NP_126192.1| hypothetical protein PAB7122 [Pyrococcus abyssi GE5]
 gi|29839688|sp|Q9V1C6|Y501_PYRAB RecName: Full=UPF0235 protein PYRAB05010
 gi|5457933|emb|CAB49423.1| Hypothetical protein [Pyrococcus abyssi GE5]
          Length = 92

 Score = 76.3 bits (187), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 19/91 (20%), Positives = 44/91 (48%), Gaps = 9/91 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V + PNA+++ I  ++          +K+ + A P KGKAN+ ++  L+       
Sbjct: 9  VILRVIVKPNARENSIEGID-----EWRGRIKVNIKAQPVKGKANRELIKFLSNLFG--- 60

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91
          + + +L  ++S  K + +   + +E+   L+
Sbjct: 61 AEVEILKGETSREKDVLVRGVNLEEVKRRLK 91


>gi|145595719|ref|YP_001160016.1| hypothetical protein Strop_3204 [Salinispora tropica CNB-440]
 gi|145305056|gb|ABP55638.1| protein of unknown function DUF167 [Salinispora tropica CNB-440]
          Length = 106

 Score = 76.3 bits (187), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 23/87 (26%), Positives = 40/87 (45%), Gaps = 3/87 (3%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V VR+ P + +S +              + I VTA P  G+A +A    LA+   + +++
Sbjct: 17  VAVRIKPGSSRSRVGG---RYMGPYGPALVIAVTAPPVDGRATEAARRALAEAFGVRRAA 73

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELL 90
           + + +  +S  KI Y+     EIT  L
Sbjct: 74  VSLGAGAASRNKIFYVGGSGVEITRTL 100


>gi|195014316|ref|XP_001984001.1| GH15254 [Drosophila grimshawi]
 gi|193897483|gb|EDV96349.1| GH15254 [Drosophila grimshawi]
          Length = 126

 Score = 76.3 bits (187), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 25/96 (26%), Positives = 45/96 (46%), Gaps = 10/96 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AK++GI  + +           +++ A P +G+AN  ++  L+K L L KS 
Sbjct: 38  IKILAKPGAKQNGITDIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 90

Query: 64  LRMLSKQSSPLKIIYIDK---DCKEITELLQNNDSL 96
           + +     S  K+I I K     + I +LL+    L
Sbjct: 91  VSLDKGSRSKNKLILITKGVSTVEAIEQLLRKESEL 126


>gi|222055825|ref|YP_002538187.1| protein of unknown function DUF167 [Geobacter sp. FRC-32]
 gi|221565114|gb|ACM21086.1| protein of unknown function DUF167 [Geobacter sp. FRC-32]
          Length = 99

 Score = 76.3 bits (187), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 20/86 (23%), Positives = 41/86 (47%), Gaps = 8/86 (9%)

Query: 9  IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
           PNAK+  I  +           + + VTA P+ G+A   M+  LA++  +S S ++++S
Sbjct: 20 TPNAKRDAIGKV-------KGHQLCVSVTAVPRAGRATDHMVRFLAEEFGVSVSDIQVVS 72

Query: 69 KQSSPLKIIYIDKDCKEITELLQNND 94
           + +  K + I    K +  ++   +
Sbjct: 73 GRMNVNKQLRIKA-PKRLPSVIGQQE 97


>gi|312375313|gb|EFR22710.1| hypothetical protein AND_14314 [Anopheles darlingi]
          Length = 155

 Score = 75.9 bits (186), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 25/114 (21%), Positives = 42/114 (36%), Gaps = 22/114 (19%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKI-----------------KVTATPQKGKAN 46
           V +   P AK SGI  +             +                  + A P  G+AN
Sbjct: 39  VKILAKPGAKTSGITDVSEEGVGCQIGMSLVDGSTLTLFDQTIQLHLSPLAAPPIDGEAN 98

Query: 47  KAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDC-----KEITELLQNNDS 95
             ++  L+K L L KS + +     S  K I +DKD      +++  + ++  S
Sbjct: 99  TELIRYLSKLLELRKSDISLDRGSKSRQKTIVLDKDGCRHTREQLLTIFRSEAS 152


>gi|310814754|ref|YP_003962718.1| hypothetical protein EIO_0232 [Ketogulonicigenium vulgare Y25]
 gi|308753489|gb|ADO41418.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25]
          Length = 82

 Score = 75.9 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 22/76 (28%), Positives = 40/76 (52%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VR+ P A ++ I          ++  +++ VT  P+ GKAN A+  +LAK L + KS 
Sbjct: 13 IAVRVTPKASRARIL-------RDESGVLRVYVTVVPEDGKANAAVTELLAKALRIPKSK 65

Query: 64 LRMLSKQSSPLKIIYI 79
          L + S  ++  K+  +
Sbjct: 66 LILKSGATARDKVFRL 81


>gi|253701118|ref|YP_003022307.1| hypothetical protein GM21_2508 [Geobacter sp. M21]
 gi|251775968|gb|ACT18549.1| protein of unknown function DUF167 [Geobacter sp. M21]
          Length = 101

 Score = 75.9 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 22/88 (25%), Positives = 40/88 (45%), Gaps = 8/88 (9%)

Query: 9   IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
            PNAK+  I              + I VTA P+ G+A   M+  LA++  +S S ++++ 
Sbjct: 21  TPNAKRDAIG-------KPKGHQLCISVTAVPRAGRATDHMVRFLAEEFEVSVSDIQVVF 73

Query: 69  KQSSPLKIIYIDKDCKEITELLQNNDSL 96
            + +  K + I    K +  ++   D L
Sbjct: 74  GRMNVNKQLRIKA-PKRLPSVIGQQDLL 100


>gi|146324075|ref|XP_001481499.1| DUF167 domain protein [Aspergillus fumigatus Af293]
 gi|129558081|gb|EBA27446.1| DUF167 domain protein [Aspergillus fumigatus Af293]
 gi|159126298|gb|EDP51414.1| YggU family protein [Aspergillus fumigatus A1163]
          Length = 133

 Score = 75.9 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 17/79 (21%), Positives = 32/79 (40%), Gaps = 5/79 (6%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
            +   + PNA  S                + + V A P+ G+AN A+  + A+   + KS
Sbjct: 33  QIACHVKPNASSSR-----EGIIAVGAEKVDVCVAAVPRNGEANTAVSRVFAQIFDVPKS 87

Query: 63  SLRMLSKQSSPLKIIYIDK 81
           ++ ++    S  K + I  
Sbjct: 88  NVEVIRGLKSRDKTLCITN 106


>gi|221059313|ref|XP_002260302.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
 gi|193810375|emb|CAQ41569.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
          Length = 176

 Score = 75.9 bits (186), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 19/91 (20%), Positives = 39/91 (42%), Gaps = 7/91 (7%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +R+ PNAK + I         SD   + I +   P   ++N A++   +  L L K  
Sbjct: 90  INLRVKPNAKNTSI------YFNSDREVLNINIQEQPINNQSNIAIIGYFSDILDLKKRD 143

Query: 64  LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
           + ++S   S  K++ +      ++   +  N
Sbjct: 144 ISIVSGLKSRDKVLMVSNISLDDLNSKIAEN 174


>gi|7508740|pir||T26031 hypothetical protein W01A8.2 - Caenorhabditis elegans
          Length = 263

 Score = 75.5 bits (185), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 19/77 (24%), Positives = 37/77 (48%), Gaps = 7/77 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AKKS + ++           + + + A P++G AN+ +++ L   L L K+ 
Sbjct: 39  LHIHAKPGAKKSCVVAI-------GDSEVDVAIGAAPREGAANEELISYLMSALGLRKNE 91

Query: 64  LRMLSKQSSPLKIIYID 80
           L+      S  K++ ID
Sbjct: 92  LQFDKGAKSRSKVVLID 108


>gi|222055811|ref|YP_002538173.1| protein of unknown function DUF167 [Geobacter sp. FRC-32]
 gi|221565100|gb|ACM21072.1| protein of unknown function DUF167 [Geobacter sp. FRC-32]
          Length = 98

 Score = 75.5 bits (185), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 40/86 (46%), Gaps = 8/86 (9%)

Query: 9  IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
           PNAK+  I  +           + + VTA P+ G+A   M+  LA++  +S S ++++ 
Sbjct: 19 TPNAKRDAIGKV-------KGHQLCVSVTAVPRAGRATDHMVRFLAEEFGVSVSDIQVVF 71

Query: 69 KQSSPLKIIYIDKDCKEITELLQNND 94
           + +  K + I    K +  ++   +
Sbjct: 72 GRMNVNKQLRIKA-PKRLPLVIGQQE 96


>gi|119357873|ref|YP_912517.1| hypothetical protein Cpha266_2081 [Chlorobium phaeobacteroides
          DSM 266]
 gi|187479907|sp|A1BI66|Y2081_CHLPD RecName: Full=UPF0235 protein Cpha266_2081
 gi|119355222|gb|ABL66093.1| protein of unknown function DUF167 [Chlorobium phaeobacteroides
          DSM 266]
          Length = 101

 Score = 75.5 bits (185), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 24/86 (27%), Positives = 40/86 (46%), Gaps = 7/86 (8%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           ++  P + KS I+        +    +K+ + A P    AN+    + AK L++S S L
Sbjct: 18 RLKAQPRSSKSAISG-------AYNGGVKVNLKAAPVDDAANRECCDLFAKVLSVSSSRL 70

Query: 65 RMLSKQSSPLKIIYIDKDCKEITELL 90
           +LS +SS  K I ++    E   LL
Sbjct: 71 TILSGKSSKNKTIKVEGLGAEEVALL 96


>gi|156100181|ref|XP_001615818.1| hypothetical protein [Plasmodium vivax SaI-1]
 gi|148804692|gb|EDL46091.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 178

 Score = 75.5 bits (185), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 19/91 (20%), Positives = 39/91 (42%), Gaps = 7/91 (7%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +R+ PNAK + I         SD   + I +   P   ++N A++   +  L L K  
Sbjct: 92  INLRVKPNAKNTSI------YFNSDREVLNINIQEQPINNQSNVAIIGYFSDILDLKKRD 145

Query: 64  LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
           + ++S   S  K++ +      ++   +  N
Sbjct: 146 ISIVSGLKSRDKVLMVSNISLDDLNSKIAEN 176


>gi|148264159|ref|YP_001230865.1| hypothetical protein Gura_2104 [Geobacter uraniireducens Rf4]
 gi|146397659|gb|ABQ26292.1| protein of unknown function DUF167 [Geobacter uraniireducens Rf4]
          Length = 99

 Score = 75.5 bits (185), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 39/86 (45%), Gaps = 8/86 (9%)

Query: 9  IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
           PNAK+  I  +           + + VTA P+ G+A   M+  LA +  +S S ++++ 
Sbjct: 20 TPNAKRDAIGKV-------KGHQLCVSVTAVPRAGRATDHMVRFLADEFGVSVSDIQVVF 72

Query: 69 KQSSPLKIIYIDKDCKEITELLQNND 94
           + +  K + I    K +  ++   +
Sbjct: 73 GRMNVNKQLRIKA-PKRLPSVIGQQE 97


>gi|300175116|emb|CBK20427.2| unnamed protein product [Blastocystis hominis]
          Length = 152

 Score = 75.2 bits (184), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 21/88 (23%), Positives = 41/88 (46%), Gaps = 10/88 (11%)

Query: 9   IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
            PN KK+GI          +   ++I++++ P + KANK +  +++    + KS + ++ 
Sbjct: 29  KPNCKKTGIE--------WEEEQLQIRLSSPPTENKANKEVCEVVSDIADIPKSQVSLIR 80

Query: 69  KQSSPLKIIYIDKDCKE--ITELLQNND 94
              S  K + ++    E  I  LLQ   
Sbjct: 81  GGKSRDKELMLNGVSSEGLIQSLLQALQ 108


>gi|218437671|ref|YP_002376000.1| hypothetical protein PCC7424_0673 [Cyanothece sp. PCC 7424]
 gi|226708012|sp|B7KEV7|Y673_CYAP7 RecName: Full=UPF0235 protein PCC7424_0673
 gi|218170399|gb|ACK69132.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7424]
          Length = 73

 Score = 75.2 bits (184), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 23/78 (29%), Positives = 43/78 (55%), Gaps = 7/78 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V++ PNAK   I       + ++   + I + + P +GKAN+ ++ +LA+K  ++K
Sbjct: 1  MKIQVKVKPNAKHQKI-------EEAEDGSLIISLKSPPVEGKANQELIKLLAQKYRVTK 53

Query: 62 SSLRMLSKQSSPLKIIYI 79
          S + + S  SS  K+I I
Sbjct: 54 SQISIQSGLSSRNKLIEI 71


>gi|188996205|ref|YP_001930456.1| protein of unknown function DUF167 [Sulfurihydrogenibium sp.
          YO3AOP1]
 gi|259646874|sp|B2V7H9|Y257_SULSY RecName: Full=UPF0235 protein SYO3AOP1_0257
 gi|188931272|gb|ACD65902.1| protein of unknown function DUF167 [Sulfurihydrogenibium sp.
          YO3AOP1]
          Length = 73

 Score = 75.2 bits (184), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 16/78 (20%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V++ P   K+ +  ++           +++ T  P+KGKAN+ ++ +L+    + K
Sbjct: 1  MRIKVKVKPGTSKNEVKKID-------ENLYEVRTTTIPEKGKANEKVVELLSDFFDVPK 53

Query: 62 SSLRMLSKQSSPLKIIYI 79
          S ++++  Q+S  K + +
Sbjct: 54 SKIKIVKGQTSREKEVEV 71


>gi|57235102|ref|YP_182004.1| hypothetical protein DET1292 [Dehalococcoides ethenogenes 195]
 gi|123618390|sp|Q3Z6Z5|Y1292_DEHE1 RecName: Full=UPF0235 protein DET1292
 gi|57225550|gb|AAW40607.1| conserved hypothetical protein [Dehalococcoides ethenogenes 195]
          Length = 97

 Score = 75.2 bits (184), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 22/75 (29%), Positives = 42/75 (56%), Gaps = 7/75 (9%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           V ++++P+A+++ +   E          +KIK+ A P+KGKANKA++  L++ L   KS
Sbjct: 9  RVNLKILPSAQRNELTGYE-------NGLLKIKIAAQPEKGKANKALVDYLSELLDTPKS 61

Query: 63 SLRMLSKQSSPLKII 77
           + +    S   K++
Sbjct: 62 EIEICRGLSGRNKVV 76


>gi|99080537|ref|YP_612691.1| hypothetical protein TM1040_0696 [Ruegeria sp. TM1040]
 gi|99036817|gb|ABF63429.1| protein of unknown function DUF167 [Ruegeria sp. TM1040]
          Length = 92

 Score = 75.2 bits (184), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 21/81 (25%), Positives = 39/81 (48%), Gaps = 8/81 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VR+ P A ++ +              +K+ VT  P+ GKAN  ++A+L++ L +S 
Sbjct: 20 AQIAVRVTPKAAQNAVL--------RKRDEIKVLVTTVPEGGKANADVVALLSRALGVSP 71

Query: 62 SSLRMLSKQSSPLKIIYIDKD 82
          S L +L   +S  K+  +   
Sbjct: 72 SRLTLLRGATSRDKVFLVTAP 92


>gi|307353453|ref|YP_003894504.1| hypothetical protein Mpet_1306 [Methanoplanus petrolearius DSM
          11571]
 gi|307156686|gb|ADN36066.1| protein of unknown function DUF167 [Methanoplanus petrolearius
          DSM 11571]
          Length = 105

 Score = 75.2 bits (184), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 17/90 (18%), Positives = 39/90 (43%), Gaps = 5/90 (5%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P +KK+    +           ++ ++ +   +GKAN  ++  +A   ++ K
Sbjct: 14 VQISLDVSPGSKKT----VFPAGYNEWRNAIECRIKSPATEGKANAEIIKTIADYFSVKK 69

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
          S + ++S   S  K I I     ++  E L
Sbjct: 70 SDVVIVSGAISGQKKIKISGVSLEDALERL 99


>gi|147921111|ref|YP_685078.1| hypothetical protein RCIX287 [uncultured methanogenic archaeon
          RC-I]
 gi|56295557|emb|CAH04800.1| conserved hypothetical protein [uncultured archaeon]
 gi|110620474|emb|CAJ35752.1| conserved hypothetical protein [uncultured methanogenic archaeon
          RC-I]
          Length = 104

 Score = 75.2 bits (184), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 15/86 (17%), Positives = 39/86 (45%), Gaps = 4/86 (4%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +   + P AK +    +     +     +++K+ A P++G+AN+ ++  L+    L  
Sbjct: 14 VVIDFEVTPGAKST----VVPSGYSVWRKRIEVKLKAPPERGRANEELIEALSDLFHLPA 69

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEIT 87
          SS+ + +  ++  K I +     ++ 
Sbjct: 70 SSIEITAGATNSRKSIKVHGISTDVV 95


>gi|38505546|ref|NP_942167.1| hypothetical protein ssr5011 [Synechocystis sp. PCC 6803]
 gi|38423570|dbj|BAD01781.1| ssr5011 [Synechocystis sp. PCC 6803]
          Length = 73

 Score = 75.2 bits (184), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 24/76 (31%), Positives = 42/76 (55%), Gaps = 7/76 (9%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           V++ PNAK+S +          D   + I V + P  GKAN+ ++ +LAK+  +S+ S+
Sbjct: 4  QVKVKPNAKQSKVV-------YGDDGSLIIHVKSPPVDGKANQELIKLLAKEFNVSQQSI 56

Query: 65 RMLSKQSSPLKIIYID 80
          ++ S   S  KI+ I+
Sbjct: 57 KIKSGAGSRQKIVEIN 72


>gi|189183832|ref|YP_001937617.1| hypothetical protein OTT_0925 [Orientia tsutsugamushi str. Ikeda]
 gi|189180603|dbj|BAG40383.1| hypothetical protein OTT_0925 [Orientia tsutsugamushi str. Ikeda]
          Length = 112

 Score = 75.2 bits (184), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 20/90 (22%), Positives = 47/90 (52%), Gaps = 3/90 (3%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +++   AK + I  L    + S    + I +   P+  KAN+A++  L++ L + +S+
Sbjct: 20  INLKVKAGAKINKIIGLYYINNKS---FLYISINTIPENNKANQAIIKFLSQWLEVCRSN 76

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
           ++++    S LK+I +      I+ L+++ 
Sbjct: 77  IKIVYGLHSNLKVISVINTNANISNLIRSK 106


>gi|206889548|ref|YP_002248254.1| hypothetical protein THEYE_A0407 [Thermodesulfovibrio
          yellowstonii DSM 11347]
 gi|206741486|gb|ACI20543.1| conserved hypothetical protein [Thermodesulfovibrio yellowstonii
          DSM 11347]
          Length = 87

 Score = 74.8 bits (183), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 22/79 (27%), Positives = 42/79 (53%), Gaps = 7/79 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V +   AK +GI  +E          +K+++ A P  G ANK ++ ML++ L + KS 
Sbjct: 14 LKVLVKTGAKITGIGGIE-------GNTLKLRLAAQPHDGLANKELIEMLSEILNIPKSR 66

Query: 64 LRMLSKQSSPLKIIYIDKD 82
          + ++  ++S  KII +  +
Sbjct: 67 IEIIKGKTSKHKIIKLKGE 85


>gi|289746071|ref|ZP_06505449.1| PE-PGRS family protein [Mycobacterium tuberculosis 02_1987]
 gi|308403079|ref|ZP_07493725.2| PE-PGRS family protein [Mycobacterium tuberculosis SUMu012]
 gi|289686599|gb|EFD54087.1| PE-PGRS family protein [Mycobacterium tuberculosis 02_1987]
 gi|308365791|gb|EFP54642.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu012]
          Length = 72

 Score = 74.8 bits (183), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 20/76 (26%), Positives = 34/76 (44%), Gaps = 6/76 (7%)

Query: 6  VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65
          VR+ P + K  +  +           + I V      GKAN A+  +LA  L L KS ++
Sbjct: 3  VRVKPGSHKGPLVEV------GPNGELIIYVREPAIDGKANDAVTRLLAAHLQLPKSRVK 56

Query: 66 MLSKQSSPLKIIYIDK 81
          ++S  +S  K   + +
Sbjct: 57 LVSGATSRFKRFRLSR 72


>gi|254466397|ref|ZP_05079808.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I]
 gi|206687305|gb|EDZ47787.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I]
          Length = 105

 Score = 74.8 bits (183), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 22/82 (26%), Positives = 36/82 (43%), Gaps = 8/82 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + VR  P A ++ I         +    +KI VTA P+ GKAN+A+  +LA  +  + 
Sbjct: 32  AEIAVRATPKAARNAIV--------AAEGVLKISVTAVPENGKANEAIRRLLAAAMGTAA 83

Query: 62  SSLRMLSKQSSPLKIIYIDKDC 83
           S L +    +S  K+       
Sbjct: 84  SRLELRRGAASRDKLFVYLGPA 105


>gi|118469743|ref|YP_888136.1| hypothetical protein MSMEG_3845 [Mycobacterium smegmatis str. MC2
          155]
 gi|226706061|sp|A0QYZ8|Y3845_MYCS2 RecName: Full=UPF0235 protein MSMEG_3845
 gi|118171030|gb|ABK71926.1| conserved domain protein [Mycobacterium smegmatis str. MC2 155]
          Length = 75

 Score = 74.8 bits (183), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 22/77 (28%), Positives = 37/77 (48%), Gaps = 6/77 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V+VR+ P ++K  +       +T+D   + I V      GKAN A+  +LA  L +  S 
Sbjct: 5  VVVRVKPGSRKGPLV------ETADDGTLTIYVQERAVDGKANAAVTKLLAAHLGVPPSR 58

Query: 64 LRMLSKQSSPLKIIYID 80
          + + S  ++ LK   I 
Sbjct: 59 VELASGATARLKRFRIS 75


>gi|313223261|emb|CBY43439.1| unnamed protein product [Oikopleura dioica]
          Length = 180

 Score = 74.8 bits (183), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 28/93 (30%), Positives = 49/93 (52%), Gaps = 2/93 (2%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDT-SDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
             +   + P +K   I+  +   DT S    + ++V+A P KGKANKA+L  LA KL + 
Sbjct: 79  VLLNCHVTPKSKTPSISVDDSINDTLSVVHVVNVRVSAPPDKGKANKAVLKSLADKLGVK 138

Query: 61  KSSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
            S L + S  +S  K++ ++ D  +++ +L   
Sbjct: 139 PSKLSIQSGTTSRSKVVLLETD-ADLSTILAQI 170


>gi|296235859|ref|XP_002763077.1| PREDICTED: UPF0235 protein C15orf40-like [Callithrix jacchus]
          Length = 154

 Score = 74.8 bits (183), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 17/87 (19%), Positives = 38/87 (43%), Gaps = 9/87 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P ++++ +  L +         + + + A P +G+AN  +   L+K L L K
Sbjct: 64  VTIAIHAKPGSRQNAVTDLTV-------EAINVAIAAPPSEGEANAELCRYLSKVLELRK 116

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEI 86
           S + +     S  K++ +      +EI
Sbjct: 117 SDVVLDKSGKSREKVVKLLASTTPEEI 143


>gi|329902481|ref|ZP_08273126.1| hypothetical protein IMCC9480_281 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327548773|gb|EGF33410.1| hypothetical protein IMCC9480_281 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 105

 Score = 74.8 bits (183), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 21/92 (22%), Positives = 45/92 (48%), Gaps = 10/92 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + V++  NAKK+ +  +           +KIK+ A P +GKAN A++  +A +L + +
Sbjct: 16  VRLAVQVAANAKKTEVIGV-------ADDVLKIKLHAQPIEGKANDALVRFVAGQLHVPR 68

Query: 62  SSLRMLSKQSSPLKIIYIDK---DCKEITELL 90
           +++ +    +S  K++ +        E+   L
Sbjct: 69  TTVSVTHGLTSKRKLLLVRAVGLSVDEVQRAL 100


>gi|148555779|ref|YP_001263361.1| hypothetical protein Swit_2871 [Sphingomonas wittichii RW1]
 gi|148500969|gb|ABQ69223.1| protein of unknown function DUF167 [Sphingomonas wittichii RW1]
          Length = 74

 Score = 74.4 bits (182), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 20/68 (29%), Positives = 36/68 (52%)

Query: 24 KDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDC 83
           D      + IK+ A P  G AN+A++ ++AK L ++K  + + S  +S LK +++  D 
Sbjct: 3  ADADGRRWLSIKLAAAPSDGAANEALVRLVAKALGVAKRDVTLASGATSRLKRLHVSGDP 62

Query: 84 KEITELLQ 91
            +   LQ
Sbjct: 63 AALAAALQ 70


>gi|256394873|ref|YP_003116437.1| hypothetical protein Caci_5738 [Catenulispora acidiphila DSM
          44928]
 gi|256361099|gb|ACU74596.1| protein of unknown function DUF167 [Catenulispora acidiphila DSM
          44928]
          Length = 92

 Score = 74.4 bits (182), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 16/92 (17%), Positives = 44/92 (47%), Gaps = 7/92 (7%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + +R+ P + ++ +              + + VTA    G A +A L  +A+ L + + 
Sbjct: 5  RIPIRVKPGSSRTKVGGRH------GERSLIVAVTAKAVDGAATEAALRAVAEALGMPRR 58

Query: 63 SLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
          ++++++  +S  K++ +   D   + EL+++ 
Sbjct: 59 AVQLITGATSRDKVLGVSSEDPDTVRELVRDL 90


>gi|78221372|ref|YP_383119.1| hypothetical protein Gmet_0145 [Geobacter metallireducens GS-15]
 gi|78192627|gb|ABB30394.1| protein of unknown function DUF167 [Geobacter metallireducens
          GS-15]
          Length = 100

 Score = 74.4 bits (182), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 38/86 (44%), Gaps = 8/86 (9%)

Query: 9  IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
           PNAK+  I  +           + + VTA P+ G+A   M+  LA +  +S   ++++ 
Sbjct: 20 TPNAKRDAIGKV-------KGHQLCVSVTAIPRAGRATDHMVRFLADEFGVSVGDIQVVF 72

Query: 69 KQSSPLKIIYIDKDCKEITELLQNND 94
           + +  K + I    K +  L+   +
Sbjct: 73 GRMNVNKQLRIKA-PKRLPPLIGQQE 97


>gi|197118100|ref|YP_002138527.1| hypothetical protein Gbem_1715 [Geobacter bemidjiensis Bem]
 gi|197087460|gb|ACH38731.1| protein of unknown function DUF167 [Geobacter bemidjiensis Bem]
          Length = 101

 Score = 74.4 bits (182), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 20/86 (23%), Positives = 39/86 (45%), Gaps = 8/86 (9%)

Query: 9  IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
           PNAK+  I              + I VTA P+ G+A   M+  LA++  ++ S ++++ 
Sbjct: 21 TPNAKRDAIG-------KPKGHQLCISVTAVPRAGRATDHMVRFLAEEFEVAVSDIQVVF 73

Query: 69 KQSSPLKIIYIDKDCKEITELLQNND 94
           + +  K + I    K +  ++   D
Sbjct: 74 GRMNVNKQLRIKA-PKRLPSVIGQQD 98


>gi|38345823|emb|CAD41928.2| OSJNBa0070M12.6 [Oryza sativa Japonica Group]
          Length = 207

 Score = 74.4 bits (182), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 15/90 (16%), Positives = 41/90 (45%), Gaps = 8/90 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + ++  P +K + I  +           + +++ A  + G+AN A++  ++  L + K  
Sbjct: 120 ISIQAKPGSKLATITEI-------GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 172

Query: 64  LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92
           + + S   S  K++ + D   + + + L+ 
Sbjct: 173 VSIGSGSKSREKVVLVQDATLQGVFDALKK 202


>gi|18978137|ref|NP_579494.1| hypothetical protein PF1765 [Pyrococcus furiosus DSM 3638]
 gi|29839724|sp|Q8U052|Y1765_PYRFU RecName: Full=UPF0235 protein PF1765
 gi|18893938|gb|AAL81889.1| hypothetical protein PF1765 [Pyrococcus furiosus DSM 3638]
          Length = 92

 Score = 74.4 bits (182), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 23/91 (25%), Positives = 43/91 (47%), Gaps = 9/91 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V + PNA+++ I  ++          +K+ V A P KGKANK ++    K      
Sbjct: 9  VILSVIVAPNARETKIVGIDGT-----RGRVKVNVAAPPVKGKANKELMKFFKKLFG--- 60

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91
          + + ++  ++S  K + I     KE+ E L+
Sbjct: 61 AEVVIVRGETSREKDLLIKGITKKEVIEKLE 91


>gi|85092099|ref|XP_959226.1| hypothetical protein NCU06879 [Neurospora crassa OR74A]
 gi|28920629|gb|EAA29990.1| predicted protein [Neurospora crassa OR74A]
          Length = 130

 Score = 74.4 bits (182), Expect = 5e-12,   Method: Composition-based stats.
 Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 5/79 (6%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           +   + P A K+    +    D      ++I V A  ++G+ANKA++ +L++ L L KS+
Sbjct: 30  IHCHVKPGASKNR-EGVTSITD----EAVEICVAAQAKEGEANKAVVKVLSEALNLPKSN 84

Query: 64  LRMLSKQSSPLKIIYIDKD 82
           L +     S  K I +   
Sbjct: 85  LEITQGLKSRAKTIAVAAP 103


>gi|294949448|ref|XP_002786202.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239900359|gb|EER17998.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 132

 Score = 74.0 bits (181), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 19/95 (20%), Positives = 47/95 (49%), Gaps = 9/95 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK-LALS 60
             + +R  P AK S +  ++          + +++ A+ + G+AN+ +L+ L+K+ L + 
Sbjct: 41  ARIAIRAKPGAKVSCLTGIDA------EGALGVQLNASARDGEANEELLSFLSKEVLGVK 94

Query: 61  KSSLRMLSKQSSPLKIIYIDK--DCKEITELLQNN 93
           K  + ++    S  K++ I       +++ LL++ 
Sbjct: 95  KKDVALVQGSKSREKVVEIADVLTVDDVSRLLRDE 129


>gi|304314780|ref|YP_003849927.1| hypothetical protein MTBMA_c10190 [Methanothermobacter
          marburgensis str. Marburg]
 gi|302588239|gb|ADL58614.1| conserved hypothetical protein [Methanothermobacter marburgensis
          str. Marburg]
          Length = 102

 Score = 74.0 bits (181), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 20/91 (21%), Positives = 38/91 (41%), Gaps = 9/91 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V + + P +        E+         ++IKV A P+KGKAN+ ++   +       ++
Sbjct: 13 VDIEVSPAS-----GGFEVRSYNEWRKRIEIKVRAPPEKGKANREIIEEFSAAFN---TN 64

Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
            ++S   S  K + I   D +    LL+  
Sbjct: 65 ADIVSGHKSRHKTLKIYGMDAETFRTLLEEK 95


>gi|33359419|ref|NP_877861.1| hypothetical protein PH1669.1n [Pyrococcus horikoshii OT3]
          Length = 95

 Score = 74.0 bits (181), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 20/89 (22%), Positives = 46/89 (51%), Gaps = 9/89 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V + PN+K++ I  ++          ++I + A P KG+ANK ++  L+K L    + 
Sbjct: 14 IQVIVRPNSKENKIEGVDN-----WKNRIRISIKAPPVKGEANKELIKFLSKILG---AK 65

Query: 64 LRMLSKQSSPLKIIYIDKDC-KEITELLQ 91
          + ++  ++S  K + +     +E+ + L+
Sbjct: 66 VEIIRGETSREKDLLVKGIKLEEVKKRLK 94


>gi|225850940|ref|YP_002731174.1| protein CPn_0497//CPj0497/CpB0517 [Persephonella marina EX-H1]
 gi|259646383|sp|C0QR78|Y1406_PERMH RecName: Full=UPF0235 protein PERMA_1406
 gi|225645391|gb|ACO03577.1| protein CPn_0497//CPj0497/CpB0517 [Persephonella marina EX-H1]
          Length = 73

 Score = 74.0 bits (181), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 7/78 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            V V++ PNAKK  I  ++         + +I+VT  P+KGKAN  ++ +L+K L + K
Sbjct: 1  MIVKVKVKPNAKKEEIREIQK-------DYFEIRVTVPPEKGKANSRVIELLSKHLKIPK 53

Query: 62 SSLRMLSKQSSPLKIIYI 79
          S +++   + S  KI  I
Sbjct: 54 SRIKLKKGEKSREKIFEI 71


>gi|67523595|ref|XP_659857.1| hypothetical protein AN2253.2 [Aspergillus nidulans FGSC A4]
 gi|40744782|gb|EAA63938.1| hypothetical protein AN2253.2 [Aspergillus nidulans FGSC A4]
 gi|259487644|tpe|CBF86471.1| TPA: DUF167 domain protein (AFU_orthologue; AFUA_5G06647)
           [Aspergillus nidulans FGSC A4]
          Length = 130

 Score = 74.0 bits (181), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 18/77 (23%), Positives = 34/77 (44%), Gaps = 5/77 (6%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           ++  R+ PNA                   + + V A P+ G+AN A+  + AK   ++KS
Sbjct: 31  HISCRVKPNAS-----GGREGITAVGNETVDVCVAAVPRDGEANLAVSQVFAKVFNVAKS 85

Query: 63  SLRMLSKQSSPLKIIYI 79
            + ++    S  K++ I
Sbjct: 86  DVGVIHGLKSRDKVLCI 102


>gi|290996548|ref|XP_002680844.1| predicted protein [Naegleria gruberi]
 gi|284094466|gb|EFC48100.1| predicted protein [Naegleria gruberi]
          Length = 73

 Score = 74.0 bits (181), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 17/79 (21%), Positives = 38/79 (48%), Gaps = 7/79 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +   PN+  S IA++   +         + + A P++G+ANK +   ++  L +SK
Sbjct: 2  IRLTILAKPNSSSSQIANINDEEIG-------VHIAAPPKEGEANKELCDYVSGVLGVSK 54

Query: 62 SSLRMLSKQSSPLKIIYID 80
          S + +     S  K++ ++
Sbjct: 55 SRVTLDRGGKSRHKLLLVE 73


>gi|332178747|gb|AEE14436.1| UPF0235 protein yggU [Thermodesulfobium narugense DSM 14796]
          Length = 86

 Score = 74.0 bits (181), Expect = 7e-12,   Method: Composition-based stats.
 Identities = 20/89 (22%), Positives = 43/89 (48%), Gaps = 10/89 (11%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + +++ PNAKK  I              +  KV+A P+ GKAN+ ++ ++++     + 
Sbjct: 4  KIELKVTPNAKKESIE--------IKDGKIYCKVSAPPEDGKANRRVIELISEFFDCKRK 55

Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91
           + + S + S  KI+ I    + I + ++
Sbjct: 56 DVEIFSGEKSKNKILLIK--SENIFKKIK 82


>gi|6503188|gb|AAF14630.1|AF200362_6 unknown [Haemophilus ducreyi]
          Length = 62

 Score = 73.6 bits (180), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 18/62 (29%), Positives = 31/62 (50%), Gaps = 3/62 (4%)

Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKD---CKEITE 88
          +K+ +TA P  G AN  +L  L+K   + KSS+ +   +    K +++       KEI +
Sbjct: 1  LKVAITAPPVDGAANAYLLKYLSKLFKVPKSSIVLEKGELQRHKQLFVPAPKLLPKEIEQ 60

Query: 89 LL 90
           L
Sbjct: 61 WL 62


>gi|327401804|ref|YP_004342643.1| hypothetical protein Arcve_1935 [Archaeoglobus veneficus SNP6]
 gi|327317312|gb|AEA47928.1| UPF0235 protein yggU [Archaeoglobus veneficus SNP6]
          Length = 101

 Score = 73.6 bits (180), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 18/88 (20%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + PN+K+S +              + ++V A P+ GKAN  ++ + +K     +  
Sbjct: 16 ISIEVTPNSKQSCVYGYN-----EWRKSIAVRVKAPPKGGKANAEIVELFSKIF---RKK 67

Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELL 90
          + ++   +S  K+I++     +E+  LL
Sbjct: 68 VEIVKGHTSSQKVIFVHSASPQEVESLL 95


>gi|261416137|ref|YP_003249820.1| protein of unknown function DUF167 [Fibrobacter succinogenes
          subsp. succinogenes S85]
 gi|261372593|gb|ACX75338.1| protein of unknown function DUF167 [Fibrobacter succinogenes
          subsp. succinogenes S85]
 gi|302328014|gb|ADL27215.1| conserved hypothetical protein [Fibrobacter succinogenes subsp.
          succinogenes S85]
          Length = 74

 Score = 73.6 bits (180), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 14/80 (17%), Positives = 33/80 (41%), Gaps = 7/80 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +++   +K+  +               K++V A P  G AN A+  ++A    + K
Sbjct: 1  MRINIKVHARSKRESVT-------PQPDGSYKVEVKAPPVDGAANAAICELIADYFHVHK 53

Query: 62 SSLRMLSKQSSPLKIIYIDK 81
            + ++   ++  K+I I  
Sbjct: 54 RDVSVVMGSTNNKKVIEILG 73


>gi|237756139|ref|ZP_04584711.1| conserved domain protein [Sulfurihydrogenibium yellowstonense
          SS-5]
 gi|237691703|gb|EEP60739.1| conserved domain protein [Sulfurihydrogenibium yellowstonense
          SS-5]
          Length = 73

 Score = 73.6 bits (180), Expect = 9e-12,   Method: Composition-based stats.
 Identities = 16/78 (20%), Positives = 41/78 (52%), Gaps = 7/78 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V++ P A ++ +  ++           +++ T  P+KGKAN+ ++ +L+    + K
Sbjct: 1  MRIKVKVKPGASENEVKKIDEYL-------YEVRTTTIPEKGKANEKVIELLSDFFDVPK 53

Query: 62 SSLRMLSKQSSPLKIIYI 79
          S ++++  Q+S  K + +
Sbjct: 54 SKIKIIKGQASREKEVEV 71


>gi|298674092|ref|YP_003725842.1| hypothetical protein Metev_0115 [Methanohalobium evestigatum
           Z-7303]
 gi|298287080|gb|ADI73046.1| protein of unknown function DUF167 [Methanohalobium evestigatum
           Z-7303]
          Length = 109

 Score = 73.2 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 16/93 (17%), Positives = 43/93 (46%), Gaps = 5/93 (5%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  + + + PN+K   +              +++K+T   + GKAN  ++   ++   ++
Sbjct: 13  LVVIDISVTPNSKTINV----PDNYNQWRNRIEVKLTQKAESGKANNQLIENFSEFFGVN 68

Query: 61  KSSLRMLSKQSSPLKIIYIDKDC-KEITELLQN 92
           KSS+++ S + S  K + +      +   +L++
Sbjct: 69  KSSIKITSGEKSSQKSVSVKGLSYDDAVSILKS 101


>gi|87198097|ref|YP_495354.1| hypothetical protein Saro_0071 [Novosphingobium aromaticivorans
          DSM 12444]
 gi|87133778|gb|ABD24520.1| protein of unknown function DUF167 [Novosphingobium
          aromaticivorans DSM 12444]
          Length = 95

 Score = 73.2 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 8/81 (9%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + VR+ P AK  GI          D   + +KV A P+ GKA  A+  +LA+ L L+ S
Sbjct: 23 RLAVRVTPGAKSEGIE--------IDGGRVLVKVRAKPEDGKATAAVQELLARALGLAPS 74

Query: 63 SLRMLSKQSSPLKIIYIDKDC 83
           + ML   +S  K+  I ++ 
Sbjct: 75 KVEMLRGATSREKLFRIPREA 95


>gi|225431557|ref|XP_002282176.1| PREDICTED: hypothetical protein [Vitis vinifera]
 gi|147866968|emb|CAN83055.1| hypothetical protein VITISV_009894 [Vitis vinifera]
          Length = 128

 Score = 73.2 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 14/79 (17%), Positives = 36/79 (45%), Gaps = 7/79 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            ++ V   P +K S I   +          + +++ A  + G+AN A+L  ++  + + +
Sbjct: 39  VSITVHAKPGSKVSSITDFD-------DEALGVQIDAPAKDGEANAALLDYISSVVGVKR 91

Query: 62  SSLRMLSKQSSPLKIIYID 80
             + + S   S  K++ ++
Sbjct: 92  RQVSISSGSKSRDKVVIVE 110


>gi|149057381|gb|EDM08704.1| similar to RIKEN cDNA 3110040N11, isoform CRA_a [Rattus norvegicus]
 gi|149057386|gb|EDM08709.1| similar to RIKEN cDNA 3110040N11, isoform CRA_a [Rattus norvegicus]
          Length = 114

 Score = 73.2 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 20/96 (20%), Positives = 37/96 (38%), Gaps = 21/96 (21%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++                    VTA P +G+AN  +   L+K L L K
Sbjct: 36  VTIAIHAKPGSKQNA-------------------VTAPPSEGEANAELCRYLSKVLDLRK 76

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95
           S + +     S  K++ +      +E+ E L+    
Sbjct: 77  SDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTEAE 112


>gi|289615268|emb|CBI58035.1| unnamed protein product [Sordaria macrospora]
          Length = 133

 Score = 73.2 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 19/79 (24%), Positives = 37/79 (46%), Gaps = 5/79 (6%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           +   + P A K     +    D      ++I V A  ++G+ANK+++ +L++ L + KS+
Sbjct: 32  IHCHVKPGASKQR-EGVTCITD----EAVEICVAAQAKEGEANKSVVKVLSEALNIPKSN 86

Query: 64  LRMLSKQSSPLKIIYIDKD 82
           L +     S  K I +   
Sbjct: 87  LEITQGLKSRAKTIAVAAP 105


>gi|89899559|ref|YP_522030.1| hypothetical protein Rfer_0749 [Rhodoferax ferrireducens T118]
 gi|89344296|gb|ABD68499.1| protein of unknown function DUF167 [Rhodoferax ferrireducens T118]
          Length = 114

 Score = 73.2 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 18/88 (20%), Positives = 33/88 (37%), Gaps = 8/88 (9%)

Query: 9   IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
            P+A K  I              +K+ V A P  G+A   M+  LA +  +S S + ++ 
Sbjct: 28  KPSASKDAIG-------KPFGKQLKVSVAAAPVAGRATDHMVRFLAVQFGVSASDIEVVF 80

Query: 69  KQSSPLKIIYIDKDCKEITELLQNNDSL 96
            + +  K + I      +  +      L
Sbjct: 81  GRMNVNKQVRIKA-PTRLPAVFAQGSLL 107


>gi|148674974|gb|EDL06921.1| RIKEN cDNA 3110040N11, isoform CRA_c [Mus musculus]
          Length = 114

 Score = 73.2 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 19/96 (19%), Positives = 37/96 (38%), Gaps = 21/96 (21%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P ++++                    VTA P +G+AN  +   L+K L L K
Sbjct: 36  VTIAIHAKPGSRQNA-------------------VTAPPSQGEANAELCRYLSKVLDLRK 76

Query: 62  SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95
           S + +     S  K++ +      +E+ E L+    
Sbjct: 77  SDVVLDKGGKSREKVVKLLASTTPEEVLEKLKTEAE 112


>gi|326504128|dbj|BAK02850.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 129

 Score = 73.2 bits (179), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 17/90 (18%), Positives = 40/90 (44%), Gaps = 8/90 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P +K + I  +           + +++ A  + G+AN A++  ++  L + K  
Sbjct: 43  ISIHAKPGSKMATITEV-------GEEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 95

Query: 64  LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92
           + + S   S  K++ + D   K + E L+ 
Sbjct: 96  VSIGSGSKSREKVVLVQDATLKGVFEALKK 125


>gi|326926706|ref|XP_003209539.1| PREDICTED: UPF0235 protein C15orf40 homolog [Meleagris gallopavo]
          Length = 85

 Score = 72.8 bits (178), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 17/69 (24%), Positives = 31/69 (44%), Gaps = 2/69 (2%)

Query: 30 IHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI--DKDCKEIT 87
            + + + A P +G+AN  +   L+K L + KS + +     S  K++ I       E+ 
Sbjct: 17 EAVGVAIAAPPSEGEANAELCRYLSKVLQVKKSDVILEKGGKSRDKVVKILVSLTPDEVL 76

Query: 88 ELLQNNDSL 96
          E L+   S 
Sbjct: 77 EKLKKEAST 85


>gi|78777110|ref|YP_393425.1| hypothetical protein Suden_0912 [Sulfurimonas denitrificans DSM
           1251]
 gi|78497650|gb|ABB44190.1| Protein of unknown function DUF167 [Sulfurimonas denitrificans DSM
           1251]
          Length = 101

 Score = 72.8 bits (178), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 22/85 (25%), Positives = 40/85 (47%), Gaps = 8/85 (9%)

Query: 9   IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
            P+AK+  I  +           +K+ V A P+ GKA   M+  LAK+  +S SS+ ++ 
Sbjct: 25  TPSAKRDVIGKVRA-------NQLKVSVRAQPEGGKATDYMVGFLAKEFGVSVSSIEVVY 77

Query: 69  KQSSPLKIIYIDKDCKEITELLQNN 93
            + S  K + I    K++   ++  
Sbjct: 78  GRESIHKQLRIKA-PKKLPNSIEFE 101


>gi|242077752|ref|XP_002448812.1| hypothetical protein SORBIDRAFT_06g033690 [Sorghum bicolor]
 gi|241939995|gb|EES13140.1| hypothetical protein SORBIDRAFT_06g033690 [Sorghum bicolor]
          Length = 131

 Score = 72.8 bits (178), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 15/90 (16%), Positives = 40/90 (44%), Gaps = 8/90 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P +K + I  +           + +++ A  + G+AN A++  ++  L + K  
Sbjct: 44  ISIHAKPGSKVATITEI-------GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 96

Query: 64  LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92
           + + S   S  K++ + D   + + + L+ 
Sbjct: 97  VSIGSGSKSREKVVLVQDATLEGVYDALKK 126


>gi|320101055|ref|YP_004176647.1| hypothetical protein Desmu_0861 [Desulfurococcus mucosus DSM 2162]
 gi|319753407|gb|ADV65165.1| protein of unknown function DUF167 [Desulfurococcus mucosus DSM
           2162]
          Length = 138

 Score = 72.8 bits (178), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 16/91 (17%), Positives = 39/91 (42%), Gaps = 9/91 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +R+ P   +           T +   +    +  P++G+AN A++   +++L +  S 
Sbjct: 52  LSIRVKPGESED--------FLTVEGDELVFYTSEPPERGRANAALVKFFSRELKIPVSR 103

Query: 64  LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93
           + ++    S LK +   D +  E+ + L   
Sbjct: 104 IDIVYGHRSTLKKLVFYDVNMDELADKLAKL 134


>gi|260429009|ref|ZP_05782986.1| conserved hypothetical protein [Citreicella sp. SE45]
 gi|260419632|gb|EEX12885.1| conserved hypothetical protein [Citreicella sp. SE45]
          Length = 85

 Score = 72.8 bits (178), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 8/79 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +R+ P A ++ I          D   +++ VT  P+ GKAN A+  +LAK L + K
Sbjct: 14 ATLALRVTPRASRNEI--------REDGDQLRVLVTTVPEDGKANAAVAKLLAKALGVPK 65

Query: 62 SSLRMLSKQSSPLKIIYID 80
          S L ++   +S  K+  I+
Sbjct: 66 SRLTLIQGATSRDKVFRIE 84


>gi|119498429|ref|XP_001265972.1| YggU family protein [Neosartorya fischeri NRRL 181]
 gi|119414136|gb|EAW24075.1| YggU family protein [Neosartorya fischeri NRRL 181]
          Length = 133

 Score = 72.8 bits (178), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 17/78 (21%), Positives = 31/78 (39%), Gaps = 5/78 (6%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
            +   + PNA  S                + + V A P+ G+AN A+  + A+   + KS
Sbjct: 33  QIACHVKPNASSSR-----EGIIAVGAEKVDVCVAAVPRNGEANAAVSRVFAQIFDVPKS 87

Query: 63  SLRMLSKQSSPLKIIYID 80
           +  ++    S  K + I 
Sbjct: 88  NAEVIRGLKSRDKTLCIT 105


>gi|68074573|ref|XP_679202.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56499889|emb|CAH95244.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 106

 Score = 72.8 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 17/91 (18%), Positives = 42/91 (46%), Gaps = 7/91 (7%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +R+ PN+K + I          DT  + I +   P   ++N A+++  +  L L K  
Sbjct: 20  INLRVKPNSKNTSI------YFNVDTEVLNINIQEQPVNNQSNVAIISYFSDILNLKKRD 73

Query: 64  LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
           + +++   S  K++ +     ++++  +  N
Sbjct: 74  ISIVAGLKSRDKVLMVSNISVEDLSNKINQN 104


>gi|259418717|ref|ZP_05742634.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B]
 gi|259344939|gb|EEW56793.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B]
          Length = 92

 Score = 72.8 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 18/79 (22%), Positives = 34/79 (43%), Gaps = 8/79 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VR+ P A  + +              +K+ VT  P+ GKA   + A+LA+ L ++ 
Sbjct: 20 AEITVRVTPKAAYNAVL--------RQGDVIKVMVTTVPEDGKATADVAALLARALGVAP 71

Query: 62 SSLRMLSKQSSPLKIIYID 80
          S + +    +S  K   + 
Sbjct: 72 SQITLRRGATSRDKTFVLT 90


>gi|320101820|ref|YP_004177411.1| hypothetical protein Isop_0265 [Isosphaera pallida ATCC 43644]
 gi|319749102|gb|ADV60862.1| protein of unknown function DUF167 [Isosphaera pallida ATCC 43644]
          Length = 131

 Score = 72.8 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 20/90 (22%), Positives = 42/90 (46%), Gaps = 8/90 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V   P +++ G+                      P+KGKAN A+L +LA  L ++KS 
Sbjct: 34  LAVMARPRSRRPGVVGTWCGAVVVAIAA-------APEKGKANAAILEILADLLGIAKSR 86

Query: 64  LRMLSKQSSPLKIIYIDK-DCKEITELLQN 92
           L ++S  ++  K++ I   +  ++ + + +
Sbjct: 87  LDLVSGATARSKVVRIAGLEPDQVRQAISD 116


>gi|302381263|ref|YP_003817086.1| hypothetical protein Bresu_0148 [Brevundimonas subvibrioides ATCC
          15264]
 gi|302191891|gb|ADK99462.1| protein of unknown function DUF167 [Brevundimonas subvibrioides
          ATCC 15264]
          Length = 92

 Score = 72.8 bits (178), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 27/91 (29%), Positives = 45/91 (49%), Gaps = 3/91 (3%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          M  + VRL P A    I       D      +K++V A P +G+AN A++ +LAK L + 
Sbjct: 1  MARLPVRLTPGASTDRIDGW--DADPEGRPVLKVRVRARPVEGEANAALILLLAKALGVP 58

Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
          +S++ +     S LK+I ++  D   +   L
Sbjct: 59 RSTVSLARGGQSRLKMIEVEGLDDAGLRARL 89


>gi|254409972|ref|ZP_05023752.1| conserved hypothetical protein [Microcoleus chthonoplastes PCC
          7420]
 gi|196183008|gb|EDX77992.1| conserved hypothetical protein [Microcoleus chthonoplastes PCC
          7420]
          Length = 74

 Score = 72.5 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 24/78 (30%), Positives = 43/78 (55%), Gaps = 7/78 (8%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           V++ PN+K   I       +      +K+ + + P  GKANK ++ +LA+K  ++KS +
Sbjct: 4  SVKVKPNSKTQSI-------EEMADGTLKVNLKSPPVDGKANKELIELLAEKFNVTKSQV 56

Query: 65 RMLSKQSSPLKIIYIDKD 82
          ++ S  SS +K+I I  D
Sbjct: 57 QIKSGLSSKIKLIEIVAD 74


>gi|326522380|dbj|BAK07652.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326531514|dbj|BAJ97761.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 129

 Score = 72.5 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 17/90 (18%), Positives = 40/90 (44%), Gaps = 8/90 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P +K + I  +           + +++ A  + G+AN A++  ++  L + K  
Sbjct: 43  ISIHAKPGSKMATITEV-------GEEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 95

Query: 64  LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92
           + + S   S  K++ + D   K + E L+ 
Sbjct: 96  VSIGSGSKSREKVVLVQDATLKGVFEALKK 125


>gi|226496211|ref|NP_001150562.1| LOC100284194 [Zea mays]
 gi|195640232|gb|ACG39584.1| uncharacterized ACR, YggU family COG1872 containing protein [Zea
           mays]
          Length = 129

 Score = 72.5 bits (177), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 15/90 (16%), Positives = 40/90 (44%), Gaps = 8/90 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P +K + I  +           + +++ A  + G+AN A++  ++  L + K  
Sbjct: 42  ISIHAKPGSKVATITEI-------GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 94

Query: 64  LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92
           + + S   S  K++ + D   + + + L+ 
Sbjct: 95  VSIGSGSKSREKVVLVQDATLEGVYDALKK 124


>gi|332157902|ref|YP_004423181.1| hypothetical protein PNA2_0260 [Pyrococcus sp. NA2]
 gi|331033365|gb|AEC51177.1| hypothetical protein PNA2_0260 [Pyrococcus sp. NA2]
          Length = 92

 Score = 72.1 bits (176), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 23/88 (26%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V + PNAKK+ I  ++          ++I V A P KGKAN+ ++  L   L    + 
Sbjct: 11 IYVLVKPNAKKTEIEGVDT-----WKKRIRISVKAPPVKGKANRELVNFLQGLLN---AE 62

Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90
          + ++  ++S  K + I     +E+   L
Sbjct: 63 VILVRGETSREKELLIKGLKVEEVKRKL 90


>gi|297526283|ref|YP_003668307.1| protein of unknown function DUF167 [Staphylothermus hellenicus DSM
           12710]
 gi|297255199|gb|ADI31408.1| protein of unknown function DUF167 [Staphylothermus hellenicus DSM
           12710]
          Length = 110

 Score = 72.1 bits (176), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 16/95 (16%), Positives = 42/95 (44%), Gaps = 9/95 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + + + PN+ +  +          +   +    T  P+KG+AN A++  L++ + L  
Sbjct: 22  VIIPIYVKPNSDRDALV--------LEGDELVFYTTEIPEKGRANAALIRFLSRNIRLPH 73

Query: 62  SSLRMLSKQSSPLKIIYI-DKDCKEITELLQNNDS 95
           + + ++    +  K + + D + +++ E L    S
Sbjct: 74  NKIDIIYGARTRSKKVLVRDMEAEKLAEKLAEIIS 108


>gi|30694498|ref|NP_175343.2| unknown protein [Arabidopsis thaliana]
 gi|34365605|gb|AAQ65114.1| At1g49170 [Arabidopsis thaliana]
 gi|51971533|dbj|BAD44431.1| similar to serine/threonine kinase 9 gb|AAD28798.1 [Arabidopsis
           thaliana]
 gi|332194278|gb|AEE32399.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 126

 Score = 72.1 bits (176), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 15/95 (15%), Positives = 39/95 (41%), Gaps = 8/95 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P +K + I  +             +++ A  + G+AN A+L  ++  L + +  
Sbjct: 39  ITIHAKPGSKAASITDVSDEAVG-------VQIDAPARDGEANAALLEYMSSVLGVKRRQ 91

Query: 64  LRMLSKQSSPLKIIYIDK-DCKEITELLQNNDSLT 97
           + + S   S  K++ ++    + + + L      T
Sbjct: 92  VSLGSGSKSRDKVVIVEDMTQQSVFQALSQASKPT 126


>gi|219852331|ref|YP_002466763.1| protein of unknown function DUF167 [Methanosphaerula palustris
           E1-9c]
 gi|219546590|gb|ACL17040.1| protein of unknown function DUF167 [Methanosphaerula palustris
           E1-9c]
          Length = 106

 Score = 72.1 bits (176), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 19/90 (21%), Positives = 42/90 (46%), Gaps = 5/90 (5%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           +++ +   AK                  +   +T  P +GKAN+A++A+L++ L + +S 
Sbjct: 17  LLLDVNSKAKADRF----PAGYNEWRHAIGCSITTPPVEGKANRAIVALLSRTLTIPQSG 72

Query: 64  LRMLSKQSSPLKIIYIDKDC-KEITELLQN 92
           + +LS  +S  K + I     +++   L+ 
Sbjct: 73  ISILSGATSSQKRVLIQGMTFEQLAGFLRE 102


>gi|262201697|ref|YP_003272905.1| hypothetical protein Gbro_1753 [Gordonia bronchialis DSM 43247]
 gi|262085044|gb|ACY21012.1| protein of unknown function DUF167 [Gordonia bronchialis DSM
          43247]
          Length = 75

 Score = 72.1 bits (176), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 20/77 (25%), Positives = 38/77 (49%), Gaps = 6/77 (7%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           V+V + PN++K  +       +T     + I V     +G+ANKA+  +LAK L + KS
Sbjct: 4  QVVVTVKPNSRKGPLV------ETGPDGTVTIYVREPATEGRANKAVAELLAKHLGVPKS 57

Query: 63 SLRMLSKQSSPLKIIYI 79
           + ++   ++  K   +
Sbjct: 58 KVALVGGATARTKRFRV 74


>gi|238494338|ref|XP_002378405.1| DUF167 domain protein [Aspergillus flavus NRRL3357]
 gi|220695055|gb|EED51398.1| DUF167 domain protein [Aspergillus flavus NRRL3357]
          Length = 131

 Score = 72.1 bits (176), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 5/78 (6%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
            +   + PNA  +                + + V A P+ G+AN A+  + A+ L + KS
Sbjct: 32  QISCNVKPNASANR-----EGIIAVGPEKVDVCVAAVPRDGEANAAVSRVFAQILKVPKS 86

Query: 63  SLRMLSKQSSPLKIIYID 80
           ++ ++    S  K + + 
Sbjct: 87  TVDVIRGLKSRDKTLCVS 104


>gi|282162781|ref|YP_003355166.1| hypothetical protein MCP_0111 [Methanocella paludicola SANAE]
 gi|282155095|dbj|BAI60183.1| conserved hypothetical protein [Methanocella paludicola SANAE]
          Length = 104

 Score = 72.1 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 15/90 (16%), Positives = 39/90 (43%), Gaps = 5/90 (5%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +   + P AK++ +              ++ ++ A P++G+AN+ ++  L+  L + +
Sbjct: 14 VLIDFEVSPGAKETRV----PSGYNEWRRRIEARLKAPPERGRANEELIGELSALLGIPE 69

Query: 62 SSLRMLSKQSSPLKIIYIDKDC-KEITELL 90
          S + + S      K + +     +E+   L
Sbjct: 70 SRIEITSGARDSRKSVKVLGASREEVLRRL 99


>gi|302868916|ref|YP_003837553.1| hypothetical protein Micau_4464 [Micromonospora aurantiaca ATCC
          27029]
 gi|315504614|ref|YP_004083501.1| hypothetical protein ML5_3839 [Micromonospora sp. L5]
 gi|302571775|gb|ADL47977.1| protein of unknown function DUF167 [Micromonospora aurantiaca
          ATCC 27029]
 gi|315411233|gb|ADU09350.1| protein of unknown function DUF167 [Micromonospora sp. L5]
          Length = 101

 Score = 72.1 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 19/88 (21%), Positives = 40/88 (45%), Gaps = 3/88 (3%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V VR+ P A ++ +       D      + + V A    G+A +A    LA  L +  ++
Sbjct: 9  VAVRVKPGAARARVGG---RFDGPYGPALVVAVHAPAVDGRATEAARRALADALGIRPAT 65

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          + + S  +S  K+  +++    + E+L+
Sbjct: 66 VSLRSGAASRDKLFLVERPHDGLPEVLR 93


>gi|307824080|ref|ZP_07654307.1| protein of unknown function DUF167 [Methylobacter tundripaludum
           SV96]
 gi|307734864|gb|EFO05714.1| protein of unknown function DUF167 [Methylobacter tundripaludum
           SV96]
          Length = 108

 Score = 72.1 bits (176), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 18/84 (21%), Positives = 37/84 (44%), Gaps = 8/84 (9%)

Query: 9   IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
            P+AK+  I  +           +K+ VTA P  G+A   M+  LAK+  ++   + ++ 
Sbjct: 26  QPSAKQDAIGKV-------KGNQLKVSVTAAPVAGRATDHMVRFLAKEFGVTPKDIEVVF 78

Query: 69  KQSSPLKIIYIDKDCKEITELLQN 92
            + +  K + I    K +  ++  
Sbjct: 79  GRFNVNKQLRIK-SPKNLPSVINK 101


>gi|154251824|ref|YP_001412648.1| hypothetical protein Plav_1371 [Parvibaculum lavamentivorans
          DS-1]
 gi|154155774|gb|ABS62991.1| protein of unknown function DUF167 [Parvibaculum lavamentivorans
          DS-1]
          Length = 108

 Score = 71.7 bits (175), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 44/89 (49%), Gaps = 2/89 (2%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           ++ +R+ P      I  L           ++++V+A  +KGKAN A+L +LA+ L L +
Sbjct: 10 VSLRLRVTPRGGADRIDGLAADASGEP--FLRVRVSAVAEKGKANDAVLKLLARALRLPR 67

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          S+  + S ++   K + I  D + +   L
Sbjct: 68 SAFAVASGEAGRTKSVTISGDTERLMADL 96


>gi|224131202|ref|XP_002328480.1| predicted protein [Populus trichocarpa]
 gi|222838195|gb|EEE76560.1| predicted protein [Populus trichocarpa]
          Length = 131

 Score = 71.7 bits (175), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 14/77 (18%), Positives = 34/77 (44%), Gaps = 7/77 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P +K + I  L             +++ A  + G+AN A+L  ++  L + +  
Sbjct: 44  ITIHAKPGSKSASITDLSDEAVG-------VQIDAPAKDGEANAALLDYISSVLGVKRRQ 96

Query: 64  LRMLSKQSSPLKIIYID 80
           + + S   S  K++ ++
Sbjct: 97  VSIGSGSKSRDKVVIVE 113


>gi|284161412|ref|YP_003400035.1| hypothetical protein Arcpr_0292 [Archaeoglobus profundus DSM
          5631]
 gi|284011409|gb|ADB57362.1| protein of unknown function DUF167 [Archaeoglobus profundus DSM
          5631]
          Length = 97

 Score = 71.7 bits (175), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 17/91 (18%), Positives = 38/91 (41%), Gaps = 9/91 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V + PNAK++ I   +          +K+ V + P+ GKAN+ +   L          
Sbjct: 10 IEVDVSPNAKRTAITGYD-----PWRKALKVSVKSPPRGGKANRELTDFLGGIFNC---K 61

Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
          + ++  + S  K + +     ++   +L+  
Sbjct: 62 VEIVKGEKSTKKTVLLKGLSLEKALSILKEL 92


>gi|54022885|ref|YP_117127.1| hypothetical protein nfa9180 [Nocardia farcinica IFM 10152]
 gi|54014393|dbj|BAD55763.1| hypothetical protein [Nocardia farcinica IFM 10152]
          Length = 72

 Score = 71.7 bits (175), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 20/73 (27%), Positives = 37/73 (50%), Gaps = 6/73 (8%)

Query: 8  LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67
          + PN++K  +       +T+    + + V A   +GKANKA + +LA    +  S++R+ 
Sbjct: 5  IKPNSRKGPLV------ETAADGTLTLYVRAPAVEGKANKAAIDLLAAHYGVPTSAVRLT 58

Query: 68 SKQSSPLKIIYID 80
          +  +S  K   ID
Sbjct: 59 AGATSRHKRFDID 71


>gi|148643663|ref|YP_001274176.1| hypothetical protein Msm_1603 [Methanobrevibacter smithii ATCC
           35061]
 gi|148552680|gb|ABQ87808.1| conserved hypothetical protein Msm_1603 [Methanobrevibacter smithii
           ATCC 35061]
          Length = 101

 Score = 71.3 bits (174), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 22/92 (23%), Positives = 41/92 (44%), Gaps = 13/92 (14%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + + + PN+ K  I+              +I++   PQKGKANK ++  L+K        
Sbjct: 18  IDIEVSPNSNKFQISGFN-----EWRNRFEIRIKQVPQKGKANKEIVKELSKIFNC---D 69

Query: 64  LRMLSKQSSPLKIIY-----IDKDCKEITELL 90
           + +   + S  K I      ID   ++++E+L
Sbjct: 70  VSISKGEKSSQKTIVCYNVSIDCILEKLSEIL 101


>gi|297852564|ref|XP_002894163.1| hypothetical protein ARALYDRAFT_474060 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297340005|gb|EFH70422.1| hypothetical protein ARALYDRAFT_474060 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 126

 Score = 71.3 bits (174), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 15/95 (15%), Positives = 39/95 (41%), Gaps = 8/95 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P +K + I  +             +++ A  + G+AN A+L  ++  L + +  
Sbjct: 39  ITIHAKPGSKAASITDVSDEAVG-------VQIDAPARDGEANAALLEFISSVLGVKRRQ 91

Query: 64  LRMLSKQSSPLKIIYIDK-DCKEITELLQNNDSLT 97
           + + S   S  K++ ++    + + + L      T
Sbjct: 92  VSLGSGSKSRDKVVIVEDMTQQSVFQALSQASKPT 126


>gi|91788526|ref|YP_549478.1| hypothetical protein Bpro_2664 [Polaromonas sp. JS666]
 gi|91697751|gb|ABE44580.1| protein of unknown function DUF167 [Polaromonas sp. JS666]
          Length = 103

 Score = 71.3 bits (174), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 19/85 (22%), Positives = 35/85 (41%), Gaps = 8/85 (9%)

Query: 9   IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
            P A +  I              +K+ VTA P+ GKA   M+  LA    ++ + + ++ 
Sbjct: 27  KPAASRDAIG-------KPKGTQLKVSVTAAPKSGKATDHMVRFLAPLFGVAVADIEVVF 79

Query: 69  KQSSPLKIIYIDKDCKEITELLQNN 93
            Q +  K + I    K++ E+    
Sbjct: 80  GQENVNKQLRIKA-PKKLPEVFTAK 103


>gi|302916303|ref|XP_003051962.1| hypothetical protein NECHADRAFT_19877 [Nectria haematococca mpVI
          77-13-4]
 gi|256732901|gb|EEU46249.1| hypothetical protein NECHADRAFT_19877 [Nectria haematococca mpVI
          77-13-4]
          Length = 75

 Score = 71.3 bits (174), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 18/77 (23%), Positives = 38/77 (49%), Gaps = 5/77 (6%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
          ++ + + P A K+                +++ V A  ++G+ANKA++ +L+  L + KS
Sbjct: 4  HLQLHVKPGASKNR-----EGVIAVTDDAIELCVAAQAREGEANKAVVQVLSSVLGVPKS 58

Query: 63 SLRMLSKQSSPLKIIYI 79
          SL++     S  K + +
Sbjct: 59 SLQLTHGLKSRDKTVVL 75


>gi|240170218|ref|ZP_04748877.1| PE-PGRS family protein [Mycobacterium kansasii ATCC 12478]
          Length = 76

 Score = 71.3 bits (174), Expect = 4e-11,   Method: Composition-based stats.
 Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 7/82 (8%)

Query: 1  MCNVIV-RLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLAL 59
          M +++V ++ P ++           +T   + + I V      GKAN+A+  +LA    L
Sbjct: 1  MADIVVVKVKPGSRN------GPRVETVSGVELTIYVPEPAVGGKANEAVARLLAAHFHL 54

Query: 60 SKSSLRMLSKQSSPLKIIYIDK 81
           ++ + ++S   S LK   ID+
Sbjct: 55 PRTRVELVSGARSRLKRFRIDR 76


>gi|74318638|ref|YP_316378.1| hypothetical protein Tbd_2620 [Thiobacillus denitrificans ATCC
          25259]
 gi|74058133|gb|AAZ98573.1| conserved hypothetical protein [Thiobacillus denitrificans ATCC
          25259]
          Length = 106

 Score = 70.9 bits (173), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 17/84 (20%), Positives = 36/84 (42%), Gaps = 8/84 (9%)

Query: 9  IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
           P+AK   I              +K+ VTA P+ G+A   M+  LA +  ++ S++ ++ 
Sbjct: 22 KPSAKVDAIG-------KPKGHQLKVSVTAAPRAGRATDHMVRFLADEFGVATSAIEVVF 74

Query: 69 KQSSPLKIIYIDKDCKEITELLQN 92
           + +  K + I      +  + + 
Sbjct: 75 GRMNVNKQLRIKA-PTRLPAVFKA 97


>gi|255627953|gb|ACU14321.1| unknown [Glycine max]
          Length = 126

 Score = 70.9 bits (173), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 13/82 (15%), Positives = 35/82 (42%), Gaps = 7/82 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AK + I  +             +++ A  + G+AN A+L  ++  L + +  
Sbjct: 39  ITIHAKPGAKSASITDISDEAVG-------VQIDAPARDGEANAALLDYISSVLGVKRRQ 91

Query: 64  LRMLSKQSSPLKIIYIDKDCKE 85
           + + +   S  K + ++   ++
Sbjct: 92  VSLGTGSKSRDKTVIVEDVTQQ 113


>gi|169777211|ref|XP_001823071.1| yggU family protein [Aspergillus oryzae RIB40]
 gi|83771808|dbj|BAE61938.1| unnamed protein product [Aspergillus oryzae]
          Length = 131

 Score = 70.9 bits (173), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 5/78 (6%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
            +   + PNA  +                + + V A P+ G+AN A+  + A+ L + KS
Sbjct: 32  QISCNVKPNASANR-----EGIIAVGPEKVDVCVAAVPRDGEANAAVSRVFAQILKVPKS 86

Query: 63  SLRMLSKQSSPLKIIYID 80
           ++ ++    S  K + + 
Sbjct: 87  TVVVIRGLKSRDKTLCVS 104


>gi|296126259|ref|YP_003633511.1| hypothetical protein Bmur_1218 [Brachyspira murdochii DSM 12563]
 gi|296018075|gb|ADG71312.1| protein of unknown function DUF167 [Brachyspira murdochii DSM
          12563]
          Length = 84

 Score = 70.9 bits (173), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 23/92 (25%), Positives = 42/92 (45%), Gaps = 8/92 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           N+ V++   AK +            +     +++ A    GKANKA++  LA +L + K
Sbjct: 1  MNIEVKVTAGAKSNSF--------KFENGAYYVRIMAKAIDGKANKAIIEFLADELNIKK 52

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93
            + +L  + S  KII I+ +  ++ E    N
Sbjct: 53 KDIDILKGEKSSKKIIAINIEENKLKEYFSKN 84


>gi|168032813|ref|XP_001768912.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679824|gb|EDQ66266.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 123

 Score = 70.9 bits (173), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 17/93 (18%), Positives = 42/93 (45%), Gaps = 8/93 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V   P +K S I   +            +++ A  ++G+AN A+L  +A+ L + +  
Sbjct: 38  ITVHAKPGSKLSAITDTDDGAVG-------VQIDAPAREGEANAALLEYIAEVLGIKRRQ 90

Query: 64  LRMLSKQSSPLKIIYIDK-DCKEITELLQNNDS 95
           + + S   S  K++ ++     ++ E ++   +
Sbjct: 91  VSLGSGSRSREKLVTVEGLTVDKVYEAIRRAST 123


>gi|223975045|gb|ACN31710.1| unknown [Zea mays]
          Length = 95

 Score = 70.9 bits (173), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 15/90 (16%), Positives = 40/90 (44%), Gaps = 8/90 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +   P +K + I  +           + +++ A  + G+AN A++  ++  L + K  
Sbjct: 8  ISIHAKPGSKVATITEI-------GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 60

Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92
          + + S   S  K++ + D   + + + L+ 
Sbjct: 61 VSIGSGSKSREKVVLVQDATLEGVYDALKK 90


>gi|91202864|emb|CAJ72503.1| similar to hypothetical protein YggU [Candidatus Kuenenia
          stuttgartiensis]
          Length = 97

 Score = 70.9 bits (173), Expect = 5e-11,   Method: Composition-based stats.
 Identities = 28/91 (30%), Positives = 49/91 (53%), Gaps = 8/91 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V V++   + K  I         +    +K+ V+A P+KGKANKA++ +LA+   ++ SS
Sbjct: 13 VFVKVQAGSGKDRIVG-------NLGGRLKLAVSAAPEKGKANKAVVELLAETFHINSSS 65

Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
          + ++S ++S  K I I+    + I  LL  N
Sbjct: 66 IHIISGKTSRDKKIMIEGVTPESINTLLNFN 96


>gi|327399372|ref|YP_004340241.1| hypothetical protein Hipma_1220 [Hippea maritima DSM 10411]
 gi|327182001|gb|AEA34182.1| UPF0235 protein yggU [Hippea maritima DSM 10411]
          Length = 84

 Score = 70.9 bits (173), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 9/90 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V++ PN+K        + +   D   + +K+   P +GKANKA++  LAK+L ++K
Sbjct: 1  MILEVKVKPNSK--------VEQFDFDKGVLTLKIKEKPVEGKANKAVVDKLAKRLKVAK 52

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
          S + ++  + S  K++ ID  D  EI   L
Sbjct: 53 SCIEIVKGEKSRSKLVRIDCLDDDEILLRL 82


>gi|322418791|ref|YP_004198014.1| hypothetical protein GM18_1269 [Geobacter sp. M18]
 gi|320125178|gb|ADW12738.1| protein of unknown function DUF167 [Geobacter sp. M18]
          Length = 105

 Score = 70.5 bits (172), Expect = 6e-11,   Method: Composition-based stats.
 Identities = 17/84 (20%), Positives = 38/84 (45%), Gaps = 8/84 (9%)

Query: 9  IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
           PNAK+  I  +           + + VT  P+ G+A   M+  LA++  +S   ++++ 
Sbjct: 21 TPNAKRDAIGKV-------KGHQLCVSVTEFPRAGRATDHMVRFLAEEFGVSTGDIQVVF 73

Query: 69 KQSSPLKIIYIDKDCKEITELLQN 92
           + +  K + I    K +  ++ +
Sbjct: 74 GRMNVNKQLRIKA-PKRLPSVIAS 96


>gi|154309869|ref|XP_001554267.1| predicted protein [Botryotinia fuckeliana B05.10]
 gi|150851643|gb|EDN26836.1| predicted protein [Botryotinia fuckeliana B05.10]
          Length = 130

 Score = 70.5 bits (172), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 24/96 (25%), Positives = 45/96 (46%), Gaps = 8/96 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            ++   + PN+  S +  ++   DTS    +++ V    +  +ANK ++ +L+  L   K
Sbjct: 23  IHLSCHVKPNSSASRV-GVKAFSDTSS--SIEVCVAQPARDNEANKGVVEVLSHILKCPK 79

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97
           + L ++  ++S  KII          E L  NDSL 
Sbjct: 80  TDLEVIRGKTSKNKIIAYKG-----IEDLLANDSLA 110


>gi|310801877|gb|EFQ36770.1| hypothetical protein GLRG_11914 [Glomerella graminicola M1.001]
          Length = 92

 Score = 70.5 bits (172), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 17/76 (22%), Positives = 36/76 (47%), Gaps = 5/76 (6%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          +  R+ P A +     +           +++ V A  ++G+AN+A++ +L++ L L KS 
Sbjct: 22 LQCRVKPGASR-----VREGIVAVTDGGVELCVAAQAREGEANRAVIKLLSEILGLPKSD 76

Query: 64 LRMLSKQSSPLKIIYI 79
          L +     S  K + +
Sbjct: 77 LIISQGLKSRDKTVAV 92


>gi|324527094|gb|ADY48748.1| Unknown [Ascaris suum]
          Length = 181

 Score = 70.5 bits (172), Expect = 7e-11,   Method: Composition-based stats.
 Identities = 22/94 (23%), Positives = 45/94 (47%), Gaps = 10/94 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   PNAK S +  +           +++ + A P KG+AN+A+   +A+ L L K+ 
Sbjct: 92  LKIHAKPNAKISRVTEIN-------ETEIEVAIAAPPHKGQANEALTDAIAEILGLRKND 144

Query: 64  LRMLSKQSSPLKIIYIDK---DCKEITELLQNND 94
           +   +   S  K++ I+      +E+ E L+ + 
Sbjct: 145 VFFDTGARSRSKLLVINSQRITVEEVREKLKKSA 178


>gi|119946689|ref|YP_944369.1| hypothetical protein Ping_3071 [Psychromonas ingrahamii 37]
 gi|119865293|gb|ABM04770.1| hypothetical protein DUF167 [Psychromonas ingrahamii 37]
          Length = 111

 Score = 70.5 bits (172), Expect = 8e-11,   Method: Composition-based stats.
 Identities = 22/90 (24%), Positives = 41/90 (45%), Gaps = 10/90 (11%)

Query: 9   IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
            P AKK  +         +  I ++++VTATP  G+A   M+  LAK+ ++S   + ++ 
Sbjct: 28  TPGAKKDAVG-------KAQGIQLRVRVTATPVAGRATDHMVRFLAKEFSVSPDDITVVF 80

Query: 69  KQSSPLKIIYIDKD---CKEITELLQNNDS 95
            + +  K + I         + + L   DS
Sbjct: 81  GRLNINKQLRIKAPKKMPSVVAKALSKKDS 110


>gi|260432898|ref|ZP_05786869.1| conserved hypothetical protein [Silicibacter lacuscaerulensis
          ITI-1157]
 gi|260416726|gb|EEX09985.1| conserved hypothetical protein [Silicibacter lacuscaerulensis
          ITI-1157]
          Length = 91

 Score = 70.1 bits (171), Expect = 9e-11,   Method: Composition-based stats.
 Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 8/78 (10%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
          ++ VR+ P A +  I         +D   + I VTA  + GKAN A+  +LAK + ++ S
Sbjct: 21 HIQVRVTPKAARDRI--------QADESSVHIAVTAPAEGGKANLAVARILAKAMGIAPS 72

Query: 63 SLRMLSKQSSPLKIIYID 80
          +L +   Q++  K+   +
Sbjct: 73 ALILKQGQTARNKLFVYE 90


>gi|91205448|ref|YP_537803.1| hypothetical protein RBE_0633 [Rickettsia bellii RML369-C]
 gi|157827193|ref|YP_001496257.1| hypothetical protein A1I_04385 [Rickettsia bellii OSU 85-389]
 gi|122425693|sp|Q1RIV0|Y633_RICBR RecName: Full=UPF0235 protein RBE_0633
 gi|226706174|sp|A8GWJ3|Y4385_RICB8 RecName: Full=UPF0235 protein A1I_04385
 gi|91068992|gb|ABE04714.1| unknown [Rickettsia bellii RML369-C]
 gi|157802497|gb|ABV79220.1| hypothetical protein A1I_04385 [Rickettsia bellii OSU 85-389]
          Length = 104

 Score = 70.1 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 3/89 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           ++ +++   AK + I    I  D     H+K+ + A  Q+GKAN+ ++  LAK+  L +
Sbjct: 13 ASLNIKVKAAAKSNDIKEFIIINDVL---HLKLSIKAHAQQGKANEEIINFLAKEWQLLR 69

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          S+L +    ++ LK I I    +E   L+
Sbjct: 70 SNLEITKGHTNSLKTILIKNIDEEYLNLI 98


>gi|222444857|ref|ZP_03607372.1| hypothetical protein METSMIALI_00470 [Methanobrevibacter smithii
           DSM 2375]
 gi|222434422|gb|EEE41587.1| hypothetical protein METSMIALI_00470 [Methanobrevibacter smithii
           DSM 2375]
          Length = 101

 Score = 69.8 bits (170), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 23/92 (25%), Positives = 40/92 (43%), Gaps = 13/92 (14%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V + + PN+ K  I+              +I++   PQKGKANK ++  L+K        
Sbjct: 18  VDIEVSPNSNKFQISGFN-----EWRNRFEIRIKQVPQKGKANKEIVKELSKIFNC---D 69

Query: 64  LRMLSKQSSPLKIIY-----IDKDCKEITELL 90
           + +   + S  K I      ID    +++E+L
Sbjct: 70  VSISKGEKSSQKTIVCYNVSIDDILDKLSEIL 101


>gi|133930345|ref|NP_001076616.1| hypothetical protein W01A8.2 [Caenorhabditis elegans]
 gi|114420882|emb|CAL44973.1| C. elegans protein W01A8.2, confirmed by transcript evidence
           [Caenorhabditis elegans]
          Length = 127

 Score = 69.8 bits (170), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 10/93 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AKKS + ++           + + + A P++G AN+ +++ L   L L K+ 
Sbjct: 39  LHIHAKPGAKKSCVVAI-------GDSEVDVAIGAAPREGAANEELISYLMSALGLRKNE 91

Query: 64  LRMLSKQSSPLKIIYIDK---DCKEITELLQNN 93
           L+      S  K++ ID       E+ + LQ  
Sbjct: 92  LQFDKGAKSRSKVVLIDTKRLTIDEVRKKLQEE 124


>gi|261402905|ref|YP_003247129.1| protein of unknown function DUF167 [Methanocaldococcus vulcanius
           M7]
 gi|261369898|gb|ACX72647.1| protein of unknown function DUF167 [Methanocaldococcus vulcanius
           M7]
          Length = 102

 Score = 69.4 bits (169), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 21/95 (22%), Positives = 41/95 (43%), Gaps = 9/95 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + + +  NAK++ IA +           + +K+ A   +GKANK ++          K  
Sbjct: 15  IDIDVQANAKRNEIAGIN-----EWRKRLSVKIKAPAIEGKANKEIIKFFGNLF---KKD 66

Query: 64  LRMLSKQSSPLKIIYIDKDCKE-ITELLQNNDSLT 97
           + ++  ++S  K I I    K  + E+L+     T
Sbjct: 67  VEIILGKTSSQKTILILGAKKGYVEEILKKELEKT 101


>gi|24214436|ref|NP_711917.1| hypothetical protein LA_1736 [Leptospira interrogans serovar Lai
          str. 56601]
 gi|45657916|ref|YP_002002.1| hypothetical protein LIC12068 [Leptospira interrogans serovar
          Copenhageni str. Fiocruz L1-130]
 gi|29839712|sp|Q8F5E6|Y1736_LEPIN RecName: Full=UPF0235 protein LA_1736
 gi|73921077|sp|Q72QP5|Y2068_LEPIC RecName: Full=UPF0235 protein LIC_12068
 gi|24195381|gb|AAN48935.1|AE011350_4 conserved hypothetical protein [Leptospira interrogans serovar
          Lai str. 56601]
 gi|45601157|gb|AAS70639.1| conserved hypothetical protein [Leptospira interrogans serovar
          Copenhageni str. Fiocruz L1-130]
          Length = 73

 Score = 69.4 bits (169), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 7/79 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
              V + PN+KK       +     +   + I V     +GKAN+A++  ++K++ + K
Sbjct: 1  MKFTVYVKPNSKK-------VFFRKEEDGVLTIAVREPALEGKANEAVIESISKEMKVPK 53

Query: 62 SSLRMLSKQSSPLKIIYID 80
          S +R+LS Q +  KII ID
Sbjct: 54 SKIRILSGQKNKKKIIEID 72


>gi|255632017|gb|ACU16361.1| unknown [Glycine max]
          Length = 126

 Score = 69.4 bits (169), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 11/82 (13%), Positives = 35/82 (42%), Gaps = 7/82 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P +K + +  +             +++ A  + G+AN A+L  ++  L + +  
Sbjct: 39  ITIHAKPGSKSASVTDISDEAVG-------VQIDAPARDGEANAALLDYISSVLGVKRRQ 91

Query: 64  LRMLSKQSSPLKIIYIDKDCKE 85
           + + +   S  K + ++   ++
Sbjct: 92  VSLGTGSKSRDKTVIVEDVTQQ 113


>gi|159905985|ref|YP_001549647.1| hypothetical protein MmarC6_1603 [Methanococcus maripaludis C6]
 gi|226734735|sp|A9AAP2|Y1603_METM6 RecName: Full=UPF0235 protein MmarC6_1603
 gi|159887478|gb|ABX02415.1| protein of unknown function DUF167 [Methanococcus maripaludis C6]
          Length = 101

 Score = 69.4 bits (169), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 23/91 (25%), Positives = 46/91 (50%), Gaps = 9/91 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + +  NAKK+ I  +           ++I++   P +GKANKA++  L     + KS 
Sbjct: 15 IDIEVTTNAKKNEIGKIN-----EWRKRIEIRIKEQPIEGKANKAIIKFLK---GIFKSE 66

Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93
          + + S  +S  K + I DK  +++ ++L+  
Sbjct: 67 ISINSGTTSSQKTVLIPDKTKEDVVKILKKE 97


>gi|121712776|ref|XP_001273999.1| YggU family protein [Aspergillus clavatus NRRL 1]
 gi|119402152|gb|EAW12573.1| YggU family protein [Aspergillus clavatus NRRL 1]
          Length = 133

 Score = 69.4 bits (169), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 18/78 (23%), Positives = 32/78 (41%), Gaps = 5/78 (6%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
            +   + PNA  S                + + V A P+ G+AN A+  + A+   + KS
Sbjct: 33  QIACHVKPNASSSR-----EGVVAIGPEKVDVCVAAVPRNGEANIAVARVFAQIFDVPKS 87

Query: 63  SLRMLSKQSSPLKIIYID 80
           +  ++    S  KI+ I 
Sbjct: 88  NAEVIRGLKSRDKILCIT 105


>gi|94496543|ref|ZP_01303119.1| hypothetical protein SKA58_17602 [Sphingomonas sp. SKA58]
 gi|94423903|gb|EAT08928.1| hypothetical protein SKA58_17602 [Sphingomonas sp. SKA58]
          Length = 107

 Score = 69.4 bits (169), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 25/78 (32%), Positives = 44/78 (56%), Gaps = 2/78 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +RL P + + GI  +   +D      +  +V A P+KG+AN A++A+LA+ L   KS+
Sbjct: 13 LAIRLTPGSARQGIGGV--WRDDRAAPWLTARVRAVPEKGRANTALIALLAQALDWPKSA 70

Query: 64 LRMLSKQSSPLKIIYIDK 81
          + + S  S+ LK + I  
Sbjct: 71 IMLESGDSNRLKRLRIIG 88


>gi|301166027|emb|CBW25601.1| conserved hypothetical protein [Bacteriovorax marinus SJ]
          Length = 109

 Score = 69.0 bits (168), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 20/75 (26%), Positives = 32/75 (42%), Gaps = 9/75 (12%)

Query: 4  VIVRLIPNAKKSG-IASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
          + +   P AK    I          D   + I +   P  G+ANKA +  LA +L+++KS
Sbjct: 17 LNIWAKPGAKVEKSIVG--------DEGEIIIYIKERPIDGQANKAFIKYLAAQLSITKS 68

Query: 63 SLRMLSKQSSPLKII 77
          S+ +     S  K  
Sbjct: 69 SVSLSRGSKSRFKRF 83


>gi|118430921|ref|NP_147028.2| hypothetical protein APE_0182.1 [Aeropyrum pernix K1]
 gi|150421715|sp|Q9YFR7|Y182_AERPE RecName: Full=UPF0235 protein APE_0182.1
 gi|116062246|dbj|BAA79094.2| conserved hypothetical protein [Aeropyrum pernix K1]
          Length = 108

 Score = 69.0 bits (168), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 16/94 (17%), Positives = 37/94 (39%), Gaps = 10/94 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + V + P  +          +   +   +       P +G+AN +++  LA+ L +S 
Sbjct: 19  VRIRVYVKPEGR--------ERRLRLEEGELVFYTDEPPLEGRANASLINFLARGLKVSV 70

Query: 62  SSLRMLSKQSSPLKIIYID--KDCKEITELLQNN 93
            ++ ++    S  K++ I    D   + E L + 
Sbjct: 71  KNIEIVHGARSRSKVVEIRDVADPDALLERLASI 104


>gi|254475305|ref|ZP_05088691.1| conserved hypothetical protein [Ruegeria sp. R11]
 gi|214029548|gb|EEB70383.1| conserved hypothetical protein [Ruegeria sp. R11]
          Length = 96

 Score = 69.0 bits (168), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 19/75 (25%), Positives = 38/75 (50%), Gaps = 3/75 (4%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
          +++VR+ P A +  +   +     S  + +KI  T  P+ GKA +A+  +LA  + ++  
Sbjct: 21 DILVRVTPKAARDSV---QRTSGDSGELVLKITTTTAPENGKATEAVRKLLATAMRVAPR 77

Query: 63 SLRMLSKQSSPLKII 77
           L +L   +S  K+ 
Sbjct: 78 DLVLLRGATSREKLF 92


>gi|150402236|ref|YP_001329530.1| hypothetical protein MmarC7_0309 [Methanococcus maripaludis C7]
 gi|166229378|sp|A6VG03|Y309_METM7 RecName: Full=UPF0235 protein MmarC7_0309
 gi|150033266|gb|ABR65379.1| protein of unknown function DUF167 [Methanococcus maripaludis C7]
          Length = 101

 Score = 69.0 bits (168), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 23/91 (25%), Positives = 46/91 (50%), Gaps = 9/91 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + +  NAKK+ I  +           ++I++   P +GKANKA++  L     + KS 
Sbjct: 15 IDIEVTTNAKKNEIGKIN-----EWRKRIEIRIREQPIEGKANKAIVKFLK---GIFKSE 66

Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93
          + + S  +S  K + I DK  +++ ++L+  
Sbjct: 67 IFINSGTTSSQKTVLIPDKTKEDVVKILKKE 97


>gi|195953751|ref|YP_002122041.1| protein of unknown function DUF167 [Hydrogenobaculum sp. Y04AAS1]
 gi|226734129|sp|B4U5M3|Y1378_HYDS0 RecName: Full=UPF0235 protein HY04AAS1_1378
 gi|195933363|gb|ACG58063.1| protein of unknown function DUF167 [Hydrogenobaculum sp. Y04AAS1]
          Length = 73

 Score = 69.0 bits (168), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 23/78 (29%), Positives = 43/78 (55%), Gaps = 7/78 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V++ PNAK   +  LE          +KI + + P  GKAN+ ++ +L++ L +SK
Sbjct: 1  MILRVKVKPNAKTVSVEQLE-------DKSLKISIKSPPVNGKANEELIKVLSEFLKVSK 53

Query: 62 SSLRMLSKQSSPLKIIYI 79
          S + + + +SS  K++ I
Sbjct: 54 SKINIKAGKSSREKLVEI 71


>gi|90399179|emb|CAJ86041.1| H0723C07.11 [Oryza sativa Indica Group]
 gi|125550304|gb|EAY96126.1| hypothetical protein OsI_18003 [Oryza sativa Indica Group]
          Length = 125

 Score = 69.0 bits (168), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 13/76 (17%), Positives = 34/76 (44%), Gaps = 7/76 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P +K + I  +           + +++ A  + G+AN A++  ++  L + K  
Sbjct: 38  ISIHAKPGSKLATITEI-------GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 90

Query: 64  LRMLSKQSSPLKIIYI 79
           + + S   S  K++ +
Sbjct: 91  VSIGSGSKSREKVVLV 106


>gi|319956454|ref|YP_004167717.1| hypothetical protein Nitsa_0701 [Nitratifractor salsuginis DSM
          16511]
 gi|319418858|gb|ADV45968.1| protein of unknown function DUF167 [Nitratifractor salsuginis DSM
          16511]
          Length = 98

 Score = 68.6 bits (167), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 23/96 (23%), Positives = 48/96 (50%), Gaps = 8/96 (8%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          + ++ ++  P A +S  A L           +K+++ A   +G ANK ++  L+K   + 
Sbjct: 9  LASLKIKAQPGASRSEFAGLY------GDEAIKVRIAAAAVEGAANKELVKFLSKAFKVP 62

Query: 61 KSSLRMLSKQSSPLKIIYIDKDC--KEITELLQNND 94
          KSS+R  S +++ +K++        KE  E L++ +
Sbjct: 63 KSSIRFKSGETAKIKVVEFPYSEKFKEFLEKLEDRN 98


>gi|114568606|ref|YP_755286.1| hypothetical protein Mmar10_0052 [Maricaulis maris MCS10]
 gi|114339068|gb|ABI64348.1| protein of unknown function DUF167 [Maricaulis maris MCS10]
          Length = 108

 Score = 68.6 bits (167), Expect = 2e-10,   Method: Composition-based stats.
 Identities = 26/94 (27%), Positives = 51/94 (54%), Gaps = 2/94 (2%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V VRL P A +  +A      D ++  ++  +V A P+KGKAN A++A+LA++L + K  
Sbjct: 11  VQVRLAPGASRDCLAG--SQADAAERHYLVARVRAIPEKGKANAALIALLARQLGIPKRD 68

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97
           + ++   +S +K + I  +  E   ++   ++  
Sbjct: 69  IDVIRGATSRMKTVRISANGPEQDRIVAQLEAFA 102


>gi|89898566|ref|YP_515676.1| hypothetical protein CF0759 [Chlamydophila felis Fe/C-56]
 gi|89331938|dbj|BAE81531.1| conserved hypothetical protein [Chlamydophila felis Fe/C-56]
          Length = 65

 Score = 68.6 bits (167), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 23/57 (40%), Positives = 38/57 (66%)

Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITE 88
          +KI+VT  P+KG+AN+A++A+LAK L+L K  + ++S  SS  K + + K  + I  
Sbjct: 2  LKIRVTEAPEKGRANEAVIALLAKTLSLPKRDVTLISGDSSRKKRLLLPKAAESIIS 58


>gi|20092890|ref|NP_618965.1| hypothetical protein MA4097 [Methanosarcina acetivorans C2A]
 gi|19918198|gb|AAM07445.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
          Length = 123

 Score = 68.6 bits (167), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 17/80 (21%), Positives = 35/80 (43%), Gaps = 4/80 (5%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + + + P ++   +              + +K+T   QKGKAN+ ++  LA+   +S S 
Sbjct: 34  IEIEVTPGSRSLSV----PSGYNEWRKRIAVKLTKNAQKGKANEQLIESLAELFGISSSE 89

Query: 64  LRMLSKQSSPLKIIYIDKDC 83
           + + S  +S  K + I    
Sbjct: 90  ILINSGATSSKKSLLIKGIS 109


>gi|302877711|ref|YP_003846275.1| hypothetical protein Galf_0467 [Gallionella capsiferriformans ES-2]
 gi|302580500|gb|ADL54511.1| protein of unknown function DUF167 [Gallionella capsiferriformans
           ES-2]
          Length = 104

 Score = 68.6 bits (167), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 14/83 (16%), Positives = 38/83 (45%), Gaps = 8/83 (9%)

Query: 9   IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
            PN+K+  I  +           +++   A P++G A   M+  LA   ++ + ++ ++ 
Sbjct: 26  RPNSKQDAIGRV-------IGHQLEVYAAAVPRRGGATAHMVQYLASIFSVPEHAITVVF 78

Query: 69  KQSSPLKIIYIDKDCKEITELLQ 91
            + +  K + I+    ++  ++Q
Sbjct: 79  GEKNVNKQLRIE-SPGKLPSVIQ 100


>gi|163757393|ref|ZP_02164482.1| hypothetical protein HPDFL43_18322 [Hoeflea phototrophica DFL-43]
 gi|162284895|gb|EDQ35177.1| hypothetical protein HPDFL43_18322 [Hoeflea phototrophica DFL-43]
          Length = 63

 Score = 68.6 bits (167), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 24/59 (40%), Positives = 39/59 (66%)

Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          MKI+V A  +  KAN+A+ A LAK+L L+KS +R++S  +S  K + ++ D +E+   L
Sbjct: 1  MKIRVRAVAENNKANRALEAFLAKRLKLAKSRVRVISGANSRTKTVRLEGDPQELAAKL 59


>gi|330469281|ref|YP_004407024.1| hypothetical protein VAB18032_26756 [Verrucosispora maris
          AB-18-032]
 gi|328812252|gb|AEB46424.1| hypothetical protein VAB18032_26756 [Verrucosispora maris
          AB-18-032]
          Length = 98

 Score = 68.6 bits (167), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 21/87 (24%), Positives = 37/87 (42%), Gaps = 3/87 (3%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V VR+ P A ++ +       D      + I V A    G+A +A    LA+ L +  ++
Sbjct: 8  VAVRVKPGASRARVGG---RYDGPHGPALVIAVNAPAVDGRATEAARRALAEALGVRPAT 64

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90
          + + +  SS  K+  I      +T  L
Sbjct: 65 VSLRTGASSRDKLFQITPASPTLTAHL 91


>gi|296284807|ref|ZP_06862805.1| hypothetical protein CbatJ_14361 [Citromicrobium bathyomarinum
          JL354]
          Length = 92

 Score = 68.2 bits (166), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 22/77 (28%), Positives = 40/77 (51%), Gaps = 8/77 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VR+ P A+   +A ++          +++KV A PQ G AN A+  ++AK L + KS 
Sbjct: 23 LRVRVTPGARTESLAIVD--------GGVQVKVRAKPQDGAANVAVAELVAKALGIPKSR 74

Query: 64 LRMLSKQSSPLKIIYID 80
            +L   +S  K++ + 
Sbjct: 75 CTLLRGATSREKVLGVT 91


>gi|261350574|ref|ZP_05975991.1| conserved hypothetical protein [Methanobrevibacter smithii DSM
           2374]
 gi|288861357|gb|EFC93655.1| conserved hypothetical protein [Methanobrevibacter smithii DSM
           2374]
          Length = 101

 Score = 68.2 bits (166), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 22/92 (23%), Positives = 39/92 (42%), Gaps = 13/92 (14%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V + + PN+ K  I+              +I++   PQKGKANK ++  L+K        
Sbjct: 18  VDIEVSPNSNKFQISGFN-----EWRNRFEIRIKQVPQKGKANKEIVKELSKIFNC---D 69

Query: 64  LRMLSKQSSPLKIIY-----IDKDCKEITELL 90
           + +   + S  K I      I+    ++ E+L
Sbjct: 70  VSISKGEKSSQKTIVCYDVSIEYILDKLGEIL 101


>gi|323356791|ref|YP_004223187.1| hypothetical protein MTES_0343 [Microbacterium testaceum StLB037]
 gi|323273162|dbj|BAJ73307.1| uncharacterized conserved protein [Microbacterium testaceum
          StLB037]
          Length = 73

 Score = 68.2 bits (166), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 12/80 (15%), Positives = 30/80 (37%), Gaps = 7/80 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VR+ P +++  +              + + V      G AN  ++  LA    +  
Sbjct: 1  MQLTVRVKPGSRRGPLV-------EDTAEGLVVHVRERAVDGAANSGVVKALAAHFGVPA 53

Query: 62 SSLRMLSKQSSPLKIIYIDK 81
            + +L   ++ +K + +D 
Sbjct: 54 RDVEILRGHAARVKRVEVDA 73


>gi|116754806|ref|YP_843924.1| hypothetical protein Mthe_1512 [Methanosaeta thermophila PT]
 gi|116754813|ref|YP_843931.1| hypothetical protein Mthe_1519 [Methanosaeta thermophila PT]
 gi|116754820|ref|YP_843938.1| hypothetical protein Mthe_1526 [Methanosaeta thermophila PT]
 gi|116666257|gb|ABK15284.1| protein of unknown function DUF167 [Methanosaeta thermophila PT]
 gi|116666264|gb|ABK15291.1| protein of unknown function DUF167 [Methanosaeta thermophila PT]
 gi|116666271|gb|ABK15298.1| protein of unknown function DUF167 [Methanosaeta thermophila PT]
          Length = 122

 Score = 68.2 bits (166), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 15/91 (16%), Positives = 42/91 (46%), Gaps = 5/91 (5%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + + +IP +       +      +    +++++T    KGKAN+ +L  +++   L++
Sbjct: 32  VVLHIDVIPGSS----ELVVPAGFNAWRGSIEVRLTERADKGKANRQLLQEISRAFGLTQ 87

Query: 62  SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91
             + ++S   S  K++ +   D + +   L+
Sbjct: 88  GDVEIMSGHRSQRKVVLMRGIDAESVLAALR 118


>gi|154150390|ref|YP_001404008.1| hypothetical protein Mboo_0847 [Candidatus Methanoregula boonei
          6A8]
 gi|153998942|gb|ABS55365.1| protein of unknown function DUF167 [Methanoregula boonei 6A8]
          Length = 104

 Score = 68.2 bits (166), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 18/84 (21%), Positives = 35/84 (41%), Gaps = 4/84 (4%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            V + +   AK      L           +  +V A    GKANKA++ +++    +  
Sbjct: 15 VLVSIEVTAGAKSD----LFPAGYNEWRKAIGCRVAAPAVNGKANKAVIGIISAGTGVPA 70

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85
          +S+ +++  +S  K + I    KE
Sbjct: 71 ASVTIVAGLTSSQKKVRIAGITKE 94


>gi|322697515|gb|EFY89294.1| DUF167 domain protein [Metarhizium acridum CQMa 102]
          Length = 122

 Score = 68.2 bits (166), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 25/104 (24%), Positives = 45/104 (43%), Gaps = 12/104 (11%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +R+     K     +    D      +++ VTA P+ G+ANKA++  L+  + + K
Sbjct: 22  VQLQLRVKAGTSKDR-EGILAVTDR----AIELCVTAQPRHGEANKAVVQALSNAIGIPK 76

Query: 62  SSLRMLSKQSSPLKIIYI-----DKD--CKEITELLQNNDSLTL 98
           S  R +S   S  K++ I     D     + +  LL+     TL
Sbjct: 77  SRFRFVSGLKSRDKVVAIGDIQGDGPDYTETVLRLLREASYRTL 120


>gi|15668799|ref|NP_247602.1| hypothetical protein MJ_0618 [Methanocaldococcus jannaschii DSM
          2661]
 gi|2496078|sp|Q58035|Y618_METJA RecName: Full=UPF0235 protein MJ0618
 gi|1591329|gb|AAB98613.1| conserved hypothetical protein [Methanocaldococcus jannaschii DSM
          2661]
          Length = 98

 Score = 68.2 bits (166), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 22/91 (24%), Positives = 42/91 (46%), Gaps = 9/91 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + +  NAKK+ I  +           + IK+ A   +GKANK ++    +     K
Sbjct: 13 VLIDIDVQANAKKNEIVGIN-----EWRKRLSIKIKAPATEGKANKEIIKFFKEIF---K 64

Query: 62 SSLRMLSKQSSPLKIIYI-DKDCKEITELLQ 91
            + ++S + +P K + I D    E+ E+L+
Sbjct: 65 KDVEIVSGKLNPQKTVLIGDIKKDEVIEILK 95


>gi|119509193|ref|ZP_01628343.1| hypothetical protein N9414_14620 [Nodularia spumigena CCY9414]
 gi|119466035|gb|EAW46922.1| hypothetical protein N9414_14620 [Nodularia spumigena CCY9414]
          Length = 52

 Score = 68.2 bits (166), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 19/49 (38%), Positives = 32/49 (65%)

Query: 34 IKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKD 82
          +++ + P  GKAN+ ++ +LA+K  + KS +R+ S  SS  K+I ID D
Sbjct: 3  VRLKSPPVDGKANEELIKLLAEKFHVPKSHIRIKSGVSSRQKLIEIDTD 51


>gi|116328001|ref|YP_797721.1| hypothetical protein LBL_1291 [Leptospira borgpetersenii serovar
          Hardjo-bovis L550]
 gi|116330879|ref|YP_800597.1| hypothetical protein LBJ_1240 [Leptospira borgpetersenii serovar
          Hardjo-bovis JB197]
 gi|122281367|sp|Q04TD1|Y1240_LEPBJ RecName: Full=UPF0235 protein LBJ_1240
 gi|122284207|sp|Q052F7|Y1291_LEPBL RecName: Full=UPF0235 protein LBL_1291
 gi|116120745|gb|ABJ78788.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
          Hardjo-bovis L550]
 gi|116124568|gb|ABJ75839.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
          Hardjo-bovis JB197]
          Length = 73

 Score = 68.2 bits (166), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 21/79 (26%), Positives = 39/79 (49%), Gaps = 7/79 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
              VR+ PN+KK       I     +   + I V     +GKAN+A++  +++++ + K
Sbjct: 1  MKFTVRVKPNSKK-------IFFRKEEDGSVTIAVREPALEGKANEAVIETISREMKIPK 53

Query: 62 SSLRMLSKQSSPLKIIYID 80
            +R++S +    K I ID
Sbjct: 54 RKIRIVSGEKGKKKTIEID 72


>gi|254419593|ref|ZP_05033317.1| conserved hypothetical protein [Brevundimonas sp. BAL3]
 gi|196185770|gb|EDX80746.1| conserved hypothetical protein [Brevundimonas sp. BAL3]
          Length = 69

 Score = 67.8 bits (165), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 18/53 (33%), Positives = 33/53 (62%)

Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCK 84
          +K++V A P  G+AN A++  LAK L +S+SS+ +     S LK++ ++   +
Sbjct: 2  LKVRVRARPVDGEANAALVKFLAKALGVSRSSVVLERGGQSRLKMVSVEGLDE 54


>gi|288818090|ref|YP_003432438.1| hypothetical protein HTH_0777 [Hydrogenobacter thermophilus TK-6]
 gi|288787490|dbj|BAI69237.1| hypothetical protein HTH_0777 [Hydrogenobacter thermophilus TK-6]
          Length = 73

 Score = 67.8 bits (165), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 20/79 (25%), Positives = 39/79 (49%), Gaps = 7/79 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VR  P AK+  +  +            ++ V   PQ+GKAN+ +  +L+  L + K
Sbjct: 1  MILEVRAKPKAKREYVKKIT-------ESVYEVAVKEPPQEGKANERIAVLLSYHLGIPK 53

Query: 62 SSLRMLSKQSSPLKIIYID 80
          S +++L   +S +K+  +D
Sbjct: 54 SRIKLLKGHTSKIKLFQVD 72


>gi|124515876|gb|EAY57385.1| conserved hypothetical protein [Leptospirillum rubarum]
          Length = 104

 Score = 67.8 bits (165), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 23/91 (25%), Positives = 39/91 (42%), Gaps = 9/91 (9%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           V +R+ P  KK  +                  ++A  ++G AN  +L  L+  LA   S
Sbjct: 13 RVRIRVRPGKKKDVLL--------LSGEEFSADLSAPAREGAANDRLLRNLSYWLAWPVS 64

Query: 63 SLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92
          ++R+   +SS LK I I     +E+ + L  
Sbjct: 65 NIRIEKGESSRLKTIAIRGMTGEEVRKRLLA 95


>gi|297568085|ref|YP_003689429.1| protein of unknown function DUF167 [Desulfurivibrio alkaliphilus
          AHT2]
 gi|296924000|gb|ADH84810.1| protein of unknown function DUF167 [Desulfurivibrio alkaliphilus
          AHT2]
          Length = 76

 Score = 67.8 bits (165), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 14/79 (17%), Positives = 38/79 (48%), Gaps = 7/79 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +++ P ++ S +       D S    +K    + P +G+AN+ ++A++A      K
Sbjct: 3  MILQIKVKPRSQSSSLT---QEADGSWLARLK----SPPVEGRANRELIALVADHFRCRK 55

Query: 62 SSLRMLSKQSSPLKIIYID 80
          + + + +  S   K++ ++
Sbjct: 56 ADVEIKAGSSGRTKLVRVE 74


>gi|117927694|ref|YP_872245.1| hypothetical protein Acel_0486 [Acidothermus cellulolyticus 11B]
 gi|117648157|gb|ABK52259.1| protein of unknown function DUF167 [Acidothermus cellulolyticus
          11B]
          Length = 85

 Score = 67.8 bits (165), Expect = 4e-10,   Method: Composition-based stats.
 Identities = 22/91 (24%), Positives = 42/91 (46%), Gaps = 8/91 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +++ + P + ++ +              + + V    + G+A  A LA +A+   + K
Sbjct: 1  MRLVIHVRPGSSRATVGGSH-------NGALIVAVREPAEHGRATDAALAAVAQAFGVPK 53

Query: 62 SSLRMLSKQSSPLKII-YIDKDCKEITELLQ 91
          S +R++S  +S  KII  ID D   + ELL 
Sbjct: 54 SQVRLVSGATSRRKIIDVIDGDPVRLAELLA 84


>gi|148358546|ref|YP_001249753.1| hypothetical protein LPC_0417 [Legionella pneumophila str. Corby]
 gi|296108365|ref|YP_003620066.1| hypothetical protein lpa_03962 [Legionella pneumophila 2300/99
          Alcoy]
 gi|148280319|gb|ABQ54407.1| conserved hypothetical protein; DUF167 [Legionella pneumophila
          str. Corby]
 gi|295650267|gb|ADG26114.1| hypothetical protein lpa_03962 [Legionella pneumophila 2300/99
          Alcoy]
          Length = 70

 Score = 67.8 bits (165), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 15/65 (23%), Positives = 33/65 (50%), Gaps = 1/65 (1%)

Query: 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEIT 87
              + I + A PQ+G+AN  +L  +++   + K+ + ++  +SS  K+I +    + + 
Sbjct: 4  SDDRLHIALHAKPQEGEANNELLFFISQFFKIPKTQIELIKGKSSRHKLIRLP-LSESVF 62

Query: 88 ELLQN 92
            L N
Sbjct: 63 RFLNN 67


>gi|297603606|ref|NP_001054326.2| Os04g0686300 [Oryza sativa Japonica Group]
 gi|215687220|dbj|BAG91785.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222629814|gb|EEE61946.1| hypothetical protein OsJ_16702 [Oryza sativa Japonica Group]
 gi|255675903|dbj|BAF16240.2| Os04g0686300 [Oryza sativa Japonica Group]
          Length = 129

 Score = 67.8 bits (165), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 13/76 (17%), Positives = 35/76 (46%), Gaps = 7/76 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + ++  P +K + I  +           + +++ A  + G+AN A++  ++  L + K  
Sbjct: 42  ISIQAKPGSKLATITEI-------GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 94

Query: 64  LRMLSKQSSPLKIIYI 79
           + + S   S  K++ +
Sbjct: 95  VSIGSGSKSREKVVLV 110


>gi|45358618|ref|NP_988175.1| hypothetical protein MMP1055 [Methanococcus maripaludis S2]
 gi|74554395|sp|Q6LYD6|Y1055_METMP RecName: Full=UPF0235 protein MMP1055
 gi|45047484|emb|CAF30611.1| conserved hypothetical protein [Methanococcus maripaludis S2]
          Length = 101

 Score = 67.8 bits (165), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 23/91 (25%), Positives = 44/91 (48%), Gaps = 9/91 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + +  NAKK+ I  +           ++I++   P +GKANKA++  L     + KS 
Sbjct: 15 IDIEVTTNAKKNEIGKIN-----EWRKRIEIRIKEQPIEGKANKAIIKFLK---GIFKSE 66

Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93
          + + S  +S  K + I DK   ++  +L+  
Sbjct: 67 ILINSGTTSSQKTVLIPDKTKDDVVTILKKE 97


>gi|149185290|ref|ZP_01863607.1| hypothetical protein ED21_19592 [Erythrobacter sp. SD-21]
 gi|148831401|gb|EDL49835.1| hypothetical protein ED21_19592 [Erythrobacter sp. SD-21]
          Length = 93

 Score = 67.5 bits (164), Expect = 5e-10,   Method: Composition-based stats.
 Identities = 18/77 (23%), Positives = 34/77 (44%), Gaps = 8/77 (10%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           V +R+ P A+   +          +   +  KV A PQ G AN A+  ++A    +  +
Sbjct: 23 RVAIRVTPGARTESLT--------LEGGVLAAKVRAKPQDGAANDAVRKLIAAAYRVPPT 74

Query: 63 SLRMLSKQSSPLKIIYI 79
           + +L   +S  K++ I
Sbjct: 75 RVELLRGATSREKLLRI 91


>gi|149057385|gb|EDM08708.1| similar to RIKEN cDNA 3110040N11, isoform CRA_e [Rattus
          norvegicus]
          Length = 130

 Score = 67.5 bits (164), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 14/67 (20%), Positives = 28/67 (41%), Gaps = 7/67 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +   P +K++ +  L             + + A P +G+AN  +   L+K L L K
Sbjct: 36 VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 88

Query: 62 SSLRMLS 68
          S + +  
Sbjct: 89 SDVVLDK 95


>gi|331695278|ref|YP_004331517.1| hypothetical protein Psed_1425 [Pseudonocardia dioxanivorans
          CB1190]
 gi|326949967|gb|AEA23664.1| UPF0235 protein yggU [Pseudonocardia dioxanivorans CB1190]
          Length = 89

 Score = 67.5 bits (164), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 18/89 (20%), Positives = 41/89 (46%), Gaps = 8/89 (8%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + +R+ P A ++ +         +    + ++V      G A  A L  +A    + + 
Sbjct: 7  RLTIRVRPGASRTSVGG-------AYADALVVRVNERAVDGAATAAALRAVAGAFGVPRG 59

Query: 63 SLRMLSKQSSPLKIIYID-KDCKEITELL 90
          ++R++S  +S  KI+ ++  D + + ELL
Sbjct: 60 AVRLVSGATSRTKILDVEGGDPERLRELL 88


>gi|296242831|ref|YP_003650318.1| hypothetical protein Tagg_1097 [Thermosphaera aggregans DSM 11486]
 gi|296095415|gb|ADG91366.1| protein of unknown function DUF167 [Thermosphaera aggregans DSM
           11486]
          Length = 111

 Score = 67.5 bits (164), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 19/96 (19%), Positives = 44/96 (45%), Gaps = 9/96 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + VR+ P +        E    T ++  +  + +   ++G+AN +++  L+++L +  
Sbjct: 23  VILQVRVKPGS--------EPEGFTIESDELVFRTSEPAERGRANASLIKYLSRELKIPV 74

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKE-ITELLQNNDSL 96
           S + ++  Q   LK + I  +  + I E L    +L
Sbjct: 75  SKIDIVYGQREKLKKVLIMDEPADKIIEKLARVLNL 110


>gi|322708042|gb|EFY99619.1| DUF167 domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 126

 Score = 67.5 bits (164), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 25/104 (24%), Positives = 44/104 (42%), Gaps = 12/104 (11%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +R+     K     +    D      +++ V A P+ G+ANKA++  L+  + + K
Sbjct: 26  VQLQLRVKAGTSKDR-EGILAVTDR----AIELCVAAQPRHGEANKAVVQALSNAIGIPK 80

Query: 62  SSLRMLSKQSSPLKIIYI-----DKD--CKEITELLQNNDSLTL 98
           S  R +S   S  K++ I     D     + I  LL+     TL
Sbjct: 81  SRFRFVSGIKSRDKVVAIGDIQGDGPEYTETILRLLREASHRTL 124


>gi|121603294|ref|YP_980623.1| hypothetical protein Pnap_0379 [Polaromonas naphthalenivorans CJ2]
 gi|120592263|gb|ABM35702.1| protein of unknown function DUF167 [Polaromonas naphthalenivorans
           CJ2]
          Length = 107

 Score = 67.5 bits (164), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 18/82 (21%), Positives = 34/82 (41%), Gaps = 8/82 (9%)

Query: 9   IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
            P+A K  I              +K+ VTA P  GKA   M+  LA    ++ + + ++ 
Sbjct: 29  KPSAGKDAIG-------KPKGTQLKVSVTAAPLAGKATDHMVRFLAPLFGVAVADIEVVF 81

Query: 69  KQSSPLKIIYIDKDCKEITELL 90
            + +  K + I    K++  + 
Sbjct: 82  GRENVNKQLRIRA-PKKLPAVF 102


>gi|297297113|ref|XP_002804962.1| PREDICTED: UPF0235 protein C15orf40-like isoform 2 [Macaca mulatta]
          Length = 150

 Score = 67.5 bits (164), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 7/67 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L           + + + A P +G+AN  +   L+K L L K
Sbjct: 64  VTITIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 116

Query: 62  SSLRMLS 68
           S + +  
Sbjct: 117 SDVVLDK 123


>gi|237858670|ref|NP_001153588.1| hypothetical protein LOC123207 isoform e [Homo sapiens]
          Length = 149

 Score = 67.5 bits (164), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 7/67 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L           + + + A P +G+AN  +   L+K L L K
Sbjct: 63  VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 115

Query: 62  SSLRMLS 68
           S + +  
Sbjct: 116 SDVVLDK 122


>gi|189501744|ref|YP_001957461.1| hypothetical protein Aasi_0294 [Candidatus Amoebophilus asiaticus
          5a2]
 gi|259646928|sp|B3ER80|Y294_AMOA5 RecName: Full=UPF0235 protein Aasi_0294
 gi|189497185|gb|ACE05732.1| hypothetical protein Aasi_0294 [Candidatus Amoebophilus asiaticus
          5a2]
          Length = 97

 Score = 67.1 bits (163), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 21/79 (26%), Positives = 42/79 (53%), Gaps = 7/79 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V+ IP +K S I              + I++T+ P+ GKAN+ ++ ++AK L L +++
Sbjct: 5  IEVKAIPKSKISAIT-------IDKLGRLCIRITSAPENGKANREIIKLIAKTLKLPQAN 57

Query: 64 LRMLSKQSSPLKIIYIDKD 82
          + +++  +  LK I I   
Sbjct: 58 VEIIAGLTIKLKRIRITSS 76


>gi|330507436|ref|YP_004383864.1| hypothetical protein MCON_1367 [Methanosaeta concilii GP-6]
 gi|328928244|gb|AEB68046.1| hypothetical protein MCON_1367 [Methanosaeta concilii GP-6]
          Length = 105

 Score = 67.1 bits (163), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 14/84 (16%), Positives = 40/84 (47%), Gaps = 4/84 (4%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
          C +   ++P + +  +              ++ ++T  P +G+AN+ ++  LA+ L + +
Sbjct: 14 CIIRFEVVPGSSRLAV----PSGFNPWRRSLEARLTEKPSRGRANRQLVEELARILGVDE 69

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85
          S + ++  + S  K++ +    K+
Sbjct: 70 SGIEVIKGEKSGRKLLLVKGIEKD 93


>gi|148284889|ref|YP_001248979.1| hypothetical protein OTBS_1640 [Orientia tsutsugamushi str.
           Boryong]
 gi|146740328|emb|CAM80735.1| conserved hypothetical protein [Orientia tsutsugamushi str.
           Boryong]
          Length = 112

 Score = 67.1 bits (163), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 19/95 (20%), Positives = 44/95 (46%), Gaps = 7/95 (7%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +++   AK + I  L    + S    + I +   P+  KAN+ ++  L++ L   +S+
Sbjct: 20  INLKVKAGAKINKIIGLYHINNKS---FLYISINTIPENNKANQLIIKFLSQWLETGRSN 76

Query: 64  LRMLSKQSSPLKIIYIDKD----CKEITELLQNND 94
           ++++    S LK+I +          I   L++ +
Sbjct: 77  IKIVYGLHSNLKVISVMNTNGNISNLIISKLKSIN 111


>gi|307611594|emb|CBX01276.1| hypothetical protein LPW_29741 [Legionella pneumophila 130b]
          Length = 70

 Score = 67.1 bits (163), Expect = 9e-10,   Method: Composition-based stats.
 Identities = 13/52 (25%), Positives = 29/52 (55%)

Query: 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79
              + I + A PQ+G+AN  +L  +++   + K+ + ++  +SS  K+I +
Sbjct: 4  SDDSLHIALHAKPQEGEANNELLFFISQFFKIPKTQIELIKGKSSRHKLIRL 55


>gi|237858664|ref|NP_001153585.1| hypothetical protein LOC123207 isoform b [Homo sapiens]
          Length = 167

 Score = 67.1 bits (163), Expect = 9e-10,   Method: Composition-based stats.
 Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 7/67 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L           + + + A P +G+AN  +   L+K L L K
Sbjct: 63  VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 115

Query: 62  SSLRMLS 68
           S + +  
Sbjct: 116 SDVVLDK 122


>gi|237858666|ref|NP_001153586.1| hypothetical protein LOC123207 isoform c [Homo sapiens]
          Length = 153

 Score = 66.7 bits (162), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L           + + + A P +G+AN  +   L+K L L K
Sbjct: 63  VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 115

Query: 62  SSLRMLSKQSSPLKII 77
           S + +  K    LKI+
Sbjct: 116 SDVVLDKKLRDLLKIV 131


>gi|206602120|gb|EDZ38602.1| Conserved hypothetical protein [Leptospirillum sp. Group II
          '5-way CG']
          Length = 102

 Score = 66.7 bits (162), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 24/91 (26%), Positives = 39/91 (42%), Gaps = 9/91 (9%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           V +R+ P  KK  +                  ++A  ++G AN  +L  L++ LA   S
Sbjct: 11 RVRIRVRPGKKKDVL--------FLSGEEFSADLSAPAREGAANDRLLRNLSQWLAWPVS 62

Query: 63 SLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92
           +R+   Q+S LK I I     +EI + L  
Sbjct: 63 KIRLEKGQASRLKTIVIAGMTGEEIRKRLLA 93


>gi|20093713|ref|NP_613560.1| hypothetical protein MK0273 [Methanopyrus kandleri AV19]
 gi|29839574|sp|Q8TYM3|Y273_METKA RecName: Full=UPF0235 protein MK0273
 gi|19886604|gb|AAM01490.1| Uncharacterized conserved protein [Methanopyrus kandleri AV19]
          Length = 96

 Score = 66.7 bits (162), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 23/88 (26%), Positives = 44/88 (50%), Gaps = 9/88 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VR+ P+A  + +  ++          +++ V A P KGKAN+ +L  L +KL    ++
Sbjct: 14 IRVRVNPDADTTDLKGVD-----EWRGVLEVDVAAPPVKGKANRELLEFLGRKLN---TT 65

Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELL 90
            ++S + S  K++   D    E+ E L
Sbjct: 66 CELVSGEKSREKLVLARDVSVDEVKERL 93


>gi|86136531|ref|ZP_01055110.1| hypothetical protein MED193_20449 [Roseobacter sp. MED193]
 gi|85827405|gb|EAQ47601.1| hypothetical protein MED193_20449 [Roseobacter sp. MED193]
          Length = 91

 Score = 66.7 bits (162), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 20/77 (25%), Positives = 38/77 (49%), Gaps = 8/77 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VR+ P A  + I        +     +K+ VT+ P+ GKA +A+ ++LAK + ++ 
Sbjct: 20 AKIAVRVTPKAASNSI--------SVSEAGLKVTVTSVPENGKATEAVRSLLAKAMGVAA 71

Query: 62 SSLRMLSKQSSPLKIIY 78
          S L +    +S  K+  
Sbjct: 72 SKLDLSQGATSRNKVFV 88


>gi|150401269|ref|YP_001325035.1| hypothetical protein Maeo_0841 [Methanococcus aeolicus Nankai-3]
 gi|166235100|sp|A6UVA1|Y841_META3 RecName: Full=UPF0235 protein Maeo_0841
 gi|150013972|gb|ABR56423.1| protein of unknown function DUF167 [Methanococcus aeolicus
           Nankai-3]
          Length = 101

 Score = 66.7 bits (162), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 10/92 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + V +  NAKK+ I  +           + IK+ A P  GKANK +     K        
Sbjct: 17  IDVEISSNAKKNEIGEIN-----EWRKRLIIKIKALPVDGKANKEIAKFFKKTFG---KD 68

Query: 64  LRMLSKQSSPLKIIYIDKDCKE--ITELLQNN 93
           + ++S  +S  K I +    K+  I ++L+NN
Sbjct: 69  IIIVSGLTSSQKTICVIGATKDEIIDKILKNN 100


>gi|312084117|ref|XP_003144143.1| hypothetical protein LOAG_08565 [Loa loa]
 gi|307760694|gb|EFO19928.1| hypothetical protein LOAG_08565 [Loa loa]
          Length = 129

 Score = 66.7 bits (162), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 19/95 (20%), Positives = 43/95 (45%), Gaps = 10/95 (10%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   PNAK + +  +           +++ + A P  G+AN+A++  +   L L K+ 
Sbjct: 39  LRIHAKPNAKTTRVIDI-------GANEVELAIAAPPHDGQANEALINAMMDILELRKNE 91

Query: 64  LRMLSKQSSPLKIIYI---DKDCKEITELLQNNDS 95
           +   +   S  K++ +       +E+ E L+ N +
Sbjct: 92  ITFDTGARSRSKVLRLMSKRITLEEVREKLERNAT 126


>gi|156039347|ref|XP_001586781.1| predicted protein [Sclerotinia sclerotiorum 1980]
 gi|154697547|gb|EDN97285.1| predicted protein [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 128

 Score = 66.3 bits (161), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 23/107 (21%), Positives = 45/107 (42%), Gaps = 15/107 (14%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            ++   + PN   S   +++   DTS    ++I V    +  +AN+ ++ +L   L   K
Sbjct: 20  IHLRCHVKPNVSASRAGAIKPFTDTSP--ILEICVPEPARDNEANEGVIELLGDVLLCPK 77

Query: 62  SSLRMLSKQSSPLKII---YIDK----------DCKEITELLQNNDS 95
           + L ++  + S  K+I    ++             KE+ E LQ   S
Sbjct: 78  TDLEVIRGKKSRDKVIAYRSLEGLWINCAIARMTVKELRERLQEASS 124


>gi|21226924|ref|NP_632846.1| hypothetical protein MM_0822 [Methanosarcina mazei Go1]
 gi|29839564|sp|Q8PYN9|Y822_METMA RecName: Full=UPF0235 protein MM_0822
 gi|20905233|gb|AAM30518.1| hypothetical protein MM_0822 [Methanosarcina mazei Go1]
          Length = 108

 Score = 66.3 bits (161), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 17/78 (21%), Positives = 35/78 (44%), Gaps = 4/78 (5%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V + + P ++   +              +++K+T   QKGKAN+ ++  LA+   +  S 
Sbjct: 19 VDIEVTPGSRSLSV----PSGYNEWRKRIEVKLTRNAQKGKANEQLIESLAELFGICSSD 74

Query: 64 LRMLSKQSSPLKIIYIDK 81
          + + S  +S  K + I  
Sbjct: 75 IFISSGATSSKKSLLIKG 92


>gi|73668032|ref|YP_304047.1| hypothetical protein Mbar_A0485 [Methanosarcina barkeri str.
          Fusaro]
 gi|72395194|gb|AAZ69467.1| conserved hypothetical protein [Methanosarcina barkeri str.
          Fusaro]
          Length = 100

 Score = 66.3 bits (161), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 16/80 (20%), Positives = 38/80 (47%), Gaps = 4/80 (5%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + P ++   +              +++K+T   QKGKAN+ ++  LA+  ++S 
Sbjct: 5  VIIDIEVTPGSRSISV----PSGYNEWRKRIEVKLTKNAQKGKANEQLVECLAELFSISS 60

Query: 62 SSLRMLSKQSSPLKIIYIDK 81
          S++ + S  +S  K + +  
Sbjct: 61 SNILINSGATSSKKSLLLKG 80


>gi|91773909|ref|YP_566601.1| hypothetical protein Mbur_1971 [Methanococcoides burtonii DSM 6242]
 gi|91712924|gb|ABE52851.1| protein of unknown function DUF167 [Methanococcoides burtonii DSM
           6242]
          Length = 107

 Score = 66.3 bits (161), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 19/91 (20%), Positives = 44/91 (48%), Gaps = 5/91 (5%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + + + P +KK+                ++IK+T+  QKGKAN  ++ ++A    + +  
Sbjct: 18  IDLEVTPGSKKACF----PAGYNQWRERIEIKLTSAAQKGKANGQLIEIVADFFNIGQRE 73

Query: 64  LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93
           + + S   S  K + I++ D +++   L++ 
Sbjct: 74  VIIGSGAKSSKKTVIINRPDQEQVVLALESI 104


>gi|294010668|ref|YP_003544128.1| hypothetical protein SJA_C1-06820 [Sphingobium japonicum UT26S]
 gi|292673998|dbj|BAI95516.1| conserved hypothetical protein [Sphingobium japonicum UT26S]
          Length = 103

 Score = 66.3 bits (161), Expect = 1e-09,   Method: Composition-based stats.
 Identities = 23/85 (27%), Positives = 44/85 (51%), Gaps = 2/85 (2%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + VRL P A +  +  +    D      +  +V A P++GKAN A++A+LAK+L   + +
Sbjct: 13 LSVRLTPGAAREEVGGV--WTDDKGANWLSARVRAVPERGKANAALIALLAKRLDWPRGA 70

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITE 88
          + + S  ++ LK + I    + +  
Sbjct: 71 ISLESGDANRLKRLRIKGGGEALAS 95


>gi|134045581|ref|YP_001097067.1| hypothetical protein MmarC5_0538 [Methanococcus maripaludis C5]
 gi|166227318|sp|A4FXC3|Y538_METM5 RecName: Full=UPF0235 protein MmarC5_0538
 gi|132663206|gb|ABO34852.1| protein of unknown function DUF167 [Methanococcus maripaludis C5]
          Length = 101

 Score = 65.9 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 24/91 (26%), Positives = 45/91 (49%), Gaps = 9/91 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + +  NAKK+ I  +           ++I++   P +GKANKA++  L     + KS 
Sbjct: 15 IDIEVTTNAKKNEIGKIN-----EWRKRIEIRIKEQPIEGKANKAIMKFLK---GIFKSE 66

Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93
          + + S  +S  K + I DK   +I ++L+  
Sbjct: 67 ILINSGTTSAQKTVLIPDKTKDDIVKILKKE 97


>gi|169629033|ref|YP_001702682.1| hypothetical protein MAB_1946 [Mycobacterium abscessus ATCC
          19977]
 gi|169241000|emb|CAM62028.1| Conserved hypothetical protein [Mycobacterium abscessus]
          Length = 76

 Score = 65.9 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 21/72 (29%), Positives = 39/72 (54%), Gaps = 6/72 (8%)

Query: 8  LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67
          + P ++K          + +D   + + V      GKANKA +A+LA+ L + KS++R++
Sbjct: 10 IKPGSRK------GPAVEVADDGALTLFVREPAIDGKANKAAIALLAEYLDVPKSTVRLV 63

Query: 68 SKQSSPLKIIYI 79
          + Q+S LK   +
Sbjct: 64 AGQTSRLKRFSV 75


>gi|29839573|sp|Q8TIP5|Y4097_METAC RecName: Full=UPF0235 protein MA_4097
          Length = 109

 Score = 65.9 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 17/80 (21%), Positives = 35/80 (43%), Gaps = 4/80 (5%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P ++   +              + +K+T   QKGKAN+ ++  LA+   +S S 
Sbjct: 20 IEIEVTPGSRSLSV----PSGYNEWRKRIAVKLTKNAQKGKANEQLIESLAELFGISSSE 75

Query: 64 LRMLSKQSSPLKIIYIDKDC 83
          + + S  +S  K + I    
Sbjct: 76 ILINSGATSSKKSLLIKGIS 95


>gi|256810180|ref|YP_003127549.1| protein of unknown function DUF167 [Methanocaldococcus fervens
          AG86]
 gi|256793380|gb|ACV24049.1| protein of unknown function DUF167 [Methanocaldococcus fervens
          AG86]
          Length = 98

 Score = 65.9 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 20/92 (21%), Positives = 39/92 (42%), Gaps = 9/92 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + +   AKK  I  +           + IK+ A   +GKANK ++    +     K
Sbjct: 13 VLIDIDVQAGAKKDEITGIN-----EWRKRLSIKIKAPATEGKANKEIIKFFKEIF---K 64

Query: 62 SSLRMLSKQSSPLKIIYIDKDCK-EITELLQN 92
            + +++ + +P K I +    K E+ E L+ 
Sbjct: 65 KDIEIVAGKLNPQKTILVKDIKKDEVIETLKK 96


>gi|86740601|ref|YP_481001.1| hypothetical protein Francci3_1896 [Frankia sp. CcI3]
 gi|86567463|gb|ABD11272.1| protein of unknown function DUF167 [Frankia sp. CcI3]
          Length = 97

 Score = 65.9 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 17/95 (17%), Positives = 41/95 (43%), Gaps = 2/95 (2%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +++R+ P A ++ +      +D      + ++VT     G+A +A L  L   L + +
Sbjct: 1  MRLMIRVQPGAGRTAVGG--RREDPLHGPLLIVRVTEPAVDGRATEAALRALCAALRVRR 58

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96
            +R++   +  +K + ++    +   L    D L
Sbjct: 59 GDVRLVRGATGRVKAVEVEVAAVDEPALRIRIDEL 93


>gi|289192506|ref|YP_003458447.1| protein of unknown function DUF167 [Methanocaldococcus sp.
          FS406-22]
 gi|288938956|gb|ADC69711.1| protein of unknown function DUF167 [Methanocaldococcus sp.
          FS406-22]
          Length = 98

 Score = 65.9 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 18/84 (21%), Positives = 37/84 (44%), Gaps = 8/84 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + +  NAKK+ I  +           + IK+ A   +GKANK ++  L       K
Sbjct: 13 VLIDIDVQANAKKNEIIGIN-----EWRKRLTIKIKAPATEGKANKEIIKFLKDIF---K 64

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85
            + +++ + +P K + +    K+
Sbjct: 65 KDVEIVAGKLNPQKTVLVKDIKKD 88


>gi|255540269|ref|XP_002511199.1| conserved hypothetical protein [Ricinus communis]
 gi|223550314|gb|EEF51801.1| conserved hypothetical protein [Ricinus communis]
          Length = 232

 Score = 65.9 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + K L L 
Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
            S + +    ++  K++ ++    +++ E L
Sbjct: 196 LSQMTLQRGWNNKSKLLVVEDLSARQVYEKL 226


>gi|268326229|emb|CBH39817.1| conserved hypothetical protein, DUF167 family [uncultured
          archaeon]
          Length = 70

 Score = 65.9 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 12/80 (15%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
          M  + +++IPN+K   I   E          M I+V   P KGKANKA++ +L++     
Sbjct: 1  MKRIAIKVIPNSKTEEIIYAEP---------MIIRVKEPPTKGKANKAVVMLLSRYFN-- 49

Query: 61 KSSLRMLSKQSSPLKIIYID 80
           + +R++S   S  KI+ ++
Sbjct: 50 -ADVRIVSGAKSRRKIVEVE 68


>gi|289641324|ref|ZP_06473490.1| protein of unknown function DUF167 [Frankia symbiont of Datisca
           glomerata]
 gi|289508922|gb|EFD29855.1| protein of unknown function DUF167 [Frankia symbiont of Datisca
           glomerata]
          Length = 122

 Score = 65.9 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 15/119 (12%), Positives = 38/119 (31%), Gaps = 24/119 (20%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIH-----------------------MKIKVTA 38
             + +R+ P + ++ +                                     + ++VTA
Sbjct: 1   MRLTIRVSPRSARTSVGGSSPASGAGAPGESPAENPTAADDPTNGSPPDAATPLVVRVTA 60

Query: 39  TPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCK-EITELLQNNDSL 96
               G+A ++ L  LA    + +  + ++   +S  KI+ I    +  +   L    + 
Sbjct: 61  PAVDGQATESALRALATAFGVRRREVVLVRGATSRTKIVEISGLPESALRSRLAELTAT 119


>gi|312197091|ref|YP_004017152.1| hypothetical protein FraEuI1c_3270 [Frankia sp. EuI1c]
 gi|311228427|gb|ADP81282.1| protein of unknown function DUF167 [Frankia sp. EuI1c]
          Length = 97

 Score = 65.9 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 20/88 (22%), Positives = 37/88 (42%), Gaps = 2/88 (2%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            V +R+ P A  + +       D      + ++V A    GKA +A L  LA  L L +
Sbjct: 1  MRVTIRVRPGASGTAVGGELGGPDGEP--SLVVRVCARAVDGKATEAALRALADALGLRR 58

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITEL 89
          + + ++   +S  K++ I     +   L
Sbjct: 59 ADVSLVHGATSRTKLVEIAASPADEPAL 86


>gi|237858668|ref|NP_001153587.1| hypothetical protein LOC123207 isoform d [Homo sapiens]
          Length = 167

 Score = 65.9 bits (160), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 7/67 (10%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + +   P +K++ +  L           + + + A P +G+AN  +   L+K L L K
Sbjct: 63  VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 115

Query: 62  SSLRMLS 68
           S + +  
Sbjct: 116 SDVVLDK 122


>gi|30697845|ref|NP_568972.2| unknown protein [Arabidopsis thaliana]
 gi|26452404|dbj|BAC43287.1| unknown protein [Arabidopsis thaliana]
 gi|332010365|gb|AED97748.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 232

 Score = 65.5 bits (159), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + + L L 
Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 195

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
            S + +    +S  K++ ++    +++ E L
Sbjct: 196 LSQMTLQRGWNSKSKLLVVEDLSARQVYEKL 226


>gi|288917345|ref|ZP_06411712.1| protein of unknown function DUF167 [Frankia sp. EUN1f]
 gi|288351210|gb|EFC85420.1| protein of unknown function DUF167 [Frankia sp. EUN1f]
          Length = 118

 Score = 65.5 bits (159), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 14/93 (15%), Positives = 40/93 (43%), Gaps = 3/93 (3%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +V +R+ P + ++ +  +    D      + ++V     +G+AN+A L  +A+ L + +
Sbjct: 17  VHVAIRVRPASDRTAVGPV--TADPIHGQLLVVRVREPAVEGRANEAALRAIAQALGVRR 74

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
           + + +       +K + +D     +   +    
Sbjct: 75  ADVTLSR-SIGRVKFVAVDAPDDIVAARVAELA 106


>gi|225456297|ref|XP_002283689.1| PREDICTED: hypothetical protein [Vitis vinifera]
 gi|147823132|emb|CAN75279.1| hypothetical protein VITISV_030868 [Vitis vinifera]
 gi|297734405|emb|CBI15652.3| unnamed protein product [Vitis vinifera]
          Length = 232

 Score = 65.5 bits (159), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + K L L 
Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
            S + +    ++  K++ ++    +++ E L
Sbjct: 196 LSQMTLQRGWNNKSKLLVVEDLSARQVYEKL 226


>gi|297793925|ref|XP_002864847.1| hypothetical protein ARALYDRAFT_332565 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310682|gb|EFH41106.1| hypothetical protein ARALYDRAFT_332565 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 232

 Score = 65.5 bits (159), Expect = 2e-09,   Method: Composition-based stats.
 Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + + L L 
Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 195

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
            S + +    +S  K++ ++    +++ E L
Sbjct: 196 LSQMTLQRGWNSKSKLLVVEDLSARQVYEKL 226


>gi|198282577|ref|YP_002218898.1| hypothetical protein Lferr_0437 [Acidithiobacillus ferrooxidans
           ATCC 53993]
 gi|218665107|ref|YP_002424767.1| conserved hypothetical protein TIGR00251, putative
           [Acidithiobacillus ferrooxidans ATCC 23270]
 gi|198247098|gb|ACH82691.1| protein of unknown function DUF167 [Acidithiobacillus ferrooxidans
           ATCC 53993]
 gi|218517320|gb|ACK77906.1| conserved hypothetical protein TIGR00251, putative
           [Acidithiobacillus ferrooxidans ATCC 23270]
          Length = 116

 Score = 65.1 bits (158), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 21/89 (23%), Positives = 37/89 (41%), Gaps = 7/89 (7%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V + + P AK   I              +KI++ A P  G AN A+ A+L   L L +  
Sbjct: 20  VRIHVQPGAKTDAIGGCH-------GDALKIRLRARPVDGAANAALSALLCATLRLKRQQ 72

Query: 64  LRMLSKQSSPLKIIYIDKDCKEITELLQN 92
           + ++  +S+  K++ I      +   L  
Sbjct: 73  ITLVQGESARDKVLRISAASTHVQTQLAK 101


>gi|328772106|gb|EGF82145.1| hypothetical protein BATDEDRAFT_86894 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 128

 Score = 65.1 bits (158), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 16/73 (21%), Positives = 35/73 (47%), Gaps = 7/73 (9%)

Query: 8   LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67
           + P  K S +  ++          + I++ A  ++G+AN  ++  +A  L L K  + ++
Sbjct: 45  VKPGTKVSQVIDIQ-------GDAVGIQIAAVAREGEANAELIQTVADVLKLRKYQVAIV 97

Query: 68  SKQSSPLKIIYID 80
           +   S  K++ ID
Sbjct: 98  AGHKSRTKVLKID 110


>gi|326403567|ref|YP_004283649.1| hypothetical protein ACMV_14200 [Acidiphilium multivorum AIU301]
 gi|325050429|dbj|BAJ80767.1| hypothetical protein ACMV_14200 [Acidiphilium multivorum AIU301]
          Length = 131

 Score = 65.1 bits (158), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 15/83 (18%), Positives = 37/83 (44%), Gaps = 2/83 (2%)

Query: 4   VIVRLIPNAKKSGIASL--EIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           V +++ P A+++ +  +             +++ V   P+ G+AN A+L  LA  L +  
Sbjct: 23  VALKVQPGARRARLGPVVPAAAAPGWPPARLRLAVVVPPEDGRANDAVLKALAAWLGVGA 82

Query: 62  SSLRMLSKQSSPLKIIYIDKDCK 84
           + L + +   +  K++ +     
Sbjct: 83  ARLALRAGGQARDKLVLVVGGSA 105


>gi|224133940|ref|XP_002321697.1| predicted protein [Populus trichocarpa]
 gi|222868693|gb|EEF05824.1| predicted protein [Populus trichocarpa]
          Length = 231

 Score = 64.8 bits (157), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + K L L 
Sbjct: 142 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFVGKVLGLK 194

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
            S + +    ++  K++ ++    +++ E L
Sbjct: 195 LSQMTLQRGWNNKSKLLVVEDLSARQVYEKL 225


>gi|9758285|dbj|BAB08809.1| unnamed protein product [Arabidopsis thaliana]
          Length = 213

 Score = 64.8 bits (157), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + + L L 
Sbjct: 124 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 176

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
            S + +    +S  K++ ++    +++ E L
Sbjct: 177 LSQMTLQRGWNSKSKLLVVEDLSARQVYEKL 207


>gi|251772211|gb|EES52781.1| conserved hypothetical protein [Leptospirillum ferrodiazotrophum]
          Length = 120

 Score = 64.8 bits (157), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 15/81 (18%), Positives = 37/81 (45%), Gaps = 3/81 (3%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           +++ + P   +     L             + +    ++G+AN+ +L  LA+ L +S SS
Sbjct: 28  LVIEVKPAQSR---VFLRRGASHGPAPIWVVGIREPAREGQANEGVLRTLAEFLGVSPSS 84

Query: 64  LRMLSKQSSPLKIIYIDKDCK 84
           + ++S  +S +K + +    +
Sbjct: 85  VALVSGHTSRIKRLSVRGIDE 105


>gi|187251200|ref|YP_001875682.1| hypothetical protein Emin_0790 [Elusimicrobium minutum Pei191]
 gi|186971360|gb|ACC98345.1| Conserved hypothetical protein DUF167 [Elusimicrobium minutum
          Pei191]
          Length = 93

 Score = 64.8 bits (157), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 14/89 (15%), Positives = 36/89 (40%), Gaps = 7/89 (7%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VR+IP A ++ +        +     +++KV     + +AN  +   LA+   +  
Sbjct: 7  MIIKVRVIPTAGENEVV-------SRIGSVLRVKVKTKSIEDEANNIIQYFLAEFFGVEN 59

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          + + ++       K + I    +E  + +
Sbjct: 60 NYINIVKGAKGKEKTVEIRGKSEEHLKKV 88


>gi|302763307|ref|XP_002965075.1| hypothetical protein SELMODRAFT_230469 [Selaginella moellendorffii]
 gi|300167308|gb|EFJ33913.1| hypothetical protein SELMODRAFT_230469 [Selaginella moellendorffii]
          Length = 233

 Score = 64.8 bits (157), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 17/93 (18%), Positives = 43/93 (46%), Gaps = 8/93 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  +AK L+L 
Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEYMAKVLSLR 195

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92
            + + +    ++  K++ ++    +++ E L  
Sbjct: 196 VTQMTLQRGWNNKSKLLVVEDLSVRDVYEKLLA 228


>gi|255628955|gb|ACU14822.1| unknown [Glycine max]
          Length = 227

 Score = 64.8 bits (157), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 14/80 (17%), Positives = 35/80 (43%), Gaps = 7/80 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  + + +   A++S I  +           +++ V A   +G+AN  +L  + K L L 
Sbjct: 143 LVQLAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195

Query: 61  KSSLRMLSKQSSPLKIIYID 80
            S + +    ++  K++ + 
Sbjct: 196 LSQMTLQRGWNNKSKLLVVS 215


>gi|300871287|ref|YP_003786160.1| hypothetical protein BP951000_1678 [Brachyspira pilosicoli
          95/1000]
 gi|300688988|gb|ADK31659.1| conserved hypothetical protein [Brachyspira pilosicoli 95/1000]
          Length = 83

 Score = 64.8 bits (157), Expect = 4e-09,   Method: Composition-based stats.
 Identities = 21/87 (24%), Positives = 42/87 (48%), Gaps = 8/87 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           N+ V++  +AK +            +     I+++A    GKANKA++  L+ +L L K
Sbjct: 1  MNIEVKVTASAKSNSF--------KKENGIYYIRISAKAIDGKANKAIIDFLSSELNLKK 52

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITE 88
            + +L  + S  K+I ++ D  ++  
Sbjct: 53 KDVEILKGEKSSKKLISLNIDEYKLES 79


>gi|302756509|ref|XP_002961678.1| hypothetical protein SELMODRAFT_76023 [Selaginella moellendorffii]
 gi|300170337|gb|EFJ36938.1| hypothetical protein SELMODRAFT_76023 [Selaginella moellendorffii]
          Length = 233

 Score = 64.4 bits (156), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 17/93 (18%), Positives = 43/93 (46%), Gaps = 8/93 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  +AK L+L 
Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEYMAKVLSLR 195

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92
            + + +    ++  K++ ++    +++ E L  
Sbjct: 196 ATQMTLQRGWNNKSKLLVVEDLSVRDVYEKLLA 228


>gi|255630115|gb|ACU15411.1| unknown [Glycine max]
          Length = 225

 Score = 64.4 bits (156), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 14/80 (17%), Positives = 36/80 (45%), Gaps = 7/80 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  + + +   A++S I  +           +++ V A   +G+AN  +L  + K L L 
Sbjct: 143 LVQLAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195

Query: 61  KSSLRMLSKQSSPLKIIYID 80
            S + +    ++  K++ ++
Sbjct: 196 LSQMTLQRGWNNKSKLLVVE 215


>gi|307199144|gb|EFN79854.1| UPF0235 protein C15orf40-like protein [Harpegnathos saltator]
          Length = 100

 Score = 64.4 bits (156), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 14/67 (20%), Positives = 28/67 (41%), Gaps = 7/67 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + ++  P AK + I  +             + ++A P +G+AN  ++  LA    L K
Sbjct: 35 VTIKIQAKPGAKCNNITDITNEGVG-------VAISAPPTEGEANAELVKYLASIFGLRK 87

Query: 62 SSLRMLS 68
          S + +  
Sbjct: 88 SHVSLDR 94


>gi|115471615|ref|NP_001059406.1| Os07g0295200 [Oryza sativa Japonica Group]
 gi|34394983|dbj|BAC84531.1| unknown protein [Oryza sativa Japonica Group]
 gi|113610942|dbj|BAF21320.1| Os07g0295200 [Oryza sativa Japonica Group]
 gi|215765278|dbj|BAG86975.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218199457|gb|EEC81884.1| hypothetical protein OsI_25695 [Oryza sativa Indica Group]
 gi|222636860|gb|EEE66992.1| hypothetical protein OsJ_23901 [Oryza sativa Japonica Group]
          Length = 232

 Score = 64.0 bits (155), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + K L L 
Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
            S + +    ++  K++ ++    +++ E L
Sbjct: 196 LSQMTLQRGWNNKSKLLIVEDLSARQVYEKL 226


>gi|126340695|ref|XP_001369778.1| PREDICTED: similar to chromosome 15 open reading frame 40,
          [Monodelphis domestica]
          Length = 73

 Score = 64.0 bits (155), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 17/65 (26%), Positives = 30/65 (46%), Gaps = 2/65 (3%)

Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI--DKDCKEITEL 89
          + + + A P +G+AN  +   L+K L L KS + +     S  K++ I      +EI E 
Sbjct: 6  VSVAIAAPPSEGEANTELCCYLSKVLELRKSDVILDKGGKSHEKVVPILASTTPEEILEK 65

Query: 90 LQNND 94
           +   
Sbjct: 66 FKMQA 70


>gi|194671931|ref|XP_001789164.1| PREDICTED: C21H15orf40 protein-like [Bos taurus]
          Length = 234

 Score = 64.0 bits (155), Expect = 6e-09,   Method: Composition-based stats.
 Identities = 15/86 (17%), Positives = 34/86 (39%), Gaps = 9/86 (10%)

Query: 5   IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
            +     +K++ +  +           + + +   P +G+AN  +   L+K L L  S +
Sbjct: 107 AIHDKAGSKQNAMTDVTT-------EAVSVGIAGPPIEGEANVELCCCLSKILELRTSDV 159

Query: 65  RMLSKQSSPLKIIYIDK--DCKEITE 88
            +     S  K++ +      +EI E
Sbjct: 160 VLDKGSKSHEKVVKLLACTPPEEILE 185


>gi|307721935|ref|YP_003893075.1| hypothetical protein Saut_2020 [Sulfurimonas autotrophica DSM
           16294]
 gi|306980028|gb|ADN10063.1| protein of unknown function DUF167 [Sulfurimonas autotrophica DSM
           16294]
          Length = 118

 Score = 64.0 bits (155), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 14/85 (16%), Positives = 33/85 (38%), Gaps = 8/85 (9%)

Query: 9   IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
            P+AK + I              +KI V   P  GKA   ++  LA +  +    + ++ 
Sbjct: 42  KPSAKITKIG-------KPFGNQLKISVACAPVNGKATDHLVKFLAHEFDVKIKDIEVVF 94

Query: 69  KQSSPLKIIYIDKDCKEITELLQNN 93
            + +  K + + +   ++  + +  
Sbjct: 95  GRMNVNKQLRVTR-PNKLPSVFKAK 118


>gi|116784383|gb|ABK23322.1| unknown [Picea sitchensis]
 gi|224284776|gb|ACN40118.1| unknown [Picea sitchensis]
          Length = 232

 Score = 64.0 bits (155), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 16/91 (17%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + K L L 
Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEYMGKVLGLR 195

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
            + + +    ++  K++ ++    +++ E L
Sbjct: 196 LTQMTLQRGWNNKSKLLVVEDLSARDVYEKL 226


>gi|167648715|ref|YP_001686378.1| hypothetical protein Caul_4760 [Caulobacter sp. K31]
 gi|167351145|gb|ABZ73880.1| protein of unknown function DUF167 [Caulobacter sp. K31]
          Length = 96

 Score = 64.0 bits (155), Expect = 7e-09,   Method: Composition-based stats.
 Identities = 31/90 (34%), Positives = 46/90 (51%), Gaps = 3/90 (3%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + VRL P   +  I    +  D     ++K++V A P +G AN A+LA LAK L L K
Sbjct: 1  MRLAVRLTPRGGREAIDGWAVDGDGRP--YLKVRVAAPPVEGAANAALLAFLAKALGLPK 58

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
          S+L + S   + LK+I I   D   +  +L
Sbjct: 59 SALTLASGAGARLKLIEIAGCDPLSLERVL 88


>gi|291615278|ref|YP_003525435.1| hypothetical protein Slit_2823 [Sideroxydans lithotrophicus ES-1]
 gi|291585390|gb|ADE13048.1| protein of unknown function DUF167 [Sideroxydans lithotrophicus
          ES-1]
          Length = 102

 Score = 63.6 bits (154), Expect = 8e-09,   Method: Composition-based stats.
 Identities = 16/86 (18%), Positives = 36/86 (41%), Gaps = 8/86 (9%)

Query: 9  IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
           P AKK+ I  +           +++ V   P +G+A   ++  LA +  +++SS+ ++ 
Sbjct: 19 RPRAKKTKIGKV-------IGNQLEVHVAENPVRGRATAHLVKFLAGEFDVTESSITVVF 71

Query: 69 KQSSPLKIIYIDKDCKEITELLQNND 94
             +  K + I    K +   +    
Sbjct: 72 GVYNVNKQLRIVA-PKRLPAAIAKQQ 96


>gi|145334887|ref|NP_001078789.1| unknown protein [Arabidopsis thaliana]
 gi|332010366|gb|AED97749.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 213

 Score = 63.2 bits (153), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + + L L 
Sbjct: 124 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 176

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
            S + +    +S  K++ ++    +++ E L
Sbjct: 177 LSQMTLQRGWNSKSKLLVVEDLSARQVYEKL 207


>gi|224119652|ref|XP_002318126.1| predicted protein [Populus trichocarpa]
 gi|222858799|gb|EEE96346.1| predicted protein [Populus trichocarpa]
          Length = 231

 Score = 63.2 bits (153), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 16/91 (17%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + + L L 
Sbjct: 142 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 194

Query: 61  KSSLRMLSKQSSPLKIIYIDKD-CKEITELL 90
            S + +    ++  K++ ++    +++ E L
Sbjct: 195 LSQMTLQRGWNNKSKLLVVEDLYARQVYEKL 225


>gi|297618780|ref|YP_003706885.1| protein of unknown function DUF167 [Methanococcus voltae A3]
 gi|297377757|gb|ADI35912.1| protein of unknown function DUF167 [Methanococcus voltae A3]
          Length = 112

 Score = 63.2 bits (153), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 23/94 (24%), Positives = 46/94 (48%), Gaps = 7/94 (7%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           V + +  NAKK+ I  +           ++I++   P +GKANKA+L  +  +L L K+ 
Sbjct: 24  VNIDVSTNAKKNEIGKIN-----EWRKRLEIRIKQQPVEGKANKAILKFIKSELNL-KTD 77

Query: 64  LRMLSKQSSPLKIIYIDK-DCKEITELLQNNDSL 96
           + + +  ++  K ++    D   I + L   +S+
Sbjct: 78  VEIATGSTNSQKTLFFKDLDKNTILKKLNLLNSI 111


>gi|281348884|gb|EFB24468.1| hypothetical protein PANDA_020551 [Ailuropoda melanoleuca]
          Length = 87

 Score = 63.2 bits (153), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 13/67 (19%), Positives = 29/67 (43%), Gaps = 7/67 (10%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + +   P +K++ +  +           + + + A P +G+AN  +   L+K L L K
Sbjct: 26 VTIAIHAKPGSKQNAVTDVTA-------EAVSVAIAAPPSEGEANAELCRYLSKVLELRK 78

Query: 62 SSLRMLS 68
          S + +  
Sbjct: 79 SDVVLDK 85


>gi|167999197|ref|XP_001752304.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696699|gb|EDQ83037.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 231

 Score = 63.2 bits (153), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 17/93 (18%), Positives = 41/93 (44%), Gaps = 8/93 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + K L L 
Sbjct: 142 LVQVAIEVEDRAQRSQITRVNADD-------VRVTVAAPAARGEANNELLEYMGKVLGLR 194

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92
            + + +    ++  K++ ++    +E+ E L  
Sbjct: 195 LTQMTLQRGWNNKSKLLAVEDLSVREVYEKLLA 227


>gi|150399132|ref|YP_001322899.1| hypothetical protein Mevan_0378 [Methanococcus vannielii SB]
 gi|166232643|sp|A6UP65|Y378_METVS RecName: Full=UPF0235 protein Mevan_0378
 gi|150011835|gb|ABR54287.1| protein of unknown function DUF167 [Methanococcus vannielii SB]
          Length = 101

 Score = 62.8 bits (152), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 23/93 (24%), Positives = 44/93 (47%), Gaps = 9/93 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + +  NAKK+ I  +           +++K+   P +G+ANKA+L  L +     K
Sbjct: 13 VLIDIEVTTNAKKNEIGKIN-----KWRKRLEVKIKEQPIEGRANKAILKFLKEIF---K 64

Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE-ITELLQNN 93
          + + +    +SP K + I  D KE +  +L+  
Sbjct: 65 TDVELNPVTTSPQKTVLISCDTKEYVVNILKRE 97


>gi|298243513|ref|ZP_06967320.1| protein of unknown function DUF167 [Ktedonobacter racemifer DSM
          44963]
 gi|297556567|gb|EFH90431.1| protein of unknown function DUF167 [Ktedonobacter racemifer DSM
          44963]
          Length = 90

 Score = 62.8 bits (152), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 25/94 (26%), Positives = 49/94 (52%), Gaps = 9/94 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            V VR+IP + ++ +          +   +K ++TA P  G AN A++A+LA+ L+L K
Sbjct: 1  MQVPVRVIPRSNRNTLE--------WEEGAIKARLTAPPVDGAANAALIALLAETLSLPK 52

Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNND 94
           ++ ++   +   KI+ I+  +  EI + L  + 
Sbjct: 53 RAITLIRGTTGRQKIVEIEGLEQVEIIQRLSASS 86


>gi|169600439|ref|XP_001793642.1| hypothetical protein SNOG_03053 [Phaeosphaeria nodorum SN15]
 gi|111068664|gb|EAT89784.1| hypothetical protein SNOG_03053 [Phaeosphaeria nodorum SN15]
          Length = 132

 Score = 62.8 bits (152), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 15/77 (19%), Positives = 34/77 (44%), Gaps = 5/77 (6%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P   +     +          H+ + V     +G ANK++  +LAK L++ KS 
Sbjct: 26 ITLFVKPGVSR-----MREGIAAVSDSHVFMNVANYAFEGSANKSVQVLLAKTLSVPKSH 80

Query: 64 LRMLSKQSSPLKIIYID 80
          + ++   +S  K+  + 
Sbjct: 81 VSIVKGLTSREKVAEVK 97


>gi|297470564|ref|XP_002684034.1| PREDICTED: C21H15orf40 protein-like, partial [Bos taurus]
 gi|296491729|gb|DAA33762.1| C21H15orf40 protein-like [Bos taurus]
          Length = 107

 Score = 62.8 bits (152), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 15/86 (17%), Positives = 34/86 (39%), Gaps = 9/86 (10%)

Query: 5  IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
           +     +K++ +  +           + + +   P +G+AN  +   L+K L L  S +
Sbjct: 17 AIHDKAGSKQNAMTDVTT-------EAVSVGIAGPPIEGEANVELCCCLSKILELRTSDV 69

Query: 65 RMLSKQSSPLKIIYIDK--DCKEITE 88
           +     S  K++ +      +EI E
Sbjct: 70 VLDKGSKSHEKVVKLLACTPPEEILE 95


>gi|322822396|gb|EFZ28456.1| hypothetical protein TCSYLVIO_5305 [Trypanosoma cruzi]
          Length = 156

 Score = 62.4 bits (151), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 22/141 (15%), Positives = 45/141 (31%), Gaps = 55/141 (39%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAM--------LA 54
           ++ V   P A+ S +A      D      ++++V A P +GKAN  ++          LA
Sbjct: 16  HLTVHAKPGARSSSLACHPAVMDA----ALEVRVGAPPVEGKANAELVDFMQMLLEQELA 71

Query: 55  KK------------------------------------LALS-----KSSLRMLSKQSSP 73
           +                                     +        K  + ++S  S+ 
Sbjct: 72  RVRATQQHTPLESNGAVMNCVYGGHSKKDKKKNKPNTKMECPVNYPDKVRVSLVSGASAR 131

Query: 74  LKIIYI--DKDCKEITELLQN 92
            K + +      +E+  +LQ+
Sbjct: 132 HKTLEVAFPGTQEELISVLQS 152


>gi|145494428|ref|XP_001433208.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400325|emb|CAK65811.1| unnamed protein product [Paramecium tetraurelia]
          Length = 99

 Score = 62.4 bits (151), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 17/79 (21%), Positives = 39/79 (49%), Gaps = 8/79 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P +K                  + +K  A P    AN+ ++ ML++KL++ +SS
Sbjct: 20 LQLFVKPKSK--------SEMLEFSEEFIIVKTKAQPIDNAANEDVIRMLSEKLSIDQSS 71

Query: 64 LRMLSKQSSPLKIIYIDKD 82
          ++++  Q S  K ++I+ +
Sbjct: 72 IKIVKGQQSKYKTVFIENE 90


>gi|116780073|gb|ABK21543.1| unknown [Picea sitchensis]
          Length = 265

 Score = 62.1 bits (150), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 14/80 (17%), Positives = 35/80 (43%), Gaps = 7/80 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + K L L 
Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEYMGKVLGLR 195

Query: 61  KSSLRMLSKQSSPLKIIYID 80
            + + +    ++  K++ + 
Sbjct: 196 LTQMTLQRGWNNKSKLLVVS 215


>gi|269860472|ref|XP_002649957.1| hypothetical cytosolic protein [Enterocytozoon bieneusi H348]
 gi|220066644|gb|EED44119.1| hypothetical cytosolic protein [Enterocytozoon bieneusi H348]
          Length = 88

 Score = 62.1 bits (150), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 16/76 (21%), Positives = 40/76 (52%), Gaps = 7/76 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + +++  +++++ I          +  ++ I++ A P   KAN  ++A+L+K     K +
Sbjct: 18 LNIKVKLSSRETAIL-------CQEDDYLIIQIAAPPVDNKANNELIALLSKTYKTKKEN 70

Query: 64 LRMLSKQSSPLKIIYI 79
          + ++  ++S  KII I
Sbjct: 71 ISIIKGKTSTTKIIKI 86


>gi|221106236|ref|XP_002164251.1| PREDICTED: similar to predicted protein [Hydra magnipapillata]
          Length = 1280

 Score = 62.1 bits (150), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 14/61 (22%), Positives = 30/61 (49%), Gaps = 7/61 (11%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AK++ +  + +           I++ A P+ GKAN  +L+ L++   + KS 
Sbjct: 915 LQIYAKPGAKRNKVTDISVEFIG-------IQLAAQPRDGKANDELLSYLSELFNIKKSG 967

Query: 64  L 64
           +
Sbjct: 968 I 968


>gi|296088594|emb|CBI37585.3| unnamed protein product [Vitis vinifera]
          Length = 89

 Score = 61.7 bits (149), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 11/56 (19%), Positives = 29/56 (51%)

Query: 25 DTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80
          D  D   + +++ A  + G+AN A+L  ++  + + +  + + S   S  K++ ++
Sbjct: 16 DYFDDEALGVQIDAPAKDGEANAALLDYISSVVGVKRRQVSISSGSKSRDKVVIVE 71


>gi|168035145|ref|XP_001770071.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678597|gb|EDQ65053.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 232

 Score = 61.3 bits (148), Expect = 4e-08,   Method: Composition-based stats.
 Identities = 17/93 (18%), Positives = 41/93 (44%), Gaps = 8/93 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + K L L 
Sbjct: 143 LVQVGIEVEDRAQRSQITRVNADD-------VRVTVAAPAARGEANNELLEYMGKVLGLR 195

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92
            + + +    ++  K++ ++    +E+ E L  
Sbjct: 196 LTQMTLQRGWNNKSKLLVVEDLTVREVYEKLLA 228


>gi|167932988|ref|ZP_02520075.1| hypothetical protein cdivTM7_00062 [candidate division TM7
          single-cell isolate TM7b]
 gi|169836860|ref|ZP_02870048.1| hypothetical protein cdivTM_07094 [candidate division TM7
          single-cell isolate TM7a]
          Length = 59

 Score = 60.9 bits (147), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 13/52 (25%), Positives = 27/52 (51%)

Query: 29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80
             + +   A   +G+AN A + +LAK   ++ S +++L   +S  K+  +D
Sbjct: 5  DGVLTVYTKAPAIEGRANLATVKLLAKYFGVASSKVKLLRGAASKYKVFEVD 56


>gi|158314062|ref|YP_001506570.1| hypothetical protein Franean1_2229 [Frankia sp. EAN1pec]
 gi|158109467|gb|ABW11664.1| protein of unknown function DUF167 [Frankia sp. EAN1pec]
          Length = 121

 Score = 60.9 bits (147), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 20/93 (21%), Positives = 41/93 (44%), Gaps = 3/93 (3%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             V +R+ P A ++ +       D +    + ++V     +G+AN+A L  LA+ L + +
Sbjct: 20  VRVAIRVRPAADRTAVG--PATSDPTHGRLLVVRVREPAVEGRANEAALRALAQALGVRR 77

Query: 62  SSLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94
           S + +       +K I +D     IT  ++   
Sbjct: 78  SDVTLTR-SIGRVKFIEVDAPDDVITRRVEELA 109


>gi|212541150|ref|XP_002150730.1| DUF167 domain protein [Penicillium marneffei ATCC 18224]
 gi|210068029|gb|EEA22121.1| DUF167 domain protein [Penicillium marneffei ATCC 18224]
          Length = 124

 Score = 60.5 bits (146), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 16/78 (20%), Positives = 38/78 (48%), Gaps = 8/78 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          +   + P ++  G   ++   D      + I V + P+KG+AN A++ +L++ L + KS 
Sbjct: 26 IRCHVTPGSR--GFQGVKKICD----EQVYIHVASAPRKGEANAAVVKVLSEVLGIPKSD 79

Query: 64 LRMLSKQSSPLKIIYIDK 81
          + ++       K+  ++ 
Sbjct: 80 ITIM--GKHRDKVGQVNG 95


>gi|312221888|emb|CBY01828.1| similar to DUF167 domain protein [Leptosphaeria maculans]
          Length = 118

 Score = 60.5 bits (146), Expect = 7e-08,   Method: Composition-based stats.
 Identities = 12/78 (15%), Positives = 36/78 (46%), Gaps = 5/78 (6%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
          ++   + P    S +  + + +D      +++ V    + G+AN A++ ++ + L   + 
Sbjct: 19 HLRCHVKPGVSSSRLGIIAVTED-----AVEVGVAEQAKNGEANDAVVHVICRALHAPRD 73

Query: 63 SLRMLSKQSSPLKIIYID 80
           +R++    S +K + + 
Sbjct: 74 EVRIVRGWKSRVKTVVVS 91


>gi|124028038|ref|YP_001013358.1| hypothetical protein Hbut_1179 [Hyperthermus butylicus DSM 5456]
 gi|123978732|gb|ABM81013.1| hypothetical protein Hbut_1179 [Hyperthermus butylicus DSM 5456]
          Length = 108

 Score = 60.1 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 14/99 (14%), Positives = 39/99 (39%), Gaps = 13/99 (13%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +V + + P A  +G+          +   +         +G+ N +++   ++ L +S 
Sbjct: 16  VDVTLYVKPEASFTGL--------RMELGELVFYTEELDVEGRVNASIVMFFSRLLGVSP 67

Query: 62  SSLRMLSKQSSPLKIIYIDKDC-----KEITELLQNNDS 95
           S + ++       K + I         ++I E L+ +++
Sbjct: 68  SMIDIVYGTREKTKRVRIKNVTWNQVFEKIVEALRESEA 106


>gi|145509569|ref|XP_001440723.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407951|emb|CAK73326.1| unnamed protein product [Paramecium tetraurelia]
          Length = 99

 Score = 59.7 bits (144), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 17/79 (21%), Positives = 39/79 (49%), Gaps = 8/79 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + + + P +K                  +  K  A P   +AN+ ++ ML++KL++ +SS
Sbjct: 20 LQLFVKPKSK--------AEMLEFSEEFVIAKTKAQPIDNEANEDVIRMLSEKLSIDQSS 71

Query: 64 LRMLSKQSSPLKIIYIDKD 82
          ++++  Q S  K ++I+ +
Sbjct: 72 IKIVKGQQSKYKTVFIENE 90


>gi|312136610|ref|YP_004003947.1| hypothetical protein Mfer_0383 [Methanothermus fervidus DSM 2088]
 gi|311224329|gb|ADP77185.1| protein of unknown function DUF167 [Methanothermus fervidus DSM
          2088]
          Length = 97

 Score = 59.7 bits (144), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 20/91 (21%), Positives = 43/91 (47%), Gaps = 9/91 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + +IP++KK GI   +     S    + + V +  +KGKANK ++   +K      
Sbjct: 11 VLLQIHVIPSSKKFGIEKYD-----SWRKRLYVTVKSPARKGKANKEIIEEFSKLFN--- 62

Query: 62 SSLRMLSKQSSPLKIIYIDKDC-KEITELLQ 91
            ++++    S  K + +     K+I E+++
Sbjct: 63 KEVKIVKGIKSRDKTLVVKDVEYKKIMEIIR 93


>gi|71660987|ref|XP_817521.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70882718|gb|EAN95670.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 156

 Score = 59.7 bits (144), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 20/141 (14%), Positives = 44/141 (31%), Gaps = 55/141 (39%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAM--------LA 54
           ++ V   P A+ S +A      D      +++++ A P +GKAN  ++          LA
Sbjct: 16  HLTVHAKPGARSSSLACHPAVTDA----ALEVRIGAPPVEGKANAELVDFMQMLLEQELA 71

Query: 55  KK------------------------------------LALS-----KSSLRMLSKQSSP 73
           +                                     +        K  + ++   S+ 
Sbjct: 72  RVRAAQQHTPVESNGAFMNCVYDGHSKKDKKKNKPNTKMECPVNYPDKVRVSLVGGASAR 131

Query: 74  LKIIYI--DKDCKEITELLQN 92
            K + +      +E+  +LQ+
Sbjct: 132 HKTLEVAFPGTEEELISVLQS 152


>gi|238060239|ref|ZP_04604948.1| hypothetical protein MCAG_01205 [Micromonospora sp. ATCC 39149]
 gi|237882050|gb|EEP70878.1| hypothetical protein MCAG_01205 [Micromonospora sp. ATCC 39149]
          Length = 99

 Score = 59.7 bits (144), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 20/88 (22%), Positives = 40/88 (45%), Gaps = 3/88 (3%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V VR+ P A ++ +       D      + I V A P  G+A +A    LA  L +  ++
Sbjct: 7  VAVRVKPGAARARVGGRH---DGPHGPALVIAVNAPPVDGRATEAARRALADALGIRPAA 63

Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91
          + + +  +S  K+  +++    + E L+
Sbjct: 64 VALRAGAASRDKLFLVERPTPGLAEALR 91


>gi|218884121|ref|YP_002428503.1| hypothetical protein DKAM_0810 [Desulfurococcus kamchatkensis
           1221n]
 gi|218765737|gb|ACL11136.1| Uncharacterized conserved protein [Desulfurococcus kamchatkensis
           1221n]
          Length = 115

 Score = 59.4 bits (143), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 17/91 (18%), Positives = 36/91 (39%), Gaps = 9/91 (9%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +R+ P               T +   +        +KG+ N A++  LA++L +  S 
Sbjct: 29  LSIRVKP--------GDVEDYITIEGDELVFHTAEQSEKGRENAALVKYLARELKIPVSK 80

Query: 64  LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93
           + ++  +   LK + + D D  E+   L   
Sbjct: 81  IDIVYGRRETLKKVLLNDVDPDELVIKLAKL 111


>gi|71668144|ref|XP_821011.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70886377|gb|EAN99160.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 156

 Score = 59.0 bits (142), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 22/141 (15%), Positives = 45/141 (31%), Gaps = 55/141 (39%)

Query: 3   NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAM--------LA 54
           ++ V   P A+ S +A      D      ++++V A P +GKAN  ++          LA
Sbjct: 16  HLTVHAKPGARSSSLACHPAVTDA----AIEVRVGAPPVEGKANAELVEFMQMLLEQELA 71

Query: 55  KK------------------------------------LALS-----KSSLRMLSKQSSP 73
           +                                     +        K  + ++S  S+ 
Sbjct: 72  RVRAAQQHTPLESNGAVMNCVYGGHSKKDKRKNKPNTKMECPVNYPEKVRVSLVSGASAR 131

Query: 74  LKIIYI--DKDCKEITELLQN 92
            K + +      +E+  +LQ+
Sbjct: 132 HKTLEVAFPGTQEELISVLQS 152


>gi|148550788|ref|YP_001260227.1| hypothetical protein Swit_5352 [Sphingomonas wittichii RW1]
 gi|148503207|gb|ABQ71460.1| protein of unknown function DUF167 [Sphingomonas wittichii RW1]
          Length = 94

 Score = 58.2 bits (140), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 40/77 (51%), Gaps = 6/77 (7%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + VR+ PNA    I         +    ++++ TATP+ GKAN+A+L +LA  L    S
Sbjct: 23 RLPVRVAPNASADAII------LATADGILQVRTTATPEGGKANEAVLRLLAAALGQPVS 76

Query: 63 SLRMLSKQSSPLKIIYI 79
          +L +L   +   KI+ +
Sbjct: 77 ALELLRGSTGRNKIVRV 93


>gi|260576559|ref|ZP_05844548.1| protein of unknown function DUF167 [Rhodobacter sp. SW2]
 gi|259021282|gb|EEW24589.1| protein of unknown function DUF167 [Rhodobacter sp. SW2]
          Length = 66

 Score = 58.2 bits (140), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 16/60 (26%), Positives = 32/60 (53%), Gaps = 8/60 (13%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           +  VR+ P A ++ +          +   +++ VT  P+ GKA + ++A+LAK L ++K
Sbjct: 14 ADFAVRVTPKASRNAVV--------VEEGAIRVYVTCVPEDGKATREVVALLAKALGVAK 65


>gi|327289071|ref|XP_003229248.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 2 [Anolis
           carolinensis]
          Length = 104

 Score = 58.2 bits (140), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 13/60 (21%), Positives = 24/60 (40%), Gaps = 7/60 (11%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             + V   P +K++ +  L             I + A P  G+AN  +   L+K L + +
Sbjct: 48  VTIAVHAKPGSKQNAVTDLSAEAVG-------IAIAAPPSDGEANAELCRYLSKVLEVKR 100


>gi|194688626|gb|ACF78397.1| unknown [Zea mays]
          Length = 232

 Score = 58.2 bits (140), Expect = 4e-07,   Method: Composition-based stats.
 Identities = 16/91 (17%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + K L L 
Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAALAARGEANSELLEFMGKVLGLR 195

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
            + + +    ++  K++ ++    +++ E L
Sbjct: 196 LTQMTLQRGWNNKSKLLIVEDLSARQVYEKL 226


>gi|11499654|ref|NP_070896.1| hypothetical protein AF2072 [Archaeoglobus fulgidus DSM 4304]
 gi|29839698|sp|O28207|Y2072_ARCFU RecName: Full=UPF0235 protein AF_2072
 gi|2648454|gb|AAB89177.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
          Length = 78

 Score = 57.8 bits (139), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 11/77 (14%), Positives = 33/77 (42%), Gaps = 10/77 (12%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + V + P +K+                 +++++ +  ++GKAN+ +L +  +      
Sbjct: 8  VLISVHVSPGSKE------VSFSYDEWRRAVEVRIKSPAKEGKANRELLGIFRQIFG--- 58

Query: 62 SSLRMLSKQSSPLKIIY 78
            + ++S + S  K++ 
Sbjct: 59 -EVELVSGEKSRSKVLK 74


>gi|325959811|ref|YP_004291277.1| hypothetical protein Metbo_2086 [Methanobacterium sp. AL-21]
 gi|325331243|gb|ADZ10305.1| UPF0235 protein yggU [Methanobacterium sp. AL-21]
          Length = 100

 Score = 57.8 bits (139), Expect = 5e-07,   Method: Composition-based stats.
 Identities = 21/93 (22%), Positives = 41/93 (44%), Gaps = 9/93 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + +   +    I               +IK+ A PQKGKANK ++   AK   L+ 
Sbjct: 14 VLLNIEVGTKSDNFRITGYND-----WRKSFEIKIKAVPQKGKANKEIILEFAK---LTN 65

Query: 62 SSLRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93
            + ++S   S  K + I D + +++ +L++  
Sbjct: 66 KRVEIISGHKSHRKTLKIYDINEEDLLKLIEQE 98


>gi|1730921|sp|P52064|YPI4_VIBAL RecName: Full=UPF0235 protein in proC 3'region
          Length = 54

 Score = 57.4 bits (138), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 13/48 (27%), Positives = 20/48 (41%), Gaps = 7/48 (14%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLA 51
          + + + P A +  I  L           +KI +TA P  GKAN  +  
Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTK 54


>gi|325120013|emb|CBZ55566.1| hypothetical protein NCLIV_059910 [Neospora caninum Liverpool]
          Length = 531

 Score = 57.4 bits (138), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 14/56 (25%), Positives = 26/56 (46%), Gaps = 5/56 (8%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLAL 59
           + V   P AK+S I S+           + +++ A  ++G AN+ +   LA   +L
Sbjct: 52  LAVHAKPGAKQSQIPSINEQA-----EQLDVQIDAPAREGAANEELCDFLADACSL 102


>gi|218889133|ref|YP_002437997.1| hypothetical protein PLES_03891 [Pseudomonas aeruginosa LESB58]
 gi|218769356|emb|CAW25116.1| conserved hypothetical protein [Pseudomonas aeruginosa LESB58]
          Length = 56

 Score = 57.4 bits (138), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 15/49 (30%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 42 KGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          +GKAN  +LA L K   ++KS + + S + +  K + I +    + E L
Sbjct: 2  EGKANAHLLAFLGKAFGVAKSLVSLESGELNRQKRVRI-RHPTRLPEEL 49


>gi|294494893|ref|YP_003541386.1| hypothetical protein Mmah_0207 [Methanohalophilus mahii DSM 5219]
 gi|292665892|gb|ADE35741.1| protein of unknown function DUF167 [Methanohalophilus mahii DSM
           5219]
          Length = 103

 Score = 57.4 bits (138), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 16/92 (17%), Positives = 41/92 (44%), Gaps = 8/92 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
           C +   + P + K     +           ++ K+T + QKGKAN  ++  L+    ++ 
Sbjct: 14  CIIDFEINPGSSK----LVVPSGYNIWRKRVEGKLTESAQKGKANDQLIQRLSHIFQINS 69

Query: 62  SSLRMLSKQSSPLKIIYIDK----DCKEITEL 89
           SS+ +++   +  K ++++       +++ E 
Sbjct: 70  SSITIVAGAKTTKKSVHLENVYPKTAEDVLEQ 101


>gi|116054122|ref|YP_788565.1| hypothetical protein PA14_05120 [Pseudomonas aeruginosa
          UCBPP-PA14]
 gi|115589343|gb|ABJ15358.1| conserved hypothetical protein [Pseudomonas aeruginosa
          UCBPP-PA14]
          Length = 56

 Score = 57.4 bits (138), Expect = 6e-07,   Method: Composition-based stats.
 Identities = 15/49 (30%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 42 KGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          +GKAN  +LA L K   ++KS + + S + +  K + I +    + E L
Sbjct: 2  EGKANAHLLAFLGKAFGVAKSLVSLESGELNRQKRVRIRR-PTRLPEEL 49


>gi|257077103|ref|ZP_05571464.1| hypothetical protein Faci_08566 [Ferroplasma acidarmanus fer1]
          Length = 66

 Score = 56.7 bits (136), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 16/55 (29%), Positives = 27/55 (49%)

Query: 26 TSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80
           S+   +KI  +A  +  KAN  ++  LA    +    ++++S Q S  KI  ID
Sbjct: 11 ESEGDRLKIYTSAPRENNKANYDIMKQLATYYNVEFYKIKLISGQKSRKKIFSID 65


>gi|107099377|ref|ZP_01363295.1| hypothetical protein PaerPA_01000389 [Pseudomonas aeruginosa
          PACS2]
 gi|152985685|ref|YP_001345886.1| hypothetical protein PSPA7_0491 [Pseudomonas aeruginosa PA7]
 gi|254243491|ref|ZP_04936813.1| conserved hypothetical protein [Pseudomonas aeruginosa 2192]
 gi|296386890|ref|ZP_06876389.1| hypothetical protein PaerPAb_02102 [Pseudomonas aeruginosa PAb1]
 gi|313111988|ref|ZP_07797775.1| hypothetical protein PA39016_004070010 [Pseudomonas aeruginosa
          39016]
 gi|126196869|gb|EAZ60932.1| conserved hypothetical protein [Pseudomonas aeruginosa 2192]
 gi|150960843|gb|ABR82868.1| protein VV1_1522 [Pseudomonas aeruginosa PA7]
 gi|310884277|gb|EFQ42871.1| hypothetical protein PA39016_004070010 [Pseudomonas aeruginosa
          39016]
          Length = 56

 Score = 56.7 bits (136), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 15/49 (30%), Positives = 26/49 (53%), Gaps = 1/49 (2%)

Query: 42 KGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELL 90
          +GKAN  +LA L K   ++KS + + S + +  K + I +    + E L
Sbjct: 2  EGKANAHLLAFLGKAFGVAKSLVSLESGELNRQKRVRIRR-PTRLPEEL 49


>gi|189913153|ref|YP_001965041.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc
          strain 'Patoc 1 (Ames)']
 gi|189913490|ref|YP_001964718.1| hypothetical protein LEPBI_p0042 [Leptospira biflexa serovar
          Patoc strain 'Patoc 1 (Paris)']
 gi|167777829|gb|ABZ96128.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc
          strain 'Patoc 1 (Ames)']
 gi|167781558|gb|ABZ99854.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc
          strain 'Patoc 1 (Paris)']
          Length = 75

 Score = 56.7 bits (136), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 19/78 (24%), Positives = 40/78 (51%), Gaps = 7/78 (8%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            +++++        I SLE   +T     +K    + P KGKAN+ ++ +L+K   ++K
Sbjct: 1  MKIVIKVK---SNQKIQSLEFKSETECIAKLK----SLPVKGKANQELVGLLSKHYGVTK 53

Query: 62 SSLRMLSKQSSPLKIIYI 79
            ++++S   S +K + I
Sbjct: 54 KEIQIISGHFSNIKTVEI 71


>gi|303245044|ref|ZP_07331365.1| protein of unknown function DUF167 [Methanothermococcus
          okinawensis IH1]
 gi|302484607|gb|EFL47550.1| protein of unknown function DUF167 [Methanothermococcus
          okinawensis IH1]
          Length = 101

 Score = 56.3 bits (135), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 9/93 (9%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            + + + PNAKK+ I  +           + IK+ A P +GKANK ++        + K
Sbjct: 15 VLIDIDISPNAKKNEIGGIN-----EWRKRIIIKIKAQPIEGKANKEIIK---FLKKIFK 66

Query: 62 SSLRMLSKQSSPLKIIY-IDKDCKEITELLQNN 93
            + ++S  +S  K +  I  + +EI  ++   
Sbjct: 67 KDVEIVSGLTSSQKTVLVIGGNREEIINIITKQ 99


>gi|223944751|gb|ACN26459.1| unknown [Zea mays]
          Length = 194

 Score = 55.9 bits (134), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 16/91 (17%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + K L L 
Sbjct: 105 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAALAARGEANSELLEFMGKVLGLR 157

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
            + + +    ++  K++ ++    +++ E L
Sbjct: 158 LTQMTLQRGWNNKSKLLIVEDLSARQVYEKL 188


>gi|283778809|ref|YP_003369564.1| hypothetical protein Psta_1020 [Pirellula staleyi DSM 6068]
 gi|283437262|gb|ADB15704.1| protein of unknown function DUF167 [Pirellula staleyi DSM 6068]
          Length = 102

 Score = 55.5 bits (133), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 28/96 (29%), Positives = 47/96 (48%), Gaps = 12/96 (12%)

Query: 2  CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
            V V+    AKK+ +              +KI VT  P+KGKAN A+  +LA  LA+ +
Sbjct: 11 VLVGVKAQAAAKKNSLRGEHA-------GLLKISVTTAPEKGKANDAIADLLAAALAVRR 63

Query: 62 SSLRMLSKQSSPLKIIYIDKDC-----KEITELLQN 92
          S++++++  + PLK   I         ++I   L+N
Sbjct: 64 SAVQIVAGHTQPLKKFLISGASLDEVREKIARALEN 99


>gi|226529615|ref|NP_001152637.1| hypothetical protein LOC100286278 [Zea mays]
 gi|195658405|gb|ACG48670.1| uncharacterized ACR, YggU family COG1872 containing protein [Zea
           mays]
          Length = 194

 Score = 54.7 bits (131), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 15/91 (16%), Positives = 41/91 (45%), Gaps = 8/91 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S +  +           +++ V A   +G+AN  +L  + K L L 
Sbjct: 105 LVQVAIEVEDRAQRSAVTRVNADD-------VRVTVAALAARGEANSELLEFMGKVLGLR 157

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
            + + +    ++  K++ ++    +++ E L
Sbjct: 158 LTQMTLQRGWNNKSKLLIVEDLSARQVYEKL 188


>gi|225620549|ref|YP_002721806.1| hypothetical protein BHWA1_01632 [Brachyspira hyodysenteriae WA1]
 gi|225215368|gb|ACN84102.1| hypothetical protein BHWA1_01632 [Brachyspira hyodysenteriae WA1]
          Length = 56

 Score = 54.7 bits (131), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 16/55 (29%), Positives = 29/55 (52%)

Query: 38 ATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92
          A    GKANKA++  LA +L + K  + +L  + +  K+I I+ +  E+ +    
Sbjct: 2  AKAIDGKANKAIIDFLADELNIKKRDVEILKGEKNSKKLISININDNELKKYFNK 56


>gi|307108344|gb|EFN56584.1| hypothetical protein CHLNCDRAFT_34979 [Chlorella variabilis]
          Length = 251

 Score = 54.4 bits (130), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 13/80 (16%), Positives = 34/80 (42%), Gaps = 7/80 (8%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61
             V + +    K++ +  +           +++++ +    G AN+ +L ML   L +  
Sbjct: 142 TQVALEVDDRGKRALVLRVTA-------DFVRVQLKSGANAGHANEELLEMLRGVLGVRL 194

Query: 62  SSLRMLSKQSSPLKIIYIDK 81
             L +   +SS  K++ ++ 
Sbjct: 195 GQLSLQRGESSRHKVLLVEG 214


>gi|261327632|emb|CBH10608.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
          DAL972]
 gi|289742777|gb|ADD20136.1| hypothetical protein Tb927.4.3080 [Glossina morsitans morsitans]
          Length = 165

 Score = 54.0 bits (129), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 11/55 (20%), Positives = 26/55 (47%), Gaps = 4/55 (7%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKL 57
           +++   P A+ + +A+     D      +++++ A P  GKAN  ++  +   L
Sbjct: 16 RLMIHAKPGARSTALAAQPQALD----EALEVRLAAPPVDGKANTELVEFMQTLL 66



 Score = 39.3 bits (91), Expect = 0.16,   Method: Composition-based stats.
 Identities = 6/36 (16%), Positives = 19/36 (52%), Gaps = 2/36 (5%)

Query: 61  KSSLRMLSKQSSPLKIIYID--KDCKEITELLQNND 94
           K  + ++S  +S  K++ +      +++  +L++ D
Sbjct: 128 KVRVSLVSGLTSRNKVLEVTFPGTEEDLLAVLKSAD 163


>gi|317509370|ref|ZP_07966989.1| hypothetical protein HMPREF9336_03361 [Segniliparus rugosus ATCC
          BAA-974]
 gi|316252293|gb|EFV11744.1| hypothetical protein HMPREF9336_03361 [Segniliparus rugosus ATCC
          BAA-974]
          Length = 79

 Score = 53.6 bits (128), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 16/76 (21%), Positives = 36/76 (47%), Gaps = 6/76 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V V + P +++          + ++   + + V     +G+A  A   +LAK L ++KS 
Sbjct: 8  VTVTVKPGSRR------GPSVEAAEDGSLTVCVREPAVEGRATAAAAVVLAKHLGVAKSR 61

Query: 64 LRMLSKQSSPLKIIYI 79
          + ++S  +S +K   +
Sbjct: 62 VALVSGATSRVKRFAV 77


>gi|72388068|ref|XP_844458.1| hypothetical protein [Trypanosoma brucei TREU927]
 gi|62359409|gb|AAX79847.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70800991|gb|AAZ10899.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
          927/4 GUTat10.1]
          Length = 165

 Score = 53.6 bits (128), Expect = 8e-06,   Method: Composition-based stats.
 Identities = 11/55 (20%), Positives = 26/55 (47%), Gaps = 4/55 (7%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKL 57
           +++   P A+ + +A+     D      +++++ A P  GKAN  ++  +   L
Sbjct: 16 RLMIHAKPGARSTALAAQPQALD----EALEVRLAAPPVDGKANTELVEFMQTLL 66



 Score = 38.9 bits (90), Expect = 0.20,   Method: Composition-based stats.
 Identities = 6/36 (16%), Positives = 19/36 (52%), Gaps = 2/36 (5%)

Query: 61  KSSLRMLSKQSSPLKIIYID--KDCKEITELLQNND 94
           K  + ++S  +S  K++ +      +++  +L++ D
Sbjct: 128 KVRVSLVSGLTSRNKVLEVTFPGTEEDLLAVLKSAD 163


>gi|187251746|ref|YP_001876228.1| hypothetical protein Emin_1341 [Elusimicrobium minutum Pei191]
 gi|186971906|gb|ACC98891.1| Uncharacterized conserved protein DUF167 [Elusimicrobium minutum
          Pei191]
          Length = 71

 Score = 53.6 bits (128), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 17/75 (22%), Positives = 38/75 (50%), Gaps = 7/75 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          + V++  + K++ +          +    +I V A  ++G AN+A+  +LAK++ +    
Sbjct: 3  IKVKVHADEKQNKLI-------KKNEDTFEIWVKAPAERGLANEAVREILAKEIGVGVKK 55

Query: 64 LRMLSKQSSPLKIIY 78
          +R++   +SP KI  
Sbjct: 56 IRLIKGATSPSKIFE 70


>gi|326386210|ref|ZP_08207834.1| hypothetical protein Y88_2102 [Novosphingobium nitrogenifigens
          DSM 19370]
 gi|326209435|gb|EGD60228.1| hypothetical protein Y88_2102 [Novosphingobium nitrogenifigens
          DSM 19370]
          Length = 95

 Score = 53.2 bits (127), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 20/77 (25%), Positives = 41/77 (53%), Gaps = 8/77 (10%)

Query: 3  NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62
           + VR+ P A++  +A +           + +KV A P+ GKA  A+L+++A  L ++ S
Sbjct: 23 RLAVRVTPGAREETVAIV--------DGRVLVKVRAKPEDGKATTAVLSLVAAALGVAAS 74

Query: 63 SLRMLSKQSSPLKIIYI 79
           + +L   +S  K++ +
Sbjct: 75 RVELLRGATSREKLLRL 91


>gi|242799550|ref|XP_002483404.1| DUF167 domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218716749|gb|EED16170.1| DUF167 domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 124

 Score = 52.8 bits (126), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 15/78 (19%), Positives = 35/78 (44%), Gaps = 8/78 (10%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          +   + P ++  G   ++          + + V + P+KG+AN A+  +L++ L   KS 
Sbjct: 26 IRCHVTPGSR--GFQGIKQI----HNEQVYVHVGSEPRKGEANTAVARVLSEVLGFPKSD 79

Query: 64 LRMLSKQSSPLKIIYIDK 81
          + ++      +KI  +  
Sbjct: 80 VIIV--GKQRVKIGQVTG 95


>gi|326517290|dbj|BAK00012.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326529833|dbj|BAK08196.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 232

 Score = 51.7 bits (123), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 15/91 (16%), Positives = 36/91 (39%), Gaps = 8/91 (8%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +                     +G+AN  +L  + K L L 
Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADDVRVAVAAPA-------ARGEANNELLEFMGKVLGLR 195

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
            S + +    ++  K++ ++    +++ E L
Sbjct: 196 LSQMTLQRGWNNKSKLLIVEDLSARQVYEKL 226


>gi|281212114|gb|EFA86275.1| hypothetical protein PPL_00837 [Polysphondylium pallidum PN500]
          Length = 83

 Score = 51.7 bits (123), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 14/56 (25%), Positives = 33/56 (58%), Gaps = 5/56 (8%)

Query: 1  MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56
          +  + V + PNAK+S + S+    D+     + I+++  P +G+AN+ ++  L+++
Sbjct: 9  IVKLKVNVHPNAKQSSVVSVNELADS-----VDIRISQPPTEGRANEEVIEYLSEQ 59


>gi|148260376|ref|YP_001234503.1| hypothetical protein Acry_1373 [Acidiphilium cryptum JF-5]
 gi|146402057|gb|ABQ30584.1| hypothetical protein Acry_1373 [Acidiphilium cryptum JF-5]
          Length = 75

 Score = 51.3 bits (122), Expect = 5e-05,   Method: Composition-based stats.
 Identities = 11/47 (23%), Positives = 23/47 (48%)

Query: 39 TPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKE 85
           P+ G+AN A+L  LA  L +  + L + +   +  K++ +     +
Sbjct: 4  PPEDGRANDAVLKALAAWLGIGAARLALRAGGQARDKLVLVAGGSAD 50


>gi|308456125|ref|XP_003090529.1| hypothetical protein CRE_03518 [Caenorhabditis remanei]
 gi|308262652|gb|EFP06605.1| hypothetical protein CRE_03518 [Caenorhabditis remanei]
          Length = 102

 Score = 50.9 bits (121), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 16/74 (21%), Positives = 32/74 (43%), Gaps = 9/74 (12%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           + +   P AKKS + ++           + + + A P++G AN+ +++ L   L L K+ 
Sbjct: 34  LRIHAKPGAKKSCVVAI-------GESEIDVSIGAAPREGAANEELISYLMAALGLRKNE 86

Query: 64  LRMLS--KQSSPLK 75
           L+           K
Sbjct: 87  LQFDKVLGLMGNDK 100


>gi|183982913|ref|YP_001851204.1| hypothetical protein MMAR_2910 [Mycobacterium marinum M]
 gi|226701561|sp|B2HE69|Y2910_MYCMM RecName: Full=UPF0235 protein MMAR_2910
 gi|183176239|gb|ACC41349.1| conserved hypothetical protein [Mycobacterium marinum M]
          Length = 76

 Score = 48.2 bits (114), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 21/78 (26%), Positives = 39/78 (50%), Gaps = 6/78 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
          V+VR+ P ++K  +       +T     + I V      GKAN+A   +LA  L L +S 
Sbjct: 5  VVVRVKPGSRKGPLV------ETGSDAELTIYVRERAVDGKANEAAARLLAAHLQLPRSR 58

Query: 64 LRMLSKQSSPLKIIYIDK 81
          + +++  +S LK   +++
Sbjct: 59 VELVAGATSRLKRFRVER 76


>gi|322494953|emb|CBZ30256.1| conserved hypothetical protein [Leishmania mexicana
          MHOM/GT/2001/U1103]
          Length = 204

 Score = 47.0 bits (111), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 4/53 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56
          + V   P+A+ S  A+   P  T       +++ A P  G+AN  +L  L + 
Sbjct: 25 LTVHAKPSARASAFAAPVTPTLTEAD----LRIAAPPVDGQANAELLRYLGEL 73



 Score = 34.7 bits (79), Expect = 3.9,   Method: Composition-based stats.
 Identities = 6/34 (17%), Positives = 14/34 (41%), Gaps = 2/34 (5%)

Query: 63  SLRMLSKQSSPLKIIYI--DKDCKEITELLQNND 94
            + ++   +S  K + I       E+T +L+   
Sbjct: 169 EVSLVRGGTSREKTVLIVFPGTRAELTAVLEKVS 202


>gi|302418742|ref|XP_003007202.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
 gi|261354804|gb|EEY17232.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
          Length = 116

 Score = 46.7 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 14/53 (26%), Positives = 26/53 (49%), Gaps = 5/53 (9%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56
          +   + P A K+                ++I V A P++G+ANKA+L +L++ 
Sbjct: 24 LHCYVKPGAAKAR-----EGVTGLTEDAIEICVAAQPREGQANKAVLRLLSEA 71


>gi|146098877|ref|XP_001468495.1| hypothetical protein [Leishmania infantum]
 gi|134072863|emb|CAM71579.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 204

 Score = 46.7 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 14/53 (26%), Positives = 25/53 (47%), Gaps = 4/53 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56
          + V   P+A+ S  A+   P  T       +++ A P +G+AN  +L  L + 
Sbjct: 25 LTVHAKPSARASAFAAPLTPALTEAD----LRIAAPPVEGQANAELLRYLGEL 73


>gi|322502521|emb|CBZ37604.1| unnamed protein product [Leishmania donovani BPK282A1]
          Length = 204

 Score = 46.7 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 14/53 (26%), Positives = 25/53 (47%), Gaps = 4/53 (7%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56
          + V   P+A+ S  A+   P  T       +++ A P +G+AN  +L  L + 
Sbjct: 25 LTVHAKPSARASAFAAPLTPALTEAD----LRIAAPPVEGQANAELLRYLGEL 73


>gi|255634374|gb|ACU17552.1| unknown [Glycine max]
          Length = 77

 Score = 46.3 bits (109), Expect = 0.001,   Method: Composition-based stats.
 Identities = 8/51 (15%), Positives = 22/51 (43%), Gaps = 7/51 (13%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLA 54
          + +   P +K + +  +             +++ A  + G+AN A+L  ++
Sbjct: 20 ITIHAKPGSKSASVTDISDEAVG-------VQIDAPARDGEANAALLDYIS 63


>gi|301774793|ref|XP_002922805.1| PREDICTED: LOW QUALITY PROTEIN: UPF0235 protein C15orf40-like
           [Ailuropoda melanoleuca]
          Length = 129

 Score = 46.3 bits (109), Expect = 0.001,   Method: Composition-based stats.
 Identities = 18/94 (19%), Positives = 37/94 (39%), Gaps = 4/94 (4%)

Query: 2   CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQ-KGKANKAMLAMLAKKLALS 60
             + +   P +K++    +     +           A  Q +G+AN  +   L+K L L 
Sbjct: 36  VTIAIHAKPGSKQNATTDVTAKVVSVAITAPPPTPPAPRQSEGEANAELSWCLSKVLELR 95

Query: 61  KS-SLRMLSKQSSPLKIIYI--DKDCKEITELLQ 91
           KS  + +     S  K++ +      +EI E ++
Sbjct: 96  KSDDVILDKGGXSHEKVVKLLASTTAEEILEKVK 129


>gi|157875650|ref|XP_001686209.1| hypothetical protein [Leishmania major strain Friedlin]
 gi|68129283|emb|CAJ07823.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 204

 Score = 46.3 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 14/50 (28%), Positives = 24/50 (48%), Gaps = 4/50 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAML 53
          + V   P+A+ S  A+   P  T       +++ A P +G+AN  +L  L
Sbjct: 25 LTVHAKPSARASAFAAPLTPVLTEAD----LRIAAPPVEGQANAELLRYL 70



 Score = 34.3 bits (78), Expect = 5.1,   Method: Composition-based stats.
 Identities = 4/34 (11%), Positives = 14/34 (41%), Gaps = 2/34 (5%)

Query: 63  SLRMLSKQSSPLKIIYI--DKDCKEITELLQNND 94
            + ++   +S  K + +       ++T +L+   
Sbjct: 169 EVSLVRGGTSREKTVLVMFPGTRAQLTAVLEKES 202


>gi|197106855|ref|YP_002132232.1| hypothetical protein PHZ_c3394 [Phenylobacterium zucineum HLK1]
 gi|196480275|gb|ACG79803.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1]
          Length = 86

 Score = 45.9 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/76 (28%), Positives = 42/76 (55%), Gaps = 2/76 (2%)

Query: 9  IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
           P   +  I       D +    +K++V+A P  G AN A++A+LAK L + KS++R+ +
Sbjct: 2  TPKGGRDAIDGW--GADEAGRPVLKVRVSAAPADGAANAAVVALLAKALKVPKSAVRIAA 59

Query: 69 KQSSPLKIIYIDKDCK 84
           +++ +K + ID   +
Sbjct: 60 GETARIKRLEIDGASE 75


>gi|255089425|ref|XP_002506634.1| predicted protein [Micromonas sp. RCC299]
 gi|226521907|gb|ACO67892.1| predicted protein [Micromonas sp. RCC299]
          Length = 319

 Score = 45.5 bits (107), Expect = 0.002,   Method: Composition-based stats.
 Identities = 15/91 (16%), Positives = 36/91 (39%), Gaps = 10/91 (10%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           M  + + +   A+   ++ +   +         + +T  P  G+ +  +L  L K L L 
Sbjct: 141 MIQIAIDVEDKARTKAVSKITADEVG-------VALT-LPV-GQCDDELLEFLGKVLHLR 191

Query: 61  KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90
              + +L   S+  K++ +      ++ E L
Sbjct: 192 LPQMSLLRGWSTRSKLLMVQGLSATQVYERL 222


>gi|320590983|gb|EFX03422.1| dash complex subunit [Grosmannia clavigera kw1407]
          Length = 312

 Score = 45.5 bits (107), Expect = 0.003,   Method: Composition-based stats.
 Identities = 12/81 (14%), Positives = 23/81 (28%), Gaps = 25/81 (30%)

Query: 4   VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           +   + P A K     +           ++I V                    L + KSS
Sbjct: 229 LQCHIKPGASK-----IRQGVTAVTDDAIEICV--------------------LDVPKSS 263

Query: 64  LRMLSKQSSPLKIIYIDKDCK 84
           L++     S  K + +    +
Sbjct: 264 LQITRGLKSRDKTVAVAGLSE 284


>gi|111223913|ref|YP_714707.1| hypothetical protein FRAAL4520 [Frankia alni ACN14a]
 gi|111151445|emb|CAJ63162.1| hypothetical protein FRAAL4520 [Frankia alni ACN14a]
          Length = 93

 Score = 45.1 bits (106), Expect = 0.003,   Method: Composition-based stats.
 Identities = 17/75 (22%), Positives = 31/75 (41%)

Query: 22 IPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81
             D      + ++VT     G+A +A L  LA  L L ++ +R++   +S +K   +  
Sbjct: 4  RWTDPRAGSIVIVRVTERAVDGRATEAALRALAGALGLRRTQVRLVRGATSRVKTFELTT 63

Query: 82 DCKEITELLQNNDSL 96
             +   L    D L
Sbjct: 64 PAADEPALRARLDRL 78


>gi|322504678|emb|CBZ14505.1| conserved hypothetical protein [Leishmania braziliensis
          MHOM/BR/75/M2904]
          Length = 193

 Score = 44.7 bits (105), Expect = 0.005,   Method: Composition-based stats.
 Identities = 14/50 (28%), Positives = 23/50 (46%), Gaps = 4/50 (8%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAML 53
          + V   P A+ S  A+   P  T       +++ A P +G+AN  +L  L
Sbjct: 15 LRVYAKPGARASAFAAPLTPSLTEAD----LRIAAAPVEGQANAELLRYL 60



 Score = 33.6 bits (76), Expect = 9.2,   Method: Composition-based stats.
 Identities = 3/34 (8%), Positives = 13/34 (38%), Gaps = 2/34 (5%)

Query: 63  SLRMLSKQSSPLKIIYI--DKDCKEITELLQNND 94
            + ++   +S  K + +       ++  +L+   
Sbjct: 158 EVSLVRGGTSREKTLLVVFPGTRAQLAAILEKES 191


>gi|7770327|gb|AAF69697.1|AC016041_2 F27J15.6 [Arabidopsis thaliana]
          Length = 91

 Score = 43.6 bits (102), Expect = 0.009,   Method: Composition-based stats.
 Identities = 9/53 (16%), Positives = 22/53 (41%), Gaps = 7/53 (13%)

Query: 4  VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56
          + +   P +K + I  +             +++ A  + G+AN A+L  ++  
Sbjct: 39 ITIHAKPGSKAASITDVSDEAVG-------VQIDAPARDGEANAALLEYMSSV 84


>gi|30697842|ref|NP_851256.1| unknown protein [Arabidopsis thaliana]
 gi|16226478|gb|AAL16178.1|AF428410_1 AT5g63440/MLE2_7 [Arabidopsis thaliana]
 gi|22137224|gb|AAM91457.1| AT5g63440/MLE2_7 [Arabidopsis thaliana]
 gi|332010364|gb|AED97747.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 205

 Score = 43.2 bits (101), Expect = 0.011,   Method: Composition-based stats.
 Identities = 11/64 (17%), Positives = 27/64 (42%), Gaps = 10/64 (15%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + +   + 
Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGR---VP 192

Query: 61  KSSL 64
             S+
Sbjct: 193 NQSV 196


>gi|303284124|ref|XP_003061353.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457704|gb|EEH55003.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 224

 Score = 42.0 bits (98), Expect = 0.025,   Method: Composition-based stats.
 Identities = 17/89 (19%), Positives = 33/89 (37%), Gaps = 6/89 (6%)

Query: 8   LIPNAKKSGIASLEIPKDTSDTIHMKIKVTAT--PQKGK---ANKAMLAMLAKKLALSKS 62
           + P    +   +LEI          KI           K   AN  ++  L K L L   
Sbjct: 136 IQPKTPGTVQIALEIEDRKKWRAITKITADEVGVAVNAKCAVANDEIVEFLGKTLHLRLP 195

Query: 63  SLRMLSKQSSPLKIIYIDK-DCKEITELL 90
            + +L+  S+  K++ +     +++ + L
Sbjct: 196 QMSLLAGWSARSKLLVVQGLTPQQVYDRL 224


>gi|212723588|ref|NP_001132046.1| hypothetical protein LOC100193457 [Zea mays]
 gi|194693286|gb|ACF80727.1| unknown [Zea mays]
          Length = 213

 Score = 39.7 bits (92), Expect = 0.13,   Method: Composition-based stats.
 Identities = 11/56 (19%), Positives = 24/56 (42%), Gaps = 7/56 (12%)

Query: 1   MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56
           +  V + +   A++S I  +           +++ V A   +G+AN  +L  + K 
Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAALAARGEANSELLEFMGKV 191


>gi|315122788|ref|YP_004063277.1| hypothetical protein CKC_05215 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
 gi|313496190|gb|ADR52789.1| hypothetical protein CKC_05215 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
          Length = 50

 Score = 38.9 bits (90), Expect = 0.21,   Method: Composition-based stats.
 Identities = 33/50 (66%), Positives = 43/50 (86%)

Query: 49 MLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL 98
          ML +LA++L+L+KSSL+MLSK SSP+K IYIDKDCKEI EL + N+ +TL
Sbjct: 1  MLTILAERLSLNKSSLKMLSKHSSPIKKIYIDKDCKEIIELFKRNNPVTL 50


>gi|302833257|ref|XP_002948192.1| hypothetical protein VOLCADRAFT_103816 [Volvox carteri f.
           nagariensis]
 gi|300266412|gb|EFJ50599.1| hypothetical protein VOLCADRAFT_103816 [Volvox carteri f.
           nagariensis]
          Length = 243

 Score = 38.6 bits (89), Expect = 0.26,   Method: Composition-based stats.
 Identities = 13/64 (20%), Positives = 30/64 (46%), Gaps = 1/64 (1%)

Query: 30  IHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK-DCKEITE 88
             +++ VT        ++ +  + AK L +  S L +   +SS  +I+ ++    ++I E
Sbjct: 162 DVVRVHVTGLMANDAVHEELFDLFAKVLNVRLSQLDIRRAKSSRNRIMTVESLTPEQIYE 221

Query: 89  LLQN 92
            L+ 
Sbjct: 222 RLRE 225


>gi|304312491|ref|YP_003812089.1| tRNA pseudouridine synthase D [gamma proteobacterium HdN1]
 gi|301798224|emb|CBL46446.1| tRNA pseudouridine synthase D [gamma proteobacterium HdN1]
          Length = 352

 Score = 37.0 bits (85), Expect = 0.78,   Method: Composition-based stats.
 Identities = 11/60 (18%), Positives = 23/60 (38%), Gaps = 7/60 (11%)

Query: 15 SGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPL 74
            +   E         H+ ++V A   +G+    ++  LA+   + +SS+   S Q    
Sbjct: 32 DEVLGFEPDG---SGEHLCVQVWA---RGQNTAWLVRQLAQWANVPRSSVSF-SGQKDRH 84


>gi|159465355|ref|XP_001690888.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158279574|gb|EDP05334.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 243

 Score = 37.0 bits (85), Expect = 0.84,   Method: Composition-based stats.
 Identities = 9/65 (13%), Positives = 30/65 (46%), Gaps = 1/65 (1%)

Query: 30  IHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK-DCKEITE 88
             +++ VT        ++ +  +++K L +  S L +   + +  +I+ ++    +++ E
Sbjct: 162 DVVRVHVTGLMAGDAVHEELFDLISKVLNVRLSQLDIRRAKHNRNRIMTVEGLTPEQVFE 221

Query: 89  LLQNN 93
            L+  
Sbjct: 222 RLREQ 226


>gi|254522602|ref|ZP_05134657.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14]
 gi|219720193|gb|EED38718.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14]
          Length = 1475

 Score = 36.6 bits (84), Expect = 1.1,   Method: Composition-based stats.
 Identities = 10/55 (18%), Positives = 24/55 (43%), Gaps = 2/55 (3%)

Query: 11   NAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65
             A  + +    + +D +     +I  T    +GKA+   + +L+ +L + K  + 
Sbjct: 974  KASVTAVERASVTRDEALGE--RINTTNAALEGKASTGSVQLLSSELGVQKGRID 1026


>gi|254523366|ref|ZP_05135421.1| p22 [Stenotrophomonas sp. SKA14]
 gi|219720957|gb|EED39482.1| p22 [Stenotrophomonas sp. SKA14]
          Length = 1651

 Score = 36.6 bits (84), Expect = 1.3,   Method: Composition-based stats.
 Identities = 10/54 (18%), Positives = 25/54 (46%), Gaps = 2/54 (3%)

Query: 12   AKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65
            A  + +    + +D++     +I  T    +GKA+   + +L+ +L + K  + 
Sbjct: 1151 ASVNAVEQASVSRDSALGE--RINTTNAALEGKASTGSVQLLSSELGVQKGRID 1202


>gi|254522586|ref|ZP_05134641.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14]
 gi|219720177|gb|EED38702.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14]
          Length = 1671

 Score = 36.6 bits (84), Expect = 1.3,   Method: Composition-based stats.
 Identities = 10/54 (18%), Positives = 25/54 (46%), Gaps = 2/54 (3%)

Query: 12   AKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65
            A  + +    + +D++     +I  T    +GKA+   + +L+ +L + K  + 
Sbjct: 1171 ASVNAVEQASVSRDSALGE--RINTTNAALEGKASTGSVQLLSSELGVQKGRID 1222


>gi|254521915|ref|ZP_05133970.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14]
 gi|219719506|gb|EED38031.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14]
          Length = 1553

 Score = 36.2 bits (83), Expect = 1.5,   Method: Composition-based stats.
 Identities = 9/54 (16%), Positives = 23/54 (42%), Gaps = 2/54 (3%)

Query: 12   AKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65
            A  + +    + +D +    +    T    +GKA+   + +L+ +L + K  + 
Sbjct: 1076 ASVNAVDRASVSRDAALGERL--NTTDAALEGKASTGSVQLLSSELGVQKGRID 1127


>gi|255582001|ref|XP_002531798.1| conserved hypothetical protein [Ricinus communis]
 gi|223528564|gb|EEF30586.1| conserved hypothetical protein [Ricinus communis]
          Length = 176

 Score = 35.9 bits (82), Expect = 1.8,   Method: Composition-based stats.
 Identities = 4/24 (16%), Positives = 11/24 (45%)

Query: 56  KLALSKSSLRMLSKQSSPLKIIYI 79
            L + +  + + S   S  K++ +
Sbjct: 122 VLGVKRRQVSIRSGSKSRDKVVIV 145


>gi|296535729|ref|ZP_06897896.1| carbon-monoxide dehydrogenase large subunit [Roseomonas cervicalis
           ATCC 49957]
 gi|296263942|gb|EFH10400.1| carbon-monoxide dehydrogenase large subunit [Roseomonas cervicalis
           ATCC 49957]
          Length = 813

 Score = 35.5 bits (81), Expect = 2.4,   Method: Composition-based stats.
 Identities = 7/69 (10%), Positives = 20/69 (28%), Gaps = 18/69 (26%)

Query: 5   IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
            +R+ P    S            +T                      +++++L +  S +
Sbjct: 505 NIRVHPTGSISVFTGTHSHGQGHETTF------------------AQLVSEQLGVPLSQV 546

Query: 65  RMLSKQSSP 73
            ++   +S 
Sbjct: 547 EIVHGDTSK 555


>gi|103487614|ref|YP_617175.1| hypothetical protein Sala_2133 [Sphingopyxis alaskensis RB2256]
 gi|98977691|gb|ABF53842.1| protein of unknown function DUF167 [Sphingopyxis alaskensis
          RB2256]
          Length = 67

 Score = 35.5 bits (81), Expect = 2.7,   Method: Composition-based stats.
 Identities = 18/74 (24%), Positives = 34/74 (45%), Gaps = 8/74 (10%)

Query: 9  IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68
           P A+   +          +   + ++VT  P  G AN A+L +LA  L +    L ++ 
Sbjct: 2  TPGARSEAV--------RIEGGTVHLRVTVPPADGAANAAVLRLLAAALDVPPRDLTLIR 53

Query: 69 KQSSPLKIIYIDKD 82
            S+ +K+I I ++
Sbjct: 54 GASARIKLIGIARN 67


>gi|219850376|ref|YP_002464809.1| Xanthine dehydrogenase [Chloroflexus aggregans DSM 9485]
 gi|219544635|gb|ACL26373.1| Xanthine dehydrogenase [Chloroflexus aggregans DSM 9485]
          Length = 756

 Score = 35.5 bits (81), Expect = 2.8,   Method: Composition-based stats.
 Identities = 8/71 (11%), Positives = 23/71 (32%), Gaps = 9/71 (12%)

Query: 9   IPNAKKS-GIASLEIPKDTSD-------TIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60
            P +     I         +           +++ V A    G  N + + + A+ L + 
Sbjct: 445 TPGSGIGLAIGGWPCGMSPAAAVCRVDTDGTVRVHVGAVDISG-VNSSFVLVAAEILGVP 503

Query: 61  KSSLRMLSKQS 71
              + +++  +
Sbjct: 504 PEQVEIVAGDT 514


>gi|269837428|ref|YP_003319656.1| aldehyde oxidase and xanthine dehydrogenase molybdopterin binding
           protein [Sphaerobacter thermophilus DSM 20745]
 gi|269786691|gb|ACZ38834.1| aldehyde oxidase and xanthine dehydrogenase molybdopterin binding
           protein [Sphaerobacter thermophilus DSM 20745]
          Length = 785

 Score = 35.1 bits (80), Expect = 3.0,   Method: Composition-based stats.
 Identities = 8/69 (11%), Positives = 20/69 (28%), Gaps = 18/69 (26%)

Query: 5   IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64
            VR+ P+ K S            +T                      ++A++L +    +
Sbjct: 484 TVRVHPSGKVSVFTGSNPHGQGEETTF------------------AQLVAEELGVPLDDI 525

Query: 65  RMLSKQSSP 73
            ++   +  
Sbjct: 526 EIVHGDTGR 534


>gi|293607793|ref|ZP_06690123.1| conserved hypothetical protein [Achromobacter piechaudii ATCC
           43553]
 gi|292813808|gb|EFF72959.1| conserved hypothetical protein [Achromobacter piechaudii ATCC
           43553]
          Length = 383

 Score = 35.1 bits (80), Expect = 3.0,   Method: Composition-based stats.
 Identities = 16/79 (20%), Positives = 31/79 (39%), Gaps = 10/79 (12%)

Query: 6   VRLIPNAK--KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           +R+ P +   K+G+A +E+           I V      GKA   +  +LA+        
Sbjct: 306 LRVTPGSAAHKAGLAGVEVTPQGIVPGDRIIDV-----DGKATDDVAKLLARLDDRKVGD 360

Query: 64  LRMLSKQ---SSPLKIIYI 79
           + +LS +    S    + +
Sbjct: 361 VVVLSVERAGKSREVRVEL 379


>gi|241662231|ref|YP_002980591.1| 2-alkenal reductase [Ralstonia pickettii 12D]
 gi|240864258|gb|ACS61919.1| 2-alkenal reductase [Ralstonia pickettii 12D]
          Length = 383

 Score = 35.1 bits (80), Expect = 3.4,   Method: Composition-based stats.
 Identities = 16/79 (20%), Positives = 31/79 (39%), Gaps = 10/79 (12%)

Query: 6   VRLIPNAK--KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           +R+ P +   K+G+A +E+           I V      GKA   +  +LA+        
Sbjct: 306 LRVTPGSAAHKAGLAGVEVTPQGIVPGDRIIDV-----DGKATDDVAKLLARLDDRKVGD 360

Query: 64  LRMLSKQ---SSPLKIIYI 79
           + +LS +    S    + +
Sbjct: 361 VVVLSVERAGKSREVRVEL 379


>gi|299136787|ref|ZP_07029970.1| isoleucyl-tRNA synthetase [Acidobacterium sp. MP5ACTX8]
 gi|298601302|gb|EFI57457.1| isoleucyl-tRNA synthetase [Acidobacterium sp. MP5ACTX8]
          Length = 959

 Score = 34.7 bits (79), Expect = 3.9,   Method: Composition-based stats.
 Identities = 11/61 (18%), Positives = 28/61 (45%), Gaps = 5/61 (8%)

Query: 29  TIHMKIKVTATPQKGKANKAML----AMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCK 84
              ++  +    Q+G A+  +L      L +   +S++S++++   ++  K+I I     
Sbjct: 856 GKSLEASIQIMAQEGSADAILLVKYEEALPEFFNVSQASVQVV-GATNEDKVILIRATVA 914

Query: 85  E 85
           E
Sbjct: 915 E 915


>gi|161525478|ref|YP_001580490.1| 2-alkenal reductase [Burkholderia multivorans ATCC 17616]
 gi|189349793|ref|YP_001945421.1| putative trypsin-like serine protease [Burkholderia multivorans
           ATCC 17616]
 gi|160342907|gb|ABX15993.1| 2-alkenal reductase [Burkholderia multivorans ATCC 17616]
 gi|189333815|dbj|BAG42885.1| putative trypsin-like serine protease [Burkholderia multivorans
           ATCC 17616]
          Length = 383

 Score = 34.7 bits (79), Expect = 4.4,   Method: Composition-based stats.
 Identities = 16/79 (20%), Positives = 31/79 (39%), Gaps = 10/79 (12%)

Query: 6   VRLIPNAK--KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           +R+ P +   K+G+A +E+           I V      GKA   +  +LA+        
Sbjct: 306 LRVTPGSAAHKAGLAGVEVTPQGIVPGDRIIDV-----DGKATDDVAKLLARLDDRKVGD 360

Query: 64  LRMLSKQ---SSPLKIIYI 79
           + +LS +    S    + +
Sbjct: 361 VVVLSVERAGKSREVRVEL 379


>gi|119476064|ref|ZP_01616416.1| putative coenzyme F390 synthetase [marine gamma proteobacterium
           HTCC2143]
 gi|119450691|gb|EAW31925.1| putative coenzyme F390 synthetase [marine gamma proteobacterium
           HTCC2143]
          Length = 452

 Score = 34.3 bits (78), Expect = 5.6,   Method: Composition-based stats.
 Identities = 15/100 (15%), Positives = 34/100 (34%), Gaps = 23/100 (23%)

Query: 6   VRLIPNAKKSGIASLEIPKDTSDTIHMKIK--VTATPQK----------GKANKAMLAML 53
           V + PNA +  +         +  I +     V A P +          G A+ A+   L
Sbjct: 352 VNVFPNAVRDIVNKHSAQTTGNIRIMLPEPGPVAAPPIRVLVETHQRLQGAADVALCKEL 411

Query: 54  AKKLA---LSKSSLRMLS--------KQSSPLKIIYIDKD 82
           ++ +      ++ + + +          +   K++ I  D
Sbjct: 412 SELVHHHLRFRAKIELQAEGVFQVQTGATGKSKLVEIIGD 451


>gi|78358459|ref|YP_389908.1| DegP2 peptidase [Desulfovibrio desulfuricans subsp. desulfuricans
           str. G20]
 gi|78220864|gb|ABB40213.1| DegP2 peptidase, Serine peptidase, MEROPS family S01B
           [Desulfovibrio desulfuricans subsp. desulfuricans str.
           G20]
          Length = 383

 Score = 33.6 bits (76), Expect = 10.0,   Method: Composition-based stats.
 Identities = 15/78 (19%), Positives = 32/78 (41%), Gaps = 8/78 (10%)

Query: 6   VRLIPNAK--KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63
           +R+ P +   K+G+A +E+           I V      GKA   +  +LA+        
Sbjct: 306 LRVTPGSAAHKAGLAGVEVTPQGIVPGDRIIGV-----DGKATDNVAKLLARLDDRKVGD 360

Query: 64  LRMLSKQS-SPLKIIYID 80
           + +LS +     + + ++
Sbjct: 361 VVVLSVERAGKTREVRVE 378


  Database: nr
    Posted date:  May 13, 2011  4:10 AM
  Number of letters in database: 999,999,932
  Number of sequences in database:  2,987,209
  
  Database: /data/usr2/db/fasta/nr.01
    Posted date:  May 13, 2011  4:17 AM
  Number of letters in database: 999,998,956
  Number of sequences in database:  2,896,973
  
  Database: /data/usr2/db/fasta/nr.02
    Posted date:  May 13, 2011  4:23 AM
  Number of letters in database: 999,999,979
  Number of sequences in database:  2,907,862
  
  Database: /data/usr2/db/fasta/nr.03
    Posted date:  May 13, 2011  4:29 AM
  Number of letters in database: 999,999,513
  Number of sequences in database:  2,932,190
  
  Database: /data/usr2/db/fasta/nr.04
    Posted date:  May 13, 2011  4:33 AM
  Number of letters in database: 792,586,372
  Number of sequences in database:  2,260,650
  
Lambda     K      H
   0.309    0.168    0.486 

Lambda     K      H
   0.267   0.0520    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,268,908,179
Number of Sequences: 13984884
Number of extensions: 68017790
Number of successful extensions: 147259
Number of sequences better than 10.0: 860
Number of HSP's better than 10.0 without gapping: 832
Number of HSP's successfully gapped in prelim test: 28
Number of HSP's that attempted gapping in prelim test: 145640
Number of HSP's gapped (non-prelim): 878
length of query: 98
length of database: 4,792,584,752
effective HSP length: 67
effective length of query: 31
effective length of database: 3,855,597,524
effective search space: 119523523244
effective search space used: 119523523244
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (20.8 bits)
S2: 76 (33.5 bits)