BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 033057
         (128 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225431557|ref|XP_002282176.1| PREDICTED: UPF0235 protein C15orf40 homolog [Vitis vinifera]
 gi|147866968|emb|CAN83055.1| hypothetical protein VITISV_009894 [Vitis vinifera]
          Length = 128

 Score =  185 bits (470), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 94/128 (73%), Positives = 107/128 (83%), Gaps = 1/128 (0%)

Query: 1   MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
           MAP+KKGK+KA S GSTQS    N  + PSCIR VPPSSVSIT+HAKPGSK  SITD  D
Sbjct: 1   MAPSKKGKAKANSGGSTQSSDSVN-SSFPSCIRFVPPSSVSITVHAKPGSKVSSITDFDD 59

Query: 61  EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLN 120
           EA+GVQIDAPAKDGEANAALL+Y+SSV+GVKRRQVSI SGSKSRDK+VIVEE+T + V +
Sbjct: 60  EALGVQIDAPAKDGEANAALLDYISSVVGVKRRQVSISSGSKSRDKVVIVEEVTLQGVFD 119

Query: 121 SLGKASSS 128
           +L K S S
Sbjct: 120 ALDKVSKS 127


>gi|30694498|ref|NP_175343.2| uncharacterized protein [Arabidopsis thaliana]
 gi|34365605|gb|AAQ65114.1| At1g49170 [Arabidopsis thaliana]
 gi|51971533|dbj|BAD44431.1| similar to serine/threonine kinase 9 gb|AAD28798.1 [Arabidopsis
           thaliana]
 gi|332194278|gb|AEE32399.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 126

 Score =  180 bits (457), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 91/126 (72%), Positives = 107/126 (84%), Gaps = 3/126 (2%)

Query: 1   MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
           MAP KKGK K KSA   +S   T   + P+C+RL+ PSSV+ITIHAKPGSK+ SITDVSD
Sbjct: 1   MAPTKKGK-KTKSAA--ESTTVTESSSFPTCLRLLTPSSVAITIHAKPGSKAASITDVSD 57

Query: 61  EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLN 120
           EAVGVQIDAPA+DGEANAALLEYMSSVLGVKRRQVS+GSGSKSRDK+VIVE++T ++V  
Sbjct: 58  EAVGVQIDAPARDGEANAALLEYMSSVLGVKRRQVSLGSGSKSRDKVVIVEDMTQQSVFQ 117

Query: 121 SLGKAS 126
           +L +AS
Sbjct: 118 ALSQAS 123


>gi|351726281|ref|NP_001237633.1| uncharacterized protein LOC100527290 [Glycine max]
 gi|255632017|gb|ACU16361.1| unknown [Glycine max]
          Length = 126

 Score =  176 bits (446), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 90/126 (71%), Positives = 108/126 (85%), Gaps = 3/126 (2%)

Query: 1   MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
           M PAK+GK+KA+   +T S+ K ND   PSCIR VPPSSV+ITIHAKPGSKS S+TD+SD
Sbjct: 1   MPPAKRGKAKAEPKVTTPSE-KPND--FPSCIRSVPPSSVAITIHAKPGSKSASVTDISD 57

Query: 61  EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLN 120
           EAVGVQIDAPA+DGEANAALL+Y+SSVLGVKRRQVS+G+GSKSRDK VIVE++T + V +
Sbjct: 58  EAVGVQIDAPARDGEANAALLDYISSVLGVKRRQVSLGTGSKSRDKTVIVEDVTQQYVFD 117

Query: 121 SLGKAS 126
           +L K S
Sbjct: 118 ALDKVS 123


>gi|351722711|ref|NP_001238533.1| uncharacterized protein LOC100499954 [Glycine max]
 gi|255627953|gb|ACU14321.1| unknown [Glycine max]
          Length = 126

 Score =  176 bits (445), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 90/126 (71%), Positives = 109/126 (86%), Gaps = 3/126 (2%)

Query: 1   MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
           M PAK+GK+KA+S  +T S+ K ND   PSCI+ VPPSSV+ITIHAKPG+KS SITD+SD
Sbjct: 1   MPPAKRGKAKAESKVTTPSE-KPND--FPSCIQSVPPSSVAITIHAKPGAKSASITDISD 57

Query: 61  EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLN 120
           EAVGVQIDAPA+DGEANAALL+Y+SSVLGVKRRQVS+G+GSKSRDK VIVE++T + V +
Sbjct: 58  EAVGVQIDAPARDGEANAALLDYISSVLGVKRRQVSLGTGSKSRDKTVIVEDVTQQYVFD 117

Query: 121 SLGKAS 126
           +L K S
Sbjct: 118 ALDKVS 123


>gi|297852564|ref|XP_002894163.1| hypothetical protein ARALYDRAFT_474060 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297340005|gb|EFH70422.1| hypothetical protein ARALYDRAFT_474060 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 126

 Score =  175 bits (443), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 89/126 (70%), Positives = 106/126 (84%), Gaps = 3/126 (2%)

Query: 1   MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
           MAP KKGK K KSA   +S   T     P+C+RL+ PSSV+ITIHAKPGSK+ SITDVSD
Sbjct: 1   MAPTKKGK-KTKSAA--ESTTVTESSIFPACLRLLTPSSVAITIHAKPGSKAASITDVSD 57

Query: 61  EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLN 120
           EAVGVQIDAPA+DGEANAALLE++SSVLGVKRRQVS+GSGSKSRDK+VIVE++T ++V  
Sbjct: 58  EAVGVQIDAPARDGEANAALLEFISSVLGVKRRQVSLGSGSKSRDKVVIVEDMTQQSVFQ 117

Query: 121 SLGKAS 126
           +L +AS
Sbjct: 118 ALSQAS 123


>gi|224131202|ref|XP_002328480.1| predicted protein [Populus trichocarpa]
 gi|222838195|gb|EEE76560.1| predicted protein [Populus trichocarpa]
          Length = 131

 Score =  174 bits (442), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 90/113 (79%), Positives = 100/113 (88%), Gaps = 2/113 (1%)

Query: 16  STQSK--IKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKD 73
           STQSK    +   N PSCIR VPPSSV+ITIHAKPGSKS SITD+SDEAVGVQIDAPAKD
Sbjct: 16  STQSKPTAPSPQNNFPSCIRAVPPSSVAITIHAKPGSKSASITDLSDEAVGVQIDAPAKD 75

Query: 74  GEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKAS 126
           GEANAALL+Y+SSVLGVKRRQVSIGSGSKSRDK+VIVEE+T +NV ++L KAS
Sbjct: 76  GEANAALLDYISSVLGVKRRQVSIGSGSKSRDKVVIVEEVTLQNVFDALEKAS 128


>gi|449467753|ref|XP_004151587.1| PREDICTED: UPF0235 protein C15orf40 homolog [Cucumis sativus]
 gi|449513315|ref|XP_004164293.1| PREDICTED: UPF0235 protein C15orf40 homolog [Cucumis sativus]
          Length = 156

 Score =  172 bits (435), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 90/126 (71%), Positives = 107/126 (84%), Gaps = 3/126 (2%)

Query: 1   MAPAKKGKSKA-KSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVS 59
           M PAKKGK+KA K+  S QS +K+N  N PSC+R V PSSV+ITIHAKPGSK  SITD  
Sbjct: 29  MPPAKKGKTKAPKATESIQSSVKSN--NYPSCLRSVSPSSVAITIHAKPGSKIASITDFG 86

Query: 60  DEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVL 119
           D+A+GVQIDAPAKDGEANAALL+YMSSVLGVKRRQVSIGSGSKSR K+VIVE+++ ++V 
Sbjct: 87  DDALGVQIDAPAKDGEANAALLDYMSSVLGVKRRQVSIGSGSKSRGKVVIVEDVSLQSVF 146

Query: 120 NSLGKA 125
           ++L KA
Sbjct: 147 DALNKA 152


>gi|357464555|ref|XP_003602559.1| hypothetical protein MTR_3g095670 [Medicago truncatula]
 gi|355491607|gb|AES72810.1| hypothetical protein MTR_3g095670 [Medicago truncatula]
          Length = 125

 Score =  163 bits (413), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 79/100 (79%), Positives = 92/100 (92%)

Query: 27  NLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
           N+PSCIR +PPSSV+ITIHAKPGSKS SITDVSDEAVGVQIDAPA+DGEANAALL+Y+SS
Sbjct: 24  NIPSCIRCMPPSSVAITIHAKPGSKSASITDVSDEAVGVQIDAPARDGEANAALLDYISS 83

Query: 87  VLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKAS 126
           VLGVKRRQVS+G+GSKSRDK VIVE++T + V ++L K S
Sbjct: 84  VLGVKRRQVSLGTGSKSRDKRVIVEDVTQQYVFDALDKVS 123


>gi|226496211|ref|NP_001150562.1| LOC100284194 [Zea mays]
 gi|195640232|gb|ACG39584.1| uncharacterized ACR, YggU family COG1872 containing protein [Zea
           mays]
          Length = 129

 Score =  154 bits (390), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 73/124 (58%), Positives = 98/124 (79%)

Query: 1   MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
           MAP K+GK+K  +         T     P C+RL+PPS+V+I+IHAKPGSK  +IT++ D
Sbjct: 1   MAPGKRGKAKGAAPPPPTXAASTGAVGFPGCLRLMPPSTVAISIHAKPGSKVATITEIGD 60

Query: 61  EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLN 120
           EAVGVQIDAPA+DGEANAAL++++SSVLGVK+R+VSIGSGSKSR+K+V+V++ T E V +
Sbjct: 61  EAVGVQIDAPARDGEANAALVDFISSVLGVKKREVSIGSGSKSREKVVLVQDATLEGVYD 120

Query: 121 SLGK 124
           +L K
Sbjct: 121 ALKK 124


>gi|297603606|ref|NP_001054326.2| Os04g0686300 [Oryza sativa Japonica Group]
 gi|215687220|dbj|BAG91785.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222629814|gb|EEE61946.1| hypothetical protein OsJ_16702 [Oryza sativa Japonica Group]
 gi|255675903|dbj|BAF16240.2| Os04g0686300 [Oryza sativa Japonica Group]
          Length = 129

 Score =  144 bits (364), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 69/128 (53%), Positives = 97/128 (75%)

Query: 1   MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
           MAP K+      +     +         P+C+RL+PPS+V+I+I AKPGSK  +IT++ D
Sbjct: 1   MAPGKQRGKAKGAPPPPAAPNAAAAGGFPACLRLMPPSTVAISIQAKPGSKLATITEIGD 60

Query: 61  EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLN 120
           EAVGVQIDAPA+DGEANAAL++++SSVLGVK+R+VSIGSGSKSR+K+V+V++ T + V +
Sbjct: 61  EAVGVQIDAPARDGEANAALVDFISSVLGVKKREVSIGSGSKSREKVVLVQDATLQGVFD 120

Query: 121 SLGKASSS 128
           +L KA +S
Sbjct: 121 ALKKACAS 128


>gi|38345823|emb|CAD41928.2| OSJNBa0070M12.6 [Oryza sativa Japonica Group]
          Length = 207

 Score =  144 bits (363), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 69/128 (53%), Positives = 97/128 (75%)

Query: 1   MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
           MAP K+      +     +         P+C+RL+PPS+V+I+I AKPGSK  +IT++ D
Sbjct: 79  MAPGKQRGKAKGAPPPPAAPNAAAAGGFPACLRLMPPSTVAISIQAKPGSKLATITEIGD 138

Query: 61  EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLN 120
           EAVGVQIDAPA+DGEANAAL++++SSVLGVK+R+VSIGSGSKSR+K+V+V++ T + V +
Sbjct: 139 EAVGVQIDAPARDGEANAALVDFISSVLGVKKREVSIGSGSKSREKVVLVQDATLQGVFD 198

Query: 121 SLGKASSS 128
           +L KA +S
Sbjct: 199 ALKKACAS 206


>gi|357166810|ref|XP_003580862.1| PREDICTED: UPF0235 protein C15orf40 homolog [Brachypodium
           distachyon]
          Length = 127

 Score =  144 bits (362), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 65/102 (63%), Positives = 89/102 (87%)

Query: 27  NLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
             P C+RL+PPS+V+I++HAKPGSK  +IT++ +EAVGVQIDAPA+DGEANAAL++++SS
Sbjct: 25  GFPRCLRLMPPSTVAISVHAKPGSKVATITEIGEEAVGVQIDAPARDGEANAALVDFISS 84

Query: 87  VLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKASSS 128
           VLGVK+R+VSIGSGSKSR+K+V+V+E T + V ++L KA  S
Sbjct: 85  VLGVKKREVSIGSGSKSREKVVLVQEATLQGVFDALKKACDS 126


>gi|326522380|dbj|BAK07652.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326531514|dbj|BAJ97761.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 129

 Score =  142 bits (359), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 75/126 (59%), Positives = 100/126 (79%), Gaps = 1/126 (0%)

Query: 1   MAPAKKG-KSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVS 59
           MAP K+G K+KAK A +            P C+RL+PPS+V+I+IHAKPGSK  +IT+V 
Sbjct: 1   MAPGKRGSKAKAKGAPAPAPAAAAAGGEFPGCLRLMPPSTVAISIHAKPGSKMATITEVG 60

Query: 60  DEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVL 119
           +EAVGVQIDAPA+DGEANAAL++++SSVLGVK+R+VSIGSGSKSR+K+V+V++ T + V 
Sbjct: 61  EEAVGVQIDAPARDGEANAALVDFISSVLGVKKREVSIGSGSKSREKVVLVQDATLKGVF 120

Query: 120 NSLGKA 125
            +L KA
Sbjct: 121 EALKKA 126


>gi|90399179|emb|CAJ86041.1| H0723C07.11 [Oryza sativa Indica Group]
 gi|125550304|gb|EAY96126.1| hypothetical protein OsI_18003 [Oryza sativa Indica Group]
          Length = 125

 Score =  142 bits (357), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 65/98 (66%), Positives = 89/98 (90%)

Query: 31  CIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGV 90
           C+RL+PPS+V+I+IHAKPGSK  +IT++ DEAVGVQIDAPA+DGEANAAL++++SSVLGV
Sbjct: 27  CLRLMPPSTVAISIHAKPGSKLATITEIGDEAVGVQIDAPARDGEANAALVDFISSVLGV 86

Query: 91  KRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKASSS 128
           K+R+VSIGSGSKSR+K+V+V++ T + V ++L KA +S
Sbjct: 87  KKREVSIGSGSKSREKVVLVQDATLQGVFDALKKACAS 124


>gi|242077752|ref|XP_002448812.1| hypothetical protein SORBIDRAFT_06g033690 [Sorghum bicolor]
 gi|241939995|gb|EES13140.1| hypothetical protein SORBIDRAFT_06g033690 [Sorghum bicolor]
          Length = 131

 Score =  142 bits (357), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 65/97 (67%), Positives = 87/97 (89%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
            P C+RL+PPS+V+I+IHAKPGSK  +IT++ DEAVGVQIDAPA+DGEANAAL++++SSV
Sbjct: 30  FPGCLRLMPPSTVAISIHAKPGSKVATITEIGDEAVGVQIDAPARDGEANAALVDFISSV 89

Query: 88  LGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGK 124
           LGVK+R+VSIGSGSKSR+K+V+V++ T E V ++L K
Sbjct: 90  LGVKKREVSIGSGSKSREKVVLVQDATLEGVYDALKK 126


>gi|326504128|dbj|BAK02850.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 129

 Score =  140 bits (353), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 74/126 (58%), Positives = 99/126 (78%), Gaps = 1/126 (0%)

Query: 1   MAPAKKG-KSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVS 59
           MAP K+G K+KAK A +            P C+ L+PPS+V+I+IHAKPGSK  +IT+V 
Sbjct: 1   MAPGKRGSKAKAKGAPAPAPAAAAAGGEFPGCLHLMPPSTVAISIHAKPGSKMATITEVG 60

Query: 60  DEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVL 119
           +EAVGVQIDAPA+DGEANAAL++++SSVLGVK+R+VSIGSGSKSR+K+V+V++ T + V 
Sbjct: 61  EEAVGVQIDAPARDGEANAALVDFISSVLGVKKREVSIGSGSKSREKVVLVQDATLKGVF 120

Query: 120 NSLGKA 125
            +L KA
Sbjct: 121 EALKKA 126


>gi|168032813|ref|XP_001768912.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679824|gb|EDQ66266.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 123

 Score =  135 bits (339), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 66/119 (55%), Positives = 90/119 (75%)

Query: 9   SKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQID 68
           SK  S+G  ++        +P CIR +  S+V+IT+HAKPGSK  +ITD  D AVGVQID
Sbjct: 5   SKGASSGKPEAVATAEKSRIPGCIRSLADSTVAITVHAKPGSKLSAITDTDDGAVGVQID 64

Query: 69  APAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKASS 127
           APA++GEANAALLEY++ VLG+KRRQVS+GSGS+SR+K+V VE +T + V  ++ +AS+
Sbjct: 65  APAREGEANAALLEYIAEVLGIKRRQVSLGSGSRSREKLVTVEGLTVDKVYEAIRRAST 123


>gi|223975045|gb|ACN31710.1| unknown [Zea mays]
          Length = 95

 Score =  130 bits (326), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 61/90 (67%), Positives = 82/90 (91%)

Query: 35  VPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           +PPS+V+I+IHAKPGSK  +IT++ DEAVGVQIDAPA+DGEANAAL++++SSVLGVK+R+
Sbjct: 1   MPPSTVAISIHAKPGSKVATITEIGDEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 60

Query: 95  VSIGSGSKSRDKIVIVEEITPENVLNSLGK 124
           VSIGSGSKSR+K+V+V++ T E V ++L K
Sbjct: 61  VSIGSGSKSREKVVLVQDATLEGVYDALKK 90


>gi|7770327|gb|AAF69697.1|AC016041_2 F27J15.6 [Arabidopsis thaliana]
          Length = 91

 Score =  124 bits (312), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 66/94 (70%), Positives = 76/94 (80%), Gaps = 5/94 (5%)

Query: 1  MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
          MAP KKGK K KSA   +S   T   + P+C+RL+ PSSV+ITIHAKPGSK+ SITDVSD
Sbjct: 1  MAPTKKGK-KTKSAA--ESTTVTESSSFPTCLRLLTPSSVAITIHAKPGSKAASITDVSD 57

Query: 61 EAVGVQIDAPAKDGEANAALLEYMSSV--LGVKR 92
          EAVGVQIDAPA+DGEANAALLEYMSSV  L ++R
Sbjct: 58 EAVGVQIDAPARDGEANAALLEYMSSVKFLFLRR 91


>gi|296088594|emb|CBI37585.3| unnamed protein product [Vitis vinifera]
          Length = 89

 Score =  110 bits (275), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 54/75 (72%), Positives = 64/75 (85%)

Query: 54  SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEI 113
           +I    DEA+GVQIDAPAKDGEANAALL+Y+SSV+GVKRRQVSI SGSKSRDK+VIVEE+
Sbjct: 14  TIDYFDDEALGVQIDAPAKDGEANAALLDYISSVVGVKRRQVSISSGSKSRDKVVIVEEV 73

Query: 114 TPENVLNSLGKASSS 128
           T + V ++L K S S
Sbjct: 74  TLQGVFDALDKVSKS 88


>gi|255634374|gb|ACU17552.1| unknown [Glycine max]
          Length = 77

 Score =  107 bits (267), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 52/64 (81%), Positives = 58/64 (90%), Gaps = 2/64 (3%)

Query: 22 KTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALL 81
          K ND   PSCIR VPPSSV+ITIHAKPGSKS S+TD+SDEAVGVQIDAPA+DGEANAALL
Sbjct: 2  KPND--FPSCIRSVPPSSVAITIHAKPGSKSASVTDISDEAVGVQIDAPARDGEANAALL 59

Query: 82 EYMS 85
          +Y+S
Sbjct: 60 DYIS 63


>gi|414586173|tpg|DAA36744.1| TPA: COG1872 containing protein, Uncharacterized ACR, YggU
          family, partial [Zea mays]
          Length = 86

 Score =  100 bits (250), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 49/86 (56%), Positives = 66/86 (76%)

Query: 1  MAPAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD 60
          MAP K+GK+K  +     +   T     P C+RL+PPS+V+I+IHAKPGSK  +IT++ D
Sbjct: 1  MAPGKRGKAKGAAPPPPTAAASTGAVGFPGCLRLMPPSTVAISIHAKPGSKVATITEIGD 60

Query: 61 EAVGVQIDAPAKDGEANAALLEYMSS 86
          EAVGVQIDAPA+DGEANAAL++++SS
Sbjct: 61 EAVGVQIDAPARDGEANAALVDFISS 86


>gi|440790394|gb|ELR11677.1| hypothetical protein ACA1_260830 [Acanthamoeba castellanii str.
           Neff]
          Length = 116

 Score = 99.0 bits (245), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 45/95 (47%), Positives = 67/95 (70%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           LP CI      +V +T+H KP +K  ++TD+S EA+GV + AP +DGEANA L+++M+ V
Sbjct: 13  LPQCIGTTKEGNVKLTVHVKPNAKISAVTDMSSEAIGVALAAPPRDGEANAELVDFMAGV 72

Query: 88  LGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           L  +++QV + SGSKSRDK+++V  +TPE V   L
Sbjct: 73  LEARKKQVELVSGSKSRDKVLLVTAMTPEQVHEKL 107


>gi|322792209|gb|EFZ16226.1| hypothetical protein SINV_80163 [Solenopsis invicta]
          Length = 122

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 49/103 (47%), Positives = 70/103 (67%), Gaps = 2/103 (1%)

Query: 22  KTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAA 79
           KT   N+PS   + L    +V+I I AKPG+K  +ITD+SDEAVG+ I AP  +GEANA 
Sbjct: 14  KTASTNVPSTGPVILDKDGNVAIKIQAKPGAKCNNITDISDEAVGIAISAPPTEGEANAE 73

Query: 80  LLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           L++Y++S  GV++  V++  GS+SR K+V+V  IT + VL  L
Sbjct: 74  LVKYLASTFGVRKSDVTLDRGSRSRQKVVVVSGITTDQVLTKL 116


>gi|259089201|ref|NP_001158638.1| UPF0235 protein C15orf40 [Oncorhynchus mykiss]
 gi|225705490|gb|ACO08591.1| UPF0235 protein C15orf40 [Oncorhynchus mykiss]
          Length = 182

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 48/89 (53%), Positives = 65/89 (73%), Gaps = 1/89 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V+I++HAKPGSK  +ITDVS EAVGV I AP   GEANA L+ Y+S VL +KR +V +  
Sbjct: 93  VTISVHAKPGSKQNAITDVSIEAVGVAIAAPPTGGEANAELVRYLSKVLELKRSEVVLDK 152

Query: 100 GSKSRDKIV-IVEEITPENVLNSLGKASS 127
           GS+SR+KI+ +   +TPE VL+ L + +S
Sbjct: 153 GSRSREKIIKVTGSLTPEQVLDRLKQEAS 181


>gi|332025796|gb|EGI65953.1| UPF0235 protein C15orf40-like protein [Acromyrmex echinatior]
          Length = 139

 Score = 91.7 bits (226), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 46/100 (46%), Positives = 71/100 (71%), Gaps = 1/100 (1%)

Query: 23  TNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLE 82
           T+D ++ S + L    +V+I I AKPG+K  ++TD+SDEA+G+ I AP  +GEANA L++
Sbjct: 35  TDDPSMGSVV-LNKDGNVAIKIQAKPGAKCNNVTDISDEAIGIAISAPPTEGEANAELVK 93

Query: 83  YMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           Y++S+ GV++  VS+  GS+SR K+VIV  I+ + VL  L
Sbjct: 94  YLASIFGVRKSDVSLDRGSRSRQKVVIVSGISTDQVLTKL 133


>gi|348523449|ref|XP_003449236.1| PREDICTED: UPF0235 protein C15orf40 homolog [Oreochromis niloticus]
          Length = 184

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 46/88 (52%), Positives = 64/88 (72%), Gaps = 1/88 (1%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V+I +HAKPGSK  SIT+VS EAV V I AP  DGEANA L+ Y++ VL +K+  +S+ 
Sbjct: 95  AVTIRVHAKPGSKQSSITEVSTEAVSVAISAPPTDGEANAELIRYLAEVLDLKKSHISLD 154

Query: 99  SGSKSRDKIVIVE-EITPENVLNSLGKA 125
            GS+SRDK++ V+  ++PE VL  L +A
Sbjct: 155 KGSRSRDKVIKVDSSLSPEEVLRKLREA 182


>gi|340709930|ref|XP_003393552.1| PREDICTED: UPF0235 protein C15orf40 homolog [Bombus terrestris]
          Length = 144

 Score = 89.7 bits (221), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 65/91 (71%)

Query: 32  IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVK 91
           I L    +V+I I AKPG+K  +ITD+S++AVGV I AP  +GEAN  L++Y++S+LG++
Sbjct: 48  ISLDKNGNVTIKIQAKPGAKHNNITDISEDAVGVAISAPPVEGEANTELVKYLASILGMR 107

Query: 92  RRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           +  VS+  GSKSR K++IV  IT E VL  L
Sbjct: 108 KSDVSLDRGSKSRHKVIIVSGITVEKVLEKL 138


>gi|148236990|ref|NP_001089221.1| uncharacterized protein LOC734268 [Xenopus laevis]
 gi|57920974|gb|AAH89152.1| MGC85153 protein [Xenopus laevis]
          Length = 121

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 49/89 (55%), Positives = 65/89 (73%), Gaps = 1/89 (1%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           SV+I+IHAKPG+K  +ITDV+ +AVGV I AP  +GEANA L  Y+S VL +K+ +VS+ 
Sbjct: 30  SVTISIHAKPGAKQNAITDVTADAVGVAIAAPPTEGEANAELCRYLSKVLVLKKSEVSLD 89

Query: 99  SGSKSRDKIV-IVEEITPENVLNSLGKAS 126
            G KSR+K+V I   ITPE VL  L +A+
Sbjct: 90  KGGKSREKVVKISASITPEVVLERLKEAA 118


>gi|328780498|ref|XP_392249.2| PREDICTED: UPF0235 protein C15orf40 homolog [Apis mellifera]
          Length = 144

 Score = 89.0 bits (219), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 44/84 (52%), Positives = 63/84 (75%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V+I I AKPG+K  +ITD+S++AVGV I AP  +GEAN  L++Y++SVLG+++  V++ 
Sbjct: 55  NVTIKIQAKPGAKHNNITDISEDAVGVAISAPPVEGEANTELVKYLASVLGMRKSDVTLD 114

Query: 99  SGSKSRDKIVIVEEITPENVLNSL 122
            GSKSR KIV+V  I+ E VL  L
Sbjct: 115 RGSKSRQKIVVVSGISVEKVLEKL 138


>gi|432862407|ref|XP_004069840.1| PREDICTED: UPF0235 protein C15orf40 homolog [Oryzias latipes]
          Length = 169

 Score = 88.2 bits (217), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 42/88 (47%), Positives = 64/88 (72%), Gaps = 1/88 (1%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V++++H KPGSK  +ITDVS EAVGV + AP  DGEAN  L+ Y++ VL +K+ Q+S+ 
Sbjct: 80  AVTVSVHVKPGSKQSAITDVSAEAVGVSVGAPPTDGEANTELIRYLADVLDLKKSQISLS 139

Query: 99  SGSKSRDKIVIVE-EITPENVLNSLGKA 125
            GS+SRDK++ V+  ++ + VL  L +A
Sbjct: 140 KGSRSRDKLIRVDSSLSQDEVLRRLQEA 167


>gi|383852310|ref|XP_003701671.1| PREDICTED: UPF0235 protein C15orf40 homolog [Megachile rotundata]
          Length = 144

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 43/84 (51%), Positives = 62/84 (73%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V+I I AKPG+K  +ITD+S++AVGV I AP  +GEAN  L++Y++SVLG+++  VS+ 
Sbjct: 55  NVTIKIQAKPGAKHNNITDISEDAVGVAISAPPVEGEANTELVKYLASVLGMRKSDVSLD 114

Query: 99  SGSKSRDKIVIVEEITPENVLNSL 122
            GSKSR K+++V   T E VL  L
Sbjct: 115 RGSKSRQKVILVSGTTVEKVLEKL 138


>gi|52345734|ref|NP_001004913.1| chromosome 15 open reading frame 40 [Xenopus (Silurana) tropicalis]
 gi|49523245|gb|AAH75357.1| MGC89060 protein [Xenopus (Silurana) tropicalis]
          Length = 120

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 49/89 (55%), Positives = 64/89 (71%), Gaps = 1/89 (1%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           SV I+IHAKPG+K  +ITDV+ +AVGV I AP  +GEANA L  Y+S VL +K+ +VS+ 
Sbjct: 29  SVIISIHAKPGAKQNAITDVTADAVGVAIAAPPTEGEANAELCRYLSKVLVLKKSEVSLD 88

Query: 99  SGSKSRDKIV-IVEEITPENVLNSLGKAS 126
            G KSR+K+V I   ITPE VL  L +A+
Sbjct: 89  KGGKSREKVVKISASITPEVVLEKLKEAA 117


>gi|350399290|ref|XP_003485480.1| PREDICTED: UPF0235 protein C15orf40 homolog [Bombus impatiens]
          Length = 144

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 45/91 (49%), Positives = 64/91 (70%)

Query: 32  IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVK 91
           I L    +V+I I AKPG+K  +ITD+S++AVGV I AP  +GEAN  L++Y++S+LG++
Sbjct: 48  ISLDKNGNVTIKIQAKPGAKHNNITDISEDAVGVAISAPPVEGEANTELVKYLASILGMR 107

Query: 92  RRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           +  VS+  GSKSR K++IV   T E VL  L
Sbjct: 108 KSDVSLDRGSKSRHKVIIVSGTTVEKVLEKL 138


>gi|380028163|ref|XP_003697778.1| PREDICTED: UPF0235 protein C15orf40 homolog [Apis florea]
          Length = 144

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 51/115 (44%), Positives = 73/115 (63%), Gaps = 2/115 (1%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQI 67
           K  +K  G  Q    T +E +   I +    +V+I I AKPG+K  +ITD+S++AVGV I
Sbjct: 26  KQGSKKTGKNQKA--TVNEEVKGPISIDKNGNVAIKIQAKPGAKHNNITDISEDAVGVAI 83

Query: 68  DAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
            AP  +GEAN  L++Y++SVLG+K+  V++  GSKSR KIV+V   + E VL  L
Sbjct: 84  SAPPVEGEANTELVKYIASVLGMKKSDVTLDRGSKSRQKIVVVSGTSVEKVLEKL 138


>gi|363737754|ref|XP_413838.2| PREDICTED: UPF0235 protein C15orf40 homolog [Gallus gallus]
          Length = 140

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/90 (51%), Positives = 63/90 (70%), Gaps = 1/90 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V +++ AKPGS+  ++TDV+ EAVGV I AP  +GEANA L  Y+S VLGVK+  V +  
Sbjct: 51  VRVSVRAKPGSRCSAVTDVTAEAVGVAIAAPPSEGEANAELCRYLSKVLGVKKSDVILEK 110

Query: 100 GSKSRDKIV-IVEEITPENVLNSLGKASSS 128
           G KSRDK+V I+  +TP+ VL  L K +S+
Sbjct: 111 GGKSRDKVVKILVSVTPDEVLEKLKKEAST 140


>gi|307102424|gb|EFN50700.1| hypothetical protein CHLNCDRAFT_142618 [Chlorella variabilis]
          Length = 127

 Score = 87.0 bits (214), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/99 (46%), Positives = 65/99 (65%), Gaps = 1/99 (1%)

Query: 27  NLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
            LP+ I    P+S  I +HAKPGSK CS+T +  +A+ V +DA   DGEANAAL+E+++ 
Sbjct: 30  GLPTFITPTGPASCRIGVHAKPGSKVCSVT-LGPDALEVAVDAKPVDGEANAALIEFVAE 88

Query: 87  VLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           VLG+KRR V++ SG+ SR K++ V  I     L  L +A
Sbjct: 89  VLGLKRRDVTLASGTTSRHKVLAVAGIDAHAALQRLRQA 127


>gi|327289069|ref|XP_003229247.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 1 [Anolis
           carolinensis]
          Length = 119

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 45/87 (51%), Positives = 60/87 (68%), Gaps = 1/87 (1%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           SV+I +HAKPGSK  ++TD+S EAVG+ I AP  DGEANA L  Y+S VL V++    + 
Sbjct: 29  SVTIAVHAKPGSKQNAVTDLSAEAVGIAIAAPPSDGEANAELCRYLSKVLEVRKSSSLLK 88

Query: 99  SGSKSRDKIV-IVEEITPENVLNSLGK 124
            G +SR+K+V I+  +TPE VL  L K
Sbjct: 89  QGGRSREKLVKILAPLTPEEVLQKLRK 115


>gi|13385576|ref|NP_080353.1| UPF0235 protein C15orf40 homolog [Mus musculus]
 gi|29839616|sp|Q9CRC3.1|CO040_MOUSE RecName: Full=UPF0235 protein C15orf40 homolog
 gi|12851848|dbj|BAB29184.1| unnamed protein product [Mus musculus]
 gi|12857526|dbj|BAB31031.1| unnamed protein product [Mus musculus]
 gi|12859380|dbj|BAB31634.1| unnamed protein product [Mus musculus]
          Length = 126

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 53/119 (44%), Positives = 70/119 (58%), Gaps = 7/119 (5%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSCIRLV---PPSSVSITIHAKPGSKSCSITDVSDEAVG 64
           K+ A S G  Q+K     E  P     V   P   V+I IHAKPGS+  ++TD+S EAVG
Sbjct: 4   KAGATSKGKNQTK---EPETAPPAAGPVATDPKGFVTIAIHAKPGSRQNAVTDLSTEAVG 60

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLNSL 122
           V I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TPE VL  L
Sbjct: 61  VAIAAPPSEGEANAELCRYLSKVLDLRKSDVVLDKGGKSREKVVKLLASTTPEEVLEKL 119


>gi|328717531|ref|XP_003246233.1| PREDICTED: UPF0235 protein C15orf40 homolog [Acyrthosiphon pisum]
          Length = 136

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 39/72 (54%), Positives = 58/72 (80%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V I I+AKPG+K+ +ITD+S + +GVQI+AP  DGEANA L++Y+S VLG+++  +S+  
Sbjct: 45  VVIKINAKPGAKNNNITDISSDGIGVQINAPPTDGEANAELIKYLSKVLGLRKSDLSLDR 104

Query: 100 GSKSRDKIVIVE 111
           GS+SR+KI+IV 
Sbjct: 105 GSRSRNKILIVH 116


>gi|344284100|ref|XP_003413808.1| PREDICTED: UPF0235 protein C15orf40-like [Loxodonta africana]
          Length = 154

 Score = 85.9 bits (211), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 76/124 (61%), Gaps = 6/124 (4%)

Query: 7   GKSKAKSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDE 61
           G    K AG+T   +S+ K ++  LPS   + + P   V+I IHAKPGSK  ++TD++ E
Sbjct: 26  GAEMPKKAGTTNKGKSQSKESERPLPSLGPVGVDPKGCVTIAIHAKPGSKQNAVTDLTAE 85

Query: 62  AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLN 120
           AV + I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V I+   TPE +L 
Sbjct: 86  AVNIAIAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGVKSREKVVKILASTTPEEILE 145

Query: 121 SLGK 124
            L K
Sbjct: 146 KLKK 149


>gi|431920276|gb|ELK18311.1| hypothetical protein PAL_GLEAN10009540 [Pteropus alecto]
          Length = 154

 Score = 85.9 bits (211), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 74/124 (59%), Gaps = 6/124 (4%)

Query: 7   GKSKAKSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDE 61
           G    K AG+T   +S+ K  +  LP    + + P   V+I IHAKPGSK  +ITDV+ E
Sbjct: 26  GTEMPKKAGATTKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAITDVTAE 85

Query: 62  AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLN 120
           AV V I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TPE +L 
Sbjct: 86  AVSVAIAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTTPEEILE 145

Query: 121 SLGK 124
            L K
Sbjct: 146 KLEK 149


>gi|386781826|ref|NP_001248190.1| uncharacterized protein LOC718713 [Macaca mulatta]
 gi|380809518|gb|AFE76634.1| hypothetical protein LOC123207 isoform a [Macaca mulatta]
 gi|380809520|gb|AFE76635.1| hypothetical protein LOC123207 isoform a [Macaca mulatta]
 gi|380809522|gb|AFE76636.1| hypothetical protein LOC123207 isoform a [Macaca mulatta]
 gi|380809524|gb|AFE76637.1| hypothetical protein LOC123207 isoform a [Macaca mulatta]
 gi|380809526|gb|AFE76638.1| hypothetical protein LOC123207 isoform a [Macaca mulatta]
 gi|380809528|gb|AFE76639.1| hypothetical protein LOC123207 isoform a [Macaca mulatta]
 gi|380809530|gb|AFE76640.1| hypothetical protein LOC123207 isoform a [Macaca mulatta]
          Length = 154

 Score = 85.9 bits (211), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 51/119 (42%), Positives = 73/119 (61%), Gaps = 6/119 (5%)

Query: 12  KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
           K AG+T   +S+ K  +   P    + + P   V+ITIHAKPGSK  ++TD++ EAV V 
Sbjct: 31  KKAGATTKGKSQSKEPERPFPPLGPVAVDPKGCVTITIHAKPGSKQNAVTDLTAEAVNVA 90

Query: 67  IDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLNSLGK 124
           I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TPE +L  L K
Sbjct: 91  IAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTTPEEILEKLKK 149


>gi|402875136|ref|XP_003901372.1| PREDICTED: UPF0235 protein C15orf40 homolog [Papio anubis]
          Length = 154

 Score = 85.9 bits (211), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 51/119 (42%), Positives = 73/119 (61%), Gaps = 6/119 (5%)

Query: 12  KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
           K AG+T   +S+ K  +   P    + + P   V+ITIHAKPGSK  ++TD++ EAV V 
Sbjct: 31  KKAGATTKGKSQSKEPERPFPPLGPVAVDPKGCVTITIHAKPGSKQNAVTDLTAEAVNVA 90

Query: 67  IDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLNSLGK 124
           I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TPE +L  L K
Sbjct: 91  IAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTTPEEILEKLKK 149


>gi|444722140|gb|ELW62843.1| BTB/POZ domain-containing protein 1 [Tupaia chinensis]
          Length = 441

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 45/90 (50%), Positives = 60/90 (66%), Gaps = 1/90 (1%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P   V+I IHAKPGSK  ++TD++ EAV V I AP  +GEANA L  Y+S VL V++  V
Sbjct: 347 PKGCVTIAIHAKPGSKQNAVTDLTTEAVNVAIAAPPTEGEANAELCRYLSKVLEVRKSDV 406

Query: 96  SIGSGSKSRDKIV-IVEEITPENVLNSLGK 124
            +  G KSR+K+V ++   TPE VL+ L K
Sbjct: 407 VLDKGGKSREKVVKLLASTTPEEVLDKLKK 436


>gi|20381446|gb|AAH27500.1| RIKEN cDNA 3110040N11 gene [Mus musculus]
 gi|74206730|dbj|BAE41614.1| unnamed protein product [Mus musculus]
 gi|148674975|gb|EDL06922.1| RIKEN cDNA 3110040N11, isoform CRA_d [Mus musculus]
 gi|148674977|gb|EDL06924.1| RIKEN cDNA 3110040N11, isoform CRA_d [Mus musculus]
          Length = 126

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 53/119 (44%), Positives = 69/119 (57%), Gaps = 7/119 (5%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSCIRLV---PPSSVSITIHAKPGSKSCSITDVSDEAVG 64
           K+ A S G  Q+K     E  P     V   P   V+I IHAKPGS+  ++TD+S EAVG
Sbjct: 4   KAGATSKGKNQTK---EPETAPPAAGPVATDPKGFVTIAIHAKPGSRQNAVTDLSTEAVG 60

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLNSL 122
           V I AP   GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TPE VL  L
Sbjct: 61  VAIAAPPSQGEANAELCRYLSKVLDLRKSDVVLDKGGKSREKVVKLLASTTPEEVLEKL 119


>gi|355692948|gb|EHH27551.1| hypothetical protein EGK_17775 [Macaca mulatta]
 gi|355778257|gb|EHH63293.1| hypothetical protein EGM_16229 [Macaca fascicularis]
          Length = 127

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 44/90 (48%), Positives = 60/90 (66%), Gaps = 1/90 (1%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P   V+ITIHAKPGSK  ++TD++ EAV V I AP  +GEANA L  Y+S VL +++  V
Sbjct: 33  PKGCVTITIHAKPGSKQNAVTDLTAEAVNVAIAAPPSEGEANAELCRYLSKVLELRKSDV 92

Query: 96  SIGSGSKSRDKIV-IVEEITPENVLNSLGK 124
            +  G KSR+K+V ++   TPE +L  L K
Sbjct: 93  VLDKGGKSREKVVKLLASTTPEEILEKLKK 122


>gi|311260654|ref|XP_001929217.2| PREDICTED: UPF0235 protein C15orf40 homolog isoform 1 [Sus scrofa]
          Length = 154

 Score = 85.1 bits (209), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 51/124 (41%), Positives = 73/124 (58%), Gaps = 6/124 (4%)

Query: 7   GKSKAKSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDE 61
           G    K AG+T   +S+ K  +  LP    + + P   V+I IHAKPGSK  ++TD++ E
Sbjct: 26  GAEMPKKAGATNKGKSQSKEQERPLPPLGPVTVDPKGCVTIAIHAKPGSKQNAVTDLTTE 85

Query: 62  AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLN 120
           AV V I AP  +GEANA L  Y+S V  +++  V +  G KSR+K+V ++   TPE +L 
Sbjct: 86  AVSVAIAAPPSEGEANAELCRYLSKVFELRKSDVVLDKGGKSREKVVKLLASTTPEEILE 145

Query: 121 SLGK 124
            L K
Sbjct: 146 KLKK 149


>gi|67078514|ref|NP_001019920.1| UPF0235 protein C15orf40 homolog [Rattus norvegicus]
 gi|392341258|ref|XP_003754291.1| PREDICTED: UPF0235 protein C15orf40 homolog [Rattus norvegicus]
 gi|392349088|ref|XP_003750283.1| PREDICTED: UPF0235 protein C15orf40 homolog [Rattus norvegicus]
 gi|81908918|sp|Q505I4.1|CO040_RAT RecName: Full=UPF0235 protein C15orf40 homolog
 gi|63100368|gb|AAH94529.1| Similar to RIKEN cDNA 3110040N11 [Rattus norvegicus]
 gi|149044058|gb|EDL97440.1| rCG63322 [Rattus norvegicus]
 gi|149057383|gb|EDM08706.1| similar to RIKEN cDNA 3110040N11, isoform CRA_c [Rattus norvegicus]
 gi|149057387|gb|EDM08710.1| similar to RIKEN cDNA 3110040N11, isoform CRA_c [Rattus norvegicus]
          Length = 126

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 49/116 (42%), Positives = 67/116 (57%), Gaps = 1/116 (0%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQI 67
           K+ A S G  Q+K           +       V+I IHAKPGSK  ++TD++ EAVGV I
Sbjct: 4   KAGATSKGKNQTKEPETPPPPTGPVATDSKGFVTIAIHAKPGSKQNAVTDLNTEAVGVAI 63

Query: 68  DAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLNSL 122
            AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TPE VL  L
Sbjct: 64  AAPPSEGEANAELCRYLSKVLDLRKSDVVLDKGGKSREKVVKLLASTTPEEVLEKL 119


>gi|395747069|ref|XP_002825813.2| PREDICTED: UPF0235 protein C15orf40 homolog [Pongo abelii]
          Length = 147

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 51/122 (41%), Positives = 75/122 (61%), Gaps = 6/122 (4%)

Query: 12  KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
           K AG+T   +S+ K  +  LP    + + P   V+I IHAKPGSK  ++TD++ EAV V 
Sbjct: 24  KKAGATTKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVA 83

Query: 67  IDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLNSLGKA 125
           I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TPE +L  L K 
Sbjct: 84  IAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTTPEEILEKLKKE 143

Query: 126 SS 127
           ++
Sbjct: 144 AT 145


>gi|307176568|gb|EFN66055.1| UPF0235 protein C15orf40-like protein [Camponotus floridanus]
          Length = 122

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 49/115 (42%), Positives = 73/115 (63%), Gaps = 1/115 (0%)

Query: 9   SKAKSAGSTQSKIKTNDENLPSC-IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQI 67
           SK +SA + + +  T  +N  +  + L    +V I I AKPG+K  +ITD+SDEAVG+ I
Sbjct: 2   SKQRSAKNEKMRKGTTTDNQSTGPVVLNKDGNVVIKIQAKPGAKCNNITDISDEAVGIAI 61

Query: 68  DAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
            AP  +GEANA L++Y++S+  +++  VS+  GS+SR K V V  IT + VL  L
Sbjct: 62  SAPPMEGEANAELVKYLASIFELRKSNVSLDRGSRSRQKTVTVSGITTDQVLAKL 116


>gi|149057384|gb|EDM08707.1| similar to RIKEN cDNA 3110040N11, isoform CRA_d [Rattus norvegicus]
          Length = 181

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 43/84 (51%), Positives = 58/84 (69%), Gaps = 1/84 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V+I IHAKPGSK  ++TD++ EAVGV I AP  +GEANA L  Y+S VL +++  V +  
Sbjct: 91  VTIAIHAKPGSKQNAVTDLNTEAVGVAIAAPPSEGEANAELCRYLSKVLDLRKSDVVLDK 150

Query: 100 GSKSRDKIV-IVEEITPENVLNSL 122
           G KSR+K+V ++   TPE VL  L
Sbjct: 151 GGKSREKVVKLLASTTPEEVLEKL 174


>gi|395822693|ref|XP_003784647.1| PREDICTED: UPF0235 protein C15orf40 homolog [Otolemur garnettii]
          Length = 154

 Score = 83.6 bits (205), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 45/116 (38%), Positives = 73/116 (62%), Gaps = 5/116 (4%)

Query: 12  KSAGSTQSKIKTNDENLP----SCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQI 67
           K++ +++ K ++N+   P    + + + P   V+I IHAKPGSK  ++TD++ EAV V I
Sbjct: 32  KASATSKGKCQSNEPERPLPPLAPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVAI 91

Query: 68  DAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLNSL 122
            AP  +GEANA L  Y+S +L +++  V +  G KSR+K+V ++   TPE +L  L
Sbjct: 92  AAPPSEGEANAELCRYLSKILELRKSDVVLDKGGKSREKVVKVLASTTPEEILEKL 147


>gi|395502265|ref|XP_003755502.1| PREDICTED: UPF0235 protein C15orf40 homolog [Sarcophilus harrisii]
          Length = 147

 Score = 83.2 bits (204), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 44/85 (51%), Positives = 58/85 (68%), Gaps = 1/85 (1%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           S++I IHAKPGSK  +ITDV+ E V V I AP  +GEANA L  Y+S VL +++  V + 
Sbjct: 56  SITIAIHAKPGSKQNAITDVTTENVSVAIAAPPSEGEANAELCRYLSKVLELRKSDVILD 115

Query: 99  SGSKSRDKIV-IVEEITPENVLNSL 122
            G KSR+K+V I+  ITPE +L  L
Sbjct: 116 KGGKSREKVVKILASITPEEILEKL 140


>gi|115495117|ref|NP_001068854.1| UPF0235 protein C15orf40 homolog [Bos taurus]
 gi|122140809|sp|Q3ZBP8.1|CO040_BOVIN RecName: Full=UPF0235 protein C15orf40 homolog
 gi|73587092|gb|AAI03180.1| Chromosome 15 open reading frame 40 ortholog [Bos taurus]
 gi|296475539|tpg|DAA17654.1| TPA: hypothetical protein LOC509050 [Bos taurus]
          Length = 126

 Score = 82.8 bits (203), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 52/120 (43%), Positives = 70/120 (58%), Gaps = 5/120 (4%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           K  A + G  QSK    +  LP    + + P   VSI IHAKPGSK  ++TDV+ EAV V
Sbjct: 4   KCGATNKGKNQSK--EPERPLPPLGPVTVDPKGGVSIAIHAKPGSKQNAVTDVTTEAVSV 61

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLNSLGK 124
            I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++    PE +L  L K
Sbjct: 62  AIAAPPTEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTPPEEILEKLKK 121


>gi|426248144|ref|XP_004017825.1| PREDICTED: UPF0235 protein C15orf40 homolog [Ovis aries]
          Length = 126

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 52/120 (43%), Positives = 70/120 (58%), Gaps = 5/120 (4%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           K  A + G  QSK    +  LP    + + P   VSI IHAKPGSK  ++TDV+ EAV V
Sbjct: 4   KGGATNKGKNQSK--EPERPLPPLGPVTVDPKGGVSIAIHAKPGSKQNAVTDVTAEAVSV 61

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLNSLGK 124
            I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++    PE +L  L K
Sbjct: 62  AIAAPPTEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTPPEEILEKLKK 121


>gi|354499521|ref|XP_003511857.1| PREDICTED: UPF0235 protein C15orf40 homolog [Cricetulus griseus]
          Length = 126

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 42/84 (50%), Positives = 57/84 (67%), Gaps = 1/84 (1%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P   V+I IHAKPGSK  ++TD++ EAVGV I AP  +GEANA L  Y+S VL +++  V
Sbjct: 32  PKGCVTIAIHAKPGSKQNAVTDLTTEAVGVAIAAPPSEGEANAELCRYLSKVLELRKSDV 91

Query: 96  SIGSGSKSRDKIV-IVEEITPENV 118
            +  G KSR+K+V ++   TPE V
Sbjct: 92  VLDKGGKSREKVVKLLASTTPEEV 115


>gi|440913185|gb|ELR62667.1| hypothetical protein M91_21008, partial [Bos grunniens mutus]
          Length = 116

 Score = 82.0 bits (201), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 44/90 (48%), Positives = 58/90 (64%), Gaps = 1/90 (1%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P   VSI IHAKPGSK  ++TDV+ EAV V I AP  +GEANA L  Y+S VL +++  V
Sbjct: 22  PKGGVSIAIHAKPGSKQNAVTDVTTEAVSVAIAAPPTEGEANAELCRYLSKVLELRKSDV 81

Query: 96  SIGSGSKSRDKIV-IVEEITPENVLNSLGK 124
            +  G KSR+K+V ++    PE +L  L K
Sbjct: 82  VLDKGGKSREKVVKLLASTPPEEILEKLKK 111


>gi|157743130|gb|AAI49509.1| C21H15orf40 protein [Bos taurus]
          Length = 132

 Score = 82.0 bits (201), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 44/90 (48%), Positives = 58/90 (64%), Gaps = 1/90 (1%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P   VSI IHAKPGSK  ++TDV+ EAV V I AP  +GEANA L  Y+S VL +++  V
Sbjct: 38  PKGGVSIAIHAKPGSKQNAVTDVTTEAVSVAIAAPPTEGEANAELCRYLSKVLELRKSDV 97

Query: 96  SIGSGSKSRDKIV-IVEEITPENVLNSLGK 124
            +  G KSR+K+V ++    PE +L  L K
Sbjct: 98  VLDKGGKSREKVVKLLASTPPEEILEKLKK 127


>gi|126273662|ref|XP_001365632.1| PREDICTED: UPF0235 protein C15orf40 homolog [Monodelphis domestica]
          Length = 146

 Score = 82.0 bits (201), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 43/90 (47%), Positives = 58/90 (64%), Gaps = 1/90 (1%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P  S++I IHAKPGSK  +ITDV+ E V V I AP  +GEAN  L  Y+S VL +++  V
Sbjct: 52  PSGSITIAIHAKPGSKQNAITDVTTENVSVAIAAPPSEGEANTELCRYLSKVLELRKSDV 111

Query: 96  SIGSGSKSRDKIV-IVEEITPENVLNSLGK 124
            +  G KSR+K+V I+   TPE +L  L +
Sbjct: 112 ILDKGGKSREKVVKILASTTPEEILEKLKR 141


>gi|126306449|ref|XP_001373757.1| PREDICTED: UPF0235 protein C15orf40 homolog [Monodelphis domestica]
          Length = 146

 Score = 82.0 bits (201), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 43/90 (47%), Positives = 58/90 (64%), Gaps = 1/90 (1%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P  S++I IHAKPGSK  +ITDV+ E V V I AP  +GEAN  L  Y+S VL +++  V
Sbjct: 52  PSGSITIAIHAKPGSKQNAITDVTTENVSVAIAAPPSEGEANTELCRYLSKVLELRKSDV 111

Query: 96  SIGSGSKSRDKIV-IVEEITPENVLNSLGK 124
            +  G KSR+K+V I+   TPE +L  L +
Sbjct: 112 ILDKGGKSREKVVKILASTTPEEILEKLKR 141


>gi|291413562|ref|XP_002723040.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
          Length = 154

 Score = 81.6 bits (200), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 49/118 (41%), Positives = 73/118 (61%), Gaps = 5/118 (4%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           K+ A S G +QSK   ++  LP    + + P   V+I IHAKPG+K  ++TD++ EAV V
Sbjct: 32  KAGATSKGRSQSK--DHEAPLPPSGPVAVDPKGCVTIAIHAKPGAKQNAVTDLTAEAVSV 89

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLNSL 122
            I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TP+ +L  L
Sbjct: 90  AIAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTTPDEILGKL 147


>gi|195428527|ref|XP_002062324.1| GK17477 [Drosophila willistoni]
 gi|194158409|gb|EDW73310.1| GK17477 [Drosophila willistoni]
          Length = 127

 Score = 81.6 bits (200), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 42/85 (49%), Positives = 59/85 (69%)

Query: 26  ENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
           E  P  I L    +++I I AKPG+K   ITD+  E VGVQI AP  +GEANA L++Y+S
Sbjct: 23  ETPPLPINLDKSGNIAIKILAKPGAKQNGITDIGLEGVGVQIAAPPSEGEANAELVKYLS 82

Query: 86  SVLGVKRRQVSIGSGSKSRDKIVIV 110
            VLG+++  VS+  GS+SR+KI++V
Sbjct: 83  KVLGLRKSDVSLDKGSRSRNKIILV 107


>gi|156373856|ref|XP_001629526.1| predicted protein [Nematostella vectensis]
 gi|156216528|gb|EDO37463.1| predicted protein [Nematostella vectensis]
          Length = 133

 Score = 81.6 bits (200), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 56/81 (69%)

Query: 30  SCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLG 89
           S   ++   S+ + IHAKPG+K   IT++S + VGVQI A  K+GEAN  L+ YMSSV G
Sbjct: 32  SPFEIIGDGSILVKIHAKPGAKQNRITELSPDFVGVQIAAQPKEGEANDELVRYMSSVFG 91

Query: 90  VKRRQVSIGSGSKSRDKIVIV 110
           VK+  V++  G+KSRDKI+ +
Sbjct: 92  VKKSSVTLDKGAKSRDKIIRI 112


>gi|157123809|ref|XP_001653923.1| hypothetical protein AaeL_AAEL009675 [Aedes aegypti]
 gi|108874207|gb|EAT38432.1| AAEL009675-PA [Aedes aegypti]
          Length = 149

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 42/92 (45%), Positives = 62/92 (67%), Gaps = 2/92 (2%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V I I AKPG+K+  ITD+ +E VGVQI AP  DGEAN  L++Y+S +L +++  VS+ 
Sbjct: 57  NVLIKIQAKPGAKTNGITDIGEEGVGVQIAAPPVDGEANTELVKYLSKLLELRKSDVSLD 116

Query: 99  SGSKSRDKIVIVEE--ITPENVLNSLGKASSS 128
            GSKSR K +++E+    PE VL+   K + +
Sbjct: 117 RGSKSRQKTIVLEKGCRMPEQVLDVFRKEAQN 148


>gi|149456878|ref|XP_001519576.1| PREDICTED: UPF0235 protein C15orf40 homolog, partial
           [Ornithorhynchus anatinus]
          Length = 119

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 40/91 (43%), Positives = 62/91 (68%), Gaps = 1/91 (1%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V+I +HAKPG+K  ++TDVS EAVGV I AP  +GEANA L  Y++ +L +++  V + 
Sbjct: 28  AVTIAVHAKPGAKQNAVTDVSVEAVGVAIAAPPSEGEANAELCRYLAKILELRKSDVVLD 87

Query: 99  SGSKSRDKIV-IVEEITPENVLNSLGKASSS 128
            G KSR+K++ I+   TP+ +L  L K + +
Sbjct: 88  RGGKSREKVIKILSSTTPDEILAKLKKQTET 118


>gi|410960502|ref|XP_003986828.1| PREDICTED: UPF0235 protein C15orf40 homolog [Felis catus]
          Length = 126

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/88 (46%), Positives = 58/88 (65%), Gaps = 1/88 (1%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P   V++ IHAKPGSK  ++TD++ EAV V I AP  +GEANA L  Y+S VL +++  V
Sbjct: 32  PKGCVTVAIHAKPGSKQNAVTDLTPEAVSVAIAAPPSEGEANAELCRYLSKVLKLRKSDV 91

Query: 96  SIGSGSKSRDKIV-IVEEITPENVLNSL 122
            +  G KSR+K+V ++   TPE +L  L
Sbjct: 92  VLEKGGKSREKVVKLLASTTPEEILEKL 119


>gi|260800225|ref|XP_002595035.1| hypothetical protein BRAFLDRAFT_237405 [Branchiostoma floridae]
 gi|229280275|gb|EEN51046.1| hypothetical protein BRAFLDRAFT_237405 [Branchiostoma floridae]
          Length = 98

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 42/84 (50%), Positives = 60/84 (71%), Gaps = 2/84 (2%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           S+ + IHAKPG+K+ +ITDV+ E VGVQI AP  +GEANA L  Y++ VL VK+  VS+ 
Sbjct: 9   SILVAIHAKPGAKANAITDVTTETVGVQITAPPMEGEANAELCRYLAGVLEVKKSAVSLE 68

Query: 99  SGSKSRDKIVIVEEITPENVLNSL 122
            G+KSR+K V V+  TP   ++++
Sbjct: 69  RGAKSREKTVRVD--TPGTTIDAV 90


>gi|47213227|emb|CAF89748.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 120

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/89 (48%), Positives = 61/89 (68%), Gaps = 1/89 (1%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V+IT+HAKPGSK   +T VS EAV V I AP  DGEAN  L+ +++ VL +K+  + + 
Sbjct: 31  AVTITVHAKPGSKHSRVTAVSTEAVEVAIAAPPVDGEANVELVRFLAEVLELKKGHLHLD 90

Query: 99  SGSKSRDKIVIVEE-ITPENVLNSLGKAS 126
            GS+SRDK V V+  ++PE VL  L +A+
Sbjct: 91  KGSRSRDKQVRVDSPLSPEEVLRRLRQAA 119


>gi|301789543|ref|XP_002930186.1| PREDICTED: UPF0235 protein C15orf40 homolog [Ailuropoda
           melanoleuca]
          Length = 214

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/122 (41%), Positives = 72/122 (59%), Gaps = 6/122 (4%)

Query: 7   GKSKAKSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDE 61
           G    K AG+T   +S+ K  +  LP    + + P   V+I IHAKPGSK  ++TDV+ E
Sbjct: 86  GAEMPKKAGATNKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDVTAE 145

Query: 62  AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLN 120
           AV V I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   T E +L 
Sbjct: 146 AVSVAIAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTTTEEILE 205

Query: 121 SL 122
            L
Sbjct: 206 KL 207


>gi|355735946|gb|AES11838.1| hypothetical protein [Mustela putorius furo]
          Length = 145

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 50/117 (42%), Positives = 71/117 (60%), Gaps = 6/117 (5%)

Query: 12  KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
           K AG+T   +S+ K  +  LP    + + P   V+I IHAKPGSK  ++TDV+ EAV V 
Sbjct: 24  KKAGATNKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDVTAEAVSVA 83

Query: 67  IDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLNSL 122
           I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   T E +L  L
Sbjct: 84  IAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTTAEEILEKL 140


>gi|196001449|ref|XP_002110592.1| hypothetical protein TRIADDRAFT_54758 [Trichoplax adhaerens]
 gi|190586543|gb|EDV26596.1| hypothetical protein TRIADDRAFT_54758 [Trichoplax adhaerens]
          Length = 132

 Score = 80.1 bits (196), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 40/86 (46%), Positives = 61/86 (70%), Gaps = 3/86 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V ITI AKPGSK  ++TD+S + +G+QI APA++GEAN+ L++++SS+L VK+  + +  
Sbjct: 41  VLITIKAKPGSKENAVTDISSDGIGIQIAAPAREGEANSELIKFLSSILKVKKSSILLDK 100

Query: 100 GSKSRDKIVIVE---EITPENVLNSL 122
           GSKSR K + V    ++T + VL  L
Sbjct: 101 GSKSRHKTICVNKNADLTEKQVLELL 126


>gi|397488591|ref|XP_003815342.1| PREDICTED: UPF0235 protein C15orf40 homolog [Pan paniscus]
 gi|410049554|ref|XP_001148978.2| PREDICTED: UPF0235 protein C15orf40 homolog isoform 1 [Pan
           troglodytes]
          Length = 147

 Score = 79.7 bits (195), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 48/113 (42%), Positives = 70/113 (61%), Gaps = 6/113 (5%)

Query: 12  KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
           K AG+T   +S+ K  +  LP    + + P   V+I IHAKPGSK  ++TD++ EAV V 
Sbjct: 24  KKAGATTKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVA 83

Query: 67  IDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENV 118
           I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TPE +
Sbjct: 84  IAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTTPEEI 136


>gi|296204187|ref|XP_002749223.1| PREDICTED: UPF0235 protein C15orf40 [Callithrix jacchus]
          Length = 154

 Score = 79.7 bits (195), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 47/113 (41%), Positives = 71/113 (62%), Gaps = 6/113 (5%)

Query: 12  KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
           K AG+T   +S+ K  + +LP    + + P   V+I IHAKPGSK  ++TD++ EA+ V 
Sbjct: 31  KKAGATTKGKSQSKEPERSLPPSGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAINVA 90

Query: 67  IDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENV 118
           I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TPE +
Sbjct: 91  IAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGCKSREKVVKLLASTTPEEI 143


>gi|237858662|ref|NP_653198.2| UPF0235 protein C15orf40 isoform a [Homo sapiens]
 gi|119582835|gb|EAW62431.1| chromosome 15 open reading frame 40, isoform CRA_a [Homo sapiens]
          Length = 153

 Score = 79.7 bits (195), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 48/113 (42%), Positives = 70/113 (61%), Gaps = 6/113 (5%)

Query: 12  KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
           K AG+T   +S+ K  +  LP    + + P   V+I IHAKPGSK  ++TD++ EAV V 
Sbjct: 30  KKAGATTKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVA 89

Query: 67  IDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENV 118
           I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TPE +
Sbjct: 90  IAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTTPEEI 142


>gi|195127445|ref|XP_002008179.1| GI11963 [Drosophila mojavensis]
 gi|193919788|gb|EDW18655.1| GI11963 [Drosophila mojavensis]
          Length = 125

 Score = 79.7 bits (195), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 45/105 (42%), Positives = 69/105 (65%), Gaps = 6/105 (5%)

Query: 6   KGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           KGK+ +K+  +T   +     N P  + L    +++I I AKPG+K   ITD+  E VGV
Sbjct: 7   KGKATSKAGEATTPPV----SNTP--VTLDKSGNIAIKILAKPGAKQNGITDIGLEGVGV 60

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           QI AP  +GEANA L++++S VLG+++  VS+  GS+SR+KI++V
Sbjct: 61  QIAAPPSEGEANAELVKFLSKVLGLRKSDVSLDKGSRSRNKIILV 105


>gi|29839587|sp|Q8WUR7.1|CO040_HUMAN RecName: Full=UPF0235 protein C15orf40
 gi|18043732|gb|AAH19820.1| Chromosome 15 open reading frame 40 [Homo sapiens]
 gi|189053288|dbj|BAG35094.1| unnamed protein product [Homo sapiens]
          Length = 126

 Score = 79.7 bits (195), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 48/113 (42%), Positives = 70/113 (61%), Gaps = 6/113 (5%)

Query: 12  KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
           K AG+T   +S+ K  +  LP    + + P   V+I IHAKPGSK  ++TD++ EAV V 
Sbjct: 3   KKAGATTKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVA 62

Query: 67  IDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENV 118
           I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TPE +
Sbjct: 63  IAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTTPEEI 115


>gi|194866006|ref|XP_001971712.1| GG14280 [Drosophila erecta]
 gi|190653495|gb|EDV50738.1| GG14280 [Drosophila erecta]
          Length = 127

 Score = 79.3 bits (194), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 46/105 (43%), Positives = 69/105 (65%), Gaps = 1/105 (0%)

Query: 6   KGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           KGKSKA +  S ++  K+     PS I +    ++ I I AKPG+K   IT +  E VGV
Sbjct: 5   KGKSKA-AVESAKNVAKSTPAKEPSPISVDKSGNICIQILAKPGAKQNGITGIGLEGVGV 63

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           QI AP  +GEANA L++++S VLG+++  VS+  GS+SR+KI+++
Sbjct: 64  QIAAPPSEGEANAELVKFLSKVLGLRKSDVSLDKGSRSRNKIIMI 108


>gi|345485956|ref|XP_001604898.2| PREDICTED: UPF0235 protein C15orf40 homolog [Nasonia vitripennis]
          Length = 147

 Score = 79.3 bits (194), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 59/84 (70%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V+I + AKPG+K  +ITD S+E VG+ I AP ++GEANA L++Y++S+L V++  V++ 
Sbjct: 57  NVTIKVQAKPGAKQNNITDFSEETVGIAISAPPQEGEANAELVKYLASILNVRKSDVTLD 116

Query: 99  SGSKSRDKIVIVEEITPENVLNSL 122
            GS+SR K VIV   + E V   L
Sbjct: 117 RGSRSRQKKVIVTGSSVEKVTEKL 140


>gi|118353243|ref|XP_001009893.1| hypothetical protein TTHERM_00161650 [Tetrahymena thermophila]
 gi|89291659|gb|EAR89647.1| hypothetical protein TTHERM_00161650 [Tetrahymena thermophila
           SB210]
          Length = 127

 Score = 79.3 bits (194), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 42/89 (47%), Positives = 56/89 (62%), Gaps = 6/89 (6%)

Query: 26  ENLPSCIRLVPPS------SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAA 79
           E  P  +  VP S         I+IHAKP SK   I+ +SDE V + I AP KDGEANA 
Sbjct: 17  EEKPKAVSNVPASIYEKGGKFFISIHAKPNSKISQISGISDEGVDINIAAPPKDGEANAE 76

Query: 80  LLEYMSSVLGVKRRQVSIGSGSKSRDKIV 108
           L++Y+S VLGVK+  +S+  G KSR+K++
Sbjct: 77  LIDYISQVLGVKKSSLSLDKGGKSRNKLM 105


>gi|426380120|ref|XP_004056728.1| PREDICTED: UPF0235 protein C15orf40 homolog [Gorilla gorilla
           gorilla]
          Length = 153

 Score = 79.0 bits (193), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 47/113 (41%), Positives = 70/113 (61%), Gaps = 6/113 (5%)

Query: 12  KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
           + AG+T   +S+ K  +  LP    + + P   V+I IHAKPGSK  ++TD++ EAV V 
Sbjct: 30  RKAGATTKDKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVA 89

Query: 67  IDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENV 118
           I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TPE +
Sbjct: 90  IAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTTPEEI 142


>gi|443689915|gb|ELT92199.1| hypothetical protein CAPTEDRAFT_102723 [Capitella teleta]
          Length = 129

 Score = 79.0 bits (193), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 40/71 (56%), Positives = 54/71 (76%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + I AKPG+K  +ITDV+ + +GVQI AP  DGEANA L++YMS VLGVK+ +VS+  
Sbjct: 38  MHLQICAKPGAKFNNITDVNADGIGVQIAAPPVDGEANAELVKYMSKVLGVKKSEVSLEK 97

Query: 100 GSKSRDKIVIV 110
           GSKSR+K + V
Sbjct: 98  GSKSRNKCIAV 108


>gi|73951609|ref|XP_545872.2| PREDICTED: UPF0235 protein C15orf40 homolog [Canis lupus
           familiaris]
          Length = 152

 Score = 78.6 bits (192), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 50/120 (41%), Positives = 67/120 (55%), Gaps = 4/120 (3%)

Query: 7   GKSKAKSAGSTQSKIKTNDENLPSCIRLV---PPSSVSITIHAKPGSKSCSITDVSDEAV 63
           G    K AG+T        E  P  +  V   P  SV+I I AKPGSK  ++TDV+ EAV
Sbjct: 26  GAEMPKKAGATNKGKSQEPERPPPSLGPVTVDPRGSVTIAIRAKPGSKQNAVTDVTAEAV 85

Query: 64  GVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLNSL 122
            V I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   T E +L  L
Sbjct: 86  SVAIAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTTAEEILEKL 145


>gi|403284558|ref|XP_003933632.1| PREDICTED: UPF0235 protein C15orf40 homolog [Saimiri boliviensis
           boliviensis]
          Length = 141

 Score = 78.6 bits (192), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 40/87 (45%), Positives = 57/87 (65%), Gaps = 1/87 (1%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++S TIHAKPGSK  ++TD++ EAV V I AP  +GEANA L  Y+S VL +++  V + 
Sbjct: 23  NLSYTIHAKPGSKQNAVTDLTAEAVNVAIAAPPSEGEANAELCRYLSKVLELRKSDVVLD 82

Query: 99  SGSKSRDKIV-IVEEITPENVLNSLGK 124
              KSR+K+V ++   TPE +L    K
Sbjct: 83  KDGKSREKVVKLLASTTPEEILEKFKK 109


>gi|242021057|ref|XP_002430963.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212516183|gb|EEB18225.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 117

 Score = 78.6 bits (192), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 35/86 (40%), Positives = 63/86 (73%), Gaps = 2/86 (2%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ + I AKPG+K+ +IT + +E +GVQI+A   DGEAN+ L+ YMS +LG+++ ++S+ 
Sbjct: 27  NIMLQIFAKPGAKTNAITGIDEEGIGVQINARPVDGEANSELVNYMSCLLGLRKTEISLE 86

Query: 99  SGSKSRDKIVIV--EEITPENVLNSL 122
            GSKSR KI+++  ++++ E ++  L
Sbjct: 87  KGSKSRQKILLISKKDLSTEEIIEKL 112


>gi|170029973|ref|XP_001842865.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167865325|gb|EDS28708.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 157

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 44/90 (48%), Positives = 59/90 (65%), Gaps = 4/90 (4%)

Query: 34  LVPPSS--VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVK 91
            V P S  V I I AKPGSK   IT + DE VGVQI AP  DGEAN  L++Y++ +L ++
Sbjct: 57  FVEPKSGNVLIKILAKPGSKFNGITGIEDEGVGVQIAAPPIDGEANTELVKYLAKLLDLR 116

Query: 92  RRQVSIGSGSKSRDKIVIVEE--ITPENVL 119
           +  VS+  GSKSR K +++E+   TP+ VL
Sbjct: 117 KSDVSLDRGSKSRQKTIVLEKGCRTPDQVL 146


>gi|194748771|ref|XP_001956818.1| GF24383 [Drosophila ananassae]
 gi|190624100|gb|EDV39624.1| GF24383 [Drosophila ananassae]
          Length = 126

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 36/72 (50%), Positives = 54/72 (75%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ I I AKPG+K   IT +S E VGVQI AP  +GEANA L++++S VLG+++  VS+ 
Sbjct: 36  NICIQILAKPGAKQNGITGISTEGVGVQIAAPPSEGEANAELVKFLSKVLGLRKSDVSLD 95

Query: 99  SGSKSRDKIVIV 110
            GS+SR+KI+++
Sbjct: 96  KGSRSRNKIILI 107


>gi|332238617|ref|XP_003268500.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 1 [Nomascus
           leucogenys]
          Length = 154

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/113 (41%), Positives = 69/113 (61%), Gaps = 6/113 (5%)

Query: 12  KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
           K AG+T   +S+ K  +  LP    + + P   V+I IHAKPG K  ++TD++ EAV V 
Sbjct: 31  KKAGATTKGKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGCKQNAVTDLTAEAVNVA 90

Query: 67  IDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENV 118
           I AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TPE +
Sbjct: 91  IAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTTPEEI 143


>gi|241835850|ref|XP_002415074.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215509286|gb|EEC18739.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 104

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 34/72 (47%), Positives = 53/72 (73%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V+I +HAKPG+    ITD+  + VGVQI AP  DGEANA L+ +++ VL +++  VS+ 
Sbjct: 14  TVAIRVHAKPGASESRITDIGTDGVGVQIAAPPMDGEANAELVRFLAKVLNLRKSDVSLE 73

Query: 99  SGSKSRDKIVIV 110
            GS+S+DK+V++
Sbjct: 74  KGSRSKDKVVMI 85


>gi|351704948|gb|EHB07867.1| hypothetical protein GW7_13591 [Heterocephalus glaber]
          Length = 164

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 39/80 (48%), Positives = 55/80 (68%), Gaps = 1/80 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V+I IHAKPGSK  ++TD++ EAV V I AP  +GEANA L  Y+S VL +++  V +  
Sbjct: 74  VTIAIHAKPGSKQNAVTDLTTEAVNVAIAAPPSEGEANAELCRYLSKVLELRKSDVVLDK 133

Query: 100 GSKSRDKIV-IVEEITPENV 118
           G KSR+K+V ++   TPE +
Sbjct: 134 GGKSREKVVKLLASTTPEEI 153


>gi|346466449|gb|AEO33069.1| hypothetical protein [Amblyomma maculatum]
          Length = 137

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 37/88 (42%), Positives = 64/88 (72%), Gaps = 1/88 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V+I +HAKPG+    ITD+S+++VG+QI AP  DGEAN  L+ ++S +L +++  VS+  
Sbjct: 20  VAIRVHAKPGASVSRITDISNDSVGIQIAAPPVDGEANTELVRFLSKLLNLRKTDVSLEK 79

Query: 100 GSKSRDKIVIV-EEITPENVLNSLGKAS 126
           G++S++K+V+V  ++   +VL+ L KA+
Sbjct: 80  GARSKEKVVVVTSQVGASDVLHVLKKAN 107


>gi|195491333|ref|XP_002093518.1| GE20708 [Drosophila yakuba]
 gi|194179619|gb|EDW93230.1| GE20708 [Drosophila yakuba]
          Length = 128

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 46/105 (43%), Positives = 68/105 (64%), Gaps = 1/105 (0%)

Query: 6   KGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           KGKSKA +  S +S  K+      S I +    ++ I I AKPG+K   IT +  E VGV
Sbjct: 5   KGKSKA-AVESAKSDAKSTPAKEASPISVDKSGNICIQILAKPGAKQNGITGIGLEGVGV 63

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           QI AP  +GEANA L++++S VLG+++  VS+  GS+SR+KI+++
Sbjct: 64  QIAAPPSEGEANAELVKFLSKVLGLRKSDVSLDKGSRSRNKIIMI 108


>gi|294916777|ref|XP_002778391.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239886749|gb|EER10186.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 132

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 46/96 (47%), Positives = 67/96 (69%), Gaps = 4/96 (4%)

Query: 27  NLPSCIRLVPPSS-VSITIHAKPGSKSCSITDVSDE-AVGVQIDAPAKDGEANAALLEYM 84
           +L SC+   P +S   I I AKPG+K   +T +  E A+GVQ++APA+DGEAN  LL ++
Sbjct: 27  DLLSCVTKCPRTSQARIAIRAKPGAKVSCLTGIDAEGALGVQLNAPARDGEANEELLSFL 86

Query: 85  S-SVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENV 118
           S  VLGVK++ V++  GSKSR+K+V I + +T E+V
Sbjct: 87  SKEVLGVKKKDVALVQGSKSREKVVEIADVLTVEDV 122


>gi|338717345|ref|XP_001498126.2| PREDICTED: UPF0235 protein C15orf40 homolog [Equus caballus]
          Length = 126

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 40/84 (47%), Positives = 56/84 (66%), Gaps = 1/84 (1%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P   V+I IHAKPGSK  ++TDV+ EAV V I AP  +GEANA L  Y+S VL +++  V
Sbjct: 32  PKGCVTIAIHAKPGSKQNAVTDVTAEAVSVAIAAPPSEGEANAELCRYLSKVLDLRKSDV 91

Query: 96  SIGSGSKSRDKIV-IVEEITPENV 118
            +  G KSR+K+V ++   TP+ +
Sbjct: 92  VLDKGGKSREKVVKLLASTTPDEI 115


>gi|349602907|gb|AEP98900.1| UPF0235 protein C15orf40-like protein, partial [Equus caballus]
          Length = 116

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/84 (47%), Positives = 56/84 (66%), Gaps = 1/84 (1%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P   V+I IHAKPGSK  ++TDV+ EAV V I AP  +GEANA L  Y+S VL +++  V
Sbjct: 22  PKGCVTIAIHAKPGSKQNAVTDVTAEAVSVAIAAPPSEGEANAELCRYLSKVLDLRKSDV 81

Query: 96  SIGSGSKSRDKIV-IVEEITPENV 118
            +  G KSR+K+V ++   TP+ +
Sbjct: 82  VLDKGGKSREKVVKLLASTTPDEI 105


>gi|442751525|gb|JAA67922.1| Hypothetical protein [Ixodes ricinus]
          Length = 145

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 33/72 (45%), Positives = 53/72 (73%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V+I +HAKPG+    ITD+  + VG+QI AP  DGEANA L+ +++ VL +++  VS+ 
Sbjct: 55  TVAIRVHAKPGASESRITDIGTDGVGIQIAAPPMDGEANAELVRFLAKVLNLRKSDVSLE 114

Query: 99  SGSKSRDKIVIV 110
            GS+S+DK+V++
Sbjct: 115 KGSRSKDKVVMI 126


>gi|195376389|ref|XP_002046979.1| GJ12187 [Drosophila virilis]
 gi|194154137|gb|EDW69321.1| GJ12187 [Drosophila virilis]
          Length = 124

 Score = 77.0 bits (188), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 37/72 (51%), Positives = 54/72 (75%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ I I AKPG+K   ITD+  + VGVQI AP  +GEANA L++++S VLG+++  VS+ 
Sbjct: 33  NIVIKILAKPGAKQNGITDIGLDGVGVQIAAPPSEGEANAELVKFLSKVLGLRKSDVSLD 92

Query: 99  SGSKSRDKIVIV 110
            GS+SR+KIV+V
Sbjct: 93  KGSRSRNKIVLV 104


>gi|195587403|ref|XP_002083454.1| GD13347 [Drosophila simulans]
 gi|194195463|gb|EDX09039.1| GD13347 [Drosophila simulans]
          Length = 128

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 45/105 (42%), Positives = 68/105 (64%), Gaps = 1/105 (0%)

Query: 6   KGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           KGKSKA +  S ++  K+      S I +    ++ I I AKPG+K   IT +  E VGV
Sbjct: 5   KGKSKA-AVESAKNDAKSTPAKEASPISVDKSGNICIQILAKPGAKQNGITGIGLEGVGV 63

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           QI AP  +GEANA L++++S VLG+++  VS+  GS+SR+KI+++
Sbjct: 64  QIAAPPSEGEANAELVKFLSKVLGLRKSDVSLDKGSRSRNKIIMI 108


>gi|410907834|ref|XP_003967396.1| PREDICTED: LOW QUALITY PROTEIN: UPF0235 protein C15orf40 homolog
           [Takifugu rubripes]
          Length = 115

 Score = 76.3 bits (186), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 38/85 (44%), Positives = 58/85 (68%), Gaps = 1/85 (1%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V+IT+HAKPGSK  SIT++      +++ AP  DGEAN  L+ +++ VL +K+  +S+ 
Sbjct: 31  AVTITVHAKPGSKHSSITEIXWPFCLLELSAPPVDGEANVELIRFLAEVLELKKSHISLD 90

Query: 99  SGSKSRDKIVIVE-EITPENVLNSL 122
            GS+SRDK V V+  ++PE VL  L
Sbjct: 91  KGSRSRDKQVRVDSSLSPEEVLRRL 115


>gi|195160833|ref|XP_002021278.1| GL25246 [Drosophila persimilis]
 gi|198465041|ref|XP_001353470.2| GA13391 [Drosophila pseudoobscura pseudoobscura]
 gi|194118391|gb|EDW40434.1| GL25246 [Drosophila persimilis]
 gi|198149990|gb|EAL30981.2| GA13391 [Drosophila pseudoobscura pseudoobscura]
          Length = 125

 Score = 76.3 bits (186), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 35/72 (48%), Positives = 54/72 (75%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ I I AKPG+K   IT++  E VGVQI AP  +GEANA L++++S VLG+++  VS+ 
Sbjct: 34  NICIKILAKPGAKHNGITNIDLEGVGVQIAAPPSEGEANAELVKFLSKVLGLRKSDVSLD 93

Query: 99  SGSKSRDKIVIV 110
            GS+SR+KI+++
Sbjct: 94  KGSRSRNKIILI 105


>gi|403343269|gb|EJY70959.1| UPF0235 protein C15orf40-like protein [Oxytricha trifallax]
          Length = 148

 Score = 76.3 bits (186), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 59/91 (64%), Gaps = 3/91 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + I AKPGSKS  I  V D+ +GV + AP  DG AN  +LE+++SVLG+K+R +++  
Sbjct: 58  IFMVIRAKPGSKSDEIFAVEDDYIGVAVQAPPLDGAANEGILEFLASVLGLKKRDLTLVK 117

Query: 100 GSKSRDKIVIVEE---ITPENVLNSLGKASS 127
           GSK  DK++ ++E   +  +NVL  L  A S
Sbjct: 118 GSKGHDKLIQIDEPGSLDVDNVLMKLMAAKS 148


>gi|24656569|ref|NP_647784.1| CG14966 [Drosophila melanogaster]
 gi|7292328|gb|AAF47735.1| CG14966 [Drosophila melanogaster]
 gi|289526411|gb|ADD01328.1| RE68649p [Drosophila melanogaster]
          Length = 140

 Score = 76.3 bits (186), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 45/107 (42%), Positives = 68/107 (63%), Gaps = 5/107 (4%)

Query: 8   KSKAKSAGSTQSKIKTNDENLP----SCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAV 63
           KSK KS    +S  K + + +P    S I +    ++ I I AKPG+K   IT +  E V
Sbjct: 15  KSKGKSKAGVESA-KNDAKAMPAKEASPISVDKSGNICIQILAKPGAKQNGITGIGFEGV 73

Query: 64  GVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           GVQI AP  +GEANA L++++S VLG+++  VS+  GS+SR+KI+++
Sbjct: 74  GVQIAAPPSEGEANAELVKFLSKVLGLRKSDVSLDKGSRSRNKIIMI 120


>gi|348580051|ref|XP_003475792.1| PREDICTED: UPF0235 protein C15orf40 homolog [Cavia porcellus]
          Length = 154

 Score = 75.9 bits (185), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 37/80 (46%), Positives = 54/80 (67%), Gaps = 1/80 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V+I IHAKPGSK  ++TD++ EAV + I AP  +GEAN  L  Y+S VL +++  V +  
Sbjct: 64  VTIAIHAKPGSKQSAVTDLTSEAVNIAIAAPPTEGEANTELCRYLSKVLELRKSDVVLDK 123

Query: 100 GSKSRDKIV-IVEEITPENV 118
           G KSR+K+V ++   TPE +
Sbjct: 124 GGKSREKVVKLLASTTPEEI 143


>gi|195014316|ref|XP_001984001.1| GH15254 [Drosophila grimshawi]
 gi|193897483|gb|EDV96349.1| GH15254 [Drosophila grimshawi]
          Length = 126

 Score = 75.9 bits (185), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 34/72 (47%), Positives = 54/72 (75%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ I I AKPG+K   ITD+  E VGVQI AP  +GEANA L++++S VLG+++  VS+ 
Sbjct: 35  NIVIKILAKPGAKQNGITDIGLEGVGVQIAAPPSEGEANAELVKFLSKVLGLRKSDVSLD 94

Query: 99  SGSKSRDKIVIV 110
            GS+S++K++++
Sbjct: 95  KGSRSKNKLILI 106


>gi|349805669|gb|AEQ18307.1| hypothetical protein [Hymenochirus curtipes]
          Length = 84

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 37/70 (52%), Positives = 50/70 (71%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V + IHAKPGSK  +ITDV+ EAV V I AP  +GEAN  L  Y++ VL VK+ +V++ 
Sbjct: 11  TVCVNIHAKPGSKQNAITDVTTEAVVVAIAAPPMEGEANLELCRYLAQVLEVKKSEVTLD 70

Query: 99  SGSKSRDKIV 108
            G KSR+K+V
Sbjct: 71  KGGKSREKVV 80


>gi|290996548|ref|XP_002680844.1| predicted protein [Naegleria gruberi]
 gi|284094466|gb|EFC48100.1| predicted protein [Naegleria gruberi]
          Length = 73

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 36/73 (49%), Positives = 52/73 (71%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           S+ +TI AKP S S  I +++DE +GV I AP K+GEAN  L +Y+S VLGV + +V++ 
Sbjct: 1   SIRLTILAKPNSSSSQIANINDEEIGVHIAAPPKEGEANKELCDYVSGVLGVSKSRVTLD 60

Query: 99  SGSKSRDKIVIVE 111
            G KSR K+++VE
Sbjct: 61  RGGKSRHKLLLVE 73


>gi|347966092|ref|XP_321597.4| AGAP001528-PA [Anopheles gambiae str. PEST]
 gi|333470215|gb|EAA01322.4| AGAP001528-PA [Anopheles gambiae str. PEST]
          Length = 164

 Score = 75.1 bits (183), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 57/82 (69%), Gaps = 4/82 (4%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + I AKPG+K+  ITDVS+E +G QI AP  DGEAN  L++Y+S +L +++  +S+  GS
Sbjct: 72  VKILAKPGAKTSGITDVSEEGIGCQIAAPPIDGEANTELIKYLSKLLDLRKSDISLDRGS 131

Query: 102 KSRDKIVIVEEI----TPENVL 119
           KSR K +++++     +PE +L
Sbjct: 132 KSRQKTIVLDKAGCRHSPEQLL 153


>gi|390355950|ref|XP_003728665.1| PREDICTED: UPF0235 protein C15orf40 homolog [Strongylocentrotus
           purpuratus]
          Length = 130

 Score = 74.3 bits (181), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 37/72 (51%), Positives = 52/72 (72%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           S+SI+I AKPG+K   IT + +E VGVQI AP  +G AN  L++Y++SVLG+++  VS+ 
Sbjct: 38  SISISIQAKPGAKHNGITGIEEEGVGVQISAPPVEGAANTELVKYLASVLGLRKSDVSLE 97

Query: 99  SGSKSRDKIVIV 110
            GSKSR K + V
Sbjct: 98  RGSKSRAKTIGV 109


>gi|390355931|ref|XP_003728660.1| PREDICTED: UPF0235 protein C15orf40 homolog [Strongylocentrotus
           purpuratus]
          Length = 147

 Score = 74.3 bits (181), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 36/70 (51%), Positives = 51/70 (72%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           S+SI+I AKPG+K   IT + +E VGVQI AP  +G AN  L++Y++SVLG+++  VS+ 
Sbjct: 55  SISISIQAKPGAKHNGITGIEEEGVGVQISAPPVEGAANTELVKYLASVLGLRKSDVSLE 114

Query: 99  SGSKSRDKIV 108
            GSKSR K +
Sbjct: 115 RGSKSRAKTI 124


>gi|391330297|ref|XP_003739600.1| PREDICTED: UPF0235 protein C15orf40 homolog [Metaseiulus
           occidentalis]
          Length = 160

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 33/72 (45%), Positives = 52/72 (72%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + IHAKPG+K  ++T +  EAV VQI AP  DGEAN  L++Y++  L +++  VS+  
Sbjct: 71  VGVRIHAKPGAKLSAVTGIGAEAVEVQIGAPPVDGEANTELVKYLAKALDLRKSDVSLDR 130

Query: 100 GSKSRDKIVIVE 111
           GS+SR+K++++E
Sbjct: 131 GSRSREKVILIE 142


>gi|403346375|gb|EJY72583.1| UPF0235 protein C15orf40-like protein [Oxytricha trifallax]
          Length = 148

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 58/91 (63%), Gaps = 3/91 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + I AKPGSKS  I  V D+  GV + AP  DG AN  +LE+++SVLG+K+R +++  
Sbjct: 58  IFMVIRAKPGSKSDEIFAVEDDYKGVAVQAPPLDGAANEGILEFLASVLGLKKRDLTLVK 117

Query: 100 GSKSRDKIVIVEE---ITPENVLNSLGKASS 127
           GSK  DK++ ++E   +  +NVL  L  A S
Sbjct: 118 GSKGHDKLIQIDEPGNLDVDNVLMKLMAAKS 148


>gi|195337079|ref|XP_002035160.1| GM14073 [Drosophila sechellia]
 gi|194128253|gb|EDW50296.1| GM14073 [Drosophila sechellia]
          Length = 128

 Score = 73.6 bits (179), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 44/105 (41%), Positives = 67/105 (63%), Gaps = 1/105 (0%)

Query: 6   KGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           KGKSKA +  S ++  K+      S I +    ++ I I AKPG+K   IT +  E VGV
Sbjct: 5   KGKSKA-AVESAKNDAKSTPAKEASPISVDKSGNICIQILAKPGAKQNGITGIGLEGVGV 63

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           QI AP  +GEANA L++++S VLG+++  VS+  GS+SR+K +++
Sbjct: 64  QIAAPPSEGEANAELVKFLSKVLGLRKSDVSLDKGSRSRNKKIMI 108


>gi|145538297|ref|XP_001454854.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124422631|emb|CAK87457.1| unnamed protein product [Paramecium tetraurelia]
          Length = 106

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 35/73 (47%), Positives = 49/73 (67%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           S  + I+AKP SK   IT +SDEAV + I AP KDGEANA L ++++  LGVK+  + + 
Sbjct: 17  SYYLVINAKPNSKVSQITGISDEAVDINIAAPPKDGEANAELCDFVAQTLGVKKTAIQVN 76

Query: 99  SGSKSRDKIVIVE 111
            G K R+K+V +E
Sbjct: 77  KGGKGRNKLVSIE 89


>gi|294949448|ref|XP_002786202.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239900359|gb|EER17998.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 132

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 44/96 (45%), Positives = 66/96 (68%), Gaps = 4/96 (4%)

Query: 27  NLPSCIRLVPPSS-VSITIHAKPGSKSCSITDVSDE-AVGVQIDAPAKDGEANAALLEYM 84
           +L SC+   P +S   I I AKPG+K   +T +  E A+GVQ++A A+DGEAN  LL ++
Sbjct: 27  DLLSCVTKCPRTSQARIAIRAKPGAKVSCLTGIDAEGALGVQLNASARDGEANEELLSFL 86

Query: 85  S-SVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENV 118
           S  VLGVK++ V++  GSKSR+K+V I + +T ++V
Sbjct: 87  SKEVLGVKKKDVALVQGSKSREKVVEIADVLTVDDV 122


>gi|384253020|gb|EIE26495.1| DUF167-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 120

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 43/100 (43%), Positives = 68/100 (68%), Gaps = 1/100 (1%)

Query: 27  NLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
           NLP+ ++ +   SVSI++H KPGSK  S + +++ +V + I APA++GEANAA   +++ 
Sbjct: 19  NLPTYVKQLSEDSVSISVHCKPGSKVNSFS-ITEGSVSLAISAPAREGEANAAAALFLAE 77

Query: 87  VLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKAS 126
           VLGVK+RQV +  GS+SR+K+  V  +  +  L  L +AS
Sbjct: 78  VLGVKKRQVELAIGSRSREKVFKVHGLDSDTALTRLMQAS 117


>gi|145543083|ref|XP_001457228.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124425043|emb|CAK89831.1| unnamed protein product [Paramecium tetraurelia]
          Length = 106

 Score = 72.4 bits (176), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 35/73 (47%), Positives = 49/73 (67%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           S  + I+AKP SK   IT +SDEAV V I AP KDGEANA L ++++  LGVK+  + + 
Sbjct: 17  SYFLVINAKPNSKVSQITGISDEAVDVNIAAPPKDGEANAELCDFVAQTLGVKKTAIQVQ 76

Query: 99  SGSKSRDKIVIVE 111
            G K R+K++ +E
Sbjct: 77  KGGKGRNKLIKIE 89


>gi|449268046|gb|EMC78919.1| UPF0235 protein C15orf40 like protein, partial [Columba livia]
          Length = 75

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/74 (51%), Positives = 52/74 (70%), Gaps = 1/74 (1%)

Query: 55  ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEI 113
           +TDV+ EAVGV I AP  +GEANA L  Y+S VL VK+ +V++  G KSRDK+V I+  +
Sbjct: 1   VTDVTAEAVGVAIAAPPSEGEANAELCRYLSKVLEVKKSEVTLDKGGKSRDKVVKILVSM 60

Query: 114 TPENVLNSLGKASS 127
           TP+ +L  L K +S
Sbjct: 61  TPDEILEKLKKEAS 74


>gi|327289071|ref|XP_003229248.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 2 [Anolis
           carolinensis]
          Length = 104

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 34/54 (62%), Positives = 41/54 (75%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKR 92
           SV+I +HAKPGSK  ++TD+S EAVG+ I AP  DGEANA L  Y+S VL VKR
Sbjct: 47  SVTIAVHAKPGSKQNAVTDLSAEAVGIAIAAPPSDGEANAELCRYLSKVLEVKR 100


>gi|291412810|ref|XP_002722671.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
          Length = 154

 Score = 70.1 bits (170), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 47/116 (40%), Positives = 70/116 (60%), Gaps = 1/116 (0%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQI 67
           K+ A S G +QSK +         + + P   V+I IHAKPG+K  ++TD++ EAV V I
Sbjct: 32  KAGATSKGRSQSKDREAPLPPSGSVAVDPKGCVTIAIHAKPGAKQNAVTDLTAEAVIVAI 91

Query: 68  DAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENVLNSL 122
            AP  +GEANA L  Y+S VL +++  V +  G KSR+K+V ++   TP+ +L  L
Sbjct: 92  AAPPSEGEANAELCRYLSKVLELRKSDVVLDKGGKSREKVVKLLASTTPDEILGKL 147


>gi|399217812|emb|CCF74699.1| unnamed protein product [Babesia microti strain RI]
          Length = 97

 Score = 68.9 bits (167), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 30/67 (44%), Positives = 47/67 (70%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V +T+  KPG+KS  I ++ +  + +QI AP +DGE N AL++Y+S +LG+K+  +S+  
Sbjct: 10  VFLTVRVKPGAKSTKIVEIDENCLHLQIAAPPRDGECNEALIKYISKILGIKKTGISLVL 69

Query: 100 GSKSRDK 106
           G KSRDK
Sbjct: 70  GHKSRDK 76


>gi|307199144|gb|EFN79854.1| UPF0235 protein C15orf40-like protein [Harpegnathos saltator]
          Length = 100

 Score = 68.2 bits (165), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 29/59 (49%), Positives = 46/59 (77%)

Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
          +V+I I AKPG+K  +ITD+++E VGV I AP  +GEANA L++Y++S+ G+++  VS+
Sbjct: 34 NVTIKIQAKPGAKCNNITDITNEGVGVAISAPPTEGEANAELVKYLASIFGLRKSHVSL 92


>gi|237858666|ref|NP_001153586.1| UPF0235 protein C15orf40 isoform c [Homo sapiens]
          Length = 153

 Score = 68.2 bits (165), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/123 (38%), Positives = 68/123 (55%), Gaps = 4/123 (3%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           K+ A + G +QSK    +  LP    + + P   V+I IHAKPGSK  ++TD++ EAV V
Sbjct: 31  KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNV 88

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
            I AP  +GEANA L  Y+S VL +++  V +    +   KIV  +  T   +L  L + 
Sbjct: 89  AIAAPPSEGEANAELCRYLSKVLELRKSDVVLDKKLRDLLKIVPCKLATRTQILGLLVEC 148

Query: 126 SSS 128
           S S
Sbjct: 149 SFS 151


>gi|242279866|ref|YP_002991995.1| hypothetical protein Desal_2400 [Desulfovibrio salexigens DSM 2638]
 gi|242122760|gb|ACS80456.1| protein of unknown function DUF167 [Desulfovibrio salexigens DSM
           2638]
          Length = 106

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 34/84 (40%), Positives = 56/84 (66%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           LPS IR     S  +++  +PG+K+  IT    ++V V+I+APA D +AN AL  ++++ 
Sbjct: 8   LPSYIRPCGHGSWRVSVWVQPGAKNEGITGEYQDSVRVRINAPAVDNKANKALAAFVATR 67

Query: 88  LGVKRRQVSIGSGSKSRDKIVIVE 111
           LG+K+R +SI SG  +R K+++VE
Sbjct: 68  LGLKKRNISIASGHSNRKKVLLVE 91


>gi|302879874|ref|YP_003848438.1| hypothetical protein Galf_2678 [Gallionella capsiferriformans ES-2]
 gi|302582663|gb|ADL56674.1| protein of unknown function DUF167 [Gallionella capsiferriformans
           ES-2]
          Length = 96

 Score = 67.4 bits (163), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 54/83 (65%), Gaps = 2/83 (2%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +++T+H +PG+K   I  +  EA+ +++ AP  DG AN ALL+Y++ +  V  RQV +  
Sbjct: 12  ITLTLHVQPGAKRSEICGLHGEALKLKLAAPPIDGRANEALLKYIAELFRVPVRQVELRQ 71

Query: 100 GSKSRDKIVIVEE--ITPENVLN 120
           G++SR K+V V +  I PE++L 
Sbjct: 72  GAQSRHKVVAVTDSAIQPESLLT 94


>gi|226942094|ref|YP_002797168.1| hypothetical protein LHK_03181 [Laribacter hongkongensis HLHK9]
 gi|254801627|sp|C1D6C4.1|Y3181_LARHH RecName: Full=UPF0235 protein LHK_03181
 gi|226717021|gb|ACO76159.1| Uncharacterized conserved protein [Laribacter hongkongensis HLHK9]
          Length = 97

 Score = 66.6 bits (161), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 50/77 (64%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V +T+H +PG++   +  +  +A+ +++ AP  DG+ANA LL +++  LGV R  V++ S
Sbjct: 11  VRLTLHVQPGARRTEVAGLHGDALKIRLAAPPVDGKANACLLAFLARGLGVSRSAVTLKS 70

Query: 100 GSKSRDKIVIVEEITPE 116
           G  SR K+V +  ITPE
Sbjct: 71  GDCSRHKVVDIRGITPE 87


>gi|426380122|ref|XP_004056729.1| PREDICTED: UPF0235 protein C15orf40 homolog [Gorilla gorilla
           gorilla]
          Length = 153

 Score = 66.6 bits (161), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 38/93 (40%), Positives = 53/93 (56%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P   V+I IHAKPGSK  ++TD++ EAV V I AP  +GEANA L  Y+S VL +++  V
Sbjct: 59  PKGCVTIAIHAKPGSKQNAVTDLTAEAVNVAIAAPPSEGEANAELCRYLSKVLELRKSDV 118

Query: 96  SIGSGSKSRDKIVIVEEITPENVLNSLGKASSS 128
            +    +   KIV  +  T   +   L + S S
Sbjct: 119 VLDKKLRDLLKIVPCKLATRTQIFGLLVECSFS 151


>gi|326926706|ref|XP_003209539.1| PREDICTED: UPF0235 protein C15orf40 homolog [Meleagris gallopavo]
          Length = 85

 Score = 66.2 bits (160), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 37/73 (50%), Positives = 49/73 (67%), Gaps = 1/73 (1%)

Query: 57  DVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITP 115
           +V+ EAVGV I AP  +GEANA L  Y+S VL VK+  V +  G KSRDK+V I+  +TP
Sbjct: 13  NVTAEAVGVAIAAPPSEGEANAELCRYLSKVLQVKKSDVILEKGGKSRDKVVKILVSLTP 72

Query: 116 ENVLNSLGKASSS 128
           + VL  L K +S+
Sbjct: 73  DEVLEKLKKEAST 85


>gi|149057385|gb|EDM08708.1| similar to RIKEN cDNA 3110040N11, isoform CRA_e [Rattus norvegicus]
          Length = 130

 Score = 66.2 bits (160), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 37/78 (47%), Positives = 50/78 (64%), Gaps = 8/78 (10%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V+I IHAKPGSK  ++TD++ EAVGV I AP  +GEANA L  Y+S VL +++  V +  
Sbjct: 36  VTIAIHAKPGSKQNAVTDLNTEAVGVAIAAPPSEGEANAELCRYLSKVLDLRKSDVVL-- 93

Query: 100 GSKSRDKIVIVEE-ITPE 116
                DK   V + I+PE
Sbjct: 94  -----DKFHWVSKGISPE 106


>gi|341898004|gb|EGT53939.1| hypothetical protein CAEBREN_29926 [Caenorhabditis brenneri]
          Length = 125

 Score = 65.9 bits (159), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 62/122 (50%), Gaps = 8/122 (6%)

Query: 3   PAKKGKSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEA 62
           P+KK K+   +A  T   +        S I       + + IHAKPG+K   +  + D  
Sbjct: 4   PSKKEKTVVPTAQETDKTVSD------SAIFAASEGRIGLRIHAKPGAKKSCVVAIGDSE 57

Query: 63  VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVE--EITPENVLN 120
           V V I AP ++G AN  L+ Y+   LG+++ ++    G+KSR K+V++E   +T E V  
Sbjct: 58  VDVAIGAPPREGAANEELISYLMQALGLRKNELQFDKGAKSRSKVVLIETKRLTMEEVRQ 117

Query: 121 SL 122
            L
Sbjct: 118 KL 119


>gi|428225074|ref|YP_007109171.1| hypothetical protein GEI7407_1628 [Geitlerinema sp. PCC 7407]
 gi|427984975|gb|AFY66119.1| protein of unknown function DUF167 [Geitlerinema sp. PCC 7407]
          Length = 75

 Score = 65.9 bits (159), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 34/68 (50%), Positives = 46/68 (67%)

Query: 41  SITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           +IT+  KP +K   IT  SD ++ +Q+ AP  DG+ANAAL+E ++   GV RRQ+SI SG
Sbjct: 3   TITVKVKPRAKVQKITTASDGSLIIQLSAPPVDGKANAALIEILADHFGVSRRQISIKSG 62

Query: 101 SKSRDKIV 108
             SR KIV
Sbjct: 63  QTSRAKIV 70


>gi|335292316|ref|XP_003356704.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 2 [Sus scrofa]
          Length = 150

 Score = 65.1 bits (157), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 39/96 (40%), Positives = 56/96 (58%), Gaps = 5/96 (5%)

Query: 7   GKSKAKSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDE 61
           G    K AG+T   +S+ K  +  LP    + + P   V+I IHAKPGSK  ++TD++ E
Sbjct: 26  GAEMPKKAGATNKGKSQSKEQERPLPPLGPVTVDPKGCVTIAIHAKPGSKQNAVTDLTTE 85

Query: 62  AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           AV V I AP  +GEANA L  Y+S V  +++  V +
Sbjct: 86  AVSVAIAAPPSEGEANAELCRYLSKVFELRKSDVVL 121


>gi|403258550|ref|XP_003921821.1| PREDICTED: UPF0235 protein C15orf40 homolog [Saimiri boliviensis
           boliviensis]
          Length = 150

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 39/92 (42%), Positives = 56/92 (60%), Gaps = 4/92 (4%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           K+ A + G +QSK    +  LP    + + P   V+I IHAKPGSK  ++TD++ EAV V
Sbjct: 32  KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNV 89

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
            I AP  +GEANA L  Y+S VL +++  V +
Sbjct: 90  AIAAPPSEGEANAELCRYLSKVLELRKSDVVL 121


>gi|328772106|gb|EGF82145.1| hypothetical protein BATDEDRAFT_86894 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 128

 Score = 64.3 bits (155), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 60/99 (60%), Gaps = 5/99 (5%)

Query: 29  PSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVL 88
           P  I      S+ +    KPG+K   + D+  +AVG+QI A A++GEANA L++ ++ VL
Sbjct: 28  PLWISKFADGSIRLNTLVKPGTKVSQVIDIQGDAVGIQIAAVAREGEANAELIQTVADVL 87

Query: 89  GVKRRQVSIGSGSKSRDKIVIVE-----EITPENVLNSL 122
            +++ QV+I +G KSR K++ ++     E   E +L+S+
Sbjct: 88  KLRKYQVAIVAGHKSRTKVLKIDTLLSIEQIQEMILSSM 126


>gi|410049556|ref|XP_003952769.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 2 [Pan
           troglodytes]
          Length = 143

 Score = 64.3 bits (155), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 39/92 (42%), Positives = 56/92 (60%), Gaps = 4/92 (4%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           K+ A + G +QSK    +  LP    + + P   V+I IHAKPGSK  ++TD++ EAV V
Sbjct: 25  KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNV 82

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
            I AP  +GEANA L  Y+S VL +++  V +
Sbjct: 83  AIAAPPSEGEANAELCRYLSKVLELRKSDVVL 114


>gi|237858664|ref|NP_001153585.1| UPF0235 protein C15orf40 isoform b [Homo sapiens]
          Length = 167

 Score = 64.3 bits (155), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 39/92 (42%), Positives = 56/92 (60%), Gaps = 4/92 (4%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           K+ A + G +QSK    +  LP    + + P   V+I IHAKPGSK  ++TD++ EAV V
Sbjct: 31  KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNV 88

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
            I AP  +GEANA L  Y+S VL +++  V +
Sbjct: 89  AIAAPPSEGEANAELCRYLSKVLELRKSDVVL 120


>gi|237858670|ref|NP_001153588.1| UPF0235 protein C15orf40 isoform e [Homo sapiens]
          Length = 149

 Score = 64.3 bits (155), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 39/92 (42%), Positives = 56/92 (60%), Gaps = 4/92 (4%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           K+ A + G +QSK    +  LP    + + P   V+I IHAKPGSK  ++TD++ EAV V
Sbjct: 31  KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNV 88

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
            I AP  +GEANA L  Y+S VL +++  V +
Sbjct: 89  AIAAPPSEGEANAELCRYLSKVLELRKSDVVL 120


>gi|237858668|ref|NP_001153587.1| UPF0235 protein C15orf40 isoform d [Homo sapiens]
          Length = 167

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 39/92 (42%), Positives = 56/92 (60%), Gaps = 4/92 (4%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           K+ A + G +QSK    +  LP    + + P   V+I IHAKPGSK  ++TD++ EAV V
Sbjct: 31  KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNV 88

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
            I AP  +GEANA L  Y+S VL +++  V +
Sbjct: 89  AIAAPPSEGEANAELCRYLSKVLELRKSDVVL 120


>gi|436842528|ref|YP_007326906.1| conserved protein of unknown function [Desulfovibrio hydrothermalis
           AM13 = DSM 14728]
 gi|432171434|emb|CCO24807.1| conserved protein of unknown function [Desulfovibrio hydrothermalis
           AM13 = DSM 14728]
          Length = 106

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 54/84 (64%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           LPS IR     S  +++  +PG+++  +T     +V V+I+APA D +AN AL  ++++ 
Sbjct: 8   LPSYIRPCGSRSWKLSVWVQPGARTEGVTGEYQGSVRVRINAPAVDNKANKALARFVAAR 67

Query: 88  LGVKRRQVSIGSGSKSRDKIVIVE 111
           LG+K R +SI SG  +R K+++VE
Sbjct: 68  LGLKNRNISIASGQTNRKKVLLVE 91


>gi|312084117|ref|XP_003144143.1| hypothetical protein LOAG_08565 [Loa loa]
 gi|307760694|gb|EFO19928.1| hypothetical protein LOAG_08565 [Loa loa]
          Length = 129

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 69/125 (55%), Gaps = 4/125 (3%)

Query: 7   GKSKAKSAGSTQSKIKT-NDENLP-SCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVG 64
           GK+K+K  G     + T N E L  S I L     + + IHAKP +K+  + D+    V 
Sbjct: 2   GKNKSKIEGMVSVTMNTSNKEKLKDSAISLNKNGDIILRIHAKPNAKTTRVIDIGANEVE 61

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKI--VIVEEITPENVLNSL 122
           + I AP  DG+AN AL+  M  +L +++ +++  +G++SR K+  ++ + IT E V   L
Sbjct: 62  LAIAAPPHDGQANEALINAMMDILELRKNEITFDTGARSRSKVLRLMSKRITLEEVREKL 121

Query: 123 GKASS 127
            + ++
Sbjct: 122 ERNAT 126


>gi|116784383|gb|ABK23322.1| unknown [Picea sitchensis]
 gi|224284776|gb|ACN40118.1| unknown [Picea sitchensis]
          Length = 232

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 57/99 (57%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLEY
Sbjct: 128 QDAPVPPCISQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEY 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           M  VLG++  Q+++  G  ++ K+++VE+++  +V   L
Sbjct: 188 MGKVLGLRLTQMTLQRGWNNKSKLLVVEDLSARDVYEKL 226


>gi|115471615|ref|NP_001059406.1| Os07g0295200 [Oryza sativa Japonica Group]
 gi|34394983|dbj|BAC84531.1| unknown protein [Oryza sativa Japonica Group]
 gi|113610942|dbj|BAF21320.1| Os07g0295200 [Oryza sativa Japonica Group]
 gi|215765278|dbj|BAG86975.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218199457|gb|EEC81884.1| hypothetical protein OsI_25695 [Oryza sativa Indica Group]
 gi|222636860|gb|EEE66992.1| hypothetical protein OsJ_23901 [Oryza sativa Japonica Group]
          Length = 232

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 59/102 (57%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
           +D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLE+
Sbjct: 128 HDAPVPPCITQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  ++ K++IVE+++   V   L +A
Sbjct: 188 MGKVLGLRLSQMTLQRGWNNKSKLLIVEDLSARQVYEKLLEA 229


>gi|168035145|ref|XP_001770071.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162678597|gb|EDQ65053.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 232

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/99 (34%), Positives = 55/99 (55%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I  +  ++   IT V+ + V V + APA  GEAN  LLEY
Sbjct: 128 QDAPVPPCISQIDGGLVQVGIEVEDRAQRSQITRVNADDVRVTVAAPAARGEANNELLEY 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           M  VLG++  Q+++  G  ++ K+++VE++T   V   L
Sbjct: 188 MGKVLGLRLTQMTLQRGWNNKSKLLVVEDLTVREVYEKL 226


>gi|357111014|ref|XP_003557310.1| PREDICTED: UPF0235 protein C15orf40 homolog [Brachypodium
           distachyon]
          Length = 232

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 59/102 (57%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
           +D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLE+
Sbjct: 128 HDAPVPPCITQLQGGLVQVAIEVEDRAQRSAITRVNADDVRVAVAAPATRGEANNELLEF 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  ++ K++IVE+++   V   L +A
Sbjct: 188 MGKVLGLRLSQMTLQRGWNNKSKLLIVEDLSARQVYEKLLEA 229


>gi|149057381|gb|EDM08704.1| similar to RIKEN cDNA 3110040N11, isoform CRA_a [Rattus norvegicus]
 gi|149057386|gb|EDM08709.1| similar to RIKEN cDNA 3110040N11, isoform CRA_a [Rattus norvegicus]
          Length = 114

 Score = 63.9 bits (154), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/84 (42%), Positives = 49/84 (58%), Gaps = 13/84 (15%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V+I IHAKPGSK  ++T            AP  +GEANA L  Y+S VL +++  V +  
Sbjct: 36  VTIAIHAKPGSKQNAVT------------APPSEGEANAELCRYLSKVLDLRKSDVVLDK 83

Query: 100 GSKSRDKIV-IVEEITPENVLNSL 122
           G KSR+K+V ++   TPE VL  L
Sbjct: 84  GGKSREKVVKLLASTTPEEVLEKL 107


>gi|145334887|ref|NP_001078789.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332010366|gb|AED97749.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 213

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 58/102 (56%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLE+
Sbjct: 109 QDAPVPPCISQLDGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 168

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  S+ K+++VE+++   V   L +A
Sbjct: 169 MGRVLGLRLSQMTLQRGWNSKSKLLVVEDLSARQVYEKLLEA 210


>gi|281348884|gb|EFB24468.1| hypothetical protein PANDA_020551 [Ailuropoda melanoleuca]
          Length = 87

 Score = 63.5 bits (153), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 32/62 (51%), Positives = 42/62 (67%)

Query: 36 PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
          P   V+I IHAKPGSK  ++TDV+ EAV V I AP  +GEANA L  Y+S VL +++  V
Sbjct: 22 PKGCVTIAIHAKPGSKQNAVTDVTAEAVSVAIAAPPSEGEANAELCRYLSKVLELRKSDV 81

Query: 96 SI 97
           +
Sbjct: 82 VL 83


>gi|432095255|gb|ELK26515.1| hypothetical protein MDA_GLEAN10001617 [Myotis davidii]
          Length = 91

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 35/79 (44%), Positives = 50/79 (63%), Gaps = 2/79 (2%)

Query: 48  PGSKSCSIT-DVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           PG +  ++  D++ EAV V I AP  +GEANA L  Y+S VL +++  V +  GSKSR+K
Sbjct: 8   PGRRWFALVRDLTTEAVSVAIAAPPSEGEANAELCRYLSKVLELRKSDVVLDKGSKSREK 67

Query: 107 IV-IVEEITPENVLNSLGK 124
           +V ++   TPE VL  L K
Sbjct: 68  VVKLLASTTPEEVLEKLEK 86


>gi|426380124|ref|XP_004056730.1| PREDICTED: UPF0235 protein C15orf40 homolog [Gorilla gorilla
           gorilla]
          Length = 149

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 38/91 (41%), Positives = 56/91 (61%), Gaps = 5/91 (5%)

Query: 12  KSAGST---QSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQ 66
           + AG+T   +S+ K  +  LP    + + P   V+I IHAKPGSK  ++TD++ EAV V 
Sbjct: 30  RKAGATTKDKSQSKEPERPLPPLGPVAVDPKGCVTIAIHAKPGSKQNAVTDLTAEAVNVA 89

Query: 67  IDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           I AP  +GEANA L  Y+S VL +++  V +
Sbjct: 90  IAAPPSEGEANAELCRYLSKVLELRKSDVVL 120


>gi|9758285|dbj|BAB08809.1| unnamed protein product [Arabidopsis thaliana]
          Length = 213

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 58/102 (56%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLE+
Sbjct: 109 QDAPVPPCISQLDGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 168

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  S+ K+++VE+++   V   L +A
Sbjct: 169 MGRVLGLRLSQMTLQRGWNSKSKLLVVEDLSARQVYEKLLEA 210


>gi|30697845|ref|NP_568972.2| uncharacterized protein [Arabidopsis thaliana]
 gi|26452404|dbj|BAC43287.1| unknown protein [Arabidopsis thaliana]
 gi|332010365|gb|AED97748.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 232

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 58/102 (56%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLE+
Sbjct: 128 QDAPVPPCISQLDGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  S+ K+++VE+++   V   L +A
Sbjct: 188 MGRVLGLRLSQMTLQRGWNSKSKLLVVEDLSARQVYEKLLEA 229


>gi|297793925|ref|XP_002864847.1| hypothetical protein ARALYDRAFT_332565 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297310682|gb|EFH41106.1| hypothetical protein ARALYDRAFT_332565 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 232

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 58/102 (56%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLE+
Sbjct: 128 QDAPVPPCISQLDGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  S+ K+++VE+++   V   L +A
Sbjct: 188 MGRVLGLRLSQMTLQRGWNSKSKLLVVEDLSARQVYEKLLEA 229


>gi|358638832|dbj|BAL26129.1| hypothetical protein AZKH_3845 [Azoarcus sp. KH32C]
          Length = 99

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 37/87 (42%), Positives = 52/87 (59%), Gaps = 2/87 (2%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           SV +T+H +PG+K   I  +  EA+ V++ AP  DG+ANAAL  +++   GV R  V++ 
Sbjct: 13  SVVLTLHIQPGAKRTEIVGLHGEALKVRLAAPPVDGKANAALCAFLAEFCGVSRSMVTLV 72

Query: 99  SGSKSRDKIVIVEEITPENV--LNSLG 123
           SG  SR K V VE    E V  L +LG
Sbjct: 73  SGETSRAKRVRVEAPGAEAVARLRALG 99


>gi|156100181|ref|XP_001615818.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148804692|gb|EDL46091.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 178

 Score = 63.2 bits (152), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 41/111 (36%), Positives = 63/111 (56%), Gaps = 3/111 (2%)

Query: 12  KSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD-EAVGVQIDAP 70
           KS      K K   ENLPS I+     +  I +  KP +K+ SI   SD E + + I   
Sbjct: 63  KSVAKETVKAKDATENLPSYIK-KKDENFLINLRVKPNAKNTSIYFNSDREVLNINIQEQ 121

Query: 71  AKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNS 121
             + ++N A++ Y S +L +K+R +SI SG KSRDK+++V  I+ ++ LNS
Sbjct: 122 PINNQSNVAIIGYFSDILDLKKRDISIVSGLKSRDKVLMVSNISLDD-LNS 171


>gi|388510890|gb|AFK43511.1| unknown [Lotus japonicus]
          Length = 232

 Score = 62.8 bits (151), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 57/102 (55%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I     ++  +IT V+ + V V + APA  GEAN  LLE+
Sbjct: 128 QDAPVPPCISHLEGGLVQVAIEVDDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  ++ K+++VE++T   V   L +A
Sbjct: 188 MGKVLGLRLSQMTLQRGWNNKSKLLVVEDLTVRQVYEKLLEA 229


>gi|148674974|gb|EDL06921.1| RIKEN cDNA 3110040N11, isoform CRA_c [Mus musculus]
          Length = 114

 Score = 62.8 bits (151), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 36/88 (40%), Positives = 49/88 (55%), Gaps = 13/88 (14%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P   V+I IHAKPGS+  ++T            AP   GEANA L  Y+S VL +++  V
Sbjct: 32  PKGFVTIAIHAKPGSRQNAVT------------APPSQGEANAELCRYLSKVLDLRKSDV 79

Query: 96  SIGSGSKSRDKIV-IVEEITPENVLNSL 122
            +  G KSR+K+V ++   TPE VL  L
Sbjct: 80  VLDKGGKSREKVVKLLASTTPEEVLEKL 107


>gi|417407795|gb|JAA50493.1| Hypothetical protein, partial [Desmodus rotundus]
          Length = 116

 Score = 62.8 bits (151), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 41/90 (45%), Positives = 57/90 (63%), Gaps = 1/90 (1%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P   V+I IHAKPGSK  +IT ++ EAV V + AP  +GEANA L  Y+S VL +++  V
Sbjct: 22  PKGCVTIAIHAKPGSKQNAITGLTAEAVSVAVAAPPSEGEANAELCRYLSKVLELRKSDV 81

Query: 96  SIGSGSKSRDKIV-IVEEITPENVLNSLGK 124
            +  G KSR+K+V ++   +P  VL  L K
Sbjct: 82  ILDKGGKSREKVVKLLASTSPGEVLEKLEK 111


>gi|441616624|ref|XP_004088386.1| PREDICTED: UPF0235 protein C15orf40 homolog [Nomascus leucogenys]
          Length = 168

 Score = 62.8 bits (151), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 38/92 (41%), Positives = 55/92 (59%), Gaps = 4/92 (4%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           K+ A + G +QSK    +  LP    + + P   V+I IHAKPG K  ++TD++ EAV V
Sbjct: 32  KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGCKQNAVTDLTAEAVNV 89

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
            I AP  +GEANA L  Y+S VL +++  V +
Sbjct: 90  AIAAPPSEGEANAELCRYLSKVLELRKSDVVL 121


>gi|332238619|ref|XP_003268501.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 2 [Nomascus
           leucogenys]
          Length = 150

 Score = 62.8 bits (151), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 38/92 (41%), Positives = 55/92 (59%), Gaps = 4/92 (4%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSC--IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGV 65
           K+ A + G +QSK    +  LP    + + P   V+I IHAKPG K  ++TD++ EAV V
Sbjct: 32  KAGATTKGKSQSK--EPERPLPPLGPVAVDPKGCVTIAIHAKPGCKQNAVTDLTAEAVNV 89

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
            I AP  +GEANA L  Y+S VL +++  V +
Sbjct: 90  AIAAPPSEGEANAELCRYLSKVLELRKSDVVL 121


>gi|225456297|ref|XP_002283689.1| PREDICTED: uncharacterized protein LOC100247416 [Vitis vinifera]
 gi|147823132|emb|CAN75279.1| hypothetical protein VITISV_030868 [Vitis vinifera]
 gi|297734405|emb|CBI15652.3| unnamed protein product [Vitis vinifera]
          Length = 232

 Score = 62.8 bits (151), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 58/102 (56%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLE+
Sbjct: 128 QDAPVPPCISQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  ++ K+++VE+++   V   L +A
Sbjct: 188 MGKVLGLRLSQMTLQRGWNNKSKLLVVEDLSARQVYEKLLEA 229


>gi|68074573|ref|XP_679202.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56499889|emb|CAH95244.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 106

 Score = 62.8 bits (151), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 63/104 (60%), Gaps = 2/104 (1%)

Query: 22  KTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSIT-DVSDEAVGVQIDAPAKDGEANAAL 80
           KT  E LPS ++    +++ I +  KP SK+ SI  +V  E + + I     + ++N A+
Sbjct: 1   KTITECLPSYLK-KKDNNLLINLRVKPNSKNTSIYFNVDTEVLNINIQEQPVNNQSNVAI 59

Query: 81  LEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGK 124
           + Y S +L +K+R +SI +G KSRDK+++V  I+ E++ N + +
Sbjct: 60  ISYFSDILNLKKRDISIVAGLKSRDKVLMVSNISVEDLSNKINQ 103


>gi|255540269|ref|XP_002511199.1| conserved hypothetical protein [Ricinus communis]
 gi|223550314|gb|EEF51801.1| conserved hypothetical protein [Ricinus communis]
          Length = 232

 Score = 62.8 bits (151), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 58/102 (56%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLE+
Sbjct: 128 QDAPVPPCISQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  ++ K+++VE+++   V   L +A
Sbjct: 188 MGKVLGLRLSQMTLQRGWNNKSKLLVVEDLSARQVYEKLLEA 229


>gi|255630115|gb|ACU15411.1| unknown [Glycine max]
          Length = 225

 Score = 62.4 bits (150), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 32/95 (33%), Positives = 55/95 (57%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLE+
Sbjct: 128 QDAPVPPCISHLEGGLVQLAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENV 118
           M  VLG++  Q+++  G  ++ K+++VE++T   V
Sbjct: 188 MGKVLGLRLSQMTLQRGWNNKSKLLVVEDLTARQV 222


>gi|257095142|ref|YP_003168783.1| hypothetical protein CAP2UW1_3597 [Candidatus Accumulibacter
           phosphatis clade IIA str. UW-1]
 gi|257047666|gb|ACV36854.1| protein of unknown function DUF167 [Candidatus Accumulibacter
           phosphatis clade IIA str. UW-1]
          Length = 96

 Score = 62.4 bits (150), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 53/84 (63%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++++TIH +PG+K+  I     +A+ V+I AP  DG ANAAL+++++  LG+ R  V + 
Sbjct: 11  AITVTIHLQPGAKANEIAGRHGDALKVRITAPPVDGRANAALVDFLAQRLGLSRSAVELK 70

Query: 99  SGSKSRDKIVIVEEITPENVLNSL 122
           SG  SR K++ +   + E VL  L
Sbjct: 71  SGLTSRRKVLRISGASAEAVLCLL 94


>gi|291615210|ref|YP_003525367.1| hypothetical protein Slit_2755 [Sideroxydans lithotrophicus ES-1]
 gi|291585322|gb|ADE12980.1| protein of unknown function DUF167 [Sideroxydans lithotrophicus
           ES-1]
          Length = 94

 Score = 62.4 bits (150), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 52/82 (63%), Gaps = 2/82 (2%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +++T+H +PG+K   +  +   A+ +++ AP  +G AN ALL++++   GV  RQV +  
Sbjct: 12  LTLTLHIQPGAKRTEVAGLHGAALKIRLAAPPIEGRANEALLKFIAESFGVPLRQVELKQ 71

Query: 100 GSKSRDKIVIV--EEITPENVL 119
           G +SR K+V V   +I PE++L
Sbjct: 72  GGQSRHKVVAVTASKIEPESLL 93


>gi|85859583|ref|YP_461785.1| cytoplasmic protein [Syntrophus aciditrophicus SB]
 gi|85722674|gb|ABC77617.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB]
          Length = 118

 Score = 62.0 bits (149), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 50/85 (58%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + VS  +H  P S  C++    + A+ +++ AP  DG+AN   LE+++ +LGVK+ Q+ I
Sbjct: 11  NGVSFCVHVLPRSAKCALAGAQEGALRIKLTAPPVDGKANDECLEFLAGILGVKKGQMDI 70

Query: 98  GSGSKSRDKIVIVEEITPENVLNSL 122
            SG  SR KIV +  +  E + N L
Sbjct: 71  ISGHTSRRKIVQIMNVPREPLENRL 95


>gi|389585285|dbj|GAB68016.1| hypothetical protein PCYB_125820 [Plasmodium cynomolgi strain B]
          Length = 176

 Score = 62.0 bits (149), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 59/97 (60%), Gaps = 3/97 (3%)

Query: 26  ENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD-EAVGVQIDAPAKDGEANAALLEYM 84
           ENLPS I+     +  I +  KP +K+ SI   SD E + + I     + ++N A++ Y 
Sbjct: 75  ENLPSYIK-KKDENFLINLRVKPNAKNTSIYFNSDREVLNINIQEQPINNQSNVAIIGYF 133

Query: 85  SSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNS 121
           S +L +K+R +SI SG KSRDK+++V  I+ ++ LNS
Sbjct: 134 SDILDLKKRDISIVSGLKSRDKVLMVSNISLDD-LNS 169


>gi|258597870|ref|XP_001348716.2| conserved protein, unknown function [Plasmodium falciparum 3D7]
 gi|255528895|gb|AAN37155.2| conserved protein, unknown function [Plasmodium falciparum 3D7]
          Length = 147

 Score = 61.6 bits (148), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 65/116 (56%), Gaps = 2/116 (1%)

Query: 8   KSKAKSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD-EAVGVQ 66
           +S   S   ++  +K   E+LPS I+     S  IT+  KP +K  SI    D E + + 
Sbjct: 28  RSNKMSKNISKEALKNVVESLPSYIK-KKNESFLITLRVKPNAKKTSIYFNQDKEVLNIN 86

Query: 67  IDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           I     + ++N A++ Y S +L +K+R +SI SG KSRDK+++V  I+ +++ N +
Sbjct: 87  IQEQPVNNQSNVAIIGYFSDILNLKKRDISIVSGLKSRDKVLMVSNISLDDLNNKI 142


>gi|340508740|gb|EGR34382.1| hypothetical protein IMG5_013830 [Ichthyophthirius multifiliis]
          Length = 150

 Score = 61.6 bits (148), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 29/69 (42%), Positives = 44/69 (63%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I I AKP SK   ++ +SD+ V V I AP KDGEAN  L++  SS+L +K+  + +  
Sbjct: 59  LYIQIKAKPNSKISQVSSISDDCVDVNIAAPPKDGEANEELIQLFSSLLNIKKSSLQLDK 118

Query: 100 GSKSRDKIV 108
           G KS+ K++
Sbjct: 119 GGKSKSKLL 127


>gi|223944751|gb|ACN26459.1| unknown [Zea mays]
 gi|414884282|tpg|DAA60296.1| TPA: COG1872 containing protein, Uncharacterized ACR, YggU family
           [Zea mays]
          Length = 194

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 59/102 (57%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
           +D  +P CI  +    V + I  +  ++  +IT V+ + V V + A A  GEAN+ LLE+
Sbjct: 90  HDAPVPPCITQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAALAARGEANSELLEF 149

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  ++ K++IVE+++   V   L +A
Sbjct: 150 MGKVLGLRLTQMTLQRGWNNKSKLLIVEDLSARQVYEKLLEA 191


>gi|358394345|gb|EHK43738.1| hypothetical protein TRIATDRAFT_284504 [Trichoderma atroviride IMI
           206040]
          Length = 120

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/100 (36%), Positives = 59/100 (59%), Gaps = 8/100 (8%)

Query: 36  PPSSVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           P   + + +H KPG+      I  V++E + + + A AKDGEAN A++E +S  L + + 
Sbjct: 19  PQGILQLRLHVKPGASKNREGIQAVTEETIELCVAAQAKDGEANQAVIEVLSEALDIPKS 78

Query: 94  QVSIGSGSKSRDKIVIVEEITPE------NVLNSLGKASS 127
           ++S+  G++SRDK V+V+EI  +       +L  L KASS
Sbjct: 79  KLSLVQGARSRDKTVVVQEIKGDGSVYAATILELLHKASS 118


>gi|221059313|ref|XP_002260302.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
 gi|193810375|emb|CAQ41569.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
          Length = 176

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 59/97 (60%), Gaps = 3/97 (3%)

Query: 26  ENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSD-EAVGVQIDAPAKDGEANAALLEYM 84
           ENLPS I+     +  I +  KP +K+ SI   SD E + + I     + ++N A++ Y 
Sbjct: 75  ENLPSYIK-KKDENFLINLRVKPNAKNTSIYFNSDREVLNINIQEQPINNQSNIAIIGYF 133

Query: 85  SSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNS 121
           S +L +K+R +SI SG KSRDK+++V  I+ ++ LNS
Sbjct: 134 SDILDLKKRDISIVSGLKSRDKVLMVSNISLDD-LNS 169


>gi|302756509|ref|XP_002961678.1| hypothetical protein SELMODRAFT_76023 [Selaginella moellendorffii]
 gi|300170337|gb|EFJ36938.1| hypothetical protein SELMODRAFT_76023 [Selaginella moellendorffii]
          Length = 233

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 60/110 (54%)

Query: 13  SAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAK 72
           S  S  ++    D  +P CI       V + I  +  ++  +IT V+ + V V + APA 
Sbjct: 117 SLSSVAAETNPQDAPVPPCISQTEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAA 176

Query: 73  DGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
            GEAN  LLEYM+ VL ++  Q+++  G  ++ K+++VE+++  +V   L
Sbjct: 177 RGEANNELLEYMAKVLSLRATQMTLQRGWNNKSKLLVVEDLSVRDVYEKL 226


>gi|167999197|ref|XP_001752304.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696699|gb|EDQ83037.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 231

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 33/99 (33%), Positives = 54/99 (54%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I  +  ++   IT V+ + V V + APA  GEAN  LLEY
Sbjct: 127 QDAPVPPCISQIDGGLVQVAIEVEDRAQRSQITRVNADDVRVTVAAPAARGEANNELLEY 186

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           M  VLG++  Q+++  G  ++ K++ VE+++   V   L
Sbjct: 187 MGKVLGLRLTQMTLQRGWNNKSKLLAVEDLSVREVYEKL 225


>gi|194688626|gb|ACF78397.1| unknown [Zea mays]
 gi|414884278|tpg|DAA60292.1| TPA: hypothetical protein ZEAMMB73_531342 [Zea mays]
          Length = 232

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 59/102 (57%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
           +D  +P CI  +    V + I  +  ++  +IT V+ + V V + A A  GEAN+ LLE+
Sbjct: 128 HDAPVPPCITQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAALAARGEANSELLEF 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  ++ K++IVE+++   V   L +A
Sbjct: 188 MGKVLGLRLTQMTLQRGWNNKSKLLIVEDLSARQVYEKLLEA 229


>gi|340518653|gb|EGR48893.1| predicted protein [Trichoderma reesei QM6a]
          Length = 121

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/101 (34%), Positives = 59/101 (58%), Gaps = 8/101 (7%)

Query: 36  PPSSVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           P   + + +H KPG+      I  V+++A+ + + A AKDGEAN A++E +S VL + + 
Sbjct: 19  PQGVLQLRLHVKPGASRNREGIQAVTEDAIELCVAAQAKDGEANQAVIEVLSEVLDIPKS 78

Query: 94  QVSIGSGSKSRDKIVIVEEI------TPENVLNSLGKASSS 128
           ++ +  G++SRDK V+V++           VL  L KASS+
Sbjct: 79  KLVLAQGARSRDKTVVVQDFKGDGASYANTVLELLHKASSA 119


>gi|429329316|gb|AFZ81075.1| proton translocating inorganic pyrophosphatase, putative [Babesia
           equi]
          Length = 110

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 53/88 (60%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           SS+ + ++ KPG+K   +    +  + +QI AP ++G  N AL+EY++ VL +K+R +S+
Sbjct: 21  SSLILKVNVKPGAKQTQVVGAQEGQLTLQIAAPPREGACNEALVEYVAEVLKLKKRSISL 80

Query: 98  GSGSKSRDKIVIVEEITPENVLNSLGKA 125
             G KSRDK++ +   T    +  + +A
Sbjct: 81  IHGHKSRDKVLSITGTTLSQAIQHINEA 108


>gi|324527094|gb|ADY48748.1| Unknown [Ascaris suum]
          Length = 181

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 56/89 (62%), Gaps = 2/89 (2%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + IHAKP +K   +T++++  + V I AP   G+AN AL + ++ +LG+++  V   +
Sbjct: 90  ILLKIHAKPNAKISRVTEINETEIEVAIAAPPHKGQANEALTDAIAEILGLRKNDVFFDT 149

Query: 100 GSKSRDKIVIV--EEITPENVLNSLGKAS 126
           G++SR K++++  + IT E V   L K++
Sbjct: 150 GARSRSKLLVINSQRITVEEVREKLKKSA 178


>gi|7508740|pir||T26031 hypothetical protein W01A8.2 - Caenorhabditis elegans
          Length = 263

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 45/72 (62%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + IHAKPG+K   +  + D  V V I A  ++G AN  L+ Y+ S LG+++ ++    
Sbjct: 37  IGLHIHAKPGAKKSCVVAIGDSEVDVAIGAAPREGAANEELISYLMSALGLRKNELQFDK 96

Query: 100 GSKSRDKIVIVE 111
           G+KSR K+V+++
Sbjct: 97  GAKSRSKVVLID 108


>gi|312375313|gb|EFR22710.1| hypothetical protein AND_14314 [Anopheles darlingi]
          Length = 155

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 51/95 (53%), Gaps = 24/95 (25%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQID------------------------APAKDGEAN 77
           + I AKPG+K+  ITDVS+E VG QI                         AP  DGEAN
Sbjct: 39  VKILAKPGAKTSGITDVSEEGVGCQIGMSLVDGSTLTLFDQTIQLHLSPLAAPPIDGEAN 98

Query: 78  AALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEE 112
             L+ Y+S +L +++  +S+  GSKSR K +++++
Sbjct: 99  TELIRYLSKLLELRKSDISLDRGSKSRQKTIVLDK 133


>gi|302763307|ref|XP_002965075.1| hypothetical protein SELMODRAFT_230469 [Selaginella moellendorffii]
 gi|300167308|gb|EFJ33913.1| hypothetical protein SELMODRAFT_230469 [Selaginella moellendorffii]
          Length = 233

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/110 (30%), Positives = 60/110 (54%)

Query: 13  SAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAK 72
           S  S  ++    D  +P CI       V + I  +  ++  +IT V+ + V V + APA 
Sbjct: 117 SLSSMAAETNPQDAPVPPCISQTEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAA 176

Query: 73  DGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
            GEAN  LLEYM+ VL ++  Q+++  G  ++ K+++VE+++  +V   L
Sbjct: 177 RGEANNELLEYMAKVLSLRVTQMTLQRGWNNKSKLLVVEDLSVRDVYEKL 226


>gi|226529615|ref|NP_001152637.1| uncharacterized protein LOC100286278 [Zea mays]
 gi|195658405|gb|ACG48670.1| uncharacterized ACR, YggU family COG1872 containing protein [Zea
           mays]
          Length = 194

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 59/102 (57%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
           +D  +P CI  +    V + I  +  ++  ++T V+ + V V + A A  GEAN+ LLE+
Sbjct: 90  HDAPVPPCITQLEGGLVQVAIEVEDRAQRSAVTRVNADDVRVTVAALAARGEANSELLEF 149

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  ++ K++IVE+++   V   L +A
Sbjct: 150 MGKVLGLRLTQMTLQRGWNNKSKLLIVEDLSARQVYEKLLEA 191


>gi|133930345|ref|NP_001076616.1| Protein W01A8.2 [Caenorhabditis elegans]
 gi|114420882|emb|CAL44973.1| Protein W01A8.2 [Caenorhabditis elegans]
          Length = 127

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/72 (36%), Positives = 45/72 (62%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + IHAKPG+K   +  + D  V V I A  ++G AN  L+ Y+ S LG+++ ++    
Sbjct: 37  IGLHIHAKPGAKKSCVVAIGDSEVDVAIGAAPREGAANEELISYLMSALGLRKNELQFDK 96

Query: 100 GSKSRDKIVIVE 111
           G+KSR K+V+++
Sbjct: 97  GAKSRSKVVLID 108


>gi|449675515|ref|XP_004208423.1| PREDICTED: uncharacterized protein LOC100205424 [Hydra
           magnipapillata]
          Length = 1002

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 51/85 (60%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + + + I+AKPG+K   +TD+S E +G+Q+ A  +DG+AN  LL Y+S +  +K+  + +
Sbjct: 911 NEIILQIYAKPGAKRNKVTDISVEFIGIQLAAQPRDGKANDELLSYLSELFNIKKSGICM 970

Query: 98  GSGSKSRDKIVIVEEITPENVLNSL 122
             G  SR K + V     E+ + SL
Sbjct: 971 IRGDTSRIKTIKVSTDKSEDDIRSL 995


>gi|338715974|ref|XP_003363374.1| PREDICTED: LOW QUALITY PROTEIN: UPF0235 protein C15orf40 homolog
           [Equus caballus]
          Length = 196

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 37/96 (38%), Positives = 50/96 (52%), Gaps = 9/96 (9%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P   V+I IHAKPGSK  +I DV+ EAV + + AP  +GE N  L  Y S VL +++  V
Sbjct: 94  PKGCVTIAIHAKPGSKQHAIPDVTAEAVSMDVAAPPLEGEVNTELCCYCSEVLDLRKSDV 153

Query: 96  S--------IGSGSKSRDKIV-IVEEITPENVLNSL 122
                    +  G KS +K V  +   TPE +L   
Sbjct: 154 RKCPPPPHVLDKGGKSHEKAVKXLASTTPEEILEKF 189


>gi|167386103|ref|XP_001737619.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165899553|gb|EDR26129.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 118

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 46/83 (55%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S+V I I  KP +K+  +  V D  + V IDAP  DG+AN  ++ +M+S  G+K+  VS+
Sbjct: 31  SNVIIEIEIKPNAKTSELQGVEDGILKVAIDAPPIDGKANTEVIAFMASTFGIKKSNVSL 90

Query: 98  GSGSKSRDKIVIVEEITPENVLN 120
             G  S  K +  E  T E VL 
Sbjct: 91  IKGQTSHHKTLQFENWTREKVLQ 113


>gi|393227308|gb|EJD34993.1| DUF167-domain-containing protein [Auricularia delicata TFB-10046
           SS5]
          Length = 113

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 35/89 (39%), Positives = 54/89 (60%), Gaps = 6/89 (6%)

Query: 40  VSITIHAKPGSKSCSITDVSDEA-----VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           + + +  +PG+ +  +T V+D A     +  ++ A A DGEAN AL   ++  LGV R +
Sbjct: 23  LQLAVRVQPGASASEVT-VADAAENLLQLSARVTARAHDGEANDALRRLLADCLGVSRSR 81

Query: 95  VSIGSGSKSRDKIVIVEEITPENVLNSLG 123
           VSI  G KSRDK+V V+ +T ++VL  LG
Sbjct: 82  VSIVRGLKSRDKVVAVDGMTRDDVLAVLG 110


>gi|268567307|ref|XP_002639944.1| Hypothetical protein CBG10764 [Caenorhabditis briggsae]
          Length = 258

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 46/72 (63%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + IHAKPG+K   +  +++  + V I A  ++G AN  L+ Y+ S LG+++ ++    
Sbjct: 33  IGLRIHAKPGAKKSGVVAINESEIDVAIGAAPREGAANEELVSYLMSALGLRKNELQFDK 92

Query: 100 GSKSRDKIVIVE 111
           G+KSR K+V++E
Sbjct: 93  GAKSRSKVVLIE 104


>gi|374584225|ref|ZP_09657317.1| UPF0235 protein yggU [Leptonema illini DSM 21528]
 gi|373873086|gb|EHQ05080.1| UPF0235 protein yggU [Leptonema illini DSM 21528]
          Length = 75

 Score = 59.3 bits (142), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 46/71 (64%)

Query: 43  TIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSK 102
           ++H KP S    +  + D  + V++ +P +DG+ANA L+E ++  LGVKRR VSI SG  
Sbjct: 5   SVHVKPKSSKQGVHVLEDGTLEVRVHSPPEDGKANAELIETLARHLGVKRRHVSIVSGET 64

Query: 103 SRDKIVIVEEI 113
           +R KI+ V ++
Sbjct: 65  ARHKIIEVSDV 75


>gi|372487286|ref|YP_005026851.1| hypothetical protein Dsui_0600 [Dechlorosoma suillum PS]
 gi|359353839|gb|AEV25010.1| TIGR00251 family protein [Dechlorosoma suillum PS]
          Length = 96

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 47/71 (66%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +T+H +PG+K   +  +  +A+ +++ AP  DG+ANAALL +++  LG+ +  VS+ SG 
Sbjct: 13  LTLHIQPGAKKTEVCGLHGDALKIRLAAPPVDGKANAALLAFVADRLGLPKSAVSLKSGQ 72

Query: 102 KSRDKIVIVEE 112
            SR K+V V E
Sbjct: 73  TSRRKVVEVAE 83


>gi|258514375|ref|YP_003190597.1| hypothetical protein Dtox_1088 [Desulfotomaculum acetoxidans DSM
           771]
 gi|257778080|gb|ACV61974.1| protein of unknown function DUF167 [Desulfotomaculum acetoxidans
           DSM 771]
          Length = 98

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 51/86 (59%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + V   +  +P +    +  + ++AV +++ AP  +GEAN AL ++++  LGV R QV +
Sbjct: 9   NGVVFKVRVQPRASKDQVAGLWEDAVKIRLTAPPVEGEANRALCDFLAKHLGVTRAQVDL 68

Query: 98  GSGSKSRDKIVIVEEITPENVLNSLG 123
            +G   R+K+V V  IT E+VL  LG
Sbjct: 69  VTGQTGRNKLVRVSGITAESVLQRLG 94


>gi|120602738|ref|YP_967138.1| hypothetical protein Dvul_1694 [Desulfovibrio vulgaris DP4]
 gi|120562967|gb|ABM28711.1| protein of unknown function DUF167 [Desulfovibrio vulgaris DP4]
          Length = 108

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 50/83 (60%)

Query: 29  PSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVL 88
           P+ +  +     ++ +  +PG+K   +  V+D  + V++ APA D +AN  L  YM+S+L
Sbjct: 7   PAYVEAMGDGRWALLVWVQPGAKKDGLAGVADGRLRVRLSAPAVDNKANRGLERYMASLL 66

Query: 89  GVKRRQVSIGSGSKSRDKIVIVE 111
           GV+  +VS+ SG  SR K V++E
Sbjct: 67  GVRPARVSVASGQTSRRKRVVIE 89


>gi|383456913|ref|YP_005370902.1| hypothetical protein COCOR_04940 [Corallococcus coralloides DSM
           2259]
 gi|380730113|gb|AFE06115.1| hypothetical protein COCOR_04940 [Corallococcus coralloides DSM
           2259]
          Length = 99

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 47/76 (61%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + V +T+  +P +    +    D  + +Q+ AP  DGEANAAL+E+++  LGV RRQV++
Sbjct: 11  TGVELTVLVQPRASRTKVVGEHDGQLKIQLAAPPVDGEANAALVEFIAKTLGVPRRQVTL 70

Query: 98  GSGSKSRDKIVIVEEI 113
            +G  SR K + VE +
Sbjct: 71  VAGDTSRRKRLRVEGV 86


>gi|442320552|ref|YP_007360573.1| hypothetical protein MYSTI_03583 [Myxococcus stipitatus DSM 14675]
 gi|441488194|gb|AGC44889.1| hypothetical protein MYSTI_03583 [Myxococcus stipitatus DSM 14675]
          Length = 98

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 44/74 (59%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  V + +  +P +    +    D  + +Q+ AP  DGEANAALLE+++  LGV RRQV+
Sbjct: 10  PEGVELALLIQPRASRTRVVGEHDGLLKIQLAAPPVDGEANAALLEFLAKKLGVPRRQVT 69

Query: 97  IGSGSKSRDKIVIV 110
           + +G  SR K V V
Sbjct: 70  LLAGDTSRRKRVQV 83


>gi|46579785|ref|YP_010593.1| hypothetical protein DVU1374 [Desulfovibrio vulgaris str.
           Hildenborough]
 gi|387153132|ref|YP_005702068.1| hypothetical protein Deval_1269 [Desulfovibrio vulgaris RCH1]
 gi|46449200|gb|AAS95852.1| conserved hypothetical protein [Desulfovibrio vulgaris str.
           Hildenborough]
 gi|311233576|gb|ADP86430.1| protein of unknown function DUF167 [Desulfovibrio vulgaris RCH1]
          Length = 106

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 50/83 (60%)

Query: 29  PSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVL 88
           P+ +  +     ++ +  +PG+K   +  V+D  + V++ APA D +AN  L  YM+S+L
Sbjct: 5   PAYVEAMGDGRWALLVWVQPGAKKDGLAGVADGRLRVRLSAPAVDNKANRGLERYMASLL 64

Query: 89  GVKRRQVSIGSGSKSRDKIVIVE 111
           GV+  +VS+ SG  SR K V++E
Sbjct: 65  GVRPARVSVASGQTSRRKRVVIE 87


>gi|422348904|ref|ZP_16429796.1| TIGR00251 family protein [Sutterella wadsworthensis 2_1_59BFAA]
 gi|404658956|gb|EKB31818.1| TIGR00251 family protein [Sutterella wadsworthensis 2_1_59BFAA]
          Length = 112

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 53/83 (63%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I +HA+PG+K  ++  V  + + + + +P  DG+ANA L++Y+S  LGV +  V + SG 
Sbjct: 25  IAVHAQPGAKRTAVVGVHGDRLKIALASPPVDGKANATLIKYLSKGLGVSKSSVRLLSGD 84

Query: 102 KSRDKIVIVEEITPENVLNSLGK 124
            SR+K + V  ++ +++L +L K
Sbjct: 85  TSREKRIEVVGLSTDDLLEALKK 107


>gi|406936038|gb|EKD69854.1| hypothetical protein ACD_46C00713G0005 [uncultured bacterium]
          Length = 92

 Score = 58.9 bits (141), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 51/77 (66%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +TI+ +PGSK   I  + D  + ++++AP  DG+ANAAL+ ++S+++G+ +  + + SG 
Sbjct: 7   LTIYLQPGSKKSEICGMHDGHIKIKLNAPPVDGKANAALILFLSNLIGIPKSSIELISGQ 66

Query: 102 KSRDKIVIVEEITPENV 118
           KSR K + +  I+ E +
Sbjct: 67  KSRIKRIKITGISQEMI 83


>gi|167630164|ref|YP_001680663.1| hypothetical protein HM1_2095 [Heliobacterium modesticaldum Ice1]
 gi|259646567|sp|B0TGP1.1|Y2027_HELMI RecName: Full=UPF0235 protein Helmi_20270
 gi|167592904|gb|ABZ84652.1| conserved hypothetical protein, uncharacterized acr, yggu family
           [Heliobacterium modesticaldum Ice1]
          Length = 96

 Score = 58.5 bits (140), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 50/83 (60%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P  S+   I  +P +    +  + D+A+ V++ AP  DGEANAA L++++  LG+ R QV
Sbjct: 8   PGGSIRFRIRVQPRASKNEVCGLLDDALKVRLTAPPVDGEANAACLQFIAKTLGLSRSQV 67

Query: 96  SIGSGSKSRDKIVIVEEITPENV 118
            + +G  SR K + VE ++ E++
Sbjct: 68  RLVAGETSRLKTLEVEGVSAEDL 90


>gi|82701713|ref|YP_411279.1| hypothetical protein Nmul_A0579 [Nitrosospira multiformis ATCC
           25196]
 gi|82409778|gb|ABB73887.1| Protein of unknown function DUF167 [Nitrosospira multiformis ATCC
           25196]
          Length = 105

 Score = 58.5 bits (140), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 46/70 (65%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +T+H +PG++   +     +A+ +++ AP  +G AN ALL +++ V GV +RQV +  G+
Sbjct: 21  LTLHIQPGARRTEVVGSHGDALKIKLAAPPVEGAANVALLAFLAGVFGVPQRQVILRQGA 80

Query: 102 KSRDKIVIVE 111
           +SR KIV ++
Sbjct: 81  RSRRKIVEID 90


>gi|444725293|gb|ELW65866.1| T-cell surface glycoprotein CD3 delta chain [Tupaia chinensis]
          Length = 178

 Score = 58.5 bits (140), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 54/81 (66%), Gaps = 3/81 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V+I IHAKPGSK  ++TD++ +AV V I AP   GEA+A L  ++S VL ++++ V++  
Sbjct: 9   VTIAIHAKPGSKQNAVTDLTTKAVNVAITAPPA-GEASAELCRFLSKVLELRKKDVALDK 67

Query: 100 GSKSRDKIVIVEEITPENVLN 120
           G    +  +IVEE+  + ++N
Sbjct: 68  GVSPFN--IIVEELEDKVLVN 86


>gi|66813694|ref|XP_641026.1| hypothetical protein DDB_G0280783 [Dictyostelium discoideum AX4]
 gi|74855711|sp|Q54UW1.1|U235_DICDI RecName: Full=UPF0235 protein
 gi|60469052|gb|EAL67049.1| hypothetical protein DDB_G0280783 [Dictyostelium discoideum AX4]
          Length = 124

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 49/79 (62%), Gaps = 1/79 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I ++  P SK  SI    D+ + ++I  P  DG+AN  ++E++S  L +++  + +G 
Sbjct: 32  IKINVNVHPNSKESSIVSFEDQILSLRISEPPIDGKANIGVIEFLSKELNIRKSNIEVGK 91

Query: 100 GSKSRDKIVIVEEITPENV 118
           GSKSR+K V + +I+ EN+
Sbjct: 92  GSKSRNKSVEI-DISSENI 109


>gi|224119652|ref|XP_002318126.1| predicted protein [Populus trichocarpa]
 gi|222858799|gb|EEE96346.1| predicted protein [Populus trichocarpa]
          Length = 231

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 57/102 (55%), Gaps = 1/102 (0%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLE+
Sbjct: 128 QDAPVPPCITQLG-GLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 186

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  ++ K+++VE++    V   L +A
Sbjct: 187 MGRVLGLRLSQMTLQRGWNNKSKLLVVEDLYARQVYEKLLEA 228


>gi|310822209|ref|YP_003954567.1| hypothetical protein STAUR_4962 [Stigmatella aurantiaca DW4/3-1]
 gi|309395281|gb|ADO72740.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1]
          Length = 98

 Score = 57.8 bits (138), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 56/99 (56%), Gaps = 1/99 (1%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           +P  ++++P   V + +  +P +    +    D  + +Q+ AP  DGEANAAL+E+++  
Sbjct: 1   MPPWLKVLP-EGVELAVLVQPRASRTRVVGEHDGMLKLQLAAPPVDGEANAALVEFLAKR 59

Query: 88  LGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKAS 126
           LG+ RRQV++ +G  +R K V +  +    V   + +AS
Sbjct: 60  LGLPRRQVTLVAGDAARRKRVFLAGVDAARVEAVMSQAS 98


>gi|320108335|ref|YP_004183925.1| hypothetical protein AciPR4_3175 [Terriglobus saanensis SP1PR4]
 gi|319926856|gb|ADV83931.1| protein of unknown function DUF167 [Terriglobus saanensis SP1PR4]
          Length = 104

 Score = 57.8 bits (138), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 50/83 (60%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V+  +  +PG+K   +  +  EAV + + APA DG+AN AL+ +++++L V R  V I S
Sbjct: 16  VTFAVRVQPGAKRSGVVGIYGEAVKIALVAPAVDGKANEALVRFVATLLDVPRMSVEILS 75

Query: 100 GSKSRDKIVIVEEITPENVLNSL 122
           G  SR K+V V  +T   V++ +
Sbjct: 76  GVSSRSKVVKVLGVTSSQVVDGV 98


>gi|224133940|ref|XP_002321697.1| predicted protein [Populus trichocarpa]
 gi|222868693|gb|EEF05824.1| predicted protein [Populus trichocarpa]
          Length = 231

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 62/112 (55%), Gaps = 11/112 (9%)

Query: 22  KTNDENLPSCIRLVPPS------SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGE 75
           +TN ++ P     VPP        V + I  +  ++  +IT V+ + V V + APA  GE
Sbjct: 124 ETNPQDAP-----VPPCISQLGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGE 178

Query: 76  ANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKASS 127
           AN  LLE++  VLG+K  Q+++  G  ++ K+++VE+++   V   L +A+ 
Sbjct: 179 ANNELLEFVGKVLGLKLSQMTLQRGWNNKSKLLVVEDLSARQVYEKLLEAAQ 230


>gi|145590207|ref|YP_001156804.1| hypothetical protein Pnuc_2029 [Polynucleobacter necessarius subsp.
           asymbioticus QLW-P1DMWA-1]
 gi|145048613|gb|ABP35240.1| protein of unknown function DUF167 [Polynucleobacter necessarius
           subsp. asymbioticus QLW-P1DMWA-1]
          Length = 98

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 49/87 (56%), Gaps = 1/87 (1%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P+ +++ +H +PG+K   +  + D  + + + APA + +AN  LL ++S  L + ++Q+ 
Sbjct: 10  PTGITLNLHCQPGAKVTKVVGLHDGCLKISLQAPAIENKANELLLGWLSKQLKIPQKQIQ 69

Query: 97  IGSGSKSRDKIV-IVEEITPENVLNSL 122
             SG  SR K V I   ITPE +   L
Sbjct: 70  FISGQNSRIKRVEIWGSITPEQITQIL 96


>gi|374261190|ref|ZP_09619777.1| hypothetical protein LDG_6154 [Legionella drancourtii LLAP12]
 gi|363538577|gb|EHL31984.1| hypothetical protein LDG_6154 [Legionella drancourtii LLAP12]
          Length = 91

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 47/82 (57%), Gaps = 2/82 (2%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
            + I ++ +PG+K   I     EA+ +++ AP  +G AN ALL++++ +  V  RQV + 
Sbjct: 9   QIIINLYIQPGAKHTEIAGFHGEALKIRLHAPPIEGRANEALLKFIAQIFAVPTRQVVLK 68

Query: 99  SGSKSRDKIVIVEE--ITPENV 118
            G KSR K +I+    I P N+
Sbjct: 69  RGDKSRLKTLIITGSCIDPANI 90


>gi|196231015|ref|ZP_03129875.1| protein of unknown function DUF167 [Chthoniobacter flavus Ellin428]
 gi|196224845|gb|EDY19355.1| protein of unknown function DUF167 [Chthoniobacter flavus Ellin428]
          Length = 93

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/78 (41%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 48  PGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKI 107
           P ++   +  V  +AV V++ APA DG+AN AL ++++ VL V  R V I +G KSRDK+
Sbjct: 13  PNARRSEVVGVHGDAVKVKVQAPAMDGKANEALRDFLAEVLTVPARAVEIVAGEKSRDKV 72

Query: 108 VIVEEI-TPENVLNSLGK 124
           V + ++ T E     LGK
Sbjct: 73  VAIADLETDEARRRLLGK 90


>gi|296491729|tpg|DAA33762.1| TPA: C21H15orf40 protein-like [Bos taurus]
          Length = 107

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 50/88 (56%), Gaps = 1/88 (1%)

Query: 34  LVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           + P    ++ IH K GSK  ++TDV+ EAV V I  P  +GEAN  L   +S +L ++  
Sbjct: 8   VYPKGGFTMAIHDKAGSKQNAMTDVTTEAVSVGIAGPPIEGEANVELCCCLSKILELRTS 67

Query: 94  QVSIGSGSKSRDKIVIVEEIT-PENVLN 120
            V +  GSKS +K+V +   T PE +L 
Sbjct: 68  DVVLDKGSKSHEKVVKLLACTPPEEILE 95


>gi|119899753|ref|YP_934966.1| hypothetical protein azo3464 [Azoarcus sp. BH72]
 gi|166232594|sp|A1KB74.1|Y3464_AZOSB RecName: Full=UPF0235 protein azo3464
 gi|119672166|emb|CAL96080.1| conserved hypothetical protein [Azoarcus sp. BH72]
          Length = 98

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 53/94 (56%), Gaps = 2/94 (2%)

Query: 32  IRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVK 91
           +R     S+++T+H +PG++      +  EA+ +++ AP  DG+ANAAL  +++    V 
Sbjct: 5   VREAADGSLTLTLHIQPGARQTGFAGLHGEAMKIRLAAPPVDGKANAALCAFLADFCEVP 64

Query: 92  RRQVSIGSGSKSRDKIVIVEEITP--ENVLNSLG 123
           +  V++ SG  SR K V VE  TP     L +LG
Sbjct: 65  KSAVTLVSGETSRAKRVRVETKTPGLAGRLRALG 98


>gi|116780073|gb|ABK21543.1| unknown [Picea sitchensis]
          Length = 265

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 50/87 (57%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLEY
Sbjct: 128 QDAPVPPCISQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEY 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           M  VLG++  Q+++  G  ++ K+++V
Sbjct: 188 MGKVLGLRLTQMTLQRGWNNKSKLLVV 214


>gi|351724451|ref|NP_001237058.1| uncharacterized protein LOC100306581 [Glycine max]
 gi|255628955|gb|ACU14822.1| unknown [Glycine max]
          Length = 227

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 51/90 (56%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLE+
Sbjct: 128 QDAPVPPCISHLEGGLVQLAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEF 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEI 113
           M  VLG++  Q+++  G  ++ K+++V  I
Sbjct: 188 MGKVLGLRLSQMTLQRGWNNKSKLLVVSYI 217


>gi|428182068|gb|EKX50930.1| hypothetical protein GUITHDRAFT_135010 [Guillardia theta CCMP2712]
          Length = 722

 Score = 56.2 bits (134), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 31/77 (40%), Positives = 46/77 (59%), Gaps = 6/77 (7%)

Query: 39  SVSITIHAKPGSKSCSITDVSD------EAVGVQIDAPAKDGEANAALLEYMSSVLGVKR 92
           SV   +H KPG+K   IT+  D        + VQI AP +DGEAN  ++ +++S+  VKR
Sbjct: 350 SVQTALHVKPGAKVTRITNAEDIRTRRAGFIDVQIAAPPRDGEANEEVVAFIASLFNVKR 409

Query: 93  RQVSIGSGSKSRDKIVI 109
             V I +G +SR+K+ I
Sbjct: 410 GCVKIVAGHRSREKVSI 426


>gi|451947238|ref|YP_007467833.1| TIGR00251 family protein [Desulfocapsa sulfexigens DSM 10523]
 gi|451906586|gb|AGF78180.1| TIGR00251 family protein [Desulfocapsa sulfexigens DSM 10523]
          Length = 99

 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 50/77 (64%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   S   + D A+ + I AP  DG+ANAA++++++S L VK++ + I  G 
Sbjct: 14  LRVYVQPRASRNSFAGLHDNAMRLTITAPPVDGKANAAVIQFLASFLNVKKKDLEIKHGL 73

Query: 102 KSRDKIVIVEEITPENV 118
           +SR+K V+++ ++ E +
Sbjct: 74  QSRNKSVLIKGLSAEYI 90


>gi|94266431|ref|ZP_01290126.1| Protein of unknown function DUF167 [delta proteobacterium MLMS-1]
 gi|93452973|gb|EAT03472.1| Protein of unknown function DUF167 [delta proteobacterium MLMS-1]
          Length = 121

 Score = 56.2 bits (134), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 25/84 (29%), Positives = 47/84 (55%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ + + A+PG+    +       + +++ AP  DG+AN ALL +++S  G+ R  V++ 
Sbjct: 35  ALLLRVRAQPGAARTEVAGTYGARLRIRVAAPPVDGKANRALLTFLASRCGLVRNAVTLV 94

Query: 99  SGSKSRDKIVIVEEITPENVLNSL 122
            G + RDK+  +E I PE +   L
Sbjct: 95  GGQRGRDKLFRLEGIGPEQLTTCL 118


>gi|56476391|ref|YP_157980.1| hypothetical protein ebA1762 [Aromatoleum aromaticum EbN1]
 gi|81358142|sp|Q5P6I2.1|Y954_AZOSE RecName: Full=UPF0235 protein AZOSEA09540
 gi|56312434|emb|CAI07079.1| conserved hypothetical protein [Aromatoleum aromaticum EbN1]
          Length = 97

 Score = 55.8 bits (133), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 33/95 (34%), Positives = 53/95 (55%), Gaps = 2/95 (2%)

Query: 31  CIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGV 90
            +R     S+ +++H +PG+K         EA+ +++ AP  DG+ANAAL  ++++  GV
Sbjct: 3   WLREAADGSLVLSLHVQPGAKKTEFVGPHGEAMKLRLAAPPVDGKANAALTVFLAAFCGV 62

Query: 91  KRRQVSIGSGSKSRDKIVIVEEITPENV--LNSLG 123
            R  VS+ SG  SR K V +E    E +  L +LG
Sbjct: 63  GRSAVSLLSGETSRAKRVRIEGAGSEALARLRALG 97


>gi|449706804|gb|EMD46572.1| hypothetical protein EHI5A_162950 [Entamoeba histolytica KU27]
          Length = 118

 Score = 55.8 bits (133), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 45/82 (54%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S+V I +  KP +K+  I  V D  + V I++P  DG+AN  ++ +M+S  G+K+  V +
Sbjct: 31  SNVIIEVEIKPNAKTSEIQGVEDGLLKVSINSPPVDGKANTEVIAFMASTFGIKKSNVKL 90

Query: 98  GSGSKSRDKIVIVEEITPENVL 119
             G  S  K +  E  T E VL
Sbjct: 91  IKGQTSHHKTLQFENWTREKVL 112


>gi|67482694|ref|XP_656664.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56473879|gb|EAL51278.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
          Length = 118

 Score = 55.8 bits (133), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 45/82 (54%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S+V I +  KP +K+  I  V D  + V I++P  DG+AN  ++ +M+S  G+K+  V +
Sbjct: 31  SNVIIEVEIKPNAKTSEIQGVEDGLLKVSINSPPVDGKANTEVIAFMASTFGIKKSNVKL 90

Query: 98  GSGSKSRDKIVIVEEITPENVL 119
             G  S  K +  E  T E VL
Sbjct: 91  IKGQTSHHKTLQFENWTREKVL 112


>gi|302916303|ref|XP_003051962.1| hypothetical protein NECHADRAFT_19877 [Nectria haematococca mpVI
           77-13-4]
 gi|256732901|gb|EEU46249.1| hypothetical protein NECHADRAFT_19877 [Nectria haematococca mpVI
           77-13-4]
          Length = 75

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 49/74 (66%), Gaps = 2/74 (2%)

Query: 39  SVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           S+ + +H KPG+      +  V+D+A+ + + A A++GEAN A+++ +SSVLGV +  + 
Sbjct: 2   SLHLQLHVKPGASKNREGVIAVTDDAIELCVAAQAREGEANKAVVQVLSSVLGVPKSSLQ 61

Query: 97  IGSGSKSRDKIVIV 110
           +  G KSRDK V++
Sbjct: 62  LTHGLKSRDKTVVL 75


>gi|452819902|gb|EME26952.1| hypothetical protein Gasu_54060 [Galdieria sulphuraria]
          Length = 129

 Score = 55.1 bits (131), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 63/118 (53%), Gaps = 3/118 (2%)

Query: 12  KSAG--STQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDA 69
           KS G  + Q  +  ++  +PS IR V  + + + +  KPGSK   +   ++E V + + A
Sbjct: 9   KSIGPKNFQRPLMAHNNRIPSYIR-VKENHILLQVLVKPGSKRPGLMQTTEEEVIIHVGA 67

Query: 70  PAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKASS 127
             K GEAN  L+E ++ +L V +  +SI SG K + K V ++ +     ++ + K++ 
Sbjct: 68  QPKQGEANQELVERLAKLLHVPKSDISIESGGKGKKKRVCIKGVVNWQTIDDIFKSTQ 125


>gi|78357213|ref|YP_388662.1| hypothetical protein [Desulfovibrio alaskensis G20]
 gi|78219618|gb|ABB38967.1| protein of unknown function DUF167 [Desulfovibrio alaskensis G20]
          Length = 118

 Score = 55.1 bits (131), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 46/73 (63%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + + A+PG+K   I  + D  V +++ APA D +AN  L+ +++ + GVK+ +V + SG 
Sbjct: 17  LKVWAQPGAKHSGIAGLYDGRVRIRLSAPAVDNKANKELIRFVAQLCGVKQNRVRLESGV 76

Query: 102 KSRDKIVIVEEIT 114
            SR K++++E  T
Sbjct: 77  SSRKKVLLIERDT 89


>gi|407041472|gb|EKE40757.1| ACR, YggU family COG1872 protein, putative [Entamoeba nuttalli P19]
          Length = 118

 Score = 55.1 bits (131), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 44/82 (53%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S V I +  KP +K+  I  V D  + V I++P  DG+AN  ++ +M+S  G+K+  V +
Sbjct: 31  SDVIIEVEIKPNAKTSEIQGVEDGLLKVSINSPPVDGKANTEVIAFMASTFGIKKSNVRL 90

Query: 98  GSGSKSRDKIVIVEEITPENVL 119
             G  S  K +  E  T E VL
Sbjct: 91  IKGQTSHHKTLQFENWTREKVL 112


>gi|358385705|gb|EHK23301.1| hypothetical protein TRIVIDRAFT_139499, partial [Trichoderma virens
           Gv29-8]
          Length = 91

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 47/77 (61%), Gaps = 2/77 (2%)

Query: 36  PPSSVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           P   + + +H KPG+      I  V+D+ + + + A AKDGEAN A++E +S  L + + 
Sbjct: 15  PQGILQLRLHVKPGASKTREGIQMVTDDVIELCVAAQAKDGEANQAVIEVLSEALDIPKS 74

Query: 94  QVSIGSGSKSRDKIVIV 110
           ++ +  G++SRDK V+V
Sbjct: 75  KLVLAQGARSRDKTVVV 91


>gi|317153487|ref|YP_004121535.1| hypothetical protein Daes_1777 [Desulfovibrio aespoeensis Aspo-2]
 gi|316943738|gb|ADU62789.1| protein of unknown function DUF167 [Desulfovibrio aespoeensis
           Aspo-2]
          Length = 102

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 45/72 (62%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I +  +PG++   +  V  + V +++ APA D +AN AL+ +++SVL VK+ QV I SG 
Sbjct: 16  IAVWVQPGARKSEVAGVYQQCVKIRLCAPAVDNKANKALVAFVASVLNVKKSQVVIESGQ 75

Query: 102 KSRDKIVIVEEI 113
            +R K++ +  +
Sbjct: 76  TTRKKLLALNTV 87


>gi|218885259|ref|YP_002434580.1| hypothetical protein DvMF_0151 [Desulfovibrio vulgaris str.
           'Miyazaki F']
 gi|218756213|gb|ACL07112.1| protein of unknown function DUF167 [Desulfovibrio vulgaris str.
           'Miyazaki F']
          Length = 106

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 54/86 (62%), Gaps = 2/86 (2%)

Query: 41  SITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           ++ + A PG++  +    +D  + V++ APA D +AN AL E+++S LG++R +V + SG
Sbjct: 21  AVLVRAVPGARKSACEGTADGRLKVRLAAPAVDNKANKALEEFVASALGMRRNRVRLVSG 80

Query: 101 SKSRDKIVIVE-EITP-ENVLNSLGK 124
             SR K +IVE ++ P   +L + GK
Sbjct: 81  HTSRLKKLIVESDVEPSWGLLGTCGK 106


>gi|71909498|ref|YP_287085.1| hypothetical protein Daro_3887 [Dechloromonas aromatica RCB]
 gi|123626353|sp|Q478W6.1|Y3887_DECAR RecName: Full=UPF0235 protein Daro_3887
 gi|71849119|gb|AAZ48615.1| Conserved hypothetical protein 251 [Dechloromonas aromatica RCB]
          Length = 97

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 46/81 (56%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           +    R      +++T+H +PG+K      +  +A+ +++ AP  DG+AN AL+ +++  
Sbjct: 1   MSDWFRQAANGCITLTLHIQPGAKKSEFAGLHGDALKIRLAAPPVDGKANEALIRFIADA 60

Query: 88  LGVKRRQVSIGSGSKSRDKIV 108
           LG+ +  V + SG  SR K++
Sbjct: 61  LGLAKSAVHLKSGQTSRRKVL 81


>gi|429857874|gb|ELA32714.1| duf167 domain protein [Colletotrichum gloeosporioides Nara gc5]
          Length = 116

 Score = 54.3 bits (129), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 51/89 (57%), Gaps = 8/89 (8%)

Query: 30  SCIRLV------PPSSVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALL 81
           + IR V      P  ++ +  H KPG+      +T V+D  V + + A A++GEAN A++
Sbjct: 4   TAIRFVAGTKKSPLGTLYLQCHVKPGASRVREGVTAVTDATVELCVSAQAREGEANKAVV 63

Query: 82  EYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           + +S VLG+ +  ++I  G KSRDK V V
Sbjct: 64  KLLSEVLGLPKSDLTITQGLKSRDKTVAV 92


>gi|54298708|ref|YP_125077.1| hypothetical protein lpp2772 [Legionella pneumophila str. Paris]
 gi|53752493|emb|CAH13925.1| hypothetical protein lpp2772 [Legionella pneumophila str. Paris]
          Length = 95

 Score = 54.3 bits (129), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 41/71 (57%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S V I I+AKP +K   +  +SD+ + + + A  ++GEAN  LL ++S    + + Q+ +
Sbjct: 8   SHVEIAIYAKPNAKKTKLMAISDDRLHIALHAKPQEGEANNELLFFISQFFKIPKTQIEL 67

Query: 98  GSGSKSRDKIV 108
             G  SR K++
Sbjct: 68  IKGKSSRHKLI 78


>gi|269837719|ref|YP_003319947.1| hypothetical protein Sthe_1691 [Sphaerobacter thermophilus DSM
           20745]
 gi|269786982|gb|ACZ39125.1| protein of unknown function DUF167 [Sphaerobacter thermophilus DSM
           20745]
          Length = 102

 Score = 54.3 bits (129), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 48/78 (61%)

Query: 41  SITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
            +T+   P +    +  V+D A+ V++ AP  +G AN AL E+++++L + +R V + +G
Sbjct: 15  QVTVRVTPRASRTQVDGVADGALRVRLAAPPVEGAANRALTEFLANLLRLPKRDVELVAG 74

Query: 101 SKSRDKIVIVEEITPENV 118
           ++ R K V++  +TP +V
Sbjct: 75  ARGRQKTVLLRGLTPADV 92


>gi|54295557|ref|YP_127972.1| hypothetical protein lpl2644 [Legionella pneumophila str. Lens]
 gi|53755389|emb|CAH16885.1| hypothetical protein lpl2644 [Legionella pneumophila str. Lens]
          Length = 95

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 42/71 (59%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S V I I+AKP +K   +  +SD+++ + + A  ++GEAN  LL ++S    + + Q+ +
Sbjct: 8   SHVEIAIYAKPNAKKSKLMAISDDSLHIALHAKPQEGEANNELLFFISQFFKIPKTQIEL 67

Query: 98  GSGSKSRDKIV 108
             G  SR K++
Sbjct: 68  IKGKSSRHKLI 78


>gi|388455770|ref|ZP_10138065.1| hypothetical protein FdumT_04323 [Fluoribacter dumoffii Tex-KL]
          Length = 91

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 50/83 (60%), Gaps = 2/83 (2%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
            V++ ++ +PG+K   I  + +  + ++++ P  +G AN ALL+Y++ +  V  RQV + 
Sbjct: 9   DVTLHLYVQPGAKKTEIIGIHEGELKIRLNTPPIEGRANKALLQYIAQLFKVPVRQVILK 68

Query: 99  SGSKSRDKIVIVEE--ITPENVL 119
            G KSR K ++V +  + P+++ 
Sbjct: 69  RGDKSRHKTLLVIDTGVDPDDLF 91


>gi|380496420|emb|CCF31758.1| hypothetical protein CH063_04329 [Colletotrichum higginsianum]
          Length = 117

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 53/87 (60%), Gaps = 3/87 (3%)

Query: 36  PPSSVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           P  ++ +  H KPG+      +T V++ AV + + A A++GEAN A+++ +S VLG+ + 
Sbjct: 17  PLGTLYLQCHVKPGASRVREGVTAVTEGAVEMCVAAQAREGEANKAVIKLLSEVLGLPKS 76

Query: 94  QVSIGSGSKSRDKIVIVEEI-TPENVL 119
            ++I  G KSRDK V V  +  P +V+
Sbjct: 77  DLTITQGLKSRDKTVAVSIVQNPTDVM 103


>gi|347538136|ref|YP_004845560.1| hypothetical protein NH8B_0297 [Pseudogulbenkiania sp. NH8B]
 gi|345641313|dbj|BAK75146.1| protein of unknown function [Pseudogulbenkiania sp. NH8B]
          Length = 97

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 50/87 (57%), Gaps = 1/87 (1%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S + +T+H +PG+K   +      A+ +++ AP  +G+ANA LL +++    V +R V +
Sbjct: 10  SVIRLTLHVQPGAKKTDLAGEHGGALKLRLAAPPVEGKANAMLLAWLAERFEVPKRDVVL 69

Query: 98  GSGSKSRDKIVIVEEITPE-NVLNSLG 123
            SG KSR KIV ++    E  VL  LG
Sbjct: 70  LSGDKSRHKIVEIKLALDEAAVLARLG 96


>gi|52842921|ref|YP_096720.1| hypothetical protein lpg2716 [Legionella pneumophila subsp.
           pneumophila str. Philadelphia 1]
 gi|378778610|ref|YP_005187049.1| hypothetical protein lp12_2709 [Legionella pneumophila subsp.
           pneumophila ATCC 43290]
 gi|52630032|gb|AAU28773.1| hypothetical protein lpg2716 [Legionella pneumophila subsp.
           pneumophila str. Philadelphia 1]
 gi|364509425|gb|AEW52949.1| hypothetical protein lp12_2709 [Legionella pneumophila subsp.
           pneumophila ATCC 43290]
          Length = 95

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 41/71 (57%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S V I I+AKP +K   +  +SD+ + + + A  ++GEAN  LL ++S    + + Q+ +
Sbjct: 8   SHVEIAIYAKPNAKKSKLMAISDDRLHIALHAKPQEGEANNELLFFISQFFKIPKTQIEL 67

Query: 98  GSGSKSRDKIV 108
             G  SR K++
Sbjct: 68  IKGKSSRHKLI 78


>gi|224826207|ref|ZP_03699310.1| protein of unknown function DUF167 [Pseudogulbenkiania ferrooxidans
           2002]
 gi|224601844|gb|EEG08024.1| protein of unknown function DUF167 [Pseudogulbenkiania ferrooxidans
           2002]
          Length = 108

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 43/71 (60%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S + +T+H +PG+K   +      A+ +++ AP  +G+ANA LL +++    V +R V +
Sbjct: 21  SVIRLTLHVQPGAKKTDLAGEHGGALKLRLAAPPVEGKANAMLLAWLAERFEVPKRDVVL 80

Query: 98  GSGSKSRDKIV 108
            SG KSR KIV
Sbjct: 81  LSGDKSRHKIV 91


>gi|373851494|ref|ZP_09594294.1| UPF0235 protein yggU [Opitutaceae bacterium TAV5]
 gi|391230656|ref|ZP_10266862.1| hypothetical protein OpiT1DRAFT_03204 [Opitutaceae bacterium TAV1]
 gi|372473723|gb|EHP33733.1| UPF0235 protein yggU [Opitutaceae bacterium TAV5]
 gi|391220317|gb|EIP98737.1| hypothetical protein OpiT1DRAFT_03204 [Opitutaceae bacterium TAV1]
          Length = 101

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/88 (34%), Positives = 47/88 (53%), Gaps = 9/88 (10%)

Query: 27  NLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
           +LPSC         S+ I A P +    +     +A+ V++ APA +G AN  L E+++ 
Sbjct: 7   SLPSC---------SLAIKAIPNASRTLVAGWLGDALKVKVRAPALEGRANGELCEFLAG 57

Query: 87  VLGVKRRQVSIGSGSKSRDKIVIVEEIT 114
            LG+  R V++ SG KSR K V +  +T
Sbjct: 58  TLGLPHRAVTVASGEKSRQKRVQITGLT 85


>gi|347730736|ref|ZP_08863847.1| hypothetical protein DA2_0105 [Desulfovibrio sp. A2]
 gi|347520448|gb|EGY27582.1| hypothetical protein DA2_0105 [Desulfovibrio sp. A2]
          Length = 106

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 54/86 (62%), Gaps = 2/86 (2%)

Query: 41  SITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           ++ + A PG++  +    +D  + V++ APA D +AN AL ++++S LG++R +V I SG
Sbjct: 21  AVLVRAVPGARKSACEGTADGRLKVRLAAPAVDNKANKALEDFVASALGMRRNRVRIVSG 80

Query: 101 SKSRDKIVIVE-EITP-ENVLNSLGK 124
             SR K +IVE ++ P   +L + GK
Sbjct: 81  HTSRLKKLIVESDVEPSWGLLGTCGK 106


>gi|117926928|ref|YP_867545.1| hypothetical protein Mmc1_3654 [Magnetococcus marinus MC-1]
 gi|166990883|sp|A0LDU6.1|Y3654_MAGSM RecName: Full=UPF0235 protein Mmc1_3654
 gi|117610684|gb|ABK46139.1| protein of unknown function DUF167 [Magnetococcus marinus MC-1]
          Length = 98

 Score = 53.5 bits (127), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 46/77 (59%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ +TI  +P +    +     E + V ++AP  DG AN AL  +++  LG+ + QV++ 
Sbjct: 10  TLHLTIRVQPKAAQERVMGWQGEQLKVALNAPPVDGAANKALCHFLAKQLGIAKGQVTLV 69

Query: 99  SGSKSRDKIVIVEEITP 115
            G KSR+K ++++ I+P
Sbjct: 70  RGEKSREKQLVIQGISP 86


>gi|397668393|ref|YP_006509930.1| hypothetical protein LPV_3066 [Legionella pneumophila subsp.
           pneumophila]
 gi|395131804|emb|CCD10097.1| conserved protein of unknown function [Legionella pneumophila
           subsp. pneumophila]
          Length = 95

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 44/78 (56%), Gaps = 1/78 (1%)

Query: 31  CIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGV 90
             +++ P  V I I+AKP +K   +  +SD+ + + + A  ++GEAN  LL ++S    +
Sbjct: 2   WFKIIHPH-VEIAIYAKPNAKKTKLMAISDDRLHIALHAKPQEGEANNELLFFISQFFKI 60

Query: 91  KRRQVSIGSGSKSRDKIV 108
            + Q+ +  G  SR K++
Sbjct: 61  PKTQIELIKGKSSRHKLI 78


>gi|330795787|ref|XP_003285952.1| hypothetical protein DICPUDRAFT_23568 [Dictyostelium purpureum]
 gi|325084041|gb|EGC37478.1| hypothetical protein DICPUDRAFT_23568 [Dictyostelium purpureum]
          Length = 110

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 43/72 (59%), Gaps = 2/72 (2%)

Query: 35  VPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           +P   ++I +H  P SK   I    +E + ++I  P  DG+AN  ++E++S  LG+++  
Sbjct: 21  LPILKLNINVH--PNSKENQIISFENEILSLRISEPPIDGQANKGVVEFLSKELGLRKSN 78

Query: 95  VSIGSGSKSRDK 106
           + +  GSKSR+K
Sbjct: 79  IQVSKGSKSRNK 90


>gi|83648992|ref|YP_437427.1| hypothetical protein HCH_06357 [Hahella chejuensis KCTC 2396]
 gi|83637035|gb|ABC33002.1| uncharacterized conserved protein [Hahella chejuensis KCTC 2396]
          Length = 102

 Score = 53.1 bits (126), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 47/84 (55%)

Query: 31  CIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGV 90
           C+ L    ++ +  H +PG+K   I     +A+ ++I AP  DG AN  L+ +++ +  V
Sbjct: 9   CVSLQDEQTLILQCHLQPGAKKDEIVGTHGDALKIKISAPPIDGRANQQLVRFLAKLCRV 68

Query: 91  KRRQVSIGSGSKSRDKIVIVEEIT 114
           K++ V I +G  SR K + V+ +T
Sbjct: 69  KQQDVQILAGESSRQKRIRVQNLT 92


>gi|408399550|gb|EKJ78649.1| hypothetical protein FPSE_01137 [Fusarium pseudograminearum CS3096]
          Length = 117

 Score = 53.1 bits (126), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 52/89 (58%), Gaps = 8/89 (8%)

Query: 30  SCIRLV------PPSSVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALL 81
           S IR V      P  S+ + +  KPG+      +  V+++A+ + + A A++GEAN A++
Sbjct: 5   SAIRFVTGSKKSPLGSLHLQLRVKPGASKNREGVIAVTEDAIELCVSAQAREGEANKAVV 64

Query: 82  EYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           + +S +LGV R  + +  G KSRDK V++
Sbjct: 65  QVLSGILGVPRSSLQLTHGMKSRDKTVVL 93


>gi|404492205|ref|YP_006716311.1| hypothetical protein Pcar_0617 [Pelobacter carbinolicus DSM 2380]
 gi|123574815|sp|Q3A6Y1.1|Y617_PELCD RecName: Full=UPF0235 protein Pcar_0617
 gi|77544314|gb|ABA87876.1| protein of unknown function DUF167 [Pelobacter carbinolicus DSM
           2380]
          Length = 95

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 51/83 (61%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V +++H +P +    +  +  E++ +++ +P  +G AN    E+++ +LGV + +V++ S
Sbjct: 12  VVLSVHVQPRASRNELAGLQGESLKIRLTSPPVEGAANKLCREFLAKLLGVAKSRVTLVS 71

Query: 100 GSKSRDKIVIVEEITPENVLNSL 122
           G KSR K +++E +T + V N L
Sbjct: 72  GDKSRHKRLLIEGVTLDEVRNKL 94


>gi|342883630|gb|EGU84084.1| hypothetical protein FOXB_05382 [Fusarium oxysporum Fo5176]
          Length = 117

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 52/89 (58%), Gaps = 8/89 (8%)

Query: 30  SCIRLV------PPSSVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALL 81
           S IR V      P  SV + +  KPG+      +  V+D+A+ + + A A++GEAN A++
Sbjct: 5   SAIRFVTGTKKSPLGSVHLQLRVKPGASKNREGVIAVADDAIELCVSAQAREGEANKAVV 64

Query: 82  EYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           + +S +LGV +  + +  G KSRDK +++
Sbjct: 65  QVLSGILGVPKSSLQLTHGIKSRDKTIVL 93


>gi|303284124|ref|XP_003061353.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457704|gb|EEH55003.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 224

 Score = 52.8 bits (125), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 56/95 (58%), Gaps = 2/95 (2%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           +P CI+   P +V I +  +   K  +IT ++ + VGV ++  AK   AN  ++E++   
Sbjct: 132 VPPCIQPKTPGTVQIALEIEDRKKWRAITKITADEVGVAVN--AKCAVANDEIVEFLGKT 189

Query: 88  LGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           L ++  Q+S+ +G  +R K+++V+ +TP+ V + L
Sbjct: 190 LHLRLPQMSLLAGWSARSKLLVVQGLTPQQVYDRL 224


>gi|390956721|ref|YP_006420478.1| hypothetical protein Terro_0811 [Terriglobus roseus DSM 18391]
 gi|390411639|gb|AFL87143.1| TIGR00251 family protein [Terriglobus roseus DSM 18391]
          Length = 100

 Score = 52.8 bits (125), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 48/88 (54%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           VS  +  +PG+    +     +A+ + + APA DG+AN AL+  ++ +LGV R  V I S
Sbjct: 12  VSFAVRVQPGASREGVVGEYGDALKIALTAPAVDGKANDALVRCLAGLLGVPRLSVEIAS 71

Query: 100 GSKSRDKIVIVEEITPENVLNSLGKASS 127
           G  SR KIV V  +T + V   L +  S
Sbjct: 72  GLLSRSKIVRVVGVTADEVTAKLMRIES 99


>gi|257056705|ref|YP_003134537.1| hypothetical protein Svir_27290 [Saccharomonospora viridis DSM
           43017]
 gi|256586577|gb|ACU97710.1| uncharacterized conserved protein [Saccharomonospora viridis DSM
           43017]
          Length = 118

 Score = 52.8 bits (125), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 48/84 (57%), Gaps = 4/84 (4%)

Query: 36  PPSSVSITIHAKPGSKSCSITDVSDEAVG----VQIDAPAKDGEANAALLEYMSSVLGVK 91
           P S +   I  KPG+K  ++  + D A+G    V + APA DG+AN A+   ++  L V+
Sbjct: 15  PMSELRFAIRVKPGAKRDAVGGIWDGALGEALVVSVRAPAVDGKANEAVCRVLAEALSVR 74

Query: 92  RRQVSIGSGSKSRDKIVIVEEITP 115
            R +++  G ++RDK+V + +  P
Sbjct: 75  ARDLTVVKGHRARDKLVELRDPPP 98


>gi|256379751|ref|YP_003103411.1| hypothetical protein Amir_5752 [Actinosynnema mirum DSM 43827]
 gi|255924054|gb|ACU39565.1| protein of unknown function DUF167 [Actinosynnema mirum DSM 43827]
          Length = 90

 Score = 52.8 bits (125), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 45/73 (61%), Gaps = 1/73 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDE-AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +   +  KPGSK  ++    DE A+ V + APA +G+AN A+   ++   GV+R+ V I 
Sbjct: 2   LKFAVRVKPGSKRDAVGGRWDERALVVSVAAPAVEGKANEAVRRALAKAFGVRRQDVEIV 61

Query: 99  SGSKSRDKIVIVE 111
           SG + RDK+V+++
Sbjct: 62  SGERGRDKVVVID 74


>gi|94971231|ref|YP_593279.1| hypothetical protein Acid345_4205 [Candidatus Koribacter versatilis
           Ellin345]
 gi|166227235|sp|Q1IIU5.1|Y4205_ACIBL RecName: Full=UPF0235 protein Acid345_4205
 gi|94553281|gb|ABF43205.1| protein of unknown function DUF167 [Candidatus Koribacter
           versatilis Ellin345]
          Length = 96

 Score = 52.8 bits (125), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 48/85 (56%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S VS  +  +P +K  +I    + A+ + +  P  DG AN AL+ +++ +L V R  V+I
Sbjct: 9   SGVSFAVRLQPKAKKTAIIGELNGALKLGVTDPPIDGRANEALIRFVAGLLKVTRSSVTI 68

Query: 98  GSGSKSRDKIVIVEEITPENVLNSL 122
            +G  SR+K++ +E +T E V   L
Sbjct: 69  AAGESSRNKVIRIEGVTAEQVRFRL 93


>gi|357037199|ref|ZP_09098999.1| UPF0235 protein yggU [Desulfotomaculum gibsoniae DSM 7213]
 gi|355361364|gb|EHG09119.1| UPF0235 protein yggU [Desulfotomaculum gibsoniae DSM 7213]
          Length = 95

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 47/85 (55%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
             VSI I  +P +    ++ V + A+ V++ AP  DG AN A   +++ +LGV + +V+I
Sbjct: 9   GGVSIKIRVQPRAAKNQVSGVMEGALKVRLTAPPVDGAANKACCAFVAELLGVAKGRVAI 68

Query: 98  GSGSKSRDKIVIVEEITPENVLNSL 122
             G   R+K V VE +T   VL  L
Sbjct: 69  SQGHTGRNKTVRVEGLTAGQVLEKL 93


>gi|440475136|gb|ELQ43837.1| AhpC/TSA family protein [Magnaporthe oryzae Y34]
 gi|440487065|gb|ELQ66871.1| AhpC/TSA family protein [Magnaporthe oryzae P131]
          Length = 314

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 28/77 (36%), Positives = 48/77 (62%), Gaps = 2/77 (2%)

Query: 36  PPSSVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           P +++ +  H KPG+      +  V+D+AV + + A A++GEAN A++  +S  LG+ + 
Sbjct: 17  PLAALLLKCHVKPGASKAREGVAAVTDDAVELCVSAQAREGEANKAVVIVLSKALGLAKS 76

Query: 94  QVSIGSGSKSRDKIVIV 110
            V+I  G KSRDK V++
Sbjct: 77  DVTITHGLKSRDKTVLI 93


>gi|303245495|ref|ZP_07331779.1| protein of unknown function DUF167 [Desulfovibrio fructosovorans
           JJ]
 gi|302493344|gb|EFL53206.1| protein of unknown function DUF167 [Desulfovibrio fructosovorans
           JJ]
          Length = 113

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 46/81 (56%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           LP  +        ++ +   PG    ++  ++++ + V++ A A +G+ANAAL  +++  
Sbjct: 14  LPVFVAAAGEGGWTLRVAVAPGGSRDALAGLAEDRLRVRLRAKAVEGQANAALTAFLAEC 73

Query: 88  LGVKRRQVSIGSGSKSRDKIV 108
            GV+ RQV I SG KSR KIV
Sbjct: 74  FGVRPRQVRIVSGEKSRKKIV 94


>gi|326517290|dbj|BAK00012.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326529833|dbj|BAK08196.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 232

 Score = 52.4 bits (124), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 59/102 (57%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
           +D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLE+
Sbjct: 128 HDAPVPPCITQLQGGLVQVAIEVEDRAQRSAITRVNADDVRVAVAAPAARGEANNELLEF 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  ++ K++IVE+++   V   L +A
Sbjct: 188 MGKVLGLRLSQMTLQRGWNNKSKLLIVEDLSARQVYEKLLEA 229


>gi|239906817|ref|YP_002953558.1| hypothetical protein DMR_21810 [Desulfovibrio magneticus RS-1]
 gi|239796683|dbj|BAH75672.1| hypothetical protein [Desulfovibrio magneticus RS-1]
          Length = 122

 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 52/89 (58%)

Query: 23  TNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLE 82
           + D + P  +      + ++ +   PG    ++  ++++ + V++ A A +G+ANAAL +
Sbjct: 18  SGDADRPVFVAPAGDGAWTLRVAVTPGGAKDALAGLAEDRLRVRLRAKAVEGQANAALTD 77

Query: 83  YMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           +++  LGVK RQV I SG KSR K + +E
Sbjct: 78  FVARCLGVKPRQVRIISGEKSRKKTLRIE 106


>gi|225164758|ref|ZP_03726990.1| conserved hypothetical protein [Diplosphaera colitermitum TAV2]
 gi|224800632|gb|EEG18996.1| conserved hypothetical protein [Diplosphaera colitermitum TAV2]
          Length = 111

 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 42/70 (60%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P++  ++I A P +   +I     +A+ V++ APA +G AN  L ++++  LG+ RR V+
Sbjct: 17  PATCILSIKAIPNASRNAIAGWLGDALKVKVSAPALEGRANEQLCDFLAETLGIPRRAVT 76

Query: 97  IGSGSKSRDK 106
           +  G KSR K
Sbjct: 77  VAGGEKSRQK 86


>gi|374301322|ref|YP_005052961.1| hypothetical protein [Desulfovibrio africanus str. Walvis Bay]
 gi|332554258|gb|EGJ51302.1| UPF0235 protein yggU [Desulfovibrio africanus str. Walvis Bay]
          Length = 109

 Score = 52.0 bits (123), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 24/87 (27%), Positives = 47/87 (54%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           LP C+  + P    + I  +PG+       +  +   +++ AP  D +AN AL+ Y++ +
Sbjct: 13  LPGCVISIEPGVWRLNIWVQPGANRNEPVGLYQDCCKIKLSAPPVDNKANKALVVYIAGL 72

Query: 88  LGVKRRQVSIGSGSKSRDKIVIVEEIT 114
           LG+++ QV + +G  SR K +++   T
Sbjct: 73  LGLRKNQVLLENGLTSRRKSLLIHSAT 99


>gi|84996333|ref|XP_952888.1| proton translocating inorganic pyrophosphatase [Theileria annulata
            strain Ankara]
 gi|65303885|emb|CAI76264.1| proton translocating inorganic pyrophosphatase, putative [Theileria
            annulata]
          Length = 1204

 Score = 52.0 bits (123), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 32/101 (31%), Positives = 52/101 (51%), Gaps = 18/101 (17%)

Query: 40   VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS------------- 86
            + + ++ KPGS+   I   S+  + VQI AP ++GE N AL+E++S              
Sbjct: 1096 ILLKVNVKPGSRQTQIIGESEGRLSVQIAAPPREGECNKALIEFISKTRNFYYFLAFNTF 1155

Query: 87   -----VLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
                 ++GVK+  V++  G KSRDKI+ +  I  +   N L
Sbjct: 1156 NVLIFLVGVKKGNVTLLHGHKSRDKILSITGIDIQTASNLL 1196


>gi|115380361|ref|ZP_01467361.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
 gi|115362627|gb|EAU61862.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
          Length = 83

 Score = 52.0 bits (123), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 46/80 (57%)

Query: 47  KPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           +P +    +    D  + +Q+ AP  DGEANAAL+E+++  LG+ RRQV++ +G  +R K
Sbjct: 4   QPRASRTRVVGEHDGMLKLQLAAPPVDGEANAALVEFLAKRLGLPRRQVTLVAGDAARRK 63

Query: 107 IVIVEEITPENVLNSLGKAS 126
            V +  +    V   + +AS
Sbjct: 64  RVFLAGVDAARVEAVMSQAS 83


>gi|442771071|gb|AGC71769.1| hypothetical protein [uncultured bacterium A1Q1_fos_504]
          Length = 98

 Score = 52.0 bits (123), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 56/97 (57%), Gaps = 1/97 (1%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           +P+ +R      V + I+ +PG+K   +    D A+ ++I+AP  DG+ANAA++ +++  
Sbjct: 3   IPTWLRECA-DGVEVFIYVQPGAKKTELAGDHDGALKLRINAPPVDGKANAAVIAFLADR 61

Query: 88  LGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGK 124
            GV + +V + SG  +R K + V  IT  +V+  L +
Sbjct: 62  CGVSKGKVLLLSGDTARRKRLQVIGITAASVVKQLAR 98


>gi|182413313|ref|YP_001818379.1| hypothetical protein Oter_1495 [Opitutus terrae PB90-1]
 gi|177840527|gb|ACB74779.1| protein of unknown function DUF167 [Opitutus terrae PB90-1]
          Length = 90

 Score = 51.6 bits (122), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 42/74 (56%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           SS +I I A P +    I     +A+ V++ AP  +G AN  L E+++  LG+ RR VS+
Sbjct: 2   SSCTIAIKAIPNAPRNQIVGWLGDALKVKVHAPPLEGRANEELCEFLADELGLPRRAVSV 61

Query: 98  GSGSKSRDKIVIVE 111
             G  SR K+V +E
Sbjct: 62  LRGDTSRQKLVQIE 75


>gi|298528115|ref|ZP_07015519.1| protein of unknown function DUF167 [Desulfonatronospira
           thiodismutans ASO3-1]
 gi|298511767|gb|EFI35669.1| protein of unknown function DUF167 [Desulfonatronospira
           thiodismutans ASO3-1]
          Length = 119

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 57/105 (54%), Gaps = 2/105 (1%)

Query: 18  QSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEAN 77
           Q  I+ +D  +    RL P + + + +  KPG+    +  +    + + + AP  DG+AN
Sbjct: 13  QGLIRYSDTCMQFLERLDPETWI-LRVVLKPGADRDEVLGIHAGRLKISVKAPPVDGKAN 71

Query: 78  AALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
            AL  ++S  LG++++QV I  G +SR+K +IV  +    V N+L
Sbjct: 72  KALCIFLSRSLGIRKKQVWIQRGLQSRNKDLIVSGVA-GTVFNAL 115


>gi|440299717|gb|ELP92265.1| hypothetical protein EIN_118690 [Entamoeba invadens IP1]
          Length = 129

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 45/75 (60%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V+I I+ KP S++  I  + D  + + IDAP  DG+AN+ +++++++   VK+  V++  
Sbjct: 46  VTIEINVKPNSRNSEIQGIEDGLLKIAIDAPPVDGKANSEVVDFIATSFSVKKSSVAVVK 105

Query: 100 GSKSRDKIVIVEEIT 114
           G  S  K V +E  T
Sbjct: 106 GQTSHHKTVRIENCT 120


>gi|302342363|ref|YP_003806892.1| hypothetical protein Deba_0928 [Desulfarculus baarsii DSM 2075]
 gi|301638976|gb|ADK84298.1| protein of unknown function DUF167 [Desulfarculus baarsii DSM 2075]
          Length = 95

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 10/103 (9%)

Query: 21  IKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAAL 80
           ++  DEN            +S  +   P +    +      A+ V++ AP  DG+AN AL
Sbjct: 2   LQIKDEN----------GGLSFAVRVSPRASRDQLAGEEGGALKVRLCAPPVDGQANEAL 51

Query: 81  LEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLG 123
           L  ++  L + RR VS+ SG +SR K ++V+ +  E +L  LG
Sbjct: 52  LRLVAKALSLPRRDVSLASGPRSRQKRLLVKGLGREQLLARLG 94


>gi|333978628|ref|YP_004516573.1| hypothetical protein Desku_1188 [Desulfotomaculum kuznetsovii DSM
           6115]
 gi|333822109|gb|AEG14772.1| UPF0235 protein yggU [Desulfotomaculum kuznetsovii DSM 6115]
          Length = 97

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 47/80 (58%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V   +  +P +    +  V ++A+ V++ AP  +GEAN A  ++ + +LGV R +V I 
Sbjct: 10  AVVFKVRVQPRAARNELAGVFEDALKVRLTAPPVEGEANEACRDFFARLLGVPRVRVEII 69

Query: 99  SGSKSRDKIVIVEEITPENV 118
           +G   R+K+V V+ +T E V
Sbjct: 70  AGHTGRNKLVRVQGVTVEQV 89


>gi|339484001|ref|YP_004695787.1| hypothetical protein Nit79A3_2621 [Nitrosomonas sp. Is79A3]
 gi|338806146|gb|AEJ02388.1| UPF0235 protein yggU [Nitrosomonas sp. Is79A3]
          Length = 98

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 50/83 (60%)

Query: 30  SCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLG 89
           S  R    +++ +T+H + G+K+     +  +A+ +++ A   +G+ANAALL++++    
Sbjct: 2   SWYRYDDANNLVLTLHIQTGAKNTEAAGLHGDALRIKLAAAPVEGKANAALLKFLAKHFD 61

Query: 90  VKRRQVSIGSGSKSRDKIVIVEE 112
           V   QV +  G KSR K++I+++
Sbjct: 62  VPLSQVILRQGDKSRHKVIIIQQ 84


>gi|428180827|gb|EKX49693.1| hypothetical protein GUITHDRAFT_104656 [Guillardia theta CCMP2712]
          Length = 139

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 49/90 (54%), Gaps = 20/90 (22%)

Query: 42  ITIHAKPGSKSCSITDVSD------EAVGVQIDAPAKDGE--------------ANAALL 81
           +++  KPG+K   + +  +       ++ VQI AP +DGE              AN A++
Sbjct: 27  VSLFVKPGAKKTQVMNSEELKEGGARSIDVQIAAPPRDGEVGLVFSSGFSPAAQANEAVV 86

Query: 82  EYMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           E+++ + G+KR+ V I +G KSRDK+  VE
Sbjct: 87  EFVAELCGLKRKDVQIIAGHKSRDKVCRVE 116


>gi|449469637|ref|XP_004152525.1| PREDICTED: uncharacterized protein LOC101215962 [Cucumis sativus]
 gi|449519278|ref|XP_004166662.1| PREDICTED: uncharacterized LOC101215962 [Cucumis sativus]
          Length = 232

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/102 (32%), Positives = 58/102 (56%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
            D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLE+
Sbjct: 128 QDAPVPPCISQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVAVAAPAARGEANNELLEF 187

Query: 84  MSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           M  VLG++  Q+++  G  ++ K+++VE+++   V   L +A
Sbjct: 188 MGKVLGLRLSQMTLQRGWNNKSKLLVVEDLSARQVYEKLLEA 229


>gi|11499654|ref|NP_070896.1| hypothetical protein AF2072 [Archaeoglobus fulgidus DSM 4304]
 gi|29839698|sp|O28207.1|Y2072_ARCFU RecName: Full=UPF0235 protein AF_2072
 gi|2648454|gb|AAB89177.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
          Length = 78

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 41/70 (58%), Gaps = 5/70 (7%)

Query: 40  VSITIHAKPGSKSCSIT-DVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           V I++H  PGSK  S + D    AV V+I +PAK+G+AN  LL     + G    +V + 
Sbjct: 8   VLISVHVSPGSKEVSFSYDEWRRAVEVRIKSPAKEGKANRELLGIFRQIFG----EVELV 63

Query: 99  SGSKSRDKIV 108
           SG KSR K++
Sbjct: 64  SGEKSRSKVL 73


>gi|289164111|ref|YP_003454249.1| hypothetical protein LLO_0767 [Legionella longbeachae NSW150]
 gi|288857284|emb|CBJ11111.1| hypothetical protein LLO_0767 [Legionella longbeachae NSW150]
          Length = 91

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/70 (30%), Positives = 43/70 (61%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +PG+K   I  + +  + ++++AP  +G AN  LL+Y++ +  V   QV +  G 
Sbjct: 12  LYLYVQPGAKKSEIVGMHEGVLKIRLNAPPIEGRANKELLKYVAQLFKVPPSQVVLKRGD 71

Query: 102 KSRDKIVIVE 111
           KSR K+++V+
Sbjct: 72  KSRHKVLLVK 81


>gi|375101823|ref|ZP_09748086.1| hypothetical protein SaccyDRAFT_3605 [Saccharomonospora cyanea
           NA-134]
 gi|374662555|gb|EHR62433.1| hypothetical protein SaccyDRAFT_3605 [Saccharomonospora cyanea
           NA-134]
          Length = 99

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/71 (39%), Positives = 43/71 (60%), Gaps = 4/71 (5%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVG----VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
             +  K G+K   +    D A+G    V + APA DG+ANAA+   +++VLGV+ R V++
Sbjct: 3   FAVRVKAGAKRDVVGGRWDGALGEALVVSVRAPAVDGKANAAVCRVLAAVLGVRARDVAV 62

Query: 98  GSGSKSRDKIV 108
             G +SRDK+V
Sbjct: 63  VKGQRSRDKLV 73


>gi|374313030|ref|YP_005059460.1| UPF0235 protein yggU [Granulicella mallensis MP5ACTX8]
 gi|358755040|gb|AEU38430.1| UPF0235 protein yggU [Granulicella mallensis MP5ACTX8]
          Length = 99

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/82 (28%), Positives = 43/82 (52%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P   S+ +   PG+K  ++T   D ++ + +  P  DG AN AL+ ++S  L + R  + 
Sbjct: 9   PGGCSLPVRVHPGAKQNAVTGTHDGSLKISLTTPPTDGRANTALIAFLSDRLNIPRAHIE 68

Query: 97  IGSGSKSRDKIVIVEEITPENV 118
           + +G+ SR K + +  +T   V
Sbjct: 69  LLTGATSRSKTLRIAGLTSAEV 90


>gi|71668144|ref|XP_821011.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70886377|gb|EAN99160.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 156

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 40/62 (64%), Gaps = 3/62 (4%)

Query: 30 SCIRLVPPSSVSITIHAKPGSKSCSIT---DVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
          S I  + P    +T+HAKPG++S S+     V+D A+ V++ AP  +G+ANA L+E+M  
Sbjct: 5  SFISQLRPGWFHLTVHAKPGARSSSLACHPAVTDAAIEVRVGAPPVEGKANAELVEFMQM 64

Query: 87 VL 88
          +L
Sbjct: 65 LL 66


>gi|307108344|gb|EFN56584.1| hypothetical protein CHLNCDRAFT_34979 [Chlorella variabilis]
          Length = 251

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 49/87 (56%), Gaps = 2/87 (2%)

Query: 29  PSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVL 88
           P  IR+   + V++ +  +   K   +  V+ + V VQ+ + A  G AN  LLE +  VL
Sbjct: 133 PCIIRVGKATQVALEVDDR--GKRALVLRVTADFVRVQLKSGANAGHANEELLEMLRGVL 190

Query: 89  GVKRRQVSIGSGSKSRDKIVIVEEITP 115
           GV+  Q+S+  G  SR K+++VE ++P
Sbjct: 191 GVRLGQLSLQRGESSRHKVLLVEGLSP 217


>gi|270157502|ref|ZP_06186159.1| conserved hypothetical protein [Legionella longbeachae D-4968]
 gi|269989527|gb|EEZ95781.1| conserved hypothetical protein [Legionella longbeachae D-4968]
          Length = 80

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 47/79 (59%), Gaps = 1/79 (1%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +PG+K   I  + +  + ++++AP  +G AN  LL+Y++ +  V   QV +  G 
Sbjct: 1   MYLYVQPGAKKSEIVGMHEGVLKIRLNAPPIEGRANKELLKYVAQLFKVPPSQVVLKRGD 60

Query: 102 KSRDKIVIVEE-ITPENVL 119
           KSR K+++V+  +   NVL
Sbjct: 61  KSRHKVLLVKNSLVDPNVL 79


>gi|410462498|ref|ZP_11316074.1| hypothetical protein B193_0572 [Desulfovibrio magneticus str.
           Maddingley MBC34]
 gi|409984389|gb|EKO40702.1| hypothetical protein B193_0572 [Desulfovibrio magneticus str.
           Maddingley MBC34]
          Length = 122

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 46/73 (63%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           + ++ +   PG    ++  ++++ + V++ A A +G+ANAAL  +++  LG+K RQV I 
Sbjct: 34  AWTLRVAVTPGGAKDALAGLAEDRLRVRLRAKAVEGQANAALTNFVARCLGLKPRQVRIV 93

Query: 99  SGSKSRDKIVIVE 111
           SG KSR K + +E
Sbjct: 94  SGEKSRKKTLRIE 106


>gi|313673537|ref|YP_004051648.1| hypothetical protein Calni_1577 [Calditerrivibrio nitroreducens DSM
           19672]
 gi|312940293|gb|ADR19485.1| protein of unknown function DUF167 [Calditerrivibrio nitroreducens
           DSM 19672]
          Length = 86

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 46/79 (58%), Gaps = 4/79 (5%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I I+ +PG+K  +     +  + ++I +P  DG+AN  L+ ++S  L + +++V I S
Sbjct: 1   MRIKIYVQPGAKKTAYDGEFNGCIKIKIKSPPTDGKANEELISFISQSLNLSKKEVGIIS 60

Query: 100 GSKSRDKIVIVEEITPENV 118
           G KSR KI+ V    PEN 
Sbjct: 61  GEKSRYKIIEV----PENY 75


>gi|392512740|emb|CCI73951.1| ECU07_0355 [Encephalitozoon cuniculi GB-M1]
          Length = 100

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 44/71 (61%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + +T+ AKP +++ SI +V+D  V V I AP  D +AN  L+E++S +L V +  + I +
Sbjct: 11  LKLTVRAKPNARNTSIREVTDTEVVVNISAPPADNKANKELVEFISKMLKVPKSLIKIVA 70

Query: 100 GSKSRDKIVIV 110
           G+    K V++
Sbjct: 71  GTTCAIKTVVI 81


>gi|336466937|gb|EGO55101.1| hypothetical protein NEUTE1DRAFT_85194 [Neurospora tetrasperma FGSC
           2508]
 gi|350288454|gb|EGZ69690.1| YggU-like protein [Neurospora tetrasperma FGSC 2509]
          Length = 130

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 44/74 (59%), Gaps = 2/74 (2%)

Query: 39  SVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           ++ I  H KPG+      +T ++DEAV + + A AK+GEAN A+++ +S  L + +  + 
Sbjct: 27  TIYIHCHVKPGASKTREGVTSITDEAVEICVAAQAKEGEANKAVVKVLSEALNLPKSNLE 86

Query: 97  IGSGSKSRDKIVIV 110
           I  G KSR K + V
Sbjct: 87  ITQGLKSRAKTIAV 100


>gi|240949737|ref|ZP_04754069.1| hypothetical protein AM305_00329 [Actinobacillus minor NM305]
 gi|240295769|gb|EER46456.1| hypothetical protein AM305_00329 [Actinobacillus minor NM305]
          Length = 100

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 46/90 (51%), Gaps = 2/90 (2%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P+ + + I  +P +    I  + DE + + I AP  DG ANA LL+++S +  V +  ++
Sbjct: 11  PNGIRLRIFLQPKASRDQIVGLHDEELKIAITAPPVDGAANAHLLKFLSKLFKVPKSSIA 70

Query: 97  IGSGSKSRDKIVIVEE--ITPENVLNSLGK 124
           +  G   R K + + E    P+ + N L K
Sbjct: 71  LEKGELQRHKQIFIPEPKQIPQEIENLLDK 100


>gi|284039659|ref|YP_003389589.1| hypothetical protein Slin_4812 [Spirosoma linguale DSM 74]
 gi|283818952|gb|ADB40790.1| protein of unknown function DUF167 [Spirosoma linguale DSM 74]
          Length = 86

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/83 (37%), Positives = 49/83 (59%), Gaps = 4/83 (4%)

Query: 42  ITIH--AKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +T+H  AKPGSK   +   +   +  +I APA+DG+ANA L+E+++  LG+ +  VSI +
Sbjct: 1   MTLHLKAKPGSKIDQLFYDAAGQLNAKIRAPAQDGKANAYLIEFLAKQLGIPKSGVSIVA 60

Query: 100 GSKSRDKIVIVEEITPENVLNSL 122
           G  +  K + V+   PE VL   
Sbjct: 61  GFTNPHKRIEVD--VPEEVLTDF 81


>gi|394990021|ref|ZP_10382853.1| hypothetical protein SCD_02446 [Sulfuricella denitrificans skB26]
 gi|393790286|dbj|GAB72492.1| hypothetical protein SCD_02446 [Sulfuricella denitrificans skB26]
          Length = 99

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/85 (27%), Positives = 49/85 (57%), Gaps = 2/85 (2%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +++T+H +PG+K   +  +  +A+ +++ A A +G+AN  LL+++     V   ++S+  
Sbjct: 12  LTLTVHVQPGAKRTEVIGLHGDALKIRVAAAAVEGQANTRLLDFLRKAFKVPASRISLKH 71

Query: 100 GSKSRDKIVIV--EEITPENVLNSL 122
           G  +R K+V +    + PE +L  L
Sbjct: 72  GEHARRKVVEILGSSLAPELLLPGL 96


>gi|378821593|ref|ZP_09844473.1| TIGR00251 family protein [Sutterella parvirubra YIT 11816]
 gi|378599538|gb|EHY32546.1| TIGR00251 family protein [Sutterella parvirubra YIT 11816]
          Length = 109

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 42/69 (60%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + V++ +H +PG+K   +     E + V + AP  DG+AN AL+++++ V GV +  V +
Sbjct: 19  AGVTVALHVQPGAKRSRVVGEHGERLKVALQAPPVDGKANQALVKFLAGVAGVPKSAVEL 78

Query: 98  GSGSKSRDK 106
            +G  SR+K
Sbjct: 79  IAGETSREK 87


>gi|406981760|gb|EKE03161.1| hypothetical protein ACD_20C00238G0001 [uncultured bacterium]
          Length = 103

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 43/72 (59%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I++   P S  C I    D+ + +++D P  +G+AN   ++++S +LGV +  + I S
Sbjct: 18  IKISVKVIPNSSRCEIAGTIDDILRIKLDVPPIEGKANEKCVKFLSKLLGVPKTSIEIVS 77

Query: 100 GSKSRDKIVIVE 111
           G KS+ KI+ ++
Sbjct: 78  GEKSKSKILYIK 89


>gi|384262233|ref|YP_005417420.1| hypothetical protein RSPPHO_01824 [Rhodospirillum photometricum DSM
           122]
 gi|378403334|emb|CCG08450.1| Putative uncharacterized protein [Rhodospirillum photometricum DSM
           122]
          Length = 114

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 54/106 (50%), Gaps = 17/106 (16%)

Query: 34  LVPPSS-----------VSITIHAKPGSKSCSITDVSDEAVGVQI-----DAPAKDGEAN 77
           ++PP+S           V I +   P ++   +T ++ EA G  +      AP +DG+AN
Sbjct: 1   MIPPASGALPWSVHEDGVWIAVRVTPKARRVGVTGLAREADGTLVLKAGVSAPPEDGKAN 60

Query: 78  AALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLG 123
           AAL+E ++   GV +R V++  G   R K+V +    P+ VL  LG
Sbjct: 61  AALVELLAKSWGVPKRDVTVIQGLTDRRKVVRLAGAVPQ-VLERLG 105


>gi|257465218|ref|ZP_05629589.1| hypothetical protein AM202_01815 [Actinobacillus minor 202]
 gi|257450878|gb|EEV24921.1| hypothetical protein AM202_01815 [Actinobacillus minor 202]
          Length = 100

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 45/90 (50%), Gaps = 2/90 (2%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  + + I  +P +    I  + DE + + I AP  DG ANA LL+++S +  V +  ++
Sbjct: 11  PDGIRLRIFLQPKASRDQIVGLHDEELKIAITAPPVDGAANAHLLKFLSKLFKVPKSSIA 70

Query: 97  IGSGSKSRDKIVIVEE--ITPENVLNSLGK 124
           +  G   R K + + E    P+ + N L K
Sbjct: 71  LEKGELQRHKQIFIPEPKQIPQEIENLLDK 100


>gi|255022260|ref|ZP_05294254.1| hypothetical protein ACA_0401 [Acidithiobacillus caldus ATCC 51756]
 gi|340783351|ref|YP_004749958.1| hypothetical protein Atc_2609 [Acidithiobacillus caldus SM-1]
 gi|254968316|gb|EET25884.1| hypothetical protein ACA_0401 [Acidithiobacillus caldus ATCC 51756]
 gi|340557502|gb|AEK59256.1| conserved hypothetical protein [Acidithiobacillus caldus SM-1]
          Length = 115

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 22/73 (30%), Positives = 45/73 (61%)

Query: 41  SITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           ++T+  +PG++   +     +A+ +++ A A DG ANAALL +++  L +   QV +  G
Sbjct: 14  TLTVQVQPGARDDCVVGYHGDALKIRLRARAVDGAANAALLSFLARRLDLGPGQVVLRHG 73

Query: 101 SKSRDKIVIVEEI 113
           + SR K++++ E+
Sbjct: 74  THSRRKVLVLSEL 86


>gi|404496024|ref|YP_006720130.1| hypothetical protein Gmet_1164 [Geobacter metallireducens GS-15]
 gi|418066794|ref|ZP_12704152.1| protein of unknown function DUF167 [Geobacter metallireducens RCH3]
 gi|78193635|gb|ABB31402.1| protein of unknown function DUF167 [Geobacter metallireducens
           GS-15]
 gi|373559932|gb|EHP86211.1| protein of unknown function DUF167 [Geobacter metallireducens RCH3]
          Length = 102

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 47/83 (56%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V+ ++H +P +    I  +  +A+ +++ +P  +GEAN    EY++ +L V +  V+I +
Sbjct: 19  VTFSVHVQPRASKNGICGIQGDAIKLRLTSPPVEGEANRLCTEYLAKLLKVPKSAVTIIA 78

Query: 100 GSKSRDKIVIVEEITPENVLNSL 122
           G KSR K + V   T + V N L
Sbjct: 79  GDKSRHKTIRVSGATAQAVHNLL 101


>gi|39995970|ref|NP_951921.1| hypothetical protein GSU0864 [Geobacter sulfurreducens PCA]
 gi|409911415|ref|YP_006889880.1| hypothetical protein KN400_0845 [Geobacter sulfurreducens KN400]
 gi|39982735|gb|AAR34194.1| protein of unknown function DUF167 [Geobacter sulfurreducens PCA]
 gi|307634758|gb|ADI83708.2| protein of unknown function DUF167 [Geobacter sulfurreducens KN400]
          Length = 107

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 43/71 (60%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V+ ++H +P +    I  V  EA+ +++ +P  +GEAN   +E+++  LGV +  V+I +
Sbjct: 20  VTFSVHVQPRASRNEICGVQGEAIKLRLTSPPVEGEANRLCVEFLAKRLGVPKSCVAIIA 79

Query: 100 GSKSRDKIVIV 110
           G KSR K + V
Sbjct: 80  GEKSRHKTIRV 90


>gi|337287121|ref|YP_004626594.1| hypothetical protein Thein_1774 [Thermodesulfatator indicus DSM
           15286]
 gi|335359949|gb|AEH45630.1| protein of unknown function DUF167 [Thermodesulfatator indicus DSM
           15286]
          Length = 99

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 22/82 (26%), Positives = 48/82 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           ++I+ +P +K   I     EA+ +++ +P  DG+AN AL+ +++  LG+ ++ + + +G 
Sbjct: 11  LSIYLQPKAKKEEIVGFYQEALKIKVKSPPVDGKANEALISFLAKKLGLSKKNLKLLAGF 70

Query: 102 KSRDKIVIVEEITPENVLNSLG 123
            S+ K V +E +  + +   LG
Sbjct: 71  TSKHKKVYIEGLEEKEIKKRLG 92


>gi|85092099|ref|XP_959226.1| hypothetical protein NCU06879 [Neurospora crassa OR74A]
 gi|28920629|gb|EAA29990.1| predicted protein [Neurospora crassa OR74A]
          Length = 130

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 44/74 (59%), Gaps = 2/74 (2%)

Query: 39  SVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           ++ I  H KPG+      +T ++DEAV + + A AK+GEAN A+++ +S  L + +  + 
Sbjct: 27  TIYIHCHVKPGASKNREGVTSITDEAVEICVAAQAKEGEANKAVVKVLSEALNLPKSNLE 86

Query: 97  IGSGSKSRDKIVIV 110
           I  G KSR K + V
Sbjct: 87  ITQGLKSRAKTIAV 100


>gi|332295644|ref|YP_004437567.1| hypothetical protein Thena_0803 [Thermodesulfobium narugense DSM
           14796]
 gi|332178747|gb|AEE14436.1| UPF0235 protein yggU [Thermodesulfobium narugense DSM 14796]
          Length = 86

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 46/81 (56%), Gaps = 4/81 (4%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I +   P +K  SI ++ D  +  ++ AP +DG+AN  ++E +S     KR+ V I SG 
Sbjct: 5   IELKVTPNAKKESI-EIKDGKIYCKVSAPPEDGKANRRVIELISEFFDCKRKDVEIFSGE 63

Query: 102 KSRDKIVIVEEITPENVLNSL 122
           KS++KI++   I  EN+   +
Sbjct: 64  KSKNKILL---IKSENIFKKI 81


>gi|322418850|ref|YP_004198073.1| hypothetical protein GM18_1329 [Geobacter sp. M18]
 gi|320125237|gb|ADW12797.1| protein of unknown function DUF167 [Geobacter sp. M18]
          Length = 104

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 47/84 (55%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  +  T+H +P +    I    D  + +++ +P  D  AN   +E ++  LG+ + +VS
Sbjct: 12  PEGLLFTVHVQPRASRSEICGPKDGELRIRLTSPPVDDAANKQCVELIAKSLGLAKSKVS 71

Query: 97  IGSGSKSRDKIVIVEEITPENVLN 120
           I SG+KSR K+V VE +  +++L 
Sbjct: 72  IKSGAKSRHKVVRVEGVDQDDLLR 95


>gi|428203544|ref|YP_007082133.1| hypothetical protein Ple7327_3360 [Pleurocapsa sp. PCC 7327]
 gi|427980976|gb|AFY78576.1| hypothetical protein Ple7327_3360 [Pleurocapsa sp. PCC 7327]
          Length = 73

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 40/69 (57%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I +  KP SK   I +  DE++ + + +P  DG+AN  L+E ++    V + QV I S
Sbjct: 1   MKIQVKVKPNSKQQRIEEADDESLFIYLKSPPIDGKANQELIELLAKKFKVTKSQVLIKS 60

Query: 100 GSKSRDKIV 108
           G  S++K+V
Sbjct: 61  GLSSKNKLV 69


>gi|320161245|ref|YP_004174469.1| hypothetical protein ANT_18430 [Anaerolinea thermophila UNI-1]
 gi|319995098|dbj|BAJ63869.1| hypothetical protein ANT_18430 [Anaerolinea thermophila UNI-1]
          Length = 105

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 43/71 (60%), Gaps = 1/71 (1%)

Query: 41  SITIHAKPGSKSCSITDVSDE-AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +IT+   P +    I ++ D+  V +++ AP  +G+AN AL++++S VL V R  + I +
Sbjct: 18  AITVRVTPRASKNEIYEILDDGTVKIRLTAPPVEGKANEALIDFLSEVLDVPRTSLEIVA 77

Query: 100 GSKSRDKIVIV 110
           G   RDKIV V
Sbjct: 78  GETGRDKIVTV 88


>gi|375107692|ref|ZP_09753953.1| hypothetical protein BurJ1DRAFT_4417 [Burkholderiales bacterium
           JOSHI_001]
 gi|374668423|gb|EHR73208.1| hypothetical protein BurJ1DRAFT_4417 [Burkholderiales bacterium
           JOSHI_001]
          Length = 124

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 47/83 (56%)

Query: 29  PSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVL 88
           P+ +R   P++  + +   PG+K   +  + D A+ +++ AP  DG+ANAAL + ++  L
Sbjct: 15  PTWLRADGPAACVLDVSVVPGAKRSELAGLHDGALRIKLAAPPVDGKANAALEKLLAQWL 74

Query: 89  GVKRRQVSIGSGSKSRDKIVIVE 111
            + +R V +  G  SR K + V+
Sbjct: 75  DLPQRDVQLLRGQTSRRKTLRVQ 97


>gi|406936857|gb|EKD70483.1| hypothetical protein ACD_46C00523G0009 [uncultured bacterium]
          Length = 99

 Score = 49.3 bits (116), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 21/73 (28%), Positives = 40/73 (54%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
             V++ I AKP +K   +  V ++ + + + A   +GEAN  L+ Y++ +  + +  V++
Sbjct: 8   HQVTLCIFAKPHAKQTVLLKVDNDGLHITLHAKPHEGEANKELISYLAKLFRLPKSHVNL 67

Query: 98  GSGSKSRDKIVIV 110
             G  SR K+V V
Sbjct: 68  QQGEHSRQKVVRV 80


>gi|37523404|ref|NP_926781.1| hypothetical protein glr3835 [Gloeobacter violaceus PCC 7421]
 gi|47117439|sp|Q7NEP3.1|Y3835_GLOVI RecName: Full=UPF0235 protein glr3835
 gi|35214408|dbj|BAC91776.1| glr3835 [Gloeobacter violaceus PCC 7421]
          Length = 111

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 48/92 (52%), Gaps = 9/92 (9%)

Query: 35  VPPSSVSI-------TIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
            PP +VS+       T+ A+P +    +      A  V++ AP  +G+ANA  +  +++ 
Sbjct: 14  FPPGAVSVDAGGITLTVWAQPRASCSEVVGWQQNAFKVRLAAPPVEGKANAECVALIAAF 73

Query: 88  LGVKRRQVSIGSGSKSRDKIVIVEEITPENVL 119
            GV RRQVS+  G + R K + +E   P ++L
Sbjct: 74  FGVPRRQVSLVQGQQGRHKKIRIE--APADLL 103


>gi|302038855|ref|YP_003799177.1| hypothetical protein NIDE3568 [Candidatus Nitrospira defluvii]
 gi|300606919|emb|CBK43252.1| conserved protein of unknown function DUF167 [Candidatus Nitrospira
           defluvii]
          Length = 105

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 43/83 (51%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V+I++H +P +       +   AV ++I AP  DG ANA L  +++    V    V I S
Sbjct: 20  VTISVHVQPKASRSECAGLHGHAVKIRIAAPPADGAANAELCRFLARCCEVPLSAVHILS 79

Query: 100 GSKSRDKIVIVEEITPENVLNSL 122
           G+ SR K V+V+  T E V   L
Sbjct: 80  GAGSRQKRVLVKGRTAEQVRAQL 102


>gi|88812461|ref|ZP_01127710.1| hypothetical protein NB231_13261 [Nitrococcus mobilis Nb-231]
 gi|88790247|gb|EAR21365.1| hypothetical protein NB231_13261 [Nitrococcus mobilis Nb-231]
          Length = 99

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 46/82 (56%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + + +T+  +P +    +   +D  + ++I AP  +G+AN  L  ++   LGV R QVS+
Sbjct: 9   TDLILTVRVQPRAARDELKIDADGRLRLRITAPPVEGKANEHLRHFLGHALGVARSQVSV 68

Query: 98  GSGSKSRDKIVIVEEITPENVL 119
            +G+ SR+K ++V+   P   L
Sbjct: 69  ATGATSRNKRIVVQNADPAKAL 90


>gi|154705806|ref|YP_001424486.1| hypothetical cytosolic protein [Coxiella burnetii Dugway 5J108-111]
 gi|154355092|gb|ABS76554.1| hypothetical cytosolic protein [Coxiella burnetii Dugway 5J108-111]
          Length = 92

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 21/73 (28%), Positives = 45/73 (61%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           S+ +T++ +PG+K   I+    E + +++ AP  +G+AN AL+++++  L +    ++I 
Sbjct: 4   SIILTVYIQPGAKQTQISGKHGEHIKIRLQAPPTEGKANKALIDFLAQRLKLNPSSITII 63

Query: 99  SGSKSRDKIVIVE 111
            G K+R K + +E
Sbjct: 64  RGEKARLKTIAIE 76


>gi|381159733|ref|ZP_09868965.1| TIGR00251 family protein [Thiorhodovibrio sp. 970]
 gi|380877797|gb|EIC19889.1| TIGR00251 family protein [Thiorhodovibrio sp. 970]
          Length = 94

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 43/72 (59%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +++ +  +P ++     +   +AV VQ+ AP  DG ANA+L+ +++   GV R QV++ S
Sbjct: 11  LTLNLRVQPRARRDGFAEPIGDAVKVQLRAPPVDGRANASLIAFVAKAFGVPRAQVTLLS 70

Query: 100 GSKSRDKIVIVE 111
           G  SR K + ++
Sbjct: 71  GEHSRSKRLRIQ 82


>gi|336261944|ref|XP_003345758.1| hypothetical protein SMAC_05915 [Sordaria macrospora k-hell]
 gi|380090094|emb|CCC12177.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 133

 Score = 48.9 bits (115), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 45/74 (60%), Gaps = 2/74 (2%)

Query: 39  SVSITIHAKPGS--KSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           ++ I  H KPG+  +   +T ++DEAV + + A AK+GEAN ++++ +S  L + +  + 
Sbjct: 29  TIYIHCHVKPGASKQREGVTCITDEAVEICVAAQAKEGEANKSVVKVLSEALNIPKSNLE 88

Query: 97  IGSGSKSRDKIVIV 110
           I  G KSR K + V
Sbjct: 89  ITQGLKSRAKTIAV 102


>gi|291286027|ref|YP_003502843.1| hypothetical protein Dacet_0081 [Denitrovibrio acetiphilus DSM
           12809]
 gi|290883187|gb|ADD66887.1| protein of unknown function DUF167 [Denitrovibrio acetiphilus DSM
           12809]
          Length = 84

 Score = 48.9 bits (115), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 51/85 (60%), Gaps = 1/85 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + ++++ +PG+K   ++ + D  + +++ AP  +G AN  L++++S  L + +  + I S
Sbjct: 1   MKLSVYVQPGAKKTELSGMHDGKIKIRVCAPPVEGAANEVLVKFLSKQLKISKSGIKIIS 60

Query: 100 GSKSRDKIVIVEEITPENVLNSLGK 124
           G KSR KIV +   T  +V+N L K
Sbjct: 61  GEKSRHKIVEINMDTL-DVMNCLSK 84


>gi|72388068|ref|XP_844458.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
          GUTat10.1]
 gi|62359409|gb|AAX79847.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70800991|gb|AAZ10899.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
          927/4 GUTat10.1]
          Length = 165

 Score = 48.9 bits (115), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 24/62 (38%), Positives = 38/62 (61%), Gaps = 3/62 (4%)

Query: 30 SCIRLVPPSSVSITIHAKPGSKSCSIT---DVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
          S I  + P    + IHAKPG++S ++       DEA+ V++ AP  DG+AN  L+E+M +
Sbjct: 5  SFIVQLRPGFFRLMIHAKPGARSTALAAQPQALDEALEVRLAAPPVDGKANTELVEFMQT 64

Query: 87 VL 88
          +L
Sbjct: 65 LL 66


>gi|261327632|emb|CBH10608.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
          DAL972]
 gi|289742777|gb|ADD20136.1| hypothetical protein Tb927.4.3080 [Glossina morsitans morsitans]
          Length = 165

 Score = 48.9 bits (115), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 24/62 (38%), Positives = 38/62 (61%), Gaps = 3/62 (4%)

Query: 30 SCIRLVPPSSVSITIHAKPGSKSCSIT---DVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
          S I  + P    + IHAKPG++S ++       DEA+ V++ AP  DG+AN  L+E+M +
Sbjct: 5  SFIVQLRPGFFRLMIHAKPGARSTALAAQPQALDEALEVRLAAPPVDGKANTELVEFMQT 64

Query: 87 VL 88
          +L
Sbjct: 65 LL 66


>gi|451852870|gb|EMD66164.1| hypothetical protein COCSADRAFT_34741 [Cochliobolus sativus ND90Pr]
          Length = 118

 Score = 48.9 bits (115), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 42/74 (56%), Gaps = 2/74 (2%)

Query: 39  SVSITIHAKPG--SKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           ++ +  H KPG  +    I  +SDE + + + A A+DGEAN A+   ++  LGV +  V 
Sbjct: 20  TIQLLCHVKPGVSANREGIATISDERIQLCVAAQARDGEANKAVRRVVADALGVPKSAVE 79

Query: 97  IGSGSKSRDKIVIV 110
           +  G KSRDK V V
Sbjct: 80  VIKGMKSRDKTVAV 93


>gi|168703344|ref|ZP_02735621.1| hypothetical protein GobsU_27681 [Gemmata obscuriglobus UQM 2246]
          Length = 101

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 46/78 (58%)

Query: 41  SITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           ++ +  +P +K  ++      A+ V + AP +DG AN A+L  +     ++R Q+++ SG
Sbjct: 13  TLAVRVQPKAKKNAVLGERASALRVSVTAPPEDGRANDAVLALLCDHFKLQRSQLALLSG 72

Query: 101 SKSRDKIVIVEEITPENV 118
             +R+K+++V  +TP+ +
Sbjct: 73  QTNRNKVILVRGVTPQQL 90


>gi|386816935|ref|ZP_10104153.1| UPF0235 protein yggU [Thiothrix nivea DSM 5205]
 gi|386421511|gb|EIJ35346.1| UPF0235 protein yggU [Thiothrix nivea DSM 5205]
          Length = 96

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 43/75 (57%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
            ++ + I  +P +      D+ ++ + ++I AP  DG+AN  L+++++   GV + +V I
Sbjct: 10  DTLHLFIKVQPKASKDEFADIQEDRIRIRITAPPVDGKANQHLVKFLAKAFGVAKSKVQI 69

Query: 98  GSGSKSRDKIVIVEE 112
            SG   R+K V VE+
Sbjct: 70  ISGETGRNKHVCVED 84


>gi|300087416|ref|YP_003757938.1| hypothetical protein Dehly_0296 [Dehalogenimonas
           lykanthroporepellens BL-DC-9]
 gi|299527149|gb|ADJ25617.1| protein of unknown function DUF167 [Dehalogenimonas
           lykanthroporepellens BL-DC-9]
          Length = 96

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 46/83 (55%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I +  +PGS    ITD + E + V++ A  + G+AN A+ E ++  LG+ + +V+I  G 
Sbjct: 12  IALKVQPGSGRNEITDTAAEIIRVRVTAAPEHGKANRAVAELLAERLGLPKSRVTIVRGL 71

Query: 102 KSRDKIVIVEEITPENVLNSLGK 124
            SR K+  V  ++   V   LGK
Sbjct: 72  TSRRKVAAVAGLSEAEVREKLGK 94


>gi|167854882|ref|ZP_02477658.1| hypothetical protein HPS_05418 [Haemophilus parasuis 29755]
 gi|219871635|ref|YP_002476010.1| hypothetical protein HAPS_1504 [Haemophilus parasuis SH0165]
 gi|254800538|sp|B8F6W0.1|Y1504_HAEPS RecName: Full=UPF0235 protein HAPS_1504
 gi|167853949|gb|EDS25187.1| hypothetical protein HPS_05418 [Haemophilus parasuis 29755]
 gi|219691839|gb|ACL33062.1| conserved hypothetical protein [Haemophilus parasuis SH0165]
          Length = 97

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 38/76 (50%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  + + I  +P +    I  + D  + + I AP  DG+ANA LL+Y+S +  V +  + 
Sbjct: 10  PQGIRLRIFLQPKASRDQIVGLHDNELKIAITAPPIDGQANAHLLKYLSKLFKVPKSSIV 69

Query: 97  IGSGSKSRDKIVIVEE 112
           +  G   R K + V E
Sbjct: 70  LEKGELQRHKQIFVPE 85


>gi|328866439|gb|EGG14823.1| hypothetical protein DFA_10696 [Dictyostelium fasciculatum]
          Length = 152

 Score = 48.5 bits (114), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 20/73 (27%), Positives = 46/73 (63%), Gaps = 1/73 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDE-AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           V + ++  P +K  +I   +D+  + ++I  P  DG+AN  +++++S  L +K+R +++ 
Sbjct: 57  VRLNVNVHPNAKQSTIVQFTDDGCLDLRISQPPIDGKANDEVIDFLSDELKLKKRFITVD 116

Query: 99  SGSKSRDKIVIVE 111
            G KSR+K++ ++
Sbjct: 117 KGLKSRNKVIAID 129


>gi|71660987|ref|XP_817521.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70882718|gb|EAN95670.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 156

 Score = 48.5 bits (114), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 40/62 (64%), Gaps = 3/62 (4%)

Query: 30 SCIRLVPPSSVSITIHAKPGSKSCSIT---DVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
          S I  + P    +T+HAKPG++S S+     V+D A+ V+I AP  +G+ANA L+++M  
Sbjct: 5  SFISQLRPGWFHLTVHAKPGARSSSLACHPAVTDAALEVRIGAPPVEGKANAELVDFMQM 64

Query: 87 VL 88
          +L
Sbjct: 65 LL 66


>gi|336314737|ref|ZP_08569653.1| hypothetical protein Rhein_1022 [Rheinheimera sp. A13L]
 gi|335881036|gb|EGM78919.1| hypothetical protein Rhein_1022 [Rheinheimera sp. A13L]
          Length = 97

 Score = 48.5 bits (114), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 37/67 (55%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + +H +PG+       +  +A+ V I AP  DG+ANA L ++++    V +RQV +  
Sbjct: 10  ILLQLHVQPGASRDQFLGLHGDAIKVAIKAPPVDGKANAHLQQFLAHSFDVAKRQVLLEK 69

Query: 100 GSKSRDK 106
           G  SR K
Sbjct: 70  GELSRQK 76


>gi|434400838|ref|YP_007134842.1| UPF0235 protein yggU [Stanieria cyanosphaera PCC 7437]
 gi|428271935|gb|AFZ37876.1| UPF0235 protein yggU [Stanieria cyanosphaera PCC 7437]
          Length = 72

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 45/69 (65%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I+I  KP S+   I +++D ++ +++ +P +DG+AN  L+E ++    V + Q++I +
Sbjct: 1   MKISIKVKPNSQQQKIEELADGSLIIRLKSPPRDGKANQELIEMLAKKFQVAKSQITIKA 60

Query: 100 GSKSRDKIV 108
           G  S++K++
Sbjct: 61  GLSSKNKLI 69


>gi|452944550|ref|YP_007500715.1| protein of unknown function DUF167 [Hydrogenobaculum sp. HO]
 gi|452882968|gb|AGG15672.1| protein of unknown function DUF167 [Hydrogenobaculum sp. HO]
          Length = 73

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 41/67 (61%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + +  KP +K+ S+  + D+++ + I AP  DG+AN  L+  +S  L V + ++ I +G 
Sbjct: 3   LRVKVKPNAKTVSVEQLEDKSLKISIKAPPVDGKANEELINVLSEFLKVSKSKIHIKAGK 62

Query: 102 KSRDKIV 108
            S++K+V
Sbjct: 63  SSKEKLV 69


>gi|111075036|gb|ABH04883.1| uncharacterized conserved protein yggY [Heliobacillus mobilis]
          Length = 101

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 43/83 (51%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  V   I  +P +    +  + D+A+ V++ AP  DGEAN A   + +  L + + QV 
Sbjct: 8   PGGVRFKIRVQPRASKNEVCGLLDDALKVRLTAPPVDGEANGACQAFFAKTLSLPKSQVR 67

Query: 97  IGSGSKSRDKIVIVEEITPENVL 119
           + +G  SR K V V  ++ E +L
Sbjct: 68  LVAGETSRTKTVEVIGVSKEQIL 90


>gi|429966179|gb|ELA48176.1| TIGR00251 family protein [Vavraia culicis 'floridensis']
          Length = 101

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 46/75 (61%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + +  +PGS    +T + D  + V ++A  +  +AN  ++ ++S++L V +  VSI S
Sbjct: 11  VLLNLVIRPGSNCTGLTRIQDGQLHVDVNAVPEKNKANKEVIRFISTLLKVDKSSVSIVS 70

Query: 100 GSKSRDKIVIVEEIT 114
           G KSR+KIV+V + T
Sbjct: 71  GLKSRNKIVMVRDET 85


>gi|254443993|ref|ZP_05057469.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
 gi|198258301|gb|EDY82609.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235]
          Length = 94

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 37/57 (64%)

Query: 58  VSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEIT 114
           + D ++ ++I +P +DG+AN AL+ +++   GV + Q+SI  G  SR K++  E ++
Sbjct: 28  LEDGSLKIRIQSPPQDGKANKALIAFLAKETGVSKNQISIARGETSRQKLIAFERLS 84


>gi|291278644|ref|YP_003495479.1| hypothetical protein DEFDS_0212 [Deferribacter desulfuricans SSM1]
 gi|290753346|dbj|BAI79723.1| conserved hypothetical protein [Deferribacter desulfuricans SSM1]
          Length = 82

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 43/75 (57%), Gaps = 1/75 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V IT + +PG+K   +    +    +++ +P  DG AN  L+++++  LGV +  V I S
Sbjct: 3   VRITFYIQPGAKKTEVAGEFNNMTKIKVASPPVDGAANKELIKFLAKKLGVSKSSVKIVS 62

Query: 100 GSKSRDKIV-IVEEI 113
           G KSR K V  +E+I
Sbjct: 63  GEKSRIKTVEFLEDI 77


>gi|401412293|ref|XP_003885594.1| hypothetical protein NCLIV_059910 [Neospora caninum Liverpool]
 gi|325120013|emb|CBZ55566.1| hypothetical protein NCLIV_059910 [Neospora caninum Liverpool]
          Length = 531

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 29/88 (32%), Positives = 53/88 (60%), Gaps = 5/88 (5%)

Query: 2  APAKKGKSKAKSAGSTQSKIKTNDE--NLPSCIRLVPPSSVSITIHAKPGSKSCSITDVS 59
          A  +K K +A +    + +  T D   +LP  +R+V    +++ +HAKPG+K   I  ++
Sbjct: 11 AAVQKKKPEAGNKHEKKDERNTTDSQSDLPRFLRIVD-GGLTLAVHAKPGAKQSQIPSIN 69

Query: 60 DEA--VGVQIDAPAKDGEANAALLEYMS 85
          ++A  + VQIDAPA++G AN  L ++++
Sbjct: 70 EQAEQLDVQIDAPAREGAANEELCDFLA 97


>gi|406940280|gb|EKD73095.1| hypothetical protein ACD_45C00464G0002 [uncultured bacterium]
          Length = 101

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/85 (27%), Positives = 48/85 (56%), Gaps = 1/85 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           ++++I  KP +K  +I  + D+A+ + + A    G+AN AL+ Y++ +  + +++V +  
Sbjct: 10  INLSILVKPNAKRTAILKIDDQALTIALHATPHYGKANQALIAYLADLFQLTKKEVILKR 69

Query: 100 GSK-SRDKIVIVEEITPENVLNSLG 123
           G+   R  IVI    T   ++N+L 
Sbjct: 70  GATGKRKHIVIPFTKTTRKIMNNLA 94


>gi|77166447|ref|YP_344972.1| hypothetical protein Noc_3000 [Nitrosococcus oceani ATCC 19707]
 gi|254435182|ref|ZP_05048689.1| conserved hypothetical protein TIGR00251 [Nitrosococcus oceani
           AFC27]
 gi|123593242|sp|Q3J6V4.1|Y3000_NITOC RecName: Full=UPF0235 protein Noc_3000
 gi|76884761|gb|ABA59442.1| Conserved hypothetical protein 251 [Nitrosococcus oceani ATCC
           19707]
 gi|207088293|gb|EDZ65565.1| conserved hypothetical protein TIGR00251 [Nitrosococcus oceani
           AFC27]
          Length = 102

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 43/76 (56%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ I I  +P +K   +     + + ++I AP  +G+AN  LL +++    V R QV + 
Sbjct: 12  ALIIQIRLQPRAKGDEVIGPHGDRLKIRITAPPVEGKANTHLLRFLAKTFQVSRNQVYLL 71

Query: 99  SGSKSRDKIVIVEEIT 114
           SG+ SRDK V +E+ T
Sbjct: 72  SGATSRDKRVRIEKPT 87


>gi|300866080|ref|ZP_07110810.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300335941|emb|CBN55968.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 78

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 40/67 (59%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +TI  KP SK  +I    D +  + + +P  DG+AN  L++ ++   G+ + Q++I SG 
Sbjct: 4   LTIKVKPNSKQQNIQQEPDGSFKISLKSPPIDGKANEELIKLLAKKFGIPKSQITIKSGL 63

Query: 102 KSRDKIV 108
            S++K+V
Sbjct: 64  SSKNKLV 70


>gi|197117881|ref|YP_002138308.1| hypothetical protein Gbem_1494 [Geobacter bemidjiensis Bem]
 gi|197087241|gb|ACH38512.1| protein of unknown function DUF167 [Geobacter bemidjiensis Bem]
          Length = 99

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 48/86 (55%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  +  T+H +P +    I    +  + +++ +P  +  AN   +E ++  LGV + +VS
Sbjct: 11  PEGLLFTVHVQPRASRSEICGAKEGELRLRLTSPPVEDAANKQCVELIAKTLGVAKSKVS 70

Query: 97  IGSGSKSRDKIVIVEEITPENVLNSL 122
           I SG+KSR K+V VE +  + +L+ L
Sbjct: 71  IKSGAKSRHKVVKVEGVDHDALLSLL 96


>gi|195953751|ref|YP_002122041.1| hypothetical protein HY04AAS1_1378 [Hydrogenobaculum sp. Y04AAS1]
 gi|226734129|sp|B4U5M3.1|Y1378_HYDS0 RecName: Full=UPF0235 protein HY04AAS1_1378
 gi|195933363|gb|ACG58063.1| protein of unknown function DUF167 [Hydrogenobaculum sp. Y04AAS1]
          Length = 73

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 43/67 (64%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + +  KP +K+ S+  + D+++ + I +P  +G+AN  L++ +S  L V + +++I +G 
Sbjct: 3   LRVKVKPNAKTVSVEQLEDKSLKISIKSPPVNGKANEELIKVLSEFLKVSKSKINIKAGK 62

Query: 102 KSRDKIV 108
            SR+K+V
Sbjct: 63  SSREKLV 69


>gi|121535253|ref|ZP_01667067.1| protein of unknown function DUF167 [Thermosinus carboxydivorans
           Nor1]
 gi|121306138|gb|EAX47066.1| protein of unknown function DUF167 [Thermosinus carboxydivorans
           Nor1]
          Length = 100

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/84 (27%), Positives = 47/84 (55%)

Query: 35  VPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           V P  +S  I  +P +   ++  ++ +++ V + +P  +GEAN A L + +++ GV + +
Sbjct: 10  VAPDGISFKIKVQPRASRNAVIGLAGDSLKVCVASPPVEGEANQACLAFFAALFGVAKTR 69

Query: 95  VSIGSGSKSRDKIVIVEEITPENV 118
           + + SG KSR K++ +  I  E  
Sbjct: 70  IVLVSGQKSRSKVIKIMGIDMEQF 93


>gi|387769243|ref|ZP_10125509.1| TIGR00251 family protein [Pasteurella bettyae CCUG 2042]
 gi|386907199|gb|EIJ71914.1| TIGR00251 family protein [Pasteurella bettyae CCUG 2042]
          Length = 95

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 38/73 (52%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + I  +P +    I  + DE + V I AP  DG ANA LL+Y+S +  V +  + +  
Sbjct: 11  LRLKIFLQPKASKDQIIGIHDEELKVSITAPPVDGAANAHLLKYLSKLFKVPKSSIVLEK 70

Query: 100 GSKSRDKIVIVEE 112
           G  +R K V + E
Sbjct: 71  GELNRHKQVFIPE 83


>gi|253701327|ref|YP_003022516.1| hypothetical protein GM21_2722 [Geobacter sp. M21]
 gi|251776177|gb|ACT18758.1| protein of unknown function DUF167 [Geobacter sp. M21]
          Length = 99

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 48/86 (55%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  +  T+H +P +    I    +  + +++ +P  +  AN   +E ++  LGV + +VS
Sbjct: 11  PEGLLFTVHVQPRASRSEICGAKEGELRLRLTSPPVEDAANKQCVELIAKTLGVAKSKVS 70

Query: 97  IGSGSKSRDKIVIVEEITPENVLNSL 122
           I SG+KSR K+V VE +  + +L+ L
Sbjct: 71  IRSGAKSRHKVVKVEGVDHDALLSLL 96


>gi|417772443|ref|ZP_12420332.1| TIGR00251 family protein [Leptospira interrogans serovar Pomona
           str. Pomona]
 gi|418679788|ref|ZP_13241045.1| TIGR00251 family protein [Leptospira interrogans serovar Pomona
           str. Kennewicki LC82-25]
 gi|418706859|ref|ZP_13267697.1| TIGR00251 family protein [Leptospira interrogans serovar Hebdomadis
           str. R499]
 gi|418716666|ref|ZP_13276629.1| TIGR00251 family protein [Leptospira interrogans str. UI 08452]
 gi|418728749|ref|ZP_13287320.1| TIGR00251 family protein [Leptospira interrogans str. UI 12758]
 gi|421118710|ref|ZP_15579045.1| TIGR00251 family protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|421127704|ref|ZP_15587925.1| TIGR00251 family protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
 gi|421134285|ref|ZP_15594426.1| TIGR00251 family protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|400328389|gb|EJO80621.1| TIGR00251 family protein [Leptospira interrogans serovar Pomona
           str. Kennewicki LC82-25]
 gi|409945814|gb|EKN95829.1| TIGR00251 family protein [Leptospira interrogans serovar Pomona
           str. Pomona]
 gi|410009734|gb|EKO67890.1| TIGR00251 family protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|410021584|gb|EKO88368.1| TIGR00251 family protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|410434763|gb|EKP83898.1| TIGR00251 family protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
 gi|410763436|gb|EKR34165.1| TIGR00251 family protein [Leptospira interrogans serovar Hebdomadis
           str. R499]
 gi|410776601|gb|EKR56578.1| TIGR00251 family protein [Leptospira interrogans str. UI 12758]
 gi|410787437|gb|EKR81169.1| TIGR00251 family protein [Leptospira interrogans str. UI 08452]
 gi|455668218|gb|EMF33463.1| TIGR00251 family protein [Leptospira interrogans serovar Pomona
           str. Fox 32256]
          Length = 73

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 40/69 (57%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +  T++ KP SK        D  + + +  PA +G+AN A++E++S  + V + ++ I S
Sbjct: 1   MKFTVYVKPNSKKVFFRKEEDGVLTIAVREPALEGKANEAVIEFISKEMKVPKSKIRILS 60

Query: 100 GSKSRDKIV 108
           G K++ KI+
Sbjct: 61  GQKNKKKII 69


>gi|367470930|ref|ZP_09470595.1| protein of unknown function DUF167 [Patulibacter sp. I11]
 gi|365814003|gb|EHN09236.1| protein of unknown function DUF167 [Patulibacter sp. I11]
          Length = 92

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 44/75 (58%), Gaps = 2/75 (2%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P+ V++ +  + G  +  I    ++ + V++ AP  DG AN AL + ++  LGV R +V 
Sbjct: 4   PTKVAVRLQPRGGRDA--ILGWREDRLLVRVSAPPVDGRANVALCKLLARQLGVARGRVE 61

Query: 97  IGSGSKSRDKIVIVE 111
           +  G +SRDK+V +E
Sbjct: 62  LLQGHQSRDKLVAIE 76


>gi|95930347|ref|ZP_01313084.1| conserved hypothetical protein [Desulfuromonas acetoxidans DSM 684]
 gi|95133599|gb|EAT15261.1| conserved hypothetical protein [Desulfuromonas acetoxidans DSM 684]
          Length = 103

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 51/100 (51%), Gaps = 1/100 (1%)

Query: 26  ENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
           E L +C+       V I +  +P +   S+  +  E + V++ +P  +G AN     + +
Sbjct: 4   EELSACVN-AQQDGVVIALFVQPRASKNSLCGLQGEELKVRLTSPPVEGAANKLCCTFFA 62

Query: 86  SVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
            +LGV +  V++  G KSR K ++VE ++ + V   L KA
Sbjct: 63  KLLGVSKSSVTLIRGDKSRHKQIVVEGVSLDEVKQRLAKA 102


>gi|254409972|ref|ZP_05023752.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196183008|gb|EDX77992.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 74

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/66 (34%), Positives = 40/66 (60%)

Query: 43  TIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSK 102
           ++  KP SK+ SI +++D  + V + +P  DG+AN  L+E ++    V + QV I SG  
Sbjct: 4   SVKVKPNSKTQSIEEMADGTLKVNLKSPPVDGKANKELIELLAEKFNVTKSQVQIKSGLS 63

Query: 103 SRDKIV 108
           S+ K++
Sbjct: 64  SKIKLI 69


>gi|114778113|ref|ZP_01453000.1| hypothetical protein SPV1_00607 [Mariprofundus ferrooxydans PV-1]
 gi|114551531|gb|EAU54085.1| hypothetical protein SPV1_00607 [Mariprofundus ferrooxydans PV-1]
          Length = 101

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 43/72 (59%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + IHA+PG++  ++  +  +A+ + +   A+DG+AN A++ +++  L + R  V + S
Sbjct: 14  LYVNIHAQPGARKPALRGMHGDALKIAVAEAAQDGKANEAIVRFIADALNLSRADVDVAS 73

Query: 100 GSKSRDKIVIVE 111
           G  SR K + + 
Sbjct: 74  GHTSRRKRLFLH 85


>gi|340966729|gb|EGS22236.1| hypothetical protein CTHT_0017530 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 120

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 46/77 (59%), Gaps = 2/77 (2%)

Query: 36  PPSSVSITIHAKPGS--KSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           P  ++ +  + KPG+  +   I  V D+AV + + A A++GEAN A+++ +S VL + + 
Sbjct: 17  PNGTLYLNCNVKPGASKQREGIASVGDDAVEICVAAQAREGEANKAVIKVLSEVLDLPKS 76

Query: 94  QVSIGSGSKSRDKIVIV 110
            + I  G KSR+K V V
Sbjct: 77  DLQITQGLKSRNKTVAV 93


>gi|328952006|ref|YP_004369340.1| hypothetical protein Desac_0267 [Desulfobacca acetoxidans DSM
           11109]
 gi|328452330|gb|AEB08159.1| UPF0235 protein yggU [Desulfobacca acetoxidans DSM 11109]
          Length = 101

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 43/79 (54%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           +   + IH  PG+ S  I     + + ++I A  + G AN  LL Y++  LG+ + ++ +
Sbjct: 9   TGYFLRIHVVPGAASNQIMGPHGDRLKIRIAAAPEKGAANKELLNYLAKCLGLPKNRLHL 68

Query: 98  GSGSKSRDKIVIVEEITPE 116
            SG++ R K+V V  + PE
Sbjct: 69  KSGAQDRVKVVEVVGLAPE 87


>gi|226942467|ref|YP_002797540.1| hypothetical protein Avin_03050 [Azotobacter vinelandii DJ]
 gi|259646933|sp|C1DI68.1|Y305_AZOVD RecName: Full=UPF0235 protein Avin_03050
 gi|226717394|gb|ACO76565.1| conserved hypothetical protein [Azotobacter vinelandii DJ]
          Length = 99

 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 39/69 (56%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P +       +  E + +++ AP  +G+ANA LL +++ V GV + QVS+ SG 
Sbjct: 13  LACHLQPKASKDEFAGLHGERLKIRLTAPPVEGKANAHLLAFLAGVFGVPKSQVSLESGE 72

Query: 102 KSRDKIVIV 110
            +R K V +
Sbjct: 73  SNRQKRVRI 81


>gi|255089425|ref|XP_002506634.1| predicted protein [Micromonas sp. RCC299]
 gi|226521907|gb|ACO67892.1| predicted protein [Micromonas sp. RCC299]
          Length = 319

 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/98 (24%), Positives = 54/98 (55%), Gaps = 2/98 (2%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           +P CI+      + I I  +  +++ +++ ++ + VGV +  P   G+ +  LLE++  V
Sbjct: 130 VPPCIQQTASGMIQIAIDVEDKARTKAVSKITADEVGVALTLPV--GQCDDELLEFLGKV 187

Query: 88  LGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           L ++  Q+S+  G  +R K+++V+ ++   V   L K+
Sbjct: 188 LHLRLPQMSLLRGWSTRSKLLMVQGLSATQVYERLHKS 225


>gi|113461793|ref|YP_719862.1| hypothetical protein HS_1657 [Haemophilus somnus 129PT]
 gi|170718106|ref|YP_001785139.1| hypothetical protein HSM_1819 [Haemophilus somnus 2336]
 gi|123132241|sp|Q0I525.1|Y1657_HAES1 RecName: Full=UPF0235 protein HS_1657
 gi|226696075|sp|B0UWD6.1|Y1819_HAES2 RecName: Full=UPF0235 protein HSM_1819
 gi|112823836|gb|ABI25925.1| conserved hypothetical protein [Haemophilus somnus 129PT]
 gi|168826235|gb|ACA31606.1| protein of unknown function DUF167 [Haemophilus somnus 2336]
          Length = 99

 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 40/72 (55%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ + I  +P +    +  + D A+ + I AP  DG+ANA LL+++S    V + Q+ + 
Sbjct: 10  NLRLRIFLQPKASKDHLIGLYDNALKISITAPPIDGQANAHLLKFLSKTFKVAKSQIILE 69

Query: 99  SGSKSRDKIVIV 110
            G  SR K +++
Sbjct: 70  KGELSRHKQILI 81


>gi|317050344|ref|YP_004111460.1| hypothetical protein Selin_0146 [Desulfurispirillum indicum S5]
 gi|316945428|gb|ADU64904.1| protein of unknown function DUF167 [Desulfurispirillum indicum S5]
          Length = 92

 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/77 (25%), Positives = 45/77 (58%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           S+++ +  +P S    +   +D  + ++++AP  DG AN  ++E+ S +L + + ++SI 
Sbjct: 8   SLTLQLKVQPRSSRTELLREADGQLKLRLNAPPVDGAANQQVIEFFSRLLSIPKSRISIV 67

Query: 99  SGSKSRDKIVIVEEITP 115
            G +SR K++ +  + P
Sbjct: 68  QGQQSRRKVIALLGVEP 84


>gi|256828220|ref|YP_003156948.1| hypothetical protein Dbac_0406 [Desulfomicrobium baculatum DSM
           4028]
 gi|256577396|gb|ACU88532.1| protein of unknown function DUF167 [Desulfomicrobium baculatum DSM
           4028]
          Length = 106

 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 43/70 (61%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + I  +PG++   +  +  + + +++ A A D +AN+AL  ++S +LG+K  QV I SG 
Sbjct: 19  LGIWVQPGARKTEVAGLHGDFLKIRVQARAVDNKANSALTVFVSKILGIKASQVVIESGH 78

Query: 102 KSRDKIVIVE 111
            SR K ++++
Sbjct: 79  ASRQKNLLLD 88


>gi|301774793|ref|XP_002922805.1| PREDICTED: LOW QUALITY PROTEIN: UPF0235 protein C15orf40-like
           [Ailuropoda melanoleuca]
          Length = 129

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/109 (35%), Positives = 56/109 (51%), Gaps = 19/109 (17%)

Query: 33  RLVPP---------SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAK--------DGE 75
           RL+PP           V+I IHAKPGSK  + TDV+ + V V I AP          +GE
Sbjct: 20  RLLPPLGPVAVDPKGCVTIAIHAKPGSKQNATTDVTAKVVSVAITAPPPTPPAPRQSEGE 79

Query: 76  ANAALLEYMSSVLGVKRR-QVSIGSGSKSRDKIV-IVEEITPENVLNSL 122
           ANA L   +S VL +++   V +  G  S +K+V ++   T E +L  +
Sbjct: 80  ANAELSWCLSKVLELRKSDDVILDKGGXSHEKVVKLLASTTAEEILEKV 128


>gi|367031698|ref|XP_003665132.1| hypothetical protein MYCTH_2308510 [Myceliophthora thermophila ATCC
           42464]
 gi|347012403|gb|AEO59887.1| hypothetical protein MYCTH_2308510 [Myceliophthora thermophila ATCC
           42464]
          Length = 119

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/54 (44%), Positives = 36/54 (66%)

Query: 55  ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV 108
           IT V+DEAV + + A A++GEAN A++  +S VL + +  + I  G KSR+K V
Sbjct: 37  ITSVNDEAVEICVAAQAREGEANKAVIRVLSEVLDLPKSDLQITQGLKSRNKTV 90


>gi|374814495|ref|ZP_09718232.1| protein C15orf40 [Treponema primitia ZAS-1]
          Length = 97

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 48/81 (59%), Gaps = 1/81 (1%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           + +C+R +  +S+ + I A PGS    I  +S+  + ++I A  +DG+ANA L  +++  
Sbjct: 1   MTNCLR-IAGASILLDIKAVPGSSKSQIAGLSENRLRIKIAAAPEDGKANAELRSFLAKT 59

Query: 88  LGVKRRQVSIGSGSKSRDKIV 108
           L   ++ +++ +G KSR K +
Sbjct: 60  LDCPKKDITLTAGEKSRLKTL 80


>gi|357632882|ref|ZP_09130760.1| UPF0235 protein yggU [Desulfovibrio sp. FW1012B]
 gi|357581436|gb|EHJ46769.1| UPF0235 protein yggU [Desulfovibrio sp. FW1012B]
          Length = 125

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 44/74 (59%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P   ++ +   PG    ++  +++  + V++ A A +G+ANA L  +++  LG++ RQV+
Sbjct: 35  PGVWTLRVVVTPGGSRDTLAGLAEGRLRVRLRAKAVEGQANAGLTVFLAGCLGLRPRQVA 94

Query: 97  IGSGSKSRDKIVIV 110
           I SG KSR K + +
Sbjct: 95  IVSGEKSRKKTLRI 108


>gi|386289122|ref|ZP_10066260.1| hypothetical protein DOK_16883 [gamma proteobacterium BDW918]
 gi|385277854|gb|EIF41828.1| hypothetical protein DOK_16883 [gamma proteobacterium BDW918]
          Length = 99

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 50/83 (60%), Gaps = 2/83 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P + +        + + ++I A A DG+ANA L+ ++++  GV +++VS+ +GS
Sbjct: 16  LFCHIQPKASANEFAGQHGDRLKIRITAAATDGKANAGLIAFVAAAFGVSKQRVSLSAGS 75

Query: 102 KSRDKIVIVEEIT--PENVLNSL 122
           +SR K + ++  T  PE + +++
Sbjct: 76  QSRQKTLKIQAPTSIPEALADTV 98


>gi|384566916|ref|ZP_10014020.1| hypothetical protein SacglDRAFT_03099 [Saccharomonospora glauca
           K62]
 gi|384522770|gb|EIE99965.1| hypothetical protein SacglDRAFT_03099 [Saccharomonospora glauca
           K62]
          Length = 99

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 42/73 (57%), Gaps = 4/73 (5%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVG----VQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           + + +  K G+K   +    D A+G    V + APA DG+AN A+   +++  G++ R V
Sbjct: 1   MRVAVRVKAGAKRDGVGGRWDGALGEALVVSVRAPAVDGKANTAVCRVLAAAFGLRARDV 60

Query: 96  SIGSGSKSRDKIV 108
           ++  G +SRDK+V
Sbjct: 61  TVVRGQRSRDKLV 73


>gi|373466486|ref|ZP_09557800.1| TIGR00251 family protein [Haemophilus sp. oral taxon 851 str.
           F0397]
 gi|371760268|gb|EHO48957.1| TIGR00251 family protein [Haemophilus sp. oral taxon 851 str.
           F0397]
          Length = 95

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 44/85 (51%), Gaps = 2/85 (2%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S    V +  + +  
Sbjct: 11  IRLRIFLQPKASKDHIAGIHDDELKITITAPPIDGQANAHLLKFLSKSFKVPKSSIILEK 70

Query: 100 GSKSRDKIVIVEE--ITPENVLNSL 122
           G  SR K V + E  + P+ + N L
Sbjct: 71  GELSRHKQVWIPEPKLIPKEIQNLL 95


>gi|417841364|ref|ZP_12487468.1| UPF0235 protein [Haemophilus haemolyticus M19501]
 gi|417844664|ref|ZP_12490705.1| UPF0235 protein [Haemophilus haemolyticus M21639]
 gi|341949402|gb|EGT76006.1| UPF0235 protein [Haemophilus haemolyticus M19501]
 gi|341956623|gb|EGT83044.1| UPF0235 protein [Haemophilus haemolyticus M21639]
          Length = 95

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 44/85 (51%), Gaps = 2/85 (2%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S    V +  + +  
Sbjct: 11  IRLRIFLQPKASKDHIAGIHDDELKITITAPPVDGQANAHLLKFLSKSFKVPKSSIILEK 70

Query: 100 GSKSRDKIVIVEE--ITPENVLNSL 122
           G  SR K V + E  + P+ + N L
Sbjct: 71  GELSRHKQVWIPEPKLIPKEIQNLL 95


>gi|386392438|ref|ZP_10077219.1| hypothetical protein DesU5LDRAFT_1837 [Desulfovibrio sp. U5L]
 gi|385733316|gb|EIG53514.1| hypothetical protein DesU5LDRAFT_1837 [Desulfovibrio sp. U5L]
          Length = 125

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 44/74 (59%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P   ++ +   PG    ++  +++  + V++ A A +G+ANA L  +++  LG++ RQV+
Sbjct: 35  PGVWTLRVVVTPGGSRDTLAGLAEGRLRVRLRAKAVEGQANAGLTVFLAGCLGLRPRQVA 94

Query: 97  IGSGSKSRDKIVIV 110
           I SG KSR K + +
Sbjct: 95  IVSGEKSRKKTLRI 108


>gi|284046655|ref|YP_003396995.1| hypothetical protein Cwoe_5214 [Conexibacter woesei DSM 14684]
 gi|283950876|gb|ADB53620.1| protein of unknown function DUF167 [Conexibacter woesei DSM 14684]
          Length = 88

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 44/81 (54%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + +  +P +K   I    D A+ V++ AP  DG+ANAAL   ++  LGV    V++  G 
Sbjct: 4   LRVRLQPRAKRNEIVGERDGALVVRVTAPPVDGKANAALCRLLAKALGVAPSTVTVVRGQ 63

Query: 102 KSRDKIVIVEEITPENVLNSL 122
            +RDK+V V+ +     L+ L
Sbjct: 64  SARDKVVHVDAVETGVALHRL 84


>gi|425902121|ref|ZP_18878712.1| hypothetical protein Pchl3084_5573 [Pseudomonas chlororaphis subsp.
           aureofaciens 30-84]
 gi|397881715|gb|EJK98204.1| hypothetical protein Pchl3084_5573 [Pseudomonas chlororaphis subsp.
           aureofaciens 30-84]
          Length = 97

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 40/69 (57%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  E + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFAGLHGERLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIVIV 110
            +R K V +
Sbjct: 73  LNRQKRVRI 81


>gi|189500987|ref|YP_001960457.1| hypothetical protein Cphamn1_2066 [Chlorobium phaeobacteroides BS1]
 gi|226701149|sp|B3EMY7.1|Y2066_CHLPB RecName: Full=UPF0235 protein Cphamn1_2066
 gi|189496428|gb|ACE04976.1| protein of unknown function DUF167 [Chlorobium phaeobacteroides
           BS1]
          Length = 101

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 42/86 (48%), Gaps = 9/86 (10%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           SV  +I A+P S    IT   D ++ V + AP  DGEAN      ++  LGV R  V I 
Sbjct: 10  SVFFSIKAQPRSSKSMITGEYDGSIKVNLKAPPVDGEANLECCRLLARTLGVARSSVEIV 69

Query: 99  SGSKSRDKIV---------IVEEITP 115
           SG++ + K V           E+ITP
Sbjct: 70  SGTRGKMKRVKVFGLSAVEFTEKITP 95


>gi|374850011|dbj|BAL53011.1| hypothetical conserved protein [uncultured candidate division OP1
           bacterium]
 gi|374857330|dbj|BAL60183.1| hypothetical conserved protein [uncultured candidate division OP1
           bacterium]
          Length = 105

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 46/83 (55%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V +T+  KP ++  +I  V ++A+ V++ AP +  +AN A++  ++  L + + +V + S
Sbjct: 15  VVLTVRVKPRARRNAIIGVRNDALLVEVTAPPEQNKANDAVIALLAEALNISKSRVELLS 74

Query: 100 GSKSRDKIVIVEEITPENVLNSL 122
           G   RDK + +  +TP      L
Sbjct: 75  GQTHRDKRLRIWGLTPSQCWERL 97


>gi|332706094|ref|ZP_08426166.1| hypothetical protein LYNGBM3L_14430 [Moorea producens 3L]
 gi|332355186|gb|EGJ34654.1| hypothetical protein LYNGBM3L_14430 [Moorea producens 3L]
          Length = 75

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 36/65 (55%)

Query: 44  IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKS 103
           +  KP SK   I D  D  + V + +P  DG+AN  L+E ++    V + ++ I SG  S
Sbjct: 5   VQVKPNSKHQKIEDSDDGTLKVHLKSPPVDGKANQELIELLAQRFKVPKSRIQIKSGLSS 64

Query: 104 RDKIV 108
           R+K++
Sbjct: 65  RNKLI 69


>gi|222054893|ref|YP_002537255.1| hypothetical protein Geob_1796 [Geobacter daltonii FRC-32]
 gi|221564182|gb|ACM20154.1| protein of unknown function DUF167 [Geobacter daltonii FRC-32]
          Length = 102

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 43/80 (53%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V  T+H +P +    I  V  + + +++ AP  D  AN   +E ++  LGV +  +++ S
Sbjct: 19  VIFTVHVQPRASRNEICGVQGDELRLRLTAPPVDDAANKLCIELLAKALGVAKSHLALTS 78

Query: 100 GSKSRDKIVIVEEITPENVL 119
           G+KSR K +  E ++   VL
Sbjct: 79  GAKSRHKTIRAEGVSKNTVL 98


>gi|399009441|ref|ZP_10711876.1| TIGR00251 family protein [Pseudomonas sp. GM17]
 gi|398112211|gb|EJM02076.1| TIGR00251 family protein [Pseudomonas sp. GM17]
          Length = 97

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 40/69 (57%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  E + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFAGLHGERLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIVIV 110
            +R K V +
Sbjct: 73  LNRQKRVRI 81


>gi|91773909|ref|YP_566601.1| hypothetical protein Mbur_1971 [Methanococcoides burtonii DSM 6242]
 gi|91712924|gb|ABE52851.1| protein of unknown function DUF167 [Methanococcoides burtonii DSM
           6242]
          Length = 107

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 40  VSITIHAKPGSKSCSI---TDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           ++I +   PGSK        +   E + +++ + A+ G+AN  L+E ++    + +R+V 
Sbjct: 16  IAIDLEVTPGSKKACFPAGYNQWRERIEIKLTSAAQKGKANGQLIEIVADFFNIGQREVI 75

Query: 97  IGSGSKSRDKIVIVEEITPENVLNSL 122
           IGSG+KS  K VI+     E V+ +L
Sbjct: 76  IGSGAKSSKKTVIINRPDQEQVVLAL 101


>gi|374852514|dbj|BAL55445.1| hypothetical conserved protein [uncultured candidate division OP1
           bacterium]
          Length = 105

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 46/83 (55%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V +T+  KP ++  SI  + ++A+ V++ AP +  +AN A++  ++  L + + +V + S
Sbjct: 15  VVLTVRVKPRARRNSIIGMRNDALLVEVTAPPEQNKANDAVIALLAEALNISKSRVELLS 74

Query: 100 GSKSRDKIVIVEEITPENVLNSL 122
           G   RDK + +  +TP      L
Sbjct: 75  GQTHRDKRLRIWGLTPSQCWERL 97


>gi|114332111|ref|YP_748333.1| hypothetical protein Neut_2146 [Nitrosomonas eutropha C91]
 gi|122313207|sp|Q0AE64.1|Y2146_NITEC RecName: Full=UPF0235 protein Neut_2146
 gi|114309125|gb|ABI60368.1| protein of unknown function DUF167 [Nitrosomonas eutropha C91]
          Length = 99

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 22/87 (25%), Positives = 49/87 (56%), Gaps = 2/87 (2%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
            ++ + ++ +PG++      V  E + +++ AP  DG+AN AL  +++    V  + +++
Sbjct: 10  HNLILKLYIQPGARQTEAIGVHGEELKIKLAAPPMDGKANRALAVFLAKRFNVPLKHITL 69

Query: 98  GSGSKSRDKIV-IVEEIT-PENVLNSL 122
             G++SR K+V I + +  PE + N +
Sbjct: 70  KWGAQSRHKVVEIYQPVNGPEVLFNEI 96


>gi|33151941|ref|NP_873294.1| hypothetical protein HD0778 [Haemophilus ducreyi 35000HP]
 gi|47117473|sp|Q7VN15.1|Y778_HAEDU RecName: Full=UPF0235 protein HD_0778
 gi|33148163|gb|AAP95683.1| conserved hypothetical protein [Haemophilus ducreyi 35000HP]
          Length = 97

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 36/74 (48%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  + + I  +P +    I  + D  + V I AP  DG ANA LL+Y+S +  V +  + 
Sbjct: 10  PYGIRLRIFLQPKASRDQIVGLHDNELKVAITAPPVDGAANAYLLKYLSKLFKVPKSSIV 69

Query: 97  IGSGSKSRDKIVIV 110
           +  G   R K + V
Sbjct: 70  LEKGELQRHKQLFV 83


>gi|407693704|ref|YP_006818493.1| hypothetical protein ASU2_10630 [Actinobacillus suis H91-0380]
 gi|407389761|gb|AFU20254.1| hypothetical protein ASU2_10630 [Actinobacillus suis H91-0380]
          Length = 99

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 39/80 (48%)

Query: 33  RLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKR 92
           R+  P  + + I  +P +    I  + D  + + I AP  DG ANA LL+Y+S +  V +
Sbjct: 6   RIENPCGIRLRIFLQPKASRDQIVGLHDNELKIAITAPPVDGAANAHLLKYLSKLFKVPK 65

Query: 93  RQVSIGSGSKSRDKIVIVEE 112
             + +  G   R K + V E
Sbjct: 66  SSIVLEKGELQRHKQLFVPE 85


>gi|389682972|ref|ZP_10174306.1| hypothetical protein PchlO6_5853 [Pseudomonas chlororaphis O6]
 gi|388553179|gb|EIM16438.1| hypothetical protein PchlO6_5853 [Pseudomonas chlororaphis O6]
          Length = 97

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 40/69 (57%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  E + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFAGLHGERLKIRLTAPPVEGKANAHLMAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIVIV 110
            +R K V +
Sbjct: 73  LNRQKRVRI 81


>gi|418460992|ref|ZP_13032074.1| hypothetical protein SZMC14600_08574 [Saccharomonospora azurea SZMC
           14600]
 gi|359738947|gb|EHK87825.1| hypothetical protein SZMC14600_08574 [Saccharomonospora azurea SZMC
           14600]
          Length = 97

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 41/73 (56%), Gaps = 4/73 (5%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVG----VQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           +   +  K G+K   +    + A+G    V + APA DG+ANAA+   ++   GV+ R V
Sbjct: 1   MRFAVRVKAGAKRDVVGGTWNGALGEALVVSVRAPAVDGKANAAVCTVLAHAFGVRARDV 60

Query: 96  SIGSGSKSRDKIV 108
           ++  G +SRDK+V
Sbjct: 61  TVVKGQRSRDKLV 73


>gi|409095286|ref|ZP_11215310.1| hypothetical protein TzilA_01415 [Thermococcus zilligii AN1]
          Length = 94

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 50/85 (58%), Gaps = 7/85 (8%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVG---VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           + IH +P +K  +I  + DE  G   V+I AP  +G+AN  ++++ S +LG    +V I 
Sbjct: 13  LFIHVQPNAKENAIEGI-DEWRGRLKVKIKAPPVEGKANREVVKFFSRLLGA---EVEIV 68

Query: 99  SGSKSRDKIVIVEEITPENVLNSLG 123
            G  SR+K ++V+ ++ E+V   LG
Sbjct: 69  KGEASREKDLLVKGLSEEDVRKKLG 93


>gi|372272874|ref|ZP_09508922.1| hypothetical protein MstaS_17414 [Marinobacterium stanieri S30]
          Length = 99

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 42/70 (60%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P +    I+    ++V ++I AP  +G ANAAL+++++   GV +R VSI SG 
Sbjct: 15  LHCHLQPKASRDEISGQHGDSVKIRIAAPPIEGRANAALIKFLAKSFGVAKRDVSIISGE 74

Query: 102 KSRDKIVIVE 111
             R+K V ++
Sbjct: 75  LGREKRVRIQ 84


>gi|190150700|ref|YP_001969225.1| hypothetical protein APP7_1431 [Actinobacillus pleuropneumoniae
           serovar 7 str. AP76]
 gi|307264052|ref|ZP_07545650.1| hypothetical protein appser13_14550 [Actinobacillus
           pleuropneumoniae serovar 13 str. N273]
 gi|226734158|sp|B3GYF9.1|Y1431_ACTP7 RecName: Full=UPF0235 protein APP7_1431
 gi|189915831|gb|ACE62083.1| hypothetical protein APP7_1431 [Actinobacillus pleuropneumoniae
           serovar 7 str. AP76]
 gi|306870598|gb|EFN02344.1| hypothetical protein appser13_14550 [Actinobacillus
           pleuropneumoniae serovar 13 str. N273]
          Length = 97

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 37/76 (48%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  + + I  +P +    I  + D  + + I AP  DG ANA LL+Y+S +  V +  + 
Sbjct: 10  PYGIRLRIFLQPKASRDQIVGLHDSELKIAITAPPVDGAANAHLLKYLSKLFKVPKSSIV 69

Query: 97  IGSGSKSRDKIVIVEE 112
           +  G   R K + V E
Sbjct: 70  LEKGELQRHKQLFVPE 85


>gi|325294222|ref|YP_004280736.1| yggU [Desulfurobacterium thermolithotrophum DSM 11699]
 gi|325064670|gb|ADY72677.1| UPF0235 protein yggU [Desulfurobacterium thermolithotrophum DSM
           11699]
          Length = 101

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 45/86 (52%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
            ++ I +  +P +    I  V +  + +++  P + G+AN  ++E +S  L V +R + I
Sbjct: 9   EAIEIEVKVQPKASRNKIEKVEEGRLKIKVTVPPEGGKANQKIIELLSKALKVPKRDIDI 68

Query: 98  GSGSKSRDKIVIVEEITPENVLNSLG 123
             G  SR K+V +E +T  N+   LG
Sbjct: 69  VKGETSRIKVVRIEGVTLANLQEKLG 94


>gi|406992901|gb|EKE12161.1| hypothetical protein ACD_13C00289G0001 [uncultured bacterium]
          Length = 75

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 44/76 (57%), Gaps = 4/76 (5%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I +  KPG+K   I   S+    + + A AK+ EAN  L++ +S   G+ + ++SI S
Sbjct: 1   MRIHVLLKPGAKEPKIEQ-SNGIFRISVKAVAKENEANNELIDALSDYFGIAKSRISIVS 59

Query: 100 GSKSRDKIVIVEEITP 115
           G  SR+K+V   EITP
Sbjct: 60  GFNSRNKVV---EITP 72


>gi|260892905|ref|YP_003239002.1| hypothetical protein Adeg_1019 [Ammonifex degensii KC4]
 gi|260865046|gb|ACX52152.1| protein of unknown function DUF167 [Ammonifex degensii KC4]
          Length = 103

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 44/73 (60%), Gaps = 1/73 (1%)

Query: 44  IHAKPGSKSCSI-TDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSK 102
           IH +   +S ++  +  +  + V++ AP  +G+AN  LLE++S VL +  R++ +  G K
Sbjct: 20  IHLRVTPRSSTLALEAGEGFLRVRLTAPPVEGKANELLLEFLSRVLDIPARRLQLVKGLK 79

Query: 103 SRDKIVIVEEITP 115
            R+K+V+V+   P
Sbjct: 80  GREKVVLVDMPLP 92


>gi|255582001|ref|XP_002531798.1| conserved hypothetical protein [Ricinus communis]
 gi|223528564|gb|EEF30586.1| conserved hypothetical protein [Ricinus communis]
          Length = 176

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/27 (85%), Positives = 25/27 (92%)

Query: 86  SVLGVKRRQVSIGSGSKSRDKIVIVEE 112
           +VLGVKRRQVSI SGSKSRDK+VIV E
Sbjct: 121 NVLGVKRRQVSIRSGSKSRDKVVIVGE 147


>gi|375096334|ref|ZP_09742599.1| hypothetical protein SacmaDRAFT_3687 [Saccharomonospora marina
           XMU15]
 gi|374657067|gb|EHR51900.1| hypothetical protein SacmaDRAFT_3687 [Saccharomonospora marina
           XMU15]
          Length = 92

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 48/89 (53%), Gaps = 5/89 (5%)

Query: 44  IHAKPGSKSCSITDVSDEAVG----VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +  KPG+K  ++  V D A+G    V + A A DG AN AL   +++   V+ R V++  
Sbjct: 1   MRVKPGAKRDAVGGVWDGALGEALVVSVRARAVDGAANDALCRVLAAAFAVRPRDVAVVK 60

Query: 100 GSKSRDKIVIVEEITPENVLNSLGKASSS 128
           G +SRDK+V V    PE   + +G   +S
Sbjct: 61  GLRSRDKVVEVRG-APEGSQDRVGALRNS 88


>gi|381162538|ref|ZP_09871768.1| hypothetical protein SacazDRAFT_01439 [Saccharomonospora azurea
           NA-128]
 gi|379254443|gb|EHY88369.1| hypothetical protein SacazDRAFT_01439 [Saccharomonospora azurea
           NA-128]
          Length = 97

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 41/73 (56%), Gaps = 4/73 (5%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVG----VQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           +   +  K G+K   +    + A+G    V + APA DG+ANAA+   ++   GV+ R V
Sbjct: 1   MRFAVRVKAGAKRDVVGGTWNGALGEALVVSVRAPAVDGKANAAVCTVLAHAFGVRARDV 60

Query: 96  SIGSGSKSRDKIV 108
           ++  G +SRDK+V
Sbjct: 61  TVVKGQRSRDKLV 73


>gi|221632637|ref|YP_002521858.1| hypothetical protein trd_0618 [Thermomicrobium roseum DSM 5159]
 gi|221157137|gb|ACM06264.1| DUF167 [Thermomicrobium roseum DSM 5159]
          Length = 102

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/85 (25%), Positives = 45/85 (52%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V   +  +P +    +     +A+ V++ AP +DGEAN A+L  ++  L + R  + I 
Sbjct: 12  AVEFWVQVQPRAPRAEVAGSRRDALLVRVTAPPRDGEANEAVLRLLAETLHLPRGSIRII 71

Query: 99  SGSKSRDKIVIVEEITPENVLNSLG 123
           +G+  R K + ++ +T + +L  L 
Sbjct: 72  AGTAQRRKRIRIDGLTSKELLERLA 96


>gi|416067426|ref|ZP_11582302.1| hypothetical protein D18P1_0403 [Aggregatibacter
           actinomycetemcomitans serotype f str. D18P1]
 gi|348002128|gb|EGY42839.1| hypothetical protein D18P1_0403 [Aggregatibacter
           actinomycetemcomitans serotype f str. D18P1]
          Length = 97

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/73 (27%), Positives = 41/73 (56%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           +++ + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S +  V +  + +
Sbjct: 9   ANLRLRIFLQPKTAKNQIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKVPKSSIVL 68

Query: 98  GSGSKSRDKIVIV 110
             G  +R K V++
Sbjct: 69  EKGELNRHKQVLI 81


>gi|452002436|gb|EMD94894.1| hypothetical protein COCHEDRAFT_1055510, partial [Cochliobolus
           heterostrophus C5]
          Length = 75

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 41/74 (55%), Gaps = 2/74 (2%)

Query: 39  SVSITIHAKPG--SKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           ++ +  H KPG  +    I  +SDE + + + A  +DGEAN A+   ++  LGV +  V 
Sbjct: 2   TIQLLCHVKPGVSANREGIAAISDERIQLCVAAQTRDGEANKAVRRVVAGALGVPKSDVE 61

Query: 97  IGSGSKSRDKIVIV 110
           +  G KSRDK V +
Sbjct: 62  VIKGMKSRDKTVAI 75


>gi|269127125|ref|YP_003300495.1| hypothetical protein Tcur_2915 [Thermomonospora curvata DSM 43183]
 gi|268312083|gb|ACY98457.1| protein of unknown function DUF167 [Thermomonospora curvata DSM
           43183]
          Length = 91

 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 43/73 (58%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V + I   PG+    +     EA+ V++ A A +G+A  A L  ++  LGV+RR V + 
Sbjct: 3   AVRVAIRVGPGASRTKVGGAHGEALVVKVAARAVEGKATEAALRAVADALGVRRRDVRLI 62

Query: 99  SGSKSRDKIVIVE 111
           SG+ SR+K++ V+
Sbjct: 63  SGATSREKLIEVD 75


>gi|365966924|ref|YP_004948486.1| hypothetical protein ANH9381_0735 [Aggregatibacter
           actinomycetemcomitans ANH9381]
 gi|416074797|ref|ZP_11584695.1| hypothetical protein SCC1398_0565 [Aggregatibacter
           actinomycetemcomitans serotype b str. SCC1398]
 gi|416084621|ref|ZP_11587051.1| hypothetical protein I23C_0861 [Aggregatibacter
           actinomycetemcomitans serotype b str. I23C]
 gi|444337775|ref|ZP_21151706.1| hypothetical protein SCC4092_0652 [Aggregatibacter
           actinomycetemcomitans serotype b str. SCC4092]
 gi|348006605|gb|EGY47008.1| hypothetical protein SCC1398_0565 [Aggregatibacter
           actinomycetemcomitans serotype b str. SCC1398]
 gi|348010355|gb|EGY50407.1| hypothetical protein I23C_0861 [Aggregatibacter
           actinomycetemcomitans serotype b str. I23C]
 gi|365745837|gb|AEW76742.1| hypothetical protein ANH9381_0735 [Aggregatibacter
           actinomycetemcomitans ANH9381]
 gi|443546317|gb|ELT55992.1| hypothetical protein SCC4092_0652 [Aggregatibacter
           actinomycetemcomitans serotype b str. SCC4092]
          Length = 97

 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/73 (27%), Positives = 41/73 (56%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           +++ + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S +  V +  + +
Sbjct: 9   ANLRLRIFLQPKAAKNQIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKVPKSSIVL 68

Query: 98  GSGSKSRDKIVIV 110
             G  +R K V++
Sbjct: 69  EKGELNRHKQVLI 81


>gi|30248412|ref|NP_840482.1| hypothetical protein NE0395 [Nitrosomonas europaea ATCC 19718]
 gi|47117515|sp|Q82X93.1|Y395_NITEU RecName: Full=UPF0235 protein NE0395
 gi|30138298|emb|CAD84306.1| DUF167 [Nitrosomonas europaea ATCC 19718]
          Length = 100

 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/73 (27%), Positives = 42/73 (57%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S + + ++ +PG++      +  E + +++ A   DG+AN AL E+++    V R+ +++
Sbjct: 11  SLLILKLYVQPGARQTEAVGICGEELKIKLAALPVDGKANRALTEFLAKRFNVPRKNITL 70

Query: 98  GSGSKSRDKIVIV 110
             G +SR K+V V
Sbjct: 71  KRGEQSRHKVVEV 83


>gi|152979630|ref|YP_001345259.1| hypothetical protein Asuc_1977 [Actinobacillus succinogenes 130Z]
 gi|171704390|sp|A6VQS7.1|Y1977_ACTSZ RecName: Full=UPF0235 protein Asuc_1977
 gi|150841353|gb|ABR75324.1| protein of unknown function DUF167 [Actinobacillus succinogenes
           130Z]
          Length = 98

 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 43/85 (50%), Gaps = 2/85 (2%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + I  +P +   +I  + DE + + I AP  DG ANA L++Y+S    V +  V +  
Sbjct: 11  IRLRIMLQPKASKDAIIGLHDEELKISITAPPVDGAANAHLIKYLSKAFKVPKSAVQLEK 70

Query: 100 GSKSRDKIVIV--EEITPENVLNSL 122
           G  +R K V +   +I PE V   L
Sbjct: 71  GELNRHKQVFIPAPKIIPEAVRQLL 95


>gi|443472281|ref|ZP_21062310.1| Hypothetical protein ppKF707_3298 [Pseudomonas pseudoalcaligenes
           KF707]
 gi|442902623|gb|ELS28139.1| Hypothetical protein ppKF707_3298 [Pseudomonas pseudoalcaligenes
           KF707]
          Length = 98

 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/71 (28%), Positives = 40/71 (56%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P +       +  E + +++ AP  DG+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LDCHLQPKASRDEFAGLHGERLKIRLTAPPVDGKANAHLMAFLAEAFGVAKSQVSLESGE 72

Query: 102 KSRDKIVIVEE 112
            +R K + +++
Sbjct: 73  LNRQKRIRIQK 83


>gi|428215927|ref|YP_007089071.1| hypothetical protein Oscil6304_5676 [Oscillatoria acuminata PCC
           6304]
 gi|428004308|gb|AFY85151.1| hypothetical protein Oscil6304_5676 [Oscillatoria acuminata PCC
           6304]
          Length = 73

 Score = 45.8 bits (107), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 39/65 (60%)

Query: 44  IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKS 103
           I  KP SK   IT+  D ++ +++ +P  DG+ANA L++ ++    V +  + I SG  S
Sbjct: 6   IKVKPNSKQQQITEEPDGSLTIRLKSPPVDGKANAELIKLLADKFEVPKSAIVIKSGFSS 65

Query: 104 RDKIV 108
           R+K+V
Sbjct: 66  RNKVV 70


>gi|325108674|ref|YP_004269742.1| hypothetical protein Plabr_2117 [Planctomyces brasiliensis DSM
           5305]
 gi|324968942|gb|ADY59720.1| UPF0235 protein yggU [Planctomyces brasiliensis DSM 5305]
          Length = 109

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 11/111 (9%)

Query: 12  KSAGSTQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPA 71
           +++G     I+  DEN            V + +   PG+K  ++  V D A+ V +   A
Sbjct: 3   EASGENAVAIEQTDEN-----------GVLLPLRVTPGAKRNAVGGVHDGALKVAVTQIA 51

Query: 72  KDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           + G+AN  +L+ ++  LG+K+ Q+++ SG  SR+K +   +++   +L  +
Sbjct: 52  ERGKANQQVLKILAKALGLKKSQLTLVSGETSRNKRIACRDVSAAELLQRI 102


>gi|387120571|ref|YP_006286454.1| hypothetical protein D7S_00821 [Aggregatibacter
           actinomycetemcomitans D7S-1]
 gi|416037652|ref|ZP_11573989.1| hypothetical protein H5P1_1770 [Aggregatibacter
           actinomycetemcomitans serotype a str. H5P1]
 gi|416045179|ref|ZP_11575274.1| hypothetical protein I63B_0735 [Aggregatibacter
           actinomycetemcomitans serotype d str. I63B]
 gi|416055471|ref|ZP_11579550.1| hypothetical protein SCC393_0537 [Aggregatibacter
           actinomycetemcomitans serotype e str. SCC393]
 gi|429734958|ref|ZP_19268955.1| TIGR00251 family protein [Aggregatibacter actinomycetemcomitans Y4]
 gi|444334140|ref|ZP_21149763.1| hypothetical protein A160_0973 [Aggregatibacter
           actinomycetemcomitans serotype a str. A160]
 gi|347995606|gb|EGY36772.1| hypothetical protein H5P1_1770 [Aggregatibacter
           actinomycetemcomitans serotype a str. H5P1]
 gi|347995660|gb|EGY36821.1| hypothetical protein I63B_0735 [Aggregatibacter
           actinomycetemcomitans serotype d str. I63B]
 gi|348003148|gb|EGY43804.1| hypothetical protein SCC393_0537 [Aggregatibacter
           actinomycetemcomitans serotype e str. SCC393]
 gi|385875063|gb|AFI86622.1| hypothetical protein D7S_00821 [Aggregatibacter
           actinomycetemcomitans D7S-1]
 gi|429150724|gb|EKX93620.1| TIGR00251 family protein [Aggregatibacter actinomycetemcomitans Y4]
 gi|443550821|gb|ELT58954.1| hypothetical protein A160_0973 [Aggregatibacter
           actinomycetemcomitans serotype a str. A160]
          Length = 97

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 20/73 (27%), Positives = 41/73 (56%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           +++ + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S +  V +  + +
Sbjct: 9   ANLRLRIFLQPKAAKNQIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKVPKSSIVL 68

Query: 98  GSGSKSRDKIVIV 110
             G  +R K V++
Sbjct: 69  EKGELNRHKQVLI 81


>gi|410642878|ref|ZP_11353387.1| hypothetical protein GCHA_3644 [Glaciecola chathamensis S18K6]
 gi|410137761|dbj|GAC11574.1| hypothetical protein GCHA_3644 [Glaciecola chathamensis S18K6]
          Length = 98

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 42/73 (57%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + + + I+ +P +    I  +  +A+ + I AP  DG+ANA L +Y++   GV + +V+I
Sbjct: 10  AELQLRIYIQPKASRDEIVGMHGDALKIAITAPPVDGKANAHLCKYLAKQCGVPKSKVAI 69

Query: 98  GSGSKSRDKIVIV 110
             G  +R K V+V
Sbjct: 70  TKGQLNRHKTVVV 82


>gi|170725690|ref|YP_001759716.1| hypothetical protein Swoo_1329 [Shewanella woodyi ATCC 51908]
 gi|226695928|sp|B1KIX3.1|Y1329_SHEWM RecName: Full=UPF0235 protein Swoo_1329
 gi|169811037|gb|ACA85621.1| protein of unknown function DUF167 [Shewanella woodyi ATCC 51908]
          Length = 95

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 44/83 (53%), Gaps = 2/83 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  V  E + + I AP  DG+ANA L++Y+S    V +  ++I  G 
Sbjct: 13  LNLYIQPKASRDQIVGVHGEELKIAITAPPVDGKANAHLIKYLSKAFKVPKGDINILKGE 72

Query: 102 KSRDKI--VIVEEITPENVLNSL 122
           + R K   VI   + PEN+ + L
Sbjct: 73  QGRHKQVKVISPRVIPENISSQL 95


>gi|410646150|ref|ZP_11356604.1| hypothetical protein GAGA_2150 [Glaciecola agarilytica NO2]
 gi|410134489|dbj|GAC05003.1| hypothetical protein GAGA_2150 [Glaciecola agarilytica NO2]
          Length = 98

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 42/73 (57%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + + + I+ +P +    I  +  +A+ + I AP  DG+ANA L +Y++   GV + +V+I
Sbjct: 10  AELQLRIYIQPKASRDEIVGMHGDALKIAITAPPVDGKANAHLCKYLAKQCGVPKSKVAI 69

Query: 98  GSGSKSRDKIVIV 110
             G  +R K V+V
Sbjct: 70  TKGQLNRHKTVVV 82


>gi|451979710|ref|ZP_21928123.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
 gi|451763079|emb|CCQ89320.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
          Length = 99

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 49/91 (53%), Gaps = 12/91 (13%)

Query: 21  IKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAAL 80
           IK ND  L          + S+TI  +P +    I  V D A+ V++ +P  +G AN A 
Sbjct: 5   IKKNDHGL----------TFSVTI--QPRTSRNEIAGVHDGALKVRLTSPPVEGAANKAC 52

Query: 81  LEYMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           L+ +   LG+   ++SI SGS SR+K++ V+
Sbjct: 53  LKLLGKTLGMAPSKLSIVSGSTSRNKVIQVD 83


>gi|162456439|ref|YP_001618806.1| hypothetical protein sce8156 [Sorangium cellulosum So ce56]
 gi|161167021|emb|CAN98326.1| hypothetical protein sce8156 [Sorangium cellulosum So ce56]
          Length = 139

 Score = 45.8 bits (107), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 40/71 (56%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V I++  +P S   +I  V + A+ V + AP  +G ANA L++ +S  L V++  V I  
Sbjct: 52  VRISVQVRPKSSRSAIVGVREGALDVSLTAPPVEGAANAELVKLLSRALDVRKSDVQIAL 111

Query: 100 GSKSRDKIVIV 110
           G+  R K+V V
Sbjct: 112 GASGRSKVVAV 122


>gi|332304862|ref|YP_004432713.1| hypothetical protein Glaag_0482 [Glaciecola sp. 4H-3-7+YE-5]
 gi|332172191|gb|AEE21445.1| protein of unknown function DUF167 [Glaciecola sp. 4H-3-7+YE-5]
          Length = 98

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 42/73 (57%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + + + I+ +P +    I  +  +A+ + I AP  DG+ANA L +Y++   GV + +V+I
Sbjct: 10  AELQLRIYIQPKAARDEIVGMHGDALKIAITAPPVDGKANAHLCKYLAKQCGVAKSKVAI 69

Query: 98  GSGSKSRDKIVIV 110
             G  +R K V+V
Sbjct: 70  TKGQLNRHKTVVV 82


>gi|312136610|ref|YP_004003947.1| hypothetical protein Mfer_0383 [Methanothermus fervidus DSM 2088]
 gi|311224329|gb|ADP77185.1| protein of unknown function DUF167 [Methanothermus fervidus DSM
           2088]
          Length = 97

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 46/85 (54%), Gaps = 5/85 (5%)

Query: 38  SSVSITIHAKPGSKSCSI--TDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           S V + IH  P SK   I   D   + + V + +PA+ G+AN  ++E  S +     ++V
Sbjct: 9   SDVLLQIHVIPSSKKFGIEKYDSWRKRLYVTVKSPARKGKANKEIIEEFSKLFN---KEV 65

Query: 96  SIGSGSKSRDKIVIVEEITPENVLN 120
            I  G KSRDK ++V+++  + ++ 
Sbjct: 66  KIVKGIKSRDKTLVVKDVEYKKIME 90


>gi|374628064|ref|ZP_09700449.1| UPF0235 protein yggU [Methanoplanus limicola DSM 2279]
 gi|373906177|gb|EHQ34281.1| UPF0235 protein yggU [Methanoplanus limicola DSM 2279]
          Length = 105

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 29/88 (32%), Positives = 45/88 (51%), Gaps = 3/88 (3%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDE---AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           + ++IT+   PGSK        +E   A+  +I AP  DG+AN  +   +S    + +R 
Sbjct: 12  TGITITLDVSPGSKKTQFPSGYNEWRKAILCRISAPPVDGKANKEIERAVSDFFSIPKRN 71

Query: 95  VSIGSGSKSRDKIVIVEEITPENVLNSL 122
           VSI SG  S  K V++E I P+  +  L
Sbjct: 72  VSIISGQTSSLKRVLIEGIEPDRAVELL 99


>gi|378953422|ref|YP_005210910.1| protein YggU [Pseudomonas fluorescens F113]
 gi|359763436|gb|AEV65515.1| YggU [Pseudomonas fluorescens F113]
          Length = 97

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 40/70 (57%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFAGLHGDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLVSGE 72

Query: 102 KSRDKIVIVE 111
            +R K V + 
Sbjct: 73  LNRQKRVRIH 82


>gi|171057075|ref|YP_001789424.1| hypothetical protein Lcho_0384 [Leptothrix cholodnii SP-6]
 gi|170774520|gb|ACB32659.1| protein of unknown function DUF167 [Leptothrix cholodnii SP-6]
          Length = 125

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 41/70 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I +   P ++   +  + D+A+ +++ AP  DG AN AL  +++  LGV ++QVS+  G+
Sbjct: 35  IDVAVVPNARRTEVVGLHDQALRLRLAAPPVDGAANDALQRWLADELGVSKQQVSLLRGA 94

Query: 102 KSRDKIVIVE 111
             R K + V+
Sbjct: 95  SGRRKRLRVQ 104


>gi|330812367|ref|YP_004356829.1| hypothetical protein PSEBR_a5324 [Pseudomonas brassicacearum subsp.
           brassicacearum NFM421]
 gi|423699875|ref|ZP_17674365.1| hypothetical protein PflQ8_5485 [Pseudomonas fluorescens Q8r1-96]
 gi|327380475|gb|AEA71825.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp.
           brassicacearum NFM421]
 gi|387996343|gb|EIK57673.1| hypothetical protein PflQ8_5485 [Pseudomonas fluorescens Q8r1-96]
          Length = 97

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 40/70 (57%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFAGLHGDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLVSGE 72

Query: 102 KSRDKIVIVE 111
            +R K V + 
Sbjct: 73  LNRQKRVRIH 82


>gi|327401804|ref|YP_004342643.1| hypothetical protein Arcve_1935 [Archaeoglobus veneficus SNP6]
 gi|327317312|gb|AEA47928.1| UPF0235 protein yggU [Archaeoglobus veneficus SNP6]
          Length = 101

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 48/88 (54%), Gaps = 5/88 (5%)

Query: 40  VSITIHAKPGSKSCSITDVSD--EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           V I+I   P SK   +   ++  +++ V++ AP K G+ANA ++E  S +    R++V I
Sbjct: 14  VYISIEVTPNSKQSCVYGYNEWRKSIAVRVKAPPKGGKANAEIVELFSKIF---RKKVEI 70

Query: 98  GSGSKSRDKIVIVEEITPENVLNSLGKA 125
             G  S  K++ V   +P+ V + L K+
Sbjct: 71  VKGHTSSQKVIFVHSASPQEVESLLVKS 98


>gi|443479055|ref|ZP_21068717.1| UPF0235 protein yggU [Pseudanabaena biceps PCC 7429]
 gi|443015534|gb|ELS30442.1| UPF0235 protein yggU [Pseudanabaena biceps PCC 7429]
          Length = 74

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 41/69 (59%)

Query: 43  TIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSK 102
           T+  KP SK  +I +  D ++ V + +P  DG+AN  L+E ++   GV + +++I SG  
Sbjct: 4   TVKVKPNSKKQAIVEQIDGSLLVHLKSPPIDGKANQELVELLAKEFGVSKSKITIQSGLS 63

Query: 103 SRDKIVIVE 111
            R K+V ++
Sbjct: 64  GRYKLVNID 72


>gi|261416137|ref|YP_003249820.1| hypothetical protein Fisuc_1745 [Fibrobacter succinogenes subsp.
           succinogenes S85]
 gi|385791007|ref|YP_005822130.1| hypothetical protein FSU_2243 [Fibrobacter succinogenes subsp.
           succinogenes S85]
 gi|261372593|gb|ACX75338.1| protein of unknown function DUF167 [Fibrobacter succinogenes subsp.
           succinogenes S85]
 gi|302328014|gb|ADL27215.1| conserved hypothetical protein [Fibrobacter succinogenes subsp.
           succinogenes S85]
          Length = 74

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 41/69 (59%), Gaps = 2/69 (2%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           ++I +HA+  SK  S+T   D +  V++ AP  DG ANAA+ E ++    V +R VS+  
Sbjct: 3   INIKVHAR--SKRESVTPQPDGSYKVEVKAPPVDGAANAAICELIADYFHVHKRDVSVVM 60

Query: 100 GSKSRDKIV 108
           GS +  K++
Sbjct: 61  GSTNNKKVI 69


>gi|338813357|ref|ZP_08625486.1| hypothetical protein ALO_14457 [Acetonema longum DSM 6540]
 gi|337274716|gb|EGO63224.1| hypothetical protein ALO_14457 [Acetonema longum DSM 6540]
          Length = 108

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 23/83 (27%), Positives = 46/83 (55%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V++ I  +P S    ++ +  ++V   ++AP  DGEAN AL++ +S V  + + ++ I +
Sbjct: 18  VTVKIKLQPRSSRSEVSGLCGDSVKACVNAPPADGEANQALIQLLSDVFHIPKSRIEIIA 77

Query: 100 GSKSRDKIVIVEEITPENVLNSL 122
           G K+R+K +    +    +L  L
Sbjct: 78  GHKNRNKRIKFSGVDSAFILRQL 100


>gi|237749382|ref|ZP_04579862.1| predicted protein [Oxalobacter formigenes OXCC13]
 gi|229380744|gb|EEO30835.1| predicted protein [Oxalobacter formigenes OXCC13]
          Length = 100

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 38/67 (56%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I +   P +K   I     EA+ +++ AP  DG+AN AL+++++  L   +R VSI  G 
Sbjct: 15  IAVQVSPNAKKTEIVSSDGEALRIRLQAPPVDGKANEALVQFIAKKLRTPKRNVSITHGL 74

Query: 102 KSRDKIV 108
            ++ K++
Sbjct: 75  SAKHKLL 81


>gi|310801877|gb|EFQ36770.1| hypothetical protein GLRG_11914 [Glomerella graminicola M1.001]
          Length = 92

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 43/74 (58%), Gaps = 2/74 (2%)

Query: 39  SVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           ++ +    KPG+      I  V+D  V + + A A++GEAN A+++ +S +LG+ +  + 
Sbjct: 19  TLYLQCRVKPGASRVREGIVAVTDGGVELCVAAQAREGEANRAVIKLLSEILGLPKSDLI 78

Query: 97  IGSGSKSRDKIVIV 110
           I  G KSRDK V V
Sbjct: 79  ISQGLKSRDKTVAV 92


>gi|416050873|ref|ZP_11577157.1| hypothetical protein SC1083_0304 [Aggregatibacter
           actinomycetemcomitans serotype e str. SC1083]
 gi|418464692|ref|ZP_13035631.1| hypothetical protein RHAA1_03356 [Aggregatibacter
           actinomycetemcomitans RhAA1]
 gi|347993686|gb|EGY35027.1| hypothetical protein SC1083_0304 [Aggregatibacter
           actinomycetemcomitans serotype e str. SC1083]
 gi|359756647|gb|EHK90804.1| hypothetical protein RHAA1_03356 [Aggregatibacter
           actinomycetemcomitans RhAA1]
          Length = 97

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 20/73 (27%), Positives = 40/73 (54%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + + + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S +  V +  + +
Sbjct: 9   ADLRLRIFLQPKAAKDQIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKVPKSSIML 68

Query: 98  GSGSKSRDKIVIV 110
             G  +R K V++
Sbjct: 69  EKGELNRHKQVLI 81


>gi|116328001|ref|YP_797721.1| hypothetical protein LBL_1291 [Leptospira borgpetersenii serovar
           Hardjo-bovis str. L550]
 gi|116330879|ref|YP_800597.1| hypothetical protein LBJ_1240 [Leptospira borgpetersenii serovar
           Hardjo-bovis str. JB197]
 gi|122281367|sp|Q04TD1.1|Y1240_LEPBJ RecName: Full=UPF0235 protein LBJ_1240
 gi|122284207|sp|Q052F7.1|Y1291_LEPBL RecName: Full=UPF0235 protein LBL_1291
 gi|116120745|gb|ABJ78788.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
           Hardjo-bovis str. L550]
 gi|116124568|gb|ABJ75839.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
           Hardjo-bovis str. JB197]
          Length = 73

 Score = 45.1 bits (105), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 41/76 (53%), Gaps = 3/76 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +  T+  KP SK        D +V + +  PA +G+AN A++E +S  + + +R++ I S
Sbjct: 1   MKFTVRVKPNSKKIFFRKEEDGSVTIAVREPALEGKANEAVIETISREMKIPKRKIRIVS 60

Query: 100 GSKSRDKIVIVEEITP 115
           G K + K +   EI P
Sbjct: 61  GEKGKKKTI---EIDP 73


>gi|340053341|emb|CCC47629.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 160

 Score = 45.1 bits (105), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 45/78 (57%), Gaps = 11/78 (14%)

Query: 29  PSCIRLVPPSSVSITIHAKPGSKSCSIT---DVSDEAVGVQIDAPAKDGEANAALLEYMS 85
           P C RL+        +HAKPG+++ S+       D ++ V+I AP  DG+AN  L+ +M 
Sbjct: 12  PGCFRLI--------VHAKPGARATSLACRPAPGDASLEVKIAAPPVDGKANVELVAFMQ 63

Query: 86  SVLGVKRRQVSIGSGSKS 103
           ++L  +++Q+ + S + S
Sbjct: 64  ALLEQQQKQLKLLSSNTS 81


>gi|121997739|ref|YP_001002526.1| hypothetical protein Hhal_0948 [Halorhodospira halophila SL1]
 gi|121589144|gb|ABM61724.1| protein of unknown function DUF167 [Halorhodospira halophila SL1]
          Length = 101

 Score = 45.1 bits (105), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 43/76 (56%), Gaps = 3/76 (3%)

Query: 37  PSSVSITIHAK--PGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           P   +  +H +  P +K  S+ DV  E + V+++ P  DG+AN AL + ++   GV +  
Sbjct: 8   PEQDAWLVHVRVTPRAKRESL-DVEGERLRVRLNTPPVDGKANTALRKLLARQFGVAKSA 66

Query: 95  VSIGSGSKSRDKIVIV 110
           VS+  G +SRDK V +
Sbjct: 67  VSLLRGERSRDKTVRI 82


>gi|427715535|ref|YP_007063529.1| hypothetical protein Cal7507_0193 [Calothrix sp. PCC 7507]
 gi|427347971|gb|AFY30695.1| UPF0235 protein yggU [Calothrix sp. PCC 7507]
          Length = 75

 Score = 45.1 bits (105), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 41/72 (56%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + IT+  KP +K   I + SD ++ V + +P  DG+AN  L++ ++    V +  + + S
Sbjct: 1   MQITVKVKPNAKQQKIEEQSDGSLNVLLKSPPVDGKANEELIKLLAEKFNVPKSNIRVKS 60

Query: 100 GSKSRDKIVIVE 111
           G  SR K+V ++
Sbjct: 61  GVSSRRKLVEIQ 72


>gi|384253623|gb|EIE27097.1| hypothetical protein COCSUDRAFT_11640 [Coccomyxa subellipsoidea
           C-169]
          Length = 234

 Score = 45.1 bits (105), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 48/95 (50%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           +P CI  +  ++  I++     +    I  VS +AV + I        A    L++M SV
Sbjct: 130 VPPCILPIDNNTTQISLEIDDRADRPEIIKVSADAVRIAITHGIAHEAAGEEALDFMRSV 189

Query: 88  LGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           LGV+  Q+S+  G  +R K+V+V+++ P    + L
Sbjct: 190 LGVRLGQLSLMRGESTRHKLVLVKDLAPAVAFDKL 224


>gi|417839599|ref|ZP_12485773.1| Hypothetical protein GG7_0792 [Haemophilus haemolyticus M19107]
 gi|341952137|gb|EGT78675.1| Hypothetical protein GG7_0792 [Haemophilus haemolyticus M19107]
          Length = 95

 Score = 45.1 bits (105), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 24/85 (28%), Positives = 44/85 (51%), Gaps = 2/85 (2%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S    V +  + +  
Sbjct: 11  IRLRIFLQPKASKDHIAGIHDDELKITITAPPVDGQANAHLLKFLSKSFKVPKSSIILEK 70

Query: 100 GSKSRDKIVIVEE--ITPENVLNSL 122
           G  SR K V + +  + P+ + N L
Sbjct: 71  GELSRHKQVWIPDPKLIPKEIQNLL 95


>gi|307152217|ref|YP_003887601.1| hypothetical protein Cyan7822_2348 [Cyanothece sp. PCC 7822]
 gi|306982445|gb|ADN14326.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7822]
          Length = 73

 Score = 45.1 bits (105), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 41/69 (59%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I +  KP +K   I ++ D ++ + + +P  DG+AN  L++ ++    V + Q+SI S
Sbjct: 1   MKIQVKVKPNAKQQKIEELEDGSLVISLKSPPVDGKANEELIKLLAKKYQVSKSQISIQS 60

Query: 100 GSKSRDKIV 108
           G  SR+K++
Sbjct: 61  GLSSRNKLI 69


>gi|386811897|ref|ZP_10099122.1| conserved hypothetical protein [planctomycete KSU-1]
 gi|386404167|dbj|GAB62003.1| conserved hypothetical protein [planctomycete KSU-1]
          Length = 97

 Score = 45.1 bits (105), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 48/89 (53%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S V ++I  +P S    I       + + + A A+ G+AN A++E ++    V+   + I
Sbjct: 9   SGVILSIRTQPRSSKNRIIGEYGGRLKLAVTAIAEKGKANKAIIELLADTFHVRESSIHI 68

Query: 98  GSGSKSRDKIVIVEEITPENVLNSLGKAS 126
            SG  SRDK +++E +TPE++ + L   S
Sbjct: 69  ISGESSRDKRLMIEGLTPEDLKSLLNHQS 97


>gi|399522701|ref|ZP_10763364.1| UPF0235 protein PFL_5841 [Pseudomonas pseudoalcaligenes CECT 5344]
 gi|399109565|emb|CCH39925.1| UPF0235 protein PFL_5841 [Pseudomonas pseudoalcaligenes CECT 5344]
          Length = 104

 Score = 45.1 bits (105), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 35/62 (56%)

Query: 45  HAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSR 104
           H +P +       +  E + +++ AP  DG+ANA LL +++ V GV + QV + SG  +R
Sbjct: 24  HLQPKASKDEFAGLQGERLKIRLTAPPVDGKANAHLLAFLAKVFGVAKSQVILESGELNR 83

Query: 105 DK 106
            K
Sbjct: 84  HK 85


>gi|398995154|ref|ZP_10698043.1| TIGR00251 family protein [Pseudomonas sp. GM21]
 gi|398130721|gb|EJM20055.1| TIGR00251 family protein [Pseudomonas sp. GM21]
          Length = 96

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFCGLHGDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|377577175|ref|ZP_09806158.1| hypothetical protein YggU [Escherichia hermannii NBRC 105704]
 gi|377541703|dbj|GAB51323.1| hypothetical protein YggU [Escherichia hermannii NBRC 105704]
          Length = 95

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 21/73 (28%), Positives = 42/73 (57%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +++ ++ +P +   +I  +  + V + I AP  DG+ANA L+++++    V + QVSI  
Sbjct: 11  LTLRLYIQPKASRDAIIGLHGDEVKIAITAPPVDGQANAHLVKFLAKQFRVPKSQVSIEK 70

Query: 100 GSKSRDKIVIVEE 112
           G   R K +++ E
Sbjct: 71  GETGRHKHIVITE 83


>gi|376295104|ref|YP_005166334.1| hypothetical protein DND132_0313 [Desulfovibrio desulfuricans
           ND132]
 gi|323457665|gb|EGB13530.1| protein of unknown function DUF167 [Desulfovibrio desulfuricans
           ND132]
          Length = 98

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 40/69 (57%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S  ++ + A+PG++   +       + V++ APA D +AN  L+ Y++ +L +K+ QV I
Sbjct: 8   SGWALDVWAQPGARKDEVAGEYQGCLKVRLRAPAVDNKANKGLVAYIARLLQLKKSQVEI 67

Query: 98  GSGSKSRDK 106
            SG  SR K
Sbjct: 68  VSGHASRRK 76


>gi|406890629|gb|EKD36479.1| hypothetical protein ACD_75C01498G0004 [uncultured bacterium]
          Length = 103

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 24/101 (23%), Positives = 57/101 (56%), Gaps = 1/101 (0%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           +P  +  +   +V + ++ +P +    I  + D  + + + AP  DG+AN  ++ ++S +
Sbjct: 1   MPPYLTTLADGAVLLHMYVQPKASKSRIVGLYDGCLKLAVAAPPVDGKANEEVVRFLSRL 60

Query: 88  LGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKASSS 128
           LG+  R +++ SG++S+ K + +E    ++VL+ + K  +S
Sbjct: 61  LGIPGRNIALHSGAQSKRKKIRIEAAC-DDVLDKIEKILAS 100


>gi|322513549|ref|ZP_08066649.1| hypothetical protein HMPREF0027_0401 [Actinobacillus ureae ATCC
           25976]
 gi|322120620|gb|EFX92514.1| hypothetical protein HMPREF0027_0401 [Actinobacillus ureae ATCC
           25976]
          Length = 99

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 22/76 (28%), Positives = 36/76 (47%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  + + I  +P +    I  + D  + + I  P  DG ANA LL+Y+S +  V +  + 
Sbjct: 10  PCGIRLRIFLQPKASRDQIVGLHDNELKIAITTPPVDGAANAHLLKYLSKLFKVPKSSIV 69

Query: 97  IGSGSKSRDKIVIVEE 112
           +  G   R K + V E
Sbjct: 70  LEKGELQRHKQLFVPE 85


>gi|258405784|ref|YP_003198526.1| hypothetical protein Dret_1664 [Desulfohalobium retbaense DSM 5692]
 gi|257798011|gb|ACV68948.1| protein of unknown function DUF167 [Desulfohalobium retbaense DSM
           5692]
          Length = 105

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 44/79 (55%), Gaps = 1/79 (1%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + +  +P +K  ++  V  + + +++ AP  D +AN A+  ++++ LG++   V +GSG 
Sbjct: 24  LRLWVQPKAKKTAVVGVYQDCLKIKLQAPPVDNKANQAVCRFVAARLGLRPADVDLGSGQ 83

Query: 102 KSRDK-IVIVEEITPENVL 119
            SR K IVI     P+  L
Sbjct: 84  ASRKKTIVIASRDEPDWTL 102


>gi|348619012|ref|ZP_08885471.1| conserved hypothetical protein DUF167 [Candidatus Glomeribacter
           gigasporarum BEG34]
 gi|347815702|emb|CCD30336.1| conserved hypothetical protein DUF167 [Candidatus Glomeribacter
           gigasporarum BEG34]
          Length = 112

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 38/67 (56%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + ++ +P +K+       + A+ V+++AP  DG+AN AL  +++  LGV R  V + +
Sbjct: 20  VQLCVYVQPNAKATETMGEFNGALKVRLNAPPVDGKANQALCAWLAERLGVPRHAVYLEA 79

Query: 100 GSKSRDK 106
           G   R K
Sbjct: 80  GHNRRYK 86


>gi|296269396|ref|YP_003652028.1| hypothetical protein Tbis_1417 [Thermobispora bispora DSM 43833]
 gi|296092183|gb|ADG88135.1| protein of unknown function DUF167 [Thermobispora bispora DSM
           43833]
          Length = 89

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 42/70 (60%), Gaps = 1/70 (1%)

Query: 40  VSITIHAKPG-SKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           + ++I  +PG S+        D A+ V++ APA DG A  A L+ ++S  GV+R  V + 
Sbjct: 1   MRVSIRVRPGASREYVGGTYGDGAIVVRVCAPAVDGRATEAALKAVASAFGVRRGDVRLV 60

Query: 99  SGSKSRDKIV 108
           SG+ +RDK+V
Sbjct: 61  SGATARDKVV 70


>gi|260913359|ref|ZP_05919840.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325]
 gi|260632590|gb|EEX50760.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325]
          Length = 100

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 44/82 (53%), Gaps = 2/82 (2%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S    V +  + + 
Sbjct: 10  NLRLRIFLQPKASKDQIVGLHDDELKITITAPPIDGQANAHLLKFLSKTFKVPKSSIVLE 69

Query: 99  SGSKSRDKIVIV--EEITPENV 118
            G  +R K ++V   +I PE V
Sbjct: 70  KGELNRHKQILVPNPKIIPEIV 91


>gi|398337232|ref|ZP_10521937.1| hypothetical protein LkmesMB_17675 [Leptospira kmetyi serovar
           Malaysia str. Bejo-Iso9]
          Length = 73

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 39/72 (54%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +  T+  KP SK        D ++ V +  PA +G+AN A++E  S  + + +R++ I S
Sbjct: 1   MKFTVLVKPNSKKVFFRKEEDGSITVAVREPALEGKANEAVIEAFSEAMNLPKRKIRILS 60

Query: 100 GSKSRDKIVIVE 111
           G K + K V V+
Sbjct: 61  GEKGKRKTVEVD 72


>gi|301064653|ref|ZP_07205047.1| conserved hypothetical protein TIGR00251 [delta proteobacterium
           NaphS2]
 gi|300441273|gb|EFK05644.1| conserved hypothetical protein TIGR00251 [delta proteobacterium
           NaphS2]
          Length = 95

 Score = 45.1 bits (105), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 22/85 (25%), Positives = 47/85 (55%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + V I +   P S    I   +D    +++ AP  +G+AN AL+ +++   G  ++++ I
Sbjct: 8   AGVVIRVKVLPRSSRTEIVGKTDGIYRIKLTAPPVEGKANKALINFLAKKTGSPKQKIRI 67

Query: 98  GSGSKSRDKIVIVEEITPENVLNSL 122
             G +SR+K + +E ++ +++L  L
Sbjct: 68  VKGEQSRNKTIRIENLSSDDILKYL 92


>gi|406936430|gb|EKD70152.1| hypothetical protein ACD_46C00619G0007 [uncultured bacterium]
          Length = 104

 Score = 45.1 bits (105), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 41/78 (52%)

Query: 35  VPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           V  S++ + ++ +P +    I  V D+ + +++  P  DG AN  L ++++    V  + 
Sbjct: 9   VEESALWLHVYLQPRASRDEIVGVHDDCLKIRLTTPPVDGRANQHLCQFLAKFFRVATQS 68

Query: 95  VSIGSGSKSRDKIVIVEE 112
           VSI  G  SR K V +E+
Sbjct: 69  VSIEKGELSRKKWVKIEK 86


>gi|328909063|gb|AEB61199.1| UPF0235, partial [Equus caballus]
          Length = 69

 Score = 44.7 bits (104), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 24/58 (41%), Positives = 36/58 (62%), Gaps = 1/58 (1%)

Query: 62  AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV-IVEEITPENV 118
           AV V I AP  +GEANA L  Y+S VL +++  V +  G KS +K+V ++   TP+ +
Sbjct: 1   AVSVAIAAPPSEGEANAELCRYLSKVLDLRKSDVVLDKGGKSCEKVVKLLASTTPDEI 58


>gi|218246502|ref|YP_002371873.1| hypothetical protein PCC8801_1666 [Cyanothece sp. PCC 8801]
 gi|257059535|ref|YP_003137423.1| hypothetical protein Cyan8802_1684 [Cyanothece sp. PCC 8802]
 gi|218166980|gb|ACK65717.1| protein of unknown function DUF167 [Cyanothece sp. PCC 8801]
 gi|256589701|gb|ACV00588.1| protein of unknown function DUF167 [Cyanothece sp. PCC 8802]
          Length = 72

 Score = 44.7 bits (104), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 40/68 (58%)

Query: 44  IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKS 103
           +  KP +K   I +  D ++ V + +P  +G+AN  L++ ++   GV + Q+SI SG  S
Sbjct: 5   VKVKPNAKKQKIQEEEDGSLVVYLKSPPIEGKANQELIKLLAKKFGVSQSQISIKSGLSS 64

Query: 104 RDKIVIVE 111
           R+K V +E
Sbjct: 65  RNKWVEIE 72


>gi|402701133|ref|ZP_10849112.1| hypothetical protein PfraA_14931 [Pseudomonas fragi A22]
          Length = 97

 Score = 44.7 bits (104), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 38/67 (56%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P + S     +  E + +++ AP  +G+ANA LL +++   G+ + QVS+ SG 
Sbjct: 13  LDCHLQPKASSDEFAGLHGERLKIRLTAPPVEGKANAHLLAFLAKAFGIAKSQVSLISGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|163783198|ref|ZP_02178192.1| hypothetical protein HG1285_14279 [Hydrogenivirga sp. 128-5-R1-1]
 gi|159881532|gb|EDP75042.1| hypothetical protein HG1285_14279 [Hydrogenivirga sp. 128-5-R1-1]
          Length = 73

 Score = 44.7 bits (104), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 40/69 (57%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + +  KP SK   + +VS   + V++ AP + G+AN  L+E ++   GV++  V I  
Sbjct: 1   MKLKVKVKPSSKREGVREVSPGELEVRVSAPPERGKANERLIELLAKHYGVRKGAVRILR 60

Query: 100 GSKSRDKIV 108
           G  SR+K+V
Sbjct: 61  GETSREKVV 69


>gi|407368083|ref|ZP_11114615.1| hypothetical protein PmanJ_29961 [Pseudomonas mandelii JR-1]
          Length = 98

 Score = 44.7 bits (104), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFCGLHGDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|398964943|ref|ZP_10680647.1| TIGR00251 family protein [Pseudomonas sp. GM30]
 gi|424925428|ref|ZP_18348789.1| hypothetical protein I1A_004915 [Pseudomonas fluorescens R124]
 gi|398147745|gb|EJM36444.1| TIGR00251 family protein [Pseudomonas sp. GM30]
 gi|404306588|gb|EJZ60550.1| hypothetical protein I1A_004915 [Pseudomonas fluorescens R124]
          Length = 97

 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFAGLHGDRLKIRLTAPPVEGKANAYLMGFLAKAFGVSKSQVSLLSGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|398893402|ref|ZP_10646116.1| TIGR00251 family protein [Pseudomonas sp. GM55]
 gi|398184302|gb|EJM71758.1| TIGR00251 family protein [Pseudomonas sp. GM55]
          Length = 95

 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFCGLHGDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|418722298|ref|ZP_13281469.1| TIGR00251 family protein [Leptospira borgpetersenii str. UI 09149]
 gi|418736835|ref|ZP_13293234.1| TIGR00251 family protein [Leptospira borgpetersenii serovar
           Castellonis str. 200801910]
 gi|421095976|ref|ZP_15556683.1| TIGR00251 family protein [Leptospira borgpetersenii str. 200801926]
 gi|410360877|gb|EKP11923.1| TIGR00251 family protein [Leptospira borgpetersenii str. 200801926]
 gi|410741608|gb|EKQ90364.1| TIGR00251 family protein [Leptospira borgpetersenii str. UI 09149]
 gi|410747721|gb|EKR00626.1| TIGR00251 family protein [Leptospira borgpetersenii serovar
           Castellonis str. 200801910]
 gi|456889338|gb|EMG00237.1| TIGR00251 family protein [Leptospira borgpetersenii str. 200701203]
          Length = 73

 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 41/76 (53%), Gaps = 3/76 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +  T+  KP SK        D ++ + +  PA +G+AN A++E +S  + + +R++ I S
Sbjct: 1   MKFTVRVKPNSKKIFFRKEEDGSITIAVREPALEGKANEAVIETISREMKIPKRKIRIVS 60

Query: 100 GSKSRDKIVIVEEITP 115
           G K + K +   EI P
Sbjct: 61  GEKGKKKTI---EIDP 73


>gi|398900555|ref|ZP_10649583.1| TIGR00251 family protein [Pseudomonas sp. GM50]
 gi|398180955|gb|EJM68529.1| TIGR00251 family protein [Pseudomonas sp. GM50]
          Length = 98

 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFCGLHGDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|426412015|ref|YP_007032114.1| hypothetical protein PputUW4_05119 [Pseudomonas sp. UW4]
 gi|426270232|gb|AFY22309.1| hypothetical protein PputUW4_05119 [Pseudomonas sp. UW4]
          Length = 97

 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFAGLHGDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|333998213|ref|YP_004530825.1| protein C15orf40 [Treponema primitia ZAS-2]
 gi|333739567|gb|AEF85057.1| protein C15orf40 [Treponema primitia ZAS-2]
          Length = 99

 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 44/79 (55%), Gaps = 1/79 (1%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           +  C R +    + + I A PGS    I  +S+  + ++I A  +DG+AN  L  +++ +
Sbjct: 1   MAECFR-IAGELLLLDIKAVPGSSKSQIAGLSEGRLRIKIAASPEDGKANTELRAFLAKL 59

Query: 88  LGVKRRQVSIGSGSKSRDK 106
           LG  R+ +++ +G KSR K
Sbjct: 60  LGCPRKDITLVAGEKSRLK 78


>gi|429963136|gb|ELA42680.1| TIGR00251 family protein [Vittaforma corneae ATCC 50505]
          Length = 101

 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 25/92 (27%), Positives = 46/92 (50%), Gaps = 1/92 (1%)

Query: 34  LVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           L+      + I  +P +    I   +DE   + + APA   +AN  LL ++S++ GV++ 
Sbjct: 5   LIKEDYFILDIKVRPNASQTEIRAFTDEEFIIALAAPADKEKANKELLNFLSTIFGVQKD 64

Query: 94  QVSIGSGSK-SRDKIVIVEEITPENVLNSLGK 124
           ++S+  G   SR KI +    + + V N + K
Sbjct: 65  KLSVTGGQHSSRKKIRVSLRASIQEVTNMIAK 96


>gi|398870082|ref|ZP_10625434.1| TIGR00251 family protein [Pseudomonas sp. GM74]
 gi|398209654|gb|EJM96323.1| TIGR00251 family protein [Pseudomonas sp. GM74]
          Length = 98

 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFCGLHGDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|398928809|ref|ZP_10663693.1| TIGR00251 family protein [Pseudomonas sp. GM48]
 gi|398167900|gb|EJM55935.1| TIGR00251 family protein [Pseudomonas sp. GM48]
          Length = 97

 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFAGLHGDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|398853101|ref|ZP_10609731.1| TIGR00251 family protein [Pseudomonas sp. GM80]
 gi|398241826|gb|EJN27463.1| TIGR00251 family protein [Pseudomonas sp. GM80]
          Length = 96

 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFAGLHGDRLKIRLTAPPVEGKANAYLMGFLAKAFGVSKSQVSLLSGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|367048099|ref|XP_003654429.1| hypothetical protein THITE_2051766 [Thielavia terrestris NRRL 8126]
 gi|347001692|gb|AEO68093.1| hypothetical protein THITE_2051766 [Thielavia terrestris NRRL 8126]
          Length = 119

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 24/54 (44%), Positives = 35/54 (64%)

Query: 55  ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV 108
           IT V DEAV + + A A++GEAN A+++ +S VL + +  + I  G KSR K V
Sbjct: 37  ITSVDDEAVEICVAAQAREGEANKAVIKVLSDVLDLPKSALQITQGLKSRTKTV 90


>gi|77461543|ref|YP_351050.1| hypothetical protein Pfl01_5322 [Pseudomonas fluorescens Pf0-1]
 gi|123602932|sp|Q3K595.1|Y5322_PSEPF RecName: Full=UPF0235 protein Pfl01_5322
 gi|77385546|gb|ABA77059.1| conserved hypothetical protein [Pseudomonas fluorescens Pf0-1]
          Length = 96

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 40/69 (57%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFCGLHGDRLKIRLTAPPVEGKANAYLMGFLAKAFGVSKSQVSLLSGE 72

Query: 102 KSRDKIVIV 110
            +R K V +
Sbjct: 73  LNRQKRVRI 81


>gi|398857166|ref|ZP_10612869.1| TIGR00251 family protein [Pseudomonas sp. GM79]
 gi|398241480|gb|EJN27132.1| TIGR00251 family protein [Pseudomonas sp. GM79]
          Length = 97

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFCGLHGDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|308456125|ref|XP_003090529.1| hypothetical protein CRE_03518 [Caenorhabditis remanei]
 gi|308262652|gb|EFP06605.1| hypothetical protein CRE_03518 [Caenorhabditis remanei]
          Length = 102

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 17/58 (29%), Positives = 34/58 (58%)

Query: 40 VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
          + + IHAKPG+K   +  + +  + V I A  ++G AN  L+ Y+ + LG+++ ++  
Sbjct: 32 IGLRIHAKPGAKKSCVVAIGESEIDVSIGAAPREGAANEELISYLMAALGLRKNELQF 89


>gi|398953878|ref|ZP_10675622.1| TIGR00251 family protein [Pseudomonas sp. GM33]
 gi|398153180|gb|EJM41685.1| TIGR00251 family protein [Pseudomonas sp. GM33]
          Length = 98

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFAGLHGDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|300115523|ref|YP_003762098.1| hypothetical protein Nwat_3056 [Nitrosococcus watsonii C-113]
 gi|299541460|gb|ADJ29777.1| protein of unknown function DUF167 [Nitrosococcus watsonii C-113]
          Length = 102

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 22/74 (29%), Positives = 40/74 (54%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ + I  +P ++   +       + ++I AP  +G+AN  LL ++     V R QV + 
Sbjct: 12  ALIVQIRLQPRARGDEVIGPHGNRLKIRITAPPVEGKANTQLLRFLVKTFQVSRNQVYLL 71

Query: 99  SGSKSRDKIVIVEE 112
           SG+ SRDK V +E+
Sbjct: 72  SGTASRDKRVRIEK 85


>gi|398911885|ref|ZP_10655681.1| TIGR00251 family protein [Pseudomonas sp. GM49]
 gi|398183040|gb|EJM70536.1| TIGR00251 family protein [Pseudomonas sp. GM49]
          Length = 98

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFCGLHGDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|322697515|gb|EFY89294.1| DUF167 domain protein [Metarhizium acridum CQMa 102]
          Length = 122

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 45/78 (57%), Gaps = 6/78 (7%)

Query: 55  ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEI- 113
           I  V+D A+ + + A  + GEAN A+++ +S+ +G+ + +    SG KSRDK+V + +I 
Sbjct: 39  ILAVTDRAIELCVTAQPRHGEANKAVVQALSNAIGIPKSRFRFVSGLKSRDKVVAIGDIQ 98

Query: 114 -----TPENVLNSLGKAS 126
                  E VL  L +AS
Sbjct: 99  GDGPDYTETVLRLLREAS 116


>gi|261867124|ref|YP_003255046.1| hypothetical protein D11S_0417 [Aggregatibacter
           actinomycetemcomitans D11S-1]
 gi|415767506|ref|ZP_11483178.1| hypothetical protein D17P2_0150 [Aggregatibacter
           actinomycetemcomitans D17P-2]
 gi|416107251|ref|ZP_11590338.1| hypothetical protein SCC2302_1544 [Aggregatibacter
           actinomycetemcomitans serotype c str. SCC2302]
 gi|444346864|ref|ZP_21154822.1| hypothetical protein AAS4A_1734 [Aggregatibacter
           actinomycetemcomitans serotype c str. AAS4A]
 gi|261412456|gb|ACX81827.1| hypothetical protein D11S_0417 [Aggregatibacter
           actinomycetemcomitans D11S-1]
 gi|348005581|gb|EGY46058.1| hypothetical protein SCC2302_1544 [Aggregatibacter
           actinomycetemcomitans serotype c str. SCC2302]
 gi|348658442|gb|EGY76010.1| hypothetical protein D17P2_0150 [Aggregatibacter
           actinomycetemcomitans D17P-2]
 gi|443541156|gb|ELT51620.1| hypothetical protein AAS4A_1734 [Aggregatibacter
           actinomycetemcomitans serotype c str. AAS4A]
          Length = 97

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 20/73 (27%), Positives = 40/73 (54%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + + + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S +  V +  + +
Sbjct: 9   AGLRLRIFLQPKAAKDQIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKVPKSSIVL 68

Query: 98  GSGSKSRDKIVIV 110
             G  +R K V++
Sbjct: 69  EKGELNRHKQVLI 81


>gi|171679393|ref|XP_001904643.1| hypothetical protein [Podospora anserina S mat+]
 gi|170939322|emb|CAP64550.1| unnamed protein product [Podospora anserina S mat+]
          Length = 119

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 45/77 (58%), Gaps = 2/77 (2%)

Query: 36  PPSSVSITIHAKPGSKSC--SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           P  ++ +  + KPG+      +  V ++AV + + A A++GEAN A+++ +S VL + + 
Sbjct: 16  PHGTLYMHCNVKPGASKNREGVASVGEDAVEICVAAQAREGEANKAVIKVLSEVLDLPKS 75

Query: 94  QVSIGSGSKSRDKIVIV 110
            + I  G KSR+K V V
Sbjct: 76  NLEITQGHKSRNKTVAV 92


>gi|212541150|ref|XP_002150730.1| DUF167 domain protein [Talaromyces marneffei ATCC 18224]
 gi|210068029|gb|EEA22121.1| DUF167 domain protein [Talaromyces marneffei ATCC 18224]
          Length = 124

 Score = 44.3 bits (103), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 41/70 (58%), Gaps = 3/70 (4%)

Query: 39  SVSITIHAKPGSKSCS-ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S+ I  H  PGS+    +  + DE V + + +  + GEANAA+++ +S VLG+ +  ++I
Sbjct: 23  SLLIRCHVTPGSRGFQGVKKICDEQVYIHVASAPRKGEANAAVVKVLSEVLGIPKSDITI 82

Query: 98  GSGSKSRDKI 107
               K RDK+
Sbjct: 83  --MGKHRDKV 90


>gi|427727766|ref|YP_007074003.1| hypothetical protein Nos7524_0495 [Nostoc sp. PCC 7524]
 gi|427363685|gb|AFY46406.1| hypothetical protein Nos7524_0495 [Nostoc sp. PCC 7524]
          Length = 73

 Score = 44.3 bits (103), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 39/69 (56%)

Query: 44  IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKS 103
           +  KP SK   I + SD ++ V + +P  DG+AN  L++ ++    V +  + I SG  S
Sbjct: 5   VKVKPNSKQQKIEEQSDGSLTVHLKSPPVDGKANEELIKLLADKFDVPKSYIRIKSGLSS 64

Query: 104 RDKIVIVEE 112
           R K++ V+E
Sbjct: 65  RQKLIEVDE 73


>gi|398991326|ref|ZP_10694471.1| TIGR00251 family protein [Pseudomonas sp. GM24]
 gi|399013687|ref|ZP_10715990.1| TIGR00251 family protein [Pseudomonas sp. GM16]
 gi|398112929|gb|EJM02781.1| TIGR00251 family protein [Pseudomonas sp. GM16]
 gi|398140801|gb|EJM29754.1| TIGR00251 family protein [Pseudomonas sp. GM24]
          Length = 98

 Score = 44.3 bits (103), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFAGLHGDRLKIRLTAPPVEGKANAYLMGFLAKAFGVSKSQVSLLSGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|220916321|ref|YP_002491625.1| hypothetical protein A2cp1_1215 [Anaeromyxobacter dehalogenans
           2CP-1]
 gi|254799985|sp|B8JFX1.1|Y1215_ANAD2 RecName: Full=UPF0235 protein A2cp1_1215
 gi|219954175|gb|ACL64559.1| protein of unknown function DUF167 [Anaeromyxobacter dehalogenans
           2CP-1]
          Length = 95

 Score = 44.3 bits (103), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 37/69 (53%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + I  +P +         D  + +Q+ AP  DG ANAAL+E++++ LGV+R  V +  G 
Sbjct: 13  LEILVQPRASRTRAVGEHDGRLKIQLAAPPVDGAANAALVEFLAAALGVRRADVELLRGE 72

Query: 102 KSRDKIVIV 110
             R K V V
Sbjct: 73  TGRRKTVRV 81


>gi|407983885|ref|ZP_11164522.1| hypothetical protein C731_2485 [Mycobacterium hassiacum DSM 44199]
 gi|407374462|gb|EKF23441.1| hypothetical protein C731_2485 [Mycobacterium hassiacum DSM 44199]
          Length = 76

 Score = 44.3 bits (103), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 43/70 (61%), Gaps = 1/70 (1%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDE-AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           +S ++ +  KPGS+   + +V D+ A+ V +   A DG+ANAA++  ++   GV R +V 
Sbjct: 2   TSQTVVVRVKPGSRKGPLVEVDDDGALTVYVPERAVDGKANAAVVRLLAEHFGVSRSEVE 61

Query: 97  IGSGSKSRDK 106
           + SG+ +R K
Sbjct: 62  LVSGATARLK 71


>gi|375082825|ref|ZP_09729871.1| hypothetical protein OCC_07259 [Thermococcus litoralis DSM 5473]
 gi|374742522|gb|EHR78914.1| hypothetical protein OCC_07259 [Thermococcus litoralis DSM 5473]
          Length = 92

 Score = 44.3 bits (103), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 49/86 (56%), Gaps = 5/86 (5%)

Query: 40  VSITIHAKPGSKSCSITDVSD--EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           V I I+ +P +K   I  V +  + + V++ AP  +G+AN  ++++ S +LG    +V +
Sbjct: 9   VIIQIYVQPNAKKTEIEGVDEWRKRLKVKVKAPPVEGKANKEVVKFFSKLLGT---EVVL 65

Query: 98  GSGSKSRDKIVIVEEITPENVLNSLG 123
             G  SR+K ++V+ +T E V   LG
Sbjct: 66  LRGETSREKDLLVKGLTAEEVKEKLG 91


>gi|24214436|ref|NP_711917.1| hypothetical protein LA_1736 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45657916|ref|YP_002002.1| hypothetical protein LIC12068 [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|386073870|ref|YP_005988187.1| hypothetical protein LIF_A1404 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|417759201|ref|ZP_12407238.1| TIGR00251 family protein [Leptospira interrogans str. 2002000624]
 gi|417768197|ref|ZP_12416130.1| TIGR00251 family protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
 gi|417777249|ref|ZP_12425074.1| TIGR00251 family protein [Leptospira interrogans str. 2002000621]
 gi|417782623|ref|ZP_12430347.1| TIGR00251 family protein [Leptospira interrogans str. C10069]
 gi|418668811|ref|ZP_13230211.1| TIGR00251 family protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|418670835|ref|ZP_13232197.1| TIGR00251 family protein [Leptospira interrogans str. 2002000623]
 gi|418691748|ref|ZP_13252832.1| TIGR00251 family protein [Leptospira interrogans str. FPW2026]
 gi|418698664|ref|ZP_13259636.1| TIGR00251 family protein [Leptospira interrogans serovar Bataviae
           str. L1111]
 gi|418708003|ref|ZP_13268816.1| TIGR00251 family protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
 gi|418722790|ref|ZP_13281764.1| TIGR00251 family protein [Leptospira interrogans str. UI 12621]
 gi|421085647|ref|ZP_15546498.1| TIGR00251 family protein [Leptospira santarosai str. HAI1594]
 gi|421101805|ref|ZP_15562416.1| TIGR00251 family protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|421122303|ref|ZP_15582586.1| TIGR00251 family protein [Leptospira interrogans str. Brem 329]
 gi|29839712|sp|Q8F5E6.1|Y1736_LEPIN RecName: Full=UPF0235 protein LA_1736
 gi|73921077|sp|Q72QP5.1|Y2068_LEPIC RecName: Full=UPF0235 protein LIC_12068
 gi|24195381|gb|AAN48935.1|AE011350_4 conserved hypothetical protein [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45601157|gb|AAS70639.1| conserved hypothetical protein [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|353457659|gb|AER02204.1| conserved hypothetical protein [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|400349093|gb|EJP01392.1| TIGR00251 family protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
 gi|400358510|gb|EJP14590.1| TIGR00251 family protein [Leptospira interrogans str. FPW2026]
 gi|409944676|gb|EKN90256.1| TIGR00251 family protein [Leptospira interrogans str. 2002000624]
 gi|409954038|gb|EKO08533.1| TIGR00251 family protein [Leptospira interrogans str. C10069]
 gi|409963624|gb|EKO27347.1| TIGR00251 family protein [Leptospira interrogans str. UI 12621]
 gi|410344203|gb|EKO95369.1| TIGR00251 family protein [Leptospira interrogans str. Brem 329]
 gi|410368478|gb|EKP23855.1| TIGR00251 family protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|410431212|gb|EKP75572.1| TIGR00251 family protein [Leptospira santarosai str. HAI1594]
 gi|410573126|gb|EKQ36183.1| TIGR00251 family protein [Leptospira interrogans str. 2002000621]
 gi|410582264|gb|EKQ50063.1| TIGR00251 family protein [Leptospira interrogans str. 2002000623]
 gi|410755543|gb|EKR17173.1| TIGR00251 family protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|410762222|gb|EKR28388.1| TIGR00251 family protein [Leptospira interrogans serovar Bataviae
           str. L1111]
 gi|410771493|gb|EKR46694.1| TIGR00251 family protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
 gi|455788724|gb|EMF40689.1| TIGR00251 family protein [Leptospira interrogans serovar Lora str.
           TE 1992]
 gi|456822225|gb|EMF70711.1| TIGR00251 family protein [Leptospira interrogans serovar Canicola
           str. LT1962]
 gi|456966928|gb|EMG08398.1| TIGR00251 family protein [Leptospira interrogans serovar
           Grippotyphosa str. LT2186]
          Length = 73

 Score = 44.3 bits (103), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 39/69 (56%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +  T++ KP SK        D  + + +  PA +G+AN A++E +S  + V + ++ I S
Sbjct: 1   MKFTVYVKPNSKKVFFRKEEDGVLTIAVREPALEGKANEAVIESISKEMKVPKSKIRILS 60

Query: 100 GSKSRDKIV 108
           G K++ KI+
Sbjct: 61  GQKNKKKII 69


>gi|425066177|ref|ZP_18469297.1| hypothetical protein P1059_01464 [Pasteurella multocida subsp.
           gallicida P1059]
 gi|404382104|gb|EJZ78566.1| hypothetical protein P1059_01464 [Pasteurella multocida subsp.
           gallicida P1059]
          Length = 99

 Score = 44.3 bits (103), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 22/87 (25%), Positives = 44/87 (50%), Gaps = 2/87 (2%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
             + + I  +P +    I  + D  + + I AP  DG+ANA LL+++S    V +  + +
Sbjct: 12  EHLRLRIFLQPKASKDQIVGLHDNELKITITAPPIDGQANAHLLKFLSKTFKVPKSSIVL 71

Query: 98  GSGSKSRDKIVIV--EEITPENVLNSL 122
             G  +R K +++   ++ PE ++  L
Sbjct: 72  EKGELNRHKQILIPNPKVIPEEIIALL 98


>gi|289208238|ref|YP_003460304.1| hypothetical protein TK90_1056 [Thioalkalivibrio sp. K90mix]
 gi|288943869|gb|ADC71568.1| protein of unknown function DUF167 [Thioalkalivibrio sp. K90mix]
          Length = 85

 Score = 44.3 bits (103), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 44/81 (54%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + +   PGSK  +I     + + +++ AP + G AN A+   ++  LG   R VS+ +G+
Sbjct: 4   LRVRVAPGSKRDAIGPWMGDILKLRVQAPPEKGRANEAVCALLAKALGCPARDVSVVAGA 63

Query: 102 KSRDKIVIVEEITPENVLNSL 122
            +RDK V +E  +  ++  +L
Sbjct: 64  TARDKTVAIEGYSEADLRRAL 84


>gi|399002120|ref|ZP_10704816.1| TIGR00251 family protein [Pseudomonas sp. GM18]
 gi|398125864|gb|EJM15327.1| TIGR00251 family protein [Pseudomonas sp. GM18]
          Length = 95

 Score = 44.3 bits (103), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFAGLHGDRLKIRLTAPPVEGKANAYLIAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|401826895|ref|XP_003887540.1| hypothetical protein EHEL_070320 [Encephalitozoon hellem ATCC
           50504]
 gi|392998546|gb|AFM98559.1| hypothetical protein EHEL_070320 [Encephalitozoon hellem ATCC
           50504]
          Length = 100

 Score = 44.3 bits (103), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 42/71 (59%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + +++ A+P ++S  + +V+D  + V I+AP  D +AN  L  ++S +L + +  V I +
Sbjct: 11  LRLSVWARPNARSTGVREVTDTEIVVNINAPPTDNKANKELTGFVSKMLKIPKSSVKILA 70

Query: 100 GSKSRDKIVIV 110
           G+ S  K V V
Sbjct: 71  GATSTHKTVEV 81


>gi|410939455|ref|ZP_11371282.1| TIGR00251 family protein [Leptospira noguchii str. 2006001870]
 gi|410785323|gb|EKR74287.1| TIGR00251 family protein [Leptospira noguchii str. 2006001870]
          Length = 73

 Score = 44.3 bits (103), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 42/72 (58%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + +T++ KP SK        D  + + +  PA +G+AN A++E +S  + + + ++ I S
Sbjct: 1   MKLTVYVKPNSKKVFFRKEEDGVLTIAVREPALEGKANEAVIESISKEMKIPKSKIRILS 60

Query: 100 GSKSRDKIVIVE 111
           G K++ KI+ V+
Sbjct: 61  GEKNKKKIIEVD 72


>gi|398941348|ref|ZP_10669807.1| TIGR00251 family protein [Pseudomonas sp. GM41(2012)]
 gi|398161817|gb|EJM50037.1| TIGR00251 family protein [Pseudomonas sp. GM41(2012)]
          Length = 98

 Score = 44.3 bits (103), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+ SG 
Sbjct: 13  LECHLQPAARSDDFCGLHGDRLKIRLTAPPVEGKANAYLIAFLAKAFGVSKSQVSLISGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|52424377|ref|YP_087514.1| hypothetical protein MS0322 [Mannheimia succiniciproducens MBEL55E]
 gi|81387574|sp|Q65VT1.1|Y322_MANSM RecName: Full=UPF0235 protein MS0322
 gi|52306429|gb|AAU36929.1| unknown [Mannheimia succiniciproducens MBEL55E]
          Length = 95

 Score = 44.3 bits (103), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 41/81 (50%), Gaps = 2/81 (2%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + I  +P +    I  + D+ + + I AP  DG ANA LL+Y+S    V +  + +  
Sbjct: 11  LRLRIFLQPKASRDKIIGIHDDELKIAITAPPVDGAANAHLLKYLSKAFKVPKSAIILEK 70

Query: 100 GSKSRDKIVIVEE--ITPENV 118
           G  +R K + + E  + PE +
Sbjct: 71  GELNRHKQLFIPEPKLIPEEL 91


>gi|322708042|gb|EFY99619.1| DUF167 domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 126

 Score = 44.3 bits (103), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 45/78 (57%), Gaps = 6/78 (7%)

Query: 55  ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEI- 113
           I  V+D A+ + + A  + GEAN A+++ +S+ +G+ + +    SG KSRDK+V + +I 
Sbjct: 43  ILAVTDRAIELCVAAQPRHGEANKAVVQALSNAIGIPKSRFRFVSGIKSRDKVVAIGDIQ 102

Query: 114 -----TPENVLNSLGKAS 126
                  E +L  L +AS
Sbjct: 103 GDGPEYTETILRLLREAS 120


>gi|396498754|ref|XP_003845307.1| similar to DUF167 domain protein [Leptosphaeria maculans JN3]
 gi|312221888|emb|CBY01828.1| similar to DUF167 domain protein [Leptosphaeria maculans JN3]
          Length = 118

 Score = 44.3 bits (103), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 45  HAKPG--SKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSK 102
           H KPG  S    I  V+++AV V +   AK+GEAN A++  +   L   R +V I  G K
Sbjct: 23  HVKPGVSSSRLGIIAVTEDAVEVGVAEQAKNGEANDAVVHVICRALHAPRDEVRIVRGWK 82

Query: 103 SRDKIVIV 110
           SR K V+V
Sbjct: 83  SRVKTVVV 90


>gi|296126259|ref|YP_003633511.1| hypothetical protein [Brachyspira murdochii DSM 12563]
 gi|296018075|gb|ADG71312.1| protein of unknown function DUF167 [Brachyspira murdochii DSM
           12563]
          Length = 84

 Score = 44.3 bits (103), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 42/71 (59%), Gaps = 1/71 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           ++I +    G+KS S     + A  V+I A A DG+AN A++E+++  L +K++ + I  
Sbjct: 1   MNIEVKVTAGAKSNSFK-FENGAYYVRIMAKAIDGKANKAIIEFLADELNIKKKDIDILK 59

Query: 100 GSKSRDKIVIV 110
           G KS  KI+ +
Sbjct: 60  GEKSSKKIIAI 70


>gi|408793795|ref|ZP_11205401.1| TIGR00251 family protein [Leptospira meyeri serovar Hardjo str.
           Went 5]
 gi|408462299|gb|EKJ86028.1| TIGR00251 family protein [Leptospira meyeri serovar Hardjo str.
           Went 5]
          Length = 75

 Score = 44.3 bits (103), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 40/69 (57%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + +T+  KP +K   +  +S+    V++ +P  DG+AN  L+E +S    V ++ +SI S
Sbjct: 1   MKLTVKVKPNNKQPGLEFLSETDCIVRLKSPPVDGKANEELIEVLSKHFHVPKKNISILS 60

Query: 100 GSKSRDKIV 108
           G  S+ K+V
Sbjct: 61  GFSSKSKLV 69


>gi|406971932|gb|EKD95856.1| hypothetical protein ACD_24C00300G0003 [uncultured bacterium]
          Length = 75

 Score = 43.9 bits (102), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 43/76 (56%), Gaps = 4/76 (5%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I +  KPG+K   I   S+    + + A AK+ EAN  L++ +S   G+ + ++SI S
Sbjct: 1   MRIHVLLKPGAKEPKIEQ-SNGIFRISVKAVAKENEANNELIDTLSDYFGIAKSRISIVS 59

Query: 100 GSKSRDKIVIVEEITP 115
           G  SR+K+V   EI P
Sbjct: 60  GFNSRNKVV---EIAP 72


>gi|388458165|ref|ZP_10140460.1| hypothetical protein FdumT_16418 [Fluoribacter dumoffii Tex-KL]
          Length = 98

 Score = 43.9 bits (102), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +++  +PG+K   +     E + ++I AP+ + +AN  L+ Y+S +  V + Q+ I  G 
Sbjct: 17  LSLLIQPGAKCNQVVGAVGEELKIKIAAPSIEDKANMELVRYLSVLFKVPKSQIKIKRGL 76

Query: 102 KSRDKIV 108
           KSR KI+
Sbjct: 77  KSRHKII 83


>gi|342903956|ref|ZP_08725758.1| Hypothetical protein GGC_0660 [Haemophilus haemolyticus M21621]
 gi|341953965|gb|EGT80459.1| Hypothetical protein GGC_0660 [Haemophilus haemolyticus M21621]
          Length = 95

 Score = 43.9 bits (102), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 23/85 (27%), Positives = 44/85 (51%), Gaps = 2/85 (2%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + I  +P +    I  + D+ + + + AP  DG+ANA LL+++S    V +  + +  
Sbjct: 11  IRLRIFLQPKASKDHIAGIHDDELKITLTAPPIDGQANAHLLKFLSKSFKVPKSSIILEK 70

Query: 100 GSKSRDKIVIVEE--ITPENVLNSL 122
           G  SR K V + +  + P+ + N L
Sbjct: 71  GELSRHKQVWLPDPKLIPKEIQNLL 95


>gi|325577506|ref|ZP_08147868.1| hypothetical protein HMPREF9417_0609 [Haemophilus parainfluenzae
           ATCC 33392]
 gi|325160610|gb|EGC72734.1| hypothetical protein HMPREF9417_0609 [Haemophilus parainfluenzae
           ATCC 33392]
          Length = 95

 Score = 43.9 bits (102), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 44/88 (50%), Gaps = 2/88 (2%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  + + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S    V +  + 
Sbjct: 8   PEGLRLKIILQPKASKDQIVGLHDDELKITITAPPVDGQANAHLLKFLSKTFKVPKSSIV 67

Query: 97  IGSGSKSRDKIVIV--EEITPENVLNSL 122
           +  G  +R K V +   ++ P  + N L
Sbjct: 68  LEKGELNRHKQVWIPSPKLIPSEIQNLL 95


>gi|332530697|ref|ZP_08406628.1| hypothetical protein HGR_12177 [Hylemonella gracilis ATCC 19624]
 gi|332039865|gb|EGI76260.1| hypothetical protein HGR_12177 [Hylemonella gracilis ATCC 19624]
          Length = 115

 Score = 43.9 bits (102), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 39/72 (54%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           SV + +H  P +    I  + D A+ V++ AP  DG+AN AL  +++  L +    V++ 
Sbjct: 22  SVLVDLHVMPNASRTQIQGLFDGALKVRLQAPPVDGKANEALRVWLAKTLSIPNSSVTLQ 81

Query: 99  SGSKSRDKIVIV 110
            G+ +R K + V
Sbjct: 82  HGATARRKQLHV 93


>gi|86157514|ref|YP_464299.1| hypothetical protein Adeh_1087 [Anaeromyxobacter dehalogenans
           2CP-C]
 gi|123499918|sp|Q2IPY3.1|Y1087_ANADE RecName: Full=UPF0235 protein Adeh_1087
 gi|85774025|gb|ABC80862.1| protein of unknown function DUF167 [Anaeromyxobacter dehalogenans
           2CP-C]
          Length = 95

 Score = 43.9 bits (102), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 37/69 (53%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + I  +P +         D  + +Q+ AP  DG ANAAL+E+++  LGV+R  V++  G 
Sbjct: 13  LEILVQPRASRTRAVGEHDGRLKIQLAAPPVDGAANAALVEFLAVALGVRRADVALLRGE 72

Query: 102 KSRDKIVIV 110
             R K V V
Sbjct: 73  AGRRKTVRV 81


>gi|146309167|ref|YP_001189632.1| hypothetical protein Pmen_4153 [Pseudomonas mendocina ymp]
 gi|205829317|sp|A4XZY4.1|Y4153_PSEMY RecName: Full=UPF0235 protein Pmen_4153
 gi|145577368|gb|ABP86900.1| protein of unknown function DUF167 [Pseudomonas mendocina ymp]
          Length = 98

 Score = 43.9 bits (102), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 20/65 (30%), Positives = 36/65 (55%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P +       +  E + +++ AP  +G+ANA LL +++   GV + QVS+ SG 
Sbjct: 13  LDCHLQPKASKDEFAGLHGERLKIRLTAPPVEGKANAHLLAFLAKAFGVAKAQVSLESGE 72

Query: 102 KSRDK 106
            +R K
Sbjct: 73  LNRHK 77


>gi|402080667|gb|EJT75812.1| hypothetical protein GGTG_05741 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 122

 Score = 43.9 bits (102), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 48/79 (60%), Gaps = 3/79 (3%)

Query: 35  VPPSSVSITIHAKP-GSKSCS-ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKR 92
           VPP+ + +  H +P  SKS   IT +SD+A+ + + AP  + +AN A++  +S  LG  +
Sbjct: 18  VPPA-LMLQCHVRPSASKSREGITSLSDDAINLSVAAPPLENKANQAVVAVLSEALGSPK 76

Query: 93  RQVSIGSGSKSRDKIVIVE 111
             ++I SG KS +K V V+
Sbjct: 77  SDLNIVSGLKSHEKTVAVK 95


>gi|375337667|ref|ZP_09779011.1| hypothetical protein SbacW_12197, partial [Succinivibrionaceae
           bacterium WG-1]
          Length = 87

 Score = 43.9 bits (102), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 21/76 (27%), Positives = 41/76 (53%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           ++++I I   P S   +I  +  E + + I AP  DG+ANA + +Y+  +    +  V I
Sbjct: 9   NNITIYIVVTPKSSKDAIVGLIGEEIKITITAPPIDGKANAYIQKYLGKIFKTAKSNVEI 68

Query: 98  GSGSKSRDKIVIVEEI 113
             G  S+ K+V++++ 
Sbjct: 69  QKGETSKHKVVLIKDF 84


>gi|345428817|ref|YP_004821933.1| hypothetical protein PARA_02320 [Haemophilus parainfluenzae T3T1]
 gi|301154876|emb|CBW14339.1| conserved protein [Haemophilus parainfluenzae T3T1]
          Length = 95

 Score = 43.9 bits (102), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 44/88 (50%), Gaps = 2/88 (2%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  + + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S    V +  + 
Sbjct: 8   PEGLRLKIILQPKASKDQIVGLHDDELKITITAPPVDGQANAHLLKFLSKAFKVPKSSIV 67

Query: 97  IGSGSKSRDKIVIV--EEITPENVLNSL 122
           +  G  +R K V +   ++ P  + N L
Sbjct: 68  LEKGELNRHKQVWIPSPKLIPSEIQNLL 95


>gi|67523595|ref|XP_659857.1| hypothetical protein AN2253.2 [Aspergillus nidulans FGSC A4]
 gi|40744782|gb|EAA63938.1| hypothetical protein AN2253.2 [Aspergillus nidulans FGSC A4]
 gi|259487644|tpe|CBF86471.1| TPA: DUF167 domain protein (AFU_orthologue; AFUA_5G06647)
           [Aspergillus nidulans FGSC A4]
          Length = 130

 Score = 43.9 bits (102), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 7/99 (7%)

Query: 33  RLVPPSSVSITIHAKPGSKSCS--ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGV 90
           +L    ++ I+   KP +      IT V +E V V + A  +DGEAN A+ +  + V  V
Sbjct: 23  KLANRYNLHISCRVKPNASGGREGITAVGNETVDVCVAAVPRDGEANLAVSQVFAKVFNV 82

Query: 91  KRRQVSIGSGSKSRDKIVIVEEI-----TPENVLNSLGK 124
            +  V +  G KSRDK++ +  +     T E  L   GK
Sbjct: 83  AKSDVGVIHGLKSRDKVLCIFNLDIGTETEERFLERAGK 121


>gi|315633577|ref|ZP_07888867.1| conserved hypothetical protein [Aggregatibacter segnis ATCC 33393]
 gi|315477619|gb|EFU68361.1| conserved hypothetical protein [Aggregatibacter segnis ATCC 33393]
          Length = 97

 Score = 43.9 bits (102), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 20/71 (28%), Positives = 39/71 (54%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S +  V +  + +  
Sbjct: 11  LRLRIFLQPKAAKDQIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKVPKSSIVLEK 70

Query: 100 GSKSRDKIVIV 110
           G  +R K V++
Sbjct: 71  GELNRHKQVLI 81


>gi|443309150|ref|ZP_21038905.1| hypothetical protein Syn7509DRAFT_00046850 [Synechocystis sp. PCC
           7509]
 gi|442780805|gb|ELR90943.1| hypothetical protein Syn7509DRAFT_00046850 [Synechocystis sp. PCC
           7509]
          Length = 77

 Score = 43.9 bits (102), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 41/69 (59%)

Query: 43  TIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSK 102
           +I  KP S+   I + +D ++ + + +P  DG+AN  L++ ++    V + +++I  G  
Sbjct: 4   SIKVKPNSQQQKIIEEADGSLTISLKSPPVDGKANHELIQLLAKKFAVSKSKITIKLGLS 63

Query: 103 SRDKIVIVE 111
           SR K+VI++
Sbjct: 64  SRQKLVIID 72


>gi|54295945|ref|YP_122257.1| hypothetical protein plpp0102 [Legionella pneumophila str. Paris]
 gi|53755777|emb|CAH17279.1| hypothetical protein plpp0102 [Legionella pneumophila str. Paris]
          Length = 98

 Score = 43.9 bits (102), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 39/67 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +++  +PG+K   +     E + ++I AP+ + +AN  L+ Y+S +  V + Q+ I  G 
Sbjct: 17  LSLLIQPGAKCNQVVGAVGEELKIKIAAPSIEVKANMELVRYLSVLFKVPKSQIKIKRGL 76

Query: 102 KSRDKIV 108
           KSR KI+
Sbjct: 77  KSRHKII 83


>gi|116751493|ref|YP_848180.1| hypothetical protein Sfum_4080 [Syntrophobacter fumaroxidans MPOB]
 gi|116700557|gb|ABK19745.1| protein of unknown function DUF167 [Syntrophobacter fumaroxidans
           MPOB]
          Length = 105

 Score = 43.5 bits (101), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 20/75 (26%), Positives = 39/75 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +       +    + +++ AP  +GEAN   +++++   GV R  V I  G 
Sbjct: 19  LDVYLQPRASKNEWAGMHQGCLKIRLTAPPVEGEANKECVKFLAGAFGVSRSDVEIIRGH 78

Query: 102 KSRDKIVIVEEITPE 116
           KSR K +++   TPE
Sbjct: 79  KSRRKTILIRNSTPE 93


>gi|445062954|ref|ZP_21375239.1| hypothetical protein H263_05997 [Brachyspira hampsonii 30599]
 gi|444505662|gb|ELV06139.1| hypothetical protein H263_05997 [Brachyspira hampsonii 30599]
          Length = 83

 Score = 43.5 bits (101), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 42/69 (60%), Gaps = 1/69 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           ++I +    G+KS S     + A  ++I A A DG+AN A++++++S L +K+R + I  
Sbjct: 1   MNIEVKVTAGAKSNSFK-FENGAYSIRIMAKAIDGKANKAIIDFLASELNIKKRDIEILK 59

Query: 100 GSKSRDKIV 108
           G K+  K++
Sbjct: 60  GEKNSKKLI 68


>gi|354567643|ref|ZP_08986811.1| UPF0235 protein yggU [Fischerella sp. JSC-11]
 gi|353542101|gb|EHC11565.1| UPF0235 protein yggU [Fischerella sp. JSC-11]
          Length = 75

 Score = 43.5 bits (101), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 39/69 (56%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I +  KP SK   I + +D ++ V + +P  DG+AN  L++ +S    V + ++ I S
Sbjct: 1   MQIKVKVKPNSKMQKIEEAADRSLIVYLKSPPVDGKANEELIKVLSENFHVPQSRIRIKS 60

Query: 100 GSKSRDKIV 108
           G  SR K++
Sbjct: 61  GLSSRQKLI 69


>gi|428316755|ref|YP_007114637.1| UPF0235 protein yggU [Oscillatoria nigro-viridis PCC 7112]
 gi|428240435|gb|AFZ06221.1| UPF0235 protein yggU [Oscillatoria nigro-viridis PCC 7112]
          Length = 78

 Score = 43.5 bits (101), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 37/67 (55%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
            T+  KP SK  SI +  D ++ V + +P  +G+AN  L+  ++    V +  + I SG 
Sbjct: 4   FTVKVKPNSKQQSIAEQPDGSLTVHLKSPPVEGKANQELIVLLAKKFKVPKSAIGIKSGF 63

Query: 102 KSRDKIV 108
            SR+K+V
Sbjct: 64  SSRNKLV 70


>gi|407801849|ref|ZP_11148692.1| hypothetical protein S7S_00925 [Alcanivorax sp. W11-5]
 gi|407024166|gb|EKE35910.1| hypothetical protein S7S_00925 [Alcanivorax sp. W11-5]
          Length = 95

 Score = 43.5 bits (101), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 22/73 (30%), Positives = 42/73 (57%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           ++ + +P +    I    D A+ +++ AP  DG ANA L+ +++ + GV ++QV I SG+
Sbjct: 13  LSCYLQPRASRTEIIGEHDGALKIRLSAPPVDGAANAELIAFLAKLCGVPKQQVVIESGA 72

Query: 102 KSRDKIVIVEEIT 114
             R K V +  ++
Sbjct: 73  TGRRKRVRIRGVS 85


>gi|126657163|ref|ZP_01728329.1| hypothetical protein CY0110_24581 [Cyanothece sp. CCY0110]
 gi|126621434|gb|EAZ92145.1| hypothetical protein CY0110_24581 [Cyanothece sp. CCY0110]
          Length = 75

 Score = 43.5 bits (101), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 19/69 (27%), Positives = 37/69 (53%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + +  KP +K   I +++D    + + +P  DG+AN  L+  ++    V +  + I S
Sbjct: 3   LKLQVKVKPNAKQQKIEEMADNHFKIAVKSPPTDGKANQELITLLAKHFNVPKSHILIKS 62

Query: 100 GSKSRDKIV 108
           G  SR+K+V
Sbjct: 63  GVSSRNKLV 71


>gi|225850940|ref|YP_002731174.1| protein CPn_0497//CPj0497/CpB0517 [Persephonella marina EX-H1]
 gi|259646383|sp|C0QR78.1|Y1406_PERMH RecName: Full=UPF0235 protein PERMA_1406
 gi|225645391|gb|ACO03577.1| protein CPn_0497//CPj0497/CpB0517 [Persephonella marina EX-H1]
          Length = 73

 Score = 43.5 bits (101), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 17/67 (25%), Positives = 38/67 (56%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + +  KP +K   I ++  +   +++  P + G+AN+ ++E +S  L + + ++ +  G 
Sbjct: 3   VKVKVKPNAKKEEIREIQKDYFEIRVTVPPEKGKANSRVIELLSKHLKIPKSRIKLKKGE 62

Query: 102 KSRDKIV 108
           KSR+KI 
Sbjct: 63  KSREKIF 69


>gi|197335333|ref|YP_002155183.1| hypothetical protein VFMJ11_0427 [Vibrio fischeri MJ11]
 gi|197316823|gb|ACH66270.1| conserved hypothetical protein [Vibrio fischeri MJ11]
          Length = 95

 Score = 43.5 bits (101), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 23/84 (27%), Positives = 43/84 (51%), Gaps = 2/84 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  E + + I AP  DG+ANA L++Y S +  V + ++++  G 
Sbjct: 12  LRLYLQPKASRDQIVGIHGEELKIAITAPPVDGKANAHLIKYFSKLFKVAKGKITVEKGE 71

Query: 102 KSRDKIVIVE--EITPENVLNSLG 123
            +R K V +   EI P  +   L 
Sbjct: 72  LNRHKQVRIHSPEIIPNEIQQLLA 95


>gi|378775560|ref|YP_005177803.1| hypothetical protein Pmu_19760 [Pasteurella multocida 36950]
 gi|383311567|ref|YP_005364377.1| hypothetical protein PMCN06_1980 [Pasteurella multocida subsp.
           multocida str. HN06]
 gi|417851328|ref|ZP_12497083.1| hypothetical protein GEW_08058 [Pasteurella multocida subsp.
           gallicida str. Anand1_poultry]
 gi|417854102|ref|ZP_12499426.1| hypothetical protein AAUPMG_07823 [Pasteurella multocida subsp.
           multocida str. Anand1_goat]
 gi|425064007|ref|ZP_18467132.1| hypothetical protein X73_01370 [Pasteurella multocida subsp.
           gallicida X73]
 gi|338218448|gb|EGP04215.1| hypothetical protein AAUPMG_07823 [Pasteurella multocida subsp.
           multocida str. Anand1_goat]
 gi|338219658|gb|EGP05286.1| hypothetical protein GEW_08058 [Pasteurella multocida subsp.
           gallicida str. Anand1_poultry]
 gi|356598108|gb|AET16834.1| hypothetical protein Pmu_19760 [Pasteurella multocida 36950]
 gi|380872839|gb|AFF25206.1| hypothetical protein PMCN06_1980 [Pasteurella multocida subsp.
           multocida str. HN06]
 gi|404381975|gb|EJZ78439.1| hypothetical protein X73_01370 [Pasteurella multocida subsp.
           gallicida X73]
          Length = 99

 Score = 43.1 bits (100), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 19/73 (26%), Positives = 37/73 (50%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
             + + I  +P +    I  + D  + + I AP  DG+ANA LL+++S    V +  + +
Sbjct: 12  EHLRLRIFLQPKASKDQIVGLHDNELKITITAPPIDGQANAHLLKFLSKTFKVPKSSIVL 71

Query: 98  GSGSKSRDKIVIV 110
             G  +R K +++
Sbjct: 72  EKGELNRHKQILI 84


>gi|337284005|ref|YP_004623479.1| hypothetical protein PYCH_05170 [Pyrococcus yayanosii CH1]
 gi|334899939|gb|AEH24207.1| hypothetical protein PYCH_05170 [Pyrococcus yayanosii CH1]
          Length = 92

 Score = 43.1 bits (100), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 49/86 (56%), Gaps = 5/86 (5%)

Query: 40  VSITIHAKPGSKSCSITDVSD--EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           V ++++ +P +K  ++  + +  E + V + AP   G+AN  L++++S +LG    +V I
Sbjct: 9   VLLSLYVRPNAKKNAVEGIDEWRERIKVSVTAPPVGGKANRELVKFLSKLLGA---EVEI 65

Query: 98  GSGSKSRDKIVIVEEITPENVLNSLG 123
             G  SR+K V+V+  T E V   LG
Sbjct: 66  VRGETSREKDVLVKGKTVEEVKKKLG 91


>gi|197121557|ref|YP_002133508.1| hypothetical protein AnaeK_1146 [Anaeromyxobacter sp. K]
 gi|226696231|sp|B4UGV4.1|Y1146_ANASK RecName: Full=UPF0235 protein AnaeK_1146
 gi|196171406|gb|ACG72379.1| protein of unknown function DUF167 [Anaeromyxobacter sp. K]
          Length = 95

 Score = 43.1 bits (100), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 21/46 (45%), Positives = 30/46 (65%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           +Q+ AP  DG ANAAL+E+++  LGV+R  V++  G   R K V V
Sbjct: 36  IQLAAPPVDGAANAALVEFLAVALGVRRADVALLRGETGRRKTVRV 81


>gi|116623504|ref|YP_825660.1| hypothetical protein Acid_4414 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116226666|gb|ABJ85375.1| protein of unknown function DUF167 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 88

 Score = 43.1 bits (100), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 43/81 (53%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +T+   P ++   IT    +A  + + AP  DG+AN   + +++   GV R +V I +G 
Sbjct: 4   LTVRVHPRARRSEITGRLGDAWKLALAAPPVDGKANDECVRFLAGWAGVPRSRVRIVTGL 63

Query: 102 KSRDKIVIVEEITPENVLNSL 122
            SR K+V +E +  E++   L
Sbjct: 64  TSRIKVVEIEGVPQEDLERRL 84


>gi|315230060|ref|YP_004070496.1| hypothetical protein TERMP_00296 [Thermococcus barophilus MP]
 gi|315183088|gb|ADT83273.1| hypothetical protein TERMP_00296 [Thermococcus barophilus MP]
          Length = 92

 Score = 43.1 bits (100), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 5/86 (5%)

Query: 40  VSITIHAKPGSKSCSITDVSDEA--VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           V + +H +P +K  SI  V      + V++ AP   G+AN  L +++S +LG   ++V I
Sbjct: 9   VILLVHVQPNTKRNSIEGVDKWKGRIKVKVSAPPVGGKANKELTKFLSKLLG---KEVVI 65

Query: 98  GSGSKSRDKIVIVEEITPENVLNSLG 123
             G  SR+K ++++  T E V   LG
Sbjct: 66  LRGETSREKDLLIKGATIEEVKEKLG 91


>gi|15603178|ref|NP_246251.1| hypothetical protein PM1313 [Pasteurella multocida subsp. multocida
           str. Pm70]
 gi|29839744|sp|Q9CLC6.1|Y1313_PASMU RecName: Full=UPF0235 protein PM1313
 gi|12721676|gb|AAK03397.1| unknown [Pasteurella multocida subsp. multocida str. Pm70]
          Length = 99

 Score = 43.1 bits (100), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 19/73 (26%), Positives = 37/73 (50%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
             + + I  +P +    I  + D  + + I AP  DG+ANA LL+++S    V +  + +
Sbjct: 12  EHLRLRIFLQPKASKDQIVGLHDNELKITITAPPIDGQANAHLLKFLSKTFKVPKSSIVL 71

Query: 98  GSGSKSRDKIVIV 110
             G  +R K +++
Sbjct: 72  EKGELNRHKQILI 84


>gi|333990437|ref|YP_004523051.1| hypothetical protein JDM601_1797 [Mycobacterium sp. JDM601]
 gi|333486405|gb|AEF35797.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
          Length = 78

 Score = 43.1 bits (100), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 40/69 (57%), Gaps = 1/69 (1%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEA-VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S ++ +  KPGS+   + +  D+  + V +  PA DG+ANAA++  ++    V R +V +
Sbjct: 4   SETVVVKVKPGSRKGPLVETDDDGQLTVYVREPAVDGKANAAVIRLLAQYFEVPRSRVEL 63

Query: 98  GSGSKSRDK 106
            SG+ +R K
Sbjct: 64  ASGAAARVK 72


>gi|212223789|ref|YP_002307025.1| hypothetical protein TON_0641 [Thermococcus onnurineus NA1]
 gi|226707988|sp|B6YUU2.1|Y641_THEON RecName: Full=UPF0235 protein TON_0641
 gi|212008746|gb|ACJ16128.1| hypothetical protein, conserved [Thermococcus onnurineus NA1]
          Length = 94

 Score = 43.1 bits (100), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 47/85 (55%), Gaps = 7/85 (8%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVG---VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           I ++ +P +K   I  V DE  G   V+I AP  +G+AN  ++ + S +LG    +V I 
Sbjct: 13  ILLYVQPKAKKNEIEGV-DEWRGRLKVKIKAPPVEGKANKEVVRFFSKMLGT---EVEII 68

Query: 99  SGSKSRDKIVIVEEITPENVLNSLG 123
            G  SR+K ++V+  + + VL  LG
Sbjct: 69  RGGTSREKDLLVKGFSSKEVLKKLG 93


>gi|374336915|ref|YP_005093602.1| hypothetical protein GU3_15495 [Oceanimonas sp. GK1]
 gi|372986602|gb|AEY02852.1| hypothetical protein GU3_15495 [Oceanimonas sp. GK1]
          Length = 103

 Score = 43.1 bits (100), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 21/73 (28%), Positives = 42/73 (57%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + ++++ +P S   +      + + V I AP  DG+ANA LL++++   GV + +V + +
Sbjct: 15  LYLSLYLQPRSSRDAFLGRHGDELKVAITAPPVDGQANAHLLKWLAKECGVAKSRVELVA 74

Query: 100 GSKSRDKIVIVEE 112
           G  SR K V+++ 
Sbjct: 75  GDSSRHKRVVIDH 87


>gi|386390193|ref|ZP_10074989.1| TIGR00251 family protein [Haemophilus paraphrohaemolyticus HK411]
 gi|385693877|gb|EIG24509.1| TIGR00251 family protein [Haemophilus paraphrohaemolyticus HK411]
          Length = 98

 Score = 43.1 bits (100), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 21/74 (28%), Positives = 36/74 (48%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  + + I  +P +    I  + D  + + I +P  DG ANA LL+Y+S +  V +  + 
Sbjct: 10  PDGIRLRIFLQPKASRDQIIGLHDGELKIAITSPPVDGAANAHLLKYLSKLFKVPKSSIV 69

Query: 97  IGSGSKSRDKIVIV 110
           +  G   R K + V
Sbjct: 70  LEKGELQRHKQIFV 83


>gi|29839573|sp|Q8TIP5.2|Y4097_METAC RecName: Full=UPF0235 protein MA_4097
          Length = 109

 Score = 43.1 bits (100), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 50/89 (56%), Gaps = 3/89 (3%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEA---VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           S + I I   PGS+S S+    +E    + V++   A+ G+AN  L+E ++ + G+   +
Sbjct: 16  SGIIIEIEVTPGSRSLSVPSGYNEWRKRIAVKLTKNAQKGKANEQLIESLAELFGISSSE 75

Query: 95  VSIGSGSKSRDKIVIVEEITPENVLNSLG 123
           + I SG+ S  K ++++ I+ +  ++ LG
Sbjct: 76  ILINSGATSSKKSLLIKGISYQQAVSVLG 104


>gi|152991960|ref|YP_001357681.1| hypothetical protein SUN_0364 [Sulfurovum sp. NBC37-1]
 gi|151423821|dbj|BAF71324.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1]
          Length = 95

 Score = 43.1 bits (100), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 40/72 (55%), Gaps = 1/72 (1%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDE-AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           + VS+ I A+P +      D+  E A+ ++I APA +G AN  L+++++    V +  + 
Sbjct: 8   TKVSMRIKAQPAASRNEFCDIYGEDAIKIRIKAPAVEGAANKELMKFLAKSFKVPKSDII 67

Query: 97  IGSGSKSRDKIV 108
             SG  S+ KIV
Sbjct: 68  FKSGQNSKIKIV 79


>gi|397781473|ref|YP_006545946.1| hypothetical protein BN140_2307 [Methanoculleus bourgensis MS2]
 gi|396939975|emb|CCJ37230.1| hypothetical protein BN140_2307 [Methanoculleus bourgensis MS2]
          Length = 105

 Score = 43.1 bits (100), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 46/88 (52%), Gaps = 3/88 (3%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDE---AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           S V+I +    G++  S     +E   ++  Q+ APA DG+AN A+++ ++   GV R  
Sbjct: 13  SGVTILLDVTAGARRTSFPAGYNEWRKSIRCQVAAPAVDGKANRAIIDLIAETFGVSRAD 72

Query: 95  VSIGSGSKSRDKIVIVEEITPENVLNSL 122
           VSI SG  S  K V +  ++   +L  L
Sbjct: 73  VSIISGHTSTSKTVAISGLSKPEILKLL 100


>gi|237747224|ref|ZP_04577704.1| predicted protein [Oxalobacter formigenes HOxBLS]
 gi|229378575|gb|EEO28666.1| predicted protein [Oxalobacter formigenes HOxBLS]
          Length = 100

 Score = 43.1 bits (100), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 47/86 (54%), Gaps = 2/86 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I +   P ++   I     E + +++ A   DG+AN AL++ ++  L V R+QVSI  G 
Sbjct: 15  IAVQVIPNARKSEIVSSEGETLRIRLQAQPVDGKANEALVQLLAKKLRVPRKQVSITHGL 74

Query: 102 KSRDKI--VIVEEITPENVLNSLGKA 125
            ++ K+  VIV + + E+++  L  A
Sbjct: 75  ANKRKLLEVIVSDRSQEDIVKQLQPA 100


>gi|323702139|ref|ZP_08113806.1| protein of unknown function DUF167 [Desulfotomaculum nigrificans
           DSM 574]
 gi|333924455|ref|YP_004498035.1| hypothetical protein Desca_2288 [Desulfotomaculum carboxydivorans
           CO-1-SRB]
 gi|323532826|gb|EGB22698.1| protein of unknown function DUF167 [Desulfotomaculum nigrificans
           DSM 574]
 gi|333750016|gb|AEF95123.1| UPF0235 protein yggU [Desulfotomaculum carboxydivorans CO-1-SRB]
          Length = 95

 Score = 43.1 bits (100), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 40/76 (52%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S V + +  +P +   S+    + A+ V++ AP  DG AN A   +++ V GV +  V I
Sbjct: 9   SGVVLRVRVQPRAAKNSLAGEMEGALKVRLTAPPVDGAANEACCRFLAEVFGVAKSNVEI 68

Query: 98  GSGSKSRDKIVIVEEI 113
            SG   R+K+V V  I
Sbjct: 69  ISGHTGRNKVVRVAGI 84


>gi|428768766|ref|YP_007160556.1| hypothetical protein Cyan10605_0367 [Cyanobacterium aponinum PCC
           10605]
 gi|428683045|gb|AFZ52512.1| UPF0235 protein yggU [Cyanobacterium aponinum PCC 10605]
          Length = 72

 Score = 43.1 bits (100), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 38/69 (55%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I++  KP SK  +I    ++   V + +P  DG+AN  L+  ++   GVK+ QV I S
Sbjct: 1   MKISVQVKPRSKKQTIEKKDNDTWIVNLKSPPVDGKANQELITVIAKQFGVKKSQVIIKS 60

Query: 100 GSKSRDKIV 108
           G  S  KI+
Sbjct: 61  GLSSPKKII 69


>gi|153874132|ref|ZP_02002465.1| conserved hypothetical protein [Beggiatoa sp. PS]
 gi|152069404|gb|EDN67535.1| conserved hypothetical protein [Beggiatoa sp. PS]
          Length = 91

 Score = 43.1 bits (100), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 40/70 (57%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +++H +P S   S+  V  + + V+I A   DG+ANA + +  S   GV + ++ I +G 
Sbjct: 13  LSVHVQPRSSQTSVVGVHGDRLKVKIMAAPVDGKANAEVCKLFSKQFGVAKSKIIIENGH 72

Query: 102 KSRDKIVIVE 111
            SRDK + ++
Sbjct: 73  TSRDKRICIK 82


>gi|334116614|ref|ZP_08490706.1| UPF0235 protein yggU [Microcoleus vaginatus FGP-2]
 gi|333461434|gb|EGK90039.1| UPF0235 protein yggU [Microcoleus vaginatus FGP-2]
          Length = 80

 Score = 43.1 bits (100), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 36/67 (53%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
            T+  KP SK  SI +  D ++   + +P  DG+AN  L+  ++    V +  + I SG 
Sbjct: 4   FTVKVKPNSKQQSIAEQPDGSLTAHLKSPPVDGKANQELIVLLAKKFKVPKSAIGIKSGF 63

Query: 102 KSRDKIV 108
            SR+K+V
Sbjct: 64  FSRNKLV 70


>gi|332288575|ref|YP_004419427.1| hypothetical protein UMN179_00494 [Gallibacterium anatis UMN179]
 gi|330431471|gb|AEC16530.1| hypothetical protein UMN179_00494 [Gallibacterium anatis UMN179]
          Length = 95

 Score = 42.7 bits (99), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 45/87 (51%), Gaps = 2/87 (2%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + + + I  +P +    I  +  + + + I AP  DG+ANA LL+++S    V + Q+ +
Sbjct: 8   NGLLLNIILQPKAGKDQIVGLYGDELKITITAPPIDGKANAHLLKFLSKQFKVAKTQIEL 67

Query: 98  GSGSKSRDKIVIV--EEITPENVLNSL 122
             G  SR K V +   E  P+ +L+ L
Sbjct: 68  RKGELSRHKQVFIPSPEQIPQPILDLL 94


>gi|21674646|ref|NP_662711.1| hypothetical protein CT1832 [Chlorobium tepidum TLS]
 gi|29839718|sp|Q8KBF5.1|Y1832_CHLTE RecName: Full=UPF0235 protein CT1832
 gi|21647849|gb|AAM73053.1| conserved hypothetical protein [Chlorobium tepidum TLS]
          Length = 105

 Score = 42.7 bits (99), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 43/84 (51%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           +V +++  +P S    +  +  E + + + +   D  AN    E ++  LGV R  VS+ 
Sbjct: 10  AVCLSVRVQPRSSKSGVAGMYGEQLKICLKSAPVDNAANKECCELLAKALGVPRSSVSVM 69

Query: 99  SGSKSRDKIVIVEEITPENVLNSL 122
            G+ SR K++ VE +TP  V  +L
Sbjct: 70  KGASSRSKVLKVEGVTPAAVREAL 93


>gi|416893474|ref|ZP_11924662.1| hypothetical protein ATCC33389_1782 [Aggregatibacter aphrophilus
           ATCC 33389]
 gi|347814028|gb|EGY30680.1| hypothetical protein ATCC33389_1782 [Aggregatibacter aphrophilus
           ATCC 33389]
          Length = 97

 Score = 42.7 bits (99), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 39/72 (54%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + + + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S +  V +  + +
Sbjct: 9   ADLRLRIFLQPKAAKDHIVGLHDDELKISITAPPIDGQANAHLLKFLSKLFKVPKSSIVL 68

Query: 98  GSGSKSRDKIVI 109
             G  +R K V+
Sbjct: 69  EKGELNRHKQVL 80


>gi|20092890|ref|NP_618965.1| hypothetical protein MA4097 [Methanosarcina acetivorans C2A]
 gi|19918198|gb|AAM07445.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
          Length = 123

 Score = 42.7 bits (99), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 50/89 (56%), Gaps = 3/89 (3%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEA---VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           S + I I   PGS+S S+    +E    + V++   A+ G+AN  L+E ++ + G+   +
Sbjct: 30  SGIIIEIEVTPGSRSLSVPSGYNEWRKRIAVKLTKNAQKGKANEQLIESLAELFGISSSE 89

Query: 95  VSIGSGSKSRDKIVIVEEITPENVLNSLG 123
           + I SG+ S  K ++++ I+ +  ++ LG
Sbjct: 90  ILINSGATSSKKSLLIKGISYQQAVSVLG 118


>gi|119498429|ref|XP_001265972.1| YggU family protein [Neosartorya fischeri NRRL 181]
 gi|119414136|gb|EAW24075.1| YggU family protein [Neosartorya fischeri NRRL 181]
          Length = 133

 Score = 42.7 bits (99), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 45/95 (47%), Gaps = 4/95 (4%)

Query: 21  IKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCS--ITDVSDEAVGVQIDAPAKDGEANA 78
           I TN  N P    L    S+ I  H KP + S    I  V  E V V + A  ++GEANA
Sbjct: 15  ITTNSRNKPQ--HLPKKYSLQIACHVKPNASSSREGIIAVGAEKVDVCVAAVPRNGEANA 72

Query: 79  ALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEI 113
           A+    + +  V +    +  G KSRDK + + ++
Sbjct: 73  AVSRVFAQIFDVPKSNAEVIRGLKSRDKTLCITDL 107


>gi|384916266|ref|ZP_10016435.1| conserved hypothetical protein [Methylacidiphilum fumariolicum
           SolV]
 gi|384526319|emb|CCG92308.1| conserved hypothetical protein [Methylacidiphilum fumariolicum
           SolV]
          Length = 99

 Score = 42.7 bits (99), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 45/86 (52%), Gaps = 4/86 (4%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           +S  + I     +K   +  +  EA+ +++ AP  +G+AN  LL ++S  L V +  + I
Sbjct: 2   ASARLLIKVLANAKKTELVGMHGEAIKIKLSAPPVEGKANELLLSFLSDRLEVPKGLLRI 61

Query: 98  GSGSKSRDKIVIVEEI----TPENVL 119
             G KSR K +++ E     +P++ L
Sbjct: 62  EKGEKSRLKTIVINEWLEKDSPQDFL 87


>gi|292493759|ref|YP_003529198.1| hypothetical protein Nhal_3796 [Nitrosococcus halophilus Nc4]
 gi|291582354|gb|ADE16811.1| protein of unknown function DUF167 [Nitrosococcus halophilus Nc4]
          Length = 102

 Score = 42.7 bits (99), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 41/74 (55%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ I I  +P +    I     + + V+I AP  +G+ANA L+ +++    V + QV + 
Sbjct: 12  ALIIQIRLQPRASCDEIIGPHGDRLKVRITAPPVEGKANADLIRFLAKTFRVSKSQVRLL 71

Query: 99  SGSKSRDKIVIVEE 112
           SG+  RDK V +E+
Sbjct: 72  SGATGRDKRVCIEK 85


>gi|386285479|ref|ZP_10062694.1| hypothetical protein SULAR_09564 [Sulfurovum sp. AR]
 gi|385343590|gb|EIF50311.1| hypothetical protein SULAR_09564 [Sulfurovum sp. AR]
          Length = 97

 Score = 42.7 bits (99), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 42/75 (56%), Gaps = 1/75 (1%)

Query: 35  VPPSSVSITIHAKPGSKSCSITDVSD-EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           +   +VS+ I A+P +      +V D +A+ ++I APA +G AN  L+++++    V + 
Sbjct: 5   IEDDTVSLRIKAQPNASKNEFCEVYDNDAIKIRIKAPAVEGAANKELVKFLAKSFKVSKS 64

Query: 94  QVSIGSGSKSRDKIV 108
            +   +G  S+ KIV
Sbjct: 65  DILFKTGQHSKIKIV 79


>gi|398975871|ref|ZP_10685897.1| TIGR00251 family protein [Pseudomonas sp. GM25]
 gi|398139863|gb|EJM28851.1| TIGR00251 family protein [Pseudomonas sp. GM25]
          Length = 97

 Score = 42.7 bits (99), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 19/67 (28%), Positives = 38/67 (56%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P ++S     +  + + +++ AP  +G+ANA L+ +++   GV + QVS+  G 
Sbjct: 13  LECHLQPAARSDDFCGLHGDRLKIRLTAPPVEGKANAYLMGFLAKAFGVSKSQVSLIGGE 72

Query: 102 KSRDKIV 108
            +R K V
Sbjct: 73  LNRQKRV 79


>gi|332524657|ref|ZP_08400856.1| hypothetical protein RBXJA2T_02622 [Rubrivivax benzoatilyticus JA2]
 gi|332107965|gb|EGJ09189.1| hypothetical protein RBXJA2T_02622 [Rubrivivax benzoatilyticus JA2]
          Length = 103

 Score = 42.7 bits (99), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 24/96 (25%), Positives = 47/96 (48%), Gaps = 1/96 (1%)

Query: 29  PSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVL 88
           P C+     ++  + +   P ++      + D A+ V+++AP  DG+AN  LL +++  L
Sbjct: 5   PPCLSAAGAAASRLRVAVVPNARRTGADGLHDGALRVRLNAPPVDGKANETLLAWLADEL 64

Query: 89  GVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGK 124
            + +R V +  G   R K + ++   PE V   L +
Sbjct: 65  DLPKRAVRLTHGQTGRRKTIELDA-APEAVAAWLAR 99


>gi|340789054|ref|YP_004754519.1| hypothetical protein CFU_3872 [Collimonas fungivorans Ter331]
 gi|340554321|gb|AEK63696.1| hypothetical protein CFU_3872 [Collimonas fungivorans Ter331]
          Length = 106

 Score = 42.7 bits (99), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 6/87 (6%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + +   P +K   I    +  + +++ A   +G+AN AL+ Y++ +L V R QV I  
Sbjct: 17  VRLALQVTPNAKKSQIVGPLENVLKLRLQAQPIEGKANEALVRYLAELLDVPRSQVVITH 76

Query: 100 GSKSRDKIVIVEEITP----ENVLNSL 122
           G  +R K  IVE + P    E V+ SL
Sbjct: 77  GLANRRK--IVEALGPSLKAEAVMRSL 101


>gi|75910098|ref|YP_324394.1| hypothetical protein Ava_3894 [Anabaena variabilis ATCC 29413]
 gi|123608489|sp|Q3M687.1|Y3894_ANAVT RecName: Full=UPF0235 protein Ava_3894
 gi|75703823|gb|ABA23499.1| Protein of unknown function DUF167 [Anabaena variabilis ATCC 29413]
          Length = 75

 Score = 42.7 bits (99), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 20/68 (29%), Positives = 38/68 (55%)

Query: 44  IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKS 103
           +  KP SK   I +  D ++ V + +P  DG+AN  L++ ++    V +  ++I SG  S
Sbjct: 5   VKVKPNSKQQKIAEQDDGSLTVHLKSPPVDGKANEELIKLLAEKFAVPKSHITIKSGLSS 64

Query: 104 RDKIVIVE 111
           R K++ ++
Sbjct: 65  RQKLIEID 72


>gi|372279206|ref|ZP_09515242.1| hypothetical protein OS124_06108 [Oceanicola sp. S124]
          Length = 88

 Score = 42.7 bits (99), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 21/74 (28%), Positives = 39/74 (52%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           +   I +   P +    IT  +D  + V +  P +DG+AN A+   ++  LG+ + ++S+
Sbjct: 14  TGTEIAVRVTPRASRDRITREADGTIRVYVTTPPEDGKANEAVRRLLAKALGLAKSRLSL 73

Query: 98  GSGSKSRDKIVIVE 111
             G  +RDK+  VE
Sbjct: 74  VRGQTARDKLFRVE 87


>gi|365847856|ref|ZP_09388338.1| TIGR00251 family protein [Yokenella regensburgei ATCC 43003]
 gi|364571712|gb|EHM49289.1| TIGR00251 family protein [Yokenella regensburgei ATCC 43003]
          Length = 96

 Score = 42.7 bits (99), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 44/80 (55%), Gaps = 2/80 (2%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ + ++ +P +   SI  +  + + V I AP  DG+ANA L++Y++    V + QV I 
Sbjct: 10  ALVLRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANAHLVKYLAKQFKVAKSQVIIE 69

Query: 99  SGSKSRDKIVIVEEITPENV 118
            G   R K V +  + P+N+
Sbjct: 70  KGELGRHKQVRI--MNPQNI 87


>gi|333994253|ref|YP_004526866.1| hypothetical protein TREAZ_1008 [Treponema azotonutricium ZAS-9]
 gi|333735277|gb|AEF81226.1| conserved hypothetical protein [Treponema azotonutricium ZAS-9]
          Length = 104

 Score = 42.7 bits (99), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 41/69 (59%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + ++I A P S    + +V +  + V+I A  +DG+AN  L  +++  LG+ ++ V++ +
Sbjct: 12  LYLSIKAVPSSSKSQLGEVKEGRLKVKIAAAPEDGKANEELRSFLAKSLGLAKKDVTLTA 71

Query: 100 GSKSRDKIV 108
           G KSR K +
Sbjct: 72  GEKSRLKTI 80


>gi|374287486|ref|YP_005034571.1| hypothetical protein BMS_0696 [Bacteriovorax marinus SJ]
 gi|301166027|emb|CBW25601.1| conserved hypothetical protein [Bacteriovorax marinus SJ]
          Length = 109

 Score = 42.7 bits (99), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 40/69 (57%), Gaps = 2/69 (2%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEA-VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S  + I AKPG+K    + V DE  + + I     DG+AN A ++Y+++ L + +  VS+
Sbjct: 14  SFELNIWAKPGAK-VEKSIVGDEGEIIIYIKERPIDGQANKAFIKYLAAQLSITKSSVSL 72

Query: 98  GSGSKSRDK 106
             GSKSR K
Sbjct: 73  SRGSKSRFK 81


>gi|414076863|ref|YP_006996181.1| hypothetical protein ANA_C11595 [Anabaena sp. 90]
 gi|413970279|gb|AFW94368.1| hypothetical protein ANA_C11595 [Anabaena sp. 90]
          Length = 74

 Score = 42.4 bits (98), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 41/70 (58%)

Query: 44  IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKS 103
           +  KP S+   I +++D ++ V + +P  DG+AN  L++ ++    V +  + I SG+ S
Sbjct: 5   VKVKPNSQQQKIEELADGSLTVHLKSPPVDGKANEELIKLLAKKFDVAKSSIRIKSGTTS 64

Query: 104 RDKIVIVEEI 113
           R K++ ++ I
Sbjct: 65  RQKVIEIDGI 74


>gi|59711034|ref|YP_203810.1| hypothetical protein VF_0427 [Vibrio fischeri ES114]
 gi|423685140|ref|ZP_17659948.1| hypothetical protein VFSR5_0414 [Vibrio fischeri SR5]
 gi|59479135|gb|AAW84922.1| conserved protein [Vibrio fischeri ES114]
 gi|371495641|gb|EHN71236.1| hypothetical protein VFSR5_0414 [Vibrio fischeri SR5]
          Length = 95

 Score = 42.4 bits (98), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 22/81 (27%), Positives = 44/81 (54%), Gaps = 2/81 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  E + + I AP  DG+ANA L++Y S +  V + ++++  G 
Sbjct: 12  LRLYLQPKASRDQIVGIHGEELKIAITAPPVDGKANAHLIKYFSKLFKVAKGKITVEKGE 71

Query: 102 KSRDKIVIVEEITPENVLNSL 122
            +R K V +   +PE + N +
Sbjct: 72  LNRHKQVRIH--SPELIPNEI 90


>gi|403224187|dbj|BAM42317.1| uncharacterized protein TOT_040000685 [Theileria orientalis strain
            Shintoku]
          Length = 1360

 Score = 42.4 bits (98), Expect = 0.055,   Method: Composition-based stats.
 Identities = 20/61 (32%), Positives = 34/61 (55%)

Query: 42   ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
            + ++ KPG+K   +    +  + +QI AP +DGE N ALLEY++ +  +   Q S  +  
Sbjct: 1291 LKVNVKPGAKQTQVVGEVEGLLSLQISAPPRDGECNKALLEYVAELCKLSAYQFSFSAFE 1350

Query: 102  K 102
            K
Sbjct: 1351 K 1351


>gi|398332965|ref|ZP_10517670.1| hypothetical protein LalesM3_16160 [Leptospira alexanderi serovar
           Manhao 3 str. L 60]
          Length = 73

 Score = 42.4 bits (98), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 41/76 (53%), Gaps = 3/76 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +  T+  KP SK        D ++ + +   A +G+AN A++E +S  + + +R++ I S
Sbjct: 1   MKFTVRVKPNSKKIFFRKEEDGSITIAVRELALEGKANEAVIEMISREMEIPKRKIRIVS 60

Query: 100 GSKSRDKIVIVEEITP 115
           G KS+ K +   EI P
Sbjct: 61  GEKSKRKTI---EIDP 73


>gi|165976805|ref|YP_001652398.1| hypothetical protein APJL_1398 [Actinobacillus pleuropneumoniae
           serovar 3 str. JL03]
 gi|226734144|sp|B0BQW9.1|Y1398_ACTPJ RecName: Full=UPF0235 protein APJL_1398
 gi|165876906|gb|ABY69954.1| hypothetical protein APJL_1398 [Actinobacillus pleuropneumoniae
           serovar 3 str. JL03]
          Length = 98

 Score = 42.4 bits (98), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 20/68 (29%), Positives = 33/68 (48%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  + + I  +P +    I  + D  + + I AP  DG ANA LL+Y+S +  V +  + 
Sbjct: 10  PYGIRLRIFLQPKASRDQIVGLHDSELKIAITAPPVDGAANAHLLKYLSKLFKVPKSSIV 69

Query: 97  IGSGSKSR 104
           +  G   R
Sbjct: 70  LEKGELQR 77


>gi|307261870|ref|ZP_07543532.1| hypothetical protein appser12_14270 [Actinobacillus
           pleuropneumoniae serovar 12 str. 1096]
 gi|306868417|gb|EFN00232.1| hypothetical protein appser12_14270 [Actinobacillus
           pleuropneumoniae serovar 12 str. 1096]
          Length = 97

 Score = 42.4 bits (98), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 22/76 (28%), Positives = 36/76 (47%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  + + I  +P +    I  + D  + + I A   DG ANA LL+Y+S +  V +  + 
Sbjct: 10  PYGIRLRIFLQPKASRDQIVGLHDNELKIAITALPVDGAANAHLLKYLSKLFKVPKSSIV 69

Query: 97  IGSGSKSRDKIVIVEE 112
           +  G   R K + V E
Sbjct: 70  LEKGELQRHKQLFVPE 85


>gi|406932365|gb|EKD67381.1| hypothetical protein ACD_48C00442G0003 [uncultured bacterium]
          Length = 75

 Score = 42.4 bits (98), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 1/69 (1%)

Query: 41  SITIHAKPGSKSCSIT-DVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           ++TI   P S    I   + D  + +++ AP  DGEAN  L+E +S   GV + +V I  
Sbjct: 3   TLTIKVLPRSSRNEIVGTLPDGTLKIKLTAPPVDGEANKKLIELLSKEFGVAKSKVKIVK 62

Query: 100 GSKSRDKIV 108
           G  SR K+V
Sbjct: 63  GETSRMKVV 71


>gi|430746371|ref|YP_007205500.1| hypothetical protein Sinac_5674 [Singulisphaera acidiphila DSM
           18658]
 gi|430018091|gb|AGA29805.1| hypothetical protein Sinac_5674 [Singulisphaera acidiphila DSM
           18658]
          Length = 103

 Score = 42.4 bits (98), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 44/86 (51%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P    + + A+PG++  +I      A+ V +    + G+AN A+   ++  LG K  Q+ 
Sbjct: 8   PRGTVLPVRARPGARKDAILGEHAGALRVAVATKPEAGKANVAIQGVLAQALGFKASQID 67

Query: 97  IGSGSKSRDKIVIVEEITPENVLNSL 122
           + SG  SR+K  ++   TPE V + L
Sbjct: 68  LLSGQTSREKRFLIVGETPEEVRSRL 93


>gi|329890913|ref|ZP_08269256.1| hypothetical protein BDIM_26220 [Brevundimonas diminuta ATCC 11568]
 gi|328846214|gb|EGF95778.1| hypothetical protein BDIM_26220 [Brevundimonas diminuta ATCC 11568]
          Length = 93

 Score = 42.4 bits (98), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 36/58 (62%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           V++ A   +GEANAAL  +++  LGV +R V++  G +SR K++ V+ +T   V   L
Sbjct: 33  VRVRAQPVEGEANAALTAFLAKALGVPKRDVALARGGQSRLKMIEVDGLTDAEVRARL 90


>gi|33359419|ref|NP_877861.1| hypothetical protein PH1669.1n [Pyrococcus horikoshii OT3]
          Length = 95

 Score = 42.4 bits (98), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 43/83 (51%), Gaps = 5/83 (6%)

Query: 42  ITIHAKPGSKSCSITDVSD--EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           I +  +P SK   I  V +    + + I AP   GEAN  L++++S +LG K   V I  
Sbjct: 14  IQVIVRPNSKENKIEGVDNWKNRIRISIKAPPVKGEANKELIKFLSKILGAK---VEIIR 70

Query: 100 GSKSRDKIVIVEEITPENVLNSL 122
           G  SR+K ++V+ I  E V   L
Sbjct: 71  GETSREKDLLVKGIKLEEVKKRL 93


>gi|427737397|ref|YP_007056941.1| hypothetical protein Riv7116_3954 [Rivularia sp. PCC 7116]
 gi|427372438|gb|AFY56394.1| TIGR00251 family protein [Rivularia sp. PCC 7116]
          Length = 74

 Score = 42.4 bits (98), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 20/68 (29%), Positives = 39/68 (57%)

Query: 44  IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKS 103
           +  KP SK   I +  D ++ +++ +P  DG+AN  L++ +S    V +  + I SG+ S
Sbjct: 5   VKVKPNSKKQEIIEEDDGSLTIRLKSPPVDGKANEELIKILSKKFNVPKSYIRIKSGASS 64

Query: 104 RDKIVIVE 111
           R K++ ++
Sbjct: 65  RQKLIEID 72


>gi|147678156|ref|YP_001212371.1| hypothetical protein PTH_1821 [Pelotomaculum thermopropionicum SI]
 gi|189039033|sp|A5D180.1|Y1821_PELTS RecName: Full=UPF0235 protein PTH_1821
 gi|146274253|dbj|BAF60002.1| uncharacterized conserved protein [Pelotomaculum thermopropionicum
           SI]
          Length = 95

 Score = 42.4 bits (98), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 23/84 (27%), Positives = 44/84 (52%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + +  +P +    +  + ++A+ +++ AP  DGEAN A   +++  L +   +V I S
Sbjct: 11  VLLKVRVQPRAARNQVAGLYEDALKIRLTAPPVDGEANEACRAFLADSLSLPPSKVEIVS 70

Query: 100 GSKSRDKIVIVEEITPENVLNSLG 123
           G  SR K+V +  +  E V  + G
Sbjct: 71  GHASRTKVVKIAGVGAEKVRRAFG 94


>gi|282891527|ref|ZP_06300019.1| hypothetical protein pah_c178o054 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|338175146|ref|YP_004651956.1| hypothetical protein PUV_11520 [Parachlamydia acanthamoebae UV-7]
 gi|281498618|gb|EFB40945.1| hypothetical protein pah_c178o054 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|336479504|emb|CCB86102.1| UPF0235 protein Swol_0959 [Parachlamydia acanthamoebae UV-7]
          Length = 93

 Score = 42.4 bits (98), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 20/87 (22%), Positives = 49/87 (56%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S  ++ I   P +   +I    ++ + + I +  + G+AN A++++++  LG++++Q+ I
Sbjct: 4   SQTTLAIKVIPNASRNAILGWENDELKMYIASVPEKGKANEAVIKFLAKFLGLRKQQIQI 63

Query: 98  GSGSKSRDKIVIVEEITPENVLNSLGK 124
             G  +R KI+ +E I    +++ + K
Sbjct: 64  IRGETNRHKILQIEGIDKTKLIDYINK 90


>gi|53728760|ref|ZP_00135410.2| COG1872: Uncharacterized conserved protein [Actinobacillus
           pleuropneumoniae serovar 1 str. 4074]
 gi|126208844|ref|YP_001054069.1| hypothetical protein APL_1380 [Actinobacillus pleuropneumoniae
           serovar 5b str. L20]
 gi|303250781|ref|ZP_07336976.1| hypothetical protein APP6_1906 [Actinobacillus pleuropneumoniae
           serovar 6 str. Femo]
 gi|303252447|ref|ZP_07338612.1| hypothetical protein APP2_1422 [Actinobacillus pleuropneumoniae
           serovar 2 str. 4226]
 gi|307246303|ref|ZP_07528382.1| hypothetical protein appser1_15050 [Actinobacillus pleuropneumoniae
           serovar 1 str. 4074]
 gi|307248416|ref|ZP_07530437.1| hypothetical protein appser2_13900 [Actinobacillus pleuropneumoniae
           serovar 2 str. S1536]
 gi|307250644|ref|ZP_07532582.1| hypothetical protein appser4_14180 [Actinobacillus pleuropneumoniae
           serovar 4 str. M62]
 gi|307253024|ref|ZP_07534909.1| hypothetical protein appser6_15320 [Actinobacillus pleuropneumoniae
           serovar 6 str. Femo]
 gi|307255287|ref|ZP_07537100.1| hypothetical protein appser9_15200 [Actinobacillus pleuropneumoniae
           serovar 9 str. CVJ13261]
 gi|307257450|ref|ZP_07539217.1| hypothetical protein appser10_14450 [Actinobacillus
           pleuropneumoniae serovar 10 str. D13039]
 gi|307259722|ref|ZP_07541443.1| hypothetical protein appser11_15170 [Actinobacillus
           pleuropneumoniae serovar 11 str. 56153]
 gi|166200384|sp|A3N228.1|Y1380_ACTP2 RecName: Full=UPF0235 protein APL_1380
 gi|126097636|gb|ABN74464.1| hypothetical protein APL_1380 [Actinobacillus pleuropneumoniae
           serovar 5b str. L20]
 gi|302648720|gb|EFL78911.1| hypothetical protein APP2_1422 [Actinobacillus pleuropneumoniae
           serovar 2 str. 4226]
 gi|302650386|gb|EFL80547.1| hypothetical protein APP6_1906 [Actinobacillus pleuropneumoniae
           serovar 6 str. Femo]
 gi|306852773|gb|EFM84999.1| hypothetical protein appser1_15050 [Actinobacillus pleuropneumoniae
           serovar 1 str. 4074]
 gi|306855058|gb|EFM87240.1| hypothetical protein appser2_13900 [Actinobacillus pleuropneumoniae
           serovar 2 str. S1536]
 gi|306857316|gb|EFM89434.1| hypothetical protein appser4_14180 [Actinobacillus pleuropneumoniae
           serovar 4 str. M62]
 gi|306859482|gb|EFM91510.1| hypothetical protein appser6_15320 [Actinobacillus pleuropneumoniae
           serovar 6 str. Femo]
 gi|306861736|gb|EFM93717.1| hypothetical protein appser9_15200 [Actinobacillus pleuropneumoniae
           serovar 9 str. CVJ13261]
 gi|306864030|gb|EFM95946.1| hypothetical protein appser10_14450 [Actinobacillus
           pleuropneumoniae serovar 10 str. D13039]
 gi|306866190|gb|EFM98057.1| hypothetical protein appser11_15170 [Actinobacillus
           pleuropneumoniae serovar 11 str. 56153]
          Length = 97

 Score = 42.4 bits (98), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 23/80 (28%), Positives = 38/80 (47%)

Query: 33  RLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKR 92
           R+  P  + + I  +P +    I  + D  + + I A   DG ANA LL+Y+S +  V +
Sbjct: 6   RIENPCGIRLRIFLQPKASRDQIVGLHDNELKIAITALPVDGAANAHLLKYLSKLFKVPK 65

Query: 93  RQVSIGSGSKSRDKIVIVEE 112
             + +  G   R K + V E
Sbjct: 66  SSIVLEKGELQRHKQLFVPE 85


>gi|414154545|ref|ZP_11410863.1| conserved hypothetical protein [Desulfotomaculum hydrothermale Lam5
           = DSM 18033]
 gi|411453942|emb|CCO08767.1| conserved hypothetical protein [Desulfotomaculum hydrothermale Lam5
           = DSM 18033]
          Length = 99

 Score = 42.4 bits (98), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 23/85 (27%), Positives = 43/85 (50%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + V I +  +P +    +      AV +++ AP  DG AN A  ++++ + GV +  V+I
Sbjct: 9   TGVVIKVRVQPRASQNGLAGQLAGAVKIRLTAPPVDGAANEACCKFLAQLFGVAKNNVTI 68

Query: 98  GSGSKSRDKIVIVEEITPENVLNSL 122
            +G   R+K V +E I     L ++
Sbjct: 69  IAGHTGRNKTVRIEGINQAQALAAI 93


>gi|374612575|ref|ZP_09685352.1| protein of unknown function DUF167 [Mycobacterium tusciae JS617]
 gi|373547486|gb|EHP74211.1| protein of unknown function DUF167 [Mycobacterium tusciae JS617]
          Length = 75

 Score = 42.0 bits (97), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 41/67 (61%), Gaps = 1/67 (1%)

Query: 41  SITIHAKPGSKSCSITDVSDEA-VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +I +  KPGS+   + + +D   + + +   A DG+AN A+++ ++  LGV R +V++ S
Sbjct: 4   TIVVRVKPGSRKGPLVEATDNGELTIHVQERAVDGKANEAVVKLLAKHLGVPRSRVALVS 63

Query: 100 GSKSRDK 106
           G+ SR K
Sbjct: 64  GATSRVK 70


>gi|386826307|ref|ZP_10113414.1| TIGR00251 family protein [Beggiatoa alba B18LD]
 gi|386427191|gb|EIJ41019.1| TIGR00251 family protein [Beggiatoa alba B18LD]
          Length = 95

 Score = 42.0 bits (97), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 17/65 (26%), Positives = 37/65 (56%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +++H +P +   +I  V  + + ++  A   DG+ANA L++ ++   GV +  +++  G 
Sbjct: 13  LSVHVQPRASQTNIVGVHQDRLKIRTTATPVDGQANAELIKLLAKTFGVAKSHITLLQGD 72

Query: 102 KSRDK 106
            SR+K
Sbjct: 73  TSREK 77


>gi|17230688|ref|NP_487236.1| hypothetical protein asl3196 [Nostoc sp. PCC 7120]
 gi|29839730|sp|Q8YS95.1|Y3196_ANASP RecName: Full=UPF0235 protein asl3196
 gi|17132291|dbj|BAB74895.1| asl3196 [Nostoc sp. PCC 7120]
          Length = 75

 Score = 42.0 bits (97), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 20/68 (29%), Positives = 38/68 (55%)

Query: 44  IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKS 103
           +  KP SK   I +  D ++ V + +P  DG+AN  L++ ++    V +  ++I SG  S
Sbjct: 5   VKVKPNSKQQKIAEQDDGSLTVHLKSPPVDGKANEELIKLLAEKFDVPKSHITIKSGLSS 64

Query: 104 RDKIVIVE 111
           + K++ +E
Sbjct: 65  KQKLIEIE 72


>gi|189219304|ref|YP_001939945.1| hypothetical protein Minf_1293 [Methylacidiphilum infernorum V4]
 gi|189186162|gb|ACD83347.1| Uncharacterized conserved protein [Methylacidiphilum infernorum V4]
          Length = 97

 Score = 42.0 bits (97), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 22/73 (30%), Positives = 40/73 (54%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + +  +  +K   I     +A+ +++ AP  +G+AN ALL ++S  L V +R + I  G 
Sbjct: 6   LFVKVQANAKKTEICGSYADALKIRLSAPPVEGKANDALLSFLSLRLCVPKRLIRIEKGE 65

Query: 102 KSRDKIVIVEEIT 114
           K+  K V++E  T
Sbjct: 66  KNSKKTVVIEGWT 78


>gi|430760632|ref|YP_007216489.1| protein of unknown function DUF167 [Thioalkalivibrio
           nitratireducens DSM 14787]
 gi|430010256|gb|AGA33008.1| protein of unknown function DUF167 [Thioalkalivibrio
           nitratireducens DSM 14787]
          Length = 80

 Score = 42.0 bits (97), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 35/71 (49%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + +  KPG++   +    D     ++ AP  DG AN  L+  ++   GV RRQ+ I  G+
Sbjct: 10  LQVRVKPGARVSELRPGDDGTWIARVKAPPVDGRANDELVALIARQFGVARRQIRIRRGA 69

Query: 102 KSRDKIVIVEE 112
             R K V V +
Sbjct: 70  SGRSKWVEVAD 80


>gi|456982598|gb|EMG19154.1| hypothetical protein LEP1GSC150_1390 [Leptospira interrogans
           serovar Copenhageni str. LT2050]
          Length = 67

 Score = 42.0 bits (97), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 37/67 (55%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +  T++ KP SK        D  + + +  PA +G+AN A++E +S  + V + ++ I S
Sbjct: 1   MKFTVYVKPNSKKVFFRKEEDGVLTIAVREPALEGKANEAVIESISKEMKVPKSKIRILS 60

Query: 100 GSKSRDK 106
           G K++ K
Sbjct: 61  GQKTKRK 67


>gi|134298585|ref|YP_001112081.1| hypothetical protein Dred_0717 [Desulfotomaculum reducens MI-1]
 gi|189040591|sp|A4J2F3.1|Y717_DESRM RecName: Full=UPF0235 protein Dred_0717
 gi|134051285|gb|ABO49256.1| protein of unknown function DUF167 [Desulfotomaculum reducens MI-1]
          Length = 94

 Score = 42.0 bits (97), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 42/77 (54%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + V + +  +P +   S+    + A+ V++ AP  DG AN A  ++   + GV + +V I
Sbjct: 9   NGVVVKVRVQPRASKNSLAGEMEGALKVRLTAPPVDGAANEACCKFFGELFGVAKSKVEI 68

Query: 98  GSGSKSRDKIVIVEEIT 114
            +G   R+K+V ++ +T
Sbjct: 69  IAGHTGRNKLVHIQGVT 85


>gi|387773933|ref|ZP_10129215.1| TIGR00251 family protein [Haemophilus parahaemolyticus HK385]
 gi|386903425|gb|EIJ68240.1| TIGR00251 family protein [Haemophilus parahaemolyticus HK385]
          Length = 97

 Score = 42.0 bits (97), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 21/71 (29%), Positives = 36/71 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + I  +P +    I  + D  + + I AP  DG ANA LL+Y+S +  V +  +++  
Sbjct: 13  IRLRILLQPKASRDQIVGLHDGELKIAITAPPVDGAANAHLLKYLSKLFQVPKSSIALEK 72

Query: 100 GSKSRDKIVIV 110
           G   R K + V
Sbjct: 73  GELQRHKQIFV 83


>gi|350634001|gb|EHA22365.1| hypothetical protein ASPNIDRAFT_51026 [Aspergillus niger ATCC 1015]
          Length = 126

 Score = 42.0 bits (97), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 43/80 (53%), Gaps = 2/80 (2%)

Query: 36  PPSSVSITIHAKPGSKSCS--ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           P  ++ I+ H KP + S    IT +  + V V + A  + GEANAA+   ++ +  V + 
Sbjct: 22  PLYTLQISCHVKPNASSNREGITAIGTDRVDVCVAAVPRKGEANAAVSRVLAQIFQVPKS 81

Query: 94  QVSIGSGSKSRDKIVIVEEI 113
            V +  G KSR+K + + E+
Sbjct: 82  NVEVIRGLKSREKTLAISEL 101


>gi|218437671|ref|YP_002376000.1| hypothetical protein PCC7424_0673 [Cyanothece sp. PCC 7424]
 gi|226708012|sp|B7KEV7.1|Y673_CYAP7 RecName: Full=UPF0235 protein PCC7424_0673
 gi|218170399|gb|ACK69132.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7424]
          Length = 73

 Score = 42.0 bits (97), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 40/69 (57%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I +  KP +K   I +  D ++ + + +P  +G+AN  L++ ++    V + Q+SI S
Sbjct: 1   MKIQVKVKPNAKHQKIEEAEDGSLIISLKSPPVEGKANQELIKLLAQKYRVTKSQISIQS 60

Query: 100 GSKSRDKIV 108
           G  SR+K++
Sbjct: 61  GLSSRNKLI 69


>gi|429123157|ref|ZP_19183690.1| hypothetical protein A966_02551 [Brachyspira hampsonii 30446]
 gi|426280970|gb|EKV57973.1| hypothetical protein A966_02551 [Brachyspira hampsonii 30446]
          Length = 84

 Score = 42.0 bits (97), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           ++I +    G+KS S     + A  ++I A A DG+AN A++E+++  L +K++ + I  
Sbjct: 2   MNIEVKVTAGAKSNSFK-FENGAYSIRIMAKAIDGKANKAIIEFLADELNIKKKDIEILK 60

Query: 100 GSKSRDKIV 108
           G K+  K++
Sbjct: 61  GEKNSKKLI 69


>gi|325982093|ref|YP_004294495.1| hypothetical protein NAL212_1446 [Nitrosomonas sp. AL212]
 gi|325531612|gb|ADZ26333.1| protein of unknown function DUF167 [Nitrosomonas sp. AL212]
          Length = 113

 Score = 42.0 bits (97), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 20/83 (24%), Positives = 46/83 (55%)

Query: 30  SCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLG 89
           S  R    +++ +T+H + G+K      +   A+ +++ A   +G+AN+ L++++++   
Sbjct: 5   SWYRYDGTNNLVLTLHIQTGAKITKAAGLLGGALKIRLAAAPVEGKANSTLIKFLAAQFD 64

Query: 90  VKRRQVSIGSGSKSRDKIVIVEE 112
           V   QV +  G KSR K++++  
Sbjct: 65  VPIGQVRLKQGGKSRHKVIVIHR 87


>gi|387887904|ref|YP_006318202.1| hypothetical protein EBL_c05660 [Escherichia blattae DSM 4481]
 gi|414594855|ref|ZP_11444488.1| hypothetical protein YggU [Escherichia blattae NBRC 105725]
 gi|386922737|gb|AFJ45691.1| hypothetical protein EBL_c05660 [Escherichia blattae DSM 4481]
 gi|403194160|dbj|GAB82140.1| hypothetical protein YggU [Escherichia blattae NBRC 105725]
          Length = 99

 Score = 42.0 bits (97), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 21/71 (29%), Positives = 39/71 (54%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   +I  +  + + V I AP  DG+ANA L +Y++    V + QV+I  G 
Sbjct: 13  LRLYIQPKASRDAIVGLYGDELKVAITAPPVDGQANAHLTKYLARQFRVAKSQVTIEKGE 72

Query: 102 KSRDKIVIVEE 112
             R K V++ +
Sbjct: 73  LGRHKQVLISQ 83


>gi|398339401|ref|ZP_10524104.1| hypothetical protein LkirsB1_07789 [Leptospira kirschneri serovar
           Bim str. 1051]
 gi|418678563|ref|ZP_13239837.1| TIGR00251 family protein [Leptospira kirschneri serovar
           Grippotyphosa str. RM52]
 gi|418687431|ref|ZP_13248590.1| TIGR00251 family protein [Leptospira kirschneri serovar
           Grippotyphosa str. Moskva]
 gi|418742351|ref|ZP_13298724.1| TIGR00251 family protein [Leptospira kirschneri serovar Valbuzzi
           str. 200702274]
 gi|421090348|ref|ZP_15551142.1| TIGR00251 family protein [Leptospira kirschneri str. 200802841]
 gi|421132601|ref|ZP_15592767.1| TIGR00251 family protein [Leptospira kirschneri str. 2008720114]
 gi|400321753|gb|EJO69613.1| TIGR00251 family protein [Leptospira kirschneri serovar
           Grippotyphosa str. RM52]
 gi|410000926|gb|EKO51552.1| TIGR00251 family protein [Leptospira kirschneri str. 200802841]
 gi|410355984|gb|EKP03359.1| TIGR00251 family protein [Leptospira kirschneri str. 2008720114]
 gi|410737755|gb|EKQ82494.1| TIGR00251 family protein [Leptospira kirschneri serovar
           Grippotyphosa str. Moskva]
 gi|410750709|gb|EKR07689.1| TIGR00251 family protein [Leptospira kirschneri serovar Valbuzzi
           str. 200702274]
          Length = 73

 Score = 42.0 bits (97), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 40/69 (57%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + +T++ KP SK   I    +  + + +  PA +G+AN A++E +S  + V + ++ I S
Sbjct: 1   MKLTVYVKPNSKKVFIRKEENGVLTIAVREPALEGKANEAVIESISKEMKVPKSKIRILS 60

Query: 100 GSKSRDKIV 108
           G K++ K +
Sbjct: 61  GQKNKKKTI 69


>gi|145253637|ref|XP_001398331.1| yggU family protein [Aspergillus niger CBS 513.88]
 gi|134083900|emb|CAK48804.1| unnamed protein product [Aspergillus niger]
          Length = 126

 Score = 42.0 bits (97), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 43/80 (53%), Gaps = 2/80 (2%)

Query: 36  PPSSVSITIHAKPGSKSCS--ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           P  ++ I+ H KP + S    IT +  + V V + A  + GEANAA+   ++ +  V + 
Sbjct: 22  PLYTLQISCHVKPNASSNREGITAIGTDRVDVCVAAVPRKGEANAAVSRVLAQIFQVPKS 81

Query: 94  QVSIGSGSKSRDKIVIVEEI 113
            V +  G KSR+K + + E+
Sbjct: 82  NVEVIRGLKSREKTLAISEL 101


>gi|87311293|ref|ZP_01093415.1| hypothetical protein DSM3645_27241 [Blastopirellula marina DSM
           3645]
 gi|87286033|gb|EAQ77945.1| hypothetical protein DSM3645_27241 [Blastopirellula marina DSM
           3645]
          Length = 100

 Score = 42.0 bits (97), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 43/80 (53%)

Query: 35  VPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           V    V + + A PGSK   I      A+ V + A  +DG+AN A++E ++  L +++ Q
Sbjct: 6   VHDGGVILPVRALPGSKKNEIRGEQQGALKVSVTAAPEDGKANKAIVELLAKKLVLRKSQ 65

Query: 95  VSIGSGSKSRDKIVIVEEIT 114
           + I +G   R K V+V +I 
Sbjct: 66  LEIIAGHTHRQKRVLVGDIV 85


>gi|406908863|gb|EKD49252.1| hypothetical protein ACD_63C00208G0002 [uncultured bacterium]
          Length = 89

 Score = 42.0 bits (97), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 19/80 (23%), Positives = 47/80 (58%), Gaps = 3/80 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + +   P +++  + +++D ++ V +   A+ G+AN AL++ ++   G+++ ++ I  
Sbjct: 13  LKLRVKVIPNARNIRVKEIADGSLKVWLVERAEKGKANKALIKILADYFGLEKGKIKIVK 72

Query: 100 GSKSRDKIVIVEEITPENVL 119
           G KSR+K++   E+   N L
Sbjct: 73  GEKSREKVI---EVCFLNFL 89


>gi|440681011|ref|YP_007155806.1| UPF0235 protein yggU [Anabaena cylindrica PCC 7122]
 gi|428678130|gb|AFZ56896.1| UPF0235 protein yggU [Anabaena cylindrica PCC 7122]
          Length = 74

 Score = 41.6 bits (96), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 38/69 (55%)

Query: 44  IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKS 103
           +  KP SK   I +  D ++ V + +P  DG+AN  L++ ++    V +  + I SG  S
Sbjct: 5   VKVKPNSKQQKIAEQEDGSLTVHLKSPPVDGKANEELIKLLAEKFDVPKSSIRIKSGLTS 64

Query: 104 RDKIVIVEE 112
           R K+V +++
Sbjct: 65  RQKLVEIDD 73


>gi|398803252|ref|ZP_10562358.1| TIGR00251 family protein [Polaromonas sp. CF318]
 gi|398097131|gb|EJL87443.1| TIGR00251 family protein [Polaromonas sp. CF318]
          Length = 122

 Score = 41.6 bits (96), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 21/81 (25%), Positives = 43/81 (53%), Gaps = 4/81 (4%)

Query: 30  SCIRLVPPSSVSITIHAKPGSKSCSITDVSDEA----VGVQIDAPAKDGEANAALLEYMS 85
           + +R+     V + +H  P +       +  EA    + +++ AP  DG+AN AL+++++
Sbjct: 14  AFLRMEKNGDVLVDVHVVPNAAKTQAVGLHGEAGQQALRLRLQAPPVDGKANQALVKWLA 73

Query: 86  SVLGVKRRQVSIGSGSKSRDK 106
             LG+ ++ V++  G  SR K
Sbjct: 74  GCLGIPQQAVTLARGETSRRK 94


>gi|254419593|ref|ZP_05033317.1| conserved hypothetical protein [Brevundimonas sp. BAL3]
 gi|196185770|gb|EDX80746.1| conserved hypothetical protein [Brevundimonas sp. BAL3]
          Length = 69

 Score = 41.6 bits (96), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 32/47 (68%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           V++ A   DGEANAAL+++++  LGV R  V +  G +SR K+V VE
Sbjct: 4   VRVRARPVDGEANAALVKFLAKALGVSRSSVVLERGGQSRLKMVSVE 50


>gi|57640703|ref|YP_183181.1| hypothetical protein TK0768 [Thermococcus kodakarensis KOD1]
 gi|73921163|sp|Q5JHB2.1|Y768_PYRKO RecName: Full=UPF0235 protein TK0768
 gi|57159027|dbj|BAD84957.1| hypothetical protein, conserved, YggU family [Thermococcus
           kodakarensis KOD1]
          Length = 94

 Score = 41.6 bits (96), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 49/88 (55%), Gaps = 5/88 (5%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEA--VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
             V I I+ +P +K  +I  V      + V+I AP  +G+AN  ++++ S +LG    +V
Sbjct: 9   EGVLIMIYVQPKAKKNAIEGVDGWRGRLKVRIAAPPVEGKANKEVVKFFSKLLGA---EV 65

Query: 96  SIGSGSKSRDKIVIVEEITPENVLNSLG 123
           +I  G  SR+K ++V+ ++ E V   LG
Sbjct: 66  NIVRGETSREKDLLVKGLSVEEVRKKLG 93


>gi|42522071|ref|NP_967451.1| hypothetical protein Bd0463 [Bdellovibrio bacteriovorus HD100]
 gi|39574602|emb|CAE78444.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100]
          Length = 94

 Score = 41.6 bits (96), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 18/69 (26%), Positives = 36/69 (52%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + +  +P S    +    +  + +++ AP  DG+AN  L+E++S +  + +R V +  
Sbjct: 9   VRLHLFIQPKSSKNEVVGPHNGEIKIKLTAPPVDGKANECLIEFLSDLFDIPKRDVHLIK 68

Query: 100 GSKSRDKIV 108
           G   R K+V
Sbjct: 69  GETGRHKVV 77


>gi|415758824|ref|ZP_11481585.1| hypothetical protein D17P3_1082 [Aggregatibacter
           actinomycetemcomitans D17P-3]
 gi|348655205|gb|EGY70679.1| hypothetical protein D17P3_1082 [Aggregatibacter
           actinomycetemcomitans D17P-3]
          Length = 97

 Score = 41.6 bits (96), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 18/57 (31%), Positives = 33/57 (57%)

Query: 54  SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
            I  + D+ + + I AP  DG+ANA LL+++S +  V +  + +  G  +R K V++
Sbjct: 25  QIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKVPKSSIVLEKGELNRHKQVLI 81


>gi|401428443|ref|XP_003878704.1| conserved hypothetical protein [Leishmania mexicana
          MHOM/GT/2001/U1103]
 gi|322494953|emb|CBZ30256.1| conserved hypothetical protein [Leishmania mexicana
          MHOM/GT/2001/U1103]
          Length = 204

 Score = 41.6 bits (96), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 37/62 (59%), Gaps = 5/62 (8%)

Query: 31 CIRLVPPSSVSITIHAKPGSKSCS----ITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
          C+  V P +  +T+HAKP +++ +    +T    EA  ++I AP  DG+ANA LL Y+  
Sbjct: 14 CLVQVQPGAYLLTVHAKPSARASAFAAPVTPTLTEA-DLRIAAPPVDGQANAELLRYLGE 72

Query: 87 VL 88
          ++
Sbjct: 73 LI 74


>gi|296242831|ref|YP_003650318.1| hypothetical protein Tagg_1097 [Thermosphaera aggregans DSM 11486]
 gi|296095415|gb|ADG91366.1| protein of unknown function DUF167 [Thermosphaera aggregans DSM
           11486]
          Length = 111

 Score = 41.6 bits (96), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 46/86 (53%), Gaps = 1/86 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + +  KPGS+    T  SDE V  +   PA+ G ANA+L++Y+S  L +   ++ I  
Sbjct: 23  VILQVRVKPGSEPEGFTIESDELV-FRTSEPAERGRANASLIKYLSRELKIPVSKIDIVY 81

Query: 100 GSKSRDKIVIVEEITPENVLNSLGKA 125
           G + + K V++ +   + ++  L + 
Sbjct: 82  GQREKLKKVLIMDEPADKIIEKLARV 107


>gi|358373229|dbj|GAA89828.1| hypothetical protein AKAW_07942 [Aspergillus kawachii IFO 4308]
          Length = 125

 Score = 41.6 bits (96), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 44/77 (57%), Gaps = 2/77 (2%)

Query: 39  SVSITIHAKPGSKSCS--ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           ++ I+ H KP + +    IT ++ + V V + A  + GEANAA+   M+ +  V + +V 
Sbjct: 24  TLQISCHVKPNASNNREGITSITADRVDVCVAAVPRKGEANAAVSRVMAQIFQVPKSKVE 83

Query: 97  IGSGSKSRDKIVIVEEI 113
           +  G KSR+K + + E+
Sbjct: 84  VIRGLKSREKTLAISEL 100


>gi|395236297|ref|ZP_10414494.1| hypothetical protein A936_21497 [Enterobacter sp. Ag1]
 gi|394728928|gb|EJF28948.1| hypothetical protein A936_21497 [Enterobacter sp. Ag1]
          Length = 99

 Score = 41.6 bits (96), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 2/82 (2%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  + + ++ +P +   S+  +  + + V I AP  DG+ANA L++Y++    V + QV 
Sbjct: 8   PDGLVLRLYIQPKASRDSLVGMHGDELKVAITAPPVDGQANAHLVKYLAKQFRVAKGQVV 67

Query: 97  IGSGSKSRDKIVIVEEITPENV 118
           I  G   R K V +  I P+ +
Sbjct: 68  IEKGELGRHKQVKI--IGPQQI 87


>gi|300175116|emb|CBK20427.2| unnamed protein product [Blastocystis hominis]
          Length = 152

 Score = 41.6 bits (96), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 61/98 (62%), Gaps = 1/98 (1%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           L S IR++ PSSVS+++H KP  K   I +  +E + +++ +P  + +AN  + E +S +
Sbjct: 10  LESVIRVLSPSSVSLSVHVKPNCKKTGI-EWEEEQLQIRLSSPPTENKANKEVCEVVSDI 68

Query: 88  LGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
             + + QVS+  G KSRDK +++  ++ E ++ SL +A
Sbjct: 69  ADIPKSQVSLIRGGKSRDKELMLNGVSSEGLIQSLLQA 106


>gi|146312998|ref|YP_001178072.1| hypothetical protein Ent638_3359 [Enterobacter sp. 638]
 gi|166990826|sp|A4WE91.1|Y3359_ENT38 RecName: Full=UPF0235 protein Ent638_3359
 gi|145319874|gb|ABP62021.1| protein of unknown function DUF167 [Enterobacter sp. 638]
          Length = 96

 Score = 41.6 bits (96), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L +Y++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQVIIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +  + P+N+
Sbjct: 73  LGRHKQVKI--LNPQNI 87


>gi|419802120|ref|ZP_14327321.1| TIGR00251 family protein [Haemophilus parainfluenzae HK262]
 gi|419845244|ref|ZP_14368522.1| TIGR00251 family protein [Haemophilus parainfluenzae HK2019]
 gi|385191442|gb|EIF38856.1| TIGR00251 family protein [Haemophilus parainfluenzae HK262]
 gi|386416167|gb|EIJ30677.1| TIGR00251 family protein [Haemophilus parainfluenzae HK2019]
          Length = 95

 Score = 41.6 bits (96), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 23/85 (27%), Positives = 43/85 (50%), Gaps = 2/85 (2%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S    V +  + +  
Sbjct: 11  LRLKIILQPKASKDQIVGLHDDELKITITAPPVDGQANAHLLKFLSKTFKVPKSSIVLEK 70

Query: 100 GSKSRDKIVIV--EEITPENVLNSL 122
           G  +R K V +   ++ P  + N L
Sbjct: 71  GELNRHKQVWIPSPKLIPSEIQNLL 95


>gi|406979039|gb|EKE00895.1| hypothetical protein ACD_21C00256G0007 [uncultured bacterium]
          Length = 98

 Score = 41.6 bits (96), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 37/63 (58%), Gaps = 2/63 (3%)

Query: 61  EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLN 120
           E + V I APA +G+AN  L+++++    V ++QV I  G+ SR K +++ +  P+  L 
Sbjct: 36  ERLKVTITAPAVEGKANKYLIKFLAQYFDVPQKQVEITKGNNSRYKSILISD--PKTHLV 93

Query: 121 SLG 123
             G
Sbjct: 94  EFG 96


>gi|396081665|gb|AFN83280.1| hypothetical protein EROM_070290 [Encephalitozoon romaleae SJ-2008]
          Length = 100

 Score = 41.6 bits (96), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 21/76 (27%), Positives = 44/76 (57%), Gaps = 3/76 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + +++ A+P ++S  I ++S   + V ++AP  + +AN  L+ ++S +L V +  + I  
Sbjct: 11  LRLSVRARPSARSTGIREISGTEIVVNVNAPPVENKANKELVRFISKMLRVPKSSIKIVI 70

Query: 100 GSKSRDKIVIVEEITP 115
           G+ S  K V   E++P
Sbjct: 71  GTTSMQKTV---EVSP 83


>gi|449136203|ref|ZP_21771596.1| protein containing DUF167 [Rhodopirellula europaea 6C]
 gi|448885103|gb|EMB15562.1| protein containing DUF167 [Rhodopirellula europaea 6C]
          Length = 107

 Score = 41.2 bits (95), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 44/81 (54%), Gaps = 4/81 (4%)

Query: 31  CIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGV 90
           C  L     V +T    P +K  S+  + D A+ V + A  + G+AN A++  ++  LGV
Sbjct: 12  CEGLTWKFRVRVT----PKAKKASVGGLHDGALKVSVHAVPEGGKANKAVIASLAKWLGV 67

Query: 91  KRRQVSIGSGSKSRDKIVIVE 111
            + +V+I +G  SR K+V VE
Sbjct: 68  SKNRVAITAGDTSRLKMVQVE 88


>gi|304313340|ref|YP_003812938.1| hypothetical protein HDN1F_37330 [gamma proteobacterium HdN1]
 gi|301799073|emb|CBL47316.1| Conserved hypothetical protein [gamma proteobacterium HdN1]
          Length = 108

 Score = 41.2 bits (95), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 2/58 (3%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           ++I AP  DG+ANA L+ +++   GV ++QV I  G   R K + +   TP  +   L
Sbjct: 41  IRITAPPVDGQANAHLIRFLAKAFGVPQQQVQIEQGETGRSKRIRIR--TPSKIPQEL 96


>gi|406994076|gb|EKE13137.1| hypothetical protein ACD_13C00090G0003 [uncultured bacterium]
          Length = 72

 Score = 41.2 bits (95), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 3/75 (4%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I + A P +K   I       + V ++AP  +G+AN A+ E ++    VK+ Q+ + S
Sbjct: 1   MKIAVIAHPNAKRPRIEQDLLGTLHVYVNAPPLEGKANRAIEEALAQYFKVKKSQIELTS 60

Query: 100 GSKSRDKIVIVEEIT 114
           G KS++K+    EIT
Sbjct: 61  GQKSKNKLF---EIT 72


>gi|331695278|ref|YP_004331517.1| hypothetical protein Psed_1425 [Pseudonocardia dioxanivorans
           CB1190]
 gi|326949967|gb|AEA23664.1| UPF0235 protein yggU [Pseudonocardia dioxanivorans CB1190]
          Length = 89

 Score = 41.2 bits (95), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 48/86 (55%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           +++ +TI  +PG+   S+     +A+ V+++  A DG A AA L  ++   GV R  V +
Sbjct: 4   TTLRLTIRVRPGASRTSVGGAYADALVVRVNERAVDGAATAAALRAVAGAFGVPRGAVRL 63

Query: 98  GSGSKSRDKIVIVEEITPENVLNSLG 123
            SG+ SR KI+ VE   PE +   LG
Sbjct: 64  VSGATSRTKILDVEGGDPERLRELLG 89


>gi|404421607|ref|ZP_11003321.1| hypothetical protein MFORT_14335 [Mycobacterium fortuitum subsp.
           fortuitum DSM 46621]
 gi|403658835|gb|EJZ13533.1| hypothetical protein MFORT_14335 [Mycobacterium fortuitum subsp.
           fortuitum DSM 46621]
          Length = 75

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 41/67 (61%), Gaps = 1/67 (1%)

Query: 41  SITIHAKPGSKSCSITDVSDE-AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           ++ +  KPGS+   + +V D+ A+ + +   A DG+AN A+ + ++  LGV R +V + S
Sbjct: 4   TVVVRVKPGSRKGPLVEVDDDGALTIYVQERAVDGKANDAVTKLLAQHLGVPRSRVELIS 63

Query: 100 GSKSRDK 106
           G+ +R K
Sbjct: 64  GATARLK 70


>gi|330444266|ref|YP_004377252.1| hypothetical protein G5S_0576 [Chlamydophila pecorum E58]
 gi|328807376|gb|AEB41549.1| conserved hypothetical protein [Chlamydophila pecorum E58]
          Length = 100

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 19/69 (27%), Positives = 38/69 (55%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + +   P +K   I     E + V++  P + G+AN A++  ++  LG+ +R V++ SG 
Sbjct: 19  LEVQVTPKAKENKIVGFQGEVLKVRVTEPPEKGKANEAVVSLLAKALGIPKRDVTLVSGE 78

Query: 102 KSRDKIVIV 110
            SR K +++
Sbjct: 79  SSRKKKLMI 87


>gi|119493536|ref|ZP_01624202.1| hypothetical protein L8106_18212 [Lyngbya sp. PCC 8106]
 gi|119452653|gb|EAW33834.1| hypothetical protein L8106_18212 [Lyngbya sp. PCC 8106]
          Length = 81

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 38/69 (55%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + +  KP SK  S+    +    + + +P  DG+AN  L++ ++    + + QVSI S
Sbjct: 2   VVLHVKVKPNSKQQSMVKNEEGTYIIHLKSPPIDGKANQELIKILAKQFNIPKSQVSIKS 61

Query: 100 GSKSRDKIV 108
           G  S++K++
Sbjct: 62  GLSSKNKLI 70


>gi|343492473|ref|ZP_08730837.1| hypothetical protein VINI7043_12916 [Vibrio nigripulchritudo ATCC
           27043]
 gi|342827138|gb|EGU61535.1| hypothetical protein VINI7043_12916 [Vibrio nigripulchritudo ATCC
           27043]
          Length = 96

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 44/84 (52%), Gaps = 2/84 (2%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
            V + ++ +P +    I     + + + I AP  DG+AN+ L++Y++    V + QV I 
Sbjct: 11  EVLLRLYIQPKASRDKIVGKHGDELKIAITAPPVDGKANSHLIKYLAKQFKVPKGQVKIE 70

Query: 99  SGSKSRDKIVIVEEITPENVLNSL 122
            G   R K V +  I+P+N+ N +
Sbjct: 71  KGELGRHKQVRI--ISPQNLPNEI 92


>gi|289547963|ref|YP_003472951.1| hypothetical protein Thal_0188 [Thermocrinis albus DSM 14484]
 gi|289181580|gb|ADC88824.1| protein of unknown function DUF167 [Thermocrinis albus DSM 14484]
          Length = 77

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 22/76 (28%), Positives = 41/76 (53%), Gaps = 2/76 (2%)

Query: 40  VSITIH--AKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + I IH  AKP +    + ++S     V +  P +DG+AN  +LE +S  L V + ++ +
Sbjct: 2   IMIVIHVKAKPKASKEYVKELSPNFYEVAVKEPPEDGKANERILELLSKHLKVPKSRIKL 61

Query: 98  GSGSKSRDKIVIVEEI 113
             G+ SR K+  + ++
Sbjct: 62  LRGTSSRIKVFCISQL 77


>gi|392969675|ref|ZP_10335090.1| protein of unknown function DUF167 [Fibrisoma limi BUZ 3]
 gi|387841869|emb|CCH57148.1| protein of unknown function DUF167 [Fibrisoma limi BUZ 3]
          Length = 87

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 39/61 (63%), Gaps = 4/61 (6%)

Query: 42  ITIHAKPGSKSCSITDVSDEA--VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + +  KPGSK   +    DEA  +  +I APA+DG+ANA L+E ++  LG+ + +V+I +
Sbjct: 3   VHLKVKPGSKVDQL--FYDEAGLLIAKIKAPAQDGKANAYLVELLAKQLGLAKSKVTIVA 60

Query: 100 G 100
           G
Sbjct: 61  G 61


>gi|209694133|ref|YP_002262061.1| hypothetical protein VSAL_I0540 [Aliivibrio salmonicida LFI1238]
 gi|208008084|emb|CAQ78225.1| conserved hypothetical protein [Aliivibrio salmonicida LFI1238]
          Length = 83

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 19/65 (29%), Positives = 36/65 (55%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  E + V I AP  DG+ANA L++Y S +  V + ++++  G 
Sbjct: 12  LRLYLQPKASRDQIVGIHGEELKVAITAPPVDGKANAHLIKYFSKLFKVAKGKITVEKGE 71

Query: 102 KSRDK 106
            +R K
Sbjct: 72  LNRHK 76


>gi|22299537|ref|NP_682784.1| hypothetical protein tsr1994 [Thermosynechococcus elongatus BP-1]
 gi|29839707|sp|Q8DHG5.1|Y1994_THEEB RecName: Full=UPF0235 protein tsr1994
 gi|22295720|dbj|BAC09546.1| tsr1994 [Thermosynechococcus elongatus BP-1]
          Length = 74

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 18/62 (29%), Positives = 35/62 (56%)

Query: 47  KPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           KP ++  S++      + V + APA DG+AN  L+  +++  GV + ++ +  G  SR K
Sbjct: 9   KPNARQASVSITPAGQLLVTVRAPASDGQANQELIALLAAYFGVPKSRIQLVKGHTSRHK 68

Query: 107 IV 108
           ++
Sbjct: 69  VI 70


>gi|406965462|gb|EKD91091.1| hypothetical protein ACD_30C00047G0016 [uncultured bacterium]
          Length = 93

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 38/69 (55%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I++ AKP SK   I  + D +  V +   AK+G AN A+++ +S    + + ++ I +
Sbjct: 1   MKISVSAKPKSKKEFIKKLKDFSYSVAVKEEAKEGRANEAVIKALSEYFNIPKSEIKIVT 60

Query: 100 GSKSRDKIV 108
           G   + KI+
Sbjct: 61  GLSFKQKII 69


>gi|313223261|emb|CBY43439.1| unnamed protein product [Oikopleura dioica]
          Length = 180

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 30/89 (33%), Positives = 49/89 (55%), Gaps = 12/89 (13%)

Query: 33  RLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQID----------APAKDGEANAALLE 82
           RL   + V +  H  P SK+ SI+   D+++   +           AP   G+AN A+L+
Sbjct: 72  RLNRMNHVLLNCHVTPKSKTPSIS--VDDSINDTLSVVHVVNVRVSAPPDKGKANKAVLK 129

Query: 83  YMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
            ++  LGVK  ++SI SG+ SR K+V++E
Sbjct: 130 SLADKLGVKPSKLSIQSGTTSRSKVVLLE 158


>gi|386281996|ref|ZP_10059655.1| UPF0235 protein yggU [Escherichia sp. 4_1_40B]
 gi|386121187|gb|EIG69805.1| UPF0235 protein yggU [Escherichia sp. 4_1_40B]
          Length = 96

 Score = 41.2 bits (95), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++  +  V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKLFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQIKIINPQQIPPE 90


>gi|443292991|ref|ZP_21032085.1| conserved hypothetical protein [Micromonospora lupini str. Lupac
           08]
 gi|385884201|emb|CCH20236.1| conserved hypothetical protein [Micromonospora lupini str. Lupac
           08]
          Length = 103

 Score = 41.2 bits (95), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 35  VPPS-SVSITIHAKPGSKSCSITDVSDEAVG----VQIDAPAKDGEANAALLEYMSSVLG 89
           VPP  + ++ +  KPG+   ++    D   G    + + APA DG A  A    ++  LG
Sbjct: 3   VPPQDAFTVAVRVKPGAPRNTVGGRFDGPHGPALVIAVHAPAVDGRATEAARRALADALG 62

Query: 90  VKRRQVSIGSGSKSRDKIVIVEEITP--ENVLNSLGKASSS 128
           V+   VS+ +G+ SRDK+ +V+   P   +VL+ L   S+ 
Sbjct: 63  VRAATVSLRTGATSRDKLFLVDRPGPGLTDVLHRLRDGSAG 103


>gi|434402268|ref|YP_007145153.1| hypothetical protein Cylst_0095 [Cylindrospermum stagnale PCC 7417]
 gi|428256523|gb|AFZ22473.1| hypothetical protein Cylst_0095 [Cylindrospermum stagnale PCC 7417]
          Length = 75

 Score = 41.2 bits (95), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 35/65 (53%)

Query: 44  IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKS 103
           +  KP SK   I +  D ++ V + +P  DG+AN  L++ ++    V +  V I SG  S
Sbjct: 5   VKVKPNSKKQKIAEQLDGSLTVHLKSPPVDGKANEELIKLLAEKFAVPKSYVRIKSGLSS 64

Query: 104 RDKIV 108
           R K+V
Sbjct: 65  RQKLV 69


>gi|429101687|ref|ZP_19163661.1| UPF0235 protein VC0458 [Cronobacter turicensis 564]
 gi|426288336|emb|CCJ89774.1| UPF0235 protein VC0458 [Cronobacter turicensis 564]
          Length = 96

 Score = 41.2 bits (95), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L++Y++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIIGLHGDELKVAITAPPVDGQANAHLVKYLAKQFRVAKSQVMIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V + E  P+ +
Sbjct: 73  LGRHKQVKITE--PQQI 87


>gi|157875650|ref|XP_001686209.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68129283|emb|CAJ07823.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 204

 Score = 40.8 bits (94), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 37/62 (59%), Gaps = 5/62 (8%)

Query: 31 CIRLVPPSSVSITIHAKPGSKSCS----ITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
          C+  V P    +T+HAKP +++ +    +T V  EA  ++I AP  +G+ANA LL Y+  
Sbjct: 14 CLVQVQPGKYLLTVHAKPSARASAFAAPLTPVLTEA-DLRIAAPPVEGQANAELLRYLDE 72

Query: 87 VL 88
          ++
Sbjct: 73 LI 74


>gi|189485621|ref|YP_001956562.1| conserved hypothetical protein [uncultured Termite group 1
           bacterium phylotype Rs-D17]
 gi|254806541|sp|B1GYK8.1|Y618_UNCTG RecName: Full=UPF0235 protein TGRD_618
 gi|170287580|dbj|BAG14101.1| conserved hypothetical protein [uncultured Termite group 1
           bacterium phylotype Rs-D17]
          Length = 87

 Score = 40.8 bits (94), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 33/67 (49%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I +   P SK   +       + V+I APA +G AN  L +++S    VKR  + +  G 
Sbjct: 3   IKVRVIPNSKRNEVVSRVGSILRVKITAPAIEGRANEELCDFLSDFFDVKRSMIFLRKGE 62

Query: 102 KSRDKIV 108
           + R+K +
Sbjct: 63  RGREKTI 69


>gi|312116202|ref|YP_004013798.1| hypothetical protein Rvan_3519 [Rhodomicrobium vannielii ATCC
           17100]
 gi|311221331|gb|ADP72699.1| protein of unknown function DUF167 [Rhodomicrobium vannielii ATCC
           17100]
          Length = 123

 Score = 40.8 bits (94), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 7/88 (7%)

Query: 27  NLPSCIRLVPPSSVSITIHAKPGSKSCSITDV----SDEAVGVQIDAPAKDGEANAALLE 82
           +LP+  +   P  V + +   P + S  +  V        +   +  P +DG+ANAAL+ 
Sbjct: 12  DLPATAK---PDGVLLHVRLTPKASSARVAGVEAFDGKPVLKAYVTTPPEDGKANAALVV 68

Query: 83  YMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
            ++S LGV +  VS+ +G KSR K V V
Sbjct: 69  LVASWLGVPKSSVSMAAGQKSRLKTVAV 96


>gi|256394873|ref|YP_003116437.1| hypothetical protein Caci_5738 [Catenulispora acidiphila DSM 44928]
 gi|256361099|gb|ACU74596.1| protein of unknown function DUF167 [Catenulispora acidiphila DSM
           44928]
          Length = 92

 Score = 40.8 bits (94), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 42/80 (52%), Gaps = 1/80 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDE-AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           + I I  KPGS    +     E ++ V + A A DG A  A L  ++  LG+ RR V + 
Sbjct: 4   LRIPIRVKPGSSRTKVGGRHGERSLIVAVTAKAVDGAATEAALRAVAEALGMPRRAVQLI 63

Query: 99  SGSKSRDKIVIVEEITPENV 118
           +G+ SRDK++ V    P+ V
Sbjct: 64  TGATSRDKVLGVSSEDPDTV 83


>gi|121712776|ref|XP_001273999.1| YggU family protein [Aspergillus clavatus NRRL 1]
 gi|119402152|gb|EAW12573.1| YggU family protein [Aspergillus clavatus NRRL 1]
          Length = 133

 Score = 40.8 bits (94), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 39  SVSITIHAKPGSKSCS--ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           S+ I  H KP + S    +  +  E V V + A  ++GEAN A+    + +  V +    
Sbjct: 31  SLQIACHVKPNASSSREGVVAIGPEKVDVCVAAVPRNGEANIAVARVFAQIFDVPKSNAE 90

Query: 97  IGSGSKSRDKIVIVEEI 113
           +  G KSRDKI+ + E+
Sbjct: 91  VIRGLKSRDKILCITEL 107


>gi|384917752|ref|ZP_10017862.1| hypothetical protein C357_01805 [Citreicella sp. 357]
 gi|384468393|gb|EIE52828.1| hypothetical protein C357_01805 [Citreicella sp. 357]
          Length = 86

 Score = 40.8 bits (94), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 44/80 (55%), Gaps = 1/80 (1%)

Query: 29  PSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVL 88
           P    LV P  V++ +   P +   +I    D  + V + A  +DG+ANAA+ + ++  L
Sbjct: 4   PPLAHLVQPG-VTLALRVTPKASRNAIRAEDDGTLRVFVTAVPEDGKANAAVHKLLARAL 62

Query: 89  GVKRRQVSIGSGSKSRDKIV 108
           GV + ++++  G+ SRDK+ 
Sbjct: 63  GVPKSRLTLVRGATSRDKLF 82


>gi|421098832|ref|ZP_15559494.1| hypothetical protein LEP1GSC125_2558 [Leptospira borgpetersenii
           str. 200901122]
 gi|410798093|gb|EKS00191.1| hypothetical protein LEP1GSC125_2558 [Leptospira borgpetersenii
           str. 200901122]
          Length = 73

 Score = 40.8 bits (94), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 38/69 (55%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +  T+  KP SK        D ++ + +   A +G+AN A++E +S  + + +R++ I S
Sbjct: 1   MKFTVRVKPNSKKTFFRKEDDGSITIAVRELALEGKANEAVIETISREMEIPKRKIRIVS 60

Query: 100 GSKSRDKIV 108
           G KS+ K +
Sbjct: 61  GEKSKRKTI 69


>gi|297617010|ref|YP_003702169.1| hypothetical protein Slip_0824 [Syntrophothermus lipocalidus DSM
           12680]
 gi|297144847|gb|ADI01604.1| protein of unknown function DUF167 [Syntrophothermus lipocalidus
           DSM 12680]
          Length = 96

 Score = 40.8 bits (94), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 42/74 (56%), Gaps = 1/74 (1%)

Query: 50  SKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVI 109
           SK+  I +V + AV +++ A   +GEAN AL+ +++ + GV R+ V+I  G  SR K+V 
Sbjct: 21  SKNEVIGEV-EGAVKIKLTAAPLEGEANQALISFLAKISGVARKNVTIIKGETSRHKLVE 79

Query: 110 VEEITPENVLNSLG 123
           +  I     L   G
Sbjct: 80  ITGIDKTEFLKRTG 93


>gi|389878956|ref|YP_006372521.1| hypothetical protein TMO_3099 [Tistrella mobilis KA081020-065]
 gi|388529740|gb|AFK54937.1| hypothetical protein TMO_3099 [Tistrella mobilis KA081020-065]
          Length = 116

 Score = 40.8 bits (94), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 44/80 (55%), Gaps = 5/80 (6%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVG-----VQIDAPAKDGEANAALLEYMSSVLGVK 91
           P++VSI +  +P ++  +I    D+  G     V++ A  +DG+ANAA+   ++  LG+ 
Sbjct: 18  PANVSIFVKVQPKARKAAIEPARDDGHGRMRLQVKVTAAPEDGKANAAVERLVAEALGIA 77

Query: 92  RRQVSIGSGSKSRDKIVIVE 111
           R +V +  G   R K + +E
Sbjct: 78  RNRVRVVQGVTDRAKRLGIE 97


>gi|126178968|ref|YP_001046933.1| hypothetical protein Memar_1018 [Methanoculleus marisnigri JR1]
 gi|125861762|gb|ABN56951.1| protein of unknown function DUF167 [Methanoculleus marisnigri JR1]
          Length = 105

 Score = 40.8 bits (94), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 3/89 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDE---AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           V+IT+    G+K  S     +E   ++  QI APA  G+AN A+ + ++   GV R  VS
Sbjct: 15  VTITLDVTAGAKRSSFPAGYNEWRKSIKCQIAAPAVGGKANRAITDLLAETFGVPRADVS 74

Query: 97  IGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           I +G  S  K V +  ++  + L  L  A
Sbjct: 75  IITGHTSSSKTVAIAGVSKSHALVRLNAA 103


>gi|359689446|ref|ZP_09259447.1| hypothetical protein LlicsVM_13712 [Leptospira licerasiae serovar
           Varillal str. MMD0835]
 gi|418749592|ref|ZP_13305880.1| TIGR00251 family protein [Leptospira licerasiae str. MMD4847]
 gi|418759038|ref|ZP_13315218.1| TIGR00251 family protein [Leptospira licerasiae serovar Varillal
           str. VAR 010]
 gi|384113529|gb|EID99793.1| TIGR00251 family protein [Leptospira licerasiae serovar Varillal
           str. VAR 010]
 gi|404274477|gb|EJZ41795.1| TIGR00251 family protein [Leptospira licerasiae str. MMD4847]
          Length = 79

 Score = 40.8 bits (94), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 39/73 (53%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S V I +  KP SK  S+T   D    V +  PA +G+AN A++  ++  LG+   +V I
Sbjct: 5   SPVKIQVRVKPNSKKPSVTKAEDGVWIVAVKEPATEGKANDAVVRAVAEELGLAPSKVKI 64

Query: 98  GSGSKSRDKIVIV 110
             G K + K++ V
Sbjct: 65  LRGEKGKLKLLEV 77


>gi|223937408|ref|ZP_03629313.1| protein of unknown function DUF167 [bacterium Ellin514]
 gi|223893959|gb|EEF60415.1| protein of unknown function DUF167 [bacterium Ellin514]
          Length = 100

 Score = 40.8 bits (94), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 24/94 (25%), Positives = 47/94 (50%), Gaps = 1/94 (1%)

Query: 29  PSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVL 88
           P  +R++P   + ++I  +P + +  I +     + +++ AP  D  AN ALL  ++ +L
Sbjct: 4   PGFLRILP-DGLLLSIKLQPRASANQIGEPLGNELRIKVTAPPVDAAANEALLRLLADIL 62

Query: 89  GVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
              R +V +  G  SR K++ +  +    VL  L
Sbjct: 63  KCPRGKVELVRGHTSRHKVIKLHGLEANAVLTRL 96


>gi|116754806|ref|YP_843924.1| hypothetical protein Mthe_1512 [Methanosaeta thermophila PT]
 gi|116754813|ref|YP_843931.1| hypothetical protein Mthe_1519 [Methanosaeta thermophila PT]
 gi|116754820|ref|YP_843938.1| hypothetical protein Mthe_1526 [Methanosaeta thermophila PT]
 gi|116666257|gb|ABK15284.1| protein of unknown function DUF167 [Methanosaeta thermophila PT]
 gi|116666264|gb|ABK15291.1| protein of unknown function DUF167 [Methanosaeta thermophila PT]
 gi|116666271|gb|ABK15298.1| protein of unknown function DUF167 [Methanosaeta thermophila PT]
          Length = 122

 Score = 40.8 bits (94), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 48/92 (52%), Gaps = 3/92 (3%)

Query: 37  PSSVSITIHAKPGSKSCSI---TDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           P  V + I   PGS    +    +    ++ V++   A  G+AN  LL+ +S   G+ + 
Sbjct: 29  PDGVVLHIDVIPGSSELVVPAGFNAWRGSIEVRLTERADKGKANRQLLQEISRAFGLTQG 88

Query: 94  QVSIGSGSKSRDKIVIVEEITPENVLNSLGKA 125
            V I SG +S+ K+V++  I  E+VL +L ++
Sbjct: 89  DVEIMSGHRSQRKVVLMRGIDAESVLAALRRS 120


>gi|433608631|ref|YP_007041000.1| hypothetical protein BN6_68910 [Saccharothrix espanaensis DSM
           44229]
 gi|407886484|emb|CCH34127.1| hypothetical protein BN6_68910 [Saccharothrix espanaensis DSM
           44229]
          Length = 93

 Score = 40.8 bits (94), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 2/89 (2%)

Query: 37  PSSVSITIHAKPGSKSCSITD-VSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P    + +  KP +K  ++       A+ V + APA +G+AN A+   ++   GV+R+ V
Sbjct: 2   PEPFRLGVRVKPSAKRDAVGGRWGPLALIVAVAAPAVEGKANEAVRRAVAKAFGVRRQDV 61

Query: 96  SIGSGSKSRDKIVIVEEITPENVLNSLGK 124
            I  G + RDK+V ++   P++    L +
Sbjct: 62  RIVHGERGRDKVVELDP-APDDAAQRLAR 89


>gi|288933577|ref|YP_003437636.1| hypothetical protein Kvar_0694 [Klebsiella variicola At-22]
 gi|290511356|ref|ZP_06550725.1| conserved hypothetical protein [Klebsiella sp. 1_1_55]
 gi|288888306|gb|ADC56624.1| protein of unknown function DUF167 [Klebsiella variicola At-22]
 gi|289776349|gb|EFD84348.1| conserved hypothetical protein [Klebsiella sp. 1_1_55]
          Length = 96

 Score = 40.4 bits (93), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  V  + + V I AP  DG+ANA L+++++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGVHGDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQVLIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +  I P+ +
Sbjct: 73  LGRHKQVKI--IAPQQI 87


>gi|429769490|ref|ZP_19301596.1| hypothetical protein HMPREF0185_01881 [Brevundimonas diminuta
           470-4]
 gi|429186915|gb|EKY27843.1| hypothetical protein HMPREF0185_01881 [Brevundimonas diminuta
           470-4]
          Length = 93

 Score = 40.4 bits (93), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 35/58 (60%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           V++ A   +GEANAAL  +++  LGV +R V++  G +SR K++ V  +   +V   L
Sbjct: 33  VRVRAQPVEGEANAALTAFLAKALGVPKRDVTLARGGQSRLKMIEVAGLDDADVRTRL 90


>gi|381397003|ref|ZP_09922417.1| protein of unknown function DUF167 [Microbacterium laevaniformans
           OR221]
 gi|380775962|gb|EIC09252.1| protein of unknown function DUF167 [Microbacterium laevaniformans
           OR221]
          Length = 73

 Score = 40.4 bits (93), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 38/72 (52%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +  ++H KPGS+   + +   E + V +   A DG AN  ++  +++  GV  R V+I  
Sbjct: 1   MQFSVHVKPGSRKGPLVEEDAEGLTVFVRERAVDGAANDGVVRALAAHFGVPPRDVTILR 60

Query: 100 GSKSRDKIVIVE 111
           G  SR K V V+
Sbjct: 61  GHGSRHKRVEVD 72


>gi|307611594|emb|CBX01276.1| hypothetical protein LPW_29741 [Legionella pneumophila 130b]
          Length = 70

 Score = 40.4 bits (93), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 15/51 (29%), Positives = 31/51 (60%)

Query: 58  VSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV 108
           +SD+++ + + A  ++GEAN  LL ++S    + + Q+ +  G  SR K++
Sbjct: 3   ISDDSLHIALHAKPQEGEANNELLFFISQFFKIPKTQIELIKGKSSRHKLI 53


>gi|300871287|ref|YP_003786160.1| hypothetical protein BP951000_1678 [Brachyspira pilosicoli 95/1000]
 gi|300688988|gb|ADK31659.1| conserved hypothetical protein [Brachyspira pilosicoli 95/1000]
          Length = 83

 Score = 40.4 bits (93), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 31/44 (70%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV 108
           ++I A A DG+AN A+++++SS L +K++ V I  G KS  K++
Sbjct: 25  IRISAKAIDGKANKAIIDFLSSELNLKKKDVEILKGEKSSKKLI 68


>gi|158520306|ref|YP_001528176.1| hypothetical protein Dole_0289 [Desulfococcus oleovorans Hxd3]
 gi|226701656|sp|A8ZS81.1|Y289_DESOH RecName: Full=UPF0235 protein Dole_0289
 gi|158509132|gb|ABW66099.1| protein of unknown function DUF167 [Desulfococcus oleovorans Hxd3]
          Length = 103

 Score = 40.4 bits (93), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 19/63 (30%), Positives = 35/63 (55%)

Query: 48  PGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKI 107
           P S    +    D A+ ++I AP  DG AN   + ++S +LG+ +  ++I +G+ S+ K 
Sbjct: 19  PRSSKNMVVGAHDNALKIKITAPPVDGRANEMCVAFLSRLLGIPKTSITIAAGAASKRKE 78

Query: 108 VIV 110
           V +
Sbjct: 79  VCL 81


>gi|427707793|ref|YP_007050170.1| hypothetical protein Nos7107_2411 [Nostoc sp. PCC 7107]
 gi|427360298|gb|AFY43020.1| UPF0235 protein yggU [Nostoc sp. PCC 7107]
          Length = 75

 Score = 40.4 bits (93), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 20/65 (30%), Positives = 36/65 (55%)

Query: 44  IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKS 103
           +  KP SK   I +  D ++ V + +P  DG+AN  L++ ++    V +  ++I SG  S
Sbjct: 5   VKVKPNSKQQKIEEQVDGSLTVHLKSPPVDGKANEELIKLLADKFDVPKSYITIKSGLAS 64

Query: 104 RDKIV 108
           R K++
Sbjct: 65  RQKLI 69


>gi|152971905|ref|YP_001337014.1| hypothetical protein KPN_03387 [Klebsiella pneumoniae subsp.
           pneumoniae MGH 78578]
 gi|206579395|ref|YP_002236595.1| hypothetical protein KPK_0722 [Klebsiella pneumoniae 342]
 gi|238896484|ref|YP_002921222.1| hypothetical protein KP1_4664 [Klebsiella pneumoniae subsp.
           pneumoniae NTUH-K2044]
 gi|262042605|ref|ZP_06015761.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
 gi|329998593|ref|ZP_08303177.1| TIGR00251 family protein [Klebsiella sp. MS 92-3]
 gi|365140448|ref|ZP_09346503.1| UPF0235 protein [Klebsiella sp. 4_1_44FAA]
 gi|378980616|ref|YP_005228757.1| hypothetical protein KPHS_44570 [Klebsiella pneumoniae subsp.
           pneumoniae HS11286]
 gi|386036535|ref|YP_005956448.1| hypothetical protein KPN2242_20010 [Klebsiella pneumoniae KCTC
           2242]
 gi|402779017|ref|YP_006634563.1| hypothetical protein A79E_0731 [Klebsiella pneumoniae subsp.
           pneumoniae 1084]
 gi|419974909|ref|ZP_14490324.1| hypothetical protein KPNIH1_16209 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH1]
 gi|419979016|ref|ZP_14494310.1| hypothetical protein KPNIH2_07991 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH2]
 gi|419985956|ref|ZP_14501093.1| hypothetical protein KPNIH4_13829 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH4]
 gi|419990782|ref|ZP_14505752.1| hypothetical protein KPNIH5_08952 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH5]
 gi|419996378|ref|ZP_14511180.1| hypothetical protein KPNIH6_07986 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH6]
 gi|420002251|ref|ZP_14516903.1| hypothetical protein KPNIH7_08516 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH7]
 gi|420008269|ref|ZP_14522759.1| hypothetical protein KPNIH8_09697 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH8]
 gi|420014387|ref|ZP_14528694.1| hypothetical protein KPNIH9_11234 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH9]
 gi|420019546|ref|ZP_14533738.1| hypothetical protein KPNIH10_08443 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH10]
 gi|420025408|ref|ZP_14539417.1| hypothetical protein KPNIH11_08712 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH11]
 gi|420030980|ref|ZP_14544804.1| hypothetical protein KPNIH12_07821 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH12]
 gi|420036691|ref|ZP_14550350.1| hypothetical protein KPNIH14_07883 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH14]
 gi|420042783|ref|ZP_14556275.1| hypothetical protein KPNIH16_09784 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH16]
 gi|420048444|ref|ZP_14561757.1| hypothetical protein KPNIH17_09284 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH17]
 gi|420054208|ref|ZP_14567382.1| hypothetical protein KPNIH18_09618 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH18]
 gi|420059710|ref|ZP_14572715.1| hypothetical protein KPNIH19_08477 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH19]
 gi|420065481|ref|ZP_14578286.1| hypothetical protein KPNIH20_08669 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH20]
 gi|420073381|ref|ZP_14586008.1| hypothetical protein KPNIH21_19594 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH21]
 gi|420079799|ref|ZP_14592238.1| hypothetical protein KPNIH22_22595 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH22]
 gi|420084933|ref|ZP_14597177.1| hypothetical protein KPNIH23_19620 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH23]
 gi|421913266|ref|ZP_16342956.1| UPF0235 protein VC0458 [Klebsiella pneumoniae subsp. pneumoniae
           ST258-K26BO]
 gi|421917700|ref|ZP_16347249.1| UPF0235 protein VC0458 [Klebsiella pneumoniae subsp. pneumoniae
           ST258-K28BO]
 gi|424832375|ref|ZP_18257103.1| conserved hypothetical protein TIGR00251 [Klebsiella pneumoniae
           subsp. pneumoniae Ecl8]
 gi|424931819|ref|ZP_18350191.1| UPF0235 protein [Klebsiella pneumoniae subsp. pneumoniae KpQ3]
 gi|425074931|ref|ZP_18478034.1| TIGR00251 family protein [Klebsiella pneumoniae subsp. pneumoniae
           WGLW1]
 gi|425083178|ref|ZP_18486275.1| TIGR00251 family protein [Klebsiella pneumoniae subsp. pneumoniae
           WGLW2]
 gi|425085567|ref|ZP_18488660.1| TIGR00251 family protein [Klebsiella pneumoniae subsp. pneumoniae
           WGLW3]
 gi|425093261|ref|ZP_18496345.1| TIGR00251 family protein [Klebsiella pneumoniae subsp. pneumoniae
           WGLW5]
 gi|428148775|ref|ZP_18996623.1| UPF0235 protein VC0458 [Klebsiella pneumoniae subsp. pneumoniae
           ST512-K30BO]
 gi|428935142|ref|ZP_19008632.1| hypothetical protein MTE1_20052 [Klebsiella pneumoniae JHCK1]
 gi|428938005|ref|ZP_19011138.1| hypothetical protein MTE2_00785 [Klebsiella pneumoniae VA360]
 gi|449049949|ref|ZP_21731545.1| hypothetical protein G057_07177 [Klebsiella pneumoniae hvKP1]
 gi|166990767|sp|A6TDW3.1|Y3323_KLEP7 RecName: Full=UPF0235 protein KPN78578_33230
 gi|226708043|sp|B5XU96.1|Y722_KLEP3 RecName: Full=UPF0235 protein KPK_0722
 gi|150956754|gb|ABR78784.1| hypothetical protein KPN_03387 [Klebsiella pneumoniae subsp.
           pneumoniae MGH 78578]
 gi|206568453|gb|ACI10229.1| conserved hypothetical protein TIGR00251 [Klebsiella pneumoniae
           342]
 gi|238548804|dbj|BAH65155.1| hypothetical protein KP1_4664 [Klebsiella pneumoniae subsp.
           pneumoniae NTUH-K2044]
 gi|259040039|gb|EEW41154.1| conserved hypothetical protein [Klebsiella pneumoniae subsp.
           rhinoscleromatis ATCC 13884]
 gi|328538612|gb|EGF64712.1| TIGR00251 family protein [Klebsiella sp. MS 92-3]
 gi|339763663|gb|AEJ99883.1| hypothetical protein KPN2242_20010 [Klebsiella pneumoniae KCTC
           2242]
 gi|363653764|gb|EHL92713.1| UPF0235 protein [Klebsiella sp. 4_1_44FAA]
 gi|364520027|gb|AEW63155.1| hypothetical protein KPHS_44570 [Klebsiella pneumoniae subsp.
           pneumoniae HS11286]
 gi|397344394|gb|EJJ37528.1| hypothetical protein KPNIH1_16209 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH1]
 gi|397349836|gb|EJJ42928.1| hypothetical protein KPNIH4_13829 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH4]
 gi|397350594|gb|EJJ43682.1| hypothetical protein KPNIH2_07991 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH2]
 gi|397365067|gb|EJJ57693.1| hypothetical protein KPNIH6_07986 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH6]
 gi|397366026|gb|EJJ58646.1| hypothetical protein KPNIH5_08952 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH5]
 gi|397371087|gb|EJJ63630.1| hypothetical protein KPNIH7_08516 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH7]
 gi|397378488|gb|EJJ70700.1| hypothetical protein KPNIH9_11234 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH9]
 gi|397383322|gb|EJJ75463.1| hypothetical protein KPNIH8_09697 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH8]
 gi|397388759|gb|EJJ80718.1| hypothetical protein KPNIH10_08443 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH10]
 gi|397397412|gb|EJJ89088.1| hypothetical protein KPNIH11_08712 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH11]
 gi|397401213|gb|EJJ92845.1| hypothetical protein KPNIH12_07821 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH12]
 gi|397406517|gb|EJJ97937.1| hypothetical protein KPNIH14_07883 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH14]
 gi|397414983|gb|EJK06174.1| hypothetical protein KPNIH17_09284 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH17]
 gi|397415830|gb|EJK07010.1| hypothetical protein KPNIH16_09784 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH16]
 gi|397423026|gb|EJK13967.1| hypothetical protein KPNIH18_09618 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH18]
 gi|397431353|gb|EJK22029.1| hypothetical protein KPNIH20_08669 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH20]
 gi|397435051|gb|EJK25677.1| hypothetical protein KPNIH19_08477 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH19]
 gi|397438019|gb|EJK28549.1| hypothetical protein KPNIH21_19594 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH21]
 gi|397443275|gb|EJK33601.1| hypothetical protein KPNIH22_22595 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH22]
 gi|397449720|gb|EJK39846.1| hypothetical protein KPNIH23_19620 [Klebsiella pneumoniae subsp.
           pneumoniae KPNIH23]
 gi|402539962|gb|AFQ64111.1| hypothetical protein A79E_0731 [Klebsiella pneumoniae subsp.
           pneumoniae 1084]
 gi|405595134|gb|EKB68524.1| TIGR00251 family protein [Klebsiella pneumoniae subsp. pneumoniae
           WGLW1]
 gi|405599497|gb|EKB72673.1| TIGR00251 family protein [Klebsiella pneumoniae subsp. pneumoniae
           WGLW2]
 gi|405607599|gb|EKB80568.1| TIGR00251 family protein [Klebsiella pneumoniae subsp. pneumoniae
           WGLW3]
 gi|405610806|gb|EKB83595.1| TIGR00251 family protein [Klebsiella pneumoniae subsp. pneumoniae
           WGLW5]
 gi|407806006|gb|EKF77257.1| UPF0235 protein [Klebsiella pneumoniae subsp. pneumoniae KpQ3]
 gi|410112806|emb|CCM85581.1| UPF0235 protein VC0458 [Klebsiella pneumoniae subsp. pneumoniae
           ST258-K26BO]
 gi|410119985|emb|CCM89874.1| UPF0235 protein VC0458 [Klebsiella pneumoniae subsp. pneumoniae
           ST258-K28BO]
 gi|414709816|emb|CCN31520.1| conserved hypothetical protein TIGR00251 [Klebsiella pneumoniae
           subsp. pneumoniae Ecl8]
 gi|426301223|gb|EKV63471.1| hypothetical protein MTE1_20052 [Klebsiella pneumoniae JHCK1]
 gi|426306426|gb|EKV68529.1| hypothetical protein MTE2_00785 [Klebsiella pneumoniae VA360]
 gi|427541201|emb|CCM92761.1| UPF0235 protein VC0458 [Klebsiella pneumoniae subsp. pneumoniae
           ST512-K30BO]
 gi|448876692|gb|EMB11675.1| hypothetical protein G057_07177 [Klebsiella pneumoniae hvKP1]
          Length = 96

 Score = 40.4 bits (93), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  V  + + V I AP  DG+ANA L+++++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGVHGDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQVLIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +  I P+ +
Sbjct: 73  LGRHKQVKI--IAPQQI 87


>gi|90412037|ref|ZP_01220044.1| hypothetical protein P3TCK_24666 [Photobacterium profundum 3TCK]
 gi|90327015|gb|EAS43394.1| hypothetical protein P3TCK_24666 [Photobacterium profundum 3TCK]
          Length = 97

 Score = 40.4 bits (93), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 3/77 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I ++ +P +    I  +  E + + I AP  DG+ANA L ++++    V + QV I  G 
Sbjct: 15  IRLYIQPKASRDQIVGLHGEELKIAITAPPVDGKANAHLSKFLAKQFRVAKSQVLIEKGM 74

Query: 102 KSRDKIVIVE---EITP 115
           + R K V +E   EI P
Sbjct: 75  QGRHKQVRIESPREIPP 91


>gi|320539543|ref|ZP_08039210.1| putative conserved protein [Serratia symbiotica str. Tucson]
 gi|320030396|gb|EFW12408.1| putative conserved protein [Serratia symbiotica str. Tucson]
          Length = 97

 Score = 40.4 bits (93), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 36/67 (53%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           ++I ++ +P +    I  +  + V V I AP  DG+ANA L+++++    V R  V+I  
Sbjct: 11  LAIRLYIQPKASRDKIIGLHGDEVKVAITAPPVDGQANAHLIKFLAKQFKVARSNVTIEK 70

Query: 100 GSKSRDK 106
           G   R K
Sbjct: 71  GELGRHK 77


>gi|254784473|ref|YP_003071901.1| hypothetical protein TERTU_0220 [Teredinibacter turnerae T7901]
 gi|237683524|gb|ACR10788.1| conserved hypothetical protein [Teredinibacter turnerae T7901]
          Length = 99

 Score = 40.4 bits (93), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 19/66 (28%), Positives = 36/66 (54%)

Query: 47  KPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           +P + S +   +  + + ++I AP  DG+ANA L++Y++   GV +  + I  G  SR K
Sbjct: 19  QPKASSDAFAGLQADRLKIRITAPPTDGKANAHLVKYLARQFGVAKSDIEIVRGQLSRQK 78

Query: 107 IVIVEE 112
            + +  
Sbjct: 79  TLQINH 84


>gi|406905541|gb|EKD46975.1| hypothetical protein ACD_66C00270G0002 [uncultured bacterium]
          Length = 115

 Score = 40.4 bits (93), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 21/82 (25%), Positives = 42/82 (51%), Gaps = 10/82 (12%)

Query: 42  ITIHAKPGSKSCSIT----------DVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVK 91
           + +  +PG+K   I           +     + ++I +P  +G+AN  LL Y+S+ LG+ 
Sbjct: 17  LFVKVQPGAKRNEIVGPVQGRTSPANPEGHYLKMKIKSPPVEGKANEVLLNYLSASLGLT 76

Query: 92  RRQVSIGSGSKSRDKIVIVEEI 113
              + I +G  SR+K +++ E+
Sbjct: 77  LSSLQIQNGHSSREKTILISEL 98


>gi|332162814|ref|YP_004299391.1| hypothetical protein YE105_C3194 [Yersinia enterocolitica subsp.
           palearctica 105.5R(r)]
 gi|386309781|ref|YP_006005837.1| hypothetical protein [Yersinia enterocolitica subsp. palearctica
           Y11]
 gi|418240153|ref|ZP_12866695.1| hypothetical protein IOK_01829 [Yersinia enterocolitica subsp.
           palearctica PhRBD_Ye1]
 gi|433551130|ref|ZP_20507173.1| UPF0235 protein VC0458 [Yersinia enterocolitica IP 10393]
 gi|318604345|emb|CBY25843.1| upf0235 protein VC0458 [Yersinia enterocolitica subsp. palearctica
           Y11]
 gi|325667044|gb|ADZ43688.1| hypothetical protein YE105_C3194 [Yersinia enterocolitica subsp.
           palearctica 105.5R(r)]
 gi|330859004|emb|CBX69362.1| UPF0235 protein YE3436 [Yersinia enterocolitica W22703]
 gi|351780413|gb|EHB22487.1| hypothetical protein IOK_01829 [Yersinia enterocolitica subsp.
           palearctica PhRBD_Ye1]
 gi|431788229|emb|CCO70213.1| UPF0235 protein VC0458 [Yersinia enterocolitica IP 10393]
          Length = 96

 Score = 40.4 bits (93), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 43/80 (53%), Gaps = 3/80 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +++ ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V + QV I  
Sbjct: 11  LALRLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQVIIEK 70

Query: 100 GSKSRD---KIVIVEEITPE 116
           G   R    KIV  ++I PE
Sbjct: 71  GELGRHKQIKIVNPQQIPPE 90


>gi|237807736|ref|YP_002892176.1| hypothetical protein Tola_0962 [Tolumonas auensis DSM 9187]
 gi|259710178|sp|C4LCM6.1|Y962_TOLAT RecName: Full=UPF0235 protein Tola_0962
 gi|237499997|gb|ACQ92590.1| protein of unknown function DUF167 [Tolumonas auensis DSM 9187]
          Length = 96

 Score = 40.4 bits (93), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 35/69 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + ++ +P +    I     E + + I AP  DG+ANA L+++++    V + Q+ I  
Sbjct: 11  VWLDVYIQPKASRDQIQGWHGEELKIAITAPPVDGQANAHLIKFLAKQFKVAKSQIVIHK 70

Query: 100 GSKSRDKIV 108
           G   R K V
Sbjct: 71  GELGRHKTV 79


>gi|123443630|ref|YP_001007602.1| hypothetical protein YE3436 [Yersinia enterocolitica subsp.
           enterocolitica 8081]
 gi|420259863|ref|ZP_14762556.1| hypothetical protein YWA314_13887 [Yersinia enterocolitica subsp.
           enterocolitica WA-314]
 gi|166232591|sp|A1JPU6.1|Y3436_YERE8 RecName: Full=UPF0235 protein YE3436
 gi|122090591|emb|CAL13460.1| conserved hypothetical protein [Yersinia enterocolitica subsp.
           enterocolitica 8081]
 gi|404512604|gb|EKA26446.1| hypothetical protein YWA314_13887 [Yersinia enterocolitica subsp.
           enterocolitica WA-314]
          Length = 96

 Score = 40.4 bits (93), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 43/80 (53%), Gaps = 3/80 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +++ ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V + QV I  
Sbjct: 11  LALRLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQVIIEK 70

Query: 100 GSKSRD---KIVIVEEITPE 116
           G   R    KIV  ++I PE
Sbjct: 71  GELGRHKQIKIVNPQQIPPE 90


>gi|38505546|ref|NP_942167.1| hypothetical protein ssr5011 [Synechocystis sp. PCC 6803]
 gi|451816554|ref|YP_007459757.1| hypothetical protein MYO_2120 [Synechocystis sp. PCC 6803]
 gi|38423570|dbj|BAD01781.1| ssr5011 [Synechocystis sp. PCC 6803]
 gi|451782472|gb|AGF53438.1| hypothetical protein MYO_2120 [Synechocystis sp. PCC 6803]
          Length = 73

 Score = 40.4 bits (93), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 38/69 (55%)

Query: 44  IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKS 103
           +  KP +K   +    D ++ + + +P  DG+AN  L++ ++    V ++ + I SG+ S
Sbjct: 5   VKVKPNAKQSKVVYGDDGSLIIHVKSPPVDGKANQELIKLLAKEFNVSQQSIKIKSGAGS 64

Query: 104 RDKIVIVEE 112
           R KIV + E
Sbjct: 65  RQKIVEINE 73


>gi|428310870|ref|YP_007121847.1| hypothetical protein Mic7113_2653 [Microcoleus sp. PCC 7113]
 gi|428252482|gb|AFZ18441.1| hypothetical protein Mic7113_2653 [Microcoleus sp. PCC 7113]
          Length = 75

 Score = 40.4 bits (93), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 39/68 (57%)

Query: 44  IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKS 103
           +  KP +K  +I +++D ++ V + +   DG+AN  L+E ++    V + Q+ I SG  S
Sbjct: 5   VKVKPNAKKQAIEELADGSLTVYLKSSPVDGKANKELIELLADQFEVPKSQIRIKSGLSS 64

Query: 104 RDKIVIVE 111
           R K V ++
Sbjct: 65  RTKWVEID 72


>gi|404476883|ref|YP_006708314.1| hypothetical protein B2904_orf2238 [Brachyspira pilosicoli B2904]
 gi|434381289|ref|YP_006703072.1| UPF0235 protein yggU [Brachyspira pilosicoli WesB]
 gi|404429938|emb|CCG55984.1| UPF0235 protein yggU [Brachyspira pilosicoli WesB]
 gi|404438372|gb|AFR71566.1| hypothetical protein B2904_orf2238 [Brachyspira pilosicoli B2904]
          Length = 102

 Score = 40.4 bits (93), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 31/44 (70%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV 108
           ++I A A DG+AN A+++++SS L +K++ V I  G KS  K++
Sbjct: 44  IRISAKAIDGKANKAIIDFLSSELNLKKKDVEILKGEKSSKKLI 87


>gi|156932605|ref|YP_001436521.1| hypothetical protein ESA_00387 [Cronobacter sakazakii ATCC BAA-894]
 gi|260599282|ref|YP_003211853.1| hypothetical protein CTU_34900 [Cronobacter turicensis z3032]
 gi|389839661|ref|YP_006341745.1| hypothetical protein ES15_0661 [Cronobacter sakazakii ES15]
 gi|417789840|ref|ZP_12437448.1| hypothetical protein CSE899_04278 [Cronobacter sakazakii E899]
 gi|424800997|ref|ZP_18226539.1| UPF0235 protein VC0458 [Cronobacter sakazakii 696]
 gi|429088023|ref|ZP_19150755.1| UPF0235 protein VC0458 [Cronobacter universalis NCTC 9529]
 gi|429094152|ref|ZP_19156705.1| UPF0235 protein VC0458 [Cronobacter dublinensis 1210]
 gi|429097359|ref|ZP_19159465.1| UPF0235 protein VC0458 [Cronobacter dublinensis 582]
 gi|429106110|ref|ZP_19167979.1| UPF0235 protein VC0458 [Cronobacter malonaticus 681]
 gi|429111540|ref|ZP_19173310.1| UPF0235 protein VC0458 [Cronobacter malonaticus 507]
 gi|429115023|ref|ZP_19175941.1| UPF0235 protein VC0458 [Cronobacter sakazakii 701]
 gi|429119942|ref|ZP_19180640.1| UPF0235 protein VC0458 [Cronobacter sakazakii 680]
 gi|449306929|ref|YP_007439285.1| hypothetical protein CSSP291_01990 [Cronobacter sakazakii SP291]
 gi|166229055|sp|A7MP89.1|Y387_ENTS8 RecName: Full=UPF0235 protein ESA_00387
 gi|156530859|gb|ABU75685.1| hypothetical protein ESA_00387 [Cronobacter sakazakii ATCC BAA-894]
 gi|260218459|emb|CBA33595.1| UPF0235 protein ESA_00387 [Cronobacter turicensis z3032]
 gi|333956039|gb|EGL73734.1| hypothetical protein CSE899_04278 [Cronobacter sakazakii E899]
 gi|387850137|gb|AFJ98234.1| hypothetical protein ES15_0661 [Cronobacter sakazakii ES15]
 gi|423236718|emb|CCK08409.1| UPF0235 protein VC0458 [Cronobacter sakazakii 696]
 gi|426283699|emb|CCJ85578.1| UPF0235 protein VC0458 [Cronobacter dublinensis 582]
 gi|426292833|emb|CCJ94092.1| UPF0235 protein VC0458 [Cronobacter malonaticus 681]
 gi|426312697|emb|CCJ99423.1| UPF0235 protein VC0458 [Cronobacter malonaticus 507]
 gi|426318152|emb|CCK02054.1| UPF0235 protein VC0458 [Cronobacter sakazakii 701]
 gi|426325628|emb|CCK11377.1| UPF0235 protein VC0458 [Cronobacter sakazakii 680]
 gi|426507826|emb|CCK15867.1| UPF0235 protein VC0458 [Cronobacter universalis NCTC 9529]
 gi|426740870|emb|CCJ82818.1| UPF0235 protein VC0458 [Cronobacter dublinensis 1210]
 gi|449096962|gb|AGE84996.1| hypothetical protein CSSP291_01990 [Cronobacter sakazakii SP291]
          Length = 96

 Score = 40.4 bits (93), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L++Y++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIIGLHGDELKVAITAPPVDGQANAHLVKYLAKQFRVAKSQVVIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +  I P+ +
Sbjct: 73  LGRHKQVKI--IEPQQI 87


>gi|398827877|ref|ZP_10586080.1| hypothetical protein PMI41_00879 [Phyllobacterium sp. YR531]
 gi|398219175|gb|EJN05672.1| hypothetical protein PMI41_00879 [Phyllobacterium sp. YR531]
          Length = 105

 Score = 40.0 bits (92), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 47/93 (50%), Gaps = 7/93 (7%)

Query: 19  SKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANA 78
           S  +T+D  L   +RL P S       AK   +     D S + V  ++ A  +DG+A A
Sbjct: 4   SYFRTDDHGLTLFVRLTPKS-------AKDAIEGVEAVDDSHQRVKARVRAVPEDGKAIA 56

Query: 79  ALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           AL++ +   L V  R ++I +G+ SR K + V+
Sbjct: 57  ALVKLLGRWLDVPPRSITIVAGATSRLKQIKVQ 89


>gi|429082828|ref|ZP_19145884.1| UPF0235 protein VC0458 [Cronobacter condimenti 1330]
 gi|426548354|emb|CCJ71925.1| UPF0235 protein VC0458 [Cronobacter condimenti 1330]
          Length = 96

 Score = 40.0 bits (92), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L++Y++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIIGLHGDELKVAITAPPVDGQANAHLVKYLAKQFRVAKSQVVIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +  I P+ +
Sbjct: 73  LGRHKQVKI--IEPQQI 87


>gi|406985947|gb|EKE06642.1| hypothetical protein ACD_18C00311G0009 [uncultured bacterium]
          Length = 73

 Score = 40.0 bits (92), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 37/70 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + I   P S    +  +S+  + V++ +P  DGEAN  L+E +S    V + ++ I  G 
Sbjct: 3   LNIKVIPKSSLNKVIKISETELKVKLTSPPVDGEANKKLIEILSKEYNVAKSKIRIVRGE 62

Query: 102 KSRDKIVIVE 111
            S+ KIV +E
Sbjct: 63  TSKSKIVEIE 72


>gi|117621083|ref|YP_858117.1| hypothetical protein AHA_3661 [Aeromonas hydrophila subsp.
           hydrophila ATCC 7966]
 gi|166232625|sp|A0KPB6.1|Y3661_AERHH RecName: Full=UPF0235 protein AHA_3661
 gi|117562490|gb|ABK39438.1| conserved hypothetical protein [Aeromonas hydrophila subsp.
           hydrophila ATCC 7966]
          Length = 99

 Score = 40.0 bits (92), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 34/65 (52%)

Query: 47  KPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           +P +    I  +  E + V I AP  DG+AN+ L++Y++    V + QV I  G   R K
Sbjct: 18  QPKASRDQIVGLHGEELKVAITAPPVDGQANSHLIKYLAKQFKVAKGQVRIVRGELGRHK 77

Query: 107 IVIVE 111
            V +E
Sbjct: 78  TVAIE 82


>gi|238494338|ref|XP_002378405.1| DUF167 domain protein [Aspergillus flavus NRRL3357]
 gi|220695055|gb|EED51398.1| DUF167 domain protein [Aspergillus flavus NRRL3357]
 gi|391871368|gb|EIT80528.1| hypothetical protein Ao3042_02807 [Aspergillus oryzae 3.042]
          Length = 131

 Score = 40.0 bits (92), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 39  SVSITIHAKPGSKSCS--ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           S+ I+ + KP + +    I  V  E V V + A  +DGEANAA+    + +L V +  V 
Sbjct: 30  SLQISCNVKPNASANREGIIAVGPEKVDVCVAAVPRDGEANAAVSRVFAQILKVPKSTVD 89

Query: 97  IGSGSKSRDKIVIVEEI 113
           +  G KSRDK + V ++
Sbjct: 90  VIRGLKSRDKTLCVSDL 106


>gi|411012173|ref|ZP_11388502.1| hypothetical protein AaquA_20870 [Aeromonas aquariorum AAK1]
 gi|423198241|ref|ZP_17184824.1| TIGR00251 family protein [Aeromonas hydrophila SSU]
 gi|404630465|gb|EKB27142.1| TIGR00251 family protein [Aeromonas hydrophila SSU]
          Length = 99

 Score = 40.0 bits (92), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 34/65 (52%)

Query: 47  KPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           +P +    I  +  E + V I AP  DG+AN+ L++Y++    V + QV I  G   R K
Sbjct: 18  QPKASRDQIVGLHGEELKVAITAPPVDGQANSHLIKYLAKQFKVAKGQVRIVRGELGRHK 77

Query: 107 IVIVE 111
            V +E
Sbjct: 78  TVAIE 82


>gi|330505393|ref|YP_004382262.1| hypothetical protein [Pseudomonas mendocina NK-01]
 gi|328919679|gb|AEB60510.1| hypothetical protein MDS_4479 [Pseudomonas mendocina NK-01]
          Length = 96

 Score = 40.0 bits (92), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 19/65 (29%), Positives = 34/65 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  H +P +       +  E + +++ AP  DG+ANA L  +++   GV + QV + SG 
Sbjct: 13  LDCHLQPKASKDEFAGLHGERLKIRLTAPPVDGKANAHLQAFLAKAFGVAKSQVILESGE 72

Query: 102 KSRDK 106
            +R K
Sbjct: 73  LNRQK 77


>gi|167623103|ref|YP_001673397.1| hypothetical protein Shal_1169 [Shewanella halifaxensis HAW-EB4]
 gi|167353125|gb|ABZ75738.1| protein of unknown function DUF167 [Shewanella halifaxensis
           HAW-EB4]
          Length = 90

 Score = 40.0 bits (92), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 40/83 (48%), Gaps = 2/83 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  E + + I AP  DG+ANA L +Y+S    V +  + I  G 
Sbjct: 8   LQLYIQPKASRDQIVGLHGEEIKIAITAPPVDGKANAHLTKYLSKAFKVPKGDIEITKGQ 67

Query: 102 KSRDKIVIVE--EITPENVLNSL 122
             R K V +   ++ P  V N L
Sbjct: 68  MGRHKQVKITAPKLIPAEVQNLL 90


>gi|426402457|ref|YP_007021428.1| hypothetical protein Bdt_0453 [Bdellovibrio bacteriovorus str.
           Tiberius]
 gi|425859125|gb|AFY00161.1| hypothetical protein Bdt_0453 [Bdellovibrio bacteriovorus str.
           Tiberius]
          Length = 72

 Score = 40.0 bits (92), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 17/60 (28%), Positives = 34/60 (56%)

Query: 63  VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           + +++ AP  DG+AN  L+E++S +  + +R V +  G   R K+V +  +  E +  +L
Sbjct: 8   IKIKLTAPPVDGKANECLIEFLSDLFDIPKRDVHLIKGETGRHKVVELTGLDAEKIREAL 67


>gi|238787374|ref|ZP_04631173.1| hypothetical protein yfred0001_33630 [Yersinia frederiksenii ATCC
           33641]
 gi|238724636|gb|EEQ16277.1| hypothetical protein yfred0001_33630 [Yersinia frederiksenii ATCC
           33641]
          Length = 90

 Score = 40.0 bits (92), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 43/80 (53%), Gaps = 3/80 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +++ ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V + QV I  
Sbjct: 5   LALRLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQVIIEK 64

Query: 100 GSKSRD---KIVIVEEITPE 116
           G   R    KIV  ++I PE
Sbjct: 65  GELGRHKQLKIVNPQQIPPE 84


>gi|302868916|ref|YP_003837553.1| hypothetical protein Micau_4464 [Micromonospora aurantiaca ATCC
           27029]
 gi|315504614|ref|YP_004083501.1| hypothetical protein ML5_3839 [Micromonospora sp. L5]
 gi|302571775|gb|ADL47977.1| protein of unknown function DUF167 [Micromonospora aurantiaca ATCC
           27029]
 gi|315411233|gb|ADU09350.1| protein of unknown function DUF167 [Micromonospora sp. L5]
          Length = 101

 Score = 40.0 bits (92), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 41/78 (52%), Gaps = 4/78 (5%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVG----VQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           ++++ +  KPG+    +    D   G    V + APA DG A  A    ++  LG++   
Sbjct: 6   TLTVAVRVKPGAARARVGGRFDGPYGPALVVAVHAPAVDGRATEAARRALADALGIRPAT 65

Query: 95  VSIGSGSKSRDKIVIVEE 112
           VS+ SG+ SRDK+ +VE 
Sbjct: 66  VSLRSGAASRDKLFLVER 83


>gi|238752326|ref|ZP_04613805.1| hypothetical protein yrohd0001_19180 [Yersinia rohdei ATCC 43380]
 gi|238709487|gb|EEQ01726.1| hypothetical protein yrohd0001_19180 [Yersinia rohdei ATCC 43380]
          Length = 90

 Score = 40.0 bits (92), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 43/80 (53%), Gaps = 3/80 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +++ ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V + QV I  
Sbjct: 5   LALRLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLVKFIAKQFKVAKSQVIIEK 64

Query: 100 GSKSRD---KIVIVEEITPE 116
           G   R    KIV  ++I PE
Sbjct: 65  GELGRHKQLKIVNPQQIPPE 84


>gi|431807957|ref|YP_007234855.1| hypothetical protein BPP43_06830 [Brachyspira pilosicoli P43/6/78]
 gi|430781316|gb|AGA66600.1| hypothetical protein BPP43_06830 [Brachyspira pilosicoli P43/6/78]
          Length = 98

 Score = 40.0 bits (92), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 31/44 (70%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV 108
           ++I A A DG+AN A+++++SS L +K++ V I  G KS  K++
Sbjct: 40  IRISAKAIDGKANKAIIDFLSSELNLKKKDVEILKGEKSSKKLI 83


>gi|30697842|ref|NP_851256.1| uncharacterized protein [Arabidopsis thaliana]
 gi|16226478|gb|AAL16178.1|AF428410_1 AT5g63440/MLE2_7 [Arabidopsis thaliana]
 gi|22137224|gb|AAM91457.1| AT5g63440/MLE2_7 [Arabidopsis thaliana]
 gi|332010364|gb|AED97747.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 205

 Score = 40.0 bits (92), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 34/63 (53%)

Query: 25  DENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYM 84
           D  +P CI  +    V + I  +  ++  +IT V+ + V V + APA  GEAN  LLE+M
Sbjct: 129 DAPVPPCISQLDGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAAPAARGEANNELLEFM 188

Query: 85  SSV 87
             V
Sbjct: 189 GRV 191


>gi|417328646|ref|ZP_12113721.1| hypothetical protein LTSEADE_4325 [Salmonella enterica subsp.
           enterica serovar Adelaide str. A4-669]
 gi|353567289|gb|EHC32532.1| hypothetical protein LTSEADE_4325 [Salmonella enterica subsp.
           enterica serovar Adelaide str. A4-669]
          Length = 85

 Score = 40.0 bits (92), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + Q+ I  G 
Sbjct: 2   LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQIVIEKGE 61

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 62  LGRHKQVKIIHPQQIPPE 79


>gi|157148503|ref|YP_001455822.1| hypothetical protein CKO_04329 [Citrobacter koseri ATCC BAA-895]
 gi|166227247|sp|A8APH3.1|Y4329_CITK8 RecName: Full=UPF0235 protein CKO_04329
 gi|157085708|gb|ABV15386.1| hypothetical protein CKO_04329 [Citrobacter koseri ATCC BAA-895]
          Length = 96

 Score = 40.0 bits (92), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQVKIIHPQQIPPE 90


>gi|54310243|ref|YP_131263.1| hypothetical protein PBPRA3146 [Photobacterium profundum SS9]
 gi|46914684|emb|CAG21461.1| conserved hypothetical protein [Photobacterium profundum SS9]
          Length = 101

 Score = 40.0 bits (92), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 3/77 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I ++ +P +    I  +  E + + I AP  DG+ANA L ++++    V + QV I  G 
Sbjct: 19  IRLYIQPKASRDQIVGLHGEELKIAITAPPVDGKANAHLSKFLAKQFRVAKGQVLIEKGM 78

Query: 102 KSRDKIVIVE---EITP 115
           + R K V +E   EI P
Sbjct: 79  QGRHKQVRIESPREIPP 95


>gi|157374368|ref|YP_001472968.1| hypothetical protein Ssed_1229 [Shewanella sediminis HAW-EB3]
 gi|189038638|sp|A8FSL7.1|Y1229_SHESH RecName: Full=UPF0235 protein Ssed_1229
 gi|157316742|gb|ABV35840.1| protein of unknown function DUF167 [Shewanella sediminis HAW-EB3]
          Length = 95

 Score = 40.0 bits (92), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 43/83 (51%), Gaps = 3/83 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  V  E + + I AP  DG+ANA L++Y+S    V +  + I  G 
Sbjct: 13  LNLYIQPKASRDQIVGVHGEELKIAITAPPVDGKANAHLIKYLSKAFKVPKGDIVILKGQ 72

Query: 102 KSRDKIVIV--EEITPENVLNSL 122
             R K + +    + PE ++N+L
Sbjct: 73  LGRHKQIKILSPRLIPE-IINAL 94


>gi|350551842|ref|ZP_08921053.1| UPF0235 protein yggU [Thiorhodospira sibirica ATCC 700588]
 gi|349796059|gb|EGZ49850.1| UPF0235 protein yggU [Thiorhodospira sibirica ATCC 700588]
          Length = 107

 Score = 40.0 bits (92), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 39/80 (48%), Gaps = 6/80 (7%)

Query: 35  VPPSSVS------ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVL 88
           +PP  VS      + I  +P +    I     E + + I AP  DG AN  LL +++   
Sbjct: 13  LPPFLVSRDAQMLLYIRVQPRASRDKIAGRFGERLKICITAPPVDGAANEHLLCFIARHC 72

Query: 89  GVKRRQVSIGSGSKSRDKIV 108
            V RR V++  G+ SRDK +
Sbjct: 73  KVARRDVALLQGNSSRDKTI 92


>gi|110635561|ref|YP_675769.1| hypothetical protein Meso_3232 [Chelativorans sp. BNC1]
 gi|110286545|gb|ABG64604.1| protein of unknown function DUF167 [Chelativorans sp. BNC1]
          Length = 116

 Score = 40.0 bits (92), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 45/87 (51%), Gaps = 13/87 (14%)

Query: 35  VPPSSVS--------ITIHAKPGSKSCSITDVSDEAVG-----VQIDAPAKDGEANAALL 81
           +PPSS S        I +   P S S +I  V++   G      ++ A  + G+AN AL 
Sbjct: 1   MPPSSWSRRSKNGIEIFVRLTPKSSSDAIEGVTEGPDGQAFLKARVRAIPEAGKANEALE 60

Query: 82  EYMSSVLGVKRRQVSIGSGSKSRDKIV 108
             ++S LGV RR V++ SG+ SR K V
Sbjct: 61  RLLASALGVPRRDVAVSSGAASRRKTV 87


>gi|383760320|ref|YP_005439306.1| hypothetical protein RGE_44710 [Rubrivivax gelatinosus IL144]
 gi|381380990|dbj|BAL97807.1| hypothetical protein RGE_44710 [Rubrivivax gelatinosus IL144]
          Length = 103

 Score = 40.0 bits (92), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 23/96 (23%), Positives = 47/96 (48%), Gaps = 1/96 (1%)

Query: 29  PSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVL 88
           P C+     ++  + +   P ++      + D A+ V+++AP  DG+AN  L+ +++  L
Sbjct: 5   PPCLSAAGAAASRLRVAVVPNARRTGADGLHDGALRVRLNAPPVDGKANDTLVAWLADEL 64

Query: 89  GVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGK 124
            + +R V +  G   R K + ++   PE V   L +
Sbjct: 65  DLPKRAVRLTHGQTGRRKTIELDA-APEAVAAWLQR 99


>gi|421264059|ref|ZP_15715066.1| hypothetical protein KCU_06871, partial [Pasteurella multocida
           subsp. multocida str. P52VAC]
 gi|401688676|gb|EJS84229.1| hypothetical protein KCU_06871, partial [Pasteurella multocida
           subsp. multocida str. P52VAC]
          Length = 91

 Score = 40.0 bits (92), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 18/72 (25%), Positives = 36/72 (50%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
            + + I  +P +    I  +    + + I AP  DG+ANA LL+++S    V +  + + 
Sbjct: 13  HLRLRIFLQPKASKDQIVGLHGNELKITITAPPIDGQANAHLLKFLSKTFKVPKSSIVLE 72

Query: 99  SGSKSRDKIVIV 110
            G  +R K +++
Sbjct: 73  KGELNRHKQILI 84


>gi|146324075|ref|XP_001481499.1| DUF167 domain protein [Aspergillus fumigatus Af293]
 gi|129558081|gb|EBA27446.1| DUF167 domain protein [Aspergillus fumigatus Af293]
 gi|159126298|gb|EDP51414.1| YggU family protein [Aspergillus fumigatus A1163]
          Length = 133

 Score = 40.0 bits (92), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 39  SVSITIHAKPGSKSCS--ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           S+ I  H KP + S    I  V  E V V + A  ++GEAN A+    + +  V +  V 
Sbjct: 31  SLQIACHVKPNASSSREGIIAVGAEKVDVCVAAVPRNGEANTAVSRVFAQIFDVPKSNVE 90

Query: 97  IGSGSKSRDKIVIVEEI 113
           +  G KSRDK + +  +
Sbjct: 91  VIRGLKSRDKTLCITNL 107


>gi|385805082|ref|YP_005841480.1| hypothetical protein FFONT_0032 [Fervidicoccus fontis Kam940]
 gi|383794945|gb|AFH42028.1| hypothetical protein FFONT_0032 [Fervidicoccus fontis Kam940]
          Length = 107

 Score = 40.0 bits (92), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 1/88 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + I+ KP SK   I     E +      P + G+ NA+L++++S + G+   ++ I  
Sbjct: 19  VDLIIYVKPKSKDTGIISERGELI-FHSTQPPERGKVNASLIKFLSKLTGISSNKIEINR 77

Query: 100 GSKSRDKIVIVEEITPENVLNSLGKASS 127
           G K R K V +    PE ++    +  S
Sbjct: 78  GEKDRVKTVRIYCDDPEKIVGLFAEYLS 105


>gi|169777211|ref|XP_001823071.1| yggU family protein [Aspergillus oryzae RIB40]
 gi|83771808|dbj|BAE61938.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 131

 Score = 40.0 bits (92), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 39  SVSITIHAKPGSKSCS--ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           S+ I+ + KP + +    I  V  E V V + A  +DGEANAA+    + +L V +  V 
Sbjct: 30  SLQISCNVKPNASANREGIIAVGPEKVDVCVAAVPRDGEANAAVSRVFAQILKVPKSTVV 89

Query: 97  IGSGSKSRDKIVIVEEI 113
           +  G KSRDK + V ++
Sbjct: 90  VIRGLKSRDKTLCVSDL 106


>gi|300925051|ref|ZP_07140969.1| hypothetical protein HMPREF9548_03158 [Escherichia coli MS 182-1]
 gi|300418797|gb|EFK02108.1| hypothetical protein HMPREF9548_03158 [Escherichia coli MS 182-1]
          Length = 96

 Score = 40.0 bits (92), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  ++ + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLNGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQIKIINPQQIPPE 90


>gi|170290086|ref|YP_001736902.1| hypothetical protein Kcr_0466 [Candidatus Korarchaeum cryptofilum
           OPF8]
 gi|170174166|gb|ACB07219.1| protein of unknown function DUF167 [Candidatus Korarchaeum
           cryptofilum OPF8]
          Length = 72

 Score = 40.0 bits (92), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 18/44 (40%), Positives = 29/44 (65%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV 108
           V + AP   G+AN A++E ++   GVK+  + I SG +SR+K+V
Sbjct: 25  VYVRAPPVKGKANEAVIEVLAEFFGVKKSDIRIISGERSREKVV 68


>gi|237729884|ref|ZP_04560365.1| conserved hypothetical protein [Citrobacter sp. 30_2]
 gi|283835286|ref|ZP_06355027.1| putative cytoplasmic protein [Citrobacter youngae ATCC 29220]
 gi|365101310|ref|ZP_09331940.1| UPF0235 protein yggU [Citrobacter freundii 4_7_47CFAA]
 gi|395228463|ref|ZP_10406786.1| protein YggU [Citrobacter sp. A1]
 gi|421845263|ref|ZP_16278418.1| hypothetical protein D186_09498 [Citrobacter freundii ATCC 8090 =
           MTCC 1658]
 gi|424731916|ref|ZP_18160497.1| protein YggU [Citrobacter sp. L17]
 gi|226908490|gb|EEH94408.1| conserved hypothetical protein [Citrobacter sp. 30_2]
 gi|291068444|gb|EFE06553.1| putative cytoplasmic protein [Citrobacter youngae ATCC 29220]
 gi|363646860|gb|EHL86089.1| UPF0235 protein yggU [Citrobacter freundii 4_7_47CFAA]
 gi|394718112|gb|EJF23756.1| protein YggU [Citrobacter sp. A1]
 gi|411773584|gb|EKS57129.1| hypothetical protein D186_09498 [Citrobacter freundii ATCC 8090 =
           MTCC 1658]
 gi|422893544|gb|EKU33391.1| protein YggU [Citrobacter sp. L17]
 gi|455642801|gb|EMF21952.1| hypothetical protein H262_15472 [Citrobacter freundii GTC 09479]
          Length = 96

 Score = 40.0 bits (92), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQVKIIHPQQIPPE 90


>gi|209525443|ref|ZP_03273983.1| protein of unknown function DUF167 [Arthrospira maxima CS-328]
 gi|423064137|ref|ZP_17052927.1| hypothetical protein SPLC1_S171680 [Arthrospira platensis C1]
 gi|209494123|gb|EDZ94438.1| protein of unknown function DUF167 [Arthrospira maxima CS-328]
 gi|406714554|gb|EKD09719.1| hypothetical protein SPLC1_S171680 [Arthrospira platensis C1]
          Length = 73

 Score = 40.0 bits (92), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
             I  KP S    I   +D ++ V + +P  DG+AN  L++ ++  L V +  + I SG 
Sbjct: 4   FNIQVKPNSPQQLIRKEADGSLTVYLKSPPVDGKANQELIKLLAKKLDVPKSNIKIKSGL 63

Query: 102 KSRDKIV 108
            SR K+V
Sbjct: 64  SSRRKLV 70


>gi|238763264|ref|ZP_04624229.1| hypothetical protein ykris0001_28410 [Yersinia kristensenii ATCC
           33638]
 gi|238698537|gb|EEP91289.1| hypothetical protein ykris0001_28410 [Yersinia kristensenii ATCC
           33638]
          Length = 90

 Score = 39.7 bits (91), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 43/80 (53%), Gaps = 3/80 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +++ ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V + QV I  
Sbjct: 5   LALRLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQVIIEK 64

Query: 100 GSKSRD---KIVIVEEITPE 116
           G   R    KIV  ++I PE
Sbjct: 65  GELGRHKQIKIVNPQQIPPE 84


>gi|145224153|ref|YP_001134831.1| hypothetical protein Mflv_3569 [Mycobacterium gilvum PYR-GCK]
 gi|315444488|ref|YP_004077367.1| hypothetical protein Mspyr1_29120 [Mycobacterium gilvum Spyr1]
 gi|189040164|sp|A4T9S3.1|Y3569_MYCGI RecName: Full=UPF0235 protein Mflv_3569
 gi|145216639|gb|ABP46043.1| protein of unknown function DUF167 [Mycobacterium gilvum PYR-GCK]
 gi|315262791|gb|ADT99532.1| uncharacterized conserved protein [Mycobacterium gilvum Spyr1]
          Length = 75

 Score = 39.7 bits (91), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 42  ITIHAKPGSKSCSITDVS-DEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           I++  KPGS+   + +   D A+ + +   A DG+AN A+ + ++  LGV R ++ + SG
Sbjct: 5   ISVRVKPGSRKGPLVEAGEDGALTLYVQERAVDGKANEAVTKLLAEHLGVPRSRIELVSG 64

Query: 101 SKSRDK 106
           + SR K
Sbjct: 65  ATSRHK 70


>gi|251793930|ref|YP_003008662.1| hypothetical protein NT05HA_2269 [Aggregatibacter aphrophilus
           NJ8700]
 gi|422337117|ref|ZP_16418089.1| hypothetical protein HMPREF9335_01277 [Aggregatibacter aphrophilus
           F0387]
 gi|247535329|gb|ACS98575.1| hypothetical protein NT05HA_2269 [Aggregatibacter aphrophilus
           NJ8700]
 gi|353345669|gb|EHB89960.1| hypothetical protein HMPREF9335_01277 [Aggregatibacter aphrophilus
           F0387]
          Length = 97

 Score = 39.7 bits (91), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 19/72 (26%), Positives = 39/72 (54%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + + + I  +P +    I  + D+ + ++I AP  DG+ANA LL+++S +  V +  + +
Sbjct: 9   ADLRLRIFLQPKAAKDHIVGLHDDELKIRITAPPIDGQANAHLLKFLSKLFKVPKSSIVL 68

Query: 98  GSGSKSRDKIVI 109
             G  +  K V+
Sbjct: 69  EKGELNCHKQVL 80


>gi|336247134|ref|YP_004590844.1| hypothetical protein EAE_03150 [Enterobacter aerogenes KCTC 2190]
 gi|444354747|ref|YP_007390891.1| UPF0235 protein VC0458 [Enterobacter aerogenes EA1509E]
 gi|334733190|gb|AEG95565.1| hypothetical protein EAE_03150 [Enterobacter aerogenes KCTC 2190]
 gi|443905577|emb|CCG33351.1| UPF0235 protein VC0458 [Enterobacter aerogenes EA1509E]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   S+  +  + + V I AP  DG+ANA L++Y++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSLVGLHGDELKVAITAPPVDGQANAHLVKYLAKQFRVAKSQVLIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +  I P+ +
Sbjct: 73  LGRHKQVKI--IHPQQI 87


>gi|345864317|ref|ZP_08816519.1| hypothetical protein TevJSym_am00170 [endosymbiont of Tevnia
           jerichonana (vent Tica)]
 gi|345877261|ref|ZP_08829014.1| hypothetical protein Rifp1Sym_aq00240 [endosymbiont of Riftia
           pachyptila (vent Ph05)]
 gi|344225728|gb|EGV52078.1| hypothetical protein Rifp1Sym_aq00240 [endosymbiont of Riftia
           pachyptila (vent Ph05)]
 gi|345124513|gb|EGW54391.1| hypothetical protein TevJSym_am00170 [endosymbiont of Tevnia
           jerichonana (vent Tica)]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 38/72 (52%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + +  +P +          +   ++I AP  DG+ANA LL+ ++   GV ++QV + +
Sbjct: 11  LELQLRIQPKASRNEFIGPHGDEFKLRITAPPVDGKANAHLLKMLAKAFGVSKKQVLLIA 70

Query: 100 GSKSRDKIVIVE 111
           G  SR K V ++
Sbjct: 71  GESSRSKRVRIK 82


>gi|417703802|ref|ZP_12352906.1| hypothetical protein SFK218_3956 [Shigella flexneri K-218]
 gi|333000185|gb|EGK19768.1| hypothetical protein SFK218_3956 [Shigella flexneri K-218]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LQLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQIKIINPQQIPPE 90


>gi|256810180|ref|YP_003127549.1| hypothetical protein Mefer_0210 [Methanocaldococcus fervens AG86]
 gi|256793380|gb|ACV24049.1| protein of unknown function DUF167 [Methanocaldococcus fervens
           AG86]
          Length = 98

 Score = 39.7 bits (91), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 22/87 (25%), Positives = 50/87 (57%), Gaps = 5/87 (5%)

Query: 40  VSITIHAKPGSKSCSITDVSD--EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           V I I  + G+K   IT +++  + + ++I APA +G+AN  ++++   +    ++ + I
Sbjct: 13  VLIDIDVQAGAKKDEITGINEWRKRLSIKIKAPATEGKANKEIIKFFKEIF---KKDIEI 69

Query: 98  GSGSKSRDKIVIVEEITPENVLNSLGK 124
            +G  +  K ++V++I  + V+ +L K
Sbjct: 70  VAGKLNPQKTILVKDIKKDEVIETLKK 96


>gi|56415040|ref|YP_152115.1| hypothetical protein SPA2965 [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. ATCC 9150]
 gi|168819868|ref|ZP_02831868.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Weltevreden str. HI_N05-537]
 gi|194446740|ref|YP_002042361.1| hypothetical protein SNSL254_A3349 [Salmonella enterica subsp.
           enterica serovar Newport str. SL254]
 gi|197363969|ref|YP_002143606.1| hypothetical protein SSPA2764 [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. AKU_12601]
 gi|204928324|ref|ZP_03219524.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Javiana str. GA_MM04042433]
 gi|238909900|ref|ZP_04653737.1| hypothetical protein SentesTe_02030 [Salmonella enterica subsp.
           enterica serovar Tennessee str. CDC07-0191]
 gi|340000632|ref|YP_004731516.1| hypothetical protein SBG_2702 [Salmonella bongori NCTC 12419]
 gi|375002856|ref|ZP_09727196.1| hypothetical protein SEENIN0B_03217 [Salmonella enterica subsp.
           enterica serovar Infantis str. SARB27]
 gi|409246800|ref|YP_006887504.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar
           Weltevreden str. 2007-60-3289-1]
 gi|416426479|ref|ZP_11692974.1| hypothetical protein SEEM315_07205 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315996572]
 gi|416429052|ref|ZP_11694265.1| hypothetical protein SEEM971_19929 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-1]
 gi|416439105|ref|ZP_11699982.1| hypothetical protein SEEM973_20050 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-3]
 gi|416446061|ref|ZP_11704816.1| hypothetical protein SEEM974_21245 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-4]
 gi|416451453|ref|ZP_11708203.1| hypothetical protein SEEM201_12335 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-1]
 gi|416459967|ref|ZP_11714412.1| hypothetical protein SEEM202_13733 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-2]
 gi|416471971|ref|ZP_11719502.1| hypothetical protein SEEM954_11862 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 531954]
 gi|416474242|ref|ZP_11720093.1| hypothetical protein SEEM054_10407 [Salmonella enterica subsp.
           enterica serovar Montevideo str. NC_MB110209-0054]
 gi|416492926|ref|ZP_11727713.1| hypothetical protein SEEM675_04396 [Salmonella enterica subsp.
           enterica serovar Montevideo str. OH_2009072675]
 gi|416500907|ref|ZP_11731769.1| hypothetical protein SEEM965_22046 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CASC_09SCPH15965]
 gi|416504060|ref|ZP_11733007.1| hypothetical protein SEEM031_11075 [Salmonella enterica subsp.
           enterica serovar Montevideo str. SARB31]
 gi|416515657|ref|ZP_11738784.1| hypothetical protein SEEM710_02886 [Salmonella enterica subsp.
           enterica serovar Montevideo str. ATCC BAA710]
 gi|416527174|ref|ZP_11743012.1| hypothetical protein SEEM010_04425 [Salmonella enterica subsp.
           enterica serovar Montevideo str. LQC 10]
 gi|416533894|ref|ZP_11746712.1| hypothetical protein SEEM030_02820 [Salmonella enterica subsp.
           enterica serovar Montevideo str. SARB30]
 gi|416546782|ref|ZP_11754176.1| hypothetical protein SEEM19N_18161 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 19N]
 gi|416549627|ref|ZP_11755470.1| hypothetical protein SEEM29N_06577 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 29N]
 gi|416557886|ref|ZP_11759866.1| hypothetical protein SEEM42N_00570 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 42N]
 gi|416568522|ref|ZP_11764874.1| hypothetical protein SEEM41H_21643 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 4441 H]
 gi|416577713|ref|ZP_11769999.1| hypothetical protein SEEM801_04461 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 81038-01]
 gi|416584009|ref|ZP_11773749.1| hypothetical protein SEEM507_10116 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MD_MDA09249507]
 gi|416591655|ref|ZP_11778599.1| hypothetical protein SEEM877_00665 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 414877]
 gi|416598297|ref|ZP_11782684.1| hypothetical protein SEEM867_02147 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 366867]
 gi|416606813|ref|ZP_11788054.1| hypothetical protein SEEM180_20639 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 413180]
 gi|416610590|ref|ZP_11790197.1| hypothetical protein SEEM600_15746 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 446600]
 gi|416620298|ref|ZP_11795656.1| hypothetical protein SEEM581_15857 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609458-1]
 gi|416628437|ref|ZP_11799602.1| hypothetical protein SEEM501_16285 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556150-1]
 gi|416641813|ref|ZP_11805632.1| hypothetical protein SEEM460_18160 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609460]
 gi|416647117|ref|ZP_11808116.1| hypothetical protein SEEM020_011659 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 507440-20]
 gi|416657010|ref|ZP_11813466.1| hypothetical protein SEEM6152_10068 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556152]
 gi|416670252|ref|ZP_11819966.1| hypothetical protein SEEM0077_03334 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB101509-0077]
 gi|416675104|ref|ZP_11821427.1| hypothetical protein SEEM0047_09590 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB102109-0047]
 gi|416695473|ref|ZP_11827702.1| hypothetical protein SEEM0055_15033 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB110209-0055]
 gi|416706008|ref|ZP_11831267.1| hypothetical protein SEEM0052_19874 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB111609-0052]
 gi|416712313|ref|ZP_11836024.1| hypothetical protein SEEM3312_06118 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009083312]
 gi|416718509|ref|ZP_11840617.1| hypothetical protein SEEM5258_05995 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009085258]
 gi|416723136|ref|ZP_11843901.1| hypothetical protein SEEM1156_01212 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315731156]
 gi|416733123|ref|ZP_11850214.1| hypothetical protein SEEM9199_21270 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2009159199]
 gi|416737622|ref|ZP_11852775.1| hypothetical protein SEEM8282_09961 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008282]
 gi|416748574|ref|ZP_11858831.1| hypothetical protein SEEM8283_11916 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008283]
 gi|416754736|ref|ZP_11861528.1| hypothetical protein SEEM8284_02121 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008284]
 gi|416761608|ref|ZP_11865659.1| hypothetical protein SEEM8285_21407 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008285]
 gi|416771263|ref|ZP_11872528.1| hypothetical protein SEEM8287_10997 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008287]
 gi|418481826|ref|ZP_13050849.1| hypothetical protein SEEM906_07666 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 80959-06]
 gi|418491214|ref|ZP_13057740.1| hypothetical protein SEEM5278_17561 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035278]
 gi|418495810|ref|ZP_13062248.1| hypothetical protein SEEM5318_17111 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035318]
 gi|418498626|ref|ZP_13065040.1| hypothetical protein SEEM5320_08430 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035320]
 gi|418505602|ref|ZP_13071948.1| hypothetical protein SEEM5321_19409 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035321]
 gi|418509894|ref|ZP_13076185.1| hypothetical protein SEEM5327_15499 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035327]
 gi|418512439|ref|ZP_13078682.1| hypothetical protein SEEPO729_05936 [Salmonella enterica subsp.
           enterica serovar Pomona str. ATCC 10729]
 gi|418524586|ref|ZP_13090571.1| hypothetical protein SEEM8286_16811 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008286]
 gi|418788748|ref|ZP_13344541.1| hypothetical protein SEEN447_01107 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19447]
 gi|418795295|ref|ZP_13351004.1| hypothetical protein SEEN449_20424 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19449]
 gi|418797411|ref|ZP_13353097.1| hypothetical protein SEEN567_03789 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19567]
 gi|418806313|ref|ZP_13361885.1| hypothetical protein SEEN550_06407 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21550]
 gi|418810472|ref|ZP_13366012.1| hypothetical protein SEEN513_02361 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22513]
 gi|418818089|ref|ZP_13373568.1| hypothetical protein SEEN538_16505 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21538]
 gi|418823158|ref|ZP_13378567.1| hypothetical protein SEEN425_19156 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22425]
 gi|418824264|ref|ZP_13379632.1| hypothetical protein SEEN462_09745 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22462]
 gi|418831052|ref|ZP_13386010.1| hypothetical protein SEEN486_12934 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM N18486]
 gi|418837215|ref|ZP_13392090.1| hypothetical protein SEEN543_06266 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM N1543]
 gi|418842478|ref|ZP_13397288.1| hypothetical protein SEEN554_11949 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21554]
 gi|418847945|ref|ZP_13402685.1| hypothetical protein SEEN978_07400 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 37978]
 gi|418856108|ref|ZP_13410756.1| hypothetical protein SEEN593_15025 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19593]
 gi|421885525|ref|ZP_16316716.1| hypothetical protein SS209_02680 [Salmonella enterica subsp.
           enterica serovar Senftenberg str. SS209]
 gi|437821419|ref|ZP_20843368.1| hypothetical protein SEEERB17_014518 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SARB17]
 gi|452123083|ref|YP_007473331.1| hypothetical protein CFSAN001992_18065 [Salmonella enterica subsp.
           enterica serovar Javiana str. CFSAN001992]
 gi|81361856|sp|Q5PML0.1|YGGU_SALPA RecName: Full=UPF0235 protein YggU
 gi|226730829|sp|B4T5K9.1|YGGU_SALNS RecName: Full=UPF0235 protein YggU
 gi|226730830|sp|B5BFQ9.1|YGGU_SALPK RecName: Full=UPF0235 protein YggU
 gi|56129297|gb|AAV78803.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. ATCC 9150]
 gi|194405403|gb|ACF65625.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Newport str. SL254]
 gi|197095446|emb|CAR61005.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Paratyphi A str. AKU_12601]
 gi|204322646|gb|EDZ07843.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Javiana str. GA_MM04042433]
 gi|205343351|gb|EDZ30115.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Weltevreden str. HI_N05-537]
 gi|320087534|emb|CBY97299.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar
           Weltevreden str. 2007-60-3289-1]
 gi|322613499|gb|EFY10440.1| hypothetical protein SEEM315_07205 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315996572]
 gi|322621091|gb|EFY17949.1| hypothetical protein SEEM971_19929 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-1]
 gi|322624155|gb|EFY20989.1| hypothetical protein SEEM973_20050 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-3]
 gi|322628106|gb|EFY24895.1| hypothetical protein SEEM974_21245 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 495297-4]
 gi|322633225|gb|EFY29967.1| hypothetical protein SEEM201_12335 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-1]
 gi|322636197|gb|EFY32905.1| hypothetical protein SEEM202_13733 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 515920-2]
 gi|322639535|gb|EFY36223.1| hypothetical protein SEEM954_11862 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 531954]
 gi|322647532|gb|EFY44021.1| hypothetical protein SEEM054_10407 [Salmonella enterica subsp.
           enterica serovar Montevideo str. NC_MB110209-0054]
 gi|322648716|gb|EFY45163.1| hypothetical protein SEEM675_04396 [Salmonella enterica subsp.
           enterica serovar Montevideo str. OH_2009072675]
 gi|322653771|gb|EFY50097.1| hypothetical protein SEEM965_22046 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CASC_09SCPH15965]
 gi|322657877|gb|EFY54145.1| hypothetical protein SEEM19N_18161 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 19N]
 gi|322663980|gb|EFY60179.1| hypothetical protein SEEM801_04461 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 81038-01]
 gi|322669009|gb|EFY65160.1| hypothetical protein SEEM507_10116 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MD_MDA09249507]
 gi|322672997|gb|EFY69104.1| hypothetical protein SEEM877_00665 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 414877]
 gi|322678012|gb|EFY74075.1| hypothetical protein SEEM867_02147 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 366867]
 gi|322681188|gb|EFY77221.1| hypothetical protein SEEM180_20639 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 413180]
 gi|322687882|gb|EFY83849.1| hypothetical protein SEEM600_15746 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 446600]
 gi|323194922|gb|EFZ80109.1| hypothetical protein SEEM581_15857 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609458-1]
 gi|323199626|gb|EFZ84716.1| hypothetical protein SEEM501_16285 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556150-1]
 gi|323202627|gb|EFZ87667.1| hypothetical protein SEEM460_18160 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 609460]
 gi|323212562|gb|EFZ97379.1| hypothetical protein SEEM6152_10068 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 556152]
 gi|323214955|gb|EFZ99703.1| hypothetical protein SEEM0077_03334 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB101509-0077]
 gi|323222685|gb|EGA07050.1| hypothetical protein SEEM0047_09590 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB102109-0047]
 gi|323225428|gb|EGA09660.1| hypothetical protein SEEM0055_15033 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB110209-0055]
 gi|323230557|gb|EGA14675.1| hypothetical protein SEEM0052_19874 [Salmonella enterica subsp.
           enterica serovar Montevideo str. MB111609-0052]
 gi|323235092|gb|EGA19178.1| hypothetical protein SEEM3312_06118 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009083312]
 gi|323239131|gb|EGA23181.1| hypothetical protein SEEM5258_05995 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 2009085258]
 gi|323244511|gb|EGA28517.1| hypothetical protein SEEM1156_01212 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 315731156]
 gi|323247126|gb|EGA31092.1| hypothetical protein SEEM9199_21270 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2009159199]
 gi|323253391|gb|EGA37220.1| hypothetical protein SEEM8282_09961 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008282]
 gi|323256302|gb|EGA40038.1| hypothetical protein SEEM8283_11916 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008283]
 gi|323262522|gb|EGA46078.1| hypothetical protein SEEM8284_02121 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008284]
 gi|323267382|gb|EGA50866.1| hypothetical protein SEEM8285_21407 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008285]
 gi|323269214|gb|EGA52669.1| hypothetical protein SEEM8287_10997 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008287]
 gi|323669738|emb|CBJ94862.1| conserved hypothetical protein [Salmonella bongori]
 gi|327412908|emb|CAX67922.1| conserved hypothetical protein [Salmonella bongori]
 gi|339513994|emb|CCC31753.1| conserved hypothetical protein [Salmonella bongori NCTC 12419]
 gi|353077544|gb|EHB43304.1| hypothetical protein SEENIN0B_03217 [Salmonella enterica subsp.
           enterica serovar Infantis str. SARB27]
 gi|363556829|gb|EHL41042.1| hypothetical protein SEEM010_04425 [Salmonella enterica subsp.
           enterica serovar Montevideo str. LQC 10]
 gi|363558436|gb|EHL42627.1| hypothetical protein SEEM031_11075 [Salmonella enterica subsp.
           enterica serovar Montevideo str. SARB31]
 gi|363563688|gb|EHL47755.1| hypothetical protein SEEM710_02886 [Salmonella enterica subsp.
           enterica serovar Montevideo str. ATCC BAA710]
 gi|363567518|gb|EHL51516.1| hypothetical protein SEEM030_02820 [Salmonella enterica subsp.
           enterica serovar Montevideo str. SARB30]
 gi|363569576|gb|EHL53526.1| hypothetical protein SEEM29N_06577 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 29N]
 gi|363577867|gb|EHL61686.1| hypothetical protein SEEM41H_21643 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 4441 H]
 gi|363578096|gb|EHL61913.1| hypothetical protein SEEM42N_00570 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 42N]
 gi|366058326|gb|EHN22615.1| hypothetical protein SEEM5318_17111 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035318]
 gi|366062913|gb|EHN27135.1| hypothetical protein SEEM5278_17561 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035278]
 gi|366064559|gb|EHN28756.1| hypothetical protein SEEM906_07666 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 80959-06]
 gi|366067909|gb|EHN32057.1| hypothetical protein SEEM5321_19409 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035321]
 gi|366073378|gb|EHN37451.1| hypothetical protein SEEM5320_08430 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035320]
 gi|366077494|gb|EHN41508.1| hypothetical protein SEEM5327_15499 [Salmonella enterica subsp.
           enterica serovar Montevideo str. CT_02035327]
 gi|366083946|gb|EHN47862.1| hypothetical protein SEEPO729_05936 [Salmonella enterica subsp.
           enterica serovar Pomona str. ATCC 10729]
 gi|366830560|gb|EHN57430.1| hypothetical protein SEEM020_011659 [Salmonella enterica subsp.
           enterica serovar Montevideo str. 507440-20]
 gi|372207445|gb|EHP20944.1| hypothetical protein SEEM8286_16811 [Salmonella enterica subsp.
           enterica serovar Montevideo str. IA_2010008286]
 gi|379984793|emb|CCF88989.1| hypothetical protein SS209_02680 [Salmonella enterica subsp.
           enterica serovar Senftenberg str. SS209]
 gi|392759437|gb|EJA16290.1| hypothetical protein SEEN449_20424 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19449]
 gi|392762414|gb|EJA19229.1| hypothetical protein SEEN447_01107 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19447]
 gi|392768850|gb|EJA25596.1| hypothetical protein SEEN567_03789 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19567]
 gi|392781420|gb|EJA38061.1| hypothetical protein SEEN513_02361 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22513]
 gi|392782930|gb|EJA39560.1| hypothetical protein SEEN550_06407 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21550]
 gi|392786052|gb|EJA42609.1| hypothetical protein SEEN425_19156 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22425]
 gi|392786501|gb|EJA43057.1| hypothetical protein SEEN538_16505 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21538]
 gi|392799291|gb|EJA55550.1| hypothetical protein SEEN543_06266 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM N1543]
 gi|392800248|gb|EJA56486.1| hypothetical protein SEEN486_12934 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM N18486]
 gi|392807049|gb|EJA63133.1| hypothetical protein SEEN554_11949 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21554]
 gi|392820458|gb|EJA76308.1| hypothetical protein SEEN593_15025 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19593]
 gi|392823810|gb|EJA79603.1| hypothetical protein SEEN462_09745 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 22462]
 gi|392824004|gb|EJA79795.1| hypothetical protein SEEN978_07400 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 37978]
 gi|435306892|gb|ELO82121.1| hypothetical protein SEEERB17_014518 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SARB17]
 gi|451912087|gb|AGF83893.1| hypothetical protein CFSAN001992_18065 [Salmonella enterica subsp.
           enterica serovar Javiana str. CFSAN001992]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + Q+ I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQIVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQVKIIHPQQIPPE 90


>gi|417356849|ref|ZP_12132309.1| hypothetical protein LTSEGIV_1346 [Salmonella enterica subsp.
           enterica serovar Give str. S5-487]
 gi|417385536|ref|ZP_12150570.1| hypothetical protein LTSEJOH_4459 [Salmonella enterica subsp.
           enterica serovar Johannesburg str. S5-703]
 gi|417469760|ref|ZP_12166057.1| hypothetical protein LTSEMON_4341 [Salmonella enterica subsp.
           enterica serovar Montevideo str. S5-403]
 gi|417513435|ref|ZP_12177488.1| hypothetical protein LTSESEN_4811 [Salmonella enterica subsp.
           enterica serovar Senftenberg str. A4-543]
 gi|417541691|ref|ZP_12193356.1| hypothetical protein LTSEWAN_4728 [Salmonella enterica subsp.
           enterica serovar Wandsworth str. A4-580]
 gi|353595012|gb|EHC52356.1| hypothetical protein LTSEGIV_1346 [Salmonella enterica subsp.
           enterica serovar Give str. S5-487]
 gi|353605541|gb|EHC60022.1| hypothetical protein LTSEJOH_4459 [Salmonella enterica subsp.
           enterica serovar Johannesburg str. S5-703]
 gi|353626808|gb|EHC75271.1| hypothetical protein LTSEMON_4341 [Salmonella enterica subsp.
           enterica serovar Montevideo str. S5-403]
 gi|353636836|gb|EHC82805.1| hypothetical protein LTSESEN_4811 [Salmonella enterica subsp.
           enterica serovar Senftenberg str. A4-543]
 gi|353660261|gb|EHC99930.1| hypothetical protein LTSEWAN_4728 [Salmonella enterica subsp.
           enterica serovar Wandsworth str. A4-580]
          Length = 93

 Score = 39.7 bits (91), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + Q+ I  G 
Sbjct: 10  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQIVIEKGE 69

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 70  LGRHKQVKIIHPQQIPPE 87


>gi|162139546|ref|YP_218029.2| hypothetical protein SC3042 [Salmonella enterica subsp. enterica
           serovar Choleraesuis str. SC-B67]
 gi|168242905|ref|ZP_02667837.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Heidelberg str. SL486]
 gi|194448304|ref|YP_002047090.1| hypothetical protein SeHA_C3341 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. SL476]
 gi|197249095|ref|YP_002148016.1| hypothetical protein SeAg_B3265 [Salmonella enterica subsp.
           enterica serovar Agona str. SL483]
 gi|375115949|ref|ZP_09761119.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar
           Choleraesuis str. SCSA50]
 gi|386592803|ref|YP_006089203.1| hypothetical protein SU5_03601 [Salmonella enterica subsp. enterica
           serovar Heidelberg str. B182]
 gi|418846828|ref|ZP_13401593.1| hypothetical protein SEEN443_22276 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19443]
 gi|418857579|ref|ZP_13412206.1| hypothetical protein SEEN470_15116 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19470]
 gi|418862654|ref|ZP_13417193.1| hypothetical protein SEEN536_07304 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19536]
 gi|419731355|ref|ZP_14258268.1| hypothetical protein SEEH1579_07600 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41579]
 gi|419735810|ref|ZP_14262683.1| hypothetical protein SEEH1563_13866 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41563]
 gi|419739579|ref|ZP_14266324.1| hypothetical protein SEEH1573_05248 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41573]
 gi|419741975|ref|ZP_14268653.1| hypothetical protein SEEH1566_20677 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41566]
 gi|419748807|ref|ZP_14275297.1| hypothetical protein SEEH1565_11531 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41565]
 gi|421572981|ref|ZP_16018626.1| hypothetical protein CFSAN00322_22320 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00322]
 gi|421576960|ref|ZP_16022550.1| hypothetical protein CFSAN00325_19109 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00325]
 gi|421579458|ref|ZP_16025021.1| hypothetical protein CFSAN00326_08578 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00326]
 gi|421583310|ref|ZP_16028834.1| hypothetical protein CFSAN00328_05012 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00328]
 gi|440763988|ref|ZP_20943022.1| hypothetical protein F434_13518 [Salmonella enterica subsp.
           enterica serovar Agona str. SH11G1113]
 gi|440770015|ref|ZP_20948969.1| hypothetical protein F514_20242 [Salmonella enterica subsp.
           enterica serovar Agona str. SH08SF124]
 gi|440772716|ref|ZP_20951619.1| hypothetical protein F515_09998 [Salmonella enterica subsp.
           enterica serovar Agona str. SH10GFN094]
 gi|226730824|sp|B5F5M7.1|YGGU_SALA4 RecName: Full=UPF0235 protein YggU
 gi|226730828|sp|B4THI7.1|YGGU_SALHS RecName: Full=UPF0235 protein YggU
 gi|194406608|gb|ACF66827.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Heidelberg str. SL476]
 gi|197212798|gb|ACH50195.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Agona str. SL483]
 gi|205338138|gb|EDZ24902.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Heidelberg str. SL486]
 gi|322716095|gb|EFZ07666.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar
           Choleraesuis str. SCSA50]
 gi|381291536|gb|EIC32773.1| hypothetical protein SEEH1579_07600 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41579]
 gi|381294134|gb|EIC35274.1| hypothetical protein SEEH1563_13866 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41563]
 gi|381298158|gb|EIC39239.1| hypothetical protein SEEH1573_05248 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41573]
 gi|381312803|gb|EIC53596.1| hypothetical protein SEEH1565_11531 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41565]
 gi|381315342|gb|EIC56105.1| hypothetical protein SEEH1566_20677 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. 41566]
 gi|383799844|gb|AFH46926.1| UPF0235 protein [Salmonella enterica subsp. enterica serovar
           Heidelberg str. B182]
 gi|392809299|gb|EJA65336.1| hypothetical protein SEEN443_22276 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19443]
 gi|392834051|gb|EJA89661.1| hypothetical protein SEEN536_07304 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19536]
 gi|392835053|gb|EJA90653.1| hypothetical protein SEEN470_15116 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 19470]
 gi|402515057|gb|EJW22472.1| hypothetical protein CFSAN00322_22320 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00322]
 gi|402516844|gb|EJW24252.1| hypothetical protein CFSAN00325_19109 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00325]
 gi|402521669|gb|EJW29003.1| hypothetical protein CFSAN00326_08578 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00326]
 gi|402532236|gb|EJW39433.1| hypothetical protein CFSAN00328_05012 [Salmonella enterica subsp.
           enterica serovar Heidelberg str. CFSAN00328]
 gi|436412585|gb|ELP10524.1| hypothetical protein F514_20242 [Salmonella enterica subsp.
           enterica serovar Agona str. SH08SF124]
 gi|436417698|gb|ELP15586.1| hypothetical protein F434_13518 [Salmonella enterica subsp.
           enterica serovar Agona str. SH11G1113]
 gi|436417873|gb|ELP15760.1| hypothetical protein F515_09998 [Salmonella enterica subsp.
           enterica serovar Agona str. SH10GFN094]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + Q+ I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQIVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQVKIIHPQQIPPE 90


>gi|406998223|gb|EKE16166.1| hypothetical protein ACD_11C00029G0046 [uncultured bacterium]
          Length = 73

 Score = 39.7 bits (91), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 20/73 (27%), Positives = 39/73 (53%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I +   P S   S+  +SD    V++ AP  DG+AN  L++ ++   GV +  + I  
Sbjct: 1   MRIYVRVSPRSSKNSVEKISDGEYKVKLTAPPVDGQANDMLIKVLADYFGVSKSSLKIVG 60

Query: 100 GSKSRDKIVIVEE 112
           G  ++ K++ +E+
Sbjct: 61  GKSAKTKMIDIEK 73


>gi|432676062|ref|ZP_19911516.1| hypothetical protein A1YU_02613 [Escherichia coli KTE142]
 gi|431212767|gb|ELF10693.1| hypothetical protein A1YU_02613 [Escherichia coli KTE142]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQIKIINPQQIPPE 90


>gi|224584895|ref|YP_002638694.1| hypothetical protein SPC_3167 [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
 gi|224469423|gb|ACN47253.1| hypothetical protein SPC_3167 [Salmonella enterica subsp. enterica
           serovar Paratyphi C strain RKS4594]
          Length = 93

 Score = 39.7 bits (91), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + Q+ I  G 
Sbjct: 10  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQIVIEKGE 69

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 70  LGRHKQVKIIHPQQIPPE 87


>gi|440232366|ref|YP_007346159.1| TIGR00251 family protein [Serratia marcescens FGI94]
 gi|440054071|gb|AGB83974.1| TIGR00251 family protein [Serratia marcescens FGI94]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 19/70 (27%), Positives = 36/70 (51%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  + I ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V +  ++
Sbjct: 8   PDGLVIRLYIQPKASRDQIIGLHGDEIKVAITAPPVDGQANAHLVKFLAKQFKVAKSNIT 67

Query: 97  IGSGSKSRDK 106
           I  G   R K
Sbjct: 68  IEKGELGRHK 77


>gi|431930170|ref|YP_007243216.1| hypothetical protein Thimo_0747 [Thioflavicoccus mobilis 8321]
 gi|431828473|gb|AGA89586.1| TIGR00251 family protein [Thioflavicoccus mobilis 8321]
          Length = 100

 Score = 39.7 bits (91), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 37/72 (51%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + +T+  +P +K  +      +   VQI AP  DG+ANA L  +++    V   Q+ + +
Sbjct: 11  LCLTLKVQPRAKRDAFVGPLGDCYRVQITAPPVDGKANAHLRRFLAETFNVPVSQIDLDA 70

Query: 100 GSKSRDKIVIVE 111
           G+ SR K V + 
Sbjct: 71  GALSRIKRVRIH 82


>gi|341581443|ref|YP_004761935.1| hypothetical protein GQS_01780 [Thermococcus sp. 4557]
 gi|340809101|gb|AEK72258.1| hypothetical protein GQS_01780 [Thermococcus sp. 4557]
          Length = 101

 Score = 39.7 bits (91), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 46/84 (54%), Gaps = 7/84 (8%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVG---VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           + ++ +P +K   I  + DE  G   V+I AP  +G+AN  L++++S +LG    +V + 
Sbjct: 15  LLVYVQPKAKKNEIEGI-DEWRGRLKVKIKAPPVEGKANKELVKFLSKLLGA---EVELV 70

Query: 99  SGSKSRDKIVIVEEITPENVLNSL 122
            G  SR+K ++V  +  E V   L
Sbjct: 71  RGETSREKDLLVRGLRVEEVKEKL 94


>gi|307310426|ref|ZP_07590074.1| protein of unknown function DUF167 [Escherichia coli W]
 gi|378711596|ref|YP_005276489.1| hypothetical protein [Escherichia coli KO11FL]
 gi|386610342|ref|YP_006125828.1| hypothetical protein ECW_m3211 [Escherichia coli W]
 gi|386700093|ref|YP_006163930.1| hypothetical protein KO11_07985 [Escherichia coli KO11FL]
 gi|386710850|ref|YP_006174571.1| hypothetical protein WFL_15685 [Escherichia coli W]
 gi|306909321|gb|EFN39816.1| protein of unknown function DUF167 [Escherichia coli W]
 gi|315062259|gb|ADT76586.1| conserved hypothetical protein [Escherichia coli W]
 gi|323377157|gb|ADX49425.1| protein of unknown function DUF167 [Escherichia coli KO11FL]
 gi|383391620|gb|AFH16578.1| hypothetical protein KO11_07985 [Escherichia coli KO11FL]
 gi|383406542|gb|AFH12785.1| hypothetical protein WFL_15685 [Escherichia coli W]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSYLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQIKIINPQQIPPE 90


>gi|293412313|ref|ZP_06655036.1| hypothetical protein ECEG_02320 [Escherichia coli B354]
 gi|386620534|ref|YP_006140114.1| hypothetical protein ECNA114_3001 [Escherichia coli NA114]
 gi|432477199|ref|ZP_19719191.1| hypothetical protein A15Q_03400 [Escherichia coli KTE208]
 gi|432519102|ref|ZP_19756284.1| hypothetical protein A17U_02078 [Escherichia coli KTE228]
 gi|432560139|ref|ZP_19796801.1| hypothetical protein A1S7_03795 [Escherichia coli KTE49]
 gi|432776019|ref|ZP_20010283.1| hypothetical protein A1SG_04110 [Escherichia coli KTE54]
 gi|432914273|ref|ZP_20119813.1| hypothetical protein A13Q_03446 [Escherichia coli KTE190]
 gi|433020053|ref|ZP_20208225.1| hypothetical protein WI7_03053 [Escherichia coli KTE105]
 gi|433160037|ref|ZP_20344866.1| hypothetical protein WKU_03118 [Escherichia coli KTE177]
 gi|291469084|gb|EFF11575.1| hypothetical protein ECEG_02320 [Escherichia coli B354]
 gi|333971035|gb|AEG37840.1| hypothetical protein ECNA114_3001 [Escherichia coli NA114]
 gi|431003328|gb|ELD18814.1| hypothetical protein A15Q_03400 [Escherichia coli KTE208]
 gi|431049499|gb|ELD59461.1| hypothetical protein A17U_02078 [Escherichia coli KTE228]
 gi|431089734|gb|ELD95538.1| hypothetical protein A1S7_03795 [Escherichia coli KTE49]
 gi|431316539|gb|ELG04344.1| hypothetical protein A1SG_04110 [Escherichia coli KTE54]
 gi|431437804|gb|ELH19312.1| hypothetical protein A13Q_03446 [Escherichia coli KTE190]
 gi|431529077|gb|ELI05781.1| hypothetical protein WI7_03053 [Escherichia coli KTE105]
 gi|431675574|gb|ELJ41705.1| hypothetical protein WKU_03118 [Escherichia coli KTE177]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQIKIINPQQIPPE 90


>gi|422331977|ref|ZP_16412992.1| UPF0235 protein yggU [Escherichia coli 4_1_47FAA]
 gi|432854081|ref|ZP_20082626.1| hypothetical protein A1YY_02780 [Escherichia coli KTE144]
 gi|373247192|gb|EHP66639.1| UPF0235 protein yggU [Escherichia coli 4_1_47FAA]
 gi|431398496|gb|ELG81916.1| hypothetical protein A1YY_02780 [Escherichia coli KTE144]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQIKIINPQQIPPE 90


>gi|215488251|ref|YP_002330682.1| hypothetical protein E2348C_3206 [Escherichia coli O127:H6 str.
           E2348/69]
 gi|312964784|ref|ZP_07779024.1| conserved hypothetical protein [Escherichia coli 2362-75]
 gi|331684580|ref|ZP_08385172.1| conserved hypothetical protein [Escherichia coli H299]
 gi|417757201|ref|ZP_12405272.1| hypothetical protein ECDEC2B_3543 [Escherichia coli DEC2B]
 gi|418998271|ref|ZP_13545861.1| hypothetical protein ECDEC1A_3296 [Escherichia coli DEC1A]
 gi|419003540|ref|ZP_13551058.1| hypothetical protein ECDEC1B_3457 [Escherichia coli DEC1B]
 gi|419009076|ref|ZP_13556500.1| hypothetical protein ECDEC1C_3390 [Escherichia coli DEC1C]
 gi|419014868|ref|ZP_13562211.1| hypothetical protein ECDEC1D_3735 [Escherichia coli DEC1D]
 gi|419019894|ref|ZP_13567198.1| hypothetical protein ECDEC1E_3625 [Escherichia coli DEC1E]
 gi|419025283|ref|ZP_13572506.1| hypothetical protein ECDEC2A_3435 [Escherichia coli DEC2A]
 gi|419030438|ref|ZP_13577594.1| hypothetical protein ECDEC2C_3491 [Escherichia coli DEC2C]
 gi|419036102|ref|ZP_13583184.1| hypothetical protein ECDEC2D_3461 [Escherichia coli DEC2D]
 gi|419041126|ref|ZP_13588148.1| hypothetical protein ECDEC2E_3455 [Escherichia coli DEC2E]
 gi|450192378|ref|ZP_21891613.1| hypothetical protein A364_14967 [Escherichia coli SEPT362]
 gi|254814154|sp|B7UI00.1|YGGU_ECO27 RecName: Full=UPF0235 protein YggU
 gi|215266323|emb|CAS10754.1| predicted protein [Escherichia coli O127:H6 str. E2348/69]
 gi|312290340|gb|EFR18220.1| conserved hypothetical protein [Escherichia coli 2362-75]
 gi|331078195|gb|EGI49401.1| conserved hypothetical protein [Escherichia coli H299]
 gi|377842221|gb|EHU07276.1| hypothetical protein ECDEC1A_3296 [Escherichia coli DEC1A]
 gi|377842431|gb|EHU07485.1| hypothetical protein ECDEC1C_3390 [Escherichia coli DEC1C]
 gi|377845263|gb|EHU10286.1| hypothetical protein ECDEC1B_3457 [Escherichia coli DEC1B]
 gi|377855550|gb|EHU20421.1| hypothetical protein ECDEC1D_3735 [Escherichia coli DEC1D]
 gi|377859054|gb|EHU23892.1| hypothetical protein ECDEC1E_3625 [Escherichia coli DEC1E]
 gi|377862641|gb|EHU27453.1| hypothetical protein ECDEC2A_3435 [Escherichia coli DEC2A]
 gi|377872579|gb|EHU37225.1| hypothetical protein ECDEC2B_3543 [Escherichia coli DEC2B]
 gi|377875815|gb|EHU40424.1| hypothetical protein ECDEC2C_3491 [Escherichia coli DEC2C]
 gi|377877712|gb|EHU42302.1| hypothetical protein ECDEC2D_3461 [Escherichia coli DEC2D]
 gi|377888228|gb|EHU52700.1| hypothetical protein ECDEC2E_3455 [Escherichia coli DEC2E]
 gi|449318694|gb|EMD08758.1| hypothetical protein A364_14967 [Escherichia coli SEPT362]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQIKIINPQQIPPE 90


>gi|118469743|ref|YP_888136.1| hypothetical protein MSMEG_3845 [Mycobacterium smegmatis str. MC2
           155]
 gi|399988158|ref|YP_006568508.1| hypothetical protein MSMEI_3755 [Mycobacterium smegmatis str. MC2
           155]
 gi|441210781|ref|ZP_20974779.1| hypothetical protein D806_3954 [Mycobacterium smegmatis MKD8]
 gi|226706061|sp|A0QYZ8.1|Y3845_MYCS2 RecName: Full=UPF0235 protein MSMEG_3845
 gi|118171030|gb|ABK71926.1| conserved domain protein [Mycobacterium smegmatis str. MC2 155]
 gi|399232720|gb|AFP40213.1| UPF0235 protein [Mycobacterium smegmatis str. MC2 155]
 gi|440626711|gb|ELQ88539.1| hypothetical protein D806_3954 [Mycobacterium smegmatis MKD8]
          Length = 75

 Score = 39.7 bits (91), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 41/67 (61%), Gaps = 1/67 (1%)

Query: 41  SITIHAKPGSKSCSITDVSDEA-VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           ++ +  KPGS+   + + +D+  + + +   A DG+ANAA+ + +++ LGV   +V + S
Sbjct: 4   TVVVRVKPGSRKGPLVETADDGTLTIYVQERAVDGKANAAVTKLLAAHLGVPPSRVELAS 63

Query: 100 GSKSRDK 106
           G+ +R K
Sbjct: 64  GATARLK 70


>gi|90111518|ref|NP_417428.2| conserved protein, UPF0235 family [Escherichia coli str. K-12
           substr. MG1655]
 gi|161984863|ref|YP_409371.2| hypothetical protein SBO_3037 [Shigella boydii Sb227]
 gi|170082505|ref|YP_001731825.1| hypothetical protein ECDH10B_3128 [Escherichia coli str. K-12
           substr. DH10B]
 gi|238902075|ref|YP_002927871.1| hypothetical protein BWG_2675 [Escherichia coli BW2952]
 gi|253772209|ref|YP_003035040.1| hypothetical protein ECBD_0787 [Escherichia coli
           'BL21-Gold(DE3)pLysS AG']
 gi|300947684|ref|ZP_07161853.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           116-1]
 gi|300954200|ref|ZP_07166665.1| hypothetical protein HMPREF9547_00147 [Escherichia coli MS 175-1]
 gi|301643693|ref|ZP_07243732.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           146-1]
 gi|386594315|ref|YP_006090715.1| hypothetical protein [Escherichia coli DH1]
 gi|386615684|ref|YP_006135350.1| hypothetical protein UMNK88_3650 [Escherichia coli UMNK88]
 gi|387608596|ref|YP_006097452.1| hypothetical protein EC042_3160 [Escherichia coli 042]
 gi|387613572|ref|YP_006116688.1| hypothetical protein ETEC_3143 [Escherichia coli ETEC H10407]
 gi|387622626|ref|YP_006130254.1| hypothetical protein ECDH1ME8569_2853 [Escherichia coli DH1]
 gi|388478960|ref|YP_491152.1| hypothetical protein Y75_p2883 [Escherichia coli str. K-12 substr.
           W3110]
 gi|415779348|ref|ZP_11490077.1| conserved hypothetical protein [Escherichia coli 3431]
 gi|415811458|ref|ZP_11503808.1| hypothetical protein ECLT68_2152 [Escherichia coli LT-68]
 gi|416272250|ref|ZP_11643157.1| hypothetical protein SDB_03448 [Shigella dysenteriae CDC 74-1112]
 gi|416301416|ref|ZP_11652965.1| hypothetical protein SGF_03464 [Shigella flexneri CDC 796-83]
 gi|417262356|ref|ZP_12049830.1| TIGR00251 family protein [Escherichia coli 2.3916]
 gi|417272760|ref|ZP_12060109.1| TIGR00251 family protein [Escherichia coli 2.4168]
 gi|417279894|ref|ZP_12067198.1| TIGR00251 family protein [Escherichia coli 3.2303]
 gi|417290741|ref|ZP_12078022.1| TIGR00251 family protein [Escherichia coli B41]
 gi|417614423|ref|ZP_12264879.1| hypothetical protein ECSTECEH250_3505 [Escherichia coli STEC_EH250]
 gi|417619564|ref|ZP_12269972.1| hypothetical protein ECG581_3386 [Escherichia coli G58-1]
 gi|417635982|ref|ZP_12286193.1| hypothetical protein ECSTECS1191_3926 [Escherichia coli STEC_S1191]
 gi|417683765|ref|ZP_12333109.1| hypothetical protein SB359474_3531 [Shigella boydii 3594-74]
 gi|417946688|ref|ZP_12589900.1| hypothetical protein IAE_16797 [Escherichia coli XH140A]
 gi|417976686|ref|ZP_12617477.1| hypothetical protein IAM_10157 [Escherichia coli XH001]
 gi|418304514|ref|ZP_12916308.1| hypothetical protein UMNF18_3785 [Escherichia coli UMNF18]
 gi|418956670|ref|ZP_13508595.1| hypothetical protein OQE_08310 [Escherichia coli J53]
 gi|419143898|ref|ZP_13688631.1| hypothetical protein ECDEC6A_3568 [Escherichia coli DEC6A]
 gi|419149893|ref|ZP_13694544.1| hypothetical protein ECDEC6B_3954 [Escherichia coli DEC6B]
 gi|419155392|ref|ZP_13699951.1| hypothetical protein ECDEC6C_3575 [Escherichia coli DEC6C]
 gi|419160704|ref|ZP_13705204.1| hypothetical protein ECDEC6D_3536 [Escherichia coli DEC6D]
 gi|419165753|ref|ZP_13710207.1| hypothetical protein ECDEC6E_3499 [Escherichia coli DEC6E]
 gi|419812213|ref|ZP_14337082.1| hypothetical protein UWO_16940 [Escherichia coli O32:H37 str. P4]
 gi|419939571|ref|ZP_14456362.1| hypothetical protein EC75_09905 [Escherichia coli 75]
 gi|420327218|ref|ZP_14828963.1| hypothetical protein SFCCH060_3557 [Shigella flexneri CCH060]
 gi|420337613|ref|ZP_14839175.1| hypothetical protein SFK315_3375 [Shigella flexneri K-315]
 gi|420354430|ref|ZP_14855516.1| hypothetical protein SB444474_3498 [Shigella boydii 4444-74]
 gi|420381824|ref|ZP_14881264.1| hypothetical protein SD22575_3918 [Shigella dysenteriae 225-75]
 gi|421684068|ref|ZP_16123857.1| hypothetical protein SF148580_3423 [Shigella flexneri 1485-80]
 gi|421775617|ref|ZP_16212226.1| hypothetical protein ECAD30_17350 [Escherichia coli AD30]
 gi|422767548|ref|ZP_16821274.1| yggU [Escherichia coli E1520]
 gi|422818070|ref|ZP_16866283.1| UPF0235 protein yggU [Escherichia coli M919]
 gi|425116483|ref|ZP_18518274.1| hypothetical protein EC80566_3144 [Escherichia coli 8.0566]
 gi|425121239|ref|ZP_18522926.1| hypothetical protein EC80569_3148 [Escherichia coli 8.0569]
 gi|425274131|ref|ZP_18665532.1| hypothetical protein ECTW15901_3347 [Escherichia coli TW15901]
 gi|425284655|ref|ZP_18675687.1| hypothetical protein ECTW00353_3263 [Escherichia coli TW00353]
 gi|432628582|ref|ZP_19864554.1| hypothetical protein A1UQ_03436 [Escherichia coli KTE77]
 gi|432638164|ref|ZP_19874031.1| hypothetical protein A1UY_03533 [Escherichia coli KTE81]
 gi|432662160|ref|ZP_19897798.1| hypothetical protein A1WY_03591 [Escherichia coli KTE111]
 gi|432686766|ref|ZP_19922059.1| hypothetical protein A31A_03627 [Escherichia coli KTE156]
 gi|432705709|ref|ZP_19940805.1| hypothetical protein A31Q_03592 [Escherichia coli KTE171]
 gi|432738432|ref|ZP_19973186.1| hypothetical protein WGE_03691 [Escherichia coli KTE42]
 gi|432876870|ref|ZP_20094739.1| hypothetical protein A317_00964 [Escherichia coli KTE154]
 gi|432956639|ref|ZP_20148297.1| hypothetical protein A155_03593 [Escherichia coli KTE197]
 gi|450221932|ref|ZP_21896647.1| hypothetical protein C202_14402 [Escherichia coli O08]
 gi|450248555|ref|ZP_21901428.1| hypothetical protein C201_13816 [Escherichia coli S17]
 gi|6920084|sp|P52060.2|YGGU_ECOLI RecName: Full=UPF0235 protein YggU
 gi|226730819|sp|B1XFB3.1|YGGU_ECODH RecName: Full=UPF0235 protein YggU
 gi|259710253|sp|C5A0M2.1|YGGU_ECOBW RecName: Full=UPF0235 protein YggU
 gi|85675763|dbj|BAE77016.1| conserved hypothetical protein [Escherichia coli str. K12 substr.
           W3110]
 gi|87082189|gb|AAC75990.2| conserved protein, UPF0235 family [Escherichia coli str. K-12
           substr. MG1655]
 gi|169890340|gb|ACB04047.1| conserved protein [Escherichia coli str. K-12 substr. DH10B]
 gi|238860521|gb|ACR62519.1| conserved protein [Escherichia coli BW2952]
 gi|253323253|gb|ACT27855.1| protein of unknown function DUF167 [Escherichia coli
           'BL21-Gold(DE3)pLysS AG']
 gi|260448004|gb|ACX38426.1| protein of unknown function DUF167 [Escherichia coli DH1]
 gi|284922896|emb|CBG35985.1| conserved hypothetical protein [Escherichia coli 042]
 gi|300318784|gb|EFJ68568.1| hypothetical protein HMPREF9547_00147 [Escherichia coli MS 175-1]
 gi|300452730|gb|EFK16350.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           116-1]
 gi|301077895|gb|EFK92701.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           146-1]
 gi|309703308|emb|CBJ02644.1| conserved hypothetical protein [Escherichia coli ETEC H10407]
 gi|315137550|dbj|BAJ44709.1| hypothetical protein ECDH1ME8569_2853 [Escherichia coli DH1]
 gi|315614885|gb|EFU95523.1| conserved hypothetical protein [Escherichia coli 3431]
 gi|320174051|gb|EFW49221.1| hypothetical protein SDB_03448 [Shigella dysenteriae CDC 74-1112]
 gi|320184300|gb|EFW59112.1| hypothetical protein SGF_03464 [Shigella flexneri CDC 796-83]
 gi|323173833|gb|EFZ59462.1| hypothetical protein ECLT68_2152 [Escherichia coli LT-68]
 gi|323936044|gb|EGB32339.1| yggU [Escherichia coli E1520]
 gi|332091357|gb|EGI96445.1| hypothetical protein SB359474_3531 [Shigella boydii 3594-74]
 gi|332344853|gb|AEE58187.1| conserved hypothetical protein [Escherichia coli UMNK88]
 gi|339416612|gb|AEJ58284.1| hypothetical protein UMNF18_3785 [Escherichia coli UMNF18]
 gi|342361597|gb|EGU25732.1| hypothetical protein IAE_16797 [Escherichia coli XH140A]
 gi|344193608|gb|EGV47687.1| hypothetical protein IAM_10157 [Escherichia coli XH001]
 gi|345360924|gb|EGW93089.1| hypothetical protein ECSTECEH250_3505 [Escherichia coli STEC_EH250]
 gi|345372694|gb|EGX04657.1| hypothetical protein ECG581_3386 [Escherichia coli G58-1]
 gi|345386852|gb|EGX16685.1| hypothetical protein ECSTECS1191_3926 [Escherichia coli STEC_S1191]
 gi|359333191|dbj|BAL39638.1| conserved protein [Escherichia coli str. K-12 substr. MDS42]
 gi|377990998|gb|EHV54154.1| hypothetical protein ECDEC6B_3954 [Escherichia coli DEC6B]
 gi|377992048|gb|EHV55196.1| hypothetical protein ECDEC6A_3568 [Escherichia coli DEC6A]
 gi|377995241|gb|EHV58361.1| hypothetical protein ECDEC6C_3575 [Escherichia coli DEC6C]
 gi|378005893|gb|EHV68885.1| hypothetical protein ECDEC6D_3536 [Escherichia coli DEC6D]
 gi|378008682|gb|EHV71641.1| hypothetical protein ECDEC6E_3499 [Escherichia coli DEC6E]
 gi|384380464|gb|EIE38330.1| hypothetical protein OQE_08310 [Escherichia coli J53]
 gi|385154950|gb|EIF16957.1| hypothetical protein UWO_16940 [Escherichia coli O32:H37 str. P4]
 gi|385538583|gb|EIF85445.1| UPF0235 protein yggU [Escherichia coli M919]
 gi|386223802|gb|EII46151.1| TIGR00251 family protein [Escherichia coli 2.3916]
 gi|386236460|gb|EII68436.1| TIGR00251 family protein [Escherichia coli 2.4168]
 gi|386237224|gb|EII74170.1| TIGR00251 family protein [Escherichia coli 3.2303]
 gi|386253063|gb|EIJ02753.1| TIGR00251 family protein [Escherichia coli B41]
 gi|388407365|gb|EIL67738.1| hypothetical protein EC75_09905 [Escherichia coli 75]
 gi|391247980|gb|EIQ07224.1| hypothetical protein SFCCH060_3557 [Shigella flexneri CCH060]
 gi|391259487|gb|EIQ18561.1| hypothetical protein SFK315_3375 [Shigella flexneri K-315]
 gi|391275692|gb|EIQ34477.1| hypothetical protein SB444474_3498 [Shigella boydii 4444-74]
 gi|391299331|gb|EIQ57295.1| hypothetical protein SD22575_3918 [Shigella dysenteriae 225-75]
 gi|404337038|gb|EJZ63493.1| hypothetical protein SF148580_3423 [Shigella flexneri 1485-80]
 gi|408191746|gb|EKI17345.1| hypothetical protein ECTW15901_3347 [Escherichia coli TW15901]
 gi|408200844|gb|EKI26020.1| hypothetical protein ECTW00353_3263 [Escherichia coli TW00353]
 gi|408459503|gb|EKJ83285.1| hypothetical protein ECAD30_17350 [Escherichia coli AD30]
 gi|408566011|gb|EKK42092.1| hypothetical protein EC80566_3144 [Escherichia coli 8.0566]
 gi|408567001|gb|EKK43062.1| hypothetical protein EC80569_3148 [Escherichia coli 8.0569]
 gi|431161875|gb|ELE62344.1| hypothetical protein A1UQ_03436 [Escherichia coli KTE77]
 gi|431169579|gb|ELE69798.1| hypothetical protein A1UY_03533 [Escherichia coli KTE81]
 gi|431198234|gb|ELE97059.1| hypothetical protein A1WY_03591 [Escherichia coli KTE111]
 gi|431220740|gb|ELF18073.1| hypothetical protein A31A_03627 [Escherichia coli KTE156]
 gi|431241493|gb|ELF35929.1| hypothetical protein A31Q_03592 [Escherichia coli KTE171]
 gi|431280487|gb|ELF71403.1| hypothetical protein WGE_03691 [Escherichia coli KTE42]
 gi|431418834|gb|ELH01228.1| hypothetical protein A317_00964 [Escherichia coli KTE154]
 gi|431466256|gb|ELH46333.1| hypothetical protein A155_03593 [Escherichia coli KTE197]
 gi|449315572|gb|EMD05713.1| hypothetical protein C202_14402 [Escherichia coli O08]
 gi|449316991|gb|EMD07086.1| hypothetical protein C201_13816 [Escherichia coli S17]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQIKIINPQQIPPE 90


>gi|342180720|emb|CCC90196.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 165

 Score = 39.7 bits (91), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 20/73 (27%), Positives = 43/73 (58%), Gaps = 6/73 (8%)

Query: 37  PSSVSITIHAKPGSKSCSIT---DVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           P    + + AKPG+++ ++     + D+A+ V++ AP  DG+AN  L+ +M ++L    +
Sbjct: 12  PGCFHLAVRAKPGARTTALAARPQIIDDALEVRVAAPPVDGKANTELICFMQALL---EQ 68

Query: 94  QVSIGSGSKSRDK 106
           Q+S    ++++ K
Sbjct: 69  QLSTLRKTQTQQK 81


>gi|149177442|ref|ZP_01856046.1| hypothetical protein PM8797T_19111 [Planctomyces maris DSM 8797]
 gi|148843775|gb|EDL58134.1| hypothetical protein PM8797T_19111 [Planctomyces maris DSM 8797]
          Length = 104

 Score = 39.7 bits (91), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 23/89 (25%), Positives = 46/89 (51%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           +++ + + A+P S    I  V D  + V +    + G+AN ALL+ + + L +KR Q+ +
Sbjct: 12  TAILLPVRAQPRSSKNGIEGVHDGRLKVCVTQVPEKGKANKALLKVLQTALKLKRSQIEL 71

Query: 98  GSGSKSRDKIVIVEEITPENVLNSLGKAS 126
             G  +  KI  + EI+ + +   +  A+
Sbjct: 72  YKGETAALKIFRIHEISRDELYTRITDAA 100


>gi|417598314|ref|ZP_12248945.1| hypothetical protein EC30301_3462 [Escherichia coli 3030-1]
 gi|345351133|gb|EGW83399.1| hypothetical protein EC30301_3462 [Escherichia coli 3030-1]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQIKIINPQQIPPE 90


>gi|329902481|ref|ZP_08273126.1| hypothetical protein IMCC9480_281 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327548773|gb|EGF33410.1| hypothetical protein IMCC9480_281 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 105

 Score = 39.7 bits (91), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 19/74 (25%), Positives = 41/74 (55%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + +     +K   +  V+D+ + +++ A   +G+AN AL+ +++  L V R  VS+  
Sbjct: 16  VRLAVQVAANAKKTEVIGVADDVLKIKLHAQPIEGKANDALVRFVAGQLHVPRTTVSVTH 75

Query: 100 GSKSRDKIVIVEEI 113
           G  S+ K+++V  +
Sbjct: 76  GLTSKRKLLLVRAV 89


>gi|148358546|ref|YP_001249753.1| hypothetical protein LPC_0417 [Legionella pneumophila str. Corby]
 gi|296108365|ref|YP_003620066.1| hypothetical protein lpa_03962 [Legionella pneumophila 2300/99
           Alcoy]
 gi|148280319|gb|ABQ54407.1| conserved hypothetical protein; DUF167 [Legionella pneumophila str.
           Corby]
 gi|295650267|gb|ADG26114.1| hypothetical protein lpa_03962 [Legionella pneumophila 2300/99
           Alcoy]
          Length = 70

 Score = 39.7 bits (91), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 15/51 (29%), Positives = 30/51 (58%)

Query: 58  VSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV 108
           +SD+ + + + A  ++GEAN  LL ++S    + + Q+ +  G  SR K++
Sbjct: 3   ISDDRLHIALHAKPQEGEANNELLFFISQFFKIPKTQIELIKGKSSRHKLI 53


>gi|110643102|ref|YP_670832.1| hypothetical protein ECP_2947 [Escherichia coli 536]
 gi|117625180|ref|YP_854168.1| hypothetical protein APECO1_3568 [Escherichia coli APEC O1]
 gi|161486124|ref|NP_755414.2| hypothetical protein c3539 [Escherichia coli CFT073]
 gi|161486442|ref|NP_838440.2| hypothetical protein S3148 [Shigella flexneri 2a str. 2457T]
 gi|170018806|ref|YP_001723760.1| hypothetical protein EcolC_0761 [Escherichia coli ATCC 8739]
 gi|170681809|ref|YP_001745114.1| hypothetical protein EcSMS35_3095 [Escherichia coli SMS-3-5]
 gi|191167930|ref|ZP_03029733.1| conserved hypothetical protein [Escherichia coli B7A]
 gi|191171828|ref|ZP_03033374.1| conserved hypothetical protein [Escherichia coli F11]
 gi|193063515|ref|ZP_03044604.1| conserved hypothetical protein [Escherichia coli E22]
 gi|193067470|ref|ZP_03048438.1| conserved hypothetical protein [Escherichia coli E110019]
 gi|194426264|ref|ZP_03058819.1| conserved hypothetical protein [Escherichia coli B171]
 gi|194431742|ref|ZP_03064033.1| conserved hypothetical protein [Shigella dysenteriae 1012]
 gi|194436768|ref|ZP_03068868.1| conserved hypothetical protein [Escherichia coli 101-1]
 gi|209920412|ref|YP_002294496.1| hypothetical protein ECSE_3221 [Escherichia coli SE11]
 gi|218550200|ref|YP_002383991.1| hypothetical protein EFER_2892 [Escherichia fergusonii ATCC 35469]
 gi|218555512|ref|YP_002388425.1| hypothetical protein ECIAI1_3086 [Escherichia coli IAI1]
 gi|218559944|ref|YP_002392857.1| hypothetical protein ECS88_3235 [Escherichia coli S88]
 gi|218691077|ref|YP_002399289.1| hypothetical protein ECED1_3416 [Escherichia coli ED1a]
 gi|218696551|ref|YP_002404218.1| hypothetical protein EC55989_3246 [Escherichia coli 55989]
 gi|218701663|ref|YP_002409292.1| hypothetical protein ECIAI39_3371 [Escherichia coli IAI39]
 gi|218706468|ref|YP_002413987.1| hypothetical protein ECUMN_3305 [Escherichia coli UMN026]
 gi|222157643|ref|YP_002557782.1| hypothetical protein LF82_3194 [Escherichia coli LF82]
 gi|227888508|ref|ZP_04006313.1| protein of hypothetical function DUF167 [Escherichia coli 83972]
 gi|251786206|ref|YP_003000510.1| hypothetical protein B21_02746 [Escherichia coli BL21(DE3)]
 gi|254162863|ref|YP_003045971.1| hypothetical protein ECB_02783 [Escherichia coli B str. REL606]
 gi|254289623|ref|YP_003055371.1| hypothetical protein ECD_02783 [Escherichia coli BL21(DE3)]
 gi|260845623|ref|YP_003223401.1| hypothetical protein ECO103_3533 [Escherichia coli O103:H2 str.
           12009]
 gi|260857086|ref|YP_003230977.1| hypothetical protein ECO26_4052 [Escherichia coli O26:H11 str.
           11368]
 gi|260869640|ref|YP_003236042.1| hypothetical protein ECO111_3701 [Escherichia coli O111:H- str.
           11128]
 gi|293406460|ref|ZP_06650386.1| hypothetical protein ECGG_01757 [Escherichia coli FVEC1412]
 gi|293416214|ref|ZP_06658854.1| hypothetical protein ECDG_03817 [Escherichia coli B185]
 gi|293449283|ref|ZP_06663704.1| hypothetical protein ECCG_02314 [Escherichia coli B088]
 gi|297520255|ref|ZP_06938641.1| hypothetical protein EcolOP_21662 [Escherichia coli OP50]
 gi|298382197|ref|ZP_06991794.1| hypothetical protein ECFG_01943 [Escherichia coli FVEC1302]
 gi|300815577|ref|ZP_07095801.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           107-1]
 gi|300824812|ref|ZP_07104916.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           119-7]
 gi|300900230|ref|ZP_07118414.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           198-1]
 gi|300906483|ref|ZP_07124178.1| hypothetical protein HMPREF9536_04445 [Escherichia coli MS 84-1]
 gi|300921295|ref|ZP_07137664.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           115-1]
 gi|300928104|ref|ZP_07143649.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           187-1]
 gi|300940765|ref|ZP_07155311.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 21-1]
 gi|300980105|ref|ZP_07174848.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 45-1]
 gi|300995466|ref|ZP_07181114.1| hypothetical protein HMPREF9553_04604 [Escherichia coli MS 200-1]
 gi|301027296|ref|ZP_07190642.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 69-1]
 gi|301027722|ref|ZP_07191032.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           196-1]
 gi|301049252|ref|ZP_07196225.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           185-1]
 gi|301306562|ref|ZP_07212624.1| hypothetical protein HMPREF9347_05170 [Escherichia coli MS 124-1]
 gi|301328107|ref|ZP_07221248.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 78-1]
 gi|306812143|ref|ZP_07446341.1| hypothetical protein ECNC101_09529 [Escherichia coli NC101]
 gi|309794042|ref|ZP_07688467.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           145-7]
 gi|312972805|ref|ZP_07786978.1| conserved hypothetical protein [Escherichia coli 1827-70]
 gi|331659088|ref|ZP_08360030.1| conserved hypothetical protein [Escherichia coli TA206]
 gi|331669699|ref|ZP_08370545.1| conserved hypothetical protein [Escherichia coli TA271]
 gi|386602989|ref|YP_006109289.1| hypothetical protein UM146_01750 [Escherichia coli UM146]
 gi|386625682|ref|YP_006145410.1| hypothetical protein CE10_3395 [Escherichia coli O7:K1 str. CE10]
 gi|386630703|ref|YP_006150423.1| hypothetical protein i02_3257 [Escherichia coli str. 'clone D i2']
 gi|386635623|ref|YP_006155342.1| hypothetical protein i14_3257 [Escherichia coli str. 'clone D i14']
 gi|386640443|ref|YP_006107241.1| hypothetical protein ECABU_c32400 [Escherichia coli ABU 83972]
 gi|387618223|ref|YP_006121245.1| hypothetical protein NRG857_14505 [Escherichia coli O83:H1 str. NRG
           857C]
 gi|404376248|ref|ZP_10981420.1| UPF0235 protein yggU [Escherichia sp. 1_1_43]
 gi|407470831|ref|YP_006782726.1| hypothetical protein O3O_20965 [Escherichia coli O104:H4 str.
           2009EL-2071]
 gi|407480508|ref|YP_006777657.1| hypothetical protein O3K_04685 [Escherichia coli O104:H4 str.
           2011C-3493]
 gi|410481074|ref|YP_006768620.1| hypothetical protein O3M_04730 [Escherichia coli O104:H4 str.
           2009EL-2050]
 gi|415787123|ref|ZP_11493856.1| hypothetical protein ECEPECA14_3462 [Escherichia coli EPECa14]
 gi|415796291|ref|ZP_11497531.1| hypothetical protein ECE128010_1205 [Escherichia coli E128010]
 gi|415818667|ref|ZP_11508389.1| hypothetical protein ECOK1180_1095 [Escherichia coli OK1180]
 gi|415830425|ref|ZP_11516327.1| hypothetical protein ECOK1357_3303 [Escherichia coli OK1357]
 gi|415839559|ref|ZP_11521301.1| hypothetical protein ECRN5871_3073 [Escherichia coli RN587/1]
 gi|415857950|ref|ZP_11532562.1| hypothetical protein SF2457T_3601 [Shigella flexneri 2a str. 2457T]
 gi|415862206|ref|ZP_11535738.1| putative cytoplasmic protein [Escherichia coli MS 85-1]
 gi|415874121|ref|ZP_11541218.1| putative cytoplasmic protein [Escherichia coli MS 79-10]
 gi|416282169|ref|ZP_11646317.1| hypothetical protein SGB_01866 [Shigella boydii ATCC 9905]
 gi|416336938|ref|ZP_11673408.1| hypothetical protein EcoM_02835 [Escherichia coli WV_060327]
 gi|416340380|ref|ZP_11675395.1| hypothetical protein ECoL_00280 [Escherichia coli EC4100B]
 gi|416899261|ref|ZP_11928743.1| hypothetical protein ECSTEC7V_3572 [Escherichia coli STEC_7v]
 gi|417086455|ref|ZP_11953655.1| hypothetical protein i01_04069 [Escherichia coli cloneA_i1]
 gi|417119177|ref|ZP_11969542.1| TIGR00251 family protein [Escherichia coli 1.2741]
 gi|417123267|ref|ZP_11972177.1| TIGR00251 family protein [Escherichia coli 97.0246]
 gi|417135080|ref|ZP_11979865.1| TIGR00251 family protein [Escherichia coli 5.0588]
 gi|417140486|ref|ZP_11983736.1| TIGR00251 family protein [Escherichia coli 97.0259]
 gi|417150988|ref|ZP_11990727.1| TIGR00251 family protein [Escherichia coli 1.2264]
 gi|417158247|ref|ZP_11995871.1| TIGR00251 family protein [Escherichia coli 96.0497]
 gi|417163011|ref|ZP_11998341.1| TIGR00251 family protein [Escherichia coli 99.0741]
 gi|417176487|ref|ZP_12006283.1| TIGR00251 family protein [Escherichia coli 3.2608]
 gi|417186026|ref|ZP_12011169.1| TIGR00251 family protein [Escherichia coli 93.0624]
 gi|417199897|ref|ZP_12017134.1| TIGR00251 family protein [Escherichia coli 4.0522]
 gi|417211552|ref|ZP_12021851.1| TIGR00251 family protein [Escherichia coli JB1-95]
 gi|417223271|ref|ZP_12026711.1| TIGR00251 family protein [Escherichia coli 96.154]
 gi|417228578|ref|ZP_12030336.1| TIGR00251 family protein [Escherichia coli 5.0959]
 gi|417237372|ref|ZP_12035339.1| TIGR00251 family protein [Escherichia coli 9.0111]
 gi|417251713|ref|ZP_12043478.1| TIGR00251 family protein [Escherichia coli 4.0967]
 gi|417268101|ref|ZP_12055462.1| TIGR00251 family protein [Escherichia coli 3.3884]
 gi|417282220|ref|ZP_12069520.1| TIGR00251 family protein [Escherichia coli 3003]
 gi|417285035|ref|ZP_12072326.1| TIGR00251 family protein [Escherichia coli TW07793]
 gi|417296471|ref|ZP_12083718.1| TIGR00251 family protein [Escherichia coli 900105 (10e)]
 gi|417309424|ref|ZP_12096262.1| hypothetical protein PPECC33_28340 [Escherichia coli PCN033]
 gi|417582460|ref|ZP_12233261.1| hypothetical protein ECSTECB2F1_3147 [Escherichia coli STEC_B2F1]
 gi|417587992|ref|ZP_12238757.1| hypothetical protein ECSTECC16502_3647 [Escherichia coli
           STEC_C165-02]
 gi|417593317|ref|ZP_12244010.1| hypothetical protein EC253486_3941 [Escherichia coli 2534-86]
 gi|417603650|ref|ZP_12254217.1| hypothetical protein ECSTEC94C_3472 [Escherichia coli STEC_94C]
 gi|417609576|ref|ZP_12260076.1| hypothetical protein ECSTECDG1313_3994 [Escherichia coli
           STEC_DG131-3]
 gi|417624975|ref|ZP_12275270.1| hypothetical protein ECSTECH18_3746 [Escherichia coli STEC_H.1.8]
 gi|417630301|ref|ZP_12280537.1| hypothetical protein ECSTECMHI813_3244 [Escherichia coli
           STEC_MHI813]
 gi|417640767|ref|ZP_12290905.1| hypothetical protein ECTX1999_3492 [Escherichia coli TX1999]
 gi|417668370|ref|ZP_12317912.1| hypothetical protein ECSTECO31_3202 [Escherichia coli STEC_O31]
 gi|417673788|ref|ZP_12323233.1| hypothetical protein SD15574_3393 [Shigella dysenteriae 155-74]
 gi|417691229|ref|ZP_12340446.1| hypothetical protein SB521682_3508 [Shigella boydii 5216-82]
 gi|417714026|ref|ZP_12362986.1| hypothetical protein SFK272_3776 [Shigella flexneri K-272]
 gi|417718997|ref|ZP_12367889.1| hypothetical protein SFK227_3749 [Shigella flexneri K-227]
 gi|417724553|ref|ZP_12373351.1| hypothetical protein SFK304_3758 [Shigella flexneri K-304]
 gi|417729844|ref|ZP_12378537.1| hypothetical protein SFK671_3527 [Shigella flexneri K-671]
 gi|417735187|ref|ZP_12383834.1| hypothetical protein SF274771_3539 [Shigella flexneri 2747-71]
 gi|417739812|ref|ZP_12388386.1| hypothetical protein SF434370_3174 [Shigella flexneri 4343-70]
 gi|417744792|ref|ZP_12393315.1| hypothetical protein SF293071_3451 [Shigella flexneri 2930-71]
 gi|417806496|ref|ZP_12453437.1| hypothetical protein HUSEC_16453 [Escherichia coli O104:H4 str.
           LB226692]
 gi|417829414|ref|ZP_12475959.1| hypothetical protein SFJ1713_3432 [Shigella flexneri J1713]
 gi|417834245|ref|ZP_12480691.1| hypothetical protein HUSEC41_16103 [Escherichia coli O104:H4 str.
           01-09591]
 gi|417867424|ref|ZP_12512461.1| hypothetical protein C22711_4351 [Escherichia coli O104:H4 str.
           C227-11]
 gi|418041167|ref|ZP_12679393.1| hypothetical protein ECW26_16220 [Escherichia coli W26]
 gi|418258197|ref|ZP_12881598.1| hypothetical protein SF660363_3459 [Shigella flexneri 6603-63]
 gi|418944841|ref|ZP_13497829.1| hypothetical protein T22_17555 [Escherichia coli O157:H43 str. T22]
 gi|419171761|ref|ZP_13715642.1| hypothetical protein ECDEC7A_3437 [Escherichia coli DEC7A]
 gi|419176724|ref|ZP_13720536.1| hypothetical protein ECDEC7B_3218 [Escherichia coli DEC7B]
 gi|419182316|ref|ZP_13725927.1| hypothetical protein ECDEC7C_3475 [Escherichia coli DEC7C]
 gi|419187943|ref|ZP_13731450.1| hypothetical protein ECDEC7D_3697 [Escherichia coli DEC7D]
 gi|419193063|ref|ZP_13736512.1| hypothetical protein ECDEC7E_3363 [Escherichia coli DEC7E]
 gi|419198605|ref|ZP_13741902.1| hypothetical protein ECDEC8A_3641 [Escherichia coli DEC8A]
 gi|419204925|ref|ZP_13748098.1| hypothetical protein ECDEC8B_3944 [Escherichia coli DEC8B]
 gi|419211378|ref|ZP_13754447.1| hypothetical protein ECDEC8C_4606 [Escherichia coli DEC8C]
 gi|419217257|ref|ZP_13760253.1| hypothetical protein ECDEC8D_4040 [Escherichia coli DEC8D]
 gi|419222999|ref|ZP_13765915.1| hypothetical protein ECDEC8E_3816 [Escherichia coli DEC8E]
 gi|419228412|ref|ZP_13771258.1| hypothetical protein ECDEC9A_3836 [Escherichia coli DEC9A]
 gi|419233739|ref|ZP_13776511.1| hypothetical protein ECDEC9B_3573 [Escherichia coli DEC9B]
 gi|419239398|ref|ZP_13782109.1| hypothetical protein ECDEC9C_3637 [Escherichia coli DEC9C]
 gi|419244915|ref|ZP_13787550.1| hypothetical protein ECDEC9D_3518 [Escherichia coli DEC9D]
 gi|419250731|ref|ZP_13793303.1| hypothetical protein ECDEC9E_3963 [Escherichia coli DEC9E]
 gi|419256530|ref|ZP_13799036.1| hypothetical protein ECDEC10A_4056 [Escherichia coli DEC10A]
 gi|419262829|ref|ZP_13805240.1| hypothetical protein ECDEC10B_4430 [Escherichia coli DEC10B]
 gi|419268595|ref|ZP_13810940.1| hypothetical protein ECDEC10C_4483 [Escherichia coli DEC10C]
 gi|419274278|ref|ZP_13816569.1| hypothetical protein ECDEC10D_4053 [Escherichia coli DEC10D]
 gi|419279483|ref|ZP_13821727.1| hypothetical protein ECDEC10E_3459 [Escherichia coli DEC10E]
 gi|419285670|ref|ZP_13827839.1| hypothetical protein ECDEC10F_4355 [Escherichia coli DEC10F]
 gi|419291021|ref|ZP_13833109.1| hypothetical protein ECDEC11A_3402 [Escherichia coli DEC11A]
 gi|419296303|ref|ZP_13838345.1| hypothetical protein ECDEC11B_3399 [Escherichia coli DEC11B]
 gi|419301759|ref|ZP_13843756.1| hypothetical protein ECDEC11C_3663 [Escherichia coli DEC11C]
 gi|419307900|ref|ZP_13849797.1| hypothetical protein ECDEC11D_3494 [Escherichia coli DEC11D]
 gi|419312904|ref|ZP_13854764.1| hypothetical protein ECDEC11E_3459 [Escherichia coli DEC11E]
 gi|419318296|ref|ZP_13860097.1| hypothetical protein ECDEC12A_3618 [Escherichia coli DEC12A]
 gi|419324587|ref|ZP_13866277.1| hypothetical protein ECDEC12B_4100 [Escherichia coli DEC12B]
 gi|419330567|ref|ZP_13872166.1| hypothetical protein ECDEC12C_3786 [Escherichia coli DEC12C]
 gi|419336071|ref|ZP_13877592.1| hypothetical protein ECDEC12D_3844 [Escherichia coli DEC12D]
 gi|419341432|ref|ZP_13882893.1| hypothetical protein ECDEC12E_3575 [Escherichia coli DEC12E]
 gi|419346640|ref|ZP_13888011.1| hypothetical protein ECDEC13A_3221 [Escherichia coli DEC13A]
 gi|419351104|ref|ZP_13892437.1| hypothetical protein ECDEC13B_3068 [Escherichia coli DEC13B]
 gi|419356506|ref|ZP_13897758.1| hypothetical protein ECDEC13C_3563 [Escherichia coli DEC13C]
 gi|419361577|ref|ZP_13902790.1| hypothetical protein ECDEC13D_3377 [Escherichia coli DEC13D]
 gi|419366670|ref|ZP_13907825.1| hypothetical protein ECDEC13E_3403 [Escherichia coli DEC13E]
 gi|419371445|ref|ZP_13912557.1| hypothetical protein ECDEC14A_3211 [Escherichia coli DEC14A]
 gi|419376947|ref|ZP_13917970.1| hypothetical protein ECDEC14B_3547 [Escherichia coli DEC14B]
 gi|419382255|ref|ZP_13923201.1| hypothetical protein ECDEC14C_3425 [Escherichia coli DEC14C]
 gi|419387593|ref|ZP_13928465.1| hypothetical protein ECDEC14D_3419 [Escherichia coli DEC14D]
 gi|419393082|ref|ZP_13933885.1| hypothetical protein ECDEC15A_3704 [Escherichia coli DEC15A]
 gi|419398187|ref|ZP_13938950.1| hypothetical protein ECDEC15B_3506 [Escherichia coli DEC15B]
 gi|419403471|ref|ZP_13944191.1| hypothetical protein ECDEC15C_3416 [Escherichia coli DEC15C]
 gi|419408628|ref|ZP_13949314.1| hypothetical protein ECDEC15D_3361 [Escherichia coli DEC15D]
 gi|419414170|ref|ZP_13954810.1| hypothetical protein ECDEC15E_3693 [Escherichia coli DEC15E]
 gi|419701761|ref|ZP_14229360.1| hypothetical protein OQA_14531 [Escherichia coli SCI-07]
 gi|419803493|ref|ZP_14328664.1| hypothetical protein ECAI27_02950 [Escherichia coli AI27]
 gi|419864661|ref|ZP_14387089.1| hypothetical protein ECO9340_03663 [Escherichia coli O103:H25 str.
           CVM9340]
 gi|419867820|ref|ZP_14390135.1| hypothetical protein ECO9450_27452 [Escherichia coli O103:H2 str.
           CVM9450]
 gi|419878966|ref|ZP_14400419.1| hypothetical protein ECO9534_07139 [Escherichia coli O111:H11 str.
           CVM9534]
 gi|419879914|ref|ZP_14401334.1| hypothetical protein ECO9545_08168 [Escherichia coli O111:H11 str.
           CVM9545]
 gi|419886472|ref|ZP_14407113.1| hypothetical protein ECO9570_29780 [Escherichia coli O111:H8 str.
           CVM9570]
 gi|419892721|ref|ZP_14412728.1| hypothetical protein ECO9574_26878 [Escherichia coli O111:H8 str.
           CVM9574]
 gi|419899172|ref|ZP_14418697.1| hypothetical protein ECO9942_08651 [Escherichia coli O26:H11 str.
           CVM9942]
 gi|419910232|ref|ZP_14428759.1| hypothetical protein ECO10026_04727 [Escherichia coli O26:H11 str.
           CVM10026]
 gi|419916151|ref|ZP_14434482.1| hypothetical protein ECKD1_23174 [Escherichia coli KD1]
 gi|419919889|ref|ZP_14438027.1| hypothetical protein ECKD2_17670 [Escherichia coli KD2]
 gi|419924062|ref|ZP_14441960.1| hypothetical protein EC54115_13548 [Escherichia coli 541-15]
 gi|419927378|ref|ZP_14445115.1| hypothetical protein EC5411_04199 [Escherichia coli 541-1]
 gi|419934762|ref|ZP_14451864.1| hypothetical protein EC5761_13497 [Escherichia coli 576-1]
 gi|419944453|ref|ZP_14460933.1| hypothetical protein ECHM605_10556 [Escherichia coli HM605]
 gi|419948209|ref|ZP_14464509.1| hypothetical protein ECMT8_02821 [Escherichia coli CUMT8]
 gi|420089599|ref|ZP_14601382.1| hypothetical protein ECO9602_03393 [Escherichia coli O111:H8 str.
           CVM9602]
 gi|420094455|ref|ZP_14606046.1| hypothetical protein ECO9634_21047 [Escherichia coli O111:H8 str.
           CVM9634]
 gi|420112075|ref|ZP_14621886.1| hypothetical protein ECO9553_06271 [Escherichia coli O111:H11 str.
           CVM9553]
 gi|420116872|ref|ZP_14626246.1| hypothetical protein ECO10021_10238 [Escherichia coli O26:H11 str.
           CVM10021]
 gi|420120609|ref|ZP_14629807.1| hypothetical protein ECO10030_05141 [Escherichia coli O26:H11 str.
           CVM10030]
 gi|420129325|ref|ZP_14637862.1| hypothetical protein ECO10224_08981 [Escherichia coli O26:H11 str.
           CVM10224]
 gi|420132349|ref|ZP_14640718.1| hypothetical protein ECO9952_09892 [Escherichia coli O26:H11 str.
           CVM9952]
 gi|420321901|ref|ZP_14823725.1| hypothetical protein SF285071_3539 [Shigella flexneri 2850-71]
 gi|420343300|ref|ZP_14844766.1| hypothetical protein SFK404_3907 [Shigella flexneri K-404]
 gi|420348954|ref|ZP_14850335.1| hypothetical protein SB96558_3905 [Shigella boydii 965-58]
 gi|420375165|ref|ZP_14875065.1| hypothetical protein SF123566_5097 [Shigella flexneri 1235-66]
 gi|420387093|ref|ZP_14886437.1| hypothetical protein ECEPECA12_3469 [Escherichia coli EPECa12]
 gi|420392993|ref|ZP_14892240.1| hypothetical protein ECEPECC34262_3841 [Escherichia coli EPEC
           C342-62]
 gi|422010498|ref|ZP_16357456.1| hypothetical protein ECO9455_06755 [Escherichia coli O111:H11 str.
           CVM9455]
 gi|422354817|ref|ZP_16435542.1| hypothetical protein HMPREF9542_04137 [Escherichia coli MS 117-3]
 gi|422356684|ref|ZP_16437357.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           110-3]
 gi|422363328|ref|ZP_16443865.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           153-1]
 gi|422372593|ref|ZP_16452950.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 16-3]
 gi|422376906|ref|ZP_16457152.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 60-1]
 gi|422383294|ref|ZP_16463446.1| hypothetical protein HMPREF9532_04861 [Escherichia coli MS 57-2]
 gi|422750026|ref|ZP_16803937.1| hypothetical protein ERKG_02252 [Escherichia coli H252]
 gi|422754268|ref|ZP_16808094.1| hypothetical protein ERLG_01390 [Escherichia coli H263]
 gi|422760412|ref|ZP_16814172.1| hypothetical protein ERBG_00336 [Escherichia coli E1167]
 gi|422771184|ref|ZP_16824874.1| hypothetical protein ERDG_01736 [Escherichia coli E482]
 gi|422775814|ref|ZP_16829469.1| yggU [Escherichia coli H120]
 gi|422780105|ref|ZP_16832890.1| hypothetical protein ERFG_00343 [Escherichia coli TW10509]
 gi|422787540|ref|ZP_16840278.1| hypothetical protein ERGG_02689 [Escherichia coli H489]
 gi|422791757|ref|ZP_16844459.1| hypothetical protein ERHG_02238 [Escherichia coli TA007]
 gi|422800893|ref|ZP_16849390.1| hypothetical protein ERJG_02059 [Escherichia coli M863]
 gi|422804224|ref|ZP_16852656.1| hypothetical protein ERIG_00360 [Escherichia fergusonii B253]
 gi|422828325|ref|ZP_16876497.1| hypothetical protein ESNG_01002 [Escherichia coli B093]
 gi|422836501|ref|ZP_16884545.1| hypothetical protein ESOG_04146 [Escherichia coli E101]
 gi|422840946|ref|ZP_16888916.1| hypothetical protein ESPG_03602 [Escherichia coli H397]
 gi|422959694|ref|ZP_16971329.1| UPF0235 protein yggU [Escherichia coli H494]
 gi|422969906|ref|ZP_16973699.1| UPF0235 protein yggU [Escherichia coli TA124]
 gi|422989067|ref|ZP_16979840.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. C227-11]
 gi|422995959|ref|ZP_16986723.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. C236-11]
 gi|423001105|ref|ZP_16991859.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 09-7901]
 gi|423004773|ref|ZP_16995519.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 04-8351]
 gi|423011276|ref|ZP_17002010.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-3677]
 gi|423020504|ref|ZP_17011213.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4404]
 gi|423025670|ref|ZP_17016367.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4522]
 gi|423031491|ref|ZP_17022178.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4623]
 gi|423039316|ref|ZP_17029990.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4632 C1]
 gi|423044436|ref|ZP_17035103.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4632 C2]
 gi|423046165|ref|ZP_17036825.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4632 C3]
 gi|423054703|ref|ZP_17043510.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4632 C4]
 gi|423061678|ref|ZP_17050474.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4632 C5]
 gi|423703682|ref|ZP_17678107.1| UPF0235 protein yggU [Escherichia coli H730]
 gi|423707115|ref|ZP_17681498.1| UPF0235 protein yggU [Escherichia coli B799]
 gi|424748330|ref|ZP_18176477.1| hypothetical protein CFSAN001629_07416 [Escherichia coli O26:H11
           str. CFSAN001629]
 gi|424758270|ref|ZP_18185986.1| hypothetical protein CFSAN001630_08138 [Escherichia coli O111:H11
           str. CFSAN001630]
 gi|424773922|ref|ZP_18200973.1| hypothetical protein CFSAN001632_23250 [Escherichia coli O111:H8
           str. CFSAN001632]
 gi|424817490|ref|ZP_18242641.1| hypothetical protein ECD227_2607 [Escherichia fergusonii ECD227]
 gi|425279306|ref|ZP_18670539.1| hypothetical protein ECARS42123_3410 [Escherichia coli ARS4.2123]
 gi|425290088|ref|ZP_18680919.1| hypothetical protein EC3006_3557 [Escherichia coli 3006]
 gi|425301787|ref|ZP_18691672.1| hypothetical protein EC07798_3612 [Escherichia coli 07798]
 gi|425306717|ref|ZP_18696404.1| hypothetical protein ECN1_3116 [Escherichia coli N1]
 gi|425381179|ref|ZP_18765187.1| hypothetical protein ECEC1865_4185 [Escherichia coli EC1865]
 gi|425423803|ref|ZP_18804966.1| hypothetical protein EC01288_3165 [Escherichia coli 0.1288]
 gi|429720535|ref|ZP_19255460.1| hypothetical protein MO3_03245 [Escherichia coli O104:H4 str.
           Ec11-9450]
 gi|429772433|ref|ZP_19304453.1| hypothetical protein C212_02216 [Escherichia coli O104:H4 str.
           11-02030]
 gi|429777380|ref|ZP_19309354.1| hypothetical protein C213_02214 [Escherichia coli O104:H4 str.
           11-02033-1]
 gi|429786105|ref|ZP_19318000.1| hypothetical protein C214_02212 [Escherichia coli O104:H4 str.
           11-02092]
 gi|429791995|ref|ZP_19323849.1| hypothetical protein C215_02213 [Escherichia coli O104:H4 str.
           11-02093]
 gi|429792844|ref|ZP_19324692.1| hypothetical protein C216_02215 [Escherichia coli O104:H4 str.
           11-02281]
 gi|429799419|ref|ZP_19331217.1| hypothetical protein C217_02212 [Escherichia coli O104:H4 str.
           11-02318]
 gi|429803036|ref|ZP_19334796.1| hypothetical protein C218_02212 [Escherichia coli O104:H4 str.
           11-02913]
 gi|429812832|ref|ZP_19344515.1| hypothetical protein C219_02212 [Escherichia coli O104:H4 str.
           11-03439]
 gi|429813380|ref|ZP_19345059.1| hypothetical protein C220_02213 [Escherichia coli O104:H4 str.
           11-04080]
 gi|429818588|ref|ZP_19350222.1| hypothetical protein C221_02212 [Escherichia coli O104:H4 str.
           11-03943]
 gi|429904939|ref|ZP_19370918.1| hypothetical protein MO5_01864 [Escherichia coli O104:H4 str.
           Ec11-9990]
 gi|429909075|ref|ZP_19375039.1| hypothetical protein MO7_01844 [Escherichia coli O104:H4 str.
           Ec11-9941]
 gi|429914949|ref|ZP_19380896.1| hypothetical protein O7C_01867 [Escherichia coli O104:H4 str.
           Ec11-4984]
 gi|429919979|ref|ZP_19385910.1| hypothetical protein O7E_01869 [Escherichia coli O104:H4 str.
           Ec11-5604]
 gi|429925799|ref|ZP_19391712.1| hypothetical protein O7G_02688 [Escherichia coli O104:H4 str.
           Ec11-4986]
 gi|429929735|ref|ZP_19395637.1| hypothetical protein O7I_01560 [Escherichia coli O104:H4 str.
           Ec11-4987]
 gi|429936274|ref|ZP_19402160.1| hypothetical protein O7K_03111 [Escherichia coli O104:H4 str.
           Ec11-4988]
 gi|429941954|ref|ZP_19407828.1| hypothetical protein O7M_03687 [Escherichia coli O104:H4 str.
           Ec11-5603]
 gi|429944635|ref|ZP_19410497.1| hypothetical protein O7O_01182 [Escherichia coli O104:H4 str.
           Ec11-6006]
 gi|429952193|ref|ZP_19418039.1| hypothetical protein S7Y_03643 [Escherichia coli O104:H4 str.
           Ec12-0465]
 gi|429955542|ref|ZP_19421374.1| hypothetical protein S91_01945 [Escherichia coli O104:H4 str.
           Ec12-0466]
 gi|432354880|ref|ZP_19598149.1| hypothetical protein WCA_03872 [Escherichia coli KTE2]
 gi|432359276|ref|ZP_19602492.1| hypothetical protein WCC_03240 [Escherichia coli KTE4]
 gi|432364123|ref|ZP_19607280.1| hypothetical protein WCE_03155 [Escherichia coli KTE5]
 gi|432366419|ref|ZP_19609537.1| hypothetical protein WCM_00343 [Escherichia coli KTE10]
 gi|432378137|ref|ZP_19621122.1| hypothetical protein WCQ_03025 [Escherichia coli KTE12]
 gi|432382656|ref|ZP_19625595.1| hypothetical protein WCU_02817 [Escherichia coli KTE15]
 gi|432388589|ref|ZP_19631470.1| hypothetical protein WCY_03854 [Escherichia coli KTE16]
 gi|432393426|ref|ZP_19636254.1| hypothetical protein WE9_03750 [Escherichia coli KTE21]
 gi|432403232|ref|ZP_19645980.1| hypothetical protein WEK_03437 [Escherichia coli KTE26]
 gi|432413078|ref|ZP_19655737.1| hypothetical protein WG9_03575 [Escherichia coli KTE39]
 gi|432418418|ref|ZP_19661014.1| hypothetical protein WGI_03935 [Escherichia coli KTE44]
 gi|432427508|ref|ZP_19669997.1| hypothetical protein A139_02907 [Escherichia coli KTE181]
 gi|432433150|ref|ZP_19675575.1| hypothetical protein A13K_03451 [Escherichia coli KTE187]
 gi|432437633|ref|ZP_19680020.1| hypothetical protein A13M_03360 [Escherichia coli KTE188]
 gi|432442385|ref|ZP_19684722.1| hypothetical protein A13O_03224 [Escherichia coli KTE189]
 gi|432447499|ref|ZP_19689797.1| hypothetical protein A13S_03557 [Escherichia coli KTE191]
 gi|432451128|ref|ZP_19693386.1| hypothetical protein A13W_02087 [Escherichia coli KTE193]
 gi|432457976|ref|ZP_19700155.1| hypothetical protein A15C_03781 [Escherichia coli KTE201]
 gi|432461965|ref|ZP_19704106.1| hypothetical protein A15I_02839 [Escherichia coli KTE204]
 gi|432467112|ref|ZP_19709197.1| hypothetical protein A15K_03072 [Escherichia coli KTE205]
 gi|432472260|ref|ZP_19714300.1| hypothetical protein A15M_03154 [Escherichia coli KTE206]
 gi|432482282|ref|ZP_19724233.1| hypothetical protein A15U_03414 [Escherichia coli KTE210]
 gi|432486713|ref|ZP_19728623.1| hypothetical protein A15Y_03210 [Escherichia coli KTE212]
 gi|432496971|ref|ZP_19738766.1| hypothetical protein A173_04148 [Escherichia coli KTE214]
 gi|432501402|ref|ZP_19743155.1| hypothetical protein A177_03511 [Escherichia coli KTE216]
 gi|432505716|ref|ZP_19747437.1| hypothetical protein A17E_02788 [Escherichia coli KTE220]
 gi|432515219|ref|ZP_19752440.1| hypothetical protein A17M_03091 [Escherichia coli KTE224]
 gi|432525107|ref|ZP_19762231.1| hypothetical protein A17Y_03237 [Escherichia coli KTE230]
 gi|432527741|ref|ZP_19764825.1| hypothetical protein A191_00978 [Escherichia coli KTE233]
 gi|432535321|ref|ZP_19772288.1| hypothetical protein A193_03767 [Escherichia coli KTE234]
 gi|432539231|ref|ZP_19776127.1| hypothetical protein A195_02861 [Escherichia coli KTE235]
 gi|432544596|ref|ZP_19781436.1| hypothetical protein A197_03190 [Escherichia coli KTE236]
 gi|432550086|ref|ZP_19786850.1| hypothetical protein A199_03565 [Escherichia coli KTE237]
 gi|432554994|ref|ZP_19791713.1| hypothetical protein A1S3_03409 [Escherichia coli KTE47]
 gi|432565223|ref|ZP_19801796.1| hypothetical protein A1SA_03870 [Escherichia coli KTE51]
 gi|432569996|ref|ZP_19806504.1| hypothetical protein A1SE_03592 [Escherichia coli KTE53]
 gi|432575131|ref|ZP_19811605.1| hypothetical protein A1SI_03838 [Escherichia coli KTE55]
 gi|432577150|ref|ZP_19813603.1| hypothetical protein A1SK_00886 [Escherichia coli KTE56]
 gi|432581958|ref|ZP_19818372.1| hypothetical protein A1SM_01163 [Escherichia coli KTE57]
 gi|432589261|ref|ZP_19825614.1| hypothetical protein A1SO_03631 [Escherichia coli KTE58]
 gi|432594129|ref|ZP_19830442.1| hypothetical protein A1SS_03562 [Escherichia coli KTE60]
 gi|432599126|ref|ZP_19835397.1| hypothetical protein A1SW_03867 [Escherichia coli KTE62]
 gi|432603607|ref|ZP_19839849.1| hypothetical protein A1U5_03467 [Escherichia coli KTE66]
 gi|432608795|ref|ZP_19844978.1| hypothetical protein A1U7_03812 [Escherichia coli KTE67]
 gi|432612937|ref|ZP_19849095.1| hypothetical protein A1UG_03315 [Escherichia coli KTE72]
 gi|432618142|ref|ZP_19854250.1| hypothetical protein A1UM_03592 [Escherichia coli KTE75]
 gi|432623175|ref|ZP_19859197.1| hypothetical protein A1UO_03059 [Escherichia coli KTE76]
 gi|432632732|ref|ZP_19868653.1| hypothetical protein A1UW_03118 [Escherichia coli KTE80]
 gi|432642443|ref|ZP_19878271.1| hypothetical protein A1W1_03320 [Escherichia coli KTE83]
 gi|432647489|ref|ZP_19883275.1| hypothetical protein A1W5_03258 [Escherichia coli KTE86]
 gi|432652532|ref|ZP_19888279.1| hypothetical protein A1W7_03554 [Escherichia coli KTE87]
 gi|432657080|ref|ZP_19892780.1| hypothetical protein A1WE_03207 [Escherichia coli KTE93]
 gi|432667433|ref|ZP_19903009.1| hypothetical protein A1Y3_04049 [Escherichia coli KTE116]
 gi|432672037|ref|ZP_19907562.1| hypothetical protein A1Y7_03594 [Escherichia coli KTE119]
 gi|432681572|ref|ZP_19916936.1| hypothetical protein A1YW_03325 [Escherichia coli KTE143]
 gi|432688164|ref|ZP_19923440.1| hypothetical protein A31G_00367 [Escherichia coli KTE161]
 gi|432695734|ref|ZP_19930928.1| hypothetical protein A31I_03217 [Escherichia coli KTE162]
 gi|432700348|ref|ZP_19935498.1| hypothetical protein A31M_03110 [Escherichia coli KTE169]
 gi|432707197|ref|ZP_19942275.1| hypothetical protein WCG_00467 [Escherichia coli KTE6]
 gi|432714672|ref|ZP_19949702.1| hypothetical protein WCI_03050 [Escherichia coli KTE8]
 gi|432720069|ref|ZP_19955034.1| hypothetical protein WCK_03704 [Escherichia coli KTE9]
 gi|432733691|ref|ZP_19968516.1| hypothetical protein WGK_03551 [Escherichia coli KTE45]
 gi|432746913|ref|ZP_19981575.1| hypothetical protein WGG_03035 [Escherichia coli KTE43]
 gi|432751423|ref|ZP_19986006.1| hypothetical protein WEQ_02841 [Escherichia coli KTE29]
 gi|432755811|ref|ZP_19990357.1| hypothetical protein WEA_02807 [Escherichia coli KTE22]
 gi|432760777|ref|ZP_19995267.1| hypothetical protein A1S1_02919 [Escherichia coli KTE46]
 gi|432766315|ref|ZP_20000732.1| hypothetical protein A1S5_03878 [Escherichia coli KTE48]
 gi|432771887|ref|ZP_20006206.1| hypothetical protein A1S9_04684 [Escherichia coli KTE50]
 gi|432779891|ref|ZP_20014112.1| hypothetical protein A1SQ_03553 [Escherichia coli KTE59]
 gi|432784826|ref|ZP_20019004.1| hypothetical protein A1SY_03688 [Escherichia coli KTE63]
 gi|432788883|ref|ZP_20023011.1| hypothetical protein A1U3_03013 [Escherichia coli KTE65]
 gi|432794115|ref|ZP_20028197.1| hypothetical protein A1US_03348 [Escherichia coli KTE78]
 gi|432795616|ref|ZP_20029676.1| hypothetical protein A1UU_00339 [Escherichia coli KTE79]
 gi|432803119|ref|ZP_20037074.1| hypothetical protein A1W3_03371 [Escherichia coli KTE84]
 gi|432807136|ref|ZP_20041051.1| hypothetical protein A1WA_03040 [Escherichia coli KTE91]
 gi|432810656|ref|ZP_20044534.1| hypothetical protein A1WM_01819 [Escherichia coli KTE101]
 gi|432816649|ref|ZP_20050410.1| hypothetical protein A1Y1_03049 [Escherichia coli KTE115]
 gi|432822320|ref|ZP_20056009.1| hypothetical protein A1Y5_03936 [Escherichia coli KTE118]
 gi|432823829|ref|ZP_20057499.1| hypothetical protein A1YA_00496 [Escherichia coli KTE123]
 gi|432828585|ref|ZP_20062203.1| hypothetical protein A1YM_00352 [Escherichia coli KTE135]
 gi|432835886|ref|ZP_20069420.1| hypothetical protein A1YO_03257 [Escherichia coli KTE136]
 gi|432845980|ref|ZP_20078661.1| hypothetical protein A1YS_03425 [Escherichia coli KTE141]
 gi|432864182|ref|ZP_20087909.1| hypothetical protein A311_03663 [Escherichia coli KTE146]
 gi|432870396|ref|ZP_20090853.1| hypothetical protein A313_01686 [Escherichia coli KTE147]
 gi|432888207|ref|ZP_20101959.1| hypothetical protein A31C_03698 [Escherichia coli KTE158]
 gi|432900161|ref|ZP_20110583.1| hypothetical protein A13U_03364 [Escherichia coli KTE192]
 gi|432906314|ref|ZP_20115042.1| hypothetical protein A13Y_03431 [Escherichia coli KTE194]
 gi|432921087|ref|ZP_20124551.1| hypothetical protein A133_03490 [Escherichia coli KTE173]
 gi|432928646|ref|ZP_20129766.1| hypothetical protein A135_03834 [Escherichia coli KTE175]
 gi|432935929|ref|ZP_20135197.1| hypothetical protein A13E_04372 [Escherichia coli KTE184]
 gi|432939439|ref|ZP_20137542.1| hypothetical protein A13C_01983 [Escherichia coli KTE183]
 gi|432949016|ref|ZP_20143939.1| hypothetical protein A153_03719 [Escherichia coli KTE196]
 gi|432963307|ref|ZP_20152726.1| hypothetical protein A15E_03664 [Escherichia coli KTE202]
 gi|432969017|ref|ZP_20157929.1| hypothetical protein A15G_04137 [Escherichia coli KTE203]
 gi|432973094|ref|ZP_20161955.1| hypothetical protein A15O_03678 [Escherichia coli KTE207]
 gi|432975060|ref|ZP_20163895.1| hypothetical protein A15S_00922 [Escherichia coli KTE209]
 gi|432982293|ref|ZP_20171066.1| hypothetical protein A15W_03437 [Escherichia coli KTE211]
 gi|432986678|ref|ZP_20175395.1| hypothetical protein A175_03146 [Escherichia coli KTE215]
 gi|432996619|ref|ZP_20185202.1| hypothetical protein A17A_03696 [Escherichia coli KTE218]
 gi|433001193|ref|ZP_20189714.1| hypothetical protein A17K_03541 [Escherichia coli KTE223]
 gi|433006410|ref|ZP_20194835.1| hypothetical protein A17S_03995 [Escherichia coli KTE227]
 gi|433009078|ref|ZP_20197491.1| hypothetical protein A17W_01797 [Escherichia coli KTE229]
 gi|433015196|ref|ZP_20203534.1| hypothetical protein WI5_03023 [Escherichia coli KTE104]
 gi|433024783|ref|ZP_20212761.1| hypothetical protein WI9_02949 [Escherichia coli KTE106]
 gi|433029848|ref|ZP_20217700.1| hypothetical protein WIA_02955 [Escherichia coli KTE109]
 gi|433034811|ref|ZP_20222512.1| hypothetical protein WIC_03378 [Escherichia coli KTE112]
 gi|433039920|ref|ZP_20227516.1| hypothetical protein WIE_03280 [Escherichia coli KTE113]
 gi|433044494|ref|ZP_20231981.1| hypothetical protein WIG_03032 [Escherichia coli KTE117]
 gi|433049363|ref|ZP_20236703.1| hypothetical protein WII_03300 [Escherichia coli KTE120]
 gi|433054611|ref|ZP_20241779.1| hypothetical protein WIK_03417 [Escherichia coli KTE122]
 gi|433059398|ref|ZP_20246438.1| hypothetical protein WIM_03174 [Escherichia coli KTE124]
 gi|433064374|ref|ZP_20251287.1| hypothetical protein WIO_03200 [Escherichia coli KTE125]
 gi|433069259|ref|ZP_20256037.1| hypothetical protein WIQ_03143 [Escherichia coli KTE128]
 gi|433074155|ref|ZP_20260800.1| hypothetical protein WIS_03117 [Escherichia coli KTE129]
 gi|433079107|ref|ZP_20265629.1| hypothetical protein WIU_02974 [Escherichia coli KTE131]
 gi|433083848|ref|ZP_20270300.1| hypothetical protein WIW_03001 [Escherichia coli KTE133]
 gi|433088593|ref|ZP_20274960.1| hypothetical protein WIY_03054 [Escherichia coli KTE137]
 gi|433093337|ref|ZP_20279595.1| hypothetical protein WK1_02981 [Escherichia coli KTE138]
 gi|433097719|ref|ZP_20283897.1| hypothetical protein WK3_02926 [Escherichia coli KTE139]
 gi|433102503|ref|ZP_20288579.1| hypothetical protein WK5_03060 [Escherichia coli KTE145]
 gi|433107175|ref|ZP_20293142.1| hypothetical protein WK7_03043 [Escherichia coli KTE148]
 gi|433116801|ref|ZP_20302588.1| hypothetical protein WKA_02996 [Escherichia coli KTE153]
 gi|433121492|ref|ZP_20307156.1| hypothetical protein WKC_02924 [Escherichia coli KTE157]
 gi|433126474|ref|ZP_20312026.1| hypothetical protein WKE_02973 [Escherichia coli KTE160]
 gi|433131490|ref|ZP_20316921.1| hypothetical protein WKG_03235 [Escherichia coli KTE163]
 gi|433136153|ref|ZP_20321490.1| hypothetical protein WKI_03098 [Escherichia coli KTE166]
 gi|433140542|ref|ZP_20325792.1| hypothetical protein WKM_02826 [Escherichia coli KTE167]
 gi|433145520|ref|ZP_20330657.1| hypothetical protein WKO_03065 [Escherichia coli KTE168]
 gi|433150461|ref|ZP_20335475.1| hypothetical protein WKQ_03118 [Escherichia coli KTE174]
 gi|433155029|ref|ZP_20339964.1| hypothetical protein WKS_02963 [Escherichia coli KTE176]
 gi|433164914|ref|ZP_20349646.1| hypothetical protein WKW_03131 [Escherichia coli KTE179]
 gi|433169899|ref|ZP_20354522.1| hypothetical protein WKY_03151 [Escherichia coli KTE180]
 gi|433174835|ref|ZP_20359350.1| hypothetical protein WGQ_03104 [Escherichia coli KTE232]
 gi|433179803|ref|ZP_20364191.1| hypothetical protein WGM_03447 [Escherichia coli KTE82]
 gi|433184628|ref|ZP_20368868.1| hypothetical protein WGO_03068 [Escherichia coli KTE85]
 gi|433189702|ref|ZP_20373794.1| hypothetical protein WGS_02787 [Escherichia coli KTE88]
 gi|433195003|ref|ZP_20378984.1| hypothetical protein WGU_03325 [Escherichia coli KTE90]
 gi|433199652|ref|ZP_20383543.1| hypothetical protein WGW_03202 [Escherichia coli KTE94]
 gi|433209035|ref|ZP_20392706.1| hypothetical protein WI1_02816 [Escherichia coli KTE97]
 gi|433213819|ref|ZP_20397407.1| hypothetical protein WI3_03009 [Escherichia coli KTE99]
 gi|433322137|ref|ZP_20399641.1| hypothetical protein B185_002349 [Escherichia coli J96]
 gi|442593122|ref|ZP_21011077.1| UPF0235 protein VC0458 [Escherichia coli O10:K5(L):H4 str. ATCC
           23506]
 gi|442597763|ref|ZP_21015542.1| UPF0235 protein VC0458 [Escherichia coli O5:K4(L):H4 str. ATCC
           23502]
 gi|442605088|ref|ZP_21019926.1| UPF0235 protein VC0458 [Escherichia coli Nissle 1917]
 gi|443619007|ref|YP_007382863.1| hypothetical protein APECO78_18500 [Escherichia coli APEC O78]
 gi|29839713|sp|Q8FE28.2|YGGU_ECOL6 RecName: Full=UPF0235 protein YggU
 gi|47117526|sp|Q83JS1.2|YGGU_SHIFL RecName: Full=UPF0235 protein YggU
 gi|123343643|sp|Q0TDP8.1|YGGU_ECOL5 RecName: Full=UPF0235 protein YggU
 gi|166227348|sp|A1AFD9.1|YGGU_ECOK1 RecName: Full=UPF0235 protein YggU
 gi|189030102|sp|B1IT54.1|YGGU_ECOLC RecName: Full=UPF0235 protein YggU
 gi|226730815|sp|B7MME1.1|YGGU_ECO45 RecName: Full=UPF0235 protein YggU
 gi|226730817|sp|B7NI16.1|YGGU_ECO7I RecName: Full=UPF0235 protein YggU
 gi|226730818|sp|B7LYY3.1|YGGU_ECO8A RecName: Full=UPF0235 protein YggU
 gi|226730820|sp|B7N7K6.1|YGGU_ECOLU RecName: Full=UPF0235 protein YggU
 gi|226730821|sp|B6I789.1|YGGU_ECOSE RecName: Full=UPF0235 protein YggU
 gi|226730822|sp|B1LDG2.1|YGGU_ECOSM RecName: Full=UPF0235 protein YggU
 gi|226730823|sp|B7LPS0.1|YGGU_ESCF3 RecName: Full=UPF0235 protein YggU
 gi|254814155|sp|B7LFL6.1|YGGU_ECO55 RecName: Full=UPF0235 protein YggU
 gi|254814156|sp|B7MZQ2.1|YGGU_ECO81 RecName: Full=UPF0235 protein YggU
 gi|110344694|gb|ABG70931.1| hypothetical protein YggU [Escherichia coli 536]
 gi|115514304|gb|ABJ02379.1| conserved hypothetical protein [Escherichia coli APEC O1]
 gi|169753734|gb|ACA76433.1| protein of unknown function DUF167 [Escherichia coli ATCC 8739]
 gi|170519527|gb|ACB17705.1| conserved hypothetical protein [Escherichia coli SMS-3-5]
 gi|190902015|gb|EDV61761.1| conserved hypothetical protein [Escherichia coli B7A]
 gi|190907863|gb|EDV67456.1| conserved hypothetical protein [Escherichia coli F11]
 gi|192930792|gb|EDV83397.1| conserved hypothetical protein [Escherichia coli E22]
 gi|192959427|gb|EDV89862.1| conserved hypothetical protein [Escherichia coli E110019]
 gi|194415572|gb|EDX31839.1| conserved hypothetical protein [Escherichia coli B171]
 gi|194420098|gb|EDX36176.1| conserved hypothetical protein [Shigella dysenteriae 1012]
 gi|194424250|gb|EDX40237.1| conserved hypothetical protein [Escherichia coli 101-1]
 gi|209913671|dbj|BAG78745.1| conserved hypothetical protein [Escherichia coli SE11]
 gi|218353283|emb|CAU99245.1| conserved hypothetical protein [Escherichia coli 55989]
 gi|218357741|emb|CAQ90385.1| conserved hypothetical protein [Escherichia fergusonii ATCC 35469]
 gi|218362280|emb|CAQ99901.1| conserved hypothetical protein [Escherichia coli IAI1]
 gi|218366713|emb|CAR04470.1| conserved hypothetical protein [Escherichia coli S88]
 gi|218371649|emb|CAR19488.1| conserved hypothetical protein [Escherichia coli IAI39]
 gi|218428641|emb|CAR09570.2| conserved hypothetical protein [Escherichia coli ED1a]
 gi|218433565|emb|CAR14468.1| conserved hypothetical protein [Escherichia coli UMN026]
 gi|222034648|emb|CAP77390.1| UPF0235 protein yggU [Escherichia coli LF82]
 gi|227834777|gb|EEJ45243.1| protein of hypothetical function DUF167 [Escherichia coli 83972]
 gi|242378479|emb|CAQ33263.1| conserved protein [Escherichia coli BL21(DE3)]
 gi|253974764|gb|ACT40435.1| hypothetical protein ECB_02783 [Escherichia coli B str. REL606]
 gi|253978930|gb|ACT44600.1| hypothetical protein ECD_02783 [Escherichia coli BL21(DE3)]
 gi|257755735|dbj|BAI27237.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368]
 gi|257760770|dbj|BAI32267.1| conserved predicted protein [Escherichia coli O103:H2 str. 12009]
 gi|257765996|dbj|BAI37491.1| conserved predicted protein [Escherichia coli O111:H- str. 11128]
 gi|291322373|gb|EFE61802.1| hypothetical protein ECCG_02314 [Escherichia coli B088]
 gi|291426466|gb|EFE99498.1| hypothetical protein ECGG_01757 [Escherichia coli FVEC1412]
 gi|291432403|gb|EFF05385.1| hypothetical protein ECDG_03817 [Escherichia coli B185]
 gi|298277337|gb|EFI18853.1| hypothetical protein ECFG_01943 [Escherichia coli FVEC1302]
 gi|299879158|gb|EFI87369.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           196-1]
 gi|300298948|gb|EFJ55333.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           185-1]
 gi|300304828|gb|EFJ59348.1| hypothetical protein HMPREF9553_04604 [Escherichia coli MS 200-1]
 gi|300356246|gb|EFJ72116.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           198-1]
 gi|300395102|gb|EFJ78640.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 69-1]
 gi|300401726|gb|EFJ85264.1| hypothetical protein HMPREF9536_04445 [Escherichia coli MS 84-1]
 gi|300409362|gb|EFJ92900.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 45-1]
 gi|300411757|gb|EFJ95067.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           115-1]
 gi|300454465|gb|EFK17958.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 21-1]
 gi|300463870|gb|EFK27363.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           187-1]
 gi|300522719|gb|EFK43788.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           119-7]
 gi|300531506|gb|EFK52568.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           107-1]
 gi|300838180|gb|EFK65940.1| hypothetical protein HMPREF9347_05170 [Escherichia coli MS 124-1]
 gi|300845411|gb|EFK73171.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 78-1]
 gi|305854181|gb|EFM54619.1| hypothetical protein ECNC101_09529 [Escherichia coli NC101]
 gi|307554935|gb|ADN47710.1| conserved hypothetical protein [Escherichia coli ABU 83972]
 gi|307625473|gb|ADN69777.1| hypothetical protein UM146_01750 [Escherichia coli UM146]
 gi|308122449|gb|EFO59711.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           145-7]
 gi|310332747|gb|EFP99960.1| conserved hypothetical protein [Escherichia coli 1827-70]
 gi|312947484|gb|ADR28311.1| hypothetical protein NRG857_14505 [Escherichia coli O83:H1 str. NRG
           857C]
 gi|313648003|gb|EFS12449.1| hypothetical protein SF2457T_3601 [Shigella flexneri 2a str. 2457T]
 gi|315256845|gb|EFU36813.1| putative cytoplasmic protein [Escherichia coli MS 85-1]
 gi|315289499|gb|EFU48894.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           110-3]
 gi|315293933|gb|EFU53285.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS
           153-1]
 gi|315295642|gb|EFU54965.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 16-3]
 gi|320181042|gb|EFW55963.1| hypothetical protein SGB_01866 [Shigella boydii ATCC 9905]
 gi|320195072|gb|EFW69701.1| hypothetical protein EcoM_02835 [Escherichia coli WV_060327]
 gi|320202617|gb|EFW77187.1| hypothetical protein ECoL_00280 [Escherichia coli EC4100B]
 gi|323154662|gb|EFZ40861.1| hypothetical protein ECEPECA14_3462 [Escherichia coli EPECa14]
 gi|323162601|gb|EFZ48448.1| hypothetical protein ECE128010_1205 [Escherichia coli E128010]
 gi|323180413|gb|EFZ65965.1| hypothetical protein ECOK1180_1095 [Escherichia coli OK1180]
 gi|323183524|gb|EFZ68921.1| hypothetical protein ECOK1357_3303 [Escherichia coli OK1357]
 gi|323188653|gb|EFZ73938.1| hypothetical protein ECRN5871_3073 [Escherichia coli RN587/1]
 gi|323941961|gb|EGB38140.1| hypothetical protein ERDG_01736 [Escherichia coli E482]
 gi|323946549|gb|EGB42572.1| yggU [Escherichia coli H120]
 gi|323951609|gb|EGB47484.1| hypothetical protein ERKG_02252 [Escherichia coli H252]
 gi|323957323|gb|EGB53045.1| hypothetical protein ERLG_01390 [Escherichia coli H263]
 gi|323960754|gb|EGB56375.1| hypothetical protein ERGG_02689 [Escherichia coli H489]
 gi|323966470|gb|EGB61903.1| hypothetical protein ERJG_02059 [Escherichia coli M863]
 gi|323971760|gb|EGB66987.1| hypothetical protein ERHG_02238 [Escherichia coli TA007]
 gi|323978752|gb|EGB73833.1| hypothetical protein ERFG_00343 [Escherichia coli TW10509]
 gi|324005509|gb|EGB74728.1| hypothetical protein HMPREF9532_04861 [Escherichia coli MS 57-2]
 gi|324011796|gb|EGB81015.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 60-1]
 gi|324017226|gb|EGB86445.1| hypothetical protein HMPREF9542_04137 [Escherichia coli MS 117-3]
 gi|324115032|gb|EGC08997.1| hypothetical protein ERIG_00360 [Escherichia fergusonii B253]
 gi|324119748|gb|EGC13628.1| hypothetical protein ERBG_00336 [Escherichia coli E1167]
 gi|325498510|gb|EGC96369.1| hypothetical protein ECD227_2607 [Escherichia fergusonii ECD227]
 gi|327251721|gb|EGE63407.1| hypothetical protein ECSTEC7V_3572 [Escherichia coli STEC_7v]
 gi|331053670|gb|EGI25699.1| conserved hypothetical protein [Escherichia coli TA206]
 gi|331063367|gb|EGI35280.1| conserved hypothetical protein [Escherichia coli TA271]
 gi|332086882|gb|EGI92018.1| hypothetical protein SB521682_3508 [Shigella boydii 5216-82]
 gi|332087620|gb|EGI92747.1| hypothetical protein SD15574_3393 [Shigella dysenteriae 155-74]
 gi|332752996|gb|EGJ83380.1| hypothetical protein SF434370_3174 [Shigella flexneri 4343-70]
 gi|332753797|gb|EGJ84176.1| hypothetical protein SFK671_3527 [Shigella flexneri K-671]
 gi|332754618|gb|EGJ84984.1| hypothetical protein SF274771_3539 [Shigella flexneri 2747-71]
 gi|332765370|gb|EGJ95588.1| hypothetical protein SF293071_3451 [Shigella flexneri 2930-71]
 gi|333000455|gb|EGK20036.1| hypothetical protein SFK272_3776 [Shigella flexneri K-272]
 gi|333015108|gb|EGK34451.1| hypothetical protein SFK304_3758 [Shigella flexneri K-304]
 gi|333015293|gb|EGK34635.1| hypothetical protein SFK227_3749 [Shigella flexneri K-227]
 gi|335573811|gb|EGM60149.1| hypothetical protein SFJ1713_3432 [Shigella flexneri J1713]
 gi|338769085|gb|EGP23867.1| hypothetical protein PPECC33_28340 [Escherichia coli PCN033]
 gi|340733241|gb|EGR62373.1| hypothetical protein HUSEC41_16103 [Escherichia coli O104:H4 str.
           01-09591]
 gi|340738958|gb|EGR73198.1| hypothetical protein HUSEC_16453 [Escherichia coli O104:H4 str.
           LB226692]
 gi|341920713|gb|EGT70319.1| hypothetical protein C22711_4351 [Escherichia coli O104:H4 str.
           C227-11]
 gi|342930239|gb|EGU98961.1| putative cytoplasmic protein [Escherichia coli MS 79-10]
 gi|345333683|gb|EGW66132.1| hypothetical protein ECSTECC16502_3647 [Escherichia coli
           STEC_C165-02]
 gi|345335409|gb|EGW67848.1| hypothetical protein EC253486_3941 [Escherichia coli 2534-86]
 gi|345335917|gb|EGW68354.1| hypothetical protein ECSTECB2F1_3147 [Escherichia coli STEC_B2F1]
 gi|345349172|gb|EGW81463.1| hypothetical protein ECSTEC94C_3472 [Escherichia coli STEC_94C]
 gi|345356787|gb|EGW88988.1| hypothetical protein ECSTECDG1313_3994 [Escherichia coli
           STEC_DG131-3]
 gi|345371872|gb|EGX03841.1| hypothetical protein ECSTECMHI813_3244 [Escherichia coli
           STEC_MHI813]
 gi|345376061|gb|EGX08007.1| hypothetical protein ECSTECH18_3746 [Escherichia coli STEC_H.1.8]
 gi|345392550|gb|EGX22331.1| hypothetical protein ECTX1999_3492 [Escherichia coli TX1999]
 gi|349739418|gb|AEQ14124.1| conserved protein, UPF0235 family [Escherichia coli O7:K1 str.
           CE10]
 gi|354862794|gb|EHF23232.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. C236-11]
 gi|354868078|gb|EHF28500.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. C227-11]
 gi|354868473|gb|EHF28891.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 04-8351]
 gi|354874076|gb|EHF34453.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 09-7901]
 gi|354880759|gb|EHF41095.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-3677]
 gi|354887913|gb|EHF48178.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4404]
 gi|354892501|gb|EHF52710.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4522]
 gi|354893707|gb|EHF53910.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4632 C1]
 gi|354896510|gb|EHF56681.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4623]
 gi|354897887|gb|EHF58044.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4632 C2]
 gi|354911739|gb|EHF71743.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4632 C5]
 gi|354913688|gb|EHF73678.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4632 C3]
 gi|354916645|gb|EHF76617.1| UPF0235 protein yggU [Escherichia coli O104:H4 str. 11-4632 C4]
 gi|355350611|gb|EHF99808.1| hypothetical protein i01_04069 [Escherichia coli cloneA_i1]
 gi|355421602|gb|AER85799.1| hypothetical protein i02_3257 [Escherichia coli str. 'clone D i2']
 gi|355426522|gb|AER90718.1| hypothetical protein i14_3257 [Escherichia coli str. 'clone D i14']
 gi|371594894|gb|EHN83752.1| UPF0235 protein yggU [Escherichia coli H494]
 gi|371600763|gb|EHN89533.1| UPF0235 protein yggU [Escherichia coli TA124]
 gi|371605442|gb|EHN94056.1| hypothetical protein ESPG_03602 [Escherichia coli H397]
 gi|371608707|gb|EHN97258.1| hypothetical protein ESOG_04146 [Escherichia coli E101]
 gi|371615027|gb|EHO03487.1| hypothetical protein ESNG_01002 [Escherichia coli B093]
 gi|375319850|gb|EHS65907.1| hypothetical protein T22_17555 [Escherichia coli O157:H43 str. T22]
 gi|378013548|gb|EHV76465.1| hypothetical protein ECDEC7A_3437 [Escherichia coli DEC7A]
 gi|378022436|gb|EHV85123.1| hypothetical protein ECDEC7C_3475 [Escherichia coli DEC7C]
 gi|378025692|gb|EHV88332.1| hypothetical protein ECDEC7D_3697 [Escherichia coli DEC7D]
 gi|378030885|gb|EHV93478.1| hypothetical protein ECDEC7B_3218 [Escherichia coli DEC7B]
 gi|378036910|gb|EHV99446.1| hypothetical protein ECDEC7E_3363 [Escherichia coli DEC7E]
 gi|378045150|gb|EHW07556.1| hypothetical protein ECDEC8A_3641 [Escherichia coli DEC8A]
 gi|378046120|gb|EHW08500.1| hypothetical protein ECDEC8B_3944 [Escherichia coli DEC8B]
 gi|378050573|gb|EHW12900.1| hypothetical protein ECDEC8C_4606 [Escherichia coli DEC8C]
 gi|378059846|gb|EHW22045.1| hypothetical protein ECDEC8D_4040 [Escherichia coli DEC8D]
 gi|378063808|gb|EHW25972.1| hypothetical protein ECDEC8E_3816 [Escherichia coli DEC8E]
 gi|378071016|gb|EHW33088.1| hypothetical protein ECDEC9A_3836 [Escherichia coli DEC9A]
 gi|378075546|gb|EHW37560.1| hypothetical protein ECDEC9B_3573 [Escherichia coli DEC9B]
 gi|378082592|gb|EHW44537.1| hypothetical protein ECDEC9C_3637 [Escherichia coli DEC9C]
 gi|378088877|gb|EHW50727.1| hypothetical protein ECDEC9D_3518 [Escherichia coli DEC9D]
 gi|378092600|gb|EHW54422.1| hypothetical protein ECDEC9E_3963 [Escherichia coli DEC9E]
 gi|378098767|gb|EHW60499.1| hypothetical protein ECDEC10A_4056 [Escherichia coli DEC10A]
 gi|378104791|gb|EHW66449.1| hypothetical protein ECDEC10B_4430 [Escherichia coli DEC10B]
 gi|378109101|gb|EHW70712.1| hypothetical protein ECDEC10C_4483 [Escherichia coli DEC10C]
 gi|378114984|gb|EHW76535.1| hypothetical protein ECDEC10D_4053 [Escherichia coli DEC10D]
 gi|378126762|gb|EHW88156.1| hypothetical protein ECDEC10E_3459 [Escherichia coli DEC10E]
 gi|378128033|gb|EHW89419.1| hypothetical protein ECDEC11A_3402 [Escherichia coli DEC11A]
 gi|378129700|gb|EHW91071.1| hypothetical protein ECDEC10F_4355 [Escherichia coli DEC10F]
 gi|378140371|gb|EHX01599.1| hypothetical protein ECDEC11B_3399 [Escherichia coli DEC11B]
 gi|378146827|gb|EHX07977.1| hypothetical protein ECDEC11D_3494 [Escherichia coli DEC11D]
 gi|378149358|gb|EHX10485.1| hypothetical protein ECDEC11C_3663 [Escherichia coli DEC11C]
 gi|378156981|gb|EHX18027.1| hypothetical protein ECDEC11E_3459 [Escherichia coli DEC11E]
 gi|378163802|gb|EHX24754.1| hypothetical protein ECDEC12B_4100 [Escherichia coli DEC12B]
 gi|378168093|gb|EHX29004.1| hypothetical protein ECDEC12A_3618 [Escherichia coli DEC12A]
 gi|378168260|gb|EHX29169.1| hypothetical protein ECDEC12C_3786 [Escherichia coli DEC12C]
 gi|378180474|gb|EHX41161.1| hypothetical protein ECDEC12D_3844 [Escherichia coli DEC12D]
 gi|378184587|gb|EHX45223.1| hypothetical protein ECDEC13A_3221 [Escherichia coli DEC13A]
 gi|378185981|gb|EHX46605.1| hypothetical protein ECDEC12E_3575 [Escherichia coli DEC12E]
 gi|378198331|gb|EHX58802.1| hypothetical protein ECDEC13C_3563 [Escherichia coli DEC13C]
 gi|378198691|gb|EHX59161.1| hypothetical protein ECDEC13B_3068 [Escherichia coli DEC13B]
 gi|378201780|gb|EHX62223.1| hypothetical protein ECDEC13D_3377 [Escherichia coli DEC13D]
 gi|378211144|gb|EHX71488.1| hypothetical protein ECDEC13E_3403 [Escherichia coli DEC13E]
 gi|378214823|gb|EHX75125.1| hypothetical protein ECDEC14A_3211 [Escherichia coli DEC14A]
 gi|378218494|gb|EHX78766.1| hypothetical protein ECDEC14B_3547 [Escherichia coli DEC14B]
 gi|378226751|gb|EHX86937.1| hypothetical protein ECDEC14C_3425 [Escherichia coli DEC14C]
 gi|378229978|gb|EHX90109.1| hypothetical protein ECDEC14D_3419 [Escherichia coli DEC14D]
 gi|378236050|gb|EHX96105.1| hypothetical protein ECDEC15A_3704 [Escherichia coli DEC15A]
 gi|378241121|gb|EHY01088.1| hypothetical protein ECDEC15B_3506 [Escherichia coli DEC15B]
 gi|378245726|gb|EHY05663.1| hypothetical protein ECDEC15C_3416 [Escherichia coli DEC15C]
 gi|378253189|gb|EHY13067.1| hypothetical protein ECDEC15D_3361 [Escherichia coli DEC15D]
 gi|378258153|gb|EHY17984.1| hypothetical protein ECDEC15E_3693 [Escherichia coli DEC15E]
 gi|380347223|gb|EIA35512.1| hypothetical protein OQA_14531 [Escherichia coli SCI-07]
 gi|383475861|gb|EID67814.1| hypothetical protein ECW26_16220 [Escherichia coli W26]
 gi|384473610|gb|EIE57650.1| hypothetical protein ECAI27_02950 [Escherichia coli AI27]
 gi|385707716|gb|EIG44743.1| UPF0235 protein yggU [Escherichia coli H730]
 gi|385710666|gb|EIG47643.1| UPF0235 protein yggU [Escherichia coli B799]
 gi|386137530|gb|EIG78692.1| TIGR00251 family protein [Escherichia coli 1.2741]
 gi|386146658|gb|EIG93103.1| TIGR00251 family protein [Escherichia coli 97.0246]
 gi|386152934|gb|EIH04223.1| TIGR00251 family protein [Escherichia coli 5.0588]
 gi|386156609|gb|EIH12954.1| TIGR00251 family protein [Escherichia coli 97.0259]
 gi|386160482|gb|EIH22293.1| TIGR00251 family protein [Escherichia coli 1.2264]
 gi|386166997|gb|EIH33517.1| TIGR00251 family protein [Escherichia coli 96.0497]
 gi|386173502|gb|EIH45514.1| TIGR00251 family protein [Escherichia coli 99.0741]
 gi|386179179|gb|EIH56658.1| TIGR00251 family protein [Escherichia coli 3.2608]
 gi|386182018|gb|EIH64776.1| TIGR00251 family protein [Escherichia coli 93.0624]
 gi|386187700|gb|EIH76513.1| TIGR00251 family protein [Escherichia coli 4.0522]
 gi|386195126|gb|EIH89362.1| TIGR00251 family protein [Escherichia coli JB1-95]
 gi|386203073|gb|EII02064.1| TIGR00251 family protein [Escherichia coli 96.154]
 gi|386207913|gb|EII12418.1| TIGR00251 family protein [Escherichia coli 5.0959]
 gi|386214457|gb|EII24880.1| TIGR00251 family protein [Escherichia coli 9.0111]
 gi|386218562|gb|EII35045.1| TIGR00251 family protein [Escherichia coli 4.0967]
 gi|386230459|gb|EII57814.1| TIGR00251 family protein [Escherichia coli 3.3884]
 gi|386246549|gb|EII88279.1| TIGR00251 family protein [Escherichia coli 3003]
 gi|386250276|gb|EII96443.1| TIGR00251 family protein [Escherichia coli TW07793]
 gi|386259915|gb|EIJ15389.1| TIGR00251 family protein [Escherichia coli 900105 (10e)]
 gi|388333378|gb|EIL00013.1| hypothetical protein ECO9534_07139 [Escherichia coli O111:H11 str.
           CVM9534]
 gi|388339642|gb|EIL05995.1| hypothetical protein ECO9340_03663 [Escherichia coli O103:H25 str.
           CVM9340]
 gi|388346893|gb|EIL12603.1| hypothetical protein ECO9450_27452 [Escherichia coli O103:H2 str.
           CVM9450]
 gi|388365677|gb|EIL29460.1| hypothetical protein ECO9570_29780 [Escherichia coli O111:H8 str.
           CVM9570]
 gi|388368882|gb|EIL32502.1| hypothetical protein ECO9574_26878 [Escherichia coli O111:H8 str.
           CVM9574]
 gi|388370396|gb|EIL33926.1| hypothetical protein ECO9545_08168 [Escherichia coli O111:H11 str.
           CVM9545]
 gi|388372067|gb|EIL35517.1| hypothetical protein ECO10026_04727 [Escherichia coli O26:H11 str.
           CVM10026]
 gi|388380509|gb|EIL43112.1| hypothetical protein ECO9942_08651 [Escherichia coli O26:H11 str.
           CVM9942]
 gi|388382551|gb|EIL44406.1| hypothetical protein ECKD1_23174 [Escherichia coli KD1]
 gi|388386427|gb|EIL48075.1| hypothetical protein ECKD2_17670 [Escherichia coli KD2]
 gi|388391066|gb|EIL52540.1| hypothetical protein EC54115_13548 [Escherichia coli 541-15]
 gi|388406989|gb|EIL67366.1| hypothetical protein EC5761_13497 [Escherichia coli 576-1]
 gi|388407607|gb|EIL67972.1| hypothetical protein EC5411_04199 [Escherichia coli 541-1]
 gi|388418516|gb|EIL78321.1| hypothetical protein ECHM605_10556 [Escherichia coli HM605]
 gi|388421630|gb|EIL81235.1| hypothetical protein ECMT8_02821 [Escherichia coli CUMT8]
 gi|391246310|gb|EIQ05571.1| hypothetical protein SF285071_3539 [Shigella flexneri 2850-71]
 gi|391264133|gb|EIQ23129.1| hypothetical protein SFK404_3907 [Shigella flexneri K-404]
 gi|391267140|gb|EIQ26077.1| hypothetical protein SB96558_3905 [Shigella boydii 965-58]
 gi|391303973|gb|EIQ61799.1| hypothetical protein ECEPECA12_3469 [Escherichia coli EPECa12]
 gi|391311090|gb|EIQ68736.1| hypothetical protein ECEPECC34262_3841 [Escherichia coli EPEC
           C342-62]
 gi|391313775|gb|EIQ71343.1| hypothetical protein SF123566_5097 [Shigella flexneri 1235-66]
 gi|394383251|gb|EJE60857.1| hypothetical protein ECO10224_08981 [Escherichia coli O26:H11 str.
           CVM10224]
 gi|394387336|gb|EJE64794.1| hypothetical protein ECO9602_03393 [Escherichia coli O111:H8 str.
           CVM9602]
 gi|394394045|gb|EJE70674.1| hypothetical protein ECO9455_06755 [Escherichia coli O111:H11 str.
           CVM9455]
 gi|394396305|gb|EJE72681.1| hypothetical protein ECO9634_21047 [Escherichia coli O111:H8 str.
           CVM9634]
 gi|394397401|gb|EJE73674.1| hypothetical protein ECO9553_06271 [Escherichia coli O111:H11 str.
           CVM9553]
 gi|394402834|gb|EJE78522.1| hypothetical protein ECO10021_10238 [Escherichia coli O26:H11 str.
           CVM10021]
 gi|394428906|gb|EJF01391.1| hypothetical protein ECO10030_05141 [Escherichia coli O26:H11 str.
           CVM10030]
 gi|394430008|gb|EJF02391.1| hypothetical protein ECO9952_09892 [Escherichia coli O26:H11 str.
           CVM9952]
 gi|397784336|gb|EJK95192.1| hypothetical protein ECSTECO31_3202 [Escherichia coli STEC_O31]
 gi|397895891|gb|EJL12316.1| hypothetical protein SF660363_3459 [Shigella flexneri 6603-63]
 gi|404290421|gb|EEH71648.2| UPF0235 protein yggU [Escherichia sp. 1_1_43]
 gi|406776236|gb|AFS55660.1| hypothetical protein O3M_04730 [Escherichia coli O104:H4 str.
           2009EL-2050]
 gi|407052805|gb|AFS72856.1| hypothetical protein O3K_04685 [Escherichia coli O104:H4 str.
           2011C-3493]
 gi|407066866|gb|AFS87913.1| hypothetical protein O3O_20965 [Escherichia coli O104:H4 str.
           2009EL-2071]
 gi|408200161|gb|EKI25349.1| hypothetical protein ECARS42123_3410 [Escherichia coli ARS4.2123]
 gi|408211869|gb|EKI36410.1| hypothetical protein EC07798_3612 [Escherichia coli 07798]
 gi|408212032|gb|EKI36566.1| hypothetical protein EC3006_3557 [Escherichia coli 3006]
 gi|408227057|gb|EKI50677.1| hypothetical protein ECN1_3116 [Escherichia coli N1]
 gi|408295113|gb|EKJ13455.1| hypothetical protein ECEC1865_4185 [Escherichia coli EC1865]
 gi|408342666|gb|EKJ57093.1| hypothetical protein EC01288_3165 [Escherichia coli 0.1288]
 gi|421935420|gb|EKT93112.1| hypothetical protein CFSAN001632_23250 [Escherichia coli O111:H8
           str. CFSAN001632]
 gi|421944960|gb|EKU02199.1| hypothetical protein CFSAN001629_07416 [Escherichia coli O26:H11
           str. CFSAN001629]
 gi|421948783|gb|EKU05787.1| hypothetical protein CFSAN001630_08138 [Escherichia coli O111:H11
           str. CFSAN001630]
 gi|429347635|gb|EKY84408.1| hypothetical protein C214_02212 [Escherichia coli O104:H4 str.
           11-02092]
 gi|429358671|gb|EKY95340.1| hypothetical protein C212_02216 [Escherichia coli O104:H4 str.
           11-02030]
 gi|429360416|gb|EKY97075.1| hypothetical protein C213_02214 [Escherichia coli O104:H4 str.
           11-02033-1]
 gi|429360727|gb|EKY97385.1| hypothetical protein C215_02213 [Escherichia coli O104:H4 str.
           11-02093]
 gi|429364095|gb|EKZ00720.1| hypothetical protein C217_02212 [Escherichia coli O104:H4 str.
           11-02318]
 gi|429375650|gb|EKZ12184.1| hypothetical protein C216_02215 [Escherichia coli O104:H4 str.
           11-02281]
 gi|429378058|gb|EKZ14573.1| hypothetical protein C219_02212 [Escherichia coli O104:H4 str.
           11-03439]
 gi|429389703|gb|EKZ26123.1| hypothetical protein C218_02212 [Escherichia coli O104:H4 str.
           11-02913]
 gi|429393537|gb|EKZ29932.1| hypothetical protein C221_02212 [Escherichia coli O104:H4 str.
           11-03943]
 gi|429403541|gb|EKZ39825.1| hypothetical protein C220_02213 [Escherichia coli O104:H4 str.
           11-04080]
 gi|429404726|gb|EKZ40997.1| hypothetical protein MO5_01864 [Escherichia coli O104:H4 str.
           Ec11-9990]
 gi|429408241|gb|EKZ44481.1| hypothetical protein MO3_03245 [Escherichia coli O104:H4 str.
           Ec11-9450]
 gi|429413345|gb|EKZ49534.1| hypothetical protein O7I_01560 [Escherichia coli O104:H4 str.
           Ec11-4987]
 gi|429416074|gb|EKZ52232.1| hypothetical protein O7C_01867 [Escherichia coli O104:H4 str.
           Ec11-4984]
 gi|429419755|gb|EKZ55890.1| hypothetical protein O7G_02688 [Escherichia coli O104:H4 str.
           Ec11-4986]
 gi|429430594|gb|EKZ66655.1| hypothetical protein O7K_03111 [Escherichia coli O104:H4 str.
           Ec11-4988]
 gi|429434960|gb|EKZ70981.1| hypothetical protein O7M_03687 [Escherichia coli O104:H4 str.
           Ec11-5603]
 gi|429437093|gb|EKZ73105.1| hypothetical protein O7O_01182 [Escherichia coli O104:H4 str.
           Ec11-6006]
 gi|429442042|gb|EKZ78005.1| hypothetical protein O7E_01869 [Escherichia coli O104:H4 str.
           Ec11-5604]
 gi|429446763|gb|EKZ82691.1| hypothetical protein S7Y_03643 [Escherichia coli O104:H4 str.
           Ec12-0465]
 gi|429450375|gb|EKZ86271.1| hypothetical protein MO7_01844 [Escherichia coli O104:H4 str.
           Ec11-9941]
 gi|429456132|gb|EKZ91979.1| hypothetical protein S91_01945 [Escherichia coli O104:H4 str.
           Ec12-0466]
 gi|430873788|gb|ELB97354.1| hypothetical protein WCA_03872 [Escherichia coli KTE2]
 gi|430875138|gb|ELB98681.1| hypothetical protein WCC_03240 [Escherichia coli KTE4]
 gi|430883885|gb|ELC06856.1| hypothetical protein WCE_03155 [Escherichia coli KTE5]
 gi|430891758|gb|ELC14279.1| hypothetical protein WCM_00343 [Escherichia coli KTE10]
 gi|430896575|gb|ELC18803.1| hypothetical protein WCQ_03025 [Escherichia coli KTE12]
 gi|430904822|gb|ELC26521.1| hypothetical protein WCY_03854 [Escherichia coli KTE16]
 gi|430905716|gb|ELC27324.1| hypothetical protein WCU_02817 [Escherichia coli KTE15]
 gi|430916892|gb|ELC37951.1| hypothetical protein WE9_03750 [Escherichia coli KTE21]
 gi|430924391|gb|ELC45112.1| hypothetical protein WEK_03437 [Escherichia coli KTE26]
 gi|430934253|gb|ELC54626.1| hypothetical protein WG9_03575 [Escherichia coli KTE39]
 gi|430937696|gb|ELC57950.1| hypothetical protein WGI_03935 [Escherichia coli KTE44]
 gi|430951332|gb|ELC70552.1| hypothetical protein A13K_03451 [Escherichia coli KTE187]
 gi|430953301|gb|ELC72201.1| hypothetical protein A139_02907 [Escherichia coli KTE181]
 gi|430961806|gb|ELC79813.1| hypothetical protein A13M_03360 [Escherichia coli KTE188]
 gi|430965289|gb|ELC82730.1| hypothetical protein A13O_03224 [Escherichia coli KTE189]
 gi|430972345|gb|ELC89343.1| hypothetical protein A13S_03557 [Escherichia coli KTE191]
 gi|430978409|gb|ELC95220.1| hypothetical protein A13W_02087 [Escherichia coli KTE193]
 gi|430980978|gb|ELC97722.1| hypothetical protein A15C_03781 [Escherichia coli KTE201]
 gi|430987643|gb|ELD04173.1| hypothetical protein A15I_02839 [Escherichia coli KTE204]
 gi|430992357|gb|ELD08730.1| hypothetical protein A15K_03072 [Escherichia coli KTE205]
 gi|430996891|gb|ELD13166.1| hypothetical protein A15M_03154 [Escherichia coli KTE206]
 gi|431004784|gb|ELD19993.1| hypothetical protein A15U_03414 [Escherichia coli KTE210]
 gi|431014400|gb|ELD28108.1| hypothetical protein A15Y_03210 [Escherichia coli KTE212]
 gi|431022664|gb|ELD35925.1| hypothetical protein A173_04148 [Escherichia coli KTE214]
 gi|431026769|gb|ELD39837.1| hypothetical protein A177_03511 [Escherichia coli KTE216]
 gi|431037232|gb|ELD48220.1| hypothetical protein A17E_02788 [Escherichia coli KTE220]
 gi|431040594|gb|ELD51129.1| hypothetical protein A17M_03091 [Escherichia coli KTE224]
 gi|431050253|gb|ELD60004.1| hypothetical protein A17Y_03237 [Escherichia coli KTE230]
 gi|431059175|gb|ELD68551.1| hypothetical protein A193_03767 [Escherichia coli KTE234]
 gi|431061899|gb|ELD71192.1| hypothetical protein A191_00978 [Escherichia coli KTE233]
 gi|431067644|gb|ELD76160.1| hypothetical protein A195_02861 [Escherichia coli KTE235]
 gi|431073531|gb|ELD81182.1| hypothetical protein A197_03190 [Escherichia coli KTE236]
 gi|431078808|gb|ELD85848.1| hypothetical protein A199_03565 [Escherichia coli KTE237]
 gi|431082345|gb|ELD88659.1| hypothetical protein A1S3_03409 [Escherichia coli KTE47]
 gi|431091618|gb|ELD97335.1| hypothetical protein A1SA_03870 [Escherichia coli KTE51]
 gi|431098628|gb|ELE03941.1| hypothetical protein A1SE_03592 [Escherichia coli KTE53]
 gi|431105714|gb|ELE10048.1| hypothetical protein A1SI_03838 [Escherichia coli KTE55]
 gi|431113705|gb|ELE17359.1| hypothetical protein A1SK_00886 [Escherichia coli KTE56]
 gi|431118619|gb|ELE21638.1| hypothetical protein A1SO_03631 [Escherichia coli KTE58]
 gi|431122240|gb|ELE25109.1| hypothetical protein A1SM_01163 [Escherichia coli KTE57]
 gi|431126531|gb|ELE28878.1| hypothetical protein A1SS_03562 [Escherichia coli KTE60]
 gi|431128996|gb|ELE31172.1| hypothetical protein A1SW_03867 [Escherichia coli KTE62]
 gi|431136874|gb|ELE38730.1| hypothetical protein A1U7_03812 [Escherichia coli KTE67]
 gi|431139966|gb|ELE41744.1| hypothetical protein A1U5_03467 [Escherichia coli KTE66]
 gi|431147120|gb|ELE48543.1| hypothetical protein A1UG_03315 [Escherichia coli KTE72]
 gi|431152696|gb|ELE53642.1| hypothetical protein A1UM_03592 [Escherichia coli KTE75]
 gi|431157814|gb|ELE58448.1| hypothetical protein A1UO_03059 [Escherichia coli KTE76]
 gi|431167861|gb|ELE68115.1| hypothetical protein A1UW_03118 [Escherichia coli KTE80]
 gi|431178836|gb|ELE78743.1| hypothetical protein A1W5_03258 [Escherichia coli KTE86]
 gi|431179975|gb|ELE79866.1| hypothetical protein A1W1_03320 [Escherichia coli KTE83]
 gi|431188986|gb|ELE88425.1| hypothetical protein A1W7_03554 [Escherichia coli KTE87]
 gi|431189253|gb|ELE88678.1| hypothetical protein A1WE_03207 [Escherichia coli KTE93]
 gi|431199148|gb|ELE97910.1| hypothetical protein A1Y3_04049 [Escherichia coli KTE116]
 gi|431208884|gb|ELF07005.1| hypothetical protein A1Y7_03594 [Escherichia coli KTE119]
 gi|431218617|gb|ELF16057.1| hypothetical protein A1YW_03325 [Escherichia coli KTE143]
 gi|431232362|gb|ELF28030.1| hypothetical protein A31I_03217 [Escherichia coli KTE162]
 gi|431237617|gb|ELF32611.1| hypothetical protein A31G_00367 [Escherichia coli KTE161]
 gi|431241959|gb|ELF36388.1| hypothetical protein A31M_03110 [Escherichia coli KTE169]
 gi|431254478|gb|ELF47748.1| hypothetical protein WCI_03050 [Escherichia coli KTE8]
 gi|431256307|gb|ELF49381.1| hypothetical protein WCG_00467 [Escherichia coli KTE6]
 gi|431260892|gb|ELF52983.1| hypothetical protein WCK_03704 [Escherichia coli KTE9]
 gi|431272599|gb|ELF63698.1| hypothetical protein WGK_03551 [Escherichia coli KTE45]
 gi|431290025|gb|ELF80750.1| hypothetical protein WGG_03035 [Escherichia coli KTE43]
 gi|431294599|gb|ELF84778.1| hypothetical protein WEQ_02841 [Escherichia coli KTE29]
 gi|431301115|gb|ELF90662.1| hypothetical protein WEA_02807 [Escherichia coli KTE22]
 gi|431306084|gb|ELF94397.1| hypothetical protein A1S1_02919 [Escherichia coli KTE46]
 gi|431308369|gb|ELF96649.1| hypothetical protein A1S5_03878 [Escherichia coli KTE48]
 gi|431312979|gb|ELG00959.1| hypothetical protein A1S9_04684 [Escherichia coli KTE50]
 gi|431325134|gb|ELG12522.1| hypothetical protein A1SQ_03553 [Escherichia coli KTE59]
 gi|431327983|gb|ELG15303.1| hypothetical protein A1SY_03688 [Escherichia coli KTE63]
 gi|431335883|gb|ELG23012.1| hypothetical protein A1U3_03013 [Escherichia coli KTE65]
 gi|431338185|gb|ELG25272.1| hypothetical protein A1US_03348 [Escherichia coli KTE78]
 gi|431347211|gb|ELG34104.1| hypothetical protein A1W3_03371 [Escherichia coli KTE84]
 gi|431350682|gb|ELG37493.1| hypothetical protein A1UU_00339 [Escherichia coli KTE79]
 gi|431353578|gb|ELG40331.1| hypothetical protein A1WA_03040 [Escherichia coli KTE91]
 gi|431361007|gb|ELG47606.1| hypothetical protein A1WM_01819 [Escherichia coli KTE101]
 gi|431361650|gb|ELG48229.1| hypothetical protein A1Y1_03049 [Escherichia coli KTE115]
 gi|431366109|gb|ELG52607.1| hypothetical protein A1Y5_03936 [Escherichia coli KTE118]
 gi|431378354|gb|ELG63345.1| hypothetical protein A1YA_00496 [Escherichia coli KTE123]
 gi|431383439|gb|ELG67563.1| hypothetical protein A1YM_00352 [Escherichia coli KTE135]
 gi|431383941|gb|ELG68064.1| hypothetical protein A1YO_03257 [Escherichia coli KTE136]
 gi|431393490|gb|ELG77054.1| hypothetical protein A1YS_03425 [Escherichia coli KTE141]
 gi|431403463|gb|ELG86744.1| hypothetical protein A311_03663 [Escherichia coli KTE146]
 gi|431409366|gb|ELG92541.1| hypothetical protein A313_01686 [Escherichia coli KTE147]
 gi|431414662|gb|ELG97213.1| hypothetical protein A31C_03698 [Escherichia coli KTE158]
 gi|431423934|gb|ELH06031.1| hypothetical protein A13U_03364 [Escherichia coli KTE192]
 gi|431430705|gb|ELH12536.1| hypothetical protein A13Y_03431 [Escherichia coli KTE194]
 gi|431439237|gb|ELH20573.1| hypothetical protein A133_03490 [Escherichia coli KTE173]
 gi|431442633|gb|ELH23722.1| hypothetical protein A135_03834 [Escherichia coli KTE175]
 gi|431451821|gb|ELH32292.1| hypothetical protein A13E_04372 [Escherichia coli KTE184]
 gi|431455648|gb|ELH36003.1| hypothetical protein A153_03719 [Escherichia coli KTE196]
 gi|431461109|gb|ELH41377.1| hypothetical protein A13C_01983 [Escherichia coli KTE183]
 gi|431468727|gb|ELH48660.1| hypothetical protein A15G_04137 [Escherichia coli KTE203]
 gi|431471882|gb|ELH51774.1| hypothetical protein A15E_03664 [Escherichia coli KTE202]
 gi|431480254|gb|ELH59981.1| hypothetical protein A15O_03678 [Escherichia coli KTE207]
 gi|431487126|gb|ELH66771.1| hypothetical protein A15S_00922 [Escherichia coli KTE209]
 gi|431490417|gb|ELH70034.1| hypothetical protein A15W_03437 [Escherichia coli KTE211]
 gi|431497947|gb|ELH77164.1| hypothetical protein A175_03146 [Escherichia coli KTE215]
 gi|431503414|gb|ELH82149.1| hypothetical protein A17A_03696 [Escherichia coli KTE218]
 gi|431506618|gb|ELH85213.1| hypothetical protein A17K_03541 [Escherichia coli KTE223]
 gi|431512158|gb|ELH90286.1| hypothetical protein A17S_03995 [Escherichia coli KTE227]
 gi|431522110|gb|ELH99345.1| hypothetical protein A17W_01797 [Escherichia coli KTE229]
 gi|431528903|gb|ELI05608.1| hypothetical protein WI5_03023 [Escherichia coli KTE104]
 gi|431533412|gb|ELI09912.1| hypothetical protein WI9_02949 [Escherichia coli KTE106]
 gi|431541530|gb|ELI16969.1| hypothetical protein WIA_02955 [Escherichia coli KTE109]
 gi|431548350|gb|ELI22632.1| hypothetical protein WIC_03378 [Escherichia coli KTE112]
 gi|431550318|gb|ELI24315.1| hypothetical protein WIE_03280 [Escherichia coli KTE113]
 gi|431554239|gb|ELI28120.1| hypothetical protein WIG_03032 [Escherichia coli KTE117]
 gi|431563209|gb|ELI36442.1| hypothetical protein WII_03300 [Escherichia coli KTE120]
 gi|431568040|gb|ELI41032.1| hypothetical protein WIM_03174 [Escherichia coli KTE124]
 gi|431568319|gb|ELI41307.1| hypothetical protein WIK_03417 [Escherichia coli KTE122]
 gi|431579690|gb|ELI52270.1| hypothetical protein WIO_03200 [Escherichia coli KTE125]
 gi|431581319|gb|ELI53772.1| hypothetical protein WIQ_03143 [Escherichia coli KTE128]
 gi|431585316|gb|ELI57268.1| hypothetical protein WIS_03117 [Escherichia coli KTE129]
 gi|431595161|gb|ELI65235.1| hypothetical protein WIU_02974 [Escherichia coli KTE131]
 gi|431599988|gb|ELI69666.1| hypothetical protein WIW_03001 [Escherichia coli KTE133]
 gi|431603609|gb|ELI73034.1| hypothetical protein WIY_03054 [Escherichia coli KTE137]
 gi|431608618|gb|ELI77960.1| hypothetical protein WK1_02981 [Escherichia coli KTE138]
 gi|431614016|gb|ELI83181.1| hypothetical protein WK3_02926 [Escherichia coli KTE139]
 gi|431617755|gb|ELI86766.1| hypothetical protein WK5_03060 [Escherichia coli KTE145]
 gi|431625375|gb|ELI93960.1| hypothetical protein WK7_03043 [Escherichia coli KTE148]
 gi|431632817|gb|ELJ01104.1| hypothetical protein WKA_02996 [Escherichia coli KTE153]
 gi|431640783|gb|ELJ08538.1| hypothetical protein WKC_02924 [Escherichia coli KTE157]
 gi|431642873|gb|ELJ10580.1| hypothetical protein WKE_02973 [Escherichia coli KTE160]
 gi|431644853|gb|ELJ12507.1| hypothetical protein WKG_03235 [Escherichia coli KTE163]
 gi|431654812|gb|ELJ21859.1| hypothetical protein WKI_03098 [Escherichia coli KTE166]
 gi|431658397|gb|ELJ25311.1| hypothetical protein WKM_02826 [Escherichia coli KTE167]
 gi|431659769|gb|ELJ26659.1| hypothetical protein WKO_03065 [Escherichia coli KTE168]
 gi|431669322|gb|ELJ35749.1| hypothetical protein WKQ_03118 [Escherichia coli KTE174]
 gi|431672424|gb|ELJ38695.1| hypothetical protein WKS_02963 [Escherichia coli KTE176]
 gi|431685270|gb|ELJ50845.1| hypothetical protein WKW_03131 [Escherichia coli KTE179]
 gi|431686175|gb|ELJ51741.1| hypothetical protein WKY_03151 [Escherichia coli KTE180]
 gi|431690122|gb|ELJ55606.1| hypothetical protein WGQ_03104 [Escherichia coli KTE232]
 gi|431699054|gb|ELJ64071.1| hypothetical protein WGM_03447 [Escherichia coli KTE82]
 gi|431704068|gb|ELJ68702.1| hypothetical protein WGS_02787 [Escherichia coli KTE88]
 gi|431704229|gb|ELJ68861.1| hypothetical protein WGO_03068 [Escherichia coli KTE85]
 gi|431714388|gb|ELJ78580.1| hypothetical protein WGU_03325 [Escherichia coli KTE90]
 gi|431719435|gb|ELJ83494.1| hypothetical protein WGW_03202 [Escherichia coli KTE94]
 gi|431729190|gb|ELJ92829.1| hypothetical protein WI1_02816 [Escherichia coli KTE97]
 gi|431733732|gb|ELJ97167.1| hypothetical protein WI3_03009 [Escherichia coli KTE99]
 gi|432349344|gb|ELL43773.1| hypothetical protein B185_002349 [Escherichia coli J96]
 gi|441607028|emb|CCP99323.1| UPF0235 protein VC0458 [Escherichia coli O10:K5(L):H4 str. ATCC
           23506]
 gi|441653737|emb|CCQ01432.1| UPF0235 protein VC0458 [Escherichia coli O5:K4(L):H4 str. ATCC
           23502]
 gi|441714179|emb|CCQ05903.1| UPF0235 protein VC0458 [Escherichia coli Nissle 1917]
 gi|443423515|gb|AGC88419.1| hypothetical protein APECO78_18500 [Escherichia coli APEC O78]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQIKIINPQQIPPE 90


>gi|161367529|ref|NP_289525.2| hypothetical protein Z4298 [Escherichia coli O157:H7 str. EDL933]
 gi|162139760|ref|NP_311856.2| hypothetical protein ECs3829 [Escherichia coli O157:H7 str. Sakai]
 gi|168747555|ref|ZP_02772577.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4113]
 gi|168753905|ref|ZP_02778912.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4401]
 gi|168760095|ref|ZP_02785102.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4501]
 gi|168766960|ref|ZP_02791967.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4486]
 gi|168773408|ref|ZP_02798415.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4196]
 gi|168781812|ref|ZP_02806819.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4076]
 gi|168785811|ref|ZP_02810818.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC869]
 gi|168797528|ref|ZP_02822535.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC508]
 gi|195937091|ref|ZP_03082473.1| hypothetical protein EscherichcoliO157_11661 [Escherichia coli
           O157:H7 str. EC4024]
 gi|208807147|ref|ZP_03249484.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7
           str. EC4206]
 gi|208813482|ref|ZP_03254811.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7
           str. EC4045]
 gi|208818265|ref|ZP_03258585.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7
           str. EC4042]
 gi|209399373|ref|YP_002272433.1| hypothetical protein ECH74115_4256 [Escherichia coli O157:H7 str.
           EC4115]
 gi|217327282|ref|ZP_03443365.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7
           str. TW14588]
 gi|254794905|ref|YP_003079742.1| hypothetical protein ECSP_3924 [Escherichia coli O157:H7 str.
           TW14359]
 gi|261226265|ref|ZP_05940546.1| hypothetical protein EscherichiacoliO157_16963 [Escherichia coli
           O157:H7 str. FRIK2000]
 gi|261256477|ref|ZP_05949010.1| hypothetical protein EscherichiacoliO157EcO_11656 [Escherichia coli
           O157:H7 str. FRIK966]
 gi|291284274|ref|YP_003501092.1| hypothetical protein G2583_3612 [Escherichia coli O55:H7 str.
           CB9615]
 gi|387508306|ref|YP_006160562.1| hypothetical protein ECO55CA74_17220 [Escherichia coli O55:H7 str.
           RM12579]
 gi|387884144|ref|YP_006314446.1| hypothetical protein CDCO157_3580 [Escherichia coli Xuzhou21]
 gi|416314424|ref|ZP_11658659.1| hypothetical protein ECoA_04497 [Escherichia coli O157:H7 str.
           1044]
 gi|416322122|ref|ZP_11663970.1| hypothetical protein ECoD_04306 [Escherichia coli O157:H7 str.
           EC1212]
 gi|416327862|ref|ZP_11667782.1| hypothetical protein ECF_02672 [Escherichia coli O157:H7 str. 1125]
 gi|416777060|ref|ZP_11875094.1| hypothetical protein ECO5101_04264 [Escherichia coli O157:H7 str.
           G5101]
 gi|416788520|ref|ZP_11880019.1| hypothetical protein ECO9389_23751 [Escherichia coli O157:H- str.
           493-89]
 gi|416800507|ref|ZP_11884931.1| hypothetical protein ECO2687_11738 [Escherichia coli O157:H- str. H
           2687]
 gi|416811070|ref|ZP_11889695.1| hypothetical protein ECO7815_01895 [Escherichia coli O55:H7 str.
           3256-97]
 gi|416821760|ref|ZP_11894345.1| hypothetical protein ECO5905_09833 [Escherichia coli O55:H7 str.
           USDA 5905]
 gi|416832151|ref|ZP_11899441.1| hypothetical protein ECOSU61_08919 [Escherichia coli O157:H7 str.
           LSU-61]
 gi|419046863|ref|ZP_13593798.1| hypothetical protein ECDEC3A_3851 [Escherichia coli DEC3A]
 gi|419052714|ref|ZP_13599581.1| hypothetical protein ECDEC3B_4023 [Escherichia coli DEC3B]
 gi|419058709|ref|ZP_13605512.1| hypothetical protein ECDEC3C_4306 [Escherichia coli DEC3C]
 gi|419064205|ref|ZP_13610928.1| hypothetical protein ECDEC3D_4010 [Escherichia coli DEC3D]
 gi|419071152|ref|ZP_13616767.1| hypothetical protein ECDEC3E_4250 [Escherichia coli DEC3E]
 gi|419077235|ref|ZP_13622738.1| hypothetical protein ECDEC3F_4159 [Escherichia coli DEC3F]
 gi|419082177|ref|ZP_13627624.1| hypothetical protein ECDEC4A_3803 [Escherichia coli DEC4A]
 gi|419088016|ref|ZP_13633369.1| hypothetical protein ECDEC4B_3958 [Escherichia coli DEC4B]
 gi|419094010|ref|ZP_13639292.1| hypothetical protein ECDEC4C_3881 [Escherichia coli DEC4C]
 gi|419099822|ref|ZP_13645015.1| hypothetical protein ECDEC4D_3801 [Escherichia coli DEC4D]
 gi|419105522|ref|ZP_13650649.1| hypothetical protein ECDEC4E_3851 [Escherichia coli DEC4E]
 gi|419110987|ref|ZP_13656041.1| hypothetical protein ECDEC4F_3820 [Escherichia coli DEC4F]
 gi|419116348|ref|ZP_13661363.1| hypothetical protein ECDEC5A_3542 [Escherichia coli DEC5A]
 gi|419122039|ref|ZP_13666985.1| hypothetical protein ECDEC5B_3872 [Escherichia coli DEC5B]
 gi|419127601|ref|ZP_13672477.1| hypothetical protein ECDEC5C_3652 [Escherichia coli DEC5C]
 gi|419132976|ref|ZP_13677810.1| hypothetical protein ECDEC5D_3755 [Escherichia coli DEC5D]
 gi|419138125|ref|ZP_13682916.1| hypothetical protein ECDEC5E_3645 [Escherichia coli DEC5E]
 gi|420271166|ref|ZP_14773520.1| hypothetical protein ECPA22_4226 [Escherichia coli PA22]
 gi|420276992|ref|ZP_14779274.1| hypothetical protein ECPA40_4245 [Escherichia coli PA40]
 gi|420281956|ref|ZP_14784189.1| hypothetical protein ECTW06591_3758 [Escherichia coli TW06591]
 gi|420288534|ref|ZP_14790718.1| hypothetical protein ECTW10246_4309 [Escherichia coli TW10246]
 gi|420293995|ref|ZP_14796110.1| hypothetical protein ECTW11039_4141 [Escherichia coli TW11039]
 gi|420299911|ref|ZP_14801957.1| hypothetical protein ECTW09109_4399 [Escherichia coli TW09109]
 gi|420305689|ref|ZP_14807679.1| hypothetical protein ECTW10119_4591 [Escherichia coli TW10119]
 gi|420311200|ref|ZP_14813130.1| hypothetical protein ECEC1738_4060 [Escherichia coli EC1738]
 gi|420316986|ref|ZP_14818859.1| hypothetical protein ECEC1734_4074 [Escherichia coli EC1734]
 gi|421813978|ref|ZP_16249690.1| hypothetical protein EC80416_3758 [Escherichia coli 8.0416]
 gi|421819797|ref|ZP_16255288.1| hypothetical protein EC100821_3681 [Escherichia coli 10.0821]
 gi|421825804|ref|ZP_16261159.1| hypothetical protein ECFRIK920_4215 [Escherichia coli FRIK920]
 gi|421832502|ref|ZP_16267785.1| hypothetical protein ECPA7_4671 [Escherichia coli PA7]
 gi|423726841|ref|ZP_17700802.1| hypothetical protein ECPA31_4027 [Escherichia coli PA31]
 gi|424079098|ref|ZP_17816072.1| hypothetical protein ECFDA505_4029 [Escherichia coli FDA505]
 gi|424085553|ref|ZP_17822048.1| hypothetical protein ECFDA517_4393 [Escherichia coli FDA517]
 gi|424091965|ref|ZP_17827898.1| hypothetical protein ECFRIK1996_4130 [Escherichia coli FRIK1996]
 gi|424098613|ref|ZP_17833902.1| hypothetical protein ECFRIK1985_4334 [Escherichia coli FRIK1985]
 gi|424104839|ref|ZP_17839590.1| hypothetical protein ECFRIK1990_4242 [Escherichia coli FRIK1990]
 gi|424111490|ref|ZP_17845726.1| hypothetical protein EC93001_4195 [Escherichia coli 93-001]
 gi|424117428|ref|ZP_17851266.1| hypothetical protein ECPA3_4203 [Escherichia coli PA3]
 gi|424123613|ref|ZP_17856929.1| hypothetical protein ECPA5_4064 [Escherichia coli PA5]
 gi|424129768|ref|ZP_17862675.1| hypothetical protein ECPA9_4241 [Escherichia coli PA9]
 gi|424136086|ref|ZP_17868541.1| hypothetical protein ECPA10_4385 [Escherichia coli PA10]
 gi|424142634|ref|ZP_17874511.1| hypothetical protein ECPA14_4229 [Escherichia coli PA14]
 gi|424149041|ref|ZP_17880417.1| hypothetical protein ECPA15_4351 [Escherichia coli PA15]
 gi|424154874|ref|ZP_17885814.1| hypothetical protein ECPA24_3941 [Escherichia coli PA24]
 gi|424252709|ref|ZP_17891375.1| hypothetical protein ECPA25_3934 [Escherichia coli PA25]
 gi|424331062|ref|ZP_17897281.1| hypothetical protein ECPA28_4270 [Escherichia coli PA28]
 gi|424451316|ref|ZP_17902998.1| hypothetical protein ECPA32_4089 [Escherichia coli PA32]
 gi|424457508|ref|ZP_17908628.1| hypothetical protein ECPA33_4089 [Escherichia coli PA33]
 gi|424463960|ref|ZP_17914358.1| hypothetical protein ECPA39_4166 [Escherichia coli PA39]
 gi|424470275|ref|ZP_17920094.1| hypothetical protein ECPA41_4178 [Escherichia coli PA41]
 gi|424476788|ref|ZP_17926106.1| hypothetical protein ECPA42_4251 [Escherichia coli PA42]
 gi|424482551|ref|ZP_17931530.1| hypothetical protein ECTW07945_4093 [Escherichia coli TW07945]
 gi|424488720|ref|ZP_17937275.1| hypothetical protein ECTW09098_4167 [Escherichia coli TW09098]
 gi|424495334|ref|ZP_17942991.1| hypothetical protein ECTW09195_4222 [Escherichia coli TW09195]
 gi|424502080|ref|ZP_17948971.1| hypothetical protein ECEC4203_4169 [Escherichia coli EC4203]
 gi|424508326|ref|ZP_17954720.1| hypothetical protein ECEC4196_4218 [Escherichia coli EC4196]
 gi|424515671|ref|ZP_17960321.1| hypothetical protein ECTW14313_4022 [Escherichia coli TW14313]
 gi|424521880|ref|ZP_17966000.1| hypothetical protein ECTW14301_3950 [Escherichia coli TW14301]
 gi|424527760|ref|ZP_17971477.1| hypothetical protein ECEC4421_4009 [Escherichia coli EC4421]
 gi|424533913|ref|ZP_17977261.1| hypothetical protein ECEC4422_4142 [Escherichia coli EC4422]
 gi|424539965|ref|ZP_17982909.1| hypothetical protein ECEC4013_4277 [Escherichia coli EC4013]
 gi|424546078|ref|ZP_17988458.1| hypothetical protein ECEC4402_4140 [Escherichia coli EC4402]
 gi|424552307|ref|ZP_17994156.1| hypothetical protein ECEC4439_4101 [Escherichia coli EC4439]
 gi|424558487|ref|ZP_17999900.1| hypothetical protein ECEC4436_4041 [Escherichia coli EC4436]
 gi|424564825|ref|ZP_18005829.1| hypothetical protein ECEC4437_4200 [Escherichia coli EC4437]
 gi|424570967|ref|ZP_18011517.1| hypothetical protein ECEC4448_4114 [Escherichia coli EC4448]
 gi|424577125|ref|ZP_18017183.1| hypothetical protein ECEC1845_4082 [Escherichia coli EC1845]
 gi|424582945|ref|ZP_18022592.1| hypothetical protein ECEC1863_3815 [Escherichia coli EC1863]
 gi|425099619|ref|ZP_18502351.1| hypothetical protein EC34870_4161 [Escherichia coli 3.4870]
 gi|425105713|ref|ZP_18508032.1| hypothetical protein EC52239_4116 [Escherichia coli 5.2239]
 gi|425111730|ref|ZP_18513651.1| hypothetical protein EC60172_4274 [Escherichia coli 6.0172]
 gi|425127649|ref|ZP_18528818.1| hypothetical protein EC80586_4420 [Escherichia coli 8.0586]
 gi|425133386|ref|ZP_18534236.1| hypothetical protein EC82524_4031 [Escherichia coli 8.2524]
 gi|425139971|ref|ZP_18540352.1| hypothetical protein EC100833_4404 [Escherichia coli 10.0833]
 gi|425145680|ref|ZP_18545677.1| hypothetical protein EC100869_3943 [Escherichia coli 10.0869]
 gi|425151795|ref|ZP_18551410.1| hypothetical protein EC880221_4075 [Escherichia coli 88.0221]
 gi|425157668|ref|ZP_18556932.1| hypothetical protein ECPA34_4228 [Escherichia coli PA34]
 gi|425164018|ref|ZP_18562905.1| hypothetical protein ECFDA506_4431 [Escherichia coli FDA506]
 gi|425169761|ref|ZP_18568235.1| hypothetical protein ECFDA507_4169 [Escherichia coli FDA507]
 gi|425175824|ref|ZP_18573944.1| hypothetical protein ECFDA504_4107 [Escherichia coli FDA504]
 gi|425181863|ref|ZP_18579559.1| hypothetical protein ECFRIK1999_4285 [Escherichia coli FRIK1999]
 gi|425188126|ref|ZP_18585401.1| hypothetical protein ECFRIK1997_4346 [Escherichia coli FRIK1997]
 gi|425194897|ref|ZP_18591666.1| hypothetical protein ECNE1487_4495 [Escherichia coli NE1487]
 gi|425201366|ref|ZP_18597575.1| hypothetical protein ECNE037_4482 [Escherichia coli NE037]
 gi|425207757|ref|ZP_18603554.1| hypothetical protein ECFRIK2001_4501 [Escherichia coli FRIK2001]
 gi|425213510|ref|ZP_18608912.1| hypothetical protein ECPA4_4245 [Escherichia coli PA4]
 gi|425219632|ref|ZP_18614596.1| hypothetical protein ECPA23_4114 [Escherichia coli PA23]
 gi|425226184|ref|ZP_18620652.1| hypothetical protein ECPA49_4249 [Escherichia coli PA49]
 gi|425232443|ref|ZP_18626484.1| hypothetical protein ECPA45_4297 [Escherichia coli PA45]
 gi|425238366|ref|ZP_18632086.1| hypothetical protein ECTT12B_3994 [Escherichia coli TT12B]
 gi|425244604|ref|ZP_18637910.1| hypothetical protein ECMA6_4304 [Escherichia coli MA6]
 gi|425250740|ref|ZP_18643682.1| hypothetical protein EC5905_4364 [Escherichia coli 5905]
 gi|425256575|ref|ZP_18649090.1| hypothetical protein ECCB7326_4160 [Escherichia coli CB7326]
 gi|425262830|ref|ZP_18654834.1| hypothetical protein ECEC96038_4055 [Escherichia coli EC96038]
 gi|425268831|ref|ZP_18660461.1| hypothetical protein EC5412_4089 [Escherichia coli 5412]
 gi|425296277|ref|ZP_18686454.1| hypothetical protein ECPA38_3948 [Escherichia coli PA38]
 gi|425312969|ref|ZP_18702150.1| hypothetical protein ECEC1735_4083 [Escherichia coli EC1735]
 gi|425318956|ref|ZP_18707746.1| hypothetical protein ECEC1736_4035 [Escherichia coli EC1736]
 gi|425325039|ref|ZP_18713401.1| hypothetical protein ECEC1737_4017 [Escherichia coli EC1737]
 gi|425331407|ref|ZP_18719249.1| hypothetical protein ECEC1846_4134 [Escherichia coli EC1846]
 gi|425337585|ref|ZP_18724945.1| hypothetical protein ECEC1847_4159 [Escherichia coli EC1847]
 gi|425343907|ref|ZP_18730798.1| hypothetical protein ECEC1848_4277 [Escherichia coli EC1848]
 gi|425349713|ref|ZP_18736182.1| hypothetical protein ECEC1849_4014 [Escherichia coli EC1849]
 gi|425356012|ref|ZP_18742080.1| hypothetical protein ECEC1850_4267 [Escherichia coli EC1850]
 gi|425361975|ref|ZP_18747623.1| hypothetical protein ECEC1856_4092 [Escherichia coli EC1856]
 gi|425368178|ref|ZP_18753312.1| hypothetical protein ECEC1862_4098 [Escherichia coli EC1862]
 gi|425374504|ref|ZP_18759148.1| hypothetical protein ECEC1864_4237 [Escherichia coli EC1864]
 gi|425387398|ref|ZP_18770957.1| hypothetical protein ECEC1866_3999 [Escherichia coli EC1866]
 gi|425394050|ref|ZP_18777159.1| hypothetical protein ECEC1868_4272 [Escherichia coli EC1868]
 gi|425400185|ref|ZP_18782892.1| hypothetical protein ECEC1869_4258 [Escherichia coli EC1869]
 gi|425406275|ref|ZP_18788498.1| hypothetical protein ECEC1870_4053 [Escherichia coli EC1870]
 gi|425412659|ref|ZP_18794423.1| hypothetical protein ECNE098_4243 [Escherichia coli NE098]
 gi|425418983|ref|ZP_18800254.1| hypothetical protein ECFRIK523_4102 [Escherichia coli FRIK523]
 gi|425430246|ref|ZP_18810858.1| hypothetical protein EC01304_4217 [Escherichia coli 0.1304]
 gi|428948678|ref|ZP_19020958.1| hypothetical protein EC881467_4170 [Escherichia coli 88.1467]
 gi|428954759|ref|ZP_19026557.1| hypothetical protein EC881042_4122 [Escherichia coli 88.1042]
 gi|428960748|ref|ZP_19032044.1| hypothetical protein EC890511_4065 [Escherichia coli 89.0511]
 gi|428967362|ref|ZP_19038075.1| hypothetical protein EC900091_4460 [Escherichia coli 90.0091]
 gi|428973045|ref|ZP_19043370.1| hypothetical protein EC900039_3958 [Escherichia coli 90.0039]
 gi|428979566|ref|ZP_19049389.1| hypothetical protein EC902281_4071 [Escherichia coli 90.2281]
 gi|428985305|ref|ZP_19054700.1| hypothetical protein EC930055_3997 [Escherichia coli 93.0055]
 gi|428991473|ref|ZP_19060464.1| hypothetical protein EC930056_4051 [Escherichia coli 93.0056]
 gi|428997354|ref|ZP_19065951.1| hypothetical protein EC940618_3950 [Escherichia coli 94.0618]
 gi|429003635|ref|ZP_19071737.1| hypothetical protein EC950183_4137 [Escherichia coli 95.0183]
 gi|429009719|ref|ZP_19077190.1| hypothetical protein EC951288_3842 [Escherichia coli 95.1288]
 gi|429016253|ref|ZP_19083138.1| hypothetical protein EC950943_4236 [Escherichia coli 95.0943]
 gi|429022051|ref|ZP_19088575.1| hypothetical protein EC960428_3960 [Escherichia coli 96.0428]
 gi|429028143|ref|ZP_19094142.1| hypothetical protein EC960427_4113 [Escherichia coli 96.0427]
 gi|429034327|ref|ZP_19099851.1| hypothetical protein EC960939_4157 [Escherichia coli 96.0939]
 gi|429040409|ref|ZP_19105512.1| hypothetical protein EC960932_4194 [Escherichia coli 96.0932]
 gi|429046214|ref|ZP_19110928.1| hypothetical protein EC960107_4006 [Escherichia coli 96.0107]
 gi|429051687|ref|ZP_19116254.1| hypothetical protein EC970003_3802 [Escherichia coli 97.0003]
 gi|429057109|ref|ZP_19121412.1| hypothetical protein EC971742_3614 [Escherichia coli 97.1742]
 gi|429062612|ref|ZP_19126610.1| hypothetical protein EC970007_3445 [Escherichia coli 97.0007]
 gi|429068869|ref|ZP_19132328.1| hypothetical protein EC990672_4109 [Escherichia coli 99.0672]
 gi|429074787|ref|ZP_19138039.1| hypothetical protein EC990678_3880 [Escherichia coli 99.0678]
 gi|429080018|ref|ZP_19143153.1| hypothetical protein EC990713_3838 [Escherichia coli 99.0713]
 gi|429828041|ref|ZP_19359070.1| hypothetical protein EC960109_4182 [Escherichia coli 96.0109]
 gi|429834412|ref|ZP_19364729.1| hypothetical protein EC970010_4091 [Escherichia coli 97.0010]
 gi|444926502|ref|ZP_21245784.1| hypothetical protein EC09BKT78844_4136 [Escherichia coli
           09BKT078844]
 gi|444932261|ref|ZP_21251288.1| hypothetical protein EC990814_3637 [Escherichia coli 99.0814]
 gi|444937683|ref|ZP_21256450.1| hypothetical protein EC990815_3631 [Escherichia coli 99.0815]
 gi|444944641|ref|ZP_21263107.1| hypothetical protein EC990816_5026 [Escherichia coli 99.0816]
 gi|444949982|ref|ZP_21268258.1| hypothetical protein EC990839_4875 [Escherichia coli 99.0839]
 gi|444954355|ref|ZP_21272440.1| hypothetical protein EC990848_3632 [Escherichia coli 99.0848]
 gi|444959864|ref|ZP_21277707.1| hypothetical protein EC991753_3696 [Escherichia coli 99.1753]
 gi|444965031|ref|ZP_21282622.1| hypothetical protein EC991775_3520 [Escherichia coli 99.1775]
 gi|444971019|ref|ZP_21288375.1| hypothetical protein EC991793_3939 [Escherichia coli 99.1793]
 gi|444976289|ref|ZP_21293399.1| hypothetical protein EC991805_3508 [Escherichia coli 99.1805]
 gi|444981695|ref|ZP_21298605.1| hypothetical protein ECATCC700728_3527 [Escherichia coli ATCC
           700728]
 gi|444987084|ref|ZP_21303863.1| hypothetical protein ECPA11_3697 [Escherichia coli PA11]
 gi|444992395|ref|ZP_21309037.1| hypothetical protein ECPA19_3661 [Escherichia coli PA19]
 gi|444997702|ref|ZP_21314199.1| hypothetical protein ECPA13_3492 [Escherichia coli PA13]
 gi|445003275|ref|ZP_21319664.1| hypothetical protein ECPA2_3836 [Escherichia coli PA2]
 gi|445009922|ref|ZP_21326133.1| hypothetical protein ECPA47_4834 [Escherichia coli PA47]
 gi|445013811|ref|ZP_21329917.1| hypothetical protein ECPA48_3518 [Escherichia coli PA48]
 gi|445019711|ref|ZP_21335674.1| hypothetical protein ECPA8_3850 [Escherichia coli PA8]
 gi|445025095|ref|ZP_21340917.1| hypothetical protein EC71982_3761 [Escherichia coli 7.1982]
 gi|445030516|ref|ZP_21346187.1| hypothetical protein EC991781_3918 [Escherichia coli 99.1781]
 gi|445035939|ref|ZP_21351469.1| hypothetical protein EC991762_3888 [Escherichia coli 99.1762]
 gi|445042908|ref|ZP_21358262.1| hypothetical protein ECPA35_5216 [Escherichia coli PA35]
 gi|445046795|ref|ZP_21362045.1| hypothetical protein EC34880_3747 [Escherichia coli 3.4880]
 gi|445052336|ref|ZP_21367373.1| hypothetical protein EC950083_3632 [Escherichia coli 95.0083]
 gi|445058066|ref|ZP_21372924.1| hypothetical protein EC990670_3876 [Escherichia coli 99.0670]
 gi|452970719|ref|ZP_21968946.1| hypothetical protein EC4009_RS19205 [Escherichia coli O157:H7 str.
           EC4009]
 gi|29839727|sp|Q8XCU6.2|YGGU_ECO57 RecName: Full=UPF0235 protein YggU
 gi|226730816|sp|B5YQF1.1|YGGU_ECO5E RecName: Full=UPF0235 protein YggU
 gi|187770933|gb|EDU34777.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4196]
 gi|188017898|gb|EDU56020.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4113]
 gi|189000572|gb|EDU69558.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4076]
 gi|189358605|gb|EDU77024.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4401]
 gi|189363683|gb|EDU82102.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4486]
 gi|189369349|gb|EDU87765.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC4501]
 gi|189374116|gb|EDU92532.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC869]
 gi|189379790|gb|EDU98206.1| conserved hypothetical protein [Escherichia coli O157:H7 str.
           EC508]
 gi|208726948|gb|EDZ76549.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7
           str. EC4206]
 gi|208734759|gb|EDZ83446.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7
           str. EC4045]
 gi|208738388|gb|EDZ86070.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7
           str. EC4042]
 gi|209160773|gb|ACI38206.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7
           str. EC4115]
 gi|217319649|gb|EEC28074.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7
           str. TW14588]
 gi|254594305|gb|ACT73666.1| conserved protein [Escherichia coli O157:H7 str. TW14359]
 gi|290764147|gb|ADD58108.1| conserved hypothetical protein [Escherichia coli O55:H7 str.
           CB9615]
 gi|320189302|gb|EFW63961.1| hypothetical protein ECoD_04306 [Escherichia coli O157:H7 str.
           EC1212]
 gi|320640599|gb|EFX10138.1| hypothetical protein ECO5101_04264 [Escherichia coli O157:H7 str.
           G5101]
 gi|320645846|gb|EFX14831.1| hypothetical protein ECO9389_23751 [Escherichia coli O157:H- str.
           493-89]
 gi|320651146|gb|EFX19586.1| hypothetical protein ECO2687_11738 [Escherichia coli O157:H- str. H
           2687]
 gi|320656642|gb|EFX24538.1| hypothetical protein ECO7815_01895 [Escherichia coli O55:H7 str.
           3256-97 TW 07815]
 gi|320662161|gb|EFX29562.1| hypothetical protein ECO5905_09833 [Escherichia coli O55:H7 str.
           USDA 5905]
 gi|320667236|gb|EFX34199.1| hypothetical protein ECOSU61_08919 [Escherichia coli O157:H7 str.
           LSU-61]
 gi|326338959|gb|EGD62774.1| hypothetical protein ECoA_04497 [Escherichia coli O157:H7 str.
           1044]
 gi|326343159|gb|EGD66927.1| hypothetical protein ECF_02672 [Escherichia coli O157:H7 str. 1125]
 gi|374360300|gb|AEZ42007.1| hypothetical protein ECO55CA74_17220 [Escherichia coli O55:H7 str.
           RM12579]
 gi|377891561|gb|EHU56013.1| hypothetical protein ECDEC3B_4023 [Escherichia coli DEC3B]
 gi|377892466|gb|EHU56912.1| hypothetical protein ECDEC3A_3851 [Escherichia coli DEC3A]
 gi|377904303|gb|EHU68590.1| hypothetical protein ECDEC3C_4306 [Escherichia coli DEC3C]
 gi|377908234|gb|EHU72452.1| hypothetical protein ECDEC3D_4010 [Escherichia coli DEC3D]
 gi|377910609|gb|EHU74797.1| hypothetical protein ECDEC3E_4250 [Escherichia coli DEC3E]
 gi|377919313|gb|EHU83356.1| hypothetical protein ECDEC3F_4159 [Escherichia coli DEC3F]
 gi|377925148|gb|EHU89089.1| hypothetical protein ECDEC4A_3803 [Escherichia coli DEC4A]
 gi|377929290|gb|EHU93190.1| hypothetical protein ECDEC4B_3958 [Escherichia coli DEC4B]
 gi|377939778|gb|EHV03532.1| hypothetical protein ECDEC4D_3801 [Escherichia coli DEC4D]
 gi|377941123|gb|EHV04869.1| hypothetical protein ECDEC4C_3881 [Escherichia coli DEC4C]
 gi|377946702|gb|EHV10382.1| hypothetical protein ECDEC4E_3851 [Escherichia coli DEC4E]
 gi|377956556|gb|EHV20106.1| hypothetical protein ECDEC4F_3820 [Escherichia coli DEC4F]
 gi|377959700|gb|EHV23196.1| hypothetical protein ECDEC5A_3542 [Escherichia coli DEC5A]
 gi|377964297|gb|EHV27734.1| hypothetical protein ECDEC5B_3872 [Escherichia coli DEC5B]
 gi|377972011|gb|EHV35362.1| hypothetical protein ECDEC5C_3652 [Escherichia coli DEC5C]
 gi|377974401|gb|EHV37729.1| hypothetical protein ECDEC5D_3755 [Escherichia coli DEC5D]
 gi|377982545|gb|EHV45797.1| hypothetical protein ECDEC5E_3645 [Escherichia coli DEC5E]
 gi|386797602|gb|AFJ30636.1| hypothetical protein CDCO157_3580 [Escherichia coli Xuzhou21]
 gi|390639764|gb|EIN19234.1| hypothetical protein ECFRIK1996_4130 [Escherichia coli FRIK1996]
 gi|390641573|gb|EIN20998.1| hypothetical protein ECFDA517_4393 [Escherichia coli FDA517]
 gi|390641983|gb|EIN21406.1| hypothetical protein ECFDA505_4029 [Escherichia coli FDA505]
 gi|390659408|gb|EIN37175.1| hypothetical protein EC93001_4195 [Escherichia coli 93-001]
 gi|390659644|gb|EIN37399.1| hypothetical protein ECFRIK1985_4334 [Escherichia coli FRIK1985]
 gi|390662116|gb|EIN39743.1| hypothetical protein ECFRIK1990_4242 [Escherichia coli FRIK1990]
 gi|390675859|gb|EIN51982.1| hypothetical protein ECPA3_4203 [Escherichia coli PA3]
 gi|390679363|gb|EIN55275.1| hypothetical protein ECPA5_4064 [Escherichia coli PA5]
 gi|390682868|gb|EIN58611.1| hypothetical protein ECPA9_4241 [Escherichia coli PA9]
 gi|390694588|gb|EIN69160.1| hypothetical protein ECPA10_4385 [Escherichia coli PA10]
 gi|390699412|gb|EIN73762.1| hypothetical protein ECPA14_4229 [Escherichia coli PA14]
 gi|390699564|gb|EIN73907.1| hypothetical protein ECPA15_4351 [Escherichia coli PA15]
 gi|390713502|gb|EIN86440.1| hypothetical protein ECPA22_4226 [Escherichia coli PA22]
 gi|390721138|gb|EIN93839.1| hypothetical protein ECPA25_3934 [Escherichia coli PA25]
 gi|390722451|gb|EIN95122.1| hypothetical protein ECPA24_3941 [Escherichia coli PA24]
 gi|390726062|gb|EIN98539.1| hypothetical protein ECPA28_4270 [Escherichia coli PA28]
 gi|390739901|gb|EIO11059.1| hypothetical protein ECPA31_4027 [Escherichia coli PA31]
 gi|390740595|gb|EIO11715.1| hypothetical protein ECPA32_4089 [Escherichia coli PA32]
 gi|390743981|gb|EIO14926.1| hypothetical protein ECPA33_4089 [Escherichia coli PA33]
 gi|390757340|gb|EIO26829.1| hypothetical protein ECPA40_4245 [Escherichia coli PA40]
 gi|390765386|gb|EIO34555.1| hypothetical protein ECPA39_4166 [Escherichia coli PA39]
 gi|390765642|gb|EIO34805.1| hypothetical protein ECPA41_4178 [Escherichia coli PA41]
 gi|390767516|gb|EIO36599.1| hypothetical protein ECPA42_4251 [Escherichia coli PA42]
 gi|390780117|gb|EIO47817.1| hypothetical protein ECTW06591_3758 [Escherichia coli TW06591]
 gi|390788225|gb|EIO55694.1| hypothetical protein ECTW07945_4093 [Escherichia coli TW07945]
 gi|390789096|gb|EIO56561.1| hypothetical protein ECTW10246_4309 [Escherichia coli TW10246]
 gi|390795609|gb|EIO62893.1| hypothetical protein ECTW11039_4141 [Escherichia coli TW11039]
 gi|390803497|gb|EIO70503.1| hypothetical protein ECTW09098_4167 [Escherichia coli TW09098]
 gi|390806319|gb|EIO73241.1| hypothetical protein ECTW09109_4399 [Escherichia coli TW09109]
 gi|390814954|gb|EIO81503.1| hypothetical protein ECTW10119_4591 [Escherichia coli TW10119]
 gi|390824458|gb|EIO90439.1| hypothetical protein ECEC4203_4169 [Escherichia coli EC4203]
 gi|390827009|gb|EIO92803.1| hypothetical protein ECTW09195_4222 [Escherichia coli TW09195]
 gi|390829402|gb|EIO95003.1| hypothetical protein ECEC4196_4218 [Escherichia coli EC4196]
 gi|390844208|gb|EIP07960.1| hypothetical protein ECTW14313_4022 [Escherichia coli TW14313]
 gi|390844844|gb|EIP08543.1| hypothetical protein ECTW14301_3950 [Escherichia coli TW14301]
 gi|390849621|gb|EIP13043.1| hypothetical protein ECEC4421_4009 [Escherichia coli EC4421]
 gi|390859970|gb|EIP22298.1| hypothetical protein ECEC4422_4142 [Escherichia coli EC4422]
 gi|390864603|gb|EIP26711.1| hypothetical protein ECEC4013_4277 [Escherichia coli EC4013]
 gi|390868977|gb|EIP30685.1| hypothetical protein ECEC4402_4140 [Escherichia coli EC4402]
 gi|390877116|gb|EIP38067.1| hypothetical protein ECEC4439_4101 [Escherichia coli EC4439]
 gi|390882588|gb|EIP43089.1| hypothetical protein ECEC4436_4041 [Escherichia coli EC4436]
 gi|390892397|gb|EIP51985.1| hypothetical protein ECEC4437_4200 [Escherichia coli EC4437]
 gi|390894517|gb|EIP54034.1| hypothetical protein ECEC4448_4114 [Escherichia coli EC4448]
 gi|390899395|gb|EIP58643.1| hypothetical protein ECEC1738_4060 [Escherichia coli EC1738]
 gi|390907243|gb|EIP66112.1| hypothetical protein ECEC1734_4074 [Escherichia coli EC1734]
 gi|390918071|gb|EIP76487.1| hypothetical protein ECEC1863_3815 [Escherichia coli EC1863]
 gi|390919071|gb|EIP77445.1| hypothetical protein ECEC1845_4082 [Escherichia coli EC1845]
 gi|408063465|gb|EKG97957.1| hypothetical protein ECPA7_4671 [Escherichia coli PA7]
 gi|408065897|gb|EKH00367.1| hypothetical protein ECFRIK920_4215 [Escherichia coli FRIK920]
 gi|408069096|gb|EKH03510.1| hypothetical protein ECPA34_4228 [Escherichia coli PA34]
 gi|408078357|gb|EKH12530.1| hypothetical protein ECFDA506_4431 [Escherichia coli FDA506]
 gi|408081739|gb|EKH15746.1| hypothetical protein ECFDA507_4169 [Escherichia coli FDA507]
 gi|408090419|gb|EKH23696.1| hypothetical protein ECFDA504_4107 [Escherichia coli FDA504]
 gi|408096482|gb|EKH29422.1| hypothetical protein ECFRIK1999_4285 [Escherichia coli FRIK1999]
 gi|408103243|gb|EKH35628.1| hypothetical protein ECFRIK1997_4346 [Escherichia coli FRIK1997]
 gi|408107644|gb|EKH39720.1| hypothetical protein ECNE1487_4495 [Escherichia coli NE1487]
 gi|408114369|gb|EKH45931.1| hypothetical protein ECNE037_4482 [Escherichia coli NE037]
 gi|408120108|gb|EKH51138.1| hypothetical protein ECFRIK2001_4501 [Escherichia coli FRIK2001]
 gi|408126249|gb|EKH56809.1| hypothetical protein ECPA4_4245 [Escherichia coli PA4]
 gi|408136403|gb|EKH66150.1| hypothetical protein ECPA23_4114 [Escherichia coli PA23]
 gi|408139190|gb|EKH68824.1| hypothetical protein ECPA49_4249 [Escherichia coli PA49]
 gi|408145519|gb|EKH74697.1| hypothetical protein ECPA45_4297 [Escherichia coli PA45]
 gi|408154115|gb|EKH82485.1| hypothetical protein ECTT12B_3994 [Escherichia coli TT12B]
 gi|408159080|gb|EKH87183.1| hypothetical protein ECMA6_4304 [Escherichia coli MA6]
 gi|408162969|gb|EKH90856.1| hypothetical protein EC5905_4364 [Escherichia coli 5905]
 gi|408172151|gb|EKH99238.1| hypothetical protein ECCB7326_4160 [Escherichia coli CB7326]
 gi|408178731|gb|EKI05428.1| hypothetical protein ECEC96038_4055 [Escherichia coli EC96038]
 gi|408181898|gb|EKI08440.1| hypothetical protein EC5412_4089 [Escherichia coli 5412]
 gi|408215733|gb|EKI40105.1| hypothetical protein ECPA38_3948 [Escherichia coli PA38]
 gi|408225796|gb|EKI49462.1| hypothetical protein ECEC1735_4083 [Escherichia coli EC1735]
 gi|408237009|gb|EKI59876.1| hypothetical protein ECEC1736_4035 [Escherichia coli EC1736]
 gi|408240572|gb|EKI63247.1| hypothetical protein ECEC1737_4017 [Escherichia coli EC1737]
 gi|408245341|gb|EKI67733.1| hypothetical protein ECEC1846_4134 [Escherichia coli EC1846]
 gi|408254075|gb|EKI75635.1| hypothetical protein ECEC1847_4159 [Escherichia coli EC1847]
 gi|408257837|gb|EKI79134.1| hypothetical protein ECEC1848_4277 [Escherichia coli EC1848]
 gi|408264379|gb|EKI85179.1| hypothetical protein ECEC1849_4014 [Escherichia coli EC1849]
 gi|408273019|gb|EKI93085.1| hypothetical protein ECEC1850_4267 [Escherichia coli EC1850]
 gi|408275898|gb|EKI95838.1| hypothetical protein ECEC1856_4092 [Escherichia coli EC1856]
 gi|408284681|gb|EKJ03773.1| hypothetical protein ECEC1862_4098 [Escherichia coli EC1862]
 gi|408290278|gb|EKJ09015.1| hypothetical protein ECEC1864_4237 [Escherichia coli EC1864]
 gi|408306609|gb|EKJ23975.1| hypothetical protein ECEC1868_4272 [Escherichia coli EC1868]
 gi|408307128|gb|EKJ24490.1| hypothetical protein ECEC1866_3999 [Escherichia coli EC1866]
 gi|408317913|gb|EKJ34143.1| hypothetical protein ECEC1869_4258 [Escherichia coli EC1869]
 gi|408323973|gb|EKJ39934.1| hypothetical protein ECEC1870_4053 [Escherichia coli EC1870]
 gi|408325324|gb|EKJ41208.1| hypothetical protein ECNE098_4243 [Escherichia coli NE098]
 gi|408335767|gb|EKJ50605.1| hypothetical protein ECFRIK523_4102 [Escherichia coli FRIK523]
 gi|408345484|gb|EKJ59826.1| hypothetical protein EC01304_4217 [Escherichia coli 0.1304]
 gi|408548244|gb|EKK25629.1| hypothetical protein EC34870_4161 [Escherichia coli 3.4870]
 gi|408548350|gb|EKK25734.1| hypothetical protein EC52239_4116 [Escherichia coli 5.2239]
 gi|408549719|gb|EKK27079.1| hypothetical protein EC60172_4274 [Escherichia coli 6.0172]
 gi|408567340|gb|EKK43400.1| hypothetical protein EC80586_4420 [Escherichia coli 8.0586]
 gi|408577694|gb|EKK53253.1| hypothetical protein EC100833_4404 [Escherichia coli 10.0833]
 gi|408580262|gb|EKK55680.1| hypothetical protein EC82524_4031 [Escherichia coli 8.2524]
 gi|408590339|gb|EKK64821.1| hypothetical protein EC100869_3943 [Escherichia coli 10.0869]
 gi|408595585|gb|EKK69820.1| hypothetical protein EC880221_4075 [Escherichia coli 88.0221]
 gi|408600345|gb|EKK74204.1| hypothetical protein EC80416_3758 [Escherichia coli 8.0416]
 gi|408611792|gb|EKK85152.1| hypothetical protein EC100821_3681 [Escherichia coli 10.0821]
 gi|427203506|gb|EKV73811.1| hypothetical protein EC881042_4122 [Escherichia coli 88.1042]
 gi|427204642|gb|EKV74917.1| hypothetical protein EC890511_4065 [Escherichia coli 89.0511]
 gi|427207235|gb|EKV77413.1| hypothetical protein EC881467_4170 [Escherichia coli 88.1467]
 gi|427219702|gb|EKV88663.1| hypothetical protein EC900091_4460 [Escherichia coli 90.0091]
 gi|427223376|gb|EKV92135.1| hypothetical protein EC902281_4071 [Escherichia coli 90.2281]
 gi|427226047|gb|EKV94655.1| hypothetical protein EC900039_3958 [Escherichia coli 90.0039]
 gi|427240638|gb|EKW08091.1| hypothetical protein EC930056_4051 [Escherichia coli 93.0056]
 gi|427240768|gb|EKW08220.1| hypothetical protein EC930055_3997 [Escherichia coli 93.0055]
 gi|427244519|gb|EKW11838.1| hypothetical protein EC940618_3950 [Escherichia coli 94.0618]
 gi|427258878|gb|EKW24954.1| hypothetical protein EC950183_4137 [Escherichia coli 95.0183]
 gi|427259960|gb|EKW25980.1| hypothetical protein EC950943_4236 [Escherichia coli 95.0943]
 gi|427262613|gb|EKW28477.1| hypothetical protein EC951288_3842 [Escherichia coli 95.1288]
 gi|427275170|gb|EKW39793.1| hypothetical protein EC960428_3960 [Escherichia coli 96.0428]
 gi|427277888|gb|EKW42398.1| hypothetical protein EC960427_4113 [Escherichia coli 96.0427]
 gi|427282071|gb|EKW46351.1| hypothetical protein EC960939_4157 [Escherichia coli 96.0939]
 gi|427290555|gb|EKW54026.1| hypothetical protein EC960932_4194 [Escherichia coli 96.0932]
 gi|427297955|gb|EKW60979.1| hypothetical protein EC960107_4006 [Escherichia coli 96.0107]
 gi|427299439|gb|EKW62413.1| hypothetical protein EC970003_3802 [Escherichia coli 97.0003]
 gi|427310604|gb|EKW72847.1| hypothetical protein EC971742_3614 [Escherichia coli 97.1742]
 gi|427313532|gb|EKW75639.1| hypothetical protein EC970007_3445 [Escherichia coli 97.0007]
 gi|427318089|gb|EKW79972.1| hypothetical protein EC990672_4109 [Escherichia coli 99.0672]
 gi|427326821|gb|EKW88228.1| hypothetical protein EC990678_3880 [Escherichia coli 99.0678]
 gi|427328316|gb|EKW89684.1| hypothetical protein EC990713_3838 [Escherichia coli 99.0713]
 gi|429252444|gb|EKY36982.1| hypothetical protein EC960109_4182 [Escherichia coli 96.0109]
 gi|429253855|gb|EKY38309.1| hypothetical protein EC970010_4091 [Escherichia coli 97.0010]
 gi|444536740|gb|ELV16740.1| hypothetical protein EC990814_3637 [Escherichia coli 99.0814]
 gi|444538377|gb|ELV18245.1| hypothetical protein EC09BKT78844_4136 [Escherichia coli
           09BKT078844]
 gi|444546620|gb|ELV25320.1| hypothetical protein EC990815_3631 [Escherichia coli 99.0815]
 gi|444553536|gb|ELV31152.1| hypothetical protein EC990816_5026 [Escherichia coli 99.0816]
 gi|444553968|gb|ELV31557.1| hypothetical protein EC990839_4875 [Escherichia coli 99.0839]
 gi|444561924|gb|ELV39026.1| hypothetical protein EC990848_3632 [Escherichia coli 99.0848]
 gi|444571265|gb|ELV47753.1| hypothetical protein EC991753_3696 [Escherichia coli 99.1753]
 gi|444574741|gb|ELV51007.1| hypothetical protein EC991775_3520 [Escherichia coli 99.1775]
 gi|444578183|gb|ELV54271.1| hypothetical protein EC991793_3939 [Escherichia coli 99.1793]
 gi|444591720|gb|ELV66991.1| hypothetical protein ECPA11_3697 [Escherichia coli PA11]
 gi|444592534|gb|ELV67793.1| hypothetical protein ECATCC700728_3527 [Escherichia coli ATCC
           700728]
 gi|444593125|gb|ELV68357.1| hypothetical protein EC991805_3508 [Escherichia coli 99.1805]
 gi|444605439|gb|ELV80081.1| hypothetical protein ECPA13_3492 [Escherichia coli PA13]
 gi|444606221|gb|ELV80847.1| hypothetical protein ECPA19_3661 [Escherichia coli PA19]
 gi|444614793|gb|ELV89019.1| hypothetical protein ECPA2_3836 [Escherichia coli PA2]
 gi|444617978|gb|ELV92077.1| hypothetical protein ECPA47_4834 [Escherichia coli PA47]
 gi|444622709|gb|ELV96654.1| hypothetical protein ECPA48_3518 [Escherichia coli PA48]
 gi|444628910|gb|ELW02647.1| hypothetical protein ECPA8_3850 [Escherichia coli PA8]
 gi|444637474|gb|ELW10848.1| hypothetical protein EC71982_3761 [Escherichia coli 7.1982]
 gi|444639967|gb|ELW13264.1| hypothetical protein EC991781_3918 [Escherichia coli 99.1781]
 gi|444644035|gb|ELW17161.1| hypothetical protein EC991762_3888 [Escherichia coli 99.1762]
 gi|444650590|gb|ELW23418.1| hypothetical protein ECPA35_5216 [Escherichia coli PA35]
 gi|444659101|gb|ELW31538.1| hypothetical protein EC34880_3747 [Escherichia coli 3.4880]
 gi|444662049|gb|ELW34318.1| hypothetical protein EC950083_3632 [Escherichia coli 95.0083]
 gi|444669221|gb|ELW41219.1| hypothetical protein EC990670_3876 [Escherichia coli 99.0670]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQIKIINPQQIPPE 90


>gi|331664536|ref|ZP_08365442.1| conserved hypothetical protein [Escherichia coli TA143]
 gi|331058467|gb|EGI30448.1| conserved hypothetical protein [Escherichia coli TA143]
          Length = 96

 Score = 39.7 bits (91), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKHIKIINPQQIPPE 90


>gi|156039347|ref|XP_001586781.1| predicted protein [Sclerotinia sclerotiorum 1980]
 gi|154697547|gb|EDN97285.1| predicted protein [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 128

 Score = 39.3 bits (90), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 21/75 (28%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 39  SVSITIHAKP---GSKSCSITDVSDEA--VGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           ++ +  H KP    S++ +I   +D +  + + +  PA+D EAN  ++E +  VL   + 
Sbjct: 19  TIHLRCHVKPNVSASRAGAIKPFTDTSPILEICVPEPARDNEANEGVIELLGDVLLCPKT 78

Query: 94  QVSIGSGSKSRDKIV 108
            + +  G KSRDK++
Sbjct: 79  DLEVIRGKKSRDKVI 93


>gi|170766160|ref|ZP_02900971.1| conserved hypothetical protein [Escherichia albertii TW07627]
 gi|170125306|gb|EDS94237.1| conserved hypothetical protein [Escherichia albertii TW07627]
          Length = 96

 Score = 39.3 bits (90), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRDK---IVIVEEITPE 116
             R K   I+  ++I PE
Sbjct: 73  LGRHKQIRIINPQQIPPE 90


>gi|146098877|ref|XP_001468495.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134072863|emb|CAM71579.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 204

 Score = 39.3 bits (90), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 5/62 (8%)

Query: 31 CIRLVPPSSVSITIHAKPGSKSCS----ITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
          C+  V P    +T+HAKP +++ +    +T    EA  ++I AP  +G+ANA LL Y+  
Sbjct: 14 CLVQVQPGKYLLTVHAKPSARASAFAAPLTPALTEA-DLRIAAPPVEGQANAELLRYLGE 72

Query: 87 VL 88
          ++
Sbjct: 73 LI 74


>gi|283788525|ref|YP_003368390.1| hypothetical protein ROD_50331 [Citrobacter rodentium ICC168]
 gi|282951979|emb|CBG91706.1| conserved hypothetical protein [Citrobacter rodentium ICC168]
          Length = 96

 Score = 39.3 bits (90), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 42/78 (53%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+AN+ L++++     V + QV+I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVAIEKGE 72

Query: 102 KSRDKIVIV---EEITPE 116
             R K V +   ++I PE
Sbjct: 73  LGRHKQVKILHPQQIPPE 90


>gi|282162781|ref|YP_003355166.1| hypothetical protein MCP_0111 [Methanocella paludicola SANAE]
 gi|282155095|dbj|BAI60183.1| conserved hypothetical protein [Methanocella paludicola SANAE]
          Length = 104

 Score = 39.3 bits (90), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 44/89 (49%), Gaps = 3/89 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEA---VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           V I     PG+K   +    +E    +  ++ AP + G AN  L+  +S++LG+   ++ 
Sbjct: 14  VLIDFEVSPGAKETRVPSGYNEWRRRIEARLKAPPERGRANEELIGELSALLGIPESRIE 73

Query: 97  IGSGSKSRDKIVIVEEITPENVLNSLGKA 125
           I SG++   K V V   + E VL  LG A
Sbjct: 74  ITSGARDSRKSVKVLGASREEVLRRLGGA 102


>gi|62129245|gb|AAX66948.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Choleraesuis str. SC-B67]
          Length = 100

 Score = 39.3 bits (90), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + Q+ I  G 
Sbjct: 17  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQIVIEKGE 76

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 77  LGRHKQVKIIHPQQIPPE 94


>gi|410623768|ref|ZP_11334579.1| fused signal recognition particle receptor [Glaciecola pallidula
           DSM 14239 = ACAM 615]
 gi|410156669|dbj|GAC29953.1| fused signal recognition particle receptor [Glaciecola pallidula
           DSM 14239 = ACAM 615]
          Length = 414

 Score = 39.3 bits (90), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 47/97 (48%), Gaps = 8/97 (8%)

Query: 11  AKSAGSTQSKIKTNDENLPSCIRLV------PPSSVSITIHAKPGSKSCSITDVSDEAVG 64
           A +AG  Q+K    +E L   +R++       P  V +T+ A  G  + S T + +EAVG
Sbjct: 294 ADTAGRLQNKQHLIEE-LKKIVRVMKKINLDAPHEVMLTLDASTGQNAVSQTKIFNEAVG 352

Query: 65  VQ-IDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           +  I+    DG A   ++  ++   G+  R + IG G
Sbjct: 353 ISGINLTKLDGTAKGGIIFSLADQFGIPIRYIGIGEG 389


>gi|398022248|ref|XP_003864286.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322502521|emb|CBZ37604.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 204

 Score = 39.3 bits (90), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 36/62 (58%), Gaps = 5/62 (8%)

Query: 31 CIRLVPPSSVSITIHAKPGSKSCS----ITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
          C+  V P    +T+HAKP +++ +    +T    EA  ++I AP  +G+ANA LL Y+  
Sbjct: 14 CLVQVQPGKYLLTVHAKPSARASAFAAPLTPALTEA-DLRIAAPPVEGQANAELLRYLGE 72

Query: 87 VL 88
          ++
Sbjct: 73 LI 74


>gi|186683679|ref|YP_001866875.1| hypothetical protein Npun_R3528 [Nostoc punctiforme PCC 73102]
 gi|226703856|sp|B2J1T0.1|Y3528_NOSP7 RecName: Full=UPF0235 protein Npun_R3528
 gi|186466131|gb|ACC81932.1| protein of unknown function DUF167 [Nostoc punctiforme PCC 73102]
          Length = 75

 Score = 39.3 bits (90), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 20/68 (29%), Positives = 37/68 (54%)

Query: 44  IHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKS 103
           +  KP SK   I +  D ++ V + +P  DG+AN  L++ ++    V +  + I SG  S
Sbjct: 5   VKVKPNSKQQKIEEQPDGSLTVYLKSPPVDGKANEELIKLLAKKFDVAKSDIRIKSGLSS 64

Query: 104 RDKIVIVE 111
           R K++ ++
Sbjct: 65  RQKLIEID 72


>gi|387830805|ref|YP_003350742.1| hypothetical protein ECSF_2752 [Escherichia coli SE15]
 gi|417663515|ref|ZP_12313095.1| hypothetical protein ECAA86_03160 [Escherichia coli AA86]
 gi|432398880|ref|ZP_19641655.1| hypothetical protein WEI_03820 [Escherichia coli KTE25]
 gi|432408005|ref|ZP_19650709.1| hypothetical protein WEO_03207 [Escherichia coli KTE28]
 gi|432423271|ref|ZP_19665810.1| hypothetical protein A137_03700 [Escherichia coli KTE178]
 gi|432724400|ref|ZP_19959314.1| hypothetical protein WE1_03448 [Escherichia coli KTE17]
 gi|432728980|ref|ZP_19963855.1| hypothetical protein WE3_03450 [Escherichia coli KTE18]
 gi|432742670|ref|ZP_19977385.1| hypothetical protein WEE_03381 [Escherichia coli KTE23]
 gi|432890307|ref|ZP_20103239.1| hypothetical protein A31K_00326 [Escherichia coli KTE165]
 gi|432992033|ref|ZP_20180692.1| hypothetical protein A179_03827 [Escherichia coli KTE217]
 gi|433112164|ref|ZP_20298020.1| hypothetical protein WK9_03041 [Escherichia coli KTE150]
 gi|281179962|dbj|BAI56292.1| conserved hypothetical protein [Escherichia coli SE15]
 gi|330908988|gb|EGH37502.1| hypothetical protein ECAA86_03160 [Escherichia coli AA86]
 gi|430913485|gb|ELC34606.1| hypothetical protein WEI_03820 [Escherichia coli KTE25]
 gi|430928006|gb|ELC48557.1| hypothetical protein WEO_03207 [Escherichia coli KTE28]
 gi|430942580|gb|ELC62711.1| hypothetical protein A137_03700 [Escherichia coli KTE178]
 gi|431263334|gb|ELF55320.1| hypothetical protein WE1_03448 [Escherichia coli KTE17]
 gi|431271576|gb|ELF62695.1| hypothetical protein WE3_03450 [Escherichia coli KTE18]
 gi|431281828|gb|ELF72726.1| hypothetical protein WEE_03381 [Escherichia coli KTE23]
 gi|431431432|gb|ELH13207.1| hypothetical protein A31K_00326 [Escherichia coli KTE165]
 gi|431492302|gb|ELH71903.1| hypothetical protein A179_03827 [Escherichia coli KTE217]
 gi|431626034|gb|ELI94586.1| hypothetical protein WK9_03041 [Escherichia coli KTE150]
          Length = 96

 Score = 39.3 bits (90), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQIKIINPQQIPPE 90


>gi|118430921|ref|NP_147028.2| hypothetical protein APE_0182.1 [Aeropyrum pernix K1]
 gi|150421715|sp|Q9YFR7.2|Y182_AERPE RecName: Full=UPF0235 protein APE_0182.1
 gi|116062246|dbj|BAA79094.2| conserved hypothetical protein [Aeropyrum pernix K1]
          Length = 108

 Score = 39.3 bits (90), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 45/86 (52%), Gaps = 2/86 (2%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + V I ++ KP  +   +     E V    D P  +G ANA+L+ +++  L V  + + I
Sbjct: 17  NRVRIRVYVKPEGRERRLRLEEGELV-FYTDEPPLEGRANASLINFLARGLKVSVKNIEI 75

Query: 98  GSGSKSRDKIVIVEEIT-PENVLNSL 122
             G++SR K+V + ++  P+ +L  L
Sbjct: 76  VHGARSRSKVVEIRDVADPDALLERL 101


>gi|397163642|ref|ZP_10487100.1| hypothetical protein Y71_3939 [Enterobacter radicincitans DSM
           16656]
 gi|396094197|gb|EJI91749.1| hypothetical protein Y71_3939 [Enterobacter radicincitans DSM
           16656]
          Length = 96

 Score = 39.3 bits (90), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 20/65 (30%), Positives = 35/65 (53%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   +I  +  + + V I AP  DG+ANA L++Y++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDNIIGLHGDELKVAITAPPVDGQANAHLVKYLAKQFRVAKNQVVIEKGE 72

Query: 102 KSRDK 106
             R K
Sbjct: 73  LGRHK 77


>gi|417347962|ref|ZP_12127031.1| hypothetical protein SeGA_1270, partial [Salmonella enterica subsp.
           enterica serovar Gaminara str. A4-567]
 gi|353576903|gb|EHC39234.1| hypothetical protein SeGA_1270, partial [Salmonella enterica subsp.
           enterica serovar Gaminara str. A4-567]
          Length = 116

 Score = 39.3 bits (90), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + Q+ I  G 
Sbjct: 33  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQIVIEKGE 92

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 93  LGRHKQVKIIHPQQIPPE 110


>gi|187732088|ref|YP_001881726.1| hypothetical protein SbBS512_E3385 [Shigella boydii CDC 3083-94]
 gi|188496122|ref|ZP_03003392.1| conserved hypothetical protein TIGR00251 [Escherichia coli 53638]
 gi|331643646|ref|ZP_08344777.1| conserved hypothetical protein [Escherichia coli H736]
 gi|882482|gb|AAA69120.1| ORF_o100 [Escherichia coli str. K-12 substr. MG1655]
 gi|81246835|gb|ABB67543.1| conserved hypothetical protein [Shigella boydii Sb227]
 gi|187429080|gb|ACD08354.1| conserved hypothetical protein [Shigella boydii CDC 3083-94]
 gi|188491321|gb|EDU66424.1| conserved hypothetical protein TIGR00251 [Escherichia coli 53638]
 gi|331037117|gb|EGI09341.1| conserved hypothetical protein [Escherichia coli H736]
          Length = 100

 Score = 39.3 bits (90), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 17  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 76

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 77  LGRHKQIKIINPQQIPPE 94


>gi|163851553|ref|YP_001639596.1| hypothetical protein Mext_2130 [Methylobacterium extorquens PA1]
 gi|259646591|sp|A9W4L9.1|Y2130_METEP RecName: Full=UPF0235 protein Mext_2130
 gi|163663158|gb|ABY30525.1| protein of unknown function DUF167 [Methylobacterium extorquens
           PA1]
          Length = 105

 Score = 39.3 bits (90), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 47/99 (47%), Gaps = 17/99 (17%)

Query: 17  TQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVG-----VQIDAPA 71
           TQS        L   +RL P +S               +  V  EA G     +++ AP 
Sbjct: 2   TQSPFTLEANGLVLAVRLTPRAS------------RTGLDGVRTEASGRPVLSLRVAAPP 49

Query: 72  KDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
            +G ANAAL  +++  LG+++ +V++ SG  SR K +++
Sbjct: 50  VEGAANAALTAFVAKSLGLRKAEVTLLSGETSRTKRLLL 88


>gi|417393422|ref|ZP_12155935.1| hypothetical protein LTSEMIN_4774, partial [Salmonella enterica
           subsp. enterica serovar Minnesota str. A4-603]
 gi|353608805|gb|EHC62286.1| hypothetical protein LTSEMIN_4774, partial [Salmonella enterica
           subsp. enterica serovar Minnesota str. A4-603]
          Length = 107

 Score = 39.3 bits (90), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + Q+ I  G 
Sbjct: 24  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQIVIEKGE 83

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 84  LGRHKQVKIIHPQQIPPE 101


>gi|432490720|ref|ZP_19732584.1| hypothetical protein A171_02647 [Escherichia coli KTE213]
 gi|432840746|ref|ZP_20074206.1| hypothetical protein A1YQ_03702 [Escherichia coli KTE140]
 gi|433204645|ref|ZP_20388401.1| hypothetical protein WGY_03224 [Escherichia coli KTE95]
 gi|431018768|gb|ELD32198.1| hypothetical protein A171_02647 [Escherichia coli KTE213]
 gi|431387376|gb|ELG71200.1| hypothetical protein A1YQ_03702 [Escherichia coli KTE140]
 gi|431718082|gb|ELJ82163.1| hypothetical protein WGY_03224 [Escherichia coli KTE95]
          Length = 96

 Score = 39.3 bits (90), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +  + +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRFYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQIKIINPQQIPPE 90


>gi|91212335|ref|YP_542321.1| hypothetical protein UTI89_C3342 [Escherichia coli UTI89]
 gi|110806861|ref|YP_690381.1| hypothetical protein SFV_3007 [Shigella flexneri 5 str. 8401]
 gi|157155173|ref|YP_001464306.1| hypothetical protein EcE24377A_3297 [Escherichia coli E24377A]
 gi|157162414|ref|YP_001459732.1| hypothetical protein EcHS_A3113 [Escherichia coli HS]
 gi|237706394|ref|ZP_04536875.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA]
 gi|331654465|ref|ZP_08355465.1| conserved hypothetical protein [Escherichia coli M718]
 gi|331674433|ref|ZP_08375193.1| conserved hypothetical protein [Escherichia coli TA280]
 gi|331678947|ref|ZP_08379621.1| conserved hypothetical protein [Escherichia coli H591]
 gi|332280353|ref|ZP_08392766.1| conserved hypothetical protein [Shigella sp. D9]
 gi|344915316|ref|NP_708718.3| hypothetical protein SF2944 [Shigella flexneri 2a str. 301]
 gi|386600951|ref|YP_006102457.1| hypothetical protein ECOK1_3341 [Escherichia coli IHE3034]
 gi|424839247|ref|ZP_18263884.1| hypothetical protein SF5M90T_2933 [Shigella flexneri 5a str. M90T]
 gi|427806133|ref|ZP_18973200.1| hypothetical protein BN16_35611 [Escherichia coli chi7122]
 gi|427810726|ref|ZP_18977791.1| hypothetical protein BN17_28661 [Escherichia coli]
 gi|26109782|gb|AAN81987.1|AE016766_75 Hypothetical protein yggU [Escherichia coli CFT073]
 gi|24053355|gb|AAN44425.1| conserved hypothetical protein [Shigella flexneri 2a str. 301]
 gi|30042526|gb|AAP18250.1| hypothetical protein S3148 [Shigella flexneri 2a str. 2457T]
 gi|91073909|gb|ABE08790.1| hypothetical protein UTI89_C3342 [Escherichia coli UTI89]
 gi|110616409|gb|ABF05076.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401]
 gi|157068094|gb|ABV07349.1| conserved hypothetical protein TIGR00251 [Escherichia coli HS]
 gi|157077203|gb|ABV16911.1| conserved hypothetical protein TIGR00251 [Escherichia coli E24377A]
 gi|195183146|dbj|BAG66691.1| predicted protein [Escherichia coli O111:H-]
 gi|226899434|gb|EEH85693.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA]
 gi|294491613|gb|ADE90369.1| conserved hypothetical protein TIGR00251 [Escherichia coli IHE3034]
 gi|331047847|gb|EGI19924.1| conserved hypothetical protein [Escherichia coli M718]
 gi|331068527|gb|EGI39922.1| conserved hypothetical protein [Escherichia coli TA280]
 gi|331073777|gb|EGI45098.1| conserved hypothetical protein [Escherichia coli H591]
 gi|332102705|gb|EGJ06051.1| conserved hypothetical protein [Shigella sp. D9]
 gi|383468299|gb|EID63320.1| hypothetical protein SF5M90T_2933 [Shigella flexneri 5a str. M90T]
 gi|412964315|emb|CCK48243.1| hypothetical protein BN16_35611 [Escherichia coli chi7122]
 gi|412970905|emb|CCJ45557.1| hypothetical protein BN17_28661 [Escherichia coli]
          Length = 100

 Score = 39.3 bits (90), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 17  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 76

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 77  LGRHKQIKIINPQQIPPE 94


>gi|12517499|gb|AAG58084.1|AE005525_10 orf, hypothetical protein [Escherichia coli O157:H7 str. EDL933]
 gi|13363301|dbj|BAB37252.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai]
 gi|209760028|gb|ACI78326.1| hypothetical protein ECs3829 [Escherichia coli]
 gi|209760030|gb|ACI78327.1| hypothetical protein ECs3829 [Escherichia coli]
 gi|209760032|gb|ACI78328.1| hypothetical protein ECs3829 [Escherichia coli]
 gi|209760034|gb|ACI78329.1| hypothetical protein ECs3829 [Escherichia coli]
 gi|209760036|gb|ACI78330.1| hypothetical protein ECs3829 [Escherichia coli]
          Length = 100

 Score = 39.3 bits (90), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 17  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 76

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 77  LGRHKQIKIINPQQIPPE 94


>gi|224373471|ref|YP_002607843.1| hypothetical protein NAMH_1451 [Nautilia profundicola AmH]
 gi|223589921|gb|ACM93657.1| conserved hypothetical protein [Nautilia profundicola AmH]
          Length = 94

 Score = 39.3 bits (90), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 39/71 (54%), Gaps = 1/71 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I I A+P S    I     E++ V I APA +G AN  L++++     + + +++I  
Sbjct: 9   IYIKIKAQPNSSKNKIAGKYGESLKVNIKAPAVEGAANKELIKFIGKEFKIPKSEIAI-K 67

Query: 100 GSKSRDKIVIV 110
           G  S+ K++IV
Sbjct: 68  GETSKQKVLIV 78


>gi|157960952|ref|YP_001500986.1| hypothetical protein Spea_1124 [Shewanella pealeana ATCC 700345]
 gi|157845952|gb|ABV86451.1| protein of unknown function DUF167 [Shewanella pealeana ATCC
           700345]
          Length = 90

 Score = 39.3 bits (90), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 40/83 (48%), Gaps = 2/83 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  E + + I AP  DG+ANA L +Y+S    V +  + I  G 
Sbjct: 8   LQLYIQPKASRDQIVGLHGEEIKIAITAPPVDGKANAHLTKYLSKAFKVPKGDIDIIKGQ 67

Query: 102 KSRDKIVIVE--EITPENVLNSL 122
             R K V +   ++ P  V N L
Sbjct: 68  MGRHKQVRITAPKLIPAEVQNLL 90


>gi|119775613|ref|YP_928353.1| hypothetical protein Sama_2480 [Shewanella amazonensis SB2B]
 gi|189039696|sp|A1S8H7.1|Y2480_SHEAM RecName: Full=UPF0235 protein Sama_2480
 gi|119768113|gb|ABM00684.1| conserved hypothetical protein [Shewanella amazonensis SB2B]
          Length = 95

 Score = 39.3 bits (90), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 22/83 (26%), Positives = 43/83 (51%), Gaps = 2/83 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    +  +  E + + I AP  DG+ANA + + ++    V + +VSI  G 
Sbjct: 13  LALYVQPKASRDELVGLHGEELKLAITAPPVDGKANAHICKLLAKAFKVPKGKVSIERGE 72

Query: 102 KSRDKIVIVE--EITPENVLNSL 122
             R K+V ++  EI P++    L
Sbjct: 73  LGRHKLVRIQAPEIIPDDFAQFL 95


>gi|218710632|ref|YP_002418253.1| hypothetical protein VS_2686 [Vibrio splendidus LGP32]
 gi|218323651|emb|CAV19947.1| Hypothetical protein VS_2686 [Vibrio splendidus LGP32]
          Length = 96

 Score = 39.3 bits (90), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 36/69 (52%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + ++ +P +    I  +  E + + I AP  DG+ANA L +Y++    V + Q++I  
Sbjct: 12  ILLRLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQITIEK 71

Query: 100 GSKSRDKIV 108
           G   R K V
Sbjct: 72  GELGRHKQV 80


>gi|161506346|ref|YP_001573458.1| hypothetical protein SARI_04543 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:- str. RSK2980]
 gi|189030103|sp|A9MQS2.1|YGGU_SALAR RecName: Full=UPF0235 protein YggU
 gi|160867693|gb|ABX24316.1| hypothetical protein SARI_04543 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:-]
          Length = 96

 Score = 39.3 bits (90), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + V V I AP  DG+AN+ L++++     V + QV+I  G 
Sbjct: 13  LRLYIQPKASRDCIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVAIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQVKIIHPQQIPPE 90


>gi|28373966|pdb|1N91|A Chain A, Solution Nmr Structure Of Protein Yggu From Escherichia
           Coli. Northeast Structural Genomics Consortium Target
           Er14.
 gi|60594359|pdb|1YH5|A Chain A, Solution Nmr Structure Of Protein Yggu From Escherichia
           Coli. Northeast Structural Genomics Consortium Target
           Er14
          Length = 108

 Score = 39.3 bits (90), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 17  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 76

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 77  LGRHKQIKIINPQQIPPE 94


>gi|348027991|ref|YP_004870677.1| signal recognition particle-docking protein FtsY [Glaciecola
           nitratireducens FR1064]
 gi|347945334|gb|AEP28684.1| signal recognition particle-docking protein FtsY [Glaciecola
           nitratireducens FR1064]
          Length = 446

 Score = 39.3 bits (90), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 47/97 (48%), Gaps = 8/97 (8%)

Query: 11  AKSAGSTQSKIKTNDENLPSCIRLV------PPSSVSITIHAKPGSKSCSITDVSDEAVG 64
           A +AG  Q+K    +E L   +R++       P  V +T+ A  G  + S T + +EAVG
Sbjct: 326 ADTAGRLQNKNHLIEE-LKKVVRVMKKINPNAPHEVMLTLDASTGQNAVSQTKIFNEAVG 384

Query: 65  VQ-IDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           +  I+    DG A   ++  ++   G+  R + IG G
Sbjct: 385 ISGINLTKLDGTAKGGIIFSLADQFGIPIRYIGIGEG 421


>gi|331648708|ref|ZP_08349796.1| conserved hypothetical protein [Escherichia coli M605]
 gi|331042455|gb|EGI14597.1| conserved hypothetical protein [Escherichia coli M605]
          Length = 100

 Score = 38.9 bits (89), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 17  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 76

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 77  LGRHKQIKIINPQQIPPE 94


>gi|240103723|ref|YP_002960032.1| hypothetical protein TGAM_1666 [Thermococcus gammatolerans EJ3]
 gi|239911277|gb|ACS34168.1| Conserved hypothetical protein [Thermococcus gammatolerans EJ3]
          Length = 124

 Score = 38.9 bits (89), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 48/85 (56%), Gaps = 8/85 (9%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVG---VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           + ++ +P +K   I  + DE  G   V++ AP   G+AN  L++++S VLG    +V + 
Sbjct: 44  LLVYVQPKAKKNEIEGI-DEWRGRLKVKVKAPPVGGKANKELVKFLSKVLGA---EVELV 99

Query: 99  SGSKSRDKIVIVEEITPENVLNSLG 123
            G  SR+K ++V  ++ E+V   LG
Sbjct: 100 RGETSREKDLLV-RMSAEDVKKRLG 123


>gi|212723588|ref|NP_001132046.1| uncharacterized protein LOC100193457 [Zea mays]
 gi|194693286|gb|ACF80727.1| unknown [Zea mays]
 gi|414884279|tpg|DAA60293.1| TPA: hypothetical protein ZEAMMB73_531342 [Zea mays]
 gi|414884280|tpg|DAA60294.1| TPA: hypothetical protein ZEAMMB73_531342 [Zea mays]
          Length = 213

 Score = 38.9 bits (89), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 39/77 (50%)

Query: 24  NDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEY 83
           +D  +P CI  +    V + I  +  ++  +IT V+ + V V + A A  GEAN+ LLE+
Sbjct: 128 HDAPVPPCITQLEGGLVQVAIEVEDRAQRSAITRVNADDVRVTVAALAARGEANSELLEF 187

Query: 84  MSSVLGVKRRQVSIGSG 100
           M  V+      + +  G
Sbjct: 188 MGKVIFFNDTLILLWCG 204


>gi|406886119|gb|EKD33196.1| hypothetical protein ACD_76C00070G0002 [uncultured bacterium]
          Length = 72

 Score = 38.9 bits (89), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 36/69 (52%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I + AKP +    +  + D    V I  P   G+AN A++  ++    V R QV+I S
Sbjct: 1   MRIFVKAKPLAGEEKVEKLDDSHFIVSIKEPPVKGQANRAIIGALADYFHVTRAQVAIVS 60

Query: 100 GSKSRDKIV 108
           G  SR+KI+
Sbjct: 61  GGISRNKII 69


>gi|334125563|ref|ZP_08499552.1| protein of hypothetical function DUF167 [Enterobacter hormaechei
           ATCC 49162]
 gi|419958575|ref|ZP_14474638.1| hypothetical protein PGS1_12632 [Enterobacter cloacae subsp.
           cloacae GS1]
 gi|295097492|emb|CBK86582.1| conserved hypothetical protein TIGR00251 [Enterobacter cloacae
           subsp. cloacae NCTC 9394]
 gi|333387026|gb|EGK58230.1| protein of hypothetical function DUF167 [Enterobacter hormaechei
           ATCC 49162]
 gi|388606478|gb|EIM35685.1| hypothetical protein PGS1_12632 [Enterobacter cloacae subsp.
           cloacae GS1]
          Length = 98

 Score = 38.9 bits (89), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L +Y++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQVIIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +  + P+++
Sbjct: 73  LGRHKQVKI--LNPQSI 87


>gi|145300496|ref|YP_001143337.1| hypothetical protein ASA_3628 [Aeromonas salmonicida subsp.
           salmonicida A449]
 gi|418362895|ref|ZP_12963513.1| hypothetical protein IYQ_21388 [Aeromonas salmonicida subsp.
           salmonicida 01-B526]
 gi|166232617|sp|A4SRR7.1|Y3628_AERS4 RecName: Full=UPF0235 protein ASA_3628
 gi|142853268|gb|ABO91589.1| conserved hypothetical protein [Aeromonas salmonicida subsp.
           salmonicida A449]
 gi|356685901|gb|EHI50520.1| hypothetical protein IYQ_21388 [Aeromonas salmonicida subsp.
           salmonicida 01-B526]
          Length = 99

 Score = 38.9 bits (89), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 21/65 (32%), Positives = 34/65 (52%)

Query: 47  KPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           +P +    I  +  + + V I AP  DG+AN+ L++Y++    V + QV I  G   R K
Sbjct: 18  QPKASRDQIVGLHGDELKVAITAPPVDGQANSHLIKYLAKQFKVAKGQVRIVRGELGRHK 77

Query: 107 IVIVE 111
            V +E
Sbjct: 78  TVAIE 82


>gi|428778544|ref|YP_007170330.1| hypothetical protein Dacsa_0158 [Dactylococcopsis salina PCC 8305]
 gi|428692823|gb|AFZ48973.1| hypothetical protein Dacsa_0158 [Dactylococcopsis salina PCC 8305]
          Length = 73

 Score = 38.9 bits (89), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 41/70 (58%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +++  KP SK   +    +  + +++ +P  +G+ANA L++ ++   GV++ QV I SG 
Sbjct: 4   LSVKVKPNSKQQKVETGENGDLIIRLKSPPVEGKANAELIKLLAKTYGVRKSQVKIKSGL 63

Query: 102 KSRDKIVIVE 111
            ++ K V +E
Sbjct: 64  TAKIKQVEIE 73


>gi|398344957|ref|ZP_10529660.1| hypothetical protein LinasL1_18322 [Leptospira inadai serovar Lyme
           str. 10]
          Length = 74

 Score = 38.9 bits (89), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I +  KP SK   +    D    V +  PA +G+AN A+L  ++   GV RR V +  
Sbjct: 1   MRIEVRVKPNSKKPFVKKGEDLTWIVAVREPAIEGKANDAVLAAIAEEFGVSRRCVRLVH 60

Query: 100 GSKSRDKIV-IVEE 112
           G K + K++ IVE+
Sbjct: 61  GEKGKRKLIEIVED 74


>gi|193212041|ref|YP_001997994.1| hypothetical protein Cpar_0370 [Chlorobaculum parvum NCIB 8327]
 gi|226705834|sp|B3QL06.1|Y370_CHLP8 RecName: Full=UPF0235 protein Cpar_0370
 gi|193085518|gb|ACF10794.1| protein of unknown function DUF167 [Chlorobaculum parvum NCIB 8327]
          Length = 105

 Score = 38.9 bits (89), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 22/81 (27%), Positives = 40/81 (49%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           +++  +P S    I     + V + + +   D  AN    + ++  LGV R  VS+ +G 
Sbjct: 13  LSVRVQPRSSKTGIAGRYGDQVKICLKSAPVDNAANKECCQLLAKTLGVPRSNVSVMNGQ 72

Query: 102 KSRDKIVIVEEITPENVLNSL 122
            SR K++ VE +TP  +  +L
Sbjct: 73  TSRSKVLKVEGMTPSELRKAL 93


>gi|296284807|ref|ZP_06862805.1| hypothetical protein CbatJ_14361 [Citromicrobium bathyomarinum
           JL354]
          Length = 92

 Score = 38.9 bits (89), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 19/67 (28%), Positives = 41/67 (61%), Gaps = 1/67 (1%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + +   PG+++ S+  + D  V V++ A  +DG AN A+ E ++  LG+ + + ++  G+
Sbjct: 23  LRVRVTPGARTESLA-IVDGGVQVKVRAKPQDGAANVAVAELVAKALGIPKSRCTLLRGA 81

Query: 102 KSRDKIV 108
            SR+K++
Sbjct: 82  TSREKVL 88


>gi|444348227|ref|ZP_21155939.1| hypothetical protein S23A_0823 [Aggregatibacter
          actinomycetemcomitans serotype b str. S23A]
 gi|443547513|gb|ELT56994.1| hypothetical protein S23A_0823 [Aggregatibacter
          actinomycetemcomitans serotype b str. S23A]
          Length = 73

 Score = 38.9 bits (89), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 16/53 (30%), Positives = 31/53 (58%)

Query: 38 SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGV 90
          +++ + I  +P +    I  + D+ + + I AP  DG+ANA LL+++S +  V
Sbjct: 9  ANLRLRIFLQPKAAKNQIVGLHDDELKISITAPPVDGQANAHLLKFLSKLFKV 61


>gi|238786243|ref|ZP_04630189.1| hypothetical protein yberc0001_39320 [Yersinia bercovieri ATCC
           43970]
 gi|238798802|ref|ZP_04642272.1| hypothetical protein ymoll0001_31720 [Yersinia mollaretii ATCC
           43969]
 gi|238712858|gb|EEQ04924.1| hypothetical protein yberc0001_39320 [Yersinia bercovieri ATCC
           43970]
 gi|238717373|gb|EEQ09219.1| hypothetical protein ymoll0001_31720 [Yersinia mollaretii ATCC
           43969]
          Length = 90

 Score = 38.9 bits (89), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V + QV I  G 
Sbjct: 7   LRLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQVIIEKGE 66

Query: 102 KSRD---KIVIVEEITPE 116
             R    KIV  ++I PE
Sbjct: 67  LGRHKQIKIVNPQQIPPE 84


>gi|322834241|ref|YP_004214268.1| hypothetical protein Rahaq_3549 [Rahnella sp. Y9602]
 gi|384259423|ref|YP_005403357.1| hypothetical protein Q7S_17885 [Rahnella aquatilis HX2]
 gi|321169442|gb|ADW75141.1| protein of unknown function DUF167 [Rahnella sp. Y9602]
 gi|380755399|gb|AFE59790.1| hypothetical protein Q7S_17885 [Rahnella aquatilis HX2]
          Length = 101

 Score = 38.9 bits (89), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 38/72 (52%), Gaps = 2/72 (2%)

Query: 47  KPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           +P +   S+  +  + + V I AP  DG+AN  L+++++    V + QVSI  G   R K
Sbjct: 22  QPKASRDSLVGLHGDELKVAITAPPVDGQANTHLVKFLAKQFKVAKSQVSIEKGELGRHK 81

Query: 107 IVIVEEITPENV 118
            V +    P+N+
Sbjct: 82  QVRITH--PQNI 91


>gi|261342370|ref|ZP_05970228.1| putative cytoplasmic protein [Enterobacter cancerogenus ATCC 35316]
 gi|288315005|gb|EFC53943.1| putative cytoplasmic protein [Enterobacter cancerogenus ATCC 35316]
          Length = 98

 Score = 38.9 bits (89), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L +Y++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQVIIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +  + P+++
Sbjct: 73  LGRHKQVKI--LNPQSI 87


>gi|435852308|ref|YP_007313894.1| TIGR00251 family protein [Methanomethylovorans hollandica DSM
           15978]
 gi|433662938|gb|AGB50364.1| TIGR00251 family protein [Methanomethylovorans hollandica DSM
           15978]
          Length = 105

 Score = 38.9 bits (89), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 50/88 (56%), Gaps = 3/88 (3%)

Query: 40  VSITIHAKPGSKSCSI---TDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           + I +   PGSK   +    +   + + V++   A+ G+AN  L+E+++++ GV    VS
Sbjct: 14  IVIDLEVTPGSKRACVPSGYNPWRKRIEVKLSQAAQKGKANEQLIEHVAALFGVAGSSVS 73

Query: 97  IGSGSKSRDKIVIVEEITPENVLNSLGK 124
           I SG+K+  K ++V+++  +  +  LG+
Sbjct: 74  IISGAKNSQKSILVKKVDMDLAVAVLGR 101


>gi|401765167|ref|YP_006580174.1| hypothetical protein ECENHK_18595 [Enterobacter cloacae subsp.
           cloacae ENHKU01]
 gi|400176701|gb|AFP71550.1| hypothetical protein ECENHK_18595 [Enterobacter cloacae subsp.
           cloacae ENHKU01]
          Length = 98

 Score = 38.9 bits (89), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L +Y++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQVIIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +  + P+++
Sbjct: 73  LGRHKQVKI--LNPQSI 87


>gi|420368917|ref|ZP_14869648.1| hypothetical protein SF123566_10107 [Shigella flexneri 1235-66]
 gi|391321688|gb|EIQ78405.1| hypothetical protein SF123566_10107 [Shigella flexneri 1235-66]
          Length = 96

 Score = 38.9 bits (89), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+AN+ L++++     V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQVKIIHPQQIPPE 90


>gi|320101820|ref|YP_004177411.1| hypothetical protein Isop_0265 [Isosphaera pallida ATCC 43644]
 gi|319749102|gb|ADV60862.1| protein of unknown function DUF167 [Isosphaera pallida ATCC 43644]
          Length = 131

 Score = 38.9 bits (89), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 16/49 (32%), Positives = 34/49 (69%)

Query: 74  GEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           G+ANAA+LE ++ +LG+ + ++ + SG+ +R K+V +  + P+ V  ++
Sbjct: 66  GKANAAILEILADLLGIAKSRLDLVSGATARSKVVRIAGLEPDQVRQAI 114


>gi|442751115|gb|JAA67717.1| Hypothetical protein [Ixodes ricinus]
          Length = 172

 Score = 38.9 bits (89), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 23/32 (71%)

Query: 39 SVSITIHAKPGSKSCSITDVSDEAVGVQIDAP 70
          +V+I +HAKPG+    ITD+  + VG+QI AP
Sbjct: 59 TVAIRVHAKPGASESRITDIGTDGVGIQIAAP 90


>gi|152991249|ref|YP_001356971.1| hypothetical protein NIS_1507 [Nitratiruptor sp. SB155-2]
 gi|151423110|dbj|BAF70614.1| conserved hypothetical protein [Nitratiruptor sp. SB155-2]
          Length = 95

 Score = 38.9 bits (89), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 19/76 (25%), Positives = 39/76 (51%)

Query: 35  VPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           +    V + I A+P +    I  +  +++ + I APA +G AN  L++++S    V +  
Sbjct: 5   IEDDQVHMFIKAQPNASKNKIAGILGDSLKIAIKAPAVEGAANKELVKFLSKTFKVAKSD 64

Query: 95  VSIGSGSKSRDKIVIV 110
           +   SG  S+ K +++
Sbjct: 65  IVFASGETSKRKHIVM 80


>gi|169600439|ref|XP_001793642.1| hypothetical protein SNOG_03053 [Phaeosphaeria nodorum SN15]
 gi|111068664|gb|EAT89784.1| hypothetical protein SNOG_03053 [Phaeosphaeria nodorum SN15]
          Length = 132

 Score = 38.9 bits (89), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 42/86 (48%), Gaps = 7/86 (8%)

Query: 38  SSVSITIHAKPGSKSCS--ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           S   IT+  KPG       I  VSD  V + +   A +G AN ++   ++  L V +  V
Sbjct: 22  SQFLITLFVKPGVSRMREGIAAVSDSHVFMNVANYAFEGSANKSVQVLLAKTLSVPKSHV 81

Query: 96  SIGSGSKSRDKI----VIVEEITPEN 117
           SI  G  SR+K+    + V E TPEN
Sbjct: 82  SIVKGLTSREKVAEVKLFVNE-TPEN 106


>gi|375257352|ref|YP_005016522.1| hypothetical protein KOX_02710 [Klebsiella oxytoca KCTC 1686]
 gi|397659952|ref|YP_006500654.1| hypothetical protein A225_4982 [Klebsiella oxytoca E718]
 gi|402840010|ref|ZP_10888481.1| TIGR00251 family protein [Klebsiella sp. OBRC7]
 gi|423104835|ref|ZP_17092537.1| UPF0235 protein yggU [Klebsiella oxytoca 10-5242]
 gi|365906830|gb|AEX02283.1| hypothetical protein KOX_02710 [Klebsiella oxytoca KCTC 1686]
 gi|376382798|gb|EHS95531.1| UPF0235 protein yggU [Klebsiella oxytoca 10-5242]
 gi|394348051|gb|AFN34172.1| hypothetical protein A225_4982 [Klebsiella oxytoca E718]
 gi|402287246|gb|EJU35702.1| TIGR00251 family protein [Klebsiella sp. OBRC7]
          Length = 96

 Score = 38.9 bits (89), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L+++++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQVLIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K + +  I P+ +
Sbjct: 73  LGRHKQIKI--IHPQQI 87


>gi|56695839|ref|YP_166190.1| hypothetical protein SPO0937 [Ruegeria pomeroyi DSS-3]
 gi|56677576|gb|AAV94242.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3]
          Length = 92

 Score = 38.9 bits (89), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 22/84 (26%), Positives = 45/84 (53%), Gaps = 2/84 (2%)

Query: 26  ENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMS 85
           ++LP    L  P    I +   P +   S+T ++ E + + + AP +DG+AN A+ + ++
Sbjct: 7   KSLPDLSHLARPGQ-EIALRVTPKAARDSVT-LAGEGLRITVTAPPEDGKANEAVRKLLA 64

Query: 86  SVLGVKRRQVSIGSGSKSRDKIVI 109
             +GV   ++++  G  +RDK  +
Sbjct: 65  RAMGVAPSRLTLRRGQTARDKTFV 88


>gi|397665310|ref|YP_006506848.1| hypothetical protein LPO_3001 [Legionella pneumophila subsp.
           pneumophila]
 gi|395128721|emb|CCD06939.1| conserved protein of unknown function [Legionella pneumophila
           subsp. pneumophila]
          Length = 70

 Score = 38.9 bits (89), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 15/51 (29%), Positives = 30/51 (58%)

Query: 58  VSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV 108
           +SD+ + + + A  ++GEAN  LL ++S    + + Q+ +  G  SR K++
Sbjct: 3   ISDDRLHIALHAKPQEGEANNELLFFISQFFKIPKTQIELIKGKGSRHKLI 53


>gi|345300766|ref|YP_004830124.1| hypothetical protein Entas_3624 [Enterobacter asburiae LF7a]
 gi|345094703|gb|AEN66339.1| UPF0235 protein yggU [Enterobacter asburiae LF7a]
          Length = 98

 Score = 38.9 bits (89), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L +Y++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQVIIEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|354725127|ref|ZP_09039342.1| hypothetical protein EmorL2_19872 [Enterobacter mori LMG 25706]
          Length = 98

 Score = 38.9 bits (89), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L +Y++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQVIIEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|423110314|ref|ZP_17098009.1| UPF0235 protein yggU [Klebsiella oxytoca 10-5243]
 gi|423116248|ref|ZP_17103939.1| UPF0235 protein yggU [Klebsiella oxytoca 10-5245]
 gi|376378430|gb|EHS91189.1| UPF0235 protein yggU [Klebsiella oxytoca 10-5245]
 gi|376380299|gb|EHS93047.1| UPF0235 protein yggU [Klebsiella oxytoca 10-5243]
          Length = 96

 Score = 38.9 bits (89), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L+++++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQVLIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K + +  I P+ +
Sbjct: 73  LGRHKQIKI--IHPQQI 87


>gi|149200997|ref|ZP_01877972.1| hypothetical protein RTM1035_15267 [Roseovarius sp. TM1035]
 gi|149145330|gb|EDM33356.1| hypothetical protein RTM1035_15267 [Roseovarius sp. TM1035]
          Length = 84

 Score = 38.9 bits (89), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 23/85 (27%), Positives = 46/85 (54%), Gaps = 2/85 (2%)

Query: 27  NLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
           ++P    L  P + ++T+   P +    I    D+ + V +    +DG+ANAA+ + ++ 
Sbjct: 2   DIPDLSALAQPGT-TLTLRVTPKAARNRIV-AEDDVLRVYVTTVPEDGKANAAVQKLLAK 59

Query: 87  VLGVKRRQVSIGSGSKSRDKIVIVE 111
            LGV + ++++  G  SRDK+  V+
Sbjct: 60  ALGVPKSRLTLLRGHTSRDKVFQVD 84


>gi|392980616|ref|YP_006479204.1| hypothetical protein A3UG_18950 [Enterobacter cloacae subsp.
           dissolvens SDM]
 gi|392326549|gb|AFM61502.1| hypothetical protein A3UG_18950 [Enterobacter cloacae subsp.
           dissolvens SDM]
          Length = 95

 Score = 38.9 bits (89), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L +Y++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQVIIEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|365972041|ref|YP_004953602.1| protein YggU [Enterobacter cloacae EcWSU1]
 gi|365750954|gb|AEW75181.1| YggU [Enterobacter cloacae EcWSU1]
          Length = 102

 Score = 38.9 bits (89), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L +Y++    V + QV I  G 
Sbjct: 17  LRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQVIIEKGE 76

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +  + P+++
Sbjct: 77  LGRHKQVKI--LNPQSI 91


>gi|417533700|ref|ZP_12187666.1| hypothetical protein LTSEURB_4649, partial [Salmonella enterica
           subsp. enterica serovar Urbana str. R8-2977]
 gi|353660107|gb|EHC99813.1| hypothetical protein LTSEURB_4649, partial [Salmonella enterica
           subsp. enterica serovar Urbana str. R8-2977]
          Length = 117

 Score = 38.9 bits (89), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L++++     V + Q+ I  G 
Sbjct: 34  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQIVIEKGE 93

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 94  LGRHKQVKIIHPQQIPPE 111


>gi|441504382|ref|ZP_20986376.1| Hypothetical protein C942_01104 [Photobacterium sp. AK15]
 gi|441427849|gb|ELR65317.1| Hypothetical protein C942_01104 [Photobacterium sp. AK15]
          Length = 97

 Score = 38.5 bits (88), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 36/70 (51%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I ++ +P +    I  +  + + V I AP  DG+ANA L +Y++    V + QV I  G 
Sbjct: 15  IHLYIQPKASRDKIVGLHGDELKVAITAPPVDGKANAHLSKYLAKQFRVAKGQVVIEKGE 74

Query: 102 KSRDKIVIVE 111
             R K V VE
Sbjct: 75  LGRHKQVRVE 84


>gi|421498972|ref|ZP_15946039.1| hypothetical protein B224_003147 [Aeromonas media WS]
 gi|407182012|gb|EKE56002.1| hypothetical protein B224_003147 [Aeromonas media WS]
          Length = 107

 Score = 38.5 bits (88), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 24/87 (27%), Positives = 45/87 (51%), Gaps = 2/87 (2%)

Query: 25  DENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYM 84
           D+ +P+   L+   ++ + +  +P +    I  +  + + V I AP  DG+AN+ L++Y+
Sbjct: 3   DDLMPAV--LLEGDALVLHLMIQPKASRDQIVGLHGDELKVAITAPPVDGQANSHLIKYL 60

Query: 85  SSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           +    V + QV I  G   R K V +E
Sbjct: 61  AKQCKVAKGQVRIVRGELGRHKTVAIE 87


>gi|421726340|ref|ZP_16165514.1| hypothetical protein KOXM_12724 [Klebsiella oxytoca M5al]
 gi|423125762|ref|ZP_17113441.1| UPF0235 protein yggU [Klebsiella oxytoca 10-5250]
 gi|376398843|gb|EHT11466.1| UPF0235 protein yggU [Klebsiella oxytoca 10-5250]
 gi|410372932|gb|EKP27639.1| hypothetical protein KOXM_12724 [Klebsiella oxytoca M5al]
          Length = 96

 Score = 38.5 bits (88), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L+++++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQVLIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K + +  I P+ +
Sbjct: 73  LGRHKQIKI--IHPQQI 87


>gi|32475275|ref|NP_868269.1| hypothetical protein RB8260 [Rhodopirellula baltica SH 1]
 gi|47117454|sp|Q7UFY2.1|Y8260_RHOBA RecName: Full=UPF0235 protein RB8260
 gi|32445816|emb|CAD78547.1| conserved hypothetical protein [Rhodopirellula baltica SH 1]
          Length = 108

 Score = 38.5 bits (88), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 21/70 (30%), Positives = 39/70 (55%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
             +   P +K  S+  + D A+ V +    +DG+AN A++  ++  L V + +V+I +G 
Sbjct: 20  FRVRVTPKAKKASVGGLHDGALKVSVHTVPEDGKANKAVIASLAKWLRVSKGRVAIVAGE 79

Query: 102 KSRDKIVIVE 111
            SR K ++VE
Sbjct: 80  TSRLKTIVVE 89


>gi|114046767|ref|YP_737317.1| hypothetical protein Shewmr7_1261 [Shewanella sp. MR-7]
 gi|123131606|sp|Q0HX95.1|Y1261_SHESR RecName: Full=UPF0235 protein Shewmr7_1261
 gi|113888209|gb|ABI42260.1| conserved hypothetical protein [Shewanella sp. MR-7]
          Length = 96

 Score = 38.5 bits (88), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA L +Y++    V +  V I  G 
Sbjct: 13  LNLYIQPKASRDQIVGLHGDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSDVHILKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K+V +    P+NV
Sbjct: 73  LGRHKLVRIS--APKNV 87


>gi|298674092|ref|YP_003725842.1| hypothetical protein Metev_0115 [Methanohalobium evestigatum
           Z-7303]
 gi|298287080|gb|ADI73046.1| protein of unknown function DUF167 [Methanohalobium evestigatum
           Z-7303]
          Length = 109

 Score = 38.5 bits (88), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 43/81 (53%), Gaps = 3/81 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEA---VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           V I I   P SK+ ++ D  ++    + V++   A+ G+AN  L+E  S   GV +  + 
Sbjct: 14  VVIDISVTPNSKTINVPDNYNQWRNRIEVKLTQKAESGKANNQLIENFSEFFGVNKSSIK 73

Query: 97  IGSGSKSRDKIVIVEEITPEN 117
           I SG KS  K V V+ ++ ++
Sbjct: 74  ITSGEKSSQKSVSVKGLSYDD 94


>gi|295106861|emb|CBL04404.1| Uncharacterized conserved protein [Gordonibacter pamelaeae
           7-10-1-b]
          Length = 96

 Score = 38.5 bits (88), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 5/75 (6%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVG-----VQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           I +H  P S    ++ V  +A G     V++ AP   G+AN A+ + ++  LGV + +V 
Sbjct: 6   IAVHVTPRSGRDEVSGVRADAAGADEVCVRVTAPPDGGKANKAVCKLVAEALGVPKSRVG 65

Query: 97  IGSGSKSRDKIVIVE 111
           + SG  +R K + VE
Sbjct: 66  VASGHTARRKRLSVE 80


>gi|445151195|ref|ZP_21390145.1| hypothetical protein SEEDHWS_015290 [Salmonella enterica subsp.
           enterica serovar Dublin str. HWS51]
 gi|444856095|gb|ELX81133.1| hypothetical protein SEEDHWS_015290 [Salmonella enterica subsp.
           enterica serovar Dublin str. HWS51]
          Length = 96

 Score = 38.5 bits (88), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L +++     V + Q+ I  G 
Sbjct: 13  LRLYIQPKASHDSIVGLHGDEVKVAITAPPVDGQANSHLTKFLGKQFRVAKSQIVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQVKIIHPQQIPPE 90


>gi|389861630|ref|YP_006363870.1| hypothetical protein TCELL_1309 [Thermogladius cellulolyticus 1633]
 gi|388526534|gb|AFK51732.1| hypothetical protein TCELL_1309 [Thermogladius cellulolyticus 1633]
          Length = 117

 Score = 38.5 bits (88), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 43/85 (50%), Gaps = 1/85 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + I  +PGS +  +    +E V  Q   P   G  NAAL+ Y+S  L +   ++ I  
Sbjct: 25  VILNIIVQPGSPTEGLVIEGNELV-FQTVEPEGRGRENAALVRYLSKNLNIPVSKIDIVY 83

Query: 100 GSKSRDKIVIVEEITPENVLNSLGK 124
           G++ R K V++ +I  E ++  L +
Sbjct: 84  GTRERYKRVLLMDINEERLITLLSR 108


>gi|341613815|ref|ZP_08700684.1| hypothetical protein CJLT1_02642 [Citromicrobium sp. JLT1363]
          Length = 92

 Score = 38.5 bits (88), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 41/69 (59%), Gaps = 1/69 (1%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + +   PG+++ SI+ + D  V V++ A  +DG ANAA+   ++  LG+ +    +  G+
Sbjct: 23  LRVRVTPGARTESIS-ILDGGVQVKVRAKPQDGAANAAVEALVAKALGIPKSHCRLLRGA 81

Query: 102 KSRDKIVIV 110
            SR+K++ V
Sbjct: 82  TSREKVLAV 90


>gi|337288806|ref|YP_004628278.1| hypothetical protein TOPB45_1269 [Thermodesulfobacterium sp. OPB45]
 gi|334902544|gb|AEH23350.1| UPF0235 protein yggU [Thermodesulfobacterium geofontis OPF15]
          Length = 91

 Score = 38.5 bits (88), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 18/71 (25%), Positives = 38/71 (53%), Gaps = 1/71 (1%)

Query: 42  ITIHAKPGSKSCSITDVSD-EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           + I  KPGS    +    +   + + + A  +  +AN +L E++ ++  + ++++ I  G
Sbjct: 3   LEIKVKPGSSKDKLLQFKEPNFLEISLKAQPEKNKANESLCEFLGNIFKIPKKEIKILKG 62

Query: 101 SKSRDKIVIVE 111
             SR+KIV +E
Sbjct: 63  KTSRNKIVKIE 73


>gi|378717160|ref|YP_005282049.1| hypothetical protein GPOL_c16340 [Gordonia polyisoprenivorans VH2]
 gi|375751863|gb|AFA72683.1| protein of unknown function DUF167 [Gordonia polyisoprenivorans
           VH2]
          Length = 75

 Score = 38.5 bits (88), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 41/71 (57%), Gaps = 3/71 (4%)

Query: 37  PSSVSITIHAKPGSKSCSITDVS-DEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P  V +T+  KP S+   +     D ++ + +  PA +G+AN A+ E +++ LGV + +V
Sbjct: 2   PQQVVVTV--KPNSRKGPLVQTGPDGSITIYVREPATEGKANKAVAELLAAHLGVPKSRV 59

Query: 96  SIGSGSKSRDK 106
           ++  G+ SR K
Sbjct: 60  NLIGGATSRTK 70


>gi|51597526|ref|YP_071717.1| hypothetical protein YPTB3216 [Yersinia pseudotuberculosis IP
           32953]
 gi|153948675|ref|YP_001399811.1| hypothetical protein YpsIP31758_0827 [Yersinia pseudotuberculosis
           IP 31758]
 gi|81638596|sp|Q666N2.1|Y3216_YERPS RecName: Full=UPF0235 protein YPTB3216
 gi|167016819|sp|A7FEY3.1|Y827_YERP3 RecName: Full=UPF0235 protein YpsIP31758_0827
 gi|51590808|emb|CAH22454.1| Conserved hypothetical protein [Yersinia pseudotuberculosis IP
           32953]
 gi|152960170|gb|ABS47631.1| conserved hypothetical protein TIGR00251 [Yersinia
           pseudotuberculosis IP 31758]
          Length = 96

 Score = 38.5 bits (88), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V + QV I  G 
Sbjct: 13  LKLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQVIIEKGE 72

Query: 102 KSRDKIVIV---EEITPE 116
             R K + V   ++I PE
Sbjct: 73  LGRHKQIKVINPQQIPPE 90


>gi|421615099|ref|ZP_16056136.1| protein containing DUF167 [Rhodopirellula baltica SH28]
 gi|408494135|gb|EKJ98756.1| protein containing DUF167 [Rhodopirellula baltica SH28]
          Length = 107

 Score = 38.5 bits (88), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 21/70 (30%), Positives = 39/70 (55%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
             +   P +K  S+  + D A+ V +    +DG+AN A++  ++  L V + +V+I +G 
Sbjct: 19  FRVRVTPKAKKASVGGLHDGALKVSVHTVPEDGKANKAVIASLAKWLRVSKGRVAIVAGE 78

Query: 102 KSRDKIVIVE 111
            SR K ++VE
Sbjct: 79  TSRLKTIVVE 88


>gi|407642209|ref|YP_006805968.1| hypothetical protein O3I_005155 [Nocardia brasiliensis ATCC 700358]
 gi|407305093|gb|AFT98993.1| hypothetical protein O3I_005155 [Nocardia brasiliensis ATCC 700358]
          Length = 76

 Score = 38.5 bits (88), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 41/71 (57%), Gaps = 3/71 (4%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEA-VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           P+ V  TI  KP S+   + +  D+  + + + APA +G+AN A +E +++  GV +  V
Sbjct: 2   PTIVRATI--KPNSRKGPLVETLDDGTLQLYVRAPAVEGKANKAAIELLAAHYGVPKSAV 59

Query: 96  SIGSGSKSRDK 106
            + +G+ SR K
Sbjct: 60  RLAAGATSRHK 70


>gi|417949353|ref|ZP_12592489.1| hypothetical protein VISP3789_20455 [Vibrio splendidus ATCC 33789]
 gi|342808311|gb|EGU43471.1| hypothetical protein VISP3789_20455 [Vibrio splendidus ATCC 33789]
          Length = 96

 Score = 38.5 bits (88), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 35/69 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + ++ +P +    I  +  E + + I AP  DG+ANA L +Y++    V + Q+ I  
Sbjct: 12  ILLRLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQIKIEK 71

Query: 100 GSKSRDKIV 108
           G   R K V
Sbjct: 72  GELGRHKQV 80


>gi|395778780|ref|ZP_10459292.1| TIGR00251 family protein [Bartonella elizabethae Re6043vi]
 gi|423714906|ref|ZP_17689130.1| TIGR00251 family protein [Bartonella elizabethae F9251]
 gi|395417988|gb|EJF84325.1| TIGR00251 family protein [Bartonella elizabethae Re6043vi]
 gi|395430390|gb|EJF96432.1| TIGR00251 family protein [Bartonella elizabethae F9251]
          Length = 104

 Score = 38.5 bits (88), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 46/87 (52%), Gaps = 11/87 (12%)

Query: 22  KTNDENLPSCIRLVPPSSVS--ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAA 79
           + +  NL   + L+P SSV   I I +K G K         + + +++ A  +DG+AN A
Sbjct: 4   QVDKNNLILFVYLIPKSSVDKIIGIESKDGEK---------QYLVIRLRAVPEDGKANKA 54

Query: 80  LLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           L++++S    +    +S+ SG+ SR K
Sbjct: 55  LIKFLSKQWKIPSSSISLKSGATSRYK 81


>gi|218530362|ref|YP_002421178.1| hypothetical protein Mchl_2407 [Methylobacterium extorquens CM4]
 gi|259646623|sp|B7KZL8.1|Y2407_METC4 RecName: Full=UPF0235 protein Mchl_2407
 gi|218522665|gb|ACK83250.1| protein of unknown function DUF167 [Methylobacterium extorquens
           CM4]
          Length = 105

 Score = 38.5 bits (88), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 44/95 (46%), Gaps = 17/95 (17%)

Query: 17  TQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVG-----VQIDAPA 71
           TQS        L   +RL P +S               +  V  EA G     +++ AP 
Sbjct: 2   TQSPFTLEANGLVLAVRLTPRAS------------RTGLDGVRTEASGRPVLSLRVAAPP 49

Query: 72  KDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
            +G ANAAL  +++  LG+++ +V++ SG  SR K
Sbjct: 50  VEGAANAALTAFVAKSLGLRKAEVTLLSGETSRTK 84


>gi|407070393|ref|ZP_11101231.1| hypothetical protein VcycZ_12648 [Vibrio cyclitrophicus ZF14]
          Length = 96

 Score = 38.5 bits (88), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 35/69 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + ++ +P +    I  +  E + + I AP  DG+ANA L +Y++    V + Q+ I  
Sbjct: 12  ILLRLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQIKIEK 71

Query: 100 GSKSRDKIV 108
           G   R K V
Sbjct: 72  GELGRHKQV 80


>gi|378961190|ref|YP_005218676.1| hypothetical protein STBHUCCB_31810 [Salmonella enterica subsp.
           enterica serovar Typhi str. P-stx-12]
 gi|417343685|ref|ZP_12124208.1| hypothetical protein LTSEBAI_4180 [Salmonella enterica subsp.
           enterica serovar Baildon str. R6-199]
 gi|417367895|ref|ZP_12139636.1| hypothetical protein LTSEHVI_4116 [Salmonella enterica subsp.
           enterica serovar Hvittingfoss str. A4-620]
 gi|417375806|ref|ZP_12145167.1| hypothetical protein LTSEINV_4610 [Salmonella enterica subsp.
           enterica serovar Inverness str. R8-3668]
 gi|417427681|ref|ZP_12160778.1| hypothetical protein LTSEMIS_4185 [Salmonella enterica subsp.
           enterica serovar Mississippi str. A4-633]
 gi|417481312|ref|ZP_12171931.1| hypothetical protein LTSERUB_4910 [Salmonella enterica subsp.
           enterica serovar Rubislaw str. A4-653]
 gi|353587993|gb|EHC47154.1| hypothetical protein LTSEHVI_4116 [Salmonella enterica subsp.
           enterica serovar Hvittingfoss str. A4-620]
 gi|353595147|gb|EHC52466.1| hypothetical protein LTSEINV_4610 [Salmonella enterica subsp.
           enterica serovar Inverness str. R8-3668]
 gi|353616369|gb|EHC67657.1| hypothetical protein LTSEMIS_4185 [Salmonella enterica subsp.
           enterica serovar Mississippi str. A4-633]
 gi|353635806|gb|EHC82015.1| hypothetical protein LTSERUB_4910 [Salmonella enterica subsp.
           enterica serovar Rubislaw str. A4-653]
 gi|357955106|gb|EHJ81030.1| hypothetical protein LTSEBAI_4180 [Salmonella enterica subsp.
           enterica serovar Baildon str. R6-199]
 gi|374355062|gb|AEZ46823.1| hypothetical protein STBHUCCB_31810 [Salmonella enterica subsp.
           enterica serovar Typhi str. P-stx-12]
          Length = 93

 Score = 38.5 bits (88), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L +++     V + Q+ I  G 
Sbjct: 10  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLTKFLGKQFRVAKSQIVIEKGE 69

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 70  LGRHKQVKIIHPQQIPPE 87


>gi|326403567|ref|YP_004283649.1| hypothetical protein ACMV_14200 [Acidiphilium multivorum AIU301]
 gi|325050429|dbj|BAJ80767.1| hypothetical protein ACMV_14200 [Acidiphilium multivorum AIU301]
          Length = 131

 Score = 38.5 bits (88), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 16/41 (39%), Positives = 31/41 (75%)

Query: 70  PAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           P +DG AN A+L+ +++ LGV   ++++ +G ++RDK+V+V
Sbjct: 60  PPEDGRANDAVLKALAAWLGVGAARLALRAGGQARDKLVLV 100


>gi|84394062|ref|ZP_00992798.1| hypothetical protein V12B01_07755 [Vibrio splendidus 12B01]
 gi|84375304|gb|EAP92215.1| hypothetical protein V12B01_07755 [Vibrio splendidus 12B01]
          Length = 96

 Score = 38.5 bits (88), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 35/69 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + ++ +P +    I  +  E + + I AP  DG+ANA L +Y++    V + Q+ I  
Sbjct: 12  ILLRLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQIKIEK 71

Query: 100 GSKSRDKIV 108
           G   R K V
Sbjct: 72  GELGRHKQV 80


>gi|86146416|ref|ZP_01064740.1| hypothetical protein MED222_22586 [Vibrio sp. MED222]
 gi|85835895|gb|EAQ54029.1| hypothetical protein MED222_22586 [Vibrio sp. MED222]
          Length = 96

 Score = 38.5 bits (88), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 35/69 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + ++ +P +    I  +  E + + I AP  DG+ANA L +Y++    V + Q+ I  
Sbjct: 12  ILLRLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQIKIEK 71

Query: 100 GSKSRDKIV 108
           G   R K V
Sbjct: 72  GELGRHKQV 80


>gi|383191437|ref|YP_005201565.1| hypothetical protein Rahaq2_3630 [Rahnella aquatilis CIP 78.65 =
           ATCC 33071]
 gi|371589695|gb|AEX53425.1| TIGR00251 family protein [Rahnella aquatilis CIP 78.65 = ATCC
           33071]
          Length = 101

 Score = 38.5 bits (88), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 2/72 (2%)

Query: 47  KPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           +P +   S+  +  + + V I AP  DG+AN  L+++++    V + Q+SI  G   R K
Sbjct: 22  QPKASRDSLVGLHGDELKVAITAPPVDGQANTHLVKFLAKQFKVAKSQISIEKGELGRHK 81

Query: 107 IVIVEEITPENV 118
            V +    P+N+
Sbjct: 82  QVRITH--PQNI 91


>gi|240138720|ref|YP_002963192.1| hypothetical protein MexAM1_META1p2120 [Methylobacterium extorquens
           AM1]
 gi|418059840|ref|ZP_12697776.1| protein of unknown function DUF167 [Methylobacterium extorquens DSM
           13060]
 gi|240008689|gb|ACS39915.1| conserved hypothetical protein,UPF0235 protein [Methylobacterium
           extorquens AM1]
 gi|373566610|gb|EHP92603.1| protein of unknown function DUF167 [Methylobacterium extorquens DSM
           13060]
          Length = 105

 Score = 38.5 bits (88), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 44/95 (46%), Gaps = 17/95 (17%)

Query: 17  TQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVG-----VQIDAPA 71
           TQS        L   +RL P +S               +  V  EA G     +++ AP 
Sbjct: 2   TQSPFTLEANGLVLAVRLTPRAS------------RTGLDGVRTEASGQPVLSLRVAAPP 49

Query: 72  KDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
            +G ANAAL  +++  LG+++ +V++ SG  SR K
Sbjct: 50  VEGAANAALTAFVAKSLGLRKAEVTLLSGETSRTK 84


>gi|148980491|ref|ZP_01816088.1| hypothetical protein VSWAT3_21090 [Vibrionales bacterium SWAT-3]
 gi|145961216|gb|EDK26530.1| hypothetical protein VSWAT3_21090 [Vibrionales bacterium SWAT-3]
          Length = 96

 Score = 38.5 bits (88), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 35/69 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + ++ +P +    I  +  E + + I AP  DG+ANA L +Y++    V + Q+ I  
Sbjct: 12  ILLRLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQIKIEK 71

Query: 100 GSKSRDKIV 108
           G   R K V
Sbjct: 72  GELGRHKQV 80


>gi|254561320|ref|YP_003068415.1| hypothetical protein METDI2902 [Methylobacterium extorquens DM4]
 gi|254268598|emb|CAX24557.1| conserved hypothetical protein,UPF0235 protein [Methylobacterium
           extorquens DM4]
          Length = 105

 Score = 38.5 bits (88), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 44/95 (46%), Gaps = 17/95 (17%)

Query: 17  TQSKIKTNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVG-----VQIDAPA 71
           TQS        L   +RL P +S               +  V  EA G     +++ AP 
Sbjct: 2   TQSPFTLEANGLVLAVRLTPRAS------------RTGLDGVRTEASGRPVLSLRVAAPP 49

Query: 72  KDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
            +G ANAAL  +++  LG+++ +V++ SG  SR K
Sbjct: 50  VEGAANAALTAFVAKSLGLRKAEVTLVSGETSRTK 84


>gi|170023077|ref|YP_001719582.1| hypothetical protein YPK_0828 [Yersinia pseudotuberculosis YPIII]
 gi|226708089|sp|B1JNN7.1|Y828_YERPY RecName: Full=UPF0235 protein YPK_0828
 gi|169749611|gb|ACA67129.1| protein of unknown function DUF167 [Yersinia pseudotuberculosis
           YPIII]
          Length = 96

 Score = 38.5 bits (88), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V + QV I  G 
Sbjct: 13  LKLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQVIIEKGE 72

Query: 102 KSRDKIVIV---EEITPE 116
             R K + V   ++I PE
Sbjct: 73  LGRHKQIKVINPQQIPPE 90


>gi|297568085|ref|YP_003689429.1| protein of unknown function DUF167 [Desulfurivibrio alkaliphilus
           AHT2]
 gi|296924000|gb|ADH84810.1| protein of unknown function DUF167 [Desulfurivibrio alkaliphilus
           AHT2]
          Length = 76

 Score = 38.5 bits (88), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 38/70 (54%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + I  KP S+S S+T  +D +   ++ +P  +G AN  L+  ++     ++  V I +GS
Sbjct: 5   LQIKVKPRSQSSSLTQEADGSWLARLKSPPVEGRANRELIALVADHFRCRKADVEIKAGS 64

Query: 102 KSRDKIVIVE 111
             R K+V VE
Sbjct: 65  SGRTKLVRVE 74


>gi|423122090|ref|ZP_17109774.1| TIGR00251 family protein [Klebsiella oxytoca 10-5246]
 gi|376392719|gb|EHT05381.1| TIGR00251 family protein [Klebsiella oxytoca 10-5246]
          Length = 96

 Score = 38.5 bits (88), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + + V I AP  DG+ANA L+++++    V + QV +  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQVLLEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K + +  I P+ +
Sbjct: 73  LGRHKQIKI--INPQQI 87


>gi|289811236|ref|ZP_06541865.1| hypothetical protein Salmonellaentericaenterica_45672 [Salmonella
           enterica subsp. enterica serovar Typhi str. AG3]
          Length = 94

 Score = 38.1 bits (87), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L +++     V + Q+ I  G 
Sbjct: 11  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLTKFLGKQFRVAKSQIVIEKGE 70

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 71  LGRHKQVKIIHPQQIPPE 88


>gi|16761877|ref|NP_457494.1| hypothetical protein STY3255 [Salmonella enterica subsp. enterica
           serovar Typhi str. CT18]
 gi|29143364|ref|NP_806706.1| hypothetical protein t3014 [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
 gi|161616066|ref|YP_001590031.1| hypothetical protein SPAB_03867 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|167552006|ref|ZP_02345759.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Saintpaul str. SARA29]
 gi|168234347|ref|ZP_02659405.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Kentucky str. CDC 191]
 gi|168236170|ref|ZP_02661228.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Schwarzengrund str. SL480]
 gi|168264452|ref|ZP_02686425.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Hadar str. RI_05P066]
 gi|168463711|ref|ZP_02697628.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Newport str. SL317]
 gi|194470418|ref|ZP_03076402.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Kentucky str. CVM29188]
 gi|194735973|ref|YP_002116050.1| hypothetical protein SeSA_A3276 [Salmonella enterica subsp.
           enterica serovar Schwarzengrund str. CVM19633]
 gi|198244387|ref|YP_002217077.1| hypothetical protein SeD_A3445 [Salmonella enterica subsp. enterica
           serovar Dublin str. CT_02021853]
 gi|200388001|ref|ZP_03214613.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Virchow str. SL491]
 gi|205354025|ref|YP_002227826.1| hypothetical protein SG2996 [Salmonella enterica subsp. enterica
           serovar Gallinarum str. 287/91]
 gi|207858363|ref|YP_002245014.1| hypothetical protein SEN2945 [Salmonella enterica subsp. enterica
           serovar Enteritidis str. P125109]
 gi|213027996|ref|ZP_03342443.1| hypothetical protein Salmonelentericaenterica_39070 [Salmonella
           enterica subsp. enterica serovar Typhi str. 404ty]
 gi|213051778|ref|ZP_03344656.1| hypothetical protein Salmoneentericaenterica_01893 [Salmonella
           enterica subsp. enterica serovar Typhi str. E00-7866]
 gi|213422843|ref|ZP_03355881.1| hypothetical protein Salmonentericaenterica_35767 [Salmonella
           enterica subsp. enterica serovar Typhi str. E01-6750]
 gi|213424105|ref|ZP_03356998.1| hypothetical protein SentesTyphi_00030 [Salmonella enterica subsp.
           enterica serovar Typhi str. E02-1180]
 gi|213580619|ref|ZP_03362445.1| hypothetical protein SentesTyph_05132 [Salmonella enterica subsp.
           enterica serovar Typhi str. E98-0664]
 gi|213609099|ref|ZP_03368925.1| hypothetical protein SentesTyp_00567 [Salmonella enterica subsp.
           enterica serovar Typhi str. E98-2068]
 gi|213648208|ref|ZP_03378261.1| hypothetical protein SentesTy_13504 [Salmonella enterica subsp.
           enterica serovar Typhi str. J185]
 gi|213850158|ref|ZP_03381056.1| hypothetical protein SentesT_00598 [Salmonella enterica subsp.
           enterica serovar Typhi str. M223]
 gi|289825938|ref|ZP_06545097.1| hypothetical protein Salmonellentericaenterica_11175 [Salmonella
           enterica subsp. enterica serovar Typhi str. E98-3139]
 gi|375120582|ref|ZP_09765749.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar
           Dublin str. SD3246]
 gi|375124888|ref|ZP_09770052.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar
           Gallinarum str. SG9]
 gi|378956719|ref|YP_005214206.1| hypothetical protein SPUL_3104 [Salmonella enterica subsp. enterica
           serovar Gallinarum/pullorum str. RKS5078]
 gi|417520491|ref|ZP_12182394.1| Hypothetical protein LTSEUGA_4347 [Salmonella enterica subsp.
           enterica serovar Uganda str. R8-3404]
 gi|418760877|ref|ZP_13317029.1| hypothetical protein SEEN185_09925 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35185]
 gi|418766137|ref|ZP_13322216.1| hypothetical protein SEEN199_07843 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35199]
 gi|418771463|ref|ZP_13327470.1| hypothetical protein SEEN539_17297 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21539]
 gi|418773768|ref|ZP_13329741.1| hypothetical protein SEEN953_05911 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 33953]
 gi|418778425|ref|ZP_13334335.1| hypothetical protein SEEN188_20492 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35188]
 gi|418783397|ref|ZP_13339244.1| hypothetical protein SEEN559_07379 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21559]
 gi|418801331|ref|ZP_13356968.1| hypothetical protein SEEN202_18461 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35202]
 gi|418869682|ref|ZP_13424115.1| hypothetical protein SEEN176_07477 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 4176]
 gi|419786962|ref|ZP_14312677.1| hypothetical protein SEENLE01_09156 [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 1]
 gi|419793356|ref|ZP_14318979.1| hypothetical protein SEENLE15_01345 [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 15]
 gi|421360688|ref|ZP_15810964.1| hypothetical protein SEEE3139_21675 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 622731-39]
 gi|421363462|ref|ZP_15813704.1| hypothetical protein SEEE0166_12594 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639016-6]
 gi|421369786|ref|ZP_15819961.1| hypothetical protein SEEE0631_21444 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 640631]
 gi|421374229|ref|ZP_15824360.1| hypothetical protein SEEE0424_21088 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-0424]
 gi|421378833|ref|ZP_15828912.1| hypothetical protein SEEE3076_21526 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-6]
 gi|421383497|ref|ZP_15833535.1| hypothetical protein SEEE4917_22116 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 485549-17]
 gi|421384856|ref|ZP_15834879.1| hypothetical protein SEEE6622_06164 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-22]
 gi|421389501|ref|ZP_15839484.1| hypothetical protein SEEE6670_06791 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-70]
 gi|421396787|ref|ZP_15846712.1| hypothetical protein SEEE6426_20833 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-26]
 gi|421399566|ref|ZP_15849461.1| hypothetical protein SEEE6437_12540 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-37]
 gi|421405944|ref|ZP_15855769.1| hypothetical protein SEEE7246_21923 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-46]
 gi|421408528|ref|ZP_15858327.1| hypothetical protein SEEE7250_12208 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-50]
 gi|421414841|ref|ZP_15864577.1| hypothetical protein SEEE1427_21225 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-1427]
 gi|421417556|ref|ZP_15867266.1| hypothetical protein SEEE2659_12186 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-2659]
 gi|421420895|ref|ZP_15870571.1| hypothetical protein SEEE1757_06240 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 78-1757]
 gi|421428540|ref|ZP_15878151.1| hypothetical protein SEEE5101_22057 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22510-1]
 gi|421430983|ref|ZP_15880569.1| hypothetical protein SEEE8B1_11656 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 8b-1]
 gi|421435587|ref|ZP_15885123.1| hypothetical protein SEEE5518_11546 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648905 5-18]
 gi|421440009|ref|ZP_15889489.1| hypothetical protein SEEE1618_11028 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 6-18]
 gi|421443932|ref|ZP_15893371.1| hypothetical protein SEEE3079_07791 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-3079]
 gi|421449402|ref|ZP_15898786.1| hypothetical protein SEEE6482_12769 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 58-6482]
 gi|436586978|ref|ZP_20511757.1| hypothetical protein SEE22704_00050 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22704]
 gi|436660635|ref|ZP_20517129.1| hypothetical protein SEE30663_03263 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE30663]
 gi|436799760|ref|ZP_20524046.1| hypothetical protein SEECHS44_12249 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS44]
 gi|436807386|ref|ZP_20527429.1| hypothetical protein SEEE1882_06386 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1882]
 gi|436818277|ref|ZP_20534910.1| hypothetical protein SEEE1884_21489 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1884]
 gi|436832500|ref|ZP_20536790.1| hypothetical protein SEEE1594_08045 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1594]
 gi|436853153|ref|ZP_20543178.1| hypothetical protein SEEE1566_17551 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1566]
 gi|436861059|ref|ZP_20548243.1| hypothetical protein SEEE1580_20609 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1580]
 gi|436867712|ref|ZP_20552866.1| hypothetical protein SEEE1543_21404 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1543]
 gi|436873057|ref|ZP_20555939.1| hypothetical protein SEEE1441_14355 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1441]
 gi|436880272|ref|ZP_20560031.1| hypothetical protein SEEE1810_12405 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1810]
 gi|436891682|ref|ZP_20566382.1| hypothetical protein SEEE1558_21721 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1558]
 gi|436899411|ref|ZP_20570822.1| hypothetical protein SEEE1018_21293 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1018]
 gi|436902922|ref|ZP_20573386.1| hypothetical protein SEEE1010_11600 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1010]
 gi|436914994|ref|ZP_20579841.1| hypothetical protein SEEE1729_21688 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1729]
 gi|436919693|ref|ZP_20582474.1| hypothetical protein SEEE0895_12108 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0895]
 gi|436928985|ref|ZP_20588191.1| hypothetical protein SEEE0899_18091 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0899]
 gi|436938402|ref|ZP_20593189.1| hypothetical protein SEEE1457_20704 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1457]
 gi|436946037|ref|ZP_20597865.1| hypothetical protein SEEE1747_21678 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1747]
 gi|436955500|ref|ZP_20602375.1| hypothetical protein SEEE0968_21674 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0968]
 gi|436966232|ref|ZP_20606901.1| hypothetical protein SEEE1444_21694 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1444]
 gi|436969376|ref|ZP_20608373.1| hypothetical protein SEEE1445_06203 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1445]
 gi|436980018|ref|ZP_20613163.1| hypothetical protein SEEE1559_07888 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1559]
 gi|436993573|ref|ZP_20618366.1| hypothetical protein SEEE1565_11367 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1565]
 gi|437005017|ref|ZP_20622247.1| hypothetical protein SEEE1808_08362 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1808]
 gi|437022700|ref|ZP_20628649.1| hypothetical protein SEEE1811_17933 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1811]
 gi|437027568|ref|ZP_20630457.1| hypothetical protein SEEE0956_04135 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0956]
 gi|437042923|ref|ZP_20636436.1| hypothetical protein SEEE1455_11592 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1455]
 gi|437050597|ref|ZP_20640742.1| hypothetical protein SEEE1575_10738 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1575]
 gi|437061829|ref|ZP_20647195.1| hypothetical protein SEEE1725_20886 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1725]
 gi|437066745|ref|ZP_20649807.1| hypothetical protein SEEE1745_11181 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1745]
 gi|437074029|ref|ZP_20653471.1| hypothetical protein SEEE1791_06801 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1791]
 gi|437083113|ref|ZP_20658856.1| hypothetical protein SEEE1795_11447 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1795]
 gi|437097855|ref|ZP_20665310.1| hypothetical protein SEEE6709_21588 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 576709]
 gi|437110640|ref|ZP_20667986.1| hypothetical protein SEEE9058_12128 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 635290-58]
 gi|437125197|ref|ZP_20673859.1| hypothetical protein SEEE0816_19219 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-16]
 gi|437129598|ref|ZP_20676074.1| hypothetical protein SEEE0819_07442 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-19]
 gi|437141690|ref|ZP_20683374.1| hypothetical protein SEEE3072_21673 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-2]
 gi|437146227|ref|ZP_20686016.1| hypothetical protein SEEE3089_12043 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-9]
 gi|437153413|ref|ZP_20690519.1| hypothetical protein SEEE9163_11989 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629163]
 gi|437159783|ref|ZP_20694181.1| hypothetical protein SEEE151_07742 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE15-1]
 gi|437169245|ref|ZP_20699638.1| hypothetical protein SEEEN202_12756 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_N202]
 gi|437175772|ref|ZP_20702948.1| hypothetical protein SEEE3991_06819 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_56-3991]
 gi|437184559|ref|ZP_20708424.1| hypothetical protein SEEE3618_11954 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_76-3618]
 gi|437205626|ref|ZP_20712423.1| hypothetical protein SEEE1831_09567 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13183-1]
 gi|437264803|ref|ZP_20720079.1| hypothetical protein SEEE2490_22031 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_81-2490]
 gi|437269338|ref|ZP_20722581.1| hypothetical protein SEEEL909_12114 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL909]
 gi|437277550|ref|ZP_20726909.1| hypothetical protein SEEEL913_11084 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL913]
 gi|437296938|ref|ZP_20732739.1| hypothetical protein SEEE4941_18052 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_69-4941]
 gi|437315935|ref|ZP_20737623.1| hypothetical protein SEEE7015_20209 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 638970-15]
 gi|437327767|ref|ZP_20740709.1| hypothetical protein SEEE7927_12773 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 17927]
 gi|437341835|ref|ZP_20744958.1| hypothetical protein SEEECHS4_11503 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS4]
 gi|437374569|ref|ZP_20749722.1| hypothetical protein SEEE2558_14891 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22558]
 gi|437417592|ref|ZP_20754011.1| hypothetical protein SEEE2217_11954 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 22-17]
 gi|437445835|ref|ZP_20758557.1| hypothetical protein SEEE4018_12140 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 40-18]
 gi|437463439|ref|ZP_20763121.1| hypothetical protein SEEE6211_12285 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 1-1]
 gi|437480998|ref|ZP_20768703.1| hypothetical protein SEEE4441_17841 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 4-1]
 gi|437492490|ref|ZP_20771721.1| hypothetical protein SEEE4647_10346 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642046 4-7]
 gi|437509510|ref|ZP_20776649.1| hypothetical protein SEEE9845_12882 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648898 4-5]
 gi|437533005|ref|ZP_20781108.1| hypothetical protein SEEE9317_12519 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648899 3-17]
 gi|437567162|ref|ZP_20787433.1| hypothetical protein SEEE0116_21713 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648900 1-16]
 gi|437580559|ref|ZP_20791962.1| hypothetical protein SEEE1117_21597 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 1-17]
 gi|437583425|ref|ZP_20792515.1| hypothetical protein SEEE1392_01384 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 39-2]
 gi|437605015|ref|ZP_20799194.1| hypothetical protein SEEE0268_12682 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648902 6-8]
 gi|437619415|ref|ZP_20803567.1| hypothetical protein SEEE0316_11911 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648903 1-6]
 gi|437643887|ref|ZP_20808520.1| hypothetical protein SEEE0436_14290 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648904 3-6]
 gi|437665443|ref|ZP_20814594.1| hypothetical protein SEEE1319_21517 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 653049 13-19]
 gi|437679960|ref|ZP_20818264.1| hypothetical protein SEEE4481_17522 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 8-1]
 gi|437699998|ref|ZP_20823585.1| hypothetical protein SEEE6297_21011 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 9-7]
 gi|437703512|ref|ZP_20824555.1| hypothetical protein SEEE4220_02924 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 42-20]
 gi|437729716|ref|ZP_20830848.1| hypothetical protein SEEE1616_11817 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 16-16]
 gi|437779120|ref|ZP_20836338.1| hypothetical protein SEEE2651_17059 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 76-2651]
 gi|437808540|ref|ZP_20840245.1| hypothetical protein SEEE3944_12048 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 33944]
 gi|437950956|ref|ZP_20852013.1| hypothetical protein SEEE5621_25836 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 6.0562-1]
 gi|437967922|ref|ZP_20852683.1| hypothetical protein SEEE5646_01953 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-5646]
 gi|438092827|ref|ZP_20861372.1| hypothetical protein SEEE2625_19473 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 81-2625]
 gi|438101777|ref|ZP_20864604.1| hypothetical protein SEEE1976_12776 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 62-1976]
 gi|438116347|ref|ZP_20870866.1| hypothetical protein SEEE3407_21852 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 53-407]
 gi|438125951|ref|ZP_20872792.1| hypothetical protein SEEP9120_04530 [Salmonella enterica subsp.
           enterica serovar Pullorum str. ATCC 9120]
 gi|445135493|ref|ZP_21383245.1| hypothetical protein SEEG9184_019698 [Salmonella enterica subsp.
           enterica serovar Gallinarum str. 9184]
 gi|445145321|ref|ZP_21387283.1| hypothetical protein SEEDSL_011332 [Salmonella enterica subsp.
           enterica serovar Dublin str. SL1438]
 gi|445171068|ref|ZP_21395979.1| hypothetical protein SEE8A_013504 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE8a]
 gi|445191112|ref|ZP_21399762.1| hypothetical protein SE20037_09174 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 20037]
 gi|445226339|ref|ZP_21403820.1| hypothetical protein SEE10_015161 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE10]
 gi|445240391|ref|ZP_21407510.1| hypothetical protein SEE436_000740 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 436]
 gi|445335016|ref|ZP_21415334.1| hypothetical protein SEE18569_012433 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 18569]
 gi|445343791|ref|ZP_21417254.1| hypothetical protein SEE13_009373 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13-1]
 gi|445358404|ref|ZP_21422596.1| hypothetical protein SEE23_018981 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. PT23]
 gi|29839732|sp|Q8Z3U7.1|YGGU_SALTI RecName: Full=UPF0235 protein YggU
 gi|189030121|sp|A9N4P8.1|YGGU_SALPB RecName: Full=UPF0235 protein YggU
 gi|226730825|sp|B5FUW5.1|YGGU_SALDC RecName: Full=UPF0235 protein YggU
 gi|226730826|sp|B5QY78.1|YGGU_SALEP RecName: Full=UPF0235 protein YggU
 gi|226730827|sp|B5RE62.1|YGGU_SALG2 RecName: Full=UPF0235 protein YggU
 gi|226730831|sp|B4TV71.1|YGGU_SALSV RecName: Full=UPF0235 protein YggU
 gi|25370114|pir||AF0878 conserved hypothetical protein STY3255 [imported] - Salmonella
           enterica subsp. enterica serovar Typhi (strain CT18)
 gi|16504179|emb|CAD02926.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Typhi]
 gi|29138998|gb|AAO70566.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Typhi str. Ty2]
 gi|161365430|gb|ABX69198.1| hypothetical protein SPAB_03867 [Salmonella enterica subsp.
           enterica serovar Paratyphi B str. SPB7]
 gi|194456782|gb|EDX45621.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Kentucky str. CVM29188]
 gi|194711475|gb|ACF90696.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Schwarzengrund str. CVM19633]
 gi|195633251|gb|EDX51665.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Newport str. SL317]
 gi|197290920|gb|EDY30274.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Schwarzengrund str. SL480]
 gi|197938903|gb|ACH76236.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Dublin str. CT_02021853]
 gi|199605099|gb|EDZ03644.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Virchow str. SL491]
 gi|205273806|emb|CAR38801.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Gallinarum str. 287/91]
 gi|205323332|gb|EDZ11171.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Saintpaul str. SARA29]
 gi|205331696|gb|EDZ18460.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Kentucky str. CDC 191]
 gi|205347059|gb|EDZ33690.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar Hadar str. RI_05P066]
 gi|206710166|emb|CAR34522.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Enteritidis str. P125109]
 gi|326624849|gb|EGE31194.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar
           Dublin str. SD3246]
 gi|326629138|gb|EGE35481.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar
           Gallinarum str. SG9]
 gi|353643832|gb|EHC87929.1| Hypothetical protein LTSEUGA_4347 [Salmonella enterica subsp.
           enterica serovar Uganda str. R8-3404]
 gi|357207330|gb|AET55376.1| hypothetical protein SPUL_3104 [Salmonella enterica subsp. enterica
           serovar Gallinarum/pullorum str. RKS5078]
 gi|392617335|gb|EIW99760.1| hypothetical protein SEENLE15_01345 [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 15]
 gi|392620905|gb|EIX03271.1| hypothetical protein SEENLE01_09156 [Salmonella enterica subsp.
           enterica serovar Newport str. Levine 1]
 gi|392733991|gb|EIZ91182.1| hypothetical protein SEEN539_17297 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21539]
 gi|392738855|gb|EIZ95995.1| hypothetical protein SEEN199_07843 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35199]
 gi|392741598|gb|EIZ98694.1| hypothetical protein SEEN185_09925 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35185]
 gi|392752808|gb|EJA09748.1| hypothetical protein SEEN953_05911 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 33953]
 gi|392755634|gb|EJA12543.1| hypothetical protein SEEN188_20492 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35188]
 gi|392757245|gb|EJA14135.1| hypothetical protein SEEN559_07379 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 21559]
 gi|392781052|gb|EJA37703.1| hypothetical protein SEEN202_18461 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 35202]
 gi|392836145|gb|EJA91733.1| hypothetical protein SEEN176_07477 [Salmonella enterica subsp.
           enterica serovar Newport str. CVM 4176]
 gi|395981255|gb|EJH90477.1| hypothetical protein SEEE3139_21675 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 622731-39]
 gi|395981909|gb|EJH91130.1| hypothetical protein SEEE0631_21444 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 640631]
 gi|395987923|gb|EJH97085.1| hypothetical protein SEEE0166_12594 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639016-6]
 gi|395994353|gb|EJI03429.1| hypothetical protein SEEE0424_21088 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-0424]
 gi|395995168|gb|EJI04233.1| hypothetical protein SEEE3076_21526 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-6]
 gi|395995731|gb|EJI04795.1| hypothetical protein SEEE4917_22116 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 485549-17]
 gi|396009241|gb|EJI18174.1| hypothetical protein SEEE6426_20833 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-26]
 gi|396017060|gb|EJI25926.1| hypothetical protein SEEE6670_06791 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-70]
 gi|396018488|gb|EJI27350.1| hypothetical protein SEEE6622_06164 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 596866-22]
 gi|396022172|gb|EJI30986.1| hypothetical protein SEEE7246_21923 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-46]
 gi|396027660|gb|EJI36423.1| hypothetical protein SEEE6437_12540 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629164-37]
 gi|396027943|gb|EJI36705.1| hypothetical protein SEEE7250_12208 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 639672-50]
 gi|396034876|gb|EJI43557.1| hypothetical protein SEEE1427_21225 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-1427]
 gi|396042391|gb|EJI51013.1| hypothetical protein SEEE2659_12186 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 77-2659]
 gi|396043940|gb|EJI52538.1| hypothetical protein SEEE1757_06240 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 78-1757]
 gi|396048575|gb|EJI57124.1| hypothetical protein SEEE5101_22057 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22510-1]
 gi|396054809|gb|EJI63301.1| hypothetical protein SEEE8B1_11656 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 8b-1]
 gi|396055999|gb|EJI64475.1| hypothetical protein SEEE5518_11546 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648905 5-18]
 gi|396068144|gb|EJI76492.1| hypothetical protein SEEE1618_11028 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 6-18]
 gi|396069563|gb|EJI77901.1| hypothetical protein SEEE3079_07791 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-3079]
 gi|396070699|gb|EJI79027.1| hypothetical protein SEEE6482_12769 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 58-6482]
 gi|434942486|gb|ELL48767.1| hypothetical protein SEEP9120_04530 [Salmonella enterica subsp.
           enterica serovar Pullorum str. ATCC 9120]
 gi|434959790|gb|ELL53236.1| hypothetical protein SEECHS44_12249 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS44]
 gi|434968342|gb|ELL61094.1| hypothetical protein SEEE1882_06386 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1882]
 gi|434970821|gb|ELL63382.1| hypothetical protein SEEE1884_21489 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1884]
 gi|434981099|gb|ELL72986.1| hypothetical protein SEEE1594_08045 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1594]
 gi|434981618|gb|ELL73481.1| hypothetical protein SEE22704_00050 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22704]
 gi|434984498|gb|ELL76238.1| hypothetical protein SEEE1566_17551 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1566]
 gi|434985503|gb|ELL77190.1| hypothetical protein SEEE1580_20609 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1580]
 gi|434992864|gb|ELL84303.1| hypothetical protein SEEE1543_21404 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1543]
 gi|434999914|gb|ELL91088.1| hypothetical protein SEEE1441_14355 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1441]
 gi|435005116|gb|ELL96038.1| hypothetical protein SEEE1810_12405 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1810]
 gi|435005811|gb|ELL96731.1| hypothetical protein SEEE1558_21721 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1558]
 gi|435012546|gb|ELM03221.1| hypothetical protein SEEE1018_21293 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1018]
 gi|435019352|gb|ELM09796.1| hypothetical protein SEEE1010_11600 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1010]
 gi|435020333|gb|ELM10745.1| hypothetical protein SEE30663_03263 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE30663]
 gi|435023076|gb|ELM13372.1| hypothetical protein SEEE1729_21688 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1729]
 gi|435029528|gb|ELM19586.1| hypothetical protein SEEE0895_12108 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0895]
 gi|435033675|gb|ELM23567.1| hypothetical protein SEEE0899_18091 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0899]
 gi|435033926|gb|ELM23816.1| hypothetical protein SEEE1457_20704 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1457]
 gi|435035609|gb|ELM25454.1| hypothetical protein SEEE1747_21678 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1747]
 gi|435045876|gb|ELM35502.1| hypothetical protein SEEE0968_21674 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0968]
 gi|435046642|gb|ELM36257.1| hypothetical protein SEEE1444_21694 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1444]
 gi|435058694|gb|ELM48001.1| hypothetical protein SEEE1445_06203 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1445]
 gi|435065250|gb|ELM54356.1| hypothetical protein SEEE1565_11367 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1565]
 gi|435068574|gb|ELM57602.1| hypothetical protein SEEE1559_07888 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1559]
 gi|435072308|gb|ELM61237.1| hypothetical protein SEEE1808_08362 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1808]
 gi|435076637|gb|ELM65420.1| hypothetical protein SEEE1811_17933 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1811]
 gi|435083573|gb|ELM72174.1| hypothetical protein SEEE1455_11592 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1455]
 gi|435085627|gb|ELM74180.1| hypothetical protein SEEE0956_04135 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_0956]
 gi|435088313|gb|ELM76770.1| hypothetical protein SEEE1725_20886 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1725]
 gi|435093301|gb|ELM81641.1| hypothetical protein SEEE1575_10738 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1575]
 gi|435097551|gb|ELM85810.1| hypothetical protein SEEE1745_11181 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1745]
 gi|435106499|gb|ELM94516.1| hypothetical protein SEEE6709_21588 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 576709]
 gi|435107830|gb|ELM95813.1| hypothetical protein SEEE1791_06801 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1791]
 gi|435108686|gb|ELM96651.1| hypothetical protein SEEE1795_11447 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CDC_2010K_1795]
 gi|435118542|gb|ELN06194.1| hypothetical protein SEEE0816_19219 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-16]
 gi|435118890|gb|ELN06541.1| hypothetical protein SEEE9058_12128 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 635290-58]
 gi|435126818|gb|ELN14212.1| hypothetical protein SEEE0819_07442 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-19]
 gi|435127858|gb|ELN15218.1| hypothetical protein SEEE3072_21673 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607307-2]
 gi|435136472|gb|ELN23562.1| hypothetical protein SEEE3089_12043 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 607308-9]
 gi|435141164|gb|ELN28106.1| hypothetical protein SEEE9163_11989 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 629163]
 gi|435148562|gb|ELN35278.1| hypothetical protein SEEE151_07742 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE15-1]
 gi|435148973|gb|ELN35687.1| hypothetical protein SEEEN202_12756 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_N202]
 gi|435156443|gb|ELN42933.1| hypothetical protein SEEE3991_06819 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_56-3991]
 gi|435159810|gb|ELN46128.1| hypothetical protein SEEE2490_22031 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_81-2490]
 gi|435161170|gb|ELN47412.1| hypothetical protein SEEE3618_11954 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_76-3618]
 gi|435172285|gb|ELN57828.1| hypothetical protein SEEEL909_12114 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL909]
 gi|435172946|gb|ELN58471.1| hypothetical protein SEEEL913_11084 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SL913]
 gi|435179365|gb|ELN64515.1| hypothetical protein SEEE4941_18052 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CVM_69-4941]
 gi|435180411|gb|ELN65519.1| hypothetical protein SEEE7015_20209 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 638970-15]
 gi|435191948|gb|ELN76504.1| hypothetical protein SEEE7927_12773 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 17927]
 gi|435193501|gb|ELN77980.1| hypothetical protein SEEECHS4_11503 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. CHS4]
 gi|435202227|gb|ELN86081.1| hypothetical protein SEEE2217_11954 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 22-17]
 gi|435205340|gb|ELN88941.1| hypothetical protein SEEE2558_14891 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 22558]
 gi|435207581|gb|ELN91033.1| hypothetical protein SEEE1831_09567 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13183-1]
 gi|435210224|gb|ELN93495.1| hypothetical protein SEEE4018_12140 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 40-18]
 gi|435218174|gb|ELO00581.1| hypothetical protein SEEE4441_17841 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 4-1]
 gi|435218716|gb|ELO01117.1| hypothetical protein SEEE6211_12285 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 1-1]
 gi|435228782|gb|ELO10205.1| hypothetical protein SEEE4647_10346 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642046 4-7]
 gi|435232793|gb|ELO13882.1| hypothetical protein SEEE9845_12882 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648898 4-5]
 gi|435234902|gb|ELO15755.1| hypothetical protein SEEE0116_21713 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648900 1-16]
 gi|435240810|gb|ELO21200.1| hypothetical protein SEEE1117_21597 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 1-17]
 gi|435242554|gb|ELO22859.1| hypothetical protein SEEE9317_12519 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648899 3-17]
 gi|435256958|gb|ELO36252.1| hypothetical protein SEEE0268_12682 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648902 6-8]
 gi|435258695|gb|ELO37955.1| hypothetical protein SEEE0316_11911 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648903 1-6]
 gi|435263629|gb|ELO42670.1| hypothetical protein SEEE1392_01384 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 39-2]
 gi|435265030|gb|ELO43915.1| hypothetical protein SEEE1319_21517 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 653049 13-19]
 gi|435272230|gb|ELO50651.1| hypothetical protein SEEE4481_17522 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 642044 8-1]
 gi|435274059|gb|ELO52183.1| hypothetical protein SEEE6297_21011 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 561362 9-7]
 gi|435274442|gb|ELO52555.1| hypothetical protein SEEE0436_14290 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648904 3-6]
 gi|435289828|gb|ELO66778.1| hypothetical protein SEEE1616_11817 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 648901 16-16]
 gi|435293602|gb|ELO70294.1| hypothetical protein SEEE4220_02924 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 543463 42-20]
 gi|435300208|gb|ELO76303.1| hypothetical protein SEEE3944_12048 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 33944]
 gi|435302694|gb|ELO78643.1| hypothetical protein SEEE2651_17059 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 76-2651]
 gi|435306441|gb|ELO81733.1| hypothetical protein SEEE5621_25836 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 6.0562-1]
 gi|435315141|gb|ELO88423.1| hypothetical protein SEEE2625_19473 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 81-2625]
 gi|435324460|gb|ELO96393.1| hypothetical protein SEEE1976_12776 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 62-1976]
 gi|435327862|gb|ELO99513.1| hypothetical protein SEEE3407_21852 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 53-407]
 gi|435339710|gb|ELP08501.1| hypothetical protein SEEE5646_01953 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 50-5646]
 gi|444845694|gb|ELX70882.1| hypothetical protein SEEG9184_019698 [Salmonella enterica subsp.
           enterica serovar Gallinarum str. 9184]
 gi|444846094|gb|ELX71275.1| hypothetical protein SEEDSL_011332 [Salmonella enterica subsp.
           enterica serovar Dublin str. SL1438]
 gi|444861738|gb|ELX86611.1| hypothetical protein SEE8A_013504 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE8a]
 gi|444867672|gb|ELX92349.1| hypothetical protein SEE10_015161 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. SE10]
 gi|444868217|gb|ELX92866.1| hypothetical protein SE20037_09174 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 20037]
 gi|444874597|gb|ELX98832.1| hypothetical protein SEE18569_012433 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 18569]
 gi|444880951|gb|ELY05013.1| hypothetical protein SEE13_009373 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 13-1]
 gi|444885959|gb|ELY09728.1| hypothetical protein SEE23_018981 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. PT23]
 gi|444891162|gb|ELY14434.1| hypothetical protein SEE436_000740 [Salmonella enterica subsp.
           enterica serovar Enteritidis str. 436]
          Length = 96

 Score = 38.1 bits (87), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L +++     V + Q+ I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLTKFLGKQFRVAKSQIVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQVKIIHPQQIPPE 90


>gi|449146497|ref|ZP_21777270.1| hypothetical protein D908_17049 [Vibrio mimicus CAIM 602]
 gi|449077729|gb|EMB48690.1| hypothetical protein D908_17049 [Vibrio mimicus CAIM 602]
          Length = 97

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  + DE + V I AP  DG+ANA L ++++    V +  V I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHDEELKVAITAPPIDGKANAHLSKFLAKQCKVAKGSVVIEKGE 72

Query: 102 KSRDKIVIVE---EITPE 116
             R K V ++   +I PE
Sbjct: 73  LGRHKQVRIQQPSQIPPE 90


>gi|410667636|ref|YP_006920007.1| hypothetical protein CHP00251 [Thermacetogenium phaeum DSM 12270]
 gi|409105383|gb|AFV11508.1| hypothetical protein CHP00251 [Thermacetogenium phaeum DSM 12270]
          Length = 101

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 18/78 (23%), Positives = 43/78 (55%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V I +  +P +    +  +++E + +++ AP  DG+AN  L++++  +      +V I  
Sbjct: 11  VRIQVKVQPRASCNEVVGITEEYLRIRLTAPPVDGKANKQLVKFLGQLFRCGAGKVRILH 70

Query: 100 GSKSRDKIVIVEEITPEN 117
           G+  R K+V ++ I+ ++
Sbjct: 71  GTSGRCKLVEIDGISEDD 88


>gi|262172428|ref|ZP_06040106.1| hypothetical protein VII_003257 [Vibrio mimicus MB-451]
 gi|261893504|gb|EEY39490.1| hypothetical protein VII_003257 [Vibrio mimicus MB-451]
          Length = 97

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 41/78 (52%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + V I AP  DG+ANA L +Y++ +  V +  V I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVVIEKGE 72

Query: 102 KSRDKIVIVE---EITPE 116
             R K V ++   +I PE
Sbjct: 73  LGRHKQVRIQQPNQIPPE 90


>gi|423141601|ref|ZP_17129239.1| hypothetical protein SEHO0A_03158 [Salmonella enterica subsp.
           houtenae str. ATCC BAA-1581]
 gi|379050773|gb|EHY68665.1| hypothetical protein SEHO0A_03158 [Salmonella enterica subsp.
           houtenae str. ATCC BAA-1581]
          Length = 93

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L +++     V + Q+ I  G 
Sbjct: 10  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLAKFLGKQFRVAKSQIVIEKGE 69

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 70  LGRHKQVKIIHPQQIPPE 87


>gi|417303415|ref|ZP_12090473.1| protein containing DUF167 [Rhodopirellula baltica WH47]
 gi|327540387|gb|EGF26973.1| protein containing DUF167 [Rhodopirellula baltica WH47]
          Length = 87

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 38/64 (59%)

Query: 48  PGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKI 107
           P +K  S+  + D A+ V +    +DG+AN A++  ++  L V + +V+I +G  SR K 
Sbjct: 5   PKAKKASVGGLHDGALKVSVHMVPEDGKANKAVIASLAKWLRVSKGRVAIVAGETSRLKT 64

Query: 108 VIVE 111
           ++VE
Sbjct: 65  IVVE 68


>gi|148260376|ref|YP_001234503.1| hypothetical protein Acry_1373 [Acidiphilium cryptum JF-5]
 gi|146402057|gb|ABQ30584.1| hypothetical protein Acry_1373 [Acidiphilium cryptum JF-5]
          Length = 75

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 15/41 (36%), Positives = 31/41 (75%)

Query: 70  PAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           P +DG AN A+L+ +++ LG+   ++++ +G ++RDK+V+V
Sbjct: 4   PPEDGRANDAVLKALAAWLGIGAARLALRAGGQARDKLVLV 44


>gi|153003993|ref|YP_001378318.1| hypothetical protein Anae109_1126 [Anaeromyxobacter sp. Fw109-5]
 gi|166977708|sp|A7H9D8.1|Y1126_ANADF RecName: Full=UPF0235 protein Anae109_1126
 gi|152027566|gb|ABS25334.1| protein of unknown function DUF167 [Anaeromyxobacter sp. Fw109-5]
          Length = 95

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 29/46 (63%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           +Q+ AP  DG ANAAL+E+++  L V+++ V +  G   R K V V
Sbjct: 36  IQLAAPPVDGAANAALVEFLAEALEVRKQDVVLVRGETGRRKAVRV 81


>gi|417336327|ref|ZP_12118839.1| hypothetical protein LTSEALA_4446 [Salmonella enterica subsp.
           enterica serovar Alachua str. R6-377]
 gi|353568275|gb|EHC33223.1| hypothetical protein LTSEALA_4446 [Salmonella enterica subsp.
           enterica serovar Alachua str. R6-377]
          Length = 100

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V V I AP  DG+AN+ L +++     V + Q+ I  G 
Sbjct: 17  LRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLTKFLGKQFRVAKSQIVIEKGE 76

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 77  LGRHKQVKIIHPQQIPPE 94


>gi|308050656|ref|YP_003914222.1| hypothetical protein Fbal_2946 [Ferrimonas balearica DSM 9799]
 gi|307632846|gb|ADN77148.1| protein of unknown function DUF167 [Ferrimonas balearica DSM 9799]
          Length = 96

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 19/65 (29%), Positives = 34/65 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    +  +  E   V I AP  DG+ANA L+++++    V + Q+SI  G 
Sbjct: 14  LKLYIQPKASRDQLVGLHGEEFKVAITAPPVDGKANAHLVKFLAKQFKVAKGQISIVKGE 73

Query: 102 KSRDK 106
             R K
Sbjct: 74  LGRHK 78


>gi|339444553|ref|YP_004710557.1| hypothetical protein EGYY_09690 [Eggerthella sp. YY7918]
 gi|338904305|dbj|BAK44156.1| hypothetical protein EGYY_09690 [Eggerthella sp. YY7918]
          Length = 97

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 33/51 (64%)

Query: 61  EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           EAV V++ AP   G+AN A+ + ++  LG+ + +V++ +G  SR K + VE
Sbjct: 31  EAVCVRVCAPPDGGKANKAVCKLIAEALGIPKSRVNVNAGHTSRRKRLAVE 81


>gi|423204104|ref|ZP_17190660.1| TIGR00251 family protein [Aeromonas veronii AMC34]
 gi|404628098|gb|EKB24886.1| TIGR00251 family protein [Aeromonas veronii AMC34]
          Length = 100

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/65 (30%), Positives = 35/65 (53%)

Query: 47  KPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           +P +    I  +  E + V I AP  DG+AN+ L+++++    V + QV+I  G   R K
Sbjct: 18  QPKASRDQIIGLHGEELKVAITAPPVDGQANSHLIKFLAKQFKVAKGQVTIVRGELGRHK 77

Query: 107 IVIVE 111
            V ++
Sbjct: 78  TVAID 82


>gi|323356791|ref|YP_004223187.1| hypothetical protein MTES_0343 [Microbacterium testaceum StLB037]
 gi|323273162|dbj|BAJ73307.1| uncharacterized conserved protein [Microbacterium testaceum
           StLB037]
          Length = 73

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 40/72 (55%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + +T+  KPGS+   + + + E + V +   A DG AN+ +++ +++  GV  R V I  
Sbjct: 1   MQLTVRVKPGSRRGPLVEDTAEGLVVHVRERAVDGAANSGVVKALAAHFGVPARDVEILR 60

Query: 100 GSKSRDKIVIVE 111
           G  +R K V V+
Sbjct: 61  GHAARVKRVEVD 72


>gi|330446896|ref|ZP_08310547.1| conserved hypothetical protein [Photobacterium leiognathi subsp.
           mandapamensis svers.1.1.]
 gi|328491087|dbj|GAA05044.1| conserved hypothetical protein [Photobacterium leiognathi subsp.
           mandapamensis svers.1.1.]
          Length = 97

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 19/70 (27%), Positives = 37/70 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + + I AP  DG+ANA L++Y+S    V +  + +  G 
Sbjct: 15  VRLYIQPKASRDQIVGLHGDEIKIAITAPPVDGKANAHLVKYLSKQFKVAKGLIHVEKGL 74

Query: 102 KSRDKIVIVE 111
           + R K V +E
Sbjct: 75  QGRHKQVRIE 84


>gi|311278130|ref|YP_003940361.1| hypothetical protein Entcl_0802 [Enterobacter cloacae SCF1]
 gi|308747325|gb|ADO47077.1| protein of unknown function DUF167 [Enterobacter cloacae SCF1]
          Length = 96

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   S+  +  + + V I AP  DG+ANA L+++++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSLVGLHGDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQVIIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +  I P  +
Sbjct: 73  LGRHKQVRI--IHPHQI 87


>gi|21226924|ref|NP_632846.1| hypothetical protein MM_0822 [Methanosarcina mazei Go1]
 gi|452209405|ref|YP_007489519.1| hypothetical protein MmTuc01_0843 [Methanosarcina mazei Tuc01]
 gi|29839564|sp|Q8PYN9.1|Y822_METMA RecName: Full=UPF0235 protein MM_0822
 gi|20905233|gb|AAM30518.1| hypothetical protein MM_0822 [Methanosarcina mazei Go1]
 gi|452099307|gb|AGF96247.1| hypothetical protein MmTuc01_0843 [Methanosarcina mazei Tuc01]
          Length = 108

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 23/90 (25%), Positives = 48/90 (53%), Gaps = 3/90 (3%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEA---VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           S + + I   PGS+S S+    +E    + V++   A+ G+AN  L+E ++ + G+    
Sbjct: 15  SGIIVDIEVTPGSRSLSVPSGYNEWRKRIEVKLTRNAQKGKANEQLIESLAELFGICSSD 74

Query: 95  VSIGSGSKSRDKIVIVEEITPENVLNSLGK 124
           + I SG+ S  K ++++ ++ +  +   GK
Sbjct: 75  IFISSGATSSKKSLLIKGVSYQQAVLVFGK 104


>gi|406992513|gb|EKE11865.1| hypothetical protein ACD_15C00011G0004 [uncultured bacterium]
          Length = 72

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 36/69 (52%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I I   P S    +T +++    V++ AP  DG+ANA L++ +S    + +  + I  
Sbjct: 1   MRIYIKVSPRSSKNEVTKIAEGEYKVRLTAPPVDGQANAMLIKILSEYFDIPKSSLRIIG 60

Query: 100 GSKSRDKIV 108
           G  ++ KIV
Sbjct: 61  GKSAKTKIV 69


>gi|428205404|ref|YP_007089757.1| hypothetical protein Chro_0336 [Chroococcidiopsis thermalis PCC
           7203]
 gi|428007325|gb|AFY85888.1| protein of unknown function DUF167 [Chroococcidiopsis thermalis PCC
           7203]
          Length = 74

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 17/61 (27%), Positives = 35/61 (57%)

Query: 48  PGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKI 107
           P SK  +I +  D ++ V + +P  +G+AN  L++ ++    + + ++ I SG  SR K+
Sbjct: 9   PNSKHPAIKEEPDGSLTVHLKSPPINGKANQELIQLLAKKFEIPKTRIKIKSGLVSRQKL 68

Query: 108 V 108
           +
Sbjct: 69  I 69


>gi|390960262|ref|YP_006424096.1| hypothetical protein CL1_0087 [Thermococcus sp. CL1]
 gi|390518570|gb|AFL94302.1| hypothetical protein CL1_0087 [Thermococcus sp. CL1]
          Length = 93

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 42/72 (58%), Gaps = 7/72 (9%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVG---VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           + ++ +P +K   I  V D+  G   V+I AP  +G+AN  L++++S VLG    +V I 
Sbjct: 13  LLVYVQPKAKRNEIEGV-DKWRGRLKVKIKAPPVEGKANKELVKFLSKVLGT---EVKII 68

Query: 99  SGSKSRDKIVIV 110
            G  SR+K ++V
Sbjct: 69  RGEASREKDLLV 80


>gi|344923742|ref|ZP_08777203.1| hypothetical protein COdytL_03746 [Candidatus Odyssella
           thessalonicensis L13]
          Length = 101

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 26/97 (26%), Positives = 46/97 (47%), Gaps = 5/97 (5%)

Query: 35  VPPSSVSITIHAKPGSKSCSI-----TDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLG 89
           V  S V I I   P S   +I      D     + + + APA+D +AN AL+++++  L 
Sbjct: 5   VQGSKVVIYIRLSPKSSKDAIGGIYKDDRERRMLKISVTAPAEDNKANQALIKFLAKKLK 64

Query: 90  VKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLGKAS 126
           +   Q+++  G   R+K V +E  T   +++     S
Sbjct: 65  IAPSQLTLLQGHTHRNKTVAIESNTISQIVDFFSTIS 101


>gi|293394481|ref|ZP_06638777.1| conserved hypothetical protein [Serratia odorifera DSM 4582]
 gi|291422946|gb|EFE96179.1| conserved hypothetical protein [Serratia odorifera DSM 4582]
          Length = 97

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 19/65 (29%), Positives = 34/65 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA LL++++    V +  V+I  G 
Sbjct: 13  VRLYIQPKASRDQIIGLHGDEIKVAITAPPVDGQANAHLLKFIAKQFKVAKSNVTIEKGE 72

Query: 102 KSRDK 106
             R K
Sbjct: 73  LGRHK 77


>gi|258545474|ref|ZP_05705708.1| cytoplasmic protein [Cardiobacterium hominis ATCC 15826]
 gi|258519174|gb|EEV88033.1| cytoplasmic protein [Cardiobacterium hominis ATCC 15826]
          Length = 97

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 22/83 (26%), Positives = 42/83 (50%), Gaps = 5/83 (6%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +++ I A+     C    + DE   + I AP +DG+AN  L  +++    V ++ V++ S
Sbjct: 13  LAVKITARASRDQCQ--GIHDERYKIAITAPPEDGKANKHLTAWLAKTFAVAKKNVALQS 70

Query: 100 GSKSRDKIVIV---EEITPENVL 119
           G+ S  K + +   + + PE  L
Sbjct: 71  GAFSPLKTLRITAPKHLPPEWQL 93


>gi|157372267|ref|YP_001480256.1| hypothetical protein Spro_4033 [Serratia proteamaculans 568]
 gi|166979954|sp|A8GJ38.1|Y4033_SERP5 RecName: Full=UPF0235 protein Spro_4033
 gi|157324031|gb|ABV43128.1| protein of unknown function DUF167 [Serratia proteamaculans 568]
          Length = 96

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 19/67 (28%), Positives = 36/67 (53%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           ++I ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V +  V+I  
Sbjct: 11  LAIRLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLIKFLAKQFKVAKGNVTIEK 70

Query: 100 GSKSRDK 106
           G   R K
Sbjct: 71  GELGRHK 77


>gi|440286217|ref|YP_007338982.1| TIGR00251 family protein [Enterobacteriaceae bacterium strain FGI
           57]
 gi|440045739|gb|AGB76797.1| TIGR00251 family protein [Enterobacteriaceae bacterium strain FGI
           57]
          Length = 96

 Score = 37.7 bits (86), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI     + + V I AP  DG+ANA L+++++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDSIVGEHGDELKVAITAPPVDGQANAHLVKFLAKQFKVAKSQVIIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +  I P+++
Sbjct: 73  LGRHKQVKI--IHPQHI 87


>gi|294139811|ref|YP_003555789.1| hypothetical protein SVI_1040 [Shewanella violacea DSS12]
 gi|293326280|dbj|BAJ01011.1| conserved hypothetical protein [Shewanella violacea DSS12]
          Length = 100

 Score = 37.7 bits (86), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/86 (26%), Positives = 41/86 (47%), Gaps = 2/86 (2%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           S+ + ++ +P +    I  V    + + I AP  DG+ANA L +++S    V +  + I 
Sbjct: 10  SLLLNLYIQPKASRDKIIGVHGNELKIAITAPPVDGKANAHLTKFLSKAFKVPKGDIIIH 69

Query: 99  SGSKSRDKIV--IVEEITPENVLNSL 122
            G   R K V  +   + PE + + L
Sbjct: 70  KGELGRHKQVEILTPRVIPEQIADLL 95


>gi|269860472|ref|XP_002649957.1| hypothetical cytosolic protein [Enterocytozoon bieneusi H348]
 gi|220066644|gb|EED44119.1| hypothetical cytosolic protein [Enterocytozoon bieneusi H348]
          Length = 88

 Score = 37.7 bits (86), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 34/69 (49%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + I  K  S+  +I    D+ + +QI AP  D +AN  L+  +S     K+  +SI  
Sbjct: 16  IYLNIKVKLSSRETAILCQEDDYLIIQIAAPPVDNKANNELIALLSKTYKTKKENISIIK 75

Query: 100 GSKSRDKIV 108
           G  S  KI+
Sbjct: 76  GKTSTTKII 84


>gi|393766737|ref|ZP_10355291.1| hypothetical protein WYO_2197 [Methylobacterium sp. GXF4]
 gi|392727831|gb|EIZ85142.1| hypothetical protein WYO_2197 [Methylobacterium sp. GXF4]
          Length = 95

 Score = 37.7 bits (86), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 34/58 (58%)

Query: 66  QIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLG 123
           ++ AP  DG ANAAL + ++  L + R  V I +G +SR KI+ ++ IT  ++    G
Sbjct: 35  RVSAPPVDGAANAALEKLIAKALKIGRSSVRIAAGDQSRLKILEIDGITQADLDRVFG 92


>gi|440716209|ref|ZP_20896720.1| protein containing DUF167 [Rhodopirellula baltica SWK14]
 gi|436438555|gb|ELP32080.1| protein containing DUF167 [Rhodopirellula baltica SWK14]
          Length = 87

 Score = 37.7 bits (86), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/64 (31%), Positives = 38/64 (59%)

Query: 48  PGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKI 107
           P +K  S+  + D A+ V +    +DG+AN A++  ++  L V + +++I +G  SR K 
Sbjct: 5   PKAKKASVGGLHDGALKVSVHTVPEDGKANKAVIASLAKWLRVSKGRIAIVAGETSRLKT 64

Query: 108 VIVE 111
           ++VE
Sbjct: 65  IVVE 68


>gi|383497764|ref|YP_005398453.1| hypothetical protein UMN798_3371 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. 798]
 gi|380464585|gb|AFD59988.1| hypothetical protein UMN798_3371 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. 798]
          Length = 93

 Score = 37.7 bits (86), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V + I AP  DG+AN+ L +++     V + Q+ I  G 
Sbjct: 10  LRLYIQPKASRDSIVGLHGDEVKIAITAPPVDGQANSHLTKFLGKQFRVAKSQIVIEKGE 69

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 70  LGRHKQVKIIHPQQIPPE 87


>gi|90421586|ref|YP_529956.1| hypothetical protein RPC_0058 [Rhodopseudomonas palustris BisB18]
 gi|122477773|sp|Q21D99.1|Y058_RHOPB RecName: Full=UPF0235 protein RPC_0058
 gi|90103600|gb|ABD85637.1| protein of unknown function DUF167 [Rhodopseudomonas palustris
           BisB18]
          Length = 107

 Score = 37.7 bits (86), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 32/49 (65%)

Query: 63  VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           V V++ A A+ GEAN A+ E ++  LGV +R V + SG+ SR K V V+
Sbjct: 40  VKVRVRAIAEGGEANRAVTELLAKALGVPKRAVRVLSGTTSRLKQVAVD 88


>gi|403738142|ref|ZP_10950870.1| hypothetical protein AUCHE_05_05550 [Austwickia chelonae NBRC
           105200]
 gi|403192254|dbj|GAB77640.1| hypothetical protein AUCHE_05_05550 [Austwickia chelonae NBRC
           105200]
          Length = 92

 Score = 37.7 bits (86), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 41/76 (53%), Gaps = 4/76 (5%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVG----VQIDAPAKDGEANAALLEYMSSVLGVKRRQV 95
           + + +  +PG+    +    +   G    V ++  A DG+A  A L+ ++   GV+ RQV
Sbjct: 1   MRLVLRVRPGASRTRVGGAYEAPEGPQLCVAVNDRAVDGQATEAALKAVAKAFGVRPRQV 60

Query: 96  SIGSGSKSRDKIVIVE 111
           ++ SG+ SR KIV V+
Sbjct: 61  TLVSGATSRSKIVEVD 76


>gi|16766403|ref|NP_462018.1| hypothetical protein STM3102 [Salmonella enterica subsp. enterica
           serovar Typhimurium str. LT2]
 gi|167990371|ref|ZP_02571471.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar 4,[5],12:i:- str. CVM23701]
 gi|197262086|ref|ZP_03162160.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA23]
 gi|374979115|ref|ZP_09720454.1| UPF0235 protein VC [Salmonella enterica subsp. enterica serovar
           Typhimurium str. TN061786]
 gi|378446454|ref|YP_005234086.1| hypothetical protein [Salmonella enterica subsp. enterica serovar
           Typhimurium str. D23580]
 gi|378451888|ref|YP_005239248.1| hypothetical protein STM14_3746 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. 14028S]
 gi|378701009|ref|YP_005182966.1| hypothetical protein SL1344_3077 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. SL1344]
 gi|378985695|ref|YP_005248851.1| hypothetical protein STMDT12_C31550 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. T000240]
 gi|378990422|ref|YP_005253586.1| hypothetical protein STMUK_3090 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. UK-1]
 gi|379702359|ref|YP_005244087.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. ST4/74]
 gi|422027320|ref|ZP_16373663.1| hypothetical protein B571_15458 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm1]
 gi|422032355|ref|ZP_16378469.1| hypothetical protein B572_15579 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm2]
 gi|427554058|ref|ZP_18928960.1| hypothetical protein B576_15515 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm8]
 gi|427571612|ref|ZP_18933675.1| hypothetical protein B577_14935 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm9]
 gi|427592360|ref|ZP_18938474.1| hypothetical protein B573_14970 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm3]
 gi|427615904|ref|ZP_18943364.1| hypothetical protein B574_15364 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm4]
 gi|427639750|ref|ZP_18948244.1| hypothetical protein B575_15590 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm6]
 gi|427657335|ref|ZP_18952989.1| hypothetical protein B578_15171 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm10]
 gi|427662653|ref|ZP_18957954.1| hypothetical protein B579_16078 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm11]
 gi|427676276|ref|ZP_18962769.1| hypothetical protein B580_15839 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm12]
 gi|427800327|ref|ZP_18968100.1| hypothetical protein B581_18352 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm5]
 gi|29839738|sp|Q8ZM46.1|YGGU_SALTY RecName: Full=UPF0235 protein YggU
 gi|16421655|gb|AAL21977.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. LT2]
 gi|197240341|gb|EDY22961.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Saintpaul str. SARA23]
 gi|205331149|gb|EDZ17913.1| conserved hypothetical protein TIGR00251 [Salmonella enterica
           subsp. enterica serovar 4,[5],12:i:- str. CVM23701]
 gi|261248233|emb|CBG26070.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. D23580]
 gi|267995267|gb|ACY90152.1| hypothetical protein STM14_3746 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. 14028S]
 gi|301159657|emb|CBW19176.1| conserved hypothetical protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. SL1344]
 gi|312914124|dbj|BAJ38098.1| hypothetical protein STMDT12_C31550 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. T000240]
 gi|321225775|gb|EFX50829.1| UPF0235 protein VC [Salmonella enterica subsp. enterica serovar
           Typhimurium str. TN061786]
 gi|323131458|gb|ADX18888.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica
           serovar Typhimurium str. ST4/74]
 gi|332989969|gb|AEF08952.1| hypothetical protein STMUK_3090 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. UK-1]
 gi|414015117|gb|EKS98944.1| hypothetical protein B571_15458 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm1]
 gi|414015968|gb|EKS99758.1| hypothetical protein B576_15515 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm8]
 gi|414016645|gb|EKT00408.1| hypothetical protein B572_15579 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm2]
 gi|414029395|gb|EKT12555.1| hypothetical protein B577_14935 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm9]
 gi|414030889|gb|EKT13970.1| hypothetical protein B573_14970 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm3]
 gi|414033996|gb|EKT16937.1| hypothetical protein B574_15364 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm4]
 gi|414044228|gb|EKT26684.1| hypothetical protein B575_15590 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm6]
 gi|414044945|gb|EKT27375.1| hypothetical protein B578_15171 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm10]
 gi|414049697|gb|EKT31896.1| hypothetical protein B579_16078 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm11]
 gi|414057357|gb|EKT39115.1| hypothetical protein B580_15839 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm12]
 gi|414063525|gb|EKT44653.1| hypothetical protein B581_18352 [Salmonella enterica subsp.
           enterica serovar Typhimurium str. STm5]
          Length = 96

 Score = 37.7 bits (86), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  + V + I AP  DG+AN+ L +++     V + Q+ I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGDEVKIAITAPPVDGQANSHLTKFLGKQFRVAKSQIVIEKGE 72

Query: 102 KSRD---KIVIVEEITPE 116
             R    KI+  ++I PE
Sbjct: 73  LGRHKQVKIIHPQQIPPE 90


>gi|406969213|gb|EKD93911.1| hypothetical protein ACD_28C00032G0008 [uncultured bacterium]
          Length = 79

 Score = 37.7 bits (86), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 36/63 (57%)

Query: 50  SKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVI 109
           SKS  +  + D  + ++I AP + G+AN  L+ ++S  L + + QV+I SG     K++ 
Sbjct: 16  SKSEIVETMGDGTLKIRIAAPPEKGKANTELIRFLSRHLKIDKSQVNIISGKSDTLKLIK 75

Query: 110 VEE 112
           + E
Sbjct: 76  IHE 78


>gi|120403830|ref|YP_953659.1| hypothetical protein Mvan_2846 [Mycobacterium vanbaalenii PYR-1]
 gi|166200356|sp|A1T903.1|Y2846_MYCVP RecName: Full=UPF0235 protein Mvan_2846
 gi|119956648|gb|ABM13653.1| protein of unknown function DUF167 [Mycobacterium vanbaalenii
           PYR-1]
          Length = 75

 Score = 37.7 bits (86), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 1/66 (1%)

Query: 42  ITIHAKPGSKSCSITDVS-DEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           +++  KPGSK   + +   D  + V +   A DG+ANAA++  ++   GV R  V +  G
Sbjct: 5   VSVRVKPGSKKGPLVETGPDGELTVYVRERAVDGKANAAVIRVLAEHFGVPRSLVELTGG 64

Query: 101 SKSRDK 106
           + SR K
Sbjct: 65  ASSRIK 70


>gi|421785440|ref|ZP_16221866.1| hypothetical protein B194_4492 [Serratia plymuthica A30]
 gi|407752457|gb|EKF62614.1| hypothetical protein B194_4492 [Serratia plymuthica A30]
 gi|453063315|gb|EMF04295.1| hypothetical protein F518_17824 [Serratia marcescens VGH107]
          Length = 90

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 19/67 (28%), Positives = 36/67 (53%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           ++I ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V +  V+I  
Sbjct: 5   LAIRLYIQPKASRDQIIGLHGDELKVAITAPPVDGQANAHLIKFIAKQFKVAKSNVTIEK 64

Query: 100 GSKSRDK 106
           G   R K
Sbjct: 65  GELGRHK 71


>gi|332669639|ref|YP_004452647.1| hypothetical protein Celf_1125 [Cellulomonas fimi ATCC 484]
 gi|332338677|gb|AEE45260.1| protein of unknown function DUF167 [Cellulomonas fimi ATCC 484]
          Length = 91

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 41/71 (57%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + I  +PG+    +  +  + + V + A A DG A  A L  ++  LGV+RR VS+ SG+
Sbjct: 3   VAIRVRPGASRTRVGGLHGDRLVVAVQARAVDGAATEAALAALADALGVRRRHVSLVSGA 62

Query: 102 KSRDKIVIVEE 112
            SRDK+  V++
Sbjct: 63  TSRDKVAEVDD 73


>gi|330831217|ref|YP_004394169.1| hypothetical protein B565_3517 [Aeromonas veronii B565]
 gi|423203497|ref|ZP_17190075.1| TIGR00251 family protein [Aeromonas veronii AER39]
 gi|423208130|ref|ZP_17194684.1| TIGR00251 family protein [Aeromonas veronii AER397]
 gi|328806353|gb|AEB51552.1| hypothetical protein B565_3517 [Aeromonas veronii B565]
 gi|404612792|gb|EKB09849.1| TIGR00251 family protein [Aeromonas veronii AER39]
 gi|404619177|gb|EKB16093.1| TIGR00251 family protein [Aeromonas veronii AER397]
          Length = 100

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 19/65 (29%), Positives = 35/65 (53%)

Query: 47  KPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           +P +    I  +  E + V I AP  DG+AN+ L+++++    V + Q++I  G   R K
Sbjct: 18  QPKASRDQIIGLHGEELKVAITAPPVDGQANSHLIKFLAKQFKVAKGQITIVRGELGRHK 77

Query: 107 IVIVE 111
            V ++
Sbjct: 78  TVAID 82


>gi|270263061|ref|ZP_06191331.1| threonine dehydratase [Serratia odorifera 4Rx13]
 gi|333929063|ref|YP_004502642.1| hypothetical protein SerAS12_4237 [Serratia sp. AS12]
 gi|333934016|ref|YP_004507594.1| hypothetical protein SerAS9_4236 [Serratia plymuthica AS9]
 gi|386330886|ref|YP_006027056.1| hypothetical protein [Serratia sp. AS13]
 gi|448243889|ref|YP_007407942.1| hypothetical protein, UPF0235 family [Serratia marcescens WW4]
 gi|270042749|gb|EFA15843.1| threonine dehydratase [Serratia odorifera 4Rx13]
 gi|333475623|gb|AEF47333.1| UPF0235 protein yggU [Serratia plymuthica AS9]
 gi|333493123|gb|AEF52285.1| UPF0235 protein yggU [Serratia sp. AS12]
 gi|333963219|gb|AEG29992.1| UPF0235 protein yggU [Serratia sp. AS13]
 gi|445214253|gb|AGE19923.1| hypothetical protein, UPF0235 family [Serratia marcescens WW4]
          Length = 96

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 19/67 (28%), Positives = 36/67 (53%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           ++I ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V +  V+I  
Sbjct: 11  LAIRLYIQPKASRDQIIGLHGDELKVAITAPPVDGQANAHLIKFIAKQFKVAKSNVTIEK 70

Query: 100 GSKSRDK 106
           G   R K
Sbjct: 71  GELGRHK 77


>gi|85704715|ref|ZP_01035816.1| hypothetical protein ROS217_06535 [Roseovarius sp. 217]
 gi|85670533|gb|EAQ25393.1| hypothetical protein ROS217_06535 [Roseovarius sp. 217]
          Length = 84

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 25/85 (29%), Positives = 45/85 (52%), Gaps = 2/85 (2%)

Query: 27  NLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSS 86
           ++P    L  P + +I +   P +    I    D A+ V +    +DG+ANAA+ + ++ 
Sbjct: 2   DIPDLSALARPGT-TIALRVTPKAARNRIV-AEDGALRVYVTTVPEDGKANAAVQKLLAK 59

Query: 87  VLGVKRRQVSIGSGSKSRDKIVIVE 111
            LGV + ++S+  G  SRDK+  V+
Sbjct: 60  ALGVPKSRLSLLRGHTSRDKVFQVD 84


>gi|296104617|ref|YP_003614763.1| hypothetical protein ECL_04282 [Enterobacter cloacae subsp. cloacae
           ATCC 13047]
 gi|295059076|gb|ADF63814.1| hypothetical protein ECL_04282 [Enterobacter cloacae subsp. cloacae
           ATCC 13047]
          Length = 95

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   +I  +  + + V I AP  DG+ANA L +Y++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDNIVGLHGDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQVIIEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|406675560|ref|ZP_11082747.1| TIGR00251 family protein [Aeromonas veronii AMC35]
 gi|404626950|gb|EKB23756.1| TIGR00251 family protein [Aeromonas veronii AMC35]
          Length = 100

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 19/65 (29%), Positives = 35/65 (53%)

Query: 47  KPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           +P +    I  +  E + V I AP  DG+AN+ L+++++    V + Q++I  G   R K
Sbjct: 18  QPKASRDQIIGLHGEELKVAITAPPVDGQANSHLIKFLAKQFKVAKGQITIVRGELGRHK 77

Query: 107 IVIVE 111
            V ++
Sbjct: 78  TVAID 82


>gi|418696992|ref|ZP_13257993.1| hypothetical protein LEP1GSC081_1952 [Leptospira kirschneri str.
           H1]
 gi|421105842|ref|ZP_15566419.1| hypothetical protein LEP1GSC082_0280 [Leptospira kirschneri str.
           H2]
 gi|409955159|gb|EKO14099.1| hypothetical protein LEP1GSC081_1952 [Leptospira kirschneri str.
           H1]
 gi|410009109|gb|EKO62768.1| hypothetical protein LEP1GSC082_0280 [Leptospira kirschneri str.
           H2]
          Length = 73

 Score = 37.7 bits (86), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 19/69 (27%), Positives = 39/69 (56%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + +T++ KP SK   I    +  + + +   A +G+AN A++E +S  + V + ++ I S
Sbjct: 1   MKLTVYVKPNSKKVFIRKEENGVLTIAVRELALEGKANEAVIESISKEMKVPKSKIRILS 60

Query: 100 GSKSRDKIV 108
           G K++ K +
Sbjct: 61  GQKNKKKTI 69


>gi|91975173|ref|YP_567832.1| hypothetical protein RPD_0693 [Rhodopseudomonas palustris BisB5]
 gi|91681629|gb|ABE37931.1| protein of unknown function DUF167 [Rhodopseudomonas palustris
           BisB5]
          Length = 107

 Score = 37.7 bits (86), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 32/49 (65%)

Query: 63  VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           V V++ A A  GEAN A++E ++  LGV +R V + SG+ SR K + ++
Sbjct: 39  VKVRVRAIADGGEANRAVIELLAKSLGVPKRNVRLLSGATSRQKQIAID 87


>gi|89074110|ref|ZP_01160609.1| hypothetical protein SKA34_22122 [Photobacterium sp. SKA34]
 gi|89050046|gb|EAR55572.1| hypothetical protein SKA34_22122 [Photobacterium sp. SKA34]
          Length = 97

 Score = 37.4 bits (85), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 36/70 (51%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I ++ +P +    I  +    V + I AP  DG+ANA L++Y+S    V +  + +  G 
Sbjct: 15  IRLYIQPKASRDQIVGLHGNEVKIAITAPPVDGKANAHLVKYLSKQFKVAKGLIHLEKGL 74

Query: 102 KSRDKIVIVE 111
           + R K + +E
Sbjct: 75  QGRHKQIRIE 84


>gi|242398935|ref|YP_002994359.1| hypothetical protein TSIB_0952 [Thermococcus sibiricus MM 739]
 gi|242265328|gb|ACS90010.1| hypothetical protein TSIB_0952 [Thermococcus sibiricus MM 739]
          Length = 94

 Score = 37.4 bits (85), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 47/86 (54%), Gaps = 7/86 (8%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVG---VQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           V + I+ +P + +  I  V DE  G   V+I AP  +G+AN  ++++ S +LG    + S
Sbjct: 9   VILQIYVQPKANTNEIEGV-DEWRGRLKVKIKAPPVEGKANKEVVKFFSKLLGA---EAS 64

Query: 97  IGSGSKSRDKIVIVEEITPENVLNSL 122
           +  G  SR+K +++  I+ E V   L
Sbjct: 65  LIKGETSREKDLLIRGISIEEVKKKL 90


>gi|145595719|ref|YP_001160016.1| hypothetical protein Strop_3204 [Salinispora tropica CNB-440]
 gi|145305056|gb|ABP55638.1| protein of unknown function DUF167 [Salinispora tropica CNB-440]
          Length = 106

 Score = 37.4 bits (85), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 38/76 (50%), Gaps = 4/76 (5%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVG----VQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           ++++ +  KPGS    +        G    + + AP  DG A  A    ++   GV+R  
Sbjct: 14  ALTVAVRIKPGSSRSRVGGRYMGPYGPALVIAVTAPPVDGRATEAARRALAEAFGVRRAA 73

Query: 95  VSIGSGSKSRDKIVIV 110
           VS+G+G+ SR+KI  V
Sbjct: 74  VSLGAGAASRNKIFYV 89


>gi|115522078|ref|YP_778989.1| hypothetical protein RPE_0048 [Rhodopseudomonas palustris BisA53]
 gi|115516025|gb|ABJ04009.1| protein of unknown function DUF167 [Rhodopseudomonas palustris
           BisA53]
          Length = 107

 Score = 37.4 bits (85), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 32/49 (65%)

Query: 63  VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           V V++ A A+ GEAN A+ E ++  LGV +R V + SG+ SR K + V+
Sbjct: 39  VKVRVRAIAEGGEANRAVTELLAKALGVPKRAVRVLSGTTSRLKQIAVD 87


>gi|219852331|ref|YP_002466763.1| hypothetical protein Mpal_1733 [Methanosphaerula palustris E1-9c]
 gi|219546590|gb|ACL17040.1| protein of unknown function DUF167 [Methanosphaerula palustris
           E1-9c]
          Length = 106

 Score = 37.4 bits (85), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 35/62 (56%)

Query: 61  EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLN 120
            A+G  I  P  +G+AN A++  +S  L + +  +SI SG+ S  K V+++ +T E +  
Sbjct: 39  HAIGCSITTPPVEGKANRAIVALLSRTLTIPQSGISILSGATSSQKRVLIQGMTFEQLAG 98

Query: 121 SL 122
            L
Sbjct: 99  FL 100


>gi|114564027|ref|YP_751541.1| hypothetical protein Sfri_2863 [Shewanella frigidimarina NCIMB 400]
 gi|122299086|sp|Q07Z62.1|Y2863_SHEFN RecName: Full=UPF0235 protein Sfri_2863
 gi|114335320|gb|ABI72702.1| conserved hypothetical protein [Shewanella frigidimarina NCIMB 400]
          Length = 104

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +    + + I AP  DG+ANA L +Y++    V + QV I  G 
Sbjct: 17  LFVYVQPKASRDQIVGLYGNELKIAITAPPIDGKANAYLSKYLAKACKVAKSQVHIIKGE 76

Query: 102 KSRDKIVIVEE---ITPE 116
           + R K + + +   I PE
Sbjct: 77  QGRHKQIRISQPQVIPPE 94


>gi|57235102|ref|YP_182004.1| hypothetical protein DET1292 [Dehalococcoides ethenogenes 195]
 gi|123618390|sp|Q3Z6Z5.1|Y1292_DEHE1 RecName: Full=UPF0235 protein DET1292
 gi|57225550|gb|AAW40607.1| conserved hypothetical protein [Dehalococcoides ethenogenes 195]
          Length = 97

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 18/71 (25%), Positives = 38/71 (53%)

Query: 38  SSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S   + +   P ++   +T   +  + ++I A  + G+AN AL++Y+S +L   + ++ I
Sbjct: 6   SPFRVNLKILPSAQRNELTGYENGLLKIKIAAQPEKGKANKALVDYLSELLDTPKSEIEI 65

Query: 98  GSGSKSRDKIV 108
             G   R+K+V
Sbjct: 66  CRGLSGRNKVV 76


>gi|307353453|ref|YP_003894504.1| hypothetical protein Mpet_1306 [Methanoplanus petrolearius DSM
           11571]
 gi|307156686|gb|ADN36066.1| protein of unknown function DUF167 [Methanoplanus petrolearius DSM
           11571]
          Length = 105

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 3/86 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDE---AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           V I++   PGSK        +E   A+  +I +PA +G+ANA +++ ++    VK+  V 
Sbjct: 14  VQISLDVSPGSKKTVFPAGYNEWRNAIECRIKSPATEGKANAEIIKTIADYFSVKKSDVV 73

Query: 97  IGSGSKSRDKIVIVEEITPENVLNSL 122
           I SG+ S  K + +  ++ E+ L  L
Sbjct: 74  IVSGAISGQKKIKISGVSLEDALERL 99


>gi|86747236|ref|YP_483732.1| hypothetical protein RPB_0109 [Rhodopseudomonas palustris HaA2]
 gi|123293376|sp|Q2J3Y9.1|Y109_RHOP2 RecName: Full=UPF0235 protein RPB_0109
 gi|86570264|gb|ABD04821.1| Protein of unknown function DUF167 [Rhodopseudomonas palustris
           HaA2]
          Length = 108

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 32/49 (65%)

Query: 63  VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           V +++ A A  GEAN A++E ++  LGV +R V + SG+ SR K + ++
Sbjct: 40  VKLRVRAIADGGEANRAVIELLAKALGVPKRNVRLLSGATSRQKQIAID 88


>gi|407003809|gb|EKE20334.1| hypothetical protein ACD_8C00020G0003 [uncultured bacterium]
          Length = 72

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I I   P S    +  VS+    V++ AP  DG+AN AL++ ++    V + +V I  G 
Sbjct: 3   IYIRVSPRSSKNEVVKVSEGEYKVKLTAPPVDGQANEALIKLLAQHFKVSKSEVLIVGGK 62

Query: 102 KSRDKIV 108
            ++ KIV
Sbjct: 63  TAKIKIV 69


>gi|242238224|ref|YP_002986405.1| hypothetical protein Dd703_0772 [Dickeya dadantii Ech703]
 gi|242130281|gb|ACS84583.1| protein of unknown function DUF167 [Dickeya dadantii Ech703]
          Length = 99

 Score = 37.4 bits (85), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 38/69 (55%), Gaps = 1/69 (1%)

Query: 39  SVSITIHAKPGSKSCSITDVS-DEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           ++ I ++ +P +    I  +  ++ V V I AP  DG+ANA L+++M+    V + +V+I
Sbjct: 10  ALVIRLYIQPKASRDQIVGLHGNDEVKVAITAPPVDGQANAHLIQFMAKQFRVAKSRVTI 69

Query: 98  GSGSKSRDK 106
             G   R K
Sbjct: 70  EKGELGRHK 78


>gi|117927694|ref|YP_872245.1| hypothetical protein Acel_0486 [Acidothermus cellulolyticus 11B]
 gi|117648157|gb|ABK52259.1| protein of unknown function DUF167 [Acidothermus cellulolyticus
           11B]
          Length = 85

 Score = 37.4 bits (85), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 26/85 (30%), Positives = 44/85 (51%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + IH +PGS   ++    + A+ V +  PA+ G A  A L  ++   GV + QV + S
Sbjct: 1   MRLVIHVRPGSSRATVGGSHNGALIVAVREPAEHGRATDAALAAVAQAFGVPKSQVRLVS 60

Query: 100 GSKSRDKIVIVEEITPENVLNSLGK 124
           G+ SR KI+ V +  P  +   L +
Sbjct: 61  GATSRRKIIDVIDGDPVRLAELLAR 85


>gi|377566380|ref|ZP_09795640.1| hypothetical protein GOSPT_115_00620 [Gordonia sputi NBRC 100414]
 gi|377526448|dbj|GAB40805.1| hypothetical protein GOSPT_115_00620 [Gordonia sputi NBRC 100414]
          Length = 75

 Score = 37.4 bits (85), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 19/66 (28%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 42  ITIHAKPGSKSCSITDVS-DEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           + + AKP S+   + +   D ++ + +  PA +G+AN A+ E ++  LGV + +V++  G
Sbjct: 5   VIVTAKPSSRKGPLVETGPDGSITIYVREPATEGKANKAVAELLAKHLGVPKSKVALIGG 64

Query: 101 SKSRDK 106
           + +R K
Sbjct: 65  ATARTK 70


>gi|401675230|ref|ZP_10807224.1| YggU Protein [Enterobacter sp. SST3]
 gi|400217687|gb|EJO48579.1| YggU Protein [Enterobacter sp. SST3]
          Length = 98

 Score = 37.4 bits (85), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA L +Y++    V + QV I  G 
Sbjct: 13  LRLYIQPKASRDGIVGLHGDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQVIIEKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +  + P+++
Sbjct: 73  LGRHKQVKI--LNPQSI 87


>gi|330469281|ref|YP_004407024.1| hypothetical protein VAB18032_26756 [Verrucosispora maris
           AB-18-032]
 gi|328812252|gb|AEB46424.1| hypothetical protein VAB18032_26756 [Verrucosispora maris
           AB-18-032]
          Length = 98

 Score = 37.4 bits (85), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 44/81 (54%), Gaps = 7/81 (8%)

Query: 39  SVSITIHAKPGSKSCSITDVSD----EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           ++++ +  KPG+    +    D     A+ + ++APA DG A  A    ++  LGV+   
Sbjct: 5   TLTVAVRVKPGASRARVGGRYDGPHGPALVIAVNAPAVDGRATEAARRALAEALGVRPAT 64

Query: 95  VSIGSGSKSRDKIVIVEEITP 115
           VS+ +G+ SRDK+    +ITP
Sbjct: 65  VSLRTGASSRDKLF---QITP 82


>gi|348618844|ref|ZP_08885346.1| conserved hypothetical protein [Candidatus Glomeribacter
           gigasporarum BEG34]
 gi|347815872|emb|CCD30172.1| conserved hypothetical protein [Candidatus Glomeribacter
           gigasporarum BEG34]
          Length = 93

 Score = 37.4 bits (85), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 18/74 (24%), Positives = 39/74 (52%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + ++ +P +K+       +  + ++++AP  +G+AN AL  +++  L V    V + +
Sbjct: 5   VQLRVYVQPNAKTTEAAGEFNGTLKIRLNAPPLEGKANKALRAWLAERLSVPHHAVHLET 64

Query: 100 GSKSRDKIVIVEEI 113
           G  SR K + V  +
Sbjct: 65  GHSSRYKQIRVTAV 78


>gi|377559786|ref|ZP_09789322.1| hypothetical protein GOOTI_111_00160 [Gordonia otitidis NBRC
           100426]
 gi|441508771|ref|ZP_20990694.1| hypothetical protein GOACH_06_01520 [Gordonia aichiensis NBRC
           108223]
 gi|377523036|dbj|GAB34487.1| hypothetical protein GOOTI_111_00160 [Gordonia otitidis NBRC
           100426]
 gi|441447212|dbj|GAC48655.1| hypothetical protein GOACH_06_01520 [Gordonia aichiensis NBRC
           108223]
          Length = 75

 Score = 37.4 bits (85), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 19/66 (28%), Positives = 39/66 (59%), Gaps = 1/66 (1%)

Query: 42  ITIHAKPGSKSCSITDVS-DEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           + + AKP S+   + +   D ++ + +  PA +G+AN A+ E ++  LGV + +V++  G
Sbjct: 5   VIVTAKPSSRKGPLVETGPDGSITIYVREPATEGKANKAVAELLAKHLGVPKSKVALIGG 64

Query: 101 SKSRDK 106
           + +R K
Sbjct: 65  ATARTK 70


>gi|343510789|ref|ZP_08747996.1| hypothetical protein VIS19158_11179 [Vibrio scophthalmi LMG 19158]
 gi|343514967|ref|ZP_08752031.1| hypothetical protein VIBRN418_11355 [Vibrio sp. N418]
 gi|342799111|gb|EGU34691.1| hypothetical protein VIBRN418_11355 [Vibrio sp. N418]
 gi|342800856|gb|EGU36362.1| hypothetical protein VIS19158_11179 [Vibrio scophthalmi LMG 19158]
          Length = 96

 Score = 37.0 bits (84), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 39/79 (49%), Gaps = 2/79 (2%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + ++ +P +    I  +  E V + I AP  DG+ANA L +Y++    V +  V I  
Sbjct: 12  IVLRLYIQPKASRDKIVGLHGEEVKIAITAPPVDGKANAHLSKYLAKQFKVAKGLVKIEK 71

Query: 100 GSKSRDKIVIVEEITPENV 118
           G   R K V +  +TP  +
Sbjct: 72  GELGRHKQVRI--VTPNQI 88


>gi|257062952|ref|YP_003142624.1| hypothetical protein Shel_02050 [Slackia heliotrinireducens DSM
           20476]
 gi|256790605|gb|ACV21275.1| uncharacterized conserved protein [Slackia heliotrinireducens DSM
           20476]
          Length = 106

 Score = 37.0 bits (84), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 40/75 (53%), Gaps = 5/75 (6%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVG-----VQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           I IHA P ++  ++  V  +  G     V++    + G+AN A+ E ++  +GV + +VS
Sbjct: 16  IPIHATPKAQRNAVAGVKADDTGRLEVQVRVTVAPEGGKANKAVCETLAKAIGVSKSKVS 75

Query: 97  IGSGSKSRDKIVIVE 111
           I  G  SR K+  VE
Sbjct: 76  IVRGETSRHKMAQVE 90


>gi|238754614|ref|ZP_04615968.1| hypothetical protein yruck0001_4830 [Yersinia ruckeri ATCC 29473]
 gi|238707245|gb|EEP99608.1| hypothetical protein yruck0001_4830 [Yersinia ruckeri ATCC 29473]
          Length = 93

 Score = 37.0 bits (84), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 22/80 (27%), Positives = 42/80 (52%), Gaps = 3/80 (3%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           ++I ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V +  V I  
Sbjct: 8   LAIRLYIQPKASRDQIIGLHGDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSHVIIEK 67

Query: 100 GSKSRD---KIVIVEEITPE 116
           G   R    KI+  ++I P+
Sbjct: 68  GDLGRHKQIKIINPQQIPPQ 87


>gi|381167909|ref|ZP_09877114.1| conserved hypothetical protein [Phaeospirillum molischianum DSM
           120]
 gi|380682985|emb|CCG41926.1| conserved hypothetical protein [Phaeospirillum molischianum DSM
           120]
          Length = 109

 Score = 37.0 bits (84), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 23/76 (30%), Positives = 41/76 (53%), Gaps = 5/76 (6%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVG-----VQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           + ++I   P S    +  V+ EA G     VQ+ A  +DG+ANAAL++ ++    + R  
Sbjct: 15  LRVSIRLTPKSSRDQVRGVAVEADGHAVLKVQVTAVPEDGKANAALVKLLAKTWRLPRTT 74

Query: 95  VSIGSGSKSRDKIVIV 110
           + I  G+  R K+V++
Sbjct: 75  MDIVQGTTDRRKVVLI 90


>gi|119384906|ref|YP_915962.1| hypothetical protein Pden_2174 [Paracoccus denitrificans PD1222]
 gi|189039514|sp|A1B422.1|Y2174_PARDP RecName: Full=UPF0235 protein Pden_2174
 gi|119374673|gb|ABL70266.1| protein of unknown function DUF167 [Paracoccus denitrificans
           PD1222]
          Length = 82

 Score = 37.0 bits (84), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 21/71 (29%), Positives = 42/71 (59%), Gaps = 3/71 (4%)

Query: 42  ITIHAKP-GSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           I +   P  S++  I D   EA+ V +    +DG+ANAA+++ ++  LGV + ++ +  G
Sbjct: 14  IAVRVTPRASRNAVILD--GEAIRVTVTTVPEDGKANAAVVKLLAKALGVAKSRLVLVRG 71

Query: 101 SKSRDKIVIVE 111
           + +RDK+  ++
Sbjct: 72  ATARDKLFRID 82


>gi|406877891|gb|EKD26987.1| hypothetical protein ACD_79C00944G0002 [uncultured bacterium]
          Length = 115

 Score = 37.0 bits (84), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 17/71 (23%), Positives = 40/71 (56%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + I   P +    + +V+ + + + I AP  D +AN  L+ +++  L + + ++ I SG+
Sbjct: 35  LHIRVSPMANVSVVKEVTPDFIKIAIKAPPVDNKANKELINFIAKKLKLPKSKILIKSGT 94

Query: 102 KSRDKIVIVEE 112
            S++K++ + E
Sbjct: 95  NSKNKVLEILE 105


>gi|238759331|ref|ZP_04620497.1| hypothetical protein yaldo0001_5600 [Yersinia aldovae ATCC 35236]
 gi|238702492|gb|EEP95043.1| hypothetical protein yaldo0001_5600 [Yersinia aldovae ATCC 35236]
          Length = 90

 Score = 37.0 bits (84), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V + QV +  G 
Sbjct: 7   LRLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQVILEKGE 66

Query: 102 KSRDKIVIVEEITPENV 118
             R K + +  I P+ V
Sbjct: 67  LGRHKQIKI--INPQQV 81


>gi|88859056|ref|ZP_01133697.1| hypothetical protein PTD2_08629 [Pseudoalteromonas tunicata D2]
 gi|88819282|gb|EAR29096.1| hypothetical protein PTD2_08629 [Pseudoalteromonas tunicata D2]
          Length = 101

 Score = 37.0 bits (84), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 17/67 (25%), Positives = 35/67 (52%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +++ ++ +P +       +    + V I AP  DG+AN+ L+++++    V + QV I  
Sbjct: 11  LTLRLYVQPKASQDKFIGLHGNELKVAITAPPVDGQANSHLIKFLAKQCKVAKNQVCIKK 70

Query: 100 GSKSRDK 106
           G + R K
Sbjct: 71  GLQGRHK 77


>gi|452963721|gb|EME68780.1| hypothetical protein H261_16555 [Magnetospirillum sp. SO-1]
          Length = 108

 Score = 37.0 bits (84), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 22/76 (28%), Positives = 39/76 (51%), Gaps = 5/76 (6%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVG-----VQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           V + +   P +    I   +DEA G      Q+ A  +DG+ANAALL+ +S    + +  
Sbjct: 13  VKVALRLTPKASRDRIVGAADEADGSVVLKAQVTAVPEDGKANAALLKLLSKAWKIPKSD 72

Query: 95  VSIGSGSKSRDKIVIV 110
           + I  G+  R K++++
Sbjct: 73  MDIVLGATDRRKVILI 88


>gi|251788394|ref|YP_003003115.1| hypothetical protein Dd1591_0757 [Dickeya zeae Ech1591]
 gi|247537015|gb|ACT05636.1| protein of unknown function DUF167 [Dickeya zeae Ech1591]
          Length = 100

 Score = 37.0 bits (84), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 42/79 (53%), Gaps = 2/79 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V +  V+I  G 
Sbjct: 17  IRLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLIKFLAKQFRVAKGMVTIEKGE 76

Query: 102 KSRDK-IVIVE-EITPENV 118
             R K I IV  ++ P +V
Sbjct: 77  LGRHKQIRIVNPQVIPADV 95


>gi|217972515|ref|YP_002357266.1| hypothetical protein Sbal223_1335 [Shewanella baltica OS223]
 gi|254800042|sp|B8E927.1|Y1335_SHEB2 RecName: Full=UPF0235 protein Sbal223_1335
 gi|217497650|gb|ACK45843.1| protein of unknown function DUF167 [Shewanella baltica OS223]
          Length = 99

 Score = 37.0 bits (84), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 24/87 (27%), Positives = 42/87 (48%), Gaps = 2/87 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA L +Y++    V +  + I  G 
Sbjct: 13  LNLYIQPKASRDQIVGLHGDELKVAITAPPIDGKANAHLSKYLAKTFKVPKSDIHIMKGE 72

Query: 102 KSRDKI--VIVEEITPENVLNSLGKAS 126
             R K   VI  +I P  +   +G+ S
Sbjct: 73  LGRHKQIRVIDPKIIPSVITELMGQTS 99


>gi|386000726|ref|YP_005919025.1| hypothetical protein Mhar_0009 [Methanosaeta harundinacea 6Ac]
 gi|357208782|gb|AET63402.1| hypothetical protein Mhar_0009 [Methanosaeta harundinacea 6Ac]
          Length = 111

 Score = 37.0 bits (84), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 25/89 (28%), Positives = 40/89 (44%), Gaps = 3/89 (3%)

Query: 37  PSSVSITIHAKPGSKSCSI---TDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           P  V I     PGS    +    +   +A+  ++   A+ G AN  L   ++ + G+   
Sbjct: 15  PQGVLIRFEVAPGSSRLLVPSGYNPWRKALEAKLTERAERGRANQQLESAVAEIFGIPSH 74

Query: 94  QVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           +V + SG+KS  K VIV  + PE     L
Sbjct: 75  RVQVASGTKSYRKAVIVLGVGPEEAARRL 103


>gi|20093713|ref|NP_613560.1| hypothetical protein MK0273 [Methanopyrus kandleri AV19]
 gi|29839574|sp|Q8TYM3.1|Y273_METKA RecName: Full=UPF0235 protein MK0273
 gi|19886604|gb|AAM01490.1| Uncharacterized conserved protein [Methanopyrus kandleri AV19]
          Length = 96

 Score = 37.0 bits (84), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 44/85 (51%), Gaps = 7/85 (8%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGV-QID--APAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           I +   P + +  +  V DE  GV ++D  AP   G+AN  LLE++   L        + 
Sbjct: 14  IRVRVNPDADTTDLKGV-DEWRGVLEVDVAAPPVKGKANRELLEFLGRKLNT---TCELV 69

Query: 99  SGSKSRDKIVIVEEITPENVLNSLG 123
           SG KSR+K+V+  +++ + V   LG
Sbjct: 70  SGEKSREKLVLARDVSVDEVKERLG 94


>gi|343505675|ref|ZP_08743232.1| hypothetical protein VII00023_16791 [Vibrio ichthyoenteri ATCC
           700023]
 gi|342806439|gb|EGU41661.1| hypothetical protein VII00023_16791 [Vibrio ichthyoenteri ATCC
           700023]
          Length = 96

 Score = 37.0 bits (84), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 34/69 (49%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + ++ +P +    I  +  E V + I AP  DG+ANA L +Y++    V +  V I  
Sbjct: 12  IVLRLYIQPKASRDKIVGLHGEEVKIAITAPPVDGKANAHLTKYLAKQFKVAKGLVKIEK 71

Query: 100 GSKSRDKIV 108
           G   R K V
Sbjct: 72  GELGRHKQV 80


>gi|86359493|ref|YP_471385.1| hypothetical protein RHE_CH03912 [Rhizobium etli CFN 42]
 gi|123510540|sp|Q2K3C8.1|Y3912_RHIEC RecName: Full=UPF0235 protein RHE_CH03912
 gi|86283595|gb|ABC92658.1| hypothetical conserved protein [Rhizobium etli CFN 42]
          Length = 112

 Score = 37.0 bits (84), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 53/108 (49%), Gaps = 12/108 (11%)

Query: 25  DENLPSCIRLVPP-----SSVSITIHAKP--GSKSCSITDVSDEA---VGVQIDAPAKDG 74
           D  LP+  RL PP       + +T+   P  G  +    D   E    +G ++ A  + G
Sbjct: 2   DHPLPAG-RLSPPWKLFADHLRLTVRLTPNGGRDAFDGIDTDSEGETYLGARVTAVPEKG 60

Query: 75  EANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSL 122
           +AN AL+  +S  +GV +  VS+ SG  +R KI+ +E   PE++   L
Sbjct: 61  KANKALIALVSKSVGVAKSSVSVISGETARKKILRIEG-DPEDLARKL 107


>gi|126175230|ref|YP_001051379.1| hypothetical protein Sbal_3028 [Shewanella baltica OS155]
 gi|153001556|ref|YP_001367237.1| hypothetical protein Shew185_3043 [Shewanella baltica OS185]
 gi|160876292|ref|YP_001555608.1| hypothetical protein Sbal195_3186 [Shewanella baltica OS195]
 gi|373950379|ref|ZP_09610340.1| UPF0235 protein yggU [Shewanella baltica OS183]
 gi|378709492|ref|YP_005274386.1| hypothetical protein [Shewanella baltica OS678]
 gi|386323783|ref|YP_006019900.1| hypothetical protein [Shewanella baltica BA175]
 gi|386341982|ref|YP_006038348.1| hypothetical protein [Shewanella baltica OS117]
 gi|418024039|ref|ZP_12663023.1| UPF0235 protein yggU [Shewanella baltica OS625]
 gi|166229359|sp|A3D6Z7.1|Y3028_SHEB5 RecName: Full=UPF0235 protein Sbal_3028
 gi|166229364|sp|A6WQT5.1|Y3043_SHEB8 RecName: Full=UPF0235 protein Shew185_3043
 gi|189039841|sp|A9KXP6.1|Y3186_SHEB9 RecName: Full=UPF0235 protein Sbal195_3186
 gi|125998435|gb|ABN62510.1| protein of unknown function DUF167 [Shewanella baltica OS155]
 gi|151366174|gb|ABS09174.1| protein of unknown function DUF167 [Shewanella baltica OS185]
 gi|160861814|gb|ABX50348.1| protein of unknown function DUF167 [Shewanella baltica OS195]
 gi|315268481|gb|ADT95334.1| protein of unknown function DUF167 [Shewanella baltica OS678]
 gi|333817928|gb|AEG10594.1| UPF0235 protein yggU [Shewanella baltica BA175]
 gi|334864383|gb|AEH14854.1| UPF0235 protein yggU [Shewanella baltica OS117]
 gi|353536912|gb|EHC06470.1| UPF0235 protein yggU [Shewanella baltica OS625]
 gi|373886979|gb|EHQ15871.1| UPF0235 protein yggU [Shewanella baltica OS183]
          Length = 99

 Score = 37.0 bits (84), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 24/87 (27%), Positives = 42/87 (48%), Gaps = 2/87 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA L +Y++    V +  + I  G 
Sbjct: 13  LNLYIQPKASRDQIVGLHGDELKVAITAPPIDGKANAHLSKYLAKTFKVPKSDIHIMKGE 72

Query: 102 KSRDKI--VIVEEITPENVLNSLGKAS 126
             R K   VI  +I P  +   +G+ S
Sbjct: 73  LGRHKQIRVIDPKIIPSIITELMGQTS 99


>gi|336320054|ref|YP_004600022.1| hypothetical protein Celgi_0935 [[Cellvibrio] gilvus ATCC 13127]
 gi|336103635|gb|AEI11454.1| protein of unknown function DUF167 [[Cellvibrio] gilvus ATCC 13127]
          Length = 89

 Score = 37.0 bits (84), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 44/77 (57%), Gaps = 1/77 (1%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + ++I  +PG+    +      A+ V + A A DG A  A L  ++  LGV+RR V + +
Sbjct: 1   MRVSIRVRPGASRTRVGGEHAGALVVAVGARAVDGAATEAALAALADALGVRRRHVRLVT 60

Query: 100 GSKSRDKIVIVEEITPE 116
           G+ SRDK++ VE + PE
Sbjct: 61  GATSRDKVLDVEGL-PE 76


>gi|198282577|ref|YP_002218898.1| hypothetical protein Lferr_0437 [Acidithiobacillus ferrooxidans
           ATCC 53993]
 gi|218665107|ref|YP_002424767.1| hypothetical protein AFE_0260 [Acidithiobacillus ferrooxidans ATCC
           23270]
 gi|415987455|ref|ZP_11559767.1| hypothetical protein GGI1_14254 [Acidithiobacillus sp. GGI-221]
 gi|198247098|gb|ACH82691.1| protein of unknown function DUF167 [Acidithiobacillus ferrooxidans
           ATCC 53993]
 gi|218517320|gb|ACK77906.1| conserved hypothetical protein TIGR00251, putative
           [Acidithiobacillus ferrooxidans ATCC 23270]
 gi|339834943|gb|EGQ62668.1| hypothetical protein GGI1_14254 [Acidithiobacillus sp. GGI-221]
          Length = 116

 Score = 37.0 bits (84), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 23  TNDENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLE 82
           T+ E  P  +R      + + IH +PG+K+ +I     +A+ +++ A   DG ANAAL  
Sbjct: 2   TSSETRPPYLR-ADGDDLYVRIHVQPGAKTDAIGGCHGDALKIRLRARPVDGAANAALSA 60

Query: 83  YMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
            + + L +KR+Q+++  G  +RDK++ +
Sbjct: 61  LLCATLRLKRQQITLVQGESARDKVLRI 88


>gi|365921341|ref|ZP_09445626.1| TIGR00251 family protein [Cardiobacterium valvarum F0432]
 gi|364576368|gb|EHM53697.1| TIGR00251 family protein [Cardiobacterium valvarum F0432]
          Length = 97

 Score = 37.0 bits (84), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 17/53 (32%), Positives = 31/53 (58%)

Query: 58  VSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           + D+   + I    +DG+ANA L  +++   GV +R V++ SG +S  KI+ +
Sbjct: 29  IHDDRHKIAITTAPEDGKANAYLCRWLAEQFGVSKRAVNVQSGERSPLKILAI 81


>gi|270308462|ref|YP_003330520.1| hypothetical protein DhcVS_1075 [Dehalococcoides sp. VS]
 gi|270154354|gb|ACZ62192.1| hypothetical protein DhcVS_1075 [Dehalococcoides sp. VS]
          Length = 97

 Score = 37.0 bits (84), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 18/74 (24%), Positives = 40/74 (54%), Gaps = 2/74 (2%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P  V++ I   P S+   ++   +  + ++I A  + G+AN  L++Y+S +L   + ++ 
Sbjct: 7   PFRVNLKIF--PSSQRNELSGYENGLLKLRIAAQPEKGKANKELIDYLSELLDTPKAEIE 64

Query: 97  IGSGSKSRDKIVIV 110
           I  G   R+K+++ 
Sbjct: 65  ICRGHTGRNKVLVF 78


>gi|145509569|ref|XP_001440723.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407951|emb|CAK73326.1| unnamed protein product [Paramecium tetraurelia]
          Length = 99

 Score = 36.6 bits (83), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 21/75 (28%), Positives = 37/75 (49%), Gaps = 1/75 (1%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P    + +  KP SK+  + + S+E V  +  A   D EAN  ++  +S  L + +  + 
Sbjct: 15  PKGFYLQLFVKPKSKA-EMLEFSEEFVIAKTKAQPIDNEANEDVIRMLSEKLSIDQSSIK 73

Query: 97  IGSGSKSRDKIVIVE 111
           I  G +S+ K V +E
Sbjct: 74  IVKGQQSKYKTVFIE 88


>gi|399039651|ref|ZP_10735160.1| hypothetical protein PMI09_02708 [Rhizobium sp. CF122]
 gi|398062064|gb|EJL53845.1| hypothetical protein PMI09_02708 [Rhizobium sp. CF122]
          Length = 104

 Score = 36.6 bits (83), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 46/88 (52%), Gaps = 6/88 (6%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVG-----VQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           +S+ +   P     +I  V  +A G     V++ A  + G+AN AL+  ++  LGV +  
Sbjct: 12  ISLAVRLTPNGGRDAIDGVEIDAEGKSHLKVRVTAVPEKGKANKALVALVAKSLGVAKSS 71

Query: 95  VSIGSGSKSRDKIVIVEEITPENVLNSL 122
           V I SG  SR KI+ ++   PE+++  L
Sbjct: 72  VRIASGDTSRKKILRIDS-DPEDLIRKL 98


>gi|374635515|ref|ZP_09707112.1| protein of unknown function DUF167 [Methanotorris formicicus
           Mc-S-70]
 gi|373562052|gb|EHP88273.1| protein of unknown function DUF167 [Methanotorris formicicus
           Mc-S-70]
          Length = 100

 Score = 36.6 bits (83), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 24/85 (28%), Positives = 45/85 (52%), Gaps = 5/85 (5%)

Query: 40  VSITIHAKPGSKSCSITDVSD--EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           + I I   P +K   I  +++    + V+I A AKDG+AN  ++++   +    ++ V I
Sbjct: 13  ILINIDVTPNAKKNEIGSINEWRNKLEVKIKAMAKDGKANKEIVKFFKDIF---KKDVEI 69

Query: 98  GSGSKSRDKIVIVEEITPENVLNSL 122
            SGS S  K +++++   E V+  L
Sbjct: 70  VSGSTSPQKTLLIKDAKKEEVVKIL 94


>gi|271501907|ref|YP_003334933.1| hypothetical protein Dd586_3394 [Dickeya dadantii Ech586]
 gi|270345462|gb|ACZ78227.1| protein of unknown function DUF167 [Dickeya dadantii Ech586]
          Length = 96

 Score = 36.6 bits (83), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 19/65 (29%), Positives = 34/65 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V +  V+I  G 
Sbjct: 13  IRLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLIKFLAKQFRVAKGMVTIEKGE 72

Query: 102 KSRDK 106
             R K
Sbjct: 73  LGRHK 77


>gi|90580282|ref|ZP_01236089.1| hypothetical protein VAS14_20161 [Photobacterium angustum S14]
 gi|90438584|gb|EAS63768.1| hypothetical protein VAS14_20161 [Vibrio angustum S14]
          Length = 97

 Score = 36.6 bits (83), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 19/70 (27%), Positives = 36/70 (51%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I ++ +P +    I  +    V + I AP  DG+ANA L++Y++    V +  + +  G 
Sbjct: 15  IRLYIQPKASRDQIVGLHGNEVKIAITAPPVDGKANAHLVKYLAKQFKVAKGLIHVEKGL 74

Query: 102 KSRDKIVIVE 111
           + R K + +E
Sbjct: 75  QGRHKQIRIE 84


>gi|295691445|ref|YP_003595138.1| hypothetical protein Cseg_4109 [Caulobacter segnis ATCC 21756]
 gi|295433348|gb|ADG12520.1| protein of unknown function DUF167 [Caulobacter segnis ATCC 21756]
          Length = 94

 Score = 36.6 bits (83), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 17/59 (28%), Positives = 34/59 (57%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLNSLG 123
           V++ +P  DG ANAAL+ +++  L + R  V + SG  +R K + ++++   +   + G
Sbjct: 33  VRVASPPVDGAANAALIAFLAKSLKIPRSAVRLASGETARIKRLEIDDVDQADFTRAFG 91


>gi|307132433|ref|YP_003884449.1| hypothetical protein Dda3937_02598 [Dickeya dadantii 3937]
 gi|306529962|gb|ADM99892.1| conserved protein [Dickeya dadantii 3937]
          Length = 100

 Score = 36.6 bits (83), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 19/65 (29%), Positives = 34/65 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I ++ +P +    I  +  + + V I AP  DG+ANA L+++++    V +  V+I  G 
Sbjct: 17  IRLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLIKFLARQFRVAKGMVTIEKGE 76

Query: 102 KSRDK 106
             R K
Sbjct: 77  LGRHK 81


>gi|339499774|ref|YP_004697809.1| hypothetical protein Spica_1155 [Spirochaeta caldaria DSM 7334]
 gi|338834123|gb|AEJ19301.1| UPF0235 protein yggU [Spirochaeta caldaria DSM 7334]
          Length = 99

 Score = 36.6 bits (83), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 19/67 (28%), Positives = 36/67 (53%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           I I A PG+    I  +++  + + I A  ++G+AN  L+ +++   G  +  + + SG 
Sbjct: 14  INIKAVPGASKTEIKGLAENRIRITIAAAPENGKANEELIAFLAKKAGCSKSHIKLVSGD 73

Query: 102 KSRDKIV 108
           KSR K +
Sbjct: 74  KSRLKTL 80


>gi|225620549|ref|YP_002721806.1| hypothetical protein BHWA1_01632 [Brachyspira hyodysenteriae WA1]
 gi|225215368|gb|ACN84102.1| hypothetical protein BHWA1_01632 [Brachyspira hyodysenteriae WA1]
          Length = 56

 Score = 36.6 bits (83), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 15/40 (37%), Positives = 27/40 (67%)

Query: 69  APAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV 108
           A A DG+AN A++++++  L +K+R V I  G K+  K++
Sbjct: 2   AKAIDGKANKAIIDFLADELNIKKRDVEILKGEKNSKKLI 41


>gi|406993875|gb|EKE12956.1| hypothetical protein ACD_13C00110G0004 [uncultured bacterium]
          Length = 72

 Score = 36.6 bits (83), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 39/69 (56%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + ++I A P +K+  I      A  V ++AP  +G+AN A++E ++    V R +V +  
Sbjct: 1   MKLSIIAHPNTKNPRIEKDLLGATHVYVNAPPLEGKANRAVVEALAKHFEVSRGKVILLW 60

Query: 100 GSKSRDKIV 108
           G KS++K+ 
Sbjct: 61  GEKSKNKVF 69


>gi|384207702|ref|YP_005593422.1| hypothetical protein Bint_0208 [Brachyspira intermedia PWS/A]
 gi|343385352|gb|AEM20842.1| hypothetical protein Bint_0208 [Brachyspira intermedia PWS/A]
          Length = 56

 Score = 36.6 bits (83), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 15/40 (37%), Positives = 27/40 (67%)

Query: 69  APAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIV 108
           A A DG+AN A++++++  L +K+R V I  G K+  K++
Sbjct: 2   AKAIDGKANKAIIDFLADELNIKKRDVEILKGEKNSKKLI 41


>gi|113969534|ref|YP_733327.1| hypothetical protein Shewmr4_1190 [Shewanella sp. MR-4]
 gi|123130613|sp|Q0HKZ7.1|Y1190_SHESM RecName: Full=UPF0235 protein Shewmr4_1190
 gi|113884218|gb|ABI38270.1| conserved hypothetical protein [Shewanella sp. MR-4]
          Length = 96

 Score = 36.6 bits (83), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA L +Y++    V +  V I  G 
Sbjct: 13  LNLYIQPKASRDQIVGLHGDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSDVHILKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +    P+NV
Sbjct: 73  LGRHKQVRIS--APKNV 87


>gi|171914386|ref|ZP_02929856.1| hypothetical protein VspiD_24445 [Verrucomicrobium spinosum DSM
           4136]
          Length = 92

 Score = 36.6 bits (83), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 40/77 (51%), Gaps = 5/77 (6%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVG-----VQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           SV++     P ++   I     +  G     V++ APA +G+AN  L+ +++  LG  + 
Sbjct: 5   SVNLACKVTPNARRSEIVGWGADEQGRGVLLVKLAAPALEGKANKELVRFLAEQLGCAKG 64

Query: 94  QVSIGSGSKSRDKIVIV 110
           +VS+  G  SR K++ V
Sbjct: 65  EVSLLRGDASRTKLLRV 81


>gi|225849498|ref|YP_002729663.1| hypothetical protein SULAZ_1705 [Sulfurihydrogenibium azorense
           Az-Fu1]
 gi|225644124|gb|ACN99174.1| conserved domain protein [Sulfurihydrogenibium azorense Az-Fu1]
          Length = 72

 Score = 36.6 bits (83), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 36/72 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + I +  KPGS    +  + +    V+     + G+AN  ++E MS  L V + ++ I  
Sbjct: 1   MRIKVKVKPGSSKNEVKKIEENFYEVRCTTIPEKGKANDKVIELMSEYLDVPKSRIKIIK 60

Query: 100 GSKSRDKIVIVE 111
           G  SR+K + +E
Sbjct: 61  GHSSREKEIEIE 72


>gi|407001629|gb|EKE18576.1| hypothetical protein ACD_9C00291G0007 [uncultured bacterium]
          Length = 75

 Score = 36.6 bits (83), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 33/67 (49%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + I   P S    +  VS+    V++ AP  DG AN AL++ +S    + +  + I  G 
Sbjct: 3   VYIKVSPRSSKNEVVKVSEGGYKVKLTAPPVDGAANDALIKLLSQYFKISKSSIKIVGGK 62

Query: 102 KSRDKIV 108
            ++ KIV
Sbjct: 63  TAKTKIV 69


>gi|84685306|ref|ZP_01013204.1| hypothetical protein 1099457000258_RB2654_10573 [Maritimibacter
           alkaliphilus HTCC2654]
 gi|84666463|gb|EAQ12935.1| hypothetical protein RB2654_10573 [Rhodobacterales bacterium
           HTCC2654]
          Length = 85

 Score = 36.6 bits (83), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 20/77 (25%), Positives = 44/77 (57%), Gaps = 2/77 (2%)

Query: 35  VPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQ 94
           VP S +++ +  K    + S+ +  D  + V +    +DG+A AA+++ ++  LGV +  
Sbjct: 11  VPGSELTVRVTPKASRNAVSVDE--DGHLKVSVTTVPEDGKATAAVVKLLAHALGVAKSD 68

Query: 95  VSIGSGSKSRDKIVIVE 111
           +++  G+ SRDK+  ++
Sbjct: 69  LTLVRGATSRDKVFRID 85


>gi|318041795|ref|ZP_07973751.1| hypothetical protein SCB01_08799 [Synechococcus sp. CB0101]
          Length = 97

 Score = 36.6 bits (83), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 22/81 (27%), Positives = 39/81 (48%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + I  +P +    +  +  EA+ + + AP  DG AN ALL+ ++  L V    V +  G+
Sbjct: 12  VAIRVQPRASRERVLGLRGEAIAIALKAPPVDGAANEALLKLIARQLKVSAAAVELVRGA 71

Query: 102 KSRDKIVIVEEITPENVLNSL 122
             R K + V   + + V  +L
Sbjct: 72  SGRSKWIRVAGWSADQVRAAL 92


>gi|212557909|gb|ACJ30363.1| Cytosolic protein, putative [Shewanella piezotolerans WP3]
          Length = 90

 Score = 36.2 bits (82), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 21/86 (24%), Positives = 41/86 (47%), Gaps = 2/86 (2%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ + ++ +P +    I  +    + + I AP  DG+AN  L +Y+S    V +  + I 
Sbjct: 5   NLLLQLYVQPKASRDQIVGLHGNELKIAITAPPIDGKANIHLAKYLSKAFKVPKGDIDIL 64

Query: 99  SGSKSRDKIVIVE--EITPENVLNSL 122
            G   R K V++   ++ P  + N L
Sbjct: 65  KGLTGRHKQVLISSPKVIPPEIQNLL 90


>gi|406996655|gb|EKE14955.1| hypothetical protein ACD_12C00223G0003 [uncultured bacterium]
          Length = 72

 Score = 36.2 bits (82), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 18/69 (26%), Positives = 35/69 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + + AKPG+K   I  + +    + +  P   G+AN A++E ++      +    + S
Sbjct: 1   MKVFVKAKPGAKEEKIEKIDETHFVIIVKEPPVRGKANKAIIEVLAQYFKTNKSACRLIS 60

Query: 100 GSKSRDKIV 108
           G  S++KIV
Sbjct: 61  GFSSKNKIV 69


>gi|350561267|ref|ZP_08930106.1| protein of unknown function DUF167 [Thioalkalivibrio
           thiocyanoxidans ARh 4]
 gi|349781374|gb|EGZ35682.1| protein of unknown function DUF167 [Thioalkalivibrio
           thiocyanoxidans ARh 4]
          Length = 79

 Score = 36.2 bits (82), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 32/72 (44%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P    + +  KPG++   +    +     ++ AP  DG AN  L+  ++    V R QV 
Sbjct: 4   PGKRLLQVRVKPGARVSELQSDPNGPWTARVKAPPVDGRANDELVALIARYFKVGRSQVR 63

Query: 97  IGSGSKSRDKIV 108
           I  G+  R K V
Sbjct: 64  IRRGASGRSKWV 75


>gi|206889548|ref|YP_002248254.1| hypothetical protein THEYE_A0407 [Thermodesulfovibrio yellowstonii
           DSM 11347]
 gi|206741486|gb|ACI20543.1| conserved hypothetical protein [Thermodesulfovibrio yellowstonii
           DSM 11347]
          Length = 87

 Score = 36.2 bits (82), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 18/67 (26%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + +  K G+K   I  +    + +++ A   DG AN  L+E +S +L + + ++ I  G 
Sbjct: 14  LKVLVKTGAKITGIGGIEGNTLKLRLAAQPHDGLANKELIEMLSEILNIPKSRIEIIKGK 73

Query: 102 KSRDKIV 108
            S+ KI+
Sbjct: 74  TSKHKII 80


>gi|117919640|ref|YP_868832.1| hypothetical protein Shewana3_1191 [Shewanella sp. ANA-3]
 gi|166228912|sp|A0KUF7.1|Y1191_SHESA RecName: Full=UPF0235 protein Shewana3_1191
 gi|117611972|gb|ABK47426.1| conserved hypothetical protein [Shewanella sp. ANA-3]
          Length = 96

 Score = 36.2 bits (82), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA L +Y++    V +  V I  G 
Sbjct: 13  LNLYIQPKASRDQIVGLHGDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSDVHILKGE 72

Query: 102 KSRDKIVIVEEITPENV 118
             R K V +    P+NV
Sbjct: 73  LGRHKQVRIS--PPKNV 87


>gi|383776702|ref|YP_005461268.1| hypothetical protein AMIS_15320 [Actinoplanes missouriensis 431]
 gi|381369934|dbj|BAL86752.1| hypothetical protein AMIS_15320 [Actinoplanes missouriensis 431]
          Length = 88

 Score = 36.2 bits (82), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 36/75 (48%), Gaps = 4/75 (5%)

Query: 44  IHAKPGSKSCSITDVSDEAVG----VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           +  +PG+    +    D   G    V + APA DG A  A+   ++  LGV+  +V +  
Sbjct: 2   VRVRPGAGRNRVGGRYDGPHGPALIVAVSAPAVDGRATEAVRRALAEALGVRPAEVELRI 61

Query: 100 GSKSRDKIVIVEEIT 114
           G+ SRDK+  V   T
Sbjct: 62  GATSRDKVFTVTAAT 76


>gi|359425843|ref|ZP_09216935.1| hypothetical protein GOAMR_54_00270 [Gordonia amarae NBRC 15530]
 gi|358238840|dbj|GAB06517.1| hypothetical protein GOAMR_54_00270 [Gordonia amarae NBRC 15530]
          Length = 76

 Score = 36.2 bits (82), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 40/69 (57%), Gaps = 1/69 (1%)

Query: 39  SVSITIHAKPGSKSCSITDVSDE-AVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
           S ++ +  KP S    + +  +E A+ + +  PA +G+AN A+ E ++  LGV + +V++
Sbjct: 3   SQTVVVTVKPNSSKGPLVEADEEGAITIFVREPATEGKANKAVGELLAKHLGVPKSKVAL 62

Query: 98  GSGSKSRDK 106
             G+ +R K
Sbjct: 63  IGGATARTK 71


>gi|428300912|ref|YP_007139218.1| hypothetical protein Cal6303_4339 [Calothrix sp. PCC 6303]
 gi|428237456|gb|AFZ03246.1| UPF0235 protein yggU [Calothrix sp. PCC 6303]
          Length = 74

 Score = 36.2 bits (82), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           IT+  KP S    I ++ D ++ V + +   DG+AN  L++ ++    V +  + I  G 
Sbjct: 4   ITVVVKPNSHRQKIEELEDGSLKVYLKSSPVDGKANKELIQVLADRYNVPKSHILIKLGL 63

Query: 102 KSRDKIV 108
            SR K+V
Sbjct: 64  ASRQKVV 70


>gi|424658356|ref|ZP_18095613.1| hypothetical protein VCHE16_0509 [Vibrio cholerae HE-16]
 gi|408055246|gb|EKG90184.1| hypothetical protein VCHE16_0509 [Vibrio cholerae HE-16]
          Length = 96

 Score = 36.2 bits (82), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + V I AP  DG+ANA L +Y++ +  V +  V I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVVIEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|422921712|ref|ZP_16954922.1| hypothetical protein VCBJG01_0465 [Vibrio cholerae BJG-01]
 gi|341648215|gb|EGS72280.1| hypothetical protein VCBJG01_0465 [Vibrio cholerae BJG-01]
          Length = 96

 Score = 36.2 bits (82), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + V I AP  DG+ANA L +Y++ +  V +  V I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVVIEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|242799550|ref|XP_002483404.1| DUF167 domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218716749|gb|EED16170.1| DUF167 domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 124

 Score = 36.2 bits (82), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 39 SVSITIHAKPGSKSCS-ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSI 97
          ++ I  H  PGS+    I  + +E V V + +  + GEAN A+   +S VLG  +  V I
Sbjct: 23 TLLIRCHVTPGSRGFQGIKQIHNEQVYVHVGSEPRKGEANTAVARVLSEVLGFPKSDVII 82


>gi|297580597|ref|ZP_06942523.1| conserved hypothetical protein [Vibrio cholerae RC385]
 gi|297535013|gb|EFH73848.1| conserved hypothetical protein [Vibrio cholerae RC385]
          Length = 96

 Score = 36.2 bits (82), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + V I AP  DG+ANA L +Y++ +  V +  V I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVVIEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|87198097|ref|YP_495354.1| hypothetical protein Saro_0071 [Novosphingobium aromaticivorans DSM
           12444]
 gi|87133778|gb|ABD24520.1| protein of unknown function DUF167 [Novosphingobium aromaticivorans
           DSM 12444]
          Length = 95

 Score = 36.2 bits (82), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 39/67 (58%), Gaps = 1/67 (1%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + +   PG+KS  I ++    V V++ A  +DG+A AA+ E ++  LG+   +V +  G+
Sbjct: 24  LAVRVTPGAKSEGI-EIDGGRVLVKVRAKPEDGKATAAVQELLARALGLAPSKVEMLRGA 82

Query: 102 KSRDKIV 108
            SR+K+ 
Sbjct: 83  TSREKLF 89


>gi|294634379|ref|ZP_06712916.1| putative cytoplasmic protein [Edwardsiella tarda ATCC 23685]
 gi|451966566|ref|ZP_21919819.1| hypothetical protein ET1_14_02350 [Edwardsiella tarda NBRC 105688]
 gi|291092187|gb|EFE24748.1| putative cytoplasmic protein [Edwardsiella tarda ATCC 23685]
 gi|451314867|dbj|GAC65181.1| hypothetical protein ET1_14_02350 [Edwardsiella tarda NBRC 105688]
          Length = 96

 Score = 36.2 bits (82), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 18/65 (27%), Positives = 34/65 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  E + V I AP  DG+ANA L+++++    V +  ++I  G 
Sbjct: 13  LRLYIQPKASRDQIVGLHGEELKVAITAPPVDGQANAHLIKFIAKQFRVAKSLITIEKGE 72

Query: 102 KSRDK 106
             R K
Sbjct: 73  LGRHK 77


>gi|388602655|ref|ZP_10161051.1| hypothetical protein VcamD_22495 [Vibrio campbellii DS40M4]
          Length = 96

 Score = 36.2 bits (82), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 36/72 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + ++ +P +    I  +  E + + I AP  DG+ANA L ++++    V +  V I  
Sbjct: 12  VVLRLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLTKFLAKQFKVAKGLVHIEK 71

Query: 100 GSKSRDKIVIVE 111
           G   R K + +E
Sbjct: 72  GELGRHKQIRIE 83


>gi|229530300|ref|ZP_04419688.1| hypothetical protein VCG_003420 [Vibrio cholerae 12129(1)]
 gi|384423759|ref|YP_005633117.1| hypothetical protein VCLMA_A0416 [Vibrio cholerae LMA3984-4]
 gi|229332073|gb|EEN97561.1| hypothetical protein VCG_003420 [Vibrio cholerae 12129(1)]
 gi|327483312|gb|AEA77719.1| UPF0235 protein [Vibrio cholerae LMA3984-4]
          Length = 96

 Score = 36.2 bits (82), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + V I AP  DG+ANA L +Y++ +  V +  V I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVVIEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|365541055|ref|ZP_09366230.1| hypothetical protein VordA3_15507 [Vibrio ordalii ATCC 33509]
          Length = 96

 Score = 36.2 bits (82), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 20/79 (25%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + + I AP  DG+ANA L ++++    V +  + I  G 
Sbjct: 14  LKLYIQPKASRDSIVGLHGEELKIAITAPPVDGKANAHLTQFLAKQFKVAKGLIEIEKGE 73

Query: 102 KSRDKIVIVE--EITPENV 118
             R K V +   ++ P+ +
Sbjct: 74  LGRHKQVRIHSPQVIPQPI 92


>gi|153828377|ref|ZP_01981044.1| conserved hypothetical protein [Vibrio cholerae 623-39]
 gi|229519844|ref|ZP_04409278.1| hypothetical protein VIF_000358 [Vibrio cholerae TM 11079-80]
 gi|417819343|ref|ZP_12465960.1| hypothetical protein VCHE39_0818 [Vibrio cholerae HE39]
 gi|417823596|ref|ZP_12470188.1| hypothetical protein VCHE48_1515 [Vibrio cholerae HE48]
 gi|419829034|ref|ZP_14352523.1| hypothetical protein VCHC1A2_1398 [Vibrio cholerae HC-1A2]
 gi|419831815|ref|ZP_14355282.1| hypothetical protein VCHC61A2_0442 [Vibrio cholerae HC-61A2]
 gi|421353281|ref|ZP_15803615.1| hypothetical protein VCHE45_0604 [Vibrio cholerae HE-45]
 gi|422305891|ref|ZP_16393078.1| hypothetical protein VCCP1035_0510 [Vibrio cholerae CP1035(8)]
 gi|422908995|ref|ZP_16943647.1| hypothetical protein VCHE09_0473 [Vibrio cholerae HE-09]
 gi|422916199|ref|ZP_16950540.1| hypothetical protein VCHC02A1_0505 [Vibrio cholerae HC-02A1]
 gi|423816157|ref|ZP_17715143.1| hypothetical protein VCHC55C2_0515 [Vibrio cholerae HC-55C2]
 gi|423848221|ref|ZP_17718930.1| hypothetical protein VCHC59A1_0557 [Vibrio cholerae HC-59A1]
 gi|423878800|ref|ZP_17722538.1| hypothetical protein VCHC60A1_0507 [Vibrio cholerae HC-60A1]
 gi|423946497|ref|ZP_17733405.1| hypothetical protein VCHE40_0475 [Vibrio cholerae HE-40]
 gi|423975656|ref|ZP_17736954.1| hypothetical protein VCHE46_0479 [Vibrio cholerae HE-46]
 gi|423996620|ref|ZP_17739886.1| hypothetical protein VCHC02C1_0516 [Vibrio cholerae HC-02C1]
 gi|424015317|ref|ZP_17755167.1| hypothetical protein VCHC55B2_0506 [Vibrio cholerae HC-55B2]
 gi|424018431|ref|ZP_17758233.1| hypothetical protein VCHC59B1_0515 [Vibrio cholerae HC-59B1]
 gi|424623803|ref|ZP_18062283.1| hypothetical protein VCHC50A1_0509 [Vibrio cholerae HC-50A1]
 gi|424628378|ref|ZP_18066687.1| hypothetical protein VCHC51A1_0505 [Vibrio cholerae HC-51A1]
 gi|424632334|ref|ZP_18070453.1| hypothetical protein VCHC52A1_0510 [Vibrio cholerae HC-52A1]
 gi|424635421|ref|ZP_18073445.1| hypothetical protein VCHC55A1_0514 [Vibrio cholerae HC-55A1]
 gi|424639212|ref|ZP_18077112.1| hypothetical protein VCHC56A1_0478 [Vibrio cholerae HC-56A1]
 gi|424647495|ref|ZP_18085175.1| hypothetical protein VCHC57A1_0508 [Vibrio cholerae HC-57A1]
 gi|443526351|ref|ZP_21092436.1| hypothetical protein VCHC78A1_00498 [Vibrio cholerae HC-78A1]
 gi|148876086|gb|EDL74221.1| conserved hypothetical protein [Vibrio cholerae 623-39]
 gi|229343132|gb|EEO08116.1| hypothetical protein VIF_000358 [Vibrio cholerae TM 11079-80]
 gi|340041199|gb|EGR02166.1| hypothetical protein VCHE39_0818 [Vibrio cholerae HE39]
 gi|340048225|gb|EGR09147.1| hypothetical protein VCHE48_1515 [Vibrio cholerae HE48]
 gi|341636077|gb|EGS60780.1| hypothetical protein VCHE09_0473 [Vibrio cholerae HE-09]
 gi|341640719|gb|EGS65298.1| hypothetical protein VCHC02A1_0505 [Vibrio cholerae HC-02A1]
 gi|395955054|gb|EJH65659.1| hypothetical protein VCHE45_0604 [Vibrio cholerae HE-45]
 gi|408016251|gb|EKG53804.1| hypothetical protein VCHC50A1_0509 [Vibrio cholerae HC-50A1]
 gi|408021366|gb|EKG58623.1| hypothetical protein VCHC52A1_0510 [Vibrio cholerae HC-52A1]
 gi|408027591|gb|EKG64554.1| hypothetical protein VCHC56A1_0478 [Vibrio cholerae HC-56A1]
 gi|408027669|gb|EKG64628.1| hypothetical protein VCHC55A1_0514 [Vibrio cholerae HC-55A1]
 gi|408037290|gb|EKG73689.1| hypothetical protein VCHC57A1_0508 [Vibrio cholerae HC-57A1]
 gi|408059200|gb|EKG93972.1| hypothetical protein VCHC51A1_0505 [Vibrio cholerae HC-51A1]
 gi|408622223|gb|EKK95211.1| hypothetical protein VCHC1A2_1398 [Vibrio cholerae HC-1A2]
 gi|408627891|gb|EKL00681.1| hypothetical protein VCCP1035_0510 [Vibrio cholerae CP1035(8)]
 gi|408636828|gb|EKL08950.1| hypothetical protein VCHC55C2_0515 [Vibrio cholerae HC-55C2]
 gi|408644094|gb|EKL15800.1| hypothetical protein VCHC60A1_0507 [Vibrio cholerae HC-60A1]
 gi|408645206|gb|EKL16867.1| hypothetical protein VCHC59A1_0557 [Vibrio cholerae HC-59A1]
 gi|408652222|gb|EKL23447.1| hypothetical protein VCHC61A2_0442 [Vibrio cholerae HC-61A2]
 gi|408662087|gb|EKL33059.1| hypothetical protein VCHE40_0475 [Vibrio cholerae HE-40]
 gi|408666193|gb|EKL36990.1| hypothetical protein VCHE46_0479 [Vibrio cholerae HE-46]
 gi|408854525|gb|EKL94278.1| hypothetical protein VCHC02C1_0516 [Vibrio cholerae HC-02C1]
 gi|408862102|gb|EKM01651.1| hypothetical protein VCHC55B2_0506 [Vibrio cholerae HC-55B2]
 gi|408869977|gb|EKM09259.1| hypothetical protein VCHC59B1_0515 [Vibrio cholerae HC-59B1]
 gi|443455344|gb|ELT19126.1| hypothetical protein VCHC78A1_00498 [Vibrio cholerae HC-78A1]
          Length = 96

 Score = 35.8 bits (81), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + V I AP  DG+ANA L +Y++ +  V +  V I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVVIEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|147674564|ref|YP_001215984.1| hypothetical protein VC0395_A0010 [Vibrio cholerae O395]
 gi|153216287|ref|ZP_01950380.1| conserved hypothetical protein [Vibrio cholerae 1587]
 gi|227116861|ref|YP_002818757.1| hypothetical protein VC395_0502 [Vibrio cholerae O395]
 gi|262167148|ref|ZP_06034862.1| hypothetical protein VIJ_000308 [Vibrio cholerae RC27]
 gi|424589734|ref|ZP_18029181.1| hypothetical protein VCCP103710_0503 [Vibrio cholerae CP1037(10)]
 gi|172047756|sp|A5F9H9.1|Y1210_VIBC3 RecName: Full=UPF0235 protein VC0395_A0010/VC395_0502
 gi|124114376|gb|EAY33196.1| conserved hypothetical protein [Vibrio cholerae 1587]
 gi|146316447|gb|ABQ20986.1| conserved hypothetical protein [Vibrio cholerae O395]
 gi|227012311|gb|ACP08521.1| conserved hypothetical protein [Vibrio cholerae O395]
 gi|262024448|gb|EEY43135.1| hypothetical protein VIJ_000308 [Vibrio cholerae RC27]
 gi|408036533|gb|EKG72959.1| hypothetical protein VCCP103710_0503 [Vibrio cholerae CP1037(10)]
          Length = 96

 Score = 35.8 bits (81), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + V I AP  DG+ANA L +Y++ +  V +  V I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVVIEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|121728574|ref|ZP_01681595.1| conserved hypothetical protein [Vibrio cholerae V52]
 gi|153802603|ref|ZP_01957189.1| conserved hypothetical protein [Vibrio cholerae MZO-3]
 gi|229512520|ref|ZP_04401991.1| hypothetical protein VCB_000160 [Vibrio cholerae TMA 21]
 gi|254291169|ref|ZP_04961965.1| conserved hypothetical protein [Vibrio cholerae AM-19226]
 gi|121629130|gb|EAX61573.1| conserved hypothetical protein [Vibrio cholerae V52]
 gi|124121866|gb|EAY40609.1| conserved hypothetical protein [Vibrio cholerae MZO-3]
 gi|150422863|gb|EDN14814.1| conserved hypothetical protein [Vibrio cholerae AM-19226]
 gi|229350413|gb|EEO15362.1| hypothetical protein VCB_000160 [Vibrio cholerae TMA 21]
          Length = 96

 Score = 35.8 bits (81), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + V I AP  DG+ANA L +Y++ +  V +  V I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVVIEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|153835271|ref|ZP_01987938.1| conserved hypothetical protein [Vibrio harveyi HY01]
 gi|156975836|ref|YP_001446743.1| hypothetical protein VIBHAR_03581 [Vibrio harveyi ATCC BAA-1116]
 gi|269960447|ref|ZP_06174820.1| conserved hypothetical protein [Vibrio harveyi 1DA3]
 gi|350532460|ref|ZP_08911401.1| hypothetical protein VrotD_15094 [Vibrio rotiferianus DAT722]
 gi|424031935|ref|ZP_17771358.1| hypothetical protein VCHENC01_0161 [Vibrio cholerae HENC-01]
 gi|424036637|ref|ZP_17775617.1| hypothetical protein VCHENC02_1936 [Vibrio cholerae HENC-02]
 gi|424045006|ref|ZP_17782575.1| hypothetical protein VCHENC03_0217 [Vibrio cholerae HENC-03]
 gi|444428100|ref|ZP_21223453.1| hypothetical protein B878_19115 [Vibrio campbellii CAIM 519 = NBRC
           15631]
 gi|166232611|sp|A7MZ92.1|Y3581_VIBHB RecName: Full=UPF0235 protein VIBHAR_03581
 gi|148868246|gb|EDL67386.1| conserved hypothetical protein [Vibrio harveyi HY01]
 gi|156527430|gb|ABU72516.1| hypothetical protein VIBHAR_03581 [Vibrio harveyi ATCC BAA-1116]
 gi|269834874|gb|EEZ88960.1| conserved hypothetical protein [Vibrio harveyi 1DA3]
 gi|408876645|gb|EKM15756.1| hypothetical protein VCHENC01_0161 [Vibrio cholerae HENC-01]
 gi|408887049|gb|EKM25691.1| hypothetical protein VCHENC03_0217 [Vibrio cholerae HENC-03]
 gi|408896411|gb|EKM32510.1| hypothetical protein VCHENC02_1936 [Vibrio cholerae HENC-02]
 gi|444238666|gb|ELU50261.1| hypothetical protein B878_19115 [Vibrio campbellii CAIM 519 = NBRC
           15631]
          Length = 96

 Score = 35.8 bits (81), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 36/72 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + ++ +P +    I  +  E + + I AP  DG+ANA L ++++    V +  V I  
Sbjct: 12  VVLRLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLTKFLAKQFKVAKGLVHIEK 71

Query: 100 GSKSRDKIVIVE 111
           G   R K + +E
Sbjct: 72  GELGRHKQIRIE 83


>gi|154309869|ref|XP_001554267.1| predicted protein [Botryotinia fuckeliana B05.10]
 gi|347836222|emb|CCD50794.1| similar to yggU family protein [Botryotinia fuckeliana]
          Length = 130

 Score = 35.8 bits (81), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 20/75 (26%), Positives = 41/75 (54%), Gaps = 6/75 (8%)

Query: 39  SVSITIHAKPGSKSC-----SITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           ++ ++ H KP S +      + +D S  ++ V +  PA+D EAN  ++E +S +L   + 
Sbjct: 22  TIHLSCHVKPNSSASRVGVKAFSDTSS-SIEVCVAQPARDNEANKGVVEVLSHILKCPKT 80

Query: 94  QVSIGSGSKSRDKIV 108
            + +  G  S++KI+
Sbjct: 81  DLEVIRGKTSKNKII 95


>gi|411117596|ref|ZP_11390063.1| hypothetical protein OsccyDRAFT_1510 [Oscillatoriales
           cyanobacterium JSC-12]
 gi|410712434|gb|EKQ69936.1| hypothetical protein OsccyDRAFT_1510 [Oscillatoriales
           cyanobacterium JSC-12]
          Length = 81

 Score = 35.8 bits (81), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 19/76 (25%), Positives = 37/76 (48%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ + +  KP S+   +  + D     ++ +P  DG+ANA LL  ++      +  V+I 
Sbjct: 5   AIILQVKVKPNSRVSLLEQMPDGNWLARLKSPPVDGKANAELLALVAKHFRCPKSAVTIK 64

Query: 99  SGSKSRDKIVIVEEIT 114
           SG   R K +++   T
Sbjct: 65  SGQSGRTKFLVINPPT 80


>gi|163803734|ref|ZP_02197592.1| hypothetical protein 1103602000580_AND4_13653 [Vibrio sp. AND4]
 gi|159172453|gb|EDP57321.1| hypothetical protein AND4_13653 [Vibrio sp. AND4]
          Length = 96

 Score = 35.8 bits (81), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 21/79 (26%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           V + ++ +P +    I  +  E + + I AP  DG+ANA L ++++    V +  V I  
Sbjct: 12  VVLRLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLTKFLAKQFKVAKGLVHIEK 71

Query: 100 GSKSRDKIVIVEEITPENV 118
           G   R K + ++  +PE +
Sbjct: 72  GELGRHKQIRIK--SPEQI 88


>gi|146293784|ref|YP_001184208.1| hypothetical protein Sputcn32_2690 [Shewanella putrefaciens CN-32]
 gi|386314531|ref|YP_006010696.1| hypothetical protein [Shewanella putrefaciens 200]
 gi|166228432|sp|A4Y8X6.1|Y2690_SHEPC RecName: Full=UPF0235 protein Sputcn32_2690
 gi|145565474|gb|ABP76409.1| protein of unknown function DUF167 [Shewanella putrefaciens CN-32]
 gi|319427156|gb|ADV55230.1| protein of unknown function DUF167 [Shewanella putrefaciens 200]
          Length = 96

 Score = 35.8 bits (81), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA L +Y++    V +  V I  G 
Sbjct: 13  LNVYIQPKASRDQIVGLHGDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSDVYIIKGE 72

Query: 102 KSRDK---IVIVEEITPE 116
             R K   IV  + I PE
Sbjct: 73  LGRHKQIRIVTPKLIPPE 90


>gi|220909519|ref|YP_002484830.1| hypothetical protein Cyan7425_4156 [Cyanothece sp. PCC 7425]
 gi|219866130|gb|ACL46469.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7425]
          Length = 89

 Score = 35.8 bits (81), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 19/66 (28%), Positives = 39/66 (59%)

Query: 61  EAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPENVLN 120
           EA+ V++ AP + G+ANAA++  +++ LG+ +  + + SG  S  K++ +E +    +  
Sbjct: 23  EALKVKVKAPPEKGKANAAVIALLATHLGIDQTCIEVLSGHTSAAKVLSIEGLDQTQIRA 82

Query: 121 SLGKAS 126
           +L   S
Sbjct: 83  ALSDVS 88


>gi|120598144|ref|YP_962718.1| hypothetical protein Sputw3181_1321 [Shewanella sp. W3-18-1]
 gi|166200340|sp|A1RHL9.1|Y1321_SHESW RecName: Full=UPF0235 protein Sputw3181_1321
 gi|120558237|gb|ABM24164.1| protein of unknown function DUF167 [Shewanella sp. W3-18-1]
          Length = 96

 Score = 35.8 bits (81), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA L +Y++    V +  V I  G 
Sbjct: 13  LNLYIQPKASRDQIVGLHGDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSDVYIIKGE 72

Query: 102 KSRDK---IVIVEEITPE 116
             R K   IV  + I PE
Sbjct: 73  LGRHKQIRIVTPKLIPPE 90


>gi|262191069|ref|ZP_06049276.1| hypothetical protein VIH_001440 [Vibrio cholerae CT 5369-93]
 gi|429886754|ref|ZP_19368298.1| hypothetical protein OSU_1905 [Vibrio cholerae PS15]
 gi|262033045|gb|EEY51576.1| hypothetical protein VIH_001440 [Vibrio cholerae CT 5369-93]
 gi|429226359|gb|EKY32484.1| hypothetical protein OSU_1905 [Vibrio cholerae PS15]
          Length = 96

 Score = 35.8 bits (81), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + V I AP  DG+ANA L +Y++ +  V +  V I  G 
Sbjct: 13  LRLYIQPKASRDSIIGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVVIEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|384082350|ref|ZP_09993525.1| hypothetical protein gproHI_03517 [gamma proteobacterium HIMB30]
          Length = 109

 Score = 35.8 bits (81), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 21/84 (25%), Positives = 43/84 (51%)

Query: 28  LPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSV 87
           LPS +     +   + +  + G+K        D  + + I AP  +G AN A+++ ++  
Sbjct: 8   LPSWLIWRGNNQWLVQLAIRSGAKETRANGELDGRLKLSIKAPPIEGRANQAIMQSIALA 67

Query: 88  LGVKRRQVSIGSGSKSRDKIVIVE 111
            G+   +V + SG+KS+ K +++E
Sbjct: 68  AGIPTSRVQLMSGAKSKHKSMVIE 91


>gi|332187012|ref|ZP_08388753.1| hypothetical protein SUS17_2027 [Sphingomonas sp. S17]
 gi|332013022|gb|EGI55086.1| hypothetical protein SUS17_2027 [Sphingomonas sp. S17]
          Length = 84

 Score = 35.8 bits (81), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 31/52 (59%)

Query: 55  ITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           +T  +++    ++ AP  DG ANAAL+  ++   GV +R V+I +G  +R K
Sbjct: 10  LTAGTEDHFAARLSAPPVDGAANAALVPLVAKHFGVAKRAVTIVAGETARLK 61


>gi|334341814|ref|YP_004546794.1| hypothetical protein [Desulfotomaculum ruminis DSM 2154]
 gi|334093168|gb|AEG61508.1| protein of unknown function DUF167 [Desulfotomaculum ruminis DSM
           2154]
          Length = 95

 Score = 35.8 bits (81), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 15/52 (28%), Positives = 30/52 (57%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPE 116
           +++ AP  DG AN A  ++++ +    +  V I SG   R+K++ V ++T +
Sbjct: 36  IRLTAPPVDGAANEACCKFLAEIFATAKSNVEILSGHTGRNKLIRVADVTVQ 87


>gi|334703242|ref|ZP_08519108.1| hypothetical protein AcavA_04327 [Aeromonas caviae Ae398]
          Length = 99

 Score = 35.8 bits (81), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 39/78 (50%)

Query: 34  LVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRR 93
           L+    + + +  +P +    I  +  + + V I AP  DG+AN+ L++Y++    V + 
Sbjct: 5   LLEGDELVLHLMIQPKASRDQIIGLHGDELKVAITAPPVDGQANSHLIKYLAKQCKVAKG 64

Query: 94  QVSIGSGSKSRDKIVIVE 111
           QV I  G   R K V ++
Sbjct: 65  QVRILRGELGRHKTVAID 82


>gi|317493545|ref|ZP_07951966.1| hypothetical protein HMPREF0864_02731 [Enterobacteriaceae bacterium
           9_2_54FAA]
 gi|316918488|gb|EFV39826.1| hypothetical protein HMPREF0864_02731 [Enterobacteriaceae bacterium
           9_2_54FAA]
          Length = 99

 Score = 35.8 bits (81), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 36/70 (51%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ + ++ +P +    I  +  + + V I AP  DG+ANA L ++++    V + QV I 
Sbjct: 12  ALVLRLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLQKFIAKQFRVAKSQVVIE 71

Query: 99  SGSKSRDKIV 108
            G   R K V
Sbjct: 72  KGELGRHKQV 81


>gi|357018761|ref|ZP_09081024.1| hypothetical protein KEK_02120 [Mycobacterium thermoresistibile
           ATCC 19527]
 gi|356481479|gb|EHI14584.1| hypothetical protein KEK_02120 [Mycobacterium thermoresistibile
           ATCC 19527]
          Length = 78

 Score = 35.8 bits (81), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 18/67 (26%), Positives = 39/67 (58%), Gaps = 1/67 (1%)

Query: 41  SITIHAKPGSKSCSITDVSDEA-VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           ++ +  KP S+   +   +D+  + V +  PA +G+ANAA+++ ++    V R +V + +
Sbjct: 7   TVAVRVKPNSRKGPLVQTADDGTLTVYVREPAVEGKANAAVIKVLADHFDVPRSRVELVA 66

Query: 100 GSKSRDK 106
           G+ +R K
Sbjct: 67  GASARIK 73


>gi|149192342|ref|ZP_01870547.1| hypothetical protein VSAK1_11360 [Vibrio shilonii AK1]
 gi|148833820|gb|EDL50852.1| hypothetical protein VSAK1_11360 [Vibrio shilonii AK1]
          Length = 95

 Score = 35.8 bits (81), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 21/65 (32%), Positives = 31/65 (47%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + I+ +P +    +     E   V I AP  DG+ANA L +Y++    V + QV I  G 
Sbjct: 13  LRIYVQPKASRDKLVGEHGEEFKVAITAPPVDGKANAHLSKYLAKQFKVAKGQVLIEKGE 72

Query: 102 KSRDK 106
             R K
Sbjct: 73  LGRHK 77


>gi|365836681|ref|ZP_09378069.1| TIGR00251 family protein [Hafnia alvei ATCC 51873]
 gi|364563579|gb|EHM41383.1| TIGR00251 family protein [Hafnia alvei ATCC 51873]
          Length = 102

 Score = 35.8 bits (81), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 36/70 (51%)

Query: 39  SVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIG 98
           ++ + ++ +P +    I  +  + + V I AP  DG+ANA L ++++    V + QV I 
Sbjct: 15  ALVLRLYIQPKASRDQIVGLHGDELKVAITAPPVDGQANAHLQKFIAKQFRVAKSQVVIE 74

Query: 99  SGSKSRDKIV 108
            G   R K V
Sbjct: 75  KGELGRHKQV 84


>gi|378754790|gb|EHY64819.1| hypothetical protein NERG_02222 [Nematocida sp. 1 ERTm2]
          Length = 93

 Score = 35.8 bits (81), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 20/66 (30%), Positives = 39/66 (59%), Gaps = 2/66 (3%)

Query: 53  CSITDV--SDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIV 110
           C I  V  ++  + + +++ A + +AN  ++E +  +  V + Q+SI +G KS+ K V+V
Sbjct: 18  CKIFSVEWTNNTLYMSLNSMAVENKANKEVIESVHEIFKVPKAQISIIAGHKSKIKAVLV 77

Query: 111 EEITPE 116
           E IT +
Sbjct: 78  ENITED 83


>gi|269965742|ref|ZP_06179839.1| conserved hypothetical protein [Vibrio alginolyticus 40B]
 gi|269829610|gb|EEZ83847.1| conserved hypothetical protein [Vibrio alginolyticus 40B]
          Length = 96

 Score = 35.4 bits (80), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 19/72 (26%), Positives = 36/72 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + ++ +P +    I  +  E + + I AP  DG+ANA L ++++    + +  V I  
Sbjct: 12  IILKLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGLVHIEK 71

Query: 100 GSKSRDKIVIVE 111
           G   R K V +E
Sbjct: 72  GELGRHKQVRIE 83


>gi|375266706|ref|YP_005024149.1| hypothetical protein VEJY3_13470 [Vibrio sp. EJY3]
 gi|369842026|gb|AEX23170.1| hypothetical protein VEJY3_13470 [Vibrio sp. EJY3]
          Length = 96

 Score = 35.4 bits (80), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 19/72 (26%), Positives = 36/72 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + ++ +P +    I  +  E + + I AP  DG+ANA L ++++    V +  V I  
Sbjct: 12  IVLRLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLTKFLAKQFKVAKGLVHIEK 71

Query: 100 GSKSRDKIVIVE 111
           G   R K + +E
Sbjct: 72  GELGRHKQIRIE 83


>gi|338733474|ref|YP_004671947.1| hypothetical protein SNE_A15790 [Simkania negevensis Z]
 gi|336482857|emb|CCB89456.1| UPF0235 protein CCA_00247 [Simkania negevensis Z]
          Length = 85

 Score = 35.4 bits (80), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 20/71 (28%), Positives = 35/71 (49%)

Query: 41  SITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           S  +   P +    I     E + V++ A  + G+AN AL+  +S  L + + ++ I  G
Sbjct: 3   SFVVKVTPKAAKNEIIGWEGELLKVRLTAAPEKGKANEALITLLSKKLKIAKSRIHILKG 62

Query: 101 SKSRDKIVIVE 111
             SR K+V +E
Sbjct: 63  ETSRQKLVEIE 73


>gi|146337438|ref|YP_001202486.1| hypothetical protein BRADO0278 [Bradyrhizobium sp. ORS 278]
 gi|146190244|emb|CAL74236.1| conserved hypothetical protein [Bradyrhizobium sp. ORS 278]
          Length = 109

 Score = 35.4 bits (80), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 30/47 (63%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           V++ A A  GEAN A+ E ++  +GV ++ V I SG+ SR K V ++
Sbjct: 41  VRVRAIADGGEANRAVTELLAKAIGVTKKAVRITSGTTSRLKQVAID 87


>gi|336310590|ref|ZP_08565562.1| integral membrane protein YggT, involved in response to
           extracytoplasmic stress (osmotic shock) [Shewanella sp.
           HN-41]
 gi|335866320|gb|EGM71311.1| integral membrane protein YggT, involved in response to
           extracytoplasmic stress (osmotic shock) [Shewanella sp.
           HN-41]
          Length = 96

 Score = 35.4 bits (80), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 38/78 (48%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA L +Y++    V +  V I  G 
Sbjct: 13  LNLYIQPKASRDQIVGLHGDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSDVHILKGE 72

Query: 102 KSRDK---IVIVEEITPE 116
             R K   I+  + I PE
Sbjct: 73  LGRHKQVRIIAPKLIPPE 90


>gi|419835405|ref|ZP_14358850.1| hypothetical protein VCHC46B1_0562 [Vibrio cholerae HC-46B1]
 gi|421342026|ref|ZP_15792433.1| hypothetical protein VCHC43B1_0567 [Vibrio cholerae HC-43B1]
 gi|423733766|ref|ZP_17706982.1| hypothetical protein VCHC41B1_0540 [Vibrio cholerae HC-41B1]
 gi|424008052|ref|ZP_17751002.1| hypothetical protein VCHC44C1_0535 [Vibrio cholerae HC-44C1]
 gi|395945529|gb|EJH56194.1| hypothetical protein VCHC43B1_0567 [Vibrio cholerae HC-43B1]
 gi|408631923|gb|EKL04439.1| hypothetical protein VCHC41B1_0540 [Vibrio cholerae HC-41B1]
 gi|408858818|gb|EKL98488.1| hypothetical protein VCHC46B1_0562 [Vibrio cholerae HC-46B1]
 gi|408866339|gb|EKM05722.1| hypothetical protein VCHC44C1_0535 [Vibrio cholerae HC-44C1]
          Length = 96

 Score = 35.4 bits (80), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + V I AP  DG+ANA L +Y++ +  V +  V I  G 
Sbjct: 13  LRLYIQPKASLDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVVIEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|15640485|ref|NP_230112.1| hypothetical protein VC0458 [Vibrio cholerae O1 biovar El Tor str.
           N16961]
 gi|153823175|ref|ZP_01975842.1| conserved hypothetical protein [Vibrio cholerae B33]
 gi|229509068|ref|ZP_04398556.1| hypothetical protein VCE_000471 [Vibrio cholerae B33]
 gi|229519736|ref|ZP_04409179.1| hypothetical protein VCC_003768 [Vibrio cholerae RC9]
 gi|229606248|ref|YP_002876896.1| hypothetical protein VCD_001149 [Vibrio cholerae MJ-1236]
 gi|254850689|ref|ZP_05240039.1| conserved hypothetical protein [Vibrio cholerae MO10]
 gi|255744295|ref|ZP_05418248.1| hypothetical protein VCH_000606 [Vibrio cholera CIRS 101]
 gi|262147280|ref|ZP_06028079.1| hypothetical protein VIG_000128 [Vibrio cholerae INDRE 91/1]
 gi|360037099|ref|YP_004938862.1| hypothetical protein Vch1786_I2779 [Vibrio cholerae O1 str.
           2010EL-1786]
 gi|379740331|ref|YP_005332300.1| hypothetical protein O3Y_02125 [Vibrio cholerae IEC224]
 gi|417812443|ref|ZP_12459103.1| hypothetical protein VCHC49A2_1432 [Vibrio cholerae HC-49A2]
 gi|417815305|ref|ZP_12461939.1| hypothetical protein VCHCUF01_0533 [Vibrio cholerae HCUF01]
 gi|418331166|ref|ZP_12942116.1| hypothetical protein VCHC06A1_0498 [Vibrio cholerae HC-06A1]
 gi|418336323|ref|ZP_12945222.1| hypothetical protein VCHC23A1_0655 [Vibrio cholerae HC-23A1]
 gi|418342703|ref|ZP_12949503.1| hypothetical protein VCHC28A1_0498 [Vibrio cholerae HC-28A1]
 gi|418347867|ref|ZP_12952603.1| hypothetical protein VCHC43A1_0506 [Vibrio cholerae HC-43A1]
 gi|418354310|ref|ZP_12957034.1| hypothetical protein VCHC61A1_1305 [Vibrio cholerae HC-61A1]
 gi|419824929|ref|ZP_14348436.1| hypothetical protein VCCP10336_0523 [Vibrio cholerae CP1033(6)]
 gi|421315748|ref|ZP_15766320.1| hypothetical protein VCCP10325_0510 [Vibrio cholerae CP1032(5)]
 gi|421319246|ref|ZP_15769805.1| hypothetical protein VCCP103811_0499 [Vibrio cholerae CP1038(11)]
 gi|421323279|ref|ZP_15773808.1| hypothetical protein VCCP104114_0473 [Vibrio cholerae CP1041(14)]
 gi|421327685|ref|ZP_15778201.1| hypothetical protein VCCP104215_1439 [Vibrio cholerae CP1042(15)]
 gi|421330686|ref|ZP_15781168.1| hypothetical protein VCCP104619_0534 [Vibrio cholerae CP1046(19)]
 gi|421334284|ref|ZP_15784754.1| hypothetical protein VCCP104821_0431 [Vibrio cholerae CP1048(21)]
 gi|421338181|ref|ZP_15788620.1| hypothetical protein VCHC20A2_0516 [Vibrio cholerae HC-20A2]
 gi|421345745|ref|ZP_15796130.1| hypothetical protein VCHC46A1_0527 [Vibrio cholerae HC-46A1]
 gi|422890498|ref|ZP_16932923.1| hypothetical protein VCHC40A1_0477 [Vibrio cholerae HC-40A1]
 gi|422901298|ref|ZP_16936676.1| hypothetical protein VCHC48A1_0481 [Vibrio cholerae HC-48A1]
 gi|422905481|ref|ZP_16940339.1| hypothetical protein VCHC70A1_0498 [Vibrio cholerae HC-70A1]
 gi|422912202|ref|ZP_16946732.1| hypothetical protein VCHFU02_0496 [Vibrio cholerae HFU-02]
 gi|422924681|ref|ZP_16957719.1| hypothetical protein VCHC38A1_0501 [Vibrio cholerae HC-38A1]
 gi|423143727|ref|ZP_17131345.1| hypothetical protein VCHC19A1_0496 [Vibrio cholerae HC-19A1]
 gi|423148711|ref|ZP_17136072.1| hypothetical protein VCHC21A1_0503 [Vibrio cholerae HC-21A1]
 gi|423152502|ref|ZP_17139704.1| hypothetical protein VCHC22A1_0485 [Vibrio cholerae HC-22A1]
 gi|423155286|ref|ZP_17142425.1| hypothetical protein VCHC32A1_0499 [Vibrio cholerae HC-32A1]
 gi|423159145|ref|ZP_17146119.1| hypothetical protein VCHC33A2_0485 [Vibrio cholerae HC-33A2]
 gi|423163823|ref|ZP_17150619.1| hypothetical protein VCHC48B2_0476 [Vibrio cholerae HC-48B2]
 gi|423729844|ref|ZP_17703165.1| hypothetical protein VCHC17A1_0514 [Vibrio cholerae HC-17A1]
 gi|423747070|ref|ZP_17711352.1| hypothetical protein VCHC50A2_0521 [Vibrio cholerae HC-50A2]
 gi|423891674|ref|ZP_17725366.1| hypothetical protein VCHC62A1_0496 [Vibrio cholerae HC-62A1]
 gi|423926451|ref|ZP_17729983.1| hypothetical protein VCHC77A1_0502 [Vibrio cholerae HC-77A1]
 gi|424001006|ref|ZP_17744099.1| hypothetical protein VCHC17A2_0504 [Vibrio cholerae HC-17A2]
 gi|424005166|ref|ZP_17748154.1| hypothetical protein VCHC37A1_0630 [Vibrio cholerae HC-37A1]
 gi|424022959|ref|ZP_17762626.1| hypothetical protein VCHC62B1_0497 [Vibrio cholerae HC-62B1]
 gi|424025977|ref|ZP_17765597.1| hypothetical protein VCHC69A1_0496 [Vibrio cholerae HC-69A1]
 gi|424585359|ref|ZP_18024955.1| hypothetical protein VCCP10303_0505 [Vibrio cholerae CP1030(3)]
 gi|424593979|ref|ZP_18033322.1| hypothetical protein VCCP1040_0502 [Vibrio cholerae CP1040(13)]
 gi|424597915|ref|ZP_18037117.1| hypothetical protein VCCP104417_0506 [Vibrio Cholerae CP1044(17)]
 gi|424600679|ref|ZP_18039838.1| hypothetical protein VCCP1047_0494 [Vibrio cholerae CP1047(20)]
 gi|424605596|ref|ZP_18044563.1| hypothetical protein VCCP1050_0509 [Vibrio cholerae CP1050(23)]
 gi|424609311|ref|ZP_18048174.1| hypothetical protein VCHC39A1_0495 [Vibrio cholerae HC-39A1]
 gi|424612231|ref|ZP_18051042.1| hypothetical protein VCHC41A1_0512 [Vibrio cholerae HC-41A1]
 gi|424616108|ref|ZP_18054803.1| hypothetical protein VCHC42A1_0500 [Vibrio cholerae HC-42A1]
 gi|424620871|ref|ZP_18059402.1| hypothetical protein VCHC47A1_0517 [Vibrio cholerae HC-47A1]
 gi|424643688|ref|ZP_18081446.1| hypothetical protein VCHC56A2_0509 [Vibrio cholerae HC-56A2]
 gi|424651610|ref|ZP_18089138.1| hypothetical protein VCHC57A2_0503 [Vibrio cholerae HC-57A2]
 gi|424655558|ref|ZP_18092864.1| hypothetical protein VCHC81A2_0506 [Vibrio cholerae HC-81A2]
 gi|440708664|ref|ZP_20889325.1| hypothetical protein VC4260B_00700 [Vibrio cholerae 4260B]
 gi|443502508|ref|ZP_21069500.1| hypothetical protein VCHC64A1_00491 [Vibrio cholerae HC-64A1]
 gi|443506416|ref|ZP_21073213.1| hypothetical protein VCHC65A1_00495 [Vibrio cholerae HC-65A1]
 gi|443510250|ref|ZP_21076922.1| hypothetical protein VCHC67A1_00494 [Vibrio cholerae HC-67A1]
 gi|443514087|ref|ZP_21080631.1| hypothetical protein VCHC68A1_00490 [Vibrio cholerae HC-68A1]
 gi|443517900|ref|ZP_21084322.1| hypothetical protein VCHC71A1_00490 [Vibrio cholerae HC-71A1]
 gi|443522482|ref|ZP_21088732.1| hypothetical protein VCHC72A2_00493 [Vibrio cholerae HC-72A2]
 gi|443530386|ref|ZP_21096402.1| hypothetical protein VCHC7A1_01504 [Vibrio cholerae HC-7A1]
 gi|443534158|ref|ZP_21100077.1| hypothetical protein VCHC80A1_00463 [Vibrio cholerae HC-80A1]
 gi|443537740|ref|ZP_21103597.1| hypothetical protein VCHC81A1_01284 [Vibrio cholerae HC-81A1]
 gi|449054312|ref|ZP_21732980.1| Hypothetical protein B839_03460 [Vibrio cholerae O1 str. Inaba
           G4222]
 gi|29839647|sp|Q9KUQ7.1|Y458_VIBCH RecName: Full=UPF0235 protein VC_0458
 gi|9654883|gb|AAF93631.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor
           str. N16961]
 gi|126519301|gb|EAZ76524.1| conserved hypothetical protein [Vibrio cholerae B33]
 gi|229344425|gb|EEO09400.1| hypothetical protein VCC_003768 [Vibrio cholerae RC9]
 gi|229353993|gb|EEO18927.1| hypothetical protein VCE_000471 [Vibrio cholerae B33]
 gi|229368903|gb|ACQ59326.1| hypothetical protein VCD_001149 [Vibrio cholerae MJ-1236]
 gi|254846394|gb|EET24808.1| conserved hypothetical protein [Vibrio cholerae MO10]
 gi|255738235|gb|EET93627.1| hypothetical protein VCH_000606 [Vibrio cholera CIRS 101]
 gi|262031274|gb|EEY49889.1| hypothetical protein VIG_000128 [Vibrio cholerae INDRE 91/1]
 gi|340043291|gb|EGR04250.1| hypothetical protein VCHCUF01_0533 [Vibrio cholerae HCUF01]
 gi|340043823|gb|EGR04780.1| hypothetical protein VCHC49A2_1432 [Vibrio cholerae HC-49A2]
 gi|341625813|gb|EGS51240.1| hypothetical protein VCHC70A1_0498 [Vibrio cholerae HC-70A1]
 gi|341627186|gb|EGS52512.1| hypothetical protein VCHC48A1_0481 [Vibrio cholerae HC-48A1]
 gi|341627511|gb|EGS52814.1| hypothetical protein VCHC40A1_0477 [Vibrio cholerae HC-40A1]
 gi|341641138|gb|EGS65697.1| hypothetical protein VCHFU02_0496 [Vibrio cholerae HFU-02]
 gi|341648634|gb|EGS72678.1| hypothetical protein VCHC38A1_0501 [Vibrio cholerae HC-38A1]
 gi|356421656|gb|EHH75150.1| hypothetical protein VCHC06A1_0498 [Vibrio cholerae HC-06A1]
 gi|356422010|gb|EHH75497.1| hypothetical protein VCHC21A1_0503 [Vibrio cholerae HC-21A1]
 gi|356426926|gb|EHH80209.1| hypothetical protein VCHC19A1_0496 [Vibrio cholerae HC-19A1]
 gi|356433104|gb|EHH86297.1| hypothetical protein VCHC23A1_0655 [Vibrio cholerae HC-23A1]
 gi|356434758|gb|EHH87928.1| hypothetical protein VCHC22A1_0485 [Vibrio cholerae HC-22A1]
 gi|356438061|gb|EHH91117.1| hypothetical protein VCHC28A1_0498 [Vibrio cholerae HC-28A1]
 gi|356443244|gb|EHH96067.1| hypothetical protein VCHC32A1_0499 [Vibrio cholerae HC-32A1]
 gi|356447978|gb|EHI00763.1| hypothetical protein VCHC43A1_0506 [Vibrio cholerae HC-43A1]
 gi|356450404|gb|EHI03126.1| hypothetical protein VCHC33A2_0485 [Vibrio cholerae HC-33A2]
 gi|356454086|gb|EHI06741.1| hypothetical protein VCHC61A1_1305 [Vibrio cholerae HC-61A1]
 gi|356456470|gb|EHI09069.1| hypothetical protein VCHC48B2_0476 [Vibrio cholerae HC-48B2]
 gi|356648253|gb|AET28308.1| hypothetical protein Vch1786_I2779 [Vibrio cholerae O1 str.
           2010EL-1786]
 gi|378793841|gb|AFC57312.1| hypothetical protein O3Y_02125 [Vibrio cholerae IEC224]
 gi|395922489|gb|EJH33305.1| hypothetical protein VCCP10325_0510 [Vibrio cholerae CP1032(5)]
 gi|395923124|gb|EJH33936.1| hypothetical protein VCCP104114_0473 [Vibrio cholerae CP1041(14)]
 gi|395925571|gb|EJH36368.1| hypothetical protein VCCP103811_0499 [Vibrio cholerae CP1038(11)]
 gi|395931419|gb|EJH42164.1| hypothetical protein VCCP104215_1439 [Vibrio cholerae CP1042(15)]
 gi|395934539|gb|EJH45277.1| hypothetical protein VCCP104619_0534 [Vibrio cholerae CP1046(19)]
 gi|395937814|gb|EJH48525.1| hypothetical protein VCCP104821_0431 [Vibrio cholerae CP1048(21)]
 gi|395946544|gb|EJH57207.1| hypothetical protein VCHC20A2_0516 [Vibrio cholerae HC-20A2]
 gi|395948414|gb|EJH59064.1| hypothetical protein VCHC46A1_0527 [Vibrio cholerae HC-46A1]
 gi|395964018|gb|EJH74261.1| hypothetical protein VCHC57A2_0503 [Vibrio cholerae HC-57A2]
 gi|395964105|gb|EJH74347.1| hypothetical protein VCHC56A2_0509 [Vibrio cholerae HC-56A2]
 gi|395967073|gb|EJH77176.1| hypothetical protein VCHC42A1_0500 [Vibrio cholerae HC-42A1]
 gi|395975687|gb|EJH85168.1| hypothetical protein VCHC47A1_0517 [Vibrio cholerae HC-47A1]
 gi|395977759|gb|EJH87158.1| hypothetical protein VCCP10303_0505 [Vibrio cholerae CP1030(3)]
 gi|395979271|gb|EJH88630.1| hypothetical protein VCCP1047_0494 [Vibrio cholerae CP1047(20)]
 gi|408010207|gb|EKG48079.1| hypothetical protein VCHC39A1_0495 [Vibrio cholerae HC-39A1]
 gi|408016967|gb|EKG54491.1| hypothetical protein VCHC41A1_0512 [Vibrio cholerae HC-41A1]
 gi|408037621|gb|EKG74009.1| hypothetical protein VCCP1040_0502 [Vibrio cholerae CP1040(13)]
 gi|408044972|gb|EKG80848.1| hypothetical protein VCCP104417_0506 [Vibrio Cholerae CP1044(17)]
 gi|408046805|gb|EKG82470.1| hypothetical protein VCCP1050_0509 [Vibrio cholerae CP1050(23)]
 gi|408057530|gb|EKG92375.1| hypothetical protein VCHC81A2_0506 [Vibrio cholerae HC-81A2]
 gi|408611953|gb|EKK85309.1| hypothetical protein VCCP10336_0523 [Vibrio cholerae CP1033(6)]
 gi|408627743|gb|EKL00546.1| hypothetical protein VCHC17A1_0514 [Vibrio cholerae HC-17A1]
 gi|408642288|gb|EKL14038.1| hypothetical protein VCHC50A2_0521 [Vibrio cholerae HC-50A2]
 gi|408658783|gb|EKL29843.1| hypothetical protein VCHC77A1_0502 [Vibrio cholerae HC-77A1]
 gi|408660102|gb|EKL31132.1| hypothetical protein VCHC62A1_0496 [Vibrio cholerae HC-62A1]
 gi|408849210|gb|EKL89238.1| hypothetical protein VCHC37A1_0630 [Vibrio cholerae HC-37A1]
 gi|408849742|gb|EKL89751.1| hypothetical protein VCHC17A2_0504 [Vibrio cholerae HC-17A2]
 gi|408874518|gb|EKM13688.1| hypothetical protein VCHC62B1_0497 [Vibrio cholerae HC-62B1]
 gi|408881450|gb|EKM20335.1| hypothetical protein VCHC69A1_0496 [Vibrio cholerae HC-69A1]
 gi|439975760|gb|ELP51867.1| hypothetical protein VC4260B_00700 [Vibrio cholerae 4260B]
 gi|443433120|gb|ELS75638.1| hypothetical protein VCHC64A1_00491 [Vibrio cholerae HC-64A1]
 gi|443436951|gb|ELS83061.1| hypothetical protein VCHC65A1_00495 [Vibrio cholerae HC-65A1]
 gi|443440824|gb|ELS90505.1| hypothetical protein VCHC67A1_00494 [Vibrio cholerae HC-67A1]
 gi|443444595|gb|ELS97864.1| hypothetical protein VCHC68A1_00490 [Vibrio cholerae HC-68A1]
 gi|443448433|gb|ELT05063.1| hypothetical protein VCHC71A1_00490 [Vibrio cholerae HC-71A1]
 gi|443451551|gb|ELT11805.1| hypothetical protein VCHC72A2_00493 [Vibrio cholerae HC-72A2]
 gi|443458587|gb|ELT25982.1| hypothetical protein VCHC7A1_01504 [Vibrio cholerae HC-7A1]
 gi|443462659|gb|ELT33691.1| hypothetical protein VCHC80A1_00463 [Vibrio cholerae HC-80A1]
 gi|443466565|gb|ELT41222.1| hypothetical protein VCHC81A1_01284 [Vibrio cholerae HC-81A1]
 gi|448266105|gb|EMB03335.1| Hypothetical protein B839_03460 [Vibrio cholerae O1 str. Inaba
           G4222]
          Length = 96

 Score = 35.4 bits (80), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + V I AP  DG+ANA L +Y++ +  V +  V +  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVVVEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|24374867|ref|NP_718910.1| protein of unknown function DUF167 YggU [Shewanella oneidensis
           MR-1]
 gi|29839709|sp|Q8EBY9.1|Y3356_SHEON RecName: Full=UPF0235 protein SO_3356
 gi|24349562|gb|AAN56354.1| protein of unknown function DUF167 YggU [Shewanella oneidensis
           MR-1]
          Length = 96

 Score = 35.4 bits (80), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 33/67 (49%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA L +Y++    V +  V I  G 
Sbjct: 13  LNLYIQPKASRDQIVGLHGDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSDVHILKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|260432898|ref|ZP_05786869.1| conserved hypothetical protein [Silicibacter lacuscaerulensis
           ITI-1157]
 gi|260416726|gb|EEX09985.1| conserved hypothetical protein [Silicibacter lacuscaerulensis
           ITI-1157]
          Length = 91

 Score = 35.4 bits (80), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 23/87 (26%), Positives = 42/87 (48%), Gaps = 4/87 (4%)

Query: 26  ENLPSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEA-VGVQIDAPAKDGEANAALLEYM 84
            NLP    L  P    I +   P +    I   +DE+ V + + APA+ G+AN A+   +
Sbjct: 7   RNLPDLTHLAQPGQ-HIQVRVTPKAARDRIQ--ADESSVHIAVTAPAEGGKANLAVARIL 63

Query: 85  SSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           +  +G+    + +  G  +R+K+ + E
Sbjct: 64  AKAMGIAPSALILKQGQTARNKLFVYE 90


>gi|229525149|ref|ZP_04414554.1| hypothetical protein VCA_002760 [Vibrio cholerae bv. albensis
           VL426]
 gi|229338730|gb|EEO03747.1| hypothetical protein VCA_002760 [Vibrio cholerae bv. albensis
           VL426]
          Length = 96

 Score = 35.4 bits (80), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + V I AP  DG+ANA L +Y++ +  V +  V +  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVVVEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|192288849|ref|YP_001989454.1| hypothetical protein Rpal_0418 [Rhodopseudomonas palustris TIE-1]
 gi|226706140|sp|B3QA92.1|Y418_RHOPT RecName: Full=UPF0235 protein Rpal_0418
 gi|192282598|gb|ACE98978.1| protein of unknown function DUF167 [Rhodopseudomonas palustris
           TIE-1]
          Length = 108

 Score = 35.4 bits (80), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 31/49 (63%)

Query: 63  VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           V V++ A A  GEAN A+ E ++  +GV +R V + SG+ SR K + ++
Sbjct: 40  VKVRVRAIADGGEANRAVTELLAKAVGVPKRNVRLLSGATSRQKQIAID 88


>gi|406991267|gb|EKE10806.1| hypothetical protein ACD_15C00198G0001, partial [uncultured
           bacterium]
          Length = 63

 Score = 35.4 bits (80), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 29/59 (49%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           I +   P S    +  +SD    V+I AP  DGEAN  L++ +S    V +  ++I  G
Sbjct: 3   IYVRVSPRSSKNEVIKISDGEYKVKITAPPVDGEANKMLIKVLSDHFSVSKSSINIVGG 61


>gi|340027583|ref|ZP_08663646.1| hypothetical protein PaTRP_02636 [Paracoccus sp. TRP]
          Length = 98

 Score = 35.4 bits (80), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 38/68 (55%), Gaps = 3/68 (4%)

Query: 42  ITIHAKP-GSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSG 100
           I +   P  S++  I D   E + V +    +DG+ANAA+++ ++  LGV + ++ +  G
Sbjct: 30  IAVRVTPRASRNAVILD--GETIRVTVTTVPEDGKANAAVVKLLAKALGVAKSRLVLVRG 87

Query: 101 SKSRDKIV 108
           +  RDKI 
Sbjct: 88  ATGRDKIF 95


>gi|121590704|ref|ZP_01678036.1| conserved hypothetical protein [Vibrio cholerae 2740-80]
 gi|153819156|ref|ZP_01971823.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457]
 gi|153826562|ref|ZP_01979229.1| conserved hypothetical protein [Vibrio cholerae MZO-2]
 gi|227080668|ref|YP_002809219.1| hypothetical protein VCM66_0443 [Vibrio cholerae M66-2]
 gi|229507096|ref|ZP_04396602.1| hypothetical protein VCF_002318 [Vibrio cholerae BX 330286]
 gi|298501011|ref|ZP_07010812.1| conserved hypothetical protein [Vibrio cholerae MAK 757]
 gi|421350320|ref|ZP_15800686.1| hypothetical protein VCHE25_1531 [Vibrio cholerae HE-25]
 gi|254803911|sp|C3LRX5.1|Y443_VIBCM RecName: Full=UPF0235 protein VCM66_0443
 gi|121547435|gb|EAX57544.1| conserved hypothetical protein [Vibrio cholerae 2740-80]
 gi|126510301|gb|EAZ72895.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457]
 gi|149739654|gb|EDM53868.1| conserved hypothetical protein [Vibrio cholerae MZO-2]
 gi|227008556|gb|ACP04768.1| conserved hypothetical protein [Vibrio cholerae M66-2]
 gi|229355841|gb|EEO20761.1| hypothetical protein VCF_002318 [Vibrio cholerae BX 330286]
 gi|297540259|gb|EFH76319.1| conserved hypothetical protein [Vibrio cholerae MAK 757]
 gi|395954442|gb|EJH65052.1| hypothetical protein VCHE25_1531 [Vibrio cholerae HE-25]
          Length = 96

 Score = 35.4 bits (80), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 35/67 (52%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + V I AP  DG+ANA L +Y++ +  V +  V +  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGSVVVEKGE 72

Query: 102 KSRDKIV 108
             R K V
Sbjct: 73  LGRHKQV 79


>gi|39933491|ref|NP_945767.1| hypothetical protein RPA0414 [Rhodopseudomonas palustris CGA009]
 gi|39647337|emb|CAE25858.1| DUF167 [Rhodopseudomonas palustris CGA009]
          Length = 112

 Score = 35.4 bits (80), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 19/49 (38%), Positives = 31/49 (63%)

Query: 63  VGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           V V++ A A  GEAN A+ E ++  +GV +R V + SG+ SR K + ++
Sbjct: 44  VKVRVRAIADGGEANRAVTELLAKAVGVPKRNVRLLSGATSRQKQIAID 92


>gi|365884066|ref|ZP_09423146.1| conserved hypothetical protein [Bradyrhizobium sp. ORS 375]
 gi|365287431|emb|CCD95677.1| conserved hypothetical protein [Bradyrhizobium sp. ORS 375]
          Length = 109

 Score = 35.4 bits (80), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 30/47 (63%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVE 111
           V++ A A  GEAN A+ E ++  +GV ++ V I SG+ SR K V ++
Sbjct: 40  VRVRAIADGGEANRAVTELLAKAIGVTKKAVRITSGTTSRLKQVAID 86


>gi|392401666|ref|YP_006438278.1| UPF0235 protein yggU [Turneriella parva DSM 21527]
 gi|390609620|gb|AFM10772.1| UPF0235 protein yggU [Turneriella parva DSM 21527]
          Length = 82

 Score = 35.0 bits (79), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 36/67 (53%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + +  KP SK   +   +D  + ++I  P  DG+AN A++E ++ +  + +  V I SG 
Sbjct: 3   VRVQVKPRSKKPGVELGADGTLILKIHEPPVDGKANQAVIERIAELYDIAKSAVHIVSGE 62

Query: 102 KSRDKIV 108
            SR K V
Sbjct: 63  TSRMKRV 69


>gi|262393226|ref|YP_003285080.1| hypothetical protein VEA_002453 [Vibrio sp. Ex25]
 gi|451972144|ref|ZP_21925356.1| hypothetical protein C408_1968 [Vibrio alginolyticus E0666]
 gi|262336820|gb|ACY50615.1| hypothetical protein VEA_002453 [Vibrio sp. Ex25]
 gi|451931982|gb|EMD79664.1| hypothetical protein C408_1968 [Vibrio alginolyticus E0666]
          Length = 96

 Score = 35.0 bits (79), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 18/72 (25%), Positives = 36/72 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + ++ +P +    I  +  E + + I AP  DG+ANA L ++++    + +  V I  
Sbjct: 12  IILKLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGLVHIEK 71

Query: 100 GSKSRDKIVIVE 111
           G   R K + +E
Sbjct: 72  GELGRHKQIRIE 83


>gi|221640262|ref|YP_002526524.1| hypothetical protein RSKD131_2163 [Rhodobacter sphaeroides KD131]
 gi|429206069|ref|ZP_19197337.1| hypothetical protein D516_2489 [Rhodobacter sp. AKP1]
 gi|221161043|gb|ACM02023.1| Hypothetical Protein RSKD131_2163 [Rhodobacter sphaeroides KD131]
 gi|428190790|gb|EKX59334.1| hypothetical protein D516_2489 [Rhodobacter sp. AKP1]
          Length = 85

 Score = 35.0 bits (79), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 2/78 (2%)

Query: 29  PSCIRLVPPSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVL 88
           P+   L  P  V   +   P +    + D+ +  V V +    +DG+AN A+ E ++  L
Sbjct: 5   PTLADLARPG-VPFAVRVTPRAARAKV-DLQEGVVRVHVTCVPEDGKANRAVTEALAKAL 62

Query: 89  GVKRRQVSIGSGSKSRDK 106
           GV + ++++  G+ SRDK
Sbjct: 63  GVAKSRLTLVRGATSRDK 80


>gi|336125242|ref|YP_004567290.1| hypothetical protein VAA_01660 [Vibrio anguillarum 775]
 gi|335342965|gb|AEH34248.1| hypothetical protein VAA_01660 [Vibrio anguillarum 775]
          Length = 101

 Score = 35.0 bits (79), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 20/79 (25%), Positives = 40/79 (50%), Gaps = 2/79 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + + I AP  DG+ANA L ++++    V +  + I  G 
Sbjct: 19  LKLYIQPKASRDSIVGLHGEELKIAITAPPVDGKANAHLTKFLAKQFKVAKGLIEIEKGE 78

Query: 102 KSRDKIVIVE--EITPENV 118
             R K V +   ++ P+ +
Sbjct: 79  LGRHKQVRIHSPQVIPQPI 97


>gi|258620721|ref|ZP_05715756.1| conserved hypothetical protein [Vibrio mimicus VM573]
 gi|258625606|ref|ZP_05720488.1| conserved hypothetical protein [Vibrio mimicus VM603]
 gi|424810421|ref|ZP_18235773.1| hypothetical protein SX4_1010 [Vibrio mimicus SX-4]
 gi|258582108|gb|EEW06975.1| conserved hypothetical protein [Vibrio mimicus VM603]
 gi|258586919|gb|EEW11633.1| conserved hypothetical protein [Vibrio mimicus VM573]
 gi|342322352|gb|EGU18143.1| hypothetical protein SX4_1010 [Vibrio mimicus SX-4]
          Length = 97

 Score = 35.0 bits (79), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +   SI  +  E + V I AP  DG+ANA L ++++    V +  V I  G 
Sbjct: 13  LRLYIQPKASRDSIVGLHGEELKVAITAPPIDGKANAHLSKFLAKQCKVAKGSVVIEKGE 72

Query: 102 KSRDKIVIVE---EITPE 116
             R K V ++   +I PE
Sbjct: 73  LGRHKQVRIQQPSQIPPE 90


>gi|238918244|ref|YP_002931758.1| conserved hypothetical protein TIGR00251 [Edwardsiella ictaluri
           93-146]
 gi|259646912|sp|C5BCR7.1|Y281_EDWI9 RecName: Full=UPF0235 protein NT01EI_0281
 gi|238867812|gb|ACR67523.1| conserved hypothetical protein TIGR00251 [Edwardsiella ictaluri
           93-146]
          Length = 96

 Score = 35.0 bits (79), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 21/84 (25%), Positives = 44/84 (52%), Gaps = 2/84 (2%)

Query: 42  ITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGS 101
           + ++ +P +    I  +  + + V I AP  DG+ANA LL++++    V + ++++  G 
Sbjct: 13  LRLYIQPKASRDLIIGLHGDELKVAITAPPVDGQANAHLLKFIAKQFRVAKSRITLEKGE 72

Query: 102 KSRDKIVIVE--EITPENVLNSLG 123
             R K + +   +  P+ V  +LG
Sbjct: 73  LGRHKQLRISQPQQIPDAVAAALG 96


>gi|417318887|ref|ZP_12105445.1| hypothetical protein VP10329_15255 [Vibrio parahaemolyticus 10329]
 gi|328474077|gb|EGF44882.1| hypothetical protein VP10329_15255 [Vibrio parahaemolyticus 10329]
          Length = 96

 Score = 35.0 bits (79), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 18/72 (25%), Positives = 36/72 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + ++ +P +    I  +  E + + I AP  DG+ANA L ++++    + +  V I  
Sbjct: 12  IVLKLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGLVHIEK 71

Query: 100 GSKSRDKIVIVE 111
           G   R K + +E
Sbjct: 72  GELGRHKQIRIE 83


>gi|338707446|ref|YP_004661647.1| hypothetical protein Zymop_0454 [Zymomonas mobilis subsp. pomaceae
           ATCC 29192]
 gi|336294250|gb|AEI37357.1| protein of unknown function DUF167 [Zymomonas mobilis subsp.
           pomaceae ATCC 29192]
          Length = 114

 Score = 35.0 bits (79), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 18/52 (34%), Positives = 29/52 (55%)

Query: 65  VQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDKIVIVEEITPE 116
           +++ AP  +G +N  L+ Y+S    V +  VSI SG  S+ KI+ +    PE
Sbjct: 50  IRVAAPPVEGASNKNLMAYLSKQFSVSKTAVSIDSGEHSKIKIIHITGNAPE 101


>gi|73748976|ref|YP_308215.1| hypothetical protein cbdb_A1230 [Dehalococcoides sp. CBDB1]
 gi|147669743|ref|YP_001214561.1| hypothetical protein DehaBAV1_1103 [Dehalococcoides sp. BAV1]
 gi|289432972|ref|YP_003462845.1| hypothetical protein DehalGT_1029 [Dehalococcoides sp. GT]
 gi|452203960|ref|YP_007484093.1| hypothetical protein dcmb_1151 [Dehalococcoides mccartyi DCMB5]
 gi|452205459|ref|YP_007485588.1| hypothetical protein btf_1170 [Dehalococcoides mccartyi BTF08]
 gi|123619917|sp|Q3ZYH5.1|Y1230_DEHSC RecName: Full=UPF0235 protein cbdbA1230
 gi|189038739|sp|A5FQ39.1|Y1103_DEHSB RecName: Full=UPF0235 protein DehaBAV1_1103
 gi|73660692|emb|CAI83299.1| conserved hypothetical protein [Dehalococcoides sp. CBDB1]
 gi|146270691|gb|ABQ17683.1| protein of unknown function DUF167 [Dehalococcoides sp. BAV1]
 gi|288946692|gb|ADC74389.1| protein of unknown function DUF167 [Dehalococcoides sp. GT]
 gi|452111019|gb|AGG06751.1| hypothetical protein dcmb_1151 [Dehalococcoides mccartyi DCMB5]
 gi|452112515|gb|AGG08246.1| hypothetical protein btf_1170 [Dehalococcoides mccartyi BTF08]
          Length = 97

 Score = 35.0 bits (79), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 17/76 (22%), Positives = 39/76 (51%), Gaps = 2/76 (2%)

Query: 35  VPPSSVSITIHAK--PGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKR 92
           +PP      ++ K  P ++   +    +  + ++I A  + G+AN  L++Y+S +L   +
Sbjct: 1   MPPKESPFKVNLKIIPSARKNELAGYENGLLKLKIAAQPEKGKANKELIDYLSDLLDTPK 60

Query: 93  RQVSIGSGSKSRDKIV 108
            ++ I  G   R+K++
Sbjct: 61  AEIEICHGHTGRNKVL 76


>gi|91228690|ref|ZP_01262604.1| hypothetical protein V12G01_14044 [Vibrio alginolyticus 12G01]
 gi|91187761|gb|EAS74079.1| hypothetical protein V12G01_14044 [Vibrio alginolyticus 12G01]
          Length = 96

 Score = 35.0 bits (79), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 18/72 (25%), Positives = 36/72 (50%)

Query: 40  VSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGS 99
           + + ++ +P +    I  +  E + + I AP  DG+ANA L ++++    + +  V I  
Sbjct: 12  IILKLYIQPKASRDKIVGLHGEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGLVHIEK 71

Query: 100 GSKSRDKIVIVE 111
           G   R K + +E
Sbjct: 72  GELGRHKQIRIE 83


>gi|406836793|ref|ZP_11096387.1| hypothetical protein SpalD1_34341 [Schlesneria paludicola DSM
           18645]
          Length = 106

 Score = 35.0 bits (79), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 20/88 (22%), Positives = 43/88 (48%)

Query: 37  PSSVSITIHAKPGSKSCSITDVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVS 96
           P    + + A  G++   IT   D A+ V +    + G+AN+A+++ ++  L +   +  
Sbjct: 10  PDGAILRVKAHAGARRNGITGTFDGAIRVAVTQAPEKGKANSAIIQVLADALKIGPSRFE 69

Query: 97  IGSGSKSRDKIVIVEEITPENVLNSLGK 124
           I SG  +  K ++V   +   +L+ L +
Sbjct: 70  ILSGHTANLKRILVRGFSSAELLDRLNR 97


>gi|126463218|ref|YP_001044332.1| hypothetical protein Rsph17029_2458 [Rhodobacter sphaeroides ATCC
           17029]
 gi|126104882|gb|ABN77560.1| protein of unknown function DUF167 [Rhodobacter sphaeroides ATCC
           17029]
          Length = 85

 Score = 35.0 bits (79), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 17/50 (34%), Positives = 31/50 (62%)

Query: 57  DVSDEAVGVQIDAPAKDGEANAALLEYMSSVLGVKRRQVSIGSGSKSRDK 106
           D+ +  V V +    +DG+AN A+ E ++  LGV + ++++  G+ SRDK
Sbjct: 31  DLQEGVVRVHVTCVPEDGKANRAVTEALAKALGVAKSRLTLVRGATSRDK 80


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.304    0.122    0.318 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,837,908,282
Number of Sequences: 23463169
Number of extensions: 65183018
Number of successful extensions: 115750
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 947
Number of HSP's successfully gapped in prelim test: 114
Number of HSP's that attempted gapping in prelim test: 114763
Number of HSP's gapped (non-prelim): 1062
length of query: 128
length of database: 8,064,228,071
effective HSP length: 94
effective length of query: 34
effective length of database: 10,153,657,481
effective search space: 345224354354
effective search space used: 345224354354
T: 11
A: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 71 (32.0 bits)