BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780231|ref|YP_003064644.1| hypothetical protein CLIBASIA_00580 [Candidatus Liberibacter asiaticus str. psy62] (98 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done Results from round 1 >gi|254780231|ref|YP_003064644.1| hypothetical protein CLIBASIA_00580 [Candidatus Liberibacter asiaticus str. psy62] gi|254039908|gb|ACT56704.1| hypothetical protein CLIBASIA_00580 [Candidatus Liberibacter asiaticus str. psy62] Length = 98 Score = 193 bits (490), Expect = 7e-48, Method: Compositional matrix adjust. Identities = 98/98 (100%), Positives = 98/98 (100%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS Sbjct: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL 98 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL Sbjct: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL 98 >gi|29839726|sp|Q8UC38|Y2660_AGRT5 RecName: Full=UPF0235 protein Atu2660 Length = 112 Score = 71.2 bits (173), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 35/86 (40%), Positives = 55/86 (63%), Gaps = 2/86 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL PN + I +E +D H+K +V+A P+ GKANKA++ +LAKKL L KSS+ Sbjct: 25 VRLTPNGGRDAIDGVE--QDADGNAHLKARVSAVPEGGKANKALIVLLAKKLGLPKSSIT 82 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQ 91 +S +++ KI+ ID D ++ +L + Sbjct: 83 FISGETARKKILRIDTDPEDFEKLFK 108 >gi|159185312|ref|NP_530540.1| hypothetical protein Atu2660 [Agrobacterium tumefaciens str. C58] gi|17741163|gb|AAL43641.1| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 183 Score = 70.9 bits (172), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 35/86 (40%), Positives = 55/86 (63%), Gaps = 2/86 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL PN + I +E +D H+K +V+A P+ GKANKA++ +LAKKL L KSS+ Sbjct: 96 VRLTPNGGRDAIDGVE--QDADGNAHLKARVSAVPEGGKANKALIVLLAKKLGLPKSSIT 153 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQ 91 +S +++ KI+ ID D ++ +L + Sbjct: 154 FISGETARKKILRIDTDPEDFEKLFK 179 >gi|222150063|ref|YP_002551020.1| hypothetical protein Avi_4153 [Agrobacterium vitis S4] gi|221737045|gb|ACM38008.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 104 Score = 70.9 bits (172), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 31/86 (36%), Positives = 60/86 (69%), Gaps = 2/86 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL PN + GI +++ + + H+K++V+ P+KG+ANKA++A+LAK+L ++KS++ Sbjct: 16 VRLTPNGGRDGIDGVDV--NANGEAHLKVRVSDVPEKGRANKALIALLAKRLGVAKSAVS 73 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQ 91 ++S ++ KI+ ID D +++ L+ Sbjct: 74 LISGDAARQKILRIDGDPEDLIGRLE 99 >gi|315122788|ref|YP_004063277.1| hypothetical protein CKC_05215 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496190|gb|ADR52789.1| hypothetical protein CKC_05215 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 50 Score = 68.6 bits (166), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 33/50 (66%), Positives = 43/50 (86%) Query: 49 MLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL 98 ML +LA++L+L+KSSL+MLSK SSP+K IYIDKDCKEI EL + N+ +TL Sbjct: 1 MLTILAERLSLNKSSLKMLSKHSSPIKKIYIDKDCKEIIELFKRNNPVTL 50 >gi|222087485|ref|YP_002546022.1| hypothetical protein Arad_4365 [Agrobacterium radiobacter K84] gi|221724933|gb|ACM28089.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 104 Score = 67.8 bits (164), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 34/86 (39%), Positives = 57/86 (66%), Gaps = 2/86 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL PN + + +E D ++K +V+A P+KGKANKA++A+LAK+L++ KSSL Sbjct: 16 VRLTPNGGRDAVDGIETGADGE--AYLKARVSAVPEKGKANKALIALLAKRLSIPKSSLS 73 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQ 91 ++S ++ KI+ ID D +++ L+ Sbjct: 74 LISGDTARKKILRIDGDPEDLIGRLK 99 >gi|325294029|ref|YP_004279893.1| hypothetical protein AGROH133_08898 [Agrobacterium sp. H13-3] gi|325061882|gb|ADY65573.1| hypothetical protein AGROH133_08898 [Agrobacterium sp. H13-3] Length = 112 Score = 67.4 bits (163), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 33/86 (38%), Positives = 52/86 (60%), Gaps = 2/86 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL PN + I +E +D H+K +V+A P+ GKANKA++ +LAKK L KS + Sbjct: 25 VRLTPNGGRDAIDGVE--QDADGNAHLKARVSAVPEGGKANKALVILLAKKFGLPKSPIT 82 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQ 91 +S +++ KI+ ID D ++ L + Sbjct: 83 FISGETARKKILRIDTDPEDFETLFR 108 >gi|227823716|ref|YP_002827689.1| hypothetical protein NGR_c32030 [Sinorhizobium fredii NGR234] gi|227342718|gb|ACP26936.1| hypothetical protein NGR_c32030 [Sinorhizobium fredii NGR234] Length = 105 Score = 65.5 bits (158), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 32/85 (37%), Positives = 51/85 (60%), Gaps = 2/85 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL PN + I EI D + H+K++V A P+KGKAN A++ +LAK L+K+ + Sbjct: 17 VRLTPNGGRDAIDGFEIAADGEE--HLKVRVRAVPEKGKANDALIGLLAKAFGLAKNRIA 74 Query: 66 MLSKQSSPLKIIYIDKDCKEITELL 90 ++S + KI+ I+ D + I + L Sbjct: 75 LVSGDTQRKKILRIEADPEAIQKRL 99 >gi|218461695|ref|ZP_03501786.1| hypothetical protein RetlK5_20376 [Rhizobium etli Kim 5] Length = 103 Score = 64.7 bits (156), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 29/86 (33%), Positives = 58/86 (67%), Gaps = 2/86 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL PN + +E D+ ++K +VTA P+KGKANKA++A++++ + ++KSS+ Sbjct: 16 VRLTPNGGRGAFDGIET--DSEGETYLKARVTAVPEKGKANKALIALVSQSVGVAKSSVS 73 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQ 91 ++S +++ KI+ I+ D +++ + L+ Sbjct: 74 LVSGETARKKILRIEGDPEDLAQKLE 99 >gi|327188872|gb|EGE56064.1| hypothetical protein RHECNPAF_750023 [Rhizobium etli CNPAF512] Length = 103 Score = 64.3 bits (155), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 30/86 (34%), Positives = 58/86 (67%), Gaps = 2/86 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL PN + +E D+ ++K +VTA P+KGKANKA++A+++K + ++KSS+ Sbjct: 16 VRLTPNGGRDAFDGIET--DSEGETYLKARVTAVPEKGKANKALIALVSKSVGVAKSSIT 73 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQ 91 ++S +++ KI+ I+ D +++ + L+ Sbjct: 74 LVSGETARKKILRIEGDPEDLAKKLE 99 >gi|209551279|ref|YP_002283196.1| hypothetical protein Rleg2_3707 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|226705832|sp|B5ZTD4|Y3707_RHILW RecName: Full=UPF0235 protein Rleg2_3707 gi|209537035|gb|ACI56970.1| protein of unknown function DUF167 [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 103 Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 31/86 (36%), Positives = 54/86 (62%), Gaps = 2/86 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +RL PN + I E D ++K +VT P+KGKANKA++ ++AK L ++KSS+ Sbjct: 16 IRLTPNGGRDAIDGAET--DGEGEAYLKTRVTTVPEKGKANKALILLIAKSLGIAKSSVS 73 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQ 91 ++S ++ KI+ ID D +++ + L+ Sbjct: 74 LVSGDTARKKILRIDGDPEDLGKKLE 99 >gi|116254232|ref|YP_770070.1| hypothetical protein RL4503 [Rhizobium leguminosarum bv. viciae 3841] gi|166227262|sp|Q1MAP9|Y4503_RHIL3 RecName: Full=UPF0235 protein RL4503 gi|115258880|emb|CAK09988.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 103 Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 32/86 (37%), Positives = 55/86 (63%), Gaps = 2/86 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL PN + + +E D +K +VTA P+KGKANKA++ ++AK L + KSS+ Sbjct: 16 VRLTPNGGRDALDGIE--ADGEGEAFLKARVTAVPEKGKANKALMLLIAKSLRIPKSSVS 73 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQ 91 ++S +++ KI+ ID D +++ + L+ Sbjct: 74 LVSGETARKKILRIDGDPEDLVKKLE 99 >gi|190893761|ref|YP_001980303.1| hypothetical protein RHECIAT_CH0004196 [Rhizobium etli CIAT 652] gi|226706142|sp|B3PQB3|Y4196_RHIE6 RecName: Full=UPF0235 protein RHECIAT_CH0004196 gi|190699040|gb|ACE93125.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 103 Score = 62.4 bits (150), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 30/86 (34%), Positives = 57/86 (66%), Gaps = 2/86 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL PN + +E + ++K +VTA P+KGKANKA++A+++K L ++KSS+ Sbjct: 16 VRLTPNGGRDAFDGIET--GSEGETYLKARVTAIPEKGKANKALIALVSKSLGVAKSSIT 73 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQ 91 ++S +++ KI+ I+ D +++ + L+ Sbjct: 74 LVSGETARKKILRIEGDPEDLAKKLE 99 >gi|218672707|ref|ZP_03522376.1| hypothetical protein RetlG_14247 [Rhizobium etli GR56] Length = 89 Score = 61.6 bits (148), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 29/75 (38%), Positives = 51/75 (68%), Gaps = 2/75 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL PN + +E D+ ++K +VTA P+KGKANKA++A+++K L ++KSS+ Sbjct: 16 VRLTPNGGRDAFDGIET--DSQGETYLKARVTAVPEKGKANKALIALVSKSLGVAKSSVS 73 Query: 66 MLSKQSSPLKIIYID 80 ++S +++ KI+ I+ Sbjct: 74 LVSGETARKKILRIE 88 >gi|86359493|ref|YP_471385.1| hypothetical protein RHE_CH03912 [Rhizobium etli CFN 42] gi|123510540|sp|Q2K3C8|Y3912_RHIEC RecName: Full=UPF0235 protein RHE_CH03912 gi|86283595|gb|ABC92658.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 112 Score = 60.8 bits (146), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 28/86 (32%), Positives = 56/86 (65%), Gaps = 2/86 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL PN + ++ D+ ++ +VTA P+KGKANKA++A+++K + ++KSS+ Sbjct: 25 VRLTPNGGRDAFDGIDT--DSEGETYLGARVTAVPEKGKANKALIALVSKSVGVAKSSVS 82 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQ 91 ++S +++ KI+ I+ D +++ L+ Sbjct: 83 VISGETARKKILRIEGDPEDLARKLE 108 >gi|241206712|ref|YP_002977808.1| hypothetical protein Rleg_4028 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240860602|gb|ACS58269.1| protein of unknown function DUF167 [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 103 Score = 60.5 bits (145), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 31/86 (36%), Positives = 54/86 (62%), Gaps = 2/86 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL PN + + +E D +K +VTA P+KGKANKA++ ++A+ L + KSS+ Sbjct: 16 VRLTPNGGRDALDGIE--ADGEGEAFLKARVTAVPEKGKANKALILLIAQSLRIPKSSVS 73 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQ 91 ++S ++ KI+ ID D +++ + L+ Sbjct: 74 LISGDTARKKILRIDGDPEDLVKKLE 99 >gi|218680333|ref|ZP_03528230.1| hypothetical protein RetlC8_16135 [Rhizobium etli CIAT 894] Length = 103 Score = 60.5 bits (145), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 30/86 (34%), Positives = 55/86 (63%), Gaps = 2/86 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL PN + I +E D ++ +VT+ P+KGKANKA++ ++A+ L + KSS+ Sbjct: 16 VRLTPNGGRDAIDGIEA--DGEGETFLRARVTSVPEKGKANKALILLVAQSLRIPKSSIS 73 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQ 91 ++S +++ KI+ ID D +++ + L+ Sbjct: 74 LVSGETARKKILRIDGDPEDLAKKLE 99 >gi|315497739|ref|YP_004086543.1| hypothetical protein Astex_0706 [Asticcacaulis excentricus CB 48] gi|315415751|gb|ADU12392.1| protein of unknown function DUF167 [Asticcacaulis excentricus CB 48] Length = 89 Score = 58.9 bits (141), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 31/80 (38%), Positives = 50/80 (62%), Gaps = 2/80 (2%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M ++VRL P A + ++ D ++K++VTA P +G+AN+A++A LAK+L L Sbjct: 1 MARLVVRLTPKAAADRVDGWDM--DEQGRPYLKVRVTAPPIEGRANEALIAFLAKRLKLP 58 Query: 61 KSSLRMLSKQSSPLKIIYID 80 KS L +L+ SS LK I ++ Sbjct: 59 KSRLSLLAGDSSRLKQIEVE 78 >gi|241762300|ref|ZP_04760381.1| protein of unknown function DUF167 [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241373203|gb|EER62833.1| protein of unknown function DUF167 [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 113 Score = 57.4 bits (137), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 29/87 (33%), Positives = 54/87 (62%), Gaps = 2/87 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +R+ A K+GI + KDT+ +I+V A P +G +NK ++A L+K ++ K ++R Sbjct: 23 LRVTARASKTGITMFD--KDTAGRGLFRIRVAAPPVEGASNKNLMAYLSKSFSVPKGAVR 80 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQN 92 + S + S +KI++I D K +TE+ ++ Sbjct: 81 IESGEHSKIKILHIAGDVKRLTEIAED 107 >gi|319406122|emb|CBI79752.1| conserved hypothetical protein [Bartonella sp. AR 15-3] Length = 107 Score = 56.2 bits (134), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 33/88 (37%), Positives = 49/88 (55%), Gaps = 2/88 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E D H+ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVRLIPKASVDSIIKVENRGDGKQ--HLIIRLRAIPENGKANKALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S +S K +Y K KEI ++LQ Sbjct: 70 ISLGSGATSHYKQLYFSKYIKEIEQILQ 97 >gi|319899260|ref|YP_004159353.1| hypothetical protein BARCL_1102 [Bartonella clarridgeiae 73] gi|319403224|emb|CBI76783.1| conserved protein of unknown function [Bartonella clarridgeiae 73] Length = 110 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 35/96 (36%), Positives = 52/96 (54%), Gaps = 5/96 (5%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E D H+ I++ A P+ GKAN+A++ LAK+ + S Sbjct: 12 LFVRLIPKASVDSIIKVEDRDDGKQ--HLIIRLRAIPENGKANRALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ---NNDSL 96 + + S +S K +Y K KEI ++LQ N D L Sbjct: 70 ISLGSGATSHYKQLYFSKYLKEIEQILQSFRNYDFL 105 >gi|56551705|ref|YP_162544.1| hypothetical protein ZMO0809 [Zymomonas mobilis subsp. mobilis ZM4] gi|260752718|ref|YP_003225611.1| hypothetical protein Za10_0477 [Zymomonas mobilis subsp. mobilis NCIMB 11163] gi|56543279|gb|AAV89433.1| protein of unknown function DUF167 [Zymomonas mobilis subsp. mobilis ZM4] gi|258552081|gb|ACV75027.1| protein of unknown function DUF167 [Zymomonas mobilis subsp. mobilis NCIMB 11163] Length = 113 Score = 55.5 bits (132), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 28/87 (32%), Positives = 54/87 (62%), Gaps = 2/87 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +R+ A K+GI + KDT+ +I+V A P +G +NK ++A L+K ++ K +++ Sbjct: 23 LRVTARASKTGITMFD--KDTAGRGLFRIRVAAPPVEGASNKNLMAYLSKSFSVPKGAVK 80 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQN 92 + S + S +KI++I D K +TE+ ++ Sbjct: 81 IESGEHSKIKILHIAGDVKRLTEIAED 107 >gi|319407608|emb|CBI81258.1| conserved hypothetical protein [Bartonella sp. 1-1C] Length = 101 Score = 53.5 bits (127), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E D ++ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVRLIPKASVDSIIKVENRDDGKQ--YLIIRLRAIPENGKANKALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S +S K +Y K KE+ ++LQ+ Sbjct: 70 ISLGSGTTSHYKQLYFSKYLKEVEQILQS 98 >gi|316931646|ref|YP_004106628.1| hypothetical protein Rpdx1_0252 [Rhodopseudomonas palustris DX-1] gi|315599360|gb|ADU41895.1| protein of unknown function DUF167 [Rhodopseudomonas palustris DX-1] Length = 108 Score = 53.5 bits (127), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D +K++V A G+AN+A+ +LAK + + K + Sbjct: 14 VAVRVTPRGGRDDIDGLETLSDGRPV--LKVRVRAIADGGEANRAVTELLAKAVGVPKRN 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+LS +S K I ID D K++ E+L+ Sbjct: 72 VRLLSGATSRQKQIAIDGDPKQLGEVLRR 100 >gi|187735949|ref|YP_001878061.1| protein of unknown function DUF167 [Akkermansia muciniphila ATCC BAA-835] gi|187426001|gb|ACD05280.1| protein of unknown function DUF167 [Akkermansia muciniphila ATCC BAA-835] Length = 96 Score = 53.5 bits (127), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 31/90 (34%), Positives = 53/90 (58%), Gaps = 3/90 (3%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +++IPNAKKS E +D +K+++ A P +GKANKA++ L+ L + +SS+ Sbjct: 5 LKVIPNAKKSEAVGWE--EDPRAGRALKLRIAAPPVEGKANKAVVLFLSAWLDIPRSSIS 62 Query: 66 MLSKQSSPLKIIYIDKDCK-EITELLQNND 94 L +SS LK++ + C+ ++ LL D Sbjct: 63 FLRGESSRLKVVELPDGCEGKLARLLSAED 92 >gi|86747236|ref|YP_483732.1| hypothetical protein RPB_0109 [Rhodopseudomonas palustris HaA2] gi|123293376|sp|Q2J3Y9|Y109_RHOP2 RecName: Full=UPF0235 protein RPB_0109 gi|86570264|gb|ABD04821.1| Protein of unknown function DUF167 [Rhodopseudomonas palustris HaA2] Length = 108 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 32/89 (35%), Positives = 49/89 (55%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D + K++V A G+AN+A++ +LAK L + K + Sbjct: 14 VAVRVTPRGDRDEIDGLETLSDGRPVV--KLRVRAIADGGEANRAVIELLAKALGVPKRN 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+LS +S K I ID D K + E L+ Sbjct: 72 VRLLSGATSRQKQIAIDGDPKSLGETLRQ 100 >gi|39933491|ref|NP_945767.1| hypothetical protein RPA0414 [Rhodopseudomonas palustris CGA009] gi|39647337|emb|CAE25858.1| DUF167 [Rhodopseudomonas palustris CGA009] Length = 112 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D + K++V A G+AN+A+ +LAK + + K + Sbjct: 18 VAVRVTPRGGRDDIDGLETLSDGRPVV--KVRVRAIADGGEANRAVTELLAKAVGVPKRN 75 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+LS +S K I ID D K++ E L+ Sbjct: 76 VRLLSGATSRQKQIAIDGDPKQLGEALRR 104 >gi|192288849|ref|YP_001989454.1| hypothetical protein Rpal_0418 [Rhodopseudomonas palustris TIE-1] gi|226706140|sp|B3QA92|Y418_RHOPT RecName: Full=UPF0235 protein Rpal_0418 gi|192282598|gb|ACE98978.1| protein of unknown function DUF167 [Rhodopseudomonas palustris TIE-1] Length = 108 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D + K++V A G+AN+A+ +LAK + + K + Sbjct: 14 VAVRVTPRGGRDDIDGLETLSDGRPVV--KVRVRAIADGGEANRAVTELLAKAVGVPKRN 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+LS +S K I ID D K++ E L+ Sbjct: 72 VRLLSGATSRQKQIAIDGDPKQLGEALRR 100 >gi|319404616|emb|CBI78222.1| conserved hypothetical protein [Bartonella rochalimae ATCC BAA-1498] Length = 101 Score = 52.8 bits (125), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E D ++ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVRLIPKASVDSIIKVENRDDGKQ--YLIIRLRAIPENGKANKALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S +S K +Y K KE+ ++LQ+ Sbjct: 70 ISLGSGATSHYKQLYFSKYLKEVEKILQS 98 >gi|90421586|ref|YP_529956.1| hypothetical protein RPC_0058 [Rhodopseudomonas palustris BisB18] gi|122477773|sp|Q21D99|Y058_RHOPB RecName: Full=UPF0235 protein RPC_0058 gi|90103600|gb|ABD85637.1| protein of unknown function DUF167 [Rhodopseudomonas palustris BisB18] Length = 107 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 29/89 (32%), Positives = 51/89 (57%), Gaps = 2/89 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +V +R+ P + I LE + + K++V A + G+AN+A+ +LAK L + K Sbjct: 13 SVALRVTPRGGRDDIDGLETLANGRTVV--KVRVRAIAEGGEANRAVTELLAKALGVPKR 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 ++R+LS +S LK + +D D E+ E L+ Sbjct: 71 AVRVLSGTTSRLKQVAVDGDPNELGEALR 99 >gi|27375661|ref|NP_767190.1| hypothetical protein bsl0550 [Bradyrhizobium japonicum USDA 110] gi|27348798|dbj|BAC45815.1| bsl0550 [Bradyrhizobium japonicum USDA 110] Length = 94 Score = 52.0 bits (123), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 30/85 (35%), Positives = 47/85 (55%), Gaps = 2/85 (2%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P + I +E D +K++V A G+ANKA+L +LAK L + K+S+R+L Sbjct: 1 MTPRGGRDDIDGIEQLADGRSV--LKVRVRAIADGGEANKAVLVLLAKSLGVPKASVRLL 58 Query: 68 SKQSSPLKIIYIDKDCKEITELLQN 92 S +S LK I +D D + E L+ Sbjct: 59 SGATSRLKQIAVDGDPARLGETLRQ 83 >gi|91975173|ref|YP_567832.1| hypothetical protein RPD_0693 [Rhodopseudomonas palustris BisB5] gi|91681629|gb|ABE37931.1| protein of unknown function DUF167 [Rhodopseudomonas palustris BisB5] Length = 107 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D + K++V A G+AN+A++ +LAK L + K + Sbjct: 13 VAVRVTPRGGRDEIDGLETLSDGRPVV--KVRVRAIADGGEANRAVIELLAKSLGVPKRN 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+LS +S K I ID D ++ E L+ Sbjct: 71 VRLLSGATSRQKQIAIDGDPTKLGEALRR 99 >gi|170290086|ref|YP_001736902.1| hypothetical protein Kcr_0466 [Candidatus Korarchaeum cryptofilum OPF8] gi|170174166|gb|ACB07219.1| protein of unknown function DUF167 [Candidatus Korarchaeum cryptofilum OPF8] Length = 72 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 25/74 (33%), Positives = 46/74 (62%), Gaps = 8/74 (10%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 ++PNA ++G+ + D H+++ V A P KGKAN+A++ +LA+ + KS +R++ Sbjct: 7 VVPNAGRNGVV------EEGD--HLRVYVRAPPVKGKANEAVIEVLAEFFGVKKSDIRII 58 Query: 68 SKQSSPLKIIYIDK 81 S + S K++ I K Sbjct: 59 SGERSREKVVEIRK 72 >gi|209966140|ref|YP_002299055.1| hypothetical protein RC1_2875 [Rhodospirillum centenum SW] gi|209959606|gb|ACJ00243.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 117 Score = 51.2 bits (121), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 28/86 (32%), Positives = 51/86 (59%), Gaps = 4/86 (4%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDT-IHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 V +R+ P A ++ +++ P D + +K+ VTA P+ GKAN A++A+LAK L Sbjct: 23 VRVALRVTPKASRT---AVQGPMDGPEGRTLLKLAVTAVPEDGKANAAVIALLAKHWRLP 79 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEI 86 KSS+ ++S + K+++I D ++ Sbjct: 80 KSSMSIVSGGTDRTKVLFIAGDAADL 105 >gi|92116228|ref|YP_575957.1| hypothetical protein Nham_0608 [Nitrobacter hamburgensis X14] gi|91799122|gb|ABE61497.1| protein of unknown function DUF167 [Nitrobacter hamburgensis X14] Length = 106 Score = 50.8 bits (120), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 26/89 (29%), Positives = 53/89 (59%), Gaps = 2/89 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 ++ +R+ P + GI +E+ D + K++V A + G+AN+A++A+LAK L + K Sbjct: 12 SIALRVTPRGGRDGIDGIEMLADGRPVV--KVRVRAIAEGGEANRAVMAVLAKALGVRKI 69 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 +R+L+ +S LK + + D ++ + L+ Sbjct: 70 DVRILAGATSRLKQVAVGGDPVKLGDALR 98 >gi|254295256|ref|YP_003061279.1| hypothetical protein Hbal_2912 [Hirschia baltica ATCC 49814] gi|254043787|gb|ACT60582.1| protein of unknown function DUF167 [Hirschia baltica ATCC 49814] Length = 109 Score = 50.8 bits (120), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 27/77 (35%), Positives = 45/77 (58%), Gaps = 2/77 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +++ R+ PNA K + + E +D + ++K++V A P KGKANKA+ +LA L KS Sbjct: 13 DIVARVTPNASKDAVEAPE--QDAAGRTYLKLRVRAIPDKGKANKAVEKLLASHFNLPKS 70 Query: 63 SLRMLSKQSSPLKIIYI 79 + ++ + LK I I Sbjct: 71 KVAVVKGSTDRLKTIRI 87 >gi|329849894|ref|ZP_08264740.1| hypothetical protein ABI_27900 [Asticcacaulis biprosthecum C19] gi|328841805|gb|EGF91375.1| hypothetical protein ABI_27900 [Asticcacaulis biprosthecum C19] Length = 86 Score = 50.4 bits (119), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 2/78 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL P + I +D +K++V A P +G+AN+A++ LAK L + KS L Sbjct: 5 VRLTPRSSADAIDGW--GEDEQGRRFLKVRVRAAPIEGRANEALIVFLAKTLGVPKSRLS 62 Query: 66 MLSKQSSPLKIIYIDKDC 83 +++ +S LK I ID D Sbjct: 63 LVAGDTSRLKQIEIDGDV 80 >gi|297620780|ref|YP_003708917.1| yggU family protein [Waddlia chondrophila WSU 86-1044] gi|297376081|gb|ADI37911.1| yggU family protein [Waddlia chondrophila WSU 86-1044] Length = 94 Score = 50.4 bits (119), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 30/85 (35%), Positives = 49/85 (57%), Gaps = 7/85 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V+LIPNA K I+ E + I +K+++TA P+KGKAN ++ LA +L +SKS + Sbjct: 13 VKLIPNAGKDEISGWE------NGI-LKVRITAVPEKGKANAHLIKFLASQLKVSKSDIT 65 Query: 66 MLSKQSSPLKIIYIDKDCKEITELL 90 ++ + + K + I D + E L Sbjct: 66 LIKGEKNRHKTLLIKGDADVVGERL 90 >gi|23011359|ref|ZP_00051742.1| COG1872: Uncharacterized conserved protein [Magnetospirillum magnetotacticum MS-1] Length = 99 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 26/87 (29%), Positives = 49/87 (56%), Gaps = 2/87 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL P A ++G+ + + D + ++V A P +G AN A+ A +AK L L K+ + Sbjct: 10 VRLTPRASRTGLDGVRVDADGRPV--LGLRVAAPPVEGAANAALTAFVAKSLKLRKAEVV 67 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQN 92 ++S ++S K +++ D KE+ ++ Sbjct: 68 LVSGEASRTKRLHLTGDAKELAARVET 94 >gi|83594855|ref|YP_428607.1| hypothetical protein Rru_A3526 [Rhodospirillum rubrum ATCC 11170] gi|83577769|gb|ABC24320.1| Protein of unknown function DUF167 [Rhodospirillum rubrum ATCC 11170] Length = 113 Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 32/92 (34%), Positives = 53/92 (57%), Gaps = 3/92 (3%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +RL P A + G++ + D S + +K VTA P+ GKAN A+L +L+++ L +SSL Sbjct: 21 LRLTPKAGRDGVSGVVAEADGS--LVVKASVTAVPEDGKANAALLKLLSRQWKLPRSSLA 78 Query: 66 MLSKQSSPLKIIYIDKDCKEIT-ELLQNNDSL 96 ++ Q+ K+I I + +T L+ DSL Sbjct: 79 VVHGQTDRRKVIEISGEPALLTPRLVAWVDSL 110 >gi|144900433|emb|CAM77297.1| protein containing DUF167 [Magnetospirillum gryphiswaldense MSR-1] Length = 107 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 2/88 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +RL P ++ I L D + +K VTA P+ GKAN A++ MLAK+ ++KS Sbjct: 14 RVFIRLTPKGSRNKIDGLAAEADGG--MVLKASVTAVPEDGKANAALIKMLAKEWRVAKS 71 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELL 90 +++ + K + I D E+ L Sbjct: 72 DFEIVAGATDRRKTVLISGDGAEMAARL 99 >gi|254443993|ref|ZP_05057469.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235] gi|198258301|gb|EDY82609.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235] Length = 94 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 28/76 (36%), Positives = 49/76 (64%), Gaps = 6/76 (7%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V+++PNA +S IA +D S +KI++ + PQ GKANKA++A LAK+ +SK+ + Sbjct: 13 VKVLPNASRSEIAGWL--EDGS----LKIRIQSPPQDGKANKALIAFLAKETGVSKNQIS 66 Query: 66 MLSKQSSPLKIIYIDK 81 + ++S K+I ++ Sbjct: 67 IARGETSRQKLIAFER 82 >gi|32475275|ref|NP_868269.1| hypothetical protein RB8260 [Rhodopirellula baltica SH 1] gi|47117454|sp|Q7UFY2|Y8260_RHOBA RecName: Full=UPF0235 protein RB8260 gi|32445816|emb|CAD78547.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 108 Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 28/75 (37%), Positives = 46/75 (61%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P AKK+ + L D + +K+ V P+ GKANKA++A LAK L +SK + Sbjct: 22 VRVTPKAKKASVGGLH---DGA----LKVSVHTVPEDGKANKAVIASLAKWLRVSKGRVA 74 Query: 66 MLSKQSSPLKIIYID 80 +++ ++S LK I ++ Sbjct: 75 IVAGETSRLKTIVVE 89 >gi|49473825|ref|YP_031867.1| hypothetical protein BQ01570 [Bartonella quintana str. Toulouse] gi|49239328|emb|CAF25660.1| hypothetical protein BQ01570 [Bartonella quintana str. Toulouse] Length = 114 Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V LIP A I +E D + I++ P+ GKANKA++ LAK+ + S Sbjct: 12 LFVYLIPKASVDKIIGVECRDDGKQ--RLVIRLRTLPENGKANKALIKFLAKQWKIPSSY 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S ++S K +Y +E+ E+LQ+ Sbjct: 70 ISLKSGETSRYKQLYFSGYLQEVGEILQS 98 >gi|170749303|ref|YP_001755563.1| hypothetical protein Mrad2831_2896 [Methylobacterium radiotolerans JCM 2831] gi|170655825|gb|ACB24880.1| protein of unknown function DUF167 [Methylobacterium radiotolerans JCM 2831] Length = 93 Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 25/75 (33%), Positives = 43/75 (57%), Gaps = 2/75 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL P + A+ +D ++K +V A P +G AN A++ ++AK L + + S+R Sbjct: 7 VRLTPRGGRD--AAEGWARDEKGQPYLKARVAAPPVEGAANAALVVLIAKALKVGRGSVR 64 Query: 66 MLSKQSSPLKIIYID 80 +++ S LKI+ ID Sbjct: 65 IVTGDQSRLKILEID 79 >gi|163783198|ref|ZP_02178192.1| hypothetical protein HG1285_14279 [Hydrogenivirga sp. 128-5-R1-1] gi|159881532|gb|EDP75042.1| hypothetical protein HG1285_14279 [Hydrogenivirga sp. 128-5-R1-1] Length = 73 Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 22/75 (29%), Positives = 49/75 (65%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ P++K+ G+ ++ S ++++V+A P++GKAN+ ++ +LAK + K ++R Sbjct: 5 VKVKPSSKREGV------REVSPG-ELEVRVSAPPERGKANERLIELLAKHYGVRKGAVR 57 Query: 66 MLSKQSSPLKIIYID 80 +L ++S K++ ID Sbjct: 58 ILRGETSREKVVEID 72 >gi|87311293|ref|ZP_01093415.1| hypothetical protein DSM3645_27241 [Blastopirellula marina DSM 3645] gi|87286033|gb|EAQ77945.1| hypothetical protein DSM3645_27241 [Blastopirellula marina DSM 3645] Length = 100 Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 28/74 (37%), Positives = 44/74 (59%), Gaps = 7/74 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR +P +KK+ EI + +K+ VTA P+ GKANKA++ +LAKKL L KS L Sbjct: 15 VRALPGSKKN-----EIRGEQQGA--LKVSVTAAPEDGKANKAIVELLAKKLVLRKSQLE 67 Query: 66 MLSKQSSPLKIIYI 79 +++ + K + + Sbjct: 68 IIAGHTHRQKRVLV 81 >gi|49474981|ref|YP_033022.1| hypothetical protein BH01670 [Bartonella henselae str. Houston-1] gi|49237786|emb|CAF26979.1| hypothetical protein BH01670 [Bartonella henselae str. Houston-1] Length = 111 Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 27/88 (30%), Positives = 47/88 (53%), Gaps = 2/88 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V +IP A I +E D H+ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVYIIPKASGDKIMGIECKNDGKR--HLVIRLRAIPENGKANKALIKFLAKQWKIPSSY 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S +S K +Y +++ E+L+ Sbjct: 70 ISLKSGGTSRYKQLYFSGYLEKLKEILR 97 >gi|168703344|ref|ZP_02735621.1| hypothetical protein GobsU_27681 [Gemmata obscuriglobus UQM 2246] Length = 101 Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C + VR+ P AKK+ + +++ VTA P+ G+AN A+LA+L L + Sbjct: 12 CTLAVRVQPKAKKNAVLGERASA-------LRVSVTAPPEDGRANDAVLALLCDHFKLQR 64 Query: 62 SSLRMLSKQSSPLKIIYI 79 S L +LS Q++ K+I + Sbjct: 65 SQLALLSGQTNRNKVILV 82 >gi|307293981|ref|ZP_07573825.1| protein of unknown function DUF167 [Sphingobium chlorophenolicum L-1] gi|306880132|gb|EFN11349.1| protein of unknown function DUF167 [Sphingobium chlorophenolicum L-1] Length = 109 Score = 47.4 bits (111), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 25/79 (31%), Positives = 44/79 (55%), Gaps = 2/79 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL P A + I + D + +V A P+KG+AN A++A+LAK+L +S++ Sbjct: 15 VRLTPGAAREDIGG--VWTDEKGAQWLGARVRAVPEKGRANTALIALLAKRLDWPRSAIS 72 Query: 66 MLSKQSSPLKIIYIDKDCK 84 + S ++ LK + I+ + Sbjct: 73 LESGDTNRLKRLRIEGGGE 91 >gi|330444266|ref|YP_004377252.1| hypothetical protein G5S_0576 [Chlamydophila pecorum E58] gi|328807376|gb|AEB41549.1| conserved hypothetical protein [Chlamydophila pecorum E58] Length = 100 Score = 47.4 bits (111), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 28/85 (32%), Positives = 51/85 (60%), Gaps = 7/85 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 +C + V++ P AK++ I + +K++VT P+KGKAN+A++++LAK L + Sbjct: 16 LCILEVQVTPKAKENKIVGFQ-------GEVLKVRVTEPPEKGKANEAVVSLLAKALGIP 68 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKE 85 K + ++S +SS K + I K +E Sbjct: 69 KRDVTLVSGESSRKKKLMIPKKVQE 93 >gi|329942568|ref|ZP_08291378.1| hypothetical protein G5Q_0265 [Chlamydophila psittaci Cal10] gi|313847795|emb|CBY16785.1| conserved hypothetical protein [Chlamydophila psittaci RD1] gi|325506510|gb|ADZ18148.1| conserved hypothetical protein [Chlamydophila psittaci 6BC] gi|328815478|gb|EGF85466.1| hypothetical protein G5Q_0265 [Chlamydophila psittaci Cal10] gi|328914447|gb|AEB55280.1| conserved hypothetical protein [Chlamydophila psittaci 6BC] Length = 92 Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 29/81 (35%), Positives = 49/81 (60%), Gaps = 7/81 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ P +K++ I E +KI+VT P+KGKAN+A++A+LAK L+L K + Sbjct: 10 VKVTPKSKENKIVGFE-------GEVLKIRVTEVPEKGKANEAVIALLAKTLSLPKRDVT 62 Query: 66 MLSKQSSPLKIIYIDKDCKEI 86 ++S ++S K I + K + I Sbjct: 63 LISGETSKNKRILLPKATESI 83 >gi|163757393|ref|ZP_02164482.1| hypothetical protein HPDFL43_18322 [Hoeflea phototrophica DFL-43] gi|162284895|gb|EDQ35177.1| hypothetical protein HPDFL43_18322 [Hoeflea phototrophica DFL-43] Length = 63 Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 23/56 (41%), Positives = 38/56 (67%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEIT 87 MKI+V A + KAN+A+ A LAK+L L+KS +R++S +S K + ++ D +E+ Sbjct: 1 MKIRVRAVAENNKANRALEAFLAKRLKLAKSRVRVISGANSRTKTVRLEGDPQELA 56 >gi|15604669|ref|NP_221187.1| hypothetical protein RP839 [Rickettsia prowazekii str. Madrid E] gi|6686136|sp|Q9ZCC0|Y839_RICPR RecName: Full=UPF0235 protein RP839 gi|3861364|emb|CAA15263.1| unknown [Rickettsia prowazekii] gi|292572500|gb|ADE30415.1| hypothetical protein rpr22_CDS819 [Rickettsia prowazekii Rp22] Length = 105 Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 27/87 (31%), Positives = 52/87 (59%), Gaps = 3/87 (3%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ P AK++ I + I ++ ++K+ + ATP++GKAN+ ++ LAK+ LS+SS+ Sbjct: 18 VKVKPYAKQNLIGNFVI---INNIPYIKLAIKATPEQGKANEGIIHYLAKEWELSRSSIE 74 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQN 92 ++ + LK I I ++ L+ N Sbjct: 75 IIKGHTHSLKTILIKNINEDYLNLIIN 101 >gi|284046655|ref|YP_003396995.1| hypothetical protein Cwoe_5214 [Conexibacter woesei DSM 14684] gi|283950876|gb|ADB53620.1| protein of unknown function DUF167 [Conexibacter woesei DSM 14684] Length = 88 Score = 47.0 bits (110), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 27/80 (33%), Positives = 49/80 (61%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M ++ VRL P AK++ I +D + + ++VTA P GKAN A+ +LAK L ++ Sbjct: 1 MGDLRVRLQPRAKRNEIVG---ERDGA----LVVRVTAPPVDGKANAALCRLLAKALGVA 53 Query: 61 KSSLRMLSKQSSPLKIIYID 80 S++ ++ QS+ K++++D Sbjct: 54 PSTVTVVRGQSARDKVVHVD 73 >gi|296445622|ref|ZP_06887577.1| protein of unknown function DUF167 [Methylosinus trichosporium OB3b] gi|296256867|gb|EFH03939.1| protein of unknown function DUF167 [Methylosinus trichosporium OB3b] Length = 109 Score = 47.0 bits (110), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 24/88 (27%), Positives = 49/88 (55%), Gaps = 2/88 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL P + ++ +E D +K +V A P+ G+AN+A++A++A+ L K S++ Sbjct: 18 VRLTPKGGRDALSGVETLADGRAV--LKARVRAAPEDGRANEALVALIAQALGAPKRSVQ 75 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + ++ LK ++I D + L+ + Sbjct: 76 IAAGHTARLKKLFIAGDPASLVAALEKS 103 >gi|327540387|gb|EGF26973.1| protein containing DUF167 [Rhodopirellula baltica WH47] Length = 87 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 27/74 (36%), Positives = 45/74 (60%), Gaps = 7/74 (9%) Query: 7 RLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRM 66 R+ P AKK+ + L D + +K+ V P+ GKANKA++A LAK L +SK + + Sbjct: 2 RVTPKAKKASVGGLH---DGA----LKVSVHMVPEDGKANKAVIASLAKWLRVSKGRVAI 54 Query: 67 LSKQSSPLKIIYID 80 ++ ++S LK I ++ Sbjct: 55 VAGETSRLKTIVVE 68 >gi|237747224|ref|ZP_04577704.1| predicted protein [Oxalobacter formigenes HOxBLS] gi|229378575|gb|EEO28666.1| predicted protein [Oxalobacter formigenes HOxBLS] Length = 100 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 30/91 (32%), Positives = 55/91 (60%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++IPNA+KS I S S+ ++I++ A P GKAN+A++ +LAKKL + + Sbjct: 15 IAVQVIPNARKSEIVS-------SEGETLRIRLQAQPVDGKANEALVQLLAKKLRVPRKQ 67 Query: 64 LRM---LSKQSSPLKIIYIDKDCKEITELLQ 91 + + L+ + L++I D+ ++I + LQ Sbjct: 68 VSITHGLANKRKLLEVIVSDRSQEDIVKQLQ 98 >gi|115522078|ref|YP_778989.1| hypothetical protein RPE_0048 [Rhodopseudomonas palustris BisA53] gi|115516025|gb|ABJ04009.1| protein of unknown function DUF167 [Rhodopseudomonas palustris BisA53] Length = 107 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 26/80 (32%), Positives = 46/80 (57%), Gaps = 2/80 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +V +R+ P + I LE + + K++V A + G+AN+A+ +LAK L + K Sbjct: 12 SVALRVTPRGGRDAIDGLETLANGRTVV--KVRVRAIAEGGEANRAVTELLAKALGVPKR 69 Query: 63 SLRMLSKQSSPLKIIYIDKD 82 ++R+LS +S LK I +D + Sbjct: 70 AVRVLSGTTSRLKQIAVDGN 89 >gi|300024715|ref|YP_003757326.1| hypothetical protein Hden_3210 [Hyphomicrobium denitrificans ATCC 51888] gi|299526536|gb|ADJ25005.1| protein of unknown function DUF167 [Hyphomicrobium denitrificans ATCC 51888] Length = 116 Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust. Identities = 29/88 (32%), Positives = 45/88 (51%), Gaps = 3/88 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V RL P + K I + TSD + +V A P+ G AN A+ ++A+ L L K S Sbjct: 24 VHFRLTPKSSKDAIEGV---TSTSDGPAFQARVRAVPEHGAANAALEQLVARWLDLPKRS 80 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S LK + ID + +E+ LL+ Sbjct: 81 VSLAKGGKSRLKALQIDGEPEELDRLLE 108 >gi|282891527|ref|ZP_06300019.1| hypothetical protein pah_c178o054 [Parachlamydia acanthamoebae str. Hall's coccus] gi|281498618|gb|EFB40945.1| hypothetical protein pah_c178o054 [Parachlamydia acanthamoebae str. Hall's coccus] Length = 93 Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust. Identities = 24/75 (32%), Positives = 47/75 (62%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +++IPNA ++ I E +D +K+ + + P+KGKAN+A++ LAK L L K ++ Sbjct: 10 IKVIPNASRNAILGWE-----ND--ELKMYIASVPEKGKANEAVIKFLAKFLGLRKQQIQ 62 Query: 66 MLSKQSSPLKIIYID 80 ++ +++ KI+ I+ Sbjct: 63 IIRGETNRHKILQIE 77 >gi|29840014|ref|NP_829120.1| hypothetical protein CCA00247 [Chlamydophila caviae GPIC] gi|33301875|sp|Q824A6|Y247_CHLCV RecName: Full=UPF0235 protein CCA_00247 gi|29834361|gb|AAP04998.1| conserved hypothetical protein [Chlamydophila caviae GPIC] Length = 92 Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust. Identities = 23/55 (41%), Positives = 39/55 (70%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEI 86 +KI+VT P+KGKAN+A++A+LAK L+L K + ++S ++S K + + K + I Sbjct: 29 LKIRVTEAPEKGKANEAVIALLAKTLSLPKRDVTLISGETSRKKRLLLPKSTESI 83 >gi|186683679|ref|YP_001866875.1| hypothetical protein Npun_R3528 [Nostoc punctiforme PCC 73102] gi|226703856|sp|B2J1T0|Y3528_NOSP7 RecName: Full=UPF0235 protein Npun_R3528 gi|186466131|gb|ACC81932.1| protein of unknown function DUF167 [Nostoc punctiforme PCC 73102] Length = 75 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 31/78 (39%), Positives = 49/78 (62%), Gaps = 7/78 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PN+K+ I E P D S T+++K + P GKAN+ ++ +LAKK ++KS +R Sbjct: 5 VKVKPNSKQQKIE--EQP-DGSLTVYLK----SPPVDGKANEELIKLLAKKFDVAKSDIR 57 Query: 66 MLSKQSSPLKIIYIDKDC 83 + S SS K+I ID+D Sbjct: 58 IKSGLSSRQKLIEIDRDV 75 >gi|225850940|ref|YP_002731174.1| protein CPn_0497//CPj0497/CpB0517 [Persephonella marina EX-H1] gi|259646383|sp|C0QR78|Y1406_PERMH RecName: Full=UPF0235 protein PERMA_1406 gi|225645391|gb|ACO03577.1| protein CPn_0497//CPj0497/CpB0517 [Persephonella marina EX-H1] Length = 73 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 30/76 (39%), Positives = 45/76 (59%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V V++ PNAKK I EI KD + +I+VT P+KGKAN ++ +L+K L + KS Sbjct: 3 VKVKVKPNAKKEEIR--EIQKD-----YFEIRVTVPPEKGKANSRVIELLSKHLKIPKSR 55 Query: 64 LRMLSKQSSPLKIIYI 79 +++ + S KI I Sbjct: 56 IKLKKGEKSREKIFEI 71 >gi|148251886|ref|YP_001236471.1| hypothetical protein BBta_0270 [Bradyrhizobium sp. BTAi1] gi|146404059|gb|ABQ32565.1| hypothetical protein BBta_0270 [Bradyrhizobium sp. BTAi1] Length = 111 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 24/90 (26%), Positives = 49/90 (54%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +R+ P + I +E D +K++V A G+AN+A+ +LAK + ++K+ Sbjct: 14 TVALRVTPRGGRDAIDGIETLSDGRSV--LKLRVRAVADGGEANRAVTELLAKAIGVTKA 71 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++R+ S ++ LK + I D + + L++ Sbjct: 72 AVRITSGATARLKQVTITGDASRLDQALRD 101 >gi|13473522|ref|NP_105090.1| hypothetical protein msl4154 [Mesorhizobium loti MAFF303099] gi|29839606|sp|Q98EP2|Y4154_RHILO RecName: Full=UPF0235 protein msl4154 gi|14024272|dbj|BAB50876.1| msl4154 [Mesorhizobium loti MAFF303099] Length = 102 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 28/90 (31%), Positives = 53/90 (58%), Gaps = 4/90 (4%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDT-IHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ VRL P KS + LE + ++D H+K +V A P+ G AN+A+ ++AK L + Sbjct: 13 DLFVRLTP---KSSLDRLEGVETSADGRSHLKARVRAVPENGAANQALERLVAKTLGVPA 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 SS+ +++ +S LK + I D + + + ++ Sbjct: 70 SSVSVVAGGTSRLKTVRIVGDPEALAQRVE 99 >gi|319408144|emb|CBI81797.1| conserved hypothetical protein [Bartonella schoenbuchensis R1] Length = 108 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 27/88 (30%), Positives = 47/88 (53%), Gaps = 2/88 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E +D +T H+ I++ P+ GKANKA++ L ++ + S Sbjct: 12 LFVRLIPKASMDSIVGVE-SRD-GETQHLVIRLRTVPEDGKANKALIKFLGRQWKIPPSY 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S +S K + +E+ + LQ Sbjct: 70 ISLKSGMTSRYKQLRFSGYVEELEQKLQ 97 >gi|75910098|ref|YP_324394.1| hypothetical protein Ava_3894 [Anabaena variabilis ATCC 29413] gi|123608489|sp|Q3M687|Y3894_ANAVT RecName: Full=UPF0235 protein Ava_3894 gi|75703823|gb|ABA23499.1| Protein of unknown function DUF167 [Anabaena variabilis ATCC 29413] Length = 75 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 30/77 (38%), Positives = 47/77 (61%), Gaps = 7/77 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PN+K+ IA + D S T+H+K + P GKAN+ ++ +LA+K A+ KS + Sbjct: 5 VKVKPNSKQQKIAEQD---DGSLTVHLK----SPPVDGKANEELIKLLAEKFAVPKSHIT 57 Query: 66 MLSKQSSPLKIIYIDKD 82 + S SS K+I ID D Sbjct: 58 IKSGLSSRQKLIEIDTD 74 >gi|257062952|ref|YP_003142624.1| hypothetical protein Shel_02050 [Slackia heliotrinireducens DSM 20476] gi|256790605|gb|ACV21275.1| uncharacterized conserved protein [Slackia heliotrinireducens DSM 20476] Length = 106 Score = 45.4 bits (106), Expect = 0.002, Method: Compositional matrix adjust. Identities = 22/92 (23%), Positives = 51/92 (55%), Gaps = 2/92 (2%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + + P A+++ +A ++ D + + ++++VT P+ GKANKA+ LAK + +S Sbjct: 13 VTQIPIHATPKAQRNAVAGVK--ADDTGRLEVQVRVTVAPEGGKANKAVCETLAKAIGVS 70 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 KS + ++ ++S K+ ++ +I + Sbjct: 71 KSKVSIVRGETSRHKMAQVEAPSADIEAWMDG 102 >gi|163795292|ref|ZP_02189259.1| hypothetical protein BAL199_14277 [alpha proteobacterium BAL199] gi|159179278|gb|EDP63809.1| hypothetical protein BAL199_14277 [alpha proteobacterium BAL199] Length = 112 Score = 45.4 bits (106), Expect = 0.002, Method: Compositional matrix adjust. Identities = 22/70 (31%), Positives = 41/70 (58%), Gaps = 1/70 (1%) Query: 24 KDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI-DKD 82 +D + +++ VTA P+ G+ANKA+ A+LAK+ + KSS+ ++ + K++ + D Sbjct: 34 QDEAGVAWLQVSVTAVPEDGRANKAVTALLAKRWRVPKSSIEIVQGTTERRKVLLVRSDD 93 Query: 83 CKEITELLQN 92 +T LQ Sbjct: 94 TAALTARLQT 103 >gi|220909519|ref|YP_002484830.1| hypothetical protein Cyan7425_4156 [Cyanothece sp. PCC 7425] gi|219866130|gb|ACL46469.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7425] Length = 89 Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust. Identities = 20/49 (40%), Positives = 35/49 (71%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 +K+KV A P+KGKAN A++A+LA L + ++ + +LS +S K++ I+ Sbjct: 25 LKVKVKAPPEKGKANAAVIALLATHLGIDQTCIEVLSGHTSAAKVLSIE 73 >gi|258545474|ref|ZP_05705708.1| cytoplasmic protein [Cardiobacterium hominis ATCC 15826] gi|258519174|gb|EEV88033.1| cytoplasmic protein [Cardiobacterium hominis ATCC 15826] Length = 97 Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust. Identities = 25/59 (42%), Positives = 35/59 (59%), Gaps = 3/59 (5%) Query: 24 KDTSDTIH---MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 +D IH KI +TA P+ GKANK + A LAK A++K ++ + S SPLK + I Sbjct: 23 RDQCQGIHDERYKIAITAPPEDGKANKHLTAWLAKTFAVAKKNVALQSGAFSPLKTLRI 81 >gi|167630164|ref|YP_001680663.1| conserved hypothetical protein, uncharacterized acr, yggu family [Heliobacterium modesticaldum Ice1] gi|259646567|sp|B0TGP1|Y2027_HELMI RecName: Full=UPF0235 protein Helmi_20270 gi|167592904|gb|ABZ84652.1| conserved hypothetical protein, uncharacterized acr, yggu family [Heliobacterium modesticaldum Ice1] Length = 96 Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust. Identities = 25/75 (33%), Positives = 45/75 (60%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +R+ P A K+ + L D +K+++TA P G+AN A L +AK L LS+S +R Sbjct: 16 IRVQPRASKNEVCGL-----LDDA--LKVRLTAPPVDGEANAACLQFIAKTLGLSRSQVR 68 Query: 66 MLSKQSSPLKIIYID 80 +++ ++S LK + ++ Sbjct: 69 LVAGETSRLKTLEVE 83 >gi|62184885|ref|YP_219670.1| hypothetical protein CAB243 [Chlamydophila abortus S26/3] gi|81312941|sp|Q5L6M2|Y243_CHLAB RecName: Full=UPF0235 protein CAB243 gi|62147952|emb|CAH63699.1| conserved hypothetical protein [Chlamydophila abortus S26/3] Length = 96 Score = 45.1 bits (105), Expect = 0.003, Method: Compositional matrix adjust. Identities = 28/81 (34%), Positives = 48/81 (59%), Gaps = 7/81 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ P +K++ I E + +KI+VT P+KGKAN+A++A+LAK L+L K + Sbjct: 10 VKVTPKSKQNTIVGFE-----GEV--LKIRVTEVPEKGKANEAVIALLAKALSLPKRDIT 62 Query: 66 MLSKQSSPLKIIYIDKDCKEI 86 ++ +S K I + K + I Sbjct: 63 LIPGDTSRKKRILLPKSTESI 83 >gi|15618408|ref|NP_224693.1| hypothetical protein CPn0497 [Chlamydophila pneumoniae CWL029] gi|15836028|ref|NP_300552.1| hypothetical protein CPj0497 [Chlamydophila pneumoniae J138] gi|16752546|ref|NP_444808.1| hypothetical protein CP0257 [Chlamydophila pneumoniae AR39] gi|33241848|ref|NP_876789.1| hypothetical protein CpB0517 [Chlamydophila pneumoniae TW-183] gi|29839694|sp|Q9Z854|Y497_CHLPN RecName: Full=UPF0235 protein CPn_0497/CP_0257/CPj0497/CpB0517 gi|4376783|gb|AAD18637.1| CT388 hypothetical protein [Chlamydophila pneumoniae CWL029] gi|7189184|gb|AAF38120.1| conserved hypothetical protein [Chlamydophila pneumoniae AR39] gi|8978867|dbj|BAA98703.1| CT388 hypothetical protein [Chlamydophila pneumoniae J138] gi|33236357|gb|AAP98446.1| hypothetical protein CpB0517 [Chlamydophila pneumoniae TW-183] gi|269303374|gb|ACZ33474.1| conserved hypothetical protein TIGR00251 [Chlamydophila pneumoniae LPCoLN] Length = 90 Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust. Identities = 23/67 (34%), Positives = 42/67 (62%), Gaps = 7/67 (10%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ P AK++ I D +K++VT P+KGKAN A++++LAK L+L K + Sbjct: 9 VKVTPKAKENKIVGF-------DGQALKVRVTEPPEKGKANDAVISLLAKALSLPKRDVT 61 Query: 66 MLSKQSS 72 +++ ++S Sbjct: 62 LIAGETS 68 >gi|257095142|ref|YP_003168783.1| hypothetical protein CAP2UW1_3597 [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257047666|gb|ACV36854.1| protein of unknown function DUF167 [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 96 Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust. Identities = 27/82 (32%), Positives = 45/82 (54%), Gaps = 7/82 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + L P AK + EI D +K+++TA P G+AN A++ LA++L LS+S+ Sbjct: 14 VTIHLQPGAKAN-----EIAGRHGDA--LKVRITAPPVDGRANAALVDFLAQRLGLSRSA 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKE 85 + + S +S K++ I E Sbjct: 67 VELKSGLTSRRKVLRISGASAE 88 >gi|149177442|ref|ZP_01856046.1| hypothetical protein PM8797T_19111 [Planctomyces maris DSM 8797] gi|148843775|gb|EDL58134.1| hypothetical protein PM8797T_19111 [Planctomyces maris DSM 8797] Length = 104 Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust. Identities = 26/76 (34%), Positives = 42/76 (55%), Gaps = 7/76 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR P + K+GI + + +K+ VT P+KGKANKA+L +L L L +S + Sbjct: 18 VRAQPRSSKNGIEGVHDGR-------LKVCVTQVPEKGKANKALLKVLQTALKLKRSQIE 70 Query: 66 MLSKQSSPLKIIYIDK 81 + +++ LKI I + Sbjct: 71 LYKGETAALKIFRIHE 86 >gi|328952006|ref|YP_004369340.1| UPF0235 protein yggU [Desulfobacca acetoxidans DSM 11109] gi|328452330|gb|AEB08159.1| UPF0235 protein yggU [Desulfobacca acetoxidans DSM 11109] Length = 101 Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust. Identities = 28/76 (36%), Positives = 41/76 (53%), Gaps = 2/76 (2%) Query: 16 GIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLK 75 G AS +I D +KI++ A P+KG ANK +L LAK L L K+ L + S +K Sbjct: 20 GAASNQIMGPHGDR--LKIRIAAAPEKGAANKELLNYLAKCLGLPKNRLHLKSGAQDRVK 77 Query: 76 IIYIDKDCKEITELLQ 91 ++ + E+ E LQ Sbjct: 78 VVEVVGLAPEVQERLQ 93 >gi|157826313|ref|YP_001494033.1| hypothetical protein A1C_06510 [Rickettsia akari str. Hartford] gi|166228807|sp|A8GQ50|Y6510_RICAH RecName: Full=UPF0235 protein A1C_06510 gi|157800271|gb|ABV75525.1| hypothetical protein A1C_06510 [Rickettsia akari str. Hartford] Length = 105 Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust. Identities = 23/70 (32%), Positives = 45/70 (64%), Gaps = 3/70 (4%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 PN+K++ I++ I ++ ++K+ + ATP+KGKAN+ ++ LAK LS+S++ ++ Sbjct: 22 PNSKQNLISNFVI---INNIPYLKLSIKATPEKGKANEEIINYLAKAWKLSRSNIEIIKG 78 Query: 70 QSSPLKIIYI 79 + +K I I Sbjct: 79 HTHSVKTILI 88 >gi|325108674|ref|YP_004269742.1| hypothetical protein Plabr_2117 [Planctomyces brasiliensis DSM 5305] gi|324968942|gb|ADY59720.1| UPF0235 protein yggU [Planctomyces brasiliensis DSM 5305] Length = 109 Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust. Identities = 30/88 (34%), Positives = 52/88 (59%), Gaps = 13/88 (14%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +R+ P AK++ + + D + +K+ VT ++GKAN+ +L +LAK L L KS L Sbjct: 24 LRVTPGAKRNAVGGVH---DGA----LKVAVTQIAERGKANQQVLKILAKALGLKKSQLT 76 Query: 66 MLSKQSSPLKIIYIDKDCKEIT--ELLQ 91 ++S ++S K I C++++ ELLQ Sbjct: 77 LVSGETSRNKRIA----CRDVSAAELLQ 100 >gi|163737739|ref|ZP_02145156.1| hypothetical protein RGBS107_19448 [Phaeobacter gallaeciensis BS107] gi|163744042|ref|ZP_02151409.1| hypothetical protein RG210_12581 [Phaeobacter gallaeciensis 2.10] gi|161382658|gb|EDQ07060.1| hypothetical protein RG210_12581 [Phaeobacter gallaeciensis 2.10] gi|161389265|gb|EDQ13617.1| hypothetical protein RGBS107_19448 [Phaeobacter gallaeciensis BS107] Length = 98 Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust. Identities = 25/72 (34%), Positives = 43/72 (59%), Gaps = 3/72 (4%) Query: 2 CNVIVRLIPNAKKSGIASLEIPK-DTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + VR+ P A ++ I L IP+ ++ + +KI VTA P+ GKA A+ A+LA+ + ++ Sbjct: 20 AEIPVRVTPKASRNAI--LPIPQAESGQGVSLKITVTAAPENGKATAAVQALLARAMRIA 77 Query: 61 KSSLRMLSKQSS 72 S L +L +S Sbjct: 78 PSDLELLRGATS 89 >gi|291286027|ref|YP_003502843.1| hypothetical protein Dacet_0081 [Denitrovibrio acetiphilus DSM 12809] gi|290883187|gb|ADD66887.1| protein of unknown function DUF167 [Denitrovibrio acetiphilus DSM 12809] Length = 84 Score = 44.3 bits (103), Expect = 0.005, Method: Compositional matrix adjust. Identities = 27/85 (31%), Positives = 50/85 (58%), Gaps = 7/85 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V + P AKK+ ++ + K +KI+V A P +G AN+ ++ L+K+L +SKS ++ Sbjct: 5 VYVQPGAKKTELSGMHDGK-------IKIRVCAPPVEGAANEVLVKFLSKQLKISKSGIK 57 Query: 66 MLSKQSSPLKIIYIDKDCKEITELL 90 ++S + S KI+ I+ D ++ L Sbjct: 58 IISGEKSRHKIVEINMDTLDVMNCL 82 >gi|146337438|ref|YP_001202486.1| hypothetical protein BRADO0278 [Bradyrhizobium sp. ORS278] gi|146190244|emb|CAL74236.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 109 Score = 44.3 bits (103), Expect = 0.005, Method: Compositional matrix adjust. Identities = 24/78 (30%), Positives = 43/78 (55%), Gaps = 2/78 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +R+ P + I +E D +K++V A G+AN+A+ +LAK + ++K Sbjct: 12 TVALRVTPRGGRDAIDGIETLSDGRSV--LKVRVRAIADGGEANRAVTELLAKAIGVTKK 69 Query: 63 SLRMLSKQSSPLKIIYID 80 ++R+ S +S LK + ID Sbjct: 70 AVRITSGTTSRLKQVAID 87 >gi|154705806|ref|YP_001424486.1| hypothetical cytosolic protein [Coxiella burnetii Dugway 5J108-111] gi|154355092|gb|ABS76554.1| hypothetical cytosolic protein [Coxiella burnetii Dugway 5J108-111] Length = 92 Score = 44.3 bits (103), Expect = 0.005, Method: Compositional matrix adjust. Identities = 21/50 (42%), Positives = 36/50 (72%) Query: 31 HMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 H+KI++ A P +GKANKA++ LA++L L+ SS+ ++ + + LK I I+ Sbjct: 27 HIKIRLQAPPTEGKANKALIDFLAQRLKLNPSSITIIRGEKARLKTIAIE 76 >gi|254472462|ref|ZP_05085862.1| conserved hypothetical protein [Pseudovibrio sp. JE062] gi|211958745|gb|EEA93945.1| conserved hypothetical protein [Pseudovibrio sp. JE062] Length = 105 Score = 44.3 bits (103), Expect = 0.005, Method: Compositional matrix adjust. Identities = 25/90 (27%), Positives = 52/90 (57%), Gaps = 2/90 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P + K I + D + K++ A P+KG ANKA+ A+ AK L++ KSS Sbjct: 16 ITVRLTPKSSKDQIEKIGAQSDGRPLVLAKVR--AVPEKGAANKAVAALFAKALSVPKSS 73 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 +++ ++ +K + + + +++ + L+++ Sbjct: 74 AELIAGSTARIKTLRVLGEPQDLAKRLEDH 103 >gi|71909498|ref|YP_287085.1| hypothetical protein Daro_3887 [Dechloromonas aromatica RCB] gi|123626353|sp|Q478W6|Y3887_DECAR RecName: Full=UPF0235 protein Daro_3887 gi|71849119|gb|AAZ48615.1| Conserved hypothetical protein 251 [Dechloromonas aromatica RCB] Length = 97 Score = 44.3 bits (103), Expect = 0.005, Method: Compositional matrix adjust. Identities = 27/70 (38%), Positives = 41/70 (58%), Gaps = 7/70 (10%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 P AKKS A L D +KI++ A P GKAN+A++ +A L L+KS++ + S Sbjct: 21 PGAKKSEFAGLH-----GDA--LKIRLAAPPVDGKANEALIRFIADALGLAKSAVHLKSG 73 Query: 70 QSSPLKIIYI 79 Q+S K++ I Sbjct: 74 QTSRRKVLEI 83 >gi|89898566|ref|YP_515676.1| hypothetical protein CF0759 [Chlamydophila felis Fe/C-56] gi|89331938|dbj|BAE81531.1| conserved hypothetical protein [Chlamydophila felis Fe/C-56] Length = 65 Score = 44.3 bits (103), Expect = 0.005, Method: Compositional matrix adjust. Identities = 23/55 (41%), Positives = 38/55 (69%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEI 86 +KI+VT P+KG+AN+A++A+LAK L+L K + ++S SS K + + K + I Sbjct: 2 LKIRVTEAPEKGRANEAVIALLAKTLSLPKRDVTLISGDSSRKKRLLLPKAAESI 56 >gi|319780857|ref|YP_004140333.1| hypothetical protein Mesci_1119 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317166745|gb|ADV10283.1| protein of unknown function DUF167 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 105 Score = 44.3 bits (103), Expect = 0.006, Method: Compositional matrix adjust. Identities = 23/81 (28%), Positives = 44/81 (54%), Gaps = 2/81 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P A + +E D + H+K +V A P+ G AN A+ ++AK + + Sbjct: 12 VELFVRLTPKAALDRLEGIETTAD--ERSHLKARVRAVPENGAANHALEKLIAKAIGVPG 69 Query: 62 SSLRMLSKQSSPLKIIYIDKD 82 S++ +++ ++ LK + I+ D Sbjct: 70 SAVSVVAGGTARLKTVRIEGD 90 >gi|51474006|ref|YP_067763.1| hypothetical protein RT0827 [Rickettsia typhi str. Wilmington] gi|81390320|sp|Q68Y09|Y827_RICTY RecName: Full=UPF0235 protein RT0827 gi|51460318|gb|AAU04281.1| conserved hypothetical protein [Rickettsia typhi str. Wilmington] Length = 105 Score = 44.3 bits (103), Expect = 0.006, Method: Compositional matrix adjust. Identities = 21/62 (33%), Positives = 39/62 (62%) Query: 31 HMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELL 90 ++K+ +TA P++GKAN+ ++ LAK+ LS+SS+ ++ + LK I I ++ L+ Sbjct: 40 YIKLSITAAPEQGKANEGIINYLAKEWKLSRSSIEIIKGHTHSLKTILIKNINEDYLNLI 99 Query: 91 QN 92 N Sbjct: 100 IN 101 >gi|291278644|ref|YP_003495479.1| hypothetical protein DEFDS_0212 [Deferribacter desulfuricans SSM1] gi|290753346|dbj|BAI79723.1| conserved hypothetical protein [Deferribacter desulfuricans SSM1] Length = 82 Score = 43.9 bits (102), Expect = 0.007, Method: Compositional matrix adjust. Identities = 29/73 (39%), Positives = 43/73 (58%), Gaps = 7/73 (9%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 P AKK+ +A E T KIKV + P G ANK ++ LAKKL +SKSS++++S Sbjct: 11 PGAKKTEVAG-EFNNMT------KIKVASPPVDGAANKELIKFLAKKLGVSKSSVKIVSG 63 Query: 70 QSSPLKIIYIDKD 82 + S +K + +D Sbjct: 64 EKSRIKTVEFLED 76 >gi|323136497|ref|ZP_08071579.1| protein of unknown function DUF167 [Methylocystis sp. ATCC 49242] gi|322398571|gb|EFY01091.1| protein of unknown function DUF167 [Methylocystis sp. ATCC 49242] Length = 110 Score = 43.9 bits (102), Expect = 0.007, Method: Compositional matrix adjust. Identities = 25/80 (31%), Positives = 44/80 (55%), Gaps = 2/80 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V +RL P + I +E D +K +V A P+ G+AN A++ ++AK L K++ Sbjct: 18 VWLRLTPKGGRDAIEGVETLSDGRAV--LKARVRAAPEDGRANAALIELIAKALRAPKNA 75 Query: 64 LRMLSKQSSPLKIIYIDKDC 83 + + S ++S +K I+I D Sbjct: 76 VSIRSGETSRVKKIFIAGDS 95 >gi|84685306|ref|ZP_01013204.1| hypothetical protein 1099457000258_RB2654_10573 [Maritimibacter alkaliphilus HTCC2654] gi|84666463|gb|EAQ12935.1| hypothetical protein RB2654_10573 [Rhodobacterales bacterium HTCC2654] Length = 85 Score = 43.9 bits (102), Expect = 0.007, Method: Compositional matrix adjust. Identities = 24/75 (32%), Positives = 43/75 (57%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P A ++ ++ + +D H+K+ VT P+ GKA A++ +LA L ++KS L Sbjct: 18 VRVTPKASRNAVS---VDEDG----HLKVSVTTVPEDGKATAAVVKLLAHALGVAKSDLT 70 Query: 66 MLSKQSSPLKIIYID 80 ++ +S K+ ID Sbjct: 71 LVRGATSRDKVFRID 85 >gi|182680478|ref|YP_001834624.1| hypothetical protein Bind_3579 [Beijerinckia indica subsp. indica ATCC 9039] gi|182636361|gb|ACB97135.1| protein of unknown function DUF167 [Beijerinckia indica subsp. indica ATCC 9039] Length = 87 Score = 43.9 bits (102), Expect = 0.008, Method: Compositional matrix adjust. Identities = 26/81 (32%), Positives = 44/81 (54%), Gaps = 3/81 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ P A + I +E D S+ ++I VT P+ GKAN+ +L +LAK L + Sbjct: 10 VEIAIRVTPKASANRIV-VETAPDGSE--RLRIYVTTVPENGKANRDVLRLLAKHLDIPP 66 Query: 62 SSLRMLSKQSSPLKIIYIDKD 82 SSL ++ + KI+ +D Sbjct: 67 SSLEIIRGSTGRDKIVRFSRD 87 >gi|67005257|gb|AAY62183.1| Conserved hypothetical protein [Rickettsia felis URRWXCal2] Length = 110 Score = 43.5 bits (101), Expect = 0.009, Method: Compositional matrix adjust. Identities = 24/77 (31%), Positives = 47/77 (61%), Gaps = 6/77 (7%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 PN+K++ I+ I ++ ++K+ + ATP++GKAN+ ++ LAK+ LS+ + ++ Sbjct: 27 PNSKQNLISDFVI---INNIPYLKLSIKATPEQGKANEEIINYLAKEWKLSRKDIEIIKG 83 Query: 70 QSSPLKIIY---IDKDC 83 ++ LK I ID+D Sbjct: 84 HTNSLKTILIKNIDEDY 100 >gi|162022116|ref|YP_247348.2| hypothetical protein RF_1332 [Rickettsia felis URRWXCal2] gi|126253831|sp|Q4UJV6|Y1332_RICFE RecName: Full=UPF0235 protein RF_1332 Length = 105 Score = 43.5 bits (101), Expect = 0.010, Method: Compositional matrix adjust. Identities = 24/77 (31%), Positives = 47/77 (61%), Gaps = 6/77 (7%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 PN+K++ I+ I ++ ++K+ + ATP++GKAN+ ++ LAK+ LS+ + ++ Sbjct: 22 PNSKQNLISDFVI---INNIPYLKLSIKATPEQGKANEEIINYLAKEWKLSRKDIEIIKG 78 Query: 70 QSSPLKIIY---IDKDC 83 ++ LK I ID+D Sbjct: 79 HTNSLKTILIKNIDEDY 95 >gi|15893224|ref|NP_360938.1| hypothetical protein RC1301 [Rickettsia conorii str. Malish 7] gi|34581107|ref|ZP_00142587.1| hypothetical protein [Rickettsia sibirica 246] gi|229587206|ref|YP_002845707.1| hypothetical protein RAF_ORF1191 [Rickettsia africae ESF-5] gi|29839741|sp|Q92G24|Y1301_RICCN RecName: Full=UPF0235 protein RC1301 gi|259645748|sp|C3PLX4|Y1191_RICAE RecName: Full=UPF0235 protein RAF_ORF1191 gi|15620440|gb|AAL03839.1| unknown [Rickettsia conorii str. Malish 7] gi|28262492|gb|EAA25996.1| unknown [Rickettsia sibirica 246] gi|228022256|gb|ACP53964.1| Unknown [Rickettsia africae ESF-5] Length = 105 Score = 43.1 bits (100), Expect = 0.012, Method: Compositional matrix adjust. Identities = 24/83 (28%), Positives = 50/83 (60%), Gaps = 3/83 (3%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 PN+K++ I++ I ++ ++K+ + A P++GKAN+ ++ LAK+ LS+S++ ++ Sbjct: 22 PNSKQNLISNFVI---INNIPYLKLSIKAIPEQGKANEEIINYLAKEWKLSRSNIEIIKG 78 Query: 70 QSSPLKIIYIDKDCKEITELLQN 92 + LK I I ++ L+ N Sbjct: 79 HTHSLKTILIKNINEDYLNLIIN 101 >gi|15835282|ref|NP_297041.1| hypothetical protein TC0667 [Chlamydia muridarum Nigg] gi|270285456|ref|ZP_06194850.1| hypothetical protein CmurN_03383 [Chlamydia muridarum Nigg] gi|270289467|ref|ZP_06195769.1| hypothetical protein CmurW_03473 [Chlamydia muridarum Weiss] gi|301336853|ref|ZP_07225055.1| hypothetical protein CmurM_03440 [Chlamydia muridarum MopnTet14] gi|29839670|sp|Q9PK06|Y667_CHLMU RecName: Full=UPF0235 protein TC_0667 gi|7190702|gb|AAF39489.1| conserved hypothetical protein [Chlamydia muridarum Nigg] Length = 100 Score = 43.1 bits (100), Expect = 0.012, Method: Compositional matrix adjust. Identities = 24/79 (30%), Positives = 51/79 (64%), Gaps = 7/79 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +R+ A+++ + SLE D I ++++VT P++GKAN A++A+LAK L++ K+ + Sbjct: 10 IRVTTKARENKVVSLE------DGI-LRVRVTEAPERGKANDAVVALLAKFLSIPKNDVT 62 Query: 66 MLSKQSSPLKIIYIDKDCK 84 +++ ++S K + + + K Sbjct: 63 LIAGEASRRKKVLLPRAIK 81 >gi|238650830|ref|YP_002916685.1| hypothetical protein RPR_04990 [Rickettsia peacockii str. Rustic] gi|259647069|sp|C4K236|Y4990_RICPU RecName: Full=UPF0235 protein RPR_04990 gi|238624928|gb|ACR47634.1| hypothetical protein RPR_04990 [Rickettsia peacockii str. Rustic] Length = 105 Score = 43.1 bits (100), Expect = 0.013, Method: Compositional matrix adjust. Identities = 22/70 (31%), Positives = 45/70 (64%), Gaps = 3/70 (4%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 PN+K++ I++ I ++ ++K+ + A P++GKAN+ ++ LAK+ LS+S++ ++ Sbjct: 22 PNSKQNLISNFVI---INNIPYLKLSIKAIPEQGKANEEIINYLAKEWKLSRSNIEIIKG 78 Query: 70 QSSPLKIIYI 79 + LK I I Sbjct: 79 HTHSLKTILI 88 >gi|87198097|ref|YP_495354.1| hypothetical protein Saro_0071 [Novosphingobium aromaticivorans DSM 12444] gi|87133778|gb|ABD24520.1| protein of unknown function DUF167 [Novosphingobium aromaticivorans DSM 12444] Length = 95 Score = 43.1 bits (100), Expect = 0.013, Method: Compositional matrix adjust. Identities = 28/78 (35%), Positives = 41/78 (52%), Gaps = 8/78 (10%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P AK GI EI D + +KV A P+ GKA A+ +LA+ L L+ S + Sbjct: 26 VRVTPGAKSEGI---EI-----DGGRVLVKVRAKPEDGKATAAVQELLARALGLAPSKVE 77 Query: 66 MLSKQSSPLKIIYIDKDC 83 ML +S K+ I ++ Sbjct: 78 MLRGATSREKLFRIPREA 95 >gi|312116202|ref|YP_004013798.1| hypothetical protein Rvan_3519 [Rhodomicrobium vannielii ATCC 17100] gi|311221331|gb|ADP72699.1| protein of unknown function DUF167 [Rhodomicrobium vannielii ATCC 17100] Length = 123 Score = 43.1 bits (100), Expect = 0.013, Method: Compositional matrix adjust. Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 3/74 (4%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL P A + +A +E +K VT P+ GKAN A++ ++A L + KSS+ Sbjct: 26 VRLTPKASSARVAGVEA---FDGKPVLKAYVTTPPEDGKANAALVVLVASWLGVPKSSVS 82 Query: 66 MLSKQSSPLKIIYI 79 M + Q S LK + + Sbjct: 83 MAAGQKSRLKTVAV 96 >gi|157829135|ref|YP_001495377.1| hypothetical protein A1G_07140 [Rickettsia rickettsii str. 'Sheila Smith'] gi|166228842|sp|A8GTZ4|Y7140_RICRS RecName: Full=UPF0235 protein A1G_07140 gi|157801616|gb|ABV76869.1| hypothetical protein A1G_07140 [Rickettsia rickettsii str. 'Sheila Smith'] Length = 105 Score = 42.7 bits (99), Expect = 0.014, Method: Compositional matrix adjust. Identities = 24/83 (28%), Positives = 48/83 (57%), Gaps = 3/83 (3%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 PN+K++ I++ I + ++K+ + A P++GKAN ++ LAK+ LS+S++ ++ Sbjct: 22 PNSKQNLISNFVIINNIQ---YLKLSIKAIPEQGKANSEIINYLAKEWKLSRSNIEIIKG 78 Query: 70 QSSPLKIIYIDKDCKEITELLQN 92 + LK I I ++ L+ N Sbjct: 79 HTHSLKTILIKNINEDYLNLIIN 101 >gi|218885259|ref|YP_002434580.1| hypothetical protein DvMF_0151 [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218756213|gb|ACL07112.1| protein of unknown function DUF167 [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 106 Score = 42.7 bits (99), Expect = 0.016, Method: Compositional matrix adjust. Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V+VR +P A+KS + T+D +K+++ A KANKA+ +A L + ++ Sbjct: 22 VLVRAVPGARKSAC------EGTADG-RLKVRLAAPAVDNKANKALEEFVASALGMRRNR 74 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 +R++S +S LK + ++ D + LL Sbjct: 75 VRLVSGHTSRLKKLIVESDVEPSWGLL 101 >gi|161723222|ref|NP_219898.2| hypothetical protein CT388 [Chlamydia trachomatis D/UW-3/CX] gi|162019865|ref|YP_328205.2| hypothetical protein CTA_0423 [Chlamydia trachomatis A/HAR-13] Length = 100 Score = 42.7 bits (99), Expect = 0.016, Method: Compositional matrix adjust. Identities = 25/79 (31%), Positives = 49/79 (62%), Gaps = 7/79 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ A+++ + LE D I ++++VT P+KGKAN A++A+LA L++ KS + Sbjct: 10 VRVTTKARENRVVCLE------DGI-LRVRVTEVPEKGKANDAVVALLANFLSIPKSDVT 62 Query: 66 MLSKQSSPLKIIYIDKDCK 84 +++ ++S K + + + K Sbjct: 63 LIAGEASRRKKVLLPRSIK 81 >gi|320161245|ref|YP_004174469.1| hypothetical protein ANT_18430 [Anaerolinea thermophila UNI-1] gi|319995098|dbj|BAJ63869.1| hypothetical protein ANT_18430 [Anaerolinea thermophila UNI-1] Length = 105 Score = 42.7 bits (99), Expect = 0.016, Method: Compositional matrix adjust. Identities = 25/76 (32%), Positives = 48/76 (63%), Gaps = 6/76 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P A K+ I EI D + +KI++TA P +GKAN+A++ L++ L + ++S Sbjct: 19 ITVRVTPRASKNEI--YEILDDGT----VKIRLTAPPVEGKANEALIDFLSEVLDVPRTS 72 Query: 64 LRMLSKQSSPLKIIYI 79 L +++ ++ KI+ + Sbjct: 73 LEIVAGETGRDKIVTV 88 >gi|166154599|ref|YP_001654717.1| hypothetical protein CTL0644 [Chlamydia trachomatis 434/Bu] gi|166155474|ref|YP_001653729.1| hypothetical protein CTLon_0641 [Chlamydia trachomatis L2b/UCH-1/proctitis] gi|237802813|ref|YP_002888007.1| hypothetical protein JALI_3871 [Chlamydia trachomatis B/Jali20/OT] gi|237804735|ref|YP_002888889.1| hypothetical protein CTB_3871 [Chlamydia trachomatis B/TZ1A828/OT] gi|255311194|ref|ZP_05353764.1| hypothetical protein Ctra62_02010 [Chlamydia trachomatis 6276] gi|255317495|ref|ZP_05358741.1| hypothetical protein Ctra6_02000 [Chlamydia trachomatis 6276s] gi|255348753|ref|ZP_05380760.1| hypothetical protein Ctra70_02050 [Chlamydia trachomatis 70] gi|255503293|ref|ZP_05381683.1| hypothetical protein Ctra7_02060 [Chlamydia trachomatis 70s] gi|255506972|ref|ZP_05382611.1| hypothetical protein CtraD_02040 [Chlamydia trachomatis D(s)2923] gi|301335866|ref|ZP_07224110.1| hypothetical protein CtraL_03535 [Chlamydia trachomatis L2tet1] gi|29839454|sp|O84393|Y388_CHLTR RecName: Full=UPF0235 protein CT_388 gi|123606912|sp|Q3KLW5|Y423_CHLTA RecName: Full=UPF0235 protein CTA_0423 gi|226707987|sp|B0BC23|Y641_CHLTB RecName: Full=UPF0235 protein CTLon_0641 gi|226707989|sp|B0B7V8|Y644_CHLT2 RecName: Full=UPF0235 protein CTL0644 gi|3328814|gb|AAC67985.1| hypothetical protein CT_388 [Chlamydia trachomatis D/UW-3/CX] gi|76167649|gb|AAX50657.1| hypothetical cytosolic protein [Chlamydia trachomatis A/HAR-13] gi|165930587|emb|CAP04084.1| conserved hypothetical protein [Chlamydia trachomatis 434/Bu] gi|165931462|emb|CAP07038.1| conserved hypothetical protein [Chlamydia trachomatis L2b/UCH-1/proctitis] gi|231273035|emb|CAX09948.1| conserved hypothetical protein [Chlamydia trachomatis B/TZ1A828/OT] gi|231274047|emb|CAX10841.1| conserved hypothetical protein [Chlamydia trachomatis B/Jali20/OT] gi|289525430|emb|CBJ14907.1| conserved hypothetical protein [Chlamydia trachomatis Sweden2] gi|296434982|gb|ADH17160.1| hypothetical protein E150_02035 [Chlamydia trachomatis E/150] gi|296435909|gb|ADH18083.1| hypothetical protein G9768_02005 [Chlamydia trachomatis G/9768] gi|296436835|gb|ADH19005.1| hypothetical protein G11222_02005 [Chlamydia trachomatis G/11222] gi|296437769|gb|ADH19930.1| hypothetical protein G11074_02005 [Chlamydia trachomatis G/11074] gi|296438702|gb|ADH20855.1| hypothetical protein E11023_02020 [Chlamydia trachomatis E/11023] gi|297140269|gb|ADH97027.1| hypothetical protein CTG9301_02010 [Chlamydia trachomatis G/9301] gi|297748518|gb|ADI51064.1| Hypothetical cytosolic protein [Chlamydia trachomatis D-EC] gi|297749398|gb|ADI52076.1| Hypothetical cytosolic protein [Chlamydia trachomatis D-LC] Length = 115 Score = 42.7 bits (99), Expect = 0.016, Method: Compositional matrix adjust. Identities = 25/79 (31%), Positives = 49/79 (62%), Gaps = 7/79 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ A+++ + LE D I ++++VT P+KGKAN A++A+LA L++ KS + Sbjct: 25 VRVTTKARENRVVCLE------DGI-LRVRVTEVPEKGKANDAVVALLANFLSIPKSDVT 77 Query: 66 MLSKQSSPLKIIYIDKDCK 84 +++ ++S K + + + K Sbjct: 78 LIAGEASRRKKVLLPRSIK 96 >gi|240849835|ref|YP_002971223.1| hypothetical protein Bgr_01760 [Bartonella grahamii as4aup] gi|240266958|gb|ACS50546.1| hypothetical protein Bgr_01760 [Bartonella grahamii as4aup] Length = 104 Score = 42.7 bits (99), Expect = 0.016, Method: Compositional matrix adjust. Identities = 26/89 (29%), Positives = 49/89 (55%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V LIP + I +E +D + ++ I++ A P+ GKANKA++ LAK+ + SS Sbjct: 12 LFVYLIPKSSVDKIIGVEC-RD-GEKQYLVIRLRAVPEDGKANKALIKFLAKQWKIPSSS 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + + S K +Y +E+ ++ Q+ Sbjct: 70 ISLKNGAISRYKQLYFSTHLEELKQIWQS 98 >gi|163867463|ref|YP_001608662.1| hypothetical protein Btr_0184 [Bartonella tribocorum CIP 105476] gi|161017109|emb|CAK00667.1| conserved hypothetical protein [Bartonella tribocorum CIP 105476] Length = 108 Score = 42.7 bits (99), Expect = 0.017, Method: Compositional matrix adjust. Identities = 25/89 (28%), Positives = 49/89 (55%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V LIP + I +E +D + ++ I++ A P+ GKANKA++ AK+ + SS Sbjct: 12 LFVYLIPKSSVDKIIGIEC-RD-GEKQYLVIRLRAVPEDGKANKALIKFFAKQWKIPSSS 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S +S K ++ +E+ ++ Q+ Sbjct: 70 ISLKSGATSRYKQLHFSTHLEELKQIWQS 98 >gi|165933862|ref|YP_001650651.1| hypothetical protein RrIowa_1526 [Rickettsia rickettsii str. Iowa] gi|189038764|sp|B0BVI5|Y1526_RICRO RecName: Full=UPF0235 protein RrIowa_1526 gi|165908949|gb|ABY73245.1| hypothetical cytosolic protein [Rickettsia rickettsii str. Iowa] Length = 105 Score = 42.7 bits (99), Expect = 0.017, Method: Compositional matrix adjust. Identities = 24/83 (28%), Positives = 48/83 (57%), Gaps = 3/83 (3%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 PN+K++ I++ I + ++K+ + A P++GKAN ++ LAK+ LS+S++ ++ Sbjct: 22 PNSKQNLISNFVIINNIQ---YLKLSIKAIPEQGKANAEIINYLAKEWKLSRSNIEIIKG 78 Query: 70 QSSPLKIIYIDKDCKEITELLQN 92 + LK I I ++ L+ N Sbjct: 79 HTHSLKTILIKNINEDYLNLIIN 101 >gi|329890913|ref|ZP_08269256.1| hypothetical protein BDIM_26220 [Brevundimonas diminuta ATCC 11568] gi|328846214|gb|EGF95778.1| hypothetical protein BDIM_26220 [Brevundimonas diminuta ATCC 11568] Length = 93 Score = 42.7 bits (99), Expect = 0.018, Method: Compositional matrix adjust. Identities = 25/79 (31%), Positives = 40/79 (50%), Gaps = 2/79 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V+L P A I ++ D +K++V A P +G+AN A+ A LAK L + K Sbjct: 3 ARIPVKLTPRASADRIDGWDVDPDGRPV--LKVRVRAQPVEGEANAALTAFLAKALGVPK 60 Query: 62 SSLRMLSKQSSPLKIIYID 80 + + S LK+I +D Sbjct: 61 RDVALARGGQSRLKMIEVD 79 >gi|83313474|ref|YP_423738.1| hypothetical protein amb4375 [Magnetospirillum magneticum AMB-1] gi|82948315|dbj|BAE53179.1| Uncharacterized conserved protein [Magnetospirillum magneticum AMB-1] Length = 108 Score = 42.4 bits (98), Expect = 0.019, Method: Compositional matrix adjust. Identities = 27/89 (30%), Positives = 46/89 (51%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VRL P A + I D + + +K +VTA P+ GKAN A+L +L+K + KS Sbjct: 15 VAVRLTPKASRDRINGPAAEADGA--VVLKAQVTAVPEDGKANAALLKLLSKAWKIPKSD 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + ++ + K+I I + +++ L Sbjct: 73 MDIVLGATDRRKVILISGETEDLRHRLDE 101 >gi|85713678|ref|ZP_01044668.1| hypothetical protein NB311A_04039 [Nitrobacter sp. Nb-311A] gi|85699582|gb|EAQ37449.1| hypothetical protein NB311A_04039 [Nitrobacter sp. Nb-311A] Length = 106 Score = 42.4 bits (98), Expect = 0.019, Method: Compositional matrix adjust. Identities = 24/89 (26%), Positives = 53/89 (59%), Gaps = 2/89 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +R+ P + GI +E+ D + ++++ A + G+AN+A++A+LAK L + K Sbjct: 12 RIALRVTPRGGRDGIDGIEMLADGRPVVKVRVRAVA--EGGEANRAVMAVLAKALGVRKV 69 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 +R+L+ +S LK + ++ D ++ + L+ Sbjct: 70 DVRILAGATSRLKQVAVEGDPVQLGDALR 98 >gi|221632637|ref|YP_002521858.1| hypothetical protein trd_0618 [Thermomicrobium roseum DSM 5159] gi|221157137|gb|ACM06264.1| DUF167 [Thermomicrobium roseum DSM 5159] Length = 102 Score = 42.4 bits (98), Expect = 0.020, Method: Compositional matrix adjust. Identities = 23/58 (39%), Positives = 38/58 (65%), Gaps = 1/58 (1%) Query: 34 IKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 ++VTA P+ G+AN+A+L +LA+ L L + S+R+++ + K I ID KE+ E L Sbjct: 38 VRVTAPPRDGEANEAVLRLLAETLHLPRGSIRIIAGTAQRRKRIRIDGLTSKELLERL 95 >gi|258514375|ref|YP_003190597.1| hypothetical protein Dtox_1088 [Desulfotomaculum acetoxidans DSM 771] gi|257778080|gb|ACV61974.1| protein of unknown function DUF167 [Desulfotomaculum acetoxidans DSM 771] Length = 98 Score = 42.4 bits (98), Expect = 0.021, Method: Compositional matrix adjust. Identities = 23/74 (31%), Positives = 44/74 (59%), Gaps = 7/74 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P A K +A L D + KI++TA P +G+AN+A+ LAK L ++++ + Sbjct: 15 VRVQPRASKDQVAGL-----WEDAV--KIRLTAPPVEGEANRALCDFLAKHLGVTRAQVD 67 Query: 66 MLSKQSSPLKIIYI 79 +++ Q+ K++ + Sbjct: 68 LVTGQTGRNKLVRV 81 >gi|262197893|ref|YP_003269102.1| hypothetical protein Hoch_4719 [Haliangium ochraceum DSM 14365] gi|262081240|gb|ACY17209.1| protein of unknown function DUF167 [Haliangium ochraceum DSM 14365] Length = 101 Score = 42.4 bits (98), Expect = 0.022, Method: Compositional matrix adjust. Identities = 20/48 (41%), Positives = 30/48 (62%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 +K+ VTA P G AN A+LA+LAK L + L ++ QSS K +++ Sbjct: 32 LKVAVTAPPADGAANDAVLALLAKHLGRPRRDLCLVGGQSSRRKTVHV 79 >gi|23015143|ref|ZP_00054928.1| COG1872: Uncharacterized conserved protein [Magnetospirillum magnetotacticum MS-1] Length = 94 Score = 42.0 bits (97), Expect = 0.025, Method: Compositional matrix adjust. Identities = 29/92 (31%), Positives = 46/92 (50%), Gaps = 6/92 (6%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL P A + I D S + +K +VT P+ GKAN A+L +L+K + KS + Sbjct: 3 VRLTPKASRDRIMGAAPEADGS--VVLKAQVTTVPEDGKANAALLKLLSKAWKIPKSDMD 60 Query: 66 MLSKQSSPLKIIYIDKDC----KEITELLQNN 93 ++ + K+I I D K + E + +N Sbjct: 61 IVLGATDRRKVILISGDSEVLRKRLDEWMASN 92 >gi|292493759|ref|YP_003529198.1| hypothetical protein Nhal_3796 [Nitrosococcus halophilus Nc4] gi|291582354|gb|ADE16811.1| protein of unknown function DUF167 [Nitrosococcus halophilus Nc4] Length = 102 Score = 42.0 bits (97), Expect = 0.026, Method: Compositional matrix adjust. Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 6/68 (8%) Query: 23 PKDTSDTI------HMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKI 76 P+ + D I +K+++TA P +GKAN ++ LAK +SKS +R+LS + K Sbjct: 21 PRASCDEIIGPHGDRLKVRITAPPVEGKANADLIRFLAKTFRVSKSQVRLLSGATGRDKR 80 Query: 77 IYIDKDCK 84 + I+K K Sbjct: 81 VCIEKPAK 88 >gi|157964988|ref|YP_001499812.1| hypothetical protein RMA_1324 [Rickettsia massiliae MTU5] gi|157844764|gb|ABV85265.1| hypothetical protein RMA_1324 [Rickettsia massiliae MTU5] Length = 107 Score = 42.0 bits (97), Expect = 0.029, Method: Compositional matrix adjust. Identities = 24/83 (28%), Positives = 49/83 (59%), Gaps = 3/83 (3%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 PN+K++ I++ I ++ ++K+ + A P +GKAN+ ++ LAK+ LS+S++ ++ Sbjct: 24 PNSKQNLISNFVI---INNIPYLKLSIKAIPAQGKANEEIINYLAKEWKLSRSNIEIIKG 80 Query: 70 QSSPLKIIYIDKDCKEITELLQN 92 + LK I I ++ L+ N Sbjct: 81 HTHSLKTILIKNINEDYLNLIIN 103 >gi|301064653|ref|ZP_07205047.1| conserved hypothetical protein TIGR00251 [delta proteobacterium NaphS2] gi|300441273|gb|EFK05644.1| conserved hypothetical protein TIGR00251 [delta proteobacterium NaphS2] Length = 95 Score = 42.0 bits (97), Expect = 0.030, Method: Compositional matrix adjust. Identities = 26/75 (34%), Positives = 44/75 (58%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V+++P + ++ I +D I+ +IK+TA P +GKANKA++ LAKK K +R Sbjct: 14 VKVLPRSSRTEIVG------KTDGIY-RIKLTAPPVEGKANKALINFLAKKTGSPKQKIR 66 Query: 66 MLSKQSSPLKIIYID 80 ++ + S K I I+ Sbjct: 67 IVKGEQSRNKTIRIE 81 >gi|17230688|ref|NP_487236.1| hypothetical protein asl3196 [Nostoc sp. PCC 7120] gi|29839730|sp|Q8YS95|Y3196_ANASP RecName: Full=UPF0235 protein asl3196 gi|17132291|dbj|BAB74895.1| asl3196 [Nostoc sp. PCC 7120] Length = 75 Score = 42.0 bits (97), Expect = 0.030, Method: Compositional matrix adjust. Identities = 28/77 (36%), Positives = 46/77 (59%), Gaps = 7/77 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PN+K+ IA + D S T+H+K + P GKAN+ ++ +LA+K + KS + Sbjct: 5 VKVKPNSKQQKIAEQD---DGSLTVHLK----SPPVDGKANEELIKLLAEKFDVPKSHIT 57 Query: 66 MLSKQSSPLKIIYIDKD 82 + S SS K+I I+ D Sbjct: 58 IKSGLSSKQKLIEIETD 74 >gi|182413313|ref|YP_001818379.1| hypothetical protein Oter_1495 [Opitutus terrae PB90-1] gi|177840527|gb|ACB74779.1| protein of unknown function DUF167 [Opitutus terrae PB90-1] Length = 90 Score = 41.6 bits (96), Expect = 0.032, Method: Compositional matrix adjust. Identities = 23/79 (29%), Positives = 43/79 (54%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C + ++ IPNA ++ I D +K+KV A P +G+AN+ + LA +L L + Sbjct: 4 CTIAIKAIPNAPRNQIVGW-----LGDA--LKVKVHAPPLEGRANEELCEFLADELGLPR 56 Query: 62 SSLRMLSKQSSPLKIIYID 80 ++ +L +S K++ I+ Sbjct: 57 RAVSVLRGDTSRQKLVQIE 75 >gi|288959535|ref|YP_003449876.1| hypothetical protein AZL_026940 [Azospirillum sp. B510] gi|288911843|dbj|BAI73332.1| hypothetical protein AZL_026940 [Azospirillum sp. B510] Length = 113 Score = 41.6 bits (96), Expect = 0.034, Method: Compositional matrix adjust. Identities = 24/80 (30%), Positives = 48/80 (60%), Gaps = 4/80 (5%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIH-MKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +R+ P A ++ + + DT+ +K+ VTA P+ GKAN+A++ +L+K + K+ Sbjct: 16 VALRVTPKASRNAVTGM---ADTAAGGRVLKLAVTAVPENGKANEAVIKLLSKAWKVPKT 72 Query: 63 SLRMLSKQSSPLKIIYIDKD 82 SL +++ + KI+++ D Sbjct: 73 SLTVVAGATDRNKILHVAGD 92 >gi|77166447|ref|YP_344972.1| hypothetical protein Noc_3000 [Nitrosococcus oceani ATCC 19707] gi|254435182|ref|ZP_05048689.1| conserved hypothetical protein TIGR00251 [Nitrosococcus oceani AFC27] gi|123593242|sp|Q3J6V4|Y3000_NITOC RecName: Full=UPF0235 protein Noc_3000 gi|76884761|gb|ABA59442.1| Conserved hypothetical protein 251 [Nitrosococcus oceani ATCC 19707] gi|207088293|gb|EDZ65565.1| conserved hypothetical protein TIGR00251 [Nitrosococcus oceani AFC27] Length = 102 Score = 41.6 bits (96), Expect = 0.036, Method: Compositional matrix adjust. Identities = 28/79 (35%), Positives = 43/79 (54%), Gaps = 7/79 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +RL P AK E+ D +KI++TA P +GKAN +L LAK +S++ + Sbjct: 17 IRLQPRAKGD-----EVIGPHGD--RLKIRITAPPVEGKANTHLLRFLAKTFQVSRNQVY 69 Query: 66 MLSKQSSPLKIIYIDKDCK 84 +LS +S K + I+K K Sbjct: 70 LLSGATSRDKRVRIEKPTK 88 >gi|91205448|ref|YP_537803.1| hypothetical protein RBE_0633 [Rickettsia bellii RML369-C] gi|157827193|ref|YP_001496257.1| hypothetical protein A1I_04385 [Rickettsia bellii OSU 85-389] gi|122425693|sp|Q1RIV0|Y633_RICBR RecName: Full=UPF0235 protein RBE_0633 gi|226706174|sp|A8GWJ3|Y4385_RICB8 RecName: Full=UPF0235 protein A1I_04385 gi|91068992|gb|ABE04714.1| unknown [Rickettsia bellii RML369-C] gi|157802497|gb|ABV79220.1| hypothetical protein A1I_04385 [Rickettsia bellii OSU 85-389] Length = 104 Score = 41.6 bits (96), Expect = 0.036, Method: Compositional matrix adjust. Identities = 21/64 (32%), Positives = 38/64 (59%) Query: 27 SDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEI 86 +D +H+K+ + A Q+GKAN+ ++ LAK+ L +S+L + ++ LK I I +E Sbjct: 35 NDVLHLKLSIKAHAQQGKANEEIINFLAKEWQLLRSNLEITKGHTNSLKTILIKNIDEEY 94 Query: 87 TELL 90 L+ Sbjct: 95 LNLI 98 >gi|226942094|ref|YP_002797168.1| hypothetical protein LHK_03181 [Laribacter hongkongensis HLHK9] gi|254801627|sp|C1D6C4|Y3181_LARHH RecName: Full=UPF0235 protein LHK_03181 gi|226717021|gb|ACO76159.1| Uncharacterized conserved protein [Laribacter hongkongensis HLHK9] Length = 97 Score = 41.6 bits (96), Expect = 0.038, Method: Compositional matrix adjust. Identities = 24/70 (34%), Positives = 40/70 (57%), Gaps = 7/70 (10%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 P A+++ +A L D +KI++ A P GKAN +LA LA+ L +S+S++ + S Sbjct: 19 PGARRTEVAGLH-----GDA--LKIRLAAPPVDGKANACLLAFLARGLGVSRSAVTLKSG 71 Query: 70 QSSPLKIIYI 79 S K++ I Sbjct: 72 DCSRHKVVDI 81 >gi|158424942|ref|YP_001526234.1| hypothetical protein AZC_3318 [Azorhizobium caulinodans ORS 571] gi|158331831|dbj|BAF89316.1| protein of unknown function [Azorhizobium caulinodans ORS 571] Length = 104 Score = 41.2 bits (95), Expect = 0.046, Method: Compositional matrix adjust. Identities = 24/89 (26%), Positives = 42/89 (47%), Gaps = 2/89 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V VR P + + + D D +KI+V A P+ G A A+ +LA ++ S Sbjct: 12 EVSVRATPKGGRDALDGVSQLSDGRDV--LKIRVRAAPEDGAATAAVAKVLAGAAGVAPS 69 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 ++R+ S ++ LK+ I D + L+ Sbjct: 70 AVRLASGATARLKVFRISGDAARLRATLE 98 >gi|300087416|ref|YP_003757938.1| hypothetical protein Dehly_0296 [Dehalogenimonas lykanthroporepellens BL-DC-9] gi|299527149|gb|ADJ25617.1| protein of unknown function DUF167 [Dehalogenimonas lykanthroporepellens BL-DC-9] Length = 96 Score = 41.2 bits (95), Expect = 0.052, Method: Compositional matrix adjust. Identities = 24/71 (33%), Positives = 43/71 (60%), Gaps = 2/71 (2%) Query: 25 DTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCK 84 DT+ I ++++VTA P+ GKAN+A+ +LA++L L KS + ++ +S K+ + + Sbjct: 27 DTAAEI-IRVRVTAAPEHGKANRAVAELLAERLGLPKSRVTIVRGLTSRRKVAAVAGLSE 85 Query: 85 -EITELLQNND 94 E+ E L D Sbjct: 86 AEVREKLGKTD 96 >gi|126657163|ref|ZP_01728329.1| hypothetical protein CY0110_24581 [Cyanothece sp. CCY0110] gi|126621434|gb|EAZ92145.1| hypothetical protein CY0110_24581 [Cyanothece sp. CCY0110] Length = 75 Score = 41.2 bits (95), Expect = 0.053, Method: Compositional matrix adjust. Identities = 27/75 (36%), Positives = 42/75 (56%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PNAK+ I + +D H KI V + P GKAN+ ++ +LAK + KS + Sbjct: 7 VKVKPNAKQQKIEEM------ADN-HFKIAVKSPPTDGKANQELITLLAKHFNVPKSHIL 59 Query: 66 MLSKQSSPLKIIYID 80 + S SS K++ ID Sbjct: 60 IKSGVSSRNKLVEID 74 >gi|299139441|ref|ZP_07032616.1| protein of unknown function DUF167 [Acidobacterium sp. MP5ACTX8] gi|298598710|gb|EFI54873.1| protein of unknown function DUF167 [Acidobacterium sp. MP5ACTX8] Length = 99 Score = 40.8 bits (94), Expect = 0.054, Method: Compositional matrix adjust. Identities = 23/78 (29%), Positives = 44/78 (56%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C++ VR+ P AK++ + D S +KI +T P G+AN A++A L+ +L + + Sbjct: 12 CSLPVRVHPGAKQNAVTGTH---DGS----LKISLTTPPTDGRANTALIAFLSDRLNIPR 64 Query: 62 SSLRMLSKQSSPLKIIYI 79 + + +L+ +S K + I Sbjct: 65 AHIELLTGATSRSKTLRI 82 >gi|1388023|gb|AAB88056.1| putative [Bartonella bacilliformis] Length = 109 Score = 40.8 bits (94), Expect = 0.058, Method: Compositional matrix adjust. Identities = 26/94 (27%), Positives = 48/94 (51%), Gaps = 2/94 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A + I +E D ++ I++ A P+ GKANKA++ LAK+ + S Sbjct: 14 LFVRLTPKASMNNIVGVESRDDGKQ--YLIIRLCAVPEDGKANKALIKFLAKQWKIPSSC 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97 + + + S K + ++I ++L + + T Sbjct: 72 ISLENGAISRYKQLRFSGGVEKIEKILHSLGNYT 105 >gi|121602415|ref|YP_988648.1| hypothetical protein BARBAKC583_0328 [Bartonella bacilliformis KC583] gi|120614592|gb|ABM45193.1| conserved hypothetical protein TIGR00251 [Bartonella bacilliformis KC583] Length = 107 Score = 40.8 bits (94), Expect = 0.060, Method: Compositional matrix adjust. Identities = 26/94 (27%), Positives = 48/94 (51%), Gaps = 2/94 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A + I +E D ++ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVRLTPKASMNNIVGVESRDDGKQ--YLIIRLCAVPEDGKANKALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97 + + + S K + ++I ++L + + T Sbjct: 70 ISLENGAISRYKQLRFSGGVEKIEKILHSLGNYT 103 >gi|119899753|ref|YP_934966.1| hypothetical protein azo3464 [Azoarcus sp. BH72] gi|166232594|sp|A1KB74|Y3464_AZOSB RecName: Full=UPF0235 protein azo3464 gi|119672166|emb|CAL96080.1| conserved hypothetical protein [Azoarcus sp. BH72] Length = 98 Score = 40.8 bits (94), Expect = 0.068, Method: Compositional matrix adjust. Identities = 23/71 (32%), Positives = 39/71 (54%), Gaps = 7/71 (9%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 P A+++G A L MKI++ A P GKAN A+ A LA + KS++ ++S Sbjct: 21 PGARQTGFAGLHGEA-------MKIRLAAPPVDGKANAALCAFLADFCEVPKSAVTLVSG 73 Query: 70 QSSPLKIIYID 80 ++S K + ++ Sbjct: 74 ETSRAKRVRVE 84 >gi|119384906|ref|YP_915962.1| hypothetical protein Pden_2174 [Paracoccus denitrificans PD1222] gi|189039514|sp|A1B422|Y2174_PARDP RecName: Full=UPF0235 protein Pden_2174 gi|119374673|gb|ABL70266.1| protein of unknown function DUF167 [Paracoccus denitrificans PD1222] Length = 82 Score = 40.8 bits (94), Expect = 0.071, Method: Compositional matrix adjust. Identities = 23/79 (29%), Positives = 42/79 (53%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A ++ + D +++ VT P+ GKAN A++ +LAK L ++K Sbjct: 12 AEIAVRVTPRASRNAV--------ILDGEAIRVTVTTVPEDGKANAAVVKLLAKALGVAK 63 Query: 62 SSLRMLSKQSSPLKIIYID 80 S L ++ ++ K+ ID Sbjct: 64 SRLVLVRGATARDKLFRID 82 >gi|149200997|ref|ZP_01877972.1| hypothetical protein RTM1035_15267 [Roseovarius sp. TM1035] gi|149145330|gb|EDM33356.1| hypothetical protein RTM1035_15267 [Roseovarius sp. TM1035] Length = 84 Score = 40.4 bits (93), Expect = 0.082, Method: Compositional matrix adjust. Identities = 19/49 (38%), Positives = 30/49 (61%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 +++ VT P+ GKAN A+ +LAK L + KS L +L +S K+ +D Sbjct: 36 LRVYVTTVPEDGKANAAVQKLLAKALGVPKSRLTLLRGHTSRDKVFQVD 84 >gi|239948125|ref|ZP_04699878.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes scapularis] gi|239922401|gb|EER22425.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes scapularis] Length = 92 Score = 40.0 bits (92), Expect = 0.094, Method: Compositional matrix adjust. Identities = 17/49 (34%), Positives = 33/49 (67%) Query: 31 HMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 ++K+ + A P++GKAN+ ++ LAK+ LS+S++ ++ + LK I I Sbjct: 40 YLKLSIKAVPEQGKANEEIINYLAKEWKLSRSNIEIIKGHTHSLKTILI 88 >gi|237749382|ref|ZP_04579862.1| predicted protein [Oxalobacter formigenes OXCC13] gi|229380744|gb|EEO30835.1| predicted protein [Oxalobacter formigenes OXCC13] Length = 100 Score = 40.0 bits (92), Expect = 0.096, Method: Compositional matrix adjust. Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ PNAKK+ I S SD ++I++ A P GKAN+A++ +AKKL K + Sbjct: 15 IAVQVSPNAKKTEIVS-------SDGEALRIRLQAPPVDGKANEALVQFIAKKLRTPKRN 67 Query: 64 LRMLSKQSSPLKIIYI 79 + + S+ K++ I Sbjct: 68 VSITHGLSAKHKLLEI 83 >gi|85704715|ref|ZP_01035816.1| hypothetical protein ROS217_06535 [Roseovarius sp. 217] gi|85670533|gb|EAQ25393.1| hypothetical protein ROS217_06535 [Roseovarius sp. 217] Length = 84 Score = 40.0 bits (92), Expect = 0.097, Method: Compositional matrix adjust. Identities = 24/79 (30%), Positives = 43/79 (54%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ P A ++ I + +D + +++ VT P+ GKAN A+ +LAK L + K Sbjct: 14 TTIALRVTPKAARNRI----VAEDGA----LRVYVTTVPEDGKANAAVQKLLAKALGVPK 65 Query: 62 SSLRMLSKQSSPLKIIYID 80 S L +L +S K+ +D Sbjct: 66 SRLSLLRGHTSRDKVFQVD 84 >gi|146276431|ref|YP_001166590.1| hypothetical protein Rsph17025_0379 [Rhodobacter sphaeroides ATCC 17025] gi|145554672|gb|ABP69285.1| protein of unknown function DUF167 [Rhodobacter sphaeroides ATCC 17025] Length = 84 Score = 40.0 bits (92), Expect = 0.10, Method: Compositional matrix adjust. Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 8/75 (10%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P A + I E+ + T +++ VT P+ GKAN+A+ +LAK L ++KS L Sbjct: 18 VRVTPRASRERI---EVQEGT-----VRVHVTCVPEDGKANRAVTEVLAKALGVAKSRLT 69 Query: 66 MLSKQSSPLKIIYID 80 ++ + K +D Sbjct: 70 LVRGATGRDKTFRLD 84 >gi|307152217|ref|YP_003887601.1| hypothetical protein Cyan7822_2348 [Cyanothece sp. PCC 7822] gi|306982445|gb|ADN14326.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7822] Length = 73 Score = 40.0 bits (92), Expect = 0.10, Method: Compositional matrix adjust. Identities = 30/74 (40%), Positives = 43/74 (58%), Gaps = 7/74 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PNAK+ I LE D S I +K + P GKAN+ ++ +LAKK +SKS + Sbjct: 5 VKVKPNAKQQKIEELE---DGSLVISLK----SPPVDGKANEELIKLLAKKYQVSKSQIS 57 Query: 66 MLSKQSSPLKIIYI 79 + S SS K+I I Sbjct: 58 IQSGLSSRNKLIEI 71 >gi|38505546|ref|NP_942167.1| hypothetical protein ssr5011 [Synechocystis sp. PCC 6803] gi|38423570|dbj|BAD01781.1| ssr5011 [Synechocystis sp. PCC 6803] Length = 73 Score = 40.0 bits (92), Expect = 0.11, Method: Compositional matrix adjust. Identities = 26/76 (34%), Positives = 46/76 (60%), Gaps = 7/76 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PNAK+S + + D S IH+K + P GKAN+ ++ +LAK+ +S+ S++ Sbjct: 5 VKVKPNAKQSKVVYGD---DGSLIIHVK----SPPVDGKANQELIKLLAKEFNVSQQSIK 57 Query: 66 MLSKQSSPLKIIYIDK 81 + S S KI+ I++ Sbjct: 58 IKSGAGSRQKIVEINE 73 >gi|147678156|ref|YP_001212371.1| hypothetical protein PTH_1821 [Pelotomaculum thermopropionicum SI] gi|189039033|sp|A5D180|Y1821_PELTS RecName: Full=UPF0235 protein PTH_1821 gi|146274253|dbj|BAF60002.1| uncharacterized conserved protein [Pelotomaculum thermopropionicum SI] Length = 95 Score = 40.0 bits (92), Expect = 0.11, Method: Compositional matrix adjust. Identities = 26/74 (35%), Positives = 42/74 (56%), Gaps = 7/74 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P A ++ +A L D +KI++TA P G+AN+A A LA L+L S + Sbjct: 15 VRVQPRAARNQVAGL-----YEDA--LKIRLTAPPVDGEANEACRAFLADSLSLPPSKVE 67 Query: 66 MLSKQSSPLKIIYI 79 ++S +S K++ I Sbjct: 68 IVSGHASRTKVVKI 81 >gi|218246502|ref|YP_002371873.1| hypothetical protein PCC8801_1666 [Cyanothece sp. PCC 8801] gi|257059535|ref|YP_003137423.1| hypothetical protein Cyan8802_1684 [Cyanothece sp. PCC 8802] gi|218166980|gb|ACK65717.1| protein of unknown function DUF167 [Cyanothece sp. PCC 8801] gi|256589701|gb|ACV00588.1| protein of unknown function DUF167 [Cyanothece sp. PCC 8802] Length = 72 Score = 40.0 bits (92), Expect = 0.12, Method: Compositional matrix adjust. Identities = 27/75 (36%), Positives = 44/75 (58%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PNAKK I E D S +++K + P +GKAN+ ++ +LAKK +S+S + Sbjct: 5 VKVKPNAKKQKIQEEE---DGSLVVYLK----SPPIEGKANQELIKLLAKKFGVSQSQIS 57 Query: 66 MLSKQSSPLKIIYID 80 + S SS K + I+ Sbjct: 58 IKSGLSSRNKWVEIE 72 >gi|148555779|ref|YP_001263361.1| hypothetical protein Swit_2871 [Sphingomonas wittichii RW1] gi|148500969|gb|ABQ69223.1| protein of unknown function DUF167 [Sphingomonas wittichii RW1] Length = 74 Score = 39.7 bits (91), Expect = 0.12, Method: Compositional matrix adjust. Identities = 20/68 (29%), Positives = 36/68 (52%) Query: 25 DTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCK 84 D + IK+ A P G AN+A++ ++AK L ++K + + S +S LK +++ D Sbjct: 4 DADGRRWLSIKLAAAPSDGAANEALVRLVAKALGVAKRDVTLASGATSRLKRLHVSGDPA 63 Query: 85 EITELLQN 92 + LQ Sbjct: 64 ALAAALQR 71 >gi|295106861|emb|CBL04404.1| Uncharacterized conserved protein [Gordonibacter pamelaeae 7-10-1-b] Length = 96 Score = 39.7 bits (91), Expect = 0.13, Method: Compositional matrix adjust. Identities = 21/89 (23%), Positives = 47/89 (52%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P + + ++ + D + + ++VTA P GKANKA+ ++A+ L + KS Sbjct: 6 IAVHVTPRSGRDEVSGVR--ADAAGADEVCVRVTAPPDGGKANKAVCKLVAEALGVPKSR 63 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S ++ K + ++ D ++ L + Sbjct: 64 VGVASGHTARRKRLSVEADQAQVDAWLAS 92 >gi|325294222|ref|YP_004280736.1| yggU [Desulfurobacterium thermolithotrophum DSM 11699] gi|325064670|gb|ADY72677.1| UPF0235 protein yggU [Desulfurobacterium thermolithotrophum DSM 11699] Length = 101 Score = 39.7 bits (91), Expect = 0.13, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 34/50 (68%) Query: 31 HMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 +KIKVT P+ GKAN+ ++ +L+K L + K + ++ ++S +K++ I+ Sbjct: 33 RLKIKVTVPPEGGKANQKIIELLSKALKVPKRDIDIVKGETSRIKVVRIE 82 >gi|269127125|ref|YP_003300495.1| hypothetical protein Tcur_2915 [Thermomonospora curvata DSM 43183] gi|268312083|gb|ACY98457.1| protein of unknown function DUF167 [Thermomonospora curvata DSM 43183] Length = 91 Score = 39.7 bits (91), Expect = 0.13, Method: Compositional matrix adjust. Identities = 23/87 (26%), Positives = 44/87 (50%), Gaps = 7/87 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V +R+ P A ++ + + + +KV A +GKA +A L +A L + + Sbjct: 4 VRVAIRVGPGASRTKVGG-------AHGEALVVKVAARAVEGKATEAALRAVADALGVRR 56 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITE 88 +R++S +S K+I +D D + +TE Sbjct: 57 RDVRLISGATSREKLIEVDGDERALTE 83 >gi|300115523|ref|YP_003762098.1| hypothetical protein Nwat_3056 [Nitrosococcus watsonii C-113] gi|299541460|gb|ADJ29777.1| protein of unknown function DUF167 [Nitrosococcus watsonii C-113] Length = 102 Score = 39.7 bits (91), Expect = 0.15, Method: Compositional matrix adjust. Identities = 22/68 (32%), Positives = 37/68 (54%), Gaps = 6/68 (8%) Query: 23 PKDTSDTI------HMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKI 76 P+ D + +KI++TA P +GKAN +L L K +S++ + +LS +S K Sbjct: 21 PRARGDEVIGPHGNRLKIRITAPPVEGKANTQLLRFLVKTFQVSRNQVYLLSGTASRDKR 80 Query: 77 IYIDKDCK 84 + I+K K Sbjct: 81 VRIEKPAK 88 >gi|329893887|ref|ZP_08269938.1| hypothetical protein IMCC3088_2481 [gamma proteobacterium IMCC3088] gi|328923406|gb|EGG30722.1| hypothetical protein IMCC3088_2481 [gamma proteobacterium IMCC3088] Length = 83 Score = 39.3 bits (90), Expect = 0.17, Method: Compositional matrix adjust. Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 4/80 (5%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P AK++ I +I + +++ VT P+ GKAN+A++ LA + ++ Sbjct: 6 VSVRVTPKAKQAQIKVEDIEGQSV----LRVYVTVAPEDGKANRAVMRALADYFDVPRTR 61 Query: 64 LRMLSKQSSPLKIIYIDKDC 83 +++LS K ID D Sbjct: 62 IKLLSGAKQRDKRFSIDADV 81 >gi|330812367|ref|YP_004356829.1| hypothetical protein PSEBR_a5324 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327380475|gb|AEA71825.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 97 Score = 39.3 bits (90), Expect = 0.17, Method: Compositional matrix adjust. Identities = 17/39 (43%), Positives = 28/39 (71%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQ 70 +KI++TA P +GKAN ++A LAK +SKS + ++S + Sbjct: 34 LKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLVSGE 72 >gi|77461543|ref|YP_351050.1| hypothetical protein Pfl01_5322 [Pseudomonas fluorescens Pf0-1] gi|123602932|sp|Q3K595|Y5322_PSEPF RecName: Full=UPF0235 protein Pfl01_5322 gi|77385546|gb|ABA77059.1| conserved hypothetical protein [Pseudomonas fluorescens Pf0-1] Length = 96 Score = 39.3 bits (90), Expect = 0.17, Method: Compositional matrix adjust. Identities = 17/39 (43%), Positives = 27/39 (69%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQ 70 +KI++TA P +GKAN ++ LAK +SKS + +LS + Sbjct: 34 LKIRLTAPPVEGKANAYLMGFLAKAFGVSKSQVSLLSGE 72 >gi|70733126|ref|YP_262899.1| hypothetical protein PFL_5841 [Pseudomonas fluorescens Pf-5] gi|123652292|sp|Q4K4D4|Y5841_PSEF5 RecName: Full=UPF0235 protein PFL_5841 gi|68347425|gb|AAY95031.1| conserved hypothetical protein TIGR00251 [Pseudomonas fluorescens Pf-5] Length = 97 Score = 39.3 bits (90), Expect = 0.18, Method: Compositional matrix adjust. Identities = 20/54 (37%), Positives = 33/54 (61%), Gaps = 6/54 (11%) Query: 23 PKDTSDTI------HMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQ 70 PK +SD +KI++TA P +GKAN ++A LAK + KS++ ++S + Sbjct: 19 PKASSDEFAGLHGERLKIRLTAPPVEGKANAHLMAFLAKAFGIPKSNVSLVSGE 72 >gi|126463218|ref|YP_001044332.1| hypothetical protein Rsph17029_2458 [Rhodobacter sphaeroides ATCC 17029] gi|126104882|gb|ABN77560.1| protein of unknown function DUF167 [Rhodobacter sphaeroides ATCC 17029] Length = 85 Score = 39.3 bits (90), Expect = 0.18, Method: Compositional matrix adjust. Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 8/75 (10%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P A ++ + D + + +++ VT P+ GKAN+A+ LAK L ++KS L Sbjct: 19 VRVTPRAARAKV-------DLQEGV-VRVHVTCVPEDGKANRAVTEALAKALGVAKSRLT 70 Query: 66 MLSKQSSPLKIIYID 80 ++ +S K +D Sbjct: 71 LVRGATSRDKTFRLD 85 >gi|114326941|ref|YP_744098.1| putative cytoplasmic protein [Granulibacter bethesdensis CGDNIH1] gi|122328075|sp|Q0BVH7|Y277_GRABC RecName: Full=UPF0235 protein GbCGDNIH1_0277 gi|114315115|gb|ABI61175.1| hypothetical cytosolic protein [Granulibacter bethesdensis CGDNIH1] Length = 107 Score = 39.3 bits (90), Expect = 0.18, Method: Compositional matrix adjust. Identities = 24/86 (27%), Positives = 47/86 (54%), Gaps = 2/86 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +R++P A+K G+ D +KI V+A KG+AN+A+ MLAK L + S + Sbjct: 15 LRVVPKARKIGLGGTVPGADGKP--RLKISVSAPADKGQANEAVRDMLAKALRVPASRIT 72 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQ 91 +L ++ K++ ++ D + + ++ Sbjct: 73 LLQGLTARDKLVRVEGDPETLGSTVE 98 >gi|217979671|ref|YP_002363818.1| protein of unknown function DUF167 [Methylocella silvestris BL2] gi|217505047|gb|ACK52456.1| protein of unknown function DUF167 [Methylocella silvestris BL2] Length = 118 Score = 39.3 bits (90), Expect = 0.19, Method: Compositional matrix adjust. Identities = 27/78 (34%), Positives = 44/78 (56%), Gaps = 4/78 (5%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIH-MKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VRL P KS +E SD +K +V A PQ G+AN A++ ++AK L L+ S++ Sbjct: 26 VRLTP---KSARDEIEGASQLSDGRAVLKARVRAAPQDGEANAALIRLVAKALRLAPSAV 82 Query: 65 RMLSKQSSPLKIIYIDKD 82 R+ + ++ LK + + D Sbjct: 83 RVEAGATARLKTLCLTGD 100 >gi|297617010|ref|YP_003702169.1| hypothetical protein Slip_0824 [Syntrophothermus lipocalidus DSM 12680] gi|297144847|gb|ADI01604.1| protein of unknown function DUF167 [Syntrophothermus lipocalidus DSM 12680] Length = 96 Score = 39.3 bits (90), Expect = 0.19, Method: Compositional matrix adjust. Identities = 22/74 (29%), Positives = 46/74 (62%), Gaps = 7/74 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR++P A K+ E+ + + KIK+TA P +G+AN+A+++ LAK +++ ++ Sbjct: 14 VRVLPRASKN-----EVIGEVEGAV--KIKLTAAPLEGEANQALISFLAKISGVARKNVT 66 Query: 66 MLSKQSSPLKIIYI 79 ++ ++S K++ I Sbjct: 67 IIKGETSRHKLVEI 80 >gi|294676098|ref|YP_003576713.1| hypothetical protein RCAP_rcc00541 [Rhodobacter capsulatus SB 1003] gi|294474918|gb|ADE84306.1| protein of unknown function DUF167 [Rhodobacter capsulatus SB 1003] Length = 83 Score = 39.3 bits (90), Expect = 0.19, Method: Compositional matrix adjust. Identities = 22/79 (27%), Positives = 44/79 (55%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ P A ++ I + +D ++ VT P+ GKAN A++ +LAK L ++K Sbjct: 13 AEIALRVTPKASRNEIV---VAEDG-----LRAYVTVVPEGGKANAAVVKLLAKSLGVAK 64 Query: 62 SSLRMLSKQSSPLKIIYID 80 S L ++ +++ K+ +D Sbjct: 65 SRLTLIRGETARDKVFRLD 83 >gi|119509193|ref|ZP_01628343.1| hypothetical protein N9414_14620 [Nodularia spumigena CCY9414] gi|119466035|gb|EAW46922.1| hypothetical protein N9414_14620 [Nodularia spumigena CCY9414] Length = 52 Score = 39.3 bits (90), Expect = 0.19, Method: Compositional matrix adjust. Identities = 20/51 (39%), Positives = 33/51 (64%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKD 82 M +++ + P GKAN+ ++ +LA+K + KS +R+ S SS K+I ID D Sbjct: 1 MIVRLKSPPVDGKANEELIKLLAEKFHVPKSHIRIKSGVSSRQKLIEIDTD 51 >gi|193213756|ref|YP_001994955.1| hypothetical protein Ctha_0036 [Chloroherpeton thalassium ATCC 35110] gi|193087233|gb|ACF12508.1| protein of unknown function DUF167 [Chloroherpeton thalassium ATCC 35110] Length = 97 Score = 39.3 bits (90), Expect = 0.20, Method: Compositional matrix adjust. Identities = 27/74 (36%), Positives = 40/74 (54%), Gaps = 7/74 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL P A K+ EI + +KI++ A P + ANKA + LAK ++KS + Sbjct: 15 VRLQPRASKN-----EIVGEYDGA--LKIRIAAPPVENAANKACIEFLAKTFGIAKSQVE 67 Query: 66 MLSKQSSPLKIIYI 79 +LS +S K+I I Sbjct: 68 ILSGDTSRNKLIRI 81 >gi|307824763|ref|ZP_07654986.1| protein of unknown function DUF167 [Methylobacter tundripaludum SV96] gi|307734121|gb|EFO04975.1| protein of unknown function DUF167 [Methylobacter tundripaludum SV96] Length = 88 Score = 39.3 bits (90), Expect = 0.21, Method: Compositional matrix adjust. Identities = 20/60 (33%), Positives = 37/60 (61%), Gaps = 3/60 (5%) Query: 24 KDTSDTIH---MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 KD +H +K++V A P GKAN+ ++A +A + +SKS+ ++++ +S K I I+ Sbjct: 11 KDEWAGLHGERLKLRVKAAPVDGKANQHLIAFIADEFGVSKSACKLITGESGREKRIAIN 70 >gi|221640262|ref|YP_002526524.1| hypothetical protein RSKD131_2163 [Rhodobacter sphaeroides KD131] gi|221161043|gb|ACM02023.1| Hypothetical Protein RSKD131_2163 [Rhodobacter sphaeroides KD131] Length = 85 Score = 38.9 bits (89), Expect = 0.21, Method: Compositional matrix adjust. Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 8/75 (10%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P A ++ + D + + +++ VT P+ GKAN+A+ LAK L ++KS L Sbjct: 19 VRVTPRAARAKV-------DLQEGV-VRVHVTCVPEDGKANRAVTEALAKALGVAKSRLT 70 Query: 66 MLSKQSSPLKIIYID 80 ++ +S K +D Sbjct: 71 LVRGATSRDKTFRLD 85 >gi|330795787|ref|XP_003285952.1| hypothetical protein DICPUDRAFT_23568 [Dictyostelium purpureum] gi|325084041|gb|EGC37478.1| hypothetical protein DICPUDRAFT_23568 [Dictyostelium purpureum] Length = 110 Score = 38.9 bits (89), Expect = 0.22, Method: Compositional matrix adjust. Identities = 25/82 (30%), Positives = 47/82 (57%), Gaps = 7/82 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 + + PN+K++ I S E + I + ++++ P G+ANK ++ L+K+L L KS+++ Sbjct: 28 INVHPNSKENQIISFE------NEI-LSLRISEPPIDGQANKGVVEFLSKELGLRKSNIQ 80 Query: 66 MLSKQSSPLKIIYIDKDCKEIT 87 + S K I ID + + IT Sbjct: 81 VSKGSKSRNKSIEIDLESESIT 102 >gi|328541881|ref|YP_004301990.1| hypothetical protein SL003B_0257 [Polymorphum gilvum SL003B-26A1] gi|326411632|gb|ADZ68695.1| hypothetical protein SL003B_0257 [Polymorphum gilvum SL003B-26A1] Length = 106 Score = 38.9 bits (89), Expect = 0.23, Method: Compositional matrix adjust. Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 2/79 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VRL P A K + D + +++ A P+KG AN A+ A++AK L + KS+ Sbjct: 17 VDVRLTPRAGKDAVEGCSELSDGRPVVLARVR--AIPEKGAANAALEALIAKALGVPKSA 74 Query: 64 LRMLSKQSSPLKIIYIDKD 82 + + + + LK + + D Sbjct: 75 VAIDAGAGARLKTLKVSGD 93 >gi|77464376|ref|YP_353880.1| hypothetical protein RSP_0800 [Rhodobacter sphaeroides 2.4.1] gi|77388794|gb|ABA79979.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] Length = 85 Score = 38.9 bits (89), Expect = 0.26, Method: Compositional matrix adjust. Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 8/75 (10%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P A + A +E+ + +++ VT P+ GKAN+A+ LAK L ++KS L Sbjct: 19 VRVTPRAAR---AKVELQEGV-----VRVHVTCVPEDGKANRAVTEALAKALGVAKSRLT 70 Query: 66 MLSKQSSPLKIIYID 80 ++ +S K +D Sbjct: 71 LVRGATSRDKTFRLD 85 >gi|288818090|ref|YP_003432438.1| hypothetical protein HTH_0777 [Hydrogenobacter thermophilus TK-6] gi|288787490|dbj|BAI69237.1| hypothetical protein HTH_0777 [Hydrogenobacter thermophilus TK-6] Length = 73 Score = 38.5 bits (88), Expect = 0.31, Method: Compositional matrix adjust. Identities = 22/75 (29%), Positives = 43/75 (57%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR P AK+ E K +++++ ++ V PQ+GKAN+ + +L+ L + KS ++ Sbjct: 5 VRAKPKAKR------EYVKKITESVY-EVAVKEPPQEGKANERIAVLLSYHLGIPKSRIK 57 Query: 66 MLSKQSSPLKIIYID 80 +L +S +K+ +D Sbjct: 58 LLKGHTSKIKLFQVD 72 >gi|254419593|ref|ZP_05033317.1| conserved hypothetical protein [Brevundimonas sp. BAL3] gi|196185770|gb|EDX80746.1| conserved hypothetical protein [Brevundimonas sp. BAL3] Length = 69 Score = 38.5 bits (88), Expect = 0.32, Method: Compositional matrix adjust. Identities = 18/49 (36%), Positives = 32/49 (65%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 +K++V A P G+AN A++ LAK L +S+SS+ + S LK++ ++ Sbjct: 2 LKVRVRARPVDGEANAALVKFLAKALGVSRSSVVLERGGQSRLKMVSVE 50 >gi|254784473|ref|YP_003071901.1| hypothetical protein TERTU_0220 [Teredinibacter turnerae T7901] gi|237683524|gb|ACR10788.1| conserved hypothetical protein [Teredinibacter turnerae T7901] Length = 99 Score = 38.5 bits (88), Expect = 0.33, Method: Compositional matrix adjust. Identities = 21/64 (32%), Positives = 36/64 (56%), Gaps = 6/64 (9%) Query: 23 PKDTSDTI------HMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKI 76 PK +SD +KI++TA P GKAN ++ LA++ ++KS + ++ Q S K Sbjct: 20 PKASSDAFAGLQADRLKIRITAPPTDGKANAHLVKYLARQFGVAKSDIEIVRGQLSRQKT 79 Query: 77 IYID 80 + I+ Sbjct: 80 LQIN 83 >gi|126733001|ref|ZP_01748760.1| hypothetical protein SSE37_15953 [Sagittula stellata E-37] gi|126706530|gb|EBA05608.1| hypothetical protein SSE37_15953 [Sagittula stellata E-37] Length = 84 Score = 38.1 bits (87), Expect = 0.38, Method: Compositional matrix adjust. Identities = 22/78 (28%), Positives = 42/78 (53%), Gaps = 8/78 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A ++ + + T D +++ VT P+ GKA A++ +LAK L + K Sbjct: 14 ATLAVRVTPKASRNAV------ERTDDA--LRVYVTTVPEGGKATAAVVKLLAKALGVPK 65 Query: 62 SSLRMLSKQSSPLKIIYI 79 S L ++ ++S K+ + Sbjct: 66 SRLELVRGETSRDKVFRV 83 >gi|296533899|ref|ZP_06896426.1| protein of hypothetical function DUF167 [Roseomonas cervicalis ATCC 49957] gi|296265774|gb|EFH11872.1| protein of hypothetical function DUF167 [Roseomonas cervicalis ATCC 49957] Length = 101 Score = 38.1 bits (87), Expect = 0.38, Method: Compositional matrix adjust. Identities = 25/88 (28%), Positives = 45/88 (51%), Gaps = 5/88 (5%) Query: 6 VRLIPNAKKSGI-ASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V+ P A+++G+ + P D +K+ V P+ G+ANKA+ A+LA L + S++ Sbjct: 16 VKAQPKARRAGLQGWIAAP----DGPRLKLAVHEAPEDGRANKAICALLAGALHVPPSAI 71 Query: 65 RMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++ +S K I D + E L+ Sbjct: 72 TVVQGATSREKTCRILGDSARLHETLEG 99 >gi|327399372|ref|YP_004340241.1| hypothetical protein Hipma_1220 [Hippea maritima DSM 10411] gi|327182001|gb|AEA34182.1| UPF0235 protein yggU [Hippea maritima DSM 10411] Length = 84 Score = 38.1 bits (87), Expect = 0.38, Method: Compositional matrix adjust. Identities = 26/85 (30%), Positives = 48/85 (56%), Gaps = 10/85 (11%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PN+K + + K + +K+ P +GKANKA++ LAK+L ++KS + Sbjct: 5 VKVKPNSK---VEQFDFDKGV-----LTLKIKEKPVEGKANKAVVDKLAKRLKVAKSCIE 56 Query: 66 MLSKQSSPLKIIYIDKDCKEITELL 90 ++ + S K++ I DC + E+L Sbjct: 57 IVKGEKSRSKLVRI--DCLDDDEIL 79 >gi|260467362|ref|ZP_05813535.1| protein of unknown function DUF167 [Mesorhizobium opportunistum WSM2075] gi|259028889|gb|EEW30192.1| protein of unknown function DUF167 [Mesorhizobium opportunistum WSM2075] Length = 103 Score = 38.1 bits (87), Expect = 0.39, Method: Compositional matrix adjust. Identities = 25/90 (27%), Positives = 52/90 (57%), Gaps = 4/90 (4%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDT-IHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ VRL P KS + LE + +D H+K++V A P+ G AN+A+ + AK L + Sbjct: 13 DLFVRLTP---KSSVDRLEGVETAADGRSHLKVRVRAVPENGAANQALERLAAKTLGVPV 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S++ +++ ++ LK + + D + ++ ++ Sbjct: 70 SAVSVVAGGTARLKTLRVAGDPEALSRSIE 99 >gi|296109159|ref|YP_003616108.1| protein of unknown function DUF167 [Methanocaldococcus infernus ME] gi|295433973|gb|ADG13144.1| protein of unknown function DUF167 [Methanocaldococcus infernus ME] Length = 92 Score = 38.1 bits (87), Expect = 0.40, Method: Compositional matrix adjust. Identities = 25/83 (30%), Positives = 44/83 (53%), Gaps = 9/83 (10%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 PNAKK+ I + ++ +++KV A P +GKANK ++ +K + +++ Sbjct: 17 PNAKKTEIVGRDEWRN-----RLEVKVKAPPVEGKANKEIIKFFSKLFG----DVEIVAG 67 Query: 70 QSSPLKIIYIDKDCKEITELLQN 92 + S K I I K KE+ E+L + Sbjct: 68 EKSSKKTILIRKPLKEVEEILNS 90 >gi|242279866|ref|YP_002991995.1| hypothetical protein Desal_2400 [Desulfovibrio salexigens DSM 2638] gi|242122760|gb|ACS80456.1| protein of unknown function DUF167 [Desulfovibrio salexigens DSM 2638] Length = 106 Score = 38.1 bits (87), Expect = 0.40, Method: Compositional matrix adjust. Identities = 24/81 (29%), Positives = 42/81 (51%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V V + P AK GI + D++ ++I A KANKA+ A +A +L L K + Sbjct: 22 VSVWVQPGAKNEGITG-----EYQDSVRVRINAPAV--DNKANKALAAFVATRLGLKKRN 74 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 + + S S+ K++ ++ D + Sbjct: 75 ISIASGHSNRKKVLLVESDVE 95 >gi|289208238|ref|YP_003460304.1| hypothetical protein TK90_1056 [Thioalkalivibrio sp. K90mix] gi|288943869|gb|ADC71568.1| protein of unknown function DUF167 [Thioalkalivibrio sp. K90mix] Length = 85 Score = 38.1 bits (87), Expect = 0.41, Method: Compositional matrix adjust. Identities = 23/80 (28%), Positives = 43/80 (53%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + VR+ P +K+ I D +K++V A P+KG+AN+A+ A+LAK L Sbjct: 1 MARLRVRVAPGSKRDAIGPW-----MGDI--LKLRVQAPPEKGRANEAVCALLAKALGCP 53 Query: 61 KSSLRMLSKQSSPLKIIYID 80 + +++ ++ K + I+ Sbjct: 54 ARDVSVVAGATARDKTVAIE 73 >gi|284039659|ref|YP_003389589.1| hypothetical protein Slin_4812 [Spirosoma linguale DSM 74] gi|283818952|gb|ADB40790.1| protein of unknown function DUF167 [Spirosoma linguale DSM 74] Length = 86 Score = 38.1 bits (87), Expect = 0.42, Method: Compositional matrix adjust. Identities = 21/66 (31%), Positives = 39/66 (59%), Gaps = 2/66 (3%) Query: 25 DTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCK 84 D + ++ KI+ A Q GKAN ++ LAK+L + KS + +++ ++P K I +D + Sbjct: 19 DAAGQLNAKIRAPA--QDGKANAYLIEFLAKQLGIPKSGVSIVAGFTNPHKRIEVDVPEE 76 Query: 85 EITELL 90 +T+ L Sbjct: 77 VLTDFL 82 >gi|226942467|ref|YP_002797540.1| hypothetical protein Avin_03050 [Azotobacter vinelandii DJ] gi|259646933|sp|C1DI68|Y305_AZOVD RecName: Full=UPF0235 protein Avin_03050 gi|226717394|gb|ACO76565.1| conserved hypothetical protein [Azotobacter vinelandii DJ] Length = 99 Score = 37.7 bits (86), Expect = 0.46, Method: Compositional matrix adjust. Identities = 22/61 (36%), Positives = 35/61 (57%), Gaps = 3/61 (4%) Query: 24 KDTSDTIH---MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 KD +H +KI++TA P +GKAN +LA LA + KS + + S +S+ K + I Sbjct: 23 KDEFAGLHGERLKIRLTAPPVEGKANAHLLAFLAGVFGVPKSQVSLESGESNRQKRVRIR 82 Query: 81 K 81 + Sbjct: 83 R 83 >gi|114778113|ref|ZP_01453000.1| hypothetical protein SPV1_00607 [Mariprofundus ferrooxydans PV-1] gi|114551531|gb|EAU54085.1| hypothetical protein SPV1_00607 [Mariprofundus ferrooxydans PV-1] Length = 101 Score = 37.7 bits (86), Expect = 0.49, Method: Compositional matrix adjust. Identities = 18/55 (32%), Positives = 32/55 (58%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEI 86 +KI V Q GKAN+A++ +A L LS++ + + S +S K +++ D E+ Sbjct: 37 LKIAVAEAAQDGKANEAIVRFIADALNLSRADVDVASGHTSRRKRLFLHGDGSEL 91 >gi|170029973|ref|XP_001842865.1| conserved hypothetical protein [Culex quinquefasciatus] gi|167865325|gb|EDS28708.1| conserved hypothetical protein [Culex quinquefasciatus] Length = 157 Score = 37.7 bits (86), Expect = 0.50, Method: Compositional matrix adjust. Identities = 25/91 (27%), Positives = 48/91 (52%), Gaps = 9/91 (9%) Query: 3 NVIVRLI--PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 NV+++++ P +K +GI +E + +++ A P G+AN ++ LAK L L Sbjct: 64 NVLIKILAKPGSKFNGITGIEDE-------GVGVQIAAPPIDGEANTELVKYLAKLLDLR 116 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 KS + + S K I ++K C+ ++L+ Sbjct: 117 KSDVSLDRGSKSRQKTIVLEKGCRTPDQVLE 147 >gi|54295557|ref|YP_127972.1| hypothetical protein lpl2644 [Legionella pneumophila str. Lens] gi|53755389|emb|CAH16885.1| hypothetical protein lpl2644 [Legionella pneumophila str. Lens] Length = 95 Score = 37.7 bits (86), Expect = 0.53, Method: Compositional matrix adjust. Identities = 21/68 (30%), Positives = 41/68 (60%), Gaps = 7/68 (10%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 PNAKKS + ++ + D++H I + A PQ+G+AN +L +++ + K+ + ++ Sbjct: 18 PNAKKSKLMAI-----SDDSLH--IALHAKPQEGEANNELLFFISQFFKIPKTQIELIKG 70 Query: 70 QSSPLKII 77 +SS K+I Sbjct: 71 KSSRHKLI 78 >gi|83945079|ref|ZP_00957445.1| hypothetical protein OA2633_10629 [Oceanicaulis alexandrii HTCC2633] gi|83851861|gb|EAP89716.1| hypothetical protein OA2633_10629 [Oceanicaulis alexandrii HTCC2633] Length = 101 Score = 37.7 bits (86), Expect = 0.53, Method: Compositional matrix adjust. Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 2/77 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VR+ P A ++G D I + +V A P KG AN + A+ AK L + KS Sbjct: 6 RLFVRVQPRASRAGFDGARAGTDGR--IRLAARVRAAPDKGAANTELCALTAKTLGVPKS 63 Query: 63 SLRMLSKQSSPLKIIYI 79 ++ +++ + K + + Sbjct: 64 TVSVIAGATQREKTLLV 80 >gi|298293297|ref|YP_003695236.1| hypothetical protein Snov_3343 [Starkeya novella DSM 506] gi|296929808|gb|ADH90617.1| protein of unknown function DUF167 [Starkeya novella DSM 506] Length = 106 Score = 37.7 bits (86), Expect = 0.56, Method: Compositional matrix adjust. Identities = 24/88 (27%), Positives = 39/88 (44%), Gaps = 2/88 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR P + I D + ++ V A + GKAN A+ +LAK ++ S Sbjct: 16 VTVRATPRGGRDAIDGFVELGDGRTALKARVSVAA--EDGKANAALGKLLAKAAGIAPSR 73 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + ++S + K ++ D EI LQ Sbjct: 74 VDLVSGATGRTKAFKLNGDAAEIAARLQ 101 >gi|195160833|ref|XP_002021278.1| GL25246 [Drosophila persimilis] gi|198465041|ref|XP_001353470.2| GA13391 [Drosophila pseudoobscura pseudoobscura] gi|194118391|gb|EDW40434.1| GL25246 [Drosophila persimilis] gi|198149990|gb|EAL30981.2| GA13391 [Drosophila pseudoobscura pseudoobscura] Length = 125 Score = 37.7 bits (86), Expect = 0.56, Method: Compositional matrix adjust. Identities = 25/91 (27%), Positives = 46/91 (50%), Gaps = 9/91 (9%) Query: 3 NVIVRLI--PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 N+ ++++ P AK +GI ++ D + +++ A P +G+AN ++ L+K L L Sbjct: 34 NICIKILAKPGAKHNGITNI-------DLEGVGVQIAAPPSEGEANAELVKFLSKVLGLR 86 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 KS + + S KII I K + + Q Sbjct: 87 KSDVSLDKGSRSRNKIILISKGASTVESIQQ 117 >gi|258593710|emb|CBE70051.1| conserved hypothetical protein [NC10 bacterium 'Dutch sediment'] Length = 102 Score = 37.4 bits (85), Expect = 0.60, Method: Compositional matrix adjust. Identities = 27/88 (30%), Positives = 43/88 (48%), Gaps = 7/88 (7%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL P A + I D ++++V A P +G+AN A L +LAK L L S L Sbjct: 15 VRLQPKASREAI-------DGEVDGVLRLRVNAPPVEGQANDACLRLLAKTLDLPISRLG 67 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQNN 93 +++ Q + +K I + ++ NN Sbjct: 68 IVAGQQARVKTIRVTDASADLLRTALNN 95 >gi|157804228|ref|YP_001492777.1| hypothetical protein A1E_05380 [Rickettsia canadensis str. McKiel] gi|166227320|sp|A8F059|Y5380_RICCK RecName: Full=UPF0235 protein A1E_05380 gi|157785491|gb|ABV73992.1| hypothetical protein A1E_05380 [Rickettsia canadensis str. McKiel] Length = 105 Score = 37.4 bits (85), Expect = 0.61, Method: Compositional matrix adjust. Identities = 22/77 (28%), Positives = 45/77 (58%), Gaps = 6/77 (7%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 P++K++ I+ I ++ ++K+ + P++GKAN+ ++ LAK LS+S++ ++ Sbjct: 22 PDSKQNLISDFVI---INNLPYLKLFIKTAPEQGKANEEIINYLAKAWKLSRSNIEIIKG 78 Query: 70 QSSPLKIIY---IDKDC 83 + LK I ID+D Sbjct: 79 HTHSLKTILIKNIDEDY 95 >gi|146309167|ref|YP_001189632.1| hypothetical protein Pmen_4153 [Pseudomonas mendocina ymp] gi|205829317|sp|A4XZY4|Y4153_PSEMY RecName: Full=UPF0235 protein Pmen_4153 gi|145577368|gb|ABP86900.1| protein of unknown function DUF167 [Pseudomonas mendocina ymp] Length = 98 Score = 37.4 bits (85), Expect = 0.61, Method: Compositional matrix adjust. Identities = 19/50 (38%), Positives = 31/50 (62%), Gaps = 3/50 (6%) Query: 24 KDTSDTIH---MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQ 70 KD +H +KI++TA P +GKAN +LA LAK ++K+ + + S + Sbjct: 23 KDEFAGLHGERLKIRLTAPPVEGKANAHLLAFLAKAFGVAKAQVSLESGE 72 >gi|94971231|ref|YP_593279.1| hypothetical protein Acid345_4205 [Candidatus Koribacter versatilis Ellin345] gi|166227235|sp|Q1IIU5|Y4205_ACIBL RecName: Full=UPF0235 protein Acid345_4205 gi|94553281|gb|ABF43205.1| protein of unknown function DUF167 [Candidatus Koribacter versatilis Ellin345] Length = 96 Score = 37.4 bits (85), Expect = 0.65, Method: Compositional matrix adjust. Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 7/80 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VRL P AKK+ I E+ +K+ VT P G+AN+A++ +A L +++SS+ Sbjct: 15 VRLQPKAKKTAIIG-ELNG------ALKLGVTDPPIDGRANEALIRFVAGLLKVTRSSVT 67 Query: 66 MLSKQSSPLKIIYIDKDCKE 85 + + +SS K+I I+ E Sbjct: 68 IAAGESSRNKVIRIEGVTAE 87 >gi|313673537|ref|YP_004051648.1| hypothetical protein Calni_1577 [Calditerrivibrio nitroreducens DSM 19672] gi|312940293|gb|ADR19485.1| protein of unknown function DUF167 [Calditerrivibrio nitroreducens DSM 19672] Length = 86 Score = 37.4 bits (85), Expect = 0.67, Method: Compositional matrix adjust. Identities = 21/63 (33%), Positives = 40/63 (63%), Gaps = 2/63 (3%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI--DKDCKEITEL 89 +KIK+ + P GKAN+ +++ +++ L LSK + ++S + S KII + + D + I E Sbjct: 24 IKIKIKSPPTDGKANEELISFISQSLNLSKKEVGIISGEKSRYKIIEVPENYDMEFIKEK 83 Query: 90 LQN 92 L++ Sbjct: 84 LKD 86 >gi|39995970|ref|NP_951921.1| hypothetical protein GSU0864 [Geobacter sulfurreducens PCA] gi|39982735|gb|AAR34194.1| conserved hypothetical protein TIGR00251 [Geobacter sulfurreducens PCA] gi|307634758|gb|ADI83708.2| protein of unknown function DUF167 [Geobacter sulfurreducens KN400] Length = 107 Score = 37.4 bits (85), Expect = 0.68, Method: Compositional matrix adjust. Identities = 19/63 (30%), Positives = 39/63 (61%), Gaps = 1/63 (1%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI-DKDCKEITELL 90 +K+++T+ P +G+AN+ + LAK+L + KS + +++ + S K I + D + LL Sbjct: 43 IKLRLTSPPVEGEANRLCVEFLAKRLGVPKSCVAIIAGEKSRHKTIRVSGSDAAAVLALL 102 Query: 91 QNN 93 +N+ Sbjct: 103 ENS 105 >gi|94266431|ref|ZP_01290126.1| Protein of unknown function DUF167 [delta proteobacterium MLMS-1] gi|93452973|gb|EAT03472.1| Protein of unknown function DUF167 [delta proteobacterium MLMS-1] Length = 121 Score = 37.4 bits (85), Expect = 0.69, Method: Compositional matrix adjust. Identities = 20/75 (26%), Positives = 39/75 (52%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR P A ++ +A + ++I+V A P GKAN+A+L LA + L ++++ Sbjct: 40 VRAQPGAARTEVAG-------TYGARLRIRVAAPPVDGKANRALLTFLASRCGLVRNAVT 92 Query: 66 MLSKQSSPLKIIYID 80 ++ Q K+ ++ Sbjct: 93 LVGGQRGRDKLFRLE 107 >gi|320352454|ref|YP_004193793.1| hypothetical protein Despr_0318 [Desulfobulbus propionicus DSM 2032] gi|320120956|gb|ADW16502.1| protein of unknown function DUF167 [Desulfobulbus propionicus DSM 2032] Length = 110 Score = 37.0 bits (84), Expect = 0.85, Method: Compositional matrix adjust. Identities = 21/48 (43%), Positives = 30/48 (62%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 +K++VT P GKAN+A++A LAK L KSS+ + S S K + I Sbjct: 42 LKLRVTTPPVDGKANQAVVAYLAKLFHLPKSSVVLKSGHQSRGKTVVI 89 >gi|254409972|ref|ZP_05023752.1| conserved hypothetical protein [Microcoleus chthonoplastes PCC 7420] gi|196183008|gb|EDX77992.1| conserved hypothetical protein [Microcoleus chthonoplastes PCC 7420] Length = 74 Score = 37.0 bits (84), Expect = 0.87, Method: Compositional matrix adjust. Identities = 26/77 (33%), Positives = 45/77 (58%), Gaps = 7/77 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PN+K I E+ T +K+ + + P GKANK ++ +LA+K ++KS ++ Sbjct: 5 VKVKPNSKTQSIE--EMADGT-----LKVNLKSPPVDGKANKELIELLAEKFNVTKSQVQ 57 Query: 66 MLSKQSSPLKIIYIDKD 82 + S SS +K+I I D Sbjct: 58 IKSGLSSKIKLIEIVAD 74 >gi|52842921|ref|YP_096720.1| hypothetical protein lpg2716 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52630032|gb|AAU28773.1| hypothetical protein lpg2716 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 95 Score = 37.0 bits (84), Expect = 0.92, Method: Compositional matrix adjust. Identities = 21/68 (30%), Positives = 40/68 (58%), Gaps = 7/68 (10%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 PNAKKS + ++ + D +H I + A PQ+G+AN +L +++ + K+ + ++ Sbjct: 18 PNAKKSKLMAI-----SDDRLH--IALHAKPQEGEANNELLFFISQFFKIPKTQIELIKG 70 Query: 70 QSSPLKII 77 +SS K+I Sbjct: 71 KSSRHKLI 78 >gi|171914386|ref|ZP_02929856.1| hypothetical protein VspiD_24445 [Verrucomicrobium spinosum DSM 4136] Length = 92 Score = 37.0 bits (84), Expect = 0.96, Method: Compositional matrix adjust. Identities = 21/77 (27%), Positives = 41/77 (53%), Gaps = 2/77 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 N+ ++ PNA++S I D + +K+ A +GKANK ++ LA++L +K Sbjct: 7 NLACKVTPNARRSEIVGWGA--DEQGRGVLLVKLAAPALEGKANKELVRFLAEQLGCAKG 64 Query: 63 SLRMLSKQSSPLKIIYI 79 + +L +S K++ + Sbjct: 65 EVSLLRGDASRTKLLRV 81 >gi|224826207|ref|ZP_03699310.1| protein of unknown function DUF167 [Lutiella nitroferrum 2002] gi|224601844|gb|EEG08024.1| protein of unknown function DUF167 [Lutiella nitroferrum 2002] Length = 108 Score = 36.6 bits (83), Expect = 1.0, Method: Compositional matrix adjust. Identities = 23/70 (32%), Positives = 37/70 (52%), Gaps = 7/70 (10%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 P AKK+ +A +K+++ A P +GKAN +LA LA++ + K + +LS Sbjct: 31 PGAKKTDLAG-------EHGGALKLRLAAPPVEGKANAMLLAWLAERFEVPKRDVVLLSG 83 Query: 70 QSSPLKIIYI 79 S KI+ I Sbjct: 84 DKSRHKIVEI 93 >gi|110635561|ref|YP_675769.1| hypothetical protein Meso_3232 [Mesorhizobium sp. BNC1] gi|110286545|gb|ABG64604.1| protein of unknown function DUF167 [Chelativorans sp. BNC1] Length = 116 Score = 36.6 bits (83), Expect = 1.0, Method: Compositional matrix adjust. Identities = 24/82 (29%), Positives = 40/82 (48%), Gaps = 4/82 (4%) Query: 3 NVIVRLIPNAKKSGIASL-EIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P + I + E P + +K +V A P+ GKAN+A+ +LA L + + Sbjct: 15 EIFVRLTPKSSSDAIEGVTEGPDGQA---FLKARVRAIPEAGKANEALERLLASALGVPR 71 Query: 62 SSLRMLSKQSSPLKIIYIDKDC 83 + + S +S K + I D Sbjct: 72 RDVAVSSGAASRRKTVSITGDA 93 >gi|196231015|ref|ZP_03129875.1| protein of unknown function DUF167 [Chthoniobacter flavus Ellin428] gi|196224845|gb|EDY19355.1| protein of unknown function DUF167 [Chthoniobacter flavus Ellin428] Length = 93 Score = 36.6 bits (83), Expect = 1.1, Method: Compositional matrix adjust. Identities = 21/73 (28%), Positives = 41/73 (56%), Gaps = 7/73 (9%) Query: 7 RLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRM 66 R++PNA++S + + D + K+KV A GKAN+A+ LA+ L + ++ + Sbjct: 10 RIVPNARRSEVVGVH-----GDAV--KVKVQAPAMDGKANEALRDFLAEVLTVPARAVEI 62 Query: 67 LSKQSSPLKIIYI 79 ++ + S K++ I Sbjct: 63 VAGEKSRDKVVAI 75 >gi|195376389|ref|XP_002046979.1| GJ12187 [Drosophila virilis] gi|194154137|gb|EDW69321.1| GJ12187 [Drosophila virilis] Length = 124 Score = 36.6 bits (83), Expect = 1.1, Method: Compositional matrix adjust. Identities = 22/91 (24%), Positives = 47/91 (51%), Gaps = 9/91 (9%) Query: 3 NVIVRLI--PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 N++++++ P AK++GI + + + +++ A P +G+AN ++ L+K L L Sbjct: 33 NIVIKILAKPGAKQNGITDIGLD-------GVGVQIAAPPSEGEANAELVKFLSKVLGLR 85 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 KS + + S KI+ + K + + Q Sbjct: 86 KSDVSLDKGSRSRNKIVLVTKGASTVEAIEQ 116 >gi|116751493|ref|YP_848180.1| hypothetical protein Sfum_4080 [Syntrophobacter fumaroxidans MPOB] gi|116700557|gb|ABK19745.1| protein of unknown function DUF167 [Syntrophobacter fumaroxidans MPOB] Length = 105 Score = 36.6 bits (83), Expect = 1.1, Method: Compositional matrix adjust. Identities = 19/55 (34%), Positives = 30/55 (54%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEI 86 +KI++TA P +G+ANK + LA +S+S + ++ S K I I EI Sbjct: 40 LKIRLTAPPVEGEANKECVKFLAGAFGVSRSDVEIIRGHKSRRKTILIRNSTPEI 94 >gi|114566491|ref|YP_753645.1| hypothetical protein Swol_0959 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|122318444|sp|Q0AYD0|Y959_SYNWW RecName: Full=UPF0235 protein Swol_0959 gi|114337426|gb|ABI68274.1| conserved hypothetical protein [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 102 Score = 36.6 bits (83), Expect = 1.1, Method: Compositional matrix adjust. Identities = 19/48 (39%), Positives = 30/48 (62%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 +KIKV A P +G AN+A+ LA+ L K +R++ ++S KI+ I Sbjct: 38 LKIKVMAPPVEGAANQALQKFLAELFKLPKKDIRIVRGETSRHKIVEI 85 >gi|169629033|ref|YP_001702682.1| hypothetical protein MAB_1946 [Mycobacterium abscessus ATCC 19977] gi|169241000|emb|CAM62028.1| Conserved hypothetical protein [Mycobacterium abscessus] Length = 76 Score = 36.6 bits (83), Expect = 1.1, Method: Compositional matrix adjust. Identities = 22/66 (33%), Positives = 41/66 (62%), Gaps = 6/66 (9%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 P ++K ++E+ D + T+ ++ GKANKA +A+LA+ L + KS++R+++ Sbjct: 12 PGSRKG--PAVEVADDGALTLFVREPAI----DGKANKAAIALLAEYLDVPKSTVRLVAG 65 Query: 70 QSSPLK 75 Q+S LK Sbjct: 66 QTSRLK 71 >gi|313223261|emb|CBY43439.1| unnamed protein product [Oikopleura dioica] Length = 180 Score = 36.6 bits (83), Expect = 1.1, Method: Compositional matrix adjust. Identities = 23/55 (41%), Positives = 32/55 (58%) Query: 37 TATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 +A P KGKANKA+L LA KL + S L + S +S K++ ++ D T L Q Sbjct: 115 SAPPDKGKANKAVLKSLADKLGVKPSKLSIQSGTTSRSKVVLLETDADLSTILAQ 169 >gi|195014316|ref|XP_001984001.1| GH15254 [Drosophila grimshawi] gi|193897483|gb|EDV96349.1| GH15254 [Drosophila grimshawi] Length = 126 Score = 36.2 bits (82), Expect = 1.3, Method: Compositional matrix adjust. Identities = 23/92 (25%), Positives = 47/92 (51%), Gaps = 9/92 (9%) Query: 2 CNVIVRLI--PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLAL 59 N++++++ P AK++GI + + + +++ A P +G+AN ++ L+K L L Sbjct: 34 GNIVIKILAKPGAKQNGITDIGLE-------GVGVQIAAPPSEGEANAELVKFLSKVLGL 86 Query: 60 SKSSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 KS + + S K+I I K + + Q Sbjct: 87 RKSDVSLDKGSRSKNKLILITKGVSTVEAIEQ 118 >gi|56476391|ref|YP_157980.1| hypothetical protein ebA1762 [Aromatoleum aromaticum EbN1] gi|81358142|sp|Q5P6I2|Y954_AZOSE RecName: Full=UPF0235 protein AZOSEA09540 gi|56312434|emb|CAI07079.1| conserved hypothetical protein [Aromatoleum aromaticum EbN1] Length = 97 Score = 36.2 bits (82), Expect = 1.4, Method: Compositional matrix adjust. Identities = 18/54 (33%), Positives = 31/54 (57%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKE 85 MK+++ A P GKAN A+ LA + +S++ +LS ++S K + I+ E Sbjct: 35 MKLRLAAPPVDGKANAALTVFLAAFCGVGRSAVSLLSGETSRAKRVRIEGAGSE 88 >gi|114763599|ref|ZP_01443004.1| hypothetical protein 1100011001330_R2601_13754 [Pelagibaca bermudensis HTCC2601] gi|114543879|gb|EAU46891.1| hypothetical protein R2601_13754 [Roseovarius sp. HTCC2601] Length = 85 Score = 36.2 bits (82), Expect = 1.4, Method: Compositional matrix adjust. Identities = 25/71 (35%), Positives = 39/71 (54%), Gaps = 8/71 (11%) Query: 18 ASLEI---PKDTSDTIH-----MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 A+LE+ PK + + I +++ VT P+ GKAN A+ +LAK L L KS L ++ Sbjct: 14 ATLELRVTPKASRNEIREEGGTLRVYVTTVPEDGKANAAVQKLLAKALGLPKSRLVLVRG 73 Query: 70 QSSPLKIIYID 80 +S K I+ Sbjct: 74 ATSRDKAFRIE 84 >gi|85859583|ref|YP_461785.1| putative cytoplasmic protein [Syntrophus aciditrophicus SB] gi|85722674|gb|ABC77617.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB] Length = 118 Score = 36.2 bits (82), Expect = 1.4, Method: Compositional matrix adjust. Identities = 19/48 (39%), Positives = 28/48 (58%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 ++IK+TA P GKAN L LA L + K + ++S +S KI+ I Sbjct: 36 LRIKLTAPPVDGKANDECLEFLAGILGVKKGQMDIISGHTSRRKIVQI 83 >gi|194748771|ref|XP_001956818.1| GF24383 [Drosophila ananassae] gi|190624100|gb|EDV39624.1| GF24383 [Drosophila ananassae] Length = 126 Score = 36.2 bits (82), Expect = 1.6, Method: Compositional matrix adjust. Identities = 24/83 (28%), Positives = 43/83 (51%), Gaps = 9/83 (10%) Query: 3 NVIVRLI--PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 N+ ++++ P AK++GI + T + +++ A P +G+AN ++ L+K L L Sbjct: 36 NICIQILAKPGAKQNGITGI-------STEGVGVQIAAPPSEGEANAELVKFLSKVLGLR 88 Query: 61 KSSLRMLSKQSSPLKIIYIDKDC 83 KS + + S KII I K Sbjct: 89 KSDVSLDKGSRSRNKIILITKGA 111 >gi|54298708|ref|YP_125077.1| hypothetical protein lpp2772 [Legionella pneumophila str. Paris] gi|53752493|emb|CAH13925.1| hypothetical protein lpp2772 [Legionella pneumophila str. Paris] Length = 95 Score = 35.8 bits (81), Expect = 1.8, Method: Compositional matrix adjust. Identities = 20/68 (29%), Positives = 40/68 (58%), Gaps = 7/68 (10%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 PNAKK+ + ++ + D +H I + A PQ+G+AN +L +++ + K+ + ++ Sbjct: 18 PNAKKTKLMAI-----SDDRLH--IALHAKPQEGEANNELLFFISQFFKIPKTQIELIKG 70 Query: 70 QSSPLKII 77 +SS K+I Sbjct: 71 KSSRHKLI 78 >gi|117621083|ref|YP_858117.1| hypothetical protein AHA_3661 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|166232625|sp|A0KPB6|Y3661_AERHH RecName: Full=UPF0235 protein AHA_3661 gi|117562490|gb|ABK39438.1| conserved hypothetical protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 99 Score = 35.8 bits (81), Expect = 1.9, Method: Compositional matrix adjust. Identities = 22/80 (27%), Positives = 41/80 (51%), Gaps = 9/80 (11%) Query: 23 PKDTSDTI------HMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKI 76 PK + D I +K+ +TA P G+AN ++ LAK+ ++K +R++ + K Sbjct: 19 PKASRDQIVGLHGEELKVAITAPPVDGQANSHLIKYLAKQFKVAKGQVRIVRGELGRHKT 78 Query: 77 IYIDKDCK---EITELLQNN 93 + I+ + E++ LL N Sbjct: 79 VAIEAPRQIPAEVSALLDNQ 98 >gi|268567307|ref|XP_002639944.1| Hypothetical protein CBG10764 [Caenorhabditis briggsae] Length = 258 Score = 35.8 bits (81), Expect = 1.9, Method: Compositional matrix adjust. Identities = 19/71 (26%), Positives = 37/71 (52%), Gaps = 7/71 (9%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 P AKKSG+ ++ + + + + A P++G AN+ +++ L L L K+ L+ Sbjct: 41 PGAKKSGVVAI-------NESEIDVAIGAAPREGAANEELVSYLMSALGLRKNELQFDKG 93 Query: 70 QSSPLKIIYID 80 S K++ I+ Sbjct: 94 AKSRSKVVLIE 104 >gi|192362044|ref|YP_001980615.1| hypothetical protein CJA_0091 [Cellvibrio japonicus Ueda107] gi|226734040|sp|B3PFH5|Y091_CELJU RecName: Full=UPF0235 protein CJA_0091 gi|190688209|gb|ACE85887.1| conserved hypothetical protein TIGR00251 [Cellvibrio japonicus Ueda107] Length = 101 Score = 35.8 bits (81), Expect = 2.0, Method: Compositional matrix adjust. Identities = 17/54 (31%), Positives = 32/54 (59%), Gaps = 6/54 (11%) Query: 23 PKDTSDTI------HMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQ 70 PK + D I +KI++TA P GKAN+ ++ L+K+ + K ++++L + Sbjct: 23 PKASGDDIVGVHGDRLKIRITAPPVDGKANEYLIKWLSKQFRVPKGNIKILQGE 76 >gi|195953751|ref|YP_002122041.1| protein of unknown function DUF167 [Hydrogenobaculum sp. Y04AAS1] gi|226734129|sp|B4U5M3|Y1378_HYDS0 RecName: Full=UPF0235 protein HY04AAS1_1378 gi|195933363|gb|ACG58063.1| protein of unknown function DUF167 [Hydrogenobaculum sp. Y04AAS1] Length = 73 Score = 35.8 bits (81), Expect = 2.0, Method: Compositional matrix adjust. Identities = 25/74 (33%), Positives = 44/74 (59%), Gaps = 7/74 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PNAK + LE D S +KI + + P GKAN+ ++ +L++ L +SKS + Sbjct: 5 VKVKPNAKTVSVEQLE---DKS----LKISIKSPPVNGKANEELIKVLSEFLKVSKSKIN 57 Query: 66 MLSKQSSPLKIIYI 79 + + +SS K++ I Sbjct: 58 IKAGKSSREKLVEI 71 >gi|309361190|emb|CAP30075.2| hypothetical protein CBG_10764 [Caenorhabditis briggsae AF16] Length = 266 Score = 35.8 bits (81), Expect = 2.0, Method: Compositional matrix adjust. Identities = 19/71 (26%), Positives = 37/71 (52%), Gaps = 7/71 (9%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 P AKKSG+ ++ + + + + A P++G AN+ +++ L L L K+ L+ Sbjct: 48 PGAKKSGVVAI-------NESEIDVAIGAAPREGAANEELVSYLMSALGLRKNELQFDKG 100 Query: 70 QSSPLKIIYID 80 S K++ I+ Sbjct: 101 AKSRSKVVLIE 111 >gi|75674677|ref|YP_317098.1| hypothetical protein Nwi_0479 [Nitrobacter winogradskyi Nb-255] gi|74419547|gb|ABA03746.1| Protein of unknown function DUF167 [Nitrobacter winogradskyi Nb-255] Length = 106 Score = 35.8 bits (81), Expect = 2.1, Method: Compositional matrix adjust. Identities = 23/88 (26%), Positives = 49/88 (55%), Gaps = 2/88 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P + I +E+ D + ++++ A G+AN+A+ A+LAK L + K Sbjct: 13 IALRVTPRGGRDAIDGIEMLADGRPVVKVRVRAVA--DGGEANRAVTAVLAKALGVRKID 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 +R+L+ +S LK + ++ D ++ L+ Sbjct: 71 VRILAGATSRLKQVAVEGDPVQLGNALR 98 >gi|218782636|ref|YP_002433954.1| hypothetical protein Dalk_4809 [Desulfatibacillum alkenivorans AK-01] gi|218764020|gb|ACL06486.1| protein of unknown function DUF167 [Desulfatibacillum alkenivorans AK-01] Length = 106 Score = 35.8 bits (81), Expect = 2.1, Method: Compositional matrix adjust. Identities = 21/48 (43%), Positives = 30/48 (62%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 +KIK+ A P G ANK + LAK L L KSS+++LS ++ K I + Sbjct: 36 LKIKLKAPPVGGAANKMCIQFLAKTLKLPKSSIKILSGETGRSKQIMV 83 >gi|195127445|ref|XP_002008179.1| GI11963 [Drosophila mojavensis] gi|193919788|gb|EDW18655.1| GI11963 [Drosophila mojavensis] Length = 125 Score = 35.8 bits (81), Expect = 2.2, Method: Compositional matrix adjust. Identities = 24/87 (27%), Positives = 46/87 (52%), Gaps = 12/87 (13%) Query: 3 NVIVRLI--PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 N+ ++++ P AK++GI + + + +++ A P +G+AN ++ L+K L L Sbjct: 34 NIAIKILAKPGAKQNGITDIGLE-------GVGVQIAAPPSEGEANAELVKFLSKVLGLR 86 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEIT 87 KS + + S KII + CK +T Sbjct: 87 KSDVSLDKGSRSRNKIILV---CKGVT 110 >gi|310814754|ref|YP_003962718.1| hypothetical protein EIO_0232 [Ketogulonicigenium vulgare Y25] gi|308753489|gb|ADO41418.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25] Length = 82 Score = 35.8 bits (81), Expect = 2.2, Method: Compositional matrix adjust. Identities = 22/63 (34%), Positives = 34/63 (53%), Gaps = 7/63 (11%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A ++ I +D S +++ VT P+ GKAN A+ +LAK L + K Sbjct: 11 TEIAVRVTPKASRA-----RILRDESGV--LRVYVTVVPEDGKANAAVTELLAKALRIPK 63 Query: 62 SSL 64 S L Sbjct: 64 SKL 66 >gi|194866006|ref|XP_001971712.1| GG14280 [Drosophila erecta] gi|190653495|gb|EDV50738.1| GG14280 [Drosophila erecta] Length = 127 Score = 35.8 bits (81), Expect = 2.2, Method: Compositional matrix adjust. Identities = 27/94 (28%), Positives = 49/94 (52%), Gaps = 12/94 (12%) Query: 3 NVIVRLI--PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 N+ ++++ P AK++GI + + + +++ A P +G+AN ++ L+K L L Sbjct: 37 NICIQILAKPGAKQNGITGIGLE-------GVGVQIAAPPSEGEANAELVKFLSKVLGLR 89 Query: 61 KSSLRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 KS + + S KII I K + I ELL+ Sbjct: 90 KSDVSLDKGSRSRNKIIMITKGVSTVEAIEELLR 123 >gi|322697515|gb|EFY89294.1| DUF167 domain protein [Metarhizium acridum CQMa 102] Length = 122 Score = 35.8 bits (81), Expect = 2.2, Method: Compositional matrix adjust. Identities = 20/56 (35%), Positives = 31/56 (55%), Gaps = 3/56 (5%) Query: 36 VTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID---KDCKEITE 88 VTA P+ G+ANKA++ L+ + + KS R +S S K++ I D + TE Sbjct: 51 VTAQPRHGEANKAVVQALSNAIGIPKSRFRFVSGLKSRDKVVAIGDIQGDGPDYTE 106 >gi|195587403|ref|XP_002083454.1| GD13347 [Drosophila simulans] gi|194195463|gb|EDX09039.1| GD13347 [Drosophila simulans] Length = 128 Score = 35.4 bits (80), Expect = 2.5, Method: Compositional matrix adjust. Identities = 24/91 (26%), Positives = 46/91 (50%), Gaps = 9/91 (9%) Query: 3 NVIVRLI--PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 N+ ++++ P AK++GI + + + +++ A P +G+AN ++ L+K L L Sbjct: 37 NICIQILAKPGAKQNGITGIGLE-------GVGVQIAAPPSEGEANAELVKFLSKVLGLR 89 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 KS + + S KII I K + + Q Sbjct: 90 KSDVSLDKGSRSRNKIIMITKGASTVEAIEQ 120 >gi|219852331|ref|YP_002466763.1| protein of unknown function DUF167 [Methanosphaerula palustris E1-9c] gi|219546590|gb|ACL17040.1| protein of unknown function DUF167 [Methanosphaerula palustris E1-9c] Length = 106 Score = 35.4 bits (80), Expect = 2.5, Method: Compositional matrix adjust. Identities = 18/60 (30%), Positives = 35/60 (58%), Gaps = 1/60 (1%) Query: 21 EIPKDTSDTIH-MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 P ++ H + +T P +GKAN+A++A+L++ L + +S + +LS +S K + I Sbjct: 29 RFPAGYNEWRHAIGCSITTPPVEGKANRAIVALLSRTLTIPQSGISILSGATSSQKRVLI 88 >gi|95930347|ref|ZP_01313084.1| conserved hypothetical protein [Desulfuromonas acetoxidans DSM 684] gi|95133599|gb|EAT15261.1| conserved hypothetical protein [Desulfuromonas acetoxidans DSM 684] Length = 103 Score = 35.4 bits (80), Expect = 2.5, Method: Compositional matrix adjust. Identities = 23/78 (29%), Positives = 42/78 (53%), Gaps = 8/78 (10%) Query: 11 NAKKSG--IASLEIPKDTSDTI------HMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 NA++ G IA P+ + +++ +K+++T+ P +G ANK AK L +SKS Sbjct: 11 NAQQDGVVIALFVQPRASKNSLCGLQGEELKVRLTSPPVEGAANKLCCTFFAKLLGVSKS 70 Query: 63 SLRMLSKQSSPLKIIYID 80 S+ ++ S K I ++ Sbjct: 71 SVTLIRGDKSRHKQIVVE 88 >gi|308270456|emb|CBX27068.1| UPF0235 protein PTH_1821 [uncultured Desulfobacterium sp.] Length = 106 Score = 35.4 bits (80), Expect = 2.5, Method: Compositional matrix adjust. Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 18/89 (20%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V ++P + K+ IA L D +KIK+TA P G AN + LAK L++S S++ Sbjct: 15 VYILPRSSKNMIAGL-----FGDA--LKIKLTAAPVDGSANNMCIKYLAKILSVSASNIE 67 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQNND 94 ++S + K I LL+NN+ Sbjct: 68 IVSGHTGKTKYI-----------LLKNNE 85 >gi|56695839|ref|YP_166190.1| hypothetical protein SPO0937 [Ruegeria pomeroyi DSS-3] gi|56677576|gb|AAV94242.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 92 Score = 35.4 bits (80), Expect = 2.6, Method: Compositional matrix adjust. Identities = 18/55 (32%), Positives = 33/55 (60%), Gaps = 5/55 (9%) Query: 23 PKDTSDTI-----HMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSS 72 PK D++ ++I VTA P+ GKAN+A+ +LA+ + ++ S L + Q++ Sbjct: 28 PKAARDSVTLAGEGLRITVTAPPEDGKANEAVRKLLARAMGVAPSRLTLRRGQTA 82 >gi|24656569|ref|NP_647784.1| CG14966 [Drosophila melanogaster] gi|7292328|gb|AAF47735.1| CG14966 [Drosophila melanogaster] gi|289526411|gb|ADD01328.1| RE68649p [Drosophila melanogaster] Length = 140 Score = 35.4 bits (80), Expect = 2.6, Method: Compositional matrix adjust. Identities = 24/91 (26%), Positives = 45/91 (49%), Gaps = 9/91 (9%) Query: 3 NVIVRLI--PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 N+ ++++ P AK++GI + + +++ A P +G+AN ++ L+K L L Sbjct: 49 NICIQILAKPGAKQNGITGIGFE-------GVGVQIAAPPSEGEANAELVKFLSKVLGLR 101 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 KS + + S KII I K + + Q Sbjct: 102 KSDVSLDKGSRSRNKIIMITKGVSTVEAIEQ 132 >gi|320101820|ref|YP_004177411.1| hypothetical protein Isop_0265 [Isosphaera pallida ATCC 43644] gi|319749102|gb|ADV60862.1| protein of unknown function DUF167 [Isosphaera pallida ATCC 43644] Length = 131 Score = 35.4 bits (80), Expect = 2.7, Method: Compositional matrix adjust. Identities = 17/40 (42%), Positives = 28/40 (70%) Query: 40 PQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 P+KGKAN A+L +LA L ++KS L ++S ++ K++ I Sbjct: 63 PEKGKANAAILEILADLLGIAKSRLDLVSGATARSKVVRI 102 >gi|195491333|ref|XP_002093518.1| GE20708 [Drosophila yakuba] gi|194179619|gb|EDW93230.1| GE20708 [Drosophila yakuba] Length = 128 Score = 35.4 bits (80), Expect = 2.8, Method: Compositional matrix adjust. Identities = 24/91 (26%), Positives = 46/91 (50%), Gaps = 9/91 (9%) Query: 3 NVIVRLI--PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 N+ ++++ P AK++GI + + + +++ A P +G+AN ++ L+K L L Sbjct: 37 NICIQILAKPGAKQNGITGIGLE-------GVGVQIAAPPSEGEANAELVKFLSKVLGLR 89 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 KS + + S KII I K + + Q Sbjct: 90 KSDVSLDKGSRSRNKIIMITKGVSTVEAIEQ 120 >gi|24214436|ref|NP_711917.1| hypothetical protein LA_1736 [Leptospira interrogans serovar Lai str. 56601] gi|45657916|ref|YP_002002.1| hypothetical protein LIC12068 [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|29839712|sp|Q8F5E6|Y1736_LEPIN RecName: Full=UPF0235 protein LA_1736 gi|73921077|sp|Q72QP5|Y2068_LEPIC RecName: Full=UPF0235 protein LIC_12068 gi|24195381|gb|AAN48935.1|AE011350_4 conserved hypothetical protein [Leptospira interrogans serovar Lai str. 56601] gi|45601157|gb|AAS70639.1| conserved hypothetical protein [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 73 Score = 35.4 bits (80), Expect = 2.9, Method: Compositional matrix adjust. Identities = 17/39 (43%), Positives = 29/39 (74%) Query: 42 KGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 +GKAN+A++ ++K++ + KS +R+LS Q + KII ID Sbjct: 34 EGKANEAVIESISKEMKVPKSKIRILSGQKNKKKIIEID 72 >gi|86136531|ref|ZP_01055110.1| hypothetical protein MED193_20449 [Roseobacter sp. MED193] gi|85827405|gb|EAQ47601.1| hypothetical protein MED193_20449 [Roseobacter sp. MED193] Length = 91 Score = 35.0 bits (79), Expect = 3.0, Method: Compositional matrix adjust. Identities = 23/70 (32%), Positives = 40/70 (57%), Gaps = 6/70 (8%) Query: 17 IASLEIPKDTSDTI-----HMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQS 71 IA PK S++I +K+ VT+ P+ GKA +A+ ++LAK + ++ S L + + Sbjct: 22 IAVRVTPKAASNSISVSEAGLKVTVTSVPENGKATEAVRSLLAKAMGVAASKLDLSQGAT 81 Query: 72 SPLKI-IYID 80 S K+ +Y D Sbjct: 82 SRNKVFVYTD 91 >gi|57235102|ref|YP_182004.1| hypothetical protein DET1292 [Dehalococcoides ethenogenes 195] gi|123618390|sp|Q3Z6Z5|Y1292_DEHE1 RecName: Full=UPF0235 protein DET1292 gi|57225550|gb|AAW40607.1| conserved hypothetical protein [Dehalococcoides ethenogenes 195] Length = 97 Score = 35.0 bits (79), Expect = 3.2, Method: Compositional matrix adjust. Identities = 22/74 (29%), Positives = 42/74 (56%), Gaps = 7/74 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V ++++P+A+++ + E +KIK+ A P+KGKANKA++ L++ L KS Sbjct: 10 VNLKILPSAQRNELTGYE-------NGLLKIKIAAQPEKGKANKALVDYLSELLDTPKSE 62 Query: 64 LRMLSKQSSPLKII 77 + + S K++ Sbjct: 63 IEICRGLSGRNKVV 76 >gi|323702139|ref|ZP_08113806.1| protein of unknown function DUF167 [Desulfotomaculum nigrificans DSM 574] gi|323532826|gb|EGB22698.1| protein of unknown function DUF167 [Desulfotomaculum nigrificans DSM 574] Length = 95 Score = 35.0 bits (79), Expect = 3.3, Method: Compositional matrix adjust. Identities = 21/74 (28%), Positives = 41/74 (55%), Gaps = 7/74 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P A K+ +A E+ +K+++TA P G AN+A LA+ ++KS++ Sbjct: 15 VRVQPRAAKNSLAG-EMEG------ALKVRLTAPPVDGAANEACCRFLAEVFGVAKSNVE 67 Query: 66 MLSKQSSPLKIIYI 79 ++S + K++ + Sbjct: 68 IISGHTGRNKVVRV 81 >gi|268326229|emb|CBH39817.1| conserved hypothetical protein, DUF167 family [uncultured archaeon] Length = 70 Score = 35.0 bits (79), Expect = 3.4, Method: Compositional matrix adjust. Identities = 24/81 (29%), Positives = 43/81 (53%), Gaps = 12/81 (14%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + +++IPN+K I E M I+V P KGKANKA++ +L++ Sbjct: 1 MKRIAIKVIPNSKTEEIIYAE---------PMIIRVKEPPTKGKANKAVVMLLSRYF--- 48 Query: 61 KSSLRMLSKQSSPLKIIYIDK 81 + +R++S S KI+ +++ Sbjct: 49 NADVRIVSGAKSRRKIVEVEE 69 >gi|322418850|ref|YP_004198073.1| hypothetical protein GM18_1329 [Geobacter sp. M18] gi|320125237|gb|ADW12797.1| protein of unknown function DUF167 [Geobacter sp. M18] Length = 104 Score = 35.0 bits (79), Expect = 3.5, Method: Compositional matrix adjust. Identities = 23/75 (30%), Positives = 40/75 (53%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V + P A +S I PKD ++I++T+ P ANK + ++AK L L+KS + Sbjct: 19 VHVQPRASRSEICG---PKDGE----LRIRLTSPPVDDAANKQCVELIAKSLGLAKSKVS 71 Query: 66 MLSKQSSPLKIIYID 80 + S S K++ ++ Sbjct: 72 IKSGAKSRHKVVRVE 86 >gi|296284807|ref|ZP_06862805.1| hypothetical protein CbatJ_14361 [Citromicrobium bathyomarinum JL354] Length = 92 Score = 35.0 bits (79), Expect = 3.6, Method: Compositional matrix adjust. Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 8/74 (10%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P A+ +A ++ +++KV A PQ G AN A+ ++AK L + KS Sbjct: 25 VRVTPGARTESLAIVDG--------GVQVKVRAKPQDGAANVAVAELVAKALGIPKSRCT 76 Query: 66 MLSKQSSPLKIIYI 79 +L +S K++ + Sbjct: 77 LLRGATSREKVLGV 90 >gi|296133363|ref|YP_003640610.1| protein of unknown function DUF167 [Thermincola sp. JR] gi|296031941|gb|ADG82709.1| protein of unknown function DUF167 [Thermincola potens JR] Length = 106 Score = 35.0 bits (79), Expect = 3.6, Method: Compositional matrix adjust. Identities = 17/57 (29%), Positives = 35/57 (61%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITE 88 +K+K+TA P +G AN+A + A+ +++KS + +++ +S K++ + KE E Sbjct: 34 LKVKLTAPPVEGAANEACIRFFAELFSVAKSQVEIITGHTSRTKLLKVKGLTKEEAE 90 >gi|325982093|ref|YP_004294495.1| hypothetical protein NAL212_1446 [Nitrosomonas sp. AL212] gi|325531612|gb|ADZ26333.1| protein of unknown function DUF167 [Nitrosomonas sp. AL212] Length = 113 Score = 35.0 bits (79), Expect = 3.6, Method: Compositional matrix adjust. Identities = 21/70 (30%), Positives = 34/70 (48%), Gaps = 6/70 (8%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKE------ 85 +KI++ A P +GKAN ++ LA + + +R+ S K+I I + + Sbjct: 38 LKIRLAAAPVEGKANSTLIKFLAAQFDVPIGQVRLKQGGKSRHKVIVIHRSVHDPRVLFN 97 Query: 86 ITELLQNNDS 95 I LL NN S Sbjct: 98 IEGLLMNNSS 107 >gi|73748976|ref|YP_308215.1| hypothetical protein cbdb_A1230 [Dehalococcoides sp. CBDB1] gi|147669743|ref|YP_001214561.1| hypothetical protein DehaBAV1_1103 [Dehalococcoides sp. BAV1] gi|289432972|ref|YP_003462845.1| hypothetical protein DehalGT_1029 [Dehalococcoides sp. GT] gi|123619917|sp|Q3ZYH5|Y1230_DEHSC RecName: Full=UPF0235 protein cbdbA1230 gi|189038739|sp|A5FQ39|Y1103_DEHSB RecName: Full=UPF0235 protein DehaBAV1_1103 gi|73660692|emb|CAI83299.1| conserved hypothetical protein [Dehalococcoides sp. CBDB1] gi|146270691|gb|ABQ17683.1| protein of unknown function DUF167 [Dehalococcoides sp. BAV1] gi|288946692|gb|ADC74389.1| protein of unknown function DUF167 [Dehalococcoides sp. GT] Length = 97 Score = 35.0 bits (79), Expect = 3.7, Method: Compositional matrix adjust. Identities = 21/74 (28%), Positives = 41/74 (55%), Gaps = 7/74 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V +++IP+A+K+ +A E +K+K+ A P+KGKANK ++ L+ L K+ Sbjct: 10 VNLKIIPSARKNELAGYE-------NGLLKLKIAAQPEKGKANKELIDYLSDLLDTPKAE 62 Query: 64 LRMLSKQSSPLKII 77 + + + K++ Sbjct: 63 IEICHGHTGRNKVL 76 >gi|195428527|ref|XP_002062324.1| GK17477 [Drosophila willistoni] gi|194158409|gb|EDW73310.1| GK17477 [Drosophila willistoni] Length = 127 Score = 35.0 bits (79), Expect = 3.8, Method: Compositional matrix adjust. Identities = 23/91 (25%), Positives = 46/91 (50%), Gaps = 9/91 (9%) Query: 3 NVIVRLI--PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 N+ ++++ P AK++GI + + + +++ A P +G+AN ++ L+K L L Sbjct: 36 NIAIKILAKPGAKQNGITDIGLE-------GVGVQIAAPPSEGEANAELVKYLSKVLGLR 88 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 KS + + S KII + K + L+ Sbjct: 89 KSDVSLDKGSRSRNKIILVSKGVSTVEACLE 119 >gi|189219304|ref|YP_001939945.1| hypothetical protein Minf_1293 [Methylacidiphilum infernorum V4] gi|189186162|gb|ACD83347.1| Uncharacterized conserved protein [Methylacidiphilum infernorum V4] Length = 97 Score = 35.0 bits (79), Expect = 3.8, Method: Compositional matrix adjust. Identities = 24/77 (31%), Positives = 45/77 (58%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ NAKK+ EI +D +KI+++A P +GKAN A+L+ L+ +L + K Sbjct: 6 LFVKVQANAKKT-----EICGSYADA--LKIRLSAPPVEGKANDALLSFLSLRLCVPKRL 58 Query: 64 LRMLSKQSSPLKIIYID 80 +R+ + + K + I+ Sbjct: 59 IRIEKGEKNSKKTVVIE 75 >gi|152991960|ref|YP_001357681.1| hypothetical protein SUN_0364 [Sulfurovum sp. NBC37-1] gi|151423821|dbj|BAF71324.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1] Length = 95 Score = 34.7 bits (78), Expect = 4.0, Method: Compositional matrix adjust. Identities = 21/61 (34%), Positives = 33/61 (54%), Gaps = 5/61 (8%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 +KI++ A +G ANK ++ LAK + KS + S Q+S +KI+ +TE Q Sbjct: 34 IKIRIKAPAVEGAANKELMKFLAKSFKVPKSDIIFKSGQNSKIKIVEFP-----LTEKFQ 88 Query: 92 N 92 N Sbjct: 89 N 89 >gi|119946689|ref|YP_944369.1| hypothetical protein Ping_3071 [Psychromonas ingrahamii 37] gi|119865293|gb|ABM04770.1| hypothetical protein DUF167 [Psychromonas ingrahamii 37] Length = 111 Score = 34.7 bits (78), Expect = 4.0, Method: Compositional matrix adjust. Identities = 17/51 (33%), Positives = 28/51 (54%), Gaps = 7/51 (13%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 P AKK + + I ++++VTATP G+A M+ LAK+ ++S Sbjct: 29 PGAKKDAVGKAQ-------GIQLRVRVTATPVAGRATDHMVRFLAKEFSVS 72 >gi|218437671|ref|YP_002376000.1| hypothetical protein PCC7424_0673 [Cyanothece sp. PCC 7424] gi|226708012|sp|B7KEV7|Y673_CYAP7 RecName: Full=UPF0235 protein PCC7424_0673 gi|218170399|gb|ACK69132.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7424] Length = 73 Score = 34.7 bits (78), Expect = 4.1, Method: Compositional matrix adjust. Identities = 27/74 (36%), Positives = 42/74 (56%), Gaps = 7/74 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PNAK I E D S I +K + P +GKAN+ ++ +LA+K ++KS + Sbjct: 5 VKVKPNAKHQKIEEAE---DGSLIISLK----SPPVEGKANQELIKLLAQKYRVTKSQIS 57 Query: 66 MLSKQSSPLKIIYI 79 + S SS K+I I Sbjct: 58 IQSGLSSRNKLIEI 71 >gi|206889548|ref|YP_002248254.1| hypothetical protein THEYE_A0407 [Thermodesulfovibrio yellowstonii DSM 11347] gi|206741486|gb|ACI20543.1| conserved hypothetical protein [Thermodesulfovibrio yellowstonii DSM 11347] Length = 87 Score = 34.7 bits (78), Expect = 4.1, Method: Compositional matrix adjust. Identities = 22/67 (32%), Positives = 39/67 (58%), Gaps = 7/67 (10%) Query: 11 NAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQ 70 AK +GI +E +T +K+++ A P G ANK ++ ML++ L + KS + ++ + Sbjct: 21 GAKITGIGGIE-----GNT--LKLRLAAQPHDGLANKELIEMLSEILNIPKSRIEIIKGK 73 Query: 71 SSPLKII 77 +S KII Sbjct: 74 TSKHKII 80 >gi|119493536|ref|ZP_01624202.1| hypothetical protein L8106_18212 [Lyngbya sp. PCC 8106] gi|119452653|gb|EAW33834.1| hypothetical protein L8106_18212 [Lyngbya sp. PCC 8106] Length = 81 Score = 34.7 bits (78), Expect = 4.2, Method: Compositional matrix adjust. Identities = 24/78 (30%), Positives = 43/78 (55%), Gaps = 7/78 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PN+K+ + E + + IH+K + P GKAN+ ++ +LAK+ + KS + Sbjct: 6 VKVKPNSKQQSMVKNE---EGTYIIHLK----SPPIDGKANQELIKILAKQFNIPKSQVS 58 Query: 66 MLSKQSSPLKIIYIDKDC 83 + S SS K+I + C Sbjct: 59 IKSGLSSKNKLIELPDSC 76 >gi|121535253|ref|ZP_01667067.1| protein of unknown function DUF167 [Thermosinus carboxydivorans Nor1] gi|121306138|gb|EAX47066.1| protein of unknown function DUF167 [Thermosinus carboxydivorans Nor1] Length = 100 Score = 34.7 bits (78), Expect = 4.3, Method: Compositional matrix adjust. Identities = 21/74 (28%), Positives = 41/74 (55%), Gaps = 7/74 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +++ P A ++ + L D+ +K+ V + P +G+AN+A LA A ++K+ + Sbjct: 19 IKVQPRASRNAVIGL-----AGDS--LKVCVASPPVEGEANQACLAFFAALFGVAKTRIV 71 Query: 66 MLSKQSSPLKIIYI 79 ++S Q S K+I I Sbjct: 72 LVSGQKSRSKVIKI 85 >gi|299133137|ref|ZP_07026332.1| protein of unknown function DUF167 [Afipia sp. 1NLS2] gi|298593274|gb|EFI53474.1| protein of unknown function DUF167 [Afipia sp. 1NLS2] Length = 108 Score = 34.7 bits (78), Expect = 4.3, Method: Compositional matrix adjust. Identities = 24/88 (27%), Positives = 49/88 (55%), Gaps = 2/88 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + + +E + + +K++V + G+AN+A+ + A+ L + KS Sbjct: 14 VAVRVTPRGGRDAVDGIE--ELANGKSVVKVRVRVAAEGGEANRAVTELFAELLRVPKSK 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 +R+ S +S +K + I+ D K++ E L+ Sbjct: 72 VRVASGVTSRVKQLTIEGDPKQLGEALK 99 >gi|320108335|ref|YP_004183925.1| hypothetical protein AciPR4_3175 [Terriglobus saanensis SP1PR4] gi|319926856|gb|ADV83931.1| protein of unknown function DUF167 [Terriglobus saanensis SP1PR4] Length = 104 Score = 34.7 bits (78), Expect = 4.4, Method: Compositional matrix adjust. Identities = 23/72 (31%), Positives = 39/72 (54%), Gaps = 7/72 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P AK+SG+ + +KI + A GKAN+A++ +A L + + S+ Sbjct: 20 VRVQPGAKRSGVVGIYGEA-------VKIALVAPAVDGKANEALVRFVATLLDVPRMSVE 72 Query: 66 MLSKQSSPLKII 77 +LS SS K++ Sbjct: 73 ILSGVSSRSKVV 84 >gi|262201697|ref|YP_003272905.1| hypothetical protein Gbro_1753 [Gordonia bronchialis DSM 43247] gi|262085044|gb|ACY21012.1| protein of unknown function DUF167 [Gordonia bronchialis DSM 43247] Length = 75 Score = 34.7 bits (78), Expect = 4.5, Method: Compositional matrix adjust. Identities = 21/70 (30%), Positives = 41/70 (58%), Gaps = 6/70 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V+V + PN++K + +E D + TI+++ T +G+ANKA+ +LAK L + KS Sbjct: 4 QVVVTVKPNSRKGPL--VETGPDGTVTIYVREPAT----EGRANKAVAELLAKHLGVPKS 57 Query: 63 SLRMLSKQSS 72 + ++ ++ Sbjct: 58 KVALVGGATA 67 >gi|189913153|ref|YP_001965041.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|189913490|ref|YP_001964718.1| hypothetical protein LEPBI_p0042 [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167777829|gb|ABZ96128.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781558|gb|ABZ99854.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 75 Score = 34.7 bits (78), Expect = 4.6, Method: Compositional matrix adjust. Identities = 21/76 (27%), Positives = 42/76 (55%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 +++++ N K I SLE +T +K + P KGKAN+ ++ +L+K ++K Sbjct: 3 IVIKVKSNQK---IQSLEFKSETECIAKLK----SLPVKGKANQELVGLLSKHYGVTKKE 55 Query: 64 LRMLSKQSSPLKIIYI 79 ++++S S +K + I Sbjct: 56 IQIISGHFSNIKTVEI 71 >gi|46446134|ref|YP_007499.1| hypothetical protein pc0500 [Candidatus Protochlamydia amoebophila UWE25] gi|46399775|emb|CAF23224.1| conserved hypothetical protein [Candidatus Protochlamydia amoebophila UWE25] Length = 92 Score = 34.7 bits (78), Expect = 5.1, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 30/50 (60%) Query: 31 HMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 +KI++ A P KG+AN ++ L+ + KSS++++ Q+S K I I Sbjct: 31 ELKIRLAAIPDKGQANTELIRFLSSLFKIRKSSIQLIQGQTSRHKKICIQ 80 >gi|303272231|ref|XP_003055477.1| predicted protein [Micromonas pusilla CCMP1545] gi|226463451|gb|EEH60729.1| predicted protein [Micromonas pusilla CCMP1545] Length = 492 Score = 34.3 bits (77), Expect = 5.1, Method: Composition-based stats. Identities = 19/50 (38%), Positives = 30/50 (60%), Gaps = 1/50 (2%) Query: 14 KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 K+GIA+LE+P D ++ K+ + A PQ + +A+L M AKK +S Sbjct: 354 KAGIAALELPSDVAELARAKLCLCA-PQMRRCWEALLFMTAKKAGAPDTS 402 >gi|330505393|ref|YP_004382262.1| hypothetical protein MDS_4479 [Pseudomonas mendocina NK-01] gi|328919679|gb|AEB60510.1| hypothetical protein MDS_4479 [Pseudomonas mendocina NK-01] Length = 96 Score = 34.3 bits (77), Expect = 5.3, Method: Compositional matrix adjust. Identities = 18/44 (40%), Positives = 26/44 (59%), Gaps = 3/44 (6%) Query: 24 KDTSDTIH---MKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 KD +H +KI++TA P GKAN + A LAK ++KS + Sbjct: 23 KDEFAGLHGERLKIRLTAPPVDGKANAHLQAFLAKAFGVAKSQV 66 >gi|254466397|ref|ZP_05079808.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] gi|206687305|gb|EDZ47787.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] Length = 105 Score = 34.3 bits (77), Expect = 5.4, Method: Compositional matrix adjust. Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 9/80 (11%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR P A ++ I + E +KI VTA P+ GKAN+A+ +LA + + Sbjct: 32 AEIAVRATPKAARNAIVAAEG--------VLKISVTAVPENGKANEAIRRLLAAAMGTAA 83 Query: 62 SSLRMLSKQSSPLKI-IYID 80 S L + +S K+ +Y+ Sbjct: 84 SRLELRRGAASRDKLFVYLG 103 >gi|118794573|ref|XP_321597.3| AGAP001528-PA [Anopheles gambiae str. PEST] gi|116116359|gb|EAA01322.3| AGAP001528-PA [Anopheles gambiae str. PEST] Length = 147 Score = 34.3 bits (77), Expect = 5.7, Method: Compositional matrix adjust. Identities = 30/91 (32%), Positives = 46/91 (50%), Gaps = 10/91 (10%) Query: 3 NVIVRLI--PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 N+IV+++ P AK SGI D S+ + ++ A P G+AN ++ L+K L L Sbjct: 52 NLIVKILAKPGAKTSGIT------DVSEE-GIGCQIAAPPIDGEANTELIKYLSKLLDLR 104 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 KS + + S K I +DK C+ E L Sbjct: 105 KSDISLDRGSKSRQKTIVLDKAGCRHSPEQL 135 >gi|209525443|ref|ZP_03273983.1| protein of unknown function DUF167 [Arthrospira maxima CS-328] gi|209494123|gb|EDZ94438.1| protein of unknown function DUF167 [Arthrospira maxima CS-328] Length = 73 Score = 34.3 bits (77), Expect = 5.9, Method: Compositional matrix adjust. Identities = 21/55 (38%), Positives = 36/55 (65%), Gaps = 4/55 (7%) Query: 25 DTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 D S T+++K + P GKAN+ ++ +LAKKL + KS++++ S SS K++ I Sbjct: 22 DGSLTVYLK----SPPVDGKANQELIKLLAKKLDVPKSNIKIKSGLSSRRKLVEI 72 >gi|332178747|gb|AEE14436.1| UPF0235 protein yggU [Thermodesulfobium narugense DSM 14796] Length = 86 Score = 34.3 bits (77), Expect = 5.9, Method: Compositional matrix adjust. Identities = 23/74 (31%), Positives = 42/74 (56%), Gaps = 8/74 (10%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +++ PNAKK S+EI KD + KV+A P+ GKAN+ ++ ++++ + + Sbjct: 7 LKVTPNAKKE---SIEI-KDG----KIYCKVSAPPEDGKANRRVIELISEFFDCKRKDVE 58 Query: 66 MLSKQSSPLKIIYI 79 + S + S KI+ I Sbjct: 59 IFSGEKSKNKILLI 72 >gi|209883467|ref|YP_002287324.1| hypothetical protein OCAR_4310 [Oligotropha carboxidovorans OM5] gi|226706162|sp|B6JAU6|Y4310_OLICO RecName: Full=UPF0235 protein OCAR_4310 gi|209871663|gb|ACI91459.1| hypothetical protein OCAR_4310 [Oligotropha carboxidovorans OM5] Length = 106 Score = 34.3 bits (77), Expect = 6.2, Method: Compositional matrix adjust. Identities = 25/93 (26%), Positives = 52/93 (55%), Gaps = 2/93 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + + +E+ + + ++++V A + G+AN+A+ + A L + KS Sbjct: 14 VAVRVTPRGGRDAVDGIEMLANGKSVVKVRVRVAA--EGGEANRAVTELFAGLLRVPKSK 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96 +++ S +S +K I I+ D K++ E L+ S+ Sbjct: 72 VKVASGVTSRIKQIAIEGDPKQLGEALKAATSI 104 >gi|330431471|gb|AEC16530.1| hypothetical protein UMN179_00494 [Gallibacterium anatis UMN179] Length = 95 Score = 33.9 bits (76), Expect = 6.8, Method: Compositional matrix adjust. Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 9/79 (11%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + N+I++ P A K I L D +KI +TA P GKAN +L L+K+ ++ Sbjct: 11 LLNIILQ--PKAGKDQIVGL-----YGD--ELKITITAPPIDGKANAHLLKFLSKQFKVA 61 Query: 61 KSSLRMLSKQSSPLKIIYI 79 K+ + + + S K ++I Sbjct: 62 KTQIELRKGELSRHKQVFI 80 >gi|94496543|ref|ZP_01303119.1| hypothetical protein SKA58_17602 [Sphingomonas sp. SKA58] gi|94423903|gb|EAT08928.1| hypothetical protein SKA58_17602 [Sphingomonas sp. SKA58] Length = 107 Score = 33.9 bits (76), Expect = 6.9, Method: Compositional matrix adjust. Identities = 25/74 (33%), Positives = 43/74 (58%), Gaps = 2/74 (2%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +RL P + + GI + +D + +V A P+KG+AN A++A+LA+ L KS++ Sbjct: 15 IRLTPGSARQGIGG--VWRDDRAAPWLTARVRAVPEKGRANTALIALLAQALDWPKSAIM 72 Query: 66 MLSKQSSPLKIIYI 79 + S S+ LK + I Sbjct: 73 LESGDSNRLKRLRI 86 >gi|7508740|pir||T26031 hypothetical protein W01A8.2 - Caenorhabditis elegans Length = 263 Score = 33.9 bits (76), Expect = 7.1, Method: Compositional matrix adjust. Identities = 20/71 (28%), Positives = 37/71 (52%), Gaps = 7/71 (9%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 P AKKS + ++ D+ + + + A P++G AN+ +++ L L L K+ L+ Sbjct: 45 PGAKKSCVVAI------GDS-EVDVAIGAAPREGAANEELISYLMSALGLRKNELQFDKG 97 Query: 70 QSSPLKIIYID 80 S K++ ID Sbjct: 98 AKSRSKVVLID 108 >gi|78222380|ref|YP_384127.1| hypothetical protein Gmet_1164 [Geobacter metallireducens GS-15] gi|78193635|gb|ABB31402.1| protein of unknown function DUF167 [Geobacter metallireducens GS-15] Length = 102 Score = 33.9 bits (76), Expect = 7.5, Method: Compositional matrix adjust. Identities = 23/74 (31%), Positives = 41/74 (55%), Gaps = 7/74 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V + P A K+GI ++ D I K+++T+ P +G+AN+ LAK L + KS++ Sbjct: 23 VHVQPRASKNGICGIQ-----GDAI--KLRLTSPPVEGEANRLCTEYLAKLLKVPKSAVT 75 Query: 66 MLSKQSSPLKIIYI 79 +++ S K I + Sbjct: 76 IIAGDKSRHKTIRV 89 >gi|21674646|ref|NP_662711.1| hypothetical protein CT1832 [Chlorobium tepidum TLS] gi|29839718|sp|Q8KBF5|Y1832_CHLTE RecName: Full=UPF0235 protein CT1832 gi|21647849|gb|AAM73053.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 105 Score = 33.9 bits (76), Expect = 7.5, Method: Compositional matrix adjust. Identities = 22/75 (29%), Positives = 39/75 (52%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P + KSG+A + +KI + + P ANK +LAK L + +SS+ Sbjct: 15 VRVQPRSSKSGVAGMY-------GEQLKICLKSAPVDNAANKECCELLAKALGVPRSSVS 67 Query: 66 MLSKQSSPLKIIYID 80 ++ SS K++ ++ Sbjct: 68 VMKGASSRSKVLKVE 82 >gi|296126259|ref|YP_003633511.1| hypothetical protein Bmur_1218 [Brachyspira murdochii DSM 12563] gi|296018075|gb|ADG71312.1| protein of unknown function DUF167 [Brachyspira murdochii DSM 12563] Length = 84 Score = 33.9 bits (76), Expect = 8.0, Method: Compositional matrix adjust. Identities = 19/60 (31%), Positives = 33/60 (55%) Query: 34 IKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 +++ A GKANKA++ LA +L + K + +L + S KII I+ + ++ E N Sbjct: 25 VRIMAKAIDGKANKAIIEFLADELNIKKKDIDILKGEKSSKKIIAINIEENKLKEYFSKN 84 >gi|332039865|gb|EGI76260.1| hypothetical protein HGR_12177 [Hylemonella gracilis ATCC 19624] Length = 115 Score = 33.5 bits (75), Expect = 8.8, Method: Compositional matrix adjust. Identities = 22/76 (28%), Positives = 43/76 (56%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + ++PNA ++ I L D + +K+++ A P GKAN+A+ LAK L++ SS Sbjct: 25 VDLHVMPNASRTQIQGLF---DGA----LKVRLQAPPVDGKANEALRVWLAKTLSIPNSS 77 Query: 64 LRMLSKQSSPLKIIYI 79 + + ++ K +++ Sbjct: 78 VTLQHGATARRKQLHV 93 >gi|83648992|ref|YP_437427.1| hypothetical protein HCH_06357 [Hahella chejuensis KCTC 2396] gi|83637035|gb|ABC33002.1| uncharacterized conserved protein [Hahella chejuensis KCTC 2396] Length = 102 Score = 33.5 bits (75), Expect = 9.1, Method: Compositional matrix adjust. Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 7/72 (9%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 L P AKK EI D +KIK++A P G+AN+ ++ LAK + + +++L Sbjct: 24 LQPGAKKD-----EIVGTHGDA--LKIKISAPPIDGRANQQLVRFLAKLCRVKQQDVQIL 76 Query: 68 SKQSSPLKIIYI 79 + +SS K I + Sbjct: 77 AGESSRQKRIRV 88 >gi|30248412|ref|NP_840482.1| hypothetical protein NE0395 [Nitrosomonas europaea ATCC 19718] gi|47117515|sp|Q82X93|Y395_NITEU RecName: Full=UPF0235 protein NE0395 gi|30138298|emb|CAD84306.1| DUF167 [Nitrosomonas europaea ATCC 19718] Length = 100 Score = 33.5 bits (75), Expect = 9.4, Method: Compositional matrix adjust. Identities = 15/49 (30%), Positives = 29/49 (59%) Query: 31 HMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 +KIK+ A P GKAN+A+ LAK+ + + ++ + + S K++ + Sbjct: 35 ELKIKLAALPVDGKANRALTEFLAKRFNVPRKNITLKRGEQSRHKVVEV 83 >gi|152991249|ref|YP_001356971.1| hypothetical protein NIS_1507 [Nitratiruptor sp. SB155-2] gi|151423110|dbj|BAF70614.1| conserved hypothetical protein [Nitratiruptor sp. SB155-2] Length = 95 Score = 33.5 bits (75), Expect = 9.7, Method: Compositional matrix adjust. Identities = 27/93 (29%), Positives = 47/93 (50%), Gaps = 13/93 (13%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ ++ PNA K+ IA + D++ + IK A +G ANK ++ L+K ++K Sbjct: 10 VHMFIKAQPNASKNKIAGI-----LGDSLKIAIKAPAV--EGAANKELVKFLSKTFKVAK 62 Query: 62 SSLRMLSKQSSPLKIIY------IDKDCKEITE 88 S + S ++S K I ID KE+T+ Sbjct: 63 SDIVFASGETSKRKHIVMPYNKKIDDFIKELTQ 95 >gi|225849498|ref|YP_002729663.1| hypothetical protein SULAZ_1705 [Sulfurihydrogenibium azorense Az-Fu1] gi|225644124|gb|ACN99174.1| conserved domain protein [Sulfurihydrogenibium azorense Az-Fu1] Length = 72 Score = 33.5 bits (75), Expect = 9.8, Method: Compositional matrix adjust. Identities = 15/49 (30%), Positives = 31/49 (63%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 +++ T P+KGKAN ++ ++++ L + KS ++++ SS K I I+ Sbjct: 24 YEVRCTTIPEKGKANDKVIELMSEYLDVPKSRIKIIKGHSSREKEIEIE 72 Searching..................................................done Results from round 2 >gi|159185312|ref|NP_530540.1| hypothetical protein Atu2660 [Agrobacterium tumefaciens str. C58] gi|17741163|gb|AAL43641.1| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 183 Score = 128 bits (321), Expect = 3e-28, Method: Composition-based stats. Identities = 35/91 (38%), Positives = 56/91 (61%), Gaps = 2/91 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL PN + I +E D + H+K +V+A P+ GKANKA++ +LAKKL L K Sbjct: 92 VRLSVRLTPNGGRDAIDGVEQDADGN--AHLKARVSAVPEGGKANKALIVLLAKKLGLPK 149 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 SS+ +S +++ KI+ ID D ++ +L + Sbjct: 150 SSITFISGETARKKILRIDTDPEDFEKLFKK 180 >gi|254780231|ref|YP_003064644.1| hypothetical protein CLIBASIA_00580 [Candidatus Liberibacter asiaticus str. psy62] gi|254039908|gb|ACT56704.1| hypothetical protein CLIBASIA_00580 [Candidatus Liberibacter asiaticus str. psy62] Length = 98 Score = 119 bits (299), Expect = 1e-25, Method: Composition-based stats. Identities = 98/98 (100%), Positives = 98/98 (100%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS Sbjct: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL 98 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL Sbjct: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL 98 >gi|29839726|sp|Q8UC38|Y2660_AGRT5 RecName: Full=UPF0235 protein Atu2660 Length = 112 Score = 117 bits (294), Expect = 5e-25, Method: Composition-based stats. Identities = 35/91 (38%), Positives = 56/91 (61%), Gaps = 2/91 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL PN + I +E D + H+K +V+A P+ GKANKA++ +LAKKL L K Sbjct: 21 VRLSVRLTPNGGRDAIDGVEQDADGN--AHLKARVSAVPEGGKANKALIVLLAKKLGLPK 78 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 SS+ +S +++ KI+ ID D ++ +L + Sbjct: 79 SSITFISGETARKKILRIDTDPEDFEKLFKK 109 >gi|325294029|ref|YP_004279893.1| hypothetical protein AGROH133_08898 [Agrobacterium sp. H13-3] gi|325061882|gb|ADY65573.1| hypothetical protein AGROH133_08898 [Agrobacterium sp. H13-3] Length = 112 Score = 116 bits (291), Expect = 1e-24, Method: Composition-based stats. Identities = 33/90 (36%), Positives = 53/90 (58%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + I +E D + H+K +V+A P+ GKANKA++ +LAKK L KS Sbjct: 22 RLSVRLTPNGGRDAIDGVEQDADGN--AHLKARVSAVPEGGKANKALVILLAKKFGLPKS 79 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + +S +++ KI+ ID D ++ L + Sbjct: 80 PITFISGETARKKILRIDTDPEDFETLFRK 109 >gi|116254232|ref|YP_770070.1| hypothetical protein RL4503 [Rhizobium leguminosarum bv. viciae 3841] gi|166227262|sp|Q1MAP9|Y4503_RHIL3 RecName: Full=UPF0235 protein RL4503 gi|115258880|emb|CAK09988.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 103 Score = 116 bits (290), Expect = 1e-24, Method: Composition-based stats. Identities = 32/89 (35%), Positives = 56/89 (62%), Gaps = 2/89 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + + +E D +K +VTA P+KGKANKA++ ++AK L + KS Sbjct: 13 RLAVRLTPNGGRDALDGIE--ADGEGEAFLKARVTAVPEKGKANKALMLLIAKSLRIPKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S+ ++S +++ KI+ ID D +++ + L+ Sbjct: 71 SVSLVSGETARKKILRIDGDPEDLVKKLE 99 >gi|241206712|ref|YP_002977808.1| hypothetical protein Rleg_4028 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240860602|gb|ACS58269.1| protein of unknown function DUF167 [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 103 Score = 115 bits (287), Expect = 3e-24, Method: Composition-based stats. Identities = 31/89 (34%), Positives = 55/89 (61%), Gaps = 2/89 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + + +E D +K +VTA P+KGKANKA++ ++A+ L + KS Sbjct: 13 RLAVRLTPNGGRDALDGIE--ADGEGEAFLKARVTAVPEKGKANKALILLIAQSLRIPKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S+ ++S ++ KI+ ID D +++ + L+ Sbjct: 71 SVSLISGDTARKKILRIDGDPEDLVKKLE 99 >gi|218680333|ref|ZP_03528230.1| hypothetical protein RetlC8_16135 [Rhizobium etli CIAT 894] Length = 103 Score = 114 bits (286), Expect = 3e-24, Method: Composition-based stats. Identities = 30/90 (33%), Positives = 56/90 (62%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + I +E D ++ +VT+ P+KGKANKA++ ++A+ L + KS Sbjct: 13 RLAVRLTPNGGRDAIDGIE--ADGEGETFLRARVTSVPEKGKANKALILLVAQSLRIPKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 S+ ++S +++ KI+ ID D +++ + L+ Sbjct: 71 SISLVSGETARKKILRIDGDPEDLAKKLET 100 >gi|190893761|ref|YP_001980303.1| hypothetical protein RHECIAT_CH0004196 [Rhizobium etli CIAT 652] gi|226706142|sp|B3PQB3|Y4196_RHIE6 RecName: Full=UPF0235 protein RHECIAT_CH0004196 gi|190699040|gb|ACE93125.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 103 Score = 112 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 30/90 (33%), Positives = 58/90 (64%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + +E + ++K +VTA P+KGKANKA++A+++K L ++KS Sbjct: 13 RLTVRLTPNGGRDAFDGIETGSEGE--TYLKARVTAIPEKGKANKALIALVSKSLGVAKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 S+ ++S +++ KI+ I+ D +++ + L+ Sbjct: 71 SITLVSGETARKKILRIEGDPEDLAKKLET 100 >gi|222087485|ref|YP_002546022.1| hypothetical protein Arad_4365 [Agrobacterium radiobacter K84] gi|221724933|gb|ACM28089.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 104 Score = 112 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 34/91 (37%), Positives = 58/91 (63%), Gaps = 2/91 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL PN + + +E D ++K +V+A P+KGKANKA++A+LAK+L++ K Sbjct: 12 VRLSVRLTPNGGRDAVDGIETGADGE--AYLKARVSAVPEKGKANKALIALLAKRLSIPK 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 SSL ++S ++ KI+ ID D +++ L+ Sbjct: 70 SSLSLISGDTARKKILRIDGDPEDLIGRLKA 100 >gi|327188872|gb|EGE56064.1| hypothetical protein RHECNPAF_750023 [Rhizobium etli CNPAF512] Length = 103 Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 30/90 (33%), Positives = 59/90 (65%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + +E D+ ++K +VTA P+KGKANKA++A+++K + ++KS Sbjct: 13 RLTVRLTPNGGRDAFDGIET--DSEGETYLKARVTAVPEKGKANKALIALVSKSVGVAKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 S+ ++S +++ KI+ I+ D +++ + L+ Sbjct: 71 SITLVSGETARKKILRIEGDPEDLAKKLET 100 >gi|209551279|ref|YP_002283196.1| hypothetical protein Rleg2_3707 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|226705832|sp|B5ZTD4|Y3707_RHILW RecName: Full=UPF0235 protein Rleg2_3707 gi|209537035|gb|ACI56970.1| protein of unknown function DUF167 [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 103 Score = 112 bits (280), Expect = 2e-23, Method: Composition-based stats. Identities = 31/91 (34%), Positives = 55/91 (60%), Gaps = 2/91 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +RL PN + I E D ++K +VT P+KGKANKA++ ++AK L ++K Sbjct: 12 VRLAIRLTPNGGRDAIDGAET--DGEGEAYLKTRVTTVPEKGKANKALILLIAKSLGIAK 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 SS+ ++S ++ KI+ ID D +++ + L+ Sbjct: 70 SSVSLVSGDTARKKILRIDGDPEDLGKKLET 100 >gi|86359493|ref|YP_471385.1| hypothetical protein RHE_CH03912 [Rhizobium etli CFN 42] gi|123510540|sp|Q2K3C8|Y3912_RHIEC RecName: Full=UPF0235 protein RHE_CH03912 gi|86283595|gb|ABC92658.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 112 Score = 111 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 28/89 (31%), Positives = 57/89 (64%), Gaps = 2/89 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + ++ D+ ++ +VTA P+KGKANKA++A+++K + ++KS Sbjct: 22 RLTVRLTPNGGRDAFDGIDT--DSEGETYLGARVTAVPEKGKANKALIALVSKSVGVAKS 79 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S+ ++S +++ KI+ I+ D +++ L+ Sbjct: 80 SVSVISGETARKKILRIEGDPEDLARKLE 108 >gi|222150063|ref|YP_002551020.1| hypothetical protein Avi_4153 [Agrobacterium vitis S4] gi|221737045|gb|ACM38008.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 104 Score = 110 bits (276), Expect = 5e-23, Method: Composition-based stats. Identities = 31/90 (34%), Positives = 60/90 (66%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + GI +++ + H+K++V+ P+KG+ANKA++A+LAK+L ++KS Sbjct: 13 RLAVRLTPNGGRDGIDGVDVNANGE--AHLKVRVSDVPEKGRANKALIALLAKRLGVAKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++ ++S ++ KI+ ID D +++ L+ Sbjct: 71 AVSLISGDAARQKILRIDGDPEDLIGRLET 100 >gi|218461695|ref|ZP_03501786.1| hypothetical protein RetlK5_20376 [Rhizobium etli Kim 5] Length = 103 Score = 110 bits (275), Expect = 6e-23, Method: Composition-based stats. Identities = 29/90 (32%), Positives = 59/90 (65%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + +E D+ ++K +VTA P+KGKANKA++A++++ + ++KS Sbjct: 13 RLTVRLTPNGGRGAFDGIET--DSEGETYLKARVTAVPEKGKANKALIALVSQSVGVAKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 S+ ++S +++ KI+ I+ D +++ + L+ Sbjct: 71 SVSLVSGETARKKILRIEGDPEDLAQKLEK 100 >gi|49473825|ref|YP_031867.1| hypothetical protein BQ01570 [Bartonella quintana str. Toulouse] gi|49239328|emb|CAF25660.1| hypothetical protein BQ01570 [Bartonella quintana str. Toulouse] Length = 114 Score = 108 bits (270), Expect = 3e-22, Method: Composition-based stats. Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V LIP A I +E D + I++ P+ GKANKA++ LAK+ + S Sbjct: 12 LFVYLIPKASVDKIIGVECRDDGKQR--LVIRLRTLPENGKANKALIKFLAKQWKIPSSY 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S ++S K +Y +E+ E+LQ+ Sbjct: 70 ISLKSGETSRYKQLYFSGYLQEVGEILQS 98 >gi|319899260|ref|YP_004159353.1| hypothetical protein BARCL_1102 [Bartonella clarridgeiae 73] gi|319403224|emb|CBI76783.1| conserved protein of unknown function [Bartonella clarridgeiae 73] Length = 110 Score = 107 bits (268), Expect = 4e-22, Method: Composition-based stats. Identities = 35/96 (36%), Positives = 52/96 (54%), Gaps = 5/96 (5%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E D H+ I++ A P+ GKAN+A++ LAK+ + S Sbjct: 12 LFVRLIPKASVDSIIKVEDRDDGKQ--HLIIRLRAIPENGKANRALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ---NNDSL 96 + + S +S K +Y K KEI ++LQ N D L Sbjct: 70 ISLGSGATSHYKQLYFSKYLKEIEQILQSFRNYDFL 105 >gi|49474981|ref|YP_033022.1| hypothetical protein BH01670 [Bartonella henselae str. Houston-1] gi|49237786|emb|CAF26979.1| hypothetical protein BH01670 [Bartonella henselae str. Houston-1] Length = 111 Score = 107 bits (267), Expect = 5e-22, Method: Composition-based stats. Identities = 27/89 (30%), Positives = 47/89 (52%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V +IP A I +E D H+ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVYIIPKASGDKIMGIECKNDGK--RHLVIRLRAIPENGKANKALIKFLAKQWKIPSSY 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S +S K +Y +++ E+L+ Sbjct: 70 ISLKSGGTSRYKQLYFSGYLEKLKEILRA 98 >gi|319406122|emb|CBI79752.1| conserved hypothetical protein [Bartonella sp. AR 15-3] Length = 107 Score = 106 bits (266), Expect = 7e-22, Method: Composition-based stats. Identities = 33/88 (37%), Positives = 49/88 (55%), Gaps = 2/88 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E D H+ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVRLIPKASVDSIIKVENRGDGKQ--HLIIRLRAIPENGKANKALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S +S K +Y K KEI ++LQ Sbjct: 70 ISLGSGATSHYKQLYFSKYIKEIEQILQ 97 >gi|227823716|ref|YP_002827689.1| hypothetical protein NGR_c32030 [Sinorhizobium fredii NGR234] gi|227342718|gb|ACP26936.1| hypothetical protein NGR_c32030 [Sinorhizobium fredii NGR234] Length = 105 Score = 106 bits (264), Expect = 1e-21, Method: Composition-based stats. Identities = 32/88 (36%), Positives = 52/88 (59%), Gaps = 2/88 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + I EI D + H+K++V A P+KGKAN A++ +LAK L+K+ Sbjct: 14 RLTVRLTPNGGRDAIDGFEIAADGEE--HLKVRVRAVPEKGKANDALIGLLAKAFGLAKN 71 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++S + KI+ I+ D + I + L Sbjct: 72 RIALVSGDTQRKKILRIEADPEAIQKRL 99 >gi|319407608|emb|CBI81258.1| conserved hypothetical protein [Bartonella sp. 1-1C] Length = 101 Score = 105 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E D ++ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVRLIPKASVDSIIKVENRDDGKQ--YLIIRLRAIPENGKANKALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S +S K +Y K KE+ ++LQ+ Sbjct: 70 ISLGSGTTSHYKQLYFSKYLKEVEQILQS 98 >gi|319404616|emb|CBI78222.1| conserved hypothetical protein [Bartonella rochalimae ATCC BAA-1498] Length = 101 Score = 105 bits (261), Expect = 3e-21, Method: Composition-based stats. Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E D ++ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVRLIPKASVDSIIKVENRDDGKQ--YLIIRLRAIPENGKANKALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S +S K +Y K KE+ ++LQ+ Sbjct: 70 ISLGSGATSHYKQLYFSKYLKEVEKILQS 98 >gi|91975173|ref|YP_567832.1| hypothetical protein RPD_0693 [Rhodopseudomonas palustris BisB5] gi|91681629|gb|ABE37931.1| protein of unknown function DUF167 [Rhodopseudomonas palustris BisB5] Length = 107 Score = 104 bits (259), Expect = 5e-21, Method: Composition-based stats. Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D + K++V A G+AN+A++ +LAK L + K + Sbjct: 13 VAVRVTPRGGRDEIDGLETLSDGRPVV--KVRVRAIADGGEANRAVIELLAKSLGVPKRN 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+LS +S K I ID D ++ E L+ Sbjct: 71 VRLLSGATSRQKQIAIDGDPTKLGEALRR 99 >gi|86747236|ref|YP_483732.1| hypothetical protein RPB_0109 [Rhodopseudomonas palustris HaA2] gi|123293376|sp|Q2J3Y9|Y109_RHOP2 RecName: Full=UPF0235 protein RPB_0109 gi|86570264|gb|ABD04821.1| Protein of unknown function DUF167 [Rhodopseudomonas palustris HaA2] Length = 108 Score = 103 bits (257), Expect = 9e-21, Method: Composition-based stats. Identities = 32/89 (35%), Positives = 49/89 (55%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D + K++V A G+AN+A++ +LAK L + K + Sbjct: 14 VAVRVTPRGDRDEIDGLETLSDGRPVV--KLRVRAIADGGEANRAVIELLAKALGVPKRN 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+LS +S K I ID D K + E L+ Sbjct: 72 VRLLSGATSRQKQIAIDGDPKSLGETLRQ 100 >gi|218672707|ref|ZP_03522376.1| hypothetical protein RetlG_14247 [Rhizobium etli GR56] Length = 89 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 29/79 (36%), Positives = 52/79 (65%), Gaps = 2/79 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + +E D+ ++K +VTA P+KGKANKA++A+++K L ++KS Sbjct: 13 RLTVRLTPNGGRDAFDGIET--DSQGETYLKARVTAVPEKGKANKALIALVSKSLGVAKS 70 Query: 63 SLRMLSKQSSPLKIIYIDK 81 S+ ++S +++ KI+ I+ Sbjct: 71 SVSLVSGETARKKILRIEG 89 >gi|121602415|ref|YP_988648.1| hypothetical protein BARBAKC583_0328 [Bartonella bacilliformis KC583] gi|120614592|gb|ABM45193.1| conserved hypothetical protein TIGR00251 [Bartonella bacilliformis KC583] Length = 107 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 25/87 (28%), Positives = 45/87 (51%), Gaps = 2/87 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A + I +E D ++ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVRLTPKASMNNIVGVESRDDGKQ--YLIIRLCAVPEDGKANKALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + + S K + ++I ++L Sbjct: 70 ISLENGAISRYKQLRFSGGVEKIEKIL 96 >gi|316931646|ref|YP_004106628.1| hypothetical protein Rpdx1_0252 [Rhodopseudomonas palustris DX-1] gi|315599360|gb|ADU41895.1| protein of unknown function DUF167 [Rhodopseudomonas palustris DX-1] Length = 108 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D +K++V A G+AN+A+ +LAK + + K + Sbjct: 14 VAVRVTPRGGRDDIDGLETLSDGRPV--LKVRVRAIADGGEANRAVTELLAKAVGVPKRN 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+LS +S K I ID D K++ E+L+ Sbjct: 72 VRLLSGATSRQKQIAIDGDPKQLGEVLRR 100 >gi|163867463|ref|YP_001608662.1| hypothetical protein Btr_0184 [Bartonella tribocorum CIP 105476] gi|161017109|emb|CAK00667.1| conserved hypothetical protein [Bartonella tribocorum CIP 105476] Length = 108 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 46/89 (51%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V LIP + I +E ++ I++ A P+ GKANKA++ AK+ + SS Sbjct: 12 LFVYLIPKSSVDKIIGIECRDGEKQ--YLVIRLRAVPEDGKANKALIKFFAKQWKIPSSS 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S +S K ++ +E+ ++ Q+ Sbjct: 70 ISLKSGATSRYKQLHFSTHLEELKQIWQS 98 >gi|1388023|gb|AAB88056.1| putative [Bartonella bacilliformis] Length = 109 Score = 102 bits (255), Expect = 1e-20, Method: Composition-based stats. Identities = 25/87 (28%), Positives = 45/87 (51%), Gaps = 2/87 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A + I +E D ++ I++ A P+ GKANKA++ LAK+ + S Sbjct: 14 LFVRLTPKASMNNIVGVESRDDGKQ--YLIIRLCAVPEDGKANKALIKFLAKQWKIPSSC 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + + S K + ++I ++L Sbjct: 72 ISLENGAISRYKQLRFSGGVEKIEKIL 98 >gi|39933491|ref|NP_945767.1| hypothetical protein RPA0414 [Rhodopseudomonas palustris CGA009] gi|39647337|emb|CAE25858.1| DUF167 [Rhodopseudomonas palustris CGA009] Length = 112 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D + K++V A G+AN+A+ +LAK + + K + Sbjct: 18 VAVRVTPRGGRDDIDGLETLSDGRPVV--KVRVRAIADGGEANRAVTELLAKAVGVPKRN 75 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+LS +S K I ID D K++ E L+ Sbjct: 76 VRLLSGATSRQKQIAIDGDPKQLGEALRR 104 >gi|319408144|emb|CBI81797.1| conserved hypothetical protein [Bartonella schoenbuchensis R1] Length = 108 Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats. Identities = 26/88 (29%), Positives = 45/88 (51%), Gaps = 2/88 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E +T H+ I++ P+ GKANKA++ L ++ + S Sbjct: 12 LFVRLIPKASMDSIVGVESRD--GETQHLVIRLRTVPEDGKANKALIKFLGRQWKIPPSY 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S +S K + +E+ + LQ Sbjct: 70 ISLKSGMTSRYKQLRFSGYVEELEQKLQ 97 >gi|192288849|ref|YP_001989454.1| hypothetical protein Rpal_0418 [Rhodopseudomonas palustris TIE-1] gi|226706140|sp|B3QA92|Y418_RHOPT RecName: Full=UPF0235 protein Rpal_0418 gi|192282598|gb|ACE98978.1| protein of unknown function DUF167 [Rhodopseudomonas palustris TIE-1] Length = 108 Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats. Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D + K++V A G+AN+A+ +LAK + + K + Sbjct: 14 VAVRVTPRGGRDDIDGLETLSDGRPVV--KVRVRAIADGGEANRAVTELLAKAVGVPKRN 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+LS +S K I ID D K++ E L+ Sbjct: 72 VRLLSGATSRQKQIAIDGDPKQLGEALRR 100 >gi|90421586|ref|YP_529956.1| hypothetical protein RPC_0058 [Rhodopseudomonas palustris BisB18] gi|122477773|sp|Q21D99|Y058_RHOPB RecName: Full=UPF0235 protein RPC_0058 gi|90103600|gb|ABD85637.1| protein of unknown function DUF167 [Rhodopseudomonas palustris BisB18] Length = 107 Score = 101 bits (251), Expect = 4e-20, Method: Composition-based stats. Identities = 29/90 (32%), Positives = 51/90 (56%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +V +R+ P + I LE + +K++V A + G+AN+A+ +LAK L + K Sbjct: 13 SVALRVTPRGGRDDIDGLETLANGR--TVVKVRVRAIAEGGEANRAVTELLAKALGVPKR 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++R+LS +S LK + +D D E+ E L+ Sbjct: 71 AVRVLSGTTSRLKQVAVDGDPNELGEALRK 100 >gi|188581339|ref|YP_001924784.1| hypothetical protein Mpop_2087 [Methylobacterium populi BJ001] gi|259646581|sp|B1ZLG8|Y2087_METPB RecName: Full=UPF0235 protein Mpop_2087 gi|179344837|gb|ACB80249.1| protein of unknown function DUF167 [Methylobacterium populi BJ001] Length = 110 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 24/86 (27%), Positives = 46/86 (53%), Gaps = 2/86 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A ++G+ + D + + +V A P +G AN A+ A +AK L L KS Sbjct: 15 LAVRLTPRAGRTGLDGVRTEPDGRPILCL--RVAAPPVEGAANAALTAFVAKSLGLRKSE 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + ++S +++ K +++ D + + Sbjct: 73 VTLVSGETARTKRLHLSGDPQALAAR 98 >gi|144900433|emb|CAM77297.1| protein containing DUF167 [Magnetospirillum gryphiswaldense MSR-1] Length = 107 Score = 99.3 bits (246), Expect = 1e-19, Method: Composition-based stats. Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +RL P ++ I L D + +K VTA P+ GKAN A++ MLAK+ ++KS Sbjct: 14 RVFIRLTPKGSRNKIDGLAAEADG--GMVLKASVTAVPEDGKANAALIKMLAKEWRVAKS 71 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +++ + K + I D E+ L Sbjct: 72 DFEIVAGATDRRKTVLISGDGAEMAARLDQ 101 >gi|240849835|ref|YP_002971223.1| hypothetical protein Bgr_01760 [Bartonella grahamii as4aup] gi|240266958|gb|ACS50546.1| hypothetical protein Bgr_01760 [Bartonella grahamii as4aup] Length = 104 Score = 99.3 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V LIP + I +E ++ I++ A P+ GKANKA++ LAK+ + SS Sbjct: 12 LFVYLIPKSSVDKIIGVECRDGEKQ--YLVIRLRAVPEDGKANKALIKFLAKQWKIPSSS 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + + S K +Y +E+ ++ Q+ Sbjct: 70 ISLKNGAISRYKQLYFSTHLEELKQIWQS 98 >gi|115522078|ref|YP_778989.1| hypothetical protein RPE_0048 [Rhodopseudomonas palustris BisA53] gi|115516025|gb|ABJ04009.1| protein of unknown function DUF167 [Rhodopseudomonas palustris BisA53] Length = 107 Score = 98.5 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 27/90 (30%), Positives = 50/90 (55%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +V +R+ P + I LE + +K++V A + G+AN+A+ +LAK L + K Sbjct: 12 SVALRVTPRGGRDAIDGLETLANGR--TVVKVRVRAIAEGGEANRAVTELLAKALGVPKR 69 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++R+LS +S LK I +D + + + L+ Sbjct: 70 AVRVLSGTTSRLKQIAVDGNPELLGAALRK 99 >gi|296445622|ref|ZP_06887577.1| protein of unknown function DUF167 [Methylosinus trichosporium OB3b] gi|296256867|gb|EFH03939.1| protein of unknown function DUF167 [Methylosinus trichosporium OB3b] Length = 109 Score = 98.5 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 50/90 (55%), Gaps = 2/90 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P + ++ +E D +K +V A P+ G+AN+A++A++A+ L K S Sbjct: 16 LWVRLTPKGGRDALSGVETLADGRAV--LKARVRAAPEDGRANEALVALIAQALGAPKRS 73 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 +++ + ++ LK ++I D + L+ + Sbjct: 74 VQIAAGHTARLKKLFIAGDPASLVAALEKS 103 >gi|209966140|ref|YP_002299055.1| hypothetical protein RC1_2875 [Rhodospirillum centenum SW] gi|209959606|gb|ACJ00243.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 117 Score = 97.0 bits (240), Expect = 7e-19, Method: Composition-based stats. Identities = 26/89 (29%), Positives = 48/89 (53%), Gaps = 2/89 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V +R+ P A ++ + + +K+ VTA P+ GKAN A++A+LAK L K Sbjct: 23 VRVALRVTPKASRTAVQGPMDGPEGR--TLLKLAVTAVPEDGKANAAVIALLAKHWRLPK 80 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 SS+ ++S + K+++I D ++ + Sbjct: 81 SSMSIVSGGTDRTKVLFIAGDAADLLARI 109 >gi|315497739|ref|YP_004086543.1| hypothetical protein Astex_0706 [Asticcacaulis excentricus CB 48] gi|315415751|gb|ADU12392.1| protein of unknown function DUF167 [Asticcacaulis excentricus CB 48] Length = 89 Score = 97.0 bits (240), Expect = 7e-19, Method: Composition-based stats. Identities = 31/81 (38%), Positives = 50/81 (61%), Gaps = 2/81 (2%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M ++VRL P A + ++ D ++K++VTA P +G+AN+A++A LAK+L L Sbjct: 1 MARLVVRLTPKAAADRVDGWDM--DEQGRPYLKVRVTAPPIEGRANEALIAFLAKRLKLP 58 Query: 61 KSSLRMLSKQSSPLKIIYIDK 81 KS L +L+ SS LK I ++ Sbjct: 59 KSRLSLLAGDSSRLKQIEVEG 79 >gi|329849894|ref|ZP_08264740.1| hypothetical protein ABI_27900 [Asticcacaulis biprosthecum C19] gi|328841805|gb|EGF91375.1| hypothetical protein ABI_27900 [Asticcacaulis biprosthecum C19] Length = 86 Score = 97.0 bits (240), Expect = 9e-19, Method: Composition-based stats. Identities = 28/82 (34%), Positives = 44/82 (53%), Gaps = 2/82 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL P + I +D +K++V A P +G+AN+A++ LAK L + KS Sbjct: 2 RLAVRLTPRSSADAIDGW--GEDEQGRRFLKVRVRAAPIEGRANEALIVFLAKTLGVPKS 59 Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84 L +++ +S LK I ID D Sbjct: 60 RLSLVAGDTSRLKQIEIDGDVD 81 >gi|92116228|ref|YP_575957.1| hypothetical protein Nham_0608 [Nitrobacter hamburgensis X14] gi|91799122|gb|ABE61497.1| protein of unknown function DUF167 [Nitrobacter hamburgensis X14] Length = 106 Score = 96.2 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 26/90 (28%), Positives = 53/90 (58%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 ++ +R+ P + GI +E+ D + K++V A + G+AN+A++A+LAK L + K Sbjct: 12 SIALRVTPRGGRDGIDGIEMLADGRPVV--KVRVRAIAEGGEANRAVMAVLAKALGVRKI 69 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+L+ +S LK + + D ++ + L+ Sbjct: 70 DVRILAGATSRLKQVAVGGDPVKLGDALRA 99 >gi|83594855|ref|YP_428607.1| hypothetical protein Rru_A3526 [Rhodospirillum rubrum ATCC 11170] gi|83577769|gb|ABC24320.1| Protein of unknown function DUF167 [Rhodospirillum rubrum ATCC 11170] Length = 113 Score = 95.4 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 32/95 (33%), Positives = 54/95 (56%), Gaps = 3/95 (3%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +RL P A + G++ + D S + +K VTA P+ GKAN A+L +L+++ L +S Sbjct: 18 RLALRLTPKAGRDGVSGVVAEADGS--LVVKASVTAVPEDGKANAALLKLLSRQWKLPRS 75 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEIT-ELLQNNDSL 96 SL ++ Q+ K+I I + +T L+ DSL Sbjct: 76 SLAVVHGQTDRRKVIEISGEPALLTPRLVAWVDSL 110 >gi|83313474|ref|YP_423738.1| hypothetical protein amb4375 [Magnetospirillum magneticum AMB-1] gi|82948315|dbj|BAE53179.1| Uncharacterized conserved protein [Magnetospirillum magneticum AMB-1] Length = 108 Score = 95.4 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 27/88 (30%), Positives = 46/88 (52%), Gaps = 2/88 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V VRL P A + I D + + +K +VTA P+ GKAN A+L +L+K + KS Sbjct: 14 KVAVRLTPKASRDRINGPAAEADGA--VVLKAQVTAVPEDGKANAALLKLLSKAWKIPKS 71 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ + K+I I + +++ L Sbjct: 72 DMDIVLGATDRRKVILISGETEDLRHRL 99 >gi|295691445|ref|YP_003595138.1| hypothetical protein Cseg_4109 [Caulobacter segnis ATCC 21756] gi|295433348|gb|ADG12520.1| protein of unknown function DUF167 [Caulobacter segnis ATCC 21756] Length = 94 Score = 95.0 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 47/88 (53%), Gaps = 3/88 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +RL P + + + D ++K++V + P G AN A++A LAK L + +S+ Sbjct: 5 LAIRLTPRGGRDAVEGWAL--DPEGRPYLKVRVASPPVDGAANAALIAFLAKSLKIPRSA 62 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90 +R+ S +++ +K + ID D + T Sbjct: 63 VRLASGETARIKRLEIDDVDQADFTRAF 90 >gi|148251886|ref|YP_001236471.1| hypothetical protein BBta_0270 [Bradyrhizobium sp. BTAi1] gi|146404059|gb|ABQ32565.1| hypothetical protein BBta_0270 [Bradyrhizobium sp. BTAi1] Length = 111 Score = 94.7 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 49/90 (54%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +R+ P + I +E D +K++V A G+AN+A+ +LAK + ++K+ Sbjct: 14 TVALRVTPRGGRDAIDGIETLSDGRSV--LKLRVRAVADGGEANRAVTELLAKAIGVTKA 71 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++R+ S ++ LK + I D + + L++ Sbjct: 72 AVRITSGATARLKQVTITGDASRLDQALRD 101 >gi|13473522|ref|NP_105090.1| hypothetical protein msl4154 [Mesorhizobium loti MAFF303099] gi|29839606|sp|Q98EP2|Y4154_RHILO RecName: Full=UPF0235 protein msl4154 gi|14024272|dbj|BAB50876.1| msl4154 [Mesorhizobium loti MAFF303099] Length = 102 Score = 94.7 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 25/90 (27%), Positives = 49/90 (54%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 ++ VRL P + + +E D H+K +V A P+ G AN+A+ ++AK L + S Sbjct: 13 DLFVRLTPKSSLDRLEGVETSADGRS--HLKARVRAVPENGAANQALERLVAKTLGVPAS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 S+ +++ +S LK + I D + + + ++ Sbjct: 71 SVSVVAGGTSRLKTVRIVGDPEALAQRVEA 100 >gi|300024715|ref|YP_003757326.1| hypothetical protein Hden_3210 [Hyphomicrobium denitrificans ATCC 51888] gi|299526536|gb|ADJ25005.1| protein of unknown function DUF167 [Hyphomicrobium denitrificans ATCC 51888] Length = 116 Score = 94.7 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 27/88 (30%), Positives = 43/88 (48%), Gaps = 3/88 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V RL P + K I + D + +V A P+ G AN A+ ++A+ L L K S Sbjct: 24 VHFRLTPKSSKDAIEGVTSTSDG---PAFQARVRAVPEHGAANAALEQLVARWLDLPKRS 80 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S LK + ID + +E+ LL+ Sbjct: 81 VSLAKGGKSRLKALQIDGEPEELDRLLE 108 >gi|27375661|ref|NP_767190.1| hypothetical protein bsl0550 [Bradyrhizobium japonicum USDA 110] gi|27348798|dbj|BAC45815.1| bsl0550 [Bradyrhizobium japonicum USDA 110] Length = 94 Score = 94.3 bits (233), Expect = 5e-18, Method: Composition-based stats. Identities = 30/85 (35%), Positives = 47/85 (55%), Gaps = 2/85 (2%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P + I +E D +K++V A G+ANKA+L +LAK L + K+S+R+L Sbjct: 1 MTPRGGRDDIDGIEQLADGRSV--LKVRVRAIADGGEANKAVLVLLAKSLGVPKASVRLL 58 Query: 68 SKQSSPLKIIYIDKDCKEITELLQN 92 S +S LK I +D D + E L+ Sbjct: 59 SGATSRLKQIAVDGDPARLGETLRQ 83 >gi|323136497|ref|ZP_08071579.1| protein of unknown function DUF167 [Methylocystis sp. ATCC 49242] gi|322398571|gb|EFY01091.1| protein of unknown function DUF167 [Methylocystis sp. ATCC 49242] Length = 110 Score = 94.3 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 26/94 (27%), Positives = 47/94 (50%), Gaps = 2/94 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V +RL P + I +E D +K +V A P+ G+AN A++ ++AK L K++ Sbjct: 18 VWLRLTPKGGRDAIEGVETLSDGRAV--LKARVRAAPEDGRANAALIELIAKALRAPKNA 75 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97 + + S ++S +K I+I D + L + Sbjct: 76 VSIRSGETSRVKKIFIAGDSATYLDALAKLAPTS 109 >gi|254561320|ref|YP_003068415.1| hypothetical protein METDI2902 [Methylobacterium extorquens DM4] gi|254268598|emb|CAX24557.1| conserved hypothetical protein,UPF0235 protein [Methylobacterium extorquens DM4] Length = 105 Score = 93.9 bits (232), Expect = 6e-18, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 47/89 (52%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A ++G+ + + ++V A P +G AN A+ A +AK L L K+ Sbjct: 15 LAVRLTPRASRTGLDGVRTEASGRPV--LSLRVAAPPVEGAANAALTAFVAKSLGLRKAE 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + ++S ++S K +++ D + + ++ Sbjct: 73 VTLVSGETSRTKRLHLSGDPQMLAARVEA 101 >gi|218530362|ref|YP_002421178.1| hypothetical protein Mchl_2407 [Methylobacterium chloromethanicum CM4] gi|259646623|sp|B7KZL8|Y2407_METC4 RecName: Full=UPF0235 protein Mchl_2407 gi|218522665|gb|ACK83250.1| protein of unknown function DUF167 [Methylobacterium chloromethanicum CM4] Length = 105 Score = 93.9 bits (232), Expect = 6e-18, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 47/89 (52%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A ++G+ + + ++V A P +G AN A+ A +AK L L K+ Sbjct: 15 LAVRLTPRASRTGLDGVRTEASGRPV--LSLRVAAPPVEGAANAALTAFVAKSLGLRKAE 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + +LS ++S K +++ D + + ++ Sbjct: 73 VTLLSGETSRTKRLHLSGDPQMLAARVEA 101 >gi|240138720|ref|YP_002963192.1| hypothetical protein MexAM1_META1p2120 [Methylobacterium extorquens AM1] gi|240008689|gb|ACS39915.1| conserved hypothetical protein,UPF0235 protein [Methylobacterium extorquens AM1] Length = 105 Score = 93.5 bits (231), Expect = 8e-18, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 47/89 (52%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A ++G+ + + ++V A P +G AN A+ A +AK L L K+ Sbjct: 15 LAVRLTPRASRTGLDGVRTEASGQPV--LSLRVAAPPVEGAANAALTAFVAKSLGLRKAE 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + +LS ++S K +++ D + + ++ Sbjct: 73 VTLLSGETSRTKRLHLSGDPQTLAARVEA 101 >gi|307293981|ref|ZP_07573825.1| protein of unknown function DUF167 [Sphingobium chlorophenolicum L-1] gi|306880132|gb|EFN11349.1| protein of unknown function DUF167 [Sphingobium chlorophenolicum L-1] Length = 109 Score = 93.5 bits (231), Expect = 8e-18, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 46/83 (55%), Gaps = 2/83 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A + I + D + +V A P+KG+AN A++A+LAK+L +S+ Sbjct: 13 LAVRLTPGAAREDIGGV--WTDEKGAQWLGARVRAVPEKGRANTALIALLAKRLDWPRSA 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEI 86 + + S ++ LK + I+ + + Sbjct: 71 ISLESGDTNRLKRLRIEGGGEPL 93 >gi|16127852|ref|NP_422416.1| hypothetical protein CC_3622 [Caulobacter crescentus CB15] gi|221236673|ref|YP_002519110.1| YggU superfamily protein [Caulobacter crescentus NA1000] gi|47117618|sp|Q9A2E3|Y3622_CAUCR RecName: Full=UPF0235 protein CC_3622 gi|254803798|sp|B8H6E1|Y3737_CAUCN RecName: Full=UPF0235 protein CCNA_03737 gi|13425372|gb|AAK25584.1| conserved hypothetical protein [Caulobacter crescentus CB15] gi|220965846|gb|ACL97202.1| YggU superfamily protein [Caulobacter crescentus NA1000] Length = 98 Score = 93.5 bits (231), Expect = 9e-18, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 49/90 (54%), Gaps = 3/90 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++VRL P + + D +++K++V + P +G AN A++A LAK L + + Sbjct: 7 VTLVVRLTPRGGRDAAEGWALDADGR--LYLKVRVASPPVEGAANAALIAFLAKTLKIPR 64 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S++R+ + +++ LK + ++ D ++ Sbjct: 65 SAVRLAAGETARLKRLELEGVDPADVARAF 94 >gi|319780857|ref|YP_004140333.1| hypothetical protein Mesci_1119 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317166745|gb|ADV10283.1| protein of unknown function DUF167 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 105 Score = 93.1 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 48/91 (52%), Gaps = 2/91 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P A + +E D H+K +V A P+ G AN A+ ++AK + + Sbjct: 12 VELFVRLTPKAALDRLEGIETTADERS--HLKARVRAVPENGAANHALEKLIAKAIGVPG 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 S++ +++ ++ LK + I+ D + + + ++ Sbjct: 70 SAVSVVAGGTARLKTVRIEGDPETLAKSIEA 100 >gi|163851553|ref|YP_001639596.1| hypothetical protein Mext_2130 [Methylobacterium extorquens PA1] gi|259646591|sp|A9W4L9|Y2130_METEP RecName: Full=UPF0235 protein Mext_2130 gi|163663158|gb|ABY30525.1| protein of unknown function DUF167 [Methylobacterium extorquens PA1] Length = 105 Score = 92.3 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 46/89 (51%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A ++G+ + + ++V A P +G AN A+ A +AK L L K+ Sbjct: 15 LAVRLTPRASRTGLDGVRTEASGRPV--LSLRVAAPPVEGAANAALTAFVAKSLGLRKAE 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + +LS ++S K + + D + + ++ Sbjct: 73 VTLLSGETSRTKRLLLSGDPQTLAARVEA 101 >gi|288959535|ref|YP_003449876.1| hypothetical protein AZL_026940 [Azospirillum sp. B510] gi|288911843|dbj|BAI73332.1| hypothetical protein AZL_026940 [Azospirillum sp. B510] Length = 113 Score = 92.0 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 48/88 (54%), Gaps = 2/88 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +R+ P A ++ + + + +K+ VTA P+ GKAN+A++ +L+K + K+ Sbjct: 15 RVALRVTPKASRNAVTG--MADTAAGGRVLKLAVTAVPENGKANEAVIKLLSKAWKVPKT 72 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELL 90 SL +++ + KI+++ D + L Sbjct: 73 SLTVVAGATDRNKILHVAGDPAALLARL 100 >gi|297620780|ref|YP_003708917.1| yggU family protein [Waddlia chondrophila WSU 86-1044] gi|297376081|gb|ADI37911.1| yggU family protein [Waddlia chondrophila WSU 86-1044] Length = 94 Score = 90.4 bits (223), Expect = 7e-17, Method: Composition-based stats. Identities = 29/87 (33%), Positives = 48/87 (55%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V+LIPNA K I+ E +K+++TA P+KGKAN ++ LA +L +SKS Sbjct: 11 LPVKLIPNAGKDEISGWEN-------GILKVRITAVPEKGKANAHLIKFLASQLKVSKSD 63 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ + + K + I D + E L Sbjct: 64 ITLIKGEKNRHKTLLIKGDADVVGERL 90 >gi|23015143|ref|ZP_00054928.1| COG1872: Uncharacterized conserved protein [Magnetospirillum magnetotacticum MS-1] Length = 94 Score = 90.4 bits (223), Expect = 8e-17, Method: Composition-based stats. Identities = 27/87 (31%), Positives = 45/87 (51%), Gaps = 2/87 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A + I D S + +K +VT P+ GKAN A+L +L+K + KS Sbjct: 1 MAVRLTPKASRDRIMGAAPEADGS--VVLKAQVTTVPEDGKANAALLKLLSKAWKIPKSD 58 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ + K+I I D + + + L Sbjct: 59 MDIVLGATDRRKVILISGDSEVLRKRL 85 >gi|330444266|ref|YP_004377252.1| hypothetical protein G5S_0576 [Chlamydophila pecorum E58] gi|328807376|gb|AEB41549.1| conserved hypothetical protein [Chlamydophila pecorum E58] Length = 100 Score = 90.0 bits (222), Expect = 9e-17, Method: Composition-based stats. Identities = 28/85 (32%), Positives = 51/85 (60%), Gaps = 7/85 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 +C + V++ P AK++ I + +K++VT P+KGKAN+A++++LAK L + Sbjct: 16 LCILEVQVTPKAKENKIVGFQ-------GEVLKVRVTEPPEKGKANEAVVSLLAKALGIP 68 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKE 85 K + ++S +SS K + I K +E Sbjct: 69 KRDVTLVSGESSRKKKLMIPKKVQE 93 >gi|62184885|ref|YP_219670.1| hypothetical protein CAB243 [Chlamydophila abortus S26/3] gi|81312941|sp|Q5L6M2|Y243_CHLAB RecName: Full=UPF0235 protein CAB243 gi|62147952|emb|CAH63699.1| conserved hypothetical protein [Chlamydophila abortus S26/3] Length = 96 Score = 90.0 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 28/84 (33%), Positives = 48/84 (57%), Gaps = 7/84 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ P +K++ I E +KI+VT P+KGKAN+A++A+LAK L+L K Sbjct: 8 LEVKVTPKSKQNTIVGFE-------GEVLKIRVTEVPEKGKANEAVIALLAKALSLPKRD 60 Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87 + ++ +S K I + K + I Sbjct: 61 ITLIPGDTSRKKRILLPKSTESIV 84 >gi|254295256|ref|YP_003061279.1| hypothetical protein Hbal_2912 [Hirschia baltica ATCC 49814] gi|254043787|gb|ACT60582.1| protein of unknown function DUF167 [Hirschia baltica ATCC 49814] Length = 109 Score = 89.3 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 28/91 (30%), Positives = 49/91 (53%), Gaps = 2/91 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +++ R+ PNA K + + E +D + ++K++V A P KGKANKA+ +LA L KS Sbjct: 13 DIVARVTPNASKDAVEAPE--QDAAGRTYLKLRVRAIPDKGKANKAVEKLLASHFNLPKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++ + LK I I ++L + Sbjct: 71 KVAVVKGSTDRLKTIRISDGADLSSQLAEKY 101 >gi|146337438|ref|YP_001202486.1| hypothetical protein BRADO0278 [Bradyrhizobium sp. ORS278] gi|146190244|emb|CAL74236.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 109 Score = 89.3 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 24/79 (30%), Positives = 43/79 (54%), Gaps = 2/79 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +R+ P + I +E D +K++V A G+AN+A+ +LAK + ++K Sbjct: 12 TVALRVTPRGGRDAIDGIETLSDGRSV--LKVRVRAIADGGEANRAVTELLAKAIGVTKK 69 Query: 63 SLRMLSKQSSPLKIIYIDK 81 ++R+ S +S LK + ID Sbjct: 70 AVRITSGTTSRLKQVAIDG 88 >gi|282891527|ref|ZP_06300019.1| hypothetical protein pah_c178o054 [Parachlamydia acanthamoebae str. Hall's coccus] gi|281498618|gb|EFB40945.1| hypothetical protein pah_c178o054 [Parachlamydia acanthamoebae str. Hall's coccus] Length = 93 Score = 88.9 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 23/79 (29%), Positives = 47/79 (59%), Gaps = 7/79 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +++IPNA ++ I E + +K+ + + P+KGKAN+A++ LAK L L K Sbjct: 7 TLAIKVIPNASRNAILGWENDE-------LKMYIASVPEKGKANEAVIKFLAKFLGLRKQ 59 Query: 63 SLRMLSKQSSPLKIIYIDK 81 ++++ +++ KI+ I+ Sbjct: 60 QIQIIRGETNRHKILQIEG 78 >gi|241762300|ref|ZP_04760381.1| protein of unknown function DUF167 [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241373203|gb|EER62833.1| protein of unknown function DUF167 [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 113 Score = 88.1 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 29/90 (32%), Positives = 55/90 (61%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +R+ A K+GI + KDT+ +I+V A P +G +NK ++A L+K ++ K Sbjct: 20 RLALRVTARASKTGITMFD--KDTAGRGLFRIRVAAPPVEGASNKNLMAYLSKSFSVPKG 77 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++R+ S + S +KI++I D K +TE+ ++ Sbjct: 78 AVRIESGEHSKIKILHIAGDVKRLTEIAED 107 >gi|170749303|ref|YP_001755563.1| hypothetical protein Mrad2831_2896 [Methylobacterium radiotolerans JCM 2831] gi|170655825|gb|ACB24880.1| protein of unknown function DUF167 [Methylobacterium radiotolerans JCM 2831] Length = 93 Score = 87.7 bits (216), Expect = 5e-16, Method: Composition-based stats. Identities = 24/79 (30%), Positives = 42/79 (53%), Gaps = 2/79 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL P + +D ++K +V A P +G AN A++ ++AK L + + Sbjct: 4 RLAVRLTPRGGRDAAEGW--ARDEKGQPYLKARVAAPPVEGAANAALVVLIAKALKVGRG 61 Query: 63 SLRMLSKQSSPLKIIYIDK 81 S+R+++ S LKI+ ID Sbjct: 62 SVRIVTGDQSRLKILEIDG 80 >gi|23011359|ref|ZP_00051742.1| COG1872: Uncharacterized conserved protein [Magnetospirillum magnetotacticum MS-1] Length = 99 Score = 87.7 bits (216), Expect = 5e-16, Method: Composition-based stats. Identities = 26/90 (28%), Positives = 50/90 (55%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL P A ++G+ + + D + ++V A P +G AN A+ A +AK L L K+ Sbjct: 7 TLSVRLTPRASRTGLDGVRVDADGRPV--LGLRVAAPPVEGAANAALTAFVAKSLKLRKA 64 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + ++S ++S K +++ D KE+ ++ Sbjct: 65 EVVLVSGEASRTKRLHLTGDAKELAARVET 94 >gi|29840014|ref|NP_829120.1| hypothetical protein CCA00247 [Chlamydophila caviae GPIC] gi|33301875|sp|Q824A6|Y247_CHLCV RecName: Full=UPF0235 protein CCA_00247 gi|29834361|gb|AAP04998.1| conserved hypothetical protein [Chlamydophila caviae GPIC] Length = 92 Score = 87.3 bits (215), Expect = 6e-16, Method: Composition-based stats. Identities = 27/84 (32%), Positives = 50/84 (59%), Gaps = 7/84 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ P ++++ I E +KI+VT P+KGKAN+A++A+LAK L+L K Sbjct: 8 LEVKVTPKSRENKIVGFE-------GEVLKIRVTEAPEKGKANEAVIALLAKTLSLPKRD 60 Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87 + ++S ++S K + + K + I Sbjct: 61 VTLISGETSRKKRLLLPKSTESII 84 >gi|56551705|ref|YP_162544.1| hypothetical protein ZMO0809 [Zymomonas mobilis subsp. mobilis ZM4] gi|260752718|ref|YP_003225611.1| hypothetical protein Za10_0477 [Zymomonas mobilis subsp. mobilis NCIMB 11163] gi|56543279|gb|AAV89433.1| protein of unknown function DUF167 [Zymomonas mobilis subsp. mobilis ZM4] gi|258552081|gb|ACV75027.1| protein of unknown function DUF167 [Zymomonas mobilis subsp. mobilis NCIMB 11163] Length = 113 Score = 87.3 bits (215), Expect = 7e-16, Method: Composition-based stats. Identities = 28/90 (31%), Positives = 55/90 (61%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +R+ A K+GI + KDT+ +I+V A P +G +NK ++A L+K ++ K Sbjct: 20 RLALRVTARASKTGITMFD--KDTAGRGLFRIRVAAPPVEGASNKNLMAYLSKSFSVPKG 77 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++++ S + S +KI++I D K +TE+ ++ Sbjct: 78 AVKIESGEHSKIKILHIAGDVKRLTEIAED 107 >gi|329942568|ref|ZP_08291378.1| hypothetical protein G5Q_0265 [Chlamydophila psittaci Cal10] gi|313847795|emb|CBY16785.1| conserved hypothetical protein [Chlamydophila psittaci RD1] gi|325506510|gb|ADZ18148.1| conserved hypothetical protein [Chlamydophila psittaci 6BC] gi|328815478|gb|EGF85466.1| hypothetical protein G5Q_0265 [Chlamydophila psittaci Cal10] gi|328914447|gb|AEB55280.1| conserved hypothetical protein [Chlamydophila psittaci 6BC] Length = 92 Score = 86.9 bits (214), Expect = 8e-16, Method: Composition-based stats. Identities = 29/84 (34%), Positives = 50/84 (59%), Gaps = 7/84 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ P +K++ I E +KI+VT P+KGKAN+A++A+LAK L+L K Sbjct: 8 LEVKVTPKSKENKIVGFE-------GEVLKIRVTEVPEKGKANEAVIALLAKTLSLPKRD 60 Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87 + ++S ++S K I + K + I Sbjct: 61 VTLISGETSKNKRILLPKATESIV 84 >gi|15893224|ref|NP_360938.1| hypothetical protein RC1301 [Rickettsia conorii str. Malish 7] gi|34581107|ref|ZP_00142587.1| hypothetical protein [Rickettsia sibirica 246] gi|229587206|ref|YP_002845707.1| hypothetical protein RAF_ORF1191 [Rickettsia africae ESF-5] gi|29839741|sp|Q92G24|Y1301_RICCN RecName: Full=UPF0235 protein RC1301 gi|259645748|sp|C3PLX4|Y1191_RICAE RecName: Full=UPF0235 protein RAF_ORF1191 gi|15620440|gb|AAL03839.1| unknown [Rickettsia conorii str. Malish 7] gi|28262492|gb|EAA25996.1| unknown [Rickettsia sibirica 246] gi|228022256|gb|ACP53964.1| Unknown [Rickettsia africae ESF-5] Length = 105 Score = 86.9 bits (214), Expect = 9e-16, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 54/90 (60%), Gaps = 3/90 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ PN+K++ I++ I ++ ++K+ + A P++GKAN+ ++ LAK+ LS+S+ Sbjct: 16 LSFKVKPNSKQNLISNFVII---NNIPYLKLSIKAIPEQGKANEEIINYLAKEWKLSRSN 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++ + LK I I ++ L+ N+ Sbjct: 73 IEIIKGHTHSLKTILIKNINEDYLNLIINS 102 >gi|187735949|ref|YP_001878061.1| protein of unknown function DUF167 [Akkermansia muciniphila ATCC BAA-835] gi|187426001|gb|ACD05280.1| protein of unknown function DUF167 [Akkermansia muciniphila ATCC BAA-835] Length = 96 Score = 86.6 bits (213), Expect = 9e-16, Method: Composition-based stats. Identities = 31/93 (33%), Positives = 54/93 (58%), Gaps = 3/93 (3%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +++IPNAKKS E +D +K+++ A P +GKANKA++ L+ L + +S Sbjct: 2 KLALKVIPNAKKSEAVGWE--EDPRAGRALKLRIAAPPVEGKANKAVVLFLSAWLDIPRS 59 Query: 63 SLRMLSKQSSPLKIIYIDKDCK-EITELLQNND 94 S+ L +SS LK++ + C+ ++ LL D Sbjct: 60 SISFLRGESSRLKVVELPDGCEGKLARLLSAED 92 >gi|329890913|ref|ZP_08269256.1| hypothetical protein BDIM_26220 [Brevundimonas diminuta ATCC 11568] gi|328846214|gb|EGF95778.1| hypothetical protein BDIM_26220 [Brevundimonas diminuta ATCC 11568] Length = 93 Score = 86.6 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 25/80 (31%), Positives = 40/80 (50%), Gaps = 2/80 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V+L P A I ++ D +K++V A P +G+AN A+ A LAK L + K Sbjct: 3 ARIPVKLTPRASADRIDGWDVDPDGRPV--LKVRVRAQPVEGEANAALTAFLAKALGVPK 60 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 + + S LK+I +D Sbjct: 61 RDVALARGGQSRLKMIEVDG 80 >gi|15604669|ref|NP_221187.1| hypothetical protein RP839 [Rickettsia prowazekii str. Madrid E] gi|6686136|sp|Q9ZCC0|Y839_RICPR RecName: Full=UPF0235 protein RP839 gi|3861364|emb|CAA15263.1| unknown [Rickettsia prowazekii] gi|292572500|gb|ADE30415.1| hypothetical protein rpr22_CDS819 [Rickettsia prowazekii Rp22] Length = 105 Score = 86.2 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 27/89 (30%), Positives = 53/89 (59%), Gaps = 3/89 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ P AK++ I + I ++ ++K+ + ATP++GKAN+ ++ LAK+ LS+SS Sbjct: 16 INVKVKPYAKQNLIGNFVII---NNIPYIKLAIKATPEQGKANEGIIHYLAKEWELSRSS 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + ++ + LK I I ++ L+ N Sbjct: 73 IEIIKGHTHSLKTILIKNINEDYLNLIIN 101 >gi|163795292|ref|ZP_02189259.1| hypothetical protein BAL199_14277 [alpha proteobacterium BAL199] gi|159179278|gb|EDP63809.1| hypothetical protein BAL199_14277 [alpha proteobacterium BAL199] Length = 112 Score = 86.2 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 26/92 (28%), Positives = 48/92 (52%), Gaps = 3/92 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V++ P A I + +D + +++ VTA P+ G+ANKA+ A+LAK+ + K Sbjct: 14 VRAAVKVTPKAAADRIRGVV--QDEAGVAWLQVSVTAVPEDGRANKAVTALLAKRWRVPK 71 Query: 62 SSLRMLSKQSSPLKIIYI-DKDCKEITELLQN 92 SS+ ++ + K++ + D +T LQ Sbjct: 72 SSIEIVQGTTERRKVLLVRSDDTAALTARLQT 103 >gi|238650830|ref|YP_002916685.1| hypothetical protein RPR_04990 [Rickettsia peacockii str. Rustic] gi|259647069|sp|C4K236|Y4990_RICPU RecName: Full=UPF0235 protein RPR_04990 gi|238624928|gb|ACR47634.1| hypothetical protein RPR_04990 [Rickettsia peacockii str. Rustic] Length = 105 Score = 86.2 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 53/90 (58%), Gaps = 3/90 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ PN+K++ I++ I ++ ++K+ + A P++GKAN+ ++ LAK+ LS+S+ Sbjct: 16 LSFKVKPNSKQNLISNFVII---NNIPYLKLSIKAIPEQGKANEEIINYLAKEWKLSRSN 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++ + LK I I ++ + N+ Sbjct: 73 IEIIKGHTHSLKTILIKNINEDYLNWIINS 102 >gi|85713678|ref|ZP_01044668.1| hypothetical protein NB311A_04039 [Nitrobacter sp. Nb-311A] gi|85699582|gb|EAQ37449.1| hypothetical protein NB311A_04039 [Nitrobacter sp. Nb-311A] Length = 106 Score = 85.8 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 53/90 (58%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +R+ P + GI +E+ D + ++++ A + G+AN+A++A+LAK L + K Sbjct: 12 RIALRVTPRGGRDGIDGIEMLADGRPVVKVRVRAVA--EGGEANRAVMAVLAKALGVRKV 69 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+L+ +S LK + ++ D ++ + L+ Sbjct: 70 DVRILAGATSRLKQVAVEGDPVQLGDALRA 99 >gi|217979671|ref|YP_002363818.1| protein of unknown function DUF167 [Methylocella silvestris BL2] gi|217505047|gb|ACK52456.1| protein of unknown function DUF167 [Methylocella silvestris BL2] Length = 118 Score = 85.8 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 25/87 (28%), Positives = 47/87 (54%), Gaps = 2/87 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P + + I D +K +V A PQ G+AN A++ ++AK L L+ S+ Sbjct: 24 LTVRLTPKSARDEIEGASQLSDGRAV--LKARVRAAPQDGEANAALIRLVAKALRLAPSA 81 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 +R+ + ++ LK + + D + + + L Sbjct: 82 VRVEAGATARLKTLCLTGDPETLQQSL 108 >gi|110635561|ref|YP_675769.1| hypothetical protein Meso_3232 [Mesorhizobium sp. BNC1] gi|110286545|gb|ABG64604.1| protein of unknown function DUF167 [Chelativorans sp. BNC1] Length = 116 Score = 85.8 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 24/87 (27%), Positives = 40/87 (45%), Gaps = 2/87 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P + I + D +K +V A P+ GKAN+A+ +LA L + + Sbjct: 16 IFVRLTPKSSSDAIEGVTEGPDGQ--AFLKARVRAIPEAGKANEALERLLASALGVPRRD 73 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + S +S K + I D + L Sbjct: 74 VAVSSGAASRRKTVSITGDAAPLIASL 100 >gi|254472462|ref|ZP_05085862.1| conserved hypothetical protein [Pseudovibrio sp. JE062] gi|211958745|gb|EEA93945.1| conserved hypothetical protein [Pseudovibrio sp. JE062] Length = 105 Score = 85.4 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 51/89 (57%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P + K I + D + K++ A P+KG ANKA+ A+ AK L++ KSS Sbjct: 16 ITVRLTPKSSKDQIEKIGAQSDGRPLVLAKVR--AVPEKGAANKAVAALFAKALSVPKSS 73 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +++ ++ +K + + + +++ + L++ Sbjct: 74 AELIAGSTARIKTLRVLGEPQDLAKRLED 102 >gi|157964988|ref|YP_001499812.1| hypothetical protein RMA_1324 [Rickettsia massiliae MTU5] gi|157844764|gb|ABV85265.1| hypothetical protein RMA_1324 [Rickettsia massiliae MTU5] Length = 107 Score = 85.0 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 53/90 (58%), Gaps = 3/90 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ PN+K++ I++ I ++ ++K+ + A P +GKAN+ ++ LAK+ LS+S+ Sbjct: 18 LSFKVKPNSKQNLISNFVII---NNIPYLKLSIKAIPAQGKANEEIINYLAKEWKLSRSN 74 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++ + LK I I ++ L+ N+ Sbjct: 75 IEIIKGHTHSLKTILIKNINEDYLNLIINS 104 >gi|165933862|ref|YP_001650651.1| hypothetical protein RrIowa_1526 [Rickettsia rickettsii str. Iowa] gi|189038764|sp|B0BVI5|Y1526_RICRO RecName: Full=UPF0235 protein RrIowa_1526 gi|165908949|gb|ABY73245.1| hypothetical cytosolic protein [Rickettsia rickettsii str. Iowa] Length = 105 Score = 85.0 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 53/90 (58%), Gaps = 3/90 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ PN+K++ I++ I ++ ++K+ + A P++GKAN ++ LAK+ LS+S+ Sbjct: 16 LSFKVKPNSKQNLISNFVII---NNIQYLKLSIKAIPEQGKANAEIINYLAKEWKLSRSN 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++ + LK I I ++ L+ N+ Sbjct: 73 IEIIKGHTHSLKTILIKNINEDYLNLIINS 102 >gi|157829135|ref|YP_001495377.1| hypothetical protein A1G_07140 [Rickettsia rickettsii str. 'Sheila Smith'] gi|166228842|sp|A8GTZ4|Y7140_RICRS RecName: Full=UPF0235 protein A1G_07140 gi|157801616|gb|ABV76869.1| hypothetical protein A1G_07140 [Rickettsia rickettsii str. 'Sheila Smith'] Length = 105 Score = 85.0 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 53/90 (58%), Gaps = 3/90 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ PN+K++ I++ I ++ ++K+ + A P++GKAN ++ LAK+ LS+S+ Sbjct: 16 LSFKVKPNSKQNLISNFVII---NNIQYLKLSIKAIPEQGKANSEIINYLAKEWKLSRSN 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++ + LK I I ++ L+ N+ Sbjct: 73 IEIIKGHTHSLKTILIKNINEDYLNLIINS 102 >gi|254443993|ref|ZP_05057469.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235] gi|198258301|gb|EDY82609.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235] Length = 94 Score = 85.0 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 28/78 (35%), Positives = 50/78 (64%), Gaps = 6/78 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V+++PNA +S IA +D S +KI++ + PQ GKANKA++A LAK+ +SK+ Sbjct: 11 LSVKVLPNASRSEIAGW--LEDGS----LKIRIQSPPQDGKANKALIAFLAKETGVSKNQ 64 Query: 64 LRMLSKQSSPLKIIYIDK 81 + + ++S K+I ++ Sbjct: 65 ISIARGETSRQKLIAFER 82 >gi|157826313|ref|YP_001494033.1| hypothetical protein A1C_06510 [Rickettsia akari str. Hartford] gi|166228807|sp|A8GQ50|Y6510_RICAH RecName: Full=UPF0235 protein A1C_06510 gi|157800271|gb|ABV75525.1| hypothetical protein A1C_06510 [Rickettsia akari str. Hartford] Length = 105 Score = 84.3 bits (207), Expect = 5e-15, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 54/90 (60%), Gaps = 3/90 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ PN+K++ I++ I ++ ++K+ + ATP+KGKAN+ ++ LAK LS+S+ Sbjct: 16 LNLKVKPNSKQNLISNFVII---NNIPYLKLSIKATPEKGKANEEIINYLAKAWKLSRSN 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++ + +K I I ++ + N+ Sbjct: 73 IEIIKGHTHSVKTILIKNINEDYLNFIINS 102 >gi|15618408|ref|NP_224693.1| hypothetical protein CPn0497 [Chlamydophila pneumoniae CWL029] gi|15836028|ref|NP_300552.1| hypothetical protein CPj0497 [Chlamydophila pneumoniae J138] gi|16752546|ref|NP_444808.1| hypothetical protein CP0257 [Chlamydophila pneumoniae AR39] gi|33241848|ref|NP_876789.1| hypothetical protein CpB0517 [Chlamydophila pneumoniae TW-183] gi|29839694|sp|Q9Z854|Y497_CHLPN RecName: Full=UPF0235 protein CPn_0497/CP_0257/CPj0497/CpB0517 gi|4376783|gb|AAD18637.1| CT388 hypothetical protein [Chlamydophila pneumoniae CWL029] gi|7189184|gb|AAF38120.1| conserved hypothetical protein [Chlamydophila pneumoniae AR39] gi|8978867|dbj|BAA98703.1| CT388 hypothetical protein [Chlamydophila pneumoniae J138] gi|33236357|gb|AAP98446.1| hypothetical protein CpB0517 [Chlamydophila pneumoniae TW-183] gi|269303374|gb|ACZ33474.1| conserved hypothetical protein TIGR00251 [Chlamydophila pneumoniae LPCoLN] Length = 90 Score = 84.3 bits (207), Expect = 5e-15, Method: Composition-based stats. Identities = 24/84 (28%), Positives = 48/84 (57%), Gaps = 7/84 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ P AK++ I + +K++VT P+KGKAN A++++LAK L+L K Sbjct: 7 LEVKVTPKAKENKIVGFD-------GQALKVRVTEPPEKGKANDAVISLLAKALSLPKRD 59 Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87 + +++ ++S K + ++I Sbjct: 60 VTLIAGETSRKKKFLLPNRVQDII 83 >gi|295106861|emb|CBL04404.1| Uncharacterized conserved protein [Gordonibacter pamelaeae 7-10-1-b] Length = 96 Score = 84.3 bits (207), Expect = 5e-15, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 47/91 (51%), Gaps = 2/91 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + P + + ++ + D + + ++VTA P GKANKA+ ++A+ L + K Sbjct: 4 AIIAVHVTPRSGRDEVSGV--RADAAGADEVCVRVTAPPDGGKANKAVCKLVAEALGVPK 61 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 S + + S ++ K + ++ D ++ L + Sbjct: 62 SRVGVASGHTARRKRLSVEADQAQVDAWLAS 92 >gi|168703344|ref|ZP_02735621.1| hypothetical protein GobsU_27681 [Gemmata obscuriglobus UQM 2246] Length = 101 Score = 83.9 bits (206), Expect = 7e-15, Method: Composition-based stats. Identities = 28/94 (29%), Positives = 51/94 (54%), Gaps = 8/94 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C + VR+ P AKK+ + +++ VTA P+ G+AN A+LA+L L + Sbjct: 12 CTLAVRVQPKAKKNAVLGERASA-------LRVSVTAPPEDGRANDAVLALLCDHFKLQR 64 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNND 94 S L +LS Q++ K+I + +++ +L+ +D Sbjct: 65 SQLALLSGQTNRNKVILVRGVTPQQLADLIPASD 98 >gi|75674677|ref|YP_317098.1| hypothetical protein Nwi_0479 [Nitrobacter winogradskyi Nb-255] gi|74419547|gb|ABA03746.1| Protein of unknown function DUF167 [Nitrobacter winogradskyi Nb-255] Length = 106 Score = 83.9 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 49/89 (55%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P + I +E+ D + ++++ A G+AN+A+ A+LAK L + K Sbjct: 13 IALRVTPRGGRDAIDGIEMLADGRPVVKVRVRAVA--DGGEANRAVTAVLAKALGVRKID 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+L+ +S LK + ++ D ++ L+ Sbjct: 71 VRILAGATSRLKQVAVEGDPVQLGNALRA 99 >gi|209883467|ref|YP_002287324.1| hypothetical protein OCAR_4310 [Oligotropha carboxidovorans OM5] gi|226706162|sp|B6JAU6|Y4310_OLICO RecName: Full=UPF0235 protein OCAR_4310 gi|209871663|gb|ACI91459.1| hypothetical protein OCAR_4310 [Oligotropha carboxidovorans OM5] Length = 106 Score = 83.5 bits (205), Expect = 9e-15, Method: Composition-based stats. Identities = 25/93 (26%), Positives = 52/93 (55%), Gaps = 2/93 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + + +E+ + + ++++V A + G+AN+A+ + A L + KS Sbjct: 14 VAVRVTPRGGRDAVDGIEMLANGKSVVKVRVRVAA--EGGEANRAVTELFAGLLRVPKSK 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96 +++ S +S +K I I+ D K++ E L+ S+ Sbjct: 72 VKVASGVTSRIKQIAIEGDPKQLGEALKAATSI 104 >gi|312116202|ref|YP_004013798.1| hypothetical protein Rvan_3519 [Rhodomicrobium vannielii ATCC 17100] gi|311221331|gb|ADP72699.1| protein of unknown function DUF167 [Rhodomicrobium vannielii ATCC 17100] Length = 123 Score = 83.1 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 26/87 (29%), Positives = 44/87 (50%), Gaps = 3/87 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A + +A +E +K VT P+ GKAN A++ ++A L + KSS Sbjct: 24 LHVRLTPKASSARVAGVEAFD---GKPVLKAYVTTPPEDGKANAALVVLVASWLGVPKSS 80 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + M + Q S LK + + ++ + Sbjct: 81 VSMAAGQKSRLKTVAVAGKADDLLAKI 107 >gi|117926928|ref|YP_867545.1| hypothetical protein Mmc1_3654 [Magnetococcus sp. MC-1] gi|166990883|sp|A0LDU6|Y3654_MAGSM RecName: Full=UPF0235 protein Mmc1_3654 gi|117610684|gb|ABK46139.1| protein of unknown function DUF167 [Magnetococcus sp. MC-1] Length = 98 Score = 82.7 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 39/87 (44%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P A + + + +K+ + A P G ANKA+ LAK+L ++K Sbjct: 13 LTIRVQPKAAQERVMGWQ-------GEQLKVALNAPPVDGAANKALCHFLAKQLGIAKGQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ + S K + I I + Sbjct: 66 VTLVRGEKSREKQLVIQGISPSIWQQF 92 >gi|67005257|gb|AAY62183.1| Conserved hypothetical protein [Rickettsia felis URRWXCal2] Length = 110 Score = 82.7 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 54/90 (60%), Gaps = 3/90 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ PN+K++ I+ I ++ ++K+ + ATP++GKAN+ ++ LAK+ LS+ Sbjct: 21 LNLKVKPNSKQNLISDFVII---NNIPYLKLSIKATPEQGKANEEIINYLAKEWKLSRKD 77 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++ ++ LK I I ++ L+ N+ Sbjct: 78 IEIIKGHTNSLKTILIKNIDEDYLNLIINS 107 >gi|299133137|ref|ZP_07026332.1| protein of unknown function DUF167 [Afipia sp. 1NLS2] gi|298593274|gb|EFI53474.1| protein of unknown function DUF167 [Afipia sp. 1NLS2] Length = 108 Score = 82.3 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 25/94 (26%), Positives = 52/94 (55%), Gaps = 2/94 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + + +E + + ++++V A + G+AN+A+ + A+ L + KS Sbjct: 14 VAVRVTPRGGRDAVDGIEELANGKSVVKVRVRVAA--EGGEANRAVTELFAELLRVPKSK 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97 +R+ S +S +K + I+ D K++ E L+ + T Sbjct: 72 VRVASGVTSRVKQLTIEGDPKQLGEALKAATAAT 105 >gi|162022116|ref|YP_247348.2| hypothetical protein RF_1332 [Rickettsia felis URRWXCal2] gi|126253831|sp|Q4UJV6|Y1332_RICFE RecName: Full=UPF0235 protein RF_1332 Length = 105 Score = 82.3 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 54/90 (60%), Gaps = 3/90 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ PN+K++ I+ I ++ ++K+ + ATP++GKAN+ ++ LAK+ LS+ Sbjct: 16 LNLKVKPNSKQNLISDFVII---NNIPYLKLSIKATPEQGKANEEIINYLAKEWKLSRKD 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++ ++ LK I I ++ L+ N+ Sbjct: 73 IEIIKGHTNSLKTILIKNIDEDYLNLIINS 102 >gi|298293297|ref|YP_003695236.1| hypothetical protein Snov_3343 [Starkeya novella DSM 506] gi|296929808|gb|ADH90617.1| protein of unknown function DUF167 [Starkeya novella DSM 506] Length = 106 Score = 81.9 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 39/89 (43%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR P + I D +K +V+ + GKAN A+ +LAK ++ S Sbjct: 16 VTVRATPRGGRDAIDGFVELGDGR--TALKARVSVAAEDGKANAALGKLLAKAAGIAPSR 73 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + ++S + K ++ D EI LQ Sbjct: 74 VDLVSGATGRTKAFKLNGDAAEIAARLQA 102 >gi|170742084|ref|YP_001770739.1| hypothetical protein M446_3939 [Methylobacterium sp. 4-46] gi|226706076|sp|B0UH52|Y3939_METS4 RecName: Full=UPF0235 protein M446_3939 gi|168196358|gb|ACA18305.1| protein of unknown function DUF167 [Methylobacterium sp. 4-46] Length = 104 Score = 81.9 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 42/87 (48%), Gaps = 2/87 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR P + + +E D +K++V A P+ G AN A+ A+LA+ L + Sbjct: 14 VRVRATPRGGRDAVEGIETRADG--LPVLKVRVRAAPEDGAANAAIRAVLAEALGCPARA 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + + ++ +K+ + D + + + Sbjct: 72 VTLAAGATARVKLFRVAGDGQALAARI 98 >gi|167630164|ref|YP_001680663.1| conserved hypothetical protein, uncharacterized acr, yggu family [Heliobacterium modesticaldum Ice1] gi|259646567|sp|B0TGP1|Y2027_HELMI RecName: Full=UPF0235 protein Helmi_20270 gi|167592904|gb|ABZ84652.1| conserved hypothetical protein, uncharacterized acr, yggu family [Heliobacterium modesticaldum Ice1] Length = 96 Score = 81.6 bits (200), Expect = 3e-14, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 48/89 (53%), Gaps = 8/89 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +R+ P A K+ + L +K+++TA P G+AN A L +AK L LS+S Sbjct: 13 RFRIRVQPRASKNEVCGLLDDA-------LKVRLTAPPVDGEANAACLQFIAKTLGLSRS 65 Query: 63 SLRMLSKQSSPLKIIYIDK-DCKEITELL 90 +R+++ ++S LK + ++ +++ + Sbjct: 66 QVRLVAGETSRLKTLEVEGVSAEDLRKRF 94 >gi|182413313|ref|YP_001818379.1| hypothetical protein Oter_1495 [Opitutus terrae PB90-1] gi|177840527|gb|ACB74779.1| protein of unknown function DUF167 [Opitutus terrae PB90-1] Length = 90 Score = 81.2 bits (199), Expect = 4e-14, Method: Composition-based stats. Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 8/94 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C + ++ IPNA ++ I +K+KV A P +G+AN+ + LA +L L + Sbjct: 4 CTIAIKAIPNAPRNQIVGW-------LGDALKVKVHAPPLEGRANEELCEFLADELGLPR 56 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNND 94 ++ +L +S K++ I+ D ++ L + D Sbjct: 57 RAVSVLRGDTSRQKLVQIEGLDLAQLKAKLSSLD 90 >gi|260467362|ref|ZP_05813535.1| protein of unknown function DUF167 [Mesorhizobium opportunistum WSM2075] gi|259028889|gb|EEW30192.1| protein of unknown function DUF167 [Mesorhizobium opportunistum WSM2075] Length = 103 Score = 81.2 bits (199), Expect = 4e-14, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 49/90 (54%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 ++ VRL P + + +E D H+K++V A P+ G AN+A+ + AK L + S Sbjct: 13 DLFVRLTPKSSVDRLEGVETAADGRS--HLKVRVRAVPENGAANQALERLAAKTLGVPVS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++ +++ ++ LK + + D + ++ ++ Sbjct: 71 AVSVVAGGTARLKTLRVAGDPEALSRSIEA 100 >gi|193213756|ref|YP_001994955.1| hypothetical protein Ctha_0036 [Chloroherpeton thalassium ATCC 35110] gi|193087233|gb|ACF12508.1| protein of unknown function DUF167 [Chloroherpeton thalassium ATCC 35110] Length = 97 Score = 80.8 bits (198), Expect = 5e-14, Method: Composition-based stats. Identities = 27/80 (33%), Positives = 41/80 (51%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P A K+ I D + +KI++ A P + ANKA + LAK ++K Sbjct: 11 VDFSVRLQPRASKNEIVG---EYDGA----LKIRIAAPPVENAANKACIEFLAKTFGIAK 63 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 S + +LS +S K+I I Sbjct: 64 SQVEILSGDTSRNKLIRIYG 83 >gi|220923538|ref|YP_002498840.1| hypothetical protein Mnod_3628 [Methylobacterium nodulans ORS 2060] gi|219948145|gb|ACL58537.1| protein of unknown function DUF167 [Methylobacterium nodulans ORS 2060] Length = 102 Score = 80.4 bits (197), Expect = 7e-14, Method: Composition-based stats. Identities = 22/81 (27%), Positives = 39/81 (48%), Gaps = 2/81 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR P + I +E D +K++V A P+ G AN A+ +L L + Sbjct: 12 VRVRATPRGGRDAIDGIETRADG--LSVLKVRVRAAPEDGAANTAIRDLLKTALGCPARA 69 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 +R+ + ++ +KI I+ D + Sbjct: 70 VRLTAGATARVKIFRIEGDGE 90 >gi|39995970|ref|NP_951921.1| hypothetical protein GSU0864 [Geobacter sulfurreducens PCA] gi|39982735|gb|AAR34194.1| conserved hypothetical protein TIGR00251 [Geobacter sulfurreducens PCA] gi|307634758|gb|ADI83708.2| protein of unknown function DUF167 [Geobacter sulfurreducens KN400] Length = 107 Score = 80.4 bits (197), Expect = 7e-14, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 48/93 (51%), Gaps = 8/93 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V + P A ++ I ++ +K+++T+ P +G+AN+ + LAK+L + K Sbjct: 20 VTFSVHVQPRASRNEICGVQ-------GEAIKLRLTSPPVEGEANRLCVEFLAKRLGVPK 72 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 S + +++ + S K I + D + LL+N+ Sbjct: 73 SCVAIIAGEKSRHKTIRVSGSDAAAVLALLENS 105 >gi|189423484|ref|YP_001950661.1| hypothetical protein Glov_0413 [Geobacter lovleyi SZ] gi|189419743|gb|ACD94141.1| protein of unknown function DUF167 [Geobacter lovleyi SZ] Length = 101 Score = 80.0 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 41/87 (47%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P A ++ + +K+++T+ P G AN+ LAK+L + KS+ Sbjct: 18 LRVFVQPRASRNQFCGIH-------EGELKLRLTSPPVDGAANECCREFLAKQLKVPKSA 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++S SS K + I + E L Sbjct: 71 VTLISGDSSRHKRLRIAGATTQQIEQL 97 >gi|289208238|ref|YP_003460304.1| hypothetical protein TK90_1056 [Thioalkalivibrio sp. K90mix] gi|288943869|gb|ADC71568.1| protein of unknown function DUF167 [Thioalkalivibrio sp. K90mix] Length = 85 Score = 80.0 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 43/84 (51%), Gaps = 7/84 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + VR+ P +K+ I +K++V A P+KG+AN+A+ A+LAK L Sbjct: 1 MARLRVRVAPGSKRDAIGPW-------MGDILKLRVQAPPEKGRANEAVCALLAKALGCP 53 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCK 84 + +++ ++ K + I+ + Sbjct: 54 ARDVSVVAGATARDKTVAIEGYSE 77 >gi|328541881|ref|YP_004301990.1| hypothetical protein SL003B_0257 [Polymorphum gilvum SL003B-26A1] gi|326411632|gb|ADZ68695.1| hypothetical protein SL003B_0257 [Polymorphum gilvum SL003B-26A1] Length = 106 Score = 79.2 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 25/79 (31%), Positives = 40/79 (50%), Gaps = 2/79 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VRL P A K + D + +V A P+KG AN A+ A++AK L + KS+ Sbjct: 17 VDVRLTPRAGKDAVEGCSELSDGRPVVL--ARVRAIPEKGAANAALEALIAKALGVPKSA 74 Query: 64 LRMLSKQSSPLKIIYIDKD 82 + + + + LK + + D Sbjct: 75 VAIDAGAGARLKTLKVSGD 93 >gi|332187012|ref|ZP_08388753.1| hypothetical protein SUS17_2027 [Sphingomonas sp. S17] gi|332013022|gb|EGI55086.1| hypothetical protein SUS17_2027 [Sphingomonas sp. S17] Length = 84 Score = 78.9 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 15/85 (17%), Positives = 41/85 (48%), Gaps = 7/85 (8%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P + + + H +++A P G AN A++ ++AK ++K ++ ++ Sbjct: 1 MTPRGGRDMLTA-------GTEDHFAARLSAPPVDGAANAALVPLVAKHFGVAKRAVTIV 53 Query: 68 SKQSSPLKIIYIDKDCKEITELLQN 92 + +++ LK ++I D + + + Sbjct: 54 AGETARLKRLHIAGDPHILARMAEA 78 >gi|304313340|ref|YP_003812938.1| hypothetical protein HDN1F_37330 [gamma proteobacterium HdN1] gi|301799073|emb|CBL47316.1| Conserved hypothetical protein [gamma proteobacterium HdN1] Length = 108 Score = 78.9 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 38/88 (43%), Gaps = 8/88 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + L P A G +KI++TA P G+AN ++ LAK + + Sbjct: 18 LHCYLQPRAANDGFVG-------EHGGRLKIRITAPPVDGQANAHLIRFLAKAFGVPQQQ 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 +++ ++ K I I + +I + L+ Sbjct: 71 VQIEQGETGRSKRIRI-RTPSKIPQELR 97 >gi|158424942|ref|YP_001526234.1| hypothetical protein AZC_3318 [Azorhizobium caulinodans ORS 571] gi|158331831|dbj|BAF89316.1| protein of unknown function [Azorhizobium caulinodans ORS 571] Length = 104 Score = 78.9 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 42/89 (47%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR P + + + D D +KI+V A P+ G A A+ +LA ++ S+ Sbjct: 13 VSVRATPKGGRDALDGVSQLSDGRDV--LKIRVRAAPEDGAATAAVAKVLAGAAGVAPSA 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+ S ++ LK+ I D + L+ Sbjct: 71 VRLASGATARLKVFRISGDAARLRATLEA 99 >gi|284046655|ref|YP_003396995.1| hypothetical protein Cwoe_5214 [Conexibacter woesei DSM 14684] gi|283950876|gb|ADB53620.1| protein of unknown function DUF167 [Conexibacter woesei DSM 14684] Length = 88 Score = 78.9 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 26/80 (32%), Positives = 46/80 (57%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M ++ VRL P AK++ I + ++VTA P GKAN A+ +LAK L ++ Sbjct: 1 MGDLRVRLQPRAKRNEIVGERDGA-------LVVRVTAPPVDGKANAALCRLLAKALGVA 53 Query: 61 KSSLRMLSKQSSPLKIIYID 80 S++ ++ QS+ K++++D Sbjct: 54 PSTVTVVRGQSARDKVVHVD 73 >gi|237807736|ref|YP_002892176.1| hypothetical protein Tola_0962 [Tolumonas auensis DSM 9187] gi|259710178|sp|C4LCM6|Y962_TOLAT RecName: Full=UPF0235 protein Tola_0962 gi|237499997|gb|ACQ92590.1| protein of unknown function DUF167 [Tolumonas auensis DSM 9187] Length = 96 Score = 78.9 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 40/90 (44%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P A + I +KI +TA P G+AN ++ LAK+ ++KS Sbjct: 13 LDVYIQPKASRDQIQGWH-------GEELKIAITAPPVDGQANAHLIKFLAKQFKVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + K + I +++ +L + Sbjct: 66 IVIHKGELGRHKTVRIT-SPQQLPAILDQS 94 >gi|292493759|ref|YP_003529198.1| hypothetical protein Nhal_3796 [Nitrosococcus halophilus Nc4] gi|291582354|gb|ADE16811.1| protein of unknown function DUF167 [Nitrosococcus halophilus Nc4] Length = 102 Score = 78.5 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 7/83 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +RL P A I +K+++TA P +GKAN ++ LAK +SKS Sbjct: 15 IQIRLQPRASCDEIIGPH-------GDRLKVRITAPPVEGKANADLIRFLAKTFRVSKSQ 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEI 86 +R+LS + K + I+K K + Sbjct: 68 VRLLSGATGRDKRVCIEKPAKLL 90 >gi|239948125|ref|ZP_04699878.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes scapularis] gi|239922401|gb|EER22425.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes scapularis] Length = 92 Score = 78.5 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 46/78 (58%), Gaps = 3/78 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ PN+ ++ I+ ++ ++K+ + A P++GKAN+ ++ LAK+ LS+S+ Sbjct: 16 LSLKVKPNSNRNLISDFITI---NNIPYLKLSIKAVPEQGKANEEIINYLAKEWKLSRSN 72 Query: 64 LRMLSKQSSPLKIIYIDK 81 + ++ + LK I I Sbjct: 73 IEIIKGHTHSLKTILIKN 90 >gi|114326941|ref|YP_744098.1| putative cytoplasmic protein [Granulibacter bethesdensis CGDNIH1] gi|122328075|sp|Q0BVH7|Y277_GRABC RecName: Full=UPF0235 protein GbCGDNIH1_0277 gi|114315115|gb|ABI61175.1| hypothetical cytosolic protein [Granulibacter bethesdensis CGDNIH1] Length = 107 Score = 78.5 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 48/90 (53%), Gaps = 2/90 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +R++P A+K G+ D +KI V+A KG+AN+A+ MLAK L + S Sbjct: 12 TLALRVVPKARKIGLGGTVPGADGKPR--LKISVSAPADKGQANEAVRDMLAKALRVPAS 69 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + +L ++ K++ ++ D + + ++ Sbjct: 70 RITLLQGLTARDKLVRVEGDPETLGSTVET 99 >gi|114798070|ref|YP_761927.1| hypothetical protein HNE_3254 [Hyphomonas neptunium ATCC 15444] gi|114738244|gb|ABI76369.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444] Length = 99 Score = 78.5 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 43/87 (49%), Gaps = 2/87 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + R+ P A + D + +K++V A P +G AN A+ A++AK L + KS Sbjct: 4 RLTARVQPKAASDRLDGW--AADEAGRPFLKLRVRALPAEGAANAAVEALVAKALGVPKS 61 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITEL 89 ++R+++ + LK + I+ + Sbjct: 62 AVRVVTGGKNRLKSLEIEGPPDLAARI 88 >gi|146309167|ref|YP_001189632.1| hypothetical protein Pmen_4153 [Pseudomonas mendocina ymp] gi|205829317|sp|A4XZY4|Y4153_PSEMY RecName: Full=UPF0235 protein Pmen_4153 gi|145577368|gb|ABP86900.1| protein of unknown function DUF167 [Pseudomonas mendocina ymp] Length = 98 Score = 78.5 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 7/72 (9%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 L P A K A L +KI++TA P +GKAN +LA LAK ++K+ + + Sbjct: 17 LQPKASKDEFAGLH-------GERLKIRLTAPPVEGKANAHLLAFLAKAFGVAKAQVSLE 69 Query: 68 SKQSSPLKIIYI 79 S + + K + I Sbjct: 70 SGELNRHKRLRI 81 >gi|32475275|ref|NP_868269.1| hypothetical protein RB8260 [Rhodopirellula baltica SH 1] gi|47117454|sp|Q7UFY2|Y8260_RHOBA RecName: Full=UPF0235 protein RB8260 gi|32445816|emb|CAD78547.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 108 Score = 78.5 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 27/78 (34%), Positives = 44/78 (56%), Gaps = 7/78 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 VR+ P AKK+ + L +K+ V P+ GKANKA++A LAK L +SK Sbjct: 19 RFRVRVTPKAKKASVGGLHDGA-------LKVSVHTVPEDGKANKAVIASLAKWLRVSKG 71 Query: 63 SLRMLSKQSSPLKIIYID 80 + +++ ++S LK I ++ Sbjct: 72 RVAIVAGETSRLKTIVVE 89 >gi|226942467|ref|YP_002797540.1| hypothetical protein Avin_03050 [Azotobacter vinelandii DJ] gi|259646933|sp|C1DI68|Y305_AZOVD RecName: Full=UPF0235 protein Avin_03050 gi|226717394|gb|ACO76565.1| conserved hypothetical protein [Azotobacter vinelandii DJ] Length = 99 Score = 78.1 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 7/78 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + L P A K A L +KI++TA P +GKAN +LA LA + KS Sbjct: 13 LACHLQPKASKDEFAGLH-------GERLKIRLTAPPVEGKANAHLLAFLAGVFGVPKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDK 81 + + S +S+ K + I + Sbjct: 66 VSLESGESNRQKRVRIRR 83 >gi|192362044|ref|YP_001980615.1| hypothetical protein CJA_0091 [Cellvibrio japonicus Ueda107] gi|226734040|sp|B3PFH5|Y091_CELJU RecName: Full=UPF0235 protein CJA_0091 gi|190688209|gb|ACE85887.1| conserved hypothetical protein TIGR00251 [Cellvibrio japonicus Ueda107] Length = 101 Score = 77.7 bits (190), Expect = 5e-13, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 37/76 (48%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +L P A I + +KI++TA P GKAN+ ++ L+K+ + K + Sbjct: 17 LHCQLQPKASGDDIVGVH-------GDRLKIRITAPPVDGKANEYLIKWLSKQFRVPKGN 69 Query: 64 LRMLSKQSSPLKIIYI 79 +++L + K + I Sbjct: 70 IKILQGELGRHKTLGI 85 >gi|51474006|ref|YP_067763.1| hypothetical protein RT0827 [Rickettsia typhi str. Wilmington] gi|81390320|sp|Q68Y09|Y827_RICTY RecName: Full=UPF0235 protein RT0827 gi|51460318|gb|AAU04281.1| conserved hypothetical protein [Rickettsia typhi str. Wilmington] Length = 105 Score = 77.7 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 51/90 (56%), Gaps = 3/90 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ P +K+ + ++ ++K+ +TA P++GKAN+ ++ LAK+ LS+SS Sbjct: 16 INIKVKPYSKQ---NLINNFVIINNIPYIKLSITAAPEQGKANEGIINYLAKEWKLSRSS 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++ + LK I I ++ L+ N+ Sbjct: 73 IEIIKGHTHSLKTILIKNINEDYLNLIINS 102 >gi|114564027|ref|YP_751541.1| hypothetical protein Sfri_2863 [Shewanella frigidimarina NCIMB 400] gi|122299086|sp|Q07Z62|Y2863_SHEFN RecName: Full=UPF0235 protein Sfri_2863 gi|114335320|gb|ABI72702.1| conserved hypothetical protein [Shewanella frigidimarina NCIMB 400] Length = 104 Score = 77.3 bits (189), Expect = 6e-13, Method: Composition-based stats. Identities = 26/90 (28%), Positives = 39/90 (43%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P A + I L +KI +TA P GKAN + LAK ++KS Sbjct: 17 LFVYVQPKASRDQIVGLY-------GNELKIAITAPPIDGKANAYLSKYLAKACKVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELL 90 + ++ + K I I + EI LL Sbjct: 70 VHIIKGEQGRHKQIRISQPQVIPPEIAALL 99 >gi|70733126|ref|YP_262899.1| hypothetical protein PFL_5841 [Pseudomonas fluorescens Pf-5] gi|123652292|sp|Q4K4D4|Y5841_PSEF5 RecName: Full=UPF0235 protein PFL_5841 gi|68347425|gb|AAY95031.1| conserved hypothetical protein TIGR00251 [Pseudomonas fluorescens Pf-5] Length = 97 Score = 77.3 bits (189), Expect = 7e-13, Method: Composition-based stats. Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 8/82 (9%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 L P A A L +KI++TA P +GKAN ++A LAK + KS++ ++ Sbjct: 17 LQPKASSDEFAGLH-------GERLKIRLTAPPVEGKANAHLMAFLAKAFGIPKSNVSLV 69 Query: 68 SKQSSPLKIIYIDKDCKEITEL 89 S + + K + + K++ +L Sbjct: 70 SGELNRQKRVRLQA-PKKLPDL 90 >gi|111075036|gb|ABH04883.1| uncharacterized conserved protein yggY [Heliobacillus mobilis] Length = 101 Score = 77.3 bits (189), Expect = 7e-13, Method: Composition-based stats. Identities = 28/94 (29%), Positives = 49/94 (52%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 +R+ P A K+ + L +K+++TA P G+AN A A AK L+L K Sbjct: 11 VRFKIRVQPRASKNEVCGLLDDA-------LKVRLTAPPVDGEANGACQAFFAKTLSLPK 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 S +R+++ ++S K + + KE ++L+ DS Sbjct: 64 SQVRLVAGETSRTKTVEVIGVSKE--QILKLFDS 95 >gi|296533899|ref|ZP_06896426.1| protein of hypothetical function DUF167 [Roseomonas cervicalis ATCC 49957] gi|296265774|gb|EFH11872.1| protein of hypothetical function DUF167 [Roseomonas cervicalis ATCC 49957] Length = 101 Score = 76.9 bits (188), Expect = 8e-13, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 44/90 (48%), Gaps = 3/90 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V+ P A+++G+ D +K+ V P+ G+ANKA+ A+LA L + Sbjct: 12 VELRVKAQPKARRAGLQGWIAAPDG---PRLKLAVHEAPEDGRANKAICALLAGALHVPP 68 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S++ ++ +S K I D + E L+ Sbjct: 69 SAITVVQGATSREKTCRILGDSARLHETLE 98 >gi|254784473|ref|YP_003071901.1| hypothetical protein TERTU_0220 [Teredinibacter turnerae T7901] gi|237683524|gb|ACR10788.1| conserved hypothetical protein [Teredinibacter turnerae T7901] Length = 99 Score = 76.9 bits (188), Expect = 9e-13, Method: Composition-based stats. Identities = 22/72 (30%), Positives = 37/72 (51%), Gaps = 7/72 (9%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 L P A A L+ + +KI++TA P GKAN ++ LA++ ++KS + ++ Sbjct: 18 LQPKASSDAFAGLQADR-------LKIRITAPPTDGKANAHLVKYLARQFGVAKSDIEIV 70 Query: 68 SKQSSPLKIIYI 79 Q S K + I Sbjct: 71 RGQLSRQKTLQI 82 >gi|157804228|ref|YP_001492777.1| hypothetical protein A1E_05380 [Rickettsia canadensis str. McKiel] gi|166227320|sp|A8F059|Y5380_RICCK RecName: Full=UPF0235 protein A1E_05380 gi|157785491|gb|ABV73992.1| hypothetical protein A1E_05380 [Rickettsia canadensis str. McKiel] Length = 105 Score = 76.9 bits (188), Expect = 9e-13, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 51/90 (56%), Gaps = 3/90 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ P++K++ I+ I ++ ++K+ + P++GKAN+ ++ LAK LS+S+ Sbjct: 16 LNLKVKPDSKQNLISDFVII---NNLPYLKLFIKTAPEQGKANEEIINYLAKAWKLSRSN 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++ + LK I I ++ + N+ Sbjct: 73 IEIIKGHTHSLKTILIKNIDEDYLNSIINS 102 >gi|182680478|ref|YP_001834624.1| hypothetical protein Bind_3579 [Beijerinckia indica subsp. indica ATCC 9039] gi|182636361|gb|ACB97135.1| protein of unknown function DUF167 [Beijerinckia indica subsp. indica ATCC 9039] Length = 87 Score = 76.2 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 26/81 (32%), Positives = 44/81 (54%), Gaps = 3/81 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ P A + I +E D S+ ++I VT P+ GKAN+ +L +LAK L + Sbjct: 10 VEIAIRVTPKASANRIV-VETAPDGSER--LRIYVTTVPENGKANRDVLRLLAKHLDIPP 66 Query: 62 SSLRMLSKQSSPLKIIYIDKD 82 SSL ++ + KI+ +D Sbjct: 67 SSLEIIRGSTGRDKIVRFSRD 87 >gi|77166447|ref|YP_344972.1| hypothetical protein Noc_3000 [Nitrosococcus oceani ATCC 19707] gi|254435182|ref|ZP_05048689.1| conserved hypothetical protein TIGR00251 [Nitrosococcus oceani AFC27] gi|123593242|sp|Q3J6V4|Y3000_NITOC RecName: Full=UPF0235 protein Noc_3000 gi|76884761|gb|ABA59442.1| Conserved hypothetical protein 251 [Nitrosococcus oceani ATCC 19707] gi|207088293|gb|EDZ65565.1| conserved hypothetical protein TIGR00251 [Nitrosococcus oceani AFC27] Length = 102 Score = 76.2 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 26/83 (31%), Positives = 43/83 (51%), Gaps = 7/83 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +RL P AK + +KI++TA P +GKAN +L LAK +S++ Sbjct: 15 IQIRLQPRAKGDEVIGPH-------GDRLKIRITAPPVEGKANTHLLRFLAKTFQVSRNQ 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEI 86 + +LS +S K + I+K K + Sbjct: 68 VYLLSGATSRDKRVRIEKPTKLL 90 >gi|85704715|ref|ZP_01035816.1| hypothetical protein ROS217_06535 [Roseovarius sp. 217] gi|85670533|gb|EAQ25393.1| hypothetical protein ROS217_06535 [Roseovarius sp. 217] Length = 84 Score = 76.2 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 8/78 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +R+ P A ++ I + + +++ VT P+ GKAN A+ +LAK L + KS Sbjct: 15 TIALRVTPKAARNRIVAED--------GALRVYVTTVPEDGKANAAVQKLLAKALGVPKS 66 Query: 63 SLRMLSKQSSPLKIIYID 80 L +L +S K+ +D Sbjct: 67 RLSLLRGHTSRDKVFQVD 84 >gi|330505393|ref|YP_004382262.1| hypothetical protein MDS_4479 [Pseudomonas mendocina NK-01] gi|328919679|gb|AEB60510.1| hypothetical protein MDS_4479 [Pseudomonas mendocina NK-01] Length = 96 Score = 75.8 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 24/72 (33%), Positives = 35/72 (48%), Gaps = 7/72 (9%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 L P A K A L +KI++TA P GKAN + A LAK ++KS + + Sbjct: 17 LQPKASKDEFAGLH-------GERLKIRLTAPPVDGKANAHLQAFLAKAFGVAKSQVILE 69 Query: 68 SKQSSPLKIIYI 79 S + + K + I Sbjct: 70 SGELNRQKRLRI 81 >gi|149200997|ref|ZP_01877972.1| hypothetical protein RTM1035_15267 [Roseovarius sp. TM1035] gi|149145330|gb|EDM33356.1| hypothetical protein RTM1035_15267 [Roseovarius sp. TM1035] Length = 84 Score = 75.8 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 8/78 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +R+ P A ++ I + + +++ VT P+ GKAN A+ +LAK L + KS Sbjct: 15 TLTLRVTPKAARNRIVAEDD--------VLRVYVTTVPEDGKANAAVQKLLAKALGVPKS 66 Query: 63 SLRMLSKQSSPLKIIYID 80 L +L +S K+ +D Sbjct: 67 RLTLLRGHTSRDKVFQVD 84 >gi|121997739|ref|YP_001002526.1| hypothetical protein Hhal_0948 [Halorhodospira halophila SL1] gi|121589144|gb|ABM61724.1| protein of unknown function DUF167 [Halorhodospira halophila SL1] Length = 101 Score = 75.8 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 40/77 (51%), Gaps = 8/77 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P AK+ + + +++++ P GKAN A+ +LA++ ++KS+ Sbjct: 15 VHVRVTPRAKRESLD--------VEGERLRVRLNTPPVDGKANTALRKLLARQFGVAKSA 66 Query: 64 LRMLSKQSSPLKIIYID 80 + +L + S K + I Sbjct: 67 VSLLRGERSRDKTVRIT 83 >gi|83945079|ref|ZP_00957445.1| hypothetical protein OA2633_10629 [Oceanicaulis alexandrii HTCC2633] gi|83851861|gb|EAP89716.1| hypothetical protein OA2633_10629 [Oceanicaulis alexandrii HTCC2633] Length = 101 Score = 75.8 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 2/77 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VR+ P A ++G D I + +V A P KG AN + A+ AK L + KS Sbjct: 6 RLFVRVQPRASRAGFDGARAGTDGR--IRLAARVRAAPDKGAANTELCALTAKTLGVPKS 63 Query: 63 SLRMLSKQSSPLKIIYI 79 ++ +++ + K + + Sbjct: 64 TVSVIAGATQREKTLLV 80 >gi|257062952|ref|YP_003142624.1| hypothetical protein Shel_02050 [Slackia heliotrinireducens DSM 20476] gi|256790605|gb|ACV21275.1| uncharacterized conserved protein [Slackia heliotrinireducens DSM 20476] Length = 106 Score = 75.8 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 22/85 (25%), Positives = 48/85 (56%), Gaps = 2/85 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P A+++ +A + D + + ++++VT P+ GKANKA+ LAK + +SKS Sbjct: 16 IPIHATPKAQRNAVAGV--KADDTGRLEVQVRVTVAPEGGKANKAVCETLAKAIGVSKSK 73 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITE 88 + ++ ++S K+ ++ +I Sbjct: 74 VSIVRGETSRHKMAQVEAPSADIEA 98 >gi|77461543|ref|YP_351050.1| hypothetical protein Pfl01_5322 [Pseudomonas fluorescens Pf0-1] gi|123602932|sp|Q3K595|Y5322_PSEPF RecName: Full=UPF0235 protein Pfl01_5322 gi|77385546|gb|ABA77059.1| conserved hypothetical protein [Pseudomonas fluorescens Pf0-1] Length = 96 Score = 75.4 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 25/82 (30%), Positives = 41/82 (50%), Gaps = 8/82 (9%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 L P A+ L +KI++TA P +GKAN ++ LAK +SKS + +L Sbjct: 17 LQPAARSDDFCGLH-------GDRLKIRLTAPPVEGKANAYLMGFLAKAFGVSKSQVSLL 69 Query: 68 SKQSSPLKIIYIDKDCKEITEL 89 S + + K + I K++ +L Sbjct: 70 SGELNRQKRVRI-GAPKKLPDL 90 >gi|170290086|ref|YP_001736902.1| hypothetical protein Kcr_0466 [Candidatus Korarchaeum cryptofilum OPF8] gi|170174166|gb|ACB07219.1| protein of unknown function DUF167 [Candidatus Korarchaeum cryptofilum OPF8] Length = 72 Score = 75.4 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 26/79 (32%), Positives = 47/79 (59%), Gaps = 8/79 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V V ++PNA ++G+ + H+++ V A P KGKAN+A++ +LA+ + KS Sbjct: 2 RVSVLVVPNAGRNGVV--------EEGDHLRVYVRAPPVKGKANEAVIEVLAEFFGVKKS 53 Query: 63 SLRMLSKQSSPLKIIYIDK 81 +R++S + S K++ I K Sbjct: 54 DIRIISGERSREKVVEIRK 72 >gi|116623504|ref|YP_825660.1| hypothetical protein Acid_4414 [Candidatus Solibacter usitatus Ellin6076] gi|116226666|gb|ABJ85375.1| protein of unknown function DUF167 [Candidatus Solibacter usitatus Ellin6076] Length = 88 Score = 75.4 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 22/94 (23%), Positives = 45/94 (47%), Gaps = 8/94 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + VR+ P A++S I K+ + A P GKAN + LA + Sbjct: 1 MARLTVRVHPRARRSEITG-------RLGDAWKLALAAPPVDGKANDECVRFLAGWAGVP 53 Query: 61 KSSLRMLSKQSSPLKIIYIDKDC-KEITELLQNN 93 +S +R+++ +S +K++ I+ +++ L+ + Sbjct: 54 RSRVRIVTGLTSRIKVVEIEGVPQEDLERRLKAS 87 >gi|71909498|ref|YP_287085.1| hypothetical protein Daro_3887 [Dechloromonas aromatica RCB] gi|123626353|sp|Q478W6|Y3887_DECAR RecName: Full=UPF0235 protein Daro_3887 gi|71849119|gb|AAZ48615.1| Conserved hypothetical protein 251 [Dechloromonas aromatica RCB] Length = 97 Score = 75.4 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 26/79 (32%), Positives = 43/79 (54%), Gaps = 7/79 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + + P AKKS A L +KI++ A P GKAN+A++ +A L L+KS Sbjct: 14 TLTLHIQPGAKKSEFAGLH-------GDALKIRLAAPPVDGKANEALIRFIADALGLAKS 66 Query: 63 SLRMLSKQSSPLKIIYIDK 81 ++ + S Q+S K++ I Sbjct: 67 AVHLKSGQTSRRKVLEILG 85 >gi|330812367|ref|YP_004356829.1| hypothetical protein PSEBR_a5324 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327380475|gb|AEA71825.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 97 Score = 75.4 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 27/82 (32%), Positives = 43/82 (52%), Gaps = 8/82 (9%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 L P A+ A L +KI++TA P +GKAN ++A LAK +SKS + ++ Sbjct: 17 LQPAARSDDFAGLH-------GDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQVSLV 69 Query: 68 SKQSSPLKIIYIDKDCKEITEL 89 S + + K + I K++ EL Sbjct: 70 SGELNRQKRVRIH-SPKKLPEL 90 >gi|114566491|ref|YP_753645.1| hypothetical protein Swol_0959 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|122318444|sp|Q0AYD0|Y959_SYNWW RecName: Full=UPF0235 protein Swol_0959 gi|114337426|gb|ABI68274.1| conserved hypothetical protein [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 102 Score = 75.0 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 23/83 (27%), Positives = 39/83 (46%), Gaps = 7/83 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V++ P + ++ I +KIKV A P +G AN+A+ LA+ L K Sbjct: 16 RFGVKVQPRSSRNQIVGEHEGD-------LKIKVMAPPVEGAANQALQKFLAELFKLPKK 68 Query: 63 SLRMLSKQSSPLKIIYIDKDCKE 85 +R++ ++S KI+ I E Sbjct: 69 DIRIVRGETSRHKIVEIRGIEAE 91 >gi|258514375|ref|YP_003190597.1| hypothetical protein Dtox_1088 [Desulfotomaculum acetoxidans DSM 771] gi|257778080|gb|ACV61974.1| protein of unknown function DUF167 [Desulfotomaculum acetoxidans DSM 771] Length = 98 Score = 75.0 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 44/81 (54%), Gaps = 7/81 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P A K +A L +KI++TA P +G+AN+A+ LAK L ++++ + Sbjct: 14 KVRVQPRASKDQVAGLWEDA-------VKIRLTAPPVEGEANRALCDFLAKHLGVTRAQV 66 Query: 65 RMLSKQSSPLKIIYIDKDCKE 85 +++ Q+ K++ + E Sbjct: 67 DLVTGQTGRNKLVRVSGITAE 87 >gi|320161245|ref|YP_004174469.1| hypothetical protein ANT_18430 [Anaerolinea thermophila UNI-1] gi|319995098|dbj|BAJ63869.1| hypothetical protein ANT_18430 [Anaerolinea thermophila UNI-1] Length = 105 Score = 75.0 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 47/78 (60%), Gaps = 6/78 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A K+ I EI D +KI++TA P +GKAN+A++ L++ L + + Sbjct: 17 AAITVRVTPRASKNEI--YEILDDG----TVKIRLTAPPVEGKANEALIDFLSEVLDVPR 70 Query: 62 SSLRMLSKQSSPLKIIYI 79 +SL +++ ++ KI+ + Sbjct: 71 TSLEIVAGETGRDKIVTV 88 >gi|119899753|ref|YP_934966.1| hypothetical protein azo3464 [Azoarcus sp. BH72] gi|166232594|sp|A1KB74|Y3464_AZOSB RecName: Full=UPF0235 protein azo3464 gi|119672166|emb|CAL96080.1| conserved hypothetical protein [Azoarcus sp. BH72] Length = 98 Score = 75.0 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 45/90 (50%), Gaps = 7/90 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + + P A+++G A L MKI++ A P GKAN A+ A LA + KS Sbjct: 14 TLTLHIQPGARQTGFAGLH-------GEAMKIRLAAPPVDGKANAALCAFLADFCEVPKS 66 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++ ++S ++S K + ++ + L+ Sbjct: 67 AVTLVSGETSRAKRVRVETKTPGLAGRLRA 96 >gi|117919640|ref|YP_868832.1| hypothetical protein Shewana3_1191 [Shewanella sp. ANA-3] gi|166228912|sp|A0KUF7|Y1191_SHESA RecName: Full=UPF0235 protein Shewana3_1191 gi|117611972|gb|ABK47426.1| conserved hypothetical protein [Shewanella sp. ANA-3] Length = 96 Score = 74.6 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 38/91 (41%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYID---KDCKEITELLQ 91 + +L + K + I EI LL+ Sbjct: 66 VHILKGELGRHKQVRISPPKNVPAEIATLLE 96 >gi|330831217|ref|YP_004394169.1| hypothetical protein B565_3517 [Aeromonas veronii B565] gi|328806353|gb|AEB51552.1| hypothetical protein B565_3517 [Aeromonas veronii B565] Length = 100 Score = 74.6 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 43/92 (46%), Gaps = 10/92 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ ++K Sbjct: 13 LHLVIQPKASRDQIIGLH-------GEELKVAITAPPVDGQANSHLIKFLAKQFKVAKGQ 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQN 92 + ++ + K + ID +E++ LL + Sbjct: 66 ITIVRGELGRHKTVAIDSPKQLPQEVSALLND 97 >gi|257095142|ref|YP_003168783.1| hypothetical protein CAP2UW1_3597 [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257047666|gb|ACV36854.1| protein of unknown function DUF167 [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 96 Score = 74.6 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 26/83 (31%), Positives = 44/83 (53%), Gaps = 7/83 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V + L P AK + IA +K+++TA P G+AN A++ LA++L LS+S Sbjct: 13 TVTIHLQPGAKANEIAG-------RHGDALKVRITAPPVDGRANAALVDFLAQRLGLSRS 65 Query: 63 SLRMLSKQSSPLKIIYIDKDCKE 85 ++ + S +S K++ I E Sbjct: 66 AVELKSGLTSRRKVLRISGASAE 88 >gi|220909519|ref|YP_002484830.1| hypothetical protein Cyan7425_4156 [Cyanothece sp. PCC 7425] gi|219866130|gb|ACL46469.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7425] Length = 89 Score = 74.6 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 26/94 (27%), Positives = 51/94 (54%), Gaps = 8/94 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + V+++P++ + + +K+KV A P+KGKAN A++A+LA L + Sbjct: 1 MAKLKVKVVPSSSRDLVVGW-------LGEALKVKVKAPPEKGKANAAVIALLATHLGID 53 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 ++ + +LS +S K++ I+ D +I L + Sbjct: 54 QTCIEVLSGHTSAAKVLSIEGLDQTQIRAALSDV 87 >gi|116751493|ref|YP_848180.1| hypothetical protein Sfum_4080 [Syntrophobacter fumaroxidans MPOB] gi|116700557|gb|ABK19745.1| protein of unknown function DUF167 [Syntrophobacter fumaroxidans MPOB] Length = 105 Score = 74.2 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 25/87 (28%), Positives = 40/87 (45%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V L P A K+ A + +KI++TA P +G+ANK + LA +S+S Sbjct: 19 LDVYLQPRASKNEWAGMHQGC-------LKIRLTAPPVEGEANKECVKFLAGAFGVSRSD 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ S K I I EI + Sbjct: 72 VEIIRGHKSRRKTILIRNSTPEILRAV 98 >gi|113969534|ref|YP_733327.1| hypothetical protein Shewmr4_1190 [Shewanella sp. MR-4] gi|123130613|sp|Q0HKZ7|Y1190_SHESM RecName: Full=UPF0235 protein Shewmr4_1190 gi|113884218|gb|ABI38270.1| conserved hypothetical protein [Shewanella sp. MR-4] Length = 96 Score = 74.2 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 38/91 (41%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYID---KDCKEITELLQ 91 + +L + K + I EI LL+ Sbjct: 66 VHILKGELGRHKQVRISAPKNVPAEIATLLE 96 >gi|171914386|ref|ZP_02929856.1| hypothetical protein VspiD_24445 [Verrucomicrobium spinosum DSM 4136] Length = 92 Score = 74.2 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 42/84 (50%), Gaps = 2/84 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 N+ ++ PNA++S I D + +K+ A +GKANK ++ LA++L +K Sbjct: 6 VNLACKVTPNARRSEIVGW--GADEQGRGVLLVKLAAPALEGKANKELVRFLAEQLGCAK 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85 + +L +S K++ + E Sbjct: 64 GEVSLLRGDASRTKLLRVPGKAYE 87 >gi|146293784|ref|YP_001184208.1| hypothetical protein Sputcn32_2690 [Shewanella putrefaciens CN-32] gi|166228432|sp|A4Y8X6|Y2690_SHEPC RecName: Full=UPF0235 protein Sputcn32_2690 gi|145565474|gb|ABP76409.1| protein of unknown function DUF167 [Shewanella putrefaciens CN-32] gi|319427156|gb|ADV55230.1| protein of unknown function DUF167 [Shewanella putrefaciens 200] Length = 96 Score = 74.2 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNVYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + ++ + K I I E++ELL+ Sbjct: 66 VYIIKGELGRHKQIRIVTPKLIPPEVSELLE 96 >gi|145300496|ref|YP_001143337.1| hypothetical protein ASA_3628 [Aeromonas salmonicida subsp. salmonicida A449] gi|166232617|sp|A4SRR7|Y3628_AERS4 RecName: Full=UPF0235 protein ASA_3628 gi|142853268|gb|ABO91589.1| conserved hypothetical protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 99 Score = 74.2 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 42/92 (45%), Gaps = 10/92 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ ++K Sbjct: 13 LHLMIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANSHLIKYLAKQFKVAKGQ 65 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQN 92 +R++ + K + I+ EI LL+ Sbjct: 66 VRIVRGELGRHKTVAIESPRQIPAEIHALLET 97 >gi|134298585|ref|YP_001112081.1| hypothetical protein Dred_0717 [Desulfotomaculum reducens MI-1] gi|189040591|sp|A4J2F3|Y717_DESRM RecName: Full=UPF0235 protein Dred_0717 gi|134051285|gb|ABO49256.1| protein of unknown function DUF167 [Desulfotomaculum reducens MI-1] Length = 94 Score = 73.9 bits (180), Expect = 6e-12, Method: Composition-based stats. Identities = 19/87 (21%), Positives = 41/87 (47%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P A K+ +A +K+++TA P G AN+A + ++KS Sbjct: 13 VKVRVQPRASKNSLAG-------EMEGALKVRLTAPPVDGAANEACCKFFGELFGVAKSK 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + +++ + K+++I ++ + Sbjct: 66 VEIIAGHTGRNKLVHIQGVTEKQARFI 92 >gi|237747224|ref|ZP_04577704.1| predicted protein [Oxalobacter formigenes HOxBLS] gi|229378575|gb|EEO28666.1| predicted protein [Oxalobacter formigenes HOxBLS] Length = 100 Score = 73.9 bits (180), Expect = 6e-12, Method: Composition-based stats. Identities = 28/92 (30%), Positives = 53/92 (57%), Gaps = 10/92 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V++IPNA+KS I S E ++I++ A P GKAN+A++ +LAKKL + + Sbjct: 14 RIAVQVIPNARKSEIVSSE-------GETLRIRLQAQPVDGKANEALVQLLAKKLRVPRK 66 Query: 63 SLRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + ++ K++ + D+ ++I + LQ Sbjct: 67 QVSITHGLANKRKLLEVIVSDRSQEDIVKQLQ 98 >gi|154705806|ref|YP_001424486.1| hypothetical cytosolic protein [Coxiella burnetii Dugway 5J108-111] gi|154355092|gb|ABS76554.1| hypothetical cytosolic protein [Coxiella burnetii Dugway 5J108-111] Length = 92 Score = 73.9 bits (180), Expect = 6e-12, Method: Composition-based stats. Identities = 26/77 (33%), Positives = 46/77 (59%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P AK++ I+ H+KI++ A P +GKANKA++ LA++L L+ SS Sbjct: 7 LTVYIQPGAKQTQISG-------KHGEHIKIRLQAPPTEGKANKALIDFLAQRLKLNPSS 59 Query: 64 LRMLSKQSSPLKIIYID 80 + ++ + + LK I I+ Sbjct: 60 ITIIRGEKARLKTIAIE 76 >gi|240949737|ref|ZP_04754069.1| hypothetical protein AM305_00329 [Actinobacillus minor NM305] gi|240295769|gb|EER46456.1| hypothetical protein AM305_00329 [Actinobacillus minor NM305] Length = 100 Score = 73.9 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 25/90 (27%), Positives = 43/90 (47%), Gaps = 8/90 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A + I L + +KI +TA P G AN +L L+K + KS Sbjct: 15 RLRIFLQPKASRDQIVGLHDEE-------LKIAITAPPVDGAANAHLLKFLSKLFKVPKS 67 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 S+ + + K I+I + K+I + ++N Sbjct: 68 SIALEKGELQRHKQIFIP-EPKQIPQEIEN 96 >gi|291286027|ref|YP_003502843.1| hypothetical protein Dacet_0081 [Denitrovibrio acetiphilus DSM 12809] gi|290883187|gb|ADD66887.1| protein of unknown function DUF167 [Denitrovibrio acetiphilus DSM 12809] Length = 84 Score = 73.9 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 27/90 (30%), Positives = 51/90 (56%), Gaps = 7/90 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V + P AKK+ ++ + D +KI+V A P +G AN+ ++ L+K+L +SKS Sbjct: 2 KLSVYVQPGAKKTELSGMH---DGK----IKIRVCAPPVEGAANEVLVKFLSKQLKISKS 54 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++++S + S KI+ I+ D ++ L Sbjct: 55 GIKIISGEKSRHKIVEINMDTLDVMNCLSK 84 >gi|294676098|ref|YP_003576713.1| hypothetical protein RCAP_rcc00541 [Rhodobacter capsulatus SB 1003] gi|294474918|gb|ADE84306.1| protein of unknown function DUF167 [Rhodobacter capsulatus SB 1003] Length = 83 Score = 73.9 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 42/79 (53%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ P A ++ I E ++ VT P+ GKAN A++ +LAK L ++K Sbjct: 13 AEIALRVTPKASRNEIVVAEDG--------LRAYVTVVPEGGKANAAVVKLLAKSLGVAK 64 Query: 62 SSLRMLSKQSSPLKIIYID 80 S L ++ +++ K+ +D Sbjct: 65 SRLTLIRGETARDKVFRLD 83 >gi|78222380|ref|YP_384127.1| hypothetical protein Gmet_1164 [Geobacter metallireducens GS-15] gi|78193635|gb|ABB31402.1| protein of unknown function DUF167 [Geobacter metallireducens GS-15] Length = 102 Score = 73.9 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 21/80 (26%), Positives = 40/80 (50%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V + P A K+GI ++ +K+++T+ P +G+AN+ LAK L + K Sbjct: 19 VTFSVHVQPRASKNGICGIQ-------GDAIKLRLTSPPVEGEANRLCTEYLAKLLKVPK 71 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 S++ +++ S K I + Sbjct: 72 SAVTIIAGDKSRHKTIRVSG 91 >gi|257465218|ref|ZP_05629589.1| hypothetical protein AM202_01815 [Actinobacillus minor 202] gi|257450878|gb|EEV24921.1| hypothetical protein AM202_01815 [Actinobacillus minor 202] Length = 100 Score = 73.9 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 25/90 (27%), Positives = 43/90 (47%), Gaps = 8/90 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A + I L + +KI +TA P G AN +L L+K + KS Sbjct: 15 RLRIFLQPKASRDQIVGLHDEE-------LKIAITAPPVDGAANAHLLKFLSKLFKVPKS 67 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 S+ + + K I+I + K+I + ++N Sbjct: 68 SIALEKGELQRHKQIFIP-EPKQIPQEIEN 96 >gi|299139441|ref|ZP_07032616.1| protein of unknown function DUF167 [Acidobacterium sp. MP5ACTX8] gi|298598710|gb|EFI54873.1| protein of unknown function DUF167 [Acidobacterium sp. MP5ACTX8] Length = 99 Score = 73.5 bits (179), Expect = 8e-12, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 44/80 (55%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C++ VR+ P AK++ + D S +KI +T P G+AN A++A L+ +L + + Sbjct: 12 CSLPVRVHPGAKQNAVTGTH---DGS----LKISLTTPPTDGRANTALIAFLSDRLNIPR 64 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 + + +L+ +S K + I Sbjct: 65 AHIELLTGATSRSKTLRIAG 84 >gi|56476391|ref|YP_157980.1| hypothetical protein ebA1762 [Aromatoleum aromaticum EbN1] gi|81358142|sp|Q5P6I2|Y954_AZOSE RecName: Full=UPF0235 protein AZOSEA09540 gi|56312434|emb|CAI07079.1| conserved hypothetical protein [Aromatoleum aromaticum EbN1] Length = 97 Score = 73.5 bits (179), Expect = 9e-12, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 7/89 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P AKK+ MK+++ A P GKAN A+ LA + +S+ Sbjct: 14 LSLHVQPGAKKTEFVGPH-------GEAMKLRLAAPPVDGKANAALTVFLAAFCGVGRSA 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + +LS ++S K + I+ E L+ Sbjct: 67 VSLLSGETSRAKRVRIEGAGSEALARLRA 95 >gi|237749382|ref|ZP_04579862.1| predicted protein [Oxalobacter formigenes OXCC13] gi|229380744|gb|EEO30835.1| predicted protein [Oxalobacter formigenes OXCC13] Length = 100 Score = 73.5 bits (179), Expect = 9e-12, Method: Composition-based stats. Identities = 29/91 (31%), Positives = 50/91 (54%), Gaps = 10/91 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V++ PNAKK+ I S SD ++I++ A P GKAN+A++ +AKKL K Sbjct: 14 RIAVQVSPNAKKTEIVS-------SDGEALRIRLQAPPVDGKANEALVQFIAKKLRTPKR 66 Query: 63 SLRMLSKQSSPLKIIYI---DKDCKEITELL 90 ++ + S+ K++ I D +E+ + L Sbjct: 67 NVSITHGLSAKHKLLEIGLPDIPEEELEKQL 97 >gi|114763599|ref|ZP_01443004.1| hypothetical protein 1100011001330_R2601_13754 [Pelagibaca bermudensis HTCC2601] gi|114543879|gb|EAU46891.1| hypothetical protein R2601_13754 [Roseovarius sp. HTCC2601] Length = 85 Score = 73.5 bits (179), Expect = 9e-12, Method: Composition-based stats. Identities = 23/79 (29%), Positives = 39/79 (49%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ P A ++ I + +++ VT P+ GKAN A+ +LAK L L K Sbjct: 14 ATLELRVTPKASRNEIR--------EEGGTLRVYVTTVPEDGKANAAVQKLLAKALGLPK 65 Query: 62 SSLRMLSKQSSPLKIIYID 80 S L ++ +S K I+ Sbjct: 66 SRLVLVRGATSRDKAFRIE 84 >gi|332107965|gb|EGJ09189.1| hypothetical protein RBXJA2T_02622 [Rubrivivax benzoatilyticus JA2] Length = 103 Score = 73.5 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 25/91 (27%), Positives = 46/91 (50%), Gaps = 7/91 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V ++PNA+++G L +++++ A P GKAN+ +LA LA +L L K Sbjct: 17 RLRVAVVPNARRTGADGLHDGA-------LRVRLNAPPVDGKANETLLAWLADELDLPKR 69 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 ++R+ Q+ K I +D + + L Sbjct: 70 AVRLTHGQTGRRKTIELDAAPEAVAAWLARV 100 >gi|152979630|ref|YP_001345259.1| hypothetical protein Asuc_1977 [Actinobacillus succinogenes 130Z] gi|171704390|sp|A6VQS7|Y1977_ACTSZ RecName: Full=UPF0235 protein Asuc_1977 gi|150841353|gb|ABR75324.1| protein of unknown function DUF167 [Actinobacillus succinogenes 130Z] Length = 98 Score = 73.5 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 43/93 (46%), Gaps = 10/93 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L + +KI +TA P G AN ++ L+K + KS Sbjct: 12 RLRIMLQPKASKDAIIGLHDEE-------LKISITAPPVDGAANAHLIKYLSKAFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYIDK---DCKEITELLQN 92 ++++ + + K ++I + + +LL N Sbjct: 65 AVQLEKGELNRHKQVFIPAPKIIPEAVRQLLDN 97 >gi|24374867|ref|NP_718910.1| hypothetical protein SO_3356 [Shewanella oneidensis MR-1] gi|29839709|sp|Q8EBY9|Y3356_SHEON RecName: Full=UPF0235 protein SO_3356 gi|24349562|gb|AAN56354.1|AE015772_14 conserved hypothetical protein TIGR00251 [Shewanella oneidensis MR-1] Length = 96 Score = 73.1 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + +L + K + I+ EI+ LL+ Sbjct: 66 VHILKGELGRHKQVRINAPKSVPAEISALLE 96 >gi|114046767|ref|YP_737317.1| hypothetical protein Shewmr7_1261 [Shewanella sp. MR-7] gi|123131606|sp|Q0HX95|Y1261_SHESR RecName: Full=UPF0235 protein Shewmr7_1261 gi|113888209|gb|ABI42260.1| conserved hypothetical protein [Shewanella sp. MR-7] Length = 96 Score = 73.1 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYID---KDCKEITELLQ 91 + +L + K++ I EI LL+ Sbjct: 66 VHILKGELGRHKLVRISAPKNVPAEIATLLE 96 >gi|302342363|ref|YP_003806892.1| hypothetical protein Deba_0928 [Desulfarculus baarsii DSM 2075] gi|301638976|gb|ADK84298.1| protein of unknown function DUF167 [Desulfarculus baarsii DSM 2075] Length = 95 Score = 73.1 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 40/79 (50%), Gaps = 7/79 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VR+ P A + +A E +K+++ A P G+AN+A+L ++AK L+L + Sbjct: 12 SFAVRVSPRASRDQLAGEE-------GGALKVRLCAPPVDGQANEALLRLVAKALSLPRR 64 Query: 63 SLRMLSKQSSPLKIIYIDK 81 + + S S K + + Sbjct: 65 DVSLASGPRSRQKRLLVKG 83 >gi|154251824|ref|YP_001412648.1| hypothetical protein Plav_1371 [Parvibaculum lavamentivorans DS-1] gi|154155774|gb|ABS62991.1| protein of unknown function DUF167 [Parvibaculum lavamentivorans DS-1] Length = 108 Score = 73.1 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 2/89 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ +R+ P I L D S ++++V+A +KGKAN A+L +LA+ L L + Sbjct: 10 VSLRLRVTPRGGADRIDGL--AADASGEPFLRVRVSAVAEKGKANDAVLKLLARALRLPR 67 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 S+ + S ++ K + I D + + L Sbjct: 68 SAFAVASGEAGRTKSVTISGDTERLMADL 96 >gi|120598144|ref|YP_962718.1| hypothetical protein Sputw3181_1321 [Shewanella sp. W3-18-1] gi|166200340|sp|A1RHL9|Y1321_SHESW RecName: Full=UPF0235 protein Sputw3181_1321 gi|120558237|gb|ABM24164.1| protein of unknown function DUF167 [Shewanella sp. W3-18-1] Length = 96 Score = 73.1 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + ++ + K I I E++ELL+ Sbjct: 66 VYIIKGELGRHKQIRIVTPKLIPPEVSELLE 96 >gi|84685306|ref|ZP_01013204.1| hypothetical protein 1099457000258_RB2654_10573 [Maritimibacter alkaliphilus HTCC2654] gi|84666463|gb|EAQ12935.1| hypothetical protein RB2654_10573 [Rhodobacterales bacterium HTCC2654] Length = 85 Score = 73.1 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 24/77 (31%), Positives = 44/77 (57%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P A ++ ++ + +D H+K+ VT P+ GKA A++ +LA L ++KS Sbjct: 16 LTVRVTPKASRNAVS---VDEDG----HLKVSVTTVPEDGKATAAVVKLLAHALGVAKSD 68 Query: 64 LRMLSKQSSPLKIIYID 80 L ++ +S K+ ID Sbjct: 69 LTLVRGATSRDKVFRID 85 >gi|119775613|ref|YP_928353.1| hypothetical protein Sama_2480 [Shewanella amazonensis SB2B] gi|189039696|sp|A1S8H7|Y2480_SHEAM RecName: Full=UPF0235 protein Sama_2480 gi|119768113|gb|ABM00684.1| conserved hypothetical protein [Shewanella amazonensis SB2B] Length = 95 Score = 72.7 bits (177), Expect = 1e-11, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 36/90 (40%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + L +K+ +TA P GKAN + +LAK + K Sbjct: 13 LALYVQPKASRDELVGLH-------GEELKLAITAPPVDGKANAHICKLLAKAFKVPKGK 65 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELL 90 + + + K++ I + + L Sbjct: 66 VSIERGELGRHKLVRIQAPEIIPDDFAQFL 95 >gi|332039865|gb|EGI76260.1| hypothetical protein HGR_12177 [Hylemonella gracilis ATCC 19624] Length = 115 Score = 72.7 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 43/76 (56%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + ++PNA ++ I L D + +K+++ A P GKAN+A+ LAK L++ SS Sbjct: 25 VDLHVMPNASRTQIQGL---FDGA----LKVRLQAPPVDGKANEALRVWLAKTLSIPNSS 77 Query: 64 LRMLSKQSSPLKIIYI 79 + + ++ K +++ Sbjct: 78 VTLQHGATARRKQLHV 93 >gi|323702139|ref|ZP_08113806.1| protein of unknown function DUF167 [Desulfotomaculum nigrificans DSM 574] gi|323532826|gb|EGB22698.1| protein of unknown function DUF167 [Desulfotomaculum nigrificans DSM 574] Length = 95 Score = 72.3 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 42/87 (48%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P A K+ +A +K+++TA P G AN+A LA+ ++KS+ Sbjct: 13 LRVRVQPRAAKNSLAG-------EMEGALKVRLTAPPVDGAANEACCRFLAEVFGVAKSN 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++S + K++ + + + Sbjct: 66 VEIISGHTGRNKVVRVAGIDEARARRV 92 >gi|117621083|ref|YP_858117.1| hypothetical protein AHA_3661 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|166232625|sp|A0KPB6|Y3661_AERHH RecName: Full=UPF0235 protein AHA_3661 gi|117562490|gb|ABK39438.1| conserved hypothetical protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 99 Score = 72.3 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 43/92 (46%), Gaps = 10/92 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ ++K Sbjct: 13 MHLMIQPKASRDQIVGLH-------GEELKVAITAPPVDGQANSHLIKYLAKQFKVAKGQ 65 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQN 92 +R++ + K + I+ E++ LL N Sbjct: 66 VRIVRGELGRHKTVAIEAPRQIPAEVSALLDN 97 >gi|126175230|ref|YP_001051379.1| hypothetical protein Sbal_3028 [Shewanella baltica OS155] gi|153001556|ref|YP_001367237.1| hypothetical protein Shew185_3043 [Shewanella baltica OS185] gi|160876292|ref|YP_001555608.1| hypothetical protein Sbal195_3186 [Shewanella baltica OS195] gi|304410074|ref|ZP_07391693.1| protein of unknown function DUF167 [Shewanella baltica OS183] gi|307302214|ref|ZP_07581972.1| protein of unknown function DUF167 [Shewanella baltica BA175] gi|166229359|sp|A3D6Z7|Y3028_SHEB5 RecName: Full=UPF0235 protein Sbal_3028 gi|166229364|sp|A6WQT5|Y3043_SHEB8 RecName: Full=UPF0235 protein Shew185_3043 gi|189039841|sp|A9KXP6|Y3186_SHEB9 RecName: Full=UPF0235 protein Sbal195_3186 gi|125998435|gb|ABN62510.1| protein of unknown function DUF167 [Shewanella baltica OS155] gi|151366174|gb|ABS09174.1| protein of unknown function DUF167 [Shewanella baltica OS185] gi|160861814|gb|ABX50348.1| protein of unknown function DUF167 [Shewanella baltica OS195] gi|304351483|gb|EFM15882.1| protein of unknown function DUF167 [Shewanella baltica OS183] gi|306914252|gb|EFN44673.1| protein of unknown function DUF167 [Shewanella baltica BA175] gi|315268481|gb|ADT95334.1| protein of unknown function DUF167 [Shewanella baltica OS678] Length = 99 Score = 72.3 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKTFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYI 79 + ++ + K I + Sbjct: 66 IHIMKGELGRHKQIRV 81 >gi|147678156|ref|YP_001212371.1| hypothetical protein PTH_1821 [Pelotomaculum thermopropionicum SI] gi|189039033|sp|A5D180|Y1821_PELTS RecName: Full=UPF0235 protein PTH_1821 gi|146274253|dbj|BAF60002.1| uncharacterized conserved protein [Pelotomaculum thermopropionicum SI] Length = 95 Score = 72.3 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 26/86 (30%), Positives = 43/86 (50%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P A ++ +A L +KI++TA P G+AN+A A LA L+L S Sbjct: 13 LKVRVQPRAARNQVAGLYEDA-------LKIRLTAPPVDGEANEACRAFLADSLSLPPSK 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + ++S +S K++ I E Sbjct: 66 VEIVSGHASRTKVVKIAGVGAEKVRR 91 >gi|170725690|ref|YP_001759716.1| hypothetical protein Swoo_1329 [Shewanella woodyi ATCC 51908] gi|226695928|sp|B1KIX3|Y1329_SHEWM RecName: Full=UPF0235 protein Swoo_1329 gi|169811037|gb|ACA85621.1| protein of unknown function DUF167 [Shewanella woodyi ATCC 51908] Length = 95 Score = 72.3 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I + +KI +TA P GKAN ++ L+K + K Sbjct: 13 LNLYIQPKASRDQIVGVH-------GEELKIAITAPPVDGKANAHLIKYLSKAFKVPKGD 65 Query: 64 LRMLSKQSSPLKIIYI 79 + +L + K + + Sbjct: 66 INILKGEQGRHKQVKV 81 >gi|217972515|ref|YP_002357266.1| hypothetical protein Sbal223_1335 [Shewanella baltica OS223] gi|254800042|sp|B8E927|Y1335_SHEB2 RecName: Full=UPF0235 protein Sbal223_1335 gi|217497650|gb|ACK45843.1| protein of unknown function DUF167 [Shewanella baltica OS223] Length = 99 Score = 72.3 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKTFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYI 79 + ++ + K I + Sbjct: 66 IHIMKGELGRHKQIRV 81 >gi|157960952|ref|YP_001500986.1| hypothetical protein Spea_1124 [Shewanella pealeana ATCC 700345] gi|157845952|gb|ABV86451.1| protein of unknown function DUF167 [Shewanella pealeana ATCC 700345] Length = 90 Score = 72.3 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 36/90 (40%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + L+K + K Sbjct: 8 LQLYIQPKASRDQIVGLH-------GEEIKIAITAPPVDGKANAHLTKYLSKAFKVPKGD 60 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELL 90 + ++ Q K + I E+ LL Sbjct: 61 IDIIKGQMGRHKQVRITAPKLIPAEVQNLL 90 >gi|294139811|ref|YP_003555789.1| hypothetical protein SVI_1040 [Shewanella violacea DSS12] gi|293326280|dbj|BAJ01011.1| conserved hypothetical protein [Shewanella violacea DSS12] Length = 100 Score = 72.3 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 37/90 (41%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I + +KI +TA P GKAN + L+K + K Sbjct: 13 LNLYIQPKASRDKIIGVH-------GNELKIAITAPPVDGKANAHLTKFLSKAFKVPKGD 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELL 90 + + + K + I ++I +LL Sbjct: 66 IIIHKGELGRHKQVEILTPRVIPEQIADLL 95 >gi|293394481|ref|ZP_06638777.1| conserved hypothetical protein [Serratia odorifera DSM 4582] gi|291422946|gb|EFE96179.1| conserved hypothetical protein [Serratia odorifera DSM 4582] Length = 97 Score = 72.3 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 36/76 (47%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + P A + I L +K+ +TA P G+AN +L +AK+ ++KS+ Sbjct: 13 VRLYIQPKASRDQIIGLH-------GDEIKVAITAPPVDGQANAHLLKFIAKQFKVAKSN 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VTIEKGELGRHKQLRI 81 >gi|42522071|ref|NP_967451.1| hypothetical protein Bd0463 [Bdellovibrio bacteriovorus HD100] gi|39574602|emb|CAE78444.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100] Length = 94 Score = 71.9 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 43/91 (47%), Gaps = 8/91 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P + K+ + + +KIK+TA P GKAN+ ++ L+ + K Sbjct: 9 VRLHLFIQPKSSKNEVVGPHNGE-------IKIKLTAPPVDGKANECLIEFLSDLFDIPK 61 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91 + ++ ++ K++ + D ++ E L+ Sbjct: 62 RDVHLIKGETGRHKVVELAGLDVEKTREALR 92 >gi|52424377|ref|YP_087514.1| hypothetical protein MS0322 [Mannheimia succiniciproducens MBEL55E] gi|81387574|sp|Q65VT1|Y322_MANSM RecName: Full=UPF0235 protein MS0322 gi|52306429|gb|AAU36929.1| unknown [Mannheimia succiniciproducens MBEL55E] Length = 95 Score = 71.9 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 42/89 (47%), Gaps = 8/89 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A + I + + +KI +TA P G AN +L L+K + KS Sbjct: 12 RLRIFLQPKASRDKIIGIHDDE-------LKIAITAPPVDGAANAHLLKYLSKAFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 ++ + + + K ++I + K I E LQ Sbjct: 65 AIILEKGELNRHKQLFIP-EPKLIPEELQ 92 >gi|281355950|ref|ZP_06242443.1| protein of unknown function DUF167 [Victivallis vadensis ATCC BAA-548] gi|281317319|gb|EFB01340.1| protein of unknown function DUF167 [Victivallis vadensis ATCC BAA-548] Length = 100 Score = 71.9 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 45/92 (48%), Gaps = 8/92 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C V +R+ P A ++ + + +KI + A P GKAN+ + + A+ L K Sbjct: 16 CLVSLRVQPGASRNAVVGMY-------GDAVKIALQAPPVDGKANQLLCRLFAEWSGLPK 68 Query: 62 SSLRMLSKQSSPLKIIYIDKD-CKEITELLQN 92 S++ + S Q+ K++ + +++ +L+ Sbjct: 69 SAVELRSGQTGRSKVLELSGITAEQLKAILER 100 >gi|126733001|ref|ZP_01748760.1| hypothetical protein SSE37_15953 [Sagittula stellata E-37] gi|126706530|gb|EBA05608.1| hypothetical protein SSE37_15953 [Sagittula stellata E-37] Length = 84 Score = 71.9 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 39/78 (50%), Gaps = 8/78 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A ++ + +++ VT P+ GKA A++ +LAK L + K Sbjct: 14 ATLAVRVTPKASRNAVE--------RTDDALRVYVTTVPEGGKATAAVVKLLAKALGVPK 65 Query: 62 SSLRMLSKQSSPLKIIYI 79 S L ++ ++S K+ + Sbjct: 66 SRLELVRGETSRDKVFRV 83 >gi|308050656|ref|YP_003914222.1| hypothetical protein Fbal_2946 [Ferrimonas balearica DSM 9799] gi|307632846|gb|ADN77148.1| protein of unknown function DUF167 [Ferrimonas balearica DSM 9799] Length = 96 Score = 71.9 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 38/90 (42%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + L K+ +TA P GKAN ++ LAK+ ++K Sbjct: 14 LKLYIQPKASRDQLVGLH-------GEEFKVAITAPPVDGKANAHLVKFLAKQFKVAKGQ 66 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELL 90 + ++ + K + I + +LL Sbjct: 67 ISIVKGELGRHKQLKIQSPTVIPDPLAQLL 96 >gi|311278130|ref|YP_003940361.1| hypothetical protein Entcl_0802 [Enterobacter cloacae SCF1] gi|308747325|gb|ADO47077.1| protein of unknown function DUF167 [Enterobacter cloacae SCF1] Length = 96 Score = 71.9 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + L +K+ +TA P G+AN ++ LAK+ ++KS Sbjct: 13 LRLYIQPKASRDSLVGLH-------GDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VIIEKGELGRHKQVRI 81 >gi|95930347|ref|ZP_01313084.1| conserved hypothetical protein [Desulfuromonas acetoxidans DSM 684] gi|95133599|gb|EAT15261.1| conserved hypothetical protein [Desulfuromonas acetoxidans DSM 684] Length = 103 Score = 71.9 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 43/90 (47%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A K+ + L+ +K+++T+ P +G ANK AK L +SKSS Sbjct: 19 IALFVQPRASKNSLCGLQ-------GEELKVRLTSPPVEGAANKLCCTFFAKLLGVSKSS 71 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 + ++ S K I ++ E+ + L Sbjct: 72 VTLIRGDKSRHKQIVVEGVSLDEVKQRLAK 101 >gi|320352454|ref|YP_004193793.1| hypothetical protein Despr_0318 [Desulfobulbus propionicus DSM 2032] gi|320120956|gb|ADW16502.1| protein of unknown function DUF167 [Desulfobulbus propionicus DSM 2032] Length = 110 Score = 71.9 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 25/87 (28%), Positives = 44/87 (50%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ P A + +A L+ +K++VT P GKAN+A++A LAK L KSS Sbjct: 21 LRLQVQPRAAANHLAGLQ-------GDMLKLRVTTPPVDGKANQAVVAYLAKLFHLPKSS 73 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + S S K + I ++ + Sbjct: 74 VVLKSGHQSRGKTVVIASGHEQEVRAV 100 >gi|153835271|ref|ZP_01987938.1| conserved hypothetical protein [Vibrio harveyi HY01] gi|156975836|ref|YP_001446743.1| hypothetical protein VIBHAR_03581 [Vibrio harveyi ATCC BAA-1116] gi|269960447|ref|ZP_06174820.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|166232611|sp|A7MZ92|Y3581_VIBHB RecName: Full=UPF0235 protein VIBHAR_03581 gi|148868246|gb|EDL67386.1| conserved hypothetical protein [Vibrio harveyi HY01] gi|156527430|gb|ABU72516.1| hypothetical protein VIBHAR_03581 [Vibrio harveyi ATCC BAA-1116] gi|269834874|gb|EEZ88960.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 96 Score = 71.5 bits (174), Expect = 3e-11, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKVAKGL 66 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K I I+ Sbjct: 67 VHIEKGELGRHKQIRIE 83 >gi|254172671|ref|ZP_04879346.1| conserved hypothetical protein TIGR00251 [Thermococcus sp. AM4] gi|214033600|gb|EEB74427.1| conserved hypothetical protein TIGR00251 [Thermococcus sp. AM4] Length = 113 Score = 71.5 bits (174), Expect = 3e-11, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 44/87 (50%), Gaps = 8/87 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P AKK+ I ++ + +K++V A P GKANK ++ L+K L + Sbjct: 33 LFVYVQPKAKKNEIEGIDEWR-----GRLKVRVKAPPVGGKANKELVKFLSKLLG---AE 84 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ ++S K + + +E+ L Sbjct: 85 VELVRGETSREKDLLVRLSAEEVRRKL 111 >gi|50122551|ref|YP_051718.1| hypothetical protein ECA3630 [Pectobacterium atrosepticum SCRI1043] gi|81644033|sp|Q6D118|Y3630_ERWCT RecName: Full=UPF0235 protein ECA3630 gi|49613077|emb|CAG76528.1| conserved hypothetical protein [Pectobacterium atrosepticum SCRI1043] Length = 96 Score = 71.5 bits (174), Expect = 3e-11, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLTKFLAKQFRVAKSL 65 Query: 64 LRMLSKQSSPLKIIYID---KDCKEITELLQ 91 + + + K I I ++ E ++ Sbjct: 66 VVIEKGELGRHKQIRITHPQHIPADVAEFIE 96 >gi|327540387|gb|EGF26973.1| protein containing DUF167 [Rhodopirellula baltica WH47] Length = 87 Score = 71.5 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 26/75 (34%), Positives = 44/75 (58%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 +R+ P AKK+ + L +K+ V P+ GKANKA++A LAK L +SK + Sbjct: 1 MRVTPKAKKASVGGLHDGA-------LKVSVHMVPEDGKANKAVIASLAKWLRVSKGRVA 53 Query: 66 MLSKQSSPLKIIYID 80 +++ ++S LK I ++ Sbjct: 54 IVAGETSRLKTIVVE 68 >gi|320539543|ref|ZP_08039210.1| putative conserved protein [Serratia symbiotica str. Tucson] gi|320030396|gb|EFW12408.1| putative conserved protein [Serratia symbiotica str. Tucson] Length = 97 Score = 71.5 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 37/76 (48%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ +++S+ Sbjct: 13 IRLYIQPKASRDKIIGLH-------GDEVKVAITAPPVDGQANAHLIKFLAKQFKVARSN 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K ++I Sbjct: 66 VTIEKGELGRHKQLHI 81 >gi|322418850|ref|YP_004198073.1| hypothetical protein GM18_1329 [Geobacter sp. M18] gi|320125237|gb|ADW12797.1| protein of unknown function DUF167 [Geobacter sp. M18] Length = 104 Score = 71.5 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 25/87 (28%), Positives = 45/87 (51%), Gaps = 8/87 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V + P A +S I PKD ++I++T+ P ANK + ++AK L L+KS + Sbjct: 19 VHVQPRASRSEICG---PKDGE----LRIRLTSPPVDDAANKQCVELIAKSLGLAKSKVS 71 Query: 66 MLSKQSSPLKIIYIDK-DCKEITELLQ 91 + S S K++ ++ D ++ L + Sbjct: 72 IKSGAKSRHKVVRVEGVDQDDLLRLFK 98 >gi|157374368|ref|YP_001472968.1| hypothetical protein Ssed_1229 [Shewanella sediminis HAW-EB3] gi|189038638|sp|A8FSL7|Y1229_SHESH RecName: Full=UPF0235 protein Ssed_1229 gi|157316742|gb|ABV35840.1| protein of unknown function DUF167 [Shewanella sediminis HAW-EB3] Length = 95 Score = 71.5 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 20/76 (26%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I + +KI +TA P GKAN ++ L+K + K Sbjct: 13 LNLYIQPKASRDQIVGVH-------GEELKIAITAPPVDGKANAHLIKYLSKAFKVPKGD 65 Query: 64 LRMLSKQSSPLKIIYI 79 + +L Q K I I Sbjct: 66 IVILKGQLGRHKQIKI 81 >gi|301154876|emb|CBW14339.1| conserved protein [Haemophilus parainfluenzae T3T1] Length = 95 Score = 71.5 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L + +KI +TA P G+AN +L L+K + KS Sbjct: 12 RLKIILQPKASKDQIVGLHDDE-------LKITITAPPVDGQANAHLLKFLSKAFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYIDK---DCKEITELL 90 S+ + + + K ++I EI LL Sbjct: 65 SIVLEKGELNRHKQVWIPSPKLIPSEIQNLL 95 >gi|258593710|emb|CBE70051.1| conserved hypothetical protein [NC10 bacterium 'Dutch sediment'] Length = 102 Score = 71.5 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 27/92 (29%), Positives = 44/92 (47%), Gaps = 7/92 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P A + I D ++++V A P +G+AN A L +LAK L L Sbjct: 11 ASFRVRLQPKASREAI-------DGEVDGVLRLRVNAPPVEGQANDACLRLLAKTLDLPI 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 S L +++ Q + +K I + ++ NN Sbjct: 64 SRLGIVAGQQARVKTIRVTDASADLLRTALNN 95 >gi|261822847|ref|YP_003260953.1| hypothetical protein Pecwa_3610 [Pectobacterium wasabiae WPP163] gi|261606860|gb|ACX89346.1| protein of unknown function DUF167 [Pectobacterium wasabiae WPP163] Length = 96 Score = 71.2 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLTKFLAKQFRVAKSL 65 Query: 64 LRMLSKQSSPLKIIYID---KDCKEITELLQ 91 + + + K I I +I + ++ Sbjct: 66 VVIEKGELGRHKQIRITHPQHIPADIADFIE 96 >gi|325577506|ref|ZP_08147868.1| hypothetical protein HMPREF9417_0609 [Haemophilus parainfluenzae ATCC 33392] gi|325160610|gb|EGC72734.1| hypothetical protein HMPREF9417_0609 [Haemophilus parainfluenzae ATCC 33392] Length = 95 Score = 71.2 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L + +KI +TA P G+AN +L L+K + KS Sbjct: 12 RLKIILQPKASKDQIVGLHDDE-------LKITITAPPVDGQANAHLLKFLSKTFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYIDK---DCKEITELL 90 S+ + + + K ++I EI LL Sbjct: 65 SIVLEKGELNRHKQVWIPSPKLIPSEIQNLL 95 >gi|271501907|ref|YP_003334933.1| hypothetical protein Dd586_3394 [Dickeya dadantii Ech586] gi|270345462|gb|ACZ78227.1| protein of unknown function DUF167 [Dickeya dadantii Ech586] Length = 96 Score = 71.2 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ ++K Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFLAKQFRVAKGM 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 66 VTIEKGELGRHKQIRI 81 >gi|226942094|ref|YP_002797168.1| hypothetical protein LHK_03181 [Laribacter hongkongensis HLHK9] gi|254801627|sp|C1D6C4|Y3181_LARHH RecName: Full=UPF0235 protein LHK_03181 gi|226717021|gb|ACO76159.1| Uncharacterized conserved protein [Laribacter hongkongensis HLHK9] Length = 97 Score = 71.2 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 44/89 (49%), Gaps = 7/89 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A+++ +A L +KI++ A P GKAN +LA LA+ L +S+ Sbjct: 11 VRLTLHVQPGARRTEVAGLH-------GDALKIRLAAPPVDGKANACLLAFLARGLGVSR 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 S++ + S S K++ I E L Sbjct: 64 SAVTLKSGDCSRHKVVDIRGITPEAAAGL 92 >gi|227325979|ref|ZP_03830003.1| hypothetical protein PcarcW_01116 [Pectobacterium carotovorum subsp. carotovorum WPP14] Length = 96 Score = 71.2 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLTKFLAKQFRVAKSL 65 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K I I Sbjct: 66 VVIEKGELGRHKQIRIT 82 >gi|163803734|ref|ZP_02197592.1| hypothetical protein 1103602000580_AND4_13653 [Vibrio sp. AND4] gi|159172453|gb|EDP57321.1| hypothetical protein AND4_13653 [Vibrio sp. AND4] Length = 96 Score = 71.2 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 22/83 (26%), Positives = 37/83 (44%), Gaps = 8/83 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKVAKGL 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEI 86 + + + K I I K ++I Sbjct: 67 VHIEKGELGRHKQIRI-KSPEQI 88 >gi|227112386|ref|ZP_03826042.1| hypothetical protein PcarbP_05447 [Pectobacterium carotovorum subsp. brasiliensis PBR1692] gi|253689815|ref|YP_003019005.1| hypothetical protein PC1_3453 [Pectobacterium carotovorum subsp. carotovorum PC1] gi|259646966|sp|C6DE33|Y3453_PECCP RecName: Full=UPF0235 protein PC1_3453 gi|251756393|gb|ACT14469.1| protein of unknown function DUF167 [Pectobacterium carotovorum subsp. carotovorum PC1] Length = 96 Score = 71.2 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLTKFLAKQFRVAKSL 65 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K I I Sbjct: 66 VVIEKGELGRHKQIRIT 82 >gi|91228690|ref|ZP_01262604.1| hypothetical protein V12G01_14044 [Vibrio alginolyticus 12G01] gi|91187761|gb|EAS74079.1| hypothetical protein V12G01_14044 [Vibrio alginolyticus 12G01] Length = 96 Score = 71.2 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K I I+ Sbjct: 67 VHIEKGELGRHKQIRIE 83 >gi|262393226|ref|YP_003285080.1| hypothetical protein VEA_002453 [Vibrio sp. Ex25] gi|262336820|gb|ACY50615.1| hypothetical protein VEA_002453 [Vibrio sp. Ex25] Length = 96 Score = 71.2 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K I I+ Sbjct: 67 VHIEKGELGRHKQIRIE 83 >gi|212636425|ref|YP_002312950.1| cytosolic protein [Shewanella piezotolerans WP3] gi|212557909|gb|ACJ30363.1| Cytosolic protein, putative [Shewanella piezotolerans WP3] Length = 90 Score = 71.2 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 36/90 (40%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + L+K + K Sbjct: 8 LQLYVQPKASRDQIVGLH-------GNELKIAITAPPIDGKANIHLAKYLSKAFKVPKGD 60 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELL 90 + +L + K + I EI LL Sbjct: 61 IDILKGLTGRHKQVLISSPKVIPPEIQNLL 90 >gi|328474077|gb|EGF44882.1| hypothetical protein VP10329_15255 [Vibrio parahaemolyticus 10329] Length = 96 Score = 71.2 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K I I+ Sbjct: 67 VHIEKGELGRHKQIRIE 83 >gi|294634379|ref|ZP_06712916.1| putative cytoplasmic protein [Edwardsiella tarda ATCC 23685] gi|291092187|gb|EFE24748.1| putative cytoplasmic protein [Edwardsiella tarda ATCC 23685] Length = 96 Score = 71.2 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 13 LRLYIQPKASRDQIVGLH-------GEELKVAITAPPVDGQANAHLIKFIAKQFRVAKSL 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 ITIEKGELGRHKQLRI 81 >gi|157372267|ref|YP_001480256.1| hypothetical protein Spro_4033 [Serratia proteamaculans 568] gi|166979954|sp|A8GJ38|Y4033_SERP5 RecName: Full=UPF0235 protein Spro_4033 gi|157324031|gb|ABV43128.1| protein of unknown function DUF167 [Serratia proteamaculans 568] Length = 96 Score = 71.2 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ ++K + Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFLAKQFKVAKGN 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VTIEKGELGRHKQLRI 81 >gi|300115523|ref|YP_003762098.1| hypothetical protein Nwat_3056 [Nitrosococcus watsonii C-113] gi|299541460|gb|ADJ29777.1| protein of unknown function DUF167 [Nitrosococcus watsonii C-113] Length = 102 Score = 70.8 bits (172), Expect = 5e-11, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 7/83 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V +RL P A+ + +KI++TA P +GKAN +L L K +S++ Sbjct: 15 VQIRLQPRARGDEVIGPH-------GNRLKIRITAPPVEGKANTQLLRFLVKTFQVSRNQ 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEI 86 + +LS +S K + I+K K + Sbjct: 68 VYLLSGTASRDKRVRIEKPAKLL 90 >gi|296133363|ref|YP_003640610.1| protein of unknown function DUF167 [Thermincola sp. JR] gi|296031941|gb|ADG82709.1| protein of unknown function DUF167 [Thermincola potens JR] Length = 106 Score = 70.8 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 45/87 (51%), Gaps = 7/87 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +++ P A K+ + ++ +K+K+TA P +G AN+A + A+ +++KS Sbjct: 12 TFKIKVQPKASKNELKGVQ-------GDSLKVKLTAPPVEGAANEACIRFFAELFSVAKS 64 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITEL 89 + +++ +S K++ + KE E Sbjct: 65 QVEIITGHTSRTKLLKVKGLTKEEAEK 91 >gi|109896736|ref|YP_659991.1| hypothetical protein Patl_0407 [Pseudoalteromonas atlantica T6c] gi|109699017|gb|ABG38937.1| conserved hypothetical protein [Pseudoalteromonas atlantica T6c] Length = 90 Score = 70.8 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 36/78 (46%), Gaps = 7/78 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P A + + L +K+ +TA P GKAN ++ LAK+ ++KS Sbjct: 3 LRIYTQPKASRDEVVGLH-------GDELKVAITAPPVDGKANTHLIKYLAKQCGVAKSK 55 Query: 64 LRMLSKQSSPLKIIYIDK 81 + + Q + K + I K Sbjct: 56 VVITKGQLNRHKTVLISK 73 >gi|260913359|ref|ZP_05919840.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325] gi|260632590|gb|EEX50760.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325] Length = 100 Score = 70.8 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 36/79 (45%), Gaps = 7/79 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L + +KI +TA P G+AN +L L+K + KS Sbjct: 12 RLRIFLQPKASKDQIVGLHDDE-------LKITITAPPIDGQANAHLLKFLSKTFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYIDK 81 S+ + + + K I + Sbjct: 65 SIVLEKGELNRHKQILVPN 83 >gi|251788394|ref|YP_003003115.1| hypothetical protein Dd1591_0757 [Dickeya zeae Ech1591] gi|247537015|gb|ACT05636.1| protein of unknown function DUF167 [Dickeya zeae Ech1591] Length = 100 Score = 70.8 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ ++K Sbjct: 17 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFLAKQFRVAKGM 69 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 70 VTIEKGELGRHKQIRI 85 >gi|167623103|ref|YP_001673397.1| hypothetical protein Shal_1169 [Shewanella halifaxensis HAW-EB4] gi|167353125|gb|ABZ75738.1| protein of unknown function DUF167 [Shewanella halifaxensis HAW-EB4] Length = 90 Score = 70.8 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 35/90 (38%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + L+K + K Sbjct: 8 LQLYIQPKASRDQIVGLH-------GEEIKIAITAPPVDGKANAHLTKYLSKAFKVPKGD 60 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELL 90 + + Q K + I E+ LL Sbjct: 61 IEITKGQMGRHKQVKITAPKLIPAEVQNLL 90 >gi|87311293|ref|ZP_01093415.1| hypothetical protein DSM3645_27241 [Blastopirellula marina DSM 3645] gi|87286033|gb|EAQ77945.1| hypothetical protein DSM3645_27241 [Blastopirellula marina DSM 3645] Length = 100 Score = 70.8 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 27/78 (34%), Positives = 44/78 (56%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR +P +KK+ I + +K+ VTA P+ GKANKA++ +LAKKL L K Sbjct: 11 VILPVRALPGSKKNEIRGEQQGA-------LKVSVTAAPEDGKANKAIVELLAKKLVLRK 63 Query: 62 SSLRMLSKQSSPLKIIYI 79 S L +++ + K + + Sbjct: 64 SQLEIIAGHTHRQKRVLV 81 >gi|323493577|ref|ZP_08098698.1| hypothetical protein VIBR0546_04984 [Vibrio brasiliensis LMG 20546] gi|323312100|gb|EGA65243.1| hypothetical protein VIBR0546_04984 [Vibrio brasiliensis LMG 20546] Length = 96 Score = 70.8 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I +K+ +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKIVG-------QHGEELKVAITAPPVDGKANAHLSKYLAKQFKVAKGL 66 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K + I Sbjct: 67 ITIEKGELGRHKQVRIS 83 >gi|242238224|ref|YP_002986405.1| hypothetical protein Dd703_0772 [Dickeya dadantii Ech703] gi|242130281|gb|ACS84583.1| protein of unknown function DUF167 [Dickeya dadantii Ech703] Length = 99 Score = 70.8 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 6/76 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 13 IRLYIQPKASRDQIVGLH------GNDEVKVAITAPPVDGQANAHLIQFMAKQFRVAKSR 66 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 67 VTIEKGELGRHKQLRI 82 >gi|269837719|ref|YP_003319947.1| hypothetical protein Sthe_1691 [Sphaerobacter thermophilus DSM 20745] gi|269786982|gb|ACZ39125.1| protein of unknown function DUF167 [Sphaerobacter thermophilus DSM 20745] Length = 102 Score = 70.8 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 43/88 (48%), Gaps = 8/88 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P A ++ + + D + +++++ A P +G AN+A+ LA L L K Sbjct: 16 VTVRVTPRASRTQVDGV---ADGA----LRVRLAAPPVEGAANRALTEFLANLLRLPKRD 68 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90 + +++ K + + +++E L Sbjct: 69 VELVAGARGRQKTVLLRGLTPADVSERL 96 >gi|77918231|ref|YP_356046.1| hypothetical protein Pcar_0617 [Pelobacter carbinolicus DSM 2380] gi|123574815|sp|Q3A6Y1|Y617_PELCD RecName: Full=UPF0235 protein Pcar_0617 gi|77544314|gb|ABA87876.1| conserved hypothetical protein TIGR00251 [Pelobacter carbinolicus DSM 2380] Length = 95 Score = 70.8 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 7/78 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P A ++ +A L+ +KI++T+ P +G ANK LAK L ++KS Sbjct: 14 LSVHVQPRASRNELAGLQ-------GESLKIRLTSPPVEGAANKLCREFLAKLLGVAKSR 66 Query: 64 LRMLSKQSSPLKIIYIDK 81 + ++S S K + I+ Sbjct: 67 VTLVSGDKSRHKRLLIEG 84 >gi|15603178|ref|NP_246251.1| hypothetical protein PM1313 [Pasteurella multocida subsp. multocida str. Pm70] gi|29839744|sp|Q9CLC6|Y1313_PASMU RecName: Full=UPF0235 protein PM1313 gi|12721676|gb|AAK03397.1| unknown [Pasteurella multocida subsp. multocida str. Pm70] Length = 99 Score = 70.8 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 23/79 (29%), Positives = 36/79 (45%), Gaps = 7/79 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L + +KI +TA P G+AN +L L+K + KS Sbjct: 15 RLRIFLQPKASKDQIVGLHDNE-------LKITITAPPIDGQANAHLLKFLSKTFKVPKS 67 Query: 63 SLRMLSKQSSPLKIIYIDK 81 S+ + + + K I I Sbjct: 68 SIVLEKGELNRHKQILIPN 86 >gi|126657163|ref|ZP_01728329.1| hypothetical protein CY0110_24581 [Cyanothece sp. CCY0110] gi|126621434|gb|EAZ92145.1| hypothetical protein CY0110_24581 [Cyanothece sp. CCY0110] Length = 75 Score = 70.8 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 26/80 (32%), Positives = 42/80 (52%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + V++ PNAK+ I + H KI V + P GKAN+ ++ +LAK + Sbjct: 2 LLKLQVKVKPNAKQQKI-------EEMADNHFKIAVKSPPTDGKANQELITLLAKHFNVP 54 Query: 61 KSSLRMLSKQSSPLKIIYID 80 KS + + S SS K++ ID Sbjct: 55 KSHILIKSGVSSRNKLVEID 74 >gi|322834241|ref|YP_004214268.1| hypothetical protein Rahaq_3549 [Rahnella sp. Y9602] gi|321169442|gb|ADW75141.1| protein of unknown function DUF167 [Rahnella sp. Y9602] Length = 101 Score = 70.8 bits (172), Expect = 7e-11, Method: Composition-based stats. Identities = 17/77 (22%), Positives = 35/77 (45%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + L +K+ +TA P G+AN ++ LAK+ ++KS Sbjct: 17 LRLVIQPKASRDSLVGLH-------GDELKVAITAPPVDGQANTHLVKFLAKQFKVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K + I Sbjct: 70 VSIEKGELGRHKQVRIT 86 >gi|118578470|ref|YP_899720.1| hypothetical protein Ppro_0022 [Pelobacter propionicus DSM 2379] gi|118501180|gb|ABK97662.1| protein of unknown function DUF167 [Pelobacter propionicus DSM 2379] Length = 103 Score = 70.8 bits (172), Expect = 7e-11, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 40/84 (47%), Gaps = 7/84 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A K+ I L +K+++T+ P G ANK LA L + KS+ Sbjct: 18 LNLYIQPRASKNEICGLVDNS-------LKLRLTSPPVDGAANKLCREFLADLLHVPKSA 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87 + ++S ++S K + I ++ Sbjct: 71 VEIISGETSRHKRVRIATADSDLI 94 >gi|37523404|ref|NP_926781.1| hypothetical protein glr3835 [Gloeobacter violaceus PCC 7421] gi|47117439|sp|Q7NEP3|Y3835_GLOVI RecName: Full=UPF0235 protein glr3835 gi|35214408|dbj|BAC91776.1| glr3835 [Gloeobacter violaceus PCC 7421] Length = 111 Score = 70.8 bits (172), Expect = 7e-11, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 36/90 (40%), Gaps = 7/90 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V P A S + + K+++ A P +GKAN +A++A + + Sbjct: 27 TLTVWAQPRASCSEVVGWQQNA-------FKVRLAAPPVEGKANAECVALIAAFFGVPRR 79 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + ++ Q K I I+ + LQ Sbjct: 80 QVSLVQGQQGRHKKIRIEAPADLLLVALQK 109 >gi|225873890|ref|YP_002755349.1| hypothetical protein ACP_2305 [Acidobacterium capsulatum ATCC 51196] gi|225793057|gb|ACO33147.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC 51196] Length = 112 Score = 70.8 bits (172), Expect = 7e-11, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 39/78 (50%), Gaps = 3/78 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P A +S + + +++ + A P G+AN+ +L LA++L L SS Sbjct: 16 LAVRVTPRASRSSFQGV---LEKEGQTMLRVALHAPPIDGRANEELLDFLARQLDLPGSS 72 Query: 64 LRMLSKQSSPLKIIYIDK 81 L ++ S K++ + Sbjct: 73 LEIIRGLQSREKLVRMTG 90 >gi|167854882|ref|ZP_02477658.1| hypothetical protein HPS_05418 [Haemophilus parasuis 29755] gi|219871635|ref|YP_002476010.1| hypothetical protein HAPS_1504 [Haemophilus parasuis SH0165] gi|254800538|sp|B8F6W0|Y1504_HAEPS RecName: Full=UPF0235 protein HAPS_1504 gi|167853949|gb|EDS25187.1| hypothetical protein HPS_05418 [Haemophilus parasuis 29755] gi|219691839|gb|ACL33062.1| conserved hypothetical protein [Haemophilus parasuis SH0165] Length = 97 Score = 70.8 bits (172), Expect = 7e-11, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 42/89 (47%), Gaps = 8/89 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A + I L + +KI +TA P G+AN +L L+K + KS Sbjct: 14 RLRIFLQPKASRDQIVGLHDNE-------LKIAITAPPIDGQANAHLLKYLSKLFKVPKS 66 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S+ + + K I++ + K I + ++ Sbjct: 67 SIVLEKGELQRHKQIFVP-EPKLIPKEIE 94 >gi|218710632|ref|YP_002418253.1| hypothetical protein VS_2686 [Vibrio splendidus LGP32] gi|218323651|emb|CAV19947.1| Hypothetical protein VS_2686 [Vibrio splendidus LGP32] Length = 96 Score = 70.4 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQ 66 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 67 ITIEKGELGRHKQVRI 82 >gi|254497376|ref|ZP_05110179.1| conserved hypothetical protein [Legionella drancourtii LLAP12] gi|254353429|gb|EET12161.1| conserved hypothetical protein [Legionella drancourtii LLAP12] Length = 91 Score = 70.4 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 21/86 (24%), Positives = 40/86 (46%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P AK + IA +KI++ A P +G+AN+A+L +A+ A+ Sbjct: 12 INLYIQPGAKHTEIAGFH-------GEALKIRLHAPPIEGRANEALLKFIAQIFAVPTRQ 64 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + S LK + I C + + Sbjct: 65 VVLKRGDKSRLKTLIITGSCIDPANI 90 >gi|257056705|ref|YP_003134537.1| hypothetical protein Svir_27290 [Saccharomonospora viridis DSM 43017] gi|256586577|gb|ACU97710.1| uncharacterized conserved protein [Saccharomonospora viridis DSM 43017] Length = 118 Score = 70.4 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 39/88 (44%), Gaps = 3/88 (3%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +R+ P AK+ + + D + + + V A GKAN+A+ +LA+ L++ Sbjct: 20 RFAIRVKPGAKRDAVGGI---WDGALGEALVVSVRAPAVDGKANEAVCRVLAEALSVRAR 76 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELL 90 L ++ + K++ + E L Sbjct: 77 DLTVVKGHRARDKLVELRDPPPGCAERL 104 >gi|269965742|ref|ZP_06179839.1| conserved hypothetical protein [Vibrio alginolyticus 40B] gi|269829610|gb|EEZ83847.1| conserved hypothetical protein [Vibrio alginolyticus 40B] Length = 96 Score = 70.4 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K + I+ Sbjct: 67 VHIEKGELGRHKQVRIE 83 >gi|270263061|ref|ZP_06191331.1| threonine dehydratase [Serratia odorifera 4Rx13] gi|270042749|gb|EFA15843.1| threonine dehydratase [Serratia odorifera 4Rx13] Length = 96 Score = 70.4 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 36/76 (47%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS+ Sbjct: 13 IRLYIQPKASRDQIIGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFKVAKSN 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VTIEKGELGRHKQLRI 81 >gi|330431471|gb|AEC16530.1| hypothetical protein UMN179_00494 [Gallibacterium anatis UMN179] Length = 95 Score = 70.4 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 25/87 (28%), Positives = 43/87 (49%), Gaps = 10/87 (11%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 L P A K I L +KI +TA P GKAN +L L+K+ ++K+ + + Sbjct: 16 LQPKAGKDQIVGLY-------GDELKITITAPPIDGKANAHLLKFLSKQFKVAKTQIELR 68 Query: 68 SKQSSPLKIIYI---DKDCKEITELLQ 91 + S K ++I ++ + I +LL+ Sbjct: 69 KGELSRHKQVFIPSPEQIPQPILDLLE 95 >gi|238795055|ref|ZP_04638648.1| hypothetical protein yinte0001_4160 [Yersinia intermedia ATCC 29909] gi|238725603|gb|EEQ17164.1| hypothetical protein yinte0001_4160 [Yersinia intermedia ATCC 29909] Length = 90 Score = 70.4 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 7 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANTHLIKFIAKQFRVAKSQ 59 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K I I + E+ LL+ Sbjct: 60 VVIEKGELGRHKQIKIVNPQQIPPEVAALLE 90 >gi|262273743|ref|ZP_06051556.1| hypothetical protein VHA_000718 [Grimontia hollisae CIP 101886] gi|262222158|gb|EEY73470.1| hypothetical protein VHA_000718 [Grimontia hollisae CIP 101886] Length = 101 Score = 70.4 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 32/76 (42%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + +K+ +TA P GKAN + L+K+ ++KS Sbjct: 16 LRLYIQPKASRDQWMG-------QHGEEIKLAITAPPVDGKANAHLTKFLSKQFKVAKSQ 68 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 69 IDIEKGELGRHKQVRI 84 >gi|94971231|ref|YP_593279.1| hypothetical protein Acid345_4205 [Candidatus Koribacter versatilis Ellin345] gi|166227235|sp|Q1IIU5|Y4205_ACIBL RecName: Full=UPF0235 protein Acid345_4205 gi|94553281|gb|ABF43205.1| protein of unknown function DUF167 [Candidatus Koribacter versatilis Ellin345] Length = 96 Score = 70.4 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 26/84 (30%), Positives = 44/84 (52%), Gaps = 7/84 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P AKK+ I +K+ VT P G+AN+A++ +A L +++ Sbjct: 11 VSFAVRLQPKAKKTAIIG-------ELNGALKLGVTDPPIDGRANEALIRFVAGLLKVTR 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85 SS+ + + +SS K+I I+ E Sbjct: 64 SSVTIAAGESSRNKVIRIEGVTAE 87 >gi|59711034|ref|YP_203810.1| hypothetical protein VF_0427 [Vibrio fischeri ES114] gi|59479135|gb|AAW84922.1| conserved protein [Vibrio fischeri ES114] Length = 95 Score = 70.4 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 39/90 (43%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + L P A + I + +KI +TA P GKAN ++ +K ++K Sbjct: 12 LRLYLQPKASRDQIVGIH-------GEELKIAITAPPVDGKANAHLIKYFSKLFKVAKGK 64 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELL 90 + + + + K + I + EI +LL Sbjct: 65 ITVEKGELNRHKQVRIHSPELIPNEIQQLL 94 >gi|328952006|ref|YP_004369340.1| UPF0235 protein yggU [Desulfobacca acetoxidans DSM 11109] gi|328452330|gb|AEB08159.1| UPF0235 protein yggU [Desulfobacca acetoxidans DSM 11109] Length = 101 Score = 70.4 bits (171), Expect = 8e-11, Method: Composition-based stats. Identities = 26/89 (29%), Positives = 43/89 (48%), Gaps = 7/89 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + ++P A + I +KI++ A P+KG ANK +L LAK L L K+ Sbjct: 13 LRIHVVPGAASNQIMGPH-------GDRLKIRIAAAPEKGAANKELLNYLAKCLGLPKNR 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 L + S +K++ + E+ E LQ Sbjct: 66 LHLKSGAQDRVKVVEVVGLAPEVQERLQA 94 >gi|84394062|ref|ZP_00992798.1| hypothetical protein V12B01_07755 [Vibrio splendidus 12B01] gi|84375304|gb|EAP92215.1| hypothetical protein V12B01_07755 [Vibrio splendidus 12B01] Length = 96 Score = 70.4 bits (171), Expect = 9e-11, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQ 66 Query: 64 LRMLSKQSSPLKIIYI 79 +++ + K + I Sbjct: 67 IKIEKGELGRHKQVRI 82 >gi|30248412|ref|NP_840482.1| hypothetical protein NE0395 [Nitrosomonas europaea ATCC 19718] gi|47117515|sp|Q82X93|Y395_NITEU RecName: Full=UPF0235 protein NE0395 gi|30138298|emb|CAD84306.1| DUF167 [Nitrosomonas europaea ATCC 19718] Length = 100 Score = 70.4 bits (171), Expect = 9e-11, Method: Composition-based stats. Identities = 17/96 (17%), Positives = 43/96 (44%), Gaps = 10/96 (10%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + + + P A+++ + +KIK+ A P GKAN+A+ LAK+ + Sbjct: 12 LLILKLYVQPGARQTEAVGI-------CGEELKIKLAALPVDGKANRALTEFLAKRFNVP 64 Query: 61 KSSLRMLSKQSSPLKIIYI---DKDCKEITELLQNN 93 + ++ + + S K++ + + + ++ Sbjct: 65 RKNITLKRGEQSRHKVVEVCQSSNGPEVLFSEMRAE 100 >gi|123443630|ref|YP_001007602.1| hypothetical protein YE3436 [Yersinia enterocolitica subsp. enterocolitica 8081] gi|166232591|sp|A1JPU6|Y3436_YERE8 RecName: Full=UPF0235 protein YE3436 gi|122090591|emb|CAL13460.1| conserved hypothetical protein [Yersinia enterocolitica subsp. enterocolitica 8081] Length = 96 Score = 70.0 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 13 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K I I + E+ LL+ Sbjct: 66 VIIEKGELGRHKQIKIVNPQQIPPEVATLLE 96 >gi|197335333|ref|YP_002155183.1| hypothetical protein VFMJ11_0427 [Vibrio fischeri MJ11] gi|197316823|gb|ACH66270.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 95 Score = 70.0 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 39/90 (43%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + L P A + I + +KI +TA P GKAN ++ +K ++K Sbjct: 12 LRLYLQPKASRDQIVGIH-------GEELKIAITAPPVDGKANAHLIKYFSKLFKVAKGK 64 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELL 90 + + + + K + I + EI +LL Sbjct: 65 ITVEKGELNRHKQVRIHSPEIIPNEIQQLL 94 >gi|119384906|ref|YP_915962.1| hypothetical protein Pden_2174 [Paracoccus denitrificans PD1222] gi|189039514|sp|A1B422|Y2174_PARDP RecName: Full=UPF0235 protein Pden_2174 gi|119374673|gb|ABL70266.1| protein of unknown function DUF167 [Paracoccus denitrificans PD1222] Length = 82 Score = 70.0 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 23/79 (29%), Positives = 42/79 (53%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A ++ + D +++ VT P+ GKAN A++ +LAK L ++K Sbjct: 12 AEIAVRVTPRASRNAVI--------LDGEAIRVTVTTVPEDGKANAAVVKLLAKALGVAK 63 Query: 62 SSLRMLSKQSSPLKIIYID 80 S L ++ ++ K+ ID Sbjct: 64 SRLVLVRGATARDKLFRID 82 >gi|325294222|ref|YP_004280736.1| yggU [Desulfurobacterium thermolithotrophum DSM 11699] gi|325064670|gb|ADY72677.1| UPF0235 protein yggU [Desulfurobacterium thermolithotrophum DSM 11699] Length = 101 Score = 70.0 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 46/78 (58%), Gaps = 7/78 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ P A ++ I +E + +KIKVT P+ GKAN+ ++ +L+K L + K Sbjct: 13 IEVKVQPKASRNKIEKVEEGR-------LKIKVTVPPEGGKANQKIIELLSKALKVPKRD 65 Query: 64 LRMLSKQSSPLKIIYIDK 81 + ++ ++S +K++ I+ Sbjct: 66 IDIVKGETSRIKVVRIEG 83 >gi|288942590|ref|YP_003444830.1| hypothetical protein Alvin_2893 [Allochromatium vinosum DSM 180] gi|288897962|gb|ADC63798.1| protein of unknown function DUF167 [Allochromatium vinosum DSM 180] Length = 100 Score = 70.0 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 39/76 (51%), Gaps = 5/76 (6%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P A K A + D + ++++ A P +GKAN+A+ +A ++ S Sbjct: 13 LRLRVQPRAPKDAFAEPDPSGD-----YYRVRLKAPPVEGKANQALRRFVADAFEVTLSQ 67 Query: 64 LRMLSKQSSPLKIIYI 79 + +LS + + K + I Sbjct: 68 VEILSGEQARYKRLRI 83 >gi|28899393|ref|NP_798998.1| hypothetical protein VP2619 [Vibrio parahaemolyticus RIMD 2210633] gi|260366264|ref|ZP_05778723.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030] gi|260878919|ref|ZP_05891274.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|260898280|ref|ZP_05906776.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|33301894|sp|Q87LJ3|Y2619_VIBPA RecName: Full=UPF0235 protein VP2619 gi|28807629|dbj|BAC60882.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308085856|gb|EFO35551.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308090459|gb|EFO40154.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308113515|gb|EFO51055.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030] Length = 96 Score = 70.0 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K I I+ Sbjct: 67 VHIEKGELGRHKQIRIE 83 >gi|161723222|ref|NP_219898.2| hypothetical protein CT388 [Chlamydia trachomatis D/UW-3/CX] gi|162019865|ref|YP_328205.2| hypothetical protein CTA_0423 [Chlamydia trachomatis A/HAR-13] Length = 100 Score = 70.0 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 23/83 (27%), Positives = 48/83 (57%), Gaps = 7/83 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ A+++ + LE ++++VT P+KGKAN A++A+LA L++ KS Sbjct: 8 LEVRVTTKARENRVVCLED-------GILRVRVTEVPEKGKANDAVVALLANFLSIPKSD 60 Query: 64 LRMLSKQSSPLKIIYIDKDCKEI 86 + +++ ++S K + + + K Sbjct: 61 VTLIAGEASRRKKVLLPRSIKAF 83 >gi|146276431|ref|YP_001166590.1| hypothetical protein Rsph17025_0379 [Rhodobacter sphaeroides ATCC 17025] gi|145554672|gb|ABP69285.1| protein of unknown function DUF167 [Rhodobacter sphaeroides ATCC 17025] Length = 84 Score = 70.0 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 23/76 (30%), Positives = 38/76 (50%), Gaps = 8/76 (10%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P A + I E +++ VT P+ GKAN+A+ +LAK L ++KS L Sbjct: 17 AVRVTPRASRERIEVQE--------GTVRVHVTCVPEDGKANRAVTEVLAKALGVAKSRL 68 Query: 65 RMLSKQSSPLKIIYID 80 ++ + K +D Sbjct: 69 TLVRGATGRDKTFRLD 84 >gi|332162814|ref|YP_004299391.1| hypothetical protein YE105_C3194 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|318604345|emb|CBY25843.1| upf0235 protein VC0458 [Yersinia enterocolitica subsp. palearctica Y11] gi|325667044|gb|ADZ43688.1| hypothetical protein YE105_C3194 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330859004|emb|CBX69362.1| UPF0235 protein YE3436 [Yersinia enterocolitica W22703] Length = 96 Score = 70.0 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 13 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K I I + E+ LL+ Sbjct: 66 VIIEKGELGRHKQIKIVNPQQIPPEVAALLE 96 >gi|154244464|ref|YP_001415422.1| hypothetical protein Xaut_0507 [Xanthobacter autotrophicus Py2] gi|154158549|gb|ABS65765.1| protein of unknown function DUF167 [Xanthobacter autotrophicus Py2] Length = 110 Score = 70.0 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 37/84 (44%), Gaps = 2/84 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V VR P + + + D +KI+V P+ G A A+ +LA+ ++ Sbjct: 11 VEVTVRATPRGGRDALDGVAELSDGRAV--LKIRVKVAPEDGAATAAVARVLAQAAGVAA 68 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85 S +R+ S ++ +K I D + Sbjct: 69 SQVRLASGATARVKTFRIAGDAAK 92 >gi|283788525|ref|YP_003368390.1| hypothetical protein ROD_50331 [Citrobacter rodentium ICC168] gi|282951979|emb|CBG91706.1| conserved hypothetical protein [Citrobacter rodentium ICC168] Length = 96 Score = 70.0 bits (170), Expect = 9e-11, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VAIEKGELGRHKQVKI 81 >gi|240103723|ref|YP_002960032.1| hypothetical protein TGAM_1666 [Thermococcus gammatolerans EJ3] gi|239911277|gb|ACS34168.1| Conserved hypothetical protein [Thermococcus gammatolerans EJ3] Length = 124 Score = 70.0 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 46/87 (52%), Gaps = 8/87 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 ++V + P AKK+ I ++ + +K+KV A P GKANK ++ L+K L + Sbjct: 44 LLVYVQPKAKKNEIEGIDEWR-----GRLKVKVKAPPVGGKANKELVKFLSKVLG---AE 95 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ ++S K + + +++ + L Sbjct: 96 VELVRGETSREKDLLVRMSAEDVKKRL 122 >gi|85060008|ref|YP_455710.1| hypothetical protein SG2030 [Sodalis glossinidius str. 'morsitans'] gi|123518874|sp|Q2NRC0|Y2030_SODGM RecName: Full=UPF0235 protein SG2030 gi|84780528|dbj|BAE75305.1| conserved hypothetical protein [Sodalis glossinidius str. 'morsitans'] Length = 101 Score = 70.0 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 42/90 (46%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + IA + +K+ +TA P G+AN ++ LAK+ ++KS Sbjct: 17 LRLYIQPRASRDHIAG-------AHGDEIKVAITAPPVDGQANSHLIRFLAKEFGVAKSR 69 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90 + + + K + ID+ + I LL Sbjct: 70 VILEKGELGRHKQLRIDQPRQLPEVIARLL 99 >gi|148980491|ref|ZP_01816088.1| hypothetical protein VSWAT3_21090 [Vibrionales bacterium SWAT-3] gi|145961216|gb|EDK26530.1| hypothetical protein VSWAT3_21090 [Vibrionales bacterium SWAT-3] Length = 96 Score = 70.0 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQ 66 Query: 64 LRMLSKQSSPLKIIYI 79 +++ + K + I Sbjct: 67 IKIEKGELGRHKQVRI 82 >gi|315633577|ref|ZP_07888867.1| conserved hypothetical protein [Aggregatibacter segnis ATCC 33393] gi|315477619|gb|EFU68361.1| conserved hypothetical protein [Aggregatibacter segnis ATCC 33393] Length = 97 Score = 70.0 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 36/78 (46%), Gaps = 7/78 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L + +KI +TA P G+AN +L L+K + KS Sbjct: 12 RLRIFLQPKAAKDQIVGLHDDE-------LKISITAPPVDGQANAHLLKFLSKLFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYID 80 S+ + + + K + I Sbjct: 65 SIVLEKGELNRHKQVLIP 82 >gi|90412037|ref|ZP_01220044.1| hypothetical protein P3TCK_24666 [Photobacterium profundum 3TCK] gi|90327015|gb|EAS43394.1| hypothetical protein P3TCK_24666 [Photobacterium profundum 3TCK] Length = 97 Score = 70.0 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++KS Sbjct: 15 IRLYIQPKASRDQIVGLH-------GEELKIAITAPPVDGKANAHLSKFLAKQFRVAKSQ 67 Query: 64 LRMLSKQSSPLKIIYID 80 + + K + I+ Sbjct: 68 VLIEKGMQGRHKQVRIE 84 >gi|152971905|ref|YP_001337014.1| hypothetical protein KPN_03387 [Klebsiella pneumoniae subsp. pneumoniae MGH 78578] gi|206579395|ref|YP_002236595.1| conserved hypothetical protein TIGR00251 [Klebsiella pneumoniae 342] gi|238896484|ref|YP_002921222.1| hypothetical protein KP1_4664 [Klebsiella pneumoniae NTUH-K2044] gi|262042605|ref|ZP_06015761.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|329998593|ref|ZP_08303177.1| TIGR00251 family protein [Klebsiella sp. MS 92-3] gi|166990767|sp|A6TDW3|Y3323_KLEP7 RecName: Full=UPF0235 protein KPN78578_33230 gi|226708043|sp|B5XU96|Y722_KLEP3 RecName: Full=UPF0235 protein KPK_0722 gi|150956754|gb|ABR78784.1| hypothetical protein KPN_03387 [Klebsiella pneumoniae subsp. pneumoniae MGH 78578] gi|206568453|gb|ACI10229.1| conserved hypothetical protein TIGR00251 [Klebsiella pneumoniae 342] gi|238548804|dbj|BAH65155.1| hypothetical protein KP1_4664 [Klebsiella pneumoniae subsp. pneumoniae NTUH-K2044] gi|259040039|gb|EEW41154.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|328538612|gb|EGF64712.1| TIGR00251 family protein [Klebsiella sp. MS 92-3] Length = 96 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I + +K+ +TA P G+AN ++ LAK+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGVH-------GDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VLIEKGELGRHKQVKI 81 >gi|307132433|ref|YP_003884449.1| hypothetical protein Dda3937_02598 [Dickeya dadantii 3937] gi|306529962|gb|ADM99892.1| conserved protein [Dickeya dadantii 3937] Length = 100 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 39/90 (43%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LA++ ++K Sbjct: 17 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFLARQFRVAKGM 69 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELL 90 + + + K I I ++ EL+ Sbjct: 70 VTIEKGELGRHKQIRIVNPQAIPADVAELI 99 >gi|218885259|ref|YP_002434580.1| hypothetical protein DvMF_0151 [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218756213|gb|ACL07112.1| protein of unknown function DUF167 [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 106 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 43/81 (53%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V+VR +P A+KS D +K+++ A KANKA+ +A L + ++ Sbjct: 22 VLVRAVPGARKSACEG---TADGR----LKVRLAAPAVDNKANKALEEFVASALGMRRNR 74 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 +R++S +S LK + ++ D + Sbjct: 75 VRLVSGHTSRLKKLIVESDVE 95 >gi|86146416|ref|ZP_01064740.1| hypothetical protein MED222_22586 [Vibrio sp. MED222] gi|85835895|gb|EAQ54029.1| hypothetical protein MED222_22586 [Vibrio sp. MED222] Length = 96 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQ 66 Query: 64 LRMLSKQSSPLKIIYI 79 +++ + K + I Sbjct: 67 IKIEKGELGRHKQVRI 82 >gi|332172191|gb|AEE21445.1| protein of unknown function DUF167 [Glaciecola agarilytica 4H-3-7+YE-5] Length = 98 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I + +KI +TA P GKAN + LAK+ ++KS Sbjct: 14 LRIYIQPKAARDEIVGMH-------GDALKIAITAPPVDGKANAHLCKYLAKQCGVAKSK 66 Query: 64 LRMLSKQSSPLKIIYI 79 + + Q + K + + Sbjct: 67 VAITKGQLNRHKTVVV 82 >gi|238786243|ref|ZP_04630189.1| hypothetical protein yberc0001_39320 [Yersinia bercovieri ATCC 43970] gi|238798802|ref|ZP_04642272.1| hypothetical protein ymoll0001_31720 [Yersinia mollaretii ATCC 43969] gi|238712858|gb|EEQ04924.1| hypothetical protein yberc0001_39320 [Yersinia bercovieri ATCC 43970] gi|238717373|gb|EEQ09219.1| hypothetical protein ymoll0001_31720 [Yersinia mollaretii ATCC 43969] Length = 90 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 7 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQ 59 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 60 VIIEKGELGRHKQIKI 75 >gi|238918244|ref|YP_002931758.1| hypothetical protein NT01EI_0281 [Edwardsiella ictaluri 93-146] gi|259646912|sp|C5BCR7|Y281_EDWI9 RecName: Full=UPF0235 protein NT01EI_0281 gi|238867812|gb|ACR67523.1| conserved hypothetical protein [Edwardsiella ictaluri 93-146] Length = 96 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 35/77 (45%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN +L +AK+ ++KS Sbjct: 13 LRLYIQPKASRDLIIGLH-------GDELKVAITAPPVDGQANAHLLKFIAKQFRVAKSR 65 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K + I Sbjct: 66 ITLEKGELGRHKQLRIS 82 >gi|33151941|ref|NP_873294.1| hypothetical protein HD0778 [Haemophilus ducreyi 35000HP] gi|47117473|sp|Q7VN15|Y778_HAEDU RecName: Full=UPF0235 protein HD_0778 gi|33148163|gb|AAP95683.1| conserved hypothetical protein [Haemophilus ducreyi 35000HP] Length = 97 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A + I L + +K+ +TA P G AN +L L+K + KS Sbjct: 14 RLRIFLQPKASRDQIVGLHDNE-------LKVAITAPPVDGAANAYLLKYLSKLFKVPKS 66 Query: 63 SLRMLSKQSSPLKIIYIDKD---CKEITELL 90 S+ + + K +++ KEI + L Sbjct: 67 SIVLEKGELQRHKQLFVPAPKLLPKEIEQWL 97 >gi|288933577|ref|YP_003437636.1| hypothetical protein Kvar_0694 [Klebsiella variicola At-22] gi|290511356|ref|ZP_06550725.1| conserved hypothetical protein [Klebsiella sp. 1_1_55] gi|288888306|gb|ADC56624.1| protein of unknown function DUF167 [Klebsiella variicola At-22] gi|289776349|gb|EFD84348.1| conserved hypothetical protein [Klebsiella sp. 1_1_55] Length = 96 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I + +K+ +TA P G+AN ++ LAK+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGVH-------GDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VLIEKGELGRHKQVKI 81 >gi|190150700|ref|YP_001969225.1| hypothetical protein APP7_1431 [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|307264052|ref|ZP_07545650.1| hypothetical protein appser13_14550 [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|226734158|sp|B3GYF9|Y1431_ACTP7 RecName: Full=UPF0235 protein APP7_1431 gi|189915831|gb|ACE62083.1| hypothetical protein APP7_1431 [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|306870598|gb|EFN02344.1| hypothetical protein appser13_14550 [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 97 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 25/91 (27%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A + I L + +KI +TA P G AN +L L+K + KS Sbjct: 14 RLRIFLQPKASRDQIVGLHDSE-------LKIAITAPPVDGAANAHLLKYLSKLFKVPKS 66 Query: 63 SLRMLSKQSSPLKIIYIDK---DCKEITELL 90 S+ + + K +++ + KEI LL Sbjct: 67 SIVLEKGELQRHKQLFVPEPKLIPKEIEALL 97 >gi|261342370|ref|ZP_05970228.1| putative cytoplasmic protein [Enterobacter cancerogenus ATCC 35316] gi|288315005|gb|EFC53943.1| putative cytoplasmic protein [Enterobacter cancerogenus ATCC 35316] Length = 98 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VIIEKGELGRHKQVKI 81 >gi|225164758|ref|ZP_03726990.1| conserved hypothetical protein [Opitutaceae bacterium TAV2] gi|224800632|gb|EEG18996.1| conserved hypothetical protein [Opitutaceae bacterium TAV2] Length = 111 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 21/80 (26%), Positives = 40/80 (50%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C + ++ IPNA ++ IA +K+KV+A +G+AN+ + LA+ L + + Sbjct: 20 CILSIKAIPNASRNAIAGW-------LGDALKVKVSAPALEGRANEQLCDFLAETLGIPR 72 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 ++ + + S K I I Sbjct: 73 RAVTVAGGEKSRQKRIQIIG 92 >gi|258620721|ref|ZP_05715756.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258625606|ref|ZP_05720488.1| conserved hypothetical protein [Vibrio mimicus VM603] gi|258582108|gb|EEW06975.1| conserved hypothetical protein [Vibrio mimicus VM603] gi|258586919|gb|EEW11633.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 97 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 41/92 (44%), Gaps = 10/92 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK+ ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKFLAKQCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQN 92 + + + K + I + EI LL++ Sbjct: 66 VVIEKGELGRHKQVRIQQPSQIPPEIAALLES 97 >gi|310822209|ref|YP_003954567.1| hypothetical protein STAUR_4962 [Stigmatella aurantiaca DW4/3-1] gi|309395281|gb|ADO72740.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 98 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 16/80 (20%), Positives = 39/80 (48%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + P A ++ + +K+++ A P G+AN A++ LAK+L L + Sbjct: 12 VELAVLVQPRASRTRVVGEHDGM-------LKLQLAAPPVDGEANAALVEFLAKRLGLPR 64 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 + +++ ++ K +++ Sbjct: 65 RQVTLVAGDAARRKRVFLAG 84 >gi|54310243|ref|YP_131263.1| hypothetical protein PBPRA3146 [Photobacterium profundum SS9] gi|46914684|emb|CAG21461.1| conserved hypothetical protein [Photobacterium profundum SS9] Length = 101 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 33/77 (42%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 19 IRLYIQPKASRDQIVGLH-------GEELKIAITAPPVDGKANAHLSKFLAKQFRVAKGQ 71 Query: 64 LRMLSKQSSPLKIIYID 80 + + K + I+ Sbjct: 72 VLIEKGMQGRHKQVRIE 88 >gi|166154599|ref|YP_001654717.1| hypothetical protein CTL0644 [Chlamydia trachomatis 434/Bu] gi|166155474|ref|YP_001653729.1| hypothetical protein CTLon_0641 [Chlamydia trachomatis L2b/UCH-1/proctitis] gi|237802813|ref|YP_002888007.1| hypothetical protein JALI_3871 [Chlamydia trachomatis B/Jali20/OT] gi|237804735|ref|YP_002888889.1| hypothetical protein CTB_3871 [Chlamydia trachomatis B/TZ1A828/OT] gi|255311194|ref|ZP_05353764.1| hypothetical protein Ctra62_02010 [Chlamydia trachomatis 6276] gi|255317495|ref|ZP_05358741.1| hypothetical protein Ctra6_02000 [Chlamydia trachomatis 6276s] gi|255348753|ref|ZP_05380760.1| hypothetical protein Ctra70_02050 [Chlamydia trachomatis 70] gi|255503293|ref|ZP_05381683.1| hypothetical protein Ctra7_02060 [Chlamydia trachomatis 70s] gi|255506972|ref|ZP_05382611.1| hypothetical protein CtraD_02040 [Chlamydia trachomatis D(s)2923] gi|301335866|ref|ZP_07224110.1| hypothetical protein CtraL_03535 [Chlamydia trachomatis L2tet1] gi|29839454|sp|O84393|Y388_CHLTR RecName: Full=UPF0235 protein CT_388 gi|123606912|sp|Q3KLW5|Y423_CHLTA RecName: Full=UPF0235 protein CTA_0423 gi|226707987|sp|B0BC23|Y641_CHLTB RecName: Full=UPF0235 protein CTLon_0641 gi|226707989|sp|B0B7V8|Y644_CHLT2 RecName: Full=UPF0235 protein CTL0644 gi|3328814|gb|AAC67985.1| hypothetical protein CT_388 [Chlamydia trachomatis D/UW-3/CX] gi|76167649|gb|AAX50657.1| hypothetical cytosolic protein [Chlamydia trachomatis A/HAR-13] gi|165930587|emb|CAP04084.1| conserved hypothetical protein [Chlamydia trachomatis 434/Bu] gi|165931462|emb|CAP07038.1| conserved hypothetical protein [Chlamydia trachomatis L2b/UCH-1/proctitis] gi|231273035|emb|CAX09948.1| conserved hypothetical protein [Chlamydia trachomatis B/TZ1A828/OT] gi|231274047|emb|CAX10841.1| conserved hypothetical protein [Chlamydia trachomatis B/Jali20/OT] gi|289525430|emb|CBJ14907.1| conserved hypothetical protein [Chlamydia trachomatis Sweden2] gi|296434982|gb|ADH17160.1| hypothetical protein E150_02035 [Chlamydia trachomatis E/150] gi|296435909|gb|ADH18083.1| hypothetical protein G9768_02005 [Chlamydia trachomatis G/9768] gi|296436835|gb|ADH19005.1| hypothetical protein G11222_02005 [Chlamydia trachomatis G/11222] gi|296437769|gb|ADH19930.1| hypothetical protein G11074_02005 [Chlamydia trachomatis G/11074] gi|296438702|gb|ADH20855.1| hypothetical protein E11023_02020 [Chlamydia trachomatis E/11023] gi|297140269|gb|ADH97027.1| hypothetical protein CTG9301_02010 [Chlamydia trachomatis G/9301] gi|297748518|gb|ADI51064.1| Hypothetical cytosolic protein [Chlamydia trachomatis D-EC] gi|297749398|gb|ADI52076.1| Hypothetical cytosolic protein [Chlamydia trachomatis D-LC] Length = 115 Score = 69.6 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 23/83 (27%), Positives = 48/83 (57%), Gaps = 7/83 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ A+++ + LE ++++VT P+KGKAN A++A+LA L++ KS Sbjct: 23 LEVRVTTKARENRVVCLED-------GILRVRVTEVPEKGKANDAVVALLANFLSIPKSD 75 Query: 64 LRMLSKQSSPLKIIYIDKDCKEI 86 + +++ ++S K + + + K Sbjct: 76 VTLIAGEASRRKKVLLPRSIKAF 98 >gi|269137631|ref|YP_003294331.1| hypothetical protein ETAE_0273 [Edwardsiella tarda EIB202] gi|267983291|gb|ACY83120.1| hypothetical protein ETAE_0273 [Edwardsiella tarda EIB202] gi|304557696|gb|ADM40360.1| hypothetical protein ETAF_0236 [Edwardsiella tarda FL6-60] Length = 96 Score = 69.6 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 35/77 (45%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN +L +AK+ ++KS Sbjct: 13 LRLYIQPKASRDLIIGLH-------GDELKVAITAPPVDGQANAHLLKFIAKQFRVAKSR 65 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K + I Sbjct: 66 ITLEKGELGRHKQLRIS 82 >gi|238763264|ref|ZP_04624229.1| hypothetical protein ykris0001_28410 [Yersinia kristensenii ATCC 33638] gi|238698537|gb|EEP91289.1| hypothetical protein ykris0001_28410 [Yersinia kristensenii ATCC 33638] Length = 90 Score = 69.2 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 7 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQ 59 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K I I + E+ LL+ Sbjct: 60 VIIEKGELGRHKQIKIVNPQQIPPEVAALLE 90 >gi|295097492|emb|CBK86582.1| conserved hypothetical protein TIGR00251 [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 98 Score = 69.2 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VIIEKGELGRHKQVKI 81 >gi|15835282|ref|NP_297041.1| hypothetical protein TC0667 [Chlamydia muridarum Nigg] gi|270285456|ref|ZP_06194850.1| hypothetical protein CmurN_03383 [Chlamydia muridarum Nigg] gi|270289467|ref|ZP_06195769.1| hypothetical protein CmurW_03473 [Chlamydia muridarum Weiss] gi|301336853|ref|ZP_07225055.1| hypothetical protein CmurM_03440 [Chlamydia muridarum MopnTet14] gi|29839670|sp|Q9PK06|Y667_CHLMU RecName: Full=UPF0235 protein TC_0667 gi|7190702|gb|AAF39489.1| conserved hypothetical protein [Chlamydia muridarum Nigg] Length = 100 Score = 69.2 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 22/83 (26%), Positives = 50/83 (60%), Gaps = 7/83 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ A+++ + SLE ++++VT P++GKAN A++A+LAK L++ K+ Sbjct: 8 LEIRVTTKARENKVVSLED-------GILRVRVTEAPERGKANDAVVALLAKFLSIPKND 60 Query: 64 LRMLSKQSSPLKIIYIDKDCKEI 86 + +++ ++S K + + + K Sbjct: 61 VTLIAGEASRRKKVLLPRAIKAF 83 >gi|238759331|ref|ZP_04620497.1| hypothetical protein yaldo0001_5600 [Yersinia aldovae ATCC 35236] gi|238702492|gb|EEP95043.1| hypothetical protein yaldo0001_5600 [Yersinia aldovae ATCC 35236] Length = 90 Score = 69.2 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 7 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQ 59 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 60 VILEKGELGRHKQIKI 75 >gi|146312998|ref|YP_001178072.1| hypothetical protein Ent638_3359 [Enterobacter sp. 638] gi|166990826|sp|A4WE91|Y3359_ENT38 RecName: Full=UPF0235 protein Ent638_3359 gi|145319874|gb|ABP62021.1| protein of unknown function DUF167 [Enterobacter sp. 638] Length = 96 Score = 69.2 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VIIEKGELGRHKQVKI 81 >gi|261250161|ref|ZP_05942737.1| hypothetical protein VIA_000181 [Vibrio orientalis CIP 102891] gi|260939277|gb|EEX95263.1| hypothetical protein VIA_000181 [Vibrio orientalis CIP 102891] Length = 96 Score = 69.2 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 32/76 (42%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKIVG-------QHGEELKIAITAPPVDGKANAHLSKYLAKQFKVAKGL 66 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 67 ITIEKGELGRHKQVRI 82 >gi|261867124|ref|YP_003255046.1| hypothetical protein D11S_0417 [Aggregatibacter actinomycetemcomitans D11S-1] gi|261412456|gb|ACX81827.1| hypothetical protein D11S_0417 [Aggregatibacter actinomycetemcomitans D11S-1] Length = 97 Score = 69.2 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 36/78 (46%), Gaps = 7/78 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L + +KI +TA P G+AN +L L+K + KS Sbjct: 12 RLRIFLQPKAAKDQIVGLHDDE-------LKISITAPPVDGQANAHLLKFLSKLFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYID 80 S+ + + + K + I Sbjct: 65 SIVLEKGELNRHKQVLIP 82 >gi|167648715|ref|YP_001686378.1| hypothetical protein Caul_4760 [Caulobacter sp. K31] gi|167351145|gb|ABZ73880.1| protein of unknown function DUF167 [Caulobacter sp. K31] Length = 96 Score = 69.2 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 31/89 (34%), Positives = 46/89 (51%), Gaps = 3/89 (3%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL P + I + D ++K++V A P +G AN A+LA LAK L L KS Sbjct: 2 RLAVRLTPRGGREAIDGWAV--DGDGRPYLKVRVAAPPVEGAANAALLAFLAKALGLPKS 59 Query: 63 SLRMLSKQSSPLKIIYIDK-DCKEITELL 90 +L + S + LK+I I D + +L Sbjct: 60 ALTLASGAGARLKLIEIAGCDPLSLERVL 88 >gi|186683679|ref|YP_001866875.1| hypothetical protein Npun_R3528 [Nostoc punctiforme PCC 73102] gi|226703856|sp|B2J1T0|Y3528_NOSP7 RecName: Full=UPF0235 protein Npun_R3528 gi|186466131|gb|ACC81932.1| protein of unknown function DUF167 [Nostoc punctiforme PCC 73102] Length = 75 Score = 69.2 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 30/79 (37%), Positives = 48/79 (60%), Gaps = 7/79 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V++ PN+K+ I E D S T+++K + P GKAN+ ++ +LAKK ++KS + Sbjct: 4 KVKVKPNSKQQKI---EEQPDGSLTVYLK----SPPVDGKANEELIKLLAKKFDVAKSDI 56 Query: 65 RMLSKQSSPLKIIYIDKDC 83 R+ S SS K+I ID+D Sbjct: 57 RIKSGLSSRQKLIEIDRDV 75 >gi|301064653|ref|ZP_07205047.1| conserved hypothetical protein TIGR00251 [delta proteobacterium NaphS2] gi|300441273|gb|EFK05644.1| conserved hypothetical protein TIGR00251 [delta proteobacterium NaphS2] Length = 95 Score = 69.2 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 24/78 (30%), Positives = 41/78 (52%), Gaps = 7/78 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V+++P + ++ I +IK+TA P +GKANKA++ LAKK K Sbjct: 12 IRVKVLPRSSRTEIVG-------KTDGIYRIKLTAPPVEGKANKALINFLAKKTGSPKQK 64 Query: 64 LRMLSKQSSPLKIIYIDK 81 +R++ + S K I I+ Sbjct: 65 IRIVKGEQSRNKTIRIEN 82 >gi|238752326|ref|ZP_04613805.1| hypothetical protein yrohd0001_19180 [Yersinia rohdei ATCC 43380] gi|238709487|gb|EEQ01726.1| hypothetical protein yrohd0001_19180 [Yersinia rohdei ATCC 43380] Length = 90 Score = 69.2 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 7 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFKVAKSQ 59 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + E+ LL+ Sbjct: 60 VIIEKGELGRHKQLKIVNPQQIPPEVAALLE 90 >gi|27364893|ref|NP_760421.1| hypothetical protein VV1_1522 [Vibrio vulnificus CMCP6] gi|29839706|sp|Q8DCB7|Y1522_VIBVU RecName: Full=UPF0235 protein VV1_1522 gi|27361038|gb|AAO09948.1| UPF0235 protein [Vibrio vulnificus CMCP6] Length = 96 Score = 69.2 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + +L K ++KS Sbjct: 14 LRLYIQPKASRDKILGLH-------GDELKIAITAPPVDGKANGHLTKLLGKWFKVAKSL 66 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + + Sbjct: 67 VTIEKGELGRHKQVRV 82 >gi|62129245|gb|AAX66948.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] Length = 100 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 70 IVIEKGELGRHKQVKI 85 >gi|148555779|ref|YP_001263361.1| hypothetical protein Swit_2871 [Sphingomonas wittichii RW1] gi|148500969|gb|ABQ69223.1| protein of unknown function DUF167 [Sphingomonas wittichii RW1] Length = 74 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 20/70 (28%), Positives = 36/70 (51%) Query: 24 KDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDC 83 D + IK+ A P G AN+A++ ++AK L ++K + + S +S LK +++ D Sbjct: 3 ADADGRRWLSIKLAAAPSDGAANEALVRLVAKALGVAKRDVTLASGATSRLKRLHVSGDP 62 Query: 84 KEITELLQNN 93 + LQ Sbjct: 63 AALAAALQRV 72 >gi|224584895|ref|YP_002638694.1| hypothetical protein SPC_3167 [Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594] gi|224469423|gb|ACN47253.1| hypothetical protein SPC_3167 [Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594] Length = 93 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 10 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQ 62 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 63 IVIEKGELGRHKQVKI 78 >gi|293390704|ref|ZP_06635038.1| hypothetical protein D7S_0840 [Aggregatibacter actinomycetemcomitans D7S-1] gi|290951238|gb|EFE01357.1| hypothetical protein D7S_0840 [Aggregatibacter actinomycetemcomitans D7S-1] Length = 97 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 37/78 (47%), Gaps = 7/78 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K+ I L + +KI +TA P G+AN +L L+K + KS Sbjct: 12 RLRIFLQPKAAKNQIVGLHDDE-------LKISITAPPVDGQANAHLLKFLSKLFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYID 80 S+ + + + K + I Sbjct: 65 SIVLEKGELNRHKQVLIP 82 >gi|221632637|ref|YP_002521858.1| hypothetical protein trd_0618 [Thermomicrobium roseum DSM 5159] gi|221157137|gb|ACM06264.1| DUF167 [Thermomicrobium roseum DSM 5159] Length = 102 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 28/86 (32%), Positives = 49/86 (56%), Gaps = 8/86 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ P A ++ +A S + ++VTA P+ G+AN+A+L +LA+ L L + S+R Sbjct: 17 VQVQPRAPRAEVAG-------SRRDALLVRVTAPPRDGEANEAVLRLLAETLHLPRGSIR 69 Query: 66 MLSKQSSPLKIIYIDK-DCKEITELL 90 +++ + K I ID KE+ E L Sbjct: 70 IIAGTAQRRKRIRIDGLTSKELLERL 95 >gi|94496543|ref|ZP_01303119.1| hypothetical protein SKA58_17602 [Sphingomonas sp. SKA58] gi|94423903|gb|EAT08928.1| hypothetical protein SKA58_17602 [Sphingomonas sp. SKA58] Length = 107 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 44/78 (56%), Gaps = 2/78 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +RL P + + GI + +D + +V A P+KG+AN A++A+LA+ L KS+ Sbjct: 13 LAIRLTPGSARQGIGGV--WRDDRAAPWLTARVRAVPEKGRANTALIALLAQALDWPKSA 70 Query: 64 LRMLSKQSSPLKIIYIDK 81 + + S S+ LK + I Sbjct: 71 IMLESGDSNRLKRLRIIG 88 >gi|222054893|ref|YP_002537255.1| protein of unknown function DUF167 [Geobacter sp. FRC-32] gi|221564182|gb|ACM20154.1| protein of unknown function DUF167 [Geobacter sp. FRC-32] Length = 102 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 7/76 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V + P A ++ I ++ +++++TA P ANK + +LAK L ++KS L Sbjct: 23 VHVQPRASRNEICGVQ-------GDELRLRLTAPPVDDAANKLCIELLAKALGVAKSHLA 75 Query: 66 MLSKQSSPLKIIYIDK 81 + S S K I + Sbjct: 76 LTSGAKSRHKTIRAEG 91 >gi|162418314|ref|YP_001604771.1| hypothetical protein YpAngola_A0141 [Yersinia pestis Angola] gi|165924856|ref|ZP_02220688.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Orientalis str. F1991016] gi|165937260|ref|ZP_02225824.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Orientalis str. IP275] gi|166010278|ref|ZP_02231176.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Antiqua str. E1979001] gi|166212808|ref|ZP_02238843.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Antiqua str. B42003004] gi|167422007|ref|ZP_02313760.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167426697|ref|ZP_02318450.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Mediaevalis str. K1973002] gi|186896650|ref|YP_001873762.1| hypothetical protein YPTS_3350 [Yersinia pseudotuberculosis PB1/+] gi|294502893|ref|YP_003566955.1| hypothetical protein YPZ3_0783 [Yersinia pestis Z176003] gi|21960273|gb|AAM86880.1|AE013934_3 hypothetical protein y3330 [Yersinia pestis KIM 10] gi|45438104|gb|AAS63652.1| conserved hypothetical protein [Yersinia pestis biovar Microtus str. 91001] gi|162351129|gb|ABX85077.1| conserved hypothetical protein TIGR00251 [Yersinia pestis Angola] gi|165914734|gb|EDR33347.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Orientalis str. IP275] gi|165923056|gb|EDR40207.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Orientalis str. F1991016] gi|165990764|gb|EDR43065.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Antiqua str. E1979001] gi|166206100|gb|EDR50580.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Antiqua str. B42003004] gi|166960144|gb|EDR56165.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167054300|gb|EDR64119.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Mediaevalis str. K1973002] gi|186699676|gb|ACC90305.1| protein of unknown function DUF167 [Yersinia pseudotuberculosis PB1/+] gi|262360928|gb|ACY57649.1| hypothetical protein YPD4_0740 [Yersinia pestis D106004] gi|294353352|gb|ADE63693.1| hypothetical protein YPZ3_0783 [Yersinia pestis Z176003] Length = 100 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 17 LKLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANTHLVKFIAKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I + Sbjct: 70 VIIEKGELGRHKQIKV 85 >gi|67482694|ref|XP_656664.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS] gi|56473879|gb|EAL51278.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS] Length = 118 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 21/80 (26%), Positives = 40/80 (50%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + PNAK S I +E +K+ + + P GKAN ++A +A + K Sbjct: 33 VIIEVEIKPNAKTSEIQGVED-------GLLKVSINSPPVDGKANTEVIAFMASTFGIKK 85 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 S+++++ Q+S K + + Sbjct: 86 SNVKLIKGQTSHHKTLQFEN 105 >gi|193212041|ref|YP_001997994.1| hypothetical protein Cpar_0370 [Chlorobaculum parvum NCIB 8327] gi|226705834|sp|B3QL06|Y370_CHLP8 RecName: Full=UPF0235 protein Cpar_0370 gi|193085518|gb|ACF10794.1| protein of unknown function DUF167 [Chlorobaculum parvum NCIB 8327] Length = 105 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 8/88 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P + K+GIA +KI + + P ANK +LAK L + +S+ Sbjct: 13 LSVRVQPRSSKTGIAG-------RYGDQVKICLKSAPVDNAANKECCQLLAKTLGVPRSN 65 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90 + +++ Q+S K++ ++ E+ + L Sbjct: 66 VSVMNGQTSRSKVLKVEGMTPSELRKAL 93 >gi|90407095|ref|ZP_01215284.1| hypothetical protein PCNPT3_02605 [Psychromonas sp. CNPT3] gi|90311817|gb|EAS39913.1| hypothetical protein PCNPT3_02605 [Psychromonas sp. CNPT3] Length = 96 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 8/93 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + L P A + L +KI +TA P G+ANK ++ L+K+ + K Sbjct: 12 LRLVLQPKASRDAFIGL-------LGDELKITITAPPVDGQANKHLIKFLSKQFKVPKRD 64 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96 + + + K+I I K K+I + ++ + Sbjct: 65 ITVEKGLLNRHKLIRI-KSPKKIPDFFTRSEHV 96 >gi|161486600|ref|NP_935670.2| hypothetical protein VV2877 [Vibrio vulnificus YJ016] gi|47117406|sp|Q7MHJ2|Y2877_VIBVY RecName: Full=UPF0235 protein VV2877 Length = 96 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + +L K ++KS Sbjct: 14 LRLYIQPKASRDKILGLH-------GDELKIAITAPPVDGKANGHLTKLLGKWFKVAKSL 66 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + + Sbjct: 67 VTIEKGELGRHKQVRV 82 >gi|37199811|dbj|BAC95641.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 101 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + +L K ++KS Sbjct: 19 LRLYIQPKASRDKILGLH-------GDELKIAITAPPVDGKANGHLTKLLGKWFKVAKSL 71 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + + Sbjct: 72 VTIEKGELGRHKQVRV 87 >gi|325108674|ref|YP_004269742.1| hypothetical protein Plabr_2117 [Planctomyces brasiliensis DSM 5305] gi|324968942|gb|ADY59720.1| UPF0235 protein yggU [Planctomyces brasiliensis DSM 5305] Length = 109 Score = 68.8 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P AK++ + + +K+ VT ++GKAN+ +L +LAK L L KS Sbjct: 22 LPLRVTPGAKRNAVGGVHDGA-------LKVAVTQIAERGKANQQVLKILAKALGLKKSQ 74 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92 L ++S ++S K I D E+ + + N Sbjct: 75 LTLVSGETSRNKRIACRDVSAAELLQRISN 104 >gi|258545474|ref|ZP_05705708.1| cytoplasmic protein [Cardiobacterium hominis ATCC 15826] gi|258519174|gb|EEV88033.1| cytoplasmic protein [Cardiobacterium hominis ATCC 15826] Length = 97 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ A + + + KI +TA P+ GKANK + A LAK A++K + Sbjct: 13 LAVKITARASRDQCQGIHDER-------YKIAITAPPEDGKANKHLTAWLAKTFAVAKKN 65 Query: 64 LRMLSKQSSPLKIIYID 80 + + S SPLK + I Sbjct: 66 VALQSGAFSPLKTLRIT 82 >gi|298528115|ref|ZP_07015519.1| protein of unknown function DUF167 [Desulfonatronospira thiodismutans ASO3-1] gi|298511767|gb|EFI35669.1| protein of unknown function DUF167 [Desulfonatronospira thiodismutans ASO3-1] Length = 119 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 37/88 (42%), Gaps = 7/88 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V L P A + + + + +KI V A P GKANKA+ L++ L + K Sbjct: 36 LRVVLKPGADRDEVLGIHAGR-------LKISVKAPPVDGKANKALCIFLSRSLGIRKKQ 88 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S K + + + L+ Sbjct: 89 VWIQRGLQSRNKDLIVSGVAGTVFNALK 116 >gi|163783198|ref|ZP_02178192.1| hypothetical protein HG1285_14279 [Hydrogenivirga sp. 128-5-R1-1] gi|159881532|gb|EDP75042.1| hypothetical protein HG1285_14279 [Hydrogenivirga sp. 128-5-R1-1] Length = 73 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 49/78 (62%), Gaps = 7/78 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V++ P++K+ G+ + + ++++V+A P++GKAN+ ++ +LAK + K Sbjct: 2 KLKVKVKPSSKREGVREVSPGE-------LEVRVSAPPERGKANERLIELLAKHYGVRKG 54 Query: 63 SLRMLSKQSSPLKIIYID 80 ++R+L ++S K++ ID Sbjct: 55 AVRILRGETSREKVVEID 72 >gi|296104617|ref|YP_003614763.1| hypothetical protein ECL_04282 [Enterobacter cloacae subsp. cloacae ATCC 13047] gi|295059076|gb|ADF63814.1| hypothetical protein ECL_04282 [Enterobacter cloacae subsp. cloacae ATCC 13047] Length = 95 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 LRLYIQPKASRDNIVGLH-------GDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VIIEKGELGRHKQVKI 81 >gi|162139546|ref|YP_218029.2| hypothetical protein SC3042 [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|168242905|ref|ZP_02667837.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|194448304|ref|YP_002047090.1| hypothetical protein SeHA_C3341 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|197249095|ref|YP_002148016.1| hypothetical protein SeAg_B3265 [Salmonella enterica subsp. enterica serovar Agona str. SL483] gi|226730824|sp|B5F5M7|YGGU_SALA4 RecName: Full=UPF0235 protein yggU gi|226730828|sp|B4THI7|YGGU_SALHS RecName: Full=UPF0235 protein yggU gi|194406608|gb|ACF66827.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|197212798|gb|ACH50195.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Agona str. SL483] gi|205338138|gb|EDZ24902.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|322716095|gb|EFZ07666.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 96 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 IVIEKGELGRHKQVKI 81 >gi|163737739|ref|ZP_02145156.1| hypothetical protein RGBS107_19448 [Phaeobacter gallaeciensis BS107] gi|163744042|ref|ZP_02151409.1| hypothetical protein RG210_12581 [Phaeobacter gallaeciensis 2.10] gi|161382658|gb|EDQ07060.1| hypothetical protein RG210_12581 [Phaeobacter gallaeciensis 2.10] gi|161389265|gb|EDQ13617.1| hypothetical protein RGBS107_19448 [Phaeobacter gallaeciensis BS107] Length = 98 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 23/76 (30%), Positives = 40/76 (52%), Gaps = 1/76 (1%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A ++ I + + +KI VTA P+ GKA A+ A+LA+ + ++ Sbjct: 20 AEIPVRVTPKASRNAILPIPQAESGQGVS-LKITVTAAPENGKATAAVQALLARAMRIAP 78 Query: 62 SSLRMLSKQSSPLKII 77 S L +L +S K+ Sbjct: 79 SDLELLRGATSRDKVF 94 >gi|170023077|ref|YP_001719582.1| hypothetical protein YPK_0828 [Yersinia pseudotuberculosis YPIII] gi|226708089|sp|B1JNN7|Y828_YERPY RecName: Full=UPF0235 protein YPK_0828 gi|169749611|gb|ACA67129.1| protein of unknown function DUF167 [Yersinia pseudotuberculosis YPIII] Length = 96 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 13 LKLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I + Sbjct: 66 VIIEKGELGRHKQIKV 81 >gi|329893887|ref|ZP_08269938.1| hypothetical protein IMCC3088_2481 [gamma proteobacterium IMCC3088] gi|328923406|gb|EGG30722.1| hypothetical protein IMCC3088_2481 [gamma proteobacterium IMCC3088] Length = 83 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 4/80 (5%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P AK++ I +D +++ VT P+ GKAN+A++ LA + ++ Sbjct: 6 VSVRVTPKAKQAQI----KVEDIEGQSVLRVYVTVAPEDGKANRAVMRALADYFDVPRTR 61 Query: 64 LRMLSKQSSPLKIIYIDKDC 83 +++LS K ID D Sbjct: 62 IKLLSGAKQRDKRFSIDADV 81 >gi|148263238|ref|YP_001229944.1| hypothetical protein Gura_1167 [Geobacter uraniireducens Rf4] gi|146396738|gb|ABQ25371.1| protein of unknown function DUF167 [Geobacter uraniireducens Rf4] Length = 102 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 40/80 (50%), Gaps = 7/80 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V + P A ++ I ++ +K+++TA P + ANK + +LAK L ++KS + Sbjct: 23 VHVQPRASRNEICGVQ-------GDELKLRLTAPPVEDAANKLCVELLAKALKVAKSRVT 75 Query: 66 MLSKQSSPLKIIYIDKDCKE 85 + + S K + ++ E Sbjct: 76 ITAGAKSRHKTVKVEGITTE 95 >gi|170766160|ref|ZP_02900971.1| conserved hypothetical protein [Escherichia albertii TW07627] gi|170125306|gb|EDS94237.1| conserved hypothetical protein [Escherichia albertii TW07627] Length = 96 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 66 VVIEKGELGRHKQIRI 81 >gi|307824763|ref|ZP_07654986.1| protein of unknown function DUF167 [Methylobacter tundripaludum SV96] gi|307734121|gb|EFO04975.1| protein of unknown function DUF167 [Methylobacter tundripaludum SV96] Length = 88 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 8/81 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 + + P A K A L +K++V A P GKAN+ ++A +A + +SKS+ + Sbjct: 3 LHVQPKASKDEWAGLH-------GERLKLRVKAAPVDGKANQHLIAFIADEFGVSKSACK 55 Query: 66 MLSKQSSPLKIIYIDKDCKEI 86 +++ +S K I I K++ Sbjct: 56 LITGESGREKRIAI-NSPKKL 75 >gi|331648708|ref|ZP_08349796.1| conserved hypothetical protein [Escherichia coli M605] gi|331042455|gb|EGI14597.1| conserved hypothetical protein [Escherichia coli M605] Length = 100 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 70 VVIEKGELGRHKQIKI 85 >gi|323498669|ref|ZP_08103660.1| hypothetical protein VISI1226_18936 [Vibrio sinaloensis DSM 21326] gi|323316269|gb|EGA69289.1| hypothetical protein VISI1226_18936 [Vibrio sinaloensis DSM 21326] Length = 96 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 39/90 (43%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK+ ++K Sbjct: 14 IRLYIQPKASRDKIVGLH-------GDELKVAITAPPVDGKANAHLSKYLAKQFKVAKGL 66 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELL 90 + + + K ++I + EI +L Sbjct: 67 IDIEKGELGRHKQLWICSPAQIPTEIEAIL 96 >gi|156932605|ref|YP_001436521.1| hypothetical protein ESA_00387 [Cronobacter sakazakii ATCC BAA-894] gi|260599282|ref|YP_003211853.1| hypothetical protein CTU_34900 [Cronobacter turicensis z3032] gi|166229055|sp|A7MP89|Y387_ENTS8 RecName: Full=UPF0235 protein ESA_00387 gi|156530859|gb|ABU75685.1| hypothetical protein ESA_00387 [Cronobacter sakazakii ATCC BAA-894] gi|260218459|emb|CBA33595.1| UPF0235 protein ESA_00387 [Cronobacter turicensis z3032] Length = 96 Score = 68.5 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ ++KS Sbjct: 13 LRLYIQPKASRDSIIGLH-------GDELKVAITAPPVDGQANAHLVKYLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VVIEKGELGRHKQVKI 81 >gi|237729884|ref|ZP_04560365.1| conserved hypothetical protein [Citrobacter sp. 30_2] gi|283835286|ref|ZP_06355027.1| putative cytoplasmic protein [Citrobacter youngae ATCC 29220] gi|226908490|gb|EEH94408.1| conserved hypothetical protein [Citrobacter sp. 30_2] gi|291068444|gb|EFE06553.1| putative cytoplasmic protein [Citrobacter youngae ATCC 29220] Length = 96 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VVIEKGELGRHKQVKI 81 >gi|119493536|ref|ZP_01624202.1| hypothetical protein L8106_18212 [Lyngbya sp. PCC 8106] gi|119452653|gb|EAW33834.1| hypothetical protein L8106_18212 [Lyngbya sp. PCC 8106] Length = 81 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 45/83 (54%), Gaps = 7/83 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + V++ PN+K+ + E + + IH+K + P GKAN+ ++ +LAK+ + Sbjct: 1 MVVLHVKVKPNSKQQSMVKNE---EGTYIIHLK----SPPIDGKANQELIKILAKQFNIP 53 Query: 61 KSSLRMLSKQSSPLKIIYIDKDC 83 KS + + S SS K+I + C Sbjct: 54 KSQVSIKSGLSSKNKLIELPDSC 76 >gi|242279866|ref|YP_002991995.1| hypothetical protein Desal_2400 [Desulfovibrio salexigens DSM 2638] gi|242122760|gb|ACS80456.1| protein of unknown function DUF167 [Desulfovibrio salexigens DSM 2638] Length = 106 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 40/82 (48%), Gaps = 7/82 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V V + P AK GI +++++ A KANKA+ A +A +L L K Sbjct: 21 RVSVWVQPGAKNEGITGEYQDS-------VRVRINAPAVDNKANKALAAFVATRLGLKKR 73 Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84 ++ + S S+ K++ ++ D + Sbjct: 74 NISIASGHSNRKKVLLVESDVE 95 >gi|108806318|ref|YP_650234.1| hypothetical protein YPA_0321 [Yersinia pestis Antiqua] gi|108813301|ref|YP_649068.1| hypothetical protein YPN_3141 [Yersinia pestis Nepal516] gi|145597878|ref|YP_001161954.1| hypothetical protein YPDSF_0571 [Yersinia pestis Pestoides F] gi|149367047|ref|ZP_01889080.1| hypothetical protein YPE_2325 [Yersinia pestis CA88-4125] gi|161484758|ref|NP_670629.2| hypothetical protein y3330 [Yersinia pestis KIM 10] gi|161511310|ref|NP_994775.2| hypothetical protein YP_3498 [Yersinia pestis biovar Microtus str. 91001] gi|167399786|ref|ZP_02305304.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167468163|ref|ZP_02332867.1| hypothetical protein YpesF_09749 [Yersinia pestis FV-1] gi|218928116|ref|YP_002345991.1| hypothetical protein YPO0944 [Yersinia pestis CO92] gi|229837637|ref|ZP_04457799.1| conserved protein [Yersinia pestis Pestoides A] gi|229840863|ref|ZP_04461022.1| conserved protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842576|ref|ZP_04462731.1| conserved protein [Yersinia pestis biovar Orientalis str. India 195] gi|229903764|ref|ZP_04518877.1| conserved protein [Yersinia pestis Nepal516] gi|270487542|ref|ZP_06204616.1| conserved hypothetical protein TIGR00251 [Yersinia pestis KIM D27] gi|29839591|sp|Q8ZHF5|Y944_YERPE RecName: Full=UPF0235 protein YPO0944/y3330/YP_3498 gi|122383790|sp|Q1CB83|Y321_YERPA RecName: Full=UPF0235 protein YPA_0321 gi|122384257|sp|Q1CEW2|Y3141_YERPN RecName: Full=UPF0235 protein YPN_3141 gi|166229066|sp|A4TI69|Y571_YERPP RecName: Full=UPF0235 protein YPDSF_0571 gi|108776949|gb|ABG19468.1| hypothetical protein YPN_3141 [Yersinia pestis Nepal516] gi|108778231|gb|ABG12289.1| hypothetical protein YPA_0321 [Yersinia pestis Antiqua] gi|115346727|emb|CAL19610.1| conserved hypothetical protein [Yersinia pestis CO92] gi|145209574|gb|ABP38981.1| hypothetical protein YPDSF_0571 [Yersinia pestis Pestoides F] gi|149290661|gb|EDM40737.1| hypothetical protein YPE_2325 [Yersinia pestis CA88-4125] gi|167050494|gb|EDR61902.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Antiqua str. UG05-0454] gi|229679534|gb|EEO75637.1| conserved protein [Yersinia pestis Nepal516] gi|229690886|gb|EEO82940.1| conserved protein [Yersinia pestis biovar Orientalis str. India 195] gi|229697229|gb|EEO87276.1| conserved protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229704325|gb|EEO91336.1| conserved protein [Yersinia pestis Pestoides A] gi|270336046|gb|EFA46823.1| conserved hypothetical protein TIGR00251 [Yersinia pestis KIM D27] gi|320013974|gb|ADV97545.1| conserved protein [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 96 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 13 LKLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANTHLVKFIAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I + Sbjct: 66 VIIEKGELGRHKQIKV 81 >gi|320155276|ref|YP_004187655.1| osmotic shock response integral membrane protein YggT [Vibrio vulnificus MO6-24/O] gi|319930588|gb|ADV85452.1| integral membrane protein YggT, involved in response to extracytoplasmic stress (osmotic shock) [Vibrio vulnificus MO6-24/O] Length = 96 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + +L K ++KS Sbjct: 14 LRLYIQPKASRDKILGLH-------GDELKIAITAPPVDGKANGHLTKLLGKWFKVAKSL 66 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + + Sbjct: 67 VTIEKGELGRHKQVRV 82 >gi|56415040|ref|YP_152115.1| hypothetical protein SPA2965 [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150] gi|168819868|ref|ZP_02831868.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|194446740|ref|YP_002042361.1| hypothetical protein SNSL254_A3349 [Salmonella enterica subsp. enterica serovar Newport str. SL254] gi|197363969|ref|YP_002143606.1| hypothetical protein SSPA2764 [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|204928324|ref|ZP_03219524.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Javiana str. GA_MM04042433] gi|238909900|ref|ZP_04653737.1| hypothetical protein SentesTe_02030 [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] gi|81361856|sp|Q5PML0|YGGU_SALPA RecName: Full=UPF0235 protein yggU gi|226730829|sp|B4T5K9|YGGU_SALNS RecName: Full=UPF0235 protein yggU gi|226730830|sp|B5BFQ9|YGGU_SALPK RecName: Full=UPF0235 protein yggU gi|56129297|gb|AAV78803.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150] gi|194405403|gb|ACF65625.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Newport str. SL254] gi|197095446|emb|CAR61005.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|204322646|gb|EDZ07843.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Javiana str. GA_MM04042433] gi|205343351|gb|EDZ30115.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|320087534|emb|CBY97299.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] gi|322613499|gb|EFY10440.1| hypothetical protein SEEM315_07205 [Salmonella enterica subsp. enterica serovar Montevideo str. 315996572] gi|322621091|gb|EFY17949.1| hypothetical protein SEEM971_19929 [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-1] gi|322624155|gb|EFY20989.1| hypothetical protein SEEM973_20050 [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-3] gi|322628106|gb|EFY24895.1| hypothetical protein SEEM974_21245 [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-4] gi|322633225|gb|EFY29967.1| hypothetical protein SEEM201_12335 [Salmonella enterica subsp. enterica serovar Montevideo str. 515920-1] gi|322636197|gb|EFY32905.1| hypothetical protein SEEM202_13733 [Salmonella enterica subsp. enterica serovar Montevideo str. 515920-2] gi|322639535|gb|EFY36223.1| hypothetical protein SEEM954_11862 [Salmonella enterica subsp. enterica serovar Montevideo str. 531954] gi|322647532|gb|EFY44021.1| hypothetical protein SEEM054_10407 [Salmonella enterica subsp. enterica serovar Montevideo str. NC_MB110209-0054] gi|322648716|gb|EFY45163.1| hypothetical protein SEEM675_04396 [Salmonella enterica subsp. enterica serovar Montevideo str. OH_2009072675] gi|322653771|gb|EFY50097.1| hypothetical protein SEEM965_22046 [Salmonella enterica subsp. enterica serovar Montevideo str. CASC_09SCPH15965] gi|322657877|gb|EFY54145.1| hypothetical protein SEEM19N_18161 [Salmonella enterica subsp. enterica serovar Montevideo str. 19N] gi|322663980|gb|EFY60179.1| hypothetical protein SEEM801_04461 [Salmonella enterica subsp. enterica serovar Montevideo str. 81038-01] gi|322669009|gb|EFY65160.1| hypothetical protein SEEM507_10116 [Salmonella enterica subsp. enterica serovar Montevideo str. MD_MDA09249507] gi|322672997|gb|EFY69104.1| hypothetical protein SEEM877_00665 [Salmonella enterica subsp. enterica serovar Montevideo str. 414877] gi|322678012|gb|EFY74075.1| hypothetical protein SEEM867_02147 [Salmonella enterica subsp. enterica serovar Montevideo str. 366867] gi|322681188|gb|EFY77221.1| hypothetical protein SEEM180_20639 [Salmonella enterica subsp. enterica serovar Montevideo str. 413180] gi|322687882|gb|EFY83849.1| hypothetical protein SEEM600_15746 [Salmonella enterica subsp. enterica serovar Montevideo str. 446600] gi|323194922|gb|EFZ80109.1| hypothetical protein SEEM581_15857 [Salmonella enterica subsp. enterica serovar Montevideo str. 609458-1] gi|323199626|gb|EFZ84716.1| hypothetical protein SEEM501_16285 [Salmonella enterica subsp. enterica serovar Montevideo str. 556150-1] gi|323202627|gb|EFZ87667.1| hypothetical protein SEEM460_18160 [Salmonella enterica subsp. enterica serovar Montevideo str. 609460] gi|323207886|gb|EFZ92832.1| hypothetical protein SEEM020_18480 [Salmonella enterica subsp. enterica serovar Montevideo str. 507440-20] gi|323212562|gb|EFZ97379.1| hypothetical protein SEEM6152_10068 [Salmonella enterica subsp. enterica serovar Montevideo str. 556152] gi|323214955|gb|EFZ99703.1| hypothetical protein SEEM0077_03334 [Salmonella enterica subsp. enterica serovar Montevideo str. MB101509-0077] gi|323222685|gb|EGA07050.1| hypothetical protein SEEM0047_09590 [Salmonella enterica subsp. enterica serovar Montevideo str. MB102109-0047] gi|323225428|gb|EGA09660.1| hypothetical protein SEEM0055_15033 [Salmonella enterica subsp. enterica serovar Montevideo str. MB110209-0055] gi|323230557|gb|EGA14675.1| hypothetical protein SEEM0052_19874 [Salmonella enterica subsp. enterica serovar Montevideo str. MB111609-0052] gi|323235092|gb|EGA19178.1| hypothetical protein SEEM3312_06118 [Salmonella enterica subsp. enterica serovar Montevideo str. 2009083312] gi|323239131|gb|EGA23181.1| hypothetical protein SEEM5258_05995 [Salmonella enterica subsp. enterica serovar Montevideo str. 2009085258] gi|323244511|gb|EGA28517.1| hypothetical protein SEEM1156_01212 [Salmonella enterica subsp. enterica serovar Montevideo str. 315731156] gi|323247126|gb|EGA31092.1| hypothetical protein SEEM9199_21270 [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2009159199] gi|323253391|gb|EGA37220.1| hypothetical protein SEEM8282_09961 [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008282] gi|323256302|gb|EGA40038.1| hypothetical protein SEEM8283_11916 [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008283] gi|323262522|gb|EGA46078.1| hypothetical protein SEEM8284_02121 [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008284] gi|323267382|gb|EGA50866.1| hypothetical protein SEEM8285_21407 [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008285] gi|323269214|gb|EGA52669.1| hypothetical protein SEEM8287_10997 [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008287] gi|323669738|emb|CBJ94862.1| conserved hypothetical protein [Salmonella bongori] gi|327412908|emb|CAX67922.1| conserved hypothetical protein [Salmonella bongori] Length = 96 Score = 68.1 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 IVIEKGELGRHKQVKI 81 >gi|161506346|ref|YP_001573458.1| hypothetical protein SARI_04543 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- str. RSK2980] gi|189030103|sp|A9MQS2|YGGU_SALAR RecName: Full=UPF0235 protein yggU gi|160867693|gb|ABX24316.1| hypothetical protein SARI_04543 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:--] Length = 96 Score = 68.1 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDCIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VAIEKGELGRHKQVKI 81 >gi|225850940|ref|YP_002731174.1| protein CPn_0497//CPj0497/CpB0517 [Persephonella marina EX-H1] gi|259646383|sp|C0QR78|Y1406_PERMH RecName: Full=UPF0235 protein PERMA_1406 gi|225645391|gb|ACO03577.1| protein CPn_0497//CPj0497/CpB0517 [Persephonella marina EX-H1] Length = 73 Score = 68.1 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 26/76 (34%), Positives = 43/76 (56%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V V++ PNAKK I ++ + +I+VT P+KGKAN ++ +L+K L + KS Sbjct: 3 VKVKVKPNAKKEEIREIQKD-------YFEIRVTVPPEKGKANSRVIELLSKHLKIPKSR 55 Query: 64 LRMLSKQSSPLKIIYI 79 +++ + S KI I Sbjct: 56 IKLKKGEKSREKIFEI 71 >gi|221640262|ref|YP_002526524.1| hypothetical protein RSKD131_2163 [Rhodobacter sphaeroides KD131] gi|221161043|gb|ACM02023.1| Hypothetical Protein RSKD131_2163 [Rhodobacter sphaeroides KD131] Length = 85 Score = 68.1 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 23/76 (30%), Positives = 39/76 (51%), Gaps = 8/76 (10%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P A ++ + E +++ VT P+ GKAN+A+ LAK L ++KS L Sbjct: 18 AVRVTPRAARAKVDLQE--------GVVRVHVTCVPEDGKANRAVTEALAKALGVAKSRL 69 Query: 65 RMLSKQSSPLKIIYID 80 ++ +S K +D Sbjct: 70 TLVRGATSRDKTFRLD 85 >gi|75910098|ref|YP_324394.1| hypothetical protein Ava_3894 [Anabaena variabilis ATCC 29413] gi|123608489|sp|Q3M687|Y3894_ANAVT RecName: Full=UPF0235 protein Ava_3894 gi|75703823|gb|ABA23499.1| Protein of unknown function DUF167 [Anabaena variabilis ATCC 29413] Length = 75 Score = 68.1 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 30/78 (38%), Positives = 46/78 (58%), Gaps = 7/78 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V++ PN+K+ IA D S T+H+K + P GKAN+ ++ +LA+K A+ KS + Sbjct: 4 KVKVKPNSKQQKIA---EQDDGSLTVHLK----SPPVDGKANEELIKLLAEKFAVPKSHI 56 Query: 65 RMLSKQSSPLKIIYIDKD 82 + S SS K+I ID D Sbjct: 57 TIKSGLSSRQKLIEIDTD 74 >gi|153874132|ref|ZP_02002465.1| conserved hypothetical protein [Beggiatoa sp. PS] gi|152069404|gb|EDN67535.1| conserved hypothetical protein [Beggiatoa sp. PS] Length = 91 Score = 68.1 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 18/83 (21%), Positives = 41/83 (49%), Gaps = 8/83 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P + ++ + + +K+K+ A P GKAN + + +K+ ++KS Sbjct: 13 LSVHVQPRSSQTSVVGVH-------GDRLKVKIMAAPVDGKANAEVCKLFSKQFGVAKSK 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEI 86 + + + +S K I I K +++ Sbjct: 66 IIIENGHTSRDKRICI-KSPQKL 87 >gi|187732088|ref|YP_001881726.1| hypothetical protein SbBS512_E3385 [Shigella boydii CDC 3083-94] gi|188496122|ref|ZP_03003392.1| conserved hypothetical protein TIGR00251 [Escherichia coli 53638] gi|331643646|ref|ZP_08344777.1| conserved hypothetical protein [Escherichia coli H736] gi|882482|gb|AAA69120.1| ORF_o100 [Escherichia coli str. K-12 substr. MG1655] gi|81246835|gb|ABB67543.1| conserved hypothetical protein [Shigella boydii Sb227] gi|187429080|gb|ACD08354.1| conserved hypothetical protein [Shigella boydii CDC 3083-94] gi|188491321|gb|EDU66424.1| conserved hypothetical protein TIGR00251 [Escherichia coli 53638] gi|331037117|gb|EGI09341.1| conserved hypothetical protein [Escherichia coli H736] Length = 100 Score = 68.1 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 70 VVIEKGELGRHKQIKI 85 >gi|51597526|ref|YP_071717.1| hypothetical protein YPTB3216 [Yersinia pseudotuberculosis IP 32953] gi|153948675|ref|YP_001399811.1| hypothetical protein YpsIP31758_0827 [Yersinia pseudotuberculosis IP 31758] gi|81638596|sp|Q666N2|Y3216_YERPS RecName: Full=UPF0235 protein YPTB3216 gi|167016819|sp|A7FEY3|Y827_YERP3 RecName: Full=UPF0235 protein YpsIP31758_0827 gi|51590808|emb|CAH22454.1| Conserved hypothetical protein [Yersinia pseudotuberculosis IP 32953] gi|152960170|gb|ABS47631.1| conserved hypothetical protein TIGR00251 [Yersinia pseudotuberculosis IP 31758] Length = 96 Score = 68.1 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 13 LKLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I + Sbjct: 66 VIIEKGELGRHKQIKV 81 >gi|152991960|ref|YP_001357681.1| hypothetical protein SUN_0364 [Sulfurovum sp. NBC37-1] gi|151423821|dbj|BAF71324.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1] Length = 95 Score = 68.1 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 42/91 (46%), Gaps = 11/91 (12%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ ++ P A ++ D +KI++ A +G ANK ++ LAK + K Sbjct: 10 VSMRIKAQPAASRNEF------CDIYGEDAIKIRIKAPAVEGAANKELMKFLAKSFKVPK 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 S + S Q+S +KI+ +TE QN Sbjct: 64 SDIIFKSGQNSKIKIVEFP-----LTEKFQN 89 >gi|157148503|ref|YP_001455822.1| hypothetical protein CKO_04329 [Citrobacter koseri ATCC BAA-895] gi|166227247|sp|A8APH3|Y4329_CITK8 RecName: Full=UPF0235 protein CKO_04329 gi|157085708|gb|ABV15386.1| hypothetical protein CKO_04329 [Citrobacter koseri ATCC BAA-895] Length = 96 Score = 68.1 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VVIEKGELGRHKQVKI 81 >gi|238787374|ref|ZP_04631173.1| hypothetical protein yfred0001_33630 [Yersinia frederiksenii ATCC 33641] gi|238724636|gb|EEQ16277.1| hypothetical protein yfred0001_33630 [Yersinia frederiksenii ATCC 33641] Length = 90 Score = 68.1 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 7 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQ 59 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + E+ LL+ Sbjct: 60 VIIEKGELGRHKQLKIVNPQQIPPEVAALLK 90 >gi|238754614|ref|ZP_04615968.1| hypothetical protein yruck0001_4830 [Yersinia ruckeri ATCC 29473] gi|238707245|gb|EEP99608.1| hypothetical protein yruck0001_4830 [Yersinia ruckeri ATCC 29473] Length = 93 Score = 67.7 bits (164), Expect = 4e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 10 IRLYIQPKASRDQIIGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSH 62 Query: 64 LRMLSKQSSPLKIIYI 79 + + K I I Sbjct: 63 VIIEKGDLGRHKQIKI 78 >gi|88812461|ref|ZP_01127710.1| hypothetical protein NB231_13261 [Nitrococcus mobilis Nb-231] gi|88790247|gb|EAR21365.1| hypothetical protein NB231_13261 [Nitrococcus mobilis Nb-231] Length = 99 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 40/78 (51%), Gaps = 7/78 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P A + L+I D +++++TA P +GKAN+ + L L +++S Sbjct: 13 LTVRVQPRAARDE---LKIDADGR----LRLRITAPPVEGKANEHLRHFLGHALGVARSQ 65 Query: 64 LRMLSKQSSPLKIIYIDK 81 + + + +S K I + Sbjct: 66 VSVATGATSRNKRIVVQN 83 >gi|126463218|ref|YP_001044332.1| hypothetical protein Rsph17029_2458 [Rhodobacter sphaeroides ATCC 17029] gi|126104882|gb|ABN77560.1| protein of unknown function DUF167 [Rhodobacter sphaeroides ATCC 17029] Length = 85 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 23/76 (30%), Positives = 39/76 (51%), Gaps = 8/76 (10%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P A ++ + E +++ VT P+ GKAN+A+ LAK L ++KS L Sbjct: 18 AVRVTPRAARAKVDLQE--------GVVRVHVTCVPEDGKANRAVTEALAKALGVAKSRL 69 Query: 65 RMLSKQSSPLKIIYID 80 ++ +S K +D Sbjct: 70 TLVRGATSRDKTFRLD 85 >gi|281179962|dbj|BAI56292.1| conserved hypothetical protein [Escherichia coli SE15] gi|330908988|gb|EGH37502.1| hypothetical protein ECAA86_03160 [Escherichia coli AA86] Length = 96 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 66 VVIEKGELGRHKQIKI 81 >gi|261210035|ref|ZP_05924333.1| hypothetical protein VCJ_000277 [Vibrio sp. RC341] gi|260840800|gb|EEX67342.1| hypothetical protein VCJ_000277 [Vibrio sp. RC341] Length = 96 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK+ ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKQCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EI L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEIAALIE 96 >gi|91212335|ref|YP_542321.1| hypothetical protein UTI89_C3342 [Escherichia coli UTI89] gi|110806861|ref|YP_690381.1| hypothetical protein SFV_3007 [Shigella flexneri 5 str. 8401] gi|157155173|ref|YP_001464306.1| hypothetical protein EcE24377A_3297 [Escherichia coli E24377A] gi|157162414|ref|YP_001459732.1| hypothetical protein EcHS_A3113 [Escherichia coli HS] gi|237706394|ref|ZP_04536875.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA] gi|254038003|ref|ZP_04872061.1| conserved hypothetical protein [Escherichia sp. 1_1_43] gi|331654465|ref|ZP_08355465.1| conserved hypothetical protein [Escherichia coli M718] gi|331674433|ref|ZP_08375193.1| conserved hypothetical protein [Escherichia coli TA280] gi|331678947|ref|ZP_08379621.1| conserved hypothetical protein [Escherichia coli H591] gi|26109782|gb|AAN81987.1|AE016766_75 Hypothetical protein yggU [Escherichia coli CFT073] gi|24053355|gb|AAN44425.1| orf, conserved hypothetical protein [Shigella flexneri 2a str. 301] gi|30042526|gb|AAP18250.1| hypothetical protein S3148 [Shigella flexneri 2a str. 2457T] gi|91073909|gb|ABE08790.1| hypothetical protein UTI89_C3342 [Escherichia coli UTI89] gi|110616409|gb|ABF05076.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401] gi|157068094|gb|ABV07349.1| conserved hypothetical protein TIGR00251 [Escherichia coli HS] gi|157077203|gb|ABV16911.1| conserved hypothetical protein TIGR00251 [Escherichia coli E24377A] gi|195183146|dbj|BAG66691.1| predicted protein [Escherichia coli O111:H-] gi|226839627|gb|EEH71648.1| conserved hypothetical protein [Escherichia sp. 1_1_43] gi|226899434|gb|EEH85693.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA] gi|294491613|gb|ADE90369.1| conserved hypothetical protein TIGR00251 [Escherichia coli IHE3034] gi|331047847|gb|EGI19924.1| conserved hypothetical protein [Escherichia coli M718] gi|331068527|gb|EGI39922.1| conserved hypothetical protein [Escherichia coli TA280] gi|331073777|gb|EGI45098.1| conserved hypothetical protein [Escherichia coli H591] gi|332102705|gb|EGJ06051.1| conserved hypothetical protein [Shigella sp. D9] Length = 100 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 70 VVIEKGELGRHKQIKI 85 >gi|297617010|ref|YP_003702169.1| hypothetical protein Slip_0824 [Syntrophothermus lipocalidus DSM 12680] gi|297144847|gb|ADI01604.1| protein of unknown function DUF167 [Syntrophothermus lipocalidus DSM 12680] Length = 96 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 44/79 (55%), Gaps = 7/79 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 VR++P A K+ + +KIK+TA P +G+AN+A+++ LAK +++ Sbjct: 11 RFEVRVLPRASKNEVIG-------EVEGAVKIKLTAAPLEGEANQALISFLAKISGVARK 63 Query: 63 SLRMLSKQSSPLKIIYIDK 81 ++ ++ ++S K++ I Sbjct: 64 NVTIIKGETSRHKLVEITG 82 >gi|12517499|gb|AAG58084.1|AE005525_10 orf, hypothetical protein [Escherichia coli O157:H7 str. EDL933] gi|13363301|dbj|BAB37252.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai] gi|209760028|gb|ACI78326.1| hypothetical protein ECs3829 [Escherichia coli] gi|209760030|gb|ACI78327.1| hypothetical protein ECs3829 [Escherichia coli] gi|209760032|gb|ACI78328.1| hypothetical protein ECs3829 [Escherichia coli] gi|209760034|gb|ACI78329.1| hypothetical protein ECs3829 [Escherichia coli] gi|209760036|gb|ACI78330.1| hypothetical protein ECs3829 [Escherichia coli] Length = 100 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 70 VVIEKGELGRHKQIKI 85 >gi|317493545|ref|ZP_07951966.1| hypothetical protein HMPREF0864_02731 [Enterobacteriaceae bacterium 9_2_54FAA] gi|316918488|gb|EFV39826.1| hypothetical protein HMPREF0864_02731 [Enterobacteriaceae bacterium 9_2_54FAA] Length = 99 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 17/77 (22%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + +AK+ ++KS Sbjct: 15 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLQKFIAKQFRVAKSQ 67 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K + I Sbjct: 68 VVIEKGELGRHKQVRIS 84 >gi|215488251|ref|YP_002330682.1| hypothetical protein E2348C_3206 [Escherichia coli O127:H6 str. E2348/69] gi|312964784|ref|ZP_07779024.1| conserved hypothetical protein [Escherichia coli 2362-75] gi|331684580|ref|ZP_08385172.1| conserved hypothetical protein [Escherichia coli H299] gi|254814154|sp|B7UI00|YGGU_ECO27 RecName: Full=UPF0235 protein yggU gi|215266323|emb|CAS10754.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] gi|312290340|gb|EFR18220.1| conserved hypothetical protein [Escherichia coli 2362-75] gi|331078195|gb|EGI49401.1| conserved hypothetical protein [Escherichia coli H299] Length = 96 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 39/90 (43%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELL 90 + + + K I I + EI LL Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEIAALL 95 >gi|91787002|ref|YP_547954.1| hypothetical protein Bpro_1104 [Polaromonas sp. JS666] gi|91696227|gb|ABE43056.1| protein of unknown function DUF167 [Polaromonas sp. JS666] Length = 113 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 18/74 (24%), Positives = 39/74 (52%), Gaps = 3/74 (4%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V V ++PNA K+ L + +++++ P GKAN+A++ LA L + +++ Sbjct: 21 VDVHVVPNAAKTQPVGLHG---EPGQLALRLRLQVPPVDGKANQALIRWLAHSLGVPQNA 77 Query: 64 LRMLSKQSSPLKII 77 + + ++S K + Sbjct: 78 ITPVRGETSRRKQL 91 >gi|28373966|pdb|1N91|A Chain A, Solution Nmr Structure Of Protein Yggu From Escherichia Coli. Northeast Structural Genomics Consortium Target Er14. gi|60594359|pdb|1YH5|A Chain A, Solution Nmr Structure Of Protein Yggu From Escherichia Coli. Northeast Structural Genomics Consortium Target Er14 Length = 108 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 70 VVIEKGELGRHKQIKI 85 >gi|218782636|ref|YP_002433954.1| hypothetical protein Dalk_4809 [Desulfatibacillum alkenivorans AK-01] gi|218764020|gb|ACL06486.1| protein of unknown function DUF167 [Desulfatibacillum alkenivorans AK-01] Length = 106 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 26/88 (29%), Positives = 45/88 (51%), Gaps = 11/88 (12%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 ++++P + + + L+ +KIK+ A P G ANK + LAK L L KSS++ Sbjct: 17 IKVLPRSSVNAVVGLQDGA-------LKIKLKAPPVGGAANKMCIQFLAKTLKLPKSSIK 69 Query: 66 MLSKQSSPLKIIYI----DKDCKEITEL 89 +LS ++ K I + + KE+ L Sbjct: 70 ILSGETGRSKQIMVRPREEGSKKELARL 97 >gi|53728760|ref|ZP_00135410.2| COG1872: Uncharacterized conserved protein [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|126208844|ref|YP_001054069.1| hypothetical protein APL_1380 [Actinobacillus pleuropneumoniae L20] gi|303250781|ref|ZP_07336976.1| hypothetical protein APP6_1906 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|303252447|ref|ZP_07338612.1| hypothetical protein APP2_1422 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|307246303|ref|ZP_07528382.1| hypothetical protein appser1_15050 [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|307248416|ref|ZP_07530437.1| hypothetical protein appser2_13900 [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|307250644|ref|ZP_07532582.1| hypothetical protein appser4_14180 [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|307253024|ref|ZP_07534909.1| hypothetical protein appser6_15320 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307255287|ref|ZP_07537100.1| hypothetical protein appser9_15200 [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307257450|ref|ZP_07539217.1| hypothetical protein appser10_14450 [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|307259722|ref|ZP_07541443.1| hypothetical protein appser11_15170 [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|166200384|sp|A3N228|Y1380_ACTP2 RecName: Full=UPF0235 protein APL_1380 gi|126097636|gb|ABN74464.1| hypothetical protein APL_1380 [Actinobacillus pleuropneumoniae serovar 5b str. L20] gi|302648720|gb|EFL78911.1| hypothetical protein APP2_1422 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|302650386|gb|EFL80547.1| hypothetical protein APP6_1906 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306852773|gb|EFM84999.1| hypothetical protein appser1_15050 [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306855058|gb|EFM87240.1| hypothetical protein appser2_13900 [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|306857316|gb|EFM89434.1| hypothetical protein appser4_14180 [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306859482|gb|EFM91510.1| hypothetical protein appser6_15320 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306861736|gb|EFM93717.1| hypothetical protein appser9_15200 [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306864030|gb|EFM95946.1| hypothetical protein appser10_14450 [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|306866190|gb|EFM98057.1| hypothetical protein appser11_15170 [Actinobacillus pleuropneumoniae serovar 11 str. 56153] Length = 97 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 25/91 (27%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A + I L + +KI +TA P G AN +L L+K + KS Sbjct: 14 RLRIFLQPKASRDQIVGLHDNE-------LKIAITALPVDGAANAHLLKYLSKLFKVPKS 66 Query: 63 SLRMLSKQSSPLKIIYIDK---DCKEITELL 90 S+ + + K +++ + KEI LL Sbjct: 67 SIVLEKGELQRHKQLFVPEPKLIPKEIEALL 97 >gi|307261870|ref|ZP_07543532.1| hypothetical protein appser12_14270 [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306868417|gb|EFN00232.1| hypothetical protein appser12_14270 [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 97 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 25/91 (27%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A + I L + +KI +TA P G AN +L L+K + KS Sbjct: 14 RLRIFLQPKASRDQIVGLHDNE-------LKIAITALPVDGAANAHLLKYLSKLFKVPKS 66 Query: 63 SLRMLSKQSSPLKIIYIDK---DCKEITELL 90 S+ + + K +++ + KEI LL Sbjct: 67 SIVLEKGELQRHKQLFVPEPKLIPKEIEALL 97 >gi|289164111|ref|YP_003454249.1| hypothetical protein LLO_0767 [Legionella longbeachae NSW150] gi|288857284|emb|CBJ11111.1| hypothetical protein LLO_0767 [Legionella longbeachae NSW150] Length = 91 Score = 67.7 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 19/82 (23%), Positives = 38/82 (46%), Gaps = 7/82 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P AKKS I + +KI++ A P +G+ANK +L +A+ + S Sbjct: 12 LYLYVQPGAKKSEIVGMH-------EGVLKIRLNAPPIEGRANKELLKYVAQLFKVPPSQ 64 Query: 64 LRMLSKQSSPLKIIYIDKDCKE 85 + + S K++ + + Sbjct: 65 VVLKRGDKSRHKVLLVKNSLVD 86 >gi|260773605|ref|ZP_05882521.1| hypothetical protein VIB_002079 [Vibrio metschnikovii CIP 69.14] gi|260612744|gb|EEX37947.1| hypothetical protein VIB_002079 [Vibrio metschnikovii CIP 69.14] Length = 97 Score = 67.7 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 20/76 (26%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + +KI +TA P GKAN + LAK +SKS+ Sbjct: 13 LRIYVQPKASRDSLVG-------QHGDELKIAITAPPVDGKANAHLSRYLAKLCKVSKSA 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + Q K + I Sbjct: 66 VEIEKGQLGRHKQVRI 81 >gi|303245495|ref|ZP_07331779.1| protein of unknown function DUF167 [Desulfovibrio fructosovorans JJ] gi|302493344|gb|EFL53206.1| protein of unknown function DUF167 [Desulfovibrio fructosovorans JJ] Length = 113 Score = 67.7 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 18/82 (21%), Positives = 41/82 (50%), Gaps = 7/82 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V + P + +A L + +++++ A +G+AN A+ A LA+ + Sbjct: 27 TLRVAVAPGGSRDALAGLAEDR-------LRVRLRAKAVEGQANAALTAFLAECFGVRPR 79 Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84 +R++S + S KI+ I+ + + Sbjct: 80 QVRIVSGEKSRKKIVRINAESE 101 >gi|89109730|ref|AP_003510.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110] gi|90111518|ref|NP_417428.2| conserved protein, UPF0235 family [Escherichia coli str. K-12 substr. MG1655] gi|161984863|ref|YP_409371.2| hypothetical protein SBO_3037 [Shigella boydii Sb227] gi|170082505|ref|YP_001731825.1| hypothetical protein ECDH10B_3128 [Escherichia coli str. K-12 substr. DH10B] gi|238902075|ref|YP_002927871.1| hypothetical protein BWG_2675 [Escherichia coli BW2952] gi|253772209|ref|YP_003035040.1| hypothetical protein ECBD_0787 [Escherichia coli 'BL21-Gold(DE3)pLysS AG'] gi|300947684|ref|ZP_07161853.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 116-1] gi|300954200|ref|ZP_07166665.1| hypothetical protein HMPREF9547_00147 [Escherichia coli MS 175-1] gi|301643693|ref|ZP_07243732.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 146-1] gi|307139638|ref|ZP_07498994.1| hypothetical protein EcolH7_16123 [Escherichia coli H736] gi|6920084|sp|P52060|YGGU_ECOLI RecName: Full=UPF0235 protein yggU gi|226730819|sp|B1XFB3|YGGU_ECODH RecName: Full=UPF0235 protein yggU gi|259710253|sp|C5A0M2|YGGU_ECOBW RecName: Full=UPF0235 protein yggU gi|85675763|dbj|BAE77016.1| conserved hypothetical protein [Escherichia coli str. K12 substr. W3110] gi|87082189|gb|AAC75990.2| conserved protein, UPF0235 family [Escherichia coli str. K-12 substr. MG1655] gi|169890340|gb|ACB04047.1| conserved protein [Escherichia coli str. K-12 substr. DH10B] gi|238860521|gb|ACR62519.1| conserved protein [Escherichia coli BW2952] gi|253323253|gb|ACT27855.1| protein of unknown function DUF167 [Escherichia coli 'BL21-Gold(DE3)pLysS AG'] gi|260448004|gb|ACX38426.1| protein of unknown function DUF167 [Escherichia coli DH1] gi|284922896|emb|CBG35985.1| conserved hypothetical protein [Escherichia coli 042] gi|300318784|gb|EFJ68568.1| hypothetical protein HMPREF9547_00147 [Escherichia coli MS 175-1] gi|300452730|gb|EFK16350.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 116-1] gi|301077895|gb|EFK92701.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 146-1] gi|309703308|emb|CBJ02644.1| conserved hypothetical protein [Escherichia coli ETEC H10407] gi|315137550|dbj|BAJ44709.1| hypothetical protein ECDH1ME8569_2853 [Escherichia coli DH1] gi|315614885|gb|EFU95523.1| conserved hypothetical protein [Escherichia coli 3431] gi|320174051|gb|EFW49221.1| hypothetical protein SDB_03448 [Shigella dysenteriae CDC 74-1112] gi|320184300|gb|EFW59112.1| hypothetical protein SGF_03464 [Shigella flexneri CDC 796-83] gi|323173833|gb|EFZ59462.1| hypothetical protein ECLT68_2152 [Escherichia coli LT-68] gi|323936044|gb|EGB32339.1| yggU [Escherichia coli E1520] gi|332091357|gb|EGI96445.1| hypothetical protein SB359474_3531 [Shigella boydii 3594-74] Length = 96 Score = 67.7 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 66 VVIEKGELGRHKQIKI 81 >gi|313673537|ref|YP_004051648.1| hypothetical protein Calni_1577 [Calditerrivibrio nitroreducens DSM 19672] gi|312940293|gb|ADR19485.1| protein of unknown function DUF167 [Calditerrivibrio nitroreducens DSM 19672] Length = 86 Score = 67.7 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 26/92 (28%), Positives = 49/92 (53%), Gaps = 9/92 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + + P AKK+ D +KIK+ + P GKAN+ +++ +++ L LSK Sbjct: 2 RIKIYVQPGAKKTA-------YDGEFNGCIKIKIKSPPTDGKANEELISFISQSLNLSKK 54 Query: 63 SLRMLSKQSSPLKIIYIDK--DCKEITELLQN 92 + ++S + S KII + + D + I E L++ Sbjct: 55 EVGIISGEKSRYKIIEVPENYDMEFIKEKLKD 86 >gi|270308462|ref|YP_003330520.1| hypothetical protein DhcVS_1075 [Dehalococcoides sp. VS] gi|270154354|gb|ACZ62192.1| hypothetical protein DhcVS_1075 [Dehalococcoides sp. VS] Length = 97 Score = 67.7 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 16/77 (20%), Positives = 41/77 (53%), Gaps = 7/77 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +++ P+++++ ++ E +K+++ A P+KGKANK ++ L++ L K+ Sbjct: 9 RVNLKIFPSSQRNELSGYEN-------GLLKLRIAAQPEKGKANKELIDYLSELLDTPKA 61 Query: 63 SLRMLSKQSSPLKIIYI 79 + + + K++ Sbjct: 62 EIEICRGHTGRNKVLVF 78 >gi|224826207|ref|ZP_03699310.1| protein of unknown function DUF167 [Lutiella nitroferrum 2002] gi|224601844|gb|EEG08024.1| protein of unknown function DUF167 [Lutiella nitroferrum 2002] Length = 108 Score = 67.3 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 7/91 (7%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + + + P AKK+ +A +K+++ A P +GKAN +LA LA++ + Sbjct: 22 VIRLTLHVQPGAKKTDLAG-------EHGGALKLRLAAPPVEGKANAMLLAWLAERFEVP 74 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 K + +LS S KI+ I E L + Sbjct: 75 KRDVVLLSGDKSRHKIVEIKLGLDEAAVLAR 105 >gi|313223261|emb|CBY43439.1| unnamed protein product [Oikopleura dioica] Length = 180 Score = 67.3 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 28/86 (32%), Positives = 48/86 (55%), Gaps = 2/86 (2%) Query: 8 LIPNAKKSGIASLEIPKDT-SDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRM 66 + P +K I+ + DT S + ++V+A P KGKANKA+L LA KL + S L + Sbjct: 85 VTPKSKTPSISVDDSINDTLSVVHVVNVRVSAPPDKGKANKAVLKSLADKLGVKPSKLSI 144 Query: 67 LSKQSSPLKIIYIDKDCKEITELLQN 92 S +S K++ ++ D +++ +L Sbjct: 145 QSGTTSRSKVVLLETD-ADLSTILAQ 169 >gi|293412313|ref|ZP_06655036.1| hypothetical protein ECEG_02320 [Escherichia coli B354] gi|291469084|gb|EFF11575.1| hypothetical protein ECEG_02320 [Escherichia coli B354] Length = 96 Score = 67.3 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 66 VVIEKGELGRHKQIKI 81 >gi|310814754|ref|YP_003962718.1| hypothetical protein EIO_0232 [Ketogulonicigenium vulgare Y25] gi|308753489|gb|ADO41418.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25] Length = 82 Score = 67.3 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 24/76 (31%), Positives = 41/76 (53%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P A ++ I +D S +++ VT P+ GKAN A+ +LAK L + KS Sbjct: 13 IAVRVTPKASRARI-----LRDESGV--LRVYVTVVPEDGKANAAVTELLAKALRIPKSK 65 Query: 64 LRMLSKQSSPLKIIYI 79 L + S ++ K+ + Sbjct: 66 LILKSGATARDKVFRL 81 >gi|260767468|ref|ZP_05876405.1| hypothetical protein VFA_000519 [Vibrio furnissii CIP 102972] gi|260617580|gb|EEX42762.1| hypothetical protein VFA_000519 [Vibrio furnissii CIP 102972] gi|315181248|gb|ADT88162.1| hypothetical protein vfu_A03054 [Vibrio furnissii NCTC 11218] Length = 96 Score = 67.3 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK+ ++K Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGKANAHLSKYLAKQCKVAKGL 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 66 IDIEKGELGRHKQIRI 81 >gi|110643102|ref|YP_670832.1| hypothetical protein ECP_2947 [Escherichia coli 536] gi|117625180|ref|YP_854168.1| hypothetical protein APECO1_3568 [Escherichia coli APEC O1] gi|161485880|ref|NP_708718.2| hypothetical protein SF2944 [Shigella flexneri 2a str. 301] gi|161486124|ref|NP_755414.2| hypothetical protein c3539 [Escherichia coli CFT073] gi|161486442|ref|NP_838440.2| hypothetical protein S3148 [Shigella flexneri 2a str. 2457T] gi|170018806|ref|YP_001723760.1| hypothetical protein EcolC_0761 [Escherichia coli ATCC 8739] gi|170681809|ref|YP_001745114.1| hypothetical protein EcSMS35_3095 [Escherichia coli SMS-3-5] gi|191167930|ref|ZP_03029733.1| conserved hypothetical protein [Escherichia coli B7A] gi|191171828|ref|ZP_03033374.1| conserved hypothetical protein [Escherichia coli F11] gi|193063515|ref|ZP_03044604.1| conserved hypothetical protein [Escherichia coli E22] gi|193067470|ref|ZP_03048438.1| conserved hypothetical protein [Escherichia coli E110019] gi|194426264|ref|ZP_03058819.1| conserved hypothetical protein [Escherichia coli B171] gi|194431742|ref|ZP_03064033.1| conserved hypothetical protein [Shigella dysenteriae 1012] gi|194436768|ref|ZP_03068868.1| conserved hypothetical protein [Escherichia coli 101-1] gi|209920412|ref|YP_002294496.1| hypothetical protein ECSE_3221 [Escherichia coli SE11] gi|218550200|ref|YP_002383991.1| hypothetical protein EFER_2892 [Escherichia fergusonii ATCC 35469] gi|218555512|ref|YP_002388425.1| hypothetical protein ECIAI1_3086 [Escherichia coli IAI1] gi|218559944|ref|YP_002392857.1| hypothetical protein ECS88_3235 [Escherichia coli S88] gi|218691077|ref|YP_002399289.1| hypothetical protein ECED1_3416 [Escherichia coli ED1a] gi|218696551|ref|YP_002404218.1| hypothetical protein EC55989_3246 [Escherichia coli 55989] gi|218701663|ref|YP_002409292.1| hypothetical protein ECIAI39_3371 [Escherichia coli IAI39] gi|218706468|ref|YP_002413987.1| hypothetical protein ECUMN_3305 [Escherichia coli UMN026] gi|227888508|ref|ZP_04006313.1| protein of hypothetical function DUF167 [Escherichia coli 83972] gi|254162863|ref|YP_003045971.1| hypothetical protein ECB_02783 [Escherichia coli B str. REL606] gi|256019245|ref|ZP_05433110.1| hypothetical protein ShiD9_10034 [Shigella sp. D9] gi|260845623|ref|YP_003223401.1| hypothetical protein ECO103_3533 [Escherichia coli O103:H2 str. 12009] gi|260857086|ref|YP_003230977.1| hypothetical protein ECO26_4052 [Escherichia coli O26:H11 str. 11368] gi|260869640|ref|YP_003236042.1| hypothetical protein ECO111_3701 [Escherichia coli O111:H- str. 11128] gi|293406460|ref|ZP_06650386.1| hypothetical protein ECGG_01757 [Escherichia coli FVEC1412] gi|293416214|ref|ZP_06658854.1| hypothetical protein ECDG_03817 [Escherichia coli B185] gi|293449283|ref|ZP_06663704.1| hypothetical protein ECCG_02314 [Escherichia coli B088] gi|297520255|ref|ZP_06938641.1| hypothetical protein EcolOP_21662 [Escherichia coli OP50] gi|298382197|ref|ZP_06991794.1| hypothetical protein ECFG_01943 [Escherichia coli FVEC1302] gi|300815577|ref|ZP_07095801.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 107-1] gi|300824812|ref|ZP_07104916.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 119-7] gi|300900230|ref|ZP_07118414.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 198-1] gi|300906483|ref|ZP_07124178.1| hypothetical protein HMPREF9536_04445 [Escherichia coli MS 84-1] gi|300921295|ref|ZP_07137664.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 115-1] gi|300928104|ref|ZP_07143649.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 187-1] gi|300940765|ref|ZP_07155311.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 21-1] gi|300980105|ref|ZP_07174848.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 45-1] gi|300995466|ref|ZP_07181114.1| hypothetical protein HMPREF9553_04604 [Escherichia coli MS 200-1] gi|301027296|ref|ZP_07190642.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 69-1] gi|301027722|ref|ZP_07191032.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 196-1] gi|301049252|ref|ZP_07196225.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 185-1] gi|301306562|ref|ZP_07212624.1| hypothetical protein HMPREF9347_05170 [Escherichia coli MS 124-1] gi|301328107|ref|ZP_07221248.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 78-1] gi|306812143|ref|ZP_07446341.1| hypothetical protein ECNC101_09529 [Escherichia coli NC101] gi|309794042|ref|ZP_07688467.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 145-7] gi|312972805|ref|ZP_07786978.1| conserved hypothetical protein [Escherichia coli 1827-70] gi|331659088|ref|ZP_08360030.1| conserved hypothetical protein [Escherichia coli TA206] gi|331669699|ref|ZP_08370545.1| conserved hypothetical protein [Escherichia coli TA271] gi|29839713|sp|Q8FE28|YGGU_ECOL6 RecName: Full=UPF0235 protein yggU gi|47117526|sp|Q83JS1|YGGU_SHIFL RecName: Full=UPF0235 protein yggU gi|123343643|sp|Q0TDP8|YGGU_ECOL5 RecName: Full=UPF0235 protein yggU gi|166227348|sp|A1AFD9|YGGU_ECOK1 RecName: Full=UPF0235 protein yggU gi|189030102|sp|B1IT54|YGGU_ECOLC RecName: Full=UPF0235 protein yggU gi|226730815|sp|B7MME1|YGGU_ECO45 RecName: Full=UPF0235 protein yggU gi|226730817|sp|B7NI16|YGGU_ECO7I RecName: Full=UPF0235 protein yggU gi|226730818|sp|B7LYY3|YGGU_ECO8A RecName: Full=UPF0235 protein yggU gi|226730820|sp|B7N7K6|YGGU_ECOLU RecName: Full=UPF0235 protein yggU gi|226730821|sp|B6I789|YGGU_ECOSE RecName: Full=UPF0235 protein yggU gi|226730822|sp|B1LDG2|YGGU_ECOSM RecName: Full=UPF0235 protein yggU gi|226730823|sp|B7LPS0|YGGU_ESCF3 RecName: Full=UPF0235 protein yggU gi|254814155|sp|B7LFL6|YGGU_ECO55 RecName: Full=UPF0235 protein yggU gi|254814156|sp|B7MZQ2|YGGU_ECO81 RecName: Full=UPF0235 protein yggU gi|110344694|gb|ABG70931.1| hypothetical protein YggU [Escherichia coli 536] gi|115514304|gb|ABJ02379.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|169753734|gb|ACA76433.1| protein of unknown function DUF167 [Escherichia coli ATCC 8739] gi|170519527|gb|ACB17705.1| conserved hypothetical protein [Escherichia coli SMS-3-5] gi|190902015|gb|EDV61761.1| conserved hypothetical protein [Escherichia coli B7A] gi|190907863|gb|EDV67456.1| conserved hypothetical protein [Escherichia coli F11] gi|192930792|gb|EDV83397.1| conserved hypothetical protein [Escherichia coli E22] gi|192959427|gb|EDV89862.1| conserved hypothetical protein [Escherichia coli E110019] gi|194415572|gb|EDX31839.1| conserved hypothetical protein [Escherichia coli B171] gi|194420098|gb|EDX36176.1| conserved hypothetical protein [Shigella dysenteriae 1012] gi|194424250|gb|EDX40237.1| conserved hypothetical protein [Escherichia coli 101-1] gi|209913671|dbj|BAG78745.1| conserved hypothetical protein [Escherichia coli SE11] gi|218353283|emb|CAU99245.1| conserved hypothetical protein [Escherichia coli 55989] gi|218357741|emb|CAQ90385.1| conserved hypothetical protein [Escherichia fergusonii ATCC 35469] gi|218362280|emb|CAQ99901.1| conserved hypothetical protein [Escherichia coli IAI1] gi|218366713|emb|CAR04470.1| conserved hypothetical protein [Escherichia coli S88] gi|218371649|emb|CAR19488.1| conserved hypothetical protein [Escherichia coli IAI39] gi|218428641|emb|CAR09570.2| conserved hypothetical protein [Escherichia coli ED1a] gi|218433565|emb|CAR14468.1| conserved hypothetical protein [Escherichia coli UMN026] gi|222034648|emb|CAP77390.1| UPF0235 protein yggU [Escherichia coli LF82] gi|227834777|gb|EEJ45243.1| protein of hypothetical function DUF167 [Escherichia coli 83972] gi|242378479|emb|CAQ33263.1| conserved protein [Escherichia coli BL21(DE3)] gi|253974764|gb|ACT40435.1| hypothetical protein ECB_02783 [Escherichia coli B str. REL606] gi|253978930|gb|ACT44600.1| hypothetical protein ECD_02783 [Escherichia coli BL21(DE3)] gi|257755735|dbj|BAI27237.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368] gi|257760770|dbj|BAI32267.1| conserved predicted protein [Escherichia coli O103:H2 str. 12009] gi|257765996|dbj|BAI37491.1| conserved predicted protein [Escherichia coli O111:H- str. 11128] gi|291322373|gb|EFE61802.1| hypothetical protein ECCG_02314 [Escherichia coli B088] gi|291426466|gb|EFE99498.1| hypothetical protein ECGG_01757 [Escherichia coli FVEC1412] gi|291432403|gb|EFF05385.1| hypothetical protein ECDG_03817 [Escherichia coli B185] gi|298277337|gb|EFI18853.1| hypothetical protein ECFG_01943 [Escherichia coli FVEC1302] gi|299879158|gb|EFI87369.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 196-1] gi|300298948|gb|EFJ55333.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 185-1] gi|300304828|gb|EFJ59348.1| hypothetical protein HMPREF9553_04604 [Escherichia coli MS 200-1] gi|300356246|gb|EFJ72116.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 198-1] gi|300395102|gb|EFJ78640.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 69-1] gi|300401726|gb|EFJ85264.1| hypothetical protein HMPREF9536_04445 [Escherichia coli MS 84-1] gi|300409362|gb|EFJ92900.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 45-1] gi|300411757|gb|EFJ95067.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 115-1] gi|300454465|gb|EFK17958.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 21-1] gi|300463870|gb|EFK27363.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 187-1] gi|300522719|gb|EFK43788.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 119-7] gi|300531506|gb|EFK52568.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 107-1] gi|300838180|gb|EFK65940.1| hypothetical protein HMPREF9347_05170 [Escherichia coli MS 124-1] gi|300845411|gb|EFK73171.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 78-1] gi|305854181|gb|EFM54619.1| hypothetical protein ECNC101_09529 [Escherichia coli NC101] gi|307554935|gb|ADN47710.1| conserved hypothetical protein [Escherichia coli ABU 83972] gi|307625473|gb|ADN69777.1| hypothetical protein UM146_01750 [Escherichia coli UM146] gi|308122449|gb|EFO59711.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 145-7] gi|310332747|gb|EFP99960.1| conserved hypothetical protein [Escherichia coli 1827-70] gi|312947484|gb|ADR28311.1| hypothetical protein NRG857_14505 [Escherichia coli O83:H1 str. NRG 857C] gi|313648003|gb|EFS12449.1| hypothetical protein SF2457T_3601 [Shigella flexneri 2a str. 2457T] gi|315256845|gb|EFU36813.1| putative cytoplasmic protein [Escherichia coli MS 85-1] gi|315289499|gb|EFU48894.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 110-3] gi|315293933|gb|EFU53285.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 153-1] gi|315295642|gb|EFU54965.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 16-3] gi|320181042|gb|EFW55963.1| hypothetical protein SGB_01866 [Shigella boydii ATCC 9905] gi|320195072|gb|EFW69701.1| hypothetical protein EcoM_02835 [Escherichia coli WV_060327] gi|320202617|gb|EFW77187.1| hypothetical protein ECoL_00280 [Escherichia coli EC4100B] gi|323154662|gb|EFZ40861.1| hypothetical protein ECEPECA14_3462 [Escherichia coli EPECa14] gi|323162601|gb|EFZ48448.1| hypothetical protein ECE128010_1205 [Escherichia coli E128010] gi|323180413|gb|EFZ65965.1| hypothetical protein ECOK1180_1095 [Escherichia coli 1180] gi|323183524|gb|EFZ68921.1| hypothetical protein ECOK1357_3303 [Escherichia coli 1357] gi|323188653|gb|EFZ73938.1| hypothetical protein ECRN5871_3073 [Escherichia coli RN587/1] gi|323941961|gb|EGB38140.1| hypothetical protein ERDG_01736 [Escherichia coli E482] gi|323946549|gb|EGB42572.1| yggU [Escherichia coli H120] gi|323951609|gb|EGB47484.1| hypothetical protein ERKG_02252 [Escherichia coli H252] gi|323957323|gb|EGB53045.1| hypothetical protein ERLG_01390 [Escherichia coli H263] gi|323960754|gb|EGB56375.1| hypothetical protein ERGG_02689 [Escherichia coli H489] gi|323966470|gb|EGB61903.1| hypothetical protein ERJG_02059 [Escherichia coli M863] gi|323971760|gb|EGB66987.1| hypothetical protein ERHG_02238 [Escherichia coli TA007] gi|323978752|gb|EGB73833.1| hypothetical protein ERFG_00343 [Escherichia coli TW10509] gi|324005509|gb|EGB74728.1| hypothetical protein HMPREF9532_04861 [Escherichia coli MS 57-2] gi|324011796|gb|EGB81015.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 60-1] gi|324017226|gb|EGB86445.1| hypothetical protein HMPREF9542_04137 [Escherichia coli MS 117-3] gi|324115032|gb|EGC08997.1| hypothetical protein ERIG_00360 [Escherichia fergusonii B253] gi|324119748|gb|EGC13628.1| hypothetical protein ERBG_00336 [Escherichia coli E1167] gi|325498510|gb|EGC96369.1| hypothetical protein ECD227_2607 [Escherichia fergusonii ECD227] gi|327251721|gb|EGE63407.1| hypothetical protein ECSTEC7V_3572 [Escherichia coli STEC_7v] gi|331053670|gb|EGI25699.1| conserved hypothetical protein [Escherichia coli TA206] gi|331063367|gb|EGI35280.1| conserved hypothetical protein [Escherichia coli TA271] gi|332086882|gb|EGI92018.1| hypothetical protein SB521682_3508 [Shigella boydii 5216-82] gi|332087620|gb|EGI92747.1| hypothetical protein SD15574_3393 [Shigella dysenteriae 155-74] Length = 96 Score = 67.3 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 66 VVIEKGELGRHKQIKI 81 >gi|294010668|ref|YP_003544128.1| hypothetical protein SJA_C1-06820 [Sphingobium japonicum UT26S] gi|292673998|dbj|BAI95516.1| conserved hypothetical protein [Sphingobium japonicum UT26S] Length = 103 Score = 67.3 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 23/84 (27%), Positives = 44/84 (52%), Gaps = 2/84 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A + + + D + +V A P++GKAN A++A+LAK+L + + Sbjct: 13 LSVRLTPGAAREEVGGV--WTDDKGANWLSARVRAVPERGKANAALIALLAKRLDWPRGA 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87 + + S ++ LK + I + + Sbjct: 71 ISLESGDANRLKRLRIKGGGEALA 94 >gi|161367529|ref|NP_289525.2| hypothetical protein Z4298 [Escherichia coli O157:H7 str. EDL933] gi|162139760|ref|NP_311856.2| hypothetical protein ECs3829 [Escherichia coli O157:H7 str. Sakai] gi|168747555|ref|ZP_02772577.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4113] gi|168753905|ref|ZP_02778912.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4401] gi|168760095|ref|ZP_02785102.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4501] gi|168766960|ref|ZP_02791967.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4486] gi|168773408|ref|ZP_02798415.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4196] gi|168781812|ref|ZP_02806819.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4076] gi|168785811|ref|ZP_02810818.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC869] gi|168797528|ref|ZP_02822535.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC508] gi|195937091|ref|ZP_03082473.1| hypothetical protein EscherichcoliO157_11661 [Escherichia coli O157:H7 str. EC4024] gi|208807147|ref|ZP_03249484.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4206] gi|208813482|ref|ZP_03254811.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4045] gi|208818265|ref|ZP_03258585.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4042] gi|209399373|ref|YP_002272433.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4115] gi|217327282|ref|ZP_03443365.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. TW14588] gi|254794905|ref|YP_003079742.1| hypothetical protein ECSP_3924 [Escherichia coli O157:H7 str. TW14359] gi|261226265|ref|ZP_05940546.1| hypothetical protein EscherichiacoliO157_16963 [Escherichia coli O157:H7 str. FRIK2000] gi|261256477|ref|ZP_05949010.1| hypothetical protein EscherichiacoliO157EcO_11656 [Escherichia coli O157:H7 str. FRIK966] gi|291284274|ref|YP_003501092.1| hypothetical protein G2583_3612 [Escherichia coli O55:H7 str. CB9615] gi|29839727|sp|Q8XCU6|YGGU_ECO57 RecName: Full=UPF0235 protein yggU gi|226730816|sp|B5YQF1|YGGU_ECO5E RecName: Full=UPF0235 protein yggU gi|187770933|gb|EDU34777.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4196] gi|188017898|gb|EDU56020.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4113] gi|189000572|gb|EDU69558.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4076] gi|189358605|gb|EDU77024.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4401] gi|189363683|gb|EDU82102.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4486] gi|189369349|gb|EDU87765.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4501] gi|189374116|gb|EDU92532.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC869] gi|189379790|gb|EDU98206.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC508] gi|208726948|gb|EDZ76549.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4206] gi|208734759|gb|EDZ83446.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4045] gi|208738388|gb|EDZ86070.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4042] gi|209160773|gb|ACI38206.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4115] gi|217319649|gb|EEC28074.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. TW14588] gi|254594305|gb|ACT73666.1| conserved protein [Escherichia coli O157:H7 str. TW14359] gi|290764147|gb|ADD58108.1| conserved hypothetical protein [Escherichia coli O55:H7 str. CB9615] gi|320189302|gb|EFW63961.1| hypothetical protein ECoD_04306 [Escherichia coli O157:H7 str. EC1212] gi|320640599|gb|EFX10138.1| hypothetical protein ECO5101_04264 [Escherichia coli O157:H7 str. G5101] gi|320645846|gb|EFX14831.1| hypothetical protein ECO9389_23751 [Escherichia coli O157:H- str. 493-89] gi|320651146|gb|EFX19586.1| hypothetical protein ECO2687_11738 [Escherichia coli O157:H- str. H 2687] gi|320656642|gb|EFX24538.1| hypothetical protein ECO7815_01895 [Escherichia coli O55:H7 str. 3256-97 TW 07815] gi|320662161|gb|EFX29562.1| hypothetical protein ECO5905_09833 [Escherichia coli O55:H7 str. USDA 5905] gi|320667236|gb|EFX34199.1| hypothetical protein ECOSU61_08919 [Escherichia coli O157:H7 str. LSU-61] gi|326338959|gb|EGD62774.1| hypothetical protein ECoA_04497 [Escherichia coli O157:H7 str. 1044] gi|326343159|gb|EGD66927.1| hypothetical protein ECF_02672 [Escherichia coli O157:H7 str. 1125] Length = 96 Score = 67.3 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 66 VVIEKGELGRHKQIKI 81 >gi|81242423|gb|ABB63133.1| conserved hypothetical protein [Shigella dysenteriae Sd197] Length = 100 Score = 67.3 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANGHLVKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 70 VVIEKGELGRHKQIKI 85 >gi|322513549|ref|ZP_08066649.1| hypothetical protein HMPREF0027_0401 [Actinobacillus ureae ATCC 25976] gi|322120620|gb|EFX92514.1| hypothetical protein HMPREF0027_0401 [Actinobacillus ureae ATCC 25976] Length = 99 Score = 67.3 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 40/89 (44%), Gaps = 8/89 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A + I L + +KI +T P G AN +L L+K + KS Sbjct: 14 RLRIFLQPKASRDQIVGLHDNE-------LKIAITTPPVDGAANAHLLKYLSKLFKVPKS 66 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S+ + + K +++ + K + + ++ Sbjct: 67 SIVLEKGELQRHKQLFVP-EPKLLPKEIE 94 >gi|113461793|ref|YP_719862.1| hypothetical protein HS_1657 [Haemophilus somnus 129PT] gi|170718106|ref|YP_001785139.1| hypothetical protein HSM_1819 [Haemophilus somnus 2336] gi|123132241|sp|Q0I525|Y1657_HAES1 RecName: Full=UPF0235 protein HS_1657 gi|226696075|sp|B0UWD6|Y1819_HAES2 RecName: Full=UPF0235 protein HSM_1819 gi|112823836|gb|ABI25925.1| conserved hypothetical protein [Haemophilus somnus 129PT] gi|168826235|gb|ACA31606.1| protein of unknown function DUF167 [Haemophilus somnus 2336] Length = 99 Score = 67.3 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 35/78 (44%), Gaps = 7/78 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K + L +KI +TA P G+AN +L L+K ++KS Sbjct: 12 RLRIFLQPKASKDHLIGLYDNA-------LKISITAPPIDGQANAHLLKFLSKTFKVAKS 64 Query: 63 SLRMLSKQSSPLKIIYID 80 + + + S K I I Sbjct: 65 QIILEKGELSRHKQILIP 82 >gi|297580597|ref|ZP_06942523.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297535013|gb|EFH73848.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 96 Score = 67.3 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EIT L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEITALIE 96 >gi|262403916|ref|ZP_06080473.1| hypothetical protein VOA_001904 [Vibrio sp. RC586] gi|262349878|gb|EEY99014.1| hypothetical protein VOA_001904 [Vibrio sp. RC586] Length = 96 Score = 67.3 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK+ ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKQCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EI L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPPEIAALIE 96 >gi|16766403|ref|NP_462018.1| hypothetical protein STM3102 [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|167990371|ref|ZP_02571471.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|197262086|ref|ZP_03162160.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|29839738|sp|Q8ZM46|YGGU_SALTY RecName: Full=UPF0235 protein yggU gi|16421655|gb|AAL21977.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|197240341|gb|EDY22961.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|205331149|gb|EDZ17913.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|261248233|emb|CBG26070.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] gi|267995267|gb|ACY90152.1| hypothetical protein STM14_3746 [Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S] gi|301159657|emb|CBW19176.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhimurium str. SL1344] gi|312914124|dbj|BAJ38098.1| hypothetical protein STMDT12_C31550 [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] gi|321225775|gb|EFX50829.1| UPF0235 protein VC [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] gi|323131458|gb|ADX18888.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica serovar Typhimurium str. 4/74] Length = 96 Score = 66.9 bits (162), Expect = 8e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P G+AN + L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKIAITAPPVDGQANSHLTKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 IVIEKGELGRHKQVKI 81 >gi|291278644|ref|YP_003495479.1| hypothetical protein DEFDS_0212 [Deferribacter desulfuricans SSM1] gi|290753346|dbj|BAI79723.1| conserved hypothetical protein [Deferribacter desulfuricans SSM1] Length = 82 Score = 66.9 bits (162), Expect = 8e-10, Method: Composition-based stats. Identities = 27/81 (33%), Positives = 43/81 (53%), Gaps = 7/81 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P AKK+ +A KIKV + P G ANK ++ LAKKL +SK Sbjct: 3 VRITFYIQPGAKKTEVAG-------EFNNMTKIKVASPPVDGAANKELIKFLAKKLGVSK 55 Query: 62 SSLRMLSKQSSPLKIIYIDKD 82 SS++++S + S +K + +D Sbjct: 56 SSVKIVSGEKSRIKTVEFLED 76 >gi|121535253|ref|ZP_01667067.1| protein of unknown function DUF167 [Thermosinus carboxydivorans Nor1] gi|121306138|gb|EAX47066.1| protein of unknown function DUF167 [Thermosinus carboxydivorans Nor1] Length = 100 Score = 66.9 bits (162), Expect = 8e-10, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 45/89 (50%), Gaps = 8/89 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +++ P A ++ + L +K+ V + P +G+AN+A LA A ++K+ Sbjct: 16 SFKIKVQPRASRNAVIGL-------AGDSLKVCVASPPVEGEANQACLAFFAALFGVAKT 68 Query: 63 SLRMLSKQSSPLKIIYIDK-DCKEITELL 90 + ++S Q S K+I I D ++ +L Sbjct: 69 RIVLVSGQKSRSKVIKIMGIDMEQFKTVL 97 >gi|300925051|ref|ZP_07140969.1| hypothetical protein HMPREF9548_03158 [Escherichia coli MS 182-1] gi|300418797|gb|EFK02108.1| hypothetical protein HMPREF9548_03158 [Escherichia coli MS 182-1] Length = 96 Score = 66.9 bits (162), Expect = 8e-10, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L + +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGL-------NGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 66 VVIEKGELGRHKQIKI 81 >gi|88859056|ref|ZP_01133697.1| hypothetical protein PTD2_08629 [Pseudoalteromonas tunicata D2] gi|88819282|gb|EAR29096.1| hypothetical protein PTD2_08629 [Pseudoalteromonas tunicata D2] Length = 101 Score = 66.9 bits (162), Expect = 8e-10, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 34/79 (43%), Gaps = 7/79 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + + P A + L +K+ +TA P G+AN ++ LAK+ ++K+ Sbjct: 12 TLRLYVQPKASQDKFIGLH-------GNELKVAITAPPVDGQANSHLIKFLAKQCKVAKN 64 Query: 63 SLRMLSKQSSPLKIIYIDK 81 + + K + I K Sbjct: 65 QVCIKKGLQGRHKEVQISK 83 >gi|149192342|ref|ZP_01870547.1| hypothetical protein VSAK1_11360 [Vibrio shilonii AK1] gi|148833820|gb|EDL50852.1| hypothetical protein VSAK1_11360 [Vibrio shilonii AK1] Length = 95 Score = 66.9 bits (162), Expect = 8e-10, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + K+ +TA P GKAN + LAK+ ++K Sbjct: 13 LRIYVQPKASRDKLVG-------EHGEEFKVAITAPPVDGKANAHLSKYLAKQFKVAKGQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VLIEKGELGRHKQLRI 81 >gi|16761877|ref|NP_457494.1| hypothetical protein STY3255 [Salmonella enterica subsp. enterica serovar Typhi str. CT18] gi|29143364|ref|NP_806706.1| hypothetical protein t3014 [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] gi|161616066|ref|YP_001590031.1| hypothetical protein SPAB_03867 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|167552006|ref|ZP_02345759.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] gi|168234347|ref|ZP_02659405.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] gi|168236170|ref|ZP_02661228.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Schwarzengrund str. SL480] gi|168264452|ref|ZP_02686425.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] gi|168463711|ref|ZP_02697628.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|194470418|ref|ZP_03076402.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|194735973|ref|YP_002116050.1| hypothetical protein SeSA_A3276 [Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633] gi|198244387|ref|YP_002217077.1| hypothetical protein SeD_A3445 [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|200388001|ref|ZP_03214613.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Virchow str. SL491] gi|205354025|ref|YP_002227826.1| hypothetical protein SG2996 [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] gi|207858363|ref|YP_002245014.1| hypothetical protein SEN2945 [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] gi|213027996|ref|ZP_03342443.1| hypothetical protein Salmonelentericaenterica_39070 [Salmonella enterica subsp. enterica serovar Typhi str. 404ty] gi|213051778|ref|ZP_03344656.1| hypothetical protein Salmoneentericaenterica_01893 [Salmonella enterica subsp. enterica serovar Typhi str. E00-7866] gi|213422843|ref|ZP_03355881.1| hypothetical protein Salmonentericaenterica_35767 [Salmonella enterica subsp. enterica serovar Typhi str. E01-6750] gi|213424105|ref|ZP_03356998.1| hypothetical protein SentesTyphi_00030 [Salmonella enterica subsp. enterica serovar Typhi str. E02-1180] gi|213580619|ref|ZP_03362445.1| hypothetical protein SentesTyph_05132 [Salmonella enterica subsp. enterica serovar Typhi str. E98-0664] gi|213609099|ref|ZP_03368925.1| hypothetical protein SentesTyp_00567 [Salmonella enterica subsp. enterica serovar Typhi str. E98-2068] gi|213648208|ref|ZP_03378261.1| hypothetical protein SentesTy_13504 [Salmonella enterica subsp. enterica serovar Typhi str. J185] gi|213850158|ref|ZP_03381056.1| hypothetical protein SentesT_00598 [Salmonella enterica subsp. enterica serovar Typhi str. M223] gi|289825938|ref|ZP_06545097.1| hypothetical protein Salmonellentericaenterica_11175 [Salmonella enterica subsp. enterica serovar Typhi str. E98-3139] gi|29839732|sp|Q8Z3U7|YGGU_SALTI RecName: Full=UPF0235 protein yggU gi|189030121|sp|A9N4P8|YGGU_SALPB RecName: Full=UPF0235 protein yggU gi|226730825|sp|B5FUW5|YGGU_SALDC RecName: Full=UPF0235 protein yggU gi|226730826|sp|B5QY78|YGGU_SALEP RecName: Full=UPF0235 protein yggU gi|226730827|sp|B5RE62|YGGU_SALG2 RecName: Full=UPF0235 protein yggU gi|226730831|sp|B4TV71|YGGU_SALSV RecName: Full=UPF0235 protein yggU gi|25370114|pir||AF0878 conserved hypothetical protein STY3255 [imported] - Salmonella enterica subsp. enterica serovar Typhi (strain CT18) gi|16504179|emb|CAD02926.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi] gi|29138998|gb|AAO70566.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] gi|161365430|gb|ABX69198.1| hypothetical protein SPAB_03867 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|194456782|gb|EDX45621.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|194711475|gb|ACF90696.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633] gi|195633251|gb|EDX51665.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|197290920|gb|EDY30274.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Schwarzengrund str. SL480] gi|197938903|gb|ACH76236.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|199605099|gb|EDZ03644.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Virchow str. SL491] gi|205273806|emb|CAR38801.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] gi|205323332|gb|EDZ11171.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] gi|205331696|gb|EDZ18460.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] gi|205347059|gb|EDZ33690.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] gi|206710166|emb|CAR34522.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] gi|326624849|gb|EGE31194.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar Dublin str. 3246] gi|326629138|gb|EGE35481.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar Gallinarum str. 9] Length = 96 Score = 66.9 bits (162), Expect = 9e-10, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLTKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 IVIEKGELGRHKQVKI 81 >gi|300866080|ref|ZP_07110810.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506] gi|300335941|emb|CBN55968.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506] Length = 78 Score = 66.9 bits (162), Expect = 9e-10, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 43/80 (53%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + +++ PN+K+ I + D S KI + + P GKAN+ ++ +LAKK + Sbjct: 1 MAILTIKVKPNSKQQNI---QQEPDGS----FKISLKSPPIDGKANEELIKLLAKKFGIP 53 Query: 61 KSSLRMLSKQSSPLKIIYID 80 KS + + S SS K++ + Sbjct: 54 KSQITIKSGLSSKNKLVELP 73 >gi|90580282|ref|ZP_01236089.1| hypothetical protein VAS14_20161 [Vibrio angustum S14] gi|90438584|gb|EAS63768.1| hypothetical protein VAS14_20161 [Vibrio angustum S14] Length = 97 Score = 66.9 bits (162), Expect = 9e-10, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN ++ LAK+ ++K Sbjct: 15 IRLYIQPKASRDQIVGLH-------GNEVKIAITAPPVDGKANAHLVKYLAKQFKVAKGL 67 Query: 64 LRMLSKQSSPLKIIYID 80 + + K I I+ Sbjct: 68 IHVEKGLQGRHKQIRIE 84 >gi|161950048|ref|YP_404624.2| hypothetical protein SDY_3119 [Shigella dysenteriae Sd197] gi|309785219|ref|ZP_07679850.1| conserved hypothetical protein [Shigella dysenteriae 1617] gi|308926339|gb|EFP71815.1| conserved hypothetical protein [Shigella dysenteriae 1617] Length = 96 Score = 66.9 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANGHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 66 VVIEKGELGRHKQIKI 81 >gi|89074110|ref|ZP_01160609.1| hypothetical protein SKA34_22122 [Photobacterium sp. SKA34] gi|89050046|gb|EAR55572.1| hypothetical protein SKA34_22122 [Photobacterium sp. SKA34] Length = 97 Score = 66.5 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN ++ L+K+ ++K Sbjct: 15 IRLYIQPKASRDQIVGLH-------GNEVKIAITAPPVDGKANAHLVKYLSKQFKVAKGL 67 Query: 64 LRMLSKQSSPLKIIYID 80 + + K I I+ Sbjct: 68 IHLEKGLQGRHKQIRIE 84 >gi|307310426|ref|ZP_07590074.1| protein of unknown function DUF167 [Escherichia coli W] gi|306909321|gb|EFN39816.1| protein of unknown function DUF167 [Escherichia coli W] gi|315062259|gb|ADT76586.1| conserved hypothetical protein [Escherichia coli W] gi|323377157|gb|ADX49425.1| protein of unknown function DUF167 [Escherichia coli KO11] Length = 96 Score = 66.5 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSYLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 66 VVIEKGELGRHKQIKI 81 >gi|262172428|ref|ZP_06040106.1| hypothetical protein VII_003257 [Vibrio mimicus MB-451] gi|261893504|gb|EEY39490.1| hypothetical protein VII_003257 [Vibrio mimicus MB-451] Length = 97 Score = 66.5 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 41/92 (44%), Gaps = 10/92 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQN 92 + + + K + I ++ EI LL++ Sbjct: 66 VVIEKGELGRHKQVRIQQPNQIPPEIAALLES 97 >gi|289811236|ref|ZP_06541865.1| hypothetical protein Salmonellaentericaenterica_45672 [Salmonella enterica subsp. enterica serovar Typhi str. AG3] Length = 94 Score = 66.5 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + L K+ ++KS Sbjct: 11 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLTKFLGKQFRVAKSQ 63 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 64 IVIEKGELGRHKQVKI 79 >gi|94266431|ref|ZP_01290126.1| Protein of unknown function DUF167 [delta proteobacterium MLMS-1] gi|93452973|gb|EAT03472.1| Protein of unknown function DUF167 [delta proteobacterium MLMS-1] Length = 121 Score = 66.5 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 45/88 (51%), Gaps = 8/88 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR P A ++ +A + ++I+V A P GKAN+A+L LA + L +++ Sbjct: 38 LRVRAQPGAARTEVAG-------TYGARLRIRVAAPPVDGKANRALLTFLASRCGLVRNA 90 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90 + ++ Q K+ ++ +++T L Sbjct: 91 VTLVGGQRGRDKLFRLEGIGPEQLTTCL 118 >gi|284039659|ref|YP_003389589.1| hypothetical protein Slin_4812 [Spirosoma linguale DSM 74] gi|283818952|gb|ADB40790.1| protein of unknown function DUF167 [Spirosoma linguale DSM 74] Length = 86 Score = 66.5 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 45/88 (51%), Gaps = 7/88 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + ++ P +K ++ D + + K+ A Q GKAN ++ LAK+L + KS Sbjct: 2 TLHLKAKPGSKID-----QLFYDAAGQ--LNAKIRAPAQDGKANAYLIEFLAKQLGIPKS 54 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELL 90 + +++ ++P K I +D + +T+ L Sbjct: 55 GVSIVAGFTNPHKRIEVDVPEEVLTDFL 82 >gi|167386103|ref|XP_001737619.1| hypothetical protein [Entamoeba dispar SAW760] gi|165899553|gb|EDR26129.1| hypothetical protein, conserved [Entamoeba dispar SAW760] Length = 118 Score = 66.5 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 39/80 (48%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + PNAK S + +E +K+ + A P GKAN ++A +A + K Sbjct: 33 VIIEIEIKPNAKTSELQGVED-------GILKVAIDAPPIDGKANTEVIAFMASTFGIKK 85 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 S++ ++ Q+S K + + Sbjct: 86 SNVSLIKGQTSHHKTLQFEN 105 >gi|77464376|ref|YP_353880.1| hypothetical protein RSP_0800 [Rhodobacter sphaeroides 2.4.1] gi|77388794|gb|ABA79979.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] Length = 85 Score = 66.5 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 23/76 (30%), Positives = 39/76 (51%), Gaps = 8/76 (10%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P A ++ + E +++ VT P+ GKAN+A+ LAK L ++KS L Sbjct: 18 AVRVTPRAARAKVELQE--------GVVRVHVTCVPEDGKANRAVTEALAKALGVAKSRL 69 Query: 65 RMLSKQSSPLKIIYID 80 ++ +S K +D Sbjct: 70 TLVRGATSRDKTFRLD 85 >gi|330446896|ref|ZP_08310547.1| conserved hypothetical protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328491087|dbj|GAA05044.1| conserved hypothetical protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 97 Score = 66.5 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + P A + I L +KI +TA P GKAN ++ L+K+ ++K Sbjct: 15 VRLYIQPKASRDQIVGLH-------GDEIKIAITAPPVDGKANAHLVKYLSKQFKVAKGL 67 Query: 64 LRMLSKQSSPLKIIYID 80 + + K + I+ Sbjct: 68 IHVEKGLQGRHKQVRIE 84 >gi|291615210|ref|YP_003525367.1| hypothetical protein Slit_2755 [Sideroxydans lithotrophicus ES-1] gi|291585322|gb|ADE12980.1| protein of unknown function DUF167 [Sideroxydans lithotrophicus ES-1] Length = 94 Score = 66.1 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 41/88 (46%), Gaps = 7/88 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + + P AK++ +A L +KI++ A P +G+AN+A+L +A+ + Sbjct: 13 TLTLHIQPGAKRTEVAGLH-------GAALKIRLAAPPIEGRANEALLKFIAESFGVPLR 65 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELL 90 + + S K++ + E LL Sbjct: 66 QVELKQGGQSRHKVVAVTASKIEPESLL 93 >gi|17230688|ref|NP_487236.1| hypothetical protein asl3196 [Nostoc sp. PCC 7120] gi|29839730|sp|Q8YS95|Y3196_ANASP RecName: Full=UPF0235 protein asl3196 gi|17132291|dbj|BAB74895.1| asl3196 [Nostoc sp. PCC 7120] Length = 75 Score = 66.1 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 28/77 (36%), Positives = 45/77 (58%), Gaps = 7/77 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PN+K+ IA D S T+H+K + P GKAN+ ++ +LA+K + KS + Sbjct: 5 VKVKPNSKQQKIA---EQDDGSLTVHLK----SPPVDGKANEELIKLLAEKFDVPKSHIT 57 Query: 66 MLSKQSSPLKIIYIDKD 82 + S SS K+I I+ D Sbjct: 58 IKSGLSSKQKLIEIETD 74 >gi|239906817|ref|YP_002953558.1| hypothetical protein DMR_21810 [Desulfovibrio magneticus RS-1] gi|239796683|dbj|BAH75672.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 122 Score = 66.1 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 17/82 (20%), Positives = 40/82 (48%), Gaps = 7/82 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V + P K +A L + +++++ A +G+AN A+ +A+ L + Sbjct: 36 TLRVAVTPGGAKDALAGLAEDR-------LRVRLRAKAVEGQANAALTDFVARCLGVKPR 88 Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84 +R++S + S K + I+ + + Sbjct: 89 QVRIISGEKSRKKTLRIETESE 110 >gi|331664536|ref|ZP_08365442.1| conserved hypothetical protein [Escherichia coli TA143] gi|331058467|gb|EGI30448.1| conserved hypothetical protein [Escherichia coli TA143] Length = 96 Score = 66.1 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 66 VVIEKGELGRHKHIKI 81 >gi|194334667|ref|YP_002016527.1| hypothetical protein Paes_1868 [Prosthecochloris aestuarii DSM 271] gi|226696156|sp|B4S4I4|Y1868_PROA2 RecName: Full=UPF0235 protein Paes_1868 gi|194312485|gb|ACF46880.1| protein of unknown function DUF167 [Prosthecochloris aestuarii DSM 271] Length = 104 Score = 66.1 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 24/77 (31%), Positives = 35/77 (45%), Gaps = 7/77 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V+ P + +S I D +KI + A P AN LAK L ++ S + Sbjct: 14 FVKAQPRSSRSAIIG---EYDGK----IKISLKAAPVDDAANVECCRFLAKSLGVASSRV 66 Query: 65 RMLSKQSSPLKIIYIDK 81 R+LS SS +K + ID Sbjct: 67 RILSGHSSRIKRLTIDG 83 >gi|147674564|ref|YP_001215984.1| hypothetical protein VC0395_A0010 [Vibrio cholerae O395] gi|153216287|ref|ZP_01950380.1| conserved hypothetical protein [Vibrio cholerae 1587] gi|262167148|ref|ZP_06034862.1| hypothetical protein VIJ_000308 [Vibrio cholerae RC27] gi|172047756|sp|A5F9H9|Y1210_VIBC3 RecName: Full=UPF0235 protein VC0395_A0010/VC395_0502 gi|124114376|gb|EAY33196.1| conserved hypothetical protein [Vibrio cholerae 1587] gi|146316447|gb|ABQ20986.1| conserved hypothetical protein [Vibrio cholerae O395] gi|227012311|gb|ACP08521.1| conserved hypothetical protein [Vibrio cholerae O395] gi|262024448|gb|EEY43135.1| hypothetical protein VIJ_000308 [Vibrio cholerae RC27] Length = 96 Score = 66.1 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EIT L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEITALIE 96 >gi|127512067|ref|YP_001093264.1| hypothetical protein Shew_1134 [Shewanella loihica PV-4] gi|166228889|sp|A3QC07|Y1134_SHELP RecName: Full=UPF0235 protein Shew_1134 gi|126637362|gb|ABO23005.1| protein of unknown function DUF167 [Shewanella loihica PV-4] Length = 96 Score = 66.1 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 32/76 (42%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I +K+ +TA P GKAN + LAK+ +K + Sbjct: 13 IRLYIQPKASRDQIVGPH-------GDELKVAITAPPIDGKANAHLCKFLAKQFKTAKGN 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I + Sbjct: 66 ILIEKGELGRHKQIRV 81 >gi|89898566|ref|YP_515676.1| hypothetical protein CF0759 [Chlamydophila felis Fe/C-56] gi|89331938|dbj|BAE81531.1| conserved hypothetical protein [Chlamydophila felis Fe/C-56] Length = 65 Score = 66.1 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 23/56 (41%), Positives = 38/56 (67%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEIT 87 +KI+VT P+KG+AN+A++A+LAK L+L K + ++S SS K + + K + I Sbjct: 2 LKIRVTEAPEKGRANEAVIALLAKTLSLPKRDVTLISGDSSRKKRLLLPKAAESII 57 >gi|223937408|ref|ZP_03629313.1| protein of unknown function DUF167 [bacterium Ellin514] gi|223893959|gb|EEF60415.1| protein of unknown function DUF167 [bacterium Ellin514] Length = 100 Score = 66.1 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 42/92 (45%), Gaps = 8/92 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++L P A + I ++IKVTA P AN+A+L +LA L + Sbjct: 16 LSIKLQPRASANQIG-------EPLGNELRIKVTAPPVDAAANEALLRLLADILKCPRGK 68 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNND 94 + ++ +S K+I + + + L ++D Sbjct: 69 VELVRGHTSRHKVIKLHGLEANAVLTRLISHD 100 >gi|209694133|ref|YP_002262061.1| hypothetical protein VSAL_I0540 [Aliivibrio salmonicida LFI1238] gi|208008084|emb|CAQ78225.1| conserved hypothetical protein [Aliivibrio salmonicida LFI1238] Length = 83 Score = 66.1 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 15/73 (20%), Positives = 31/73 (42%), Gaps = 7/73 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + L P A + I + +K+ +TA P GKAN ++ +K ++K Sbjct: 12 LRLYLQPKASRDQIVGIH-------GEELKVAITAPPVDGKANAHLIKYFSKLFKVAKGK 64 Query: 64 LRMLSKQSSPLKI 76 + + + + K Sbjct: 65 ITVEKGELNRHKQ 77 >gi|225849498|ref|YP_002729663.1| hypothetical protein SULAZ_1705 [Sulfurihydrogenibium azorense Az-Fu1] gi|225644124|gb|ACN99174.1| conserved domain protein [Sulfurihydrogenibium azorense Az-Fu1] Length = 72 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 42/78 (53%), Gaps = 7/78 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V++ P + K+ + +E +++ T P+KGKAN ++ ++++ L + KS Sbjct: 2 RIKVKVKPGSSKNEVKKIE-------ENFYEVRCTTIPEKGKANDKVIELMSEYLDVPKS 54 Query: 63 SLRMLSKQSSPLKIIYID 80 ++++ SS K I I+ Sbjct: 55 RIKIIKGHSSREKEIEIE 72 >gi|229530300|ref|ZP_04419688.1| hypothetical protein VCG_003420 [Vibrio cholerae 12129(1)] gi|229332073|gb|EEN97561.1| hypothetical protein VCG_003420 [Vibrio cholerae 12129(1)] gi|327483312|gb|AEA77719.1| UPF0235 protein [Vibrio cholerae LMA3894-4] Length = 96 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EIT L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEITALIE 96 >gi|254508600|ref|ZP_05120716.1| conserved hypothetical protein TIGR00251 [Vibrio parahaemolyticus 16] gi|219548451|gb|EED25460.1| conserved hypothetical protein TIGR00251 [Vibrio parahaemolyticus 16] Length = 96 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 39/90 (43%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + L +K+ +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKLVGLH-------GDELKVAITAPPVDGKANAHLSKYLAKQFKVAKGL 66 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELL 90 +++ + K + + + EI +L Sbjct: 67 IKIEKGELGRHKQLRVNSPTQIPAEIKAIL 96 >gi|121728574|ref|ZP_01681595.1| conserved hypothetical protein [Vibrio cholerae V52] gi|153802603|ref|ZP_01957189.1| conserved hypothetical protein [Vibrio cholerae MZO-3] gi|229512520|ref|ZP_04401991.1| hypothetical protein VCB_000160 [Vibrio cholerae TMA 21] gi|254291169|ref|ZP_04961965.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|121629130|gb|EAX61573.1| conserved hypothetical protein [Vibrio cholerae V52] gi|124121866|gb|EAY40609.1| conserved hypothetical protein [Vibrio cholerae MZO-3] gi|150422863|gb|EDN14814.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|229350413|gb|EEO15362.1| hypothetical protein VCB_000160 [Vibrio cholerae TMA 21] Length = 96 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EI L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEIAALIE 96 >gi|330795787|ref|XP_003285952.1| hypothetical protein DICPUDRAFT_23568 [Dictyostelium purpureum] gi|325084041|gb|EGC37478.1| hypothetical protein DICPUDRAFT_23568 [Dictyostelium purpureum] Length = 110 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 24/86 (27%), Positives = 47/86 (54%), Gaps = 7/86 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + + PN+K++ I S E + ++++ P G+ANK ++ L+K+L L KS Sbjct: 25 KLNINVHPNSKENQIISFENE-------ILSLRISEPPIDGQANKGVVEFLSKELGLRKS 77 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITE 88 ++++ S K I ID + + IT+ Sbjct: 78 NIQVSKGSKSRNKSIEIDLESESITK 103 >gi|262191069|ref|ZP_06049276.1| hypothetical protein VIH_001440 [Vibrio cholerae CT 5369-93] gi|262033045|gb|EEY51576.1| hypothetical protein VIH_001440 [Vibrio cholerae CT 5369-93] Length = 96 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIIGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EI L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEIAALIE 96 >gi|46446134|ref|YP_007499.1| hypothetical protein pc0500 [Candidatus Protochlamydia amoebophila UWE25] gi|46399775|emb|CAF23224.1| conserved hypothetical protein [Candidatus Protochlamydia amoebophila UWE25] Length = 92 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 24/87 (27%), Positives = 43/87 (49%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++IP A S E +KI++ A P KG+AN ++ L+ + KSS Sbjct: 11 IKIKVIPLASFSEKVGWE-------GDELKIRLAAIPDKGQANTELIRFLSSLFKIRKSS 63 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 ++++ Q+S K I I E ++L Sbjct: 64 IQLIQGQTSRHKKICIQDISLERLQVL 90 >gi|21674646|ref|NP_662711.1| hypothetical protein CT1832 [Chlorobium tepidum TLS] gi|29839718|sp|Q8KBF5|Y1832_CHLTE RecName: Full=UPF0235 protein CT1832 gi|21647849|gb|AAM73053.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 105 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 7/78 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P + KSG+A + +KI + + P ANK +LAK L + +SS Sbjct: 13 LSVRVQPRSSKSGVAGMY-------GEQLKICLKSAPVDNAANKECCELLAKALGVPRSS 65 Query: 64 LRMLSKQSSPLKIIYIDK 81 + ++ SS K++ ++ Sbjct: 66 VSVMKGASSRSKVLKVEG 83 >gi|153828377|ref|ZP_01981044.1| conserved hypothetical protein [Vibrio cholerae 623-39] gi|229519844|ref|ZP_04409278.1| hypothetical protein VIF_000358 [Vibrio cholerae TM 11079-80] gi|148876086|gb|EDL74221.1| conserved hypothetical protein [Vibrio cholerae 623-39] gi|229343132|gb|EEO08116.1| hypothetical protein VIF_000358 [Vibrio cholerae TM 11079-80] Length = 96 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EI L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEIAALIE 96 >gi|73856987|gb|AAZ89694.1| conserved hypothetical protein [Shigella sonnei Ss046] Length = 100 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+ N ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQGNSHLVKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 70 VVIEKGELGRHKQIKI 85 >gi|270157502|ref|ZP_06186159.1| conserved hypothetical protein [Legionella longbeachae D-4968] gi|269989527|gb|EEZ95781.1| conserved hypothetical protein [Legionella longbeachae D-4968] Length = 80 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 19/82 (23%), Positives = 38/82 (46%), Gaps = 7/82 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P AKKS I + +KI++ A P +G+ANK +L +A+ + S Sbjct: 1 MYLYVQPGAKKSEIVGMH-------EGVLKIRLNAPPIEGRANKELLKYVAQLFKVPPSQ 53 Query: 64 LRMLSKQSSPLKIIYIDKDCKE 85 + + S K++ + + Sbjct: 54 VVLKRGDKSRHKVLLVKNSLVD 75 >gi|152991249|ref|YP_001356971.1| hypothetical protein NIS_1507 [Nitratiruptor sp. SB155-2] gi|151423110|dbj|BAF70614.1| conserved hypothetical protein [Nitratiruptor sp. SB155-2] Length = 95 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ PNA K+ IA + +KI + A +G ANK ++ L+K ++KS Sbjct: 12 MFIKAQPNASKNKIAGI-------LGDSLKIAIKAPAVEGAANKELVKFLSKTFKVAKSD 64 Query: 64 LRMLSKQSSPLKIIYID 80 + S ++S K I + Sbjct: 65 IVFASGETSKRKHIVMP 81 >gi|189219304|ref|YP_001939945.1| hypothetical protein Minf_1293 [Methylacidiphilum infernorum V4] gi|189186162|gb|ACD83347.1| Uncharacterized conserved protein [Methylacidiphilum infernorum V4] Length = 97 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 43/80 (53%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ NAKK+ I S +KI+++A P +GKAN A+L+ L+ +L + K Sbjct: 4 ARLFVKVQANAKKTEICG-------SYADALKIRLSAPPVEGKANDALLSFLSLRLCVPK 56 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 +R+ + + K + I+ Sbjct: 57 RLIRIEKGEKNSKKTVVIEG 76 >gi|83648992|ref|YP_437427.1| hypothetical protein HCH_06357 [Hahella chejuensis KCTC 2396] gi|83637035|gb|ABC33002.1| uncharacterized conserved protein [Hahella chejuensis KCTC 2396] Length = 102 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 43/86 (50%), Gaps = 8/86 (9%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 L P AKK I +KIK++A P G+AN+ ++ LAK + + +++L Sbjct: 24 LQPGAKKDEIVGTH-------GDALKIKISAPPIDGRANQQLVRFLAKLCRVKQQDVQIL 76 Query: 68 SKQSSPLKIIYIDKDCKEITELLQNN 93 + +SS K I + +I +LL+ Sbjct: 77 AGESSRQKRIRVQN-LTDIPKLLKPY 101 >gi|115380361|ref|ZP_01467361.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1] gi|115362627|gb|EAU61862.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1] Length = 83 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 15/74 (20%), Positives = 37/74 (50%), Gaps = 7/74 (9%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P A ++ + +K+++ A P G+AN A++ LAK+L L + + ++ Sbjct: 3 VQPRASRTRVVGEHDGM-------LKLQLAAPPVDGEANAALVEFLAKRLGLPRRQVTLV 55 Query: 68 SKQSSPLKIIYIDK 81 + ++ K +++ Sbjct: 56 AGDAARRKRVFLAG 69 >gi|256024538|ref|ZP_05438403.1| hypothetical protein E4_14269 [Escherichia sp. 4_1_40B] Length = 96 Score = 65.4 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKLFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 66 VVIEKGELGRHKQIKI 81 >gi|114332111|ref|YP_748333.1| hypothetical protein Neut_2146 [Nitrosomonas eutropha C91] gi|122313207|sp|Q0AE64|Y2146_NITEC RecName: Full=UPF0235 protein Neut_2146 gi|114309125|gb|ABI60368.1| protein of unknown function DUF167 [Nitrosomonas eutropha C91] Length = 99 Score = 65.4 bits (158), Expect = 2e-09, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A+++ + +KIK+ A P GKAN+A+ LAK+ + Sbjct: 14 LKLYIQPGARQTEAIGVH-------GEELKIKLAAPPMDGKANRALAVFLAKRFNVPLKH 66 Query: 64 LRMLSKQSSPLKIIYI 79 + + S K++ I Sbjct: 67 ITLKWGAQSRHKVVEI 82 >gi|165976805|ref|YP_001652398.1| hypothetical protein APJL_1398 [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|226734144|sp|B0BQW9|Y1398_ACTPJ RecName: Full=UPF0235 protein APJL_1398 gi|165876906|gb|ABY69954.1| hypothetical protein APJL_1398 [Actinobacillus pleuropneumoniae serovar 3 str. JL03] Length = 98 Score = 65.4 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 11/92 (11%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A + I L + +KI +TA P G AN +L L+K + KS Sbjct: 14 RLRIFLQPKASRDQIVGLHDSE-------LKIAITAPPVDGAANAHLLKYLSKLFKVPKS 66 Query: 63 SLRMLSKQSSPL-KIIYIDK---DCKEITELL 90 S+ + + K +++ + KEI LL Sbjct: 67 SIVLEKGELQRHNKQLFVPEPKLIPKEIEALL 98 >gi|161986454|ref|YP_311929.2| hypothetical protein SSON_3107 [Shigella sonnei Ss046] gi|323167995|gb|EFZ53684.1| hypothetical protein SS53G_1711 [Shigella sonnei 53G] Length = 96 Score = 65.4 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+ N ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQGNSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K I I Sbjct: 66 VVIEKGELGRHKQIKI 81 >gi|251793930|ref|YP_003008662.1| hypothetical protein NT05HA_2269 [Aggregatibacter aphrophilus NJ8700] gi|247535329|gb|ACS98575.1| hypothetical protein NT05HA_2269 [Aggregatibacter aphrophilus NJ8700] Length = 97 Score = 65.0 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 21/76 (27%), Positives = 36/76 (47%), Gaps = 7/76 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L + +KI++TA P G+AN +L L+K + KS Sbjct: 12 RLRIFLQPKAAKDHIVGLHDDE-------LKIRITAPPIDGQANAHLLKFLSKLFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIY 78 S+ + + + K + Sbjct: 65 SIVLEKGELNCHKQVL 80 >gi|149177442|ref|ZP_01856046.1| hypothetical protein PM8797T_19111 [Planctomyces maris DSM 8797] gi|148843775|gb|EDL58134.1| hypothetical protein PM8797T_19111 [Planctomyces maris DSM 8797] Length = 104 Score = 65.0 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 27/76 (35%), Positives = 42/76 (55%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR P + K+GI + D +K+ VT P+KGKANKA+L +L L L +S Sbjct: 16 LPVRAQPRSSKNGIEGVH---DGR----LKVCVTQVPEKGKANKALLKVLQTALKLKRSQ 68 Query: 64 LRMLSKQSSPLKIIYI 79 + + +++ LKI I Sbjct: 69 IELYKGETAALKIFRI 84 >gi|163757393|ref|ZP_02164482.1| hypothetical protein HPDFL43_18322 [Hoeflea phototrophica DFL-43] gi|162284895|gb|EDQ35177.1| hypothetical protein HPDFL43_18322 [Hoeflea phototrophica DFL-43] Length = 63 Score = 65.0 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 24/59 (40%), Positives = 39/59 (66%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELL 90 MKI+V A + KAN+A+ A LAK+L L+KS +R++S +S K + ++ D +E+ L Sbjct: 1 MKIRVRAVAENNKANRALEAFLAKRLKLAKSRVRVISGANSRTKTVRLEGDPQELAAKL 59 >gi|121590704|ref|ZP_01678036.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|153819156|ref|ZP_01971823.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|153826562|ref|ZP_01979229.1| conserved hypothetical protein [Vibrio cholerae MZO-2] gi|227080668|ref|YP_002809219.1| hypothetical protein VCM66_0443 [Vibrio cholerae M66-2] gi|229507096|ref|ZP_04396602.1| hypothetical protein VCF_002318 [Vibrio cholerae BX 330286] gi|298501011|ref|ZP_07010812.1| conserved hypothetical protein [Vibrio cholerae MAK 757] gi|254803911|sp|C3LRX5|Y443_VIBCM RecName: Full=UPF0235 protein VCM66_0443 gi|121547435|gb|EAX57544.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|126510301|gb|EAZ72895.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|149739654|gb|EDM53868.1| conserved hypothetical protein [Vibrio cholerae MZO-2] gi|227008556|gb|ACP04768.1| conserved hypothetical protein [Vibrio cholerae M66-2] gi|229355841|gb|EEO20761.1| hypothetical protein VCF_002318 [Vibrio cholerae BX 330286] gi|297540259|gb|EFH76319.1| conserved hypothetical protein [Vibrio cholerae MAK 757] Length = 96 Score = 65.0 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EI L++ Sbjct: 66 VVVEKGELGRHKQVRILQPSQIPAEIAALIE 96 >gi|145590207|ref|YP_001156804.1| hypothetical protein Pnuc_2029 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] gi|145048613|gb|ABP35240.1| protein of unknown function DUF167 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] Length = 98 Score = 65.0 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 47/91 (51%), Gaps = 9/91 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + P AK + + L D +KI + A + KAN+ +L L+K+L + + Sbjct: 14 TLNLHCQPGAKVTKVVGLH---DG----CLKISLQAPAIENKANELLLGWLSKQLKIPQK 66 Query: 63 SLRMLSKQSSPLKIIYIDKD--CKEITELLQ 91 ++ +S Q+S +K + I ++IT++LQ Sbjct: 67 QIQFISGQNSRIKRVEIWGSITPEQITQILQ 97 >gi|15640485|ref|NP_230112.1| hypothetical protein VC0458 [Vibrio cholerae O1 biovar El Tor str. N16961] gi|153823175|ref|ZP_01975842.1| conserved hypothetical protein [Vibrio cholerae B33] gi|229509068|ref|ZP_04398556.1| hypothetical protein VCE_000471 [Vibrio cholerae B33] gi|229519736|ref|ZP_04409179.1| hypothetical protein VCC_003768 [Vibrio cholerae RC9] gi|229606248|ref|YP_002876896.1| hypothetical protein VCD_001149 [Vibrio cholerae MJ-1236] gi|254850689|ref|ZP_05240039.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255744295|ref|ZP_05418248.1| hypothetical protein VCH_000606 [Vibrio cholera CIRS 101] gi|262147280|ref|ZP_06028079.1| hypothetical protein VIG_000128 [Vibrio cholerae INDRE 91/1] gi|29839647|sp|Q9KUQ7|Y458_VIBCH RecName: Full=UPF0235 protein VC_0458 gi|9654883|gb|AAF93631.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor str. N16961] gi|126519301|gb|EAZ76524.1| conserved hypothetical protein [Vibrio cholerae B33] gi|229344425|gb|EEO09400.1| hypothetical protein VCC_003768 [Vibrio cholerae RC9] gi|229353993|gb|EEO18927.1| hypothetical protein VCE_000471 [Vibrio cholerae B33] gi|229368903|gb|ACQ59326.1| hypothetical protein VCD_001149 [Vibrio cholerae MJ-1236] gi|254846394|gb|EET24808.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255738235|gb|EET93627.1| hypothetical protein VCH_000606 [Vibrio cholera CIRS 101] gi|262031274|gb|EEY49889.1| hypothetical protein VIG_000128 [Vibrio cholerae INDRE 91/1] Length = 96 Score = 65.0 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EI L++ Sbjct: 66 VVVEKGELGRHKQVRILQPSQIPAEIAALIE 96 >gi|302381263|ref|YP_003817086.1| hypothetical protein Bresu_0148 [Brevundimonas subvibrioides ATCC 15264] gi|302191891|gb|ADK99462.1| protein of unknown function DUF167 [Brevundimonas subvibrioides ATCC 15264] Length = 92 Score = 65.0 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 25/81 (30%), Positives = 43/81 (53%), Gaps = 2/81 (2%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + VRL P A I + D +K++V A P +G+AN A++ +LAK L + Sbjct: 1 MARLPVRLTPGASTDRIDGWD--ADPEGRPVLKVRVRARPVEGEANAALILLLAKALGVP 58 Query: 61 KSSLRMLSKQSSPLKIIYIDK 81 +S++ + S LK+I ++ Sbjct: 59 RSTVSLARGGQSRLKMIEVEG 79 >gi|289547963|ref|YP_003472951.1| hypothetical protein Thal_0188 [Thermocrinis albus DSM 14484] gi|289181580|gb|ADC88824.1| protein of unknown function DUF167 [Thermocrinis albus DSM 14484] Length = 77 Score = 65.0 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 39/80 (48%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + V+ P A K + L ++ V P+ GKAN+ +L +L+K L + Sbjct: 3 MIVIHVKAKPKASKEYVKEL-------SPNFYEVAVKEPPEDGKANERILELLSKHLKVP 55 Query: 61 KSSLRMLSKQSSPLKIIYID 80 KS +++L SS +K+ I Sbjct: 56 KSRIKLLRGTSSRIKVFCIS 75 >gi|82701713|ref|YP_411279.1| hypothetical protein Nmul_A0579 [Nitrosospira multiformis ATCC 25196] gi|82409778|gb|ABB73887.1| Protein of unknown function DUF167 [Nitrosospira multiformis ATCC 25196] Length = 105 Score = 65.0 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 24/92 (26%), Positives = 41/92 (44%), Gaps = 7/92 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A+++ + S +KIK+ A P +G AN A+LA LA + + Sbjct: 21 LTLHIQPGARRTEVVG-------SHGDALKIKLAAPPVEGAANVALLAFLAGVFGVPQRQ 73 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 + + S KI+ ID LL+ S Sbjct: 74 VILRQGARSRRKIVEIDGTACGADTLLKQTAS 105 >gi|51244641|ref|YP_064525.1| hypothetical protein DP0789 [Desulfotalea psychrophila LSv54] gi|50875678|emb|CAG35518.1| hypothetical protein DP0789 [Desulfotalea psychrophila LSv54] Length = 95 Score = 65.0 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 38/89 (42%), Gaps = 7/89 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++V P A K+ + L +KI + P GKANK ++ L++ L K Sbjct: 12 VILLVYTQPRASKTKVVGLHDGM-------LKIACCSPPVDGKANKELIVFLSRLLDCRK 64 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 + +L QSS K + E+ + L Sbjct: 65 CDIELLRGQSSRRKQFVLTGVDAELLDKL 93 >gi|119946664|ref|YP_944344.1| hypothetical protein Ping_3043 [Psychromonas ingrahamii 37] gi|166229363|sp|A1SZ30|Y3043_PSYIN RecName: Full=UPF0235 protein Ping_3043 gi|119865268|gb|ABM04745.1| hypothetical protein DUF167 [Psychromonas ingrahamii 37] Length = 98 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + L P + + L +KI +TA P GKAN ++ L+K+ ++K + Sbjct: 15 LRLVLQPKSSRDQFIGL-------LGDELKIAITAPPVDGKANAHLIKFLSKQFKVAKGA 67 Query: 64 LRMLSKQSSPLKIIYI 79 + + S K + + Sbjct: 68 IIIEKGLLSRHKRVRV 83 >gi|256379751|ref|YP_003103411.1| hypothetical protein Amir_5752 [Actinosynnema mirum DSM 43827] gi|255924054|gb|ACU39565.1| protein of unknown function DUF167 [Actinosynnema mirum DSM 43827] Length = 90 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 37/84 (44%), Gaps = 6/84 (7%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M VR+ P +K+ + D + + V A +GKAN+A+ LAK + Sbjct: 1 MLKFAVRVKPGSKRDAVGG------RWDERALVVSVAAPAVEGKANEAVRRALAKAFGVR 54 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCK 84 + + ++S + K++ ID Sbjct: 55 RQDVEIVSGERGRDKVVVIDPAPD 78 >gi|307152217|ref|YP_003887601.1| hypothetical protein Cyan7822_2348 [Cyanothece sp. PCC 7822] gi|306982445|gb|ADN14326.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7822] Length = 73 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 7/77 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V++ PNAK+ I LE + I + + P GKAN+ ++ +LAKK +SKS Sbjct: 2 KIQVKVKPNAKQQKIEELEDGS-------LVISLKSPPVDGKANEELIKLLAKKYQVSKS 54 Query: 63 SLRMLSKQSSPLKIIYI 79 + + S SS K+I I Sbjct: 55 QISIQSGLSSRNKLIEI 71 >gi|46579785|ref|YP_010593.1| hypothetical protein DVU1374 [Desulfovibrio vulgaris str. Hildenborough] gi|46449200|gb|AAS95852.1| conserved hypothetical protein [Desulfovibrio vulgaris str. Hildenborough] gi|311233576|gb|ADP86430.1| protein of unknown function DUF167 [Desulfovibrio vulgaris RCH1] Length = 106 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 20/81 (24%), Positives = 42/81 (51%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 ++V + P AKK G+A + D ++++++A KAN+ + +A L + + Sbjct: 18 LLVWVQPGAKKDGLAGV---ADGR----LRVRLSAPAVDNKANRGLERYMASLLGVRPAR 70 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 + + S Q+S K + I+ D + Sbjct: 71 VSVASGQTSRRKRVVIESDAE 91 >gi|300087416|ref|YP_003757938.1| hypothetical protein Dehly_0296 [Dehalogenimonas lykanthroporepellens BL-DC-9] gi|299527149|gb|ADJ25617.1| protein of unknown function DUF167 [Dehalogenimonas lykanthroporepellens BL-DC-9] Length = 96 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 49/92 (53%), Gaps = 8/92 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ P + ++ I + ++++VTA P+ GKAN+A+ +LA++L L KS Sbjct: 12 IALKVQPGSGRNEIT-------DTAAEIIRVRVTAAPEHGKANRAVAELLAERLGLPKSR 64 Query: 64 LRMLSKQSSPLKIIYIDKDCK-EITELLQNND 94 + ++ +S K+ + + E+ E L D Sbjct: 65 VTIVRGLTSRRKVAAVAGLSEAEVREKLGKTD 96 >gi|120602738|ref|YP_967138.1| hypothetical protein Dvul_1694 [Desulfovibrio vulgaris DP4] gi|120562967|gb|ABM28711.1| protein of unknown function DUF167 [Desulfovibrio vulgaris DP4] Length = 108 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 20/81 (24%), Positives = 42/81 (51%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 ++V + P AKK G+A + D ++++++A KAN+ + +A L + + Sbjct: 20 LLVWVQPGAKKDGLAGV---ADGR----LRVRLSAPAVDNKANRGLERYMASLLGVRPAR 72 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 + + S Q+S K + I+ D + Sbjct: 73 VSVASGQTSRRKRVVIESDAE 93 >gi|296269396|ref|YP_003652028.1| hypothetical protein Tbis_1417 [Thermobispora bispora DSM 43833] gi|296092183|gb|ADG88135.1| protein of unknown function DUF167 [Thermobispora bispora DSM 43833] Length = 89 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 39/90 (43%), Gaps = 6/90 (6%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +R+ P A + + T + ++V A G+A +A L +A + + Sbjct: 2 RVSIRVRPGASREYVGG------TYGDGAIVVRVCAPAVDGRATEAALKAVASAFGVRRG 55 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R++S ++ K++ I D +E+ Sbjct: 56 DVRLVSGATARDKVVEIAGDEEELARRAAR 85 >gi|78188297|ref|YP_378635.1| hypothetical protein Cag_0319 [Chlorobium chlorochromatii CaD3] gi|123580414|sp|Q3ATT3|Y319_CHLCH RecName: Full=UPF0235 protein Cag_0319 gi|78170496|gb|ABB27592.1| conserved hypothetical protein [Chlorobium chlorochromatii CaD3] Length = 99 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 36/87 (41%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR P + KS ++ +K+ + + P AN+ +LA+ + S Sbjct: 13 IAVRAQPRSSKSMVSG-------EWNGALKVHLQSPPVDDAANEECCRLLARLFQVPPSR 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + +++ SS K + ++ + L Sbjct: 66 VHLVAGHSSRNKRVMVEGVSAAMATEL 92 >gi|317153487|ref|YP_004121535.1| hypothetical protein Daes_1777 [Desulfovibrio aespoeensis Aspo-2] gi|316943738|gb|ADU62789.1| protein of unknown function DUF167 [Desulfovibrio aespoeensis Aspo-2] Length = 102 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 43/82 (52%), Gaps = 7/82 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V + P A+KS +A + +KI++ A KANKA++A +A L + KS Sbjct: 15 RIAVWVQPGARKSEVAGVYQQC-------VKIRLCAPAVDNKANKALVAFVASVLNVKKS 67 Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84 + + S Q++ K++ ++ + Sbjct: 68 QVVIESGQTTRKKLLALNTVAE 89 >gi|91205448|ref|YP_537803.1| hypothetical protein RBE_0633 [Rickettsia bellii RML369-C] gi|157827193|ref|YP_001496257.1| hypothetical protein A1I_04385 [Rickettsia bellii OSU 85-389] gi|122425693|sp|Q1RIV0|Y633_RICBR RecName: Full=UPF0235 protein RBE_0633 gi|226706174|sp|A8GWJ3|Y4385_RICB8 RecName: Full=UPF0235 protein A1I_04385 gi|91068992|gb|ABE04714.1| unknown [Rickettsia bellii RML369-C] gi|157802497|gb|ABV79220.1| hypothetical protein A1I_04385 [Rickettsia bellii OSU 85-389] Length = 104 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 47/89 (52%), Gaps = 3/89 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ +++ AK + I I D +H+K+ + A Q+GKAN+ ++ LAK+ L + Sbjct: 13 ASLNIKVKAAAKSNDIKEFIIIND---VLHLKLSIKAHAQQGKANEEIINFLAKEWQLLR 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 S+L + ++ LK I I +E L+ Sbjct: 70 SNLEITKGHTNSLKTILIKNIDEEYLNLI 98 >gi|171057075|ref|YP_001789424.1| hypothetical protein Lcho_0384 [Leptothrix cholodnii SP-6] gi|170774520|gb|ACB32659.1| protein of unknown function DUF167 [Leptothrix cholodnii SP-6] Length = 125 Score = 64.6 bits (156), Expect = 5e-09, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 40/83 (48%), Gaps = 7/83 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V ++PNA+++ + L +++++ A P G AN A+ LA +L +SK Sbjct: 35 IDVAVVPNARRTEVVGLHDQA-------LRLRLAAPPVDGAANDALQRWLADELGVSKQQ 87 Query: 64 LRMLSKQSSPLKIIYIDKDCKEI 86 + +L S K + + ++ Sbjct: 88 VSLLRGASGRRKRLRVQVPPAQM 110 >gi|56695839|ref|YP_166190.1| hypothetical protein SPO0937 [Ruegeria pomeroyi DSS-3] gi|56677576|gb|AAV94242.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 92 Score = 64.2 bits (155), Expect = 5e-09, Method: Composition-based stats. Identities = 20/74 (27%), Positives = 38/74 (51%), Gaps = 8/74 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P A + + T ++I VTA P+ GKAN+A+ +LA+ + ++ S Sbjct: 22 IALRVTPKAARDSV--------TLAGEGLRITVTAPPEDGKANEAVRKLLARAMGVAPSR 73 Query: 64 LRMLSKQSSPLKII 77 L + Q++ K Sbjct: 74 LTLRRGQTARDKTF 87 >gi|302879874|ref|YP_003848438.1| hypothetical protein Galf_2678 [Gallionella capsiferriformans ES-2] gi|302582663|gb|ADL56674.1| protein of unknown function DUF167 [Gallionella capsiferriformans ES-2] Length = 96 Score = 64.2 bits (155), Expect = 6e-09, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 41/92 (44%), Gaps = 7/92 (7%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + + + P AK+S I L +K+K+ A P G+AN+A+L +A+ + Sbjct: 11 VITLTLHVQPGAKRSEICGLH-------GEALKLKLAAPPIDGRANEALLKYIAELFRVP 63 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S K++ + + LL + Sbjct: 64 VRQVELRQGAQSRHKVVAVTDSAIQPESLLTD 95 >gi|85859583|ref|YP_461785.1| putative cytoplasmic protein [Syntrophus aciditrophicus SB] gi|85722674|gb|ABC77617.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB] Length = 118 Score = 64.2 bits (155), Expect = 6e-09, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 39/80 (48%), Gaps = 7/80 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V ++P + K +A + ++IK+TA P GKAN L LA L + K + Sbjct: 17 VHVLPRSAKCALAG-------AQEGALRIKLTAPPVDGKANDECLEFLAGILGVKKGQMD 69 Query: 66 MLSKQSSPLKIIYIDKDCKE 85 ++S +S KI+ I +E Sbjct: 70 IISGHTSRRKIVQIMNVPRE 89 >gi|73748976|ref|YP_308215.1| hypothetical protein cbdb_A1230 [Dehalococcoides sp. CBDB1] gi|147669743|ref|YP_001214561.1| hypothetical protein DehaBAV1_1103 [Dehalococcoides sp. BAV1] gi|289432972|ref|YP_003462845.1| hypothetical protein DehalGT_1029 [Dehalococcoides sp. GT] gi|123619917|sp|Q3ZYH5|Y1230_DEHSC RecName: Full=UPF0235 protein cbdbA1230 gi|189038739|sp|A5FQ39|Y1103_DEHSB RecName: Full=UPF0235 protein DehaBAV1_1103 gi|73660692|emb|CAI83299.1| conserved hypothetical protein [Dehalococcoides sp. CBDB1] gi|146270691|gb|ABQ17683.1| protein of unknown function DUF167 [Dehalococcoides sp. BAV1] gi|288946692|gb|ADC74389.1| protein of unknown function DUF167 [Dehalococcoides sp. GT] Length = 97 Score = 64.2 bits (155), Expect = 6e-09, Method: Composition-based stats. Identities = 21/77 (27%), Positives = 41/77 (53%), Gaps = 7/77 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +++IP+A+K+ +A E +K+K+ A P+KGKANK ++ L+ L K+ Sbjct: 9 KVNLKIIPSARKNELAGYEN-------GLLKLKIAAQPEKGKANKELIDYLSDLLDTPKA 61 Query: 63 SLRMLSKQSSPLKIIYI 79 + + + K++ Sbjct: 62 EIEICHGHTGRNKVLAF 78 >gi|197117881|ref|YP_002138308.1| hypothetical protein Gbem_1494 [Geobacter bemidjiensis Bem] gi|197087241|gb|ACH38512.1| protein of unknown function DUF167 [Geobacter bemidjiensis Bem] Length = 99 Score = 63.8 bits (154), Expect = 6e-09, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 44/87 (50%), Gaps = 8/87 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V + P A +S I + +++++T+ P + ANK + ++AK L ++KS + Sbjct: 18 VHVQPRASRSEICG-------AKEGELRLRLTSPPVEDAANKQCVELIAKTLGVAKSKVS 70 Query: 66 MLSKQSSPLKIIYIDK-DCKEITELLQ 91 + S S K++ ++ D + LL+ Sbjct: 71 IKSGAKSRHKVVKVEGVDHDALLSLLK 97 >gi|218246502|ref|YP_002371873.1| hypothetical protein PCC8801_1666 [Cyanothece sp. PCC 8801] gi|257059535|ref|YP_003137423.1| hypothetical protein Cyan8802_1684 [Cyanothece sp. PCC 8802] gi|218166980|gb|ACK65717.1| protein of unknown function DUF167 [Cyanothece sp. PCC 8801] gi|256589701|gb|ACV00588.1| protein of unknown function DUF167 [Cyanothece sp. PCC 8802] Length = 72 Score = 63.8 bits (154), Expect = 7e-09, Method: Composition-based stats. Identities = 26/75 (34%), Positives = 45/75 (60%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PNAKK I + +D S +++K + P +GKAN+ ++ +LAKK +S+S + Sbjct: 5 VKVKPNAKKQKI---QEEEDGSLVVYLK----SPPIEGKANQELIKLLAKKFGVSQSQIS 57 Query: 66 MLSKQSSPLKIIYID 80 + S SS K + I+ Sbjct: 58 IKSGLSSRNKWVEIE 72 >gi|302038855|ref|YP_003799177.1| hypothetical protein NIDE3568 [Candidatus Nitrospira defluvii] gi|300606919|emb|CBK43252.1| conserved protein of unknown function DUF167 [Candidatus Nitrospira defluvii] Length = 105 Score = 63.8 bits (154), Expect = 7e-09, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 35/88 (39%), Gaps = 7/88 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + P A +S A L +KI++ A P G AN + LA+ + Sbjct: 20 VTISVHVQPKASRSECAGLH-------GHAVKIRIAAPPADGAANAELCRFLARCCEVPL 72 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITEL 89 S++ +LS S K + + E Sbjct: 73 SAVHILSGAGSRQKRVLVKGRTAEQVRA 100 >gi|114568606|ref|YP_755286.1| hypothetical protein Mmar10_0052 [Maricaulis maris MCS10] gi|114339068|gb|ABI64348.1| protein of unknown function DUF167 [Maricaulis maris MCS10] Length = 108 Score = 63.8 bits (154), Expect = 7e-09, Method: Composition-based stats. Identities = 26/89 (29%), Positives = 49/89 (55%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VRL P A + +A D ++ ++ +V A P+KGKAN A++A+LA++L + K Sbjct: 11 VQVRLAPGASRDCLAG--SQADAAERHYLVARVRAIPEKGKANAALIALLARQLGIPKRD 68 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + ++ +S +K + I + E ++ Sbjct: 69 IDVIRGATSRMKTVRISANGPEQDRIVAQ 97 >gi|283850632|ref|ZP_06367919.1| protein of unknown function DUF167 [Desulfovibrio sp. FW1012B] gi|283573875|gb|EFC21848.1| protein of unknown function DUF167 [Desulfovibrio sp. FW1012B] Length = 125 Score = 63.8 bits (154), Expect = 8e-09, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 35/82 (42%), Gaps = 7/82 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V + P + +A L +++++ A +G+AN + LA L L Sbjct: 39 TLRVVVTPGGSRDTLAGL-------AEGRLRVRLRAKAVEGQANAGLTVFLAGCLGLRPR 91 Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84 + ++S + S K + I + + Sbjct: 92 QVAIVSGEKSRKKTLRISAESE 113 >gi|260775569|ref|ZP_05884466.1| hypothetical protein VIC_000947 [Vibrio coralliilyticus ATCC BAA-450] gi|260608750|gb|EEX34915.1| hypothetical protein VIC_000947 [Vibrio coralliilyticus ATCC BAA-450] Length = 96 Score = 63.5 bits (153), Expect = 9e-09, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + L +KI +TA P GKAN + L+K+ ++K Sbjct: 14 LRLYIQPKASRDKLIGLH-------GDEIKIAITAPPVDGKANAHLSKYLSKQFKVAKGL 66 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 67 ITIEKGELGRHKQVRI 82 >gi|253701327|ref|YP_003022516.1| hypothetical protein GM21_2722 [Geobacter sp. M21] gi|251776177|gb|ACT18758.1| protein of unknown function DUF167 [Geobacter sp. M21] Length = 99 Score = 63.5 bits (153), Expect = 9e-09, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 44/87 (50%), Gaps = 8/87 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V + P A +S I + +++++T+ P + ANK + ++AK L ++KS + Sbjct: 18 VHVQPRASRSEICG-------AKEGELRLRLTSPPVEDAANKQCVELIAKTLGVAKSKVS 70 Query: 66 MLSKQSSPLKIIYIDK-DCKEITELLQ 91 + S S K++ ++ D + LL+ Sbjct: 71 IRSGAKSRHKVVKVEGVDHDALLSLLK 97 >gi|229525149|ref|ZP_04414554.1| hypothetical protein VCA_002760 [Vibrio cholerae bv. albensis VL426] gi|229338730|gb|EEO03747.1| hypothetical protein VCA_002760 [Vibrio cholerae bv. albensis VL426] Length = 96 Score = 63.5 bits (153), Expect = 9e-09, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I EI+ L++ Sbjct: 66 VVVEKGELGRHKQVRILQPSLIPAEISALIE 96 >gi|22299537|ref|NP_682784.1| hypothetical protein tsr1994 [Thermosynechococcus elongatus BP-1] gi|29839707|sp|Q8DHG5|Y1994_THEEB RecName: Full=UPF0235 protein tsr1994 gi|22295720|dbj|BAC09546.1| tsr1994 [Thermosynechococcus elongatus BP-1] Length = 74 Score = 63.5 bits (153), Expect = 9e-09, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 40/79 (50%), Gaps = 7/79 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M V + PNA+++ ++ + + + V A G+AN+ ++A+LA + Sbjct: 1 MGKKHVIVKPNARQASVS-------ITPAGQLLVTVRAPASDGQANQELIALLAAYFGVP 53 Query: 61 KSSLRMLSKQSSPLKIIYI 79 KS ++++ +S K+I + Sbjct: 54 KSRIQLVKGHTSRHKVIEL 72 >gi|209525443|ref|ZP_03273983.1| protein of unknown function DUF167 [Arthrospira maxima CS-328] gi|209494123|gb|EDZ94438.1| protein of unknown function DUF167 [Arthrospira maxima CS-328] Length = 73 Score = 63.5 bits (153), Expect = 9e-09, Method: Composition-based stats. Identities = 25/80 (31%), Positives = 45/80 (56%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M +++ PN+ + I D S T+++K + P GKAN+ ++ +LAKKL + Sbjct: 1 MAFFNIQVKPNSPQQLI---RKEADGSLTVYLK----SPPVDGKANQELIKLLAKKLDVP 53 Query: 61 KSSLRMLSKQSSPLKIIYID 80 KS++++ S SS K++ I Sbjct: 54 KSNIKIKSGLSSRRKLVEIS 73 >gi|114778113|ref|ZP_01453000.1| hypothetical protein SPV1_00607 [Mariprofundus ferrooxydans PV-1] gi|114551531|gb|EAU54085.1| hypothetical protein SPV1_00607 [Mariprofundus ferrooxydans PV-1] Length = 101 Score = 63.5 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 41/87 (47%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + P A+K + + +KI V Q GKAN+A++ +A L LS++ Sbjct: 16 VNIHAQPGARKPALRGMH-------GDALKIAVAEAAQDGKANEAIVRFIADALNLSRAD 68 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + S +S K +++ D E+ L Sbjct: 69 VDVASGHTSRRKRLFLHGDGSELRARL 95 >gi|145224153|ref|YP_001134831.1| hypothetical protein Mflv_3569 [Mycobacterium gilvum PYR-GCK] gi|315444488|ref|YP_004077367.1| hypothetical protein Mspyr1_29120 [Mycobacterium sp. Spyr1] gi|189040164|sp|A4T9S3|Y3569_MYCGI RecName: Full=UPF0235 protein Mflv_3569 gi|145216639|gb|ABP46043.1| protein of unknown function DUF167 [Mycobacterium gilvum PYR-GCK] gi|315262791|gb|ADT99532.1| uncharacterized conserved protein [Mycobacterium sp. Spyr1] Length = 75 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 20/76 (26%), Positives = 40/76 (52%), Gaps = 6/76 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P ++K + +E +D + + + V GKAN+A+ +LA+ L + +S Sbjct: 5 ISVRVKPGSRKGPL--VEAGEDGA----LTLYVQERAVDGKANEAVTKLLAEHLGVPRSR 58 Query: 64 LRMLSKQSSPLKIIYI 79 + ++S +S K I Sbjct: 59 IELVSGATSRHKRFRI 74 >gi|320108335|ref|YP_004183925.1| hypothetical protein AciPR4_3175 [Terriglobus saanensis SP1PR4] gi|319926856|gb|ADV83931.1| protein of unknown function DUF167 [Terriglobus saanensis SP1PR4] Length = 104 Score = 63.1 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 40/80 (50%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 VR+ P AK+SG+ + +KI + A GKAN+A++ +A L + + Sbjct: 16 VTFAVRVQPGAKRSGVVGIY-------GEAVKIALVAPAVDGKANEALVRFVATLLDVPR 68 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 S+ +LS SS K++ + Sbjct: 69 MSVEILSGVSSRSKVVKVLG 88 >gi|258405784|ref|YP_003198526.1| hypothetical protein Dret_1664 [Desulfohalobium retbaense DSM 5692] gi|257798011|gb|ACV68948.1| protein of unknown function DUF167 [Desulfohalobium retbaense DSM 5692] Length = 105 Score = 62.7 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 7/77 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + + P AKK+ + + +KIK+ A P KAN+A+ +A +L L + Sbjct: 23 RLRLWVQPKAKKTAVVGVYQDC-------LKIKLQAPPVDNKANQAVCRFVAARLGLRPA 75 Query: 63 SLRMLSKQSSPLKIIYI 79 + + S Q+S K I I Sbjct: 76 DVDLGSGQASRKKTIVI 92 >gi|238899239|ref|YP_002924922.1| hypothetical protein HDEF_2231 [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] gi|229467000|gb|ACQ68774.1| conserved hypothetical protein [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] Length = 103 Score = 62.7 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 25/96 (26%), Positives = 45/96 (46%), Gaps = 10/96 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P ++ I + + +KI +TA P KANK ++ L+KK ++KS Sbjct: 15 LNVYIQPQSRFDNIVGIHHNE-------IKINLTALPVDNKANKHLIHFLSKKCKVAKSH 67 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQNNDSL 96 + + Q S K + I EI ++L + + L Sbjct: 68 IMIEKGQLSRHKQVRIVEPKVIPVEIKQILADRNKL 103 >gi|57235102|ref|YP_182004.1| hypothetical protein DET1292 [Dehalococcoides ethenogenes 195] gi|123618390|sp|Q3Z6Z5|Y1292_DEHE1 RecName: Full=UPF0235 protein DET1292 gi|57225550|gb|AAW40607.1| conserved hypothetical protein [Dehalococcoides ethenogenes 195] Length = 97 Score = 62.7 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 42/77 (54%), Gaps = 7/77 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V ++++P+A+++ + E +KIK+ A P+KGKANKA++ L++ L KS Sbjct: 9 RVNLKILPSAQRNELTGYEN-------GLLKIKIAAQPEKGKANKALVDYLSELLDTPKS 61 Query: 63 SLRMLSKQSSPLKIIYI 79 + + S K++ Sbjct: 62 EIEICRGLSGRNKVVAF 78 >gi|15841462|ref|NP_336499.1| hypothetical protein MT2038 [Mycobacterium tuberculosis CDC1551] gi|148661795|ref|YP_001283318.1| PE-PGRS family protein [Mycobacterium tuberculosis H37Ra] gi|167970491|ref|ZP_02552768.1| PE-PGRS family protein [Mycobacterium tuberculosis H37Ra] gi|215404212|ref|ZP_03416393.1| PE-PGRS family protein [Mycobacterium tuberculosis 02_1987] gi|215411676|ref|ZP_03420472.1| PE-PGRS family protein [Mycobacterium tuberculosis 94_M4241A] gi|215427341|ref|ZP_03425260.1| PE-PGRS family protein [Mycobacterium tuberculosis T92] gi|215430902|ref|ZP_03428821.1| PE-PGRS family protein [Mycobacterium tuberculosis EAS054] gi|215446194|ref|ZP_03432946.1| PE-PGRS family protein [Mycobacterium tuberculosis T85] gi|218753699|ref|ZP_03532495.1| PE-PGRS family protein [Mycobacterium tuberculosis GM 1503] gi|218753702|ref|ZP_03532498.1| PE-PGRS family protein [Mycobacterium tuberculosis GM 1503] gi|219557946|ref|ZP_03537022.1| PE-PGRS family protein [Mycobacterium tuberculosis T17] gi|253798966|ref|YP_003031967.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN 1435] gi|254232156|ref|ZP_04925483.1| hypothetical protein TBCG_01934 [Mycobacterium tuberculosis C] gi|254364802|ref|ZP_04980848.1| hypothetical protein TBHG_01941 [Mycobacterium tuberculosis str. Haarlem] gi|254551007|ref|ZP_05141454.1| PE-PGRS family protein [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|260186958|ref|ZP_05764432.1| PE-PGRS family protein [Mycobacterium tuberculosis CPHL_A] gi|260201085|ref|ZP_05768576.1| PE-PGRS family protein [Mycobacterium tuberculosis T46] gi|260205263|ref|ZP_05772754.1| PE-PGRS family protein [Mycobacterium tuberculosis K85] gi|289443475|ref|ZP_06433219.1| PE-PGRS family protein [Mycobacterium tuberculosis T46] gi|289447602|ref|ZP_06437346.1| PE-PGRS family protein [Mycobacterium tuberculosis CPHL_A] gi|289570082|ref|ZP_06450309.1| conserved hypothetical protein [Mycobacterium tuberculosis T17] gi|289574658|ref|ZP_06454885.1| PE-PGRS family protein [Mycobacterium tuberculosis K85] gi|289750564|ref|ZP_06509942.1| PE-PGRS family protein [Mycobacterium tuberculosis T92] gi|289754087|ref|ZP_06513465.1| PE-PGRS family protein [Mycobacterium tuberculosis EAS054] gi|289758097|ref|ZP_06517475.1| PE-PGRS family protein [Mycobacterium tuberculosis T85] gi|294996916|ref|ZP_06802607.1| PE-PGRS family protein [Mycobacterium tuberculosis 210] gi|297634555|ref|ZP_06952335.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN 4207] gi|297731543|ref|ZP_06960661.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN R506] gi|306776218|ref|ZP_07414555.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu001] gi|306779999|ref|ZP_07418336.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu002] gi|306784749|ref|ZP_07423071.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu003] gi|306789106|ref|ZP_07427428.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu004] gi|306793440|ref|ZP_07431742.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu005] gi|306797824|ref|ZP_07436126.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu006] gi|306803704|ref|ZP_07440372.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu008] gi|306808278|ref|ZP_07444946.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu007] gi|306968102|ref|ZP_07480763.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu009] gi|306972327|ref|ZP_07484988.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu010] gi|307080037|ref|ZP_07489207.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu011] gi|313658876|ref|ZP_07815756.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN V2475] gi|166227505|sp|A5U406|Y1997_MYCTA RecName: Full=UPF0235 protein MRA_1997 gi|13881702|gb|AAK46313.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551] gi|124601215|gb|EAY60225.1| hypothetical protein TBCG_01934 [Mycobacterium tuberculosis C] gi|134150316|gb|EBA42361.1| hypothetical protein TBHG_01941 [Mycobacterium tuberculosis str. Haarlem] gi|148505947|gb|ABQ73756.1| PE-PGRS family protein [Mycobacterium tuberculosis H37Ra] gi|253320469|gb|ACT25072.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN 1435] gi|289416394|gb|EFD13634.1| PE-PGRS family protein [Mycobacterium tuberculosis T46] gi|289420560|gb|EFD17761.1| PE-PGRS family protein [Mycobacterium tuberculosis CPHL_A] gi|289539089|gb|EFD43667.1| PE-PGRS family protein [Mycobacterium tuberculosis K85] gi|289543836|gb|EFD47484.1| conserved hypothetical protein [Mycobacterium tuberculosis T17] gi|289691151|gb|EFD58580.1| PE-PGRS family protein [Mycobacterium tuberculosis T92] gi|289694674|gb|EFD62103.1| PE-PGRS family protein [Mycobacterium tuberculosis EAS054] gi|289713661|gb|EFD77673.1| PE-PGRS family protein [Mycobacterium tuberculosis T85] gi|308215330|gb|EFO74729.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu001] gi|308327103|gb|EFP15954.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu002] gi|308330482|gb|EFP19333.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu003] gi|308334316|gb|EFP23167.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu004] gi|308338117|gb|EFP26968.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu005] gi|308341810|gb|EFP30661.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu006] gi|308345297|gb|EFP34148.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu007] gi|308349599|gb|EFP38450.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu008] gi|308354227|gb|EFP43078.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu009] gi|308358205|gb|EFP47056.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu010] gi|308362136|gb|EFP50987.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu011] gi|323719473|gb|EGB28600.1| hypothetical protein TMMG_01247 [Mycobacterium tuberculosis CDC1551A] gi|326903595|gb|EGE50528.1| hypothetical protein TBPG_01471 [Mycobacterium tuberculosis W-148] gi|328458721|gb|AEB04144.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN 4207] Length = 76 Score = 62.7 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 37/79 (46%), Gaps = 6/79 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +V+VR+ P + K + + + I V GKAN A+ +LA L L KS Sbjct: 4 SVVVRVKPGSHKGPLV------EVGPNGELIIYVREPAIDGKANDAVTRLLAAHLQLPKS 57 Query: 63 SLRMLSKQSSPLKIIYIDK 81 ++++S +S K + + Sbjct: 58 RVKLVSGATSRFKRFRLSR 76 >gi|323697733|ref|ZP_08109645.1| protein of unknown function DUF167 [Desulfovibrio sp. ND132] gi|323457665|gb|EGB13530.1| protein of unknown function DUF167 [Desulfovibrio desulfuricans ND132] Length = 98 Score = 62.7 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 43/93 (46%), Gaps = 8/93 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V P A+K +A +K+++ A KANK ++A +A+ L L KS Sbjct: 12 LDVWAQPGARKDEVAG-------EYQGCLKVRLRAPAVDNKANKGLVAYIARLLQLKKSQ 64 Query: 64 LRMLSKQSSPLKIIYIDKDCK-EITELLQNNDS 95 + ++S +S K + ++ + + LL N Sbjct: 65 VEIVSGHASRRKHLALNTAGEPDWGSLLSGNAP 97 >gi|170029973|ref|XP_001842865.1| conserved hypothetical protein [Culex quinquefasciatus] gi|167865325|gb|EDS28708.1| conserved hypothetical protein [Culex quinquefasciatus] Length = 157 Score = 62.3 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 42/88 (47%), Gaps = 7/88 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K +GI +E +++ A P G+AN ++ LAK L L KS Sbjct: 67 IKILAKPGSKFNGITGIEDEGVG-------VQIAAPPIDGEANTELVKYLAKLLDLRKSD 119 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S K I ++K C+ ++L+ Sbjct: 120 VSLDRGSKSRQKTIVLEKGCRTPDQVLE 147 >gi|157123809|ref|XP_001653923.1| hypothetical protein AaeL_AAEL009675 [Aedes aegypti] gi|108874207|gb|EAT38432.1| conserved hypothetical protein [Aedes aegypti] Length = 149 Score = 62.3 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 42/87 (48%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ P AK +GI + + +++ A P G+AN ++ L+K L L KS Sbjct: 60 IKIQAKPGAKTNGITDI-------GEEGVGVQIAAPPVDGEANTELVKYLSKLLELRKSD 112 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + S K I ++K C+ ++L Sbjct: 113 VSLDRGSKSRQKTIVLEKGCRMPEQVL 139 >gi|296165029|ref|ZP_06847584.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295899677|gb|EFG79128.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 77 Score = 62.3 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 36/78 (46%), Gaps = 6/78 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P ++K + + + I V GKAN A+ +LA L L +S Sbjct: 6 VAVRVKPGSRKGPLV------EEGPNGELTIYVRERAVDGKANDAVTGLLAAHLELPRSR 59 Query: 64 LRMLSKQSSPLKIIYIDK 81 + ++S ++S LK + Sbjct: 60 VELISGRTSRLKRFRVSG 77 >gi|254466397|ref|ZP_05079808.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] gi|206687305|gb|EDZ47787.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] Length = 105 Score = 62.3 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 23/74 (31%), Positives = 36/74 (48%), Gaps = 8/74 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR P A ++ I + E +KI VTA P+ GKAN+A+ +LA + + Sbjct: 32 AEIAVRATPKAARNAIVAAE--------GVLKISVTAVPENGKANEAIRRLLAAAMGTAA 83 Query: 62 SSLRMLSKQSSPLK 75 S L + +S K Sbjct: 84 SRLELRRGAASRDK 97 >gi|66813694|ref|XP_641026.1| hypothetical protein DDB_G0280783 [Dictyostelium discoideum AX4] gi|74855711|sp|Q54UW1|U235_DICDI RecName: Full=UPF0235 protein gi|60469052|gb|EAL67049.1| hypothetical protein DDB_G0280783 [Dictyostelium discoideum AX4] Length = 124 Score = 61.9 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 7/88 (7%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + V + PN+K+S I S E + ++++ P GKAN ++ L+K+L + Sbjct: 31 IIKINVNVHPNSKESSIVSFEDQ-------ILSLRISEPPIDGKANIGVIEFLSKELNIR 83 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITE 88 KS++ + S K + ID + IT+ Sbjct: 84 KSNIEVGKGSKSRNKSVEIDISSENITK 111 >gi|262197893|ref|YP_003269102.1| hypothetical protein Hoch_4719 [Haliangium ochraceum DSM 14365] gi|262081240|gb|ACY17209.1| protein of unknown function DUF167 [Haliangium ochraceum DSM 14365] Length = 101 Score = 61.9 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 20/53 (37%), Positives = 30/53 (56%) Query: 29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 +K+ VTA P G AN A+LA+LAK L + L ++ QSS K +++ Sbjct: 29 GERLKVAVTAPPADGAANDAVLALLAKHLGRPRRDLCLVGGQSSRRKTVHVAG 81 >gi|325982093|ref|YP_004294495.1| hypothetical protein NAL212_1446 [Nitrosomonas sp. AL212] gi|325531612|gb|ADZ26333.1| protein of unknown function DUF167 [Nitrosomonas sp. AL212] Length = 113 Score = 61.9 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 15/64 (23%), Positives = 28/64 (43%) Query: 22 IPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 +KI++ A P +GKAN ++ LA + + +R+ S K+I I + Sbjct: 28 TKAAGLLGGALKIRLAAAPVEGKANSTLIKFLAAQFDVPIGQVRLKQGGKSRHKVIVIHR 87 Query: 82 DCKE 85 + Sbjct: 88 SVHD 91 >gi|78357213|ref|YP_388662.1| hypothetical protein Dde_2170 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219618|gb|ABB38967.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 118 Score = 61.9 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 43/80 (53%), Gaps = 7/80 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V P AK SGIA L D ++I+++A KANK ++ +A+ + ++ Sbjct: 16 RLKVWAQPGAKHSGIAGL---YDGR----VRIRLSAPAVDNKANKELIRFVAQLCGVKQN 68 Query: 63 SLRMLSKQSSPLKIIYIDKD 82 +R+ S SS K++ I++D Sbjct: 69 RVRLESGVSSRKKVLLIERD 88 >gi|218437671|ref|YP_002376000.1| hypothetical protein PCC7424_0673 [Cyanothece sp. PCC 7424] gi|226708012|sp|B7KEV7|Y673_CYAP7 RecName: Full=UPF0235 protein PCC7424_0673 gi|218170399|gb|ACK69132.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7424] Length = 73 Score = 61.9 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 26/77 (33%), Positives = 44/77 (57%), Gaps = 7/77 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V++ PNAK I E +D S + I + + P +GKAN+ ++ +LA+K ++KS Sbjct: 2 KIQVKVKPNAKHQKI---EEAEDGS----LIISLKSPPVEGKANQELIKLLAQKYRVTKS 54 Query: 63 SLRMLSKQSSPLKIIYI 79 + + S SS K+I I Sbjct: 55 QISIQSGLSSRNKLIEI 71 >gi|260892905|ref|YP_003239002.1| protein of unknown function DUF167 [Ammonifex degensii KC4] gi|260865046|gb|ACX52152.1| protein of unknown function DUF167 [Ammonifex degensii KC4] Length = 103 Score = 61.9 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 16/89 (17%), Positives = 41/89 (46%), Gaps = 8/89 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P + + +++++TA P +GKAN+ +L L++ L + Sbjct: 20 IHLRVTPRSS--------TLALEAGEGFLRVRLTAPPVEGKANELLLEFLSRVLDIPARR 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 L+++ K++ +D + E ++ Sbjct: 72 LQLVKGLKGREKVVLVDMPLPLVAEKIEK 100 >gi|119509193|ref|ZP_01628343.1| hypothetical protein N9414_14620 [Nodularia spumigena CCY9414] gi|119466035|gb|EAW46922.1| hypothetical protein N9414_14620 [Nodularia spumigena CCY9414] Length = 52 Score = 61.9 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 20/51 (39%), Positives = 33/51 (64%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKD 82 M +++ + P GKAN+ ++ +LA+K + KS +R+ S SS K+I ID D Sbjct: 1 MIVRLKSPPVDGKANEELIKLLAEKFHVPKSHIRIKSGVSSRQKLIEIDTD 51 >gi|317050344|ref|YP_004111460.1| hypothetical protein Selin_0146 [Desulfurispirillum indicum S5] gi|316945428|gb|ADU64904.1| protein of unknown function DUF167 [Desulfurispirillum indicum S5] Length = 92 Score = 61.5 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 16/89 (17%), Positives = 44/89 (49%), Gaps = 8/89 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +++ P + ++ + D +K+++ A P G AN+ ++ ++ L++ KS Sbjct: 10 TLQLKVQPRSSRTELL---READGQ----LKLRLNAPPVDGAANQQVIEFFSRLLSIPKS 62 Query: 63 SLRMLSKQSSPLKIIYIDK-DCKEITELL 90 + ++ Q S K+I + + + ++L Sbjct: 63 RISIVQGQQSRRKVIALLGVEPAILEKVL 91 >gi|153003993|ref|YP_001378318.1| hypothetical protein Anae109_1126 [Anaeromyxobacter sp. Fw109-5] gi|166977708|sp|A7H9D8|Y1126_ANADF RecName: Full=UPF0235 protein Anae109_1126 gi|152027566|gb|ABS25334.1| protein of unknown function DUF167 [Anaeromyxobacter sp. Fw109-5] Length = 95 Score = 61.5 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 35/77 (45%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A ++ + +KI++ A P G AN A++ LA+ L + K Sbjct: 13 LELLVQPRASRTRVLG-------EHGGRLKIQLAAPPVDGAANAALVEFLAEALEVRKQD 65 Query: 64 LRMLSKQSSPLKIIYID 80 + ++ ++ K + + Sbjct: 66 VVLVRGETGRRKAVRVT 82 >gi|256828220|ref|YP_003156948.1| protein of unknown function DUF167 [Desulfomicrobium baculatum DSM 4028] gi|256577396|gb|ACU88532.1| protein of unknown function DUF167 [Desulfomicrobium baculatum DSM 4028] Length = 106 Score = 61.5 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 7/82 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + + P A+K+ +A L +KI+V A KAN A+ ++K L + S Sbjct: 18 RLGIWVQPGARKTEVAGLH-------GDFLKIRVQARAVDNKANSALTVFVSKILGIKAS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84 + + S +S K + +D + + Sbjct: 71 QVVIESGHASRQKNLLLDVEEE 92 >gi|307102424|gb|EFN50700.1| hypothetical protein CHLNCDRAFT_142618 [Chlorella variabilis] Length = 127 Score = 61.1 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 40/92 (43%), Gaps = 9/92 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C + V P +K + T +++ V A P G+AN A++ +A+ L L + Sbjct: 43 CRIGVHAKPGSKVCSV--------TLGPDALEVAVDAKPVDGEANAALIEFVAEVLGLKR 94 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 + + S +S K++ + D + L+ Sbjct: 95 RDVTLASGTTSRHKVLAVAGIDAHAALQRLRQ 126 >gi|57640703|ref|YP_183181.1| hypothetical protein TK0768 [Thermococcus kodakarensis KOD1] gi|73921163|sp|Q5JHB2|Y768_PYRKO RecName: Full=UPF0235 protein TK0768 gi|57159027|dbj|BAD84957.1| hypothetical protein, conserved, YggU family [Thermococcus kodakarensis KOD1] Length = 94 Score = 61.1 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 46/88 (52%), Gaps = 9/88 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 +++ + P AKK+ I ++ + +K+++ A P +GKANK ++ +K L + Sbjct: 13 IMIYVQPKAKKNAIEGVDGWR-----GRLKVRIAAPPVEGKANKEVVKFFSKLLG---AE 64 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90 + ++ ++S K + + +E+ + L Sbjct: 65 VNIVRGETSREKDLLVKGLSVEEVRKKL 92 >gi|332178747|gb|AEE14436.1| UPF0235 protein yggU [Thermodesulfobium narugense DSM 14796] Length = 86 Score = 61.1 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 21/80 (26%), Positives = 41/80 (51%), Gaps = 8/80 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +++ PNAKK I KD + KV+A P+ GKAN+ ++ ++++ + Sbjct: 4 KIELKVTPNAKKESIE----IKDGK----IYCKVSAPPEDGKANRRVIELISEFFDCKRK 55 Query: 63 SLRMLSKQSSPLKIIYIDKD 82 + + S + S KI+ I + Sbjct: 56 DVEIFSGEKSKNKILLIKSE 75 >gi|91202864|emb|CAJ72503.1| similar to hypothetical protein YggU [Candidatus Kuenenia stuttgartiensis] Length = 97 Score = 61.1 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 27/91 (29%), Positives = 48/91 (52%), Gaps = 7/91 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V V++ + K I + +K+ V+A P+KGKANKA++ +LA+ ++ SS Sbjct: 13 VFVKVQAGSGKDRIVG-------NLGGRLKLAVSAAPEKGKANKAVVELLAETFHINSSS 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 + ++S ++S K I I+ E L N + Sbjct: 66 IHIISGKTSRDKKIMIEGVTPESINTLLNFN 96 >gi|38505546|ref|NP_942167.1| hypothetical protein ssr5011 [Synechocystis sp. PCC 6803] gi|38423570|dbj|BAD01781.1| ssr5011 [Synechocystis sp. PCC 6803] Length = 73 Score = 61.1 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 25/74 (33%), Positives = 42/74 (56%), Gaps = 7/74 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PNAK+S + D S + I V + P GKAN+ ++ +LAK+ +S+ S++ Sbjct: 5 VKVKPNAKQSKVV---YGDDGS----LIIHVKSPPVDGKANQELIKLLAKEFNVSQQSIK 57 Query: 66 MLSKQSSPLKIIYI 79 + S S KI+ I Sbjct: 58 IKSGAGSRQKIVEI 71 >gi|255022260|ref|ZP_05294254.1| hypothetical protein ACA_0401 [Acidithiobacillus caldus ATCC 51756] gi|254968316|gb|EET25884.1| hypothetical protein ACA_0401 [Acidithiobacillus caldus ATCC 51756] Length = 115 Score = 61.1 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 34/79 (43%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ P A+ + +KI++ A G AN A+L+ LA++L L Sbjct: 13 GTLTVQVQPGARDDCVVGYH-------GDALKIRLRARAVDGAANAALLSFLARRLDLGP 65 Query: 62 SSLRMLSKQSSPLKIIYID 80 + + S K++ + Sbjct: 66 GQVVLRHGTHSRRKVLVLS 84 >gi|289746071|ref|ZP_06505449.1| PE-PGRS family protein [Mycobacterium tuberculosis 02_1987] gi|308403079|ref|ZP_07493725.2| PE-PGRS family protein [Mycobacterium tuberculosis SUMu012] gi|289686599|gb|EFD54087.1| PE-PGRS family protein [Mycobacterium tuberculosis 02_1987] gi|308365791|gb|EFP54642.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu012] Length = 72 Score = 61.1 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 36/78 (46%), Gaps = 6/78 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 ++VR+ P + K + + + I V GKAN A+ +LA L L KS Sbjct: 1 MVVRVKPGSHKGPLV------EVGPNGELIIYVREPAIDGKANDAVTRLLAAHLQLPKSR 54 Query: 64 LRMLSKQSSPLKIIYIDK 81 ++++S +S K + + Sbjct: 55 VKLVSGATSRFKRFRLSR 72 >gi|188996205|ref|YP_001930456.1| protein of unknown function DUF167 [Sulfurihydrogenibium sp. YO3AOP1] gi|259646874|sp|B2V7H9|Y257_SULSY RecName: Full=UPF0235 protein SYO3AOP1_0257 gi|188931272|gb|ACD65902.1| protein of unknown function DUF167 [Sulfurihydrogenibium sp. YO3AOP1] Length = 73 Score = 60.8 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 16/77 (20%), Positives = 40/77 (51%), Gaps = 7/77 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V++ P K+ + ++ +++ T P+KGKAN+ ++ +L+ + KS Sbjct: 2 RIKVKVKPGTSKNEVKKID-------ENLYEVRTTTIPEKGKANEKVVELLSDFFDVPKS 54 Query: 63 SLRMLSKQSSPLKIIYI 79 ++++ Q+S K + + Sbjct: 55 KIKIVKGQTSREKEVEV 71 >gi|260429009|ref|ZP_05782986.1| conserved hypothetical protein [Citreicella sp. SE45] gi|260419632|gb|EEX12885.1| conserved hypothetical protein [Citreicella sp. SE45] Length = 85 Score = 60.8 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ P A ++ I D +++ VT P+ GKAN A+ +LAK L + K Sbjct: 14 ATLALRVTPRASRNEIR--------EDGDQLRVLVTTVPEDGKANAAVAKLLAKALGVPK 65 Query: 62 SSLRMLSKQSSPLKIIYID 80 S L ++ +S K+ I+ Sbjct: 66 SRLTLIQGATSRDKVFRIE 84 >gi|189183832|ref|YP_001937617.1| hypothetical protein OTT_0925 [Orientia tsutsugamushi str. Ikeda] gi|189180603|dbj|BAG40383.1| hypothetical protein OTT_0925 [Orientia tsutsugamushi str. Ikeda] Length = 112 Score = 60.8 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 47/89 (52%), Gaps = 3/89 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ AK + I L + S + I + P+ KAN+A++ L++ L + +S+ Sbjct: 20 INLKVKAGAKINKIIGLYYINNKS---FLYISINTIPENNKANQAIIKFLSQWLEVCRSN 76 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++++ S LK+I + I+ L+++ Sbjct: 77 IKIVYGLHSNLKVISVINTNANISNLIRS 105 >gi|224373471|ref|YP_002607843.1| hypothetical protein NAMH_1451 [Nautilia profundicola AmH] gi|223589921|gb|ACM93657.1| conserved hypothetical protein [Nautilia profundicola AmH] Length = 94 Score = 60.8 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 43/91 (47%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ PN+ K+ IA +K+ + A +G ANK ++ + K+ + KS Sbjct: 11 IKIKAQPNSSKNKIAG-------KYGESLKVNIKAPAVEGAANKELIKFIGKEFKIPKSE 63 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 + + ++S K++ + +I + L+ + Sbjct: 64 IAI-KGETSKQKVLIVP-FRDDIIKKLEEIN 92 >gi|118469743|ref|YP_888136.1| hypothetical protein MSMEG_3845 [Mycobacterium smegmatis str. MC2 155] gi|226706061|sp|A0QYZ8|Y3845_MYCS2 RecName: Full=UPF0235 protein MSMEG_3845 gi|118171030|gb|ABK71926.1| conserved domain protein [Mycobacterium smegmatis str. MC2 155] Length = 75 Score = 60.8 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 36/78 (46%), Gaps = 6/78 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V+VR+ P ++K + +E D + I V GKAN A+ +LA L + S Sbjct: 4 TVVVRVKPGSRKGPL--VETADDG----TLTIYVQERAVDGKANAAVTKLLAAHLGVPPS 57 Query: 63 SLRMLSKQSSPLKIIYID 80 + + S ++ LK I Sbjct: 58 RVELASGATARLKRFRIS 75 >gi|296109159|ref|YP_003616108.1| protein of unknown function DUF167 [Methanocaldococcus infernus ME] gi|295433973|gb|ADG13144.1| protein of unknown function DUF167 [Methanocaldococcus infernus ME] Length = 92 Score = 60.8 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 44/83 (53%), Gaps = 9/83 (10%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + PNAKK+ I + ++ +++KV A P +GKANK ++ +K + ++ Sbjct: 15 VTPNAKKTEIVGRDEWRN-----RLEVKVKAPPVEGKANKEIIKFFSKLFG----DVEIV 65 Query: 68 SKQSSPLKIIYIDKDCKEITELL 90 + + S K I I K KE+ E+L Sbjct: 66 AGEKSSKKTILIRKPLKEVEEIL 88 >gi|195160833|ref|XP_002021278.1| GL25246 [Drosophila persimilis] gi|198465041|ref|XP_001353470.2| GA13391 [Drosophila pseudoobscura pseudoobscura] gi|194118391|gb|EDW40434.1| GL25246 [Drosophila persimilis] gi|198149990|gb|EAL30981.2| GA13391 [Drosophila pseudoobscura pseudoobscura] Length = 125 Score = 60.4 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 25/93 (26%), Positives = 45/93 (48%), Gaps = 10/93 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK +GI ++++ +++ A P +G+AN ++ L+K L L KS Sbjct: 37 IKILAKPGAKHNGITNIDLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 89 Query: 64 LRMLSKQSSPLKIIYIDKDC---KEITELLQNN 93 + + S KII I K + I +LL+ Sbjct: 90 VSLDKGSRSRNKIILISKGASTVESIQQLLRKE 122 >gi|54295557|ref|YP_127972.1| hypothetical protein lpl2644 [Legionella pneumophila str. Lens] gi|53755389|emb|CAH16885.1| hypothetical protein lpl2644 [Legionella pneumophila str. Lens] Length = 95 Score = 60.4 bits (145), Expect = 7e-08, Method: Composition-based stats. Identities = 19/79 (24%), Positives = 40/79 (50%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + PNAKKS + ++ + I + A PQ+G+AN +L +++ + K Sbjct: 10 VEIAIYAKPNAKKSKLMAISDDS-------LHIALHAKPQEGEANNELLFFISQFFKIPK 62 Query: 62 SSLRMLSKQSSPLKIIYID 80 + + ++ +SS K+I + Sbjct: 63 TQIELIKGKSSRHKLIRLP 81 >gi|262201697|ref|YP_003272905.1| hypothetical protein Gbro_1753 [Gordonia bronchialis DSM 43247] gi|262085044|gb|ACY21012.1| protein of unknown function DUF167 [Gordonia bronchialis DSM 43247] Length = 75 Score = 60.4 bits (145), Expect = 8e-08, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 41/76 (53%), Gaps = 6/76 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V+V + PN++K + +E D + TI+ V +G+ANKA+ +LAK L + KS Sbjct: 5 VVVTVKPNSRKGPL--VETGPDGTVTIY----VREPATEGRANKAVAELLAKHLGVPKSK 58 Query: 64 LRMLSKQSSPLKIIYI 79 + ++ ++ K + Sbjct: 59 VALVGGATARTKRFRV 74 >gi|261855187|ref|YP_003262470.1| hypothetical protein Hneap_0569 [Halothiobacillus neapolitanus c2] gi|261835656|gb|ACX95423.1| protein of unknown function DUF167 [Halothiobacillus neapolitanus c2] Length = 122 Score = 60.4 bits (145), Expect = 8e-08, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 39/76 (51%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ P A ++ + +K+ +T P GKAN+A++ +AK +++K+ Sbjct: 19 LKLKVQPRASRTAWGEVI-------GGRIKLYLTTPPIDGKANQAVIDFIAKTFSVAKNR 71 Query: 64 LRMLSKQSSPLKIIYI 79 + + ++S K I + Sbjct: 72 VEIRRGETSRSKDIAL 87 >gi|220916321|ref|YP_002491625.1| protein of unknown function DUF167 [Anaeromyxobacter dehalogenans 2CP-1] gi|254799985|sp|B8JFX1|Y1215_ANAD2 RecName: Full=UPF0235 protein A2cp1_1215 gi|219954175|gb|ACL64559.1| protein of unknown function DUF167 [Anaeromyxobacter dehalogenans 2CP-1] Length = 95 Score = 60.4 bits (145), Expect = 9e-08, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A ++ D +KI++ A P G AN A++ LA L + ++ Sbjct: 13 LEILVQPRASRTRAVG---EHDGR----LKIQLAAPPVDGAANAALVEFLAAALGVRRAD 65 Query: 64 LRMLSKQSSPLKIIYI 79 + +L ++ K + + Sbjct: 66 VELLRGETGRRKTVRV 81 >gi|237756139|ref|ZP_04584711.1| conserved domain protein [Sulfurihydrogenibium yellowstonense SS-5] gi|237691703|gb|EEP60739.1| conserved domain protein [Sulfurihydrogenibium yellowstonense SS-5] Length = 73 Score = 60.4 bits (145), Expect = 9e-08, Method: Composition-based stats. Identities = 16/77 (20%), Positives = 41/77 (53%), Gaps = 7/77 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V++ P A ++ + ++ +++ T P+KGKAN+ ++ +L+ + KS Sbjct: 2 RIKVKVKPGASENEVKKID-------EYLYEVRTTTIPEKGKANEKVIELLSDFFDVPKS 54 Query: 63 SLRMLSKQSSPLKIIYI 79 ++++ Q+S K + + Sbjct: 55 KIKIIKGQASREKEVEV 71 >gi|194748771|ref|XP_001956818.1| GF24383 [Drosophila ananassae] gi|190624100|gb|EDV39624.1| GF24383 [Drosophila ananassae] Length = 126 Score = 60.4 bits (145), Expect = 9e-08, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 40/88 (45%), Gaps = 10/88 (11%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P AK++GI + +++ A P +G+AN ++ L+K L L KS + + Sbjct: 44 KPGAKQNGITGISTEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSDVSLDK 96 Query: 69 KQSSPLKIIYIDKDC---KEITELLQNN 93 S KII I K + +LL+ Sbjct: 97 GSRSRNKIILITKGAITTEAAEQLLRKE 124 >gi|99080537|ref|YP_612691.1| hypothetical protein TM1040_0696 [Ruegeria sp. TM1040] gi|99036817|gb|ABF63429.1| protein of unknown function DUF167 [Ruegeria sp. TM1040] Length = 92 Score = 60.4 bits (145), Expect = 9e-08, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 39/79 (49%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A ++ + +K+ VT P+ GKAN ++A+L++ L +S Sbjct: 20 AQIAVRVTPKAAQNAVL--------RKRDEIKVLVTTVPEGGKANADVVALLSRALGVSP 71 Query: 62 SSLRMLSKQSSPLKIIYID 80 S L +L +S K+ + Sbjct: 72 SRLTLLRGATSRDKVFLVT 90 >gi|78777110|ref|YP_393425.1| hypothetical protein Suden_0912 [Sulfurimonas denitrificans DSM 1251] gi|78497650|gb|ABB44190.1| Protein of unknown function DUF167 [Sulfurimonas denitrificans DSM 1251] Length = 101 Score = 60.4 bits (145), Expect = 9e-08, Method: Composition-based stats. Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 8/83 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+AK+ I + + +K+ V A P+ GKA M+ LAK+ +S SS+ ++ Sbjct: 25 TPSAKRDVIGKVRANQ-------LKVSVRAQPEGGKATDYMVGFLAKEFGVSVSSIEVVY 77 Query: 69 KQSSPLKIIYIDKDCKEITELLQ 91 + S K + I K K++ ++ Sbjct: 78 GRESIHKQLRI-KAPKKLPNSIE 99 >gi|54298708|ref|YP_125077.1| hypothetical protein lpp2772 [Legionella pneumophila str. Paris] gi|53752493|emb|CAH13925.1| hypothetical protein lpp2772 [Legionella pneumophila str. Paris] Length = 95 Score = 60.0 bits (144), Expect = 9e-08, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 41/79 (51%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + PNAKK+ + ++ + + I + A PQ+G+AN +L +++ + K Sbjct: 10 VEIAIYAKPNAKKTKLMAISDDR-------LHIALHAKPQEGEANNELLFFISQFFKIPK 62 Query: 62 SSLRMLSKQSSPLKIIYID 80 + + ++ +SS K+I + Sbjct: 63 TQIELIKGKSSRHKLIRLP 81 >gi|52842921|ref|YP_096720.1| hypothetical protein lpg2716 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52630032|gb|AAU28773.1| hypothetical protein lpg2716 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 95 Score = 60.0 bits (144), Expect = 9e-08, Method: Composition-based stats. Identities = 19/79 (24%), Positives = 41/79 (51%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + PNAKKS + ++ + + I + A PQ+G+AN +L +++ + K Sbjct: 10 VEIAIYAKPNAKKSKLMAISDDR-------LHIALHAKPQEGEANNELLFFISQFFKIPK 62 Query: 62 SSLRMLSKQSSPLKIIYID 80 + + ++ +SS K+I + Sbjct: 63 TQIELIKGKSSRHKLIRLP 81 >gi|24656569|ref|NP_647784.1| CG14966 [Drosophila melanogaster] gi|7292328|gb|AAF47735.1| CG14966 [Drosophila melanogaster] gi|289526411|gb|ADD01328.1| RE68649p [Drosophila melanogaster] Length = 140 Score = 60.0 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 25/88 (28%), Positives = 41/88 (46%), Gaps = 10/88 (11%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P AK++GI + +++ A P +G+AN ++ L+K L L KS + + Sbjct: 57 KPGAKQNGITGIGFEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSDVSLDK 109 Query: 69 KQSSPLKIIYIDKD---CKEITELLQNN 93 S KII I K + I +LL+ Sbjct: 110 GSRSRNKIIMITKGVSTVEAIEQLLRKE 137 >gi|54022885|ref|YP_117127.1| hypothetical protein nfa9180 [Nocardia farcinica IFM 10152] gi|54014393|dbj|BAD55763.1| hypothetical protein [Nocardia farcinica IFM 10152] Length = 72 Score = 60.0 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 21/73 (28%), Positives = 37/73 (50%), Gaps = 6/73 (8%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + PN++K + +E D + + V A +GKANKA + +LA + S++R+ Sbjct: 5 IKPNSRKGPL--VETAADG----TLTLYVRAPAVEGKANKAAIDLLAAHYGVPTSAVRLT 58 Query: 68 SKQSSPLKIIYID 80 + +S K ID Sbjct: 59 AGATSRHKRFDID 71 >gi|159038970|ref|YP_001538223.1| hypothetical protein Sare_3430 [Salinispora arenicola CNS-205] gi|157917805|gb|ABV99232.1| protein of unknown function DUF167 [Salinispora arenicola CNS-205] Length = 107 Score = 60.0 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 37/90 (41%), Gaps = 3/90 (3%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V VR+ P + + + + + VTA P G+AN+A LA L + + Sbjct: 17 TVAVRVKPGSSRIRVGG---RYVGPHGPALIVAVTAPPVDGRANEAARRALADALGVRSA 73 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++ + + + K +D +IT L Sbjct: 74 AVSLEAGATGRNKTFRVDGVAADITRTLHR 103 >gi|78186226|ref|YP_374269.1| hypothetical protein Plut_0338 [Chlorobium luteolum DSM 273] gi|123583481|sp|Q3B605|Y338_PELLD RecName: Full=UPF0235 protein Plut_0338 gi|78166128|gb|ABB23226.1| conserved hypothetical protein [Chlorobium luteolum DSM 273] Length = 101 Score = 60.0 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 39/91 (42%), Gaps = 7/91 (7%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P + KS ++ +K+ + A P AN+ + +K ++ S + Sbjct: 15 VRVQPRSSKSAVSG-------PYGNALKVTLKAAPVDDAANRECCRLFSKLFSIPDSRVS 67 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96 ++S +S K + ++ E + N S+ Sbjct: 68 IVSGAASRTKSVMLEGLSAEEARSILRNSSI 98 >gi|315230060|ref|YP_004070496.1| hypothetical protein TERMP_00296 [Thermococcus barophilus MP] gi|315183088|gb|ADT83273.1| hypothetical protein TERMP_00296 [Thermococcus barophilus MP] Length = 92 Score = 60.0 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 26/90 (28%), Positives = 45/90 (50%), Gaps = 9/90 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++V + PN K++ I ++ K +K+KV+A P GKANK + L+K L Sbjct: 9 VILLVHVQPNTKRNSIEGVDKWK-----GRIKVKVSAPPVGGKANKELTKFLSKLLG--- 60 Query: 62 SSLRMLSKQSSPLKIIYIDKDC-KEITELL 90 + +L ++S K + I +E+ E L Sbjct: 61 KEVVILRGETSREKDLLIKGATIEEVKEKL 90 >gi|126178968|ref|YP_001046933.1| hypothetical protein Memar_1018 [Methanoculleus marisnigri JR1] gi|125861762|gb|ABN56951.1| protein of unknown function DUF167 [Methanoculleus marisnigri JR1] Length = 105 Score = 60.0 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 35/80 (43%), Gaps = 4/80 (5%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + AK+S +K ++ A GKAN+A+ +LA+ + + Sbjct: 15 VTITLDVTAGAKRSSF----PAGYNEWRKSIKCQIAAPAVGGKANRAITDLLAETFGVPR 70 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 + + +++ +S K + I Sbjct: 71 ADVSIITGHTSSSKTVAIAG 90 >gi|328866439|gb|EGG14823.1| hypothetical protein DFA_10696 [Dictyostelium fasciculatum] Length = 152 Score = 60.0 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 42/88 (47%), Gaps = 6/88 (6%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + V + PNAK+S I +D + ++++ P GKAN ++ L+ +L L Sbjct: 56 IVRLNVNVHPNAKQSTIVQF------TDDGCLDLRISQPPIDGKANDEVIDFLSDELKLK 109 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITE 88 K + + S K+I ID +T+ Sbjct: 110 KRFITVDKGLKSRNKVIAIDLSESSLTK 137 >gi|329902481|ref|ZP_08273126.1| hypothetical protein IMCC9480_281 [Oxalobacteraceae bacterium IMCC9480] gi|327548773|gb|EGF33410.1| hypothetical protein IMCC9480_281 [Oxalobacteraceae bacterium IMCC9480] Length = 105 Score = 59.6 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 42/78 (53%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ NAKK+ + + +KIK+ A P +GKAN A++ +A +L + + Sbjct: 16 VRLAVQVAANAKKTEVIGVADD-------VLKIKLHAQPIEGKANDALVRFVAGQLHVPR 68 Query: 62 SSLRMLSKQSSPLKIIYI 79 +++ + +S K++ + Sbjct: 69 TTVSVTHGLTSKRKLLLV 86 >gi|189485621|ref|YP_001956562.1| hypothetical protein TGRD_618 [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|254806541|sp|B1GYK8|Y618_UNCTG RecName: Full=UPF0235 protein TGRD_618 gi|170287580|dbj|BAG14101.1| conserved hypothetical protein [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 87 Score = 59.6 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 18/82 (21%), Positives = 40/82 (48%), Gaps = 7/82 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+IPN+K++ + S +++K+TA +G+AN+ + L+ + +S Sbjct: 3 IKVRVIPNSKRNEVVS-------RVGSILRVKITAPAIEGRANEELCDFLSDFFDVKRSM 55 Query: 64 LRMLSKQSSPLKIIYIDKDCKE 85 + + + K I + +E Sbjct: 56 IFLRKGERGREKTIEVLGRLEE 77 >gi|87198097|ref|YP_495354.1| hypothetical protein Saro_0071 [Novosphingobium aromaticivorans DSM 12444] gi|87133778|gb|ABD24520.1| protein of unknown function DUF167 [Novosphingobium aromaticivorans DSM 12444] Length = 95 Score = 59.6 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 26/82 (31%), Positives = 40/82 (48%), Gaps = 8/82 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P AK GI D + +KV A P+ GKA A+ +LA+ L L+ Sbjct: 22 GRLAVRVTPGAKSEGIE--------IDGGRVLVKVRAKPEDGKATAAVQELLARALGLAP 73 Query: 62 SSLRMLSKQSSPLKIIYIDKDC 83 S + ML +S K+ I ++ Sbjct: 74 SKVEMLRGATSREKLFRIPREA 95 >gi|254409972|ref|ZP_05023752.1| conserved hypothetical protein [Microcoleus chthonoplastes PCC 7420] gi|196183008|gb|EDX77992.1| conserved hypothetical protein [Microcoleus chthonoplastes PCC 7420] Length = 74 Score = 59.6 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 26/77 (33%), Positives = 44/77 (57%), Gaps = 7/77 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ PN+K I E D +K+ + + P GKANK ++ +LA+K ++KS ++ Sbjct: 5 VKVKPNSKTQSI---EEMADG----TLKVNLKSPPVDGKANKELIELLAEKFNVTKSQVQ 57 Query: 66 MLSKQSSPLKIIYIDKD 82 + S SS +K+I I D Sbjct: 58 IKSGLSSKIKLIEIVAD 74 >gi|269127125|ref|YP_003300495.1| hypothetical protein Tcur_2915 [Thermomonospora curvata DSM 43183] gi|268312083|gb|ACY98457.1| protein of unknown function DUF167 [Thermomonospora curvata DSM 43183] Length = 91 Score = 59.6 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 45/89 (50%), Gaps = 7/89 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V +R+ P A ++ + + + +KV A +GKA +A L +A L + + Sbjct: 4 VRVAIRVGPGASRTKVGG-------AHGEALVVKVAARAVEGKATEAALRAVADALGVRR 56 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 +R++S +S K+I +D D + +TE + Sbjct: 57 RDVRLISGATSREKLIEVDGDERALTEKI 85 >gi|259089201|ref|NP_001158638.1| UPF0235 protein C15orf40 [Oncorhynchus mykiss] gi|225705490|gb|ACO08591.1| UPF0235 protein C15orf40 [Oncorhynchus mykiss] Length = 182 Score = 59.6 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 21/97 (21%), Positives = 42/97 (43%), Gaps = 9/97 (9%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + V P +K++ I + I + + A P G+AN ++ L+K L L Sbjct: 92 VVTISVHAKPGSKQNAITDVSIEAVG-------VAIAAPPTGGEANAELVRYLSKVLELK 144 Query: 61 KSSLRMLSKQSSPLKIIYIDKD--CKEITELLQNNDS 95 +S + + S KII + +++ + L+ S Sbjct: 145 RSEVVLDKGSRSREKIIKVTGSLTPEQVLDRLKQEAS 181 >gi|197121557|ref|YP_002133508.1| hypothetical protein AnaeK_1146 [Anaeromyxobacter sp. K] gi|226696231|sp|B4UGV4|Y1146_ANASK RecName: Full=UPF0235 protein AnaeK_1146 gi|196171406|gb|ACG72379.1| protein of unknown function DUF167 [Anaeromyxobacter sp. K] Length = 95 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A ++ D +KI++ A P G AN A++ LA L + ++ Sbjct: 13 LELLVQPRASRTRAVG---EHDGR----LKIQLAAPPVDGAANAALVEFLAVALGVRRAD 65 Query: 64 LRMLSKQSSPLKIIYI 79 + +L ++ K + + Sbjct: 66 VALLRGETGRRKTVRV 81 >gi|145595719|ref|YP_001160016.1| hypothetical protein Strop_3204 [Salinispora tropica CNB-440] gi|145305056|gb|ABP55638.1| protein of unknown function DUF167 [Salinispora tropica CNB-440] Length = 106 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 40/90 (44%), Gaps = 3/90 (3%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V VR+ P + +S + + I VTA P G+A +A LA+ + ++ Sbjct: 16 TVAVRIKPGSSRSRVGG---RYMGPYGPALVIAVTAPPVDGRATEAARRALAEAFGVRRA 72 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++ + + +S KI Y+ EIT L Sbjct: 73 AVSLGAGAASRNKIFYVGGSGVEITRTLHR 102 >gi|120403830|ref|YP_953659.1| hypothetical protein Mvan_2846 [Mycobacterium vanbaalenii PYR-1] gi|166200356|sp|A1T903|Y2846_MYCVP RecName: Full=UPF0235 protein Mvan_2846 gi|119956648|gb|ABM13653.1| protein of unknown function DUF167 [Mycobacterium vanbaalenii PYR-1] Length = 75 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 21/76 (27%), Positives = 36/76 (47%), Gaps = 6/76 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P +KK + +E D + + V GKAN A++ +LA+ + +S Sbjct: 5 VSVRVKPGSKKGPL--VETGPDGE----LTVYVRERAVDGKANAAVIRVLAEHFGVPRSL 58 Query: 64 LRMLSKQSSPLKIIYI 79 + + SS +K I Sbjct: 59 VELTGGASSRIKRFRI 74 >gi|158520306|ref|YP_001528176.1| hypothetical protein Dole_0289 [Desulfococcus oleovorans Hxd3] gi|226701656|sp|A8ZS81|Y289_DESOH RecName: Full=UPF0235 protein Dole_0289 gi|158509132|gb|ABW66099.1| protein of unknown function DUF167 [Desulfococcus oleovorans Hxd3] Length = 103 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 18/82 (21%), Positives = 38/82 (46%), Gaps = 7/82 (8%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P + K+ + +KIK+TA P G+AN+ +A L++ L + K+S+ + Sbjct: 17 VAPRSSKNMVVGAHDNA-------LKIKITAPPVDGRANEMCVAFLSRLLGIPKTSITIA 69 Query: 68 SKQSSPLKIIYIDKDCKEITEL 89 + +S K + + + Sbjct: 70 AGAASKRKEVCLALSPNAAGKQ 91 >gi|219852331|ref|YP_002466763.1| protein of unknown function DUF167 [Methanosphaerula palustris E1-9c] gi|219546590|gb|ACL17040.1| protein of unknown function DUF167 [Methanosphaerula palustris E1-9c] Length = 106 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 16/47 (34%), Positives = 30/47 (63%) Query: 35 KVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 +T P +GKAN+A++A+L++ L + +S + +LS +S K + I Sbjct: 44 SITTPPVEGKANRAIVALLSRTLTIPQSGISILSGATSSQKRVLIQG 90 >gi|328780498|ref|XP_392249.2| PREDICTED: UPF0235 protein C15orf40 homolog [Apis mellifera] Length = 144 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 40/93 (43%), Gaps = 8/93 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + ++ P AK + I + + ++A P +G+AN ++ LA L + K Sbjct: 56 VTIKIQAKPGAKHNNITDISEDAVG-------VAISAPPVEGEANTELVKYLASVLGMRK 108 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 S + + S KI+ + +++ E L+ Sbjct: 109 SDVTLDRGSKSRQKIVVVSGISVEKVLEKLKGE 141 >gi|308270456|emb|CBX27068.1| UPF0235 protein PTH_1821 [uncultured Desulfobacterium sp.] Length = 106 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 25/82 (30%), Positives = 42/82 (51%), Gaps = 7/82 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V ++P + K+ IA L +KIK+TA P G AN + LAK L++S S++ Sbjct: 14 KVYILPRSSKNMIAGL-------FGDALKIKLTAAPVDGSANNMCIKYLAKILSVSASNI 66 Query: 65 RMLSKQSSPLKIIYIDKDCKEI 86 ++S + K I + + K + Sbjct: 67 EIVSGHTGKTKYILLKNNEKTL 88 >gi|288931799|ref|YP_003435859.1| hypothetical protein Ferp_1433 [Ferroglobus placidus DSM 10642] gi|288894047|gb|ADC65584.1| protein of unknown function DUF167 [Ferroglobus placidus DSM 10642] Length = 94 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 40/80 (50%), Gaps = 8/80 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + P++KK+ IA + K + +KV A P +GKAN+ + L + Sbjct: 12 VIITVHVTPSSKKNEIAGYDPWK-----KALSVKVKAPPVEGKANRELEKFLKEYFG--- 63 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 +++++S + S +K + I Sbjct: 64 KNVKLVSGEKSRVKKVLIVG 83 >gi|206889548|ref|YP_002248254.1| hypothetical protein THEYE_A0407 [Thermodesulfovibrio yellowstonii DSM 11347] gi|206741486|gb|ACI20543.1| conserved hypothetical protein [Thermodesulfovibrio yellowstonii DSM 11347] Length = 87 Score = 59.2 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 42/79 (53%), Gaps = 7/79 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + AK +GI +E +K+++ A P G ANK ++ ML++ L + KS Sbjct: 14 LKVLVKTGAKITGIGGIE-------GNTLKLRLAAQPHDGLANKELIEMLSEILNIPKSR 66 Query: 64 LRMLSKQSSPLKIIYIDKD 82 + ++ ++S KII + + Sbjct: 67 IEIIKGKTSKHKIIKLKGE 85 >gi|194866006|ref|XP_001971712.1| GG14280 [Drosophila erecta] gi|190653495|gb|EDV50738.1| GG14280 [Drosophila erecta] Length = 127 Score = 58.8 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 26/88 (29%), Positives = 42/88 (47%), Gaps = 10/88 (11%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P AK++GI + + +++ A P +G+AN ++ L+K L L KS + + Sbjct: 45 KPGAKQNGITGIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSDVSLDK 97 Query: 69 KQSSPLKIIYIDKD---CKEITELLQNN 93 S KII I K + I ELL+ Sbjct: 98 GSRSRNKIIMITKGVSTVEAIEELLRKE 125 >gi|126465433|ref|YP_001040542.1| hypothetical protein Smar_0527 [Staphylothermus marinus F1] gi|126014256|gb|ABN69634.1| protein of unknown function DUF167 [Staphylothermus marinus F1] Length = 110 Score = 58.8 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 42/90 (46%), Gaps = 9/90 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + PN+ + + + + T P+KG+AN A++ L+KK+ L Sbjct: 22 VIIPIYVKPNSDRDALV--------LEGDELVFYTTEIPEKGRANAALIRFLSKKVGLPH 73 Query: 62 SSLRMLSKQSSPLKIIYI-DKDCKEITELL 90 S + ++ + K I + D D +++ E L Sbjct: 74 SKIDIIYGARTRSKKILVRDIDTEKLAEKL 103 >gi|212223789|ref|YP_002307025.1| hypothetical protein TON_0641 [Thermococcus onnurineus NA1] gi|226707988|sp|B6YUU2|Y641_THEON RecName: Full=UPF0235 protein TON_0641 gi|212008746|gb|ACJ16128.1| hypothetical protein, conserved [Thermococcus onnurineus NA1] Length = 94 Score = 58.8 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 45/88 (51%), Gaps = 9/88 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 +++ + P AKK+ I ++ + +K+K+ A P +GKANK ++ +K L + Sbjct: 13 ILLYVQPKAKKNEIEGVDEWR-----GRLKVKIKAPPVEGKANKEVVRFFSKMLG---TE 64 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90 + ++ +S K + + KE+ + L Sbjct: 65 VEIIRGGTSREKDLLVKGFSSKEVLKKL 92 >gi|300175116|emb|CBK20427.2| unnamed protein product [Blastocystis hominis] Length = 152 Score = 58.8 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 41/90 (45%), Gaps = 13/90 (14%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PN KK+GI E ++I++++ P + KANK + +++ + KS + ++ Sbjct: 29 KPNCKKTGIEWEE--------EQLQIRLSSPPTENKANKEVCEVVSDIADIPKSQVSLIR 80 Query: 69 KQSSPLKIIYI-----DKDCKEITELLQNN 93 S K + + + + + + LQ Sbjct: 81 GGKSRDKELMLNGVSSEGLIQSLLQALQQY 110 >gi|196231015|ref|ZP_03129875.1| protein of unknown function DUF167 [Chthoniobacter flavus Ellin428] gi|196224845|gb|EDY19355.1| protein of unknown function DUF167 [Chthoniobacter flavus Ellin428] Length = 93 Score = 58.8 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 42/78 (53%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R++PNA++S + + +K+KV A GKAN+A+ LA+ L + Sbjct: 5 AILRLRIVPNARRSEVVGVH-------GDAVKVKVQAPAMDGKANEALRDFLAEVLTVPA 57 Query: 62 SSLRMLSKQSSPLKIIYI 79 ++ +++ + S K++ I Sbjct: 58 RAVEIVAGEKSRDKVVAI 75 >gi|189347519|ref|YP_001944048.1| hypothetical protein Clim_2040 [Chlorobium limicola DSM 245] gi|189341666|gb|ACD91069.1| protein of unknown function DUF167 [Chlorobium limicola DSM 245] Length = 101 Score = 58.4 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 39/89 (43%), Gaps = 8/89 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V+ P + KS I +K+ + A P AN+ A+ AK S L Sbjct: 14 VKAQPRSSKSRITGAYDRG-------VKVTLKAAPVDDAANEECCALFAKVFGFPVSRLC 66 Query: 66 MLSKQSSPLKIIYIDK-DCKEITELLQNN 93 ++S +SS K + ++ +E++ LL+ Sbjct: 67 IVSGRSSRNKTLRVEGTSAEEVSRLLRPY 95 >gi|260432898|ref|ZP_05786869.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] gi|260416726|gb|EEX09985.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] Length = 91 Score = 58.4 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 39/77 (50%), Gaps = 8/77 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P A + I + E + I VTA + GKAN A+ +LAK + ++ S+ Sbjct: 22 IQVRVTPKAARDRIQADESS--------VHIAVTAPAEGGKANLAVARILAKAMGIAPSA 73 Query: 64 LRMLSKQSSPLKIIYID 80 L + Q++ K+ + Sbjct: 74 LILKQGQTARNKLFVYE 90 >gi|195491333|ref|XP_002093518.1| GE20708 [Drosophila yakuba] gi|194179619|gb|EDW93230.1| GE20708 [Drosophila yakuba] Length = 128 Score = 58.4 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 25/88 (28%), Positives = 42/88 (47%), Gaps = 10/88 (11%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P AK++GI + + +++ A P +G+AN ++ L+K L L KS + + Sbjct: 45 KPGAKQNGITGIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSDVSLDK 97 Query: 69 KQSSPLKIIYIDKD---CKEITELLQNN 93 S KII I K + I +LL+ Sbjct: 98 GSRSRNKIIMITKGVSTVEAIEQLLRKE 125 >gi|320101820|ref|YP_004177411.1| hypothetical protein Isop_0265 [Isosphaera pallida ATCC 43644] gi|319749102|gb|ADV60862.1| protein of unknown function DUF167 [Isosphaera pallida ATCC 43644] Length = 131 Score = 58.4 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 43/90 (47%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V P +++ G+ A P+KGKAN A+L +LA L ++KS Sbjct: 34 LAVMARPRSRRPGVVGTWCGAVVVAIA-------AAPEKGKANAAILEILADLLGIAKSR 86 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 L ++S ++ K++ I + ++ + + + Sbjct: 87 LDLVSGATARSKVVRIAGLEPDQVRQAISD 116 >gi|318041795|ref|ZP_07973751.1| hypothetical protein SCB01_08799 [Synechococcus sp. CB0101] Length = 97 Score = 58.4 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 39/86 (45%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V +R+ P A + + L + I + A P G AN+A+L ++A++L +S ++ Sbjct: 12 VAIRVQPRASRERVLGLR-------GEAIAIALKAPPVDGAANEALLKLIARQLKVSAAA 64 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + ++ S K I + + Sbjct: 65 VELVRGASGRSKWIRVAGWSADQVRA 90 >gi|240170218|ref|ZP_04748877.1| PE-PGRS family protein [Mycobacterium kansasii ATCC 12478] Length = 76 Score = 58.4 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 6/78 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V+V++ P ++G +T + + I V GKAN+A+ +LA L ++ Sbjct: 5 VVVKVKP-GSRNG-----PRVETVSGVELTIYVPEPAVGGKANEAVARLLAAHFHLPRTR 58 Query: 64 LRMLSKQSSPLKIIYIDK 81 + ++S S LK ID+ Sbjct: 59 VELVSGARSRLKRFRIDR 76 >gi|149195332|ref|ZP_01872419.1| hypothetical protein CMTB2_08545 [Caminibacter mediatlanticus TB-2] gi|149134524|gb|EDM23013.1| hypothetical protein CMTB2_08545 [Caminibacter mediatlanticus TB-2] Length = 94 Score = 58.4 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 26/90 (28%), Positives = 46/90 (51%), Gaps = 9/90 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V+ PN+ K+ IA L +KI + A +G ANK ++ L+K +SK+ Sbjct: 12 LNVKAQPNSSKNKIAGLY------GEDAIKINIKAPAVEGAANKELIKFLSKMFKVSKND 65 Query: 64 LRMLSKQSSPLKIIY--IDKDCKEITELLQ 91 + ++ ++S K I I++ KE E L+ Sbjct: 66 I-IIKGETSKKKQIIMPINEKVKEFIEGLK 94 >gi|195428527|ref|XP_002062324.1| GK17477 [Drosophila willistoni] gi|194158409|gb|EDW73310.1| GK17477 [Drosophila willistoni] Length = 127 Score = 58.4 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 44/92 (47%), Gaps = 9/92 (9%) Query: 2 CNVIVRL--IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLAL 59 N+ +++ P AK++GI + + +++ A P +G+AN ++ L+K L L Sbjct: 35 GNIAIKILAKPGAKQNGITDIGLEGVG-------VQIAAPPSEGEANAELVKYLSKVLGL 87 Query: 60 SKSSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 KS + + S KII + K + L+ Sbjct: 88 RKSDVSLDKGSRSRNKIILVSKGVSTVEACLE 119 >gi|254419593|ref|ZP_05033317.1| conserved hypothetical protein [Brevundimonas sp. BAL3] gi|196185770|gb|EDX80746.1| conserved hypothetical protein [Brevundimonas sp. BAL3] Length = 69 Score = 58.1 bits (139), Expect = 4e-07, Method: Composition-based stats. Identities = 18/50 (36%), Positives = 32/50 (64%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 +K++V A P G+AN A++ LAK L +S+SS+ + S LK++ ++ Sbjct: 2 LKVRVRARPVDGEANAALVKFLAKALGVSRSSVVLERGGQSRLKMVSVEG 51 >gi|195587403|ref|XP_002083454.1| GD13347 [Drosophila simulans] gi|194195463|gb|EDX09039.1| GD13347 [Drosophila simulans] Length = 128 Score = 58.1 bits (139), Expect = 4e-07, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 42/88 (47%), Gaps = 10/88 (11%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P AK++GI + + +++ A P +G+AN ++ L+K L L KS + + Sbjct: 45 KPGAKQNGITGIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSDVSLDK 97 Query: 69 KQSSPLKIIYIDKDC---KEITELLQNN 93 S KII I K + I ++L+ Sbjct: 98 GSRSRNKIIMITKGASTVEAIEQMLRKE 125 >gi|288818090|ref|YP_003432438.1| hypothetical protein HTH_0777 [Hydrogenobacter thermophilus TK-6] gi|288787490|dbj|BAI69237.1| hypothetical protein HTH_0777 [Hydrogenobacter thermophilus TK-6] Length = 73 Score = 58.1 bits (139), Expect = 4e-07, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 39/77 (50%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR P AK+ + + ++ V PQ+GKAN+ + +L+ L + KS Sbjct: 3 LEVRAKPKAKREYVKKI-------TESVYEVAVKEPPQEGKANERIAVLLSYHLGIPKSR 55 Query: 64 LRMLSKQSSPLKIIYID 80 +++L +S +K+ +D Sbjct: 56 IKLLKGHTSKIKLFQVD 72 >gi|254475305|ref|ZP_05088691.1| conserved hypothetical protein [Ruegeria sp. R11] gi|214029548|gb|EEB70383.1| conserved hypothetical protein [Ruegeria sp. R11] Length = 96 Score = 58.1 bits (139), Expect = 4e-07, Method: Composition-based stats. Identities = 19/73 (26%), Positives = 37/73 (50%), Gaps = 3/73 (4%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +++VR+ P A + + + S + +KI T P+ GKA +A+ +LA + ++ Sbjct: 21 DILVRVTPKAARDSV---QRTSGDSGELVLKITTTTAPENGKATEAVRKLLATAMRVAPR 77 Query: 63 SLRMLSKQSSPLK 75 L +L +S K Sbjct: 78 DLVLLRGATSREK 90 >gi|86136531|ref|ZP_01055110.1| hypothetical protein MED193_20449 [Roseobacter sp. MED193] gi|85827405|gb|EAQ47601.1| hypothetical protein MED193_20449 [Roseobacter sp. MED193] Length = 91 Score = 58.1 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 21/76 (27%), Positives = 39/76 (51%), Gaps = 8/76 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A + I+ E +K+ VT+ P+ GKA +A+ ++LAK + ++ Sbjct: 20 AKIAVRVTPKAASNSISVSEAG--------LKVTVTSVPENGKATEAVRSLLAKAMGVAA 71 Query: 62 SSLRMLSKQSSPLKII 77 S L + +S K+ Sbjct: 72 SKLDLSQGATSRNKVF 87 >gi|189500987|ref|YP_001960457.1| hypothetical protein Cphamn1_2066 [Chlorobium phaeobacteroides BS1] gi|226701149|sp|B3EMY7|Y2066_CHLPB RecName: Full=UPF0235 protein Cphamn1_2066 gi|189496428|gb|ACE04976.1| protein of unknown function DUF167 [Chlorobium phaeobacteroides BS1] Length = 101 Score = 57.7 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 37/76 (48%), Gaps = 7/76 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 ++ P + KS I D S +K+ + A P G+AN +LA+ L +++SS+ Sbjct: 15 IKAQPRSSKSMITG---EYDGS----IKVNLKAPPVDGEANLECCRLLARTLGVARSSVE 67 Query: 66 MLSKQSSPLKIIYIDK 81 ++S +K + + Sbjct: 68 IVSGTRGKMKRVKVFG 83 >gi|319956454|ref|YP_004167717.1| hypothetical protein Nitsa_0701 [Nitratifractor salsuginis DSM 16511] gi|319418858|gb|ADV45968.1| protein of unknown function DUF167 [Nitratifractor salsuginis DSM 16511] Length = 98 Score = 57.7 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 46/92 (50%), Gaps = 7/92 (7%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + ++ ++ P A +S A L +K+++ A +G ANK ++ L+K + Sbjct: 9 LASLKIKAQPGASRSEFAGLY------GDEAIKVRIAAAAVEGAANKELVKFLSKAFKVP 62 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 KSS+R S +++ +K++ ++ E L+ Sbjct: 63 KSSIRFKSGETAKIKVVEFP-YSEKFKEFLEK 93 >gi|162456439|ref|YP_001618806.1| hypothetical protein sce8156 [Sorangium cellulosum 'So ce 56'] gi|161167021|emb|CAN98326.1| hypothetical protein sce8156 [Sorangium cellulosum 'So ce 56'] Length = 139 Score = 57.7 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 16/80 (20%), Positives = 38/80 (47%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ P + +S I + + + +TA P +G AN ++ +L++ L + K Sbjct: 52 VRISVQVRPKSSRSAIVGVREGA-------LDVSLTAPPVEGAANAELVKLLSRALDVRK 104 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 S +++ S K++ + Sbjct: 105 SDVQIALGASGRSKVVAVRG 124 >gi|194337426|ref|YP_002019220.1| protein of unknown function DUF167 [Pelodictyon phaeoclathratiforme BU-1] gi|226701435|sp|B4SES7|Y2415_PELPB RecName: Full=UPF0235 protein Ppha_2415 gi|194309903|gb|ACF44603.1| protein of unknown function DUF167 [Pelodictyon phaeoclathratiforme BU-1] Length = 97 Score = 57.7 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 +V+ P + KS + L +K+ + A P AN+ + +K + S + Sbjct: 14 VVKAQPRSSKSRVCGLYNGG-------LKVNLKAAPVDDAANRECCELFSKLFRIPPSRV 66 Query: 65 RMLSKQSSPLKIIYIDKDCKEITELL 90 +LS QSS K + ++ + L+ Sbjct: 67 HILSGQSSRTKTVMVEGISSKAAALV 92 >gi|84996333|ref|XP_952888.1| proton translocating inorganic pyrophosphatase [Theileria annulata strain Ankara] gi|65303885|emb|CAI76264.1| proton translocating inorganic pyrophosphatase, putative [Theileria annulata] Length = 1204 Score = 57.7 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 20/109 (18%), Positives = 46/109 (42%), Gaps = 26/109 (23%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK------- 56 + V + P ++++ I + +++ A P++G+ NKA++ ++K Sbjct: 1098 LKVNVKPGSRQTQIIG-------ESEGRLSVQIAAPPREGECNKALIEFISKTRNFYYFL 1150 Query: 57 -----------LALSKSSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 + + K ++ +L S KI+ I D + + LL N+ Sbjct: 1151 AFNTFNVLIFLVGVKKGNVTLLHGHKSRDKILSITGIDIQTASNLLINS 1199 >gi|86157514|ref|YP_464299.1| hypothetical protein Adeh_1087 [Anaeromyxobacter dehalogenans 2CP-C] gi|123499918|sp|Q2IPY3|Y1087_ANADE RecName: Full=UPF0235 protein Adeh_1087 gi|85774025|gb|ABC80862.1| protein of unknown function DUF167 [Anaeromyxobacter dehalogenans 2CP-C] Length = 95 Score = 57.7 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A ++ D +KI++ A P G AN A++ LA L + ++ Sbjct: 13 LEILVQPRASRTRAVG---EHDGR----LKIQLAAPPVDGAANAALVEFLAVALGVRRAD 65 Query: 64 LRMLSKQSSPLKIIYI 79 + +L ++ K + + Sbjct: 66 VALLRGEAGRRKTVRV 81 >gi|327399372|ref|YP_004340241.1| hypothetical protein Hipma_1220 [Hippea maritima DSM 10411] gi|327182001|gb|AEA34182.1| UPF0235 protein yggU [Hippea maritima DSM 10411] Length = 84 Score = 57.3 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 23/77 (29%), Positives = 42/77 (54%), Gaps = 8/77 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ PN+K D + +K+ P +GKANKA++ LAK+L ++KS Sbjct: 3 LEVKVKPNSKVEQFD--------FDKGVLTLKIKEKPVEGKANKAVVDKLAKRLKVAKSC 54 Query: 64 LRMLSKQSSPLKIIYID 80 + ++ + S K++ ID Sbjct: 55 IEIVKGEKSRSKLVRID 71 >gi|189501744|ref|YP_001957461.1| hypothetical protein Aasi_0294 [Candidatus Amoebophilus asiaticus 5a2] gi|259646928|sp|B3ER80|Y294_AMOA5 RecName: Full=UPF0235 protein Aasi_0294 gi|189497185|gb|ACE05732.1| hypothetical protein Aasi_0294 [Candidatus Amoebophilus asiaticus 5a2] Length = 97 Score = 57.3 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 21/77 (27%), Positives = 42/77 (54%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V+ IP +K S I + I++T+ P+ GKAN+ ++ ++AK L L +++ Sbjct: 5 IEVKAIPKSKISAIT-------IDKLGRLCIRITSAPENGKANREIIKLIAKTLKLPQAN 57 Query: 64 LRMLSKQSSPLKIIYID 80 + +++ + LK I I Sbjct: 58 VEIIAGLTIKLKRIRIT 74 >gi|195953751|ref|YP_002122041.1| protein of unknown function DUF167 [Hydrogenobaculum sp. Y04AAS1] gi|226734129|sp|B4U5M3|Y1378_HYDS0 RecName: Full=UPF0235 protein HY04AAS1_1378 gi|195933363|gb|ACG58063.1| protein of unknown function DUF167 [Hydrogenobaculum sp. Y04AAS1] Length = 73 Score = 57.3 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 23/76 (30%), Positives = 43/76 (56%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ PNAK + LE +KI + + P GKAN+ ++ +L++ L +SKS Sbjct: 3 LRVKVKPNAKTVSVEQLEDKS-------LKISIKSPPVNGKANEELIKVLSEFLKVSKSK 55 Query: 64 LRMLSKQSSPLKIIYI 79 + + + +SS K++ I Sbjct: 56 INIKAGKSSREKLVEI 71 >gi|156548436|ref|XP_001604898.1| PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] Length = 122 Score = 57.3 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 8/93 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V+ P AK++ I + I ++A PQ+G+AN ++ LA L + K Sbjct: 33 VTIKVQAKPGAKQNNITDF-------SEETVGIAISAPPQEGEANAELVKYLASILNVRK 85 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 S + + S K + + +++TE L+ Sbjct: 86 SDVTLDRGSRSRQKKVIVTGSSVEKVTEKLKAE 118 >gi|88602392|ref|YP_502570.1| hypothetical protein Mhun_1102 [Methanospirillum hungatei JF-1] gi|88187854|gb|ABD40851.1| conserved hypothetical protein [Methanospirillum hungatei JF-1] Length = 117 Score = 57.3 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 18/53 (33%), Positives = 29/53 (54%) Query: 29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 I V A P +GKANKA++ ++A L ++ +LS Q+S +K + I Sbjct: 53 RKAFGIAVKAPPVEGKANKAIMELIAGYFHLPVHAVTILSGQTSSVKKVRIHG 105 >gi|149057384|gb|EDM08707.1| similar to RIKEN cDNA 3110040N11, isoform CRA_d [Rattus norvegicus] Length = 181 Score = 57.3 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 39/94 (41%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + A P +G+AN + L+K L L K Sbjct: 91 VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 143 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 S + + S K++ + +E+ E L+ Sbjct: 144 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTE 177 >gi|259418717|ref|ZP_05742634.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259344939|gb|EEW56793.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 92 Score = 57.3 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 34/79 (43%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A + + +K+ VT P+ GKA + A+LA+ L ++ Sbjct: 20 AEITVRVTPKAAYNAVL--------RQGDVIKVMVTTVPEDGKATADVAALLARALGVAP 71 Query: 62 SSLRMLSKQSSPLKIIYID 80 S + + +S K + Sbjct: 72 SQITLRRGATSRDKTFVLT 90 >gi|195337079|ref|XP_002035160.1| GM14073 [Drosophila sechellia] gi|194128253|gb|EDW50296.1| GM14073 [Drosophila sechellia] Length = 128 Score = 57.3 bits (137), Expect = 7e-07, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 41/88 (46%), Gaps = 10/88 (11%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P AK++GI + + +++ A P +G+AN ++ L+K L L KS + + Sbjct: 45 KPGAKQNGITGIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSDVSLDK 97 Query: 69 KQSSPLKIIYIDKD---CKEITELLQNN 93 S K I I K + I ++L+ Sbjct: 98 GSRSRNKKIMITKGVSTVEAIEQMLRKE 125 >gi|24214436|ref|NP_711917.1| hypothetical protein LA_1736 [Leptospira interrogans serovar Lai str. 56601] gi|45657916|ref|YP_002002.1| hypothetical protein LIC12068 [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|29839712|sp|Q8F5E6|Y1736_LEPIN RecName: Full=UPF0235 protein LA_1736 gi|73921077|sp|Q72QP5|Y2068_LEPIC RecName: Full=UPF0235 protein LIC_12068 gi|24195381|gb|AAN48935.1|AE011350_4 conserved hypothetical protein [Leptospira interrogans serovar Lai str. 56601] gi|45601157|gb|AAS70639.1| conserved hypothetical protein [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 73 Score = 56.9 bits (136), Expect = 8e-07, Method: Composition-based stats. Identities = 25/75 (33%), Positives = 41/75 (54%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V + PN+KK +D + I V +GKAN+A++ ++K++ + KS +R Sbjct: 5 VYVKPNSKK---VFFRKEEDG----VLTIAVREPALEGKANEAVIESISKEMKVPKSKIR 57 Query: 66 MLSKQSSPLKIIYID 80 +LS Q + KII ID Sbjct: 58 ILSGQKNKKKIIEID 72 >gi|296134781|ref|YP_003642023.1| protein of unknown function DUF167 [Thiomonas intermedia K12] gi|294338737|emb|CAZ87069.1| conserved hypothetical protein [Thiomonas sp. 3As] gi|295794903|gb|ADG29693.1| protein of unknown function DUF167 [Thiomonas intermedia K12] Length = 106 Score = 56.9 bits (136), Expect = 8e-07, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 39/88 (44%), Gaps = 8/88 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+A I + +K+ VTA P+ G+A M+ LAK+ + S++ ++ Sbjct: 26 KPSAGVDAIGKPK-------GPQLKVSVTAAPRAGRATDHMVRFLAKEFGVPTSAIEVVF 78 Query: 69 KQSSPLKIIYIDKDCKEITELLQNNDSL 96 + + K + I +++ + Q L Sbjct: 79 GRMNVNKQLRIHA-PQKLPAVFQQTSLL 105 >gi|6503188|gb|AAF14630.1|AF200362_6 unknown [Haemophilus ducreyi] Length = 62 Score = 56.9 bits (136), Expect = 9e-07, Method: Composition-based stats. Identities = 18/62 (29%), Positives = 31/62 (50%), Gaps = 3/62 (4%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKD---CKEITE 88 +K+ +TA P G AN +L L+K + KSS+ + + K +++ KEI + Sbjct: 1 LKVAITAPPVDGAANAYLLKYLSKLFKVPKSSIVLEKGELQRHKQLFVPAPKLLPKEIEQ 60 Query: 89 LL 90 L Sbjct: 61 WL 62 >gi|169629033|ref|YP_001702682.1| hypothetical protein MAB_1946 [Mycobacterium abscessus ATCC 19977] gi|169241000|emb|CAM62028.1| Conserved hypothetical protein [Mycobacterium abscessus] Length = 76 Score = 56.9 bits (136), Expect = 9e-07, Method: Composition-based stats. Identities = 23/77 (29%), Positives = 43/77 (55%), Gaps = 5/77 (6%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +V +I + G ++E+ D + + + V GKANKA +A+LA+ L + KS Sbjct: 4 RTVVCVIKPGSRKG-PAVEVADDGA----LTLFVREPAIDGKANKAAIALLAEYLDVPKS 58 Query: 63 SLRMLSKQSSPLKIIYI 79 ++R+++ Q+S LK + Sbjct: 59 TVRLVAGQTSRLKRFSV 75 >gi|296284807|ref|ZP_06862805.1| hypothetical protein CbatJ_14361 [Citromicrobium bathyomarinum JL354] Length = 92 Score = 56.9 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 39/76 (51%), Gaps = 8/76 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A+ +A ++ +++KV A PQ G AN A+ ++AK L + K Sbjct: 21 GELRVRVTPGARTESLAIVDGG--------VQVKVRAKPQDGAANVAVAELVAKALGIPK 72 Query: 62 SSLRMLSKQSSPLKII 77 S +L +S K++ Sbjct: 73 SRCTLLRGATSREKVL 88 >gi|222055825|ref|YP_002538187.1| protein of unknown function DUF167 [Geobacter sp. FRC-32] gi|221565114|gb|ACM21086.1| protein of unknown function DUF167 [Geobacter sp. FRC-32] Length = 99 Score = 56.5 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 40/78 (51%), Gaps = 8/78 (10%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PNAK+ I ++ + + VTA P+ G+A M+ LA++ +S S ++++S Sbjct: 20 TPNAKRDAIGKVK-------GHQLCVSVTAVPRAGRATDHMVRFLAEEFGVSVSDIQVVS 72 Query: 69 KQSSPLKIIYIDKDCKEI 86 + + K + I K K + Sbjct: 73 GRMNVNKQLRI-KAPKRL 89 >gi|241835850|ref|XP_002415074.1| conserved hypothetical protein [Ixodes scapularis] gi|215509286|gb|EEC18739.1| conserved hypothetical protein [Ixodes scapularis] Length = 104 Score = 56.5 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 25/93 (26%), Positives = 38/93 (40%), Gaps = 7/93 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V +R+ S +I D +++ A P G+AN ++ LAK L L K Sbjct: 13 GTVAIRVHAKPGASESRITDIGTDGVG-----VQIAAPPMDGEANAELVRFLAKVLNLRK 67 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQN 92 S + + S K++ I EI LLQ Sbjct: 68 SDVSLEKGSRSKDKVVMIASPATAAEILSLLQQ 100 >gi|293348273|ref|XP_002726816.1| PREDICTED: hypothetical protein [Rattus norvegicus] Length = 178 Score = 56.5 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 39/94 (41%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + A P +G+AN + L+K L L K Sbjct: 88 VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 140 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 S + + S K++ + +E+ E L+ Sbjct: 141 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTE 174 >gi|293360116|ref|XP_002729708.1| PREDICTED: hypothetical protein [Rattus norvegicus] Length = 304 Score = 56.1 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 39/94 (41%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + A P +G+AN + L+K L L K Sbjct: 214 VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 266 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 S + + S K++ + +E+ E L+ Sbjct: 267 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTE 300 >gi|242398935|ref|YP_002994359.1| hypothetical protein TSIB_0952 [Thermococcus sibiricus MM 739] gi|242265328|gb|ACS90010.1| hypothetical protein TSIB_0952 [Thermococcus sibiricus MM 739] Length = 94 Score = 56.1 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 43/91 (47%), Gaps = 9/91 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A + I ++ + +K+K+ A P +GKANK ++ +K L Sbjct: 9 VILQIYVQPKANTNEIEGVDEWR-----GRLKVKIKAPPVEGKANKEVVKFFSKLLG--- 60 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91 + ++ ++S K + I +E+ + L+ Sbjct: 61 AEASLIKGETSREKDLLIRGISIEEVKKKLK 91 >gi|260800225|ref|XP_002595035.1| hypothetical protein BRAFLDRAFT_237405 [Branchiostoma floridae] gi|229280275|gb|EEN51046.1| hypothetical protein BRAFLDRAFT_237405 [Branchiostoma floridae] Length = 98 Score = 56.1 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 38/89 (42%), Gaps = 7/89 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + P AK + I + +++TA P +G+AN + LA L + KS+ Sbjct: 12 VAIHAKPGAKANAITDVTTETVG-------VQITAPPMEGEANAELCRYLAGVLEVKKSA 64 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S K + +D I +LQ Sbjct: 65 VSLERGAKSREKTVRVDTPGTTIDAVLQR 93 >gi|145543083|ref|XP_001457228.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124425043|emb|CAK89831.1| unnamed protein product [Paramecium tetraurelia] Length = 106 Score = 56.1 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 19/89 (21%), Positives = 42/89 (47%), Gaps = 7/89 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 +++ PN+K S I + + + + A P+ G+AN + +A+ L + K++ Sbjct: 20 LVINAKPNSKVSQITGISDEA-------VDVNIAAPPKDGEANAELCDFVAQTLGVKKTA 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +++ K+I I+ K+I E + Sbjct: 73 IQVQKGGKGRNKLIKIESKFKDINEFYEK 101 >gi|294916777|ref|XP_002778391.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983] gi|239886749|gb|EER10186.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983] Length = 132 Score = 56.1 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 20/95 (21%), Positives = 47/95 (49%), Gaps = 9/95 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK-LALS 60 + +R P AK S + ++ + +++ A + G+AN+ +L+ L+K+ L + Sbjct: 41 ARIAIRAKPGAKVSCLTGIDAEG------ALGVQLNAPARDGEANEELLSFLSKEVLGVK 94 Query: 61 KSSLRMLSKQSSPLKIIYIDK--DCKEITELLQNN 93 K + ++ S K++ I ++++ LL+N Sbjct: 95 KKDVALVQGSKSREKVVEIADVLTVEDVSRLLRNE 129 >gi|145538297|ref|XP_001454854.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124422631|emb|CAK87457.1| unnamed protein product [Paramecium tetraurelia] Length = 106 Score = 56.1 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 42/89 (47%), Gaps = 7/89 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 +++ PN+K S I + + I + A P+ G+AN + +A+ L + K++ Sbjct: 20 LVINAKPNSKVSQITGISDEA-------VDINIAAPPKDGEANAELCDFVAQTLGVKKTA 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +++ K++ I+ K+I + + Sbjct: 73 IQVNKGGKGRNKLVSIESKFKDINDFFEK 101 >gi|261416137|ref|YP_003249820.1| protein of unknown function DUF167 [Fibrobacter succinogenes subsp. succinogenes S85] gi|261372593|gb|ACX75338.1| protein of unknown function DUF167 [Fibrobacter succinogenes subsp. succinogenes S85] gi|302328014|gb|ADL27215.1| conserved hypothetical protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 74 Score = 55.7 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 14/79 (17%), Positives = 33/79 (41%), Gaps = 7/79 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +++ +K+ + K++V A P G AN A+ ++A + K Sbjct: 2 RINIKVHARSKRESVT-------PQPDGSYKVEVKAPPVDGAANAAICELIADYFHVHKR 54 Query: 63 SLRMLSKQSSPLKIIYIDK 81 + ++ ++ K+I I Sbjct: 55 DVSVVMGSTNNKKVIEILG 73 >gi|118794573|ref|XP_321597.3| AGAP001528-PA [Anopheles gambiae str. PEST] gi|116116359|gb|EAA01322.3| AGAP001528-PA [Anopheles gambiae str. PEST] Length = 147 Score = 55.7 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 27/101 (26%), Positives = 44/101 (43%), Gaps = 14/101 (13%) Query: 2 CNVIVRL--IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLAL 59 N+IV++ P AK SGI + + A P G+AN ++ L+K L L Sbjct: 51 GNLIVKILAKPGAKTSGITDVSEEGIGCQ-------IAAPPIDGEANTELIKYLSKLLDL 103 Query: 60 SKSSLRMLSKQSSPLKIIYIDK-----DCKEITELLQNNDS 95 KS + + S K I +DK +++ + +N S Sbjct: 104 RKSDISLDRGSKSRQKTIVLDKAGCRHSPEQLLVIFRNEAS 144 >gi|195127445|ref|XP_002008179.1| GI11963 [Drosophila mojavensis] gi|193919788|gb|EDW18655.1| GI11963 [Drosophila mojavensis] Length = 125 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 24/97 (24%), Positives = 46/97 (47%), Gaps = 12/97 (12%) Query: 2 CNVIVRL--IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLAL 59 N+ +++ P AK++GI + + +++ A P +G+AN ++ L+K L L Sbjct: 33 GNIAIKILAKPGAKQNGITDIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGL 85 Query: 60 SKSSLRMLSKQSSPLKIIYIDKD---CKEITELLQNN 93 KS + + S KII + K + I + L+ Sbjct: 86 RKSDVSLDKGSRSRNKIILVCKGVTTVEAIEQSLRKE 122 >gi|195376389|ref|XP_002046979.1| GJ12187 [Drosophila virilis] gi|194154137|gb|EDW69321.1| GJ12187 [Drosophila virilis] Length = 124 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 44/93 (47%), Gaps = 10/93 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK++GI + + +++ A P +G+AN ++ L+K L L KS Sbjct: 36 IKILAKPGAKQNGITDIGLDGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 88 Query: 64 LRMLSKQSSPLKIIYIDKDC---KEITELLQNN 93 + + S KI+ + K + I +LL+ Sbjct: 89 VSLDKGSRSRNKIVLVTKGASTVEAIEQLLRKE 121 >gi|322792209|gb|EFZ16226.1| hypothetical protein SINV_80163 [Solenopsis invicta] Length = 122 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 34/82 (41%), Gaps = 7/82 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ P AK + I + I ++A P +G+AN ++ LA + KS Sbjct: 36 IKIQAKPGAKCNNITDISDEAVG-------IAISAPPTEGEANAELVKYLASTFGVRKSD 88 Query: 64 LRMLSKQSSPLKIIYIDKDCKE 85 + + S K++ + + Sbjct: 89 VTLDRGSRSRQKVVVVSGITTD 110 >gi|222055811|ref|YP_002538173.1| protein of unknown function DUF167 [Geobacter sp. FRC-32] gi|221565100|gb|ACM21072.1| protein of unknown function DUF167 [Geobacter sp. FRC-32] Length = 98 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 39/78 (50%), Gaps = 8/78 (10%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PNAK+ I ++ + + VTA P+ G+A M+ LA++ +S S ++++ Sbjct: 19 TPNAKRDAIGKVK-------GHQLCVSVTAVPRAGRATDHMVRFLAEEFGVSVSDIQVVF 71 Query: 69 KQSSPLKIIYIDKDCKEI 86 + + K + I K K + Sbjct: 72 GRMNVNKQLRI-KAPKRL 88 >gi|332157902|ref|YP_004423181.1| hypothetical protein PNA2_0260 [Pyrococcus sp. NA2] gi|331033365|gb|AEC51177.1| hypothetical protein PNA2_0260 [Pyrococcus sp. NA2] Length = 92 Score = 55.4 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 43/88 (48%), Gaps = 9/88 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + PNAKK+ I ++ K ++I V A P KGKAN+ ++ L L + Sbjct: 11 IYVLVKPNAKKTEIEGVDTWK-----KRIRISVKAPPVKGKANRELVNFLQ---GLLNAE 62 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90 + ++ ++S K + I +E+ L Sbjct: 63 VILVRGETSREKELLIKGLKVEEVKRKL 90 >gi|149456878|ref|XP_001519576.1| PREDICTED: similar to chromosome 15 open reading frame 40, partial [Ornithorhynchus anatinus] Length = 119 Score = 55.4 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 24/98 (24%), Positives = 39/98 (39%), Gaps = 9/98 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V P AK++ + + + + + A P +G+AN + LAK L L K Sbjct: 29 VTIAVHAKPGAKQNAVTDVSVEAVG-------VAIAAPPSEGEANAELCRYLAKILELRK 81 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDSLT 97 S + + S K+I I EI L+ T Sbjct: 82 SDVVLDRGGKSREKVIKILSSTTPDEILAKLKKQTETT 119 >gi|110596999|ref|ZP_01385289.1| Protein of unknown function DUF167 [Chlorobium ferrooxidans DSM 13031] gi|110341686|gb|EAT60146.1| Protein of unknown function DUF167 [Chlorobium ferrooxidans DSM 13031] Length = 97 Score = 55.4 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 36/85 (42%), Gaps = 7/85 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 +V+ P + KS + L +K+ + A P AN+ + +K + S + Sbjct: 14 VVKAQPRSSKSRVCGLYNGG-------LKVSLKAAPVDDAANRECCDLFSKVFHIPPSRV 66 Query: 65 RMLSKQSSPLKIIYIDKDCKEITEL 89 +++ +SS K + +D E L Sbjct: 67 HIIAGKSSRTKTVMLDGVTVEAAAL 91 >gi|118353243|ref|XP_001009893.1| hypothetical protein TTHERM_00161650 [Tetrahymena thermophila] gi|89291659|gb|EAR89647.1| hypothetical protein TTHERM_00161650 [Tetrahymena thermophila SB210] Length = 127 Score = 55.4 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 22/94 (23%), Positives = 46/94 (48%), Gaps = 11/94 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + PN+K S I+ + + I + A P+ G+AN ++ +++ L + KSS Sbjct: 39 ISIHAKPNSKISQISGISDEG-------VDINIAAPPKDGEANAELIDYISQVLGVKKSS 91 Query: 64 LRMLSKQSSPLKIIYIDK----DCKEITELLQNN 93 L + S K++ I D +E+ + L+++ Sbjct: 92 LSLDKGGKSRNKLMEISDSGYADVEELYQALKDS 125 >gi|117927694|ref|YP_872245.1| hypothetical protein Acel_0486 [Acidothermus cellulolyticus 11B] gi|117648157|gb|ABK52259.1| protein of unknown function DUF167 [Acidothermus cellulolyticus 11B] Length = 85 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 43/91 (47%), Gaps = 8/91 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +++ + P + ++ + S + + V + G+A A LA +A+ + KS Sbjct: 2 RLVIHVRPGSSRATVGG-------SHNGALIVAVREPAEHGRATDAALAAVAQAFGVPKS 54 Query: 63 SLRMLSKQSSPLKII-YIDKDCKEITELLQN 92 +R++S +S KII ID D + ELL Sbjct: 55 QVRLVSGATSRRKIIDVIDGDPVRLAELLAR 85 >gi|327401804|ref|YP_004342643.1| hypothetical protein Arcve_1935 [Archaeoglobus veneficus SNP6] gi|327317312|gb|AEA47928.1| UPF0235 protein yggU [Archaeoglobus veneficus SNP6] Length = 101 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 45/91 (49%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + PN+K+S + + + + ++V A P+ GKAN ++ + +K + Sbjct: 16 ISIEVTPNSKQSCVYGYNEWRKS-----IAVRVKAPPKGGKANAEIVELFSKIF---RKK 67 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93 + ++ +S K+I++ +E+ LL + Sbjct: 68 VEIVKGHTSSQKVIFVHSASPQEVESLLVKS 98 >gi|328717531|ref|XP_003246233.1| PREDICTED: UPF0235 protein C15orf40 homolog [Acyrthosiphon pisum] Length = 136 Score = 55.0 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 24/96 (25%), Positives = 43/96 (44%), Gaps = 14/96 (14%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK + I +I D + +++ A P G+AN ++ L+K L L KS Sbjct: 47 IKINAKPGAKNNNIT--DISSDG-----IGVQINAPPTDGEANAELIKYLSKVLGLRKSD 99 Query: 64 LRMLSKQSSPLKIIY-------IDKDCKEITELLQN 92 L + S KI+ I+ ++I E + + Sbjct: 100 LSLDRGSRSRNKILIVHNTSLGIEGITEKIKEEIND 135 >gi|157743130|gb|AAI49509.1| C21H15orf40 protein [Bos taurus] Length = 132 Score = 55.0 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 41/93 (44%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ + P +K++ + + + + + A P +G+AN + L+K L L K Sbjct: 42 VSIAIHAKPGSKQNAVTDVTTEA-------VSVAIAAPPTEGEANAELCRYLSKVLELRK 94 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQN 92 S + + S K++ + +EI E L+ Sbjct: 95 SDVVLDKGGKSREKVVKLLASTPPEEILEKLKK 127 >gi|312375313|gb|EFR22710.1| hypothetical protein AND_14314 [Anopheles darlingi] Length = 155 Score = 55.0 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 17/47 (36%), Positives = 25/47 (53%) Query: 36 VTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKD 82 + A P G+AN ++ L+K L L KS + + S K I +DKD Sbjct: 88 LAAPPIDGEANTELIRYLSKLLELRKSDISLDRGSKSRQKTIVLDKD 134 >gi|195014316|ref|XP_001984001.1| GH15254 [Drosophila grimshawi] gi|193897483|gb|EDV96349.1| GH15254 [Drosophila grimshawi] Length = 126 Score = 54.6 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 25/96 (26%), Positives = 45/96 (46%), Gaps = 10/96 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK++GI + + +++ A P +G+AN ++ L+K L L KS Sbjct: 38 IKILAKPGAKQNGITDIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 90 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQNNDSL 96 + + S K+I I K + I +LL+ L Sbjct: 91 VSLDKGSRSKNKLILITKGVSTVEAIEQLLRKESEL 126 >gi|126306449|ref|XP_001373757.1| PREDICTED: similar to chromosome 15 open reading frame 40, [Monodelphis domestica] Length = 146 Score = 54.6 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 40/92 (43%), Gaps = 9/92 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + P +K++ I + + + + A P +G+AN + L+K L L KS Sbjct: 57 TIAIHAKPGSKQNAITDVTTEN-------VSVAIAAPPSEGEANTELCRYLSKVLELRKS 109 Query: 63 SLRMLSKQSSPLKIIYI--DKDCKEITELLQN 92 + + S K++ I +EI E L+ Sbjct: 110 DVILDKGGKSREKVVKILASTTPEEILEKLKR 141 >gi|268326229|emb|CBH39817.1| conserved hypothetical protein, DUF167 family [uncultured archaeon] Length = 70 Score = 54.6 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 12/80 (15%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + +++IPN+K I E M I+V P KGKANKA++ +L++ Sbjct: 1 MKRIAIKVIPNSKTEEIIYAEP---------MIIRVKEPPTKGKANKAVVMLLSRYFN-- 49 Query: 61 KSSLRMLSKQSSPLKIIYID 80 + +R++S S KI+ ++ Sbjct: 50 -ADVRIVSGAKSRRKIVEVE 68 >gi|33359419|ref|NP_877861.1| hypothetical protein PH1669.1n [Pyrococcus horikoshii OT3] Length = 95 Score = 54.6 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 48/89 (53%), Gaps = 9/89 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + PN+K++ I ++ K+ ++I + A P KG+ANK ++ L+K L + Sbjct: 14 IQVIVRPNSKENKIEGVDNWKN-----RIRISIKAPPVKGEANKELIKFLSKILG---AK 65 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQ 91 + ++ ++S K + + +E+ + L+ Sbjct: 66 VEIIRGETSREKDLLVKGIKLEEVKKRLK 94 >gi|149185290|ref|ZP_01863607.1| hypothetical protein ED21_19592 [Erythrobacter sp. SD-21] gi|148831401|gb|EDL49835.1| hypothetical protein ED21_19592 [Erythrobacter sp. SD-21] Length = 93 Score = 54.6 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 35/78 (44%), Gaps = 8/78 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V +R+ P A+ + T + + KV A PQ G AN A+ ++A + Sbjct: 22 GRVAIRVTPGARTESL--------TLEGGVLAAKVRAKPQDGAANDAVRKLIAAAYRVPP 73 Query: 62 SSLRMLSKQSSPLKIIYI 79 + + +L +S K++ I Sbjct: 74 TRVELLRGATSREKLLRI 91 >gi|322697515|gb|EFY89294.1| DUF167 domain protein [Metarhizium acridum CQMa 102] Length = 122 Score = 54.6 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 24/104 (23%), Positives = 44/104 (42%), Gaps = 12/104 (11%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ K + D +++ VTA P+ G+ANKA++ L+ + + K Sbjct: 22 VQLQLRVKAGTSKDR-EGILAVTD----RAIELCVTAQPRHGEANKAVVQALSNAIGIPK 76 Query: 62 SSLRMLSKQSSPLKIIYI-------DKDCKEITELLQNNDSLTL 98 S R +S S K++ I + + LL+ TL Sbjct: 77 SRFRFVSGLKSRDKVVAIGDIQGDGPDYTETVLRLLREASYRTL 120 >gi|332025796|gb|EGI65953.1| UPF0235 protein C15orf40-like protein [Acromyrmex echinatior] Length = 139 Score = 54.6 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 15/82 (18%), Positives = 34/82 (41%), Gaps = 7/82 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ P AK + + + I ++A P +G+AN ++ LA + KS Sbjct: 53 IKIQAKPGAKCNNVTDISDEAIG-------IAISAPPTEGEANAELVKYLASIFGVRKSD 105 Query: 64 LRMLSKQSSPLKIIYIDKDCKE 85 + + S K++ + + Sbjct: 106 VSLDRGSRSRQKVVIVSGISTD 127 >gi|301789543|ref|XP_002930186.1| PREDICTED: UPF0235 protein C15orf40 homolog [Ailuropoda melanoleuca] Length = 214 Score = 54.2 bits (129), Expect = 5e-06, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 40/93 (43%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + + + + + A P +G+AN + L+K L L K Sbjct: 124 VTIAIHAKPGSKQNAVTDVTAEA-------VSVAIAAPPSEGEANAELCRYLSKVLELRK 176 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQN 92 S + + S K++ + +EI E L+ Sbjct: 177 SDVVLDKGGKSREKVVKLLASTTTEEILEKLKQ 209 >gi|315122788|ref|YP_004063277.1| hypothetical protein CKC_05215 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496190|gb|ADR52789.1| hypothetical protein CKC_05215 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 50 Score = 54.2 bits (129), Expect = 5e-06, Method: Composition-based stats. Identities = 33/50 (66%), Positives = 43/50 (86%) Query: 49 MLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL 98 ML +LA++L+L+KSSL+MLSK SSP+K IYIDKDCKEI EL + N+ +TL Sbjct: 1 MLTILAERLSLNKSSLKMLSKHSSPIKKIYIDKDCKEIIELFKRNNPVTL 50 >gi|171679393|ref|XP_001904643.1| hypothetical protein [Podospora anserina S mat+] gi|170939322|emb|CAP64550.1| unnamed protein product [Podospora anserina S mat+] Length = 119 Score = 54.2 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 20/74 (27%), Positives = 38/74 (51%), Gaps = 5/74 (6%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P A K+ + +D ++I V A ++G+ANKA++ +L++ L L KS+L + Sbjct: 26 VKPGASKNREGVASVGEDA-----VEICVAAQAREGEANKAVIKVLSEVLDLPKSNLEIT 80 Query: 68 SKQSSPLKIIYIDK 81 S K + + Sbjct: 81 QGHKSRNKTVAVIG 94 >gi|126273662|ref|XP_001365632.1| PREDICTED: similar to chromosome 15 open reading frame 40, [Monodelphis domestica] Length = 146 Score = 54.2 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 40/92 (43%), Gaps = 9/92 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + P +K++ I + + + + A P +G+AN + L+K L L KS Sbjct: 57 TIAIHAKPGSKQNAITDVTTEN-------VSVAIAAPPSEGEANTELCRYLSKVLELRKS 109 Query: 63 SLRMLSKQSSPLKIIYI--DKDCKEITELLQN 92 + + S K++ I +EI E L+ Sbjct: 110 DVILDKGGKSREKVVKILASTTPEEILEKLKR 141 >gi|258597870|ref|XP_001348716.2| conserved protein, unknown function [Plasmodium falciparum 3D7] gi|255528895|gb|AAN37155.2| conserved protein, unknown function [Plasmodium falciparum 3D7] Length = 147 Score = 53.8 bits (128), Expect = 7e-06, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 40/91 (43%), Gaps = 7/91 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ PNAKK+ I D + I + P ++N A++ + L L K Sbjct: 61 ITLRVKPNAKKTSI------YFNQDKEVLNINIQEQPVNNQSNVAIIGYFSDILNLKKRD 114 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 + ++S S K++ + ++ ++ N Sbjct: 115 ISIVSGLKSRDKVLMVSNISLDDLNNKIEEN 145 >gi|297697323|ref|XP_002825813.1| PREDICTED: UPF0235 protein C15orf40-like [Pongo abelii] Length = 242 Score = 53.8 bits (128), Expect = 7e-06, Method: Composition-based stats. Identities = 21/98 (21%), Positives = 42/98 (42%), Gaps = 9/98 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 152 VTIAIHAKPGSKQNAVTDLTAEA-------VNVAIAAPPSEGEANAELCRYLSKVLELRK 204 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDSLT 97 S + + S K++ + +EI E L+ + T Sbjct: 205 SDVVLDKGGKSREKVVKLLASTTPEEILEKLKKEATKT 242 >gi|256394873|ref|YP_003116437.1| hypothetical protein Caci_5738 [Catenulispora acidiphila DSM 44928] gi|256361099|gb|ACU74596.1| protein of unknown function DUF167 [Catenulispora acidiphila DSM 44928] Length = 92 Score = 53.8 bits (128), Expect = 7e-06, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 44/91 (48%), Gaps = 7/91 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +R+ P + ++ + + + VTA G A +A L +A+ L + + Sbjct: 5 RIPIRVKPGSSRTKVGGRH------GERSLIVAVTAKAVDGAATEAALRAVAEALGMPRR 58 Query: 63 SLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 ++++++ +S K++ + D + EL+++ Sbjct: 59 AVQLITGATSRDKVLGVSSEDPDTVRELVRD 89 >gi|296126259|ref|YP_003633511.1| hypothetical protein Bmur_1218 [Brachyspira murdochii DSM 12563] gi|296018075|gb|ADG71312.1| protein of unknown function DUF167 [Brachyspira murdochii DSM 12563] Length = 84 Score = 53.8 bits (128), Expect = 7e-06, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 44/91 (48%), Gaps = 8/91 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 N+ V++ AK + K + + +++ A GKANKA++ LA +L + K Sbjct: 2 NIEVKVTAGAKSNSF------KFENGAYY--VRIMAKAIDGKANKAIIEFLADELNIKKK 53 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + +L + S KII I+ + ++ E N Sbjct: 54 DIDILKGEKSSKKIIAINIEENKLKEYFSKN 84 >gi|15678665|ref|NP_275780.1| hypothetical protein MTH637 [Methanothermobacter thermautotrophicus str. Delta H] gi|29839449|sp|O26734|Y637_METTH RecName: Full=UPF0235 protein MTH_637 gi|2621719|gb|AAB85143.1| conserved protein [Methanothermobacter thermautotrophicus str. Delta H] Length = 104 Score = 53.8 bits (128), Expect = 7e-06, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 42/92 (45%), Gaps = 8/92 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + P + K GI S + +++K+ + PQKGKAN+ ++ ++ Sbjct: 16 VNIEVSPASGKFGIPSYNEWR-----KRIEVKIHSPPQKGKANREIIKEFSETFG---RD 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 + ++S Q S K I I +++ L + Sbjct: 68 VEIVSGQKSRQKTIRIQGMGRDLFLKLVSEKF 99 >gi|148284889|ref|YP_001248979.1| hypothetical protein OTBS_1640 [Orientia tsutsugamushi str. Boryong] gi|146740328|emb|CAM80735.1| conserved hypothetical protein [Orientia tsutsugamushi str. Boryong] Length = 112 Score = 53.8 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 19/89 (21%), Positives = 44/89 (49%), Gaps = 3/89 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ AK + I L + S + I + P+ KAN+ ++ L++ L +S+ Sbjct: 20 INLKVKAGAKINKIIGLYHINNKS---FLYISINTIPENNKANQLIIKFLSQWLETGRSN 76 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++++ S LK+I + I+ L+ + Sbjct: 77 IKIVYGLHSNLKVISVMNTNGNISNLIIS 105 >gi|309361190|emb|CAP30075.2| hypothetical protein CBG_10764 [Caenorhabditis briggsae AF16] Length = 266 Score = 53.8 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 39/77 (50%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AKKSG+ ++ + + + + A P++G AN+ +++ L L L K+ Sbjct: 42 LRIHAKPGAKKSGVVAI-------NESEIDVAIGAAPREGAANEELVSYLMSALGLRKNE 94 Query: 64 LRMLSKQSSPLKIIYID 80 L+ S K++ I+ Sbjct: 95 LQFDKGAKSRSKVVLIE 111 >gi|268567307|ref|XP_002639944.1| Hypothetical protein CBG10764 [Caenorhabditis briggsae] Length = 258 Score = 53.8 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 39/77 (50%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AKKSG+ ++ + + + + A P++G AN+ +++ L L L K+ Sbjct: 35 LRIHAKPGAKKSGVVAI-------NESEIDVAIGAAPREGAANEELVSYLMSALGLRKNE 87 Query: 64 LRMLSKQSSPLKIIYID 80 L+ S K++ I+ Sbjct: 88 LQFDKGAKSRSKVVLIE 104 >gi|253701118|ref|YP_003022307.1| hypothetical protein GM21_2508 [Geobacter sp. M21] gi|251775968|gb|ACT18549.1| protein of unknown function DUF167 [Geobacter sp. M21] Length = 101 Score = 53.8 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 42/88 (47%), Gaps = 8/88 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PNAK+ I + + I VTA P+ G+A M+ LA++ +S S ++++ Sbjct: 21 TPNAKRDAIGKPK-------GHQLCISVTAVPRAGRATDHMVRFLAEEFEVSVSDIQVVF 73 Query: 69 KQSSPLKIIYIDKDCKEITELLQNNDSL 96 + + K + I K K + ++ D L Sbjct: 74 GRMNVNKQLRI-KAPKRLPSVIGQQDLL 100 >gi|7508740|pir||T26031 hypothetical protein W01A8.2 - Caenorhabditis elegans Length = 263 Score = 53.8 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 37/77 (48%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AKKS + ++ + + + A P++G AN+ +++ L L L K+ Sbjct: 39 LHIHAKPGAKKSCVVAI-------GDSEVDVAIGAAPREGAANEELISYLMSALGLRKNE 91 Query: 64 LRMLSKQSSPLKIIYID 80 L+ S K++ ID Sbjct: 92 LQFDKGAKSRSKVVLID 108 >gi|148264159|ref|YP_001230865.1| hypothetical protein Gura_2104 [Geobacter uraniireducens Rf4] gi|146397659|gb|ABQ26292.1| protein of unknown function DUF167 [Geobacter uraniireducens Rf4] Length = 99 Score = 53.8 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 38/78 (48%), Gaps = 8/78 (10%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PNAK+ I ++ + + VTA P+ G+A M+ LA + +S S ++++ Sbjct: 20 TPNAKRDAIGKVK-------GHQLCVSVTAVPRAGRATDHMVRFLADEFGVSVSDIQVVF 72 Query: 69 KQSSPLKIIYIDKDCKEI 86 + + K + I K K + Sbjct: 73 GRMNVNKQLRI-KAPKRL 89 >gi|322708042|gb|EFY99619.1| DUF167 domain protein [Metarhizium anisopliae ARSEF 23] Length = 126 Score = 53.8 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 24/104 (23%), Positives = 44/104 (42%), Gaps = 12/104 (11%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ K + D +++ V A P+ G+ANKA++ L+ + + K Sbjct: 26 VQLQLRVKAGTSKDR-EGILAVTD----RAIELCVAAQPRHGEANKAVVQALSNAIGIPK 80 Query: 62 SSLRMLSKQSSPLKIIYI-------DKDCKEITELLQNNDSLTL 98 S R +S S K++ I + + I LL+ TL Sbjct: 81 SRFRFVSGIKSRDKVVAIGDIQGDGPEYTETILRLLREASHRTL 124 >gi|320101055|ref|YP_004176647.1| hypothetical protein Desmu_0861 [Desulfurococcus mucosus DSM 2162] gi|319753407|gb|ADV65165.1| protein of unknown function DUF167 [Desulfurococcus mucosus DSM 2162] Length = 138 Score = 53.8 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 16/90 (17%), Positives = 40/90 (44%), Gaps = 9/90 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P + + T + + + P++G+AN A++ +++L + S Sbjct: 52 LSIRVKPGESEDFL--------TVEGDELVFYTSEPPERGRANAALVKFFSRELKIPVSR 103 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92 + ++ S LK + D + E+ + L Sbjct: 104 IDIVYGHRSTLKKLVFYDVNMDELADKLAK 133 >gi|14520717|ref|NP_126192.1| hypothetical protein PAB7122 [Pyrococcus abyssi GE5] gi|29839688|sp|Q9V1C6|Y501_PYRAB RecName: Full=UPF0235 protein PYRAB05010 gi|5457933|emb|CAB49423.1| Hypothetical protein [Pyrococcus abyssi GE5] Length = 92 Score = 53.8 bits (128), Expect = 9e-06, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 45/91 (49%), Gaps = 9/91 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + PNA+++ I ++ + +K+ + A P KGKAN+ ++ L+ Sbjct: 9 VILRVIVKPNARENSIEGIDEWR-----GRIKVNIKAQPVKGKANRELIKFLSNLFG--- 60 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91 + + +L ++S K + + + +E+ L+ Sbjct: 61 AEVEILKGETSREKDVLVRGVNLEEVKRRLK 91 >gi|20150521|pdb|1JRM|A Chain A, Nmr Structure Of Mth0637. Ontario Centre For Structural Proteomics Target Mth0637_1_104; Northeast Structural Genomics Target Tt135 Length = 104 Score = 53.4 bits (127), Expect = 9e-06, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 42/92 (45%), Gaps = 8/92 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + P + K GI S + +++K+ + PQKGKAN+ ++ ++ Sbjct: 16 VNIEVSPASGKFGIPSYNEWR-----KRIEVKIHSPPQKGKANREIIKEFSETFG---RD 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 + ++S Q S K I I +++ L + Sbjct: 68 VEIVSGQKSRQKTIRIQGMGRDLFLKLVSEKF 99 >gi|116328001|ref|YP_797721.1| hypothetical protein LBL_1291 [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116330879|ref|YP_800597.1| hypothetical protein LBJ_1240 [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|122281367|sp|Q04TD1|Y1240_LEPBJ RecName: Full=UPF0235 protein LBJ_1240 gi|122284207|sp|Q052F7|Y1291_LEPBL RecName: Full=UPF0235 protein LBL_1291 gi|116120745|gb|ABJ78788.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116124568|gb|ABJ75839.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 73 Score = 53.4 bits (127), Expect = 9e-06, Method: Composition-based stats. Identities = 22/75 (29%), Positives = 40/75 (53%), Gaps = 7/75 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ PN+KK +D S + I V +GKAN+A++ +++++ + K +R Sbjct: 5 VRVKPNSKK---IFFRKEEDGS----VTIAVREPALEGKANEAVIETISREMKIPKRKIR 57 Query: 66 MLSKQSSPLKIIYID 80 ++S + K I ID Sbjct: 58 IVSGEKGKKKTIEID 72 >gi|78221372|ref|YP_383119.1| hypothetical protein Gmet_0145 [Geobacter metallireducens GS-15] gi|78192627|gb|ABB30394.1| protein of unknown function DUF167 [Geobacter metallireducens GS-15] Length = 100 Score = 53.4 bits (127), Expect = 9e-06, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 40/87 (45%), Gaps = 8/87 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PNAK+ I ++ + + VTA P+ G+A M+ LA + +S ++++ Sbjct: 20 TPNAKRDAIGKVK-------GHQLCVSVTAIPRAGRATDHMVRFLADEFGVSVGDIQVVF 72 Query: 69 KQSSPLKIIYIDKDCKEITELLQNNDS 95 + + K + I K K + L+ + Sbjct: 73 GRMNVNKQLRI-KAPKRLPPLIGQQEF 98 >gi|297526283|ref|YP_003668307.1| protein of unknown function DUF167 [Staphylothermus hellenicus DSM 12710] gi|297255199|gb|ADI31408.1| protein of unknown function DUF167 [Staphylothermus hellenicus DSM 12710] Length = 110 Score = 53.4 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 15/90 (16%), Positives = 41/90 (45%), Gaps = 9/90 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + PN+ + + + + T P+KG+AN A++ L++ + L Sbjct: 22 VIIPIYVKPNSDRDALV--------LEGDELVFYTTEIPEKGRANAALIRFLSRNIRLPH 73 Query: 62 SSLRMLSKQSSPLKIIYI-DKDCKEITELL 90 + + ++ + K + + D + +++ E L Sbjct: 74 NKIDIIYGARTRSKKVLVRDMEAEKLAEKL 103 >gi|198282397|ref|YP_002218718.1| hypothetical protein Lferr_0253 [Acidithiobacillus ferrooxidans ATCC 53993] gi|198246918|gb|ACH82511.1| protein of unknown function DUF167 [Acidithiobacillus ferrooxidans ATCC 53993] Length = 113 Score = 53.4 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 16/72 (22%), Positives = 34/72 (47%), Gaps = 7/72 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+A + I + +K+ VTA P+ G+A M+ LA + ++ S++ ++ Sbjct: 32 KPSAGRDAIGKPK-------GAQIKVSVTAEPRNGRATDHMVRFLAGEFGVAPSAIEVVF 84 Query: 69 KQSSPLKIIYID 80 + + K + I Sbjct: 85 GRMNVNKQLRIK 96 >gi|47213227|emb|CAF89748.1| unnamed protein product [Tetraodon nigroviridis] Length = 120 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 39/93 (41%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V P +K S + ++ +++ + A P G+AN ++ LA+ L L K Sbjct: 32 VTITVHAKPGSKHSRVTAVSTEA-------VEVAIAAPPVDGEANVELVRFLAEVLELKK 84 Query: 62 SSLRMLSKQSSPLKIIYIDK--DCKEITELLQN 92 L + S K + +D +E+ L+ Sbjct: 85 GHLHLDKGSRSRDKQVRVDSPLSPEEVLRRLRQ 117 >gi|18978137|ref|NP_579494.1| hypothetical protein PF1765 [Pyrococcus furiosus DSM 3638] gi|29839724|sp|Q8U052|Y1765_PYRFU RecName: Full=UPF0235 protein PF1765 gi|18893938|gb|AAL81889.1| hypothetical protein PF1765 [Pyrococcus furiosus DSM 3638] Length = 92 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 43/91 (47%), Gaps = 9/91 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + PNA+++ I ++ +K+ V A P KGKANK ++ K Sbjct: 9 VILSVIVAPNARETKIVGID-----GTRGRVKVNVAAPPVKGKANKELMKFFKKLFG--- 60 Query: 62 SSLRMLSKQSSPLKIIYIDKDC-KEITELLQ 91 + + ++ ++S K + I KE+ E L+ Sbjct: 61 AEVVIVRGETSREKDLLIKGITKKEVIEKLE 91 >gi|307611594|emb|CBX01276.1| hypothetical protein LPW_29741 [Legionella pneumophila 130b] Length = 70 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 13/49 (26%), Positives = 29/49 (59%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 + I + A PQ+G+AN +L +++ + K+ + ++ +SS K+I + Sbjct: 8 LHIALHAKPQEGEANNELLFFISQFFKIPKTQIELIKGKSSRHKLIRLP 56 >gi|197118100|ref|YP_002138527.1| hypothetical protein Gbem_1715 [Geobacter bemidjiensis Bem] gi|197087460|gb|ACH38731.1| protein of unknown function DUF167 [Geobacter bemidjiensis Bem] Length = 101 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 21/86 (24%), Positives = 41/86 (47%), Gaps = 8/86 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PNAK+ I + + I VTA P+ G+A M+ LA++ ++ S ++++ Sbjct: 21 TPNAKRDAIGKPK-------GHQLCISVTAVPRAGRATDHMVRFLAEEFEVAVSDIQVVF 73 Query: 69 KQSSPLKIIYIDKDCKEITELLQNND 94 + + K + I K K + ++ D Sbjct: 74 GRMNVNKQLRI-KAPKRLPSVIGQQD 98 >gi|119946689|ref|YP_944369.1| hypothetical protein Ping_3071 [Psychromonas ingrahamii 37] gi|119865293|gb|ABM04770.1| hypothetical protein DUF167 [Psychromonas ingrahamii 37] Length = 111 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 42/90 (46%), Gaps = 10/90 (11%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P AKK + + I ++++VTATP G+A M+ LAK+ ++S + ++ Sbjct: 28 TPGAKKDAVGK-------AQGIQLRVRVTATPVAGRATDHMVRFLAKEFSVSPDDITVVF 80 Query: 69 KQSSPLKIIYI---DKDCKEITELLQNNDS 95 + + K + I K + + L DS Sbjct: 81 GRLNINKQLRIKAPKKMPSVVAKALSKKDS 110 >gi|145253637|ref|XP_001398331.1| yggU family protein [Aspergillus niger CBS 513.88] gi|134083900|emb|CAK48804.1| unnamed protein product [Aspergillus niger] Length = 126 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 21/75 (28%), Positives = 38/75 (50%), Gaps = 9/75 (12%) Query: 8 LIPNAK--KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 + PNA + GI ++ T + + V A P+KG+AN A+ +LA+ + KS++ Sbjct: 32 VKPNASSNREGITAI-------GTDRVDVCVAAVPRKGEANAAVSRVLAQIFQVPKSNVE 84 Query: 66 MLSKQSSPLKIIYID 80 ++ S K + I Sbjct: 85 VIRGLKSREKTLAIS 99 >gi|307176568|gb|EFN66055.1| UPF0235 protein C15orf40-like protein [Camponotus floridanus] Length = 122 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 17/82 (20%), Positives = 34/82 (41%), Gaps = 7/82 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ P AK + I + I ++A P +G+AN ++ LA L KS+ Sbjct: 36 IKIQAKPGAKCNNITDISDEAVG-------IAISAPPMEGEANAELVKYLASIFELRKSN 88 Query: 64 LRMLSKQSSPLKIIYIDKDCKE 85 + + S K + + + Sbjct: 89 VSLDRGSRSRQKTVTVSGITTD 110 >gi|301166027|emb|CBW25601.1| conserved hypothetical protein [Bacteriovorax marinus SJ] Length = 109 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 21/72 (29%), Positives = 32/72 (44%), Gaps = 9/72 (12%) Query: 9 IPNAKKSG-IASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 P AK I E I++K + P G+ANKA + LA +L+++KSS+ + Sbjct: 22 KPGAKVEKSIVGDE----GEIIIYIKER----PIDGQANKAFIKYLAAQLSITKSSVSLS 73 Query: 68 SKQSSPLKIIYI 79 S K Sbjct: 74 RGSKSRFKRFSF 85 >gi|298243513|ref|ZP_06967320.1| protein of unknown function DUF167 [Ktedonobacter racemifer DSM 44963] gi|297556567|gb|EFH90431.1| protein of unknown function DUF167 [Ktedonobacter racemifer DSM 44963] Length = 90 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 26/91 (28%), Positives = 49/91 (53%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+IP + ++ + E +K ++TA P G AN A++A+LA+ L+L K + Sbjct: 3 VPVRVIPRSNRNTLEWEE--------GAIKARLTAPPVDGAANAALIALLAETLSLPKRA 54 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 + ++ + KI+ I+ + EI + L + Sbjct: 55 ITLIRGTTGRQKIVEIEGLEQVEIIQRLSAS 85 >gi|311260654|ref|XP_001929217.2| PREDICTED: UPF0235 protein C15orf40 homolog [Sus scrofa] Length = 154 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 39/93 (41%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L K Sbjct: 64 VTIAIHAKPGSKQNAVTDLTTEA-------VSVAIAAPPSEGEANAELCRYLSKVFELRK 116 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQN 92 S + + S K++ + +EI E L+ Sbjct: 117 SDVVLDKGGKSREKVVKLLASTTPEEILEKLKK 149 >gi|85092099|ref|XP_959226.1| hypothetical protein NCU06879 [Neurospora crassa OR74A] gi|28920629|gb|EAA29990.1| predicted protein [Neurospora crassa OR74A] Length = 130 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 21/76 (27%), Positives = 39/76 (51%), Gaps = 5/76 (6%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P A K+ + D + ++I V A ++G+ANKA++ +L++ L L KS+ Sbjct: 30 IHCHVKPGASKNR-EGVTSITDEA----VEICVAAQAKEGEANKAVVKVLSEALNLPKSN 84 Query: 64 LRMLSKQSSPLKIIYI 79 L + S K I + Sbjct: 85 LEITQGLKSRAKTIAV 100 >gi|242021057|ref|XP_002430963.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212516183|gb|EEB18225.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 117 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 39/88 (44%), Gaps = 10/88 (11%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P AK + I ++ +++ A P G+AN ++ ++ L L K+ + + Sbjct: 35 KPGAKTNAITGIDEEGIG-------VQINARPVDGEANSELVNYMSCLLGLRKTEISLEK 87 Query: 69 KQSSPLKIIYIDK---DCKEITELLQNN 93 S KI+ I K +EI E L+N Sbjct: 88 GSKSRQKILLISKKDLSTEEIIEKLKNE 115 >gi|54295945|ref|YP_122257.1| hypothetical protein plpp0102 [Legionella pneumophila str. Paris] gi|53755777|emb|CAH17279.1| hypothetical protein plpp0102 [Legionella pneumophila str. Paris] Length = 98 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 34/78 (43%), Gaps = 7/78 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P AK + + + +KIK+ A + KAN ++ L+ + KS Sbjct: 17 LSLLIQPGAKCNQVVG-------AVGEELKIKIAAPSIEVKANMELVRYLSVLFKVPKSQ 69 Query: 64 LRMLSKQSSPLKIIYIDK 81 +++ S KII + Sbjct: 70 IKIKRGLKSRHKIIEVIG 87 >gi|260576559|ref|ZP_05844548.1| protein of unknown function DUF167 [Rhodobacter sp. SW2] gi|259021282|gb|EEW24589.1| protein of unknown function DUF167 [Rhodobacter sp. SW2] Length = 66 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 17/60 (28%), Positives = 32/60 (53%), Gaps = 8/60 (13%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A ++ + E +++ VT P+ GKA + ++A+LAK L ++K Sbjct: 14 ADFAVRVTPKASRNAVVVEE--------GAIRVYVTCVPEDGKATREVVALLAKALGVAK 65 >gi|294949448|ref|XP_002786202.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983] gi|239900359|gb|EER17998.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983] Length = 132 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 19/95 (20%), Positives = 47/95 (49%), Gaps = 9/95 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK-LALS 60 + +R P AK S + ++ + +++ A+ + G+AN+ +L+ L+K+ L + Sbjct: 41 ARIAIRAKPGAKVSCLTGIDAEG------ALGVQLNASARDGEANEELLSFLSKEVLGVK 94 Query: 61 KSSLRMLSKQSSPLKIIYIDK--DCKEITELLQNN 93 K + ++ S K++ I +++ LL++ Sbjct: 95 KKDVALVQGSKSREKVVEIADVLTVDDVSRLLRDE 129 >gi|148358546|ref|YP_001249753.1| hypothetical protein LPC_0417 [Legionella pneumophila str. Corby] gi|296108365|ref|YP_003620066.1| hypothetical protein lpa_03962 [Legionella pneumophila 2300/99 Alcoy] gi|148280319|gb|ABQ54407.1| conserved hypothetical protein; DUF167 [Legionella pneumophila str. Corby] gi|295650267|gb|ADG26114.1| hypothetical protein lpa_03962 [Legionella pneumophila 2300/99 Alcoy] Length = 70 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 13/49 (26%), Positives = 29/49 (59%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 + I + A PQ+G+AN +L +++ + K+ + ++ +SS K+I + Sbjct: 8 LHIALHAKPQEGEANNELLFFISQFFKIPKTQIELIKGKSSRHKLIRLP 56 >gi|119357873|ref|YP_912517.1| hypothetical protein Cpha266_2081 [Chlorobium phaeobacteroides DSM 266] gi|187479907|sp|A1BI66|Y2081_CHLPD RecName: Full=UPF0235 protein Cpha266_2081 gi|119355222|gb|ABL66093.1| protein of unknown function DUF167 [Chlorobium phaeobacteroides DSM 266] Length = 101 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 42/89 (47%), Gaps = 8/89 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 ++ P + KS I+ +K+ + A P AN+ + AK L++S S L Sbjct: 19 LKAQPRSSKSAISGAYNGG-------VKVNLKAAPVDDAANRECCDLFAKVLSVSSSRLT 71 Query: 66 MLSKQSSPLKIIYIDK-DCKEITELLQNN 93 +LS +SS K I ++ +E+ LL+ Sbjct: 72 ILSGKSSKNKTIKVEGLGAEEVALLLRPY 100 >gi|154150390|ref|YP_001404008.1| hypothetical protein Mboo_0847 [Candidatus Methanoregula boonei 6A8] gi|153998942|gb|ABS55365.1| protein of unknown function DUF167 [Methanoregula boonei 6A8] Length = 104 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 14/51 (27%), Positives = 28/51 (54%) Query: 35 KVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKE 85 +V A GKANKA++ +++ + +S+ +++ +S K + I KE Sbjct: 44 RVAAPAVNGKANKAVIGIISAGTGVPAASVTIVAGLTSSQKKVRIAGITKE 94 >gi|67078514|ref|NP_001019920.1| hypothetical protein LOC293059 [Rattus norvegicus] gi|81908918|sp|Q505I4|CO040_RAT RecName: Full=UPF0235 protein C15orf40 homolog gi|63100368|gb|AAH94529.1| Similar to RIKEN cDNA 3110040N11 [Rattus norvegicus] gi|149044058|gb|EDL97440.1| rCG63322 [Rattus norvegicus] gi|149057383|gb|EDM08706.1| similar to RIKEN cDNA 3110040N11, isoform CRA_c [Rattus norvegicus] gi|149057387|gb|EDM08710.1| similar to RIKEN cDNA 3110040N11, isoform CRA_c [Rattus norvegicus] Length = 126 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 39/94 (41%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + A P +G+AN + L+K L L K Sbjct: 36 VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 88 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 S + + S K++ + +E+ E L+ Sbjct: 89 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTE 122 >gi|67523595|ref|XP_659857.1| hypothetical protein AN2253.2 [Aspergillus nidulans FGSC A4] gi|40744782|gb|EAA63938.1| hypothetical protein AN2253.2 [Aspergillus nidulans FGSC A4] gi|259487644|tpe|CBF86471.1| TPA: DUF167 domain protein (AFU_orthologue; AFUA_5G06647) [Aspergillus nidulans FGSC A4] Length = 130 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 38/78 (48%), Gaps = 9/78 (11%) Query: 4 VIVRLIPNA--KKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + R+ PNA + GI ++ + + V A P+ G+AN A+ + AK ++K Sbjct: 32 ISCRVKPNASGGREGITAV-------GNETVDVCVAAVPRDGEANLAVSQVFAKVFNVAK 84 Query: 62 SSLRMLSKQSSPLKIIYI 79 S + ++ S K++ I Sbjct: 85 SDVGVIHGLKSRDKVLCI 102 >gi|304314780|ref|YP_003849927.1| hypothetical protein MTBMA_c10190 [Methanothermobacter marburgensis str. Marburg] gi|302588239|gb|ADL58614.1| conserved hypothetical protein [Methanothermobacter marburgensis str. Marburg] Length = 102 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 38/89 (42%), Gaps = 9/89 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + P + E+ ++IKV A P+KGKAN+ ++ + ++ Sbjct: 13 VDIEVSPASG-----GFEVRSYNEWRKRIEIKVRAPPEKGKANREIIEEFSAAFN---TN 64 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQ 91 ++S S K + I D + LL+ Sbjct: 65 ADIVSGHKSRHKTLKIYGMDAETFRTLLE 93 >gi|114658629|ref|XP_001148978.1| PREDICTED: similar to Chromosome 15 open reading frame 40 [Pan troglodytes] Length = 283 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 9/87 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 193 VTIAIHAKPGSKQNAVTDLTAEA-------VNVAIAAPPSEGEANAELCRYLSKVLELRK 245 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEI 86 S + + S K++ + +EI Sbjct: 246 SDVVLDKGGKSREKVVKLLASTTPEEI 272 >gi|115495117|ref|NP_001068854.1| hypothetical protein LOC509050 [Bos taurus] gi|122140809|sp|Q3ZBP8|CO040_BOVIN RecName: Full=UPF0235 protein C15orf40 homolog gi|73587092|gb|AAI03180.1| Chromosome 15 open reading frame 40 ortholog [Bos taurus] gi|296475539|gb|DAA17654.1| hypothetical protein LOC509050 [Bos taurus] Length = 126 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 41/93 (44%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ + P +K++ + + + + + A P +G+AN + L+K L L K Sbjct: 36 VSIAIHAKPGSKQNAVTDVTTEA-------VSVAIAAPPTEGEANAELCRYLSKVLELRK 88 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQN 92 S + + S K++ + +EI E L+ Sbjct: 89 SDVVLDKGGKSREKVVKLLASTPPEEILEKLKK 121 >gi|307824080|ref|ZP_07654307.1| protein of unknown function DUF167 [Methylobacter tundripaludum SV96] gi|307734864|gb|EFO05714.1| protein of unknown function DUF167 [Methylobacter tundripaludum SV96] Length = 108 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 37/78 (47%), Gaps = 8/78 (10%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+AK+ I ++ +K+ VTA P G+A M+ LAK+ ++ + ++ Sbjct: 26 QPSAKQDAIGKVK-------GNQLKVSVTAAPVAGRATDHMVRFLAKEFGVTPKDIEVVF 78 Query: 69 KQSSPLKIIYIDKDCKEI 86 + + K + I K K + Sbjct: 79 GRFNVNKQLRI-KSPKNL 95 >gi|197106855|ref|YP_002132232.1| hypothetical protein PHZ_c3394 [Phenylobacterium zucineum HLK1] gi|196480275|gb|ACG79803.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 86 Score = 51.5 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 43/77 (55%), Gaps = 2/77 (2%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P + I D + +K++V+A P G AN A++A+LAK L + KS++R+ Sbjct: 1 MTPKGGRDAIDGW--GADEAGRPVLKVRVSAAPADGAANAAVVALLAKALKVPKSAVRIA 58 Query: 68 SKQSSPLKIIYIDKDCK 84 + +++ +K + ID + Sbjct: 59 AGETARIKRLEIDGASE 75 >gi|13385576|ref|NP_080353.1| hypothetical protein LOC67290 [Mus musculus] gi|29839616|sp|Q9CRC3|CO040_MOUSE RecName: Full=UPF0235 protein C15orf40 homolog gi|12851848|dbj|BAB29184.1| unnamed protein product [Mus musculus] gi|12857526|dbj|BAB31031.1| unnamed protein product [Mus musculus] gi|12859380|dbj|BAB31634.1| unnamed protein product [Mus musculus] Length = 126 Score = 51.5 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 39/94 (41%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P ++++ + L + + A P +G+AN + L+K L L K Sbjct: 36 VTIAIHAKPGSRQNAVTDLSTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 88 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 S + + S K++ + +E+ E L+ Sbjct: 89 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLKTE 122 >gi|145219218|ref|YP_001129927.1| hypothetical protein Cvib_0403 [Prosthecochloris vibrioformis DSM 265] gi|189040268|sp|A4SD66|Y403_PROVI RecName: Full=UPF0235 protein Cvib_0403 gi|145205382|gb|ABP36425.1| protein of unknown function DUF167 [Chlorobium phaeovibrioides DSM 265] Length = 100 Score = 51.5 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P A K+ ++ +KI + A P AN+ + A ++ + Sbjct: 17 VRVQPRASKTAVSGPYAGG-------LKITLKAAPVDDAANRECCRLFAGMFGIADGRVH 69 Query: 66 MLSKQSSPLKIIYIDK 81 ++S +SS K + ++ Sbjct: 70 VVSGRSSRSKSVMLEG 85 >gi|289615268|emb|CBI58035.1| unnamed protein product [Sordaria macrospora] Length = 133 Score = 51.5 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 38/76 (50%), Gaps = 5/76 (6%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P A K + D + ++I V A ++G+ANK+++ +L++ L + KS+ Sbjct: 32 IHCHVKPGASKQR-EGVTCITDEA----VEICVAAQAKEGEANKSVVKVLSEALNIPKSN 86 Query: 64 LRMLSKQSSPLKIIYI 79 L + S K I + Sbjct: 87 LEITQGLKSRAKTIAV 102 >gi|148236990|ref|NP_001089221.1| hypothetical protein LOC734268 [Xenopus laevis] gi|57920974|gb|AAH89152.1| MGC85153 protein [Xenopus laevis] Length = 121 Score = 51.5 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 38/92 (41%), Gaps = 9/92 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P AK++ I + + + A P +G+AN + L+K L L K Sbjct: 31 VTISIHAKPGAKQNAITDVTADAVG-------VAIAAPPTEGEANAELCRYLSKVLVLKK 83 Query: 62 SSLRMLSKQSSPLKIIYIDKD--CKEITELLQ 91 S + + S K++ I + + E L+ Sbjct: 84 SEVSLDKGGKSREKVVKISASITPEVVLERLK 115 >gi|302868916|ref|YP_003837553.1| hypothetical protein Micau_4464 [Micromonospora aurantiaca ATCC 27029] gi|315504614|ref|YP_004083501.1| hypothetical protein ML5_3839 [Micromonospora sp. L5] gi|302571775|gb|ADL47977.1| protein of unknown function DUF167 [Micromonospora aurantiaca ATCC 27029] gi|315411233|gb|ADU09350.1| protein of unknown function DUF167 [Micromonospora sp. L5] Length = 101 Score = 51.5 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 40/90 (44%), Gaps = 3/90 (3%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V VR+ P A ++ + D + + V A G+A +A LA L + + Sbjct: 8 TVAVRVKPGAARARVGG---RFDGPYGPALVVAVHAPAVDGRATEAARRALADALGIRPA 64 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++ + S +S K+ +++ + E+L+ Sbjct: 65 TVSLRSGAASRDKLFLVERPHDGLPEVLRR 94 >gi|20381446|gb|AAH27500.1| RIKEN cDNA 3110040N11 gene [Mus musculus] gi|74206730|dbj|BAE41614.1| unnamed protein product [Mus musculus] gi|148674975|gb|EDL06922.1| RIKEN cDNA 3110040N11, isoform CRA_d [Mus musculus] gi|148674977|gb|EDL06924.1| RIKEN cDNA 3110040N11, isoform CRA_d [Mus musculus] Length = 126 Score = 51.5 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 39/94 (41%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P ++++ + L + + A P +G+AN + L+K L L K Sbjct: 36 VTIAIHAKPGSRQNAVTDLSTEAVG-------VAIAAPPSQGEANAELCRYLSKVLDLRK 88 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 S + + S K++ + +E+ E L+ Sbjct: 89 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLKTE 122 >gi|297568085|ref|YP_003689429.1| protein of unknown function DUF167 [Desulfurivibrio alkaliphilus AHT2] gi|296924000|gb|ADH84810.1| protein of unknown function DUF167 [Desulfurivibrio alkaliphilus AHT2] Length = 76 Score = 51.1 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 13/77 (16%), Positives = 38/77 (49%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ P ++ S + D S ++ + P +G+AN+ ++A++A K+ Sbjct: 5 LQIKVKPRSQSSSLT---QEADGSWL----ARLKSPPVEGRANRELIALVADHFRCRKAD 57 Query: 64 LRMLSKQSSPLKIIYID 80 + + + S K++ ++ Sbjct: 58 VEIKAGSSGRTKLVRVE 74 >gi|326403567|ref|YP_004283649.1| hypothetical protein ACMV_14200 [Acidiphilium multivorum AIU301] gi|325050429|dbj|BAJ80767.1| hypothetical protein ACMV_14200 [Acidiphilium multivorum AIU301] Length = 131 Score = 51.1 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 37/80 (46%), Gaps = 2/80 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIK--VTATPQKGKANKAMLAMLAKKLALSK 61 V +++ P A+++ + + +++ V P+ G+AN A+L LA L + Sbjct: 23 VALKVQPGARRARLGPVVPAAAAPGWPPARLRLAVVVPPEDGRANDAVLKALAAWLGVGA 82 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 + L + + + K++ + Sbjct: 83 ARLALRAGGQARDKLVLVVG 102 >gi|269860472|ref|XP_002649957.1| hypothetical cytosolic protein [Enterocytozoon bieneusi H348] gi|220066644|gb|EED44119.1| hypothetical cytosolic protein [Enterocytozoon bieneusi H348] Length = 88 Score = 51.1 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 18/66 (27%), Positives = 35/66 (53%) Query: 14 KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSP 73 K ++S E + ++ I++ A P KAN ++A+L+K K ++ ++ ++S Sbjct: 21 KVKLSSRETAILCQEDDYLIIQIAAPPVDNKANNELIALLSKTYKTKKENISIIKGKTST 80 Query: 74 LKIIYI 79 KII I Sbjct: 81 TKIIKI 86 >gi|149690913|ref|XP_001498126.1| PREDICTED: similar to chromosome 15 open reading frame 40 [Equus caballus] Length = 167 Score = 51.1 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 34/78 (43%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + + + + + A P +G+AN + L+K L L K Sbjct: 77 VTIAIHAKPGSKQNAVTDVTAEA-------VSVAIAAPPSEGEANAELCRYLSKVLDLRK 129 Query: 62 SSLRMLSKQSSPLKIIYI 79 S + + S K++ + Sbjct: 130 SDVVLDKGGKSREKVVKL 147 >gi|73951609|ref|XP_545872.2| PREDICTED: similar to CG14966-PA [Canis familiaris] Length = 152 Score = 51.1 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 41/93 (44%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R P +K++ + + + + + A P +G+AN + L+K L L K Sbjct: 62 VTIAIRAKPGSKQNAVTDVTAEA-------VSVAIAAPPSEGEANAELCRYLSKVLELRK 114 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQN 92 S + + S K++ + +EI E L+ Sbjct: 115 SDVVLDKGGKSREKVVKLLASTTAEEILEKLKQ 147 >gi|148643663|ref|YP_001274176.1| hypothetical protein Msm_1603 [Methanobrevibacter smithii ATCC 35061] gi|148552680|gb|ABQ87808.1| conserved hypothetical protein Msm_1603 [Methanobrevibacter smithii ATCC 35061] Length = 101 Score = 51.1 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 43/92 (46%), Gaps = 13/92 (14%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + PN+ K I+ ++ +I++ PQKGKANK ++ L+K Sbjct: 18 IDIEVSPNSNKFQISGFNEWRN-----RFEIRIKQVPQKGKANKEIVKELSKIFNC---D 69 Query: 64 LRMLSKQSSPLKIIY-----IDKDCKEITELL 90 + + + S K I ID ++++E+L Sbjct: 70 VSISKGEKSSQKTIVCYNVSIDCILEKLSEIL 101 >gi|222444857|ref|ZP_03607372.1| hypothetical protein METSMIALI_00470 [Methanobrevibacter smithii DSM 2375] gi|222434422|gb|EEE41587.1| hypothetical protein METSMIALI_00470 [Methanobrevibacter smithii DSM 2375] Length = 101 Score = 51.1 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 42/92 (45%), Gaps = 13/92 (14%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + PN+ K I+ ++ +I++ PQKGKANK ++ L+K Sbjct: 18 VDIEVSPNSNKFQISGFNEWRN-----RFEIRIKQVPQKGKANKEIVKELSKIFNC---D 69 Query: 64 LRMLSKQSSPLKIIY-----IDKDCKEITELL 90 + + + S K I ID +++E+L Sbjct: 70 VSISKGEKSSQKTIVCYNVSIDDILDKLSEIL 101 >gi|327289069|ref|XP_003229247.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 1 [Anolis carolinensis] Length = 119 Score = 51.1 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 38/94 (40%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V P +K++ + L I + A P G+AN + L+K L + K Sbjct: 30 VTIAVHAKPGSKQNAVTDLSAEAVG-------IAIAAPPSDGEANAELCRYLSKVLEVRK 82 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 SS + S K++ I +E+ + L+ Sbjct: 83 SSSLLKQGGRSREKLVKILAPLTPEEVLQKLRKE 116 >gi|148550788|ref|YP_001260227.1| hypothetical protein Swit_5352 [Sphingomonas wittichii RW1] gi|148503207|gb|ABQ71460.1| protein of unknown function DUF167 [Sphingomonas wittichii RW1] Length = 94 Score = 51.1 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 41/78 (52%), Gaps = 6/78 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ PNA I + D ++++ TATP+ GKAN+A+L +LA L Sbjct: 22 GRLPVRVAPNASADAI--ILATADG----ILQVRTTATPEGGKANEAVLRLLAAALGQPV 75 Query: 62 SSLRMLSKQSSPLKIIYI 79 S+L +L + KI+ + Sbjct: 76 SALELLRGSTGRNKIVRV 93 >gi|109082186|ref|XP_001111768.1| PREDICTED: UPF0235 protein C15orf40-like isoform 1 [Macaca mulatta] Length = 154 Score = 50.7 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 20/94 (21%), Positives = 40/94 (42%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 64 VTITIHAKPGSKQNAVTDLTAEA-------VNVAIAAPPSEGEANAELCRYLSKVLELRK 116 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 S + + S K++ + +EI E L+ Sbjct: 117 SDVVLDKGGKSREKVVKLLASTTPEEILEKLKKE 150 >gi|38345823|emb|CAD41928.2| OSJNBa0070M12.6 [Oryza sativa Japonica Group] Length = 207 Score = 50.7 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 13/76 (17%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ P +K + I + + +++ A + G+AN A++ ++ L + K Sbjct: 120 ISIQAKPGSKLATITEI-------GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 172 Query: 64 LRMLSKQSSPLKIIYI 79 + + S S K++ + Sbjct: 173 VSIGSGSKSREKVVLV 188 >gi|146324075|ref|XP_001481499.1| DUF167 domain protein [Aspergillus fumigatus Af293] gi|129558081|gb|EBA27446.1| DUF167 domain protein [Aspergillus fumigatus Af293] gi|159126298|gb|EDP51414.1| YggU family protein [Aspergillus fumigatus A1163] Length = 133 Score = 50.7 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 18/80 (22%), Positives = 36/80 (45%), Gaps = 9/80 (11%) Query: 4 VIVRLIPNAK--KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + PNA + GI ++ + + V A P+ G+AN A+ + A+ + K Sbjct: 34 IACHVKPNASSSREGIIAV-------GAEKVDVCVAAVPRNGEANTAVSRVFAQIFDVPK 86 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 S++ ++ S K + I Sbjct: 87 SNVEVIRGLKSRDKTLCITN 106 >gi|291413562|ref|XP_002723040.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus] Length = 154 Score = 50.7 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 20/94 (21%), Positives = 38/94 (40%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P AK++ + L + + + A P +G+AN + L+K L L K Sbjct: 64 VTIAIHAKPGAKQNAVTDLTAEA-------VSVAIAAPPSEGEANAELCRYLSKVLELRK 116 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 S + + S K++ + EI L+ Sbjct: 117 SDVVLDKGGKSREKVVKLLASTTPDEILGKLKRE 150 >gi|251772211|gb|EES52781.1| conserved hypothetical protein [Leptospirillum ferrodiazotrophum] Length = 120 Score = 50.7 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 15/82 (18%), Positives = 37/82 (45%), Gaps = 3/82 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 +++ + P + L + + ++G+AN+ +L LA+ L +S SS Sbjct: 28 LVIEVKPAQSR---VFLRRGASHGPAPIWVVGIREPAREGQANEGVLRTLAEFLGVSPSS 84 Query: 64 LRMLSKQSSPLKIIYIDKDCKE 85 + ++S +S +K + + + Sbjct: 85 VALVSGHTSRIKRLSVRGIDES 106 >gi|159905985|ref|YP_001549647.1| hypothetical protein MmarC6_1603 [Methanococcus maripaludis C6] gi|226734735|sp|A9AAP2|Y1603_METM6 RecName: Full=UPF0235 protein MmarC6_1603 gi|159887478|gb|ABX02415.1| protein of unknown function DUF167 [Methanococcus maripaludis C6] Length = 101 Score = 50.7 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 47/91 (51%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + NAKK+ I + + ++I++ P +GKANKA++ L + KS Sbjct: 15 IDIEVTTNAKKNEIGKINEWR-----KRIEIRIKEQPIEGKANKAIIKFLK---GIFKSE 66 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93 + + S +S K + I DK +++ ++L+ Sbjct: 67 ISINSGTTSSQKTVLIPDKTKEDVVKILKKE 97 >gi|50753027|ref|XP_413838.1| PREDICTED: similar to Chromosome 15 open reading frame 40 [Gallus gallus] Length = 158 Score = 50.7 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 22/96 (22%), Positives = 39/96 (40%), Gaps = 9/96 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V VR P ++ S + + + + A P +G+AN + L+K L + K Sbjct: 69 VRVSVRAKPGSRCSAVTDVTAEAVG-------VAIAAPPSEGEANAELCRYLSKVLGVKK 121 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95 S + + S K++ I E+ E L+ S Sbjct: 122 SDVILEKGGKSRDKVVKILVSVTPDEVLEKLKKEAS 157 >gi|145494428|ref|XP_001433208.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124400325|emb|CAK65811.1| unnamed protein product [Paramecium tetraurelia] Length = 99 Score = 50.7 bits (120), Expect = 6e-05, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 44/86 (51%), Gaps = 8/86 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P +K + E + +K A P AN+ ++ ML++KL++ +SS Sbjct: 20 LQLFVKPKSKSEMLEFSE--------EFIIVKTKAQPIDNAANEDVIRMLSEKLSIDQSS 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 ++++ Q S K ++I+ + + +L Sbjct: 72 IKIVKGQQSKYKTVFIENEMQAYNKL 97 >gi|225431557|ref|XP_002282176.1| PREDICTED: hypothetical protein [Vitis vinifera] gi|147866968|emb|CAN83055.1| hypothetical protein VITISV_009894 [Vitis vinifera] Length = 128 Score = 50.7 bits (120), Expect = 7e-05, Method: Composition-based stats. Identities = 14/79 (17%), Positives = 36/79 (45%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ V P +K S I + + +++ A + G+AN A+L ++ + + + Sbjct: 39 VSITVHAKPGSKVSSITDFDDEA-------LGVQIDAPAKDGEANAALLDYISSVVGVKR 91 Query: 62 SSLRMLSKQSSPLKIIYID 80 + + S S K++ ++ Sbjct: 92 RQVSISSGSKSRDKVVIVE 110 >gi|189913153|ref|YP_001965041.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|189913490|ref|YP_001964718.1| hypothetical protein LEPBI_p0042 [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167777829|gb|ABZ96128.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781558|gb|ABZ99854.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 75 Score = 50.7 bits (120), Expect = 7e-05, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 41/77 (53%), Gaps = 7/77 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +++++ N I SLE +T +K + P KGKAN+ ++ +L+K ++K Sbjct: 2 KIVIKVKSN---QKIQSLEFKSETECIAKLK----SLPVKGKANQELVGLLSKHYGVTKK 54 Query: 63 SLRMLSKQSSPLKIIYI 79 ++++S S +K + I Sbjct: 55 EIQIISGHFSNIKTVEI 71 >gi|323356791|ref|YP_004223187.1| hypothetical protein MTES_0343 [Microbacterium testaceum StLB037] gi|323273162|dbj|BAJ73307.1| uncharacterized conserved protein [Microbacterium testaceum StLB037] Length = 73 Score = 50.4 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 37/77 (48%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P +++ + +DT++ + + ++ A G AN ++ LA + Sbjct: 3 LTVRVKPGSRRGPLV-----EDTAEGLVVHVRERAV--DGAANSGVVKALAAHFGVPARD 55 Query: 64 LRMLSKQSSPLKIIYID 80 + +L ++ +K + +D Sbjct: 56 VEILRGHAARVKRVEVD 72 >gi|150402236|ref|YP_001329530.1| hypothetical protein MmarC7_0309 [Methanococcus maripaludis C7] gi|166229378|sp|A6VG03|Y309_METM7 RecName: Full=UPF0235 protein MmarC7_0309 gi|150033266|gb|ABR65379.1| protein of unknown function DUF167 [Methanococcus maripaludis C7] Length = 101 Score = 50.4 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 47/91 (51%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + NAKK+ I + + ++I++ P +GKANKA++ L + KS Sbjct: 15 IDIEVTTNAKKNEIGKINEWR-----KRIEIRIREQPIEGKANKAIVKFLK---GIFKSE 66 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93 + + S +S K + I DK +++ ++L+ Sbjct: 67 IFINSGTTSSQKTVLIPDKTKEDVVKILKKE 97 >gi|261350574|ref|ZP_05975991.1| conserved hypothetical protein [Methanobrevibacter smithii DSM 2374] gi|288861357|gb|EFC93655.1| conserved hypothetical protein [Methanobrevibacter smithii DSM 2374] Length = 101 Score = 50.4 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 41/92 (44%), Gaps = 13/92 (14%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + PN+ K I+ ++ +I++ PQKGKANK ++ L+K Sbjct: 18 VDIEVSPNSNKFQISGFNEWRN-----RFEIRIKQVPQKGKANKEIVKELSKIFNC---D 69 Query: 64 LRMLSKQSSPLKIIY-----IDKDCKEITELL 90 + + + S K I I+ ++ E+L Sbjct: 70 VSISKGEKSSQKTIVCYDVSIEYILDKLGEIL 101 >gi|331695278|ref|YP_004331517.1| hypothetical protein Psed_1425 [Pseudonocardia dioxanivorans CB1190] gi|326949967|gb|AEA23664.1| UPF0235 protein yggU [Pseudonocardia dioxanivorans CB1190] Length = 89 Score = 50.4 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 40/89 (44%), Gaps = 8/89 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +R+ P A ++ + + ++V G A A L +A + + Sbjct: 7 RLTIRVRPGASRTSVGGAYADA-------LVVRVNERAVDGAATAAALRAVAGAFGVPRG 59 Query: 63 SLRMLSKQSSPLKIIYID-KDCKEITELL 90 ++R++S +S KI+ ++ D + + ELL Sbjct: 60 AVRLVSGATSRTKILDVEGGDPERLRELL 88 >gi|221059313|ref|XP_002260302.1| hypothetical protein, conserved in Plasmodium species [Plasmodium knowlesi strain H] gi|193810375|emb|CAQ41569.1| hypothetical protein, conserved in Plasmodium species [Plasmodium knowlesi strain H] Length = 176 Score = 50.4 bits (119), Expect = 8e-05, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 35/78 (44%), Gaps = 6/78 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ PNAK + I SD + I + P ++N A++ + L L K Sbjct: 90 INLRVKPNAKNTSI------YFNSDREVLNINIQEQPINNQSNIAIIGYFSDILDLKKRD 143 Query: 64 LRMLSKQSSPLKIIYIDK 81 + ++S S K++ + Sbjct: 144 ISIVSGLKSRDKVLMVSN 161 >gi|296242831|ref|YP_003650318.1| hypothetical protein Tagg_1097 [Thermosphaera aggregans DSM 11486] gi|296095415|gb|ADG91366.1| protein of unknown function DUF167 [Thermosphaera aggregans DSM 11486] Length = 111 Score = 50.4 bits (119), Expect = 9e-05, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 40/93 (43%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P + I D + + + ++G+AN +++ L+++L + Sbjct: 23 VILQVRVKPGS---EPEGFTIESDE-----LVFRTSEPAERGRANASLIKYLSRELKIPV 74 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE-ITELLQNN 93 S + ++ Q LK + I + + I E L Sbjct: 75 SKIDIVYGQREKLKKVLIMDEPADKIIEKLARV 107 >gi|330469281|ref|YP_004407024.1| hypothetical protein VAB18032_26756 [Verrucosispora maris AB-18-032] gi|328812252|gb|AEB46424.1| hypothetical protein VAB18032_26756 [Verrucosispora maris AB-18-032] Length = 98 Score = 50.4 bits (119), Expect = 9e-05, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 34/78 (43%), Gaps = 3/78 (3%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V VR+ P A ++ + D + I V A G+A +A LA+ L + + Sbjct: 7 TVAVRVKPGASRARVGG---RYDGPHGPALVIAVNAPAVDGRATEAARRALAEALGVRPA 63 Query: 63 SLRMLSKQSSPLKIIYID 80 ++ + + SS K+ I Sbjct: 64 TVSLRTGASSRDKLFQIT 81 >gi|300871287|ref|YP_003786160.1| hypothetical protein BP951000_1678 [Brachyspira pilosicoli 95/1000] gi|300688988|gb|ADK31659.1| conserved hypothetical protein [Brachyspira pilosicoli 95/1000] Length = 83 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 21/85 (24%), Positives = 42/85 (49%), Gaps = 8/85 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 N+ V++ +AK + + I+++A GKANKA++ L+ +L L K Sbjct: 2 NIEVKVTASAKSNSFK--------KENGIYYIRISAKAIDGKANKAIIDFLSSELNLKKK 53 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEIT 87 + +L + S K+I ++ D ++ Sbjct: 54 DVEILKGEKSSKKLISLNIDEYKLE 78 >gi|302916303|ref|XP_003051962.1| hypothetical protein NECHADRAFT_19877 [Nectria haematococca mpVI 77-13-4] gi|256732901|gb|EEU46249.1| hypothetical protein NECHADRAFT_19877 [Nectria haematococca mpVI 77-13-4] Length = 75 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 40/76 (52%), Gaps = 5/76 (6%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A K+ + + D +++ V A ++G+ANKA++ +L+ L + KSS Sbjct: 5 LQLHVKPGASKNREGVIAVTDDA-----IELCVAAQAREGEANKAVVQVLSSVLGVPKSS 59 Query: 64 LRMLSKQSSPLKIIYI 79 L++ S K + + Sbjct: 60 LQLTHGLKSRDKTVVL 75 >gi|156100181|ref|XP_001615818.1| hypothetical protein [Plasmodium vivax SaI-1] gi|148804692|gb|EDL46091.1| hypothetical protein, conserved [Plasmodium vivax] Length = 178 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 35/78 (44%), Gaps = 6/78 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ PNAK + I SD + I + P ++N A++ + L L K Sbjct: 92 INLRVKPNAKNTSI------YFNSDREVLNINIQEQPINNQSNVAIIGYFSDILDLKKRD 145 Query: 64 LRMLSKQSSPLKIIYIDK 81 + ++S S K++ + Sbjct: 146 ISIVSGLKSRDKVLMVSN 163 >gi|52345734|ref|NP_001004913.1| chromosome 15 open reading frame 40 [Xenopus (Silurana) tropicalis] gi|49523245|gb|AAH75357.1| MGC89060 protein [Xenopus (Silurana) tropicalis] Length = 120 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 38/92 (41%), Gaps = 9/92 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P AK++ I + + + A P +G+AN + L+K L L K Sbjct: 30 VIISIHAKPGAKQNAITDVTADAVG-------VAIAAPPTEGEANAELCRYLSKVLVLKK 82 Query: 62 SSLRMLSKQSSPLKIIYIDKD--CKEITELLQ 91 S + + S K++ I + + E L+ Sbjct: 83 SEVSLDKGGKSREKVVKISASITPEVVLEKLK 114 >gi|238494338|ref|XP_002378405.1| DUF167 domain protein [Aspergillus flavus NRRL3357] gi|220695055|gb|EED51398.1| DUF167 domain protein [Aspergillus flavus NRRL3357] Length = 131 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 18/75 (24%), Positives = 36/75 (48%), Gaps = 9/75 (12%) Query: 8 LIPNAK--KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 + PNA + GI ++ + + V A P+ G+AN A+ + A+ L + KS++ Sbjct: 37 VKPNASANREGIIAV-------GPEKVDVCVAAVPRDGEANAAVSRVFAQILKVPKSTVD 89 Query: 66 MLSKQSSPLKIIYID 80 ++ S K + + Sbjct: 90 VIRGLKSRDKTLCVS 104 >gi|147921111|ref|YP_685078.1| hypothetical protein RCIX287 [uncultured methanogenic archaeon RC-I] gi|56295557|emb|CAH04800.1| conserved hypothetical protein [uncultured archaeon] gi|110620474|emb|CAJ35752.1| conserved hypothetical protein [uncultured methanogenic archaeon RC-I] Length = 104 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 40/81 (49%), Gaps = 6/81 (7%) Query: 8 LIPNAKKSGI-ASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRM 66 + P AK + + + + + +++K+ A P++G+AN+ ++ L+ L SS+ + Sbjct: 20 VTPGAKSTVVPSGYSVWR-----KRIEVKLKAPPERGRANEELIEALSDLFHLPASSIEI 74 Query: 67 LSKQSSPLKIIYIDKDCKEIT 87 + ++ K I + ++ Sbjct: 75 TAGATNSRKSIKVHGISTDVV 95 >gi|116754806|ref|YP_843924.1| hypothetical protein Mthe_1512 [Methanosaeta thermophila PT] gi|116754813|ref|YP_843931.1| hypothetical protein Mthe_1519 [Methanosaeta thermophila PT] gi|116754820|ref|YP_843938.1| hypothetical protein Mthe_1526 [Methanosaeta thermophila PT] gi|116666257|gb|ABK15284.1| protein of unknown function DUF167 [Methanosaeta thermophila PT] gi|116666264|gb|ABK15291.1| protein of unknown function DUF167 [Methanosaeta thermophila PT] gi|116666271|gb|ABK15298.1| protein of unknown function DUF167 [Methanosaeta thermophila PT] Length = 122 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 13/63 (20%), Positives = 35/63 (55%), Gaps = 1/63 (1%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 +++++T KGKAN+ +L +++ L++ + ++S S K++ + D + + L Sbjct: 58 IEVRLTERADKGKANRQLLQEISRAFGLTQGDVEIMSGHRSQRKVVLMRGIDAESVLAAL 117 Query: 91 QNN 93 + + Sbjct: 118 RRS 120 >gi|224062575|ref|XP_002197110.1| PREDICTED: hypothetical protein, partial [Taeniopygia guttata] Length = 138 Score = 49.6 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 18/64 (28%), Positives = 30/64 (46%), Gaps = 2/64 (3%) Query: 34 IKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI--DKDCKEITELLQ 91 + + A P +G+AN + L+K L + KS + + S K++ I EI E L+ Sbjct: 74 VAIAAPPSEGEANAELCRYLSKVLEVKKSDVILEKGGKSRDKVVKISVSATPDEILEKLK 133 Query: 92 NNDS 95 S Sbjct: 134 KEAS 137 >gi|169777211|ref|XP_001823071.1| yggU family protein [Aspergillus oryzae RIB40] gi|83771808|dbj|BAE61938.1| unnamed protein product [Aspergillus oryzae] Length = 131 Score = 49.6 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 18/75 (24%), Positives = 36/75 (48%), Gaps = 9/75 (12%) Query: 8 LIPNAK--KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 + PNA + GI ++ + + V A P+ G+AN A+ + A+ L + KS++ Sbjct: 37 VKPNASANREGIIAV-------GPEKVDVCVAAVPRDGEANAAVSRVFAQILKVPKSTVV 89 Query: 66 MLSKQSSPLKIIYID 80 ++ S K + + Sbjct: 90 VIRGLKSRDKTLCVS 104 >gi|145509569|ref|XP_001440723.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124407951|emb|CAK73326.1| unnamed protein product [Paramecium tetraurelia] Length = 99 Score = 49.6 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 44/86 (51%), Gaps = 8/86 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P +K + E + K A P +AN+ ++ ML++KL++ +SS Sbjct: 20 LQLFVKPKSKAEMLEFSE--------EFVIAKTKAQPIDNEANEDVIRMLSEKLSIDQSS 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 ++++ Q S K ++I+ + + +L Sbjct: 72 IKIVKGQQSKYKTVFIENEMQAYNKL 97 >gi|310801877|gb|EFQ36770.1| hypothetical protein GLRG_11914 [Glomerella graminicola M1.001] Length = 92 Score = 49.6 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 18/73 (24%), Positives = 36/73 (49%), Gaps = 5/73 (6%) Query: 7 RLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRM 66 R+ P A + + D +++ V A ++G+AN+A++ +L++ L L KS L + Sbjct: 25 RVKPGASRVR-EGIVAVTDGG----VELCVAAQAREGEANRAVIKLLSEILGLPKSDLII 79 Query: 67 LSKQSSPLKIIYI 79 S K + + Sbjct: 80 SQGLKSRDKTVAV 92 >gi|284161412|ref|YP_003400035.1| hypothetical protein Arcpr_0292 [Archaeoglobus profundus DSM 5631] gi|284011409|gb|ADB57362.1| protein of unknown function DUF167 [Archaeoglobus profundus DSM 5631] Length = 97 Score = 49.6 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 35/78 (44%), Gaps = 8/78 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + PNAK++ I + + +K+ V + P+ GKAN+ + L + Sbjct: 10 IEVDVSPNAKRTAITGYDPWR-----KALKVSVKSPPRGGKANRELTDFLG---GIFNCK 61 Query: 64 LRMLSKQSSPLKIIYIDK 81 + ++ + S K + + Sbjct: 62 VEIVKGEKSTKKTVLLKG 79 >gi|290996548|ref|XP_002680844.1| predicted protein [Naegleria gruberi] gi|284094466|gb|EFC48100.1| predicted protein [Naegleria gruberi] Length = 73 Score = 49.6 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 38/78 (48%), Gaps = 7/78 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + PN+ S IA++ + + + A P++G+ANK + ++ L +SKS Sbjct: 3 RLTILAKPNSSSSQIANINDEEIG-------VHIAAPPKEGEANKELCDYVSGVLGVSKS 55 Query: 63 SLRMLSKQSSPLKIIYID 80 + + S K++ ++ Sbjct: 56 RVTLDRGGKSRHKLLLVE 73 >gi|124515876|gb|EAY57385.1| conserved hypothetical protein [Leptospirillum rubarum] Length = 104 Score = 49.6 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 25/92 (27%), Positives = 42/92 (45%), Gaps = 13/92 (14%) Query: 2 CNVIVRLIPNAKKS--GIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLAL 59 V +R+ P KK ++ E D ++A ++G AN +L L+ LA Sbjct: 12 GRVRIRVRPGKKKDVLLLSGEEFSAD----------LSAPAREGAANDRLLRNLSYWLAW 61 Query: 60 SKSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S++R+ +SS LK I I +E+ + L Sbjct: 62 PVSNIRIEKGESSRLKTIAIRGMTGEEVRKRL 93 >gi|86740601|ref|YP_481001.1| hypothetical protein Francci3_1896 [Frankia sp. CcI3] gi|86567463|gb|ABD11272.1| protein of unknown function DUF167 [Frankia sp. CcI3] Length = 97 Score = 49.6 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 17/94 (18%), Positives = 41/94 (43%), Gaps = 2/94 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +++R+ P A ++ + +D + ++VT G+A +A L L L + + Sbjct: 2 RLMIRVQPGAGRTAVGG--RREDPLHGPLLIVRVTEPAVDGRATEAALRALCAALRVRRG 59 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96 +R++ + +K + ++ + L D L Sbjct: 60 DVRLVRGATGRVKAVEVEVAAVDEPALRIRIDEL 93 >gi|91788526|ref|YP_549478.1| hypothetical protein Bpro_2664 [Polaromonas sp. JS666] gi|91697751|gb|ABE44580.1| protein of unknown function DUF167 [Polaromonas sp. JS666] Length = 103 Score = 49.6 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 20/82 (24%), Positives = 37/82 (45%), Gaps = 8/82 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P A + I + +K+ VTA P+ GKA M+ LA ++ + + ++ Sbjct: 27 KPAASRDAIGKPK-------GTQLKVSVTAAPKSGKATDHMVRFLAPLFGVAVADIEVVF 79 Query: 69 KQSSPLKIIYIDKDCKEITELL 90 Q + K + I K K++ E+ Sbjct: 80 GQENVNKQLRI-KAPKKLPEVF 100 >gi|322418791|ref|YP_004198014.1| hypothetical protein GM18_1269 [Geobacter sp. M18] gi|320125178|gb|ADW12738.1| protein of unknown function DUF167 [Geobacter sp. M18] Length = 105 Score = 49.2 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 37/78 (47%), Gaps = 8/78 (10%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PNAK+ I ++ + + VT P+ G+A M+ LA++ +S ++++ Sbjct: 21 TPNAKRDAIGKVK-------GHQLCVSVTEFPRAGRATDHMVRFLAEEFGVSTGDIQVVF 73 Query: 69 KQSSPLKIIYIDKDCKEI 86 + + K + I K K + Sbjct: 74 GRMNVNKQLRI-KAPKRL 90 >gi|206602120|gb|EDZ38602.1| Conserved hypothetical protein [Leptospirillum sp. Group II '5-way CG'] Length = 102 Score = 49.2 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 26/92 (28%), Positives = 42/92 (45%), Gaps = 13/92 (14%) Query: 2 CNVIVRLIPNAKKSGI--ASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLAL 59 V +R+ P KK + + E D ++A ++G AN +L L++ LA Sbjct: 10 GRVRIRVRPGKKKDVLFLSGEEFSAD----------LSAPAREGAANDRLLRNLSQWLAW 59 Query: 60 SKSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S +R+ Q+S LK I I +EI + L Sbjct: 60 PVSKIRLEKGQASRLKTIVIAGMTGEEIRKRL 91 >gi|296204187|ref|XP_002749223.1| PREDICTED: UPF0235 protein C15orf40-like [Callithrix jacchus] Length = 154 Score = 49.2 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 9/87 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 64 VTIAIHAKPGSKQNAVTDLTAEA-------INVAIAAPPSEGEANAELCRYLSKVLELRK 116 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEI 86 S + + S K++ + +EI Sbjct: 117 SDVVLDKGCKSREKVVKLLASTTPEEI 143 >gi|237858662|ref|NP_653198.2| hypothetical protein LOC123207 isoform a [Homo sapiens] gi|119582835|gb|EAW62431.1| chromosome 15 open reading frame 40, isoform CRA_a [Homo sapiens] Length = 153 Score = 49.2 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 9/87 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 63 VTIAIHAKPGSKQNAVTDLTAEA-------VNVAIAAPPSEGEANAELCRYLSKVLELRK 115 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEI 86 S + + S K++ + +EI Sbjct: 116 SDVVLDKGGKSREKVVKLLASTTPEEI 142 >gi|312197091|ref|YP_004017152.1| hypothetical protein FraEuI1c_3270 [Frankia sp. EuI1c] gi|311228427|gb|ADP81282.1| protein of unknown function DUF167 [Frankia sp. EuI1c] Length = 97 Score = 49.2 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 37/83 (44%), Gaps = 2/83 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +R+ P A + + D + + ++V A GKA +A L LA L L ++ Sbjct: 2 RVTIRVRPGASGTAVGGELGGPDGEPS--LVVRVCARAVDGKATEAALRALADALGLRRA 59 Query: 63 SLRMLSKQSSPLKIIYIDKDCKE 85 + ++ +S K++ I + Sbjct: 60 DVSLVHGATSRTKLVEIAASPAD 82 >gi|326504128|dbj|BAK02850.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 129 Score = 48.8 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 26/52 (50%) Query: 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 + +++ A + G+AN A++ ++ L + K + + S S K++ + Sbjct: 60 GEEAVGVQIDAPARDGEANAALVDFISSVLGVKKREVSIGSGSKSREKVVLV 111 >gi|167932988|ref|ZP_02520075.1| hypothetical protein cdivTM7_00062 [candidate division TM7 single-cell isolate TM7b] gi|169836860|ref|ZP_02870048.1| hypothetical protein cdivTM_07094 [candidate division TM7 single-cell isolate TM7a] Length = 59 Score = 48.8 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 15/54 (27%), Positives = 29/54 (53%) Query: 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 D + + A +G+AN A + +LAK ++ S +++L +S K+ +DK Sbjct: 4 DDGVLTVYTKAPAIEGRANLATVKLLAKYFGVASSKVKLLRGAASKYKVFEVDK 57 >gi|68074573|ref|XP_679202.1| hypothetical protein [Plasmodium berghei strain ANKA] gi|56499889|emb|CAH95244.1| conserved hypothetical protein [Plasmodium berghei] Length = 106 Score = 48.8 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 42/91 (46%), Gaps = 7/91 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ PN+K + I DT + I + P ++N A+++ + L L K Sbjct: 20 INLRVKPNSKNTSI------YFNVDTEVLNINIQEQPVNNQSNVAIISYFSDILNLKKRD 73 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 + +++ S K++ + ++++ + N Sbjct: 74 ISIVAGLKSRDKVLMVSNISVEDLSNKINQN 104 >gi|29839587|sp|Q8WUR7|CO040_HUMAN RecName: Full=UPF0235 protein C15orf40 gi|18043732|gb|AAH19820.1| Chromosome 15 open reading frame 40 [Homo sapiens] gi|189053288|dbj|BAG35094.1| unnamed protein product [Homo sapiens] Length = 126 Score = 48.8 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 9/87 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 36 VTIAIHAKPGSKQNAVTDLTAEA-------VNVAIAAPPSEGEANAELCRYLSKVLELRK 88 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEI 86 S + + S K++ + +EI Sbjct: 89 SDVVLDKGGKSREKVVKLLASTTPEEI 115 >gi|168032813|ref|XP_001768912.1| predicted protein [Physcomitrella patens subsp. patens] gi|162679824|gb|EDQ66266.1| predicted protein [Physcomitrella patens subsp. patens] Length = 123 Score = 48.8 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 36/78 (46%), Gaps = 7/78 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V P +K S I + +++ A ++G+AN A+L +A+ L + + Sbjct: 38 ITVHAKPGSKLSAITDTDDGAVG-------VQIDAPAREGEANAALLEYIAEVLGIKRRQ 90 Query: 64 LRMLSKQSSPLKIIYIDK 81 + + S S K++ ++ Sbjct: 91 VSLGSGSRSREKLVTVEG 108 >gi|20093713|ref|NP_613560.1| hypothetical protein MK0273 [Methanopyrus kandleri AV19] gi|29839574|sp|Q8TYM3|Y273_METKA RecName: Full=UPF0235 protein MK0273 gi|19886604|gb|AAM01490.1| Uncharacterized conserved protein [Methanopyrus kandleri AV19] Length = 96 Score = 48.8 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 9/88 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P+A + + ++ + +++ V A P KGKAN+ +L L +KL ++ Sbjct: 14 IRVRVNPDADTTDLKGVDEWR-----GVLEVDVAAPPVKGKANRELLEFLGRKLN---TT 65 Query: 64 LRMLSKQSSPLK-IIYIDKDCKEITELL 90 ++S + S K ++ D E+ E L Sbjct: 66 CELVSGEKSREKLVLARDVSVDEVKERL 93 >gi|119498429|ref|XP_001265972.1| YggU family protein [Neosartorya fischeri NRRL 181] gi|119414136|gb|EAW24075.1| YggU family protein [Neosartorya fischeri NRRL 181] Length = 133 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 35/79 (44%), Gaps = 9/79 (11%) Query: 4 VIVRLIPNAK--KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + PNA + GI ++ + + V A P+ G+AN A+ + A+ + K Sbjct: 34 IACHVKPNASSSREGIIAV-------GAEKVDVCVAAVPRNGEANAAVSRVFAQIFDVPK 86 Query: 62 SSLRMLSKQSSPLKIIYID 80 S+ ++ S K + I Sbjct: 87 SNAEVIRGLKSRDKTLCIT 105 >gi|148674974|gb|EDL06921.1| RIKEN cDNA 3110040N11, isoform CRA_c [Mus musculus] Length = 114 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 18/60 (30%), Positives = 30/60 (50%), Gaps = 2/60 (3%) Query: 36 VTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 VTA P +G+AN + L+K L L KS + + S K++ + +E+ E L+ Sbjct: 51 VTAPPSQGEANAELCRYLSKVLDLRKSDVVLDKGGKSREKVVKLLASTTPEEVLEKLKTE 110 >gi|328772106|gb|EGF82145.1| hypothetical protein BATDEDRAFT_86894 [Batrachochytrium dendrobatidis JAM81] Length = 128 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 16/73 (21%), Positives = 34/73 (46%), Gaps = 7/73 (9%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P K S + ++ I++ A ++G+AN ++ +A L L K + ++ Sbjct: 45 VKPGTKVSQVIDIQGDAVG-------IQIAAVAREGEANAELIQTVADVLKLRKYQVAIV 97 Query: 68 SKQSSPLKIIYID 80 + S K++ ID Sbjct: 98 AGHKSRTKVLKID 110 >gi|154309869|ref|XP_001554267.1| predicted protein [Botryotinia fuckeliana B05.10] gi|150851643|gb|EDN26836.1| predicted protein [Botryotinia fuckeliana B05.10] Length = 130 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 44/89 (49%), Gaps = 8/89 (8%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + PN+ S + ++ DTS + +++ V + +ANK ++ +L+ L K+ L ++ Sbjct: 29 VKPNSSASRV-GVKAFSDTSSS--IEVCVAQPARDNEANKGVVEVLSHILKCPKTDLEVI 85 Query: 68 SKQSSPLKIIYIDKDCKEITELLQNNDSL 96 ++S KII E L NDSL Sbjct: 86 RGKTSKNKIIAYKG-----IEDLLANDSL 109 >gi|326522380|dbj|BAK07652.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326531514|dbj|BAJ97761.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 129 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 26/52 (50%) Query: 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 + +++ A + G+AN A++ ++ L + K + + S S K++ + Sbjct: 60 GEEAVGVQIDAPARDGEANAALVDFISSVLGVKKREVSIGSGSKSREKVVLV 111 >gi|242077752|ref|XP_002448812.1| hypothetical protein SORBIDRAFT_06g033690 [Sorghum bicolor] gi|241939995|gb|EES13140.1| hypothetical protein SORBIDRAFT_06g033690 [Sorghum bicolor] Length = 131 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 26/52 (50%) Query: 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 + +++ A + G+AN A++ ++ L + K + + S S K++ + Sbjct: 61 GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKREVSIGSGSKSREKVVLV 112 >gi|196001449|ref|XP_002110592.1| hypothetical protein TRIADDRAFT_54758 [Trichoplax adhaerens] gi|190586543|gb|EDV26596.1| hypothetical protein TRIADDRAFT_54758 [Trichoplax adhaerens] Length = 132 Score = 48.4 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 43/87 (49%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ P +K++ + +I D + I++ A ++G+AN ++ L+ L + KSS Sbjct: 43 ITIKAKPGSKENAVT--DISSDG-----IGIQIAAPAREGEANSELIKFLSSILKVKKSS 95 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + S K I ++K+ + + Sbjct: 96 ILLDKGSKSRHKTICVNKNADLTEKQV 122 >gi|89899559|ref|YP_522030.1| hypothetical protein Rfer_0749 [Rhodoferax ferrireducens T118] gi|89344296|gb|ABD68499.1| protein of unknown function DUF167 [Rhodoferax ferrireducens T118] Length = 114 Score = 48.4 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 30/72 (41%), Gaps = 7/72 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+A K I +K+ V A P G+A M+ LA + +S S + ++ Sbjct: 28 KPSASKDAIGK-------PFGKQLKVSVAAAPVAGRATDHMVRFLAVQFGVSASDIEVVF 80 Query: 69 KQSSPLKIIYID 80 + + K + I Sbjct: 81 GRMNVNKQVRIK 92 >gi|324527094|gb|ADY48748.1| Unknown [Ascaris suum] Length = 181 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 46/93 (49%), Gaps = 10/93 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + PNAK S + + + +++ + A P KG+AN+A+ +A+ L L K+ Sbjct: 92 LKIHAKPNAKISRVTEI-------NETEIEVAIAAPPHKGQANEALTDAIAEILGLRKND 144 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQNN 93 + + S K++ I+ +E+ E L+ + Sbjct: 145 VFFDTGARSRSKLLVINSQRITVEEVREKLKKS 177 >gi|149057381|gb|EDM08704.1| similar to RIKEN cDNA 3110040N11, isoform CRA_a [Rattus norvegicus] gi|149057386|gb|EDM08709.1| similar to RIKEN cDNA 3110040N11, isoform CRA_a [Rattus norvegicus] Length = 114 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 18/60 (30%), Positives = 30/60 (50%), Gaps = 2/60 (3%) Query: 36 VTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 VTA P +G+AN + L+K L L KS + + S K++ + +E+ E L+ Sbjct: 51 VTAPPSEGEANAELCRYLSKVLDLRKSDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTE 110 >gi|74318638|ref|YP_316378.1| hypothetical protein Tbd_2620 [Thiobacillus denitrificans ATCC 25259] gi|74058133|gb|AAZ98573.1| conserved hypothetical protein [Thiobacillus denitrificans ATCC 25259] Length = 106 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 7/72 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+AK I + +K+ VTA P+ G+A M+ LA + ++ S++ ++ Sbjct: 22 KPSAKVDAIGKPK-------GHQLKVSVTAAPRAGRATDHMVRFLADEFGVATSAIEVVF 74 Query: 69 KQSSPLKIIYID 80 + + K + I Sbjct: 75 GRMNVNKQLRIK 86 >gi|326926706|ref|XP_003209539.1| PREDICTED: UPF0235 protein C15orf40 homolog [Meleagris gallopavo] Length = 85 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 17/69 (24%), Positives = 31/69 (44%), Gaps = 2/69 (2%) Query: 29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI--DKDCKEI 86 + + + A P +G+AN + L+K L + KS + + S K++ I E+ Sbjct: 16 AEAVGVAIAAPPSEGEANAELCRYLSKVLQVKKSDVILEKGGKSRDKVVKILVSLTPDEV 75 Query: 87 TELLQNNDS 95 E L+ S Sbjct: 76 LEKLKKEAS 84 >gi|226496211|ref|NP_001150562.1| LOC100284194 [Zea mays] gi|195640232|gb|ACG39584.1| uncharacterized ACR, YggU family COG1872 containing protein [Zea mays] Length = 129 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 26/52 (50%) Query: 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 + +++ A + G+AN A++ ++ L + K + + S S K++ + Sbjct: 59 GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKREVSIGSGSKSREKVVLV 110 >gi|45358618|ref|NP_988175.1| hypothetical protein MMP1055 [Methanococcus maripaludis S2] gi|74554395|sp|Q6LYD6|Y1055_METMP RecName: Full=UPF0235 protein MMP1055 gi|45047484|emb|CAF30611.1| conserved hypothetical protein [Methanococcus maripaludis S2] Length = 101 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 21/82 (25%), Positives = 40/82 (48%), Gaps = 8/82 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + NAKK+ I + + ++I++ P +GKANKA++ L + KS Sbjct: 15 IDIEVTTNAKKNEIGKINEWR-----KRIEIRIKEQPIEGKANKAIIKFLK---GIFKSE 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKE 85 + + S +S K + I K+ Sbjct: 67 ILINSGTTSSQKTVLIPDKTKD 88 >gi|307353453|ref|YP_003894504.1| hypothetical protein Mpet_1306 [Methanoplanus petrolearius DSM 11571] gi|307156686|gb|ADN36066.1| protein of unknown function DUF167 [Methanoplanus petrolearius DSM 11571] Length = 105 Score = 48.0 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 39/90 (43%), Gaps = 5/90 (5%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P +KK+ + ++ ++ + +GKAN ++ +A ++ K Sbjct: 14 VQISLDVSPGSKKT----VFPAGYNEWRNAIECRIKSPATEGKANAEIIKTIADYFSVKK 69 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S + ++S S K I I ++ E L Sbjct: 70 SDVVIVSGAISGQKKIKISGVSLEDALERL 99 >gi|283778809|ref|YP_003369564.1| hypothetical protein Psta_1020 [Pirellula staleyi DSM 6068] gi|283437262|gb|ADB15704.1| protein of unknown function DUF167 [Pirellula staleyi DSM 6068] Length = 102 Score = 48.0 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 28/97 (28%), Positives = 47/97 (48%), Gaps = 12/97 (12%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V V+ AKK+ + +KI VT P+KGKAN A+ +LA LA+ +S+ Sbjct: 13 VGVKAQAAAKKNSLRG-------EHAGLLKISVTTAPEKGKANDAIADLLAAALAVRRSA 65 Query: 64 LRMLSKQSSPLKIIYIDKDC-----KEITELLQNNDS 95 +++++ + PLK I ++I L+N Sbjct: 66 VQIVAGHTQPLKKFLISGASLDEVREKIARALENAQP 102 >gi|156373856|ref|XP_001629526.1| predicted protein [Nematostella vectensis] gi|156216528|gb|EDO37463.1| predicted protein [Nematostella vectensis] Length = 133 Score = 48.0 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 38/86 (44%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + P AK++ I L + +++ A P++G+AN ++ ++ + KSS Sbjct: 44 VKIHAKPGAKQNRITELSPD-------FVGVQIAAQPKEGEANDELVRYMSSVFGVKKSS 96 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + S KII I + + Sbjct: 97 VTLDKGAKSRDKIIRISSSGLTLKQA 122 >gi|330507436|ref|YP_004383864.1| hypothetical protein MCON_1367 [Methanosaeta concilii GP-6] gi|328928244|gb|AEB68046.1| hypothetical protein MCON_1367 [Methanosaeta concilii GP-6] Length = 105 Score = 47.7 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 16/84 (19%), Positives = 42/84 (50%), Gaps = 4/84 (4%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C + ++P + + + S P ++ ++T P +G+AN+ ++ LA+ L + + Sbjct: 14 CIIRFEVVPGSSRLAVPSGFNPW----RRSLEARLTEKPSRGRANRQLVEELARILGVDE 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85 S + ++ + S K++ + K+ Sbjct: 70 SGIEVIKGEKSGRKLLLVKGIEKD 93 >gi|118430921|ref|NP_147028.2| hypothetical protein APE_0182.1 [Aeropyrum pernix K1] gi|150421715|sp|Q9YFR7|Y182_AERPE RecName: Full=UPF0235 protein APE_0182.1 gi|116062246|dbj|BAA79094.2| conserved hypothetical protein [Aeropyrum pernix K1] Length = 108 Score = 47.7 bits (112), Expect = 5e-04, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 38/93 (40%), Gaps = 10/93 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + P ++ + E + P +G+AN +++ LA+ L +S Sbjct: 19 VRIRVYVKPEGRERRLRLEEGE--------LVFYTDEPPLEGRANASLINFLARGLKVSV 70 Query: 62 SSLRMLSKQSSPLKIIYIDK--DCKEITELLQN 92 ++ ++ S K++ I D + E L + Sbjct: 71 KNIEIVHGARSRSKVVEIRDVADPDALLERLAS 103 >gi|224131202|ref|XP_002328480.1| predicted protein [Populus trichocarpa] gi|222838195|gb|EEE76560.1| predicted protein [Populus trichocarpa] Length = 131 Score = 47.7 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K + I L +++ A + G+AN A+L ++ L + + Sbjct: 44 ITIHAKPGSKSASITDLSDEAVG-------VQIDAPAKDGEANAALLDYISSVLGVKRRQ 96 Query: 64 LRMLSKQSSPLKIIYID 80 + + S S K++ ++ Sbjct: 97 VSIGSGSKSRDKVVIVE 113 >gi|156039347|ref|XP_001586781.1| predicted protein [Sclerotinia sclerotiorum 1980] gi|154697547|gb|EDN97285.1| predicted protein [Sclerotinia sclerotiorum 1980 UF-70] Length = 128 Score = 47.7 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 37/77 (48%), Gaps = 4/77 (5%) Query: 3 NVIVR--LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + +R + PN S +++ DTS ++I V + +AN+ ++ +L L Sbjct: 19 TIHLRCHVKPNVSASRAGAIKPFTDTSPI--LEICVPEPARDNEANEGVIELLGDVLLCP 76 Query: 61 KSSLRMLSKQSSPLKII 77 K+ L ++ + S K+I Sbjct: 77 KTDLEVIRGKKSRDKVI 93 >gi|223975045|gb|ACN31710.1| unknown [Zea mays] Length = 95 Score = 47.3 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 26/52 (50%) Query: 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 + +++ A + G+AN A++ ++ L + K + + S S K++ + Sbjct: 25 GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKREVSIGSGSKSREKVVLV 76 >gi|134045581|ref|YP_001097067.1| hypothetical protein MmarC5_0538 [Methanococcus maripaludis C5] gi|166227318|sp|A4FXC3|Y538_METM5 RecName: Full=UPF0235 protein MmarC5_0538 gi|132663206|gb|ABO34852.1| protein of unknown function DUF167 [Methanococcus maripaludis C5] Length = 101 Score = 47.3 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 46/91 (50%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + NAKK+ I + + ++I++ P +GKANKA++ L + KS Sbjct: 15 IDIEVTTNAKKNEIGKINEWR-----KRIEIRIKEQPIEGKANKAIMKFLK---GIFKSE 66 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93 + + S +S K + I DK +I ++L+ Sbjct: 67 ILINSGTTSAQKTVLIPDKTKDDIVKILKKE 97 >gi|121603294|ref|YP_980623.1| hypothetical protein Pnap_0379 [Polaromonas naphthalenivorans CJ2] gi|120592263|gb|ABM35702.1| protein of unknown function DUF167 [Polaromonas naphthalenivorans CJ2] Length = 107 Score = 47.3 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 18/82 (21%), Positives = 36/82 (43%), Gaps = 8/82 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+A K I + +K+ VTA P GKA M+ LA ++ + + ++ Sbjct: 29 KPSAGKDAIGKPK-------GTQLKVSVTAAPLAGKATDHMVRFLAPLFGVAVADIEVVF 81 Query: 69 KQSSPLKIIYIDKDCKEITELL 90 + + K + I + K++ + Sbjct: 82 GRENVNKQLRI-RAPKKLPAVF 102 >gi|296088594|emb|CBI37585.3| unnamed protein product [Vitis vinifera] Length = 89 Score = 47.3 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 11/56 (19%), Positives = 29/56 (51%) Query: 25 DTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 D D + +++ A + G+AN A+L ++ + + + + + S S K++ ++ Sbjct: 16 DYFDDEALGVQIDAPAKDGEANAALLDYISSVVGVKRRQVSISSGSKSRDKVVIVE 71 >gi|126340695|ref|XP_001369778.1| PREDICTED: similar to chromosome 15 open reading frame 40, [Monodelphis domestica] Length = 73 Score = 47.3 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 17/60 (28%), Positives = 29/60 (48%), Gaps = 2/60 (3%) Query: 34 IKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI--DKDCKEITELLQ 91 + + A P +G+AN + L+K L L KS + + S K++ I +EI E + Sbjct: 8 VAIAAPPSEGEANTELCCYLSKVLELRKSDVILDKGGKSHEKVVPILASTTPEEILEKFK 67 >gi|297852564|ref|XP_002894163.1| hypothetical protein ARALYDRAFT_474060 [Arabidopsis lyrata subsp. lyrata] gi|297340005|gb|EFH70422.1| hypothetical protein ARALYDRAFT_474060 [Arabidopsis lyrata subsp. lyrata] Length = 126 Score = 47.3 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 13/77 (16%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K + I + +++ A + G+AN A+L ++ L + + Sbjct: 39 ITIHAKPGSKAASITDVSDEAVG-------VQIDAPARDGEANAALLEFISSVLGVKRRQ 91 Query: 64 LRMLSKQSSPLKIIYID 80 + + S S K++ ++ Sbjct: 92 VSLGSGSKSRDKVVIVE 108 >gi|325959811|ref|YP_004291277.1| hypothetical protein Metbo_2086 [Methanobacterium sp. AL-21] gi|325331243|gb|ADZ10305.1| UPF0235 protein yggU [Methanobacterium sp. AL-21] Length = 100 Score = 47.3 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 4/77 (5%) Query: 18 ASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKII 77 + I +IK+ A PQKGKANK ++ AK L+ + ++S S K + Sbjct: 25 DNFRITGYNDWRKSFEIKIKAVPQKGKANKEIILEFAK---LTNKRVEIISGHKSHRKTL 81 Query: 78 YI-DKDCKEITELLQNN 93 I D + +++ +L++ Sbjct: 82 KIYDINEEDLLKLIEQE 98 >gi|297603606|ref|NP_001054326.2| Os04g0686300 [Oryza sativa Japonica Group] gi|215687220|dbj|BAG91785.1| unnamed protein product [Oryza sativa Japonica Group] gi|222629814|gb|EEE61946.1| hypothetical protein OsJ_16702 [Oryza sativa Japonica Group] gi|255675903|dbj|BAF16240.2| Os04g0686300 [Oryza sativa Japonica Group] Length = 129 Score = 46.9 bits (110), Expect = 8e-04, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 26/52 (50%) Query: 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 + +++ A + G+AN A++ ++ L + K + + S S K++ + Sbjct: 59 GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKREVSIGSGSKSREKVVLV 110 >gi|297618780|ref|YP_003706885.1| protein of unknown function DUF167 [Methanococcus voltae A3] gi|297377757|gb|ADI35912.1| protein of unknown function DUF167 [Methanococcus voltae A3] Length = 112 Score = 46.9 bits (110), Expect = 8e-04, Method: Composition-based stats. Identities = 24/94 (25%), Positives = 48/94 (51%), Gaps = 7/94 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + NAKK+ I + + ++I++ P +GKANKA+L + +L L K+ Sbjct: 24 VNIDVSTNAKKNEIGKINEWR-----KRLEIRIKQQPVEGKANKAILKFIKSELNL-KTD 77 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNNDSL 96 + + + ++ K ++ D D I + L +S+ Sbjct: 78 VEIATGSTNSQKTLFFKDLDKNTILKKLNLLNSI 111 >gi|20092890|ref|NP_618965.1| hypothetical protein MA4097 [Methanosarcina acetivorans C2A] gi|19918198|gb|AAM07445.1| conserved hypothetical protein [Methanosarcina acetivorans C2A] Length = 123 Score = 46.9 bits (110), Expect = 9e-04, Method: Composition-based stats. Identities = 16/48 (33%), Positives = 27/48 (56%) Query: 34 IKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 +K+T QKGKAN+ ++ LA+ +S S + + S +S K + I Sbjct: 60 VKLTKNAQKGKANEQLIESLAELFGISSSEILINSGATSSKKSLLIKG 107 >gi|169600439|ref|XP_001793642.1| hypothetical protein SNOG_03053 [Phaeosphaeria nodorum SN15] gi|111068664|gb|EAT89784.1| hypothetical protein SNOG_03053 [Phaeosphaeria nodorum SN15] Length = 132 Score = 46.9 bits (110), Expect = 9e-04, Method: Composition-based stats. Identities = 16/77 (20%), Positives = 36/77 (46%), Gaps = 5/77 (6%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P + + D+ H+ + V +G ANK++ +LAK L++ KS Sbjct: 26 ITLFVKPGVSRMR-EGIAAVSDS----HVFMNVANYAFEGSANKSVQVLLAKTLSVPKSH 80 Query: 64 LRMLSKQSSPLKIIYID 80 + ++ +S K+ + Sbjct: 81 VSIVKGLTSREKVAEVK 97 >gi|90399179|emb|CAJ86041.1| H0723C07.11 [Oryza sativa Indica Group] gi|125550304|gb|EAY96126.1| hypothetical protein OsI_18003 [Oryza sativa Indica Group] Length = 125 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 10/52 (19%), Positives = 26/52 (50%) Query: 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 + +++ A + G+AN A++ ++ L + K + + S S K++ + Sbjct: 55 GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKREVSIGSGSKSREKVVLV 106 >gi|296235859|ref|XP_002763077.1| PREDICTED: UPF0235 protein C15orf40-like [Callithrix jacchus] Length = 154 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 17/87 (19%), Positives = 38/87 (43%), Gaps = 9/87 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P ++++ + L + + + + A P +G+AN + L+K L L K Sbjct: 64 VTIAIHAKPGSRQNAVTDLTVEA-------INVAIAAPPSEGEANAELCRYLSKVLELRK 116 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEI 86 S + + S K++ + +EI Sbjct: 117 SDVVLDKSGKSREKVVKLLASTTPEEI 143 >gi|302877711|ref|YP_003846275.1| hypothetical protein Galf_0467 [Gallionella capsiferriformans ES-2] gi|302580500|gb|ADL54511.1| protein of unknown function DUF167 [Gallionella capsiferriformans ES-2] Length = 104 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 14/83 (16%), Positives = 38/83 (45%), Gaps = 8/83 (9%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 PN+K+ I + +++ A P++G A M+ LA ++ + ++ ++ Sbjct: 27 PNSKQDAIGRVI-------GHQLEVYAAAVPRRGGATAHMVQYLASIFSVPEHAITVVFG 79 Query: 70 QSSPLKIIYIDKDCKEITELLQN 92 + + K + I+ ++ ++Q Sbjct: 80 EKNVNKQLRIE-SPGKLPSVIQR 101 >gi|291412810|ref|XP_002722671.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus] Length = 154 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 20/94 (21%), Positives = 35/94 (37%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P AK++ + L A P +G+AN + L+K L L K Sbjct: 64 VTIAIHAKPGAKQNAVTDLTAEAVIVAIA-------APPSEGEANAELCRYLSKVLELRK 116 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 S + + S K++ + EI L+ Sbjct: 117 SDVVLDKGGKSREKVVKLLASTTPDEILGKLKQE 150 >gi|198282577|ref|YP_002218898.1| hypothetical protein Lferr_0437 [Acidithiobacillus ferrooxidans ATCC 53993] gi|218665107|ref|YP_002424767.1| conserved hypothetical protein TIGR00251, putative [Acidithiobacillus ferrooxidans ATCC 23270] gi|198247098|gb|ACH82691.1| protein of unknown function DUF167 [Acidithiobacillus ferrooxidans ATCC 53993] gi|218517320|gb|ACK77906.1| conserved hypothetical protein TIGR00251, putative [Acidithiobacillus ferrooxidans ATCC 23270] Length = 116 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 37/90 (41%), Gaps = 7/90 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + P AK I +KI++ A P G AN A+ A+L L L + Sbjct: 20 VRIHVQPGAKTDAIGGCH-------GDALKIRLRARPVDGAANAALSALLCATLRLKRQQ 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++ +S+ K++ I + L Sbjct: 73 ITLVQGESARDKVLRISAASTHVQTQLAKY 102 >gi|298674092|ref|YP_003725842.1| hypothetical protein Metev_0115 [Methanohalobium evestigatum Z-7303] gi|298287080|gb|ADI73046.1| protein of unknown function DUF167 [Methanohalobium evestigatum Z-7303] Length = 109 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 17/81 (20%), Positives = 39/81 (48%), Gaps = 4/81 (4%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + + + PN+K I + + I +K+ A + GKAN ++ ++ ++ Sbjct: 13 LVVIDISVTPNSK--TINVPDNYNQWRNRIEVKLTQKA--ESGKANNQLIENFSEFFGVN 68 Query: 61 KSSLRMLSKQSSPLKIIYIDK 81 KSS+++ S + S K + + Sbjct: 69 KSSIKITSGEKSSQKSVSVKG 89 >gi|282162781|ref|YP_003355166.1| hypothetical protein MCP_0111 [Methanocella paludicola SANAE] gi|282155095|dbj|BAI60183.1| conserved hypothetical protein [Methanocella paludicola SANAE] Length = 104 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 12/63 (19%), Positives = 31/63 (49%), Gaps = 1/63 (1%) Query: 29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK-DCKEIT 87 ++ ++ A P++G+AN+ ++ L+ L + +S + + S K + + +E+ Sbjct: 37 RRRIEARLKAPPERGRANEELIGELSALLGIPESRIEITSGARDSRKSVKVLGASREEVL 96 Query: 88 ELL 90 L Sbjct: 97 RRL 99 >gi|30694498|ref|NP_175343.2| unknown protein [Arabidopsis thaliana] gi|34365605|gb|AAQ65114.1| At1g49170 [Arabidopsis thaliana] gi|51971533|dbj|BAD44431.1| similar to serine/threonine kinase 9 gb|AAD28798.1 [Arabidopsis thaliana] gi|332194278|gb|AEE32399.1| uncharacterized protein [Arabidopsis thaliana] Length = 126 Score = 46.5 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 13/77 (16%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K + I + +++ A + G+AN A+L ++ L + + Sbjct: 39 ITIHAKPGSKAASITDVSDEAVG-------VQIDAPARDGEANAALLEYMSSVLGVKRRQ 91 Query: 64 LRMLSKQSSPLKIIYID 80 + + S S K++ ++ Sbjct: 92 VSLGSGSKSRDKVVIVE 108 >gi|288917345|ref|ZP_06411712.1| protein of unknown function DUF167 [Frankia sp. EUN1f] gi|288351210|gb|EFC85420.1| protein of unknown function DUF167 [Frankia sp. EUN1f] Length = 118 Score = 46.5 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 41/86 (47%), Gaps = 4/86 (4%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V +R+ P + ++ + + D + ++V +G+AN+A L +A+ L + ++ Sbjct: 19 VAIRVRPASDRTAVGPV--TADPIHGQLLVVRVREPAVEGRANEAALRAIAQALGVRRAD 76 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + S+ +K + +D +I Sbjct: 77 VTL-SRSIGRVKFVAVDA-PDDIVAA 100 >gi|133930345|ref|NP_001076616.1| hypothetical protein W01A8.2 [Caenorhabditis elegans] gi|114420882|emb|CAL44973.1| C. elegans protein W01A8.2, confirmed by transcript evidence [Caenorhabditis elegans] Length = 127 Score = 46.1 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 37/77 (48%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AKKS + ++ + + + A P++G AN+ +++ L L L K+ Sbjct: 39 LHIHAKPGAKKSCVVAI-------GDSEVDVAIGAAPREGAANEELISYLMSALGLRKNE 91 Query: 64 LRMLSKQSSPLKIIYID 80 L+ S K++ ID Sbjct: 92 LQFDKGAKSRSKVVLID 108 >gi|21226924|ref|NP_632846.1| hypothetical protein MM_0822 [Methanosarcina mazei Go1] gi|29839564|sp|Q8PYN9|Y822_METMA RecName: Full=UPF0235 protein MM_0822 gi|20905233|gb|AAM30518.1| hypothetical protein MM_0822 [Methanosarcina mazei Go1] Length = 108 Score = 46.1 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 15/50 (30%), Positives = 28/50 (56%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 +++K+T QKGKAN+ ++ LA+ + S + + S +S K + I Sbjct: 43 IEVKLTRNAQKGKANEQLIESLAELFGICSSDIFISSGATSSKKSLLIKG 92 >gi|289641324|ref|ZP_06473490.1| protein of unknown function DUF167 [Frankia symbiont of Datisca glomerata] gi|289508922|gb|EFD29855.1| protein of unknown function DUF167 [Frankia symbiont of Datisca glomerata] Length = 122 Score = 46.1 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 39/106 (36%), Gaps = 23/106 (21%) Query: 3 NVIVRLIPNAKKSGIAS--------------------LEIPKDTSD---TIHMKIKVTAT 39 + +R+ P + ++ + + P + S + ++VTA Sbjct: 2 RLTIRVSPRSARTSVGGSSPASGAGAPGESPAENPTAADDPTNGSPPDAATPLVVRVTAP 61 Query: 40 PQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKE 85 G+A ++ L LA + + + ++ +S KI+ I + Sbjct: 62 AVDGQATESALRALATAFGVRRREVVLVRGATSRTKIVEISGLPES 107 >gi|255628955|gb|ACU14822.1| unknown [Glycine max] Length = 227 Score = 46.1 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 35/80 (43%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQLAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYID 80 S + + ++ K++ + Sbjct: 196 LSQMTLQRGWNNKSKLLVVS 215 >gi|150401269|ref|YP_001325035.1| hypothetical protein Maeo_0841 [Methanococcus aeolicus Nankai-3] gi|166235100|sp|A6UVA1|Y841_META3 RecName: Full=UPF0235 protein Maeo_0841 gi|150013972|gb|ABR56423.1| protein of unknown function DUF167 [Methanococcus aeolicus Nankai-3] Length = 101 Score = 45.7 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 25/92 (27%), Positives = 42/92 (45%), Gaps = 10/92 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + NAKK+ I + + + IK+ A P GKANK + K Sbjct: 17 IDVEISSNAKKNEIGEINEWR-----KRLIIKIKALPVDGKANKEIAKFFKKTFG---KD 68 Query: 64 LRMLSKQSSPLKIIYIDKDCKE--ITELLQNN 93 + ++S +S K I + K+ I ++L+NN Sbjct: 69 IIIVSGLTSSQKTICVIGATKDEIIDKILKNN 100 >gi|218889133|ref|YP_002437997.1| hypothetical protein PLES_03891 [Pseudomonas aeruginosa LESB58] gi|218769356|emb|CAW25116.1| conserved hypothetical protein [Pseudomonas aeruginosa LESB58] Length = 56 Score = 45.7 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 14/49 (28%), Positives = 26/49 (53%), Gaps = 3/49 (6%) Query: 42 KGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI---DKDCKEIT 87 +GKAN +LA L K ++KS + + S + + K + I + +E+ Sbjct: 2 EGKANAHLLAFLGKAFGVAKSLVSLESGELNRQKRVRIRHPTRLPEELG 50 >gi|255540269|ref|XP_002511199.1| conserved hypothetical protein [Ricinus communis] gi|223550314|gb|EEF51801.1| conserved hypothetical protein [Ricinus communis] Length = 232 Score = 45.7 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 17/97 (17%), Positives = 40/97 (41%), Gaps = 9/97 (9%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKE--ITELLQNNDS 95 S + + ++ K++ ++ +LL+ Sbjct: 196 LSQMTLQRGWNNKSKLLVVEDLSARQVYEKLLEAVQP 232 >gi|187251200|ref|YP_001875682.1| hypothetical protein Emin_0790 [Elusimicrobium minutum Pei191] gi|186971360|gb|ACC98345.1| Conserved hypothetical protein DUF167 [Elusimicrobium minutum Pei191] Length = 93 Score = 45.7 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 15/89 (16%), Positives = 37/89 (41%), Gaps = 7/89 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+IP A ++ + S +++KV + +AN + LA+ + + Sbjct: 9 IKVRVIPTAGENEVVS-------RIGSVLRVKVKTKSIEDEANNIIQYFLAEFFGVENNY 61 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + ++ K + I +E + + + Sbjct: 62 INIVKGAKGKEKTVEIRGKSEEHLKKVMD 90 >gi|225456297|ref|XP_002283689.1| PREDICTED: hypothetical protein [Vitis vinifera] gi|147823132|emb|CAN75279.1| hypothetical protein VITISV_030868 [Vitis vinifera] gi|297734405|emb|CBI15652.3| unnamed protein product [Vitis vinifera] Length = 232 Score = 45.7 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 17/97 (17%), Positives = 40/97 (41%), Gaps = 9/97 (9%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKE--ITELLQNNDS 95 S + + ++ K++ ++ +LL+ Sbjct: 196 LSQMTLQRGWNNKSKLLVVEDLSARQVYEKLLEAVQP 232 >gi|255627953|gb|ACU14321.1| unknown [Glycine max] Length = 126 Score = 45.7 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 13/84 (15%), Positives = 35/84 (41%), Gaps = 7/84 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK + I + +++ A + G+AN A+L ++ L + + Sbjct: 39 ITIHAKPGAKSASITDISDEAVG-------VQIDAPARDGEANAALLDYISSVLGVKRRQ 91 Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87 + + + S K + ++ ++ Sbjct: 92 VSLGTGSKSRDKTVIVEDVTQQYV 115 >gi|121712776|ref|XP_001273999.1| YggU family protein [Aspergillus clavatus NRRL 1] gi|119402152|gb|EAW12573.1| YggU family protein [Aspergillus clavatus NRRL 1] Length = 133 Score = 45.7 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 36/79 (45%), Gaps = 9/79 (11%) Query: 4 VIVRLIPNAK--KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + PNA + G+ ++ + + V A P+ G+AN A+ + A+ + K Sbjct: 34 IACHVKPNASSSREGVVAI-------GPEKVDVCVAAVPRNGEANIAVARVFAQIFDVPK 86 Query: 62 SSLRMLSKQSSPLKIIYID 80 S+ ++ S KI+ I Sbjct: 87 SNAEVIRGLKSRDKILCIT 105 >gi|255630115|gb|ACU15411.1| unknown [Glycine max] Length = 225 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 36/80 (45%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQLAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYID 80 S + + ++ K++ ++ Sbjct: 196 LSQMTLQRGWNNKSKLLVVE 215 >gi|29839573|sp|Q8TIP5|Y4097_METAC RecName: Full=UPF0235 protein MA_4097 Length = 109 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 16/48 (33%), Positives = 27/48 (56%) Query: 34 IKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 +K+T QKGKAN+ ++ LA+ +S S + + S +S K + I Sbjct: 46 VKLTKNAQKGKANEQLIESLAELFGISSSEILINSGATSSKKSLLIKG 93 >gi|116054122|ref|YP_788565.1| hypothetical protein PA14_05120 [Pseudomonas aeruginosa UCBPP-PA14] gi|115589343|gb|ABJ15358.1| conserved hypothetical protein [Pseudomonas aeruginosa UCBPP-PA14] Length = 56 Score = 45.3 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 14/49 (28%), Positives = 26/49 (53%), Gaps = 3/49 (6%) Query: 42 KGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKD---CKEIT 87 +GKAN +LA L K ++KS + + S + + K + I + +E+ Sbjct: 2 EGKANAHLLAFLGKAFGVAKSLVSLESGELNRQKRVRIRRPTRLPEELG 50 >gi|224133940|ref|XP_002321697.1| predicted protein [Populus trichocarpa] gi|222868693|gb|EEF05824.1| predicted protein [Populus trichocarpa] Length = 231 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 17/97 (17%), Positives = 40/97 (41%), Gaps = 9/97 (9%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 142 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFVGKVLGLK 194 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKE--ITELLQNNDS 95 S + + ++ K++ ++ +LL+ Sbjct: 195 LSQMTLQRGWNNKSKLLVVEDLSARQVYEKLLEAAQP 231 >gi|148260376|ref|YP_001234503.1| hypothetical protein Acry_1373 [Acidiphilium cryptum JF-5] gi|146402057|gb|ABQ30584.1| hypothetical protein Acry_1373 [Acidiphilium cryptum JF-5] Length = 75 Score = 45.0 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 11/47 (23%), Positives = 23/47 (48%) Query: 39 TPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKE 85 P+ G+AN A+L LA L + + L + + + K++ + + Sbjct: 4 PPEDGRANDAVLKALAAWLGIGAARLALRAGGQARDKLVLVAGGSAD 50 >gi|115471615|ref|NP_001059406.1| Os07g0295200 [Oryza sativa Japonica Group] gi|34394983|dbj|BAC84531.1| unknown protein [Oryza sativa Japonica Group] gi|113610942|dbj|BAF21320.1| Os07g0295200 [Oryza sativa Japonica Group] gi|215765278|dbj|BAG86975.1| unnamed protein product [Oryza sativa Japonica Group] gi|218199457|gb|EEC81884.1| hypothetical protein OsI_25695 [Oryza sativa Indica Group] gi|222636860|gb|EEE66992.1| hypothetical protein OsJ_23901 [Oryza sativa Japonica Group] Length = 232 Score = 45.0 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 17/97 (17%), Positives = 40/97 (41%), Gaps = 9/97 (9%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKE--ITELLQNNDS 95 S + + ++ K++ ++ +LL+ Sbjct: 196 LSQMTLQRGWNNKSKLLIVEDLSARQVYEKLLEAVQP 232 >gi|238060239|ref|ZP_04604948.1| hypothetical protein MCAG_01205 [Micromonospora sp. ATCC 39149] gi|237882050|gb|EEP70878.1| hypothetical protein MCAG_01205 [Micromonospora sp. ATCC 39149] Length = 99 Score = 45.0 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 40/90 (44%), Gaps = 3/90 (3%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V VR+ P A ++ + D + I V A P G+A +A LA L + + Sbjct: 6 TVAVRVKPGAARARVGG---RHDGPHGPALVIAVNAPPVDGRATEAARRALADALGIRPA 62 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++ + + +S K+ +++ + E L+ Sbjct: 63 AVALRAGAASRDKLFLVERPTPGLAEALRR 92 >gi|187251746|ref|YP_001876228.1| hypothetical protein Emin_1341 [Elusimicrobium minutum Pei191] gi|186971906|gb|ACC98891.1| Uncharacterized conserved protein DUF167 [Elusimicrobium minutum Pei191] Length = 71 Score = 44.6 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 15/52 (28%), Positives = 29/52 (55%) Query: 27 SDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIY 78 + +I V A ++G AN+A+ +LAK++ + +R++ +SP KI Sbjct: 19 KNEDTFEIWVKAPAERGLANEAVREILAKEIGVGVKKIRLIKGATSPSKIFE 70 >gi|107099377|ref|ZP_01363295.1| hypothetical protein PaerPA_01000389 [Pseudomonas aeruginosa PACS2] gi|152985685|ref|YP_001345886.1| hypothetical protein PSPA7_0491 [Pseudomonas aeruginosa PA7] gi|254243491|ref|ZP_04936813.1| conserved hypothetical protein [Pseudomonas aeruginosa 2192] gi|296386890|ref|ZP_06876389.1| hypothetical protein PaerPAb_02102 [Pseudomonas aeruginosa PAb1] gi|313111988|ref|ZP_07797775.1| hypothetical protein PA39016_004070010 [Pseudomonas aeruginosa 39016] gi|126196869|gb|EAZ60932.1| conserved hypothetical protein [Pseudomonas aeruginosa 2192] gi|150960843|gb|ABR82868.1| protein VV1_1522 [Pseudomonas aeruginosa PA7] gi|310884277|gb|EFQ42871.1| hypothetical protein PA39016_004070010 [Pseudomonas aeruginosa 39016] Length = 56 Score = 44.6 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 14/49 (28%), Positives = 26/49 (53%), Gaps = 3/49 (6%) Query: 42 KGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKD---CKEIT 87 +GKAN +LA L K ++KS + + S + + K + I + +E+ Sbjct: 2 EGKANAHLLAFLGKAFGVAKSLVSLESGELNRQKRVRIRRPTRLPEELG 50 >gi|30697845|ref|NP_568972.2| unknown protein [Arabidopsis thaliana] gi|26452404|dbj|BAC43287.1| unknown protein [Arabidopsis thaliana] gi|332010365|gb|AED97748.1| uncharacterized protein [Arabidopsis thaliana] Length = 232 Score = 44.2 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 36/80 (45%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + + L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYID 80 S + + +S K++ ++ Sbjct: 196 LSQMTLQRGWNSKSKLLVVE 215 >gi|297793925|ref|XP_002864847.1| hypothetical protein ARALYDRAFT_332565 [Arabidopsis lyrata subsp. lyrata] gi|297310682|gb|EFH41106.1| hypothetical protein ARALYDRAFT_332565 [Arabidopsis lyrata subsp. lyrata] Length = 232 Score = 44.2 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 36/80 (45%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + + L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYID 80 S + + +S K++ ++ Sbjct: 196 LSQMTLQRGWNSKSKLLVVE 215 >gi|158314062|ref|YP_001506570.1| hypothetical protein Franean1_2229 [Frankia sp. EAN1pec] gi|158109467|gb|ABW11664.1| protein of unknown function DUF167 [Frankia sp. EAN1pec] Length = 121 Score = 44.2 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 41/90 (45%), Gaps = 3/90 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V +R+ P A ++ + D + + ++V +G+AN+A L LA+ L + + Sbjct: 20 VRVAIRVRPAADRTAVG--PATSDPTHGRLLVVRVREPAVEGRANEAALRALAQALGVRR 77 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S + + +K I +D IT ++ Sbjct: 78 SDVTLTR-SIGRVKFIEVDAPDDVITRRVE 106 >gi|302756509|ref|XP_002961678.1| hypothetical protein SELMODRAFT_76023 [Selaginella moellendorffii] gi|300170337|gb|EFJ36938.1| hypothetical protein SELMODRAFT_76023 [Selaginella moellendorffii] Length = 233 Score = 44.2 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 38/80 (47%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L +AK L+L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEYMAKVLSLR 195 Query: 61 KSSLRMLSKQSSPLKIIYID 80 + + + ++ K++ ++ Sbjct: 196 ATQMTLQRGWNNKSKLLVVE 215 >gi|149057385|gb|EDM08708.1| similar to RIKEN cDNA 3110040N11, isoform CRA_e [Rattus norvegicus] Length = 130 Score = 43.8 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 14/65 (21%), Positives = 28/65 (43%), Gaps = 7/65 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + A P +G+AN + L+K L L K Sbjct: 36 VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 88 Query: 62 SSLRM 66 S + + Sbjct: 89 SDVVL 93 >gi|312084117|ref|XP_003144143.1| hypothetical protein LOAG_08565 [Loa loa] gi|307760694|gb|EFO19928.1| hypothetical protein LOAG_08565 [Loa loa] Length = 129 Score = 43.8 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 42/93 (45%), Gaps = 10/93 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + PNAK + + + +++ + A P G+AN+A++ + L L K+ Sbjct: 39 LRIHAKPNAKTTRVIDI-------GANEVELAIAAPPHDGQANEALINAMMDILELRKNE 91 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQNN 93 + + S K++ + +E+ E L+ N Sbjct: 92 ITFDTGARSRSKVLRLMSKRITLEEVREKLERN 124 >gi|225620549|ref|YP_002721806.1| hypothetical protein BHWA1_01632 [Brachyspira hyodysenteriae WA1] gi|225215368|gb|ACN84102.1| hypothetical protein BHWA1_01632 [Brachyspira hyodysenteriae WA1] Length = 56 Score = 43.8 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 16/53 (30%), Positives = 29/53 (54%) Query: 38 ATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELL 90 A GKANKA++ LA +L + K + +L + + K+I I+ + E+ + Sbjct: 2 AKAIDGKANKAIIDFLADELNIKKRDVEILKGEKNSKKLISININDNELKKYF 54 >gi|91773909|ref|YP_566601.1| hypothetical protein Mbur_1971 [Methanococcoides burtonii DSM 6242] gi|91712924|gb|ABE52851.1| protein of unknown function DUF167 [Methanococcoides burtonii DSM 6242] Length = 107 Score = 43.8 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 44/90 (48%), Gaps = 5/90 (5%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P +KK+ ++IK+T+ QKGKAN ++ ++A + + Sbjct: 18 IDLEVTPGSKKACF----PAGYNQWRERIEIKLTSAAQKGKANGQLIEIVADFFNIGQRE 73 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 + + S S K + I++ D +++ L++ Sbjct: 74 VIIGSGAKSSKKTVIINRPDQEQVVLALES 103 >gi|255632017|gb|ACU16361.1| unknown [Glycine max] Length = 126 Score = 43.8 bits (102), Expect = 0.007, Method: Composition-based stats. Identities = 9/54 (16%), Positives = 27/54 (50%) Query: 34 IKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEIT 87 +++ A + G+AN A+L ++ L + + + + + S K + ++ ++ Sbjct: 62 VQIDAPARDGEANAALLDYISSVLGVKRRQVSLGTGSKSRDKTVIVEDVTQQYV 115 >gi|116780073|gb|ABK21543.1| unknown [Picea sitchensis] Length = 265 Score = 43.8 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 35/80 (43%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEYMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYID 80 + + + ++ K++ + Sbjct: 196 LTQMTLQRGWNNKSKLLVVS 215 >gi|237858666|ref|NP_001153586.1| hypothetical protein LOC123207 isoform c [Homo sapiens] Length = 153 Score = 43.8 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 63 VTIAIHAKPGSKQNAVTDLTAEA-------VNVAIAAPPSEGEANAELCRYLSKVLELRK 115 Query: 62 SSLRMLSKQSSPLKII 77 S + + K LKI+ Sbjct: 116 SDVVLDKKLRDLLKIV 131 >gi|224119652|ref|XP_002318126.1| predicted protein [Populus trichocarpa] gi|222858799|gb|EEE96346.1| predicted protein [Populus trichocarpa] Length = 231 Score = 43.8 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 16/97 (16%), Positives = 41/97 (42%), Gaps = 9/97 (9%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + + L L Sbjct: 142 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 194 Query: 61 KSSLRMLSKQSSPLKIIYIDK--DCKEITELLQNNDS 95 S + + ++ K++ ++ + +LL+ Sbjct: 195 LSQMTLQRGWNNKSKLLVVEDLYARQVYEKLLEAVQP 231 >gi|261402905|ref|YP_003247129.1| protein of unknown function DUF167 [Methanocaldococcus vulcanius M7] gi|261369898|gb|ACX72647.1| protein of unknown function DUF167 [Methanocaldococcus vulcanius M7] Length = 102 Score = 43.8 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 19/87 (21%), Positives = 38/87 (43%), Gaps = 8/87 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + NAK++ IA + + + +K+ A +GKANK ++ K Sbjct: 15 IDIDVQANAKRNEIAGINEWR-----KRLSVKIKAPAIEGKANKEIIKFFGNLF---KKD 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ ++S K I I K E + Sbjct: 67 VEIILGKTSSQKTILILGAKKGYVEEI 93 >gi|212541150|ref|XP_002150730.1| DUF167 domain protein [Penicillium marneffei ATCC 18224] gi|210068029|gb|EEA22121.1| DUF167 domain protein [Penicillium marneffei ATCC 18224] Length = 124 Score = 43.8 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 17/70 (24%), Positives = 36/70 (51%), Gaps = 6/70 (8%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P ++ G ++ D IH V + P+KG+AN A++ +L++ L + KS + ++ Sbjct: 30 VTPGSR--GFQGVKKICDEQVYIH----VASAPRKGEANAAVVKVLSEVLGIPKSDITIM 83 Query: 68 SKQSSPLKII 77 K + + Sbjct: 84 GKHRDKVGQV 93 >gi|73668032|ref|YP_304047.1| hypothetical protein Mbar_A0485 [Methanosarcina barkeri str. Fusaro] gi|72395194|gb|AAZ69467.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro] Length = 100 Score = 43.8 bits (102), Expect = 0.008, Method: Composition-based stats. Identities = 15/50 (30%), Positives = 31/50 (62%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 +++K+T QKGKAN+ ++ LA+ ++S S++ + S +S K + + Sbjct: 31 IEVKLTKNAQKGKANEQLVECLAELFSISSSNILINSGATSSKKSLLLKG 80 >gi|15668799|ref|NP_247602.1| hypothetical protein MJ_0618 [Methanocaldococcus jannaschii DSM 2661] gi|2496078|sp|Q58035|Y618_METJA RecName: Full=UPF0235 protein MJ0618 gi|1591329|gb|AAB98613.1| conserved hypothetical protein [Methanocaldococcus jannaschii DSM 2661] Length = 98 Score = 43.8 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 43/91 (47%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + NAKK+ I + + + IK+ A +GKANK ++ + K Sbjct: 15 IDIDVQANAKKNEIVGINEWR-----KRLSIKIKAPATEGKANKEIIKFFKEIF---KKD 66 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93 + ++S + +P K + I D E+ E+L+ Sbjct: 67 VEIVSGKLNPQKTVLIGDIKKDEVIEILKRY 97 >gi|9758285|dbj|BAB08809.1| unnamed protein product [Arabidopsis thaliana] Length = 213 Score = 43.8 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 36/80 (45%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + + L L Sbjct: 124 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 176 Query: 61 KSSLRMLSKQSSPLKIIYID 80 S + + +S K++ ++ Sbjct: 177 LSQMTLQRGWNSKSKLLVVE 196 >gi|307199144|gb|EFN79854.1| UPF0235 protein C15orf40-like protein [Harpegnathos saltator] Length = 100 Score = 43.8 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 14/65 (21%), Positives = 28/65 (43%), Gaps = 7/65 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + ++ P AK + I + + ++A P +G+AN ++ LA L K Sbjct: 35 VTIKIQAKPGAKCNNITDITNEGVG-------VAISAPPTEGEANAELVKYLASIFGLRK 87 Query: 62 SSLRM 66 S + + Sbjct: 88 SHVSL 92 >gi|302763307|ref|XP_002965075.1| hypothetical protein SELMODRAFT_230469 [Selaginella moellendorffii] gi|300167308|gb|EFJ33913.1| hypothetical protein SELMODRAFT_230469 [Selaginella moellendorffii] Length = 233 Score = 43.8 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 38/80 (47%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L +AK L+L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEYMAKVLSLR 195 Query: 61 KSSLRMLSKQSSPLKIIYID 80 + + + ++ K++ ++ Sbjct: 196 VTQMTLQRGWNNKSKLLVVE 215 >gi|257077103|ref|ZP_05571464.1| hypothetical protein Faci_08566 [Ferroplasma acidarmanus fer1] Length = 66 Score = 43.4 bits (101), Expect = 0.009, Method: Composition-based stats. Identities = 16/55 (29%), Positives = 27/55 (49%) Query: 26 TSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 S+ +KI +A + KAN ++ LA + ++++S Q S KI ID Sbjct: 11 ESEGDRLKIYTSAPRENNKANYDIMKQLATYYNVEFYKIKLISGQKSRKKIFSID 65 >gi|326386210|ref|ZP_08207834.1| hypothetical protein Y88_2102 [Novosphingobium nitrogenifigens DSM 19370] gi|326209435|gb|EGD60228.1| hypothetical protein Y88_2102 [Novosphingobium nitrogenifigens DSM 19370] Length = 95 Score = 43.4 bits (101), Expect = 0.010, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 42/79 (53%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A++ +A ++ + +KV A P+ GKA A+L+++A L ++ Sbjct: 22 GRLAVRVTPGAREETVAIVDGR--------VLVKVRAKPEDGKATTAVLSLVAAALGVAA 73 Query: 62 SSLRMLSKQSSPLKIIYID 80 S + +L +S K++ + Sbjct: 74 SRVELLRGATSREKLLRLP 92 >gi|291615278|ref|YP_003525435.1| hypothetical protein Slit_2823 [Sideroxydans lithotrophicus ES-1] gi|291585390|gb|ADE13048.1| protein of unknown function DUF167 [Sideroxydans lithotrophicus ES-1] Length = 102 Score = 43.4 bits (101), Expect = 0.012, Method: Composition-based stats. Identities = 15/70 (21%), Positives = 33/70 (47%), Gaps = 7/70 (10%) Query: 10 PNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSK 69 P AKK+ I + +++ V P +G+A ++ LA + +++SS+ ++ Sbjct: 20 PRAKKTKIGKVI-------GNQLEVHVAENPVRGRATAHLVKFLAGEFDVTESSITVVFG 72 Query: 70 QSSPLKIIYI 79 + K + I Sbjct: 73 VYNVNKQLRI 82 >gi|1730921|sp|P52064|YPI4_VIBAL RecName: Full=UPF0235 protein in proC 3'region Length = 54 Score = 43.0 bits (100), Expect = 0.012, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 20/48 (41%), Gaps = 7/48 (14%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLA 51 + + + P A + I L +KI +TA P GKAN + Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTK 54 >gi|307108344|gb|EFN56584.1| hypothetical protein CHLNCDRAFT_34979 [Chlorella variabilis] Length = 251 Score = 43.0 bits (100), Expect = 0.013, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 29/71 (40%), Gaps = 9/71 (12%) Query: 20 LEIPKDTSDTIHMKIKVTAT---------PQKGKANKAMLAMLAKKLALSKSSLRMLSKQ 70 + + D + ++VTA G AN+ +L ML L + L + + Sbjct: 144 VALEVDDRGKRALVLRVTADFVRVQLKSGANAGHANEELLEMLRGVLGVRLGQLSLQRGE 203 Query: 71 SSPLKIIYIDK 81 SS K++ ++ Sbjct: 204 SSRHKVLLVEG 214 >gi|145334887|ref|NP_001078789.1| unknown protein [Arabidopsis thaliana] gi|332010366|gb|AED97749.1| uncharacterized protein [Arabidopsis thaliana] Length = 213 Score = 42.7 bits (99), Expect = 0.016, Method: Composition-based stats. Identities = 15/80 (18%), Positives = 36/80 (45%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + + L L Sbjct: 124 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 176 Query: 61 KSSLRMLSKQSSPLKIIYID 80 S + + +S K++ ++ Sbjct: 177 LSQMTLQRGWNSKSKLLVVE 196 >gi|116784383|gb|ABK23322.1| unknown [Picea sitchensis] gi|224284776|gb|ACN40118.1| unknown [Picea sitchensis] Length = 232 Score = 42.7 bits (99), Expect = 0.018, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 36/80 (45%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEYMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYID 80 + + + ++ K++ ++ Sbjct: 196 LTQMTLQRGWNNKSKLLVVE 215 >gi|150399132|ref|YP_001322899.1| hypothetical protein Mevan_0378 [Methanococcus vannielii SB] gi|166232643|sp|A6UP65|Y378_METVS RecName: Full=UPF0235 protein Mevan_0378 gi|150011835|gb|ABR54287.1| protein of unknown function DUF167 [Methanococcus vannielii SB] Length = 101 Score = 42.3 bits (98), Expect = 0.020, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 41/84 (48%), Gaps = 8/84 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + NAKK+ I + + +++K+ P +G+ANKA+L L + K+ Sbjct: 15 IDIEVTTNAKKNEIGKINKWR-----KRLEVKIKEQPIEGRANKAILKFLKEIF---KTD 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87 + + +SP K + I D KE Sbjct: 67 VELNPVTTSPQKTVLISCDTKEYV 90 >gi|312221888|emb|CBY01828.1| similar to DUF167 domain protein [Leptosphaeria maculans] Length = 118 Score = 42.3 bits (98), Expect = 0.021, Method: Composition-based stats. Identities = 12/73 (16%), Positives = 34/73 (46%), Gaps = 5/73 (6%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P S + + + +D +++ V + G+AN A++ ++ + L + +R++ Sbjct: 24 VKPGVSSSRLGIIAVTEDA-----VEVGVAEQAKNGEANDAVVHVICRALHAPRDEVRIV 78 Query: 68 SKQSSPLKIIYID 80 S +K + + Sbjct: 79 RGWKSRVKTVVVS 91 >gi|237858670|ref|NP_001153588.1| hypothetical protein LOC123207 isoform e [Homo sapiens] Length = 149 Score = 42.3 bits (98), Expect = 0.024, Method: Composition-based stats. Identities = 14/65 (21%), Positives = 29/65 (44%), Gaps = 7/65 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 63 VTIAIHAKPGSKQNAVTDLTAEA-------VNVAIAAPPSEGEANAELCRYLSKVLELRK 115 Query: 62 SSLRM 66 S + + Sbjct: 116 SDVVL 120 >gi|237858664|ref|NP_001153585.1| hypothetical protein LOC123207 isoform b [Homo sapiens] Length = 167 Score = 42.3 bits (98), Expect = 0.024, Method: Composition-based stats. Identities = 14/65 (21%), Positives = 29/65 (44%), Gaps = 7/65 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 63 VTIAIHAKPGSKQNAVTDLTAEA-------VNVAIAAPPSEGEANAELCRYLSKVLELRK 115 Query: 62 SSLRM 66 S + + Sbjct: 116 SDVVL 120 >gi|167999197|ref|XP_001752304.1| predicted protein [Physcomitrella patens subsp. patens] gi|162696699|gb|EDQ83037.1| predicted protein [Physcomitrella patens subsp. patens] Length = 231 Score = 42.3 bits (98), Expect = 0.026, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 36/80 (45%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 142 LVQVAIEVEDRAQRSQITRVNADD-------VRVTVAAPAARGEANNELLEYMGKVLGLR 194 Query: 61 KSSLRMLSKQSSPLKIIYID 80 + + + ++ K++ ++ Sbjct: 195 LTQMTLQRGWNNKSKLLAVE 214 >gi|218884121|ref|YP_002428503.1| hypothetical protein DKAM_0810 [Desulfurococcus kamchatkensis 1221n] gi|218765737|gb|ACL11136.1| Uncharacterized conserved protein [Desulfurococcus kamchatkensis 1221n] Length = 115 Score = 41.9 bits (97), Expect = 0.026, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 38/90 (42%), Gaps = 9/90 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P + I T + + +KG+ N A++ LA++L + S Sbjct: 29 LSIRVKPGDVEDYI--------TIEGDELVFHTAEQSEKGRENAALVKYLARELKIPVSK 80 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92 + ++ + LK + + D D E+ L Sbjct: 81 IDIVYGRRETLKKVLLNDVDPDELVIKLAK 110 >gi|317509370|ref|ZP_07966989.1| hypothetical protein HMPREF9336_03361 [Segniliparus rugosus ATCC BAA-974] gi|316252293|gb|EFV11744.1| hypothetical protein HMPREF9336_03361 [Segniliparus rugosus ATCC BAA-974] Length = 79 Score = 41.9 bits (97), Expect = 0.027, Method: Composition-based stats. Identities = 20/76 (26%), Positives = 39/76 (51%), Gaps = 6/76 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V V + P +++ S+E +D S + + V +G+A A +LAK L ++KS Sbjct: 8 VTVTVKPGSRRGP--SVEAAEDGS----LTVCVREPAVEGRATAAAAVVLAKHLGVAKSR 61 Query: 64 LRMLSKQSSPLKIIYI 79 + ++S +S +K + Sbjct: 62 VALVSGATSRVKRFAV 77 >gi|294494893|ref|YP_003541386.1| hypothetical protein Mmah_0207 [Methanohalophilus mahii DSM 5219] gi|292665892|gb|ADE35741.1| protein of unknown function DUF167 [Methanohalophilus mahii DSM 5219] Length = 103 Score = 41.9 bits (97), Expect = 0.030, Method: Composition-based stats. Identities = 14/60 (23%), Positives = 33/60 (55%), Gaps = 1/60 (1%) Query: 35 KVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 K+T + QKGKAN ++ L+ ++ SS+ +++ + K ++++ K ++L+ Sbjct: 43 KLTESAQKGKANDQLIQRLSHIFQINSSSITIVAGAKTTKKSVHLENVYPKTAEDVLEQY 102 >gi|221106236|ref|XP_002164251.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 1280 Score = 41.9 bits (97), Expect = 0.030, Method: Composition-based stats. Identities = 14/61 (22%), Positives = 30/61 (49%), Gaps = 7/61 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK++ + + + I++ A P+ GKAN +L+ L++ + KS Sbjct: 915 LQIYAKPGAKRNKVTDI-------SVEFIGIQLAAQPRDGKANDELLSYLSELFNIKKSG 967 Query: 64 L 64 + Sbjct: 968 I 968 >gi|289192506|ref|YP_003458447.1| protein of unknown function DUF167 [Methanocaldococcus sp. FS406-22] gi|288938956|gb|ADC69711.1| protein of unknown function DUF167 [Methanocaldococcus sp. FS406-22] Length = 98 Score = 41.9 bits (97), Expect = 0.032, Method: Composition-based stats. Identities = 18/82 (21%), Positives = 38/82 (46%), Gaps = 8/82 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + NAKK+ I + + + IK+ A +GKANK ++ L K Sbjct: 15 IDIDVQANAKKNEIIGINEWR-----KRLTIKIKAPATEGKANKEIIKFLKDIF---KKD 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKE 85 + +++ + +P K + + K+ Sbjct: 67 VEIVAGKLNPQKTVLVKDIKKD 88 >gi|281212114|gb|EFA86275.1| hypothetical protein PPL_00837 [Polysphondylium pallidum PN500] Length = 83 Score = 41.5 bits (96), Expect = 0.035, Method: Composition-based stats. Identities = 15/56 (26%), Positives = 33/56 (58%), Gaps = 5/56 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56 + + V + PNAK+S + S+ D+ D I+++ P +G+AN+ ++ L+++ Sbjct: 9 IVKLKVNVHPNAKQSSVVSVNELADSVD-----IRISQPPTEGRANEEVIEYLSEQ 59 >gi|194688626|gb|ACF78397.1| unknown [Zea mays] Length = 232 Score = 41.5 bits (96), Expect = 0.037, Method: Composition-based stats. Identities = 16/97 (16%), Positives = 40/97 (41%), Gaps = 9/97 (9%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAALAARGEANSELLEFMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKE--ITELLQNNDS 95 + + + ++ K++ ++ +LL+ Sbjct: 196 LTQMTLQRGWNNKSKLLIVEDLSARQVYEKLLEAVQP 232 >gi|307721935|ref|YP_003893075.1| hypothetical protein Saut_2020 [Sulfurimonas autotrophica DSM 16294] gi|306980028|gb|ADN10063.1| protein of unknown function DUF167 [Sulfurimonas autotrophica DSM 16294] Length = 118 Score = 41.1 bits (95), Expect = 0.046, Method: Composition-based stats. Identities = 11/56 (19%), Positives = 24/56 (42%) Query: 29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCK 84 +KI V P GKA ++ LA + + + ++ + + K + + + K Sbjct: 55 GNQLKISVACAPVNGKATDHLVKFLAHEFDVKIKDIEVVFGRMNVNKQLRVTRPNK 110 >gi|237858668|ref|NP_001153587.1| hypothetical protein LOC123207 isoform d [Homo sapiens] Length = 167 Score = 41.1 bits (95), Expect = 0.047, Method: Composition-based stats. Identities = 14/65 (21%), Positives = 29/65 (44%), Gaps = 7/65 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 63 VTIAIHAKPGSKQNAVTDLTAEA-------VNVAIAAPPSEGEANAELCRYLSKVLELRK 115 Query: 62 SSLRM 66 S + + Sbjct: 116 SDVVL 120 >gi|297297113|ref|XP_002804962.1| PREDICTED: UPF0235 protein C15orf40-like isoform 2 [Macaca mulatta] Length = 150 Score = 41.1 bits (95), Expect = 0.053, Method: Composition-based stats. Identities = 14/65 (21%), Positives = 29/65 (44%), Gaps = 7/65 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 64 VTITIHAKPGSKQNAVTDLTAEA-------VNVAIAAPPSEGEANAELCRYLSKVLELRK 116 Query: 62 SSLRM 66 S + + Sbjct: 117 SDVVL 121 >gi|168035145|ref|XP_001770071.1| predicted protein [Physcomitrella patens subsp. patens] gi|162678597|gb|EDQ65053.1| predicted protein [Physcomitrella patens subsp. patens] Length = 232 Score = 40.3 bits (93), Expect = 0.080, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 36/80 (45%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQVGIEVEDRAQRSQITRVNADD-------VRVTVAAPAARGEANNELLEYMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYID 80 + + + ++ K++ ++ Sbjct: 196 LTQMTLQRGWNNKSKLLVVE 215 >gi|256810180|ref|YP_003127549.1| protein of unknown function DUF167 [Methanocaldococcus fervens AG86] gi|256793380|gb|ACV24049.1| protein of unknown function DUF167 [Methanocaldococcus fervens AG86] Length = 98 Score = 40.3 bits (93), Expect = 0.087, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 40/91 (43%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + AKK I + + + IK+ A +GKANK ++ + K Sbjct: 15 IDIDVQAGAKKDEITGINEWR-----KRLSIKIKAPATEGKANKEIIKFFKEIF---KKD 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKE-ITELLQNN 93 + +++ + +P K I + K+ + E L+ Sbjct: 67 IEIVAGKLNPQKTILVKDIKKDEVIETLKKY 97 >gi|281348884|gb|EFB24468.1| hypothetical protein PANDA_020551 [Ailuropoda melanoleuca] Length = 87 Score = 40.0 bits (92), Expect = 0.10, Method: Composition-based stats. Identities = 13/65 (20%), Positives = 29/65 (44%), Gaps = 7/65 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + + + + + A P +G+AN + L+K L L K Sbjct: 26 VTIAIHAKPGSKQNAVTDVTAEA-------VSVAIAAPPSEGEANAELCRYLSKVLELRK 78 Query: 62 SSLRM 66 S + + Sbjct: 79 SDVVL 83 >gi|223944751|gb|ACN26459.1| unknown [Zea mays] Length = 194 Score = 39.6 bits (91), Expect = 0.13, Method: Composition-based stats. Identities = 16/97 (16%), Positives = 40/97 (41%), Gaps = 9/97 (9%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 105 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAALAARGEANSELLEFMGKVLGLR 157 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKE--ITELLQNNDS 95 + + + ++ K++ ++ +LL+ Sbjct: 158 LTQMTLQRGWNNKSKLLIVEDLSARQVYEKLLEAVQP 194 >gi|72388068|ref|XP_844458.1| hypothetical protein [Trypanosoma brucei TREU927] gi|62359409|gb|AAX79847.1| hypothetical protein, conserved [Trypanosoma brucei] gi|70800991|gb|AAZ10899.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain 927/4 GUTat10.1] Length = 165 Score = 39.2 bits (90), Expect = 0.20, Method: Composition-based stats. Identities = 15/72 (20%), Positives = 34/72 (47%), Gaps = 4/72 (5%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +++ P A+ + +A+ D + +++++ A P GKAN ++ + L + Sbjct: 16 RLMIHAKPGARSTALAAQPQALDEA----LEVRLAAPPVDGKANTELVEFMQTLLEQQLA 71 Query: 63 SLRMLSKQSSPL 74 +LR+ KQ Sbjct: 72 TLRITQKQHLRK 83 >gi|261327632|emb|CBH10608.1| hypothetical protein, conserved [Trypanosoma brucei gambiense DAL972] gi|289742777|gb|ADD20136.1| hypothetical protein Tb927.4.3080 [Glossina morsitans morsitans] Length = 165 Score = 38.8 bits (89), Expect = 0.26, Method: Composition-based stats. Identities = 16/72 (22%), Positives = 34/72 (47%), Gaps = 4/72 (5%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +++ P A+ + +A+ D + +++++ A P GKAN ++ + L + Sbjct: 16 RLMIHAKPGARSTALAAQPQALDEA----LEVRLAAPPVDGKANTELVEFMQTLLEQQLA 71 Query: 63 SLRMLSKQSSPL 74 +LRM KQ Sbjct: 72 TLRMTQKQHLRK 83 >gi|226529615|ref|NP_001152637.1| hypothetical protein LOC100286278 [Zea mays] gi|195658405|gb|ACG48670.1| uncharacterized ACR, YggU family COG1872 containing protein [Zea mays] Length = 194 Score = 38.8 bits (89), Expect = 0.27, Method: Composition-based stats. Identities = 15/97 (15%), Positives = 40/97 (41%), Gaps = 9/97 (9%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S + + +++ V A +G+AN +L + K L L Sbjct: 105 LVQVAIEVEDRAQRSAVTRVNADD-------VRVTVAALAARGEANSELLEFMGKVLGLR 157 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKE--ITELLQNNDS 95 + + + ++ K++ ++ +LL+ Sbjct: 158 LTQMTLQRGWNNKSKLLIVEDLSARQVYEKLLEAVQP 194 >gi|303245044|ref|ZP_07331365.1| protein of unknown function DUF167 [Methanothermococcus okinawensis IH1] gi|302484607|gb|EFL47550.1| protein of unknown function DUF167 [Methanothermococcus okinawensis IH1] Length = 101 Score = 38.8 bits (89), Expect = 0.27, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 44/88 (50%), Gaps = 9/88 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + PNAKK+ I + + + IK+ A P +GKANK ++ + K Sbjct: 17 IDIDISPNAKKNEIGGINEWR-----KRIIIKIKAQPIEGKANKEIIK---FLKKIFKKD 68 Query: 64 LRMLSKQSSPLK-IIYIDKDCKEITELL 90 + ++S +S K ++ I + +EI ++ Sbjct: 69 VEIVSGLTSSQKTVLVIGGNREEIINII 96 >gi|194671931|ref|XP_001789164.1| PREDICTED: C21H15orf40 protein-like [Bos taurus] Length = 234 Score = 38.4 bits (88), Expect = 0.30, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 28/77 (36%), Gaps = 5/77 (6%) Query: 8 LIPNAK-----KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + P S + T + + + P +G+AN + L+K L L S Sbjct: 98 VYPKGGFTMAIHDKAGSKQNAMTDVTTEAVSVGIAGPPIEGEANVELCCCLSKILELRTS 157 Query: 63 SLRMLSKQSSPLKIIYI 79 + + S K++ + Sbjct: 158 DVVLDKGSKSHEKVVKL 174 >gi|297470564|ref|XP_002684034.1| PREDICTED: C21H15orf40 protein-like, partial [Bos taurus] gi|296491729|gb|DAA33762.1| C21H15orf40 protein-like [Bos taurus] Length = 107 Score = 38.4 bits (88), Expect = 0.33, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 28/77 (36%), Gaps = 5/77 (6%) Query: 8 LIPNAK-----KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + P S + T + + + P +G+AN + L+K L L S Sbjct: 8 VYPKGGFTMAIHDKAGSKQNAMTDVTTEAVSVGIAGPPIEGEANVELCCCLSKILELRTS 67 Query: 63 SLRMLSKQSSPLKIIYI 79 + + S K++ + Sbjct: 68 DVVLDKGSKSHEKVVKL 84 >gi|327289071|ref|XP_003229248.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 2 [Anolis carolinensis] Length = 104 Score = 38.0 bits (87), Expect = 0.39, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 25/63 (39%), Gaps = 7/63 (11%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V P +K++ + L I + A P G+AN + L+K L + + Sbjct: 48 VTIAVHAKPGSKQNAVTDLSAEAVG-------IAIAAPPSDGEANAELCRYLSKVLEVKR 100 Query: 62 SSL 64 L Sbjct: 101 VRL 103 >gi|322822396|gb|EFZ28456.1| hypothetical protein TCSYLVIO_5305 [Trypanosoma cruzi] Length = 156 Score = 37.6 bits (86), Expect = 0.57, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 48/140 (34%), Gaps = 55/140 (39%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLA----MLAKKLAL 59 + V P A+ S +A D + ++++V A P +GKAN ++ +L ++LA Sbjct: 17 LTVHAKPGARSSSLACHPAVMDAA----LEVRVGAPPVEGKANAELVDFMQMLLEQELAR 72 Query: 60 SKS---------------------------------------------SLRMLSKQSSPL 74 ++ + ++S S+ Sbjct: 73 VRATQQHTPLESNGAVMNCVYGGHSKKDKKKNKPNTKMECPVNYPDKVRVSLVSGASARH 132 Query: 75 KIIYI--DKDCKEITELLQN 92 K + + +E+ +LQ+ Sbjct: 133 KTLEVAFPGTQEELISVLQS 152 >gi|312136610|ref|YP_004003947.1| hypothetical protein Mfer_0383 [Methanothermus fervidus DSM 2088] gi|311224329|gb|ADP77185.1| protein of unknown function DUF167 [Methanothermus fervidus DSM 2088] Length = 97 Score = 37.6 bits (86), Expect = 0.60, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 45/89 (50%), Gaps = 9/89 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + +IP++KK GI + + + + V + +KGKANK ++ +K Sbjct: 13 LQIHVIPSSKKFGIEKYDSWR-----KRLYVTVKSPARKGKANKEIIEEFSKLFN---KE 64 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQ 91 ++++ S K + + D + K+I E+++ Sbjct: 65 VKIVKGIKSRDKTLVVKDVEYKKIMEIIR 93 >gi|242799550|ref|XP_002483404.1| DUF167 domain protein [Talaromyces stipitatus ATCC 10500] gi|218716749|gb|EED16170.1| DUF167 domain protein [Talaromyces stipitatus ATCC 10500] Length = 124 Score = 37.3 bits (85), Expect = 0.79, Method: Composition-based stats. Identities = 20/86 (23%), Positives = 42/86 (48%), Gaps = 10/86 (11%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P ++ G ++ + +H V + P+KG+AN A+ +L++ L KS + ++ Sbjct: 30 VTPGSR--GFQGIKQIHNEQVYVH----VGSEPRKGEANTAVARVLSEVLGFPKSDVIIV 83 Query: 68 SKQSSPLKIIYIDKDCKEITELLQNN 93 KQ +KI + +I +L + Sbjct: 84 GKQ--RVKIGQVTG--VDIGKLSEEE 105 >gi|183982913|ref|YP_001851204.1| hypothetical protein MMAR_2910 [Mycobacterium marinum M] gi|226701561|sp|B2HE69|Y2910_MYCMM RecName: Full=UPF0235 protein MMAR_2910 gi|183176239|gb|ACC41349.1| conserved hypothetical protein [Mycobacterium marinum M] Length = 76 Score = 36.9 bits (84), Expect = 0.85, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 6/78 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V+VR+ P ++K + +E D + I V GKAN+A +LA L L +S Sbjct: 5 VVVRVKPGSRKGPL--VETGSDAE----LTIYVRERAVDGKANEAAARLLAAHLQLPRSR 58 Query: 64 LRMLSKQSSPLKIIYIDK 81 + +++ +S LK +++ Sbjct: 59 VELVAGATSRLKRFRVER 76 >gi|71668144|ref|XP_821011.1| hypothetical protein [Trypanosoma cruzi strain CL Brener] gi|70886377|gb|EAN99160.1| hypothetical protein, conserved [Trypanosoma cruzi] Length = 156 Score = 36.9 bits (84), Expect = 1.0, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 46/140 (32%), Gaps = 55/140 (39%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAM--------LAK 55 + V P A+ S +A D + ++++V A P +GKAN ++ LA+ Sbjct: 17 LTVHAKPGARSSSLACHPAVTDAA----IEVRVGAPPVEGKANAELVEFMQMLLEQELAR 72 Query: 56 ------------------------------------KLALS-----KSSLRMLSKQSSPL 74 K+ K + ++S S+ Sbjct: 73 VRAAQQHTPLESNGAVMNCVYGGHSKKDKRKNKPNTKMECPVNYPEKVRVSLVSGASARH 132 Query: 75 KIIYI--DKDCKEITELLQN 92 K + + +E+ +LQ+ Sbjct: 133 KTLEVAFPGTQEELISVLQS 152 >gi|50551895|ref|XP_503422.1| YALI0E01584p [Yarrowia lipolytica] gi|49649291|emb|CAG79001.1| YALI0E01584p [Yarrowia lipolytica] Length = 344 Score = 36.5 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 17/73 (23%), Positives = 28/73 (38%), Gaps = 2/73 (2%) Query: 17 IASLEIPKDTSDTIHMKIKVTATPQKGKA-NKAMLAMLAKKLALSKSSLRMLSKQSSPLK 75 + + MK VTA K N L ++A+ L L K+S+ + LK Sbjct: 118 LDGPNRFPEGKTGEEMKQAVTAYLDAMKEMNFKFLQLVAECLNLPKNSVTDFLGEMDRLK 177 Query: 76 IIYI-DKDCKEIT 87 +I D + + Sbjct: 178 LIRYPPGDGQGVG 190 >gi|325120013|emb|CBZ55566.1| hypothetical protein NCLIV_059910 [Neospora caninum Liverpool] Length = 531 Score = 36.5 bits (83), Expect = 1.3, Method: Composition-based stats. Identities = 14/57 (24%), Positives = 27/57 (47%), Gaps = 5/57 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLAL 59 + V P AK+S I S+ + + +++ A ++G AN+ + LA +L Sbjct: 51 TLAVHAKPGAKQSQIPSINEQAEQ-----LDVQIDAPAREGAANEELCDFLADACSL 102 >gi|71660987|ref|XP_817521.1| hypothetical protein [Trypanosoma cruzi strain CL Brener] gi|70882718|gb|EAN95670.1| hypothetical protein, conserved [Trypanosoma cruzi] Length = 156 Score = 36.1 bits (82), Expect = 1.7, Method: Composition-based stats. Identities = 12/50 (24%), Positives = 25/50 (50%), Gaps = 4/50 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAML 53 + V P A+ S +A D + +++++ A P +GKAN ++ + Sbjct: 17 LTVHAKPGARSSSLACHPAVTDAA----LEVRIGAPPVEGKANAELVDFM 62 >gi|11499654|ref|NP_070896.1| hypothetical protein AF2072 [Archaeoglobus fulgidus DSM 4304] gi|29839698|sp|O28207|Y2072_ARCFU RecName: Full=UPF0235 protein AF_2072 gi|2648454|gb|AAB89177.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304] Length = 78 Score = 36.1 bits (82), Expect = 1.8, Method: Composition-based stats. Identities = 12/74 (16%), Positives = 36/74 (48%), Gaps = 10/74 (13%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P +K+ + E + +++++ + ++GKAN+ +L + + Sbjct: 10 ISVHVSPGSKEVSFSYDEWRR------AVEVRIKSPAKEGKANRELLGIFRQIFG----E 59 Query: 64 LRMLSKQSSPLKII 77 + ++S + S K++ Sbjct: 60 VELVSGEKSRSKVL 73 >gi|124028038|ref|YP_001013358.1| hypothetical protein Hbut_1179 [Hyperthermus butylicus DSM 5456] gi|123978732|gb|ABM81013.1| hypothetical protein Hbut_1179 [Hyperthermus butylicus DSM 5456] Length = 108 Score = 35.7 bits (81), Expect = 2.1, Method: Composition-based stats. Identities = 14/98 (14%), Positives = 38/98 (38%), Gaps = 13/98 (13%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 +V + + P A +G+ + + +G+ N +++ ++ L +S Sbjct: 16 VDVTLYVKPEASFTGLRM--------ELGELVFYTEELDVEGRVNASIVMFFSRLLGVSP 67 Query: 62 SSLRMLSKQSSPLKIIYIDKDC-----KEITELLQNND 94 S + ++ K + I ++I E L+ ++ Sbjct: 68 SMIDIVYGTREKTKRVRIKNVTWNQVFEKIVEALRESE 105 >gi|326517290|dbj|BAK00012.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326529833|dbj|BAK08196.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 232 Score = 35.3 bits (80), Expect = 2.5, Method: Composition-based stats. Identities = 15/97 (15%), Positives = 35/97 (36%), Gaps = 9/97 (9%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +G+AN +L + K L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADDVRVAVAAPA-------ARGEANNELLEFMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKE--ITELLQNNDS 95 S + + ++ K++ ++ +LL+ Sbjct: 196 LSQMTLQRGWNNKSKLLIVEDLSARQVYEKLLEAVQP 232 >gi|331664329|ref|ZP_08365235.1| putative electron transfer flavoprotein subunit YgcQ [Escherichia coli TA143] gi|331058260|gb|EGI30241.1| putative electron transfer flavoprotein subunit YgcQ [Escherichia coli TA143] Length = 286 Score = 34.9 bits (79), Expect = 3.4, Method: Composition-based stats. Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 3/81 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIK-VTATPQKGKANKAMLAMLAKKLA--LS 60 + + ++P A + S+E K+ + + + V Q G+A+ +AMLA+KL + Sbjct: 138 LQLNIVPGALPDWLVSIENLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVG 197 Query: 61 KSSLRMLSKQSSPLKIIYIDK 81 S R+++ K+I I Sbjct: 198 YSRARVMNGGVDAEKVIGISG 218 >gi|301027216|ref|ZP_07190579.1| electron transfer flavoprotein FAD-binding domain protein [Escherichia coli MS 69-1] gi|300395138|gb|EFJ78676.1| electron transfer flavoprotein FAD-binding domain protein [Escherichia coli MS 69-1] Length = 286 Score = 34.9 bits (79), Expect = 3.8, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 3/79 (3%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIK-VTATPQKGKANKAMLAMLAKKLA--LSKS 62 + ++P A + S+E K+ S + + V Q G+A+ +AMLA+KL + S Sbjct: 140 LNIVPGALPDWLVSIENLKNVSRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYS 199 Query: 63 SLRMLSKQSSPLKIIYIDK 81 R+++ K+I I Sbjct: 200 RARVMNGGVDAEKVIGISG 218 >gi|331654249|ref|ZP_08355249.1| putative electron transfer flavoprotein subunit YgcQ [Escherichia coli M718] gi|331047631|gb|EGI19708.1| putative electron transfer flavoprotein subunit YgcQ [Escherichia coli M718] Length = 297 Score = 34.9 bits (79), Expect = 4.1, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 3/79 (3%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIK-VTATPQKGKANKAMLAMLAKKLA--LSKS 62 + ++P A + S+E K+ S + + V Q G+A+ +AMLA+KL + S Sbjct: 151 LNIVPGALPDWLVSIENLKNVSRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYS 210 Query: 63 SLRMLSKQSSPLKIIYIDK 81 R+++ K+I I Sbjct: 211 RARVMNGGVDAEKVIGISG 229 >gi|308456125|ref|XP_003090529.1| hypothetical protein CRE_03518 [Caenorhabditis remanei] gi|308262652|gb|EFP06605.1| hypothetical protein CRE_03518 [Caenorhabditis remanei] Length = 102 Score = 34.6 bits (78), Expect = 5.0, Method: Composition-based stats. Identities = 15/63 (23%), Positives = 31/63 (49%), Gaps = 7/63 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AKKS + ++ + + + A P++G AN+ +++ L L L K+ Sbjct: 34 LRIHAKPGAKKSCVVAI-------GESEIDVSIGAAPREGAANEELISYLMAALGLRKNE 86 Query: 64 LRM 66 L+ Sbjct: 87 LQF 89 >gi|111223913|ref|YP_714707.1| hypothetical protein FRAAL4520 [Frankia alni ACN14a] gi|111151445|emb|CAJ63162.1| hypothetical protein FRAAL4520 [Frankia alni ACN14a] Length = 93 Score = 33.8 bits (76), Expect = 7.2, Method: Composition-based stats. Identities = 17/74 (22%), Positives = 31/74 (41%) Query: 23 PKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKD 82 D + ++VT G+A +A L LA L L ++ +R++ +S +K + Sbjct: 5 WTDPRAGSIVIVRVTERAVDGRATEAALRALAGALGLRRTQVRLVRGATSRVKTFELTTP 64 Query: 83 CKEITELLQNNDSL 96 + L D L Sbjct: 65 AADEPALRARLDRL 78 >gi|289523066|ref|ZP_06439920.1| 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase [Anaerobaculum hydrogeniformans ATCC BAA-1850] gi|289503609|gb|EFD24773.1| 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase [Anaerobaculum hydrogeniformans ATCC BAA-1850] Length = 353 Score = 33.8 bits (76), Expect = 8.8, Method: Composition-based stats. Identities = 18/85 (21%), Positives = 40/85 (47%), Gaps = 5/85 (5%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATP-QKGKANKAMLAMLAKKLALSKSSLRM 66 ++P+ A + + M+I ++A ++ KA A+L + L L K + + Sbjct: 208 VMPDGLVKSAAGMALLLTQGLGDTMRISLSAPAVEEVKAGYALL----RSLGLRKKGVEI 263 Query: 67 LSKQSSPLKIIYIDKDCKEITELLQ 91 +S + K + + K ++I+ LL+ Sbjct: 264 ISCPTCGRKHLDVAKIVEDISPLLK 288 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.310 0.143 0.367 Lambda K H 0.267 0.0439 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,501,181,801 Number of Sequences: 13984884 Number of extensions: 48248111 Number of successful extensions: 148513 Number of sequences better than 10.0: 829 Number of HSP's better than 10.0 without gapping: 1056 Number of HSP's successfully gapped in prelim test: 94 Number of HSP's that attempted gapping in prelim test: 147000 Number of HSP's gapped (non-prelim): 1159 length of query: 98 length of database: 4,792,584,752 effective HSP length: 67 effective length of query: 31 effective length of database: 3,855,597,524 effective search space: 119523523244 effective search space used: 119523523244 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 76 (33.8 bits)