BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780231|ref|YP_003064644.1| hypothetical protein CLIBASIA_00580 [Candidatus Liberibacter asiaticus str. psy62] (98 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done >gi|159185312|ref|NP_530540.1| hypothetical protein Atu2660 [Agrobacterium tumefaciens str. C58] gi|17741163|gb|AAL43641.1| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 183 Score = 114 bits (287), Expect = 3e-24, Method: Composition-based stats. Identities = 35/93 (37%), Positives = 55/93 (59%), Gaps = 2/93 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL PN + I +E D H+K +V+A P+ GKANKA++ +LAKKL L K Sbjct: 92 VRLSVRLTPNGGRDAIDGVEQDADG--NAHLKARVSAVPEGGKANKALIVLLAKKLGLPK 149 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 SS+ +S +++ KI+ ID D ++ +L + Sbjct: 150 SSITFISGETARKKILRIDTDPEDFEKLFKKLA 182 >gi|116254232|ref|YP_770070.1| hypothetical protein RL4503 [Rhizobium leguminosarum bv. viciae 3841] gi|166227262|sp|Q1MAP9|Y4503_RHIL3 RecName: Full=UPF0235 protein RL4503 gi|115258880|emb|CAK09988.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 103 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 32/89 (35%), Positives = 56/89 (62%), Gaps = 2/89 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + + +E D +K +VTA P+KGKANKA++ ++AK L + KS Sbjct: 13 RLAVRLTPNGGRDALDGIE--ADGEGEAFLKARVTAVPEKGKANKALMLLIAKSLRIPKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S+ ++S +++ KI+ ID D +++ + L+ Sbjct: 71 SVSLVSGETARKKILRIDGDPEDLVKKLE 99 >gi|190893761|ref|YP_001980303.1| hypothetical protein RHECIAT_CH0004196 [Rhizobium etli CIAT 652] gi|226706142|sp|B3PQB3|Y4196_RHIE6 RecName: Full=UPF0235 protein RHECIAT_CH0004196 gi|190699040|gb|ACE93125.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 103 Score = 108 bits (270), Expect = 3e-22, Method: Composition-based stats. Identities = 30/92 (32%), Positives = 58/92 (63%), Gaps = 2/92 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + +E + ++K +VTA P+KGKANKA++A+++K L ++KS Sbjct: 13 RLTVRLTPNGGRDAFDGIETGSEGET--YLKARVTAIPEKGKANKALIALVSKSLGVAKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 S+ ++S +++ KI+ I+ D +++ + L+ Sbjct: 71 SITLVSGETARKKILRIEGDPEDLAKKLETLS 102 >gi|29839726|sp|Q8UC38|Y2660_AGRT5 RecName: Full=UPF0235 protein Atu2660 Length = 112 Score = 107 bits (267), Expect = 7e-22, Method: Composition-based stats. Identities = 35/93 (37%), Positives = 55/93 (59%), Gaps = 2/93 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL PN + I +E D H+K +V+A P+ GKANKA++ +LAKKL L K Sbjct: 21 VRLSVRLTPNGGRDAIDGVEQDADG--NAHLKARVSAVPEGGKANKALIVLLAKKLGLPK 78 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 SS+ +S +++ KI+ ID D ++ +L + Sbjct: 79 SSITFISGETARKKILRIDTDPEDFEKLFKKLA 111 >gi|241206712|ref|YP_002977808.1| hypothetical protein Rleg_4028 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240860602|gb|ACS58269.1| protein of unknown function DUF167 [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 103 Score = 106 bits (266), Expect = 8e-22, Method: Composition-based stats. Identities = 31/89 (34%), Positives = 55/89 (61%), Gaps = 2/89 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + + +E D +K +VTA P+KGKANKA++ ++A+ L + KS Sbjct: 13 RLAVRLTPNGGRDALDGIE--ADGEGEAFLKARVTAVPEKGKANKALILLIAQSLRIPKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S+ ++S ++ KI+ ID D +++ + L+ Sbjct: 71 SVSLISGDTARKKILRIDGDPEDLVKKLE 99 >gi|327188872|gb|EGE56064.1| hypothetical protein RHECNPAF_750023 [Rhizobium etli CNPAF512] Length = 103 Score = 106 bits (266), Expect = 9e-22, Method: Composition-based stats. Identities = 30/92 (32%), Positives = 59/92 (64%), Gaps = 2/92 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + +E D+ ++K +VTA P+KGKANKA++A+++K + ++KS Sbjct: 13 RLTVRLTPNGGRDAFDGIET--DSEGETYLKARVTAVPEKGKANKALIALVSKSVGVAKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 S+ ++S +++ KI+ I+ D +++ + L+ Sbjct: 71 SITLVSGETARKKILRIEGDPEDLAKKLETLS 102 >gi|218680333|ref|ZP_03528230.1| hypothetical protein RetlC8_16135 [Rhizobium etli CIAT 894] Length = 103 Score = 106 bits (266), Expect = 9e-22, Method: Composition-based stats. Identities = 30/91 (32%), Positives = 56/91 (61%), Gaps = 2/91 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + I +E D ++ +VT+ P+KGKANKA++ ++A+ L + KS Sbjct: 13 RLAVRLTPNGGRDAIDGIE--ADGEGETFLRARVTSVPEKGKANKALILLVAQSLRIPKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 S+ ++S +++ KI+ ID D +++ + L+ Sbjct: 71 SISLVSGETARKKILRIDGDPEDLAKKLETL 101 >gi|237807736|ref|YP_002892176.1| hypothetical protein Tola_0962 [Tolumonas auensis DSM 9187] gi|259710178|sp|C4LCM6|Y962_TOLAT RecName: Full=UPF0235 protein Tola_0962 gi|237499997|gb|ACQ92590.1| protein of unknown function DUF167 [Tolumonas auensis DSM 9187] Length = 96 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 40/91 (43%), Gaps = 8/91 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P A + I +KI +TA P G+AN ++ LAK+ ++KS Sbjct: 13 LDVYIQPKASRDQIQGWH-------GEELKIAITAPPVDGQANAHLIKFLAKQFKVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 + + + K + I +++ +L + Sbjct: 66 IVIHKGELGRHKTVRIT-SPQQLPAILDQSA 95 >gi|325294029|ref|YP_004279893.1| hypothetical protein AGROH133_08898 [Agrobacterium sp. H13-3] gi|325061882|gb|ADY65573.1| hypothetical protein AGROH133_08898 [Agrobacterium sp. H13-3] Length = 112 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 33/93 (35%), Positives = 53/93 (56%), Gaps = 2/93 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL PN + I +E D H+K +V+A P+ GKANKA++ +LAKK L K Sbjct: 21 IRLSVRLTPNGGRDAIDGVEQDADG--NAHLKARVSAVPEGGKANKALVILLAKKFGLPK 78 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 S + +S +++ KI+ ID D ++ L + + Sbjct: 79 SPITFISGETARKKILRIDTDPEDFETLFRKLE 111 >gi|222150063|ref|YP_002551020.1| hypothetical protein Avi_4153 [Agrobacterium vitis S4] gi|221737045|gb|ACM38008.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 104 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 31/94 (32%), Positives = 61/94 (64%), Gaps = 2/94 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL PN + GI +++ + H+K++V+ P+KG+ANKA++A+LAK+L ++K Sbjct: 12 IRLAVRLTPNGGRDGIDGVDVNANGE--AHLKVRVSDVPEKGRANKALIALLAKRLGVAK 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 S++ ++S ++ KI+ ID D +++ L+ + Sbjct: 70 SAVSLISGDAARQKILRIDGDPEDLIGRLETITA 103 >gi|209551279|ref|YP_002283196.1| hypothetical protein Rleg2_3707 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|226705832|sp|B5ZTD4|Y3707_RHILW RecName: Full=UPF0235 protein Rleg2_3707 gi|209537035|gb|ACI56970.1| protein of unknown function DUF167 [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 103 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 31/92 (33%), Positives = 55/92 (59%), Gaps = 2/92 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +RL PN + I E D ++K +VT P+KGKANKA++ ++AK L ++K Sbjct: 12 VRLAIRLTPNGGRDAIDGAET--DGEGEAYLKTRVTTVPEKGKANKALILLIAKSLGIAK 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 SS+ ++S ++ KI+ ID D +++ + L+ Sbjct: 70 SSVSLVSGDTARKKILRIDGDPEDLGKKLETL 101 >gi|222087485|ref|YP_002546022.1| hypothetical protein Arad_4365 [Agrobacterium radiobacter K84] gi|221724933|gb|ACM28089.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 104 Score = 105 bits (262), Expect = 3e-21, Method: Composition-based stats. Identities = 34/92 (36%), Positives = 58/92 (63%), Gaps = 2/92 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL PN + + +E D ++K +V+A P+KGKANKA++A+LAK+L++ K Sbjct: 12 VRLSVRLTPNGGRDAVDGIETGADGE--AYLKARVSAVPEKGKANKALIALLAKRLSIPK 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 SSL ++S ++ KI+ ID D +++ L+ Sbjct: 70 SSLSLISGDTARKKILRIDGDPEDLIGRLKAI 101 >gi|86359493|ref|YP_471385.1| hypothetical protein RHE_CH03912 [Rhizobium etli CFN 42] gi|123510540|sp|Q2K3C8|Y3912_RHIEC RecName: Full=UPF0235 protein RHE_CH03912 gi|86283595|gb|ABC92658.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 112 Score = 104 bits (261), Expect = 4e-21, Method: Composition-based stats. Identities = 28/89 (31%), Positives = 57/89 (64%), Gaps = 2/89 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + ++ D+ ++ +VTA P+KGKANKA++A+++K + ++KS Sbjct: 22 RLTVRLTPNGGRDAFDGIDT--DSEGETYLGARVTAVPEKGKANKALIALVSKSVGVAKS 79 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S+ ++S +++ KI+ I+ D +++ L+ Sbjct: 80 SVSVISGETARKKILRIEGDPEDLARKLE 108 >gi|227823716|ref|YP_002827689.1| hypothetical protein NGR_c32030 [Sinorhizobium fredii NGR234] gi|227342718|gb|ACP26936.1| hypothetical protein NGR_c32030 [Sinorhizobium fredii NGR234] Length = 105 Score = 104 bits (260), Expect = 5e-21, Method: Composition-based stats. Identities = 32/91 (35%), Positives = 51/91 (56%), Gaps = 2/91 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + I EI D H+K++V A P+KGKAN A++ +LAK L+K+ Sbjct: 14 RLTVRLTPNGGRDAIDGFEIAADGE--EHLKVRVRAVPEKGKANDALIGLLAKAFGLAKN 71 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++S + KI+ I+ D + I + L Sbjct: 72 RIALVSGDTQRKKILRIEADPEAIQKRLTEI 102 >gi|116751493|ref|YP_848180.1| hypothetical protein Sfum_4080 [Syntrophobacter fumaroxidans MPOB] gi|116700557|gb|ABK19745.1| protein of unknown function DUF167 [Syntrophobacter fumaroxidans MPOB] Length = 105 Score = 104 bits (260), Expect = 5e-21, Method: Composition-based stats. Identities = 25/87 (28%), Positives = 40/87 (45%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V L P A K+ A + +KI++TA P +G+ANK + LA +S+S Sbjct: 19 LDVYLQPRASKNEWAGMHQ-------GCLKIRLTAPPVEGEANKECVKFLAGAFGVSRSD 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ S K I I EI + Sbjct: 72 VEIIRGHKSRRKTILIRNSTPEILRAV 98 >gi|304313340|ref|YP_003812938.1| hypothetical protein HDN1F_37330 [gamma proteobacterium HdN1] gi|301799073|emb|CBL47316.1| Conserved hypothetical protein [gamma proteobacterium HdN1] Length = 108 Score = 104 bits (259), Expect = 5e-21, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 38/88 (43%), Gaps = 8/88 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + L P A G +KI++TA P G+AN ++ LAK + + Sbjct: 18 LHCYLQPRAANDGFVG-------EHGGRLKIRITAPPVDGQANAHLIRFLAKAFGVPQQQ 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 +++ ++ K I I + +I + L+ Sbjct: 71 VQIEQGETGRSKRIRI-RTPSKIPQELR 97 >gi|218461695|ref|ZP_03501786.1| hypothetical protein RetlK5_20376 [Rhizobium etli Kim 5] Length = 103 Score = 103 bits (258), Expect = 7e-21, Method: Composition-based stats. Identities = 29/92 (31%), Positives = 59/92 (64%), Gaps = 2/92 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + +E D+ ++K +VTA P+KGKANKA++A++++ + ++KS Sbjct: 13 RLTVRLTPNGGRGAFDGIET--DSEGETYLKARVTAVPEKGKANKALIALVSQSVGVAKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 S+ ++S +++ KI+ I+ D +++ + L+ Sbjct: 71 SVSLVSGETARKKILRIEGDPEDLAQKLEKLS 102 >gi|292493759|ref|YP_003529198.1| hypothetical protein Nhal_3796 [Nitrosococcus halophilus Nc4] gi|291582354|gb|ADE16811.1| protein of unknown function DUF167 [Nitrosococcus halophilus Nc4] Length = 102 Score = 103 bits (258), Expect = 8e-21, Method: Composition-based stats. Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 7/84 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +RL P A I +K+++TA P +GKAN ++ LAK +SKS Sbjct: 15 IQIRLQPRASCDEIIG-------PHGDRLKVRITAPPVEGKANADLIRFLAKTFRVSKSQ 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87 +R+LS + K + I+K K + Sbjct: 68 VRLLSGATGRDKRVCIEKPAKLLP 91 >gi|189423484|ref|YP_001950661.1| hypothetical protein Glov_0413 [Geobacter lovleyi SZ] gi|189419743|gb|ACD94141.1| protein of unknown function DUF167 [Geobacter lovleyi SZ] Length = 101 Score = 103 bits (257), Expect = 9e-21, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 41/87 (47%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P A ++ + +K+++T+ P G AN+ LAK+L + KS+ Sbjct: 18 LRVFVQPRASRNQFCGIH-------EGELKLRLTSPPVDGAANECCREFLAKQLKVPKSA 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++S SS K + I + E L Sbjct: 71 VTLISGDSSRHKRLRIAGATTQQIEQL 97 >gi|188581339|ref|YP_001924784.1| hypothetical protein Mpop_2087 [Methylobacterium populi BJ001] gi|259646581|sp|B1ZLG8|Y2087_METPB RecName: Full=UPF0235 protein Mpop_2087 gi|179344837|gb|ACB80249.1| protein of unknown function DUF167 [Methylobacterium populi BJ001] Length = 110 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 24/86 (27%), Positives = 46/86 (53%), Gaps = 2/86 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A ++G+ + D + ++V A P +G AN A+ A +AK L L KS Sbjct: 15 LAVRLTPRAGRTGLDGVRTEPDGRP--ILCLRVAAPPVEGAANAALTAFVAKSLGLRKSE 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + ++S +++ K +++ D + + Sbjct: 73 VTLVSGETARTKRLHLSGDPQALAAR 98 >gi|146309167|ref|YP_001189632.1| hypothetical protein Pmen_4153 [Pseudomonas mendocina ymp] gi|205829317|sp|A4XZY4|Y4153_PSEMY RecName: Full=UPF0235 protein Pmen_4153 gi|145577368|gb|ABP86900.1| protein of unknown function DUF167 [Pseudomonas mendocina ymp] Length = 98 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 7/80 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + L P A K A L +KI++TA P +GKAN +LA LAK ++K+ Sbjct: 13 LDCHLQPKASKDEFAGLH-------GERLKIRLTAPPVEGKANAHLLAFLAKAFGVAKAQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDC 83 + + S + + K + I Sbjct: 66 VSLESGELNRHKRLRIHAPQ 85 >gi|113969534|ref|YP_733327.1| hypothetical protein Shewmr4_1190 [Shewanella sp. MR-4] gi|123130613|sp|Q0HKZ7|Y1190_SHESM RecName: Full=UPF0235 protein Shewmr4_1190 gi|113884218|gb|ABI38270.1| conserved hypothetical protein [Shewanella sp. MR-4] Length = 96 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 38/91 (41%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + +L + K + I EI LL+ Sbjct: 66 VHILKGELGRHKQVRISAPKNVPAEIATLLE 96 >gi|167630164|ref|YP_001680663.1| conserved hypothetical protein, uncharacterized acr, yggu family [Heliobacterium modesticaldum Ice1] gi|259646567|sp|B0TGP1|Y2027_HELMI RecName: Full=UPF0235 protein Helmi_20270 gi|167592904|gb|ABZ84652.1| conserved hypothetical protein, uncharacterized acr, yggu family [Heliobacterium modesticaldum Ice1] Length = 96 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 48/90 (53%), Gaps = 8/90 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 +R+ P A K+ + L +K+++TA P G+AN A L +AK L LS+ Sbjct: 12 IRFRIRVQPRASKNEVCGL-------LDDALKVRLTAPPVDGEANAACLQFIAKTLGLSR 64 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S +R+++ ++S LK + ++ +++ + Sbjct: 65 SQVRLVAGETSRLKTLEVEGVSAEDLRKRF 94 >gi|226942467|ref|YP_002797540.1| hypothetical protein Avin_03050 [Azotobacter vinelandii DJ] gi|259646933|sp|C1DI68|Y305_AZOVD RecName: Full=UPF0235 protein Avin_03050 gi|226717394|gb|ACO76565.1| conserved hypothetical protein [Azotobacter vinelandii DJ] Length = 99 Score = 102 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 25/79 (31%), Positives = 38/79 (48%), Gaps = 7/79 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + L P A K A L +KI++TA P +GKAN +LA LA + KS Sbjct: 13 LACHLQPKASKDEFAGLH-------GERLKIRLTAPPVEGKANAHLLAFLAGVFGVPKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKD 82 + + S +S+ K + I + Sbjct: 66 VSLESGESNRQKRVRIRRP 84 >gi|117926928|ref|YP_867545.1| hypothetical protein Mmc1_3654 [Magnetococcus sp. MC-1] gi|166990883|sp|A0LDU6|Y3654_MAGSM RecName: Full=UPF0235 protein Mmc1_3654 gi|117610684|gb|ABK46139.1| protein of unknown function DUF167 [Magnetococcus sp. MC-1] Length = 98 Score = 102 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 40/88 (45%), Gaps = 7/88 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 ++ +R+ P A + + + +K+ + A P G ANKA+ LAK+L ++K Sbjct: 12 HLTIRVQPKAAQERVMGWQ-------GEQLKVALNAPPVDGAANKALCHFLAKQLGIAKG 64 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ + S K + I I + Sbjct: 65 QVTLVRGEKSREKQLVIQGISPSIWQQF 92 >gi|117919640|ref|YP_868832.1| hypothetical protein Shewana3_1191 [Shewanella sp. ANA-3] gi|166228912|sp|A0KUF7|Y1191_SHESA RecName: Full=UPF0235 protein Shewana3_1191 gi|117611972|gb|ABK47426.1| conserved hypothetical protein [Shewanella sp. ANA-3] Length = 96 Score = 102 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 38/91 (41%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYID---KDCKEITELLQ 91 + +L + K + I EI LL+ Sbjct: 66 VHILKGELGRHKQVRISPPKNVPAEIATLLE 96 >gi|24374867|ref|NP_718910.1| hypothetical protein SO_3356 [Shewanella oneidensis MR-1] gi|29839709|sp|Q8EBY9|Y3356_SHEON RecName: Full=UPF0235 protein SO_3356 gi|24349562|gb|AAN56354.1|AE015772_14 conserved hypothetical protein TIGR00251 [Shewanella oneidensis MR-1] Length = 96 Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + +L + K + I+ EI+ LL+ Sbjct: 66 VHILKGELGRHKQVRINAPKSVPAEISALLE 96 >gi|146293784|ref|YP_001184208.1| hypothetical protein Sputcn32_2690 [Shewanella putrefaciens CN-32] gi|166228432|sp|A4Y8X6|Y2690_SHEPC RecName: Full=UPF0235 protein Sputcn32_2690 gi|145565474|gb|ABP76409.1| protein of unknown function DUF167 [Shewanella putrefaciens CN-32] gi|319427156|gb|ADV55230.1| protein of unknown function DUF167 [Shewanella putrefaciens 200] Length = 96 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNVYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + ++ + K I I E++ELL+ Sbjct: 66 VYIIKGELGRHKQIRIVTPKLIPPEVSELLE 96 >gi|70733126|ref|YP_262899.1| hypothetical protein PFL_5841 [Pseudomonas fluorescens Pf-5] gi|123652292|sp|Q4K4D4|Y5841_PSEF5 RecName: Full=UPF0235 protein PFL_5841 gi|68347425|gb|AAY95031.1| conserved hypothetical protein TIGR00251 [Pseudomonas fluorescens Pf-5] Length = 97 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + L P A A L +KI++TA P +GKAN ++A LAK + KS+ Sbjct: 13 LDCHLQPKASSDEFAGLH-------GERLKIRLTAPPVEGKANAHLMAFLAKAFGIPKSN 65 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 + ++S + + K + + K Sbjct: 66 VSLVSGELNRQKRVRLQAPKK 86 >gi|114046767|ref|YP_737317.1| hypothetical protein Shewmr7_1261 [Shewanella sp. MR-7] gi|123131606|sp|Q0HX95|Y1261_SHESR RecName: Full=UPF0235 protein Shewmr7_1261 gi|113888209|gb|ABI42260.1| conserved hypothetical protein [Shewanella sp. MR-7] Length = 96 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + +L + K++ I EI LL+ Sbjct: 66 VHILKGELGRHKLVRISAPKNVPAEIATLLE 96 >gi|330505393|ref|YP_004382262.1| hypothetical protein MDS_4479 [Pseudomonas mendocina NK-01] gi|328919679|gb|AEB60510.1| hypothetical protein MDS_4479 [Pseudomonas mendocina NK-01] Length = 96 Score = 101 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 36/80 (45%), Gaps = 7/80 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + L P A K A L +KI++TA P GKAN + A LAK ++KS Sbjct: 13 LDCHLQPKASKDEFAGLH-------GERLKIRLTAPPVDGKANAHLQAFLAKAFGVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDC 83 + + S + + K + I Sbjct: 66 VILESGELNRQKRLRIRAPQ 85 >gi|114564027|ref|YP_751541.1| hypothetical protein Sfri_2863 [Shewanella frigidimarina NCIMB 400] gi|122299086|sp|Q07Z62|Y2863_SHEFN RecName: Full=UPF0235 protein Sfri_2863 gi|114335320|gb|ABI72702.1| conserved hypothetical protein [Shewanella frigidimarina NCIMB 400] Length = 104 Score = 101 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 26/91 (28%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P A + I L +KI +TA P GKAN + LAK ++KS Sbjct: 17 LFVYVQPKASRDQIVGL-------YGNELKIAITAPPIDGKANAYLSKYLAKACKVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + ++ + K I I + EI LL Sbjct: 70 VHIIKGEQGRHKQIRISQPQVIPPEIAALLS 100 >gi|120598144|ref|YP_962718.1| hypothetical protein Sputw3181_1321 [Shewanella sp. W3-18-1] gi|166200340|sp|A1RHL9|Y1321_SHESW RecName: Full=UPF0235 protein Sputw3181_1321 gi|120558237|gb|ABM24164.1| protein of unknown function DUF167 [Shewanella sp. W3-18-1] Length = 96 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKAFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + ++ + K I I E++ELL+ Sbjct: 66 VYIIKGELGRHKQIRIVTPKLIPPEVSELLE 96 >gi|116623504|ref|YP_825660.1| hypothetical protein Acid_4414 [Candidatus Solibacter usitatus Ellin6076] gi|116226666|gb|ABJ85375.1| protein of unknown function DUF167 [Candidatus Solibacter usitatus Ellin6076] Length = 88 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 44/93 (47%), Gaps = 8/93 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + VR+ P A++S I K+ + A P GKAN + LA + Sbjct: 1 MARLTVRVHPRARRSEITG-------RLGDAWKLALAAPPVDGKANDECVRFLAGWAGVP 53 Query: 61 KSSLRMLSKQSSPLKIIYIDKDC-KEITELLQN 92 +S +R+++ +S +K++ I+ +++ L+ Sbjct: 54 RSRVRIVTGLTSRIKVVEIEGVPQEDLERRLKA 86 >gi|91975173|ref|YP_567832.1| hypothetical protein RPD_0693 [Rhodopseudomonas palustris BisB5] gi|91681629|gb|ABE37931.1| protein of unknown function DUF167 [Rhodopseudomonas palustris BisB5] Length = 107 Score = 100 bits (251), Expect = 5e-20, Method: Composition-based stats. Identities = 31/92 (33%), Positives = 50/92 (54%), Gaps = 2/92 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D +K++V A G+AN+A++ +LAK L + K + Sbjct: 13 VAVRVTPRGGRDEIDGLETLSDGRP--VVKVRVRAIADGGEANRAVIELLAKSLGVPKRN 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 +R+LS +S K I ID D ++ E L+ + Sbjct: 71 VRLLSGATSRQKQIAIDGDPTKLGEALRRLTA 102 >gi|77166447|ref|YP_344972.1| hypothetical protein Noc_3000 [Nitrosococcus oceani ATCC 19707] gi|254435182|ref|ZP_05048689.1| conserved hypothetical protein TIGR00251 [Nitrosococcus oceani AFC27] gi|123593242|sp|Q3J6V4|Y3000_NITOC RecName: Full=UPF0235 protein Noc_3000 gi|76884761|gb|ABA59442.1| Conserved hypothetical protein 251 [Nitrosococcus oceani ATCC 19707] gi|207088293|gb|EDZ65565.1| conserved hypothetical protein TIGR00251 [Nitrosococcus oceani AFC27] Length = 102 Score = 100 bits (251), Expect = 5e-20, Method: Composition-based stats. Identities = 26/84 (30%), Positives = 43/84 (51%), Gaps = 7/84 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +RL P AK + +KI++TA P +GKAN +L LAK +S++ Sbjct: 15 IQIRLQPRAKGDEVIG-------PHGDRLKIRITAPPVEGKANTHLLRFLAKTFQVSRNQ 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87 + +LS +S K + I+K K + Sbjct: 68 VYLLSGATSRDKRVRIEKPTKLLP 91 >gi|126175230|ref|YP_001051379.1| hypothetical protein Sbal_3028 [Shewanella baltica OS155] gi|153001556|ref|YP_001367237.1| hypothetical protein Shew185_3043 [Shewanella baltica OS185] gi|160876292|ref|YP_001555608.1| hypothetical protein Sbal195_3186 [Shewanella baltica OS195] gi|304410074|ref|ZP_07391693.1| protein of unknown function DUF167 [Shewanella baltica OS183] gi|307302214|ref|ZP_07581972.1| protein of unknown function DUF167 [Shewanella baltica BA175] gi|166229359|sp|A3D6Z7|Y3028_SHEB5 RecName: Full=UPF0235 protein Sbal_3028 gi|166229364|sp|A6WQT5|Y3043_SHEB8 RecName: Full=UPF0235 protein Shew185_3043 gi|189039841|sp|A9KXP6|Y3186_SHEB9 RecName: Full=UPF0235 protein Sbal195_3186 gi|125998435|gb|ABN62510.1| protein of unknown function DUF167 [Shewanella baltica OS155] gi|151366174|gb|ABS09174.1| protein of unknown function DUF167 [Shewanella baltica OS185] gi|160861814|gb|ABX50348.1| protein of unknown function DUF167 [Shewanella baltica OS195] gi|304351483|gb|EFM15882.1| protein of unknown function DUF167 [Shewanella baltica OS183] gi|306914252|gb|EFN44673.1| protein of unknown function DUF167 [Shewanella baltica BA175] gi|315268481|gb|ADT95334.1| protein of unknown function DUF167 [Shewanella baltica OS678] Length = 99 Score = 100 bits (251), Expect = 5e-20, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 8/87 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKTFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ + K I + D K I ++ Sbjct: 66 IHIMKGELGRHKQIRVI-DPKIIPSII 91 >gi|329849894|ref|ZP_08264740.1| hypothetical protein ABI_27900 [Asticcacaulis biprosthecum C19] gi|328841805|gb|EGF91375.1| hypothetical protein ABI_27900 [Asticcacaulis biprosthecum C19] Length = 86 Score = 100 bits (250), Expect = 6e-20, Method: Composition-based stats. Identities = 28/83 (33%), Positives = 44/83 (53%), Gaps = 2/83 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P + I +D +K++V A P +G+AN+A++ LAK L + K Sbjct: 1 MRLAVRLTPRSSADAIDGW--GEDEQGRRFLKVRVRAAPIEGRANEALIVFLAKTLGVPK 58 Query: 62 SSLRMLSKQSSPLKIIYIDKDCK 84 S L +++ +S LK I ID D Sbjct: 59 SRLSLVAGDTSRLKQIEIDGDVD 81 >gi|121602415|ref|YP_988648.1| hypothetical protein BARBAKC583_0328 [Bartonella bacilliformis KC583] gi|120614592|gb|ABM45193.1| conserved hypothetical protein TIGR00251 [Bartonella bacilliformis KC583] Length = 107 Score = 100 bits (250), Expect = 6e-20, Method: Composition-based stats. Identities = 26/94 (27%), Positives = 48/94 (51%), Gaps = 2/94 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A + I +E D ++ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVRLTPKASMNNIVGVESRDDGKQ--YLIIRLCAVPEDGKANKALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97 + + + S K + ++I ++L + + T Sbjct: 70 ISLENGAISRYKQLRFSGGVEKIEKILHSLGNYT 103 >gi|86747236|ref|YP_483732.1| hypothetical protein RPB_0109 [Rhodopseudomonas palustris HaA2] gi|123293376|sp|Q2J3Y9|Y109_RHOP2 RecName: Full=UPF0235 protein RPB_0109 gi|86570264|gb|ABD04821.1| Protein of unknown function DUF167 [Rhodopseudomonas palustris HaA2] Length = 108 Score = 100 bits (250), Expect = 6e-20, Method: Composition-based stats. Identities = 32/92 (34%), Positives = 50/92 (54%), Gaps = 2/92 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D +K++V A G+AN+A++ +LAK L + K + Sbjct: 14 VAVRVTPRGDRDEIDGLETLSDGRP--VVKLRVRAIADGGEANRAVIELLAKALGVPKRN 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 +R+LS +S K I ID D K + E L+ + Sbjct: 72 VRLLSGATSRQKQIAIDGDPKSLGETLRQLTA 103 >gi|1388023|gb|AAB88056.1| putative [Bartonella bacilliformis] Length = 109 Score = 100 bits (250), Expect = 6e-20, Method: Composition-based stats. Identities = 26/94 (27%), Positives = 48/94 (51%), Gaps = 2/94 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A + I +E D ++ I++ A P+ GKANKA++ LAK+ + S Sbjct: 14 LFVRLTPKASMNNIVGVESRDDGKQ--YLIIRLCAVPEDGKANKALIKFLAKQWKIPSSC 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97 + + + S K + ++I ++L + + T Sbjct: 72 ISLENGAISRYKQLRFSGGVEKIEKILHSLGNYT 105 >gi|217972515|ref|YP_002357266.1| hypothetical protein Sbal223_1335 [Shewanella baltica OS223] gi|254800042|sp|B8E927|Y1335_SHEB2 RecName: Full=UPF0235 protein Sbal223_1335 gi|217497650|gb|ACK45843.1| protein of unknown function DUF167 [Shewanella baltica OS223] Length = 99 Score = 100 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 38/87 (43%), Gaps = 8/87 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK + KS Sbjct: 13 LNLYIQPKASRDQIVGLH-------GDELKVAITAPPIDGKANAHLSKYLAKTFKVPKSD 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ + K I + D K I ++ Sbjct: 66 IHIMKGELGRHKQIRVI-DPKIIPSVI 91 >gi|157960952|ref|YP_001500986.1| hypothetical protein Spea_1124 [Shewanella pealeana ATCC 700345] gi|157845952|gb|ABV86451.1| protein of unknown function DUF167 [Shewanella pealeana ATCC 700345] Length = 90 Score = 100 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 36/90 (40%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + L+K + K Sbjct: 8 LQLYIQPKASRDQIVGLH-------GEEIKIAITAPPVDGKANAHLTKYLSKAFKVPKGD 60 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90 + ++ Q K + I E+ LL Sbjct: 61 IDIIKGQMGRHKQVRITAPKLIPAEVQNLL 90 >gi|193213756|ref|YP_001994955.1| hypothetical protein Ctha_0036 [Chloroherpeton thalassium ATCC 35110] gi|193087233|gb|ACF12508.1| protein of unknown function DUF167 [Chloroherpeton thalassium ATCC 35110] Length = 97 Score = 100 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 27/83 (32%), Positives = 40/83 (48%), Gaps = 7/83 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P A K+ I +KI++ A P + ANKA + LAK ++K Sbjct: 11 VDFSVRLQPRASKNEIVG-------EYDGALKIRIAAPPVENAANKACIEFLAKTFGIAK 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCK 84 S + +LS +S K+I I K Sbjct: 64 SQVEILSGDTSRNKLIRIYGIDK 86 >gi|212636425|ref|YP_002312950.1| cytosolic protein [Shewanella piezotolerans WP3] gi|212557909|gb|ACJ30363.1| Cytosolic protein, putative [Shewanella piezotolerans WP3] Length = 90 Score = 100 bits (249), Expect = 7e-20, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 37/90 (41%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + L+K + K Sbjct: 8 LQLYVQPKASRDQIVGLH-------GNELKIAITAPPIDGKANIHLAKYLSKAFKVPKGD 60 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + +L + K + I K I +QN Sbjct: 61 IDILKGLTGRHKQVLIS-SPKVIPPEIQNL 89 >gi|111075036|gb|ABH04883.1| uncharacterized conserved protein yggY [Heliobacillus mobilis] Length = 101 Score = 100 bits (249), Expect = 8e-20, Method: Composition-based stats. Identities = 25/98 (25%), Positives = 50/98 (51%), Gaps = 8/98 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 +R+ P A K+ + L +K+++TA P G+AN A A AK L+L K Sbjct: 11 VRFKIRVQPRASKNEVCGL-------LDDALKVRLTAPPVDGEANGACQAFFAKTLSLPK 63 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNNDSLTL 98 S +R+++ ++S K + + ++I +L + + ++ Sbjct: 64 SQVRLVAGETSRTKTVEVIGVSKEQILKLFDSKQTSSV 101 >gi|39995970|ref|NP_951921.1| hypothetical protein GSU0864 [Geobacter sulfurreducens PCA] gi|39982735|gb|AAR34194.1| conserved hypothetical protein TIGR00251 [Geobacter sulfurreducens PCA] gi|307634758|gb|ADI83708.2| protein of unknown function DUF167 [Geobacter sulfurreducens KN400] Length = 107 Score = 100 bits (249), Expect = 8e-20, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 47/89 (52%), Gaps = 8/89 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V + P A ++ I ++ +K+++T+ P +G+AN+ + LAK+L + KS + Sbjct: 23 SVHVQPRASRNEICGVQ-------GEAIKLRLTSPPVEGEANRLCVEFLAKRLGVPKSCV 75 Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELLQN 92 +++ + S K I + D + LL+N Sbjct: 76 AIIAGEKSRHKTIRVSGSDAAAVLALLEN 104 >gi|77461543|ref|YP_351050.1| hypothetical protein Pfl01_5322 [Pseudomonas fluorescens Pf0-1] gi|123602932|sp|Q3K595|Y5322_PSEPF RecName: Full=UPF0235 protein Pfl01_5322 gi|77385546|gb|ABA77059.1| conserved hypothetical protein [Pseudomonas fluorescens Pf0-1] Length = 96 Score = 100 bits (249), Expect = 9e-20, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 42/86 (48%), Gaps = 8/86 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + L P A+ L +KI++TA P +GKAN ++ LAK +SKS Sbjct: 13 LECHLQPAARSDDFCGLH-------GDRLKIRLTAPPVEGKANAYLMGFLAKAFGVSKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + +LS + + K + I K++ +L Sbjct: 66 VSLLSGELNRQKRVRI-GAPKKLPDL 90 >gi|182413313|ref|YP_001818379.1| hypothetical protein Oter_1495 [Opitutus terrae PB90-1] gi|177840527|gb|ACB74779.1| protein of unknown function DUF167 [Opitutus terrae PB90-1] Length = 90 Score = 99.8 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 8/94 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C + ++ IPNA ++ I +K+KV A P +G+AN+ + LA +L L + Sbjct: 4 CTIAIKAIPNAPRNQIVGW-------LGDALKVKVHAPPLEGRANEELCEFLADELGLPR 56 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNND 94 ++ +L +S K++ I+ D ++ L + D Sbjct: 57 RAVSVLRGDTSRQKLVQIEGLDLAQLKAKLSSLD 90 >gi|330812367|ref|YP_004356829.1| hypothetical protein PSEBR_a5324 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327380475|gb|AEA71825.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 97 Score = 99.8 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 27/86 (31%), Positives = 44/86 (51%), Gaps = 8/86 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + L P A+ A L +KI++TA P +GKAN ++A LAK +SKS Sbjct: 13 LECHLQPAARSDDFAGLH-------GDRLKIRLTAPPVEGKANAYLMAFLAKAFGVSKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + ++S + + K + I K++ EL Sbjct: 66 VSLVSGELNRQKRVRIH-SPKKLPEL 90 >gi|294139811|ref|YP_003555789.1| hypothetical protein SVI_1040 [Shewanella violacea DSS12] gi|293326280|dbj|BAJ01011.1| conserved hypothetical protein [Shewanella violacea DSS12] Length = 100 Score = 99.8 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 37/90 (41%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I + +KI +TA P GKAN + L+K + K Sbjct: 13 LNLYIQPKASRDKIIGVH-------GNELKIAITAPPVDGKANAHLTKFLSKAFKVPKGD 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90 + + + K + I ++I +LL Sbjct: 66 IIIHKGELGRHKQVEILTPRVIPEQIADLL 95 >gi|297620780|ref|YP_003708917.1| yggU family protein [Waddlia chondrophila WSU 86-1044] gi|297376081|gb|ADI37911.1| yggU family protein [Waddlia chondrophila WSU 86-1044] Length = 94 Score = 99.8 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 29/90 (32%), Positives = 48/90 (53%), Gaps = 7/90 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V+LIPNA K I+ E +K+++TA P+KGKAN ++ LA +L +SK Sbjct: 9 VVLPVKLIPNAGKDEISGWE-------NGILKVRITAVPEKGKANAHLIKFLASQLKVSK 61 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S + ++ + + K + I D + E L Sbjct: 62 SDITLIKGEKNRHKTLLIKGDADVVGERLS 91 >gi|119775613|ref|YP_928353.1| hypothetical protein Sama_2480 [Shewanella amazonensis SB2B] gi|189039696|sp|A1S8H7|Y2480_SHEAM RecName: Full=UPF0235 protein Sama_2480 gi|119768113|gb|ABM00684.1| conserved hypothetical protein [Shewanella amazonensis SB2B] Length = 95 Score = 99.8 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 36/90 (40%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + L +K+ +TA P GKAN + +LAK + K Sbjct: 13 LALYVQPKASRDELVGLH-------GEELKLAITAPPVDGKANAHICKLLAKAFKVPKGK 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90 + + + K++ I + + L Sbjct: 66 VSIERGELGRHKLVRIQAPEIIPDDFAQFL 95 >gi|83313474|ref|YP_423738.1| hypothetical protein amb4375 [Magnetospirillum magneticum AMB-1] gi|82948315|dbj|BAE53179.1| Uncharacterized conserved protein [Magnetospirillum magneticum AMB-1] Length = 108 Score = 99.8 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 27/88 (30%), Positives = 46/88 (52%), Gaps = 2/88 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V VRL P A + I D + + +K +VTA P+ GKAN A+L +L+K + KS Sbjct: 14 KVAVRLTPKASRDRINGPAAEADGA--VVLKAQVTAVPEDGKANAALLKLLSKAWKIPKS 71 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ + K+I I + +++ L Sbjct: 72 DMDIVLGATDRRKVILISGETEDLRHRL 99 >gi|254780231|ref|YP_003064644.1| hypothetical protein CLIBASIA_00580 [Candidatus Liberibacter asiaticus str. psy62] gi|254039908|gb|ACT56704.1| hypothetical protein CLIBASIA_00580 [Candidatus Liberibacter asiaticus str. psy62] Length = 98 Score = 99.4 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 98/98 (100%), Positives = 98/98 (100%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS Sbjct: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL 98 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL Sbjct: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL 98 >gi|119899753|ref|YP_934966.1| hypothetical protein azo3464 [Azoarcus sp. BH72] gi|166232594|sp|A1KB74|Y3464_AZOSB RecName: Full=UPF0235 protein azo3464 gi|119672166|emb|CAL96080.1| conserved hypothetical protein [Azoarcus sp. BH72] Length = 98 Score = 99.4 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 45/90 (50%), Gaps = 7/90 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A+++G A L MKI++ A P GKAN A+ A LA + KS+ Sbjct: 15 LTLHIQPGARQTGFAGLH-------GEAMKIRLAAPPVDGKANAALCAFLADFCEVPKSA 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++S ++S K + ++ + L+ Sbjct: 68 VTLVSGETSRAKRVRVETKTPGLAGRLRAL 97 >gi|254561320|ref|YP_003068415.1| hypothetical protein METDI2902 [Methylobacterium extorquens DM4] gi|254268598|emb|CAX24557.1| conserved hypothetical protein,UPF0235 protein [Methylobacterium extorquens DM4] Length = 105 Score = 99.0 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 47/89 (52%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A ++G+ + + ++V A P +G AN A+ A +AK L L K+ Sbjct: 15 LAVRLTPRASRTGLDGVRTEASGRP--VLSLRVAAPPVEGAANAALTAFVAKSLGLRKAE 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + ++S ++S K +++ D + + ++ Sbjct: 73 VTLVSGETSRTKRLHLSGDPQMLAARVEA 101 >gi|167623103|ref|YP_001673397.1| hypothetical protein Shal_1169 [Shewanella halifaxensis HAW-EB4] gi|167353125|gb|ABZ75738.1| protein of unknown function DUF167 [Shewanella halifaxensis HAW-EB4] Length = 90 Score = 98.7 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 35/90 (38%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + L+K + K Sbjct: 8 LQLYIQPKASRDQIVGLH-------GEEIKIAITAPPVDGKANAHLTKYLSKAFKVPKGD 60 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90 + + Q K + I E+ LL Sbjct: 61 IEITKGQMGRHKQVKITAPKLIPAEVQNLL 90 >gi|163851553|ref|YP_001639596.1| hypothetical protein Mext_2130 [Methylobacterium extorquens PA1] gi|259646591|sp|A9W4L9|Y2130_METEP RecName: Full=UPF0235 protein Mext_2130 gi|163663158|gb|ABY30525.1| protein of unknown function DUF167 [Methylobacterium extorquens PA1] Length = 105 Score = 98.7 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 46/89 (51%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A ++G+ + + ++V A P +G AN A+ A +AK L L K+ Sbjct: 15 LAVRLTPRASRTGLDGVRTEASGRP--VLSLRVAAPPVEGAANAALTAFVAKSLGLRKAE 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + +LS ++S K + + D + + ++ Sbjct: 73 VTLLSGETSRTKRLLLSGDPQTLAARVEA 101 >gi|240138720|ref|YP_002963192.1| hypothetical protein MexAM1_META1p2120 [Methylobacterium extorquens AM1] gi|240008689|gb|ACS39915.1| conserved hypothetical protein,UPF0235 protein [Methylobacterium extorquens AM1] Length = 105 Score = 98.7 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 47/89 (52%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A ++G+ + + ++V A P +G AN A+ A +AK L L K+ Sbjct: 15 LAVRLTPRASRTGLDGVRTEASGQP--VLSLRVAAPPVEGAANAALTAFVAKSLGLRKAE 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + +LS ++S K +++ D + + ++ Sbjct: 73 VTLLSGETSRTKRLHLSGDPQTLAARVEA 101 >gi|71909498|ref|YP_287085.1| hypothetical protein Daro_3887 [Dechloromonas aromatica RCB] gi|123626353|sp|Q478W6|Y3887_DECAR RecName: Full=UPF0235 protein Daro_3887 gi|71849119|gb|AAZ48615.1| Conserved hypothetical protein 251 [Dechloromonas aromatica RCB] Length = 97 Score = 98.7 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 27/89 (30%), Positives = 45/89 (50%), Gaps = 7/89 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P AKKS A L +KI++ A P GKAN+A++ +A L L+KS+ Sbjct: 15 LTLHIQPGAKKSEFAGLH-------GDALKIRLAAPPVDGKANEALIRFIADALGLAKSA 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S Q+S K++ I L + Sbjct: 68 VHLKSGQTSRRKVLEILGTSTTTIAGLAD 96 >gi|144900433|emb|CAM77297.1| protein containing DUF167 [Magnetospirillum gryphiswaldense MSR-1] Length = 107 Score = 98.7 bits (245), Expect = 3e-19, Method: Composition-based stats. Identities = 28/88 (31%), Positives = 45/88 (51%), Gaps = 2/88 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +RL P ++ I L D + +K VTA P+ GKAN A++ MLAK+ ++KS Sbjct: 14 RVFIRLTPKGSRNKIDGLAAEADG--GMVLKASVTAVPEDGKANAALIKMLAKEWRVAKS 71 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELL 90 +++ + K + I D E+ L Sbjct: 72 DFEIVAGATDRRKTVLISGDGAEMAARL 99 >gi|262273743|ref|ZP_06051556.1| hypothetical protein VHA_000718 [Grimontia hollisae CIP 101886] gi|262222158|gb|EEY73470.1| hypothetical protein VHA_000718 [Grimontia hollisae CIP 101886] Length = 101 Score = 98.7 bits (245), Expect = 3e-19, Method: Composition-based stats. Identities = 19/92 (20%), Positives = 38/92 (41%), Gaps = 10/92 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + +K+ +TA P GKAN + L+K+ ++KS Sbjct: 16 LRLYIQPKASRDQWMGQH-------GEEIKLAITAPPVDGKANAHLTKFLSKQFKVAKSQ 68 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQN 92 + + + K + I + I++LL Sbjct: 69 IDIEKGELGRHKQVRIHRPVLCPDAISDLLPA 100 >gi|134298585|ref|YP_001112081.1| hypothetical protein Dred_0717 [Desulfotomaculum reducens MI-1] gi|189040591|sp|A4J2F3|Y717_DESRM RecName: Full=UPF0235 protein Dred_0717 gi|134051285|gb|ABO49256.1| protein of unknown function DUF167 [Desulfotomaculum reducens MI-1] Length = 94 Score = 98.7 bits (245), Expect = 3e-19, Method: Composition-based stats. Identities = 19/89 (21%), Positives = 41/89 (46%), Gaps = 7/89 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V VR+ P A K+ +A +K+++TA P G AN+A + ++K Sbjct: 11 VVVKVRVQPRASKNSLAG-------EMEGALKVRLTAPPVDGAANEACCKFFGELFGVAK 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 S + +++ + K+++I ++ + Sbjct: 64 SKVEIIAGHTGRNKLVHIQGVTEKQARFI 92 >gi|145300496|ref|YP_001143337.1| hypothetical protein ASA_3628 [Aeromonas salmonicida subsp. salmonicida A449] gi|166232617|sp|A4SRR7|Y3628_AERS4 RecName: Full=UPF0235 protein ASA_3628 gi|142853268|gb|ABO91589.1| conserved hypothetical protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 99 Score = 98.7 bits (245), Expect = 3e-19, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 10/93 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ ++K Sbjct: 13 LHLMIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANSHLIKYLAKQFKVAKGQ 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQNN 93 +R++ + K + I+ EI LL+ Sbjct: 66 VRIVRGELGRHKTVAIESPRQIPAEIHALLETQ 98 >gi|315497739|ref|YP_004086543.1| hypothetical protein Astex_0706 [Asticcacaulis excentricus CB 48] gi|315415751|gb|ADU12392.1| protein of unknown function DUF167 [Asticcacaulis excentricus CB 48] Length = 89 Score = 98.3 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 31/89 (34%), Positives = 52/89 (58%), Gaps = 2/89 (2%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M ++VRL P A + ++ D ++K++VTA P +G+AN+A++A LAK+L L Sbjct: 1 MARLVVRLTPKAAADRVDGWDM--DEQGRPYLKVRVTAPPIEGRANEALIAFLAKRLKLP 58 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITEL 89 KS L +L+ SS LK I ++ + + Sbjct: 59 KSRLSLLAGDSSRLKQIEVEGFDEAALKA 87 >gi|218530362|ref|YP_002421178.1| hypothetical protein Mchl_2407 [Methylobacterium chloromethanicum CM4] gi|259646623|sp|B7KZL8|Y2407_METC4 RecName: Full=UPF0235 protein Mchl_2407 gi|218522665|gb|ACK83250.1| protein of unknown function DUF167 [Methylobacterium chloromethanicum CM4] Length = 105 Score = 98.3 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 47/89 (52%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A ++G+ + + ++V A P +G AN A+ A +AK L L K+ Sbjct: 15 LAVRLTPRASRTGLDGVRTEASGRP--VLSLRVAAPPVEGAANAALTAFVAKSLGLRKAE 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + +LS ++S K +++ D + + ++ Sbjct: 73 VTLLSGETSRTKRLHLSGDPQMLAARVEA 101 >gi|153874132|ref|ZP_02002465.1| conserved hypothetical protein [Beggiatoa sp. PS] gi|152069404|gb|EDN67535.1| conserved hypothetical protein [Beggiatoa sp. PS] Length = 91 Score = 98.3 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 18/81 (22%), Positives = 38/81 (46%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P + ++ + + +K+K+ A P GKAN + + +K+ ++KS Sbjct: 13 LSVHVQPRSSQTSVVGVH-------GDRLKVKIMAAPVDGKANAEVCKLFSKQFGVAKSK 65 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 + + + +S K I I K Sbjct: 66 IIIENGHTSRDKRICIKSPQK 86 >gi|49474981|ref|YP_033022.1| hypothetical protein BH01670 [Bartonella henselae str. Houston-1] gi|49237786|emb|CAF26979.1| hypothetical protein BH01670 [Bartonella henselae str. Houston-1] Length = 111 Score = 98.3 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 27/91 (29%), Positives = 48/91 (52%), Gaps = 2/91 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V +IP A I +E D H+ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVYIIPKASGDKIMGIECKNDGKR--HLVIRLRAIPENGKANKALIKFLAKQWKIPSSY 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 + + S +S K +Y +++ E+L+ + Sbjct: 70 ISLKSGGTSRYKQLYFSGYLEKLKEILRALE 100 >gi|260773605|ref|ZP_05882521.1| hypothetical protein VIB_002079 [Vibrio metschnikovii CIP 69.14] gi|260612744|gb|EEX37947.1| hypothetical protein VIB_002079 [Vibrio metschnikovii CIP 69.14] Length = 97 Score = 98.3 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 36/90 (40%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + +KI +TA P GKAN + LAK +SKS+ Sbjct: 13 LRIYVQPKASRDSLVGQH-------GDELKIAITAPPVDGKANAHLSRYLAKLCKVSKSA 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + Q K + I I +Q Sbjct: 66 VEIEKGQLGRHKQVRIL-SPTVIPAEIQAL 94 >gi|49473825|ref|YP_031867.1| hypothetical protein BQ01570 [Bartonella quintana str. Toulouse] gi|49239328|emb|CAF25660.1| hypothetical protein BQ01570 [Bartonella quintana str. Toulouse] Length = 114 Score = 98.3 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 29/94 (30%), Positives = 49/94 (52%), Gaps = 2/94 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V LIP A I +E D + I++ P+ GKANKA++ LAK+ + S Sbjct: 12 LFVYLIPKASVDKIIGVECRDDGKQ--RLVIRLRTLPENGKANKALIKFLAKQWKIPSSY 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97 + + S ++S K +Y +E+ E+LQ+ + T Sbjct: 70 ISLKSGETSRYKQLYFSGYLQEVGEILQSLYTST 103 >gi|316931646|ref|YP_004106628.1| hypothetical protein Rpdx1_0252 [Rhodopseudomonas palustris DX-1] gi|315599360|gb|ADU41895.1| protein of unknown function DUF167 [Rhodopseudomonas palustris DX-1] Length = 108 Score = 98.3 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 31/88 (35%), Positives = 50/88 (56%), Gaps = 2/88 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D +K++V A G+AN+A+ +LAK + + K + Sbjct: 14 VAVRVTPRGGRDDIDGLETLSDGRP--VLKVRVRAIADGGEANRAVTELLAKAVGVPKRN 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 +R+LS +S K I ID D K++ E+L+ Sbjct: 72 VRLLSGATSRQKQIAIDGDPKQLGEVLR 99 >gi|90421586|ref|YP_529956.1| hypothetical protein RPC_0058 [Rhodopseudomonas palustris BisB18] gi|122477773|sp|Q21D99|Y058_RHOPB RecName: Full=UPF0235 protein RPC_0058 gi|90103600|gb|ABD85637.1| protein of unknown function DUF167 [Rhodopseudomonas palustris BisB18] Length = 107 Score = 98.3 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 29/94 (30%), Positives = 52/94 (55%), Gaps = 2/94 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 +V +R+ P + I LE + +K++V A + G+AN+A+ +LAK L + K Sbjct: 12 ISVALRVTPRGGRDDIDGLETLANG--RTVVKVRVRAIAEGGEANRAVTELLAKALGVPK 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 ++R+LS +S LK + +D D E+ E L+ + Sbjct: 70 RAVRVLSGTTSRLKQVAVDGDPNELGEALRKLTA 103 >gi|114566491|ref|YP_753645.1| hypothetical protein Swol_0959 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|122318444|sp|Q0AYD0|Y959_SYNWW RecName: Full=UPF0235 protein Swol_0959 gi|114337426|gb|ABI68274.1| conserved hypothetical protein [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 102 Score = 98.3 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 39/80 (48%), Gaps = 7/80 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ P + ++ I +KIKV A P +G AN+A+ LA+ L K +R Sbjct: 19 VKVQPRSSRNQIVG-------EHEGDLKIKVMAPPVEGAANQALQKFLAELFKLPKKDIR 71 Query: 66 MLSKQSSPLKIIYIDKDCKE 85 ++ ++S KI+ I E Sbjct: 72 IVRGETSRHKIVEIRGIEAE 91 >gi|330831217|ref|YP_004394169.1| hypothetical protein B565_3517 [Aeromonas veronii B565] gi|328806353|gb|AEB51552.1| hypothetical protein B565_3517 [Aeromonas veronii B565] Length = 100 Score = 97.9 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 41/90 (45%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ ++K Sbjct: 13 LHLVIQPKASRDQIIGLH-------GEELKVAITAPPVDGQANSHLIKFLAKQFKVAKGQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++ + K + ID K++ + + Sbjct: 66 ITIVRGELGRHKTVAID-SPKQLPQEVSAL 94 >gi|323493577|ref|ZP_08098698.1| hypothetical protein VIBR0546_04984 [Vibrio brasiliensis LMG 20546] gi|323312100|gb|EGA65243.1| hypothetical protein VIBR0546_04984 [Vibrio brasiliensis LMG 20546] Length = 96 Score = 97.5 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 36/90 (40%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I +K+ +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKIVGQH-------GEELKVAITAPPVDGKANAHLSKYLAKQFKVAKGL 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + K + I +I +Q Sbjct: 67 ITIEKGELGRHKQVRIS-SPTQIPNEIQAI 95 >gi|39933491|ref|NP_945767.1| hypothetical protein RPA0414 [Rhodopseudomonas palustris CGA009] gi|39647337|emb|CAE25858.1| DUF167 [Rhodopseudomonas palustris CGA009] Length = 112 Score = 97.5 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 31/88 (35%), Positives = 49/88 (55%), Gaps = 2/88 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D +K++V A G+AN+A+ +LAK + + K + Sbjct: 18 VAVRVTPRGGRDDIDGLETLSDGRP--VVKVRVRAIADGGEANRAVTELLAKAVGVPKRN 75 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 +R+LS +S K I ID D K++ E L+ Sbjct: 76 VRLLSGATSRQKQIAIDGDPKQLGEALR 103 >gi|59711034|ref|YP_203810.1| hypothetical protein VF_0427 [Vibrio fischeri ES114] gi|59479135|gb|AAW84922.1| conserved protein [Vibrio fischeri ES114] Length = 95 Score = 97.5 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + L P A + I + +KI +TA P GKAN ++ +K ++K Sbjct: 12 LRLYLQPKASRDQIVGIH-------GEELKIAITAPPVDGKANAHLIKYFSKLFKVAKGK 64 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + + K + I + I +Q Sbjct: 65 ITVEKGELNRHKQVRIH-SPELIPNEIQQL 93 >gi|257095142|ref|YP_003168783.1| hypothetical protein CAP2UW1_3597 [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257047666|gb|ACV36854.1| protein of unknown function DUF167 [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 96 Score = 97.5 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 27/90 (30%), Positives = 45/90 (50%), Gaps = 7/90 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + L P AK + IA +K+++TA P G+AN A++ LA++L LS+S+ Sbjct: 14 VTIHLQPGAKANEIAG-------RHGDALKVRITAPPVDGRANAALVDFLAQRLGLSRSA 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + S +S K++ I E L Sbjct: 67 VELKSGLTSRRKVLRISGASAEAVLCLLAE 96 >gi|209966140|ref|YP_002299055.1| hypothetical protein RC1_2875 [Rhodospirillum centenum SW] gi|209959606|gb|ACJ00243.1| conserved hypothetical protein [Rhodospirillum centenum SW] Length = 117 Score = 97.5 bits (242), Expect = 6e-19, Method: Composition-based stats. Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 2/89 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V +R+ P A ++ + +K+ VTA P+ GKAN A++A+LAK L K Sbjct: 23 VRVALRVTPKASRTAVQG--PMDGPEGRTLLKLAVTAVPEDGKANAAVIALLAKHWRLPK 80 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 SS+ ++S + K+++I D ++ + Sbjct: 81 SSMSIVSGGTDRTKVLFIAGDAADLLARI 109 >gi|323702139|ref|ZP_08113806.1| protein of unknown function DUF167 [Desulfotomaculum nigrificans DSM 574] gi|323532826|gb|EGB22698.1| protein of unknown function DUF167 [Desulfotomaculum nigrificans DSM 574] Length = 95 Score = 97.1 bits (241), Expect = 6e-19, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 42/89 (47%), Gaps = 7/89 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A K+ +A +K+++TA P G AN+A LA+ ++K Sbjct: 11 VVLRVRVQPRAAKNSLAG-------EMEGALKVRLTAPPVDGAANEACCRFLAEVFGVAK 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 S++ ++S + K++ + + + Sbjct: 64 SNVEIISGHTGRNKVVRVAGIDEARARRV 92 >gi|78188297|ref|YP_378635.1| hypothetical protein Cag_0319 [Chlorobium chlorochromatii CaD3] gi|123580414|sp|Q3ATT3|Y319_CHLCH RecName: Full=UPF0235 protein Cag_0319 gi|78170496|gb|ABB27592.1| conserved hypothetical protein [Chlorobium chlorochromatii CaD3] Length = 99 Score = 97.1 bits (241), Expect = 6e-19, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 36/87 (41%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR P + KS ++ +K+ + + P AN+ +LA+ + S Sbjct: 13 IAVRAQPRSSKSMVSG-------EWNGALKVHLQSPPVDDAANEECCRLLARLFQVPPSR 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + +++ SS K + ++ + L Sbjct: 66 VHLVAGHSSRNKRVMVEGVSAAMATEL 92 >gi|163803734|ref|ZP_02197592.1| hypothetical protein 1103602000580_AND4_13653 [Vibrio sp. AND4] gi|159172453|gb|EDP57321.1| hypothetical protein AND4_13653 [Vibrio sp. AND4] Length = 96 Score = 97.1 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 38/93 (40%), Gaps = 8/93 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 12 VVLRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKVAK 64 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 + + + K I I ++I ++ Sbjct: 65 GLVHIEKGELGRHKQIRIK-SPEQIPTEIKAIT 96 >gi|319408144|emb|CBI81797.1| conserved hypothetical protein [Bartonella schoenbuchensis R1] Length = 108 Score = 97.1 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 25/88 (28%), Positives = 44/88 (50%), Gaps = 2/88 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E + H+ I++ P+ GKANKA++ L ++ + S Sbjct: 12 LFVRLIPKASMDSIVGVESRDGETQ--HLVIRLRTVPEDGKANKALIKFLGRQWKIPPSY 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S +S K + +E+ + LQ Sbjct: 70 ISLKSGMTSRYKQLRFSGYVEELEQKLQ 97 >gi|197335333|ref|YP_002155183.1| hypothetical protein VFMJ11_0427 [Vibrio fischeri MJ11] gi|197316823|gb|ACH66270.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 95 Score = 97.1 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + L P A + I + +KI +TA P GKAN ++ +K ++K Sbjct: 12 LRLYLQPKASRDQIVGIH-------GEELKIAITAPPVDGKANAHLIKYFSKLFKVAKGK 64 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + + K + I + I +Q Sbjct: 65 ITVEKGELNRHKQVRIH-SPEIIPNEIQQL 93 >gi|157374368|ref|YP_001472968.1| hypothetical protein Ssed_1229 [Shewanella sediminis HAW-EB3] gi|189038638|sp|A8FSL7|Y1229_SHESH RecName: Full=UPF0235 protein Ssed_1229 gi|157316742|gb|ABV35840.1| protein of unknown function DUF167 [Shewanella sediminis HAW-EB3] Length = 95 Score = 97.1 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 38/90 (42%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I + +KI +TA P GKAN ++ L+K + K Sbjct: 13 LNLYIQPKASRDQIVGVH-------GEELKIAITAPPVDGKANAHLIKYLSKAFKVPKGD 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + +L Q K I I + I E++ Sbjct: 66 IVILKGQLGRHKQIKIL-SPRLIPEIINAL 94 >gi|78222380|ref|YP_384127.1| hypothetical protein Gmet_1164 [Geobacter metallireducens GS-15] gi|78193635|gb|ABB31402.1| protein of unknown function DUF167 [Geobacter metallireducens GS-15] Length = 102 Score = 97.1 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 22/86 (25%), Positives = 42/86 (48%), Gaps = 7/86 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V + P A K+GI ++ +K+++T+ P +G+AN+ LAK L + KS++ Sbjct: 22 SVHVQPRASKNGICGIQ-------GDAIKLRLTSPPVEGEANRLCTEYLAKLLKVPKSAV 74 Query: 65 RMLSKQSSPLKIIYIDKDCKEITELL 90 +++ S K I + + L Sbjct: 75 TIIAGDKSRHKTIRVSGATAQAVHNL 100 >gi|192362044|ref|YP_001980615.1| hypothetical protein CJA_0091 [Cellvibrio japonicus Ueda107] gi|226734040|sp|B3PFH5|Y091_CELJU RecName: Full=UPF0235 protein CJA_0091 gi|190688209|gb|ACE85887.1| conserved hypothetical protein TIGR00251 [Cellvibrio japonicus Ueda107] Length = 101 Score = 97.1 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 19/79 (24%), Positives = 37/79 (46%), Gaps = 7/79 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +L P A I + +KI++TA P GKAN+ ++ L+K+ + K + Sbjct: 17 LHCQLQPKASGDDIVGVH-------GDRLKIRITAPPVDGKANEYLIKWLSKQFRVPKGN 69 Query: 64 LRMLSKQSSPLKIIYIDKD 82 +++L + K + I Sbjct: 70 IKILQGELGRHKTLGIHAP 88 >gi|258514375|ref|YP_003190597.1| hypothetical protein Dtox_1088 [Desulfotomaculum acetoxidans DSM 771] gi|257778080|gb|ACV61974.1| protein of unknown function DUF167 [Desulfotomaculum acetoxidans DSM 771] Length = 98 Score = 97.1 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 47/87 (54%), Gaps = 8/87 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P A K +A L +KI++TA P +G+AN+A+ LAK L ++++ + Sbjct: 14 KVRVQPRASKDQVAGL-------WEDAVKIRLTAPPVEGEANRALCDFLAKHLGVTRAQV 66 Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELL 90 +++ Q+ K++ + + + + L Sbjct: 67 DLVTGQTGRNKLVRVSGITAESVLQRL 93 >gi|254172671|ref|ZP_04879346.1| conserved hypothetical protein TIGR00251 [Thermococcus sp. AM4] gi|214033600|gb|EEB74427.1| conserved hypothetical protein TIGR00251 [Thermococcus sp. AM4] Length = 113 Score = 97.1 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 43/87 (49%), Gaps = 8/87 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P AKK+ I ++ +K++V A P GKANK ++ L+K L + Sbjct: 33 LFVYVQPKAKKNEIEGID-----EWRGRLKVRVKAPPVGGKANKELVKFLSKLLG---AE 84 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ ++S K + + +E+ L Sbjct: 85 VELVRGETSREKDLLVRLSAEEVRRKL 111 >gi|192288849|ref|YP_001989454.1| hypothetical protein Rpal_0418 [Rhodopseudomonas palustris TIE-1] gi|226706140|sp|B3QA92|Y418_RHOPT RecName: Full=UPF0235 protein Rpal_0418 gi|192282598|gb|ACE98978.1| protein of unknown function DUF167 [Rhodopseudomonas palustris TIE-1] Length = 108 Score = 97.1 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 31/88 (35%), Positives = 49/88 (55%), Gaps = 2/88 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + I LE D +K++V A G+AN+A+ +LAK + + K + Sbjct: 14 VAVRVTPRGGRDDIDGLETLSDGRP--VVKVRVRAIADGGEANRAVTELLAKAVGVPKRN 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 +R+LS +S K I ID D K++ E L+ Sbjct: 72 VRLLSGATSRQKQIAIDGDPKQLGEALR 99 >gi|170725690|ref|YP_001759716.1| hypothetical protein Swoo_1329 [Shewanella woodyi ATCC 51908] gi|226695928|sp|B1KIX3|Y1329_SHEWM RecName: Full=UPF0235 protein Swoo_1329 gi|169811037|gb|ACA85621.1| protein of unknown function DUF167 [Shewanella woodyi ATCC 51908] Length = 95 Score = 97.1 bits (241), Expect = 8e-19, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 38/90 (42%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I + +KI +TA P GKAN ++ L+K + K Sbjct: 13 LNLYIQPKASRDQIVGVH-------GEELKIAITAPPVDGKANAHLIKYLSKAFKVPKGD 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + +L + K + + + I E + + Sbjct: 66 INILKGEQGRHKQVKVI-SPRVIPENISSQ 94 >gi|50122551|ref|YP_051718.1| hypothetical protein ECA3630 [Pectobacterium atrosepticum SCRI1043] gi|81644033|sp|Q6D118|Y3630_ERWCT RecName: Full=UPF0235 protein ECA3630 gi|49613077|emb|CAG76528.1| conserved hypothetical protein [Pectobacterium atrosepticum SCRI1043] Length = 96 Score = 96.7 bits (240), Expect = 9e-19, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLTKFLAKQFRVAKSL 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K I I ++ E ++ Sbjct: 66 VVIEKGELGRHKQIRITHPQHIPADVAEFIE 96 >gi|85060008|ref|YP_455710.1| hypothetical protein SG2030 [Sodalis glossinidius str. 'morsitans'] gi|123518874|sp|Q2NRC0|Y2030_SODGM RecName: Full=UPF0235 protein SG2030 gi|84780528|dbj|BAE75305.1| conserved hypothetical protein [Sodalis glossinidius str. 'morsitans'] Length = 101 Score = 96.7 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 41/90 (45%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + IA +K+ +TA P G+AN ++ LAK+ ++KS Sbjct: 17 LRLYIQPRASRDHIAGAH-------GDEIKVAITAPPVDGQANSHLIRFLAKEFGVAKSR 69 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90 + + + K + ID+ + I LL Sbjct: 70 VILEKGELGRHKQLRIDQPRQLPEVIARLL 99 >gi|240103723|ref|YP_002960032.1| hypothetical protein TGAM_1666 [Thermococcus gammatolerans EJ3] gi|239911277|gb|ACS34168.1| Conserved hypothetical protein [Thermococcus gammatolerans EJ3] Length = 124 Score = 96.7 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 45/87 (51%), Gaps = 8/87 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 ++V + P AKK+ I ++ +K+KV A P GKANK ++ L+K L + Sbjct: 44 LLVYVQPKAKKNEIEGID-----EWRGRLKVKVKAPPVGGKANKELVKFLSKVLG---AE 95 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ ++S K + + +++ + L Sbjct: 96 VELVRGETSREKDLLVRMSAEDVKKRL 122 >gi|281355950|ref|ZP_06242443.1| protein of unknown function DUF167 [Victivallis vadensis ATCC BAA-548] gi|281317319|gb|EFB01340.1| protein of unknown function DUF167 [Victivallis vadensis ATCC BAA-548] Length = 100 Score = 96.7 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 45/91 (49%), Gaps = 8/91 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C V +R+ P A ++ + + +KI + A P GKAN+ + + A+ L K Sbjct: 16 CLVSLRVQPGASRNAVVGM-------YGDAVKIALQAPPVDGKANQLLCRLFAEWSGLPK 68 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91 S++ + S Q+ K++ + +++ +L+ Sbjct: 69 SAVELRSGQTGRSKVLELSGITAEQLKAILE 99 >gi|163867463|ref|YP_001608662.1| hypothetical protein Btr_0184 [Bartonella tribocorum CIP 105476] gi|161017109|emb|CAK00667.1| conserved hypothetical protein [Bartonella tribocorum CIP 105476] Length = 108 Score = 96.7 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 46/89 (51%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V LIP + I +E ++ I++ A P+ GKANKA++ AK+ + SS Sbjct: 12 LFVYLIPKSSVDKIIGIECRDGEKQ--YLVIRLRAVPEDGKANKALIKFFAKQWKIPSSS 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S +S K ++ +E+ ++ Q+ Sbjct: 70 ISLKSGATSRYKQLHFSTHLEELKQIWQS 98 >gi|157372267|ref|YP_001480256.1| hypothetical protein Spro_4033 [Serratia proteamaculans 568] gi|166979954|sp|A8GJ38|Y4033_SERP5 RecName: Full=UPF0235 protein Spro_4033 gi|157324031|gb|ABV43128.1| protein of unknown function DUF167 [Serratia proteamaculans 568] Length = 96 Score = 96.3 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 39/89 (43%), Gaps = 9/89 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ ++K + Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFLAKQFKVAKGN 65 Query: 64 LRMLSKQSSPLKIIYIDKDCK--EITELL 90 + + + K + I + ++ L Sbjct: 66 VTIEKGELGRHKQLRIVNPQQIPDVVAAL 94 >gi|282891527|ref|ZP_06300019.1| hypothetical protein pah_c178o054 [Parachlamydia acanthamoebae str. Hall's coccus] gi|281498618|gb|EFB40945.1| hypothetical protein pah_c178o054 [Parachlamydia acanthamoebae str. Hall's coccus] Length = 93 Score = 96.3 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 24/81 (29%), Positives = 47/81 (58%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++IPNA ++ I E +K+ + + P+KGKAN+A++ LAK L L K Sbjct: 8 LAIKVIPNASRNAILGWE-------NDELKMYIASVPEKGKANEAVIKFLAKFLGLRKQQ 60 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 ++++ +++ KI+ I+ K Sbjct: 61 IQIIRGETNRHKILQIEGIDK 81 >gi|117621083|ref|YP_858117.1| hypothetical protein AHA_3661 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|166232625|sp|A0KPB6|Y3661_AERHH RecName: Full=UPF0235 protein AHA_3661 gi|117562490|gb|ABK39438.1| conserved hypothetical protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 99 Score = 96.3 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 41/89 (46%), Gaps = 10/89 (11%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P A + I L +K+ +TA P G+AN ++ LAK+ ++K +R++ Sbjct: 17 IQPKASRDQIVGLH-------GEELKVAITAPPVDGQANSHLIKYLAKQFKVAKGQVRIV 69 Query: 68 SKQSSPLKIIYIDKD---CKEITELLQNN 93 + K + I+ E++ LL N Sbjct: 70 RGELGRHKTVAIEAPRQIPAEVSALLDNQ 98 >gi|323498669|ref|ZP_08103660.1| hypothetical protein VISI1226_18936 [Vibrio sinaloensis DSM 21326] gi|323316269|gb|EGA69289.1| hypothetical protein VISI1226_18936 [Vibrio sinaloensis DSM 21326] Length = 96 Score = 96.3 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 38/90 (42%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK+ ++K Sbjct: 14 IRLYIQPKASRDKIVGLH-------GDELKVAITAPPVDGKANAHLSKYLAKQFKVAKGL 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + K ++I +I ++ Sbjct: 67 IDIEKGELGRHKQLWI-CSPAQIPTEIEAI 95 >gi|37523404|ref|NP_926781.1| hypothetical protein glr3835 [Gloeobacter violaceus PCC 7421] gi|47117439|sp|Q7NEP3|Y3835_GLOVI RecName: Full=UPF0235 protein glr3835 gi|35214408|dbj|BAC91776.1| glr3835 [Gloeobacter violaceus PCC 7421] Length = 111 Score = 96.3 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 36/91 (39%), Gaps = 7/91 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V P A S + + K+++ A P +GKAN +A++A + + Sbjct: 28 LTVWAQPRASCSEVVGWQQ-------NAFKVRLAAPPVEGKANAECVALIAAFFGVPRRQ 80 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 + ++ Q K I I+ + LQ Sbjct: 81 VSLVQGQQGRHKKIRIEAPADLLLVALQKLS 111 >gi|271501907|ref|YP_003334933.1| hypothetical protein Dd586_3394 [Dickeya dadantii Ech586] gi|270345462|gb|ACZ78227.1| protein of unknown function DUF167 [Dickeya dadantii Ech586] Length = 96 Score = 96.3 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 7/80 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ ++K Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFLAKQFRVAKGM 65 Query: 64 LRMLSKQSSPLKIIYIDKDC 83 + + + K I I Sbjct: 66 VTIEKGELGRHKQIRIVNPQ 85 >gi|261822847|ref|YP_003260953.1| hypothetical protein Pecwa_3610 [Pectobacterium wasabiae WPP163] gi|261606860|gb|ACX89346.1| protein of unknown function DUF167 [Pectobacterium wasabiae WPP163] Length = 96 Score = 96.3 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLTKFLAKQFRVAKSL 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K I I +I + ++ Sbjct: 66 VVIEKGELGRHKQIRITHPQHIPADIADFIE 96 >gi|115522078|ref|YP_778989.1| hypothetical protein RPE_0048 [Rhodopseudomonas palustris BisA53] gi|115516025|gb|ABJ04009.1| protein of unknown function DUF167 [Rhodopseudomonas palustris BisA53] Length = 107 Score = 96.3 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 27/94 (28%), Positives = 51/94 (54%), Gaps = 2/94 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 +V +R+ P + I LE + +K++V A + G+AN+A+ +LAK L + K Sbjct: 11 ISVALRVTPRGGRDAIDGLETLANG--RTVVKVRVRAIAEGGEANRAVTELLAKALGVPK 68 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 ++R+LS +S LK I +D + + + L+ + Sbjct: 69 RAVRVLSGTTSRLKQIAVDGNPELLGAALRKLTA 102 >gi|218710632|ref|YP_002418253.1| hypothetical protein VS_2686 [Vibrio splendidus LGP32] gi|218323651|emb|CAV19947.1| Hypothetical protein VS_2686 [Vibrio splendidus LGP32] Length = 96 Score = 96.3 bits (239), Expect = 1e-18, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQ 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + K + I ++ ++ Sbjct: 67 ITIEKGELGRHKQVRI-CSPSQLPTEVKAI 95 >gi|293394481|ref|ZP_06638777.1| conserved hypothetical protein [Serratia odorifera DSM 4582] gi|291422946|gb|EFE96179.1| conserved hypothetical protein [Serratia odorifera DSM 4582] Length = 97 Score = 96.0 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 19/76 (25%), Positives = 36/76 (47%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + P A + I L +K+ +TA P G+AN +L +AK+ ++KS+ Sbjct: 13 VRLYIQPKASRDQIIGLH-------GDEIKVAITAPPVDGQANAHLLKFIAKQFKVAKSN 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VTIEKGELGRHKQLRI 81 >gi|13473522|ref|NP_105090.1| hypothetical protein msl4154 [Mesorhizobium loti MAFF303099] gi|29839606|sp|Q98EP2|Y4154_RHILO RecName: Full=UPF0235 protein msl4154 gi|14024272|dbj|BAB50876.1| msl4154 [Mesorhizobium loti MAFF303099] Length = 102 Score = 96.0 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 25/92 (27%), Positives = 49/92 (53%), Gaps = 2/92 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ VRL P + + +E D H+K +V A P+ G AN+A+ ++AK L + Sbjct: 12 IDLFVRLTPKSSLDRLEGVETSADG--RSHLKARVRAVPENGAANQALERLVAKTLGVPA 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 SS+ +++ +S LK + I D + + + ++ Sbjct: 70 SSVSVVAGGTSRLKTVRIVGDPEALAQRVEAL 101 >gi|23015143|ref|ZP_00054928.1| COG1872: Uncharacterized conserved protein [Magnetospirillum magnetotacticum MS-1] Length = 94 Score = 96.0 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 27/86 (31%), Positives = 44/86 (51%), Gaps = 2/86 (2%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VRL P A + I D S + +K +VT P+ GKAN A+L +L+K + KS + Sbjct: 2 AVRLTPKASRDRIMGAAPEADGS--VVLKAQVTTVPEDGKANAALLKLLSKAWKIPKSDM 59 Query: 65 RMLSKQSSPLKIIYIDKDCKEITELL 90 ++ + K+I I D + + + L Sbjct: 60 DIVLGATDRRKVILISGDSEVLRKRL 85 >gi|95930347|ref|ZP_01313084.1| conserved hypothetical protein [Desulfuromonas acetoxidans DSM 684] gi|95133599|gb|EAT15261.1| conserved hypothetical protein [Desulfuromonas acetoxidans DSM 684] Length = 103 Score = 96.0 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 43/92 (46%), Gaps = 8/92 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A K+ + L+ +K+++T+ P +G ANK AK L +SK Sbjct: 17 VVIALFVQPRASKNSLCGLQ-------GEELKVRLTSPPVEGAANKLCCTFFAKLLGVSK 69 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 SS+ ++ S K I ++ E+ + L Sbjct: 70 SSVTLIRGDKSRHKQIVVEGVSLDEVKQRLAK 101 >gi|242238224|ref|YP_002986405.1| hypothetical protein Dd703_0772 [Dickeya dadantii Ech703] gi|242130281|gb|ACS84583.1| protein of unknown function DUF167 [Dickeya dadantii Ech703] Length = 99 Score = 96.0 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 17/80 (21%), Positives = 35/80 (43%), Gaps = 6/80 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 13 IRLYIQPKASRDQIVGLH------GNDEVKVAITAPPVDGQANAHLIQFMAKQFRVAKSR 66 Query: 64 LRMLSKQSSPLKIIYIDKDC 83 + + + K + I Sbjct: 67 VTIEKGELGRHKQLRIHSPQ 86 >gi|153835271|ref|ZP_01987938.1| conserved hypothetical protein [Vibrio harveyi HY01] gi|156975836|ref|YP_001446743.1| hypothetical protein VIBHAR_03581 [Vibrio harveyi ATCC BAA-1116] gi|269960447|ref|ZP_06174820.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|166232611|sp|A7MZ92|Y3581_VIBHB RecName: Full=UPF0235 protein VIBHAR_03581 gi|148868246|gb|EDL67386.1| conserved hypothetical protein [Vibrio harveyi HY01] gi|156527430|gb|ABU72516.1| hypothetical protein VIBHAR_03581 [Vibrio harveyi ATCC BAA-1116] gi|269834874|gb|EEZ88960.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 96 Score = 96.0 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 38/92 (41%), Gaps = 8/92 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 12 VVLRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKVAK 64 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + K I I+ +I ++ Sbjct: 65 GLVHIEKGELGRHKQIRIE-SPVQIPTEIKAI 95 >gi|332172191|gb|AEE21445.1| protein of unknown function DUF167 [Glaciecola agarilytica 4H-3-7+YE-5] Length = 98 Score = 96.0 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 39/93 (41%), Gaps = 10/93 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + + P A + I + +KI +TA P GKAN + LAK+ ++KS Sbjct: 13 QLRIYIQPKAARDEIVGMH-------GDALKIAITAPPVDGKANAHLCKYLAKQCGVAKS 65 Query: 63 SLRMLSKQSSPLKIIYIDKD---CKEITELLQN 92 + + Q + K + + + I LL Sbjct: 66 KVAITKGQLNRHKTVVVCAPGVIPEAIQALLNE 98 >gi|227112386|ref|ZP_03826042.1| hypothetical protein PcarbP_05447 [Pectobacterium carotovorum subsp. brasiliensis PBR1692] gi|253689815|ref|YP_003019005.1| hypothetical protein PC1_3453 [Pectobacterium carotovorum subsp. carotovorum PC1] gi|259646966|sp|C6DE33|Y3453_PECCP RecName: Full=UPF0235 protein PC1_3453 gi|251756393|gb|ACT14469.1| protein of unknown function DUF167 [Pectobacterium carotovorum subsp. carotovorum PC1] Length = 96 Score = 96.0 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLTKFLAKQFRVAKSL 65 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K I I Sbjct: 66 VVIEKGELGRHKQIRIT 82 >gi|109896736|ref|YP_659991.1| hypothetical protein Patl_0407 [Pseudoalteromonas atlantica T6c] gi|109699017|gb|ABG38937.1| conserved hypothetical protein [Pseudoalteromonas atlantica T6c] Length = 90 Score = 96.0 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 20/81 (24%), Positives = 37/81 (45%), Gaps = 7/81 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ + P A + + L +K+ +TA P GKAN ++ LAK+ ++K Sbjct: 1 MHLRIYTQPKASRDEVVGLH-------GDELKVAITAPPVDGKANTHLIKYLAKQCGVAK 53 Query: 62 SSLRMLSKQSSPLKIIYIDKD 82 S + + Q + K + I K Sbjct: 54 SKVVITKGQLNRHKTVLISKP 74 >gi|311278130|ref|YP_003940361.1| hypothetical protein Entcl_0802 [Enterobacter cloacae SCF1] gi|308747325|gb|ADO47077.1| protein of unknown function DUF167 [Enterobacter cloacae SCF1] Length = 96 Score = 96.0 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + L +K+ +TA P G+AN ++ LAK+ ++KS Sbjct: 13 LRLYIQPKASRDSLVGLH-------GDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VIIEKGELGRHKQVRI 81 >gi|295691445|ref|YP_003595138.1| hypothetical protein Cseg_4109 [Caulobacter segnis ATCC 21756] gi|295433348|gb|ADG12520.1| protein of unknown function DUF167 [Caulobacter segnis ATCC 21756] Length = 94 Score = 96.0 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 46/88 (52%), Gaps = 3/88 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +RL P + + D ++K++V + P G AN A++A LAK L + +S+ Sbjct: 5 LAIRLTPRGGRDAVEGW--ALDPEGRPYLKVRVASPPVDGAANAALIAFLAKSLKIPRSA 62 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90 +R+ S +++ +K + ID D + T Sbjct: 63 VRLASGETARIKRLEIDDVDQADFTRAF 90 >gi|218672707|ref|ZP_03522376.1| hypothetical protein RetlG_14247 [Rhizobium etli GR56] Length = 89 Score = 96.0 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 28/79 (35%), Positives = 50/79 (63%), Gaps = 2/79 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VRL PN + +E ++K +VTA P+KGKANKA++A+++K L ++KS Sbjct: 13 RLTVRLTPNGGRDAFDGIETDSQGET--YLKARVTAVPEKGKANKALIALVSKSLGVAKS 70 Query: 63 SLRMLSKQSSPLKIIYIDK 81 S+ ++S +++ KI+ I+ Sbjct: 71 SVSLVSGETARKKILRIEG 89 >gi|118578470|ref|YP_899720.1| hypothetical protein Ppro_0022 [Pelobacter propionicus DSM 2379] gi|118501180|gb|ABK97662.1| protein of unknown function DUF167 [Pelobacter propionicus DSM 2379] Length = 103 Score = 96.0 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 8/91 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A K+ I L +K+++T+ P G ANK LA L + K Sbjct: 16 VVLNLYIQPRASKNEICGLV-------DNSLKLRLTSPPVDGAANKLCREFLADLLHVPK 68 Query: 62 SSLRMLSKQSSPLKIIYI-DKDCKEITELLQ 91 S++ ++S ++S K + I D I+ +Q Sbjct: 69 SAVEIISGETSRHKRVRIATADSDLISSRIQ 99 >gi|262393226|ref|YP_003285080.1| hypothetical protein VEA_002453 [Vibrio sp. Ex25] gi|262336820|gb|ACY50615.1| hypothetical protein VEA_002453 [Vibrio sp. Ex25] Length = 96 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 38/90 (42%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + K I I+ +I ++ Sbjct: 67 VHIEKGELGRHKQIRIE-SPTQIPTEIKAI 95 >gi|257465218|ref|ZP_05629589.1| hypothetical protein AM202_01815 [Actinobacillus minor 202] gi|257450878|gb|EEV24921.1| hypothetical protein AM202_01815 [Actinobacillus minor 202] Length = 100 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 25/92 (27%), Positives = 42/92 (45%), Gaps = 8/92 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + L P A + I L +KI +TA P G AN +L L+K + K Sbjct: 14 IRLRIFLQPKASRDQIVGLH-------DEELKIAITAPPVDGAANAHLLKFLSKLFKVPK 66 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 SS+ + + K I+I + K+I + ++N Sbjct: 67 SSIALEKGELQRHKQIFIP-EPKQIPQEIENL 97 >gi|291286027|ref|YP_003502843.1| hypothetical protein Dacet_0081 [Denitrovibrio acetiphilus DSM 12809] gi|290883187|gb|ADD66887.1| protein of unknown function DUF167 [Denitrovibrio acetiphilus DSM 12809] Length = 84 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 26/91 (28%), Positives = 50/91 (54%), Gaps = 7/91 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + P AKK+ ++ + +KI+V A P +G AN+ ++ L+K+L +SK Sbjct: 1 MKLSVYVQPGAKKTELSGMH-------DGKIKIRVCAPPVEGAANEVLVKFLSKQLKISK 53 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 S ++++S + S KI+ I+ D ++ L Sbjct: 54 SGIKIISGEKSRHKIVEINMDTLDVMNCLSK 84 >gi|269965742|ref|ZP_06179839.1| conserved hypothetical protein [Vibrio alginolyticus 40B] gi|269829610|gb|EEZ83847.1| conserved hypothetical protein [Vibrio alginolyticus 40B] Length = 96 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 38/90 (42%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + K + I+ +I ++ Sbjct: 67 VHIEKGELGRHKQVRIE-SPTQIPTEIKAI 95 >gi|27364893|ref|NP_760421.1| hypothetical protein VV1_1522 [Vibrio vulnificus CMCP6] gi|29839706|sp|Q8DCB7|Y1522_VIBVU RecName: Full=UPF0235 protein VV1_1522 gi|27361038|gb|AAO09948.1| UPF0235 protein [Vibrio vulnificus CMCP6] Length = 96 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 37/92 (40%), Gaps = 10/92 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A + I L +KI +TA P GKAN + +L K ++K Sbjct: 12 VVLRLYIQPKASRDKILGLH-------GDELKIAITAPPVDGKANGHLTKLLGKWFKVAK 64 Query: 62 SSLRMLSKQSSPLKIIYIDKD---CKEITELL 90 S + + + K + + E+ +L Sbjct: 65 SLVTIEKGELGRHKQVRVHTPQQIPDEVKAIL 96 >gi|56476391|ref|YP_157980.1| hypothetical protein ebA1762 [Aromatoleum aromaticum EbN1] gi|81358142|sp|Q5P6I2|Y954_AZOSE RecName: Full=UPF0235 protein AZOSEA09540 gi|56312434|emb|CAI07079.1| conserved hypothetical protein [Aromatoleum aromaticum EbN1] Length = 97 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 41/90 (45%), Gaps = 7/90 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P AKK+ MK+++ A P GKAN A+ LA + +S+ Sbjct: 14 LSLHVQPGAKKTEFVG-------PHGEAMKLRLAAPPVDGKANAALTVFLAAFCGVGRSA 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + +LS ++S K + I+ E L+ Sbjct: 67 VSLLSGETSRAKRVRIEGAGSEALARLRAL 96 >gi|294634379|ref|ZP_06712916.1| putative cytoplasmic protein [Edwardsiella tarda ATCC 23685] gi|291092187|gb|EFE24748.1| putative cytoplasmic protein [Edwardsiella tarda ATCC 23685] Length = 96 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 13 LRLYIQPKASRDQIVGLH-------GEELKVAITAPPVDGQANAHLIKFIAKQFRVAKSL 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K + I + + L+ Sbjct: 66 ITIEKGELGRHKQLRIHQPQQIPDVVAAALE 96 >gi|227325979|ref|ZP_03830003.1| hypothetical protein PcarcW_01116 [Pectobacterium carotovorum subsp. carotovorum WPP14] Length = 96 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLTKFLAKQFRVAKSL 65 Query: 64 LRMLSKQSSPLKIIYID 80 + + + K I I Sbjct: 66 VVIEKGELGRHKQIRIT 82 >gi|226942094|ref|YP_002797168.1| hypothetical protein LHK_03181 [Laribacter hongkongensis HLHK9] gi|254801627|sp|C1D6C4|Y3181_LARHH RecName: Full=UPF0235 protein LHK_03181 gi|226717021|gb|ACO76159.1| Uncharacterized conserved protein [Laribacter hongkongensis HLHK9] Length = 97 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 44/89 (49%), Gaps = 7/89 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A+++ +A L +KI++ A P GKAN +LA LA+ L +S+ Sbjct: 11 VRLTLHVQPGARRTEVAGLH-------GDALKIRLAAPPVDGKANACLLAFLARGLGVSR 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 S++ + S S K++ I E L Sbjct: 64 SAVTLKSGDCSRHKVVDIRGITPEAAAGL 92 >gi|260767468|ref|ZP_05876405.1| hypothetical protein VFA_000519 [Vibrio furnissii CIP 102972] gi|260617580|gb|EEX42762.1| hypothetical protein VFA_000519 [Vibrio furnissii CIP 102972] gi|315181248|gb|ADT88162.1| hypothetical protein vfu_A03054 [Vibrio furnissii NCTC 11218] Length = 96 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 37/90 (41%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK+ ++K Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGKANAHLSKYLAKQCKVAKGL 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + K I I +I +Q Sbjct: 66 IDIEKGELGRHKQIRIH-TPAQIPPEVQAI 94 >gi|329890913|ref|ZP_08269256.1| hypothetical protein BDIM_26220 [Brevundimonas diminuta ATCC 11568] gi|328846214|gb|EGF95778.1| hypothetical protein BDIM_26220 [Brevundimonas diminuta ATCC 11568] Length = 93 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 27/92 (29%), Positives = 43/92 (46%), Gaps = 3/92 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V+L P A I ++ D +K++V A P +G+AN A+ A LAK L + K Sbjct: 3 ARIPVKLTPRASADRIDGWDVDPDGRP--VLKVRVRAQPVEGEANAALTAFLAKALGVPK 60 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 + + S LK+I +D E+ L Sbjct: 61 RDVALARGGQSRLKMIEVDGLTDAEVRARLPA 92 >gi|240949737|ref|ZP_04754069.1| hypothetical protein AM305_00329 [Actinobacillus minor NM305] gi|240295769|gb|EER46456.1| hypothetical protein AM305_00329 [Actinobacillus minor NM305] Length = 100 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 25/92 (27%), Positives = 42/92 (45%), Gaps = 8/92 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + L P A + I L +KI +TA P G AN +L L+K + K Sbjct: 14 IRLRIFLQPKASRDQIVGLH-------DEELKIAITAPPVDGAANAHLLKFLSKLFKVPK 66 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 SS+ + + K I+I + K+I + ++N Sbjct: 67 SSIALEKGELQRHKQIFIP-EPKQIPQEIENL 97 >gi|299139441|ref|ZP_07032616.1| protein of unknown function DUF167 [Acidobacterium sp. MP5ACTX8] gi|298598710|gb|EFI54873.1| protein of unknown function DUF167 [Acidobacterium sp. MP5ACTX8] Length = 99 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 45/90 (50%), Gaps = 8/90 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C++ VR+ P AK++ + +KI +T P G+AN A++A L+ +L + + Sbjct: 12 CSLPVRVHPGAKQNAVTGTH-------DGSLKISLTTPPTDGRANTALIAFLSDRLNIPR 64 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 + + +L+ +S K + I E+ L Sbjct: 65 AHIELLTGATSRSKTLRIAGLTSAEVEARL 94 >gi|123443630|ref|YP_001007602.1| hypothetical protein YE3436 [Yersinia enterocolitica subsp. enterocolitica 8081] gi|166232591|sp|A1JPU6|Y3436_YERE8 RecName: Full=UPF0235 protein YE3436 gi|122090591|emb|CAL13460.1| conserved hypothetical protein [Yersinia enterocolitica subsp. enterocolitica 8081] Length = 96 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 13 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K I I E+ LL+ Sbjct: 66 VIIEKGELGRHKQIKIVNPQQIPPEVATLLE 96 >gi|52424377|ref|YP_087514.1| hypothetical protein MS0322 [Mannheimia succiniciproducens MBEL55E] gi|81387574|sp|Q65VT1|Y322_MANSM RecName: Full=UPF0235 protein MS0322 gi|52306429|gb|AAU36929.1| unknown [Mannheimia succiniciproducens MBEL55E] Length = 95 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 41/89 (46%), Gaps = 8/89 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A + I + +KI +TA P G AN +L L+K + KS Sbjct: 12 RLRIFLQPKASRDKIIGIH-------DDELKIAITAPPVDGAANAHLLKYLSKAFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 ++ + + + K ++I + K I E LQ Sbjct: 65 AIILEKGELNRHKQLFIP-EPKLIPEELQ 92 >gi|152979630|ref|YP_001345259.1| hypothetical protein Asuc_1977 [Actinobacillus succinogenes 130Z] gi|171704390|sp|A6VQS7|Y1977_ACTSZ RecName: Full=UPF0235 protein Asuc_1977 gi|150841353|gb|ABR75324.1| protein of unknown function DUF167 [Actinobacillus succinogenes 130Z] Length = 98 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 23/94 (24%), Positives = 42/94 (44%), Gaps = 10/94 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + L P A K I L +KI +TA P G AN ++ L+K + K Sbjct: 11 IRLRIMLQPKASKDAIIGLH-------DEELKISITAPPVDGAANAHLIKYLSKAFKVPK 63 Query: 62 SSLRMLSKQSSPLKIIYIDKD---CKEITELLQN 92 S++++ + + K ++I + + +LL N Sbjct: 64 SAVQLEKGELNRHKQVFIPAPKIIPEAVRQLLDN 97 >gi|261250161|ref|ZP_05942737.1| hypothetical protein VIA_000181 [Vibrio orientalis CIP 102891] gi|260939277|gb|EEX95263.1| hypothetical protein VIA_000181 [Vibrio orientalis CIP 102891] Length = 96 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKIVGQH-------GEELKIAITAPPVDGKANAHLSKYLAKQFKVAKGL 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + K + I +I + ++ Sbjct: 67 ITIEKGELGRHKQVRIQ-SPVQIPQEIKAI 95 >gi|77918231|ref|YP_356046.1| hypothetical protein Pcar_0617 [Pelobacter carbinolicus DSM 2380] gi|123574815|sp|Q3A6Y1|Y617_PELCD RecName: Full=UPF0235 protein Pcar_0617 gi|77544314|gb|ABA87876.1| conserved hypothetical protein TIGR00251 [Pelobacter carbinolicus DSM 2380] Length = 95 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 25/90 (27%), Positives = 44/90 (48%), Gaps = 8/90 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + P A ++ +A L+ +KI++T+ P +G ANK LAK L ++K Sbjct: 12 VVLSVHVQPRASRNELAGLQ-------GESLKIRLTSPPVEGAANKLCREFLAKLLGVAK 64 Query: 62 SSLRMLSKQSSPLKIIYIDKDC-KEITELL 90 S + ++S S K + I+ E+ L Sbjct: 65 SRVTLVSGDKSRHKRLLIEGVTLDEVRNKL 94 >gi|161486600|ref|NP_935670.2| hypothetical protein VV2877 [Vibrio vulnificus YJ016] gi|47117406|sp|Q7MHJ2|Y2877_VIBVY RecName: Full=UPF0235 protein VV2877 Length = 96 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 37/92 (40%), Gaps = 10/92 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A + I L +KI +TA P GKAN + +L K ++K Sbjct: 12 VVLRLYIQPKASRDKILGLH-------GDELKIAITAPPVDGKANGHLTKLLGKWFKVAK 64 Query: 62 SSLRMLSKQSSPLKIIYIDKD---CKEITELL 90 S + + + K + + E+ +L Sbjct: 65 SLVTIEKGELGRHKQVRVHTPQQIPDEVKAIL 96 >gi|91228690|ref|ZP_01262604.1| hypothetical protein V12G01_14044 [Vibrio alginolyticus 12G01] gi|91187761|gb|EAS74079.1| hypothetical protein V12G01_14044 [Vibrio alginolyticus 12G01] Length = 96 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 38/90 (42%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + K I I+ +I ++ Sbjct: 67 VHIEKGELGRHKQIRIE-SPTQIPTEIKAI 95 >gi|330444266|ref|YP_004377252.1| hypothetical protein G5S_0576 [Chlamydophila pecorum E58] gi|328807376|gb|AEB41549.1| conserved hypothetical protein [Chlamydophila pecorum E58] Length = 100 Score = 95.6 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 29/90 (32%), Positives = 53/90 (58%), Gaps = 10/90 (11%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 +C + V++ P AK++ I + +K++VT P+KGKAN+A++++LAK L + Sbjct: 16 LCILEVQVTPKAKENKIVGFQ-------GEVLKVRVTEPPEKGKANEAVVSLLAKALGIP 68 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELL 90 K + ++S +SS K + I K++ E L Sbjct: 69 KRDVTLVSGESSRKKKLMI---PKKVQEKL 95 >gi|296133363|ref|YP_003640610.1| protein of unknown function DUF167 [Thermincola sp. JR] gi|296031941|gb|ADG82709.1| protein of unknown function DUF167 [Thermincola potens JR] Length = 106 Score = 95.2 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 20/85 (23%), Positives = 45/85 (52%), Gaps = 7/85 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 +++ P A K+ + ++ +K+K+TA P +G AN+A + A+ +++KS + Sbjct: 14 KIKVQPKASKNELKGVQ-------GDSLKVKLTAPPVEGAANEACIRFFAELFSVAKSQV 66 Query: 65 RMLSKQSSPLKIIYIDKDCKEITEL 89 +++ +S K++ + KE E Sbjct: 67 EIITGHTSRTKLLKVKGLTKEEAEK 91 >gi|193212041|ref|YP_001997994.1| hypothetical protein Cpar_0370 [Chlorobaculum parvum NCIB 8327] gi|226705834|sp|B3QL06|Y370_CHLP8 RecName: Full=UPF0235 protein Cpar_0370 gi|193085518|gb|ACF10794.1| protein of unknown function DUF167 [Chlorobaculum parvum NCIB 8327] Length = 105 Score = 95.2 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 45/89 (50%), Gaps = 8/89 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P + K+GIA +KI + + P ANK +LAK L + +S+ Sbjct: 13 LSVRVQPRSSKTGIAG-------RYGDQVKICLKSAPVDNAANKECCQLLAKTLGVPRSN 65 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQ 91 + +++ Q+S K++ ++ E+ + L Sbjct: 66 VSVMNGQTSRSKVLKVEGMTPSELRKALA 94 >gi|332162814|ref|YP_004299391.1| hypothetical protein YE105_C3194 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|318604345|emb|CBY25843.1| upf0235 protein VC0458 [Yersinia enterocolitica subsp. palearctica Y11] gi|325667044|gb|ADZ43688.1| hypothetical protein YE105_C3194 [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|330859004|emb|CBX69362.1| UPF0235 protein YE3436 [Yersinia enterocolitica W22703] Length = 96 Score = 95.2 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 13 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K I I E+ LL+ Sbjct: 66 VIIEKGELGRHKQIKIVNPQQIPPEVAALLE 96 >gi|251788394|ref|YP_003003115.1| hypothetical protein Dd1591_0757 [Dickeya zeae Ech1591] gi|247537015|gb|ACT05636.1| protein of unknown function DUF167 [Dickeya zeae Ech1591] Length = 100 Score = 95.2 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 7/80 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ ++K Sbjct: 17 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFLAKQFRVAKGM 69 Query: 64 LRMLSKQSSPLKIIYIDKDC 83 + + + K I I Sbjct: 70 VTIEKGELGRHKQIRIVNPQ 89 >gi|320155276|ref|YP_004187655.1| osmotic shock response integral membrane protein YggT [Vibrio vulnificus MO6-24/O] gi|319930588|gb|ADV85452.1| integral membrane protein YggT, involved in response to extracytoplasmic stress (osmotic shock) [Vibrio vulnificus MO6-24/O] Length = 96 Score = 95.2 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 37/92 (40%), Gaps = 10/92 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A + I L +KI +TA P GKAN + +L K ++K Sbjct: 12 VVLRLYIQPKASRDKILGLH-------GDELKIAITAPPVDGKANGHLTKLLGKWFKVAK 64 Query: 62 SSLRMLSKQSSPLKIIYIDKD---CKEITELL 90 S + + + K + + E+ +L Sbjct: 65 SLVTIEKGELGRHKQVRVHAPQQIPDEVKAIL 96 >gi|238752326|ref|ZP_04613805.1| hypothetical protein yrohd0001_19180 [Yersinia rohdei ATCC 43380] gi|238709487|gb|EEQ01726.1| hypothetical protein yrohd0001_19180 [Yersinia rohdei ATCC 43380] Length = 90 Score = 95.2 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 7 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFKVAKSQ 59 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K + I E+ LL+ Sbjct: 60 VIIEKGELGRHKQLKIVNPQQIPPEVAALLE 90 >gi|270263061|ref|ZP_06191331.1| threonine dehydratase [Serratia odorifera 4Rx13] gi|270042749|gb|EFA15843.1| threonine dehydratase [Serratia odorifera 4Rx13] Length = 96 Score = 95.2 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 41/90 (45%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS+ Sbjct: 13 IRLYIQPKASRDQIIGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFKVAKSN 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + K + I ++I ++ Sbjct: 66 VTIEKGELGRHKQLRIVN-PQQIPAVVAAL 94 >gi|254784473|ref|YP_003071901.1| hypothetical protein TERTU_0220 [Teredinibacter turnerae T7901] gi|237683524|gb|ACR10788.1| conserved hypothetical protein [Teredinibacter turnerae T7901] Length = 99 Score = 95.2 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 24/81 (29%), Positives = 38/81 (46%), Gaps = 8/81 (9%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 L P A A L+ +KI++TA P GKAN ++ LA++ ++KS + ++ Sbjct: 18 LQPKASSDAFAGLQA-------DRLKIRITAPPTDGKANAHLVKYLARQFGVAKSDIEIV 70 Query: 68 SKQSSPLKIIYIDKDCKEITE 88 Q S K + I K I Sbjct: 71 RGQLSRQKTLQI-NHPKHIPA 90 >gi|238795055|ref|ZP_04638648.1| hypothetical protein yinte0001_4160 [Yersinia intermedia ATCC 29909] gi|238725603|gb|EEQ17164.1| hypothetical protein yinte0001_4160 [Yersinia intermedia ATCC 29909] Length = 90 Score = 95.2 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 7 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANTHLIKFIAKQFRVAKSQ 59 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K I I E+ LL+ Sbjct: 60 VVIEKGELGRHKQIKIVNPQQIPPEVAALLE 90 >gi|295106861|emb|CBL04404.1| Uncharacterized conserved protein [Gordonibacter pamelaeae 7-10-1-b] Length = 96 Score = 95.2 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 46/92 (50%), Gaps = 2/92 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + P + + ++ + + + ++VTA P GKANKA+ ++A+ L + K Sbjct: 4 AIIAVHVTPRSGRDEVSGVRADAAGA--DEVCVRVTAPPDGGKANKAVCKLVAEALGVPK 61 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 S + + S ++ K + ++ D ++ L + Sbjct: 62 SRVGVASGHTARRKRLSVEADQAQVDAWLASL 93 >gi|308050656|ref|YP_003914222.1| hypothetical protein Fbal_2946 [Ferrimonas balearica DSM 9799] gi|307632846|gb|ADN77148.1| protein of unknown function DUF167 [Ferrimonas balearica DSM 9799] Length = 96 Score = 95.2 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 38/90 (42%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + L K+ +TA P GKAN ++ LAK+ ++K Sbjct: 14 LKLYIQPKASRDQLVGLH-------GEEFKVAITAPPVDGKANAHLVKFLAKQFKVAKGQ 66 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90 + ++ + K + I + +LL Sbjct: 67 ISIVKGELGRHKQLKIQSPTVIPDPLAQLL 96 >gi|269137631|ref|YP_003294331.1| hypothetical protein ETAE_0273 [Edwardsiella tarda EIB202] gi|267983291|gb|ACY83120.1| hypothetical protein ETAE_0273 [Edwardsiella tarda EIB202] gi|304557696|gb|ADM40360.1| hypothetical protein ETAF_0236 [Edwardsiella tarda FL6-60] Length = 96 Score = 95.2 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN +L +AK+ ++KS Sbjct: 13 LRLYIQPKASRDLIIGLH-------GDELKVAITAPPVDGQANAHLLKFIAKQFRVAKSR 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K + I + + L+ Sbjct: 66 ITLEKGELGRHKQLRISQPQQIPDAVAAALE 96 >gi|307132433|ref|YP_003884449.1| hypothetical protein Dda3937_02598 [Dickeya dadantii 3937] gi|306529962|gb|ADM99892.1| conserved protein [Dickeya dadantii 3937] Length = 100 Score = 95.2 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LA++ ++K Sbjct: 17 IRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFLARQFRVAKGM 69 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K I I ++ EL+ Sbjct: 70 VTIEKGELGRHKQIRIVNPQAIPADVAELIS 100 >gi|288959535|ref|YP_003449876.1| hypothetical protein AZL_026940 [Azospirillum sp. B510] gi|288911843|dbj|BAI73332.1| hypothetical protein AZL_026940 [Azospirillum sp. B510] Length = 113 Score = 94.8 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 47/89 (52%), Gaps = 2/89 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +R+ P A ++ + + +K+ VTA P+ GKAN+A++ +L+K + K+ Sbjct: 15 RVALRVTPKASRNAVTGMADTAAG--GRVLKLAVTAVPENGKANEAVIKLLSKAWKVPKT 72 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 SL +++ + KI+++ D + L Sbjct: 73 SLTVVAGATDRNKILHVAGDPAALLARLS 101 >gi|322418850|ref|YP_004198073.1| hypothetical protein GM18_1329 [Geobacter sp. M18] gi|320125237|gb|ADW12797.1| protein of unknown function DUF167 [Geobacter sp. M18] Length = 104 Score = 94.8 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 42/88 (47%), Gaps = 8/88 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V + P A +S I ++I++T+ P ANK + ++AK L L+KS + Sbjct: 18 TVHVQPRASRSEICG-------PKDGELRIRLTSPPVDDAANKQCVELIAKSLGLAKSKV 70 Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELLQ 91 + S S K++ ++ D ++ L + Sbjct: 71 SIKSGAKSRHKVVRVEGVDQDDLLRLFK 98 >gi|284046655|ref|YP_003396995.1| hypothetical protein Cwoe_5214 [Conexibacter woesei DSM 14684] gi|283950876|gb|ADB53620.1| protein of unknown function DUF167 [Conexibacter woesei DSM 14684] Length = 88 Score = 94.8 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 26/81 (32%), Positives = 46/81 (56%), Gaps = 7/81 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M ++ VRL P AK++ I + ++VTA P GKAN A+ +LAK L ++ Sbjct: 1 MGDLRVRLQPRAKRNEIVG-------ERDGALVVRVTAPPVDGKANAALCRLLAKALGVA 53 Query: 61 KSSLRMLSKQSSPLKIIYIDK 81 S++ ++ QS+ K++++D Sbjct: 54 PSTVTVVRGQSARDKVVHVDA 74 >gi|238763264|ref|ZP_04624229.1| hypothetical protein ykris0001_28410 [Yersinia kristensenii ATCC 33638] gi|238698537|gb|EEP91289.1| hypothetical protein ykris0001_28410 [Yersinia kristensenii ATCC 33638] Length = 90 Score = 94.8 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 7 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQ 59 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K I I E+ LL+ Sbjct: 60 VIIEKGELGRHKQIKIVNPQQIPPEVAALLE 90 >gi|328474077|gb|EGF44882.1| hypothetical protein VP10329_15255 [Vibrio parahaemolyticus 10329] Length = 96 Score = 94.8 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 38/90 (42%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90 + + + K I I+ E+ +L Sbjct: 67 VHIEKGELGRHKQIRIESPVQIPAEVKAIL 96 >gi|317493545|ref|ZP_07951966.1| hypothetical protein HMPREF0864_02731 [Enterobacteriaceae bacterium 9_2_54FAA] gi|316918488|gb|EFV39826.1| hypothetical protein HMPREF0864_02731 [Enterobacteriaceae bacterium 9_2_54FAA] Length = 99 Score = 94.8 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 41/92 (44%), Gaps = 8/92 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + +AK+ ++KS Sbjct: 15 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLQKFIAKQFRVAKSQ 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 + + + K + I ++I E++ S Sbjct: 68 VVIEKGELGRHKQVRIS-QPQQIPEVVSALRS 98 >gi|148980491|ref|ZP_01816088.1| hypothetical protein VSWAT3_21090 [Vibrionales bacterium SWAT-3] gi|145961216|gb|EDK26530.1| hypothetical protein VSWAT3_21090 [Vibrionales bacterium SWAT-3] Length = 96 Score = 94.8 bits (235), Expect = 3e-18, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 38/90 (42%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQ 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 +++ + K + I +I ++ Sbjct: 67 IKIEKGELGRHKQVRI-CSPSQIPTEVKAI 95 >gi|42522071|ref|NP_967451.1| hypothetical protein Bd0463 [Bdellovibrio bacteriovorus HD100] gi|39574602|emb|CAE78444.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100] Length = 94 Score = 94.8 bits (235), Expect = 4e-18, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 42/91 (46%), Gaps = 8/91 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P + K+ + +KIK+TA P GKAN+ ++ L+ + K Sbjct: 9 VRLHLFIQPKSSKNEVVG-------PHNGEIKIKLTAPPVDGKANECLIEFLSDLFDIPK 61 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91 + ++ ++ K++ + D ++ E L+ Sbjct: 62 RDVHLIKGETGRHKVVELAGLDVEKTREALR 92 >gi|90412037|ref|ZP_01220044.1| hypothetical protein P3TCK_24666 [Photobacterium profundum 3TCK] gi|90327015|gb|EAS43394.1| hypothetical protein P3TCK_24666 [Photobacterium profundum 3TCK] Length = 97 Score = 94.8 bits (235), Expect = 4e-18, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 38/90 (42%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++KS Sbjct: 15 IRLYIQPKASRDQIVGLH-------GEELKIAITAPPVDGKANAHLSKFLAKQFRVAKSQ 67 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90 + + K + I+ I ELL Sbjct: 68 VLIEKGMQGRHKQVRIESPREIPPVIAELL 97 >gi|261342370|ref|ZP_05970228.1| putative cytoplasmic protein [Enterobacter cancerogenus ATCC 35316] gi|288315005|gb|EFC53943.1| putative cytoplasmic protein [Enterobacter cancerogenus ATCC 35316] Length = 98 Score = 94.4 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 7/80 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDC 83 + + + K + I Sbjct: 66 VIIEKGELGRHKQVKILNPQ 85 >gi|37199811|dbj|BAC95641.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 101 Score = 94.4 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 37/92 (40%), Gaps = 10/92 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A + I L +KI +TA P GKAN + +L K ++K Sbjct: 17 VVLRLYIQPKASRDKILGLH-------GDELKIAITAPPVDGKANGHLTKLLGKWFKVAK 69 Query: 62 SSLRMLSKQSSPLKIIYIDKD---CKEITELL 90 S + + + K + + E+ +L Sbjct: 70 SLVTIEKGELGRHKQVRVHTPQQIPDEVKAIL 101 >gi|86146416|ref|ZP_01064740.1| hypothetical protein MED222_22586 [Vibrio sp. MED222] gi|85835895|gb|EAQ54029.1| hypothetical protein MED222_22586 [Vibrio sp. MED222] Length = 96 Score = 94.4 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 38/90 (42%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQ 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 +++ + K + I +I ++ Sbjct: 67 IKIEKGELGRHKQVRI-CSPSQIPTEVKAI 95 >gi|295097492|emb|CBK86582.1| conserved hypothetical protein TIGR00251 [Enterobacter cloacae subsp. cloacae NCTC 9394] Length = 98 Score = 94.4 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 7/80 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDC 83 + + + K + I Sbjct: 66 VIIEKGELGRHKQVKILNPQ 85 >gi|238754614|ref|ZP_04615968.1| hypothetical protein yruck0001_4830 [Yersinia ruckeri ATCC 29473] gi|238707245|gb|EEP99608.1| hypothetical protein yruck0001_4830 [Yersinia ruckeri ATCC 29473] Length = 93 Score = 94.4 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 10 IRLYIQPKASRDQIIGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSH 62 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + K I I ++ LL+ Sbjct: 63 VIIEKGDLGRHKQIKIINPQQIPPQVAALLE 93 >gi|238918244|ref|YP_002931758.1| hypothetical protein NT01EI_0281 [Edwardsiella ictaluri 93-146] gi|259646912|sp|C5BCR7|Y281_EDWI9 RecName: Full=UPF0235 protein NT01EI_0281 gi|238867812|gb|ACR67523.1| conserved hypothetical protein [Edwardsiella ictaluri 93-146] Length = 96 Score = 94.4 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 19/89 (21%), Positives = 40/89 (44%), Gaps = 8/89 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN +L +AK+ ++KS Sbjct: 13 LRLYIQPKASRDLIIGLH-------GDELKVAITAPPVDGQANAHLLKFIAKQFRVAKSR 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + + K + I ++I + + Sbjct: 66 ITLEKGELGRHKQLRIS-QPQQIPDAVAA 93 >gi|289208238|ref|YP_003460304.1| hypothetical protein TK90_1056 [Thioalkalivibrio sp. K90mix] gi|288943869|gb|ADC71568.1| protein of unknown function DUF167 [Thioalkalivibrio sp. K90mix] Length = 85 Score = 94.4 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 46/92 (50%), Gaps = 8/92 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + VR+ P +K+ I +K++V A P+KG+AN+A+ A+LAK L Sbjct: 1 MARLRVRVAPGSKRDAIGPW-------MGDILKLRVQAPPEKGRANEAVCALLAKALGCP 53 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCK-EITELLQ 91 + +++ ++ K + I+ + ++ L Sbjct: 54 ARDVSVVAGATARDKTVAIEGYSEADLRRALS 85 >gi|54310243|ref|YP_131263.1| hypothetical protein PBPRA3146 [Photobacterium profundum SS9] gi|46914684|emb|CAG21461.1| conserved hypothetical protein [Photobacterium profundum SS9] Length = 101 Score = 94.4 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 37/90 (41%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 19 IRLYIQPKASRDQIVGLH-------GEELKIAITAPPVDGKANAHLSKFLAKQFRVAKGQ 71 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90 + + K + I+ I ELL Sbjct: 72 VLIEKGMQGRHKQVRIESPREIPPVIAELL 101 >gi|269837719|ref|YP_003319947.1| hypothetical protein Sthe_1691 [Sphaerobacter thermophilus DSM 20745] gi|269786982|gb|ACZ39125.1| protein of unknown function DUF167 [Sphaerobacter thermophilus DSM 20745] Length = 102 Score = 94.4 bits (234), Expect = 5e-18, Method: Composition-based stats. Identities = 19/92 (20%), Positives = 41/92 (44%), Gaps = 8/92 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V VR+ P A ++ + + +++++ A P +G AN+A+ LA L L K Sbjct: 14 TQVTVRVTPRASRTQVDGV-------ADGALRVRLAAPPVEGAANRALTEFLANLLRLPK 66 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 + +++ K + + +++E L Sbjct: 67 RDVELVAGARGRQKTVLLRGLTPADVSERLTA 98 >gi|325577506|ref|ZP_08147868.1| hypothetical protein HMPREF9417_0609 [Haemophilus parainfluenzae ATCC 33392] gi|325160610|gb|EGC72734.1| hypothetical protein HMPREF9417_0609 [Haemophilus parainfluenzae ATCC 33392] Length = 95 Score = 94.0 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L +KI +TA P G+AN +L L+K + KS Sbjct: 12 RLKIILQPKASKDQIVGLH-------DDELKITITAPPVDGQANAHLLKFLSKTFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 S+ + + + K ++I K I +QN Sbjct: 65 SIVLEKGELNRHKQVWIP-SPKLIPSEIQNL 94 >gi|322834241|ref|YP_004214268.1| hypothetical protein Rahaq_3549 [Rahnella sp. Y9602] gi|321169442|gb|ADW75141.1| protein of unknown function DUF167 [Rahnella sp. Y9602] Length = 101 Score = 94.0 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 39/92 (42%), Gaps = 10/92 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + L +K+ +TA P G+AN ++ LAK+ ++KS Sbjct: 17 LRLVIQPKASRDSLVGLH-------GDELKVAITAPPVDGQANTHLVKFLAKQFKVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYID---KDCKEITELLQN 92 + + + K + I E+ LL Sbjct: 70 VSIEKGELGRHKQVRITHPQNIPTEVAVLLAE 101 >gi|33151941|ref|NP_873294.1| hypothetical protein HD0778 [Haemophilus ducreyi 35000HP] gi|47117473|sp|Q7VN15|Y778_HAEDU RecName: Full=UPF0235 protein HD_0778 gi|33148163|gb|AAP95683.1| conserved hypothetical protein [Haemophilus ducreyi 35000HP] Length = 97 Score = 94.0 bits (233), Expect = 6e-18, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 39/92 (42%), Gaps = 10/92 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + L P A + I L +K+ +TA P G AN +L L+K + K Sbjct: 13 IRLRIFLQPKASRDQIVGLH-------DNELKVAITAPPVDGAANAYLLKYLSKLFKVPK 65 Query: 62 SSLRMLSKQSSPLKIIYIDKD---CKEITELL 90 SS+ + + K +++ KEI + L Sbjct: 66 SSIVLEKGELQRHKQLFVPAPKLLPKEIEQWL 97 >gi|146312998|ref|YP_001178072.1| hypothetical protein Ent638_3359 [Enterobacter sp. 638] gi|166990826|sp|A4WE91|Y3359_ENT38 RecName: Full=UPF0235 protein Ent638_3359 gi|145319874|gb|ABP62021.1| protein of unknown function DUF167 [Enterobacter sp. 638] Length = 96 Score = 94.0 bits (233), Expect = 7e-18, Method: Composition-based stats. Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 7/80 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDC 83 + + + K + I Sbjct: 66 VIIEKGELGRHKQVKILNPQ 85 >gi|28899393|ref|NP_798998.1| hypothetical protein VP2619 [Vibrio parahaemolyticus RIMD 2210633] gi|260366264|ref|ZP_05778723.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030] gi|260878919|ref|ZP_05891274.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|260898280|ref|ZP_05906776.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|33301894|sp|Q87LJ3|Y2619_VIBPA RecName: Full=UPF0235 protein VP2619 gi|28807629|dbj|BAC60882.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308085856|gb|EFO35551.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308090459|gb|EFO40154.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308113515|gb|EFO51055.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030] Length = 96 Score = 94.0 bits (233), Expect = 7e-18, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 38/90 (42%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTKFLAKQFKIAKGL 66 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90 + + + K I I+ E+ +L Sbjct: 67 VHIEKGELGRHKQIRIESPVQIPAEVKAIL 96 >gi|84394062|ref|ZP_00992798.1| hypothetical protein V12B01_07755 [Vibrio splendidus 12B01] gi|84375304|gb|EAP92215.1| hypothetical protein V12B01_07755 [Vibrio splendidus 12B01] Length = 96 Score = 94.0 bits (233), Expect = 7e-18, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 37/90 (41%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLAKYLAKQFKVAKGQ 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 +++ + K + I I ++ Sbjct: 67 IKIEKGELGRHKQVRI-CSPSHIPTEVKAI 95 >gi|320539543|ref|ZP_08039210.1| putative conserved protein [Serratia symbiotica str. Tucson] gi|320030396|gb|EFW12408.1| putative conserved protein [Serratia symbiotica str. Tucson] Length = 97 Score = 94.0 bits (233), Expect = 7e-18, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 42/90 (46%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ +++S+ Sbjct: 13 IRLYIQPKASRDKIIGLH-------GDEVKVAITAPPVDGQANAHLIKFLAKQFKVARSN 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + K ++I ++I ++ Sbjct: 66 VTIEKGELGRHKQLHII-HPQQIPAVVAAL 94 >gi|238787374|ref|ZP_04631173.1| hypothetical protein yfred0001_33630 [Yersinia frederiksenii ATCC 33641] gi|238724636|gb|EEQ16277.1| hypothetical protein yfred0001_33630 [Yersinia frederiksenii ATCC 33641] Length = 90 Score = 93.6 bits (232), Expect = 7e-18, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 7 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQ 59 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K + I E+ LL+ Sbjct: 60 VIIEKGELGRHKQLKIVNPQQIPPEVAALLK 90 >gi|238786243|ref|ZP_04630189.1| hypothetical protein yberc0001_39320 [Yersinia bercovieri ATCC 43970] gi|238798802|ref|ZP_04642272.1| hypothetical protein ymoll0001_31720 [Yersinia mollaretii ATCC 43969] gi|238712858|gb|EEQ04924.1| hypothetical protein yberc0001_39320 [Yersinia bercovieri ATCC 43970] gi|238717373|gb|EEQ09219.1| hypothetical protein ymoll0001_31720 [Yersinia mollaretii ATCC 43969] Length = 90 Score = 93.6 bits (232), Expect = 7e-18, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 7 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQ 59 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K I I E+ LL+ Sbjct: 60 VIIEKGELGRHKQIKIVNPQQIPPEVAVLLE 90 >gi|167854882|ref|ZP_02477658.1| hypothetical protein HPS_05418 [Haemophilus parasuis 29755] gi|219871635|ref|YP_002476010.1| hypothetical protein HAPS_1504 [Haemophilus parasuis SH0165] gi|254800538|sp|B8F6W0|Y1504_HAEPS RecName: Full=UPF0235 protein HAPS_1504 gi|167853949|gb|EDS25187.1| hypothetical protein HPS_05418 [Haemophilus parasuis 29755] gi|219691839|gb|ACL33062.1| conserved hypothetical protein [Haemophilus parasuis SH0165] Length = 97 Score = 93.6 bits (232), Expect = 7e-18, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 41/90 (45%), Gaps = 8/90 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + L P A + I L +KI +TA P G+AN +L L+K + K Sbjct: 13 IRLRIFLQPKASRDQIVGLH-------DNELKIAITAPPIDGQANAHLLKYLSKLFKVPK 65 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 SS+ + + K I++ + K I + ++ Sbjct: 66 SSIVLEKGELQRHKQIFVP-EPKLIPKEIE 94 >gi|254508600|ref|ZP_05120716.1| conserved hypothetical protein TIGR00251 [Vibrio parahaemolyticus 16] gi|219548451|gb|EED25460.1| conserved hypothetical protein TIGR00251 [Vibrio parahaemolyticus 16] Length = 96 Score = 93.6 bits (232), Expect = 7e-18, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 38/90 (42%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + L +K+ +TA P GKAN + LAK+ ++K Sbjct: 14 LRLYIQPKASRDKLVGLH-------GDELKVAITAPPVDGKANAHLSKYLAKQFKVAKGL 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 +++ + K + + +I ++ Sbjct: 67 IKIEKGELGRHKQLRV-NSPTQIPAEIKAI 95 >gi|296533899|ref|ZP_06896426.1| protein of hypothetical function DUF167 [Roseomonas cervicalis ATCC 49957] gi|296265774|gb|EFH11872.1| protein of hypothetical function DUF167 [Roseomonas cervicalis ATCC 49957] Length = 101 Score = 93.6 bits (232), Expect = 8e-18, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 44/90 (48%), Gaps = 3/90 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V+ P A+++G+ D +K+ V P+ G+ANKA+ A+LA L + Sbjct: 12 VELRVKAQPKARRAGLQGWIAAPDGP---RLKLAVHEAPEDGRANKAICALLAGALHVPP 68 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S++ ++ +S K I D + E L+ Sbjct: 69 SAITVVQGATSREKTCRILGDSARLHETLE 98 >gi|89074110|ref|ZP_01160609.1| hypothetical protein SKA34_22122 [Photobacterium sp. SKA34] gi|89050046|gb|EAR55572.1| hypothetical protein SKA34_22122 [Photobacterium sp. SKA34] Length = 97 Score = 93.6 bits (232), Expect = 8e-18, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 37/90 (41%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN ++ L+K+ ++K Sbjct: 15 IRLYIQPKASRDQIVGLH-------GNEVKIAITAPPVDGKANAHLVKYLSKQFKVAKGL 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + K I I+ K I + Sbjct: 68 IHLEKGLQGRHKQIRIE-TPKVIPSEIATI 96 >gi|82701713|ref|YP_411279.1| hypothetical protein Nmul_A0579 [Nitrosospira multiformis ATCC 25196] gi|82409778|gb|ABB73887.1| Protein of unknown function DUF167 [Nitrosospira multiformis ATCC 25196] Length = 105 Score = 93.6 bits (232), Expect = 8e-18, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 40/92 (43%), Gaps = 7/92 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A+++ + +KIK+ A P +G AN A+LA LA + + Sbjct: 21 LTLHIQPGARRTEVVGSH-------GDALKIKLAAPPVEGAANVALLAFLAGVFGVPQRQ 73 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 + + S KI+ ID LL+ S Sbjct: 74 VILRQGARSRRKIVEIDGTACGADTLLKQTAS 105 >gi|330431471|gb|AEC16530.1| hypothetical protein UMN179_00494 [Gallibacterium anatis UMN179] Length = 95 Score = 93.6 bits (232), Expect = 8e-18, Method: Composition-based stats. Identities = 23/85 (27%), Positives = 41/85 (48%), Gaps = 8/85 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + L P A K I L +KI +TA P GKAN +L L+K+ ++K+ Sbjct: 12 LNIILQPKAGKDQIVGL-------YGDELKITITAPPIDGKANAHLLKFLSKQFKVAKTQ 64 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITE 88 + + + S K ++I ++I + Sbjct: 65 IELRKGELSRHKQVFIP-SPEQIPQ 88 >gi|301154876|emb|CBW14339.1| conserved protein [Haemophilus parainfluenzae T3T1] Length = 95 Score = 93.6 bits (232), Expect = 8e-18, Method: Composition-based stats. Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L +KI +TA P G+AN +L L+K + KS Sbjct: 12 RLKIILQPKASKDQIVGLH-------DDELKITITAPPVDGQANAHLLKFLSKAFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 S+ + + + K ++I K I +QN Sbjct: 65 SIVLEKGELNRHKQVWIP-SPKLIPSEIQNL 94 >gi|16127852|ref|NP_422416.1| hypothetical protein CC_3622 [Caulobacter crescentus CB15] gi|221236673|ref|YP_002519110.1| YggU superfamily protein [Caulobacter crescentus NA1000] gi|47117618|sp|Q9A2E3|Y3622_CAUCR RecName: Full=UPF0235 protein CC_3622 gi|254803798|sp|B8H6E1|Y3737_CAUCN RecName: Full=UPF0235 protein CCNA_03737 gi|13425372|gb|AAK25584.1| conserved hypothetical protein [Caulobacter crescentus CB15] gi|220965846|gb|ACL97202.1| YggU superfamily protein [Caulobacter crescentus NA1000] Length = 98 Score = 93.6 bits (232), Expect = 9e-18, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 48/90 (53%), Gaps = 3/90 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++VRL P + + D ++K++V + P +G AN A++A LAK L + + Sbjct: 7 VTLVVRLTPRGGRDAAEGWALDADGRL--YLKVRVASPPVEGAANAALIAFLAKTLKIPR 64 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S++R+ + +++ LK + ++ D ++ Sbjct: 65 SAVRLAAGETARLKRLELEGVDPADVARAF 94 >gi|330446896|ref|ZP_08310547.1| conserved hypothetical protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328491087|dbj|GAA05044.1| conserved hypothetical protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 97 Score = 93.6 bits (232), Expect = 9e-18, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 38/88 (43%), Gaps = 8/88 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + P A + I L +KI +TA P GKAN ++ L+K+ ++K Sbjct: 15 VRLYIQPKASRDQIVGLH-------GDEIKIAITAPPVDGKANAHLVKYLSKQFKVAKGL 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + K + I+ K I ++ Sbjct: 68 IHVEKGLQGRHKQVRIEA-PKAIPNEIE 94 >gi|260913359|ref|ZP_05919840.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325] gi|260632590|gb|EEX50760.1| conserved hypothetical protein [Pasteurella dagmatis ATCC 43325] Length = 100 Score = 93.3 bits (231), Expect = 9e-18, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 40/89 (44%), Gaps = 8/89 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L +KI +TA P G+AN +L L+K + KS Sbjct: 12 RLRIFLQPKASKDQIVGLH-------DDELKITITAPPIDGQANAHLLKFLSKTFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S+ + + + K I + K I E++ Sbjct: 65 SIVLEKGELNRHKQILVPN-PKIIPEIVS 92 >gi|152971905|ref|YP_001337014.1| hypothetical protein KPN_03387 [Klebsiella pneumoniae subsp. pneumoniae MGH 78578] gi|206579395|ref|YP_002236595.1| conserved hypothetical protein TIGR00251 [Klebsiella pneumoniae 342] gi|238896484|ref|YP_002921222.1| hypothetical protein KP1_4664 [Klebsiella pneumoniae NTUH-K2044] gi|262042605|ref|ZP_06015761.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|329998593|ref|ZP_08303177.1| TIGR00251 family protein [Klebsiella sp. MS 92-3] gi|166990767|sp|A6TDW3|Y3323_KLEP7 RecName: Full=UPF0235 protein KPN78578_33230 gi|226708043|sp|B5XU96|Y722_KLEP3 RecName: Full=UPF0235 protein KPK_0722 gi|150956754|gb|ABR78784.1| hypothetical protein KPN_03387 [Klebsiella pneumoniae subsp. pneumoniae MGH 78578] gi|206568453|gb|ACI10229.1| conserved hypothetical protein TIGR00251 [Klebsiella pneumoniae 342] gi|238548804|dbj|BAH65155.1| hypothetical protein KP1_4664 [Klebsiella pneumoniae subsp. pneumoniae NTUH-K2044] gi|259040039|gb|EEW41154.1| conserved hypothetical protein [Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884] gi|328538612|gb|EGF64712.1| TIGR00251 family protein [Klebsiella sp. MS 92-3] Length = 96 Score = 93.3 bits (231), Expect = 9e-18, Method: Composition-based stats. Identities = 18/92 (19%), Positives = 39/92 (42%), Gaps = 8/92 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I + +K+ +TA P G+AN ++ LAK+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGVH-------GDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 + + + K + I ++I + Sbjct: 66 VLIEKGELGRHKQVKIIA-PQQIPTAVAALTE 96 >gi|121997739|ref|YP_001002526.1| hypothetical protein Hhal_0948 [Halorhodospira halophila SL1] gi|121589144|gb|ABM61724.1| protein of unknown function DUF167 [Halorhodospira halophila SL1] Length = 101 Score = 93.3 bits (231), Expect = 9e-18, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 40/79 (50%), Gaps = 8/79 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P AK+ + + +++++ P GKAN A+ +LA++ ++KS+ Sbjct: 15 VHVRVTPRAKRESLD--------VEGERLRVRLNTPPVDGKANTALRKLLARQFGVAKSA 66 Query: 64 LRMLSKQSSPLKIIYIDKD 82 + +L + S K + I Sbjct: 67 VSLLRGERSRDKTVRITAP 85 >gi|258620721|ref|ZP_05715756.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258625606|ref|ZP_05720488.1| conserved hypothetical protein [Vibrio mimicus VM603] gi|258582108|gb|EEW06975.1| conserved hypothetical protein [Vibrio mimicus VM603] gi|258586919|gb|EEW11633.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 97 Score = 93.3 bits (231), Expect = 9e-18, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 41/92 (44%), Gaps = 10/92 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK+ ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKFLAKQCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQN 92 + + + K + I + EI LL++ Sbjct: 66 VVIEKGELGRHKQVRIQQPSQIPPEIAALLES 97 >gi|283788525|ref|YP_003368390.1| hypothetical protein ROD_50331 [Citrobacter rodentium ICC168] gi|282951979|emb|CBG91706.1| conserved hypothetical protein [Citrobacter rodentium ICC168] Length = 96 Score = 93.3 bits (231), Expect = 9e-18, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDELKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VAIEKGELGRHKQVKI 81 >gi|92116228|ref|YP_575957.1| hypothetical protein Nham_0608 [Nitrobacter hamburgensis X14] gi|91799122|gb|ABE61497.1| protein of unknown function DUF167 [Nitrobacter hamburgensis X14] Length = 106 Score = 93.3 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 26/93 (27%), Positives = 54/93 (58%), Gaps = 2/93 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 ++ +R+ P + GI +E+ D +K++V A + G+AN+A++A+LAK L + K Sbjct: 12 SIALRVTPRGGRDGIDGIEMLADGRP--VVKVRVRAIAEGGEANRAVMAVLAKALGVRKI 69 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 +R+L+ +S LK + + D ++ + L+ + Sbjct: 70 DVRILAGATSRLKQVAVGGDPVKLGDALRALTA 102 >gi|238759331|ref|ZP_04620497.1| hypothetical protein yaldo0001_5600 [Yersinia aldovae ATCC 35236] gi|238702492|gb|EEP95043.1| hypothetical protein yaldo0001_5600 [Yersinia aldovae ATCC 35236] Length = 90 Score = 93.3 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 7 LRLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLIKFIAKQFRVAKSQ 59 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K I I E+ L++ Sbjct: 60 VILEKGELGRHKQIKIINPQQVPPEVAALIE 90 >gi|127512067|ref|YP_001093264.1| hypothetical protein Shew_1134 [Shewanella loihica PV-4] gi|166228889|sp|A3QC07|Y1134_SHELP RecName: Full=UPF0235 protein Shew_1134 gi|126637362|gb|ABO23005.1| protein of unknown function DUF167 [Shewanella loihica PV-4] Length = 96 Score = 93.3 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 37/88 (42%), Gaps = 8/88 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I +K+ +TA P GKAN + LAK+ +K + Sbjct: 13 IRLYIQPKASRDQIVG-------PHGDELKVAITAPPIDGKANAHLCKFLAKQFKTAKGN 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + + K I + KEI E + Sbjct: 66 ILIEKGELGRHKQIRVVM-PKEIPEAIS 92 >gi|315633577|ref|ZP_07888867.1| conserved hypothetical protein [Aggregatibacter segnis ATCC 33393] gi|315477619|gb|EFU68361.1| conserved hypothetical protein [Aggregatibacter segnis ATCC 33393] Length = 97 Score = 93.3 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 39/90 (43%), Gaps = 8/90 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L +KI +TA P G+AN +L L+K + KS Sbjct: 12 RLRIFLQPKAAKDQIVGLH-------DDELKISITAPPVDGQANAHLLKFLSKLFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 S+ + + + K + I K I ++ Sbjct: 65 SIVLEKGELNRHKQVLIPC-PKAIPPQVEA 93 >gi|148263238|ref|YP_001229944.1| hypothetical protein Gura_1167 [Geobacter uraniireducens Rf4] gi|146396738|gb|ABQ25371.1| protein of unknown function DUF167 [Geobacter uraniireducens Rf4] Length = 102 Score = 93.3 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 44/88 (50%), Gaps = 8/88 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V + P A ++ I ++ +K+++TA P + ANK + +LAK L ++KS + Sbjct: 22 TVHVQPRASRNEICGVQ-------GDELKLRLTAPPVEDAANKLCVELLAKALKVAKSRV 74 Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELLQ 91 + + S K + ++ + + LL+ Sbjct: 75 TITAGAKSRHKTVKVEGITTEPVLSLLK 102 >gi|147678156|ref|YP_001212371.1| hypothetical protein PTH_1821 [Pelotomaculum thermopropionicum SI] gi|189039033|sp|A5D180|Y1821_PELTS RecName: Full=UPF0235 protein PTH_1821 gi|146274253|dbj|BAF60002.1| uncharacterized conserved protein [Pelotomaculum thermopropionicum SI] Length = 95 Score = 93.3 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 7/88 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A ++ +A L +KI++TA P G+AN+A A LA L+L Sbjct: 11 VLLKVRVQPRAARNQVAGL-------YEDALKIRLTAPPVDGEANEACRAFLADSLSLPP 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITEL 89 S + ++S +S K++ I E Sbjct: 64 SKVEIVSGHASRTKVVKIAGVGAEKVRR 91 >gi|29840014|ref|NP_829120.1| hypothetical protein CCA00247 [Chlamydophila caviae GPIC] gi|33301875|sp|Q824A6|Y247_CHLCV RecName: Full=UPF0235 protein CCA_00247 gi|29834361|gb|AAP04998.1| conserved hypothetical protein [Chlamydophila caviae GPIC] Length = 92 Score = 93.3 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 27/85 (31%), Positives = 50/85 (58%), Gaps = 7/85 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ P ++++ I E +KI+VT P+KGKAN+A++A+LAK L+L K Sbjct: 8 LEVKVTPKSRENKIVGFE-------GEVLKIRVTEAPEKGKANEAVIALLAKTLSLPKRD 60 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITE 88 + ++S ++S K + + K + I Sbjct: 61 VTLISGETSRKKRLLLPKSTESIIS 85 >gi|288933577|ref|YP_003437636.1| hypothetical protein Kvar_0694 [Klebsiella variicola At-22] gi|290511356|ref|ZP_06550725.1| conserved hypothetical protein [Klebsiella sp. 1_1_55] gi|288888306|gb|ADC56624.1| protein of unknown function DUF167 [Klebsiella variicola At-22] gi|289776349|gb|EFD84348.1| conserved hypothetical protein [Klebsiella sp. 1_1_55] Length = 96 Score = 92.9 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 18/92 (19%), Positives = 39/92 (42%), Gaps = 8/92 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I + +K+ +TA P G+AN ++ LAK+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGVH-------GDELKVAITAPPVDGQANAHLVKFLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 + + + K + I ++I + Sbjct: 66 VLIEKGELGRHKQVKIIA-PQQIPTAVAALTE 96 >gi|302879874|ref|YP_003848438.1| hypothetical protein Galf_2678 [Gallionella capsiferriformans ES-2] gi|302582663|gb|ADL56674.1| protein of unknown function DUF167 [Gallionella capsiferriformans ES-2] Length = 96 Score = 92.9 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 40/90 (44%), Gaps = 7/90 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P AK+S I L +K+K+ A P G+AN+A+L +A+ + Sbjct: 14 LTLHVQPGAKRSEICGLH-------GEALKLKLAAPPIDGRANEALLKYIAELFRVPVRQ 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + S K++ + + LL + Sbjct: 67 VELRQGAQSRHKVVAVTDSAIQPESLLTDQ 96 >gi|62184885|ref|YP_219670.1| hypothetical protein CAB243 [Chlamydophila abortus S26/3] gi|81312941|sp|Q5L6M2|Y243_CHLAB RecName: Full=UPF0235 protein CAB243 gi|62147952|emb|CAH63699.1| conserved hypothetical protein [Chlamydophila abortus S26/3] Length = 96 Score = 92.9 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 28/85 (32%), Positives = 48/85 (56%), Gaps = 7/85 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ P +K++ I E +KI+VT P+KGKAN+A++A+LAK L+L K Sbjct: 8 LEVKVTPKSKQNTIVGFE-------GEVLKIRVTEVPEKGKANEAVIALLAKALSLPKRD 60 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITE 88 + ++ +S K I + K + I Sbjct: 61 ITLIPGDTSRKKRILLPKSTESIVS 85 >gi|261867124|ref|YP_003255046.1| hypothetical protein D11S_0417 [Aggregatibacter actinomycetemcomitans D11S-1] gi|261412456|gb|ACX81827.1| hypothetical protein D11S_0417 [Aggregatibacter actinomycetemcomitans D11S-1] Length = 97 Score = 92.9 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 39/89 (43%), Gaps = 8/89 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L +KI +TA P G+AN +L L+K + KS Sbjct: 12 RLRIFLQPKAAKDQIVGLH-------DDELKISITAPPVDGQANAHLLKFLSKLFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S+ + + + K + I K I ++ Sbjct: 65 SIVLEKGELNRHKQVLIP-SPKVIPPQIE 92 >gi|262172428|ref|ZP_06040106.1| hypothetical protein VII_003257 [Vibrio mimicus MB-451] gi|261893504|gb|EEY39490.1| hypothetical protein VII_003257 [Vibrio mimicus MB-451] Length = 97 Score = 92.9 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 40/92 (43%), Gaps = 10/92 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQN 92 + + + K + I + EI LL++ Sbjct: 66 VVIEKGELGRHKQVRIQQPNQIPPEIAALLES 97 >gi|257056705|ref|YP_003134537.1| hypothetical protein Svir_27290 [Saccharomonospora viridis DSM 43017] gi|256586577|gb|ACU97710.1| uncharacterized conserved protein [Saccharomonospora viridis DSM 43017] Length = 118 Score = 92.9 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 19/89 (21%), Positives = 39/89 (43%), Gaps = 3/89 (3%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 +R+ P AK+ + + D + + + V A GKAN+A+ +LA+ L++ L Sbjct: 22 AIRVKPGAKRDAVGGI---WDGALGEALVVSVRAPAVDGKANEAVCRVLAEALSVRARDL 78 Query: 65 RMLSKQSSPLKIIYIDKDCKEITELLQNN 93 ++ + K++ + E L Sbjct: 79 TVVKGHRARDKLVELRDPPPGCAERLAEL 107 >gi|15603178|ref|NP_246251.1| hypothetical protein PM1313 [Pasteurella multocida subsp. multocida str. Pm70] gi|29839744|sp|Q9CLC6|Y1313_PASMU RecName: Full=UPF0235 protein PM1313 gi|12721676|gb|AAK03397.1| unknown [Pasteurella multocida subsp. multocida str. Pm70] Length = 99 Score = 92.9 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 26/92 (28%), Positives = 40/92 (43%), Gaps = 10/92 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L +KI +TA P G+AN +L L+K + KS Sbjct: 15 RLRIFLQPKASKDQIVGLH-------DNELKITITAPPIDGQANAHLLKFLSKTFKVPKS 67 Query: 63 SLRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 S+ + + + K I I E+ LL+ Sbjct: 68 SIVLEKGELNRHKQILIPNPKVIPTEVNVLLK 99 >gi|293390704|ref|ZP_06635038.1| hypothetical protein D7S_0840 [Aggregatibacter actinomycetemcomitans D7S-1] gi|290951238|gb|EFE01357.1| hypothetical protein D7S_0840 [Aggregatibacter actinomycetemcomitans D7S-1] Length = 97 Score = 92.5 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 40/89 (44%), Gaps = 8/89 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K+ I L +KI +TA P G+AN +L L+K + KS Sbjct: 12 RLRIFLQPKAAKNQIVGLH-------DDELKISITAPPVDGQANAHLLKFLSKLFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 S+ + + + K + I K I ++ Sbjct: 65 SIVLEKGELNRHKQVLIP-SPKVIPPQIE 92 >gi|260775569|ref|ZP_05884466.1| hypothetical protein VIC_000947 [Vibrio coralliilyticus ATCC BAA-450] gi|260608750|gb|EEX34915.1| hypothetical protein VIC_000947 [Vibrio coralliilyticus ATCC BAA-450] Length = 96 Score = 92.5 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 19/92 (20%), Positives = 37/92 (40%), Gaps = 8/92 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A + + L +KI +TA P GKAN + L+K+ ++K Sbjct: 12 VLLRLYIQPKASRDKLIGLH-------GDEIKIAITAPPVDGKANAHLSKYLSKQFKVAK 64 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + + + K + I I E ++ Sbjct: 65 GLITIEKGELGRHKQVRIQ-SPAHIPETIKAI 95 >gi|296104617|ref|YP_003614763.1| hypothetical protein ECL_04282 [Enterobacter cloacae subsp. cloacae ATCC 13047] gi|295059076|gb|ADF63814.1| hypothetical protein ECL_04282 [Enterobacter cloacae subsp. cloacae ATCC 13047] Length = 95 Score = 92.5 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 18/80 (22%), Positives = 34/80 (42%), Gaps = 7/80 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + LAK+ ++KS Sbjct: 13 LRLYIQPKASRDNIVGLH-------GDELKVAITAPPVDGQANAHLTKYLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDC 83 + + + K + I Sbjct: 66 VIIEKGELGRHKQVKILNPQ 85 >gi|90580282|ref|ZP_01236089.1| hypothetical protein VAS14_20161 [Vibrio angustum S14] gi|90438584|gb|EAS63768.1| hypothetical protein VAS14_20161 [Vibrio angustum S14] Length = 97 Score = 92.5 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 34/79 (43%), Gaps = 7/79 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P GKAN ++ LAK+ ++K Sbjct: 15 IRLYIQPKASRDQIVGLH-------GNEVKIAITAPPVDGKANAHLVKYLAKQFKVAKGL 67 Query: 64 LRMLSKQSSPLKIIYIDKD 82 + + K I I+ Sbjct: 68 IHVEKGLQGRHKQIRIEAP 86 >gi|170023077|ref|YP_001719582.1| hypothetical protein YPK_0828 [Yersinia pseudotuberculosis YPIII] gi|226708089|sp|B1JNN7|Y828_YERPY RecName: Full=UPF0235 protein YPK_0828 gi|169749611|gb|ACA67129.1| protein of unknown function DUF167 [Yersinia pseudotuberculosis YPIII] Length = 96 Score = 92.5 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 13 LKLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K I + E+T LL+ Sbjct: 66 VIIEKGELGRHKQIKVINPQQIPPEVTILLE 96 >gi|162418314|ref|YP_001604771.1| hypothetical protein YpAngola_A0141 [Yersinia pestis Angola] gi|165924856|ref|ZP_02220688.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Orientalis str. F1991016] gi|165937260|ref|ZP_02225824.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Orientalis str. IP275] gi|166010278|ref|ZP_02231176.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Antiqua str. E1979001] gi|166212808|ref|ZP_02238843.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Antiqua str. B42003004] gi|167422007|ref|ZP_02313760.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167426697|ref|ZP_02318450.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Mediaevalis str. K1973002] gi|186896650|ref|YP_001873762.1| hypothetical protein YPTS_3350 [Yersinia pseudotuberculosis PB1/+] gi|294502893|ref|YP_003566955.1| hypothetical protein YPZ3_0783 [Yersinia pestis Z176003] gi|21960273|gb|AAM86880.1|AE013934_3 hypothetical protein y3330 [Yersinia pestis KIM 10] gi|45438104|gb|AAS63652.1| conserved hypothetical protein [Yersinia pestis biovar Microtus str. 91001] gi|162351129|gb|ABX85077.1| conserved hypothetical protein TIGR00251 [Yersinia pestis Angola] gi|165914734|gb|EDR33347.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Orientalis str. IP275] gi|165923056|gb|EDR40207.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Orientalis str. F1991016] gi|165990764|gb|EDR43065.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Antiqua str. E1979001] gi|166206100|gb|EDR50580.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Antiqua str. B42003004] gi|166960144|gb|EDR56165.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167054300|gb|EDR64119.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Mediaevalis str. K1973002] gi|186699676|gb|ACC90305.1| protein of unknown function DUF167 [Yersinia pseudotuberculosis PB1/+] gi|262360928|gb|ACY57649.1| hypothetical protein YPD4_0740 [Yersinia pestis D106004] gi|294353352|gb|ADE63693.1| hypothetical protein YPZ3_0783 [Yersinia pestis Z176003] Length = 100 Score = 92.5 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 17 LKLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANTHLVKFIAKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K I + E+T LL+ Sbjct: 70 VIIEKGELGRHKQIKVINPQQIPPEVTILLE 100 >gi|121535253|ref|ZP_01667067.1| protein of unknown function DUF167 [Thermosinus carboxydivorans Nor1] gi|121306138|gb|EAX47066.1| protein of unknown function DUF167 [Thermosinus carboxydivorans Nor1] Length = 100 Score = 92.5 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 44/87 (50%), Gaps = 8/87 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 +++ P A ++ + L +K+ V + P +G+AN+A LA A ++K+ + Sbjct: 18 KIKVQPRASRNAVIGL-------AGDSLKVCVASPPVEGEANQACLAFFAALFGVAKTRI 70 Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELL 90 ++S Q S K+I I D ++ +L Sbjct: 71 VLVSGQKSRSKVIKIMGIDMEQFKTVL 97 >gi|62129245|gb|AAX66948.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] Length = 100 Score = 92.5 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 70 IVIEKGELGRHKQVKI 85 >gi|300024715|ref|YP_003757326.1| hypothetical protein Hden_3210 [Hyphomicrobium denitrificans ATCC 51888] gi|299526536|gb|ADJ25005.1| protein of unknown function DUF167 [Hyphomicrobium denitrificans ATCC 51888] Length = 116 Score = 92.5 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 27/88 (30%), Positives = 43/88 (48%), Gaps = 3/88 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V RL P + K I + D + +V A P+ G AN A+ ++A+ L L K S Sbjct: 24 VHFRLTPKSSKDAIEGVTSTSDGP---AFQARVRAVPEHGAANAALEQLVARWLDLPKRS 80 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S LK + ID + +E+ LL+ Sbjct: 81 VSLAKGGKSRLKALQIDGEPEELDRLLE 108 >gi|224584895|ref|YP_002638694.1| hypothetical protein SPC_3167 [Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594] gi|224469423|gb|ACN47253.1| hypothetical protein SPC_3167 [Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594] Length = 93 Score = 92.5 bits (229), Expect = 2e-17, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 10 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQ 62 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 63 IVIEKGELGRHKQVKI 78 >gi|254443993|ref|ZP_05057469.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235] gi|198258301|gb|EDY82609.1| conserved hypothetical protein [Verrucomicrobiae bacterium DG1235] Length = 94 Score = 92.1 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 27/87 (31%), Positives = 50/87 (57%), Gaps = 6/87 (6%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V+++PNA +S IA + +KI++ + PQ GKANKA++A LAK+ +SK+ Sbjct: 11 LSVKVLPNASRSEIAGW------LEDGSLKIRIQSPPQDGKANKALIAFLAKETGVSKNQ 64 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + ++S K+I ++ + L Sbjct: 65 ISIARGETSRQKLIAFERLSSSQWQRL 91 >gi|187735949|ref|YP_001878061.1| protein of unknown function DUF167 [Akkermansia muciniphila ATCC BAA-835] gi|187426001|gb|ACD05280.1| protein of unknown function DUF167 [Akkermansia muciniphila ATCC BAA-835] Length = 96 Score = 92.1 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 31/94 (32%), Positives = 53/94 (56%), Gaps = 3/94 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +++IPNAKKS E +D +K+++ A P +GKANKA++ L+ L + + Sbjct: 1 MKLALKVIPNAKKSEAVGWE--EDPRAGRALKLRIAAPPVEGKANKAVVLFLSAWLDIPR 58 Query: 62 SSLRMLSKQSSPLKIIYI-DKDCKEITELLQNND 94 SS+ L +SS LK++ + D ++ LL D Sbjct: 59 SSISFLRGESSRLKVVELPDGCEGKLARLLSAED 92 >gi|108806318|ref|YP_650234.1| hypothetical protein YPA_0321 [Yersinia pestis Antiqua] gi|108813301|ref|YP_649068.1| hypothetical protein YPN_3141 [Yersinia pestis Nepal516] gi|145597878|ref|YP_001161954.1| hypothetical protein YPDSF_0571 [Yersinia pestis Pestoides F] gi|149367047|ref|ZP_01889080.1| hypothetical protein YPE_2325 [Yersinia pestis CA88-4125] gi|161484758|ref|NP_670629.2| hypothetical protein y3330 [Yersinia pestis KIM 10] gi|161511310|ref|NP_994775.2| hypothetical protein YP_3498 [Yersinia pestis biovar Microtus str. 91001] gi|167399786|ref|ZP_02305304.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167468163|ref|ZP_02332867.1| hypothetical protein YpesF_09749 [Yersinia pestis FV-1] gi|218928116|ref|YP_002345991.1| hypothetical protein YPO0944 [Yersinia pestis CO92] gi|229837637|ref|ZP_04457799.1| conserved protein [Yersinia pestis Pestoides A] gi|229840863|ref|ZP_04461022.1| conserved protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842576|ref|ZP_04462731.1| conserved protein [Yersinia pestis biovar Orientalis str. India 195] gi|229903764|ref|ZP_04518877.1| conserved protein [Yersinia pestis Nepal516] gi|270487542|ref|ZP_06204616.1| conserved hypothetical protein TIGR00251 [Yersinia pestis KIM D27] gi|29839591|sp|Q8ZHF5|Y944_YERPE RecName: Full=UPF0235 protein YPO0944/y3330/YP_3498 gi|122383790|sp|Q1CB83|Y321_YERPA RecName: Full=UPF0235 protein YPA_0321 gi|122384257|sp|Q1CEW2|Y3141_YERPN RecName: Full=UPF0235 protein YPN_3141 gi|166229066|sp|A4TI69|Y571_YERPP RecName: Full=UPF0235 protein YPDSF_0571 gi|108776949|gb|ABG19468.1| hypothetical protein YPN_3141 [Yersinia pestis Nepal516] gi|108778231|gb|ABG12289.1| hypothetical protein YPA_0321 [Yersinia pestis Antiqua] gi|115346727|emb|CAL19610.1| conserved hypothetical protein [Yersinia pestis CO92] gi|145209574|gb|ABP38981.1| hypothetical protein YPDSF_0571 [Yersinia pestis Pestoides F] gi|149290661|gb|EDM40737.1| hypothetical protein YPE_2325 [Yersinia pestis CA88-4125] gi|167050494|gb|EDR61902.1| conserved hypothetical protein TIGR00251 [Yersinia pestis biovar Antiqua str. UG05-0454] gi|229679534|gb|EEO75637.1| conserved protein [Yersinia pestis Nepal516] gi|229690886|gb|EEO82940.1| conserved protein [Yersinia pestis biovar Orientalis str. India 195] gi|229697229|gb|EEO87276.1| conserved protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229704325|gb|EEO91336.1| conserved protein [Yersinia pestis Pestoides A] gi|270336046|gb|EFA46823.1| conserved hypothetical protein TIGR00251 [Yersinia pestis KIM D27] gi|320013974|gb|ADV97545.1| conserved protein [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 96 Score = 92.1 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 13 LKLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANTHLVKFIAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K I + E+T LL+ Sbjct: 66 VIIEKGELGRHKQIKVINPQQIPPEVTILLE 96 >gi|319780857|ref|YP_004140333.1| hypothetical protein Mesci_1119 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317166745|gb|ADV10283.1| protein of unknown function DUF167 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 105 Score = 92.1 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 49/92 (53%), Gaps = 2/92 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P A + +E D + H+K +V A P+ G AN A+ ++AK + + Sbjct: 12 VELFVRLTPKAALDRLEGIETTAD--ERSHLKARVRAVPENGAANHALEKLIAKAIGVPG 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 S++ +++ ++ LK + I+ D + + + ++ Sbjct: 70 SAVSVVAGGTARLKTVRIEGDPETLAKSIEAL 101 >gi|237729884|ref|ZP_04560365.1| conserved hypothetical protein [Citrobacter sp. 30_2] gi|283835286|ref|ZP_06355027.1| putative cytoplasmic protein [Citrobacter youngae ATCC 29220] gi|226908490|gb|EEH94408.1| conserved hypothetical protein [Citrobacter sp. 30_2] gi|291068444|gb|EFE06553.1| putative cytoplasmic protein [Citrobacter youngae ATCC 29220] Length = 96 Score = 92.1 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VVIEKGELGRHKQVKI 81 >gi|30248412|ref|NP_840482.1| hypothetical protein NE0395 [Nitrosomonas europaea ATCC 19718] gi|47117515|sp|Q82X93|Y395_NITEU RecName: Full=UPF0235 protein NE0395 gi|30138298|emb|CAD84306.1| DUF167 [Nitrosomonas europaea ATCC 19718] Length = 100 Score = 92.1 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 17/96 (17%), Positives = 43/96 (44%), Gaps = 10/96 (10%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + + + P A+++ + +KIK+ A P GKAN+A+ LAK+ + Sbjct: 12 LLILKLYVQPGARQTEAVGI-------CGEELKIKLAALPVDGKANRALTEFLAKRFNVP 64 Query: 61 KSSLRMLSKQSSPLKIIYI---DKDCKEITELLQNN 93 + ++ + + S K++ + + + ++ Sbjct: 65 RKNITLKRGEQSRHKVVEVCQSSNGPEVLFSEMRAE 100 >gi|300115523|ref|YP_003762098.1| hypothetical protein Nwat_3056 [Nitrosococcus watsonii C-113] gi|299541460|gb|ADJ29777.1| protein of unknown function DUF167 [Nitrosococcus watsonii C-113] Length = 102 Score = 92.1 bits (228), Expect = 3e-17, Method: Composition-based stats. Identities = 25/84 (29%), Positives = 42/84 (50%), Gaps = 7/84 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V +RL P A+ + +KI++TA P +GKAN +L L K +S++ Sbjct: 15 VQIRLQPRARGDEVIG-------PHGNRLKIRITAPPVEGKANTQLLRFLVKTFQVSRNQ 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87 + +LS +S K + I+K K + Sbjct: 68 VYLLSGTASRDKRVRIEKPAKLLP 91 >gi|215488251|ref|YP_002330682.1| hypothetical protein E2348C_3206 [Escherichia coli O127:H6 str. E2348/69] gi|312964784|ref|ZP_07779024.1| conserved hypothetical protein [Escherichia coli 2362-75] gi|331684580|ref|ZP_08385172.1| conserved hypothetical protein [Escherichia coli H299] gi|254814154|sp|B7UI00|YGGU_ECO27 RecName: Full=UPF0235 protein yggU gi|215266323|emb|CAS10754.1| predicted protein [Escherichia coli O127:H6 str. E2348/69] gi|312290340|gb|EFR18220.1| conserved hypothetical protein [Escherichia coli 2362-75] gi|331078195|gb|EGI49401.1| conserved hypothetical protein [Escherichia coli H299] Length = 96 Score = 92.1 bits (228), Expect = 3e-17, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 38/90 (42%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90 + + + K I I EI LL Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEIAALL 95 >gi|170766160|ref|ZP_02900971.1| conserved hypothetical protein [Escherichia albertii TW07627] gi|170125306|gb|EDS94237.1| conserved hypothetical protein [Escherichia albertii TW07627] Length = 96 Score = 91.7 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 66 VVIEKGELGRHKQIRIINPQQIPPEI 91 >gi|110635561|ref|YP_675769.1| hypothetical protein Meso_3232 [Mesorhizobium sp. BNC1] gi|110286545|gb|ABG64604.1| protein of unknown function DUF167 [Chelativorans sp. BNC1] Length = 116 Score = 91.7 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 24/92 (26%), Positives = 40/92 (43%), Gaps = 2/92 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P + I + D +K +V A P+ GKAN+A+ +LA L + + Sbjct: 16 IFVRLTPKSSSDAIEGVTEGPDGQ--AFLKARVRAIPEAGKANEALERLLASALGVPRRD 73 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 + + S +S K + I D + L Sbjct: 74 VAVSSGAASRRKTVSITGDAAPLIASLDTIAE 105 >gi|190150700|ref|YP_001969225.1| hypothetical protein APP7_1431 [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|307264052|ref|ZP_07545650.1| hypothetical protein appser13_14550 [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|226734158|sp|B3GYF9|Y1431_ACTP7 RecName: Full=UPF0235 protein APP7_1431 gi|189915831|gb|ACE62083.1| hypothetical protein APP7_1431 [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|306870598|gb|EFN02344.1| hypothetical protein appser13_14550 [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 97 Score = 91.7 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 40/92 (43%), Gaps = 8/92 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + L P A + I L +KI +TA P G AN +L L+K + K Sbjct: 13 IRLRIFLQPKASRDQIVGLH-------DSELKIAITAPPVDGAANAHLLKYLSKLFKVPK 65 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 SS+ + + K +++ + K I + ++ Sbjct: 66 SSIVLEKGELQRHKQLFVP-EPKLIPKEIEAL 96 >gi|297580597|ref|ZP_06942523.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297535013|gb|EFH73848.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 96 Score = 91.7 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EIT L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEITALIE 96 >gi|328952006|ref|YP_004369340.1| UPF0235 protein yggU [Desulfobacca acetoxidans DSM 11109] gi|328452330|gb|AEB08159.1| UPF0235 protein yggU [Desulfobacca acetoxidans DSM 11109] Length = 101 Score = 91.7 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 7/90 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + ++P A + I +KI++ A P+KG ANK +L LAK L L K+ Sbjct: 13 LRIHVVPGAASNQIMG-------PHGDRLKIRIAAAPEKGAANKELLNYLAKCLGLPKNR 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 L + S +K++ + E+ E LQ Sbjct: 66 LHLKSGAQDRVKVVEVVGLAPEVQERLQAL 95 >gi|296445622|ref|ZP_06887577.1| protein of unknown function DUF167 [Methylosinus trichosporium OB3b] gi|296256867|gb|EFH03939.1| protein of unknown function DUF167 [Methylosinus trichosporium OB3b] Length = 109 Score = 91.7 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 49/91 (53%), Gaps = 2/91 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P + ++ +E D +K +V A P+ G+AN+A++A++A+ L K Sbjct: 14 VVLWVRLTPKGGRDALSGVETLADG--RAVLKARVRAAPEDGRANEALVALIAQALGAPK 71 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 S+++ + ++ LK ++I D + L+ Sbjct: 72 RSVQIAAGHTARLKKLFIAGDPASLVAALEK 102 >gi|51597526|ref|YP_071717.1| hypothetical protein YPTB3216 [Yersinia pseudotuberculosis IP 32953] gi|153948675|ref|YP_001399811.1| hypothetical protein YpsIP31758_0827 [Yersinia pseudotuberculosis IP 31758] gi|81638596|sp|Q666N2|Y3216_YERPS RecName: Full=UPF0235 protein YPTB3216 gi|167016819|sp|A7FEY3|Y827_YERP3 RecName: Full=UPF0235 protein YpsIP31758_0827 gi|51590808|emb|CAH22454.1| Conserved hypothetical protein [Yersinia pseudotuberculosis IP 32953] gi|152960170|gb|ABS47631.1| conserved hypothetical protein TIGR00251 [Yersinia pseudotuberculosis IP 31758] Length = 96 Score = 91.7 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 21/91 (23%), Positives = 41/91 (45%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ +AK+ ++KS Sbjct: 13 LKLYIQPKASRDQIVGLH-------GDELKVAITAPPVDGQANAHLVKFIAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K I + E+T LL+ Sbjct: 66 VIIEKGELGRHKQIKVINPQQIPPEVTILLK 96 >gi|83594855|ref|YP_428607.1| hypothetical protein Rru_A3526 [Rhodospirillum rubrum ATCC 11170] gi|83577769|gb|ABC24320.1| Protein of unknown function DUF167 [Rhodospirillum rubrum ATCC 11170] Length = 113 Score = 91.7 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 29/88 (32%), Positives = 49/88 (55%), Gaps = 2/88 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +RL P A + G++ + D S +K VTA P+ GKAN A+L +L+++ L +S Sbjct: 18 RLALRLTPKAGRDGVSGVVAEADGSL--VVKASVTAVPEDGKANAALLKLLSRQWKLPRS 75 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELL 90 SL ++ Q+ K+I I + +T L Sbjct: 76 SLAVVHGQTDRRKVIEISGEPALLTPRL 103 >gi|331648708|ref|ZP_08349796.1| conserved hypothetical protein [Escherichia coli M605] gi|331042455|gb|EGI14597.1| conserved hypothetical protein [Escherichia coli M605] Length = 100 Score = 91.7 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 70 VVIEKGELGRHKQIKIINPQQIPPEI 95 >gi|162139546|ref|YP_218029.2| hypothetical protein SC3042 [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|168242905|ref|ZP_02667837.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|194448304|ref|YP_002047090.1| hypothetical protein SeHA_C3341 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|197249095|ref|YP_002148016.1| hypothetical protein SeAg_B3265 [Salmonella enterica subsp. enterica serovar Agona str. SL483] gi|226730824|sp|B5F5M7|YGGU_SALA4 RecName: Full=UPF0235 protein yggU gi|226730828|sp|B4THI7|YGGU_SALHS RecName: Full=UPF0235 protein yggU gi|194406608|gb|ACF66827.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Heidelberg str. SL476] gi|197212798|gb|ACH50195.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Agona str. SL483] gi|205338138|gb|EDZ24902.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Heidelberg str. SL486] gi|322716095|gb|EFZ07666.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 96 Score = 91.7 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 IVIEKGELGRHKQVKI 81 >gi|156932605|ref|YP_001436521.1| hypothetical protein ESA_00387 [Cronobacter sakazakii ATCC BAA-894] gi|260599282|ref|YP_003211853.1| hypothetical protein CTU_34900 [Cronobacter turicensis z3032] gi|166229055|sp|A7MP89|Y387_ENTS8 RecName: Full=UPF0235 protein ESA_00387 gi|156530859|gb|ABU75685.1| hypothetical protein ESA_00387 [Cronobacter sakazakii ATCC BAA-894] gi|260218459|emb|CBA33595.1| UPF0235 protein ESA_00387 [Cronobacter turicensis z3032] Length = 96 Score = 91.7 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 40/91 (43%), Gaps = 8/91 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ LAK+ ++KS Sbjct: 13 LRLYIQPKASRDSIIGLH-------GDELKVAITAPPVDGQANAHLVKYLAKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 + + + K + I + ++I + Sbjct: 66 VVIEKGELGRHKQVKII-EPQQIPTEVAAVT 95 >gi|187732088|ref|YP_001881726.1| hypothetical protein SbBS512_E3385 [Shigella boydii CDC 3083-94] gi|188496122|ref|ZP_03003392.1| conserved hypothetical protein TIGR00251 [Escherichia coli 53638] gi|331643646|ref|ZP_08344777.1| conserved hypothetical protein [Escherichia coli H736] gi|882482|gb|AAA69120.1| ORF_o100 [Escherichia coli str. K-12 substr. MG1655] gi|81246835|gb|ABB67543.1| conserved hypothetical protein [Shigella boydii Sb227] gi|187429080|gb|ACD08354.1| conserved hypothetical protein [Shigella boydii CDC 3083-94] gi|188491321|gb|EDU66424.1| conserved hypothetical protein TIGR00251 [Escherichia coli 53638] gi|331037117|gb|EGI09341.1| conserved hypothetical protein [Escherichia coli H736] Length = 100 Score = 91.3 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 70 VVIEKGELGRHKQIKIINPQQIPPEI 95 >gi|217979671|ref|YP_002363818.1| protein of unknown function DUF167 [Methylocella silvestris BL2] gi|217505047|gb|ACK52456.1| protein of unknown function DUF167 [Methylocella silvestris BL2] Length = 118 Score = 91.3 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 25/97 (25%), Positives = 49/97 (50%), Gaps = 2/97 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P + + I D +K +V A PQ G+AN A++ ++AK L L+ Sbjct: 22 VVLTVRLTPKSARDEIEGASQLSDG--RAVLKARVRAAPQDGEANAALIRLVAKALRLAP 79 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL 98 S++R+ + ++ LK + + D + + + L + + Sbjct: 80 SAVRVEAGATARLKTLCLTGDPETLQQSLAELAARAV 116 >gi|168703344|ref|ZP_02735621.1| hypothetical protein GobsU_27681 [Gemmata obscuriglobus UQM 2246] Length = 101 Score = 91.3 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 28/94 (29%), Positives = 51/94 (54%), Gaps = 8/94 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C + VR+ P AKK+ + +++ VTA P+ G+AN A+LA+L L + Sbjct: 12 CTLAVRVQPKAKKNAVLG-------ERASALRVSVTAPPEDGRANDAVLALLCDHFKLQR 64 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNND 94 S L +LS Q++ K+I + +++ +L+ +D Sbjct: 65 SQLALLSGQTNRNKVILVRGVTPQQLADLIPASD 98 >gi|289164111|ref|YP_003454249.1| hypothetical protein LLO_0767 [Legionella longbeachae NSW150] gi|288857284|emb|CBJ11111.1| hypothetical protein LLO_0767 [Legionella longbeachae NSW150] Length = 91 Score = 91.3 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 19/79 (24%), Positives = 37/79 (46%), Gaps = 7/79 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P AKKS I + +KI++ A P +G+ANK +L +A+ + S Sbjct: 12 LYLYVQPGAKKSEIVGMH-------EGVLKIRLNAPPIEGRANKELLKYVAQLFKVPPSQ 64 Query: 64 LRMLSKQSSPLKIIYIDKD 82 + + S K++ + Sbjct: 65 VVLKRGDKSRHKVLLVKNS 83 >gi|171914386|ref|ZP_02929856.1| hypothetical protein VspiD_24445 [Verrucomicrobium spinosum DSM 4136] Length = 92 Score = 91.3 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 42/84 (50%), Gaps = 2/84 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 N+ ++ PNA++S I D + +K+ A +GKANK ++ LA++L +K Sbjct: 6 VNLACKVTPNARRSEIVGW--GADEQGRGVLLVKLAAPALEGKANKELVRFLAEQLGCAK 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85 + +L +S K++ + E Sbjct: 64 GEVSLLRGDASRTKLLRVPGKAYE 87 >gi|157148503|ref|YP_001455822.1| hypothetical protein CKO_04329 [Citrobacter koseri ATCC BAA-895] gi|166227247|sp|A8APH3|Y4329_CITK8 RecName: Full=UPF0235 protein CKO_04329 gi|157085708|gb|ABV15386.1| hypothetical protein CKO_04329 [Citrobacter koseri ATCC BAA-895] Length = 96 Score = 91.3 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VVIEKGELGRHKQVKI 81 >gi|83648992|ref|YP_437427.1| hypothetical protein HCH_06357 [Hahella chejuensis KCTC 2396] gi|83637035|gb|ABC33002.1| uncharacterized conserved protein [Hahella chejuensis KCTC 2396] Length = 102 Score = 91.3 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 25/88 (28%), Positives = 44/88 (50%), Gaps = 8/88 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + L P AKK I +KIK++A P G+AN+ ++ LAK + + Sbjct: 20 LQCHLQPGAKKDEIVGTH-------GDALKIKISAPPIDGRANQQLVRFLAKLCRVKQQD 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 +++L+ +SS K I + +I +LL+ Sbjct: 73 VQILAGESSRQKRIRVQNLT-DIPKLLK 99 >gi|307293981|ref|ZP_07573825.1| protein of unknown function DUF167 [Sphingobium chlorophenolicum L-1] gi|306880132|gb|EFN11349.1| protein of unknown function DUF167 [Sphingobium chlorophenolicum L-1] Length = 109 Score = 91.3 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 24/84 (28%), Positives = 47/84 (55%), Gaps = 2/84 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A + I + + + + +V A P+KG+AN A++A+LAK+L +S+ Sbjct: 13 LAVRLTPGAAREDIGGVWTDEKGAQ--WLGARVRAVPEKGRANTALIALLAKRLDWPRSA 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEIT 87 + + S ++ LK + I+ + + Sbjct: 71 ISLESGDTNRLKRLRIEGGGEPLP 94 >gi|51244641|ref|YP_064525.1| hypothetical protein DP0789 [Desulfotalea psychrophila LSv54] gi|50875678|emb|CAG35518.1| hypothetical protein DP0789 [Desulfotalea psychrophila LSv54] Length = 95 Score = 91.3 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 38/89 (42%), Gaps = 7/89 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++V P A K+ + L +KI + P GKANK ++ L++ L K Sbjct: 12 VILLVYTQPRASKTKVVGLH-------DGMLKIACCSPPVDGKANKELIVFLSRLLDCRK 64 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 + +L QSS K + E+ + L Sbjct: 65 CDIELLRGQSSRRKQFVLTGVDAELLDKL 93 >gi|91212335|ref|YP_542321.1| hypothetical protein UTI89_C3342 [Escherichia coli UTI89] gi|110806861|ref|YP_690381.1| hypothetical protein SFV_3007 [Shigella flexneri 5 str. 8401] gi|157155173|ref|YP_001464306.1| hypothetical protein EcE24377A_3297 [Escherichia coli E24377A] gi|157162414|ref|YP_001459732.1| hypothetical protein EcHS_A3113 [Escherichia coli HS] gi|237706394|ref|ZP_04536875.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA] gi|254038003|ref|ZP_04872061.1| conserved hypothetical protein [Escherichia sp. 1_1_43] gi|331654465|ref|ZP_08355465.1| conserved hypothetical protein [Escherichia coli M718] gi|331674433|ref|ZP_08375193.1| conserved hypothetical protein [Escherichia coli TA280] gi|331678947|ref|ZP_08379621.1| conserved hypothetical protein [Escherichia coli H591] gi|26109782|gb|AAN81987.1|AE016766_75 Hypothetical protein yggU [Escherichia coli CFT073] gi|24053355|gb|AAN44425.1| orf, conserved hypothetical protein [Shigella flexneri 2a str. 301] gi|30042526|gb|AAP18250.1| hypothetical protein S3148 [Shigella flexneri 2a str. 2457T] gi|91073909|gb|ABE08790.1| hypothetical protein UTI89_C3342 [Escherichia coli UTI89] gi|110616409|gb|ABF05076.1| conserved hypothetical protein [Shigella flexneri 5 str. 8401] gi|157068094|gb|ABV07349.1| conserved hypothetical protein TIGR00251 [Escherichia coli HS] gi|157077203|gb|ABV16911.1| conserved hypothetical protein TIGR00251 [Escherichia coli E24377A] gi|195183146|dbj|BAG66691.1| predicted protein [Escherichia coli O111:H-] gi|226839627|gb|EEH71648.1| conserved hypothetical protein [Escherichia sp. 1_1_43] gi|226899434|gb|EEH85693.1| conserved hypothetical protein [Escherichia sp. 3_2_53FAA] gi|294491613|gb|ADE90369.1| conserved hypothetical protein TIGR00251 [Escherichia coli IHE3034] gi|331047847|gb|EGI19924.1| conserved hypothetical protein [Escherichia coli M718] gi|331068527|gb|EGI39922.1| conserved hypothetical protein [Escherichia coli TA280] gi|331073777|gb|EGI45098.1| conserved hypothetical protein [Escherichia coli H591] gi|332102705|gb|EGJ06051.1| conserved hypothetical protein [Shigella sp. D9] Length = 100 Score = 91.3 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 70 VVIEKGELGRHKQIKIINPQQIPPEI 95 >gi|209694133|ref|YP_002262061.1| hypothetical protein VSAL_I0540 [Aliivibrio salmonicida LFI1238] gi|208008084|emb|CAQ78225.1| conserved hypothetical protein [Aliivibrio salmonicida LFI1238] Length = 83 Score = 91.3 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 15/73 (20%), Positives = 31/73 (42%), Gaps = 7/73 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + L P A + I + +K+ +TA P GKAN ++ +K ++K Sbjct: 12 LRLYLQPKASRDQIVGIH-------GEELKVAITAPPVDGKANAHLIKYFSKLFKVAKGK 64 Query: 64 LRMLSKQSSPLKI 76 + + + + K Sbjct: 65 ITVEKGELNRHKQ 77 >gi|28373966|pdb|1N91|A Chain A, Solution Nmr Structure Of Protein Yggu From Escherichia Coli. Northeast Structural Genomics Consortium Target Er14. gi|60594359|pdb|1YH5|A Chain A, Solution Nmr Structure Of Protein Yggu From Escherichia Coli. Northeast Structural Genomics Consortium Target Er14 Length = 108 Score = 91.3 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 18/81 (22%), Positives = 35/81 (43%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 + + + K I I + Sbjct: 70 VVIEKGELGRHKQIKIINPQQ 90 >gi|240849835|ref|YP_002971223.1| hypothetical protein Bgr_01760 [Bartonella grahamii as4aup] gi|240266958|gb|ACS50546.1| hypothetical protein Bgr_01760 [Bartonella grahamii as4aup] Length = 104 Score = 91.3 bits (226), Expect = 4e-17, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V LIP + I +E ++ I++ A P+ GKANKA++ LAK+ + SS Sbjct: 12 LFVYLIPKSSVDKIIGVECRDGEKQ--YLVIRLRAVPEDGKANKALIKFLAKQWKIPSSS 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + + S K +Y +E+ ++ Q+ Sbjct: 70 ISLKNGAISRYKQLYFSTHLEELKQIWQS 98 >gi|149192342|ref|ZP_01870547.1| hypothetical protein VSAK1_11360 [Vibrio shilonii AK1] gi|148833820|gb|EDL50852.1| hypothetical protein VSAK1_11360 [Vibrio shilonii AK1] Length = 95 Score = 90.9 bits (225), Expect = 4e-17, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 35/90 (38%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + + K+ +TA P GKAN + LAK+ ++K Sbjct: 13 LRIYVQPKASRDKLVG-------EHGEEFKVAITAPPVDGKANAHLSKYLAKQFKVAKGQ 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90 + + + K + I KE LL Sbjct: 66 VLIEKGELGRHKQLRIVSPALIPKEFMALL 95 >gi|119946664|ref|YP_944344.1| hypothetical protein Ping_3043 [Psychromonas ingrahamii 37] gi|166229363|sp|A1SZ30|Y3043_PSYIN RecName: Full=UPF0235 protein Ping_3043 gi|119865268|gb|ABM04745.1| hypothetical protein DUF167 [Psychromonas ingrahamii 37] Length = 98 Score = 90.9 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 19/92 (20%), Positives = 40/92 (43%), Gaps = 8/92 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + L P + + L +KI +TA P GKAN ++ L+K+ ++K + Sbjct: 15 LRLVLQPKSSRDQFIGL-------LGDELKIAITAPPVDGKANAHLIKFLSKQFKVAKGA 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 + + S K + + K++ E + + Sbjct: 68 IIIEKGLLSRHKRVRV-CAPKKMPEFFNSLNE 98 >gi|23011359|ref|ZP_00051742.1| COG1872: Uncharacterized conserved protein [Magnetospirillum magnetotacticum MS-1] Length = 99 Score = 90.9 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 26/89 (29%), Positives = 50/89 (56%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A ++G+ + + D + ++V A P +G AN A+ A +AK L L K+ Sbjct: 8 LSVRLTPRASRTGLDGVRVDADGRP--VLGLRVAAPPVEGAANAALTAFVAKSLKLRKAE 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + ++S ++S K +++ D KE+ ++ Sbjct: 66 VVLVSGEASRTKRLHLTGDAKELAARVET 94 >gi|161506346|ref|YP_001573458.1| hypothetical protein SARI_04543 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- str. RSK2980] gi|189030103|sp|A9MQS2|YGGU_SALAR RecName: Full=UPF0235 protein yggU gi|160867693|gb|ABX24316.1| hypothetical protein SARI_04543 [Salmonella enterica subsp. arizonae serovar 62:z4,z23:--] Length = 96 Score = 90.9 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDCIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 VAIEKGELGRHKQVKI 81 >gi|329942568|ref|ZP_08291378.1| hypothetical protein G5Q_0265 [Chlamydophila psittaci Cal10] gi|313847795|emb|CBY16785.1| conserved hypothetical protein [Chlamydophila psittaci RD1] gi|325506510|gb|ADZ18148.1| conserved hypothetical protein [Chlamydophila psittaci 6BC] gi|328815478|gb|EGF85466.1| hypothetical protein G5Q_0265 [Chlamydophila psittaci Cal10] gi|328914447|gb|AEB55280.1| conserved hypothetical protein [Chlamydophila psittaci 6BC] Length = 92 Score = 90.9 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 29/85 (34%), Positives = 50/85 (58%), Gaps = 7/85 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ P +K++ I E +KI+VT P+KGKAN+A++A+LAK L+L K Sbjct: 8 LEVKVTPKSKENKIVGFE-------GEVLKIRVTEVPEKGKANEAVIALLAKTLSLPKRD 60 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITE 88 + ++S ++S K I + K + I Sbjct: 61 VTLISGETSKNKRILLPKATESIVS 85 >gi|12517499|gb|AAG58084.1|AE005525_10 orf, hypothetical protein [Escherichia coli O157:H7 str. EDL933] gi|13363301|dbj|BAB37252.1| hypothetical protein [Escherichia coli O157:H7 str. Sakai] gi|209760028|gb|ACI78326.1| hypothetical protein ECs3829 [Escherichia coli] gi|209760030|gb|ACI78327.1| hypothetical protein ECs3829 [Escherichia coli] gi|209760032|gb|ACI78328.1| hypothetical protein ECs3829 [Escherichia coli] gi|209760034|gb|ACI78329.1| hypothetical protein ECs3829 [Escherichia coli] gi|209760036|gb|ACI78330.1| hypothetical protein ECs3829 [Escherichia coli] Length = 100 Score = 90.9 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 18/81 (22%), Positives = 35/81 (43%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 + + + K I I + Sbjct: 70 VVIEKGELGRHKQIKIINPQQ 90 >gi|56415040|ref|YP_152115.1| hypothetical protein SPA2965 [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150] gi|168819868|ref|ZP_02831868.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|194446740|ref|YP_002042361.1| hypothetical protein SNSL254_A3349 [Salmonella enterica subsp. enterica serovar Newport str. SL254] gi|197363969|ref|YP_002143606.1| hypothetical protein SSPA2764 [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|204928324|ref|ZP_03219524.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Javiana str. GA_MM04042433] gi|238909900|ref|ZP_04653737.1| hypothetical protein SentesTe_02030 [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] gi|81361856|sp|Q5PML0|YGGU_SALPA RecName: Full=UPF0235 protein yggU gi|226730829|sp|B4T5K9|YGGU_SALNS RecName: Full=UPF0235 protein yggU gi|226730830|sp|B5BFQ9|YGGU_SALPK RecName: Full=UPF0235 protein yggU gi|56129297|gb|AAV78803.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150] gi|194405403|gb|ACF65625.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Newport str. SL254] gi|197095446|emb|CAR61005.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU_12601] gi|204322646|gb|EDZ07843.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Javiana str. GA_MM04042433] gi|205343351|gb|EDZ30115.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|320087534|emb|CBY97299.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] gi|322613499|gb|EFY10440.1| hypothetical protein SEEM315_07205 [Salmonella enterica subsp. enterica serovar Montevideo str. 315996572] gi|322621091|gb|EFY17949.1| hypothetical protein SEEM971_19929 [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-1] gi|322624155|gb|EFY20989.1| hypothetical protein SEEM973_20050 [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-3] gi|322628106|gb|EFY24895.1| hypothetical protein SEEM974_21245 [Salmonella enterica subsp. enterica serovar Montevideo str. 495297-4] gi|322633225|gb|EFY29967.1| hypothetical protein SEEM201_12335 [Salmonella enterica subsp. enterica serovar Montevideo str. 515920-1] gi|322636197|gb|EFY32905.1| hypothetical protein SEEM202_13733 [Salmonella enterica subsp. enterica serovar Montevideo str. 515920-2] gi|322639535|gb|EFY36223.1| hypothetical protein SEEM954_11862 [Salmonella enterica subsp. enterica serovar Montevideo str. 531954] gi|322647532|gb|EFY44021.1| hypothetical protein SEEM054_10407 [Salmonella enterica subsp. enterica serovar Montevideo str. NC_MB110209-0054] gi|322648716|gb|EFY45163.1| hypothetical protein SEEM675_04396 [Salmonella enterica subsp. enterica serovar Montevideo str. OH_2009072675] gi|322653771|gb|EFY50097.1| hypothetical protein SEEM965_22046 [Salmonella enterica subsp. enterica serovar Montevideo str. CASC_09SCPH15965] gi|322657877|gb|EFY54145.1| hypothetical protein SEEM19N_18161 [Salmonella enterica subsp. enterica serovar Montevideo str. 19N] gi|322663980|gb|EFY60179.1| hypothetical protein SEEM801_04461 [Salmonella enterica subsp. enterica serovar Montevideo str. 81038-01] gi|322669009|gb|EFY65160.1| hypothetical protein SEEM507_10116 [Salmonella enterica subsp. enterica serovar Montevideo str. MD_MDA09249507] gi|322672997|gb|EFY69104.1| hypothetical protein SEEM877_00665 [Salmonella enterica subsp. enterica serovar Montevideo str. 414877] gi|322678012|gb|EFY74075.1| hypothetical protein SEEM867_02147 [Salmonella enterica subsp. enterica serovar Montevideo str. 366867] gi|322681188|gb|EFY77221.1| hypothetical protein SEEM180_20639 [Salmonella enterica subsp. enterica serovar Montevideo str. 413180] gi|322687882|gb|EFY83849.1| hypothetical protein SEEM600_15746 [Salmonella enterica subsp. enterica serovar Montevideo str. 446600] gi|323194922|gb|EFZ80109.1| hypothetical protein SEEM581_15857 [Salmonella enterica subsp. enterica serovar Montevideo str. 609458-1] gi|323199626|gb|EFZ84716.1| hypothetical protein SEEM501_16285 [Salmonella enterica subsp. enterica serovar Montevideo str. 556150-1] gi|323202627|gb|EFZ87667.1| hypothetical protein SEEM460_18160 [Salmonella enterica subsp. enterica serovar Montevideo str. 609460] gi|323207886|gb|EFZ92832.1| hypothetical protein SEEM020_18480 [Salmonella enterica subsp. enterica serovar Montevideo str. 507440-20] gi|323212562|gb|EFZ97379.1| hypothetical protein SEEM6152_10068 [Salmonella enterica subsp. enterica serovar Montevideo str. 556152] gi|323214955|gb|EFZ99703.1| hypothetical protein SEEM0077_03334 [Salmonella enterica subsp. enterica serovar Montevideo str. MB101509-0077] gi|323222685|gb|EGA07050.1| hypothetical protein SEEM0047_09590 [Salmonella enterica subsp. enterica serovar Montevideo str. MB102109-0047] gi|323225428|gb|EGA09660.1| hypothetical protein SEEM0055_15033 [Salmonella enterica subsp. enterica serovar Montevideo str. MB110209-0055] gi|323230557|gb|EGA14675.1| hypothetical protein SEEM0052_19874 [Salmonella enterica subsp. enterica serovar Montevideo str. MB111609-0052] gi|323235092|gb|EGA19178.1| hypothetical protein SEEM3312_06118 [Salmonella enterica subsp. enterica serovar Montevideo str. 2009083312] gi|323239131|gb|EGA23181.1| hypothetical protein SEEM5258_05995 [Salmonella enterica subsp. enterica serovar Montevideo str. 2009085258] gi|323244511|gb|EGA28517.1| hypothetical protein SEEM1156_01212 [Salmonella enterica subsp. enterica serovar Montevideo str. 315731156] gi|323247126|gb|EGA31092.1| hypothetical protein SEEM9199_21270 [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2009159199] gi|323253391|gb|EGA37220.1| hypothetical protein SEEM8282_09961 [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008282] gi|323256302|gb|EGA40038.1| hypothetical protein SEEM8283_11916 [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008283] gi|323262522|gb|EGA46078.1| hypothetical protein SEEM8284_02121 [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008284] gi|323267382|gb|EGA50866.1| hypothetical protein SEEM8285_21407 [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008285] gi|323269214|gb|EGA52669.1| hypothetical protein SEEM8287_10997 [Salmonella enterica subsp. enterica serovar Montevideo str. IA_2010008287] gi|323669738|emb|CBJ94862.1| conserved hypothetical protein [Salmonella bongori] gi|327412908|emb|CAX67922.1| conserved hypothetical protein [Salmonella bongori] Length = 96 Score = 90.9 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLIKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 IVIEKGELGRHKQVKI 81 >gi|222054893|ref|YP_002537255.1| protein of unknown function DUF167 [Geobacter sp. FRC-32] gi|221564182|gb|ACM20154.1| protein of unknown function DUF167 [Geobacter sp. FRC-32] Length = 102 Score = 90.9 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 22/80 (27%), Positives = 38/80 (47%), Gaps = 7/80 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V + P A ++ I ++ +++++TA P ANK + +LAK L ++KS L Sbjct: 22 TVHVQPRASRNEICGVQ-------GDELRLRLTAPPVDDAANKLCIELLAKALGVAKSHL 74 Query: 65 RMLSKQSSPLKIIYIDKDCK 84 + S S K I + K Sbjct: 75 ALTSGAKSRHKTIRAEGVSK 94 >gi|81242423|gb|ABB63133.1| conserved hypothetical protein [Shigella dysenteriae Sd197] Length = 100 Score = 90.9 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANGHLVKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 70 VVIEKGELGRHKQIKIINPQQIPPEI 95 >gi|15618408|ref|NP_224693.1| hypothetical protein CPn0497 [Chlamydophila pneumoniae CWL029] gi|15836028|ref|NP_300552.1| hypothetical protein CPj0497 [Chlamydophila pneumoniae J138] gi|16752546|ref|NP_444808.1| hypothetical protein CP0257 [Chlamydophila pneumoniae AR39] gi|33241848|ref|NP_876789.1| hypothetical protein CpB0517 [Chlamydophila pneumoniae TW-183] gi|29839694|sp|Q9Z854|Y497_CHLPN RecName: Full=UPF0235 protein CPn_0497/CP_0257/CPj0497/CpB0517 gi|4376783|gb|AAD18637.1| CT388 hypothetical protein [Chlamydophila pneumoniae CWL029] gi|7189184|gb|AAF38120.1| conserved hypothetical protein [Chlamydophila pneumoniae AR39] gi|8978867|dbj|BAA98703.1| CT388 hypothetical protein [Chlamydophila pneumoniae J138] gi|33236357|gb|AAP98446.1| hypothetical protein CpB0517 [Chlamydophila pneumoniae TW-183] gi|269303374|gb|ACZ33474.1| conserved hypothetical protein TIGR00251 [Chlamydophila pneumoniae LPCoLN] Length = 90 Score = 90.9 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 25/87 (28%), Positives = 49/87 (56%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ P AK++ I + +K++VT P+KGKAN A++++LAK L+L K Sbjct: 7 LEVKVTPKAKENKIVGFD-------GQALKVRVTEPPEKGKANDAVISLLAKALSLPKRD 59 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + +++ ++S K + ++I L Sbjct: 60 VTLIAGETSRKKKFLLPNRVQDIIFSL 86 >gi|229530300|ref|ZP_04419688.1| hypothetical protein VCG_003420 [Vibrio cholerae 12129(1)] gi|229332073|gb|EEN97561.1| hypothetical protein VCG_003420 [Vibrio cholerae 12129(1)] gi|327483312|gb|AEA77719.1| UPF0235 protein [Vibrio cholerae LMA3894-4] Length = 96 Score = 90.9 bits (225), Expect = 6e-17, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EIT L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEITALIE 96 >gi|147674564|ref|YP_001215984.1| hypothetical protein VC0395_A0010 [Vibrio cholerae O395] gi|153216287|ref|ZP_01950380.1| conserved hypothetical protein [Vibrio cholerae 1587] gi|262167148|ref|ZP_06034862.1| hypothetical protein VIJ_000308 [Vibrio cholerae RC27] gi|172047756|sp|A5F9H9|Y1210_VIBC3 RecName: Full=UPF0235 protein VC0395_A0010/VC395_0502 gi|124114376|gb|EAY33196.1| conserved hypothetical protein [Vibrio cholerae 1587] gi|146316447|gb|ABQ20986.1| conserved hypothetical protein [Vibrio cholerae O395] gi|227012311|gb|ACP08521.1| conserved hypothetical protein [Vibrio cholerae O395] gi|262024448|gb|EEY43135.1| hypothetical protein VIJ_000308 [Vibrio cholerae RC27] Length = 96 Score = 90.6 bits (224), Expect = 6e-17, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EIT L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEITALIE 96 >gi|281179962|dbj|BAI56292.1| conserved hypothetical protein [Escherichia coli SE15] gi|330908988|gb|EGH37502.1| hypothetical protein ECAA86_03160 [Escherichia coli AA86] Length = 96 Score = 90.6 bits (224), Expect = 6e-17, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91 >gi|323136497|ref|ZP_08071579.1| protein of unknown function DUF167 [Methylocystis sp. ATCC 49242] gi|322398571|gb|EFY01091.1| protein of unknown function DUF167 [Methylocystis sp. ATCC 49242] Length = 110 Score = 90.6 bits (224), Expect = 6e-17, Method: Composition-based stats. Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 2/91 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V +RL P + I +E D +K +V A P+ G+AN A++ ++AK L K++ Sbjct: 18 VWLRLTPKGGRDAIEGVETLSDG--RAVLKARVRAAPEDGRANAALIELIAKALRAPKNA 75 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 + + S ++S +K I+I D + L Sbjct: 76 VSIRSGETSRVKKIFIAGDSATYLDALAKLA 106 >gi|291615210|ref|YP_003525367.1| hypothetical protein Slit_2755 [Sideroxydans lithotrophicus ES-1] gi|291585322|gb|ADE12980.1| protein of unknown function DUF167 [Sideroxydans lithotrophicus ES-1] Length = 94 Score = 90.6 bits (224), Expect = 6e-17, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 42/90 (46%), Gaps = 7/90 (7%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + + + P AK++ +A L +KI++ A P +G+AN+A+L +A+ + Sbjct: 11 ILTLTLHIQPGAKRTEVAGLH-------GAALKIRLAAPPIEGRANEALLKFIAESFGVP 63 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELL 90 + + S K++ + E LL Sbjct: 64 LRQVELKQGGQSRHKVVAVTASKIEPESLL 93 >gi|89109730|ref|AP_003510.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110] gi|90111518|ref|NP_417428.2| conserved protein, UPF0235 family [Escherichia coli str. K-12 substr. MG1655] gi|161984863|ref|YP_409371.2| hypothetical protein SBO_3037 [Shigella boydii Sb227] gi|170082505|ref|YP_001731825.1| hypothetical protein ECDH10B_3128 [Escherichia coli str. K-12 substr. DH10B] gi|238902075|ref|YP_002927871.1| hypothetical protein BWG_2675 [Escherichia coli BW2952] gi|253772209|ref|YP_003035040.1| hypothetical protein ECBD_0787 [Escherichia coli 'BL21-Gold(DE3)pLysS AG'] gi|300947684|ref|ZP_07161853.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 116-1] gi|300954200|ref|ZP_07166665.1| hypothetical protein HMPREF9547_00147 [Escherichia coli MS 175-1] gi|301643693|ref|ZP_07243732.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 146-1] gi|307139638|ref|ZP_07498994.1| hypothetical protein EcolH7_16123 [Escherichia coli H736] gi|6920084|sp|P52060|YGGU_ECOLI RecName: Full=UPF0235 protein yggU gi|226730819|sp|B1XFB3|YGGU_ECODH RecName: Full=UPF0235 protein yggU gi|259710253|sp|C5A0M2|YGGU_ECOBW RecName: Full=UPF0235 protein yggU gi|85675763|dbj|BAE77016.1| conserved hypothetical protein [Escherichia coli str. K12 substr. W3110] gi|87082189|gb|AAC75990.2| conserved protein, UPF0235 family [Escherichia coli str. K-12 substr. MG1655] gi|169890340|gb|ACB04047.1| conserved protein [Escherichia coli str. K-12 substr. DH10B] gi|238860521|gb|ACR62519.1| conserved protein [Escherichia coli BW2952] gi|253323253|gb|ACT27855.1| protein of unknown function DUF167 [Escherichia coli 'BL21-Gold(DE3)pLysS AG'] gi|260448004|gb|ACX38426.1| protein of unknown function DUF167 [Escherichia coli DH1] gi|284922896|emb|CBG35985.1| conserved hypothetical protein [Escherichia coli 042] gi|300318784|gb|EFJ68568.1| hypothetical protein HMPREF9547_00147 [Escherichia coli MS 175-1] gi|300452730|gb|EFK16350.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 116-1] gi|301077895|gb|EFK92701.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 146-1] gi|309703308|emb|CBJ02644.1| conserved hypothetical protein [Escherichia coli ETEC H10407] gi|315137550|dbj|BAJ44709.1| hypothetical protein ECDH1ME8569_2853 [Escherichia coli DH1] gi|315614885|gb|EFU95523.1| conserved hypothetical protein [Escherichia coli 3431] gi|320174051|gb|EFW49221.1| hypothetical protein SDB_03448 [Shigella dysenteriae CDC 74-1112] gi|320184300|gb|EFW59112.1| hypothetical protein SGF_03464 [Shigella flexneri CDC 796-83] gi|323173833|gb|EFZ59462.1| hypothetical protein ECLT68_2152 [Escherichia coli LT-68] gi|323936044|gb|EGB32339.1| yggU [Escherichia coli E1520] gi|332091357|gb|EGI96445.1| hypothetical protein SB359474_3531 [Shigella boydii 3594-74] Length = 96 Score = 90.6 bits (224), Expect = 6e-17, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91 >gi|254497376|ref|ZP_05110179.1| conserved hypothetical protein [Legionella drancourtii LLAP12] gi|254353429|gb|EET12161.1| conserved hypothetical protein [Legionella drancourtii LLAP12] Length = 91 Score = 90.6 bits (224), Expect = 7e-17, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 37/79 (46%), Gaps = 7/79 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P AK + IA +KI++ A P +G+AN+A+L +A+ A+ Sbjct: 12 INLYIQPGAKHTEIAGFH-------GEALKIRLHAPPIEGRANEALLKFIAQIFAVPTRQ 64 Query: 64 LRMLSKQSSPLKIIYIDKD 82 + + S LK + I Sbjct: 65 VVLKRGDKSRLKTLIITGS 83 >gi|319899260|ref|YP_004159353.1| hypothetical protein BARCL_1102 [Bartonella clarridgeiae 73] gi|319403224|emb|CBI76783.1| conserved protein of unknown function [Bartonella clarridgeiae 73] Length = 110 Score = 90.6 bits (224), Expect = 7e-17, Method: Composition-based stats. Identities = 32/89 (35%), Positives = 50/89 (56%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E D H+ I++ A P+ GKAN+A++ LAK+ + S Sbjct: 12 LFVRLIPKASVDSIIKVEDRDDGKQ--HLIIRLRAIPENGKANRALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S +S K +Y K KEI ++LQ+ Sbjct: 70 ISLGSGATSHYKQLYFSKYLKEIEQILQS 98 >gi|262403916|ref|ZP_06080473.1| hypothetical protein VOA_001904 [Vibrio sp. RC586] gi|262349878|gb|EEY99014.1| hypothetical protein VOA_001904 [Vibrio sp. RC586] Length = 96 Score = 90.2 bits (223), Expect = 8e-17, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK+ ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKQCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EI L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPPEIAALIE 96 >gi|110643102|ref|YP_670832.1| hypothetical protein ECP_2947 [Escherichia coli 536] gi|117625180|ref|YP_854168.1| hypothetical protein APECO1_3568 [Escherichia coli APEC O1] gi|161485880|ref|NP_708718.2| hypothetical protein SF2944 [Shigella flexneri 2a str. 301] gi|161486124|ref|NP_755414.2| hypothetical protein c3539 [Escherichia coli CFT073] gi|161486442|ref|NP_838440.2| hypothetical protein S3148 [Shigella flexneri 2a str. 2457T] gi|170018806|ref|YP_001723760.1| hypothetical protein EcolC_0761 [Escherichia coli ATCC 8739] gi|170681809|ref|YP_001745114.1| hypothetical protein EcSMS35_3095 [Escherichia coli SMS-3-5] gi|191167930|ref|ZP_03029733.1| conserved hypothetical protein [Escherichia coli B7A] gi|191171828|ref|ZP_03033374.1| conserved hypothetical protein [Escherichia coli F11] gi|193063515|ref|ZP_03044604.1| conserved hypothetical protein [Escherichia coli E22] gi|193067470|ref|ZP_03048438.1| conserved hypothetical protein [Escherichia coli E110019] gi|194426264|ref|ZP_03058819.1| conserved hypothetical protein [Escherichia coli B171] gi|194431742|ref|ZP_03064033.1| conserved hypothetical protein [Shigella dysenteriae 1012] gi|194436768|ref|ZP_03068868.1| conserved hypothetical protein [Escherichia coli 101-1] gi|209920412|ref|YP_002294496.1| hypothetical protein ECSE_3221 [Escherichia coli SE11] gi|218550200|ref|YP_002383991.1| hypothetical protein EFER_2892 [Escherichia fergusonii ATCC 35469] gi|218555512|ref|YP_002388425.1| hypothetical protein ECIAI1_3086 [Escherichia coli IAI1] gi|218559944|ref|YP_002392857.1| hypothetical protein ECS88_3235 [Escherichia coli S88] gi|218691077|ref|YP_002399289.1| hypothetical protein ECED1_3416 [Escherichia coli ED1a] gi|218696551|ref|YP_002404218.1| hypothetical protein EC55989_3246 [Escherichia coli 55989] gi|218701663|ref|YP_002409292.1| hypothetical protein ECIAI39_3371 [Escherichia coli IAI39] gi|218706468|ref|YP_002413987.1| hypothetical protein ECUMN_3305 [Escherichia coli UMN026] gi|227888508|ref|ZP_04006313.1| protein of hypothetical function DUF167 [Escherichia coli 83972] gi|254162863|ref|YP_003045971.1| hypothetical protein ECB_02783 [Escherichia coli B str. REL606] gi|256019245|ref|ZP_05433110.1| hypothetical protein ShiD9_10034 [Shigella sp. D9] gi|260845623|ref|YP_003223401.1| hypothetical protein ECO103_3533 [Escherichia coli O103:H2 str. 12009] gi|260857086|ref|YP_003230977.1| hypothetical protein ECO26_4052 [Escherichia coli O26:H11 str. 11368] gi|260869640|ref|YP_003236042.1| hypothetical protein ECO111_3701 [Escherichia coli O111:H- str. 11128] gi|293406460|ref|ZP_06650386.1| hypothetical protein ECGG_01757 [Escherichia coli FVEC1412] gi|293416214|ref|ZP_06658854.1| hypothetical protein ECDG_03817 [Escherichia coli B185] gi|293449283|ref|ZP_06663704.1| hypothetical protein ECCG_02314 [Escherichia coli B088] gi|297520255|ref|ZP_06938641.1| hypothetical protein EcolOP_21662 [Escherichia coli OP50] gi|298382197|ref|ZP_06991794.1| hypothetical protein ECFG_01943 [Escherichia coli FVEC1302] gi|300815577|ref|ZP_07095801.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 107-1] gi|300824812|ref|ZP_07104916.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 119-7] gi|300900230|ref|ZP_07118414.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 198-1] gi|300906483|ref|ZP_07124178.1| hypothetical protein HMPREF9536_04445 [Escherichia coli MS 84-1] gi|300921295|ref|ZP_07137664.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 115-1] gi|300928104|ref|ZP_07143649.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 187-1] gi|300940765|ref|ZP_07155311.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 21-1] gi|300980105|ref|ZP_07174848.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 45-1] gi|300995466|ref|ZP_07181114.1| hypothetical protein HMPREF9553_04604 [Escherichia coli MS 200-1] gi|301027296|ref|ZP_07190642.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 69-1] gi|301027722|ref|ZP_07191032.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 196-1] gi|301049252|ref|ZP_07196225.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 185-1] gi|301306562|ref|ZP_07212624.1| hypothetical protein HMPREF9347_05170 [Escherichia coli MS 124-1] gi|301328107|ref|ZP_07221248.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 78-1] gi|306812143|ref|ZP_07446341.1| hypothetical protein ECNC101_09529 [Escherichia coli NC101] gi|309794042|ref|ZP_07688467.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 145-7] gi|312972805|ref|ZP_07786978.1| conserved hypothetical protein [Escherichia coli 1827-70] gi|331659088|ref|ZP_08360030.1| conserved hypothetical protein [Escherichia coli TA206] gi|331669699|ref|ZP_08370545.1| conserved hypothetical protein [Escherichia coli TA271] gi|29839713|sp|Q8FE28|YGGU_ECOL6 RecName: Full=UPF0235 protein yggU gi|47117526|sp|Q83JS1|YGGU_SHIFL RecName: Full=UPF0235 protein yggU gi|123343643|sp|Q0TDP8|YGGU_ECOL5 RecName: Full=UPF0235 protein yggU gi|166227348|sp|A1AFD9|YGGU_ECOK1 RecName: Full=UPF0235 protein yggU gi|189030102|sp|B1IT54|YGGU_ECOLC RecName: Full=UPF0235 protein yggU gi|226730815|sp|B7MME1|YGGU_ECO45 RecName: Full=UPF0235 protein yggU gi|226730817|sp|B7NI16|YGGU_ECO7I RecName: Full=UPF0235 protein yggU gi|226730818|sp|B7LYY3|YGGU_ECO8A RecName: Full=UPF0235 protein yggU gi|226730820|sp|B7N7K6|YGGU_ECOLU RecName: Full=UPF0235 protein yggU gi|226730821|sp|B6I789|YGGU_ECOSE RecName: Full=UPF0235 protein yggU gi|226730822|sp|B1LDG2|YGGU_ECOSM RecName: Full=UPF0235 protein yggU gi|226730823|sp|B7LPS0|YGGU_ESCF3 RecName: Full=UPF0235 protein yggU gi|254814155|sp|B7LFL6|YGGU_ECO55 RecName: Full=UPF0235 protein yggU gi|254814156|sp|B7MZQ2|YGGU_ECO81 RecName: Full=UPF0235 protein yggU gi|110344694|gb|ABG70931.1| hypothetical protein YggU [Escherichia coli 536] gi|115514304|gb|ABJ02379.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|169753734|gb|ACA76433.1| protein of unknown function DUF167 [Escherichia coli ATCC 8739] gi|170519527|gb|ACB17705.1| conserved hypothetical protein [Escherichia coli SMS-3-5] gi|190902015|gb|EDV61761.1| conserved hypothetical protein [Escherichia coli B7A] gi|190907863|gb|EDV67456.1| conserved hypothetical protein [Escherichia coli F11] gi|192930792|gb|EDV83397.1| conserved hypothetical protein [Escherichia coli E22] gi|192959427|gb|EDV89862.1| conserved hypothetical protein [Escherichia coli E110019] gi|194415572|gb|EDX31839.1| conserved hypothetical protein [Escherichia coli B171] gi|194420098|gb|EDX36176.1| conserved hypothetical protein [Shigella dysenteriae 1012] gi|194424250|gb|EDX40237.1| conserved hypothetical protein [Escherichia coli 101-1] gi|209913671|dbj|BAG78745.1| conserved hypothetical protein [Escherichia coli SE11] gi|218353283|emb|CAU99245.1| conserved hypothetical protein [Escherichia coli 55989] gi|218357741|emb|CAQ90385.1| conserved hypothetical protein [Escherichia fergusonii ATCC 35469] gi|218362280|emb|CAQ99901.1| conserved hypothetical protein [Escherichia coli IAI1] gi|218366713|emb|CAR04470.1| conserved hypothetical protein [Escherichia coli S88] gi|218371649|emb|CAR19488.1| conserved hypothetical protein [Escherichia coli IAI39] gi|218428641|emb|CAR09570.2| conserved hypothetical protein [Escherichia coli ED1a] gi|218433565|emb|CAR14468.1| conserved hypothetical protein [Escherichia coli UMN026] gi|222034648|emb|CAP77390.1| UPF0235 protein yggU [Escherichia coli LF82] gi|227834777|gb|EEJ45243.1| protein of hypothetical function DUF167 [Escherichia coli 83972] gi|242378479|emb|CAQ33263.1| conserved protein [Escherichia coli BL21(DE3)] gi|253974764|gb|ACT40435.1| hypothetical protein ECB_02783 [Escherichia coli B str. REL606] gi|253978930|gb|ACT44600.1| hypothetical protein ECD_02783 [Escherichia coli BL21(DE3)] gi|257755735|dbj|BAI27237.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368] gi|257760770|dbj|BAI32267.1| conserved predicted protein [Escherichia coli O103:H2 str. 12009] gi|257765996|dbj|BAI37491.1| conserved predicted protein [Escherichia coli O111:H- str. 11128] gi|291322373|gb|EFE61802.1| hypothetical protein ECCG_02314 [Escherichia coli B088] gi|291426466|gb|EFE99498.1| hypothetical protein ECGG_01757 [Escherichia coli FVEC1412] gi|291432403|gb|EFF05385.1| hypothetical protein ECDG_03817 [Escherichia coli B185] gi|298277337|gb|EFI18853.1| hypothetical protein ECFG_01943 [Escherichia coli FVEC1302] gi|299879158|gb|EFI87369.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 196-1] gi|300298948|gb|EFJ55333.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 185-1] gi|300304828|gb|EFJ59348.1| hypothetical protein HMPREF9553_04604 [Escherichia coli MS 200-1] gi|300356246|gb|EFJ72116.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 198-1] gi|300395102|gb|EFJ78640.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 69-1] gi|300401726|gb|EFJ85264.1| hypothetical protein HMPREF9536_04445 [Escherichia coli MS 84-1] gi|300409362|gb|EFJ92900.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 45-1] gi|300411757|gb|EFJ95067.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 115-1] gi|300454465|gb|EFK17958.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 21-1] gi|300463870|gb|EFK27363.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 187-1] gi|300522719|gb|EFK43788.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 119-7] gi|300531506|gb|EFK52568.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 107-1] gi|300838180|gb|EFK65940.1| hypothetical protein HMPREF9347_05170 [Escherichia coli MS 124-1] gi|300845411|gb|EFK73171.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 78-1] gi|305854181|gb|EFM54619.1| hypothetical protein ECNC101_09529 [Escherichia coli NC101] gi|307554935|gb|ADN47710.1| conserved hypothetical protein [Escherichia coli ABU 83972] gi|307625473|gb|ADN69777.1| hypothetical protein UM146_01750 [Escherichia coli UM146] gi|308122449|gb|EFO59711.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 145-7] gi|310332747|gb|EFP99960.1| conserved hypothetical protein [Escherichia coli 1827-70] gi|312947484|gb|ADR28311.1| hypothetical protein NRG857_14505 [Escherichia coli O83:H1 str. NRG 857C] gi|313648003|gb|EFS12449.1| hypothetical protein SF2457T_3601 [Shigella flexneri 2a str. 2457T] gi|315256845|gb|EFU36813.1| putative cytoplasmic protein [Escherichia coli MS 85-1] gi|315289499|gb|EFU48894.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 110-3] gi|315293933|gb|EFU53285.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 153-1] gi|315295642|gb|EFU54965.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 16-3] gi|320181042|gb|EFW55963.1| hypothetical protein SGB_01866 [Shigella boydii ATCC 9905] gi|320195072|gb|EFW69701.1| hypothetical protein EcoM_02835 [Escherichia coli WV_060327] gi|320202617|gb|EFW77187.1| hypothetical protein ECoL_00280 [Escherichia coli EC4100B] gi|323154662|gb|EFZ40861.1| hypothetical protein ECEPECA14_3462 [Escherichia coli EPECa14] gi|323162601|gb|EFZ48448.1| hypothetical protein ECE128010_1205 [Escherichia coli E128010] gi|323180413|gb|EFZ65965.1| hypothetical protein ECOK1180_1095 [Escherichia coli 1180] gi|323183524|gb|EFZ68921.1| hypothetical protein ECOK1357_3303 [Escherichia coli 1357] gi|323188653|gb|EFZ73938.1| hypothetical protein ECRN5871_3073 [Escherichia coli RN587/1] gi|323941961|gb|EGB38140.1| hypothetical protein ERDG_01736 [Escherichia coli E482] gi|323946549|gb|EGB42572.1| yggU [Escherichia coli H120] gi|323951609|gb|EGB47484.1| hypothetical protein ERKG_02252 [Escherichia coli H252] gi|323957323|gb|EGB53045.1| hypothetical protein ERLG_01390 [Escherichia coli H263] gi|323960754|gb|EGB56375.1| hypothetical protein ERGG_02689 [Escherichia coli H489] gi|323966470|gb|EGB61903.1| hypothetical protein ERJG_02059 [Escherichia coli M863] gi|323971760|gb|EGB66987.1| hypothetical protein ERHG_02238 [Escherichia coli TA007] gi|323978752|gb|EGB73833.1| hypothetical protein ERFG_00343 [Escherichia coli TW10509] gi|324005509|gb|EGB74728.1| hypothetical protein HMPREF9532_04861 [Escherichia coli MS 57-2] gi|324011796|gb|EGB81015.1| conserved hypothetical protein TIGR00251 [Escherichia coli MS 60-1] gi|324017226|gb|EGB86445.1| hypothetical protein HMPREF9542_04137 [Escherichia coli MS 117-3] gi|324115032|gb|EGC08997.1| hypothetical protein ERIG_00360 [Escherichia fergusonii B253] gi|324119748|gb|EGC13628.1| hypothetical protein ERBG_00336 [Escherichia coli E1167] gi|325498510|gb|EGC96369.1| hypothetical protein ECD227_2607 [Escherichia fergusonii ECD227] gi|327251721|gb|EGE63407.1| hypothetical protein ECSTEC7V_3572 [Escherichia coli STEC_7v] gi|331053670|gb|EGI25699.1| conserved hypothetical protein [Escherichia coli TA206] gi|331063367|gb|EGI35280.1| conserved hypothetical protein [Escherichia coli TA271] gi|332086882|gb|EGI92018.1| hypothetical protein SB521682_3508 [Shigella boydii 5216-82] gi|332087620|gb|EGI92747.1| hypothetical protein SD15574_3393 [Shigella dysenteriae 155-74] Length = 96 Score = 90.2 bits (223), Expect = 8e-17, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91 >gi|257062952|ref|YP_003142624.1| hypothetical protein Shel_02050 [Slackia heliotrinireducens DSM 20476] gi|256790605|gb|ACV21275.1| uncharacterized conserved protein [Slackia heliotrinireducens DSM 20476] Length = 106 Score = 90.2 bits (223), Expect = 8e-17, Method: Composition-based stats. Identities = 22/87 (25%), Positives = 48/87 (55%), Gaps = 2/87 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P A+++ +A + D + + ++++VT P+ GKANKA+ LAK + +SK Sbjct: 14 TQIPIHATPKAQRNAVAGV--KADDTGRLEVQVRVTVAPEGGKANKAVCETLAKAIGVSK 71 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITE 88 S + ++ ++S K+ ++ +I Sbjct: 72 SKVSIVRGETSRHKMAQVEAPSADIEA 98 >gi|302038855|ref|YP_003799177.1| hypothetical protein NIDE3568 [Candidatus Nitrospira defluvii] gi|300606919|emb|CBK43252.1| conserved protein of unknown function DUF167 [Candidatus Nitrospira defluvii] Length = 105 Score = 90.2 bits (223), Expect = 8e-17, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 38/90 (42%), Gaps = 8/90 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + P A +S A L +KI++ A P G AN + LA+ + Sbjct: 20 VTISVHVQPKASRSECAGLH-------GHAVKIRIAAPPADGAANAELCRFLARCCEVPL 72 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S++ +LS S K + + +++ L Sbjct: 73 SAVHILSGAGSRQKRVLVKGRTAEQVRAQL 102 >gi|161367529|ref|NP_289525.2| hypothetical protein Z4298 [Escherichia coli O157:H7 str. EDL933] gi|162139760|ref|NP_311856.2| hypothetical protein ECs3829 [Escherichia coli O157:H7 str. Sakai] gi|168747555|ref|ZP_02772577.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4113] gi|168753905|ref|ZP_02778912.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4401] gi|168760095|ref|ZP_02785102.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4501] gi|168766960|ref|ZP_02791967.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4486] gi|168773408|ref|ZP_02798415.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4196] gi|168781812|ref|ZP_02806819.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4076] gi|168785811|ref|ZP_02810818.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC869] gi|168797528|ref|ZP_02822535.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC508] gi|195937091|ref|ZP_03082473.1| hypothetical protein EscherichcoliO157_11661 [Escherichia coli O157:H7 str. EC4024] gi|208807147|ref|ZP_03249484.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4206] gi|208813482|ref|ZP_03254811.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4045] gi|208818265|ref|ZP_03258585.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4042] gi|209399373|ref|YP_002272433.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4115] gi|217327282|ref|ZP_03443365.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. TW14588] gi|254794905|ref|YP_003079742.1| hypothetical protein ECSP_3924 [Escherichia coli O157:H7 str. TW14359] gi|261226265|ref|ZP_05940546.1| hypothetical protein EscherichiacoliO157_16963 [Escherichia coli O157:H7 str. FRIK2000] gi|261256477|ref|ZP_05949010.1| hypothetical protein EscherichiacoliO157EcO_11656 [Escherichia coli O157:H7 str. FRIK966] gi|291284274|ref|YP_003501092.1| hypothetical protein G2583_3612 [Escherichia coli O55:H7 str. CB9615] gi|29839727|sp|Q8XCU6|YGGU_ECO57 RecName: Full=UPF0235 protein yggU gi|226730816|sp|B5YQF1|YGGU_ECO5E RecName: Full=UPF0235 protein yggU gi|187770933|gb|EDU34777.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4196] gi|188017898|gb|EDU56020.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4113] gi|189000572|gb|EDU69558.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4076] gi|189358605|gb|EDU77024.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4401] gi|189363683|gb|EDU82102.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4486] gi|189369349|gb|EDU87765.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC4501] gi|189374116|gb|EDU92532.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC869] gi|189379790|gb|EDU98206.1| conserved hypothetical protein [Escherichia coli O157:H7 str. EC508] gi|208726948|gb|EDZ76549.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4206] gi|208734759|gb|EDZ83446.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4045] gi|208738388|gb|EDZ86070.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4042] gi|209160773|gb|ACI38206.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. EC4115] gi|217319649|gb|EEC28074.1| conserved hypothetical protein TIGR00251 [Escherichia coli O157:H7 str. TW14588] gi|254594305|gb|ACT73666.1| conserved protein [Escherichia coli O157:H7 str. TW14359] gi|290764147|gb|ADD58108.1| conserved hypothetical protein [Escherichia coli O55:H7 str. CB9615] gi|320189302|gb|EFW63961.1| hypothetical protein ECoD_04306 [Escherichia coli O157:H7 str. EC1212] gi|320640599|gb|EFX10138.1| hypothetical protein ECO5101_04264 [Escherichia coli O157:H7 str. G5101] gi|320645846|gb|EFX14831.1| hypothetical protein ECO9389_23751 [Escherichia coli O157:H- str. 493-89] gi|320651146|gb|EFX19586.1| hypothetical protein ECO2687_11738 [Escherichia coli O157:H- str. H 2687] gi|320656642|gb|EFX24538.1| hypothetical protein ECO7815_01895 [Escherichia coli O55:H7 str. 3256-97 TW 07815] gi|320662161|gb|EFX29562.1| hypothetical protein ECO5905_09833 [Escherichia coli O55:H7 str. USDA 5905] gi|320667236|gb|EFX34199.1| hypothetical protein ECOSU61_08919 [Escherichia coli O157:H7 str. LSU-61] gi|326338959|gb|EGD62774.1| hypothetical protein ECoA_04497 [Escherichia coli O157:H7 str. 1044] gi|326343159|gb|EGD66927.1| hypothetical protein ECF_02672 [Escherichia coli O157:H7 str. 1125] Length = 96 Score = 90.2 bits (223), Expect = 9e-17, Method: Composition-based stats. Identities = 18/81 (22%), Positives = 35/81 (43%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 + + + K I I + Sbjct: 66 VVIEKGELGRHKQIKIINPQQ 86 >gi|293412313|ref|ZP_06655036.1| hypothetical protein ECEG_02320 [Escherichia coli B354] gi|291469084|gb|EFF11575.1| hypothetical protein ECEG_02320 [Escherichia coli B354] Length = 96 Score = 90.2 bits (223), Expect = 9e-17, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91 >gi|153828377|ref|ZP_01981044.1| conserved hypothetical protein [Vibrio cholerae 623-39] gi|229519844|ref|ZP_04409278.1| hypothetical protein VIF_000358 [Vibrio cholerae TM 11079-80] gi|148876086|gb|EDL74221.1| conserved hypothetical protein [Vibrio cholerae 623-39] gi|229343132|gb|EEO08116.1| hypothetical protein VIF_000358 [Vibrio cholerae TM 11079-80] Length = 96 Score = 90.2 bits (223), Expect = 9e-17, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EI L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEIAALIE 96 >gi|261210035|ref|ZP_05924333.1| hypothetical protein VCJ_000277 [Vibrio sp. RC341] gi|260840800|gb|EEX67342.1| hypothetical protein VCJ_000277 [Vibrio sp. RC341] Length = 96 Score = 90.2 bits (223), Expect = 9e-17, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK+ ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKQCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EI L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEIAALIE 96 >gi|161950048|ref|YP_404624.2| hypothetical protein SDY_3119 [Shigella dysenteriae Sd197] gi|309785219|ref|ZP_07679850.1| conserved hypothetical protein [Shigella dysenteriae 1617] gi|308926339|gb|EFP71815.1| conserved hypothetical protein [Shigella dysenteriae 1617] Length = 96 Score = 90.2 bits (223), Expect = 9e-17, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANGHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91 >gi|149057384|gb|EDM08707.1| similar to RIKEN cDNA 3110040N11, isoform CRA_d [Rattus norvegicus] Length = 181 Score = 90.2 bits (223), Expect = 9e-17, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 39/96 (40%), Gaps = 9/96 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + A P +G+AN + L+K L L K Sbjct: 91 VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 143 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95 S + + S K++ + +E+ E L+ Sbjct: 144 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTEAE 179 >gi|256024538|ref|ZP_05438403.1| hypothetical protein E4_14269 [Escherichia sp. 4_1_40B] Length = 96 Score = 90.2 bits (223), Expect = 9e-17, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 36/86 (41%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKLFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91 >gi|121728574|ref|ZP_01681595.1| conserved hypothetical protein [Vibrio cholerae V52] gi|153802603|ref|ZP_01957189.1| conserved hypothetical protein [Vibrio cholerae MZO-3] gi|229512520|ref|ZP_04401991.1| hypothetical protein VCB_000160 [Vibrio cholerae TMA 21] gi|254291169|ref|ZP_04961965.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|121629130|gb|EAX61573.1| conserved hypothetical protein [Vibrio cholerae V52] gi|124121866|gb|EAY40609.1| conserved hypothetical protein [Vibrio cholerae MZO-3] gi|150422863|gb|EDN14814.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|229350413|gb|EEO15362.1| hypothetical protein VCB_000160 [Vibrio cholerae TMA 21] Length = 96 Score = 90.2 bits (223), Expect = 9e-17, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EI L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEIAALIE 96 >gi|16761877|ref|NP_457494.1| hypothetical protein STY3255 [Salmonella enterica subsp. enterica serovar Typhi str. CT18] gi|29143364|ref|NP_806706.1| hypothetical protein t3014 [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] gi|161616066|ref|YP_001590031.1| hypothetical protein SPAB_03867 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|167552006|ref|ZP_02345759.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] gi|168234347|ref|ZP_02659405.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] gi|168236170|ref|ZP_02661228.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Schwarzengrund str. SL480] gi|168264452|ref|ZP_02686425.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] gi|168463711|ref|ZP_02697628.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|194470418|ref|ZP_03076402.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|194735973|ref|YP_002116050.1| hypothetical protein SeSA_A3276 [Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633] gi|198244387|ref|YP_002217077.1| hypothetical protein SeD_A3445 [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|200388001|ref|ZP_03214613.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Virchow str. SL491] gi|205354025|ref|YP_002227826.1| hypothetical protein SG2996 [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] gi|207858363|ref|YP_002245014.1| hypothetical protein SEN2945 [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] gi|213027996|ref|ZP_03342443.1| hypothetical protein Salmonelentericaenterica_39070 [Salmonella enterica subsp. enterica serovar Typhi str. 404ty] gi|213051778|ref|ZP_03344656.1| hypothetical protein Salmoneentericaenterica_01893 [Salmonella enterica subsp. enterica serovar Typhi str. E00-7866] gi|213422843|ref|ZP_03355881.1| hypothetical protein Salmonentericaenterica_35767 [Salmonella enterica subsp. enterica serovar Typhi str. E01-6750] gi|213424105|ref|ZP_03356998.1| hypothetical protein SentesTyphi_00030 [Salmonella enterica subsp. enterica serovar Typhi str. E02-1180] gi|213580619|ref|ZP_03362445.1| hypothetical protein SentesTyph_05132 [Salmonella enterica subsp. enterica serovar Typhi str. E98-0664] gi|213609099|ref|ZP_03368925.1| hypothetical protein SentesTyp_00567 [Salmonella enterica subsp. enterica serovar Typhi str. E98-2068] gi|213648208|ref|ZP_03378261.1| hypothetical protein SentesTy_13504 [Salmonella enterica subsp. enterica serovar Typhi str. J185] gi|213850158|ref|ZP_03381056.1| hypothetical protein SentesT_00598 [Salmonella enterica subsp. enterica serovar Typhi str. M223] gi|289825938|ref|ZP_06545097.1| hypothetical protein Salmonellentericaenterica_11175 [Salmonella enterica subsp. enterica serovar Typhi str. E98-3139] gi|29839732|sp|Q8Z3U7|YGGU_SALTI RecName: Full=UPF0235 protein yggU gi|189030121|sp|A9N4P8|YGGU_SALPB RecName: Full=UPF0235 protein yggU gi|226730825|sp|B5FUW5|YGGU_SALDC RecName: Full=UPF0235 protein yggU gi|226730826|sp|B5QY78|YGGU_SALEP RecName: Full=UPF0235 protein yggU gi|226730827|sp|B5RE62|YGGU_SALG2 RecName: Full=UPF0235 protein yggU gi|226730831|sp|B4TV71|YGGU_SALSV RecName: Full=UPF0235 protein yggU gi|25370114|pir||AF0878 conserved hypothetical protein STY3255 [imported] - Salmonella enterica subsp. enterica serovar Typhi (strain CT18) gi|16504179|emb|CAD02926.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi] gi|29138998|gb|AAO70566.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi str. Ty2] gi|161365430|gb|ABX69198.1| hypothetical protein SPAB_03867 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|194456782|gb|EDX45621.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|194711475|gb|ACF90696.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633] gi|195633251|gb|EDX51665.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|197290920|gb|EDY30274.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Schwarzengrund str. SL480] gi|197938903|gb|ACH76236.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|199605099|gb|EDZ03644.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Virchow str. SL491] gi|205273806|emb|CAR38801.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] gi|205323332|gb|EDZ11171.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] gi|205331696|gb|EDZ18460.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] gi|205347059|gb|EDZ33690.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar Hadar str. RI_05P066] gi|206710166|emb|CAR34522.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] gi|326624849|gb|EGE31194.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar Dublin str. 3246] gi|326629138|gb|EGE35481.1| UPF0235 protein yggU [Salmonella enterica subsp. enterica serovar Gallinarum str. 9] Length = 96 Score = 90.2 bits (223), Expect = 9e-17, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLTKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 IVIEKGELGRHKQVKI 81 >gi|16766403|ref|NP_462018.1| hypothetical protein STM3102 [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|167990371|ref|ZP_02571471.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|197262086|ref|ZP_03162160.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|29839738|sp|Q8ZM46|YGGU_SALTY RecName: Full=UPF0235 protein yggU gi|16421655|gb|AAL21977.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|197240341|gb|EDY22961.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|205331149|gb|EDZ17913.1| conserved hypothetical protein TIGR00251 [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|261248233|emb|CBG26070.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] gi|267995267|gb|ACY90152.1| hypothetical protein STM14_3746 [Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S] gi|301159657|emb|CBW19176.1| conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhimurium str. SL1344] gi|312914124|dbj|BAJ38098.1| hypothetical protein STMDT12_C31550 [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] gi|321225775|gb|EFX50829.1| UPF0235 protein VC [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] gi|323131458|gb|ADX18888.1| putative cytoplasmic protein [Salmonella enterica subsp. enterica serovar Typhimurium str. 4/74] Length = 96 Score = 90.2 bits (223), Expect = 1e-16, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +KI +TA P G+AN + L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKIAITAPPVDGQANSHLTKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 66 IVIEKGELGRHKQVKI 81 >gi|254295256|ref|YP_003061279.1| hypothetical protein Hbal_2912 [Hirschia baltica ATCC 49814] gi|254043787|gb|ACT60582.1| protein of unknown function DUF167 [Hirschia baltica ATCC 49814] Length = 109 Score = 89.8 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 28/91 (30%), Positives = 49/91 (53%), Gaps = 3/91 (3%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +++ R+ PNA K + + +D + ++K++V A P KGKANKA+ +LA L KS Sbjct: 13 DIVARVTPNASKDAVEA--PEQDAAGRTYLKLRVRAIPDKGKANKAVEKLLASHFNLPKS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++ + LK I I D +++ L Sbjct: 71 KVAVVKGSTDRLKTIRIS-DGADLSSQLAEK 100 >gi|114326941|ref|YP_744098.1| putative cytoplasmic protein [Granulibacter bethesdensis CGDNIH1] gi|122328075|sp|Q0BVH7|Y277_GRABC RecName: Full=UPF0235 protein GbCGDNIH1_0277 gi|114315115|gb|ABI61175.1| hypothetical cytosolic protein [Granulibacter bethesdensis CGDNIH1] Length = 107 Score = 89.8 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 25/92 (27%), Positives = 49/92 (53%), Gaps = 2/92 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R++P A+K G+ D +KI V+A KG+AN+A+ MLAK L + S Sbjct: 13 LALRVVPKARKIGLGGTVPGADGKP--RLKISVSAPADKGQANEAVRDMLAKALRVPASR 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 + +L ++ K++ ++ D + + ++ S Sbjct: 71 ITLLQGLTARDKLVRVEGDPETLGSTVETLAS 102 >gi|262191069|ref|ZP_06049276.1| hypothetical protein VIH_001440 [Vibrio cholerae CT 5369-93] gi|262033045|gb|EEY51576.1| hypothetical protein VIH_001440 [Vibrio cholerae CT 5369-93] Length = 96 Score = 89.8 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIIGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EI L++ Sbjct: 66 VVIEKGELGRHKQVRILQPSQIPAEIAALIE 96 >gi|322513549|ref|ZP_08066649.1| hypothetical protein HMPREF0027_0401 [Actinobacillus ureae ATCC 25976] gi|322120620|gb|EFX92514.1| hypothetical protein HMPREF0027_0401 [Actinobacillus ureae ATCC 25976] Length = 99 Score = 89.8 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 39/90 (43%), Gaps = 8/90 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + L P A + I L +KI +T P G AN +L L+K + K Sbjct: 13 IRLRIFLQPKASRDQIVGLH-------DNELKIAITTPPVDGAANAHLLKYLSKLFKVPK 65 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 SS+ + + K +++ + K + + ++ Sbjct: 66 SSIVLEKGELQRHKQLFVP-EPKLLPKEIE 94 >gi|27375661|ref|NP_767190.1| hypothetical protein bsl0550 [Bradyrhizobium japonicum USDA 110] gi|27348798|dbj|BAC45815.1| bsl0550 [Bradyrhizobium japonicum USDA 110] Length = 94 Score = 89.8 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 31/87 (35%), Positives = 47/87 (54%), Gaps = 2/87 (2%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P + I +E D +K++V A G+ANKA+L +LAK L + K+S+R+LS Sbjct: 2 TPRGGRDDIDGIEQLADG--RSVLKVRVRAIADGGEANKAVLVLLAKSLGVPKASVRLLS 59 Query: 69 KQSSPLKIIYIDKDCKEITELLQNNDS 95 +S LK I +D D + E L+ S Sbjct: 60 GATSRLKQIAVDGDPARLGETLRQLAS 86 >gi|241762300|ref|ZP_04760381.1| protein of unknown function DUF167 [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241373203|gb|EER62833.1| protein of unknown function DUF167 [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 113 Score = 89.8 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 28/91 (30%), Positives = 54/91 (59%), Gaps = 2/91 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ A K+GI + DT+ +I+V A P +G +NK ++A L+K ++ K Sbjct: 19 IRLALRVTARASKTGITMFDK--DTAGRGLFRIRVAAPPVEGASNKNLMAYLSKSFSVPK 76 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++R+ S + S +KI++I D K +TE+ ++ Sbjct: 77 GAVRIESGEHSKIKILHIAGDVKRLTEIAED 107 >gi|320161245|ref|YP_004174469.1| hypothetical protein ANT_18430 [Anaerolinea thermophila UNI-1] gi|319995098|dbj|BAJ63869.1| hypothetical protein ANT_18430 [Anaerolinea thermophila UNI-1] Length = 105 Score = 89.4 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 23/76 (30%), Positives = 46/76 (60%), Gaps = 6/76 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P A K+ I + D +KI++TA P +GKAN+A++ L++ L + ++S Sbjct: 19 ITVRVTPRASKNEI------YEILDDGTVKIRLTAPPVEGKANEALIDFLSEVLDVPRTS 72 Query: 64 LRMLSKQSSPLKIIYI 79 L +++ ++ KI+ + Sbjct: 73 LEIVAGETGRDKIVTV 88 >gi|289811236|ref|ZP_06541865.1| hypothetical protein Salmonellaentericaenterica_45672 [Salmonella enterica subsp. enterica serovar Typhi str. AG3] Length = 94 Score = 89.4 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN + L K+ ++KS Sbjct: 11 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLTKFLGKQFRVAKSQ 63 Query: 64 LRMLSKQSSPLKIIYI 79 + + + K + I Sbjct: 64 IVIEKGELGRHKQVKI 79 >gi|90407095|ref|ZP_01215284.1| hypothetical protein PCNPT3_02605 [Psychromonas sp. CNPT3] gi|90311817|gb|EAS39913.1| hypothetical protein PCNPT3_02605 [Psychromonas sp. CNPT3] Length = 96 Score = 89.4 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 38/87 (43%), Gaps = 8/87 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + L P A + L +KI +TA P G+ANK ++ L+K+ + K Sbjct: 12 LRLVLQPKASRDAFIGL-------LGDELKITITAPPVDGQANKHLIKFLSKQFKVPKRD 64 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + + K+I I K+I + Sbjct: 65 ITVEKGLLNRHKLIRIK-SPKKIPDFF 90 >gi|224826207|ref|ZP_03699310.1| protein of unknown function DUF167 [Lutiella nitroferrum 2002] gi|224601844|gb|EEG08024.1| protein of unknown function DUF167 [Lutiella nitroferrum 2002] Length = 108 Score = 89.4 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P AKK+ +A +K+++ A P +GKAN +LA LA++ + K Sbjct: 23 IRLTLHVQPGAKKTDLAG-------EHGGALKLRLAAPPVEGKANAMLLAWLAERFEVPK 75 Query: 62 SSLRMLSKQSSPLKIIYID 80 + +LS S KI+ I Sbjct: 76 RDVVLLSGDKSRHKIVEIK 94 >gi|225164758|ref|ZP_03726990.1| conserved hypothetical protein [Opitutaceae bacterium TAV2] gi|224800632|gb|EEG18996.1| conserved hypothetical protein [Opitutaceae bacterium TAV2] Length = 111 Score = 89.4 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 43/92 (46%), Gaps = 8/92 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C + ++ IPNA ++ IA +K+KV+A +G+AN+ + LA+ L + + Sbjct: 20 CILSIKAIPNASRNAIAGW-------LGDALKVKVSAPALEGRANEQLCDFLAETLGIPR 72 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 ++ + + S K I I D + L Sbjct: 73 RAVTVAGGEKSRQKRIQIIGLDLPAVRARLNA 104 >gi|223937408|ref|ZP_03629313.1| protein of unknown function DUF167 [bacterium Ellin514] gi|223893959|gb|EEF60415.1| protein of unknown function DUF167 [bacterium Ellin514] Length = 100 Score = 89.4 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 24/94 (25%), Positives = 41/94 (43%), Gaps = 15/94 (15%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++L P A + I ++IKVTA P AN+A+L +LA L + Sbjct: 16 LSIKLQPRASANQIG-------EPLGNELRIKVTAPPVDAAANEALLRLLADILKCPRGK 68 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97 + ++ +S K+I + L+ N LT Sbjct: 69 VELVRGHTSRHKVIKLHG--------LEANAVLT 94 >gi|319406122|emb|CBI79752.1| conserved hypothetical protein [Bartonella sp. AR 15-3] Length = 107 Score = 89.4 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 33/88 (37%), Positives = 49/88 (55%), Gaps = 2/88 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E D H+ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVRLIPKASVDSIIKVENRGDGKQ--HLIIRLRAIPENGKANKALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S +S K +Y K KEI ++LQ Sbjct: 70 ISLGSGATSHYKQLYFSKYIKEIEQILQ 97 >gi|149200997|ref|ZP_01877972.1| hypothetical protein RTM1035_15267 [Roseovarius sp. TM1035] gi|149145330|gb|EDM33356.1| hypothetical protein RTM1035_15267 [Roseovarius sp. TM1035] Length = 84 Score = 89.4 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 8/77 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P A ++ I ++ +++ VT P+ GKAN A+ +LAK L + KS Sbjct: 16 LTLRVTPKAARNRIV--------AEDDVLRVYVTTVPEDGKANAAVQKLLAKALGVPKSR 67 Query: 64 LRMLSKQSSPLKIIYID 80 L +L +S K+ +D Sbjct: 68 LTLLRGHTSRDKVFQVD 84 >gi|270157502|ref|ZP_06186159.1| conserved hypothetical protein [Legionella longbeachae D-4968] gi|269989527|gb|EEZ95781.1| conserved hypothetical protein [Legionella longbeachae D-4968] Length = 80 Score = 89.4 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 7/77 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 + + P AKKS I + +KI++ A P +G+ANK +L +A+ + S + Sbjct: 3 LYVQPGAKKSEIVGMH-------EGVLKIRLNAPPIEGRANKELLKYVAQLFKVPPSQVV 55 Query: 66 MLSKQSSPLKIIYIDKD 82 + S K++ + Sbjct: 56 LKRGDKSRHKVLLVKNS 72 >gi|293360116|ref|XP_002729708.1| PREDICTED: hypothetical protein [Rattus norvegicus] Length = 304 Score = 89.4 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 39/96 (40%), Gaps = 9/96 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + A P +G+AN + L+K L L K Sbjct: 214 VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 266 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95 S + + S K++ + +E+ E L+ Sbjct: 267 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTEAE 302 >gi|56551705|ref|YP_162544.1| hypothetical protein ZMO0809 [Zymomonas mobilis subsp. mobilis ZM4] gi|260752718|ref|YP_003225611.1| hypothetical protein Za10_0477 [Zymomonas mobilis subsp. mobilis NCIMB 11163] gi|56543279|gb|AAV89433.1| protein of unknown function DUF167 [Zymomonas mobilis subsp. mobilis ZM4] gi|258552081|gb|ACV75027.1| protein of unknown function DUF167 [Zymomonas mobilis subsp. mobilis NCIMB 11163] Length = 113 Score = 89.0 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 27/91 (29%), Positives = 54/91 (59%), Gaps = 2/91 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ A K+GI + DT+ +I+V A P +G +NK ++A L+K ++ K Sbjct: 19 IRLALRVTARASKTGITMFDK--DTAGRGLFRIRVAAPPVEGASNKNLMAYLSKSFSVPK 76 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++++ S + S +KI++I D K +TE+ ++ Sbjct: 77 GAVKIESGEHSKIKILHIAGDVKRLTEIAED 107 >gi|148251886|ref|YP_001236471.1| hypothetical protein BBta_0270 [Bradyrhizobium sp. BTAi1] gi|146404059|gb|ABQ32565.1| hypothetical protein BBta_0270 [Bradyrhizobium sp. BTAi1] Length = 111 Score = 89.0 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 49/90 (54%), Gaps = 2/90 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V +R+ P + I +E D +K++V A G+AN+A+ +LAK + ++K++ Sbjct: 15 VALRVTPRGGRDAIDGIETLSDG--RSVLKLRVRAVADGGEANRAVTELLAKAIGVTKAA 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 +R+ S ++ LK + I D + + L++ Sbjct: 73 VRITSGATARLKQVTITGDASRLDQALRDL 102 >gi|297697323|ref|XP_002825813.1| PREDICTED: UPF0235 protein C15orf40-like [Pongo abelii] Length = 242 Score = 89.0 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 21/98 (21%), Positives = 42/98 (42%), Gaps = 9/98 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 152 VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 204 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDSLT 97 S + + S K++ + +EI E L+ + T Sbjct: 205 SDVVLDKGGKSREKVVKLLASTTPEEILEKLKKEATKT 242 >gi|298293297|ref|YP_003695236.1| hypothetical protein Snov_3343 [Starkeya novella DSM 506] gi|296929808|gb|ADH90617.1| protein of unknown function DUF167 [Starkeya novella DSM 506] Length = 106 Score = 89.0 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 24/90 (26%), Positives = 39/90 (43%), Gaps = 2/90 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR P + I D +K +V+ + GKAN A+ +LAK ++ S Sbjct: 16 VTVRATPRGGRDAIDGFVELGDG--RTALKARVSVAAEDGKANAALGKLLAKAAGIAPSR 73 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++S + K ++ D EI LQ Sbjct: 74 VDLVSGATGRTKAFKLNGDAAEIAARLQAL 103 >gi|302342363|ref|YP_003806892.1| hypothetical protein Deba_0928 [Desulfarculus baarsii DSM 2075] gi|301638976|gb|ADK84298.1| protein of unknown function DUF167 [Desulfarculus baarsii DSM 2075] Length = 95 Score = 89.0 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 43/87 (49%), Gaps = 8/87 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P A + +A + +K+++ A P G+AN+A+L ++AK L+L + + Sbjct: 14 AVRVSPRASRDQLAG-------EEGGALKVRLCAPPVDGQANEALLRLVAKALSLPRRDV 66 Query: 65 RMLSKQSSPLKIIYIDKDC-KEITELL 90 + S S K + + +++ L Sbjct: 67 SLASGPRSRQKRLLVKGLGREQLLARL 93 >gi|298528115|ref|ZP_07015519.1| protein of unknown function DUF167 [Desulfonatronospira thiodismutans ASO3-1] gi|298511767|gb|EFI35669.1| protein of unknown function DUF167 [Desulfonatronospira thiodismutans ASO3-1] Length = 119 Score = 89.0 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 36/88 (40%), Gaps = 7/88 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V L P A + + + +KI V A P GKANKA+ L++ L + K Sbjct: 36 LRVVLKPGADRDEVLGIHA-------GRLKISVKAPPVDGKANKALCIFLSRSLGIRKKQ 88 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S K + + + L+ Sbjct: 89 VWIQRGLQSRNKDLIVSGVAGTVFNALK 116 >gi|300925051|ref|ZP_07140969.1| hypothetical protein HMPREF9548_03158 [Escherichia coli MS 182-1] gi|300418797|gb|EFK02108.1| hypothetical protein HMPREF9548_03158 [Escherichia coli MS 182-1] Length = 96 Score = 89.0 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLN-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91 >gi|21674646|ref|NP_662711.1| hypothetical protein CT1832 [Chlorobium tepidum TLS] gi|29839718|sp|Q8KBF5|Y1832_CHLTE RecName: Full=UPF0235 protein CT1832 gi|21647849|gb|AAM73053.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 105 Score = 89.0 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 43/88 (48%), Gaps = 8/88 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P + KSG+A + +KI + + P ANK +LAK L + +SS Sbjct: 13 LSVRVQPRSSKSGVAGM-------YGEQLKICLKSAPVDNAANKECCELLAKALGVPRSS 65 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90 + ++ SS K++ ++ + E L Sbjct: 66 VSVMKGASSRSKVLKVEGVTPAAVREAL 93 >gi|331664536|ref|ZP_08365442.1| conserved hypothetical protein [Escherichia coli TA143] gi|331058467|gb|EGI30448.1| conserved hypothetical protein [Escherichia coli TA143] Length = 96 Score = 89.0 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 66 VVIEKGELGRHKHIKIINPQQIPPEI 91 >gi|85704715|ref|ZP_01035816.1| hypothetical protein ROS217_06535 [Roseovarius sp. 217] gi|85670533|gb|EAQ25393.1| hypothetical protein ROS217_06535 [Roseovarius sp. 217] Length = 84 Score = 89.0 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 23/77 (29%), Positives = 41/77 (53%), Gaps = 8/77 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P A ++ I ++ +++ VT P+ GKAN A+ +LAK L + KS Sbjct: 16 IALRVTPKAARNRIV--------AEDGALRVYVTTVPEDGKANAAVQKLLAKALGVPKSR 67 Query: 64 LRMLSKQSSPLKIIYID 80 L +L +S K+ +D Sbjct: 68 LSLLRGHTSRDKVFQVD 84 >gi|88859056|ref|ZP_01133697.1| hypothetical protein PTD2_08629 [Pseudoalteromonas tunicata D2] gi|88819282|gb|EAR29096.1| hypothetical protein PTD2_08629 [Pseudoalteromonas tunicata D2] Length = 101 Score = 89.0 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 17/82 (20%), Positives = 35/82 (42%), Gaps = 7/82 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + + + P A + L +K+ +TA P G+AN ++ LAK+ ++ Sbjct: 10 ILTLRLYVQPKASQDKFIGLH-------GNELKVAITAPPVDGQANSHLIKFLAKQCKVA 62 Query: 61 KSSLRMLSKQSSPLKIIYIDKD 82 K+ + + K + I K Sbjct: 63 KNQVCIKKGLQGRHKEVQISKP 84 >gi|307310426|ref|ZP_07590074.1| protein of unknown function DUF167 [Escherichia coli W] gi|306909321|gb|EFN39816.1| protein of unknown function DUF167 [Escherichia coli W] gi|315062259|gb|ADT76586.1| conserved hypothetical protein [Escherichia coli W] gi|323377157|gb|ADX49425.1| protein of unknown function DUF167 [Escherichia coli KO11] Length = 96 Score = 88.6 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+AN ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQANSYLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91 >gi|15640485|ref|NP_230112.1| hypothetical protein VC0458 [Vibrio cholerae O1 biovar El Tor str. N16961] gi|153823175|ref|ZP_01975842.1| conserved hypothetical protein [Vibrio cholerae B33] gi|229509068|ref|ZP_04398556.1| hypothetical protein VCE_000471 [Vibrio cholerae B33] gi|229519736|ref|ZP_04409179.1| hypothetical protein VCC_003768 [Vibrio cholerae RC9] gi|229606248|ref|YP_002876896.1| hypothetical protein VCD_001149 [Vibrio cholerae MJ-1236] gi|254850689|ref|ZP_05240039.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255744295|ref|ZP_05418248.1| hypothetical protein VCH_000606 [Vibrio cholera CIRS 101] gi|262147280|ref|ZP_06028079.1| hypothetical protein VIG_000128 [Vibrio cholerae INDRE 91/1] gi|29839647|sp|Q9KUQ7|Y458_VIBCH RecName: Full=UPF0235 protein VC_0458 gi|9654883|gb|AAF93631.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor str. N16961] gi|126519301|gb|EAZ76524.1| conserved hypothetical protein [Vibrio cholerae B33] gi|229344425|gb|EEO09400.1| hypothetical protein VCC_003768 [Vibrio cholerae RC9] gi|229353993|gb|EEO18927.1| hypothetical protein VCE_000471 [Vibrio cholerae B33] gi|229368903|gb|ACQ59326.1| hypothetical protein VCD_001149 [Vibrio cholerae MJ-1236] gi|254846394|gb|EET24808.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255738235|gb|EET93627.1| hypothetical protein VCH_000606 [Vibrio cholera CIRS 101] gi|262031274|gb|EEY49889.1| hypothetical protein VIG_000128 [Vibrio cholerae INDRE 91/1] Length = 96 Score = 88.6 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EI L++ Sbjct: 66 VVVEKGELGRHKQVRILQPSQIPAEIAALIE 96 >gi|121590704|ref|ZP_01678036.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|153819156|ref|ZP_01971823.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|153826562|ref|ZP_01979229.1| conserved hypothetical protein [Vibrio cholerae MZO-2] gi|227080668|ref|YP_002809219.1| hypothetical protein VCM66_0443 [Vibrio cholerae M66-2] gi|229507096|ref|ZP_04396602.1| hypothetical protein VCF_002318 [Vibrio cholerae BX 330286] gi|298501011|ref|ZP_07010812.1| conserved hypothetical protein [Vibrio cholerae MAK 757] gi|254803911|sp|C3LRX5|Y443_VIBCM RecName: Full=UPF0235 protein VCM66_0443 gi|121547435|gb|EAX57544.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|126510301|gb|EAZ72895.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|149739654|gb|EDM53868.1| conserved hypothetical protein [Vibrio cholerae MZO-2] gi|227008556|gb|ACP04768.1| conserved hypothetical protein [Vibrio cholerae M66-2] gi|229355841|gb|EEO20761.1| hypothetical protein VCF_002318 [Vibrio cholerae BX 330286] gi|297540259|gb|EFH76319.1| conserved hypothetical protein [Vibrio cholerae MAK 757] Length = 96 Score = 88.6 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 39/91 (42%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + + K + I + EI L++ Sbjct: 66 VVVEKGELGRHKQVRILQPSQIPAEIAALIE 96 >gi|312116202|ref|YP_004013798.1| hypothetical protein Rvan_3519 [Rhodomicrobium vannielii ATCC 17100] gi|311221331|gb|ADP72699.1| protein of unknown function DUF167 [Rhodomicrobium vannielii ATCC 17100] Length = 123 Score = 88.6 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 26/90 (28%), Positives = 44/90 (48%), Gaps = 3/90 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P A + +A +E +K VT P+ GKAN A++ ++A L + K Sbjct: 22 VLLHVRLTPKASSARVAGVEAFDGKP---VLKAYVTTPPEDGKANAALVVLVASWLGVPK 78 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 SS+ M + Q S LK + + ++ + Sbjct: 79 SSVSMAAGQKSRLKTVAVAGKADDLLAKIA 108 >gi|307261870|ref|ZP_07543532.1| hypothetical protein appser12_14270 [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306868417|gb|EFN00232.1| hypothetical protein appser12_14270 [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 97 Score = 88.6 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 40/92 (43%), Gaps = 8/92 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + L P A + I L +KI +TA P G AN +L L+K + K Sbjct: 13 IRLRIFLQPKASRDQIVGLH-------DNELKIAITALPVDGAANAHLLKYLSKLFKVPK 65 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 SS+ + + K +++ + K I + ++ Sbjct: 66 SSIVLEKGELQRHKQLFVP-EPKLIPKEIEAL 96 >gi|73856987|gb|AAZ89694.1| conserved hypothetical protein [Shigella sonnei Ss046] Length = 100 Score = 88.6 bits (219), Expect = 2e-16, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 37/90 (41%), Gaps = 10/90 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+ N ++ L K+ ++KS Sbjct: 17 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQGNSHLVKFLGKQFRVAKSQ 69 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELL 90 + + + K I I EI L+ Sbjct: 70 VVIEKGELGRHKQIKIINPQQIPPEIAALI 99 >gi|114332111|ref|YP_748333.1| hypothetical protein Neut_2146 [Nitrosomonas eutropha C91] gi|122313207|sp|Q0AE64|Y2146_NITEC RecName: Full=UPF0235 protein Neut_2146 gi|114309125|gb|ABI60368.1| protein of unknown function DUF167 [Nitrosomonas eutropha C91] Length = 99 Score = 88.6 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 39/93 (41%), Gaps = 10/93 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A+++ + +KIK+ A P GKAN+A+ LAK+ + Sbjct: 14 LKLYIQPGARQTEAIGVH-------GEELKIKLAAPPMDGKANRALAVFLAKRFNVPLKH 66 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQNN 93 + + S K++ I + + ++ Sbjct: 67 ITLKWGAQSRHKVVEIYQPVNGPEVLFNEIRAE 99 >gi|307824763|ref|ZP_07654986.1| protein of unknown function DUF167 [Methylobacter tundripaludum SV96] gi|307734121|gb|EFO04975.1| protein of unknown function DUF167 [Methylobacter tundripaludum SV96] Length = 88 Score = 88.6 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 24/85 (28%), Positives = 43/85 (50%), Gaps = 8/85 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 + + P A K A L +K++V A P GKAN+ ++A +A + +SKS+ Sbjct: 2 NLHVQPKASKDEWAGLH-------GERLKLRVKAAPVDGKANQHLIAFIADEFGVSKSAC 54 Query: 65 RMLSKQSSPLKIIYIDKDCKEITEL 89 ++++ +S K I I K++ L Sbjct: 55 KLITGESGREKRIAI-NSPKKLPSL 78 >gi|320352454|ref|YP_004193793.1| hypothetical protein Despr_0318 [Desulfobulbus propionicus DSM 2032] gi|320120956|gb|ADW16502.1| protein of unknown function DUF167 [Desulfobulbus propionicus DSM 2032] Length = 110 Score = 88.6 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 27/90 (30%), Positives = 46/90 (51%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ P A + +A L+ +K++VT P GKAN+A++A LAK L KSS Sbjct: 21 LRLQVQPRAAANHLAGLQ-------GDMLKLRVTTPPVDGKANQAVVAYLAKLFHLPKSS 73 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92 + + S S K + I +E+ +L Sbjct: 74 VVLKSGHQSRGKTVVIASGHEQEVRAVLAA 103 >gi|53728760|ref|ZP_00135410.2| COG1872: Uncharacterized conserved protein [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|126208844|ref|YP_001054069.1| hypothetical protein APL_1380 [Actinobacillus pleuropneumoniae L20] gi|303250781|ref|ZP_07336976.1| hypothetical protein APP6_1906 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|303252447|ref|ZP_07338612.1| hypothetical protein APP2_1422 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|307246303|ref|ZP_07528382.1| hypothetical protein appser1_15050 [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|307248416|ref|ZP_07530437.1| hypothetical protein appser2_13900 [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|307250644|ref|ZP_07532582.1| hypothetical protein appser4_14180 [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|307253024|ref|ZP_07534909.1| hypothetical protein appser6_15320 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307255287|ref|ZP_07537100.1| hypothetical protein appser9_15200 [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307257450|ref|ZP_07539217.1| hypothetical protein appser10_14450 [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|307259722|ref|ZP_07541443.1| hypothetical protein appser11_15170 [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|166200384|sp|A3N228|Y1380_ACTP2 RecName: Full=UPF0235 protein APL_1380 gi|126097636|gb|ABN74464.1| hypothetical protein APL_1380 [Actinobacillus pleuropneumoniae serovar 5b str. L20] gi|302648720|gb|EFL78911.1| hypothetical protein APP2_1422 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|302650386|gb|EFL80547.1| hypothetical protein APP6_1906 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306852773|gb|EFM84999.1| hypothetical protein appser1_15050 [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306855058|gb|EFM87240.1| hypothetical protein appser2_13900 [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|306857316|gb|EFM89434.1| hypothetical protein appser4_14180 [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306859482|gb|EFM91510.1| hypothetical protein appser6_15320 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306861736|gb|EFM93717.1| hypothetical protein appser9_15200 [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306864030|gb|EFM95946.1| hypothetical protein appser10_14450 [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|306866190|gb|EFM98057.1| hypothetical protein appser11_15170 [Actinobacillus pleuropneumoniae serovar 11 str. 56153] Length = 97 Score = 88.6 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 40/92 (43%), Gaps = 8/92 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + L P A + I L +KI +TA P G AN +L L+K + K Sbjct: 13 IRLRIFLQPKASRDQIVGLH-------DNELKIAITALPVDGAANAHLLKYLSKLFKVPK 65 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 SS+ + + K +++ + K I + ++ Sbjct: 66 SSIVLEKGELQRHKQLFVP-EPKLIPKEIEAL 96 >gi|113461793|ref|YP_719862.1| hypothetical protein HS_1657 [Haemophilus somnus 129PT] gi|170718106|ref|YP_001785139.1| hypothetical protein HSM_1819 [Haemophilus somnus 2336] gi|123132241|sp|Q0I525|Y1657_HAES1 RecName: Full=UPF0235 protein HS_1657 gi|226696075|sp|B0UWD6|Y1819_HAES2 RecName: Full=UPF0235 protein HSM_1819 gi|112823836|gb|ABI25925.1| conserved hypothetical protein [Haemophilus somnus 129PT] gi|168826235|gb|ACA31606.1| protein of unknown function DUF167 [Haemophilus somnus 2336] Length = 99 Score = 88.6 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 7/77 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K + L +KI +TA P G+AN +L L+K ++KS Sbjct: 12 RLRIFLQPKASKDHLIGL-------YDNALKISITAPPIDGQANAHLLKFLSKTFKVAKS 64 Query: 63 SLRMLSKQSSPLKIIYI 79 + + + S K I I Sbjct: 65 QIILEKGELSRHKQILI 81 >gi|332039865|gb|EGI76260.1| hypothetical protein HGR_12177 [Hylemonella gracilis ATCC 19624] Length = 115 Score = 88.3 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 21/83 (25%), Positives = 41/83 (49%), Gaps = 7/83 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V + ++PNA ++ I L +K+++ A P GKAN+A+ LAK L++ Sbjct: 23 VLVDLHVMPNASRTQIQGL-------FDGALKVRLQAPPVDGKANEALRVWLAKTLSIPN 75 Query: 62 SSLRMLSKQSSPLKIIYIDKDCK 84 SS+ + ++ K +++ Sbjct: 76 SSVTLQHGATARRKQLHVAAHSA 98 >gi|253701327|ref|YP_003022516.1| hypothetical protein GM21_2722 [Geobacter sp. M21] gi|251776177|gb|ACT18758.1| protein of unknown function DUF167 [Geobacter sp. M21] Length = 99 Score = 88.3 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 44/88 (50%), Gaps = 8/88 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V + P A +S I + +++++T+ P + ANK + ++AK L ++KS + Sbjct: 17 TVHVQPRASRSEICG-------AKEGELRLRLTSPPVEDAANKQCVELIAKTLGVAKSKV 69 Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELLQ 91 + S S K++ ++ D + LL+ Sbjct: 70 SIRSGAKSRHKVVKVEGVDHDALLSLLK 97 >gi|94266431|ref|ZP_01290126.1| Protein of unknown function DUF167 [delta proteobacterium MLMS-1] gi|93452973|gb|EAT03472.1| Protein of unknown function DUF167 [delta proteobacterium MLMS-1] Length = 121 Score = 88.3 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 45/88 (51%), Gaps = 8/88 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR P A ++ +A + ++I+V A P GKAN+A+L LA + L +++ Sbjct: 38 LRVRAQPGAARTEVAG-------TYGARLRIRVAAPPVDGKANRALLTFLASRCGLVRNA 90 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90 + ++ Q K+ ++ +++T L Sbjct: 91 VTLVGGQRGRDKLFRLEGIGPEQLTTCL 118 >gi|78186226|ref|YP_374269.1| hypothetical protein Plut_0338 [Chlorobium luteolum DSM 273] gi|123583481|sp|Q3B605|Y338_PELLD RecName: Full=UPF0235 protein Plut_0338 gi|78166128|gb|ABB23226.1| conserved hypothetical protein [Chlorobium luteolum DSM 273] Length = 101 Score = 88.3 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 39/91 (42%), Gaps = 7/91 (7%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P + KS ++ +K+ + A P AN+ + +K ++ S + Sbjct: 15 VRVQPRSSKSAVSG-------PYGNALKVTLKAAPVDDAANRECCRLFSKLFSIPDSRVS 67 Query: 66 MLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96 ++S +S K + ++ E + N S+ Sbjct: 68 IVSGAASRTKSVMLEGLSAEEARSILRNSSI 98 >gi|197117881|ref|YP_002138308.1| hypothetical protein Gbem_1494 [Geobacter bemidjiensis Bem] gi|197087241|gb|ACH38512.1| protein of unknown function DUF167 [Geobacter bemidjiensis Bem] Length = 99 Score = 88.3 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 44/88 (50%), Gaps = 8/88 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V + P A +S I + +++++T+ P + ANK + ++AK L ++KS + Sbjct: 17 TVHVQPRASRSEICG-------AKEGELRLRLTSPPVEDAANKQCVELIAKTLGVAKSKV 69 Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELLQ 91 + S S K++ ++ D + LL+ Sbjct: 70 SIKSGAKSRHKVVKVEGVDHDALLSLLK 97 >gi|157743130|gb|AAI49509.1| C21H15orf40 protein [Bos taurus] Length = 132 Score = 88.3 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 41/94 (43%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ + P +K++ + + + + + A P +G+AN + L+K L L K Sbjct: 42 VSIAIHAKPGSKQNAVTDVTT-------EAVSVAIAAPPTEGEANAELCRYLSKVLELRK 94 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 S + + S K++ + +EI E L+ Sbjct: 95 SDVVLDKGGKSREKVVKLLASTPPEEILEKLKKQ 128 >gi|229525149|ref|ZP_04414554.1| hypothetical protein VCA_002760 [Vibrio cholerae bv. albensis VL426] gi|229338730|gb|EEO03747.1| hypothetical protein VCA_002760 [Vibrio cholerae bv. albensis VL426] Length = 96 Score = 88.3 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 10/91 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P GKAN + LAK ++K S Sbjct: 13 LRLYIQPKASRDSIVGLH-------GEELKVAITAPPIDGKANAHLSKYLAKLCKVAKGS 65 Query: 64 LRMLSKQSSPLKIIYIDKD---CKEITELLQ 91 + + + K + I + EI+ L++ Sbjct: 66 VVVEKGELGRHKQVRILQPSLIPAEISALIE 96 >gi|259089201|ref|NP_001158638.1| UPF0235 protein C15orf40 [Oncorhynchus mykiss] gi|225705490|gb|ACO08591.1| UPF0235 protein C15orf40 [Oncorhynchus mykiss] Length = 182 Score = 88.3 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 21/96 (21%), Positives = 41/96 (42%), Gaps = 9/96 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V P +K++ I + I + + A P G+AN ++ L+K L L + Sbjct: 93 VTISVHAKPGSKQNAITDVSIEAVG-------VAIAAPPTGGEANAELVRYLSKVLELKR 145 Query: 62 SSLRMLSKQSSPLKIIYIDKD--CKEITELLQNNDS 95 S + + S KII + +++ + L+ S Sbjct: 146 SEVVLDKGSRSREKIIKVTGSLTPEQVLDRLKQEAS 181 >gi|293348273|ref|XP_002726816.1| PREDICTED: hypothetical protein [Rattus norvegicus] Length = 178 Score = 87.9 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 39/96 (40%), Gaps = 9/96 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + A P +G+AN + L+K L L K Sbjct: 88 VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 140 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95 S + + S K++ + +E+ E L+ Sbjct: 141 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTEAE 176 >gi|256379751|ref|YP_003103411.1| hypothetical protein Amir_5752 [Actinosynnema mirum DSM 43827] gi|255924054|gb|ACU39565.1| protein of unknown function DUF167 [Actinosynnema mirum DSM 43827] Length = 90 Score = 87.9 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 37/84 (44%), Gaps = 6/84 (7%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M VR+ P +K+ + D + + V A +GKAN+A+ LAK + Sbjct: 1 MLKFAVRVKPGSKRDAVGG------RWDERALVVSVAAPAVEGKANEAVRRALAKAFGVR 54 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCK 84 + + ++S + K++ ID Sbjct: 55 RQDVEIVSGERGRDKVVVIDPAPD 78 >gi|319407608|emb|CBI81258.1| conserved hypothetical protein [Bartonella sp. 1-1C] Length = 101 Score = 87.9 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E D ++ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVRLIPKASVDSIIKVENRDDGKQ--YLIIRLRAIPENGKANKALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S +S K +Y K KE+ ++LQ+ Sbjct: 70 ISLGSGTTSHYKQLYFSKYLKEVEQILQS 98 >gi|288942590|ref|YP_003444830.1| hypothetical protein Alvin_2893 [Allochromatium vinosum DSM 180] gi|288897962|gb|ADC63798.1| protein of unknown function DUF167 [Allochromatium vinosum DSM 180] Length = 100 Score = 87.9 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 38/77 (49%), Gaps = 5/77 (6%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P A K A + + ++++ A P +GKAN+A+ +A ++ S Sbjct: 13 LRLRVQPRAPKDAFA-----EPDPSGDYYRVRLKAPPVEGKANQALRRFVADAFEVTLSQ 67 Query: 64 LRMLSKQSSPLKIIYID 80 + +LS + + K + I Sbjct: 68 VEILSGEQARYKRLRIR 84 >gi|319404616|emb|CBI78222.1| conserved hypothetical protein [Bartonella rochalimae ATCC BAA-1498] Length = 101 Score = 87.9 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 31/89 (34%), Positives = 50/89 (56%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRLIP A I +E D ++ I++ A P+ GKANKA++ LAK+ + S Sbjct: 12 LFVRLIPKASVDSIIKVENRDDGKQ--YLIIRLRAIPENGKANKALIKFLAKQWKIPSSC 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + + S +S K +Y K KE+ ++LQ+ Sbjct: 70 ISLGSGATSHYKQLYFSKYLKEVEKILQS 98 >gi|254472462|ref|ZP_05085862.1| conserved hypothetical protein [Pseudovibrio sp. JE062] gi|211958745|gb|EEA93945.1| conserved hypothetical protein [Pseudovibrio sp. JE062] Length = 105 Score = 87.9 bits (217), Expect = 4e-16, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 51/89 (57%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P + K I + D + K++ A P+KG ANKA+ A+ AK L++ KSS Sbjct: 16 ITVRLTPKSSKDQIEKIGAQSDGRPLVLAKVR--AVPEKGAANKAVAALFAKALSVPKSS 73 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +++ ++ +K + + + +++ + L++ Sbjct: 74 AELIAGSTARIKTLRVLGEPQDLAKRLED 102 >gi|114778113|ref|ZP_01453000.1| hypothetical protein SPV1_00607 [Mariprofundus ferrooxydans PV-1] gi|114551531|gb|EAU54085.1| hypothetical protein SPV1_00607 [Mariprofundus ferrooxydans PV-1] Length = 101 Score = 87.9 bits (217), Expect = 5e-16, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 41/87 (47%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + P A+K + + +KI V Q GKAN+A++ +A L LS++ Sbjct: 16 VNIHAQPGARKPALRGMH-------GDALKIAVAEAAQDGKANEAIVRFIADALNLSRAD 68 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + S +S K +++ D E+ L Sbjct: 69 VDVASGHTSRRKRLFLHGDGSELRARL 95 >gi|161986454|ref|YP_311929.2| hypothetical protein SSON_3107 [Shigella sonnei Ss046] gi|323167995|gb|EFZ53684.1| hypothetical protein SS53G_1711 [Shigella sonnei 53G] Length = 96 Score = 87.9 bits (217), Expect = 5e-16, Method: Composition-based stats. Identities = 18/86 (20%), Positives = 36/86 (41%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P A + I L +K+ +TA P G+ N ++ L K+ ++KS Sbjct: 13 LRLYIQPKASRDSIVGLH-------GDEVKVAITAPPVDGQGNSHLVKFLGKQFRVAKSQ 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + + K I I + E+ Sbjct: 66 VVIEKGELGRHKQIKIINPQQIPPEI 91 >gi|301789543|ref|XP_002930186.1| PREDICTED: UPF0235 protein C15orf40 homolog [Ailuropoda melanoleuca] Length = 214 Score = 87.5 bits (216), Expect = 5e-16, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 40/94 (42%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + + + + + A P +G+AN + L+K L L K Sbjct: 124 VTIAIHAKPGSKQNAVTDVTA-------EAVSVAIAAPPSEGEANAELCRYLSKVLELRK 176 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 S + + S K++ + +EI E L+ Sbjct: 177 SDVVLDKGGKSREKVVKLLASTTTEEILEKLKQQ 210 >gi|120602738|ref|YP_967138.1| hypothetical protein Dvul_1694 [Desulfovibrio vulgaris DP4] gi|120562967|gb|ABM28711.1| protein of unknown function DUF167 [Desulfovibrio vulgaris DP4] Length = 108 Score = 87.5 bits (216), Expect = 5e-16, Method: Composition-based stats. Identities = 19/81 (23%), Positives = 41/81 (50%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 ++V + P AKK G+A + ++++++A KAN+ + +A L + + Sbjct: 20 LLVWVQPGAKKDGLAGV-------ADGRLRVRLSAPAVDNKANRGLERYMASLLGVRPAR 72 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 + + S Q+S K + I+ D + Sbjct: 73 VSVASGQTSRRKRVVIESDAE 93 >gi|145590207|ref|YP_001156804.1| hypothetical protein Pnuc_2029 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] gi|145048613|gb|ABP35240.1| protein of unknown function DUF167 [Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1] Length = 98 Score = 87.5 bits (216), Expect = 5e-16, Method: Composition-based stats. Identities = 23/90 (25%), Positives = 46/90 (51%), Gaps = 9/90 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK + + L +KI + A + KAN+ +L L+K+L + + Sbjct: 15 LNLHCQPGAKVTKVVGLH-------DGCLKISLQAPAIENKANELLLGWLSKQLKIPQKQ 67 Query: 64 LRMLSKQSSPLKIIYIDKD--CKEITELLQ 91 ++ +S Q+S +K + I ++IT++LQ Sbjct: 68 IQFISGQNSRIKRVEIWGSITPEQITQILQ 97 >gi|301064653|ref|ZP_07205047.1| conserved hypothetical protein TIGR00251 [delta proteobacterium NaphS2] gi|300441273|gb|EFK05644.1| conserved hypothetical protein TIGR00251 [delta proteobacterium NaphS2] Length = 95 Score = 87.5 bits (216), Expect = 6e-16, Method: Composition-based stats. Identities = 26/93 (27%), Positives = 45/93 (48%), Gaps = 8/93 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V+++P + ++ I +IK+TA P +GKANKA++ LAKK K Sbjct: 10 VVIRVKVLPRSSRTEIVG-------KTDGIYRIKLTAPPVEGKANKALINFLAKKTGSPK 62 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 +R++ + S K I I+ +I + L Sbjct: 63 QKIRIVKGEQSRNKTIRIENLSSDDILKYLNEE 95 >gi|46579785|ref|YP_010593.1| hypothetical protein DVU1374 [Desulfovibrio vulgaris str. Hildenborough] gi|46449200|gb|AAS95852.1| conserved hypothetical protein [Desulfovibrio vulgaris str. Hildenborough] gi|311233576|gb|ADP86430.1| protein of unknown function DUF167 [Desulfovibrio vulgaris RCH1] Length = 106 Score = 87.5 bits (216), Expect = 6e-16, Method: Composition-based stats. Identities = 19/81 (23%), Positives = 41/81 (50%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 ++V + P AKK G+A + ++++++A KAN+ + +A L + + Sbjct: 18 LLVWVQPGAKKDGLAGV-------ADGRLRVRLSAPAVDNKANRGLERYMASLLGVRPAR 70 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 + + S Q+S K + I+ D + Sbjct: 71 VSVASGQTSRRKRVVIESDAE 91 >gi|114763599|ref|ZP_01443004.1| hypothetical protein 1100011001330_R2601_13754 [Pelagibaca bermudensis HTCC2601] gi|114543879|gb|EAU46891.1| hypothetical protein R2601_13754 [Roseovarius sp. HTCC2601] Length = 85 Score = 87.5 bits (216), Expect = 6e-16, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 39/80 (48%), Gaps = 8/80 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ P A ++ I + +++ VT P+ GKAN A+ +LAK L L K Sbjct: 14 ATLELRVTPKASRNEI--------REEGGTLRVYVTTVPEDGKANAAVQKLLAKALGLPK 65 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 S L ++ +S K I+ Sbjct: 66 SRLVLVRGATSRDKAFRIEA 85 >gi|165976805|ref|YP_001652398.1| hypothetical protein APJL_1398 [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|226734144|sp|B0BQW9|Y1398_ACTPJ RecName: Full=UPF0235 protein APJL_1398 gi|165876906|gb|ABY69954.1| hypothetical protein APJL_1398 [Actinobacillus pleuropneumoniae serovar 3 str. JL03] Length = 98 Score = 87.5 bits (216), Expect = 6e-16, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 40/93 (43%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + L P A + I L +KI +TA P G AN +L L+K + K Sbjct: 13 IRLRIFLQPKASRDQIVGLH-------DSELKIAITAPPVDGAANAHLLKYLSKLFKVPK 65 Query: 62 SSLRMLSKQSSPL-KIIYIDKDCKEITELLQNN 93 SS+ + + K +++ + K I + ++ Sbjct: 66 SSIVLEKGELQRHNKQLFVP-EPKLIPKEIEAL 97 >gi|167386103|ref|XP_001737619.1| hypothetical protein [Entamoeba dispar SAW760] gi|165899553|gb|EDR26129.1| hypothetical protein, conserved [Entamoeba dispar SAW760] Length = 118 Score = 87.5 bits (216), Expect = 6e-16, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 47/93 (50%), Gaps = 8/93 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + PNAK S + +E +K+ + A P GKAN ++A +A + K Sbjct: 33 VIIEIEIKPNAKTSELQGVE-------DGILKVAIDAPPIDGKANTEVIAFMASTFGIKK 85 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 S++ ++ Q+S K + + +++ +++Q+ Sbjct: 86 SNVSLIKGQTSHHKTLQFENWTREKVLQIIQSK 118 >gi|225873890|ref|YP_002755349.1| hypothetical protein ACP_2305 [Acidobacterium capsulatum ATCC 51196] gi|225793057|gb|ACO33147.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC 51196] Length = 112 Score = 87.5 bits (216), Expect = 6e-16, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 39/80 (48%), Gaps = 3/80 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A +S + + +++ + A P G+AN+ +L LA++L L Sbjct: 14 AVLAVRVTPRASRSSFQGV---LEKEGQTMLRVALHAPPIDGRANEELLDFLARQLDLPG 70 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 SSL ++ S K++ + Sbjct: 71 SSLEIIRGLQSREKLVRMTG 90 >gi|57640703|ref|YP_183181.1| hypothetical protein TK0768 [Thermococcus kodakarensis KOD1] gi|73921163|sp|Q5JHB2|Y768_PYRKO RecName: Full=UPF0235 protein TK0768 gi|57159027|dbj|BAD84957.1| hypothetical protein, conserved, YggU family [Thermococcus kodakarensis KOD1] Length = 94 Score = 87.5 bits (216), Expect = 6e-16, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 45/90 (50%), Gaps = 9/90 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 +++ + P AKK+ I ++ +K+++ A P +GKANK ++ +K L Sbjct: 11 VLIMIYVQPKAKKNAIEGVDG-----WRGRLKVRIAAPPVEGKANKEVVKFFSKLLG--- 62 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 + + ++ ++S K + + +E+ + L Sbjct: 63 AEVNIVRGETSREKDLLVKGLSVEEVRKKL 92 >gi|212223789|ref|YP_002307025.1| hypothetical protein TON_0641 [Thermococcus onnurineus NA1] gi|226707988|sp|B6YUU2|Y641_THEON RecName: Full=UPF0235 protein TON_0641 gi|212008746|gb|ACJ16128.1| hypothetical protein, conserved [Thermococcus onnurineus NA1] Length = 94 Score = 87.1 bits (215), Expect = 6e-16, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 44/90 (48%), Gaps = 9/90 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 +++ + P AKK+ I ++ +K+K+ A P +GKANK ++ +K L Sbjct: 11 AVILLYVQPKAKKNEIEGVD-----EWRGRLKVKIKAPPVEGKANKEVVRFFSKMLG--- 62 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 + + ++ +S K + + KE+ + L Sbjct: 63 TEVEIIRGGTSREKDLLVKGFSSKEVLKKL 92 >gi|332107965|gb|EGJ09189.1| hypothetical protein RBXJA2T_02622 [Rubrivivax benzoatilyticus JA2] Length = 103 Score = 87.1 bits (215), Expect = 7e-16, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 7/89 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V ++PNA+++G L +++++ A P GKAN+ +LA LA +L L K Sbjct: 17 RLRVAVVPNARRTGADGLH-------DGALRVRLNAPPVDGKANETLLAWLADELDLPKR 69 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 ++R+ Q+ K I +D + + L Sbjct: 70 AVRLTHGQTGRRKTIELDAAPEAVAAWLA 98 >gi|328780498|ref|XP_392249.2| PREDICTED: UPF0235 protein C15orf40 homolog [Apis mellifera] Length = 144 Score = 87.1 bits (215), Expect = 8e-16, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 40/93 (43%), Gaps = 8/93 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + ++ P AK + I + + ++A P +G+AN ++ LA L + K Sbjct: 56 VTIKIQAKPGAKHNNITDISEDAVG-------VAISAPPVEGEANTELVKYLASVLGMRK 108 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 S + + S KI+ + +++ E L+ Sbjct: 109 SDVTLDRGSKSRQKIVVVSGISVEKVLEKLKGE 141 >gi|84685306|ref|ZP_01013204.1| hypothetical protein 1099457000258_RB2654_10573 [Maritimibacter alkaliphilus HTCC2654] gi|84666463|gb|EAQ12935.1| hypothetical protein RB2654_10573 [Rhodobacterales bacterium HTCC2654] Length = 85 Score = 86.7 bits (214), Expect = 9e-16, Method: Composition-based stats. Identities = 23/77 (29%), Positives = 42/77 (54%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P A ++ ++ + H+K+ VT P+ GKA A++ +LA L ++KS Sbjct: 16 LTVRVTPKASRNAVS-------VDEDGHLKVSVTTVPEDGKATAAVVKLLAHALGVAKSD 68 Query: 64 LRMLSKQSSPLKIIYID 80 L ++ +S K+ ID Sbjct: 69 LTLVRGATSRDKVFRID 85 >gi|170742084|ref|YP_001770739.1| hypothetical protein M446_3939 [Methylobacterium sp. 4-46] gi|226706076|sp|B0UH52|Y3939_METS4 RecName: Full=UPF0235 protein M446_3939 gi|168196358|gb|ACA18305.1| protein of unknown function DUF167 [Methylobacterium sp. 4-46] Length = 104 Score = 86.7 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 42/87 (48%), Gaps = 2/87 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR P + + +E D +K++V A P+ G AN A+ A+LA+ L + Sbjct: 14 VRVRATPRGGRDAVEGIETRADGLP--VLKVRVRAAPEDGAANAAIRAVLAEALGCPARA 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + + ++ +K+ + D + + + Sbjct: 72 VTLAAGATARVKLFRVAGDGQALAARI 98 >gi|157123809|ref|XP_001653923.1| hypothetical protein AaeL_AAEL009675 [Aedes aegypti] gi|108874207|gb|EAT38432.1| conserved hypothetical protein [Aedes aegypti] Length = 149 Score = 86.7 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 20/96 (20%), Positives = 42/96 (43%), Gaps = 10/96 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + ++ P AK +GI + +++ A P G+AN ++ L+K L L K Sbjct: 58 VLIKIQAKPGAKTNGITDIGEEGVG-------VQIAAPPVDGEANTELVKYLSKLLELRK 110 Query: 62 SSLRMLSKQSSPLKIIYIDKD---CKEITELLQNND 94 S + + S K I ++K +++ ++ + Sbjct: 111 SDVSLDRGSKSRQKTIVLEKGCRMPEQVLDVFRKEA 146 >gi|171057075|ref|YP_001789424.1| hypothetical protein Lcho_0384 [Leptothrix cholodnii SP-6] gi|170774520|gb|ACB32659.1| protein of unknown function DUF167 [Leptothrix cholodnii SP-6] Length = 125 Score = 86.7 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 40/83 (48%), Gaps = 7/83 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V ++PNA+++ + L +++++ A P G AN A+ LA +L +SK Sbjct: 35 IDVAVVPNARRTEVVGLHDQA-------LRLRLAAPPVDGAANDALQRWLADELGVSKQQ 87 Query: 64 LRMLSKQSSPLKIIYIDKDCKEI 86 + +L S K + + ++ Sbjct: 88 VSLLRGASGRRKRLRVQVPPAQM 110 >gi|67482694|ref|XP_656664.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS] gi|56473879|gb|EAL51278.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS] Length = 118 Score = 86.7 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 47/93 (50%), Gaps = 8/93 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + PNAK S I +E +K+ + + P GKAN ++A +A + K Sbjct: 33 VIIEVEIKPNAKTSEIQGVE-------DGLLKVSINSPPVDGKANTEVIAFMASTFGIKK 85 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 S+++++ Q+S K + + +++ +++Q Sbjct: 86 SNVKLIKGQTSHHKTLQFENWTREKVLQIIQAK 118 >gi|310822209|ref|YP_003954567.1| hypothetical protein STAUR_4962 [Stigmatella aurantiaca DW4/3-1] gi|309395281|gb|ADO72740.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 98 Score = 86.7 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 43/94 (45%), Gaps = 7/94 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + P A ++ + +K+++ A P G+AN A++ LAK+L L + Sbjct: 12 VELAVLVQPRASRTRVVG-------EHDGMLKLQLAAPPVDGEANAALVEFLAKRLGLPR 64 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 + +++ ++ K +++ E + + S Sbjct: 65 RQVTLVAGDAARRKRVFLAGVDAARVEAVMSQAS 98 >gi|149456878|ref|XP_001519576.1| PREDICTED: similar to chromosome 15 open reading frame 40, partial [Ornithorhynchus anatinus] Length = 119 Score = 86.3 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 24/98 (24%), Positives = 39/98 (39%), Gaps = 9/98 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V P AK++ + + + + + A P +G+AN + LAK L L K Sbjct: 29 VTIAVHAKPGAKQNAVTDVSVEAVG-------VAIAAPPSEGEANAELCRYLAKILELRK 81 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDSLT 97 S + + S K+I I EI L+ T Sbjct: 82 SDVVLDRGGKSREKVIKILSSTTPDEILAKLKKQTETT 119 >gi|154705806|ref|YP_001424486.1| hypothetical cytosolic protein [Coxiella burnetii Dugway 5J108-111] gi|154355092|gb|ABS76554.1| hypothetical cytosolic protein [Coxiella burnetii Dugway 5J108-111] Length = 92 Score = 86.3 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 26/79 (32%), Positives = 46/79 (58%), Gaps = 7/79 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P AK++ I+ H+KI++ A P +GKANKA++ LA++L L+ SS Sbjct: 7 LTVYIQPGAKQTQISGKH-------GEHIKIRLQAPPTEGKANKALIDFLAQRLKLNPSS 59 Query: 64 LRMLSKQSSPLKIIYIDKD 82 + ++ + + LK I I+ Sbjct: 60 ITIIRGEKARLKTIAIESS 78 >gi|85859583|ref|YP_461785.1| putative cytoplasmic protein [Syntrophus aciditrophicus SB] gi|85722674|gb|ABC77617.1| hypothetical cytosolic protein [Syntrophus aciditrophicus SB] Length = 118 Score = 86.3 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 25/83 (30%), Positives = 40/83 (48%), Gaps = 7/83 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V ++P + K +A + ++IK+TA P GKAN L LA L + K + Sbjct: 17 VHVLPRSAKCALAG-------AQEGALRIKLTAPPVDGKANDECLEFLAGILGVKKGQMD 69 Query: 66 MLSKQSSPLKIIYIDKDCKEITE 88 ++S +S KI+ I +E E Sbjct: 70 IISGHTSRRKIVQIMNVPREPLE 92 >gi|194334667|ref|YP_002016527.1| hypothetical protein Paes_1868 [Prosthecochloris aestuarii DSM 271] gi|226696156|sp|B4S4I4|Y1868_PROA2 RecName: Full=UPF0235 protein Paes_1868 gi|194312485|gb|ACF46880.1| protein of unknown function DUF167 [Prosthecochloris aestuarii DSM 271] Length = 104 Score = 86.3 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 24/86 (27%), Positives = 35/86 (40%), Gaps = 7/86 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V+ P + +S I +KI + A P AN LAK L ++ S + Sbjct: 14 FVKAQPRSSRSAIIG-------EYDGKIKISLKAAPVDDAANVECCRFLAKSLGVASSRV 66 Query: 65 RMLSKQSSPLKIIYIDKDCKEITELL 90 R+LS SS +K + ID L Sbjct: 67 RILSGHSSRIKRLTIDGMGAAEAATL 92 >gi|189347519|ref|YP_001944048.1| hypothetical protein Clim_2040 [Chlorobium limicola DSM 245] gi|189341666|gb|ACD91069.1| protein of unknown function DUF167 [Chlorobium limicola DSM 245] Length = 101 Score = 85.9 bits (212), Expect = 1e-15, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 40/88 (45%), Gaps = 8/88 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V+ P + KS I + +K+ + A P AN+ A+ AK S L Sbjct: 13 QVKAQPRSSKSRITG-------AYDRGVKVTLKAAPVDDAANEECCALFAKVFGFPVSRL 65 Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELLQ 91 ++S +SS K + ++ +E++ LL+ Sbjct: 66 CIVSGRSSRNKTLRVEGTSAEEVSRLLR 93 >gi|251793930|ref|YP_003008662.1| hypothetical protein NT05HA_2269 [Aggregatibacter aphrophilus NJ8700] gi|247535329|gb|ACS98575.1| hypothetical protein NT05HA_2269 [Aggregatibacter aphrophilus NJ8700] Length = 97 Score = 85.9 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 21/76 (27%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + L P A K I L +KI++TA P G+AN +L L+K + KS Sbjct: 12 RLRIFLQPKAAKDHIVGLH-------DDELKIRITAPPIDGQANAHLLKFLSKLFKVPKS 64 Query: 63 SLRMLSKQSSPLKIIY 78 S+ + + + K + Sbjct: 65 SIVLEKGELNCHKQVL 80 >gi|221632637|ref|YP_002521858.1| hypothetical protein trd_0618 [Thermomicrobium roseum DSM 5159] gi|221157137|gb|ACM06264.1| DUF167 [Thermomicrobium roseum DSM 5159] Length = 102 Score = 85.6 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 28/87 (32%), Positives = 49/87 (56%), Gaps = 8/87 (9%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V++ P A ++ +A S + ++VTA P+ G+AN+A+L +LA+ L L + S+R Sbjct: 17 VQVQPRAPRAEVAG-------SRRDALLVRVTAPPRDGEANEAVLRLLAETLHLPRGSIR 69 Query: 66 MLSKQSSPLKIIYIDK-DCKEITELLQ 91 +++ + K I ID KE+ E L Sbjct: 70 IIAGTAQRRKRIRIDGLTSKELLERLA 96 >gi|158424942|ref|YP_001526234.1| hypothetical protein AZC_3318 [Azorhizobium caulinodans ORS 571] gi|158331831|dbj|BAF89316.1| protein of unknown function [Azorhizobium caulinodans ORS 571] Length = 104 Score = 85.6 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 2/89 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR P + + + D +KI+V A P+ G A A+ +LA ++ S+ Sbjct: 13 VSVRATPKGGRDALDGVSQLSDG--RDVLKIRVRAAPEDGAATAAVAKVLAGAAGVAPSA 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 +R+ S ++ LK+ I D + L+ Sbjct: 71 VRLASGATARLKVFRISGDAARLRATLEA 99 >gi|170029973|ref|XP_001842865.1| conserved hypothetical protein [Culex quinquefasciatus] gi|167865325|gb|EDS28708.1| conserved hypothetical protein [Culex quinquefasciatus] Length = 157 Score = 85.6 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 21/97 (21%), Positives = 42/97 (43%), Gaps = 10/97 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K +GI +E +++ A P G+AN ++ LAK L L K Sbjct: 65 VLIKILAKPGSKFNGITGIEDEGVG-------VQIAAPPIDGEANTELVKYLAKLLDLRK 117 Query: 62 SSLRMLSKQSSPLKIIYID---KDCKEITELLQNNDS 95 S + + S K I ++ + ++ E+ + + Sbjct: 118 SDVSLDRGSKSRQKTIVLEKGCRTPDQVLEIFRREAT 154 >gi|83945079|ref|ZP_00957445.1| hypothetical protein OA2633_10629 [Oceanicaulis alexandrii HTCC2633] gi|83851861|gb|EAP89716.1| hypothetical protein OA2633_10629 [Oceanicaulis alexandrii HTCC2633] Length = 101 Score = 85.6 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 24/94 (25%), Positives = 44/94 (46%), Gaps = 4/94 (4%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VR+ P A ++G D + +V A P KG AN + A+ AK L + KS Sbjct: 6 RLFVRVQPRASRAGFDGARAGTDGRI--RLAARVRAAPDKGAANTELCALTAKTLGVPKS 63 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96 ++ +++ + K + + +E +LL +L Sbjct: 64 TVSVIAGATQREKTLLVR--SEESIQLLDAVTAL 95 >gi|303245495|ref|ZP_07331779.1| protein of unknown function DUF167 [Desulfovibrio fructosovorans JJ] gi|302493344|gb|EFL53206.1| protein of unknown function DUF167 [Desulfovibrio fructosovorans JJ] Length = 113 Score = 85.6 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 18/81 (22%), Positives = 40/81 (49%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P + +A L +++++ A +G+AN A+ A LA+ + Sbjct: 28 LRVAVAPGGSRDALAGL-------AEDRLRVRLRAKAVEGQANAALTAFLAECFGVRPRQ 80 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 +R++S + S KI+ I+ + + Sbjct: 81 VRIVSGEKSRKKIVRINAESE 101 >gi|220923538|ref|YP_002498840.1| hypothetical protein Mnod_3628 [Methylobacterium nodulans ORS 2060] gi|219948145|gb|ACL58537.1| protein of unknown function DUF167 [Methylobacterium nodulans ORS 2060] Length = 102 Score = 85.6 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 22/81 (27%), Positives = 39/81 (48%), Gaps = 2/81 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR P + I +E D +K++V A P+ G AN A+ +L L + Sbjct: 12 VRVRATPRGGRDAIDGIETRADGLS--VLKVRVRAAPEDGAANTAIRDLLKTALGCPARA 69 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 +R+ + ++ +KI I+ D + Sbjct: 70 VRLTAGATARVKIFRIEGDGE 90 >gi|157826313|ref|YP_001494033.1| hypothetical protein A1C_06510 [Rickettsia akari str. Hartford] gi|166228807|sp|A8GQ50|Y6510_RICAH RecName: Full=UPF0235 protein A1C_06510 gi|157800271|gb|ABV75525.1| hypothetical protein A1C_06510 [Rickettsia akari str. Hartford] Length = 105 Score = 85.6 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 23/84 (27%), Positives = 50/84 (59%), Gaps = 3/84 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +++ PN+K++ I++ I + ++K+ + ATP+KGKAN+ ++ LAK LS+ Sbjct: 14 VLLNLKVKPNSKQNLISNFVIINNIP---YLKLSIKATPEKGKANEEIINYLAKAWKLSR 70 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85 S++ ++ + +K I I ++ Sbjct: 71 SNIEIIKGHTHSVKTILIKNINED 94 >gi|296109159|ref|YP_003616108.1| protein of unknown function DUF167 [Methanocaldococcus infernus ME] gi|295433973|gb|ADG13144.1| protein of unknown function DUF167 [Methanocaldococcus infernus ME] Length = 92 Score = 85.2 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 25/90 (27%), Positives = 45/90 (50%), Gaps = 9/90 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + PNAKK+ I + +++KV A P +GKANK ++ +K Sbjct: 11 LDIIVTPNAKKTEIVGRD-----EWRNRLEVKVKAPPVEGKANKEIIKFFSKLFG----D 61 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + +++ + S K I I K KE+ E+L + Sbjct: 62 VEIVAGEKSSKKTILIRKPLKEVEEILNSL 91 >gi|311260654|ref|XP_001929217.2| PREDICTED: UPF0235 protein C15orf40 homolog [Sus scrofa] Length = 154 Score = 85.2 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 39/94 (41%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L K Sbjct: 64 VTIAIHAKPGSKQNAVTDLTT-------EAVSVAIAAPPSEGEANAELCRYLSKVFELRK 116 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 S + + S K++ + +EI E L+ Sbjct: 117 SDVVLDKGGKSREKVVKLLASTTPEEILEKLKKQ 150 >gi|238650830|ref|YP_002916685.1| hypothetical protein RPR_04990 [Rickettsia peacockii str. Rustic] gi|259647069|sp|C4K236|Y4990_RICPU RecName: Full=UPF0235 protein RPR_04990 gi|238624928|gb|ACR47634.1| hypothetical protein RPR_04990 [Rickettsia peacockii str. Rustic] Length = 105 Score = 85.2 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 49/84 (58%), Gaps = 3/84 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + ++ PN+K++ I++ I + ++K+ + A P++GKAN+ ++ LAK+ LS+ Sbjct: 14 ALLSFKVKPNSKQNLISNFVIINNIP---YLKLSIKAIPEQGKANEEIINYLAKEWKLSR 70 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85 S++ ++ + LK I I ++ Sbjct: 71 SNIEIIKGHTHSLKTILIKNINED 94 >gi|109082186|ref|XP_001111768.1| PREDICTED: UPF0235 protein C15orf40-like isoform 1 [Macaca mulatta] Length = 154 Score = 85.2 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 21/98 (21%), Positives = 41/98 (41%), Gaps = 9/98 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 64 VTITIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 116 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDSLT 97 S + + S K++ + +EI E L+ T Sbjct: 117 SDVVLDKGGKSREKVVKLLASTTPEEILEKLKKEARKT 154 >gi|152991249|ref|YP_001356971.1| hypothetical protein NIS_1507 [Nitratiruptor sp. SB155-2] gi|151423110|dbj|BAF70614.1| conserved hypothetical protein [Nitratiruptor sp. SB155-2] Length = 95 Score = 85.2 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 7/77 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ ++ PNA K+ IA + +KI + A +G ANK ++ L+K ++K Sbjct: 10 VHMFIKAQPNASKNKIAGI-------LGDSLKIAIKAPAVEGAANKELVKFLSKTFKVAK 62 Query: 62 SSLRMLSKQSSPLKIIY 78 S + S ++S K I Sbjct: 63 SDIVFASGETSKRKHIV 79 >gi|126733001|ref|ZP_01748760.1| hypothetical protein SSE37_15953 [Sagittula stellata E-37] gi|126706530|gb|EBA05608.1| hypothetical protein SSE37_15953 [Sagittula stellata E-37] Length = 84 Score = 85.2 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 20/78 (25%), Positives = 39/78 (50%), Gaps = 8/78 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A ++ + +++ VT P+ GKA A++ +LAK L + K Sbjct: 14 ATLAVRVTPKASRNAVE--------RTDDALRVYVTTVPEGGKATAAVVKLLAKALGVPK 65 Query: 62 SSLRMLSKQSSPLKIIYI 79 S L ++ ++S K+ + Sbjct: 66 SRLELVRGETSRDKVFRV 83 >gi|88812461|ref|ZP_01127710.1| hypothetical protein NB231_13261 [Nitrococcus mobilis Nb-231] gi|88790247|gb|EAR21365.1| hypothetical protein NB231_13261 [Nitrococcus mobilis Nb-231] Length = 99 Score = 85.2 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 37/78 (47%), Gaps = 7/78 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P A + + +++++TA P +GKAN+ + L L +++S Sbjct: 13 LTVRVQPRAARDEL-------KIDADGRLRLRITAPPVEGKANEHLRHFLGHALGVARSQ 65 Query: 64 LRMLSKQSSPLKIIYIDK 81 + + + +S K I + Sbjct: 66 VSVATGATSRNKRIVVQN 83 >gi|126306449|ref|XP_001373757.1| PREDICTED: similar to chromosome 15 open reading frame 40, [Monodelphis domestica] Length = 146 Score = 85.2 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 40/94 (42%), Gaps = 9/94 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K++ I + + + + A P +G+AN + L+K L L KS Sbjct: 58 IAIHAKPGSKQNAITDVTTEN-------VSVAIAAPPSEGEANTELCRYLSKVLELRKSD 110 Query: 64 LRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95 + + S K++ I +EI E L+ Sbjct: 111 VILDKGGKSREKVVKILASTTPEEILEKLKRQAE 144 >gi|32475275|ref|NP_868269.1| hypothetical protein RB8260 [Rhodopirellula baltica SH 1] gi|47117454|sp|Q7UFY2|Y8260_RHOBA RecName: Full=UPF0235 protein RB8260 gi|32445816|emb|CAD78547.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 108 Score = 85.2 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 27/76 (35%), Positives = 44/76 (57%), Gaps = 7/76 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P AKK+ + L +K+ V P+ GKANKA++A LAK L +SK + Sbjct: 21 RVRVTPKAKKASVGGLH-------DGALKVSVHTVPEDGKANKAVIASLAKWLRVSKGRV 73 Query: 65 RMLSKQSSPLKIIYID 80 +++ ++S LK I ++ Sbjct: 74 AIVAGETSRLKTIVVE 89 >gi|189500987|ref|YP_001960457.1| hypothetical protein Cphamn1_2066 [Chlorobium phaeobacteroides BS1] gi|226701149|sp|B3EMY7|Y2066_CHLPB RecName: Full=UPF0235 protein Cphamn1_2066 gi|189496428|gb|ACE04976.1| protein of unknown function DUF167 [Chlorobium phaeobacteroides BS1] Length = 101 Score = 85.2 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 39/87 (44%), Gaps = 8/87 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 ++ P + KS I +K+ + A P G+AN +LA+ L +++SS+ Sbjct: 14 SIKAQPRSSKSMITG-------EYDGSIKVNLKAPPVDGEANLECCRLLARTLGVARSSV 66 Query: 65 RMLSKQSSPLKIIYIDK-DCKEITELL 90 ++S +K + + E TE + Sbjct: 67 EIVSGTRGKMKRVKVFGLSAVEFTEKI 93 >gi|237749382|ref|ZP_04579862.1| predicted protein [Oxalobacter formigenes OXCC13] gi|229380744|gb|EEO30835.1| predicted protein [Oxalobacter formigenes OXCC13] Length = 100 Score = 85.2 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 28/91 (30%), Positives = 50/91 (54%), Gaps = 10/91 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V++ PNAKK+ I +SD ++I++ A P GKAN+A++ +AKKL K Sbjct: 14 RIAVQVSPNAKKTEIV-------SSDGEALRIRLQAPPVDGKANEALVQFIAKKLRTPKR 66 Query: 63 SLRMLSKQSSPLKIIYI---DKDCKEITELL 90 ++ + S+ K++ I D +E+ + L Sbjct: 67 NVSITHGLSAKHKLLEIGLPDIPEEELEKQL 97 >gi|50753027|ref|XP_413838.1| PREDICTED: similar to Chromosome 15 open reading frame 40 [Gallus gallus] Length = 158 Score = 85.2 bits (210), Expect = 3e-15, Method: Composition-based stats. Identities = 22/97 (22%), Positives = 39/97 (40%), Gaps = 9/97 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V VR P ++ S + + + + A P +G+AN + L+K L + K Sbjct: 69 VRVSVRAKPGSRCSAVTDVTAEAVG-------VAIAAPPSEGEANAELCRYLSKVLGVKK 121 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDSL 96 S + + S K++ I E+ E L+ S Sbjct: 122 SDVILEKGGKSRDKVVKILVSVTPDEVLEKLKKEAST 158 >gi|67005257|gb|AAY62183.1| Conserved hypothetical protein [Rickettsia felis URRWXCal2] Length = 110 Score = 84.8 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 51/89 (57%), Gaps = 3/89 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +++ PN+K++ I+ I + ++K+ + ATP++GKAN+ ++ LAK+ LS+ Sbjct: 19 VLLNLKVKPNSKQNLISDFVIINNIP---YLKLSIKATPEQGKANEEIINYLAKEWKLSR 75 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ ++ LK I I ++ L+ Sbjct: 76 KDIEIIKGHTNSLKTILIKNIDEDYLNLI 104 >gi|163795292|ref|ZP_02189259.1| hypothetical protein BAL199_14277 [alpha proteobacterium BAL199] gi|159179278|gb|EDP63809.1| hypothetical protein BAL199_14277 [alpha proteobacterium BAL199] Length = 112 Score = 84.8 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 3/89 (3%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V++ P A I + D + +++ VTA P+ G+ANKA+ A+LAK+ + KSS+ Sbjct: 17 AVKVTPKAAADRIRGVVQ--DEAGVAWLQVSVTAVPEDGRANKAVTALLAKRWRVPKSSI 74 Query: 65 RMLSKQSSPLKIIYI-DKDCKEITELLQN 92 ++ + K++ + D +T LQ Sbjct: 75 EIVQGTTERRKVLLVRSDDTAALTARLQT 103 >gi|78357213|ref|YP_388662.1| hypothetical protein Dde_2170 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219618|gb|ABB38967.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 118 Score = 84.8 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 42/81 (51%), Gaps = 7/81 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V P AK SGIA L ++I+++A KANK ++ +A+ + ++ Sbjct: 16 RLKVWAQPGAKHSGIAGL-------YDGRVRIRLSAPAVDNKANKELIRFVAQLCGVKQN 68 Query: 63 SLRMLSKQSSPLKIIYIDKDC 83 +R+ S SS K++ I++D Sbjct: 69 RVRLESGVSSRKKVLLIERDT 89 >gi|162022116|ref|YP_247348.2| hypothetical protein RF_1332 [Rickettsia felis URRWXCal2] gi|126253831|sp|Q4UJV6|Y1332_RICFE RecName: Full=UPF0235 protein RF_1332 Length = 105 Score = 84.8 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 51/89 (57%), Gaps = 3/89 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +++ PN+K++ I+ I + ++K+ + ATP++GKAN+ ++ LAK+ LS+ Sbjct: 14 VLLNLKVKPNSKQNLISDFVIINNIP---YLKLSIKATPEQGKANEEIINYLAKEWKLSR 70 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ ++ LK I I ++ L+ Sbjct: 71 KDIEIIKGHTNSLKTILIKNIDEDYLNLI 99 >gi|258593710|emb|CBE70051.1| conserved hypothetical protein [NC10 bacterium 'Dutch sediment'] Length = 102 Score = 84.8 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 25/87 (28%), Positives = 42/87 (48%), Gaps = 7/87 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P A + I D ++++V A P +G+AN A L +LAK L L Sbjct: 11 ASFRVRLQPKASREAI-------DGEVDGVLRLRVNAPPVEGQANDACLRLLAKTLDLPI 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITE 88 S L +++ Q + +K I + ++ Sbjct: 64 SRLGIVAGQQARVKTIRVTDASADLLR 90 >gi|182680478|ref|YP_001834624.1| hypothetical protein Bind_3579 [Beijerinckia indica subsp. indica ATCC 9039] gi|182636361|gb|ACB97135.1| protein of unknown function DUF167 [Beijerinckia indica subsp. indica ATCC 9039] Length = 87 Score = 84.8 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 26/81 (32%), Positives = 43/81 (53%), Gaps = 3/81 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ P A + I +E D S ++I VT P+ GKAN+ +L +LAK L + Sbjct: 10 VEIAIRVTPKASANRIV-VETAPDGS--ERLRIYVTTVPENGKANRDVLRLLAKHLDIPP 66 Query: 62 SSLRMLSKQSSPLKIIYIDKD 82 SSL ++ + KI+ +D Sbjct: 67 SSLEIIRGSTGRDKIVRFSRD 87 >gi|313673537|ref|YP_004051648.1| hypothetical protein Calni_1577 [Calditerrivibrio nitroreducens DSM 19672] gi|312940293|gb|ADR19485.1| protein of unknown function DUF167 [Calditerrivibrio nitroreducens DSM 19672] Length = 86 Score = 84.8 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 26/93 (27%), Positives = 49/93 (52%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P AKK+ D +KIK+ + P GKAN+ +++ +++ L LSK Sbjct: 1 MRIKIYVQPGAKKTA-------YDGEFNGCIKIKIKSPPTDGKANEELISFISQSLNLSK 53 Query: 62 SSLRMLSKQSSPLKIIYIDK--DCKEITELLQN 92 + ++S + S KII + + D + I E L++ Sbjct: 54 KEVGIISGEKSRYKIIEVPENYDMEFIKEKLKD 86 >gi|291278644|ref|YP_003495479.1| hypothetical protein DEFDS_0212 [Deferribacter desulfuricans SSM1] gi|290753346|dbj|BAI79723.1| conserved hypothetical protein [Deferribacter desulfuricans SSM1] Length = 82 Score = 84.4 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 7/77 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P AKK+ +A KIKV + P G ANK ++ LAKKL +SK Sbjct: 3 VRITFYIQPGAKKTEVAG-------EFNNMTKIKVASPPVDGAANKELIKFLAKKLGVSK 55 Query: 62 SSLRMLSKQSSPLKIIY 78 SS++++S + S +K + Sbjct: 56 SSVKIVSGEKSRIKTVE 72 >gi|189485621|ref|YP_001956562.1| hypothetical protein TGRD_618 [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|254806541|sp|B1GYK8|Y618_UNCTG RecName: Full=UPF0235 protein TGRD_618 gi|170287580|dbj|BAG14101.1| conserved hypothetical protein [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 87 Score = 84.4 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 44/93 (47%), Gaps = 8/93 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+IPN+K++ + + +++K+TA +G+AN+ + L+ + + Sbjct: 1 MIIKVRVIPNSKRNEVV-------SRVGSILRVKITAPAIEGRANEELCDFLSDFFDVKR 53 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 S + + + K I + +E+ E+L Sbjct: 54 SMIFLRKGERGREKTIEVLGRLEEELNEVLDTI 86 >gi|126273662|ref|XP_001365632.1| PREDICTED: similar to chromosome 15 open reading frame 40, [Monodelphis domestica] Length = 146 Score = 84.4 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 40/94 (42%), Gaps = 9/94 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K++ I + + + + A P +G+AN + L+K L L KS Sbjct: 58 IAIHAKPGSKQNAITDVTTEN-------VSVAIAAPPSEGEANTELCRYLSKVLELRKSD 110 Query: 64 LRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95 + + S K++ I +EI E L+ Sbjct: 111 VILDKGGKSREKVVKILASTTPEEILEKLKRQAE 144 >gi|239906817|ref|YP_002953558.1| hypothetical protein DMR_21810 [Desulfovibrio magneticus RS-1] gi|239796683|dbj|BAH75672.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 122 Score = 84.4 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 17/81 (20%), Positives = 39/81 (48%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P K +A L +++++ A +G+AN A+ +A+ L + Sbjct: 37 LRVAVTPGGAKDALAGL-------AEDRLRVRLRAKAVEGQANAALTDFVARCLGVKPRQ 89 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 +R++S + S K + I+ + + Sbjct: 90 VRIISGEKSRKKTLRIETESE 110 >gi|317050344|ref|YP_004111460.1| hypothetical protein Selin_0146 [Desulfurispirillum indicum S5] gi|316945428|gb|ADU64904.1| protein of unknown function DUF167 [Desulfurispirillum indicum S5] Length = 92 Score = 84.4 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 16/87 (18%), Positives = 41/87 (47%), Gaps = 7/87 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ P + ++ + +K+++ A P G AN+ ++ ++ L++ KS Sbjct: 11 LQLKVQPRSSRTELL-------READGQLKLRLNAPPVDGAANQQVIEFFSRLLSIPKSR 63 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + ++ Q S K+I + I E + Sbjct: 64 ISIVQGQQSRRKVIALLGVEPAILEKV 90 >gi|94971231|ref|YP_593279.1| hypothetical protein Acid345_4205 [Candidatus Koribacter versatilis Ellin345] gi|166227235|sp|Q1IIU5|Y4205_ACIBL RecName: Full=UPF0235 protein Acid345_4205 gi|94553281|gb|ABF43205.1| protein of unknown function DUF167 [Candidatus Koribacter versatilis Ellin345] Length = 96 Score = 84.4 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 26/91 (28%), Positives = 48/91 (52%), Gaps = 8/91 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P AKK+ I +K+ VT P G+AN+A++ +A L +++ Sbjct: 11 VSFAVRLQPKAKKTAIIG-------ELNGALKLGVTDPPIDGRANEALIRFVAGLLKVTR 63 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91 SS+ + + +SS K+I I+ +++ L+ Sbjct: 64 SSVTIAAGESSRNKVIRIEGVTAEQVRFRLK 94 >gi|220916321|ref|YP_002491625.1| protein of unknown function DUF167 [Anaeromyxobacter dehalogenans 2CP-1] gi|254799985|sp|B8JFX1|Y1215_ANAD2 RecName: Full=UPF0235 protein A2cp1_1215 gi|219954175|gb|ACL64559.1| protein of unknown function DUF167 [Anaeromyxobacter dehalogenans 2CP-1] Length = 95 Score = 84.4 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 34/78 (43%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A ++ +KI++ A P G AN A++ LA L + + Sbjct: 11 AVLEILVQPRASRTRAVG-------EHDGRLKIQLAAPPVDGAANAALVEFLAAALGVRR 63 Query: 62 SSLRMLSKQSSPLKIIYI 79 + + +L ++ K + + Sbjct: 64 ADVELLRGETGRRKTVRV 81 >gi|15893224|ref|NP_360938.1| hypothetical protein RC1301 [Rickettsia conorii str. Malish 7] gi|34581107|ref|ZP_00142587.1| hypothetical protein [Rickettsia sibirica 246] gi|229587206|ref|YP_002845707.1| hypothetical protein RAF_ORF1191 [Rickettsia africae ESF-5] gi|29839741|sp|Q92G24|Y1301_RICCN RecName: Full=UPF0235 protein RC1301 gi|259645748|sp|C3PLX4|Y1191_RICAE RecName: Full=UPF0235 protein RAF_ORF1191 gi|15620440|gb|AAL03839.1| unknown [Rickettsia conorii str. Malish 7] gi|28262492|gb|EAA25996.1| unknown [Rickettsia sibirica 246] gi|228022256|gb|ACP53964.1| Unknown [Rickettsia africae ESF-5] Length = 105 Score = 84.4 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 51/89 (57%), Gaps = 3/89 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + ++ PN+K++ I++ I + ++K+ + A P++GKAN+ ++ LAK+ LS+ Sbjct: 14 ALLSFKVKPNSKQNLISNFVIINNIP---YLKLSIKAIPEQGKANEEIINYLAKEWKLSR 70 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 S++ ++ + LK I I ++ L+ Sbjct: 71 SNIEIIKGHTHSLKTILIKNINEDYLNLI 99 >gi|115495117|ref|NP_001068854.1| hypothetical protein LOC509050 [Bos taurus] gi|122140809|sp|Q3ZBP8|CO040_BOVIN RecName: Full=UPF0235 protein C15orf40 homolog gi|73587092|gb|AAI03180.1| Chromosome 15 open reading frame 40 ortholog [Bos taurus] gi|296475539|gb|DAA17654.1| hypothetical protein LOC509050 [Bos taurus] Length = 126 Score = 84.4 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 41/94 (43%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ + P +K++ + + + + + A P +G+AN + L+K L L K Sbjct: 36 VSIAIHAKPGSKQNAVTDVTT-------EAVSVAIAAPPTEGEANAELCRYLSKVLELRK 88 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 S + + S K++ + +EI E L+ Sbjct: 89 SDVVLDKGGKSREKVVKLLASTPPEEILEKLKKQ 122 >gi|146337438|ref|YP_001202486.1| hypothetical protein BRADO0278 [Bradyrhizobium sp. ORS278] gi|146190244|emb|CAL74236.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 109 Score = 84.4 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 24/78 (30%), Positives = 43/78 (55%), Gaps = 2/78 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V +R+ P + I +E D +K++V A G+AN+A+ +LAK + ++K + Sbjct: 13 VALRVTPRGGRDAIDGIETLSDG--RSVLKVRVRAIADGGEANRAVTELLAKAIGVTKKA 70 Query: 64 LRMLSKQSSPLKIIYIDK 81 +R+ S +S LK + ID Sbjct: 71 VRITSGTTSRLKQVAIDG 88 >gi|299133137|ref|ZP_07026332.1| protein of unknown function DUF167 [Afipia sp. 1NLS2] gi|298593274|gb|EFI53474.1| protein of unknown function DUF167 [Afipia sp. 1NLS2] Length = 108 Score = 84.4 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 25/94 (26%), Positives = 52/94 (55%), Gaps = 2/94 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + + +E + + ++++V A + G+AN+A+ + A+ L + KS Sbjct: 14 VAVRVTPRGGRDAVDGIEELANGKSVVKVRVRVAA--EGGEANRAVTELFAELLRVPKSK 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97 +R+ S +S +K + I+ D K++ E L+ + T Sbjct: 72 VRVASGVTSRVKQLTIEGDPKQLGEALKAATAAT 105 >gi|218782636|ref|YP_002433954.1| hypothetical protein Dalk_4809 [Desulfatibacillum alkenivorans AK-01] gi|218764020|gb|ACL06486.1| protein of unknown function DUF167 [Desulfatibacillum alkenivorans AK-01] Length = 106 Score = 84.0 bits (207), Expect = 5e-15, Method: Composition-based stats. Identities = 27/97 (27%), Positives = 49/97 (50%), Gaps = 11/97 (11%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 ++++P + + + L+ +KIK+ A P G ANK + LAK L L KSS++ Sbjct: 17 IKVLPRSSVNAVVGLQ-------DGALKIKLKAPPVGGAANKMCIQFLAKTLKLPKSSIK 69 Query: 66 MLSKQSSPLKIIYI----DKDCKEITELLQNNDSLTL 98 +LS ++ K I + + KE+ L + + L + Sbjct: 70 ILSGETGRSKQIMVRPREEGSKKELARLRKIIEELAV 106 >gi|296269396|ref|YP_003652028.1| hypothetical protein Tbis_1417 [Thermobispora bispora DSM 43833] gi|296092183|gb|ADG88135.1| protein of unknown function DUF167 [Thermobispora bispora DSM 43833] Length = 89 Score = 84.0 bits (207), Expect = 6e-15, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 39/88 (44%), Gaps = 6/88 (6%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V +R+ P A + + T + ++V A G+A +A L +A + + Sbjct: 1 MRVSIRVRPGASREYVGG------TYGDGAIVVRVCAPAVDGRATEAALKAVASAFGVRR 54 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITEL 89 +R++S ++ K++ I D +E+ Sbjct: 55 GDVRLVSGATARDKVVEIAGDEEELARR 82 >gi|194337426|ref|YP_002019220.1| protein of unknown function DUF167 [Pelodictyon phaeoclathratiforme BU-1] gi|226701435|sp|B4SES7|Y2415_PELPB RecName: Full=UPF0235 protein Ppha_2415 gi|194309903|gb|ACF44603.1| protein of unknown function DUF167 [Pelodictyon phaeoclathratiforme BU-1] Length = 97 Score = 84.0 bits (207), Expect = 6e-15, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 36/85 (42%), Gaps = 7/85 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V+ P + KS + L +K+ + A P AN+ + +K + S + Sbjct: 15 VKAQPRSSKSRVCGL-------YNGGLKVNLKAAPVDDAANRECCELFSKLFRIPPSRVH 67 Query: 66 MLSKQSSPLKIIYIDKDCKEITELL 90 +LS QSS K + ++ + L+ Sbjct: 68 ILSGQSSRTKTVMVEGISSKAAALV 92 >gi|260800225|ref|XP_002595035.1| hypothetical protein BRAFLDRAFT_237405 [Branchiostoma floridae] gi|229280275|gb|EEN51046.1| hypothetical protein BRAFLDRAFT_237405 [Branchiostoma floridae] Length = 98 Score = 84.0 bits (207), Expect = 6e-15, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 39/92 (42%), Gaps = 7/92 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + P AK + I + +++TA P +G+AN + LA L + KS+ Sbjct: 12 VAIHAKPGAKANAITDVTTETVG-------VQITAPPMEGEANAELCRYLAGVLEVKKSA 64 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 + + S K + +D I +LQ + Sbjct: 65 VSLERGAKSREKTVRVDTPGTTIDAVLQRIKA 96 >gi|156548436|ref|XP_001604898.1| PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] Length = 122 Score = 84.0 bits (207), Expect = 6e-15, Method: Composition-based stats. Identities = 22/95 (23%), Positives = 42/95 (44%), Gaps = 8/95 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V+ P AK++ I I ++A PQ+G+AN ++ LA L + K Sbjct: 33 VTIKVQAKPGAKQNNITDFSEETVG-------IAISAPPQEGEANAELVKYLASILNVRK 85 Query: 62 SSLRMLSKQSSPLKIIYIDKDC-KEITELLQNNDS 95 S + + S K + + +++TE L+ + Sbjct: 86 SDVTLDRGSRSRQKKVIVTGSSVEKVTEKLKAEAA 120 >gi|114658629|ref|XP_001148978.1| PREDICTED: similar to Chromosome 15 open reading frame 40 [Pan troglodytes] Length = 283 Score = 84.0 bits (207), Expect = 6e-15, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 9/87 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 193 VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 245 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEI 86 S + + S K++ + +EI Sbjct: 246 SDVVLDKGGKSREKVVKLLASTTPEEI 272 >gi|258545474|ref|ZP_05705708.1| cytoplasmic protein [Cardiobacterium hominis ATCC 15826] gi|258519174|gb|EEV88033.1| cytoplasmic protein [Cardiobacterium hominis ATCC 15826] Length = 97 Score = 84.0 bits (207), Expect = 6e-15, Method: Composition-based stats. Identities = 24/79 (30%), Positives = 38/79 (48%), Gaps = 7/79 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ A + + KI +TA P+ GKANK + A LAK A++K + Sbjct: 13 LAVKITARASRDQCQGIH-------DERYKIAITAPPEDGKANKHLTAWLAKTFAVAKKN 65 Query: 64 LRMLSKQSSPLKIIYIDKD 82 + + S SPLK + I Sbjct: 66 VALQSGAFSPLKTLRITAP 84 >gi|297617010|ref|YP_003702169.1| hypothetical protein Slip_0824 [Syntrophothermus lipocalidus DSM 12680] gi|297144847|gb|ADI01604.1| protein of unknown function DUF167 [Syntrophothermus lipocalidus DSM 12680] Length = 96 Score = 84.0 bits (207), Expect = 7e-15, Method: Composition-based stats. Identities = 22/83 (26%), Positives = 45/83 (54%), Gaps = 7/83 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 VR++P A K+ + +KIK+TA P +G+AN+A+++ LAK +++ Sbjct: 10 IRFEVRVLPRASKNEVIG-------EVEGAVKIKLTAAPLEGEANQALISFLAKISGVAR 62 Query: 62 SSLRMLSKQSSPLKIIYIDKDCK 84 ++ ++ ++S K++ I K Sbjct: 63 KNVTIIKGETSRHKLVEITGIDK 85 >gi|85713678|ref|ZP_01044668.1| hypothetical protein NB311A_04039 [Nitrobacter sp. Nb-311A] gi|85699582|gb|EAQ37449.1| hypothetical protein NB311A_04039 [Nitrobacter sp. Nb-311A] Length = 106 Score = 83.6 bits (206), Expect = 7e-15, Method: Composition-based stats. Identities = 24/93 (25%), Positives = 54/93 (58%), Gaps = 2/93 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +R+ P + GI +E+ D + ++++ A + G+AN+A++A+LAK L + K Sbjct: 12 RIALRVTPRGGRDGIDGIEMLADGRPVVKVRVRAVA--EGGEANRAVMAVLAKALGVRKV 69 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 +R+L+ +S LK + ++ D ++ + L+ + Sbjct: 70 DVRILAGATSRLKQVAVEGDPVQLGDALRALTA 102 >gi|325108674|ref|YP_004269742.1| hypothetical protein Plabr_2117 [Planctomyces brasiliensis DSM 5305] gi|324968942|gb|ADY59720.1| UPF0235 protein yggU [Planctomyces brasiliensis DSM 5305] Length = 109 Score = 83.6 bits (206), Expect = 7e-15, Method: Composition-based stats. Identities = 27/93 (29%), Positives = 48/93 (51%), Gaps = 8/93 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ P AK++ + + +K+ VT ++GKAN+ +L +LAK L L K Sbjct: 20 VLLPLRVTPGAKRNAVGGVH-------DGALKVAVTQIAERGKANQQVLKILAKALGLKK 72 Query: 62 SSLRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93 S L ++S ++S K I D E+ + + N Sbjct: 73 SQLTLVSGETSRNKRIACRDVSAAELLQRISNL 105 >gi|300866080|ref|ZP_07110810.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506] gi|300335941|emb|CBN55968.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506] Length = 78 Score = 83.6 bits (206), Expect = 7e-15, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 40/79 (50%), Gaps = 7/79 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + +++ PN+K+ I KI + + P GKAN+ ++ +LAKK + Sbjct: 1 MAILTIKVKPNSKQQNI-------QQEPDGSFKISLKSPPIDGKANEELIKLLAKKFGIP 53 Query: 61 KSSLRMLSKQSSPLKIIYI 79 KS + + S SS K++ + Sbjct: 54 KSQITIKSGLSSKNKLVEL 72 >gi|114798070|ref|YP_761927.1| hypothetical protein HNE_3254 [Hyphomonas neptunium ATCC 15444] gi|114738244|gb|ABI76369.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444] Length = 99 Score = 83.6 bits (206), Expect = 7e-15, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 42/82 (51%), Gaps = 2/82 (2%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + R+ P A + D + +K++V A P +G AN A+ A++AK L + KS Sbjct: 4 RLTARVQPKAASDRLDGW--AADEAGRPFLKLRVRALPAEGAANAAVEALVAKALGVPKS 61 Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84 ++R+++ + LK + I+ Sbjct: 62 AVRVVTGGKNRLKSLEIEGPPD 83 >gi|323697733|ref|ZP_08109645.1| protein of unknown function DUF167 [Desulfovibrio sp. ND132] gi|323457665|gb|EGB13530.1| protein of unknown function DUF167 [Desulfovibrio desulfuricans ND132] Length = 98 Score = 83.6 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V P A+K +A +K+++ A KANK ++A +A+ L L KS Sbjct: 12 LDVWAQPGARKDEVAG-------EYQGCLKVRLRAPAVDNKANKGLVAYIARLLQLKKSQ 64 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 + ++S +S K + ++ + Sbjct: 65 VEIVSGHASRRKHLALNTAGE 85 >gi|209883467|ref|YP_002287324.1| hypothetical protein OCAR_4310 [Oligotropha carboxidovorans OM5] gi|226706162|sp|B6JAU6|Y4310_OLICO RecName: Full=UPF0235 protein OCAR_4310 gi|209871663|gb|ACI91459.1| hypothetical protein OCAR_4310 [Oligotropha carboxidovorans OM5] Length = 106 Score = 83.6 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 25/93 (26%), Positives = 52/93 (55%), Gaps = 2/93 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + + +E+ + + ++++V A + G+AN+A+ + A L + KS Sbjct: 14 VAVRVTPRGGRDAVDGIEMLANGKSVVKVRVRVAA--EGGEANRAVTELFAGLLRVPKSK 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96 +++ S +S +K I I+ D K++ E L+ S+ Sbjct: 72 VKVASGVTSRIKQIAIEGDPKQLGEALKAATSI 104 >gi|317153487|ref|YP_004121535.1| hypothetical protein Daes_1777 [Desulfovibrio aespoeensis Aspo-2] gi|316943738|gb|ADU62789.1| protein of unknown function DUF167 [Desulfovibrio aespoeensis Aspo-2] Length = 102 Score = 83.6 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 43/82 (52%), Gaps = 7/82 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V + P A+KS +A + +KI++ A KANKA++A +A L + KS Sbjct: 15 RIAVWVQPGARKSEVAGV-------YQQCVKIRLCAPAVDNKANKALVAFVASVLNVKKS 67 Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84 + + S Q++ K++ ++ + Sbjct: 68 QVVIESGQTTRKKLLALNTVAE 89 >gi|238899239|ref|YP_002924922.1| hypothetical protein HDEF_2231 [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] gi|229467000|gb|ACQ68774.1| conserved hypothetical protein [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] Length = 103 Score = 83.6 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 25/96 (26%), Positives = 44/96 (45%), Gaps = 10/96 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P ++ I + +KI +TA P KANK ++ L+KK ++KS Sbjct: 15 LNVYIQPQSRFDNIVGIHH-------NEIKINLTALPVDNKANKHLIHFLSKKCKVAKSH 67 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQNNDSL 96 + + Q S K + I EI ++L + + L Sbjct: 68 IMIEKGQLSRHKQVRIVEPKVIPVEIKQILADRNKL 103 >gi|258405784|ref|YP_003198526.1| hypothetical protein Dret_1664 [Desulfohalobium retbaense DSM 5692] gi|257798011|gb|ACV68948.1| protein of unknown function DUF167 [Desulfohalobium retbaense DSM 5692] Length = 105 Score = 83.6 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 39/82 (47%), Gaps = 7/82 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + + P AKK+ + + +KIK+ A P KAN+A+ +A +L L + Sbjct: 23 RLRLWVQPKAKKTAVVGV-------YQDCLKIKLQAPPVDNKANQAVCRFVAARLGLRPA 75 Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84 + + S Q+S K I I + Sbjct: 76 DVDLGSGQASRKKTIVIASRDE 97 >gi|153003993|ref|YP_001378318.1| hypothetical protein Anae109_1126 [Anaeromyxobacter sp. Fw109-5] gi|166977708|sp|A7H9D8|Y1126_ANADF RecName: Full=UPF0235 protein Anae109_1126 gi|152027566|gb|ABS25334.1| protein of unknown function DUF167 [Anaeromyxobacter sp. Fw109-5] Length = 95 Score = 83.6 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 15/79 (18%), Positives = 35/79 (44%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A ++ + +KI++ A P G AN A++ LA+ L + K Sbjct: 11 VVLELLVQPRASRTRVLG-------EHGGRLKIQLAAPPVDGAANAALVEFLAEALEVRK 63 Query: 62 SSLRMLSKQSSPLKIIYID 80 + ++ ++ K + + Sbjct: 64 QDVVLVRGETGRRKAVRVT 82 >gi|170290086|ref|YP_001736902.1| hypothetical protein Kcr_0466 [Candidatus Korarchaeum cryptofilum OPF8] gi|170174166|gb|ACB07219.1| protein of unknown function DUF167 [Candidatus Korarchaeum cryptofilum OPF8] Length = 72 Score = 83.6 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 25/79 (31%), Positives = 46/79 (58%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V V ++PNA ++G+ + H+++ V A P KGKAN+A++ +LA+ + K Sbjct: 1 MRVSVLVVPNAGRNGVV--------EEGDHLRVYVRAPPVKGKANEAVIEVLAEFFGVKK 52 Query: 62 SSLRMLSKQSSPLKIIYID 80 S +R++S + S K++ I Sbjct: 53 SDIRIISGERSREKVVEIR 71 >gi|165933862|ref|YP_001650651.1| hypothetical protein RrIowa_1526 [Rickettsia rickettsii str. Iowa] gi|189038764|sp|B0BVI5|Y1526_RICRO RecName: Full=UPF0235 protein RrIowa_1526 gi|165908949|gb|ABY73245.1| hypothetical cytosolic protein [Rickettsia rickettsii str. Iowa] Length = 105 Score = 83.6 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 50/89 (56%), Gaps = 3/89 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + ++ PN+K++ I++ I + ++K+ + A P++GKAN ++ LAK+ LS+ Sbjct: 14 ALLSFKVKPNSKQNLISNFVIINNIQ---YLKLSIKAIPEQGKANAEIINYLAKEWKLSR 70 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 S++ ++ + LK I I ++ L+ Sbjct: 71 SNIEIIKGHTHSLKTILIKNINEDYLNLI 99 >gi|284039659|ref|YP_003389589.1| hypothetical protein Slin_4812 [Spirosoma linguale DSM 74] gi|283818952|gb|ADB40790.1| protein of unknown function DUF167 [Spirosoma linguale DSM 74] Length = 86 Score = 83.2 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 42/89 (47%), Gaps = 7/89 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + ++ P +K + + K+ A Q GKAN ++ LAK+L + K Sbjct: 1 MTLHLKAKPGSKIDQL-------FYDAAGQLNAKIRAPAQDGKANAYLIEFLAKQLGIPK 53 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 S + +++ ++P K I +D + +T+ L Sbjct: 54 SGVSIVAGFTNPHKRIEVDVPEEVLTDFL 82 >gi|13385576|ref|NP_080353.1| hypothetical protein LOC67290 [Mus musculus] gi|29839616|sp|Q9CRC3|CO040_MOUSE RecName: Full=UPF0235 protein C15orf40 homolog gi|12851848|dbj|BAB29184.1| unnamed protein product [Mus musculus] gi|12857526|dbj|BAB31031.1| unnamed protein product [Mus musculus] gi|12859380|dbj|BAB31634.1| unnamed protein product [Mus musculus] Length = 126 Score = 83.2 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 39/96 (40%), Gaps = 9/96 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P ++++ + L + + A P +G+AN + L+K L L K Sbjct: 36 VTIAIHAKPGSRQNAVTDLSTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 88 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95 S + + S K++ + +E+ E L+ Sbjct: 89 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLKTEAE 124 >gi|189219304|ref|YP_001939945.1| hypothetical protein Minf_1293 [Methylacidiphilum infernorum V4] gi|189186162|gb|ACD83347.1| Uncharacterized conserved protein [Methylacidiphilum infernorum V4] Length = 97 Score = 83.2 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 43/80 (53%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ NAKK+ I S +KI+++A P +GKAN A+L+ L+ +L + K Sbjct: 4 ARLFVKVQANAKKTEICG-------SYADALKIRLSAPPVEGKANDALLSFLSLRLCVPK 56 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 +R+ + + K + I+ Sbjct: 57 RLIRIEKGEKNSKKTVVIEG 76 >gi|91787002|ref|YP_547954.1| hypothetical protein Bpro_1104 [Polaromonas sp. JS666] gi|91696227|gb|ABE43056.1| protein of unknown function DUF167 [Polaromonas sp. JS666] Length = 113 Score = 83.2 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 39/76 (51%), Gaps = 3/76 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V V ++PNA K+ L + +++++ P GKAN+A++ LA L + + Sbjct: 19 VLVDVHVVPNAAKTQPVGLHGE---PGQLALRLRLQVPPVDGKANQALIRWLAHSLGVPQ 75 Query: 62 SSLRMLSKQSSPLKII 77 +++ + ++S K + Sbjct: 76 NAITPVRGETSRRKQL 91 >gi|170749303|ref|YP_001755563.1| hypothetical protein Mrad2831_2896 [Methylobacterium radiotolerans JCM 2831] gi|170655825|gb|ACB24880.1| protein of unknown function DUF167 [Methylobacterium radiotolerans JCM 2831] Length = 93 Score = 83.2 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 2/83 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P + + ++K +V A P +G AN A++ ++AK L + + Sbjct: 3 IRLAVRLTPRGGRDAAEGWARDEKGQP--YLKARVAAPPVEGAANAALVVLIAKALKVGR 60 Query: 62 SSLRMLSKQSSPLKIIYIDKDCK 84 S+R+++ S LKI+ ID + Sbjct: 61 GSVRIVTGDQSRLKILEIDGVAQ 83 >gi|241835850|ref|XP_002415074.1| conserved hypothetical protein [Ixodes scapularis] gi|215509286|gb|EEC18739.1| conserved hypothetical protein [Ixodes scapularis] Length = 104 Score = 83.2 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 25/92 (27%), Positives = 38/92 (41%), Gaps = 9/92 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V P A +S I + +++ A P G+AN ++ LAK L L KS Sbjct: 17 IRVHAKPGASESRITDIGTDGVG-------VQIAAPPMDGEANAELVRFLAKVLNLRKSD 69 Query: 64 LRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 + + S K++ I EI LLQ Sbjct: 70 VSLEKGSRSKDKVVMIASPATAAEILSLLQQK 101 >gi|157964988|ref|YP_001499812.1| hypothetical protein RMA_1324 [Rickettsia massiliae MTU5] gi|157844764|gb|ABV85265.1| hypothetical protein RMA_1324 [Rickettsia massiliae MTU5] Length = 107 Score = 83.2 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 50/89 (56%), Gaps = 3/89 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + ++ PN+K++ I++ I + ++K+ + A P +GKAN+ ++ LAK+ LS+ Sbjct: 16 ALLSFKVKPNSKQNLISNFVIINNIP---YLKLSIKAIPAQGKANEEIINYLAKEWKLSR 72 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 S++ ++ + LK I I ++ L+ Sbjct: 73 SNIEIIKGHTHSLKTILIKNINEDYLNLI 101 >gi|67078514|ref|NP_001019920.1| hypothetical protein LOC293059 [Rattus norvegicus] gi|81908918|sp|Q505I4|CO040_RAT RecName: Full=UPF0235 protein C15orf40 homolog gi|63100368|gb|AAH94529.1| Similar to RIKEN cDNA 3110040N11 [Rattus norvegicus] gi|149044058|gb|EDL97440.1| rCG63322 [Rattus norvegicus] gi|149057383|gb|EDM08706.1| similar to RIKEN cDNA 3110040N11, isoform CRA_c [Rattus norvegicus] gi|149057387|gb|EDM08710.1| similar to RIKEN cDNA 3110040N11, isoform CRA_c [Rattus norvegicus] Length = 126 Score = 82.9 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 39/96 (40%), Gaps = 9/96 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + A P +G+AN + L+K L L K Sbjct: 36 VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 88 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95 S + + S K++ + +E+ E L+ Sbjct: 89 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTEAE 124 >gi|256828220|ref|YP_003156948.1| protein of unknown function DUF167 [Desulfomicrobium baculatum DSM 4028] gi|256577396|gb|ACU88532.1| protein of unknown function DUF167 [Desulfomicrobium baculatum DSM 4028] Length = 106 Score = 82.9 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 7/82 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + + P A+K+ +A L +KI+V A KAN A+ ++K L + S Sbjct: 18 RLGIWVQPGARKTEVAGLH-------GDFLKIRVQARAVDNKANSALTVFVSKILGIKAS 70 Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84 + + S +S K + +D + + Sbjct: 71 QVVIESGHASRQKNLLLDVEEE 92 >gi|242279866|ref|YP_002991995.1| hypothetical protein Desal_2400 [Desulfovibrio salexigens DSM 2638] gi|242122760|gb|ACS80456.1| protein of unknown function DUF167 [Desulfovibrio salexigens DSM 2638] Length = 106 Score = 82.9 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 40/82 (48%), Gaps = 7/82 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V V + P AK GI +++++ A KANKA+ A +A +L L K Sbjct: 21 RVSVWVQPGAKNEGITG-------EYQDSVRVRINAPAVDNKANKALAAFVATRLGLKKR 73 Query: 63 SLRMLSKQSSPLKIIYIDKDCK 84 ++ + S S+ K++ ++ D + Sbjct: 74 NISIASGHSNRKKVLLVESDVE 95 >gi|157829135|ref|YP_001495377.1| hypothetical protein A1G_07140 [Rickettsia rickettsii str. 'Sheila Smith'] gi|166228842|sp|A8GTZ4|Y7140_RICRS RecName: Full=UPF0235 protein A1G_07140 gi|157801616|gb|ABV76869.1| hypothetical protein A1G_07140 [Rickettsia rickettsii str. 'Sheila Smith'] Length = 105 Score = 82.9 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 50/89 (56%), Gaps = 3/89 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + ++ PN+K++ I++ I + ++K+ + A P++GKAN ++ LAK+ LS+ Sbjct: 14 ALLSFKVKPNSKQNLISNFVIINNIQ---YLKLSIKAIPEQGKANSEIINYLAKEWKLSR 70 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 S++ ++ + LK I I ++ L+ Sbjct: 71 SNIEIIKGHTHSLKTILIKNINEDYLNLI 99 >gi|220909519|ref|YP_002484830.1| hypothetical protein Cyan7425_4156 [Cyanothece sp. PCC 7425] gi|219866130|gb|ACL46469.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7425] Length = 89 Score = 82.9 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 26/95 (27%), Positives = 51/95 (53%), Gaps = 8/95 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + V+++P++ + + +K+KV A P+KGKAN A++A+LA L + Sbjct: 1 MAKLKVKVVPSSSRDLVVGW-------LGEALKVKVKAPPEKGKANAAVIALLATHLGID 53 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELLQNND 94 ++ + +LS +S K++ I+ D +I L + Sbjct: 54 QTCIEVLSGHTSAAKVLSIEGLDQTQIRAALSDVS 88 >gi|126657163|ref|ZP_01728329.1| hypothetical protein CY0110_24581 [Cyanothece sp. CCY0110] gi|126621434|gb|EAZ92145.1| hypothetical protein CY0110_24581 [Cyanothece sp. CCY0110] Length = 75 Score = 82.9 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 26/80 (32%), Positives = 42/80 (52%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + V++ PNAK+ I + H KI V + P GKAN+ ++ +LAK + Sbjct: 2 LLKLQVKVKPNAKQQKI-------EEMADNHFKIAVKSPPTDGKANQELITLLAKHFNVP 54 Query: 61 KSSLRMLSKQSSPLKIIYID 80 KS + + S SS K++ ID Sbjct: 55 KSHILIKSGVSSRNKLVEID 74 >gi|239948125|ref|ZP_04699878.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes scapularis] gi|239922401|gb|EER22425.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes scapularis] Length = 92 Score = 82.9 bits (204), Expect = 2e-14, Method: Composition-based stats. Identities = 20/80 (25%), Positives = 45/80 (56%), Gaps = 3/80 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +++ PN+ ++ I+ + ++K+ + A P++GKAN+ ++ LAK+ LS+ Sbjct: 14 ALLSLKVKPNSNRNLISDFITINNIP---YLKLSIKAVPEQGKANEEIINYLAKEWKLSR 70 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 S++ ++ + LK I I Sbjct: 71 SNIEIIKGHTHSLKTILIKN 90 >gi|146276431|ref|YP_001166590.1| hypothetical protein Rsph17025_0379 [Rhodobacter sphaeroides ATCC 17025] gi|145554672|gb|ABP69285.1| protein of unknown function DUF167 [Rhodobacter sphaeroides ATCC 17025] Length = 84 Score = 82.9 bits (204), Expect = 2e-14, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 37/76 (48%), Gaps = 8/76 (10%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P A + I +++ VT P+ GKAN+A+ +LAK L ++KS L Sbjct: 17 AVRVTPRASRERIE--------VQEGTVRVHVTCVPEDGKANRAVTEVLAKALGVAKSRL 68 Query: 65 RMLSKQSSPLKIIYID 80 ++ + K +D Sbjct: 69 TLVRGATGRDKTFRLD 84 >gi|115380361|ref|ZP_01467361.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1] gi|115362627|gb|EAU61862.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1] Length = 83 Score = 82.5 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 17/88 (19%), Positives = 41/88 (46%), Gaps = 7/88 (7%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P A ++ + +K+++ A P G+AN A++ LAK+L L + + ++ Sbjct: 3 VQPRASRTRVVG-------EHDGMLKLQLAAPPVDGEANAALVEFLAKRLGLPRRQVTLV 55 Query: 68 SKQSSPLKIIYIDKDCKEITELLQNNDS 95 + ++ K +++ E + + S Sbjct: 56 AGDAARRKRVFLAGVDAARVEAVMSQAS 83 >gi|186683679|ref|YP_001866875.1| hypothetical protein Npun_R3528 [Nostoc punctiforme PCC 73102] gi|226703856|sp|B2J1T0|Y3528_NOSP7 RecName: Full=UPF0235 protein Npun_R3528 gi|186466131|gb|ACC81932.1| protein of unknown function DUF167 [Nostoc punctiforme PCC 73102] Length = 75 Score = 82.5 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 44/78 (56%), Gaps = 7/78 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V++ PN+K+ I + + + + + P GKAN+ ++ +LAKK ++KS + Sbjct: 4 KVKVKPNSKQQKI-------EEQPDGSLTVYLKSPPVDGKANEELIKLLAKKFDVAKSDI 56 Query: 65 RMLSKQSSPLKIIYIDKD 82 R+ S SS K+I ID+D Sbjct: 57 RIKSGLSSRQKLIEIDRD 74 >gi|332025796|gb|EGI65953.1| UPF0235 protein C15orf40-like protein [Acromyrmex echinatior] Length = 139 Score = 82.1 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 37/91 (40%), Gaps = 8/91 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ P AK + + + I ++A P +G+AN ++ LA + KS Sbjct: 53 IKIQAKPGAKCNNVTDISDEAIG-------IAISAPPTEGEANAELVKYLASIFGVRKSD 105 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 + + S K++ + ++ L+ Sbjct: 106 VSLDRGSRSRQKVVIVSGISTDQVLTKLKGE 136 >gi|325294222|ref|YP_004280736.1| yggU [Desulfurobacterium thermolithotrophum DSM 11699] gi|325064670|gb|ADY72677.1| UPF0235 protein yggU [Desulfurobacterium thermolithotrophum DSM 11699] Length = 101 Score = 82.1 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 45/78 (57%), Gaps = 7/78 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ P A ++ I +E +KIKVT P+ GKAN+ ++ +L+K L + K Sbjct: 13 IEVKVQPKASRNKIEKVE-------EGRLKIKVTVPPEGGKANQKIIELLSKALKVPKRD 65 Query: 64 LRMLSKQSSPLKIIYIDK 81 + ++ ++S +K++ I+ Sbjct: 66 IDIVKGETSRIKVVRIEG 83 >gi|20381446|gb|AAH27500.1| RIKEN cDNA 3110040N11 gene [Mus musculus] gi|74206730|dbj|BAE41614.1| unnamed protein product [Mus musculus] gi|148674975|gb|EDL06922.1| RIKEN cDNA 3110040N11, isoform CRA_d [Mus musculus] gi|148674977|gb|EDL06924.1| RIKEN cDNA 3110040N11, isoform CRA_d [Mus musculus] Length = 126 Score = 82.1 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 39/96 (40%), Gaps = 9/96 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P ++++ + L + + A P +G+AN + L+K L L K Sbjct: 36 VTIAIHAKPGSRQNAVTDLSTEAVG-------VAIAAPPSQGEANAELCRYLSKVLDLRK 88 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95 S + + S K++ + +E+ E L+ Sbjct: 89 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLKTEAE 124 >gi|322792209|gb|EFZ16226.1| hypothetical protein SINV_80163 [Solenopsis invicta] Length = 122 Score = 82.1 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 37/91 (40%), Gaps = 8/91 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ P AK + I + I ++A P +G+AN ++ LA + KS Sbjct: 36 IKIQAKPGAKCNNITDISDEAVG-------IAISAPPTEGEANAELVKYLASTFGVRKSD 88 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 + + S K++ + ++ L+ Sbjct: 89 VTLDRGSRSRQKVVVVSGITTDQVLTKLKGE 119 >gi|218885259|ref|YP_002434580.1| hypothetical protein DvMF_0151 [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218756213|gb|ACL07112.1| protein of unknown function DUF167 [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 106 Score = 82.1 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 22/81 (27%), Positives = 43/81 (53%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V+VR +P A+KS + +K+++ A KANKA+ +A L + ++ Sbjct: 22 VLVRAVPGARKSACEG-------TADGRLKVRLAAPAVDNKANKALEEFVASALGMRRNR 74 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 +R++S +S LK + ++ D + Sbjct: 75 VRLVSGHTSRLKKLIVESDVE 95 >gi|288931799|ref|YP_003435859.1| hypothetical protein Ferp_1433 [Ferroglobus placidus DSM 10642] gi|288894047|gb|ADC65584.1| protein of unknown function DUF167 [Ferroglobus placidus DSM 10642] Length = 94 Score = 82.1 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 42/88 (47%), Gaps = 8/88 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + P++KK+ IA + + +KV A P +GKAN+ + L + Sbjct: 12 VIITVHVTPSSKKNEIAGYD-----PWKKALSVKVKAPPVEGKANRELEKFLKEYFG--- 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITEL 89 +++++S + S +K + I +E E Sbjct: 64 KNVKLVSGEKSRVKKVLIVGCSREEVEK 91 >gi|119384906|ref|YP_915962.1| hypothetical protein Pden_2174 [Paracoccus denitrificans PD1222] gi|189039514|sp|A1B422|Y2174_PARDP RecName: Full=UPF0235 protein Pden_2174 gi|119374673|gb|ABL70266.1| protein of unknown function DUF167 [Paracoccus denitrificans PD1222] Length = 82 Score = 82.1 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 23/79 (29%), Positives = 42/79 (53%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A ++ + D +++ VT P+ GKAN A++ +LAK L ++K Sbjct: 12 AEIAVRVTPRASRNAVI--------LDGEAIRVTVTTVPEDGKANAAVVKLLAKALGVAK 63 Query: 62 SSLRMLSKQSSPLKIIYID 80 S L ++ ++ K+ ID Sbjct: 64 SRLVLVRGATARDKLFRID 82 >gi|47213227|emb|CAF89748.1| unnamed protein product [Tetraodon nigroviridis] Length = 120 Score = 81.7 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 39/93 (41%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V P +K S + ++ +++ + A P G+AN ++ LA+ L L K Sbjct: 32 VTITVHAKPGSKHSRVTAVST-------EAVEVAIAAPPVDGEANVELVRFLAEVLELKK 84 Query: 62 SSLRMLSKQSSPLKIIYIDK--DCKEITELLQN 92 L + S K + +D +E+ L+ Sbjct: 85 GHLHLDKGSRSRDKQVRVDSPLSPEEVLRRLRQ 117 >gi|260467362|ref|ZP_05813535.1| protein of unknown function DUF167 [Mesorhizobium opportunistum WSM2075] gi|259028889|gb|EEW30192.1| protein of unknown function DUF167 [Mesorhizobium opportunistum WSM2075] Length = 103 Score = 81.7 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 49/92 (53%), Gaps = 2/92 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ VRL P + + +E D H+K++V A P+ G AN+A+ + AK L + Sbjct: 12 IDLFVRLTPKSSVDRLEGVETAADG--RSHLKVRVRAVPENGAANQALERLAAKTLGVPV 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 S++ +++ ++ LK + + D + ++ ++ Sbjct: 70 SAVSVVAGGTARLKTLRVAGDPEALSRSIEAL 101 >gi|157804228|ref|YP_001492777.1| hypothetical protein A1E_05380 [Rickettsia canadensis str. McKiel] gi|166227320|sp|A8F059|Y5380_RICCK RecName: Full=UPF0235 protein A1E_05380 gi|157785491|gb|ABV73992.1| hypothetical protein A1E_05380 [Rickettsia canadensis str. McKiel] Length = 105 Score = 81.7 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 47/84 (55%), Gaps = 3/84 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +++ P++K++ I+ I + ++K+ + P++GKAN+ ++ LAK LS+ Sbjct: 14 ALLNLKVKPDSKQNLISDFVIINNLP---YLKLFIKTAPEQGKANEEIINYLAKAWKLSR 70 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85 S++ ++ + LK I I ++ Sbjct: 71 SNIEIIKGHTHSLKTILIKNIDED 94 >gi|308270456|emb|CBX27068.1| UPF0235 protein PTH_1821 [uncultured Desulfobacterium sp.] Length = 106 Score = 81.7 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 26/91 (28%), Positives = 45/91 (49%), Gaps = 7/91 (7%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V ++P + K+ IA L +KIK+TA P G AN + LAK L++S S++ Sbjct: 14 KVYILPRSSKNMIAGL-------FGDALKIKLTAAPVDGSANNMCIKYLAKILSVSASNI 66 Query: 65 RMLSKQSSPLKIIYIDKDCKEITELLQNNDS 95 ++S + K I + + K ++ + S Sbjct: 67 EIVSGHTGKTKYILLKNNEKTLSSSTEALIS 97 >gi|110596999|ref|ZP_01385289.1| Protein of unknown function DUF167 [Chlorobium ferrooxidans DSM 13031] gi|110341686|gb|EAT60146.1| Protein of unknown function DUF167 [Chlorobium ferrooxidans DSM 13031] Length = 97 Score = 81.7 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 35/84 (41%), Gaps = 7/84 (8%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 V+ P + KS + L +K+ + A P AN+ + +K + S + Sbjct: 15 VKAQPRSSKSRVCGL-------YNGGLKVSLKAAPVDDAANRECCDLFSKVFHIPPSRVH 67 Query: 66 MLSKQSSPLKIIYIDKDCKEITEL 89 +++ +SS K + +D E L Sbjct: 68 IIAGKSSRTKTVMLDGVTVEAAAL 91 >gi|145224153|ref|YP_001134831.1| hypothetical protein Mflv_3569 [Mycobacterium gilvum PYR-GCK] gi|315444488|ref|YP_004077367.1| hypothetical protein Mspyr1_29120 [Mycobacterium sp. Spyr1] gi|189040164|sp|A4T9S3|Y3569_MYCGI RecName: Full=UPF0235 protein Mflv_3569 gi|145216639|gb|ABP46043.1| protein of unknown function DUF167 [Mycobacterium gilvum PYR-GCK] gi|315262791|gb|ADT99532.1| uncharacterized conserved protein [Mycobacterium sp. Spyr1] Length = 75 Score = 81.7 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 37/76 (48%), Gaps = 6/76 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P ++K + + + + + V GKAN+A+ +LA+ L + +S Sbjct: 5 ISVRVKPGSRKGPLV------EAGEDGALTLYVQERAVDGKANEAVTKLLAEHLGVPRSR 58 Query: 64 LRMLSKQSSPLKIIYI 79 + ++S +S K I Sbjct: 59 IELVSGATSRHKRFRI 74 >gi|75910098|ref|YP_324394.1| hypothetical protein Ava_3894 [Anabaena variabilis ATCC 29413] gi|123608489|sp|Q3M687|Y3894_ANAVT RecName: Full=UPF0235 protein Ava_3894 gi|75703823|gb|ABA23499.1| Protein of unknown function DUF167 [Anabaena variabilis ATCC 29413] Length = 75 Score = 81.7 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 7/78 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V++ PN+K+ IA D + + + + P GKAN+ ++ +LA+K A+ KS + Sbjct: 4 KVKVKPNSKQQKIA-------EQDDGSLTVHLKSPPVDGKANEELIKLLAEKFAVPKSHI 56 Query: 65 RMLSKQSSPLKIIYIDKD 82 + S SS K+I ID D Sbjct: 57 TIKSGLSSRQKLIEIDTD 74 >gi|54295945|ref|YP_122257.1| hypothetical protein plpp0102 [Legionella pneumophila str. Paris] gi|53755777|emb|CAH17279.1| hypothetical protein plpp0102 [Legionella pneumophila str. Paris] Length = 98 Score = 81.7 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 34/78 (43%), Gaps = 7/78 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P AK + + + +KIK+ A + KAN ++ L+ + KS Sbjct: 17 LSLLIQPGAKCNQVVG-------AVGEELKIKIAAPSIEVKANMELVRYLSVLFKVPKSQ 69 Query: 64 LRMLSKQSSPLKIIYIDK 81 +++ S KII + Sbjct: 70 IKIKRGLKSRHKIIEVIG 87 >gi|291413562|ref|XP_002723040.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus] Length = 154 Score = 81.7 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 20/96 (20%), Positives = 38/96 (39%), Gaps = 9/96 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P AK++ + L + + + A P +G+AN + L+K L L K Sbjct: 64 VTIAIHAKPGAKQNAVTDLTA-------EAVSVAIAAPPSEGEANAELCRYLSKVLELRK 116 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95 S + + S K++ + EI L+ Sbjct: 117 SDVVLDKGGKSREKVVKLLASTTPDEILGKLKREAE 152 >gi|197121557|ref|YP_002133508.1| hypothetical protein AnaeK_1146 [Anaeromyxobacter sp. K] gi|226696231|sp|B4UGV4|Y1146_ANASK RecName: Full=UPF0235 protein AnaeK_1146 gi|196171406|gb|ACG72379.1| protein of unknown function DUF167 [Anaeromyxobacter sp. K] Length = 95 Score = 81.7 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 34/78 (43%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A ++ +KI++ A P G AN A++ LA L + + Sbjct: 11 AVLELLVQPRASRTRAVG-------EHDGRLKIQLAAPPVDGAANAALVEFLAVALGVRR 63 Query: 62 SSLRMLSKQSSPLKIIYI 79 + + +L ++ K + + Sbjct: 64 ADVALLRGETGRRKTVRV 81 >gi|75674677|ref|YP_317098.1| hypothetical protein Nwi_0479 [Nitrobacter winogradskyi Nb-255] gi|74419547|gb|ABA03746.1| Protein of unknown function DUF167 [Nitrobacter winogradskyi Nb-255] Length = 106 Score = 81.3 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 51/93 (54%), Gaps = 2/93 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P + I +E+ D + ++++ A G+AN+A+ A+LAK L + K Sbjct: 13 IALRVTPRGGRDAIDGIEMLADGRPVVKVRVRAVA--DGGEANRAVTAVLAKALGVRKID 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96 +R+L+ +S LK + ++ D ++ L+ ++ Sbjct: 71 VRILAGATSRLKQVAVEGDPVQLGNALRALTAV 103 >gi|260892905|ref|YP_003239002.1| protein of unknown function DUF167 [Ammonifex degensii KC4] gi|260865046|gb|ACX52152.1| protein of unknown function DUF167 [Ammonifex degensii KC4] Length = 103 Score = 81.3 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 16/89 (17%), Positives = 41/89 (46%), Gaps = 8/89 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P + + +++++TA P +GKAN+ +L L++ L + Sbjct: 20 IHLRVTPRSS--------TLALEAGEGFLRVRLTAPPVEGKANELLLEFLSRVLDIPARR 71 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 L+++ K++ +D + E ++ Sbjct: 72 LQLVKGLKGREKVVLVDMPLPLVAEKIEK 100 >gi|152991960|ref|YP_001357681.1| hypothetical protein SUN_0364 [Sulfurovum sp. NBC37-1] gi|151423821|dbj|BAF71324.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1] Length = 95 Score = 81.3 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 42/91 (46%), Gaps = 11/91 (12%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ ++ P A ++ + +KI++ A +G ANK ++ LAK + K Sbjct: 10 VSMRIKAQPAASRNEFCDIY------GEDAIKIRIKAPAVEGAANKELMKFLAKSFKVPK 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 S + S Q+S +KI+ +TE QN Sbjct: 64 SDIIFKSGQNSKIKIVEFP-----LTEKFQN 89 >gi|149690913|ref|XP_001498126.1| PREDICTED: similar to chromosome 15 open reading frame 40 [Equus caballus] Length = 167 Score = 81.3 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 15/82 (18%), Positives = 34/82 (41%), Gaps = 7/82 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + + + + + A P +G+AN + L+K L L K Sbjct: 77 VTIAIHAKPGSKQNAVTDVTA-------EAVSVAIAAPPSEGEANAELCRYLSKVLDLRK 129 Query: 62 SSLRMLSKQSSPLKIIYIDKDC 83 S + + S K++ + Sbjct: 130 SDVVLDKGGKSREKVVKLLAST 151 >gi|148236990|ref|NP_001089221.1| hypothetical protein LOC734268 [Xenopus laevis] gi|57920974|gb|AAH89152.1| MGC85153 protein [Xenopus laevis] Length = 121 Score = 81.3 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 38/93 (40%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P AK++ I + + + A P +G+AN + L+K L L K Sbjct: 31 VTISIHAKPGAKQNAITDVTADAVG-------VAIAAPPTEGEANAELCRYLSKVLVLKK 83 Query: 62 SSLRMLSKQSSPLKIIYIDKD--CKEITELLQN 92 S + + S K++ I + + E L+ Sbjct: 84 SEVSLDKGGKSREKVVKISASITPEVVLERLKE 116 >gi|194748771|ref|XP_001956818.1| GF24383 [Drosophila ananassae] gi|190624100|gb|EDV39624.1| GF24383 [Drosophila ananassae] Length = 126 Score = 81.3 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 24/95 (25%), Positives = 42/95 (44%), Gaps = 10/95 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK++GI + +++ A P +G+AN ++ L+K L L KS Sbjct: 39 IQILAKPGAKQNGITGISTEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 91 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQNNDS 95 + + S KII I K + +LL+ Sbjct: 92 VSLDKGSRSRNKIILITKGAITTEAAEQLLRKESE 126 >gi|242398935|ref|YP_002994359.1| hypothetical protein TSIB_0952 [Thermococcus sibiricus MM 739] gi|242265328|gb|ACS90010.1| hypothetical protein TSIB_0952 [Thermococcus sibiricus MM 739] Length = 94 Score = 80.9 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 42/91 (46%), Gaps = 9/91 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A + I ++ +K+K+ A P +GKANK ++ +K L Sbjct: 9 VILQIYVQPKANTNEIEGVD-----EWRGRLKVKIKAPPVEGKANKEVVKFFSKLLG--- 60 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91 + ++ ++S K + I +E+ + L+ Sbjct: 61 AEASLIKGETSREKDLLIRGISIEEVKKKLK 91 >gi|224373471|ref|YP_002607843.1| hypothetical protein NAMH_1451 [Nautilia profundicola AmH] gi|223589921|gb|ACM93657.1| conserved hypothetical protein [Nautilia profundicola AmH] Length = 94 Score = 80.9 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 18/91 (19%), Positives = 43/91 (47%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ PN+ K+ IA +K+ + A +G ANK ++ + K+ + KS Sbjct: 11 IKIKAQPNSSKNKIAG-------KYGESLKVNIKAPAVEGAANKELIKFIGKEFKIPKSE 63 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 + + ++S K++ + +I + L+ + Sbjct: 64 IAI-KGETSKQKVLIVP-FRDDIIKKLEEIN 92 >gi|261855187|ref|YP_003262470.1| hypothetical protein Hneap_0569 [Halothiobacillus neapolitanus c2] gi|261835656|gb|ACX95423.1| protein of unknown function DUF167 [Halothiobacillus neapolitanus c2] Length = 122 Score = 80.9 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 20/102 (19%), Positives = 45/102 (44%), Gaps = 14/102 (13%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ P A ++ + +K+ +T P GKAN+A++ +AK +++K+ Sbjct: 19 LKLKVQPRASRTAWGEV-------IGGRIKLYLTTPPIDGKANQAVIDFIAKTFSVAKNR 71 Query: 64 LRMLSKQSSPLKIIYI-------DKDCKEITELLQNNDSLTL 98 + + ++S K I + + LLQ S + Sbjct: 72 VEIRRGETSRSKDIALAWLTVPKASSSEAERLLLQALSSKAI 113 >gi|17230688|ref|NP_487236.1| hypothetical protein asl3196 [Nostoc sp. PCC 7120] gi|29839730|sp|Q8YS95|Y3196_ANASP RecName: Full=UPF0235 protein asl3196 gi|17132291|dbj|BAB74895.1| asl3196 [Nostoc sp. PCC 7120] Length = 75 Score = 80.9 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 24/78 (30%), Positives = 42/78 (53%), Gaps = 7/78 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V++ PN+K+ IA D + + + + P GKAN+ ++ +LA+K + KS + Sbjct: 4 RVKVKPNSKQQKIA-------EQDDGSLTVHLKSPPVDGKANEELIKLLAEKFDVPKSHI 56 Query: 65 RMLSKQSSPLKIIYIDKD 82 + S SS K+I I+ D Sbjct: 57 TIKSGLSSKQKLIEIETD 74 >gi|320108335|ref|YP_004183925.1| hypothetical protein AciPR4_3175 [Terriglobus saanensis SP1PR4] gi|319926856|gb|ADV83931.1| protein of unknown function DUF167 [Terriglobus saanensis SP1PR4] Length = 104 Score = 80.9 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 23/77 (29%), Positives = 40/77 (51%), Gaps = 7/77 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P AK+SG+ + +KI + A GKAN+A++ +A L + + S+ Sbjct: 19 AVRVQPGAKRSGVVGI-------YGEAVKIALVAPAVDGKANEALVRFVATLLDVPRMSV 71 Query: 65 RMLSKQSSPLKIIYIDK 81 +LS SS K++ + Sbjct: 72 EILSGVSSRSKVVKVLG 88 >gi|24656569|ref|NP_647784.1| CG14966 [Drosophila melanogaster] gi|7292328|gb|AAF47735.1| CG14966 [Drosophila melanogaster] gi|289526411|gb|ADD01328.1| RE68649p [Drosophila melanogaster] Length = 140 Score = 80.9 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 25/94 (26%), Positives = 43/94 (45%), Gaps = 10/94 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK++GI + +++ A P +G+AN ++ L+K L L KS Sbjct: 52 IQILAKPGAKQNGITGIGFEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 104 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQNND 94 + + S KII I K + I +LL+ Sbjct: 105 VSLDKGSRSRNKIIMITKGVSTVEAIEQLLRKES 138 >gi|73951609|ref|XP_545872.2| PREDICTED: similar to CG14966-PA [Canis familiaris] Length = 152 Score = 80.9 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 20/94 (21%), Positives = 41/94 (43%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R P +K++ + + + + + A P +G+AN + L+K L L K Sbjct: 62 VTIAIRAKPGSKQNAVTDVTA-------EAVSVAIAAPPSEGEANAELCRYLSKVLELRK 114 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNN 93 S + + S K++ + +EI E L+ Sbjct: 115 SDVVLDKGGKSREKVVKLLASTTAEEILEKLKQQ 148 >gi|237747224|ref|ZP_04577704.1| predicted protein [Oxalobacter formigenes HOxBLS] gi|229378575|gb|EEO28666.1| predicted protein [Oxalobacter formigenes HOxBLS] Length = 100 Score = 80.9 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 27/92 (29%), Positives = 54/92 (58%), Gaps = 10/92 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + V++IPNA+KS I +S+ ++I++ A P GKAN+A++ +LAKKL + + Sbjct: 14 RIAVQVIPNARKSEIV-------SSEGETLRIRLQAQPVDGKANEALVQLLAKKLRVPRK 66 Query: 63 SLRMLSKQSSPLKIIYI---DKDCKEITELLQ 91 + + ++ K++ + D+ ++I + LQ Sbjct: 67 QVSITHGLANKRKLLEVIVSDRSQEDIVKQLQ 98 >gi|54298708|ref|YP_125077.1| hypothetical protein lpp2772 [Legionella pneumophila str. Paris] gi|53752493|emb|CAH13925.1| hypothetical protein lpp2772 [Legionella pneumophila str. Paris] Length = 95 Score = 80.9 bits (199), Expect = 6e-14, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 40/78 (51%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + PNAKK+ + ++ + I + A PQ+G+AN +L +++ + K Sbjct: 10 VEIAIYAKPNAKKTKLMAI-------SDDRLHIALHAKPQEGEANNELLFFISQFFKIPK 62 Query: 62 SSLRMLSKQSSPLKIIYI 79 + + ++ +SS K+I + Sbjct: 63 TQIELIKGKSSRHKLIRL 80 >gi|283850632|ref|ZP_06367919.1| protein of unknown function DUF167 [Desulfovibrio sp. FW1012B] gi|283573875|gb|EFC21848.1| protein of unknown function DUF167 [Desulfovibrio sp. FW1012B] Length = 125 Score = 80.5 bits (198), Expect = 6e-14, Method: Composition-based stats. Identities = 16/81 (19%), Positives = 35/81 (43%), Gaps = 7/81 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + P + +A L +++++ A +G+AN + LA L L Sbjct: 40 LRVVVTPGGSRDTLAGL-------AEGRLRVRLRAKAVEGQANAGLTVFLAGCLGLRPRQ 92 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 + ++S + S K + I + + Sbjct: 93 VAIVSGEKSRKKTLRISAESE 113 >gi|84996333|ref|XP_952888.1| proton translocating inorganic pyrophosphatase [Theileria annulata strain Ankara] gi|65303885|emb|CAI76264.1| proton translocating inorganic pyrophosphatase, putative [Theileria annulata] Length = 1204 Score = 80.5 bits (198), Expect = 6e-14, Method: Composition-based stats. Identities = 16/96 (16%), Positives = 39/96 (40%), Gaps = 25/96 (26%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK------- 56 + V + P ++++ I + +++ A P++G+ NKA++ ++K Sbjct: 1098 LKVNVKPGSRQTQIIG-------ESEGRLSVQIAAPPREGECNKALIEFISKTRNFYYFL 1150 Query: 57 -----------LALSKSSLRMLSKQSSPLKIIYIDK 81 + + K ++ +L S KI+ I Sbjct: 1151 AFNTFNVLIFLVGVKKGNVTLLHGHKSRDKILSITG 1186 >gi|15604669|ref|NP_221187.1| hypothetical protein RP839 [Rickettsia prowazekii str. Madrid E] gi|6686136|sp|Q9ZCC0|Y839_RICPR RecName: Full=UPF0235 protein RP839 gi|3861364|emb|CAA15263.1| unknown [Rickettsia prowazekii] gi|292572500|gb|ADE30415.1| hypothetical protein rpr22_CDS819 [Rickettsia prowazekii Rp22] Length = 105 Score = 80.5 bits (198), Expect = 6e-14, Method: Composition-based stats. Identities = 26/89 (29%), Positives = 51/89 (57%), Gaps = 3/89 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ P AK++ I + I + ++K+ + ATP++GKAN+ ++ LAK+ LS+ Sbjct: 14 ALINVKVKPYAKQNLIGNFVIINNIP---YIKLAIKATPEQGKANEGIIHYLAKEWELSR 70 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 SS+ ++ + LK I I ++ L+ Sbjct: 71 SSIEIIKGHTHSLKTILIKNINEDYLNLI 99 >gi|46446134|ref|YP_007499.1| hypothetical protein pc0500 [Candidatus Protochlamydia amoebophila UWE25] gi|46399775|emb|CAF23224.1| conserved hypothetical protein [Candidatus Protochlamydia amoebophila UWE25] Length = 92 Score = 80.5 bits (198), Expect = 6e-14, Method: Composition-based stats. Identities = 24/89 (26%), Positives = 43/89 (48%), Gaps = 7/89 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++IP A S E +KI++ A P KG+AN ++ L+ + KSS Sbjct: 11 IKIKVIPLASFSEKVGWE-------GDELKIRLAAIPDKGQANTELIRFLSSLFKIRKSS 63 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 ++++ Q+S K I I E ++L Sbjct: 64 IQLIQGQTSRHKKICIQDISLERLQVLLA 92 >gi|52842921|ref|YP_096720.1| hypothetical protein lpg2716 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52630032|gb|AAU28773.1| hypothetical protein lpg2716 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 95 Score = 80.5 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 40/78 (51%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + PNAKKS + ++ + I + A PQ+G+AN +L +++ + K Sbjct: 10 VEIAIYAKPNAKKSKLMAI-------SDDRLHIALHAKPQEGEANNELLFFISQFFKIPK 62 Query: 62 SSLRMLSKQSSPLKIIYI 79 + + ++ +SS K+I + Sbjct: 63 TQIELIKGKSSRHKLIRL 80 >gi|330795787|ref|XP_003285952.1| hypothetical protein DICPUDRAFT_23568 [Dictyostelium purpureum] gi|325084041|gb|EGC37478.1| hypothetical protein DICPUDRAFT_23568 [Dictyostelium purpureum] Length = 110 Score = 80.5 bits (198), Expect = 8e-14, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 48/88 (54%), Gaps = 7/88 (7%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + + + PN+K++ I S E + ++++ P G+ANK ++ L+K+L L Sbjct: 23 ILKLNINVHPNSKENQIISFE-------NEILSLRISEPPIDGQANKGVVEFLSKELGLR 75 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITE 88 KS++++ S K I ID + + IT+ Sbjct: 76 KSNIQVSKGSKSRNKSIEIDLESESITK 103 >gi|52345734|ref|NP_001004913.1| chromosome 15 open reading frame 40 [Xenopus (Silurana) tropicalis] gi|49523245|gb|AAH75357.1| MGC89060 protein [Xenopus (Silurana) tropicalis] Length = 120 Score = 80.2 bits (197), Expect = 8e-14, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 38/93 (40%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P AK++ I + + + A P +G+AN + L+K L L K Sbjct: 30 VIISIHAKPGAKQNAITDVTADAVG-------VAIAAPPTEGEANAELCRYLSKVLVLKK 82 Query: 62 SSLRMLSKQSSPLKIIYIDKD--CKEITELLQN 92 S + + S K++ I + + E L+ Sbjct: 83 SEVSLDKGGKSREKVVKISASITPEVVLEKLKE 115 >gi|224062575|ref|XP_002197110.1| PREDICTED: hypothetical protein, partial [Taeniopygia guttata] Length = 138 Score = 80.2 bits (197), Expect = 8e-14, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 36/89 (40%), Gaps = 9/89 (10%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P A+ S + + + + A P +G+AN + L+K L + KS + + Sbjct: 56 KPGARCSAVTDVTAEAVG-------VAIAAPPSEGEANAELCRYLSKVLEVKKSDVILEK 108 Query: 69 KQSSPLKIIYI--DKDCKEITELLQNNDS 95 S K++ I EI E L+ S Sbjct: 109 GGKSRDKVVKISVSATPDEILEKLKKEAS 137 >gi|294676098|ref|YP_003576713.1| hypothetical protein RCAP_rcc00541 [Rhodobacter capsulatus SB 1003] gi|294474918|gb|ADE84306.1| protein of unknown function DUF167 [Rhodobacter capsulatus SB 1003] Length = 83 Score = 80.2 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 41/79 (51%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ P A ++ I ++ VT P+ GKAN A++ +LAK L ++K Sbjct: 13 AEIALRVTPKASRNEIV--------VAEDGLRAYVTVVPEGGKANAAVVKLLAKSLGVAK 64 Query: 62 SSLRMLSKQSSPLKIIYID 80 S L ++ +++ K+ +D Sbjct: 65 SRLTLIRGETARDKVFRLD 83 >gi|163737739|ref|ZP_02145156.1| hypothetical protein RGBS107_19448 [Phaeobacter gallaeciensis BS107] gi|163744042|ref|ZP_02151409.1| hypothetical protein RG210_12581 [Phaeobacter gallaeciensis 2.10] gi|161382658|gb|EDQ07060.1| hypothetical protein RG210_12581 [Phaeobacter gallaeciensis 2.10] gi|161389265|gb|EDQ13617.1| hypothetical protein RGBS107_19448 [Phaeobacter gallaeciensis BS107] Length = 98 Score = 80.2 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 23/76 (30%), Positives = 41/76 (53%), Gaps = 1/76 (1%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A ++ I + + + +KI VTA P+ GKA A+ A+LA+ + ++ Sbjct: 20 AEIPVRVTPKASRNAILPIPQAESGQ-GVSLKITVTAAPENGKATAAVQALLARAMRIAP 78 Query: 62 SSLRMLSKQSSPLKII 77 S L +L +S K+ Sbjct: 79 SDLELLRGATSRDKVF 94 >gi|86157514|ref|YP_464299.1| hypothetical protein Adeh_1087 [Anaeromyxobacter dehalogenans 2CP-C] gi|123499918|sp|Q2IPY3|Y1087_ANADE RecName: Full=UPF0235 protein Adeh_1087 gi|85774025|gb|ABC80862.1| protein of unknown function DUF167 [Anaeromyxobacter dehalogenans 2CP-C] Length = 95 Score = 80.2 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 34/78 (43%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A ++ +KI++ A P G AN A++ LA L + + Sbjct: 11 AVLEILVQPRASRTRAVG-------EHDGRLKIQLAAPPVDGAANAALVEFLAVALGVRR 63 Query: 62 SSLRMLSKQSSPLKIIYI 79 + + +L ++ K + + Sbjct: 64 ADVALLRGEAGRRKTVRV 81 >gi|195428527|ref|XP_002062324.1| GK17477 [Drosophila willistoni] gi|194158409|gb|EDW73310.1| GK17477 [Drosophila willistoni] Length = 127 Score = 80.2 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 41/88 (46%), Gaps = 7/88 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK++GI + + +++ A P +G+AN ++ L+K L L KS Sbjct: 39 IKILAKPGAKQNGITDIGLEGVG-------VQIAAPPSEGEANAELVKYLSKVLGLRKSD 91 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S KII + K + L+ Sbjct: 92 VSLDKGSRSRNKIILVSKGVSTVEACLE 119 >gi|195160833|ref|XP_002021278.1| GL25246 [Drosophila persimilis] gi|198465041|ref|XP_001353470.2| GA13391 [Drosophila pseudoobscura pseudoobscura] gi|194118391|gb|EDW40434.1| GL25246 [Drosophila persimilis] gi|198149990|gb|EAL30981.2| GA13391 [Drosophila pseudoobscura pseudoobscura] Length = 125 Score = 80.2 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 25/93 (26%), Positives = 45/93 (48%), Gaps = 10/93 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK +GI ++++ +++ A P +G+AN ++ L+K L L KS Sbjct: 37 IKILAKPGAKHNGITNIDLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 89 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQNN 93 + + S KII I K + I +LL+ Sbjct: 90 VSLDKGSRSRNKIILISKGASTVESIQQLLRKE 122 >gi|329893887|ref|ZP_08269938.1| hypothetical protein IMCC3088_2481 [gamma proteobacterium IMCC3088] gi|328923406|gb|EGG30722.1| hypothetical protein IMCC3088_2481 [gamma proteobacterium IMCC3088] Length = 83 Score = 79.8 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 4/79 (5%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P AK++ I +D +++ VT P+ GKAN+A++ LA + ++ Sbjct: 6 VSVRVTPKAKQAQI----KVEDIEGQSVLRVYVTVAPEDGKANRAVMRALADYFDVPRTR 61 Query: 64 LRMLSKQSSPLKIIYIDKD 82 +++LS K ID D Sbjct: 62 IKLLSGAKQRDKRFSIDAD 80 >gi|161723222|ref|NP_219898.2| hypothetical protein CT388 [Chlamydia trachomatis D/UW-3/CX] gi|162019865|ref|YP_328205.2| hypothetical protein CTA_0423 [Chlamydia trachomatis A/HAR-13] Length = 100 Score = 79.8 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 26/95 (27%), Positives = 53/95 (55%), Gaps = 8/95 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ A+++ + LE ++++VT P+KGKAN A++A+LA L++ KS Sbjct: 8 LEVRVTTKARENRVVCLE-------DGILRVRVTEVPEKGKANDAVVALLANFLSIPKSD 60 Query: 64 LRMLSKQSSPLKIIYIDKDCKE-ITELLQNNDSLT 97 + +++ ++S K + + + K + E + S T Sbjct: 61 VTLIAGEASRRKKVLLPRSIKAFLLEQFPSESSST 95 >gi|221640262|ref|YP_002526524.1| hypothetical protein RSKD131_2163 [Rhodobacter sphaeroides KD131] gi|221161043|gb|ACM02023.1| Hypothetical Protein RSKD131_2163 [Rhodobacter sphaeroides KD131] Length = 85 Score = 79.8 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 38/76 (50%), Gaps = 8/76 (10%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P A ++ + +++ VT P+ GKAN+A+ LAK L ++KS L Sbjct: 18 AVRVTPRAARAKVD--------LQEGVVRVHVTCVPEDGKANRAVTEALAKALGVAKSRL 69 Query: 65 RMLSKQSSPLKIIYID 80 ++ +S K +D Sbjct: 70 TLVRGATSRDKTFRLD 85 >gi|118353243|ref|XP_001009893.1| hypothetical protein TTHERM_00161650 [Tetrahymena thermophila] gi|89291659|gb|EAR89647.1| hypothetical protein TTHERM_00161650 [Tetrahymena thermophila SB210] Length = 127 Score = 79.8 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 45/93 (48%), Gaps = 11/93 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + PN+K S I+ + + I + A P+ G+AN ++ +++ L + KSS Sbjct: 39 ISIHAKPNSKISQISGI-------SDEGVDINIAAPPKDGEANAELIDYISQVLGVKKSS 91 Query: 64 LRMLSKQSSPLKIIYID----KDCKEITELLQN 92 L + S K++ I D +E+ + L++ Sbjct: 92 LSLDKGGKSRNKLMEISDSGYADVEELYQALKD 124 >gi|307176568|gb|EFN66055.1| UPF0235 protein C15orf40-like protein [Camponotus floridanus] Length = 122 Score = 79.8 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 37/93 (39%), Gaps = 8/93 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + ++ P AK + I + I ++A P +G+AN ++ LA L K Sbjct: 34 VVIKIQAKPGAKCNNITDISDEAVG-------IAISAPPMEGEANAELVKYLASIFELRK 86 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 S++ + S K + + ++ L+ Sbjct: 87 SNVSLDRGSRSRQKTVTVSGITTDQVLAKLKGE 119 >gi|328866439|gb|EGG14823.1| hypothetical protein DFA_10696 [Dictyostelium fasciculatum] Length = 152 Score = 79.8 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 40/88 (45%), Gaps = 6/88 (6%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + V + PNAK+S I + ++++ P GKAN ++ L+ +L L Sbjct: 56 IVRLNVNVHPNAKQSTIVQFTDD------GCLDLRISQPPIDGKANDEVIDFLSDELKLK 109 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITE 88 K + + S K+I ID +T+ Sbjct: 110 KRFITVDKGLKSRNKVIAIDLSESSLTK 137 >gi|300087416|ref|YP_003757938.1| hypothetical protein Dehly_0296 [Dehalogenimonas lykanthroporepellens BL-DC-9] gi|299527149|gb|ADJ25617.1| protein of unknown function DUF167 [Dehalogenimonas lykanthroporepellens BL-DC-9] Length = 96 Score = 79.8 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 23/94 (24%), Positives = 48/94 (51%), Gaps = 8/94 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +++ P + ++ I ++++VTA P+ GKAN+A+ +LA++L L K Sbjct: 10 AVIALKVQPGSGRNEITDTAA-------EIIRVRVTAAPEHGKANRAVAELLAERLGLPK 62 Query: 62 SSLRMLSKQSSPLKIIYIDKDCK-EITELLQNND 94 S + ++ +S K+ + + E+ E L D Sbjct: 63 SRVTIVRGLTSRRKVAAVAGLSEAEVREKLGKTD 96 >gi|54295557|ref|YP_127972.1| hypothetical protein lpl2644 [Legionella pneumophila str. Lens] gi|53755389|emb|CAH16885.1| hypothetical protein lpl2644 [Legionella pneumophila str. Lens] Length = 95 Score = 79.8 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 40/78 (51%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + PNAKKS + ++ + I + A PQ+G+AN +L +++ + K Sbjct: 10 VEIAIYAKPNAKKSKLMAI-------SDDSLHIALHAKPQEGEANNELLFFISQFFKIPK 62 Query: 62 SSLRMLSKQSSPLKIIYI 79 + + ++ +SS K+I + Sbjct: 63 TQIELIKGKSSRHKLIRL 80 >gi|162456439|ref|YP_001618806.1| hypothetical protein sce8156 [Sorangium cellulosum 'So ce 56'] gi|161167021|emb|CAN98326.1| hypothetical protein sce8156 [Sorangium cellulosum 'So ce 56'] Length = 139 Score = 79.8 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 16/88 (18%), Positives = 39/88 (44%), Gaps = 7/88 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ P + +S I + + + +TA P +G AN ++ +L++ L + K Sbjct: 52 VRISVQVRPKSSRSAIVGVR-------EGALDVSLTAPPVEGAANAELVKLLSRALDVRK 104 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITEL 89 S +++ S K++ + + Sbjct: 105 SDVQIALGASGRSKVVAVRGLKEAEARK 132 >gi|307102424|gb|EFN50700.1| hypothetical protein CHLNCDRAFT_142618 [Chlorella variabilis] Length = 127 Score = 79.8 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 19/92 (20%), Positives = 39/92 (42%), Gaps = 9/92 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C + V P +K + +++ V A P G+AN A++ +A+ L L + Sbjct: 43 CRIGVHAKPGSKVCSVT--------LGPDALEVAVDAKPVDGEANAALIEFVAEVLGLKR 94 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 + + S +S K++ + D + L+ Sbjct: 95 RDVTLASGTTSRHKVLAVAGIDAHAALQRLRQ 126 >gi|325982093|ref|YP_004294495.1| hypothetical protein NAL212_1446 [Nitrosomonas sp. AL212] gi|325531612|gb|ADZ26333.1| protein of unknown function DUF167 [Nitrosomonas sp. AL212] Length = 113 Score = 79.8 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 41/93 (44%), Gaps = 8/93 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + AK + A L +KI++ A P +GKAN ++ LA + + Sbjct: 17 LTLHIQTGAKITKAAGL-------LGGALKIRLAAAPVEGKANSTLIKFLAAQFDVPIGQ 69 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96 +R+ S K+I I + + +L N + L Sbjct: 70 VRLKQGGKSRHKVIVIHRSVHD-PRVLFNIEGL 101 >gi|269127125|ref|YP_003300495.1| hypothetical protein Tcur_2915 [Thermomonospora curvata DSM 43183] gi|268312083|gb|ACY98457.1| protein of unknown function DUF167 [Thermomonospora curvata DSM 43183] Length = 91 Score = 79.8 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 44/92 (47%), Gaps = 7/92 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V +R+ P A ++ + + +KV A +GKA +A L +A L + + Sbjct: 4 VRVAIRVGPGASRTKVGGAH-------GEALVVKVAARAVEGKATEAALRAVADALGVRR 56 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 +R++S +S K+I +D D + +TE + Sbjct: 57 RDVRLISGATSREKLIEVDGDERALTEKINAL 88 >gi|194866006|ref|XP_001971712.1| GG14280 [Drosophila erecta] gi|190653495|gb|EDV50738.1| GG14280 [Drosophila erecta] Length = 127 Score = 79.8 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 26/94 (27%), Positives = 44/94 (46%), Gaps = 10/94 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK++GI + + +++ A P +G+AN ++ L+K L L KS Sbjct: 40 IQILAKPGAKQNGITGIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 92 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQNND 94 + + S KII I K + I ELL+ Sbjct: 93 VSLDKGSRSRNKIIMITKGVSTVEAIEELLRKES 126 >gi|126463218|ref|YP_001044332.1| hypothetical protein Rsph17029_2458 [Rhodobacter sphaeroides ATCC 17029] gi|126104882|gb|ABN77560.1| protein of unknown function DUF167 [Rhodobacter sphaeroides ATCC 17029] Length = 85 Score = 79.4 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 38/76 (50%), Gaps = 8/76 (10%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P A ++ + +++ VT P+ GKAN+A+ LAK L ++KS L Sbjct: 18 AVRVTPRAARAKVD--------LQEGVVRVHVTCVPEDGKANRAVTEALAKALGVAKSRL 69 Query: 65 RMLSKQSSPLKIIYID 80 ++ +S K +D Sbjct: 70 TLVRGATSRDKTFRLD 85 >gi|315230060|ref|YP_004070496.1| hypothetical protein TERMP_00296 [Thermococcus barophilus MP] gi|315183088|gb|ADT83273.1| hypothetical protein TERMP_00296 [Thermococcus barophilus MP] Length = 92 Score = 79.4 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 25/90 (27%), Positives = 44/90 (48%), Gaps = 9/90 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++V + PN K++ I ++ +K+KV+A P GKANK + L+K L Sbjct: 9 VILLVHVQPNTKRNSIEGVD-----KWKGRIKVKVSAPPVGGKANKELTKFLSKLLG--- 60 Query: 62 SSLRMLSKQSSPLKIIYIDKDC-KEITELL 90 + +L ++S K + I +E+ E L Sbjct: 61 KEVVILRGETSREKDLLIKGATIEEVKEKL 90 >gi|118794573|ref|XP_321597.3| AGAP001528-PA [Anopheles gambiae str. PEST] gi|116116359|gb|EAA01322.3| AGAP001528-PA [Anopheles gambiae str. PEST] Length = 147 Score = 79.4 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 25/97 (25%), Positives = 41/97 (42%), Gaps = 12/97 (12%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + P AK SGI + ++ A P G+AN ++ L+K L L KS Sbjct: 55 VKILAKPGAKTSGITDVSEEGIG-------CQIAAPPIDGEANTELIKYLSKLLDLRKSD 107 Query: 64 LRMLSKQSSPLKIIYIDK-----DCKEITELLQNNDS 95 + + S K I +DK +++ + +N S Sbjct: 108 ISLDRGSKSRQKTIVLDKAGCRHSPEQLLVIFRNEAS 144 >gi|126178968|ref|YP_001046933.1| hypothetical protein Memar_1018 [Methanoculleus marisnigri JR1] gi|125861762|gb|ABN56951.1| protein of unknown function DUF167 [Methanoculleus marisnigri JR1] Length = 105 Score = 79.4 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 16/83 (19%), Positives = 36/83 (43%), Gaps = 4/83 (4%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + AK+S +K ++ A GKAN+A+ +LA+ + + Sbjct: 15 VTITLDVTAGAKRSSF----PAGYNEWRKSIKCQIAAPAVGGKANRAITDLLAETFGVPR 70 Query: 62 SSLRMLSKQSSPLKIIYIDKDCK 84 + + +++ +S K + I K Sbjct: 71 ADVSIITGHTSSSKTVAIAGVSK 93 >gi|166154599|ref|YP_001654717.1| hypothetical protein CTL0644 [Chlamydia trachomatis 434/Bu] gi|166155474|ref|YP_001653729.1| hypothetical protein CTLon_0641 [Chlamydia trachomatis L2b/UCH-1/proctitis] gi|237802813|ref|YP_002888007.1| hypothetical protein JALI_3871 [Chlamydia trachomatis B/Jali20/OT] gi|237804735|ref|YP_002888889.1| hypothetical protein CTB_3871 [Chlamydia trachomatis B/TZ1A828/OT] gi|255311194|ref|ZP_05353764.1| hypothetical protein Ctra62_02010 [Chlamydia trachomatis 6276] gi|255317495|ref|ZP_05358741.1| hypothetical protein Ctra6_02000 [Chlamydia trachomatis 6276s] gi|255348753|ref|ZP_05380760.1| hypothetical protein Ctra70_02050 [Chlamydia trachomatis 70] gi|255503293|ref|ZP_05381683.1| hypothetical protein Ctra7_02060 [Chlamydia trachomatis 70s] gi|255506972|ref|ZP_05382611.1| hypothetical protein CtraD_02040 [Chlamydia trachomatis D(s)2923] gi|301335866|ref|ZP_07224110.1| hypothetical protein CtraL_03535 [Chlamydia trachomatis L2tet1] gi|29839454|sp|O84393|Y388_CHLTR RecName: Full=UPF0235 protein CT_388 gi|123606912|sp|Q3KLW5|Y423_CHLTA RecName: Full=UPF0235 protein CTA_0423 gi|226707987|sp|B0BC23|Y641_CHLTB RecName: Full=UPF0235 protein CTLon_0641 gi|226707989|sp|B0B7V8|Y644_CHLT2 RecName: Full=UPF0235 protein CTL0644 gi|3328814|gb|AAC67985.1| hypothetical protein CT_388 [Chlamydia trachomatis D/UW-3/CX] gi|76167649|gb|AAX50657.1| hypothetical cytosolic protein [Chlamydia trachomatis A/HAR-13] gi|165930587|emb|CAP04084.1| conserved hypothetical protein [Chlamydia trachomatis 434/Bu] gi|165931462|emb|CAP07038.1| conserved hypothetical protein [Chlamydia trachomatis L2b/UCH-1/proctitis] gi|231273035|emb|CAX09948.1| conserved hypothetical protein [Chlamydia trachomatis B/TZ1A828/OT] gi|231274047|emb|CAX10841.1| conserved hypothetical protein [Chlamydia trachomatis B/Jali20/OT] gi|289525430|emb|CBJ14907.1| conserved hypothetical protein [Chlamydia trachomatis Sweden2] gi|296434982|gb|ADH17160.1| hypothetical protein E150_02035 [Chlamydia trachomatis E/150] gi|296435909|gb|ADH18083.1| hypothetical protein G9768_02005 [Chlamydia trachomatis G/9768] gi|296436835|gb|ADH19005.1| hypothetical protein G11222_02005 [Chlamydia trachomatis G/11222] gi|296437769|gb|ADH19930.1| hypothetical protein G11074_02005 [Chlamydia trachomatis G/11074] gi|296438702|gb|ADH20855.1| hypothetical protein E11023_02020 [Chlamydia trachomatis E/11023] gi|297140269|gb|ADH97027.1| hypothetical protein CTG9301_02010 [Chlamydia trachomatis G/9301] gi|297748518|gb|ADI51064.1| Hypothetical cytosolic protein [Chlamydia trachomatis D-EC] gi|297749398|gb|ADI52076.1| Hypothetical cytosolic protein [Chlamydia trachomatis D-LC] Length = 115 Score = 79.4 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 26/95 (27%), Positives = 53/95 (55%), Gaps = 8/95 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ A+++ + LE ++++VT P+KGKAN A++A+LA L++ KS Sbjct: 23 LEVRVTTKARENRVVCLE-------DGILRVRVTEVPEKGKANDAVVALLANFLSIPKSD 75 Query: 64 LRMLSKQSSPLKIIYIDKDCKE-ITELLQNNDSLT 97 + +++ ++S K + + + K + E + S T Sbjct: 76 VTLIAGEASRRKKVLLPRSIKAFLLEQFPSESSST 110 >gi|237858662|ref|NP_653198.2| hypothetical protein LOC123207 isoform a [Homo sapiens] gi|119582835|gb|EAW62431.1| chromosome 15 open reading frame 40, isoform CRA_a [Homo sapiens] Length = 153 Score = 79.0 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 9/87 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 63 VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 115 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEI 86 S + + S K++ + +EI Sbjct: 116 SDVVLDKGGKSREKVVKLLASTTPEEI 142 >gi|119493536|ref|ZP_01624202.1| hypothetical protein L8106_18212 [Lyngbya sp. PCC 8106] gi|119452653|gb|EAW33834.1| hypothetical protein L8106_18212 [Lyngbya sp. PCC 8106] Length = 81 Score = 79.0 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 40/82 (48%), Gaps = 7/82 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + V++ PN+K+ + E I + + P GKAN+ ++ +LAK+ + Sbjct: 1 MVVLHVKVKPNSKQQSMVKNE-------EGTYIIHLKSPPIDGKANQELIKILAKQFNIP 53 Query: 61 KSSLRMLSKQSSPLKIIYIDKD 82 KS + + S SS K+I + Sbjct: 54 KSQVSIKSGLSSKNKLIELPDS 75 >gi|218246502|ref|YP_002371873.1| hypothetical protein PCC8801_1666 [Cyanothece sp. PCC 8801] gi|257059535|ref|YP_003137423.1| hypothetical protein Cyan8802_1684 [Cyanothece sp. PCC 8802] gi|218166980|gb|ACK65717.1| protein of unknown function DUF167 [Cyanothece sp. PCC 8801] gi|256589701|gb|ACV00588.1| protein of unknown function DUF167 [Cyanothece sp. PCC 8802] Length = 72 Score = 79.0 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 23/76 (30%), Positives = 41/76 (53%), Gaps = 7/76 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V++ PNAKK I + + + + + P +GKAN+ ++ +LAKK +S+S + Sbjct: 4 QVKVKPNAKKQKI-------QEEEDGSLVVYLKSPPIEGKANQELIKLLAKKFGVSQSQI 56 Query: 65 RMLSKQSSPLKIIYID 80 + S SS K + I+ Sbjct: 57 SIKSGLSSRNKWVEIE 72 >gi|332187012|ref|ZP_08388753.1| hypothetical protein SUS17_2027 [Sphingomonas sp. S17] gi|332013022|gb|EGI55086.1| hypothetical protein SUS17_2027 [Sphingomonas sp. S17] Length = 84 Score = 79.0 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 15/85 (17%), Positives = 39/85 (45%), Gaps = 7/85 (8%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P + + H +++A P G AN A++ ++AK ++K ++ +++ Sbjct: 2 TPRGGRDMLT-------AGTEDHFAARLSAPPVDGAANAALVPLVAKHFGVAKRAVTIVA 54 Query: 69 KQSSPLKIIYIDKDCKEITELLQNN 93 +++ LK ++I D + + + Sbjct: 55 GETARLKRLHIAGDPHILARMAEAL 79 >gi|56695839|ref|YP_166190.1| hypothetical protein SPO0937 [Ruegeria pomeroyi DSS-3] gi|56677576|gb|AAV94242.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 92 Score = 79.0 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 37/78 (47%), Gaps = 8/78 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P A + + ++I VTA P+ GKAN+A+ +LA+ + ++ S Sbjct: 22 IALRVTPKAARDSVT--------LAGEGLRITVTAPPEDGKANEAVRKLLARAMGVAPSR 73 Query: 64 LRMLSKQSSPLKIIYIDK 81 L + Q++ K Sbjct: 74 LTLRRGQTARDKTFVYLG 91 >gi|195491333|ref|XP_002093518.1| GE20708 [Drosophila yakuba] gi|194179619|gb|EDW93230.1| GE20708 [Drosophila yakuba] Length = 128 Score = 79.0 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 25/94 (26%), Positives = 44/94 (46%), Gaps = 10/94 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK++GI + + +++ A P +G+AN ++ L+K L L KS Sbjct: 40 IQILAKPGAKQNGITGIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 92 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQNND 94 + + S KII I K + I +LL+ Sbjct: 93 VSLDKGSRSRNKIIMITKGVSTVEAIEQLLRKES 126 >gi|149177442|ref|ZP_01856046.1| hypothetical protein PM8797T_19111 [Planctomyces maris DSM 8797] gi|148843775|gb|EDL58134.1| hypothetical protein PM8797T_19111 [Planctomyces maris DSM 8797] Length = 104 Score = 79.0 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 26/76 (34%), Positives = 41/76 (53%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR P + K+GI + +K+ VT P+KGKANKA+L +L L L +S Sbjct: 16 LPVRAQPRSSKNGIEGVH-------DGRLKVCVTQVPEKGKANKALLKVLQTALKLKRSQ 68 Query: 64 LRMLSKQSSPLKIIYI 79 + + +++ LKI I Sbjct: 69 IELYKGETAALKIFRI 84 >gi|159038970|ref|YP_001538223.1| hypothetical protein Sare_3430 [Salinispora arenicola CNS-205] gi|157917805|gb|ABV99232.1| protein of unknown function DUF167 [Salinispora arenicola CNS-205] Length = 107 Score = 79.0 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 37/87 (42%), Gaps = 3/87 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + + + + + VTA P G+AN+A LA L + ++ Sbjct: 18 VAVRVKPGSSRIRVGG---RYVGPHGPALIVAVTAPPVDGRANEAARRALADALGVRSAA 74 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + + + K +D +IT L Sbjct: 75 VSLEAGATGRNKTFRVDGVAADITRTL 101 >gi|77464376|ref|YP_353880.1| hypothetical protein RSP_0800 [Rhodobacter sphaeroides 2.4.1] gi|77388794|gb|ABA79979.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] Length = 85 Score = 78.6 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 38/76 (50%), Gaps = 8/76 (10%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P A ++ + +++ VT P+ GKAN+A+ LAK L ++KS L Sbjct: 18 AVRVTPRAARAKVE--------LQEGVVRVHVTCVPEDGKANRAVTEALAKALGVAKSRL 69 Query: 65 RMLSKQSSPLKIIYID 80 ++ +S K +D Sbjct: 70 TLVRGATSRDKTFRLD 85 >gi|296204187|ref|XP_002749223.1| PREDICTED: UPF0235 protein C15orf40-like [Callithrix jacchus] Length = 154 Score = 78.6 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 9/87 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 64 VTIAIHAKPGSKQNAVTDLTA-------EAINVAIAAPPSEGEANAELCRYLSKVLELRK 116 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEI 86 S + + S K++ + +EI Sbjct: 117 SDVVLDKGCKSREKVVKLLASTTPEEI 143 >gi|195587403|ref|XP_002083454.1| GD13347 [Drosophila simulans] gi|194195463|gb|EDX09039.1| GD13347 [Drosophila simulans] Length = 128 Score = 78.6 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 24/94 (25%), Positives = 44/94 (46%), Gaps = 10/94 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK++GI + + +++ A P +G+AN ++ L+K L L KS Sbjct: 40 IQILAKPGAKQNGITGIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 92 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQNND 94 + + S KII I K + I ++L+ Sbjct: 93 VSLDKGSRSRNKIIMITKGASTVEAIEQMLRKES 126 >gi|209525443|ref|ZP_03273983.1| protein of unknown function DUF167 [Arthrospira maxima CS-328] gi|209494123|gb|EDZ94438.1| protein of unknown function DUF167 [Arthrospira maxima CS-328] Length = 73 Score = 78.6 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 21/80 (26%), Positives = 41/80 (51%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M +++ PN+ + I + + + + P GKAN+ ++ +LAKKL + Sbjct: 1 MAFFNIQVKPNSPQQLI-------RKEADGSLTVYLKSPPVDGKANQELIKLLAKKLDVP 53 Query: 61 KSSLRMLSKQSSPLKIIYID 80 KS++++ S SS K++ I Sbjct: 54 KSNIKIKSGLSSRRKLVEIS 73 >gi|87311293|ref|ZP_01093415.1| hypothetical protein DSM3645_27241 [Blastopirellula marina DSM 3645] gi|87286033|gb|EAQ77945.1| hypothetical protein DSM3645_27241 [Blastopirellula marina DSM 3645] Length = 100 Score = 78.6 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR +P +KK+ I +K+ VTA P+ GKANKA++ +LAKKL L K Sbjct: 11 VILPVRALPGSKKNEIRG-------EQQGALKVSVTAAPEDGKANKAIVELLAKKLVLRK 63 Query: 62 SSLRMLSKQSSPLKIIYI 79 S L +++ + K + + Sbjct: 64 SQLEIIAGHTHRQKRVLV 81 >gi|294916777|ref|XP_002778391.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983] gi|239886749|gb|EER10186.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983] Length = 132 Score = 78.6 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 20/95 (21%), Positives = 46/95 (48%), Gaps = 9/95 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAK-KLALS 60 + +R P AK S + ++ + +++ A + G+AN+ +L+ L+K L + Sbjct: 41 ARIAIRAKPGAKVSCLTGIDA------EGALGVQLNAPARDGEANEELLSFLSKEVLGVK 94 Query: 61 KSSLRMLSKQSSPLKIIYIDK--DCKEITELLQNN 93 K + ++ S K++ I ++++ LL+N Sbjct: 95 KKDVALVQGSKSREKVVEIADVLTVEDVSRLLRNE 129 >gi|328717531|ref|XP_003246233.1| PREDICTED: UPF0235 protein C15orf40 homolog [Acyrthosiphon pisum] Length = 136 Score = 78.6 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 22/95 (23%), Positives = 39/95 (41%), Gaps = 10/95 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P AK + I + +++ A P G+AN ++ L+K L L K Sbjct: 45 VVIKINAKPGAKNNNITDISSDGIG-------VQINAPPTDGEANAELIKYLSKVLGLRK 97 Query: 62 SSLRMLSKQSSPLKIIYIDKDC---KEITELLQNN 93 S L + S KI+ + + ITE ++ Sbjct: 98 SDLSLDRGSRSRNKILIVHNTSLGIEGITEKIKEE 132 >gi|270308462|ref|YP_003330520.1| hypothetical protein DhcVS_1075 [Dehalococcoides sp. VS] gi|270154354|gb|ACZ62192.1| hypothetical protein DhcVS_1075 [Dehalococcoides sp. VS] Length = 97 Score = 78.6 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 41/76 (53%), Gaps = 7/76 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +++ P+++++ ++ E +K+++ A P+KGKANK ++ L++ L K+ Sbjct: 9 RVNLKIFPSSQRNELSGYE-------NGLLKLRIAAQPEKGKANKELIDYLSELLDTPKA 61 Query: 63 SLRMLSKQSSPLKIIY 78 + + + K++ Sbjct: 62 EIEICRGHTGRNKVLV 77 >gi|163783198|ref|ZP_02178192.1| hypothetical protein HG1285_14279 [Hydrogenivirga sp. 128-5-R1-1] gi|159881532|gb|EDP75042.1| hypothetical protein HG1285_14279 [Hydrogenivirga sp. 128-5-R1-1] Length = 73 Score = 78.2 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 48/79 (60%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ P++K+ G+ + ++++V+A P++GKAN+ ++ +LAK + K Sbjct: 1 MKLKVKVKPSSKREGVREVSP-------GELEVRVSAPPERGKANERLIELLAKHYGVRK 53 Query: 62 SSLRMLSKQSSPLKIIYID 80 ++R+L ++S K++ ID Sbjct: 54 GAVRILRGETSREKVVEID 72 >gi|328541881|ref|YP_004301990.1| hypothetical protein SL003B_0257 [Polymorphum gilvum SL003B-26A1] gi|326411632|gb|ADZ68695.1| hypothetical protein SL003B_0257 [Polymorphum gilvum SL003B-26A1] Length = 106 Score = 78.2 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 25/79 (31%), Positives = 40/79 (50%), Gaps = 2/79 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VRL P A K + D + +V A P+KG AN A+ A++AK L + KS+ Sbjct: 17 VDVRLTPRAGKDAVEGCSELSDGRP--VVLARVRAIPEKGAANAALEALIAKALGVPKSA 74 Query: 64 LRMLSKQSSPLKIIYIDKD 82 + + + + LK + + D Sbjct: 75 VAIDAGAGARLKTLKVSGD 93 >gi|255022260|ref|ZP_05294254.1| hypothetical protein ACA_0401 [Acidithiobacillus caldus ATCC 51756] gi|254968316|gb|EET25884.1| hypothetical protein ACA_0401 [Acidithiobacillus caldus ATCC 51756] Length = 115 Score = 78.2 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 17/77 (22%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ P A+ + +KI++ A G AN A+L+ LA++L L Sbjct: 15 LTVQVQPGARDDCVVGYH-------GDALKIRLRARAVDGAANAALLSFLARRLDLGPGQ 67 Query: 64 LRMLSKQSSPLKIIYID 80 + + S K++ + Sbjct: 68 VVLRHGTHSRRKVLVLS 84 >gi|307152217|ref|YP_003887601.1| hypothetical protein Cyan7822_2348 [Cyanothece sp. PCC 7822] gi|306982445|gb|ADN14326.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7822] Length = 73 Score = 78.2 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 27/78 (34%), Positives = 42/78 (53%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ PNAK+ I LE + I + + P GKAN+ ++ +LAKK +SK Sbjct: 1 MKIQVKVKPNAKQQKIEELE-------DGSLVISLKSPPVDGKANEELIKLLAKKYQVSK 53 Query: 62 SSLRMLSKQSSPLKIIYI 79 S + + S SS K+I I Sbjct: 54 SQISIQSGLSSRNKLIEI 71 >gi|196231015|ref|ZP_03129875.1| protein of unknown function DUF167 [Chthoniobacter flavus Ellin428] gi|196224845|gb|EDY19355.1| protein of unknown function DUF167 [Chthoniobacter flavus Ellin428] Length = 93 Score = 78.2 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 43/88 (48%), Gaps = 7/88 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R++PNA++S + + +K+KV A GKAN+A+ LA+ L + Sbjct: 5 AILRLRIVPNARRSEVVGVH-------GDAVKVKVQAPAMDGKANEALRDFLAEVLTVPA 57 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITEL 89 ++ +++ + S K++ I + Sbjct: 58 RAVEIVAGEKSRDKVVAIADLETDEARR 85 >gi|145538297|ref|XP_001454854.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124422631|emb|CAK87457.1| unnamed protein product [Paramecium tetraurelia] Length = 106 Score = 78.2 bits (192), Expect = 3e-13, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 42/90 (46%), Gaps = 7/90 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 +++ PN+K S I + + I + A P+ G+AN + +A+ L + K++ Sbjct: 20 LVINAKPNSKVSQITGI-------SDEAVDINIAAPPKDGEANAELCDFVAQTLGVKKTA 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 +++ K++ I+ K+I + + Sbjct: 73 IQVNKGGKGRNKLVSIESKFKDINDFFEKL 102 >gi|29839587|sp|Q8WUR7|CO040_HUMAN RecName: Full=UPF0235 protein C15orf40 gi|18043732|gb|AAH19820.1| Chromosome 15 open reading frame 40 [Homo sapiens] gi|189053288|dbj|BAG35094.1| unnamed protein product [Homo sapiens] Length = 126 Score = 77.9 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 9/87 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 36 VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 88 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEI 86 S + + S K++ + +EI Sbjct: 89 SDVVLDKGGKSREKVVKLLASTTPEEI 115 >gi|258597870|ref|XP_001348716.2| conserved protein, unknown function [Plasmodium falciparum 3D7] gi|255528895|gb|AAN37155.2| conserved protein, unknown function [Plasmodium falciparum 3D7] Length = 147 Score = 77.9 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 40/91 (43%), Gaps = 7/91 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ PNAKK+ I D + I + P ++N A++ + L L K Sbjct: 61 ITLRVKPNAKKTSI------YFNQDKEVLNINIQEQPVNNQSNVAIIGYFSDILNLKKRD 114 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 + ++S S K++ + ++ ++ N Sbjct: 115 ISIVSGLKSRDKVLMVSNISLDDLNNKIEEN 145 >gi|225849498|ref|YP_002729663.1| hypothetical protein SULAZ_1705 [Sulfurihydrogenibium azorense Az-Fu1] gi|225644124|gb|ACN99174.1| conserved domain protein [Sulfurihydrogenibium azorense Az-Fu1] Length = 72 Score = 77.9 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 19/79 (24%), Positives = 42/79 (53%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ P + K+ + +E +++ T P+KGKAN ++ ++++ L + K Sbjct: 1 MRIKVKVKPGSSKNEVKKIE-------ENFYEVRCTTIPEKGKANDKVIELMSEYLDVPK 53 Query: 62 SSLRMLSKQSSPLKIIYID 80 S ++++ SS K I I+ Sbjct: 54 SRIKIIKGHSSREKEIEIE 72 >gi|15678665|ref|NP_275780.1| hypothetical protein MTH637 [Methanothermobacter thermautotrophicus str. Delta H] gi|29839449|sp|O26734|Y637_METTH RecName: Full=UPF0235 protein MTH_637 gi|2621719|gb|AAB85143.1| conserved protein [Methanothermobacter thermautotrophicus str. Delta H] Length = 104 Score = 77.9 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 41/90 (45%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + P + K GI P +++K+ + PQKGKAN+ ++ ++ Sbjct: 16 VNIEVSPASGKFGI-----PSYNEWRKRIEVKIHSPPQKGKANREIIKEFSETFG---RD 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++S Q S K I I +++ L + Sbjct: 68 VEIVSGQKSRQKTIRIQGMGRDLFLKLVSE 97 >gi|149195332|ref|ZP_01872419.1| hypothetical protein CMTB2_08545 [Caminibacter mediatlanticus TB-2] gi|149134524|gb|EDM23013.1| hypothetical protein CMTB2_08545 [Caminibacter mediatlanticus TB-2] Length = 94 Score = 77.9 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 45/88 (51%), Gaps = 8/88 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V+ PN+ K+ IA L +KI + A +G ANK ++ L+K +SK+ Sbjct: 12 LNVKAQPNSSKNKIAGLY------GEDAIKINIKAPAVEGAANKELIKFLSKMFKVSKND 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + ++ ++S K I I +++ E ++ Sbjct: 66 I-IIKGETSKKKQI-IMPINEKVKEFIE 91 >gi|195337079|ref|XP_002035160.1| GM14073 [Drosophila sechellia] gi|194128253|gb|EDW50296.1| GM14073 [Drosophila sechellia] Length = 128 Score = 77.9 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 23/94 (24%), Positives = 43/94 (45%), Gaps = 10/94 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK++GI + + +++ A P +G+AN ++ L+K L L KS Sbjct: 40 IQILAKPGAKQNGITGIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 92 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQNND 94 + + S K I I K + I ++L+ Sbjct: 93 VSLDKGSRSRNKKIMITKGVSTVEAIEQMLRKES 126 >gi|242021057|ref|XP_002430963.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212516183|gb|EEB18225.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 117 Score = 77.5 bits (190), Expect = 5e-13, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 41/93 (44%), Gaps = 10/93 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK + I ++ +++ A P G+AN ++ ++ L L K+ Sbjct: 30 LQIFAKPGAKTNAITGIDEEGIG-------VQINARPVDGEANSELVNYMSCLLGLRKTE 82 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQNN 93 + + S KI+ I K +EI E L+N Sbjct: 83 ISLEKGSKSRQKILLISKKDLSTEEIIEKLKNE 115 >gi|20150521|pdb|1JRM|A Chain A, Nmr Structure Of Mth0637. Ontario Centre For Structural Proteomics Target Mth0637_1_104; Northeast Structural Genomics Target Tt135 Length = 104 Score = 77.5 bits (190), Expect = 5e-13, Method: Composition-based stats. Identities = 21/90 (23%), Positives = 41/90 (45%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + P + K GI P +++K+ + PQKGKAN+ ++ ++ Sbjct: 16 VNIEVSPASGKFGI-----PSYNEWRKRIEVKIHSPPQKGKANREIIKEFSETFG---RD 67 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + ++S Q S K I I +++ L + Sbjct: 68 VEIVSGQKSRQKTIRIQGMGRDLFLKLVSE 97 >gi|66813694|ref|XP_641026.1| hypothetical protein DDB_G0280783 [Dictyostelium discoideum AX4] gi|74855711|sp|Q54UW1|U235_DICDI RecName: Full=UPF0235 protein gi|60469052|gb|EAL67049.1| hypothetical protein DDB_G0280783 [Dictyostelium discoideum AX4] Length = 124 Score = 77.5 bits (190), Expect = 5e-13, Method: Composition-based stats. Identities = 24/88 (27%), Positives = 45/88 (51%), Gaps = 7/88 (7%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + V + PN+K+S I S E + ++++ P GKAN ++ L+K+L + Sbjct: 31 IIKINVNVHPNSKESSIVSFEDQ-------ILSLRISEPPIDGKANIGVIEFLSKELNIR 83 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITE 88 KS++ + S K + ID + IT+ Sbjct: 84 KSNIEVGKGSKSRNKSVEIDISSENITK 111 >gi|296165029|ref|ZP_06847584.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295899677|gb|EFG79128.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 77 Score = 77.5 bits (190), Expect = 5e-13, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 36/78 (46%), Gaps = 6/78 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P ++K + + + I V GKAN A+ +LA L L +S Sbjct: 6 VAVRVKPGSRKGPLV------EEGPNGELTIYVRERAVDGKANDAVTGLLAAHLELPRSR 59 Query: 64 LRMLSKQSSPLKIIYIDK 81 + ++S ++S LK + Sbjct: 60 VELISGRTSRLKRFRVSG 77 >gi|268567307|ref|XP_002639944.1| Hypothetical protein CBG10764 [Caenorhabditis briggsae] Length = 258 Score = 77.5 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 38/77 (49%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AKKSG+ ++ + + + A P++G AN+ +++ L L L K+ Sbjct: 35 LRIHAKPGAKKSGVVAIN-------ESEIDVAIGAAPREGAANEELVSYLMSALGLRKNE 87 Query: 64 LRMLSKQSSPLKIIYID 80 L+ S K++ I+ Sbjct: 88 LQFDKGAKSRSKVVLIE 104 >gi|158520306|ref|YP_001528176.1| hypothetical protein Dole_0289 [Desulfococcus oleovorans Hxd3] gi|226701656|sp|A8ZS81|Y289_DESOH RecName: Full=UPF0235 protein Dole_0289 gi|158509132|gb|ABW66099.1| protein of unknown function DUF167 [Desulfococcus oleovorans Hxd3] Length = 103 Score = 77.5 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 37/76 (48%), Gaps = 7/76 (9%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P + K+ + +KIK+TA P G+AN+ +A L++ L + K+S+ + Sbjct: 17 VAPRSSKNMVVGAH-------DNALKIKITAPPVDGRANEMCVAFLSRLLGIPKTSITIA 69 Query: 68 SKQSSPLKIIYIDKDC 83 + +S K + + Sbjct: 70 AGAASKRKEVCLALSP 85 >gi|22299537|ref|NP_682784.1| hypothetical protein tsr1994 [Thermosynechococcus elongatus BP-1] gi|29839707|sp|Q8DHG5|Y1994_THEEB RecName: Full=UPF0235 protein tsr1994 gi|22295720|dbj|BAC09546.1| tsr1994 [Thermosynechococcus elongatus BP-1] Length = 74 Score = 77.5 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 40/79 (50%), Gaps = 7/79 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M V + PNA+++ ++ + + + V A G+AN+ ++A+LA + Sbjct: 1 MGKKHVIVKPNARQASVS-------ITPAGQLLVTVRAPASDGQANQELIALLAAYFGVP 53 Query: 61 KSSLRMLSKQSSPLKIIYI 79 KS ++++ +S K+I + Sbjct: 54 KSRIQLVKGHTSRHKVIEL 72 >gi|309361190|emb|CAP30075.2| hypothetical protein CBG_10764 [Caenorhabditis briggsae AF16] Length = 266 Score = 77.5 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 38/77 (49%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AKKSG+ ++ + + + A P++G AN+ +++ L L L K+ Sbjct: 42 LRIHAKPGAKKSGVVAIN-------ESEIDVAIGAAPREGAANEELVSYLMSALGLRKNE 94 Query: 64 LRMLSKQSSPLKIIYID 80 L+ S K++ I+ Sbjct: 95 LQFDKGAKSRSKVVLIE 111 >gi|327540387|gb|EGF26973.1| protein containing DUF167 [Rhodopirellula baltica WH47] Length = 87 Score = 77.5 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 26/74 (35%), Positives = 43/74 (58%), Gaps = 7/74 (9%) Query: 7 RLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRM 66 R+ P AKK+ + L +K+ V P+ GKANKA++A LAK L +SK + + Sbjct: 2 RVTPKAKKASVGGLH-------DGALKVSVHMVPEDGKANKAVIASLAKWLRVSKGRVAI 54 Query: 67 LSKQSSPLKIIYID 80 ++ ++S LK I ++ Sbjct: 55 VAGETSRLKTIVVE 68 >gi|73748976|ref|YP_308215.1| hypothetical protein cbdb_A1230 [Dehalococcoides sp. CBDB1] gi|147669743|ref|YP_001214561.1| hypothetical protein DehaBAV1_1103 [Dehalococcoides sp. BAV1] gi|289432972|ref|YP_003462845.1| hypothetical protein DehalGT_1029 [Dehalococcoides sp. GT] gi|123619917|sp|Q3ZYH5|Y1230_DEHSC RecName: Full=UPF0235 protein cbdbA1230 gi|189038739|sp|A5FQ39|Y1103_DEHSB RecName: Full=UPF0235 protein DehaBAV1_1103 gi|73660692|emb|CAI83299.1| conserved hypothetical protein [Dehalococcoides sp. CBDB1] gi|146270691|gb|ABQ17683.1| protein of unknown function DUF167 [Dehalococcoides sp. BAV1] gi|288946692|gb|ADC74389.1| protein of unknown function DUF167 [Dehalococcoides sp. GT] Length = 97 Score = 77.5 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 21/75 (28%), Positives = 41/75 (54%), Gaps = 7/75 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +++IP+A+K+ +A E +K+K+ A P+KGKANK ++ L+ L K+ Sbjct: 9 KVNLKIIPSARKNELAGYE-------NGLLKLKIAAQPEKGKANKELIDYLSDLLDTPKA 61 Query: 63 SLRMLSKQSSPLKII 77 + + + K++ Sbjct: 62 EIEICHGHTGRNKVL 76 >gi|145543083|ref|XP_001457228.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124425043|emb|CAK89831.1| unnamed protein product [Paramecium tetraurelia] Length = 106 Score = 77.5 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 42/90 (46%), Gaps = 7/90 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 +++ PN+K S I + + + + A P+ G+AN + +A+ L + K++ Sbjct: 20 LVINAKPNSKVSQITGI-------SDEAVDVNIAAPPKDGEANAELCDFVAQTLGVKKTA 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 +++ K+I I+ K+I E + Sbjct: 73 IQVQKGGKGRNKLIKIESKFKDINEFYEKL 102 >gi|145219218|ref|YP_001129927.1| hypothetical protein Cvib_0403 [Prosthecochloris vibrioformis DSM 265] gi|189040268|sp|A4SD66|Y403_PROVI RecName: Full=UPF0235 protein Cvib_0403 gi|145205382|gb|ABP36425.1| protein of unknown function DUF167 [Chlorobium phaeovibrioides DSM 265] Length = 100 Score = 77.5 bits (190), Expect = 6e-13, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 34/84 (40%), Gaps = 7/84 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P A K+ ++ +KI + A P AN+ + A ++ + Sbjct: 16 SVRVQPRASKTAVSG-------PYAGGLKITLKAAPVDDAANRECCRLFAGMFGIADGRV 68 Query: 65 RMLSKQSSPLKIIYIDKDCKEITE 88 ++S +SS K + ++ E Sbjct: 69 HVVSGRSSRSKSVMLEGVSSREAE 92 >gi|198282397|ref|YP_002218718.1| hypothetical protein Lferr_0253 [Acidithiobacillus ferrooxidans ATCC 53993] gi|198246918|gb|ACH82511.1| protein of unknown function DUF167 [Acidithiobacillus ferrooxidans ATCC 53993] Length = 113 Score = 77.5 bits (190), Expect = 7e-13, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 38/88 (43%), Gaps = 8/88 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+A + I +K+ VTA P+ G+A M+ LA + ++ S++ ++ Sbjct: 32 KPSAGRDAIG-------KPKGAQIKVSVTAEPRNGRATDHMVRFLAGEFGVAPSAIEVVF 84 Query: 69 KQSSPLKIIYIDKDCKEITELLQNNDSL 96 + + K + I + + Q SL Sbjct: 85 GRMNVNKQLRIKA-PTHLPPVFQVQASL 111 >gi|196001449|ref|XP_002110592.1| hypothetical protein TRIADDRAFT_54758 [Trichoplax adhaerens] gi|190586543|gb|EDV26596.1| hypothetical protein TRIADDRAFT_54758 [Trichoplax adhaerens] Length = 132 Score = 77.1 bits (189), Expect = 7e-13, Method: Composition-based stats. Identities = 21/95 (22%), Positives = 45/95 (47%), Gaps = 11/95 (11%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + ++ P +K++ + + I++ A ++G+AN ++ L+ L + K Sbjct: 41 VLITIKAKPGSKENAVTDISSDGIG-------IQIAAPAREGEANSELIKFLSSILKVKK 93 Query: 62 SSLRMLSKQSSPLKIIYIDKDC----KEITELLQN 92 SS+ + S K I ++K+ K++ ELLQ Sbjct: 94 SSILLDKGSKSRHKTICVNKNADLTEKQVLELLQE 128 >gi|145253637|ref|XP_001398331.1| yggU family protein [Aspergillus niger CBS 513.88] gi|134083900|emb|CAK48804.1| unnamed protein product [Aspergillus niger] Length = 126 Score = 77.1 bits (189), Expect = 7e-13, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 35/78 (44%), Gaps = 5/78 (6%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + PNA + T + + V A P+KG+AN A+ +LA+ + KS Sbjct: 27 QISCHVKPNASSNR-----EGITAIGTDRVDVCVAAVPRKGEANAAVSRVLAQIFQVPKS 81 Query: 63 SLRMLSKQSSPLKIIYID 80 ++ ++ S K + I Sbjct: 82 NVEVIRGLKSREKTLAIS 99 >gi|195376389|ref|XP_002046979.1| GJ12187 [Drosophila virilis] gi|194154137|gb|EDW69321.1| GJ12187 [Drosophila virilis] Length = 124 Score = 77.1 bits (189), Expect = 7e-13, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 44/93 (47%), Gaps = 10/93 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK++GI + + +++ A P +G+AN ++ L+K L L KS Sbjct: 36 IKILAKPGAKQNGITDIGLDGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 88 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQNN 93 + + S KI+ + K + I +LL+ Sbjct: 89 VSLDKGSRSRNKIVLVTKGASTVEAIEQLLRKE 121 >gi|156373856|ref|XP_001629526.1| predicted protein [Nematostella vectensis] gi|156216528|gb|EDO37463.1| predicted protein [Nematostella vectensis] Length = 133 Score = 77.1 bits (189), Expect = 7e-13, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + P AK++ I L +++ A P++G+AN ++ ++ + KSS Sbjct: 44 VKIHAKPGAKQNRITELSPDFVG-------VQIAAQPKEGEANDELVRYMSSVFGVKKSS 96 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITEL 89 + + S KII I + + Sbjct: 97 VTLDKGAKSRDKIIRISSSGLTLKQA 122 >gi|318041795|ref|ZP_07973751.1| hypothetical protein SCB01_08799 [Synechococcus sp. CB0101] Length = 97 Score = 77.1 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 41/88 (46%), Gaps = 8/88 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V +R+ P A + + L + I + A P G AN+A+L ++A++L +S ++ Sbjct: 12 VAIRVQPRASRERVLGLR-------GEAIAIALKAPPVDGAANEALLKLIARQLKVSAAA 64 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90 + ++ S K I + ++ L Sbjct: 65 VELVRGASGRSKWIRVAGWSADQVRAAL 92 >gi|296134781|ref|YP_003642023.1| protein of unknown function DUF167 [Thiomonas intermedia K12] gi|294338737|emb|CAZ87069.1| conserved hypothetical protein [Thiomonas sp. 3As] gi|295794903|gb|ADG29693.1| protein of unknown function DUF167 [Thiomonas intermedia K12] Length = 106 Score = 77.1 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 38/88 (43%), Gaps = 8/88 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+A I +K+ VTA P+ G+A M+ LAK+ + S++ ++ Sbjct: 26 KPSAGVDAIG-------KPKGPQLKVSVTAAPRAGRATDHMVRFLAKEFGVPTSAIEVVF 78 Query: 69 KQSSPLKIIYIDKDCKEITELLQNNDSL 96 + + K + I +++ + Q L Sbjct: 79 GRMNVNKQLRIHA-PQKLPAVFQQTSLL 105 >gi|120403830|ref|YP_953659.1| hypothetical protein Mvan_2846 [Mycobacterium vanbaalenii PYR-1] gi|166200356|sp|A1T903|Y2846_MYCVP RecName: Full=UPF0235 protein Mvan_2846 gi|119956648|gb|ABM13653.1| protein of unknown function DUF167 [Mycobacterium vanbaalenii PYR-1] Length = 75 Score = 77.1 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 20/76 (26%), Positives = 35/76 (46%), Gaps = 6/76 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P +KK + +T + + V GKAN A++ +LA+ + +S Sbjct: 5 VSVRVKPGSKKGPLV------ETGPDGELTVYVRERAVDGKANAAVIRVLAEHFGVPRSL 58 Query: 64 LRMLSKQSSPLKIIYI 79 + + SS +K I Sbjct: 59 VELTGGASSRIKRFRI 74 >gi|88602392|ref|YP_502570.1| hypothetical protein Mhun_1102 [Methanospirillum hungatei JF-1] gi|88187854|gb|ABD40851.1| conserved hypothetical protein [Methanospirillum hungatei JF-1] Length = 117 Score = 77.1 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 39/82 (47%), Gaps = 4/82 (4%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + +KKS L I V A P +GKANKA++ ++A L + Sbjct: 32 ITIEVSAGSKKS----LFPDGYNPWRKAFGIAVKAPPVEGKANKAIMELIAGYFHLPVHA 87 Query: 64 LRMLSKQSSPLKIIYIDKDCKE 85 + +LS Q+S +K + I ++ Sbjct: 88 VTILSGQTSSVKKVRIHGISRQ 109 >gi|327289069|ref|XP_003229247.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 1 [Anolis carolinensis] Length = 119 Score = 77.1 bits (189), Expect = 8e-13, Method: Composition-based stats. Identities = 21/94 (22%), Positives = 38/94 (40%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V P +K++ + L I + A P G+AN + L+K L + K Sbjct: 30 VTIAVHAKPGSKQNAVTDLSAEAVG-------IAIAAPPSDGEANAELCRYLSKVLEVRK 82 Query: 62 SSLRMLSKQSSPLKIIYIDKD--CKEITELLQNN 93 SS + S K++ I +E+ + L+ Sbjct: 83 SSSLLKQGGRSREKLVKILAPLTPEEVLQKLRKE 116 >gi|289547963|ref|YP_003472951.1| hypothetical protein Thal_0188 [Thermocrinis albus DSM 14484] gi|289181580|gb|ADC88824.1| protein of unknown function DUF167 [Thermocrinis albus DSM 14484] Length = 77 Score = 77.1 bits (189), Expect = 9e-13, Method: Composition-based stats. Identities = 23/80 (28%), Positives = 39/80 (48%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + V+ P A K + L ++ V P+ GKAN+ +L +L+K L + Sbjct: 3 MIVIHVKAKPKASKEYVKELSP-------NFYEVAVKEPPEDGKANERILELLSKHLKVP 55 Query: 61 KSSLRMLSKQSSPLKIIYID 80 KS +++L SS +K+ I Sbjct: 56 KSRIKLLRGTSSRIKVFCIS 75 >gi|195127445|ref|XP_002008179.1| GI11963 [Drosophila mojavensis] gi|193919788|gb|EDW18655.1| GI11963 [Drosophila mojavensis] Length = 125 Score = 76.7 bits (188), Expect = 9e-13, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 43/93 (46%), Gaps = 10/93 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK++GI + + +++ A P +G+AN ++ L+K L L KS Sbjct: 37 IKILAKPGAKQNGITDIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 89 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQNN 93 + + S KII + K + I + L+ Sbjct: 90 VSLDKGSRSRNKIILVCKGVTTVEAIEQSLRKE 122 >gi|51474006|ref|YP_067763.1| hypothetical protein RT0827 [Rickettsia typhi str. Wilmington] gi|81390320|sp|Q68Y09|Y827_RICTY RecName: Full=UPF0235 protein RT0827 gi|51460318|gb|AAU04281.1| conserved hypothetical protein [Rickettsia typhi str. Wilmington] Length = 105 Score = 76.7 bits (188), Expect = 9e-13, Method: Composition-based stats. Identities = 22/89 (24%), Positives = 49/89 (55%), Gaps = 3/89 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +++ P +K+ + ++ ++K+ +TA P++GKAN+ ++ LAK+ LS+ Sbjct: 14 ALINIKVKPYSKQ---NLINNFVIINNIPYIKLSITAAPEQGKANEGIINYLAKEWKLSR 70 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 SS+ ++ + LK I I ++ L+ Sbjct: 71 SSIEIIKGHTHSLKTILIKNINEDYLNLI 99 >gi|262197893|ref|YP_003269102.1| hypothetical protein Hoch_4719 [Haliangium ochraceum DSM 14365] gi|262081240|gb|ACY17209.1| protein of unknown function DUF167 [Haliangium ochraceum DSM 14365] Length = 101 Score = 76.7 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 22/80 (27%), Positives = 38/80 (47%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P A + + + +K+ VTA P G AN A+LA+LAK L + Sbjct: 9 VLLDILVRPRAAHARVGPVH-------GERLKVAVTAPPADGAANDAVLALLAKHLGRPR 61 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 L ++ QSS K +++ Sbjct: 62 RDLCLVGGQSSRRKTVHVAG 81 >gi|171679393|ref|XP_001904643.1| hypothetical protein [Podospora anserina S mat+] gi|170939322|emb|CAP64550.1| unnamed protein product [Podospora anserina S mat+] Length = 119 Score = 76.7 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 19/75 (25%), Positives = 36/75 (48%), Gaps = 5/75 (6%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P A K+ + ++I V A ++G+ANKA++ +L++ L L KS+L + Sbjct: 26 VKPGASKNR-----EGVASVGEDAVEICVAAQAREGEANKAVIKVLSEVLDLPKSNLEIT 80 Query: 68 SKQSSPLKIIYIDKD 82 S K + + Sbjct: 81 QGHKSRNKTVAVIGP 95 >gi|291412810|ref|XP_002722671.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus] Length = 154 Score = 76.7 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 34/96 (35%), Gaps = 9/96 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P AK++ + L P +G+AN + L+K L L K Sbjct: 64 VTIAIHAKPGAKQNAVTDLTAEAVIVAIAA-------PPSEGEANAELCRYLSKVLELRK 116 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95 S + + S K++ + EI L+ Sbjct: 117 SDVVLDKGGKSREKVVKLLASTTPDEILGKLKQEAE 152 >gi|126465433|ref|YP_001040542.1| hypothetical protein Smar_0527 [Staphylothermus marinus F1] gi|126014256|gb|ABN69634.1| protein of unknown function DUF167 [Staphylothermus marinus F1] Length = 110 Score = 76.7 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 21/95 (22%), Positives = 43/95 (45%), Gaps = 9/95 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + PN+ + + + + T P+KG+AN A++ L+KK+ L Sbjct: 22 VIIPIYVKPNSDRDALV--------LEGDELVFYTTEIPEKGRANAALIRFLSKKVGLPH 73 Query: 62 SSLRMLSKQSSPLKIIYI-DKDCKEITELLQNNDS 95 S + ++ + K I + D D +++ E L S Sbjct: 74 SKIDIIYGARTRSKKILVRDIDTEKLAEKLTEIIS 108 >gi|15835282|ref|NP_297041.1| hypothetical protein TC0667 [Chlamydia muridarum Nigg] gi|270285456|ref|ZP_06194850.1| hypothetical protein CmurN_03383 [Chlamydia muridarum Nigg] gi|270289467|ref|ZP_06195769.1| hypothetical protein CmurW_03473 [Chlamydia muridarum Weiss] gi|301336853|ref|ZP_07225055.1| hypothetical protein CmurM_03440 [Chlamydia muridarum MopnTet14] gi|29839670|sp|Q9PK06|Y667_CHLMU RecName: Full=UPF0235 protein TC_0667 gi|7190702|gb|AAF39489.1| conserved hypothetical protein [Chlamydia muridarum Nigg] Length = 100 Score = 76.7 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 49/78 (62%), Gaps = 7/78 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ A+++ + SLE ++++VT P++GKAN A++A+LAK L++ K+ Sbjct: 8 LEIRVTTKARENKVVSLE-------DGILRVRVTEAPERGKANDAVVALLAKFLSIPKND 60 Query: 64 LRMLSKQSSPLKIIYIDK 81 + +++ ++S K + + + Sbjct: 61 VTLIAGEASRRKKVLLPR 78 >gi|15841462|ref|NP_336499.1| hypothetical protein MT2038 [Mycobacterium tuberculosis CDC1551] gi|148661795|ref|YP_001283318.1| PE-PGRS family protein [Mycobacterium tuberculosis H37Ra] gi|167970491|ref|ZP_02552768.1| PE-PGRS family protein [Mycobacterium tuberculosis H37Ra] gi|215404212|ref|ZP_03416393.1| PE-PGRS family protein [Mycobacterium tuberculosis 02_1987] gi|215411676|ref|ZP_03420472.1| PE-PGRS family protein [Mycobacterium tuberculosis 94_M4241A] gi|215427341|ref|ZP_03425260.1| PE-PGRS family protein [Mycobacterium tuberculosis T92] gi|215430902|ref|ZP_03428821.1| PE-PGRS family protein [Mycobacterium tuberculosis EAS054] gi|215446194|ref|ZP_03432946.1| PE-PGRS family protein [Mycobacterium tuberculosis T85] gi|218753699|ref|ZP_03532495.1| PE-PGRS family protein [Mycobacterium tuberculosis GM 1503] gi|218753702|ref|ZP_03532498.1| PE-PGRS family protein [Mycobacterium tuberculosis GM 1503] gi|219557946|ref|ZP_03537022.1| PE-PGRS family protein [Mycobacterium tuberculosis T17] gi|253798966|ref|YP_003031967.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN 1435] gi|254232156|ref|ZP_04925483.1| hypothetical protein TBCG_01934 [Mycobacterium tuberculosis C] gi|254364802|ref|ZP_04980848.1| hypothetical protein TBHG_01941 [Mycobacterium tuberculosis str. Haarlem] gi|254551007|ref|ZP_05141454.1| PE-PGRS family protein [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|260186958|ref|ZP_05764432.1| PE-PGRS family protein [Mycobacterium tuberculosis CPHL_A] gi|260201085|ref|ZP_05768576.1| PE-PGRS family protein [Mycobacterium tuberculosis T46] gi|260205263|ref|ZP_05772754.1| PE-PGRS family protein [Mycobacterium tuberculosis K85] gi|289443475|ref|ZP_06433219.1| PE-PGRS family protein [Mycobacterium tuberculosis T46] gi|289447602|ref|ZP_06437346.1| PE-PGRS family protein [Mycobacterium tuberculosis CPHL_A] gi|289570082|ref|ZP_06450309.1| conserved hypothetical protein [Mycobacterium tuberculosis T17] gi|289574658|ref|ZP_06454885.1| PE-PGRS family protein [Mycobacterium tuberculosis K85] gi|289750564|ref|ZP_06509942.1| PE-PGRS family protein [Mycobacterium tuberculosis T92] gi|289754087|ref|ZP_06513465.1| PE-PGRS family protein [Mycobacterium tuberculosis EAS054] gi|289758097|ref|ZP_06517475.1| PE-PGRS family protein [Mycobacterium tuberculosis T85] gi|294996916|ref|ZP_06802607.1| PE-PGRS family protein [Mycobacterium tuberculosis 210] gi|297634555|ref|ZP_06952335.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN 4207] gi|297731543|ref|ZP_06960661.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN R506] gi|306776218|ref|ZP_07414555.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu001] gi|306779999|ref|ZP_07418336.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu002] gi|306784749|ref|ZP_07423071.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu003] gi|306789106|ref|ZP_07427428.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu004] gi|306793440|ref|ZP_07431742.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu005] gi|306797824|ref|ZP_07436126.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu006] gi|306803704|ref|ZP_07440372.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu008] gi|306808278|ref|ZP_07444946.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu007] gi|306968102|ref|ZP_07480763.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu009] gi|306972327|ref|ZP_07484988.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu010] gi|307080037|ref|ZP_07489207.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu011] gi|313658876|ref|ZP_07815756.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN V2475] gi|166227505|sp|A5U406|Y1997_MYCTA RecName: Full=UPF0235 protein MRA_1997 gi|13881702|gb|AAK46313.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551] gi|124601215|gb|EAY60225.1| hypothetical protein TBCG_01934 [Mycobacterium tuberculosis C] gi|134150316|gb|EBA42361.1| hypothetical protein TBHG_01941 [Mycobacterium tuberculosis str. Haarlem] gi|148505947|gb|ABQ73756.1| PE-PGRS family protein [Mycobacterium tuberculosis H37Ra] gi|253320469|gb|ACT25072.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN 1435] gi|289416394|gb|EFD13634.1| PE-PGRS family protein [Mycobacterium tuberculosis T46] gi|289420560|gb|EFD17761.1| PE-PGRS family protein [Mycobacterium tuberculosis CPHL_A] gi|289539089|gb|EFD43667.1| PE-PGRS family protein [Mycobacterium tuberculosis K85] gi|289543836|gb|EFD47484.1| conserved hypothetical protein [Mycobacterium tuberculosis T17] gi|289691151|gb|EFD58580.1| PE-PGRS family protein [Mycobacterium tuberculosis T92] gi|289694674|gb|EFD62103.1| PE-PGRS family protein [Mycobacterium tuberculosis EAS054] gi|289713661|gb|EFD77673.1| PE-PGRS family protein [Mycobacterium tuberculosis T85] gi|308215330|gb|EFO74729.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu001] gi|308327103|gb|EFP15954.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu002] gi|308330482|gb|EFP19333.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu003] gi|308334316|gb|EFP23167.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu004] gi|308338117|gb|EFP26968.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu005] gi|308341810|gb|EFP30661.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu006] gi|308345297|gb|EFP34148.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu007] gi|308349599|gb|EFP38450.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu008] gi|308354227|gb|EFP43078.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu009] gi|308358205|gb|EFP47056.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu010] gi|308362136|gb|EFP50987.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu011] gi|323719473|gb|EGB28600.1| hypothetical protein TMMG_01247 [Mycobacterium tuberculosis CDC1551A] gi|326903595|gb|EGE50528.1| hypothetical protein TBPG_01471 [Mycobacterium tuberculosis W-148] gi|328458721|gb|AEB04144.1| PE-PGRS family protein [Mycobacterium tuberculosis KZN 4207] Length = 76 Score = 76.3 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 37/79 (46%), Gaps = 6/79 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +V+VR+ P + K + + + I V GKAN A+ +LA L L KS Sbjct: 4 SVVVRVKPGSHKGPLVEV------GPNGELIIYVREPAIDGKANDAVTRLLAAHLQLPKS 57 Query: 63 SLRMLSKQSSPLKIIYIDK 81 ++++S +S K + + Sbjct: 58 RVKLVSGATSRFKRFRLSR 76 >gi|154244464|ref|YP_001415422.1| hypothetical protein Xaut_0507 [Xanthobacter autotrophicus Py2] gi|154158549|gb|ABS65765.1| protein of unknown function DUF167 [Xanthobacter autotrophicus Py2] Length = 110 Score = 76.3 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 37/84 (44%), Gaps = 2/84 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V VR P + + + D +KI+V P+ G A A+ +LA+ ++ Sbjct: 11 VEVTVRATPRGGRDALDGVAELSDG--RAVLKIRVKVAPEDGAATAAVARVLAQAAGVAA 68 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85 S +R+ S ++ +K I D + Sbjct: 69 SQVRLASGATARVKTFRIAGDAAK 92 >gi|14520717|ref|NP_126192.1| hypothetical protein PAB7122 [Pyrococcus abyssi GE5] gi|29839688|sp|Q9V1C6|Y501_PYRAB RecName: Full=UPF0235 protein PYRAB05010 gi|5457933|emb|CAB49423.1| Hypothetical protein [Pyrococcus abyssi GE5] Length = 92 Score = 76.3 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 44/91 (48%), Gaps = 9/91 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + PNA+++ I ++ +K+ + A P KGKAN+ ++ L+ Sbjct: 9 VILRVIVKPNARENSIEGID-----EWRGRIKVNIKAQPVKGKANRELIKFLSNLFG--- 60 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91 + + +L ++S K + + + +E+ L+ Sbjct: 61 AEVEILKGETSREKDVLVRGVNLEEVKRRLK 91 >gi|145595719|ref|YP_001160016.1| hypothetical protein Strop_3204 [Salinispora tropica CNB-440] gi|145305056|gb|ABP55638.1| protein of unknown function DUF167 [Salinispora tropica CNB-440] Length = 106 Score = 76.3 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 23/87 (26%), Positives = 40/87 (45%), Gaps = 3/87 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P + +S + + I VTA P G+A +A LA+ + +++ Sbjct: 17 VAVRIKPGSSRSRVGG---RYMGPYGPALVIAVTAPPVDGRATEAARRALAEAFGVRRAA 73 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + + +S KI Y+ EIT L Sbjct: 74 VSLGAGAASRNKIFYVGGSGVEITRTL 100 >gi|195014316|ref|XP_001984001.1| GH15254 [Drosophila grimshawi] gi|193897483|gb|EDV96349.1| GH15254 [Drosophila grimshawi] Length = 126 Score = 76.3 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 25/96 (26%), Positives = 45/96 (46%), Gaps = 10/96 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK++GI + + +++ A P +G+AN ++ L+K L L KS Sbjct: 38 IKILAKPGAKQNGITDIGLEGVG-------VQIAAPPSEGEANAELVKFLSKVLGLRKSD 90 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQNNDSL 96 + + S K+I I K + I +LL+ L Sbjct: 91 VSLDKGSRSKNKLILITKGVSTVEAIEQLLRKESEL 126 >gi|222055825|ref|YP_002538187.1| protein of unknown function DUF167 [Geobacter sp. FRC-32] gi|221565114|gb|ACM21086.1| protein of unknown function DUF167 [Geobacter sp. FRC-32] Length = 99 Score = 76.3 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 20/86 (23%), Positives = 41/86 (47%), Gaps = 8/86 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PNAK+ I + + + VTA P+ G+A M+ LA++ +S S ++++S Sbjct: 20 TPNAKRDAIGKV-------KGHQLCVSVTAVPRAGRATDHMVRFLAEEFGVSVSDIQVVS 72 Query: 69 KQSSPLKIIYIDKDCKEITELLQNND 94 + + K + I K + ++ + Sbjct: 73 GRMNVNKQLRIKA-PKRLPSVIGQQE 97 >gi|312375313|gb|EFR22710.1| hypothetical protein AND_14314 [Anopheles darlingi] Length = 155 Score = 75.9 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 25/114 (21%), Positives = 42/114 (36%), Gaps = 22/114 (19%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKI-----------------KVTATPQKGKAN 46 V + P AK SGI + + + A P G+AN Sbjct: 39 VKILAKPGAKTSGITDVSEEGVGCQIGMSLVDGSTLTLFDQTIQLHLSPLAAPPIDGEAN 98 Query: 47 KAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDC-----KEITELLQNNDS 95 ++ L+K L L KS + + S K I +DKD +++ + ++ S Sbjct: 99 TELIRYLSKLLELRKSDISLDRGSKSRQKTIVLDKDGCRHTREQLLTIFRSEAS 152 >gi|310814754|ref|YP_003962718.1| hypothetical protein EIO_0232 [Ketogulonicigenium vulgare Y25] gi|308753489|gb|ADO41418.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25] Length = 82 Score = 75.9 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 40/76 (52%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P A ++ I ++ +++ VT P+ GKAN A+ +LAK L + KS Sbjct: 13 IAVRVTPKASRARIL-------RDESGVLRVYVTVVPEDGKANAAVTELLAKALRIPKSK 65 Query: 64 LRMLSKQSSPLKIIYI 79 L + S ++ K+ + Sbjct: 66 LILKSGATARDKVFRL 81 >gi|253701118|ref|YP_003022307.1| hypothetical protein GM21_2508 [Geobacter sp. M21] gi|251775968|gb|ACT18549.1| protein of unknown function DUF167 [Geobacter sp. M21] Length = 101 Score = 75.9 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 40/88 (45%), Gaps = 8/88 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PNAK+ I + I VTA P+ G+A M+ LA++ +S S ++++ Sbjct: 21 TPNAKRDAIG-------KPKGHQLCISVTAVPRAGRATDHMVRFLAEEFEVSVSDIQVVF 73 Query: 69 KQSSPLKIIYIDKDCKEITELLQNNDSL 96 + + K + I K + ++ D L Sbjct: 74 GRMNVNKQLRIKA-PKRLPSVIGQQDLL 100 >gi|146324075|ref|XP_001481499.1| DUF167 domain protein [Aspergillus fumigatus Af293] gi|129558081|gb|EBA27446.1| DUF167 domain protein [Aspergillus fumigatus Af293] gi|159126298|gb|EDP51414.1| YggU family protein [Aspergillus fumigatus A1163] Length = 133 Score = 75.9 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 32/79 (40%), Gaps = 5/79 (6%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + PNA S + + V A P+ G+AN A+ + A+ + KS Sbjct: 33 QIACHVKPNASSSR-----EGIIAVGAEKVDVCVAAVPRNGEANTAVSRVFAQIFDVPKS 87 Query: 63 SLRMLSKQSSPLKIIYIDK 81 ++ ++ S K + I Sbjct: 88 NVEVIRGLKSRDKTLCITN 106 >gi|221059313|ref|XP_002260302.1| hypothetical protein, conserved in Plasmodium species [Plasmodium knowlesi strain H] gi|193810375|emb|CAQ41569.1| hypothetical protein, conserved in Plasmodium species [Plasmodium knowlesi strain H] Length = 176 Score = 75.9 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 39/91 (42%), Gaps = 7/91 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ PNAK + I SD + I + P ++N A++ + L L K Sbjct: 90 INLRVKPNAKNTSI------YFNSDREVLNINIQEQPINNQSNIAIIGYFSDILDLKKRD 143 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 + ++S S K++ + ++ + N Sbjct: 144 ISIVSGLKSRDKVLMVSNISLDDLNSKIAEN 174 >gi|7508740|pir||T26031 hypothetical protein W01A8.2 - Caenorhabditis elegans Length = 263 Score = 75.5 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 19/77 (24%), Positives = 37/77 (48%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AKKS + ++ + + + A P++G AN+ +++ L L L K+ Sbjct: 39 LHIHAKPGAKKSCVVAI-------GDSEVDVAIGAAPREGAANEELISYLMSALGLRKNE 91 Query: 64 LRMLSKQSSPLKIIYID 80 L+ S K++ ID Sbjct: 92 LQFDKGAKSRSKVVLID 108 >gi|222055811|ref|YP_002538173.1| protein of unknown function DUF167 [Geobacter sp. FRC-32] gi|221565100|gb|ACM21072.1| protein of unknown function DUF167 [Geobacter sp. FRC-32] Length = 98 Score = 75.5 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 40/86 (46%), Gaps = 8/86 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PNAK+ I + + + VTA P+ G+A M+ LA++ +S S ++++ Sbjct: 19 TPNAKRDAIGKV-------KGHQLCVSVTAVPRAGRATDHMVRFLAEEFGVSVSDIQVVF 71 Query: 69 KQSSPLKIIYIDKDCKEITELLQNND 94 + + K + I K + ++ + Sbjct: 72 GRMNVNKQLRIKA-PKRLPLVIGQQE 96 >gi|119357873|ref|YP_912517.1| hypothetical protein Cpha266_2081 [Chlorobium phaeobacteroides DSM 266] gi|187479907|sp|A1BI66|Y2081_CHLPD RecName: Full=UPF0235 protein Cpha266_2081 gi|119355222|gb|ABL66093.1| protein of unknown function DUF167 [Chlorobium phaeobacteroides DSM 266] Length = 101 Score = 75.5 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 24/86 (27%), Positives = 40/86 (46%), Gaps = 7/86 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 ++ P + KS I+ + +K+ + A P AN+ + AK L++S S L Sbjct: 18 RLKAQPRSSKSAISG-------AYNGGVKVNLKAAPVDDAANRECCDLFAKVLSVSSSRL 70 Query: 65 RMLSKQSSPLKIIYIDKDCKEITELL 90 +LS +SS K I ++ E LL Sbjct: 71 TILSGKSSKNKTIKVEGLGAEEVALL 96 >gi|156100181|ref|XP_001615818.1| hypothetical protein [Plasmodium vivax SaI-1] gi|148804692|gb|EDL46091.1| hypothetical protein, conserved [Plasmodium vivax] Length = 178 Score = 75.5 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 39/91 (42%), Gaps = 7/91 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ PNAK + I SD + I + P ++N A++ + L L K Sbjct: 92 INLRVKPNAKNTSI------YFNSDREVLNINIQEQPINNQSNVAIIGYFSDILDLKKRD 145 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 + ++S S K++ + ++ + N Sbjct: 146 ISIVSGLKSRDKVLMVSNISLDDLNSKIAEN 176 >gi|148264159|ref|YP_001230865.1| hypothetical protein Gura_2104 [Geobacter uraniireducens Rf4] gi|146397659|gb|ABQ26292.1| protein of unknown function DUF167 [Geobacter uraniireducens Rf4] Length = 99 Score = 75.5 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 39/86 (45%), Gaps = 8/86 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PNAK+ I + + + VTA P+ G+A M+ LA + +S S ++++ Sbjct: 20 TPNAKRDAIGKV-------KGHQLCVSVTAVPRAGRATDHMVRFLADEFGVSVSDIQVVF 72 Query: 69 KQSSPLKIIYIDKDCKEITELLQNND 94 + + K + I K + ++ + Sbjct: 73 GRMNVNKQLRIKA-PKRLPSVIGQQE 97 >gi|300175116|emb|CBK20427.2| unnamed protein product [Blastocystis hominis] Length = 152 Score = 75.2 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 21/88 (23%), Positives = 41/88 (46%), Gaps = 10/88 (11%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PN KK+GI + ++I++++ P + KANK + +++ + KS + ++ Sbjct: 29 KPNCKKTGIE--------WEEEQLQIRLSSPPTENKANKEVCEVVSDIADIPKSQVSLIR 80 Query: 69 KQSSPLKIIYIDKDCKE--ITELLQNND 94 S K + ++ E I LLQ Sbjct: 81 GGKSRDKELMLNGVSSEGLIQSLLQALQ 108 >gi|218437671|ref|YP_002376000.1| hypothetical protein PCC7424_0673 [Cyanothece sp. PCC 7424] gi|226708012|sp|B7KEV7|Y673_CYAP7 RecName: Full=UPF0235 protein PCC7424_0673 gi|218170399|gb|ACK69132.1| protein of unknown function DUF167 [Cyanothece sp. PCC 7424] Length = 73 Score = 75.2 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 43/78 (55%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ PNAK I + ++ + I + + P +GKAN+ ++ +LA+K ++K Sbjct: 1 MKIQVKVKPNAKHQKI-------EEAEDGSLIISLKSPPVEGKANQELIKLLAQKYRVTK 53 Query: 62 SSLRMLSKQSSPLKIIYI 79 S + + S SS K+I I Sbjct: 54 SQISIQSGLSSRNKLIEI 71 >gi|188996205|ref|YP_001930456.1| protein of unknown function DUF167 [Sulfurihydrogenibium sp. YO3AOP1] gi|259646874|sp|B2V7H9|Y257_SULSY RecName: Full=UPF0235 protein SYO3AOP1_0257 gi|188931272|gb|ACD65902.1| protein of unknown function DUF167 [Sulfurihydrogenibium sp. YO3AOP1] Length = 73 Score = 75.2 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 40/78 (51%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ P K+ + ++ +++ T P+KGKAN+ ++ +L+ + K Sbjct: 1 MRIKVKVKPGTSKNEVKKID-------ENLYEVRTTTIPEKGKANEKVVELLSDFFDVPK 53 Query: 62 SSLRMLSKQSSPLKIIYI 79 S ++++ Q+S K + + Sbjct: 54 SKIKIVKGQTSREKEVEV 71 >gi|57235102|ref|YP_182004.1| hypothetical protein DET1292 [Dehalococcoides ethenogenes 195] gi|123618390|sp|Q3Z6Z5|Y1292_DEHE1 RecName: Full=UPF0235 protein DET1292 gi|57225550|gb|AAW40607.1| conserved hypothetical protein [Dehalococcoides ethenogenes 195] Length = 97 Score = 75.2 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 22/75 (29%), Positives = 42/75 (56%), Gaps = 7/75 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V ++++P+A+++ + E +KIK+ A P+KGKANKA++ L++ L KS Sbjct: 9 RVNLKILPSAQRNELTGYE-------NGLLKIKIAAQPEKGKANKALVDYLSELLDTPKS 61 Query: 63 SLRMLSKQSSPLKII 77 + + S K++ Sbjct: 62 EIEICRGLSGRNKVV 76 >gi|99080537|ref|YP_612691.1| hypothetical protein TM1040_0696 [Ruegeria sp. TM1040] gi|99036817|gb|ABF63429.1| protein of unknown function DUF167 [Ruegeria sp. TM1040] Length = 92 Score = 75.2 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 21/81 (25%), Positives = 39/81 (48%), Gaps = 8/81 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A ++ + +K+ VT P+ GKAN ++A+L++ L +S Sbjct: 20 AQIAVRVTPKAAQNAVL--------RKRDEIKVLVTTVPEGGKANADVVALLSRALGVSP 71 Query: 62 SSLRMLSKQSSPLKIIYIDKD 82 S L +L +S K+ + Sbjct: 72 SRLTLLRGATSRDKVFLVTAP 92 >gi|307353453|ref|YP_003894504.1| hypothetical protein Mpet_1306 [Methanoplanus petrolearius DSM 11571] gi|307156686|gb|ADN36066.1| protein of unknown function DUF167 [Methanoplanus petrolearius DSM 11571] Length = 105 Score = 75.2 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 39/90 (43%), Gaps = 5/90 (5%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P +KK+ + ++ ++ + +GKAN ++ +A ++ K Sbjct: 14 VQISLDVSPGSKKT----VFPAGYNEWRNAIECRIKSPATEGKANAEIIKTIADYFSVKK 69 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S + ++S S K I I ++ E L Sbjct: 70 SDVVIVSGAISGQKKIKISGVSLEDALERL 99 >gi|147921111|ref|YP_685078.1| hypothetical protein RCIX287 [uncultured methanogenic archaeon RC-I] gi|56295557|emb|CAH04800.1| conserved hypothetical protein [uncultured archaeon] gi|110620474|emb|CAJ35752.1| conserved hypothetical protein [uncultured methanogenic archaeon RC-I] Length = 104 Score = 75.2 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 15/86 (17%), Positives = 39/86 (45%), Gaps = 4/86 (4%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P AK + + + +++K+ A P++G+AN+ ++ L+ L Sbjct: 14 VVIDFEVTPGAKST----VVPSGYSVWRKRIEVKLKAPPERGRANEELIEALSDLFHLPA 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEIT 87 SS+ + + ++ K I + ++ Sbjct: 70 SSIEITAGATNSRKSIKVHGISTDVV 95 >gi|38505546|ref|NP_942167.1| hypothetical protein ssr5011 [Synechocystis sp. PCC 6803] gi|38423570|dbj|BAD01781.1| ssr5011 [Synechocystis sp. PCC 6803] Length = 73 Score = 75.2 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 24/76 (31%), Positives = 42/76 (55%), Gaps = 7/76 (9%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V++ PNAK+S + D + I V + P GKAN+ ++ +LAK+ +S+ S+ Sbjct: 4 QVKVKPNAKQSKVV-------YGDDGSLIIHVKSPPVDGKANQELIKLLAKEFNVSQQSI 56 Query: 65 RMLSKQSSPLKIIYID 80 ++ S S KI+ I+ Sbjct: 57 KIKSGAGSRQKIVEIN 72 >gi|189183832|ref|YP_001937617.1| hypothetical protein OTT_0925 [Orientia tsutsugamushi str. Ikeda] gi|189180603|dbj|BAG40383.1| hypothetical protein OTT_0925 [Orientia tsutsugamushi str. Ikeda] Length = 112 Score = 75.2 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 47/90 (52%), Gaps = 3/90 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ AK + I L + S + I + P+ KAN+A++ L++ L + +S+ Sbjct: 20 INLKVKAGAKINKIIGLYYINNKS---FLYISINTIPENNKANQAIIKFLSQWLEVCRSN 76 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 ++++ S LK+I + I+ L+++ Sbjct: 77 IKIVYGLHSNLKVISVINTNANISNLIRSK 106 >gi|206889548|ref|YP_002248254.1| hypothetical protein THEYE_A0407 [Thermodesulfovibrio yellowstonii DSM 11347] gi|206741486|gb|ACI20543.1| conserved hypothetical protein [Thermodesulfovibrio yellowstonii DSM 11347] Length = 87 Score = 74.8 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 42/79 (53%), Gaps = 7/79 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + AK +GI +E +K+++ A P G ANK ++ ML++ L + KS Sbjct: 14 LKVLVKTGAKITGIGGIE-------GNTLKLRLAAQPHDGLANKELIEMLSEILNIPKSR 66 Query: 64 LRMLSKQSSPLKIIYIDKD 82 + ++ ++S KII + + Sbjct: 67 IEIIKGKTSKHKIIKLKGE 85 >gi|289746071|ref|ZP_06505449.1| PE-PGRS family protein [Mycobacterium tuberculosis 02_1987] gi|308403079|ref|ZP_07493725.2| PE-PGRS family protein [Mycobacterium tuberculosis SUMu012] gi|289686599|gb|EFD54087.1| PE-PGRS family protein [Mycobacterium tuberculosis 02_1987] gi|308365791|gb|EFP54642.1| PE-PGRS family protein [Mycobacterium tuberculosis SUMu012] Length = 72 Score = 74.8 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 20/76 (26%), Positives = 34/76 (44%), Gaps = 6/76 (7%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 VR+ P + K + + + I V GKAN A+ +LA L L KS ++ Sbjct: 3 VRVKPGSHKGPLVEV------GPNGELIIYVREPAIDGKANDAVTRLLAAHLQLPKSRVK 56 Query: 66 MLSKQSSPLKIIYIDK 81 ++S +S K + + Sbjct: 57 LVSGATSRFKRFRLSR 72 >gi|254466397|ref|ZP_05079808.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] gi|206687305|gb|EDZ47787.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] Length = 105 Score = 74.8 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 36/82 (43%), Gaps = 8/82 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR P A ++ I + +KI VTA P+ GKAN+A+ +LA + + Sbjct: 32 AEIAVRATPKAARNAIV--------AAEGVLKISVTAVPENGKANEAIRRLLAAAMGTAA 83 Query: 62 SSLRMLSKQSSPLKIIYIDKDC 83 S L + +S K+ Sbjct: 84 SRLELRRGAASRDKLFVYLGPA 105 >gi|118469743|ref|YP_888136.1| hypothetical protein MSMEG_3845 [Mycobacterium smegmatis str. MC2 155] gi|226706061|sp|A0QYZ8|Y3845_MYCS2 RecName: Full=UPF0235 protein MSMEG_3845 gi|118171030|gb|ABK71926.1| conserved domain protein [Mycobacterium smegmatis str. MC2 155] Length = 75 Score = 74.8 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 37/77 (48%), Gaps = 6/77 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V+VR+ P ++K + +T+D + I V GKAN A+ +LA L + S Sbjct: 5 VVVRVKPGSRKGPLV------ETADDGTLTIYVQERAVDGKANAAVTKLLAAHLGVPPSR 58 Query: 64 LRMLSKQSSPLKIIYID 80 + + S ++ LK I Sbjct: 59 VELASGATARLKRFRIS 75 >gi|313223261|emb|CBY43439.1| unnamed protein product [Oikopleura dioica] Length = 180 Score = 74.8 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 28/93 (30%), Positives = 49/93 (52%), Gaps = 2/93 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDT-SDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + P +K I+ + DT S + ++V+A P KGKANKA+L LA KL + Sbjct: 79 VLLNCHVTPKSKTPSISVDDSINDTLSVVHVVNVRVSAPPDKGKANKAVLKSLADKLGVK 138 Query: 61 KSSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 S L + S +S K++ ++ D +++ +L Sbjct: 139 PSKLSIQSGTTSRSKVVLLETD-ADLSTILAQI 170 >gi|296235859|ref|XP_002763077.1| PREDICTED: UPF0235 protein C15orf40-like [Callithrix jacchus] Length = 154 Score = 74.8 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 17/87 (19%), Positives = 38/87 (43%), Gaps = 9/87 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P ++++ + L + + + + A P +G+AN + L+K L L K Sbjct: 64 VTIAIHAKPGSRQNAVTDLTV-------EAINVAIAAPPSEGEANAELCRYLSKVLELRK 116 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEI 86 S + + S K++ + +EI Sbjct: 117 SDVVLDKSGKSREKVVKLLASTTPEEI 143 >gi|329902481|ref|ZP_08273126.1| hypothetical protein IMCC9480_281 [Oxalobacteraceae bacterium IMCC9480] gi|327548773|gb|EGF33410.1| hypothetical protein IMCC9480_281 [Oxalobacteraceae bacterium IMCC9480] Length = 105 Score = 74.8 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 21/92 (22%), Positives = 45/92 (48%), Gaps = 10/92 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ NAKK+ + + +KIK+ A P +GKAN A++ +A +L + + Sbjct: 16 VRLAVQVAANAKKTEVIGV-------ADDVLKIKLHAQPIEGKANDALVRFVAGQLHVPR 68 Query: 62 SSLRMLSKQSSPLKIIYIDK---DCKEITELL 90 +++ + +S K++ + E+ L Sbjct: 69 TTVSVTHGLTSKRKLLLVRAVGLSVDEVQRAL 100 >gi|148555779|ref|YP_001263361.1| hypothetical protein Swit_2871 [Sphingomonas wittichii RW1] gi|148500969|gb|ABQ69223.1| protein of unknown function DUF167 [Sphingomonas wittichii RW1] Length = 74 Score = 74.4 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 20/68 (29%), Positives = 36/68 (52%) Query: 24 KDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDC 83 D + IK+ A P G AN+A++ ++AK L ++K + + S +S LK +++ D Sbjct: 3 ADADGRRWLSIKLAAAPSDGAANEALVRLVAKALGVAKRDVTLASGATSRLKRLHVSGDP 62 Query: 84 KEITELLQ 91 + LQ Sbjct: 63 AALAAALQ 70 >gi|256394873|ref|YP_003116437.1| hypothetical protein Caci_5738 [Catenulispora acidiphila DSM 44928] gi|256361099|gb|ACU74596.1| protein of unknown function DUF167 [Catenulispora acidiphila DSM 44928] Length = 92 Score = 74.4 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 16/92 (17%), Positives = 44/92 (47%), Gaps = 7/92 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +R+ P + ++ + + + VTA G A +A L +A+ L + + Sbjct: 5 RIPIRVKPGSSRTKVGGRH------GERSLIVAVTAKAVDGAATEAALRAVAEALGMPRR 58 Query: 63 SLRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 ++++++ +S K++ + D + EL+++ Sbjct: 59 AVQLITGATSRDKVLGVSSEDPDTVRELVRDL 90 >gi|78221372|ref|YP_383119.1| hypothetical protein Gmet_0145 [Geobacter metallireducens GS-15] gi|78192627|gb|ABB30394.1| protein of unknown function DUF167 [Geobacter metallireducens GS-15] Length = 100 Score = 74.4 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 38/86 (44%), Gaps = 8/86 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PNAK+ I + + + VTA P+ G+A M+ LA + +S ++++ Sbjct: 20 TPNAKRDAIGKV-------KGHQLCVSVTAIPRAGRATDHMVRFLADEFGVSVGDIQVVF 72 Query: 69 KQSSPLKIIYIDKDCKEITELLQNND 94 + + K + I K + L+ + Sbjct: 73 GRMNVNKQLRIKA-PKRLPPLIGQQE 97 >gi|197118100|ref|YP_002138527.1| hypothetical protein Gbem_1715 [Geobacter bemidjiensis Bem] gi|197087460|gb|ACH38731.1| protein of unknown function DUF167 [Geobacter bemidjiensis Bem] Length = 101 Score = 74.4 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 20/86 (23%), Positives = 39/86 (45%), Gaps = 8/86 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PNAK+ I + I VTA P+ G+A M+ LA++ ++ S ++++ Sbjct: 21 TPNAKRDAIG-------KPKGHQLCISVTAVPRAGRATDHMVRFLAEEFEVAVSDIQVVF 73 Query: 69 KQSSPLKIIYIDKDCKEITELLQNND 94 + + K + I K + ++ D Sbjct: 74 GRMNVNKQLRIKA-PKRLPSVIGQQD 98 >gi|38345823|emb|CAD41928.2| OSJNBa0070M12.6 [Oryza sativa Japonica Group] Length = 207 Score = 74.4 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 15/90 (16%), Positives = 41/90 (45%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ P +K + I + + +++ A + G+AN A++ ++ L + K Sbjct: 120 ISIQAKPGSKLATITEI-------GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 172 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92 + + S S K++ + D + + + L+ Sbjct: 173 VSIGSGSKSREKVVLVQDATLQGVFDALKK 202 >gi|18978137|ref|NP_579494.1| hypothetical protein PF1765 [Pyrococcus furiosus DSM 3638] gi|29839724|sp|Q8U052|Y1765_PYRFU RecName: Full=UPF0235 protein PF1765 gi|18893938|gb|AAL81889.1| hypothetical protein PF1765 [Pyrococcus furiosus DSM 3638] Length = 92 Score = 74.4 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 43/91 (47%), Gaps = 9/91 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + PNA+++ I ++ +K+ V A P KGKANK ++ K Sbjct: 9 VILSVIVAPNARETKIVGIDGT-----RGRVKVNVAAPPVKGKANKELMKFFKKLFG--- 60 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91 + + ++ ++S K + I KE+ E L+ Sbjct: 61 AEVVIVRGETSREKDLLIKGITKKEVIEKLE 91 >gi|85092099|ref|XP_959226.1| hypothetical protein NCU06879 [Neurospora crassa OR74A] gi|28920629|gb|EAA29990.1| predicted protein [Neurospora crassa OR74A] Length = 130 Score = 74.4 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 38/79 (48%), Gaps = 5/79 (6%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P A K+ + D ++I V A ++G+ANKA++ +L++ L L KS+ Sbjct: 30 IHCHVKPGASKNR-EGVTSITD----EAVEICVAAQAKEGEANKAVVKVLSEALNLPKSN 84 Query: 64 LRMLSKQSSPLKIIYIDKD 82 L + S K I + Sbjct: 85 LEITQGLKSRAKTIAVAAP 103 >gi|294949448|ref|XP_002786202.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983] gi|239900359|gb|EER17998.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983] Length = 132 Score = 74.0 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 19/95 (20%), Positives = 47/95 (49%), Gaps = 9/95 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK-LALS 60 + +R P AK S + ++ + +++ A+ + G+AN+ +L+ L+K+ L + Sbjct: 41 ARIAIRAKPGAKVSCLTGIDA------EGALGVQLNASARDGEANEELLSFLSKEVLGVK 94 Query: 61 KSSLRMLSKQSSPLKIIYIDK--DCKEITELLQNN 93 K + ++ S K++ I +++ LL++ Sbjct: 95 KKDVALVQGSKSREKVVEIADVLTVDDVSRLLRDE 129 >gi|304314780|ref|YP_003849927.1| hypothetical protein MTBMA_c10190 [Methanothermobacter marburgensis str. Marburg] gi|302588239|gb|ADL58614.1| conserved hypothetical protein [Methanothermobacter marburgensis str. Marburg] Length = 102 Score = 74.0 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 38/91 (41%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + P + E+ ++IKV A P+KGKAN+ ++ + ++ Sbjct: 13 VDIEVSPAS-----GGFEVRSYNEWRKRIEIKVRAPPEKGKANREIIEEFSAAFN---TN 64 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 ++S S K + I D + LL+ Sbjct: 65 ADIVSGHKSRHKTLKIYGMDAETFRTLLEEK 95 >gi|33359419|ref|NP_877861.1| hypothetical protein PH1669.1n [Pyrococcus horikoshii OT3] Length = 95 Score = 74.0 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 46/89 (51%), Gaps = 9/89 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + PN+K++ I ++ ++I + A P KG+ANK ++ L+K L + Sbjct: 14 IQVIVRPNSKENKIEGVDN-----WKNRIRISIKAPPVKGEANKELIKFLSKILG---AK 65 Query: 64 LRMLSKQSSPLKIIYIDKDC-KEITELLQ 91 + ++ ++S K + + +E+ + L+ Sbjct: 66 VEIIRGETSREKDLLVKGIKLEEVKKRLK 94 >gi|225850940|ref|YP_002731174.1| protein CPn_0497//CPj0497/CpB0517 [Persephonella marina EX-H1] gi|259646383|sp|C0QR78|Y1406_PERMH RecName: Full=UPF0235 protein PERMA_1406 gi|225645391|gb|ACO03577.1| protein CPn_0497//CPj0497/CpB0517 [Persephonella marina EX-H1] Length = 73 Score = 74.0 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 26/78 (33%), Positives = 43/78 (55%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V V++ PNAKK I ++ + +I+VT P+KGKAN ++ +L+K L + K Sbjct: 1 MIVKVKVKPNAKKEEIREIQK-------DYFEIRVTVPPEKGKANSRVIELLSKHLKIPK 53 Query: 62 SSLRMLSKQSSPLKIIYI 79 S +++ + S KI I Sbjct: 54 SRIKLKKGEKSREKIFEI 71 >gi|67523595|ref|XP_659857.1| hypothetical protein AN2253.2 [Aspergillus nidulans FGSC A4] gi|40744782|gb|EAA63938.1| hypothetical protein AN2253.2 [Aspergillus nidulans FGSC A4] gi|259487644|tpe|CBF86471.1| TPA: DUF167 domain protein (AFU_orthologue; AFUA_5G06647) [Aspergillus nidulans FGSC A4] Length = 130 Score = 74.0 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 34/77 (44%), Gaps = 5/77 (6%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 ++ R+ PNA + + V A P+ G+AN A+ + AK ++KS Sbjct: 31 HISCRVKPNAS-----GGREGITAVGNETVDVCVAAVPRDGEANLAVSQVFAKVFNVAKS 85 Query: 63 SLRMLSKQSSPLKIIYI 79 + ++ S K++ I Sbjct: 86 DVGVIHGLKSRDKVLCI 102 >gi|290996548|ref|XP_002680844.1| predicted protein [Naegleria gruberi] gi|284094466|gb|EFC48100.1| predicted protein [Naegleria gruberi] Length = 73 Score = 74.0 bits (181), Expect = 6e-12, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 38/79 (48%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + PN+ S IA++ + + + A P++G+ANK + ++ L +SK Sbjct: 2 IRLTILAKPNSSSSQIANINDEEIG-------VHIAAPPKEGEANKELCDYVSGVLGVSK 54 Query: 62 SSLRMLSKQSSPLKIIYID 80 S + + S K++ ++ Sbjct: 55 SRVTLDRGGKSRHKLLLVE 73 >gi|332178747|gb|AEE14436.1| UPF0235 protein yggU [Thermodesulfobium narugense DSM 14796] Length = 86 Score = 74.0 bits (181), Expect = 7e-12, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 43/89 (48%), Gaps = 10/89 (11%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +++ PNAKK I + KV+A P+ GKAN+ ++ ++++ + Sbjct: 4 KIELKVTPNAKKESIE--------IKDGKIYCKVSAPPEDGKANRRVIELISEFFDCKRK 55 Query: 63 SLRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S + S KI+ I + I + ++ Sbjct: 56 DVEIFSGEKSKNKILLIK--SENIFKKIK 82 >gi|6503188|gb|AAF14630.1|AF200362_6 unknown [Haemophilus ducreyi] Length = 62 Score = 73.6 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 18/62 (29%), Positives = 31/62 (50%), Gaps = 3/62 (4%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKD---CKEITE 88 +K+ +TA P G AN +L L+K + KSS+ + + K +++ KEI + Sbjct: 1 LKVAITAPPVDGAANAYLLKYLSKLFKVPKSSIVLEKGELQRHKQLFVPAPKLLPKEIEQ 60 Query: 89 LL 90 L Sbjct: 61 WL 62 >gi|327401804|ref|YP_004342643.1| hypothetical protein Arcve_1935 [Archaeoglobus veneficus SNP6] gi|327317312|gb|AEA47928.1| UPF0235 protein yggU [Archaeoglobus veneficus SNP6] Length = 101 Score = 73.6 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 42/88 (47%), Gaps = 9/88 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + PN+K+S + + ++V A P+ GKAN ++ + +K + Sbjct: 16 ISIEVTPNSKQSCVYGYN-----EWRKSIAVRVKAPPKGGKANAEIVELFSKIF---RKK 67 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELL 90 + ++ +S K+I++ +E+ LL Sbjct: 68 VEIVKGHTSSQKVIFVHSASPQEVESLL 95 >gi|261416137|ref|YP_003249820.1| protein of unknown function DUF167 [Fibrobacter succinogenes subsp. succinogenes S85] gi|261372593|gb|ACX75338.1| protein of unknown function DUF167 [Fibrobacter succinogenes subsp. succinogenes S85] gi|302328014|gb|ADL27215.1| conserved hypothetical protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 74 Score = 73.6 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 33/80 (41%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +++ +K+ + K++V A P G AN A+ ++A + K Sbjct: 1 MRINIKVHARSKRESVT-------PQPDGSYKVEVKAPPVDGAANAAICELIADYFHVHK 53 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 + ++ ++ K+I I Sbjct: 54 RDVSVVMGSTNNKKVIEILG 73 >gi|237756139|ref|ZP_04584711.1| conserved domain protein [Sulfurihydrogenibium yellowstonense SS-5] gi|237691703|gb|EEP60739.1| conserved domain protein [Sulfurihydrogenibium yellowstonense SS-5] Length = 73 Score = 73.6 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 41/78 (52%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ P A ++ + ++ +++ T P+KGKAN+ ++ +L+ + K Sbjct: 1 MRIKVKVKPGASENEVKKIDEYL-------YEVRTTTIPEKGKANEKVIELLSDFFDVPK 53 Query: 62 SSLRMLSKQSSPLKIIYI 79 S ++++ Q+S K + + Sbjct: 54 SKIKIIKGQASREKEVEV 71 >gi|298674092|ref|YP_003725842.1| hypothetical protein Metev_0115 [Methanohalobium evestigatum Z-7303] gi|298287080|gb|ADI73046.1| protein of unknown function DUF167 [Methanohalobium evestigatum Z-7303] Length = 109 Score = 73.2 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 16/93 (17%), Positives = 43/93 (46%), Gaps = 5/93 (5%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + + + PN+K + +++K+T + GKAN ++ ++ ++ Sbjct: 13 LVVIDISVTPNSKTINV----PDNYNQWRNRIEVKLTQKAESGKANNQLIENFSEFFGVN 68 Query: 61 KSSLRMLSKQSSPLKIIYIDKDC-KEITELLQN 92 KSS+++ S + S K + + + +L++ Sbjct: 69 KSSIKITSGEKSSQKSVSVKGLSYDDAVSILKS 101 >gi|87198097|ref|YP_495354.1| hypothetical protein Saro_0071 [Novosphingobium aromaticivorans DSM 12444] gi|87133778|gb|ABD24520.1| protein of unknown function DUF167 [Novosphingobium aromaticivorans DSM 12444] Length = 95 Score = 73.2 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 8/81 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VR+ P AK GI D + +KV A P+ GKA A+ +LA+ L L+ S Sbjct: 23 RLAVRVTPGAKSEGIE--------IDGGRVLVKVRAKPEDGKATAAVQELLARALGLAPS 74 Query: 63 SLRMLSKQSSPLKIIYIDKDC 83 + ML +S K+ I ++ Sbjct: 75 KVEMLRGATSREKLFRIPREA 95 >gi|225431557|ref|XP_002282176.1| PREDICTED: hypothetical protein [Vitis vinifera] gi|147866968|emb|CAN83055.1| hypothetical protein VITISV_009894 [Vitis vinifera] Length = 128 Score = 73.2 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 14/79 (17%), Positives = 36/79 (45%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ V P +K S I + + +++ A + G+AN A+L ++ + + + Sbjct: 39 VSITVHAKPGSKVSSITDFD-------DEALGVQIDAPAKDGEANAALLDYISSVVGVKR 91 Query: 62 SSLRMLSKQSSPLKIIYID 80 + + S S K++ ++ Sbjct: 92 RQVSISSGSKSRDKVVIVE 110 >gi|149057381|gb|EDM08704.1| similar to RIKEN cDNA 3110040N11, isoform CRA_a [Rattus norvegicus] gi|149057386|gb|EDM08709.1| similar to RIKEN cDNA 3110040N11, isoform CRA_a [Rattus norvegicus] Length = 114 Score = 73.2 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 20/96 (20%), Positives = 37/96 (38%), Gaps = 21/96 (21%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ VTA P +G+AN + L+K L L K Sbjct: 36 VTIAIHAKPGSKQNA-------------------VTAPPSEGEANAELCRYLSKVLDLRK 76 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95 S + + S K++ + +E+ E L+ Sbjct: 77 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLRTEAE 112 >gi|289615268|emb|CBI58035.1| unnamed protein product [Sordaria macrospora] Length = 133 Score = 73.2 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 19/79 (24%), Positives = 37/79 (46%), Gaps = 5/79 (6%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P A K + D ++I V A ++G+ANK+++ +L++ L + KS+ Sbjct: 32 IHCHVKPGASKQR-EGVTCITD----EAVEICVAAQAKEGEANKSVVKVLSEALNIPKSN 86 Query: 64 LRMLSKQSSPLKIIYIDKD 82 L + S K I + Sbjct: 87 LEITQGLKSRAKTIAVAAP 105 >gi|89899559|ref|YP_522030.1| hypothetical protein Rfer_0749 [Rhodoferax ferrireducens T118] gi|89344296|gb|ABD68499.1| protein of unknown function DUF167 [Rhodoferax ferrireducens T118] Length = 114 Score = 73.2 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 33/88 (37%), Gaps = 8/88 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+A K I +K+ V A P G+A M+ LA + +S S + ++ Sbjct: 28 KPSASKDAIG-------KPFGKQLKVSVAAAPVAGRATDHMVRFLAVQFGVSASDIEVVF 80 Query: 69 KQSSPLKIIYIDKDCKEITELLQNNDSL 96 + + K + I + + L Sbjct: 81 GRMNVNKQVRIKA-PTRLPAVFAQGSLL 107 >gi|148674974|gb|EDL06921.1| RIKEN cDNA 3110040N11, isoform CRA_c [Mus musculus] Length = 114 Score = 73.2 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 37/96 (38%), Gaps = 21/96 (21%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P ++++ VTA P +G+AN + L+K L L K Sbjct: 36 VTIAIHAKPGSRQNA-------------------VTAPPSQGEANAELCRYLSKVLDLRK 76 Query: 62 SSLRMLSKQSSPLKIIYI--DKDCKEITELLQNNDS 95 S + + S K++ + +E+ E L+ Sbjct: 77 SDVVLDKGGKSREKVVKLLASTTPEEVLEKLKTEAE 112 >gi|326504128|dbj|BAK02850.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 129 Score = 73.2 bits (179), Expect = 1e-11, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 40/90 (44%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K + I + + +++ A + G+AN A++ ++ L + K Sbjct: 43 ISIHAKPGSKMATITEV-------GEEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 95 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92 + + S S K++ + D K + E L+ Sbjct: 96 VSIGSGSKSREKVVLVQDATLKGVFEALKK 125 >gi|326926706|ref|XP_003209539.1| PREDICTED: UPF0235 protein C15orf40 homolog [Meleagris gallopavo] Length = 85 Score = 72.8 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 17/69 (24%), Positives = 31/69 (44%), Gaps = 2/69 (2%) Query: 30 IHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI--DKDCKEIT 87 + + + A P +G+AN + L+K L + KS + + S K++ I E+ Sbjct: 17 EAVGVAIAAPPSEGEANAELCRYLSKVLQVKKSDVILEKGGKSRDKVVKILVSLTPDEVL 76 Query: 88 ELLQNNDSL 96 E L+ S Sbjct: 77 EKLKKEAST 85 >gi|78777110|ref|YP_393425.1| hypothetical protein Suden_0912 [Sulfurimonas denitrificans DSM 1251] gi|78497650|gb|ABB44190.1| Protein of unknown function DUF167 [Sulfurimonas denitrificans DSM 1251] Length = 101 Score = 72.8 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 22/85 (25%), Positives = 40/85 (47%), Gaps = 8/85 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+AK+ I + +K+ V A P+ GKA M+ LAK+ +S SS+ ++ Sbjct: 25 TPSAKRDVIGKVRA-------NQLKVSVRAQPEGGKATDYMVGFLAKEFGVSVSSIEVVY 77 Query: 69 KQSSPLKIIYIDKDCKEITELLQNN 93 + S K + I K++ ++ Sbjct: 78 GRESIHKQLRIKA-PKKLPNSIEFE 101 >gi|242077752|ref|XP_002448812.1| hypothetical protein SORBIDRAFT_06g033690 [Sorghum bicolor] gi|241939995|gb|EES13140.1| hypothetical protein SORBIDRAFT_06g033690 [Sorghum bicolor] Length = 131 Score = 72.8 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 15/90 (16%), Positives = 40/90 (44%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K + I + + +++ A + G+AN A++ ++ L + K Sbjct: 44 ISIHAKPGSKVATITEI-------GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 96 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92 + + S S K++ + D + + + L+ Sbjct: 97 VSIGSGSKSREKVVLVQDATLEGVYDALKK 126 >gi|320101055|ref|YP_004176647.1| hypothetical protein Desmu_0861 [Desulfurococcus mucosus DSM 2162] gi|319753407|gb|ADV65165.1| protein of unknown function DUF167 [Desulfurococcus mucosus DSM 2162] Length = 138 Score = 72.8 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 39/91 (42%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P + T + + + P++G+AN A++ +++L + S Sbjct: 52 LSIRVKPGESED--------FLTVEGDELVFYTSEPPERGRANAALVKFFSRELKIPVSR 103 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93 + ++ S LK + D + E+ + L Sbjct: 104 IDIVYGHRSTLKKLVFYDVNMDELADKLAKL 134 >gi|260429009|ref|ZP_05782986.1| conserved hypothetical protein [Citreicella sp. SE45] gi|260419632|gb|EEX12885.1| conserved hypothetical protein [Citreicella sp. SE45] Length = 85 Score = 72.8 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ P A ++ I D +++ VT P+ GKAN A+ +LAK L + K Sbjct: 14 ATLALRVTPRASRNEI--------REDGDQLRVLVTTVPEDGKANAAVAKLLAKALGVPK 65 Query: 62 SSLRMLSKQSSPLKIIYID 80 S L ++ +S K+ I+ Sbjct: 66 SRLTLIQGATSRDKVFRIE 84 >gi|119498429|ref|XP_001265972.1| YggU family protein [Neosartorya fischeri NRRL 181] gi|119414136|gb|EAW24075.1| YggU family protein [Neosartorya fischeri NRRL 181] Length = 133 Score = 72.8 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 31/78 (39%), Gaps = 5/78 (6%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + PNA S + + V A P+ G+AN A+ + A+ + KS Sbjct: 33 QIACHVKPNASSSR-----EGIIAVGAEKVDVCVAAVPRNGEANAAVSRVFAQIFDVPKS 87 Query: 63 SLRMLSKQSSPLKIIYID 80 + ++ S K + I Sbjct: 88 NAEVIRGLKSRDKTLCIT 105 >gi|68074573|ref|XP_679202.1| hypothetical protein [Plasmodium berghei strain ANKA] gi|56499889|emb|CAH95244.1| conserved hypothetical protein [Plasmodium berghei] Length = 106 Score = 72.8 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 42/91 (46%), Gaps = 7/91 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ PN+K + I DT + I + P ++N A+++ + L L K Sbjct: 20 INLRVKPNSKNTSI------YFNVDTEVLNINIQEQPVNNQSNVAIISYFSDILNLKKRD 73 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 + +++ S K++ + ++++ + N Sbjct: 74 ISIVAGLKSRDKVLMVSNISVEDLSNKINQN 104 >gi|259418717|ref|ZP_05742634.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259344939|gb|EEW56793.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 92 Score = 72.8 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 34/79 (43%), Gaps = 8/79 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A + + +K+ VT P+ GKA + A+LA+ L ++ Sbjct: 20 AEITVRVTPKAAYNAVL--------RQGDVIKVMVTTVPEDGKATADVAALLARALGVAP 71 Query: 62 SSLRMLSKQSSPLKIIYID 80 S + + +S K + Sbjct: 72 SQITLRRGATSRDKTFVLT 90 >gi|320101820|ref|YP_004177411.1| hypothetical protein Isop_0265 [Isosphaera pallida ATCC 43644] gi|319749102|gb|ADV60862.1| protein of unknown function DUF167 [Isosphaera pallida ATCC 43644] Length = 131 Score = 72.8 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 42/90 (46%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V P +++ G+ P+KGKAN A+L +LA L ++KS Sbjct: 34 LAVMARPRSRRPGVVGTWCGAVVVAIAA-------APEKGKANAAILEILADLLGIAKSR 86 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 L ++S ++ K++ I + ++ + + + Sbjct: 87 LDLVSGATARSKVVRIAGLEPDQVRQAISD 116 >gi|302381263|ref|YP_003817086.1| hypothetical protein Bresu_0148 [Brevundimonas subvibrioides ATCC 15264] gi|302191891|gb|ADK99462.1| protein of unknown function DUF167 [Brevundimonas subvibrioides ATCC 15264] Length = 92 Score = 72.8 bits (178), Expect = 2e-11, Method: Composition-based stats. Identities = 27/91 (29%), Positives = 45/91 (49%), Gaps = 3/91 (3%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + VRL P A I D +K++V A P +G+AN A++ +LAK L + Sbjct: 1 MARLPVRLTPGASTDRIDGW--DADPEGRPVLKVRVRARPVEGEANAALILLLAKALGVP 58 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 +S++ + S LK+I ++ D + L Sbjct: 59 RSTVSLARGGQSRLKMIEVEGLDDAGLRARL 89 >gi|254409972|ref|ZP_05023752.1| conserved hypothetical protein [Microcoleus chthonoplastes PCC 7420] gi|196183008|gb|EDX77992.1| conserved hypothetical protein [Microcoleus chthonoplastes PCC 7420] Length = 74 Score = 72.5 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 24/78 (30%), Positives = 43/78 (55%), Gaps = 7/78 (8%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 V++ PN+K I + +K+ + + P GKANK ++ +LA+K ++KS + Sbjct: 4 SVKVKPNSKTQSI-------EEMADGTLKVNLKSPPVDGKANKELIELLAEKFNVTKSQV 56 Query: 65 RMLSKQSSPLKIIYIDKD 82 ++ S SS +K+I I D Sbjct: 57 QIKSGLSSKIKLIEIVAD 74 >gi|326522380|dbj|BAK07652.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326531514|dbj|BAJ97761.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 129 Score = 72.5 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 40/90 (44%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K + I + + +++ A + G+AN A++ ++ L + K Sbjct: 43 ISIHAKPGSKMATITEV-------GEEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 95 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92 + + S S K++ + D K + E L+ Sbjct: 96 VSIGSGSKSREKVVLVQDATLKGVFEALKK 125 >gi|226496211|ref|NP_001150562.1| LOC100284194 [Zea mays] gi|195640232|gb|ACG39584.1| uncharacterized ACR, YggU family COG1872 containing protein [Zea mays] Length = 129 Score = 72.5 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 15/90 (16%), Positives = 40/90 (44%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K + I + + +++ A + G+AN A++ ++ L + K Sbjct: 42 ISIHAKPGSKVATITEI-------GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 94 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92 + + S S K++ + D + + + L+ Sbjct: 95 VSIGSGSKSREKVVLVQDATLEGVYDALKK 124 >gi|332157902|ref|YP_004423181.1| hypothetical protein PNA2_0260 [Pyrococcus sp. NA2] gi|331033365|gb|AEC51177.1| hypothetical protein PNA2_0260 [Pyrococcus sp. NA2] Length = 92 Score = 72.1 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 42/88 (47%), Gaps = 9/88 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + PNAKK+ I ++ ++I V A P KGKAN+ ++ L L + Sbjct: 11 IYVLVKPNAKKTEIEGVDT-----WKKRIRISVKAPPVKGKANRELVNFLQGLLN---AE 62 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELL 90 + ++ ++S K + I +E+ L Sbjct: 63 VILVRGETSREKELLIKGLKVEEVKRKL 90 >gi|297526283|ref|YP_003668307.1| protein of unknown function DUF167 [Staphylothermus hellenicus DSM 12710] gi|297255199|gb|ADI31408.1| protein of unknown function DUF167 [Staphylothermus hellenicus DSM 12710] Length = 110 Score = 72.1 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 42/95 (44%), Gaps = 9/95 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + PN+ + + + + T P+KG+AN A++ L++ + L Sbjct: 22 VIIPIYVKPNSDRDALV--------LEGDELVFYTTEIPEKGRANAALIRFLSRNIRLPH 73 Query: 62 SSLRMLSKQSSPLKIIYI-DKDCKEITELLQNNDS 95 + + ++ + K + + D + +++ E L S Sbjct: 74 NKIDIIYGARTRSKKVLVRDMEAEKLAEKLAEIIS 108 >gi|30694498|ref|NP_175343.2| unknown protein [Arabidopsis thaliana] gi|34365605|gb|AAQ65114.1| At1g49170 [Arabidopsis thaliana] gi|51971533|dbj|BAD44431.1| similar to serine/threonine kinase 9 gb|AAD28798.1 [Arabidopsis thaliana] gi|332194278|gb|AEE32399.1| uncharacterized protein [Arabidopsis thaliana] Length = 126 Score = 72.1 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 15/95 (15%), Positives = 39/95 (41%), Gaps = 8/95 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K + I + +++ A + G+AN A+L ++ L + + Sbjct: 39 ITIHAKPGSKAASITDVSDEAVG-------VQIDAPARDGEANAALLEYMSSVLGVKRRQ 91 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNNDSLT 97 + + S S K++ ++ + + + L T Sbjct: 92 VSLGSGSKSRDKVVIVEDMTQQSVFQALSQASKPT 126 >gi|219852331|ref|YP_002466763.1| protein of unknown function DUF167 [Methanosphaerula palustris E1-9c] gi|219546590|gb|ACL17040.1| protein of unknown function DUF167 [Methanosphaerula palustris E1-9c] Length = 106 Score = 72.1 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 19/90 (21%), Positives = 42/90 (46%), Gaps = 5/90 (5%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 +++ + AK + +T P +GKAN+A++A+L++ L + +S Sbjct: 17 LLLDVNSKAKADRF----PAGYNEWRHAIGCSITTPPVEGKANRAIVALLSRTLTIPQSG 72 Query: 64 LRMLSKQSSPLKIIYIDKDC-KEITELLQN 92 + +LS +S K + I +++ L+ Sbjct: 73 ISILSGATSSQKRVLIQGMTFEQLAGFLRE 102 >gi|262201697|ref|YP_003272905.1| hypothetical protein Gbro_1753 [Gordonia bronchialis DSM 43247] gi|262085044|gb|ACY21012.1| protein of unknown function DUF167 [Gordonia bronchialis DSM 43247] Length = 75 Score = 72.1 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 38/77 (49%), Gaps = 6/77 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V+V + PN++K + +T + I V +G+ANKA+ +LAK L + KS Sbjct: 4 QVVVTVKPNSRKGPLV------ETGPDGTVTIYVREPATEGRANKAVAELLAKHLGVPKS 57 Query: 63 SLRMLSKQSSPLKIIYI 79 + ++ ++ K + Sbjct: 58 KVALVGGATARTKRFRV 74 >gi|238494338|ref|XP_002378405.1| DUF167 domain protein [Aspergillus flavus NRRL3357] gi|220695055|gb|EED51398.1| DUF167 domain protein [Aspergillus flavus NRRL3357] Length = 131 Score = 72.1 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 5/78 (6%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + PNA + + + V A P+ G+AN A+ + A+ L + KS Sbjct: 32 QISCNVKPNASANR-----EGIIAVGPEKVDVCVAAVPRDGEANAAVSRVFAQILKVPKS 86 Query: 63 SLRMLSKQSSPLKIIYID 80 ++ ++ S K + + Sbjct: 87 TVDVIRGLKSRDKTLCVS 104 >gi|282162781|ref|YP_003355166.1| hypothetical protein MCP_0111 [Methanocella paludicola SANAE] gi|282155095|dbj|BAI60183.1| conserved hypothetical protein [Methanocella paludicola SANAE] Length = 104 Score = 72.1 bits (176), Expect = 3e-11, Method: Composition-based stats. Identities = 15/90 (16%), Positives = 39/90 (43%), Gaps = 5/90 (5%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P AK++ + ++ ++ A P++G+AN+ ++ L+ L + + Sbjct: 14 VLIDFEVSPGAKETRV----PSGYNEWRRRIEARLKAPPERGRANEELIGELSALLGIPE 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDC-KEITELL 90 S + + S K + + +E+ L Sbjct: 70 SRIEITSGARDSRKSVKVLGASREEVLRRL 99 >gi|302868916|ref|YP_003837553.1| hypothetical protein Micau_4464 [Micromonospora aurantiaca ATCC 27029] gi|315504614|ref|YP_004083501.1| hypothetical protein ML5_3839 [Micromonospora sp. L5] gi|302571775|gb|ADL47977.1| protein of unknown function DUF167 [Micromonospora aurantiaca ATCC 27029] gi|315411233|gb|ADU09350.1| protein of unknown function DUF167 [Micromonospora sp. L5] Length = 101 Score = 72.1 bits (176), Expect = 3e-11, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 40/88 (45%), Gaps = 3/88 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P A ++ + D + + V A G+A +A LA L + ++ Sbjct: 9 VAVRVKPGAARARVGG---RFDGPYGPALVVAVHAPAVDGRATEAARRALADALGIRPAT 65 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + S +S K+ +++ + E+L+ Sbjct: 66 VSLRSGAASRDKLFLVERPHDGLPEVLR 93 >gi|307824080|ref|ZP_07654307.1| protein of unknown function DUF167 [Methylobacter tundripaludum SV96] gi|307734864|gb|EFO05714.1| protein of unknown function DUF167 [Methylobacter tundripaludum SV96] Length = 108 Score = 72.1 bits (176), Expect = 3e-11, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 37/84 (44%), Gaps = 8/84 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+AK+ I + +K+ VTA P G+A M+ LAK+ ++ + ++ Sbjct: 26 QPSAKQDAIGKV-------KGNQLKVSVTAAPVAGRATDHMVRFLAKEFGVTPKDIEVVF 78 Query: 69 KQSSPLKIIYIDKDCKEITELLQN 92 + + K + I K + ++ Sbjct: 79 GRFNVNKQLRIK-SPKNLPSVINK 101 >gi|154251824|ref|YP_001412648.1| hypothetical protein Plav_1371 [Parvibaculum lavamentivorans DS-1] gi|154155774|gb|ABS62991.1| protein of unknown function DUF167 [Parvibaculum lavamentivorans DS-1] Length = 108 Score = 71.7 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 44/89 (49%), Gaps = 2/89 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ +R+ P I L ++++V+A +KGKAN A+L +LA+ L L + Sbjct: 10 VSLRLRVTPRGGADRIDGLAADASGEP--FLRVRVSAVAEKGKANDAVLKLLARALRLPR 67 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 S+ + S ++ K + I D + + L Sbjct: 68 SAFAVASGEAGRTKSVTISGDTERLMADL 96 >gi|224131202|ref|XP_002328480.1| predicted protein [Populus trichocarpa] gi|222838195|gb|EEE76560.1| predicted protein [Populus trichocarpa] Length = 131 Score = 71.7 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 14/77 (18%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K + I L +++ A + G+AN A+L ++ L + + Sbjct: 44 ITIHAKPGSKSASITDLSDEAVG-------VQIDAPAKDGEANAALLDYISSVLGVKRRQ 96 Query: 64 LRMLSKQSSPLKIIYID 80 + + S S K++ ++ Sbjct: 97 VSIGSGSKSRDKVVIVE 113 >gi|284161412|ref|YP_003400035.1| hypothetical protein Arcpr_0292 [Archaeoglobus profundus DSM 5631] gi|284011409|gb|ADB57362.1| protein of unknown function DUF167 [Archaeoglobus profundus DSM 5631] Length = 97 Score = 71.7 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 38/91 (41%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + PNAK++ I + +K+ V + P+ GKAN+ + L Sbjct: 10 IEVDVSPNAKRTAITGYD-----PWRKALKVSVKSPPRGGKANRELTDFLGGIFNC---K 61 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 + ++ + S K + + ++ +L+ Sbjct: 62 VEIVKGEKSTKKTVLLKGLSLEKALSILKEL 92 >gi|54022885|ref|YP_117127.1| hypothetical protein nfa9180 [Nocardia farcinica IFM 10152] gi|54014393|dbj|BAD55763.1| hypothetical protein [Nocardia farcinica IFM 10152] Length = 72 Score = 71.7 bits (175), Expect = 4e-11, Method: Composition-based stats. Identities = 20/73 (27%), Positives = 37/73 (50%), Gaps = 6/73 (8%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + PN++K + +T+ + + V A +GKANKA + +LA + S++R+ Sbjct: 5 IKPNSRKGPLV------ETAADGTLTLYVRAPAVEGKANKAAIDLLAAHYGVPTSAVRLT 58 Query: 68 SKQSSPLKIIYID 80 + +S K ID Sbjct: 59 AGATSRHKRFDID 71 >gi|148643663|ref|YP_001274176.1| hypothetical protein Msm_1603 [Methanobrevibacter smithii ATCC 35061] gi|148552680|gb|ABQ87808.1| conserved hypothetical protein Msm_1603 [Methanobrevibacter smithii ATCC 35061] Length = 101 Score = 71.3 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 41/92 (44%), Gaps = 13/92 (14%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + PN+ K I+ +I++ PQKGKANK ++ L+K Sbjct: 18 IDIEVSPNSNKFQISGFN-----EWRNRFEIRIKQVPQKGKANKEIVKELSKIFNC---D 69 Query: 64 LRMLSKQSSPLKIIY-----IDKDCKEITELL 90 + + + S K I ID ++++E+L Sbjct: 70 VSISKGEKSSQKTIVCYNVSIDCILEKLSEIL 101 >gi|297852564|ref|XP_002894163.1| hypothetical protein ARALYDRAFT_474060 [Arabidopsis lyrata subsp. lyrata] gi|297340005|gb|EFH70422.1| hypothetical protein ARALYDRAFT_474060 [Arabidopsis lyrata subsp. lyrata] Length = 126 Score = 71.3 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 15/95 (15%), Positives = 39/95 (41%), Gaps = 8/95 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K + I + +++ A + G+AN A+L ++ L + + Sbjct: 39 ITIHAKPGSKAASITDVSDEAVG-------VQIDAPARDGEANAALLEFISSVLGVKRRQ 91 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNNDSLT 97 + + S S K++ ++ + + + L T Sbjct: 92 VSLGSGSKSRDKVVIVEDMTQQSVFQALSQASKPT 126 >gi|91788526|ref|YP_549478.1| hypothetical protein Bpro_2664 [Polaromonas sp. JS666] gi|91697751|gb|ABE44580.1| protein of unknown function DUF167 [Polaromonas sp. JS666] Length = 103 Score = 71.3 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 19/85 (22%), Positives = 35/85 (41%), Gaps = 8/85 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P A + I +K+ VTA P+ GKA M+ LA ++ + + ++ Sbjct: 27 KPAASRDAIG-------KPKGTQLKVSVTAAPKSGKATDHMVRFLAPLFGVAVADIEVVF 79 Query: 69 KQSSPLKIIYIDKDCKEITELLQNN 93 Q + K + I K++ E+ Sbjct: 80 GQENVNKQLRIKA-PKKLPEVFTAK 103 >gi|302916303|ref|XP_003051962.1| hypothetical protein NECHADRAFT_19877 [Nectria haematococca mpVI 77-13-4] gi|256732901|gb|EEU46249.1| hypothetical protein NECHADRAFT_19877 [Nectria haematococca mpVI 77-13-4] Length = 75 Score = 71.3 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 38/77 (49%), Gaps = 5/77 (6%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 ++ + + P A K+ +++ V A ++G+ANKA++ +L+ L + KS Sbjct: 4 HLQLHVKPGASKNR-----EGVIAVTDDAIELCVAAQAREGEANKAVVQVLSSVLGVPKS 58 Query: 63 SLRMLSKQSSPLKIIYI 79 SL++ S K + + Sbjct: 59 SLQLTHGLKSRDKTVVL 75 >gi|240170218|ref|ZP_04748877.1| PE-PGRS family protein [Mycobacterium kansasii ATCC 12478] Length = 76 Score = 71.3 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 7/82 (8%) Query: 1 MCNVIV-RLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLAL 59 M +++V ++ P ++ +T + + I V GKAN+A+ +LA L Sbjct: 1 MADIVVVKVKPGSRN------GPRVETVSGVELTIYVPEPAVGGKANEAVARLLAAHFHL 54 Query: 60 SKSSLRMLSKQSSPLKIIYIDK 81 ++ + ++S S LK ID+ Sbjct: 55 PRTRVELVSGARSRLKRFRIDR 76 >gi|74318638|ref|YP_316378.1| hypothetical protein Tbd_2620 [Thiobacillus denitrificans ATCC 25259] gi|74058133|gb|AAZ98573.1| conserved hypothetical protein [Thiobacillus denitrificans ATCC 25259] Length = 106 Score = 70.9 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 36/84 (42%), Gaps = 8/84 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+AK I +K+ VTA P+ G+A M+ LA + ++ S++ ++ Sbjct: 22 KPSAKVDAIG-------KPKGHQLKVSVTAAPRAGRATDHMVRFLADEFGVATSAIEVVF 74 Query: 69 KQSSPLKIIYIDKDCKEITELLQN 92 + + K + I + + + Sbjct: 75 GRMNVNKQLRIKA-PTRLPAVFKA 97 >gi|255627953|gb|ACU14321.1| unknown [Glycine max] Length = 126 Score = 70.9 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 13/82 (15%), Positives = 35/82 (42%), Gaps = 7/82 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK + I + +++ A + G+AN A+L ++ L + + Sbjct: 39 ITIHAKPGAKSASITDISDEAVG-------VQIDAPARDGEANAALLDYISSVLGVKRRQ 91 Query: 64 LRMLSKQSSPLKIIYIDKDCKE 85 + + + S K + ++ ++ Sbjct: 92 VSLGTGSKSRDKTVIVEDVTQQ 113 >gi|169777211|ref|XP_001823071.1| yggU family protein [Aspergillus oryzae RIB40] gi|83771808|dbj|BAE61938.1| unnamed protein product [Aspergillus oryzae] Length = 131 Score = 70.9 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 5/78 (6%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + PNA + + + V A P+ G+AN A+ + A+ L + KS Sbjct: 32 QISCNVKPNASANR-----EGIIAVGPEKVDVCVAAVPRDGEANAAVSRVFAQILKVPKS 86 Query: 63 SLRMLSKQSSPLKIIYID 80 ++ ++ S K + + Sbjct: 87 TVVVIRGLKSRDKTLCVS 104 >gi|296126259|ref|YP_003633511.1| hypothetical protein Bmur_1218 [Brachyspira murdochii DSM 12563] gi|296018075|gb|ADG71312.1| protein of unknown function DUF167 [Brachyspira murdochii DSM 12563] Length = 84 Score = 70.9 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 42/92 (45%), Gaps = 8/92 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 N+ V++ AK + + +++ A GKANKA++ LA +L + K Sbjct: 1 MNIEVKVTAGAKSNSF--------KFENGAYYVRIMAKAIDGKANKAIIEFLADELNIKK 52 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNN 93 + +L + S KII I+ + ++ E N Sbjct: 53 KDIDILKGEKSSKKIIAINIEENKLKEYFSKN 84 >gi|168032813|ref|XP_001768912.1| predicted protein [Physcomitrella patens subsp. patens] gi|162679824|gb|EDQ66266.1| predicted protein [Physcomitrella patens subsp. patens] Length = 123 Score = 70.9 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 42/93 (45%), Gaps = 8/93 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V P +K S I + +++ A ++G+AN A+L +A+ L + + Sbjct: 38 ITVHAKPGSKLSAITDTDDGAVG-------VQIDAPAREGEANAALLEYIAEVLGIKRRQ 90 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNNDS 95 + + S S K++ ++ ++ E ++ + Sbjct: 91 VSLGSGSRSREKLVTVEGLTVDKVYEAIRRAST 123 >gi|223975045|gb|ACN31710.1| unknown [Zea mays] Length = 95 Score = 70.9 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 15/90 (16%), Positives = 40/90 (44%), Gaps = 8/90 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K + I + + +++ A + G+AN A++ ++ L + K Sbjct: 8 ISIHAKPGSKVATITEI-------GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 60 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQN 92 + + S S K++ + D + + + L+ Sbjct: 61 VSIGSGSKSREKVVLVQDATLEGVYDALKK 90 >gi|91202864|emb|CAJ72503.1| similar to hypothetical protein YggU [Candidatus Kuenenia stuttgartiensis] Length = 97 Score = 70.9 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 28/91 (30%), Positives = 49/91 (53%), Gaps = 8/91 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V V++ + K I + +K+ V+A P+KGKANKA++ +LA+ ++ SS Sbjct: 13 VFVKVQAGSGKDRIVG-------NLGGRLKLAVSAAPEKGKANKAVVELLAETFHINSSS 65 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 + ++S ++S K I I+ + I LL N Sbjct: 66 IHIISGKTSRDKKIMIEGVTPESINTLLNFN 96 >gi|327399372|ref|YP_004340241.1| hypothetical protein Hipma_1220 [Hippea maritima DSM 10411] gi|327182001|gb|AEA34182.1| UPF0235 protein yggU [Hippea maritima DSM 10411] Length = 84 Score = 70.9 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 9/90 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ PN+K + + D + +K+ P +GKANKA++ LAK+L ++K Sbjct: 1 MILEVKVKPNSK--------VEQFDFDKGVLTLKIKEKPVEGKANKAVVDKLAKRLKVAK 52 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S + ++ + S K++ ID D EI L Sbjct: 53 SCIEIVKGEKSRSKLVRIDCLDDDEILLRL 82 >gi|322418791|ref|YP_004198014.1| hypothetical protein GM18_1269 [Geobacter sp. M18] gi|320125178|gb|ADW12738.1| protein of unknown function DUF167 [Geobacter sp. M18] Length = 105 Score = 70.5 bits (172), Expect = 6e-11, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 38/84 (45%), Gaps = 8/84 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PNAK+ I + + + VT P+ G+A M+ LA++ +S ++++ Sbjct: 21 TPNAKRDAIGKV-------KGHQLCVSVTEFPRAGRATDHMVRFLAEEFGVSTGDIQVVF 73 Query: 69 KQSSPLKIIYIDKDCKEITELLQN 92 + + K + I K + ++ + Sbjct: 74 GRMNVNKQLRIKA-PKRLPSVIAS 96 >gi|154309869|ref|XP_001554267.1| predicted protein [Botryotinia fuckeliana B05.10] gi|150851643|gb|EDN26836.1| predicted protein [Botryotinia fuckeliana B05.10] Length = 130 Score = 70.5 bits (172), Expect = 7e-11, Method: Composition-based stats. Identities = 24/96 (25%), Positives = 45/96 (46%), Gaps = 8/96 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ + PN+ S + ++ DTS +++ V + +ANK ++ +L+ L K Sbjct: 23 IHLSCHVKPNSSASRV-GVKAFSDTSS--SIEVCVAQPARDNEANKGVVEVLSHILKCPK 79 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97 + L ++ ++S KII E L NDSL Sbjct: 80 TDLEVIRGKTSKNKIIAYKG-----IEDLLANDSLA 110 >gi|310801877|gb|EFQ36770.1| hypothetical protein GLRG_11914 [Glomerella graminicola M1.001] Length = 92 Score = 70.5 bits (172), Expect = 7e-11, Method: Composition-based stats. Identities = 17/76 (22%), Positives = 36/76 (47%), Gaps = 5/76 (6%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + R+ P A + + +++ V A ++G+AN+A++ +L++ L L KS Sbjct: 22 LQCRVKPGASR-----VREGIVAVTDGGVELCVAAQAREGEANRAVIKLLSEILGLPKSD 76 Query: 64 LRMLSKQSSPLKIIYI 79 L + S K + + Sbjct: 77 LIISQGLKSRDKTVAV 92 >gi|324527094|gb|ADY48748.1| Unknown [Ascaris suum] Length = 181 Score = 70.5 bits (172), Expect = 7e-11, Method: Composition-based stats. Identities = 22/94 (23%), Positives = 45/94 (47%), Gaps = 10/94 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + PNAK S + + +++ + A P KG+AN+A+ +A+ L L K+ Sbjct: 92 LKIHAKPNAKISRVTEIN-------ETEIEVAIAAPPHKGQANEALTDAIAEILGLRKND 144 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQNND 94 + + S K++ I+ +E+ E L+ + Sbjct: 145 VFFDTGARSRSKLLVINSQRITVEEVREKLKKSA 178 >gi|119946689|ref|YP_944369.1| hypothetical protein Ping_3071 [Psychromonas ingrahamii 37] gi|119865293|gb|ABM04770.1| hypothetical protein DUF167 [Psychromonas ingrahamii 37] Length = 111 Score = 70.5 bits (172), Expect = 8e-11, Method: Composition-based stats. Identities = 22/90 (24%), Positives = 41/90 (45%), Gaps = 10/90 (11%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P AKK + + I ++++VTATP G+A M+ LAK+ ++S + ++ Sbjct: 28 TPGAKKDAVG-------KAQGIQLRVRVTATPVAGRATDHMVRFLAKEFSVSPDDITVVF 80 Query: 69 KQSSPLKIIYIDKD---CKEITELLQNNDS 95 + + K + I + + L DS Sbjct: 81 GRLNINKQLRIKAPKKMPSVVAKALSKKDS 110 >gi|260432898|ref|ZP_05786869.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] gi|260416726|gb|EEX09985.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] Length = 91 Score = 70.1 bits (171), Expect = 9e-11, Method: Composition-based stats. Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 8/78 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 ++ VR+ P A + I +D + I VTA + GKAN A+ +LAK + ++ S Sbjct: 21 HIQVRVTPKAARDRI--------QADESSVHIAVTAPAEGGKANLAVARILAKAMGIAPS 72 Query: 63 SLRMLSKQSSPLKIIYID 80 +L + Q++ K+ + Sbjct: 73 ALILKQGQTARNKLFVYE 90 >gi|91205448|ref|YP_537803.1| hypothetical protein RBE_0633 [Rickettsia bellii RML369-C] gi|157827193|ref|YP_001496257.1| hypothetical protein A1I_04385 [Rickettsia bellii OSU 85-389] gi|122425693|sp|Q1RIV0|Y633_RICBR RecName: Full=UPF0235 protein RBE_0633 gi|226706174|sp|A8GWJ3|Y4385_RICB8 RecName: Full=UPF0235 protein A1I_04385 gi|91068992|gb|ABE04714.1| unknown [Rickettsia bellii RML369-C] gi|157802497|gb|ABV79220.1| hypothetical protein A1I_04385 [Rickettsia bellii OSU 85-389] Length = 104 Score = 70.1 bits (171), Expect = 1e-10, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 3/89 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ +++ AK + I I D H+K+ + A Q+GKAN+ ++ LAK+ L + Sbjct: 13 ASLNIKVKAAAKSNDIKEFIIINDVL---HLKLSIKAHAQQGKANEEIINFLAKEWQLLR 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 S+L + ++ LK I I +E L+ Sbjct: 70 SNLEITKGHTNSLKTILIKNIDEEYLNLI 98 >gi|222444857|ref|ZP_03607372.1| hypothetical protein METSMIALI_00470 [Methanobrevibacter smithii DSM 2375] gi|222434422|gb|EEE41587.1| hypothetical protein METSMIALI_00470 [Methanobrevibacter smithii DSM 2375] Length = 101 Score = 69.8 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 40/92 (43%), Gaps = 13/92 (14%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + PN+ K I+ +I++ PQKGKANK ++ L+K Sbjct: 18 VDIEVSPNSNKFQISGFN-----EWRNRFEIRIKQVPQKGKANKEIVKELSKIFNC---D 69 Query: 64 LRMLSKQSSPLKIIY-----IDKDCKEITELL 90 + + + S K I ID +++E+L Sbjct: 70 VSISKGEKSSQKTIVCYNVSIDDILDKLSEIL 101 >gi|133930345|ref|NP_001076616.1| hypothetical protein W01A8.2 [Caenorhabditis elegans] gi|114420882|emb|CAL44973.1| C. elegans protein W01A8.2, confirmed by transcript evidence [Caenorhabditis elegans] Length = 127 Score = 69.8 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 10/93 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AKKS + ++ + + + A P++G AN+ +++ L L L K+ Sbjct: 39 LHIHAKPGAKKSCVVAI-------GDSEVDVAIGAAPREGAANEELISYLMSALGLRKNE 91 Query: 64 LRMLSKQSSPLKIIYIDK---DCKEITELLQNN 93 L+ S K++ ID E+ + LQ Sbjct: 92 LQFDKGAKSRSKVVLIDTKRLTIDEVRKKLQEE 124 >gi|261402905|ref|YP_003247129.1| protein of unknown function DUF167 [Methanocaldococcus vulcanius M7] gi|261369898|gb|ACX72647.1| protein of unknown function DUF167 [Methanocaldococcus vulcanius M7] Length = 102 Score = 69.4 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 21/95 (22%), Positives = 41/95 (43%), Gaps = 9/95 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + NAK++ IA + + +K+ A +GKANK ++ K Sbjct: 15 IDIDVQANAKRNEIAGIN-----EWRKRLSVKIKAPAIEGKANKEIIKFFGNLF---KKD 66 Query: 64 LRMLSKQSSPLKIIYIDKDCKE-ITELLQNNDSLT 97 + ++ ++S K I I K + E+L+ T Sbjct: 67 VEIILGKTSSQKTILILGAKKGYVEEILKKELEKT 101 >gi|24214436|ref|NP_711917.1| hypothetical protein LA_1736 [Leptospira interrogans serovar Lai str. 56601] gi|45657916|ref|YP_002002.1| hypothetical protein LIC12068 [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|29839712|sp|Q8F5E6|Y1736_LEPIN RecName: Full=UPF0235 protein LA_1736 gi|73921077|sp|Q72QP5|Y2068_LEPIC RecName: Full=UPF0235 protein LIC_12068 gi|24195381|gb|AAN48935.1|AE011350_4 conserved hypothetical protein [Leptospira interrogans serovar Lai str. 56601] gi|45601157|gb|AAS70639.1| conserved hypothetical protein [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 73 Score = 69.4 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V + PN+KK + + + I V +GKAN+A++ ++K++ + K Sbjct: 1 MKFTVYVKPNSKK-------VFFRKEEDGVLTIAVREPALEGKANEAVIESISKEMKVPK 53 Query: 62 SSLRMLSKQSSPLKIIYID 80 S +R+LS Q + KII ID Sbjct: 54 SKIRILSGQKNKKKIIEID 72 >gi|255632017|gb|ACU16361.1| unknown [Glycine max] Length = 126 Score = 69.4 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 11/82 (13%), Positives = 35/82 (42%), Gaps = 7/82 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K + + + +++ A + G+AN A+L ++ L + + Sbjct: 39 ITIHAKPGSKSASVTDISDEAVG-------VQIDAPARDGEANAALLDYISSVLGVKRRQ 91 Query: 64 LRMLSKQSSPLKIIYIDKDCKE 85 + + + S K + ++ ++ Sbjct: 92 VSLGTGSKSRDKTVIVEDVTQQ 113 >gi|159905985|ref|YP_001549647.1| hypothetical protein MmarC6_1603 [Methanococcus maripaludis C6] gi|226734735|sp|A9AAP2|Y1603_METM6 RecName: Full=UPF0235 protein MmarC6_1603 gi|159887478|gb|ABX02415.1| protein of unknown function DUF167 [Methanococcus maripaludis C6] Length = 101 Score = 69.4 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 46/91 (50%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + NAKK+ I + ++I++ P +GKANKA++ L + KS Sbjct: 15 IDIEVTTNAKKNEIGKIN-----EWRKRIEIRIKEQPIEGKANKAIIKFLK---GIFKSE 66 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93 + + S +S K + I DK +++ ++L+ Sbjct: 67 ISINSGTTSSQKTVLIPDKTKEDVVKILKKE 97 >gi|121712776|ref|XP_001273999.1| YggU family protein [Aspergillus clavatus NRRL 1] gi|119402152|gb|EAW12573.1| YggU family protein [Aspergillus clavatus NRRL 1] Length = 133 Score = 69.4 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 32/78 (41%), Gaps = 5/78 (6%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + PNA S + + V A P+ G+AN A+ + A+ + KS Sbjct: 33 QIACHVKPNASSSR-----EGVVAIGPEKVDVCVAAVPRNGEANIAVARVFAQIFDVPKS 87 Query: 63 SLRMLSKQSSPLKIIYID 80 + ++ S KI+ I Sbjct: 88 NAEVIRGLKSRDKILCIT 105 >gi|94496543|ref|ZP_01303119.1| hypothetical protein SKA58_17602 [Sphingomonas sp. SKA58] gi|94423903|gb|EAT08928.1| hypothetical protein SKA58_17602 [Sphingomonas sp. SKA58] Length = 107 Score = 69.4 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 25/78 (32%), Positives = 44/78 (56%), Gaps = 2/78 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +RL P + + GI + +D + +V A P+KG+AN A++A+LA+ L KS+ Sbjct: 13 LAIRLTPGSARQGIGGV--WRDDRAAPWLTARVRAVPEKGRANTALIALLAQALDWPKSA 70 Query: 64 LRMLSKQSSPLKIIYIDK 81 + + S S+ LK + I Sbjct: 71 IMLESGDSNRLKRLRIIG 88 >gi|301166027|emb|CBW25601.1| conserved hypothetical protein [Bacteriovorax marinus SJ] Length = 109 Score = 69.0 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 20/75 (26%), Positives = 32/75 (42%), Gaps = 9/75 (12%) Query: 4 VIVRLIPNAKKSG-IASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + + P AK I D + I + P G+ANKA + LA +L+++KS Sbjct: 17 LNIWAKPGAKVEKSIVG--------DEGEIIIYIKERPIDGQANKAFIKYLAAQLSITKS 68 Query: 63 SLRMLSKQSSPLKII 77 S+ + S K Sbjct: 69 SVSLSRGSKSRFKRF 83 >gi|118430921|ref|NP_147028.2| hypothetical protein APE_0182.1 [Aeropyrum pernix K1] gi|150421715|sp|Q9YFR7|Y182_AERPE RecName: Full=UPF0235 protein APE_0182.1 gi|116062246|dbj|BAA79094.2| conserved hypothetical protein [Aeropyrum pernix K1] Length = 108 Score = 69.0 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 16/94 (17%), Positives = 37/94 (39%), Gaps = 10/94 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + P + + + + P +G+AN +++ LA+ L +S Sbjct: 19 VRIRVYVKPEGR--------ERRLRLEEGELVFYTDEPPLEGRANASLINFLARGLKVSV 70 Query: 62 SSLRMLSKQSSPLKIIYID--KDCKEITELLQNN 93 ++ ++ S K++ I D + E L + Sbjct: 71 KNIEIVHGARSRSKVVEIRDVADPDALLERLASI 104 >gi|254475305|ref|ZP_05088691.1| conserved hypothetical protein [Ruegeria sp. R11] gi|214029548|gb|EEB70383.1| conserved hypothetical protein [Ruegeria sp. R11] Length = 96 Score = 69.0 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 19/75 (25%), Positives = 38/75 (50%), Gaps = 3/75 (4%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 +++VR+ P A + + + S + +KI T P+ GKA +A+ +LA + ++ Sbjct: 21 DILVRVTPKAARDSV---QRTSGDSGELVLKITTTTAPENGKATEAVRKLLATAMRVAPR 77 Query: 63 SLRMLSKQSSPLKII 77 L +L +S K+ Sbjct: 78 DLVLLRGATSREKLF 92 >gi|150402236|ref|YP_001329530.1| hypothetical protein MmarC7_0309 [Methanococcus maripaludis C7] gi|166229378|sp|A6VG03|Y309_METM7 RecName: Full=UPF0235 protein MmarC7_0309 gi|150033266|gb|ABR65379.1| protein of unknown function DUF167 [Methanococcus maripaludis C7] Length = 101 Score = 69.0 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 46/91 (50%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + NAKK+ I + ++I++ P +GKANKA++ L + KS Sbjct: 15 IDIEVTTNAKKNEIGKIN-----EWRKRIEIRIREQPIEGKANKAIVKFLK---GIFKSE 66 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93 + + S +S K + I DK +++ ++L+ Sbjct: 67 IFINSGTTSSQKTVLIPDKTKEDVVKILKKE 97 >gi|195953751|ref|YP_002122041.1| protein of unknown function DUF167 [Hydrogenobaculum sp. Y04AAS1] gi|226734129|sp|B4U5M3|Y1378_HYDS0 RecName: Full=UPF0235 protein HY04AAS1_1378 gi|195933363|gb|ACG58063.1| protein of unknown function DUF167 [Hydrogenobaculum sp. Y04AAS1] Length = 73 Score = 69.0 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 23/78 (29%), Positives = 43/78 (55%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V++ PNAK + LE +KI + + P GKAN+ ++ +L++ L +SK Sbjct: 1 MILRVKVKPNAKTVSVEQLE-------DKSLKISIKSPPVNGKANEELIKVLSEFLKVSK 53 Query: 62 SSLRMLSKQSSPLKIIYI 79 S + + + +SS K++ I Sbjct: 54 SKINIKAGKSSREKLVEI 71 >gi|90399179|emb|CAJ86041.1| H0723C07.11 [Oryza sativa Indica Group] gi|125550304|gb|EAY96126.1| hypothetical protein OsI_18003 [Oryza sativa Indica Group] Length = 125 Score = 69.0 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 13/76 (17%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P +K + I + + +++ A + G+AN A++ ++ L + K Sbjct: 38 ISIHAKPGSKLATITEI-------GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 90 Query: 64 LRMLSKQSSPLKIIYI 79 + + S S K++ + Sbjct: 91 VSIGSGSKSREKVVLV 106 >gi|319956454|ref|YP_004167717.1| hypothetical protein Nitsa_0701 [Nitratifractor salsuginis DSM 16511] gi|319418858|gb|ADV45968.1| protein of unknown function DUF167 [Nitratifractor salsuginis DSM 16511] Length = 98 Score = 68.6 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 23/96 (23%), Positives = 48/96 (50%), Gaps = 8/96 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + ++ ++ P A +S A L +K+++ A +G ANK ++ L+K + Sbjct: 9 LASLKIKAQPGASRSEFAGLY------GDEAIKVRIAAAAVEGAANKELVKFLSKAFKVP 62 Query: 61 KSSLRMLSKQSSPLKIIYIDKDC--KEITELLQNND 94 KSS+R S +++ +K++ KE E L++ + Sbjct: 63 KSSIRFKSGETAKIKVVEFPYSEKFKEFLEKLEDRN 98 >gi|114568606|ref|YP_755286.1| hypothetical protein Mmar10_0052 [Maricaulis maris MCS10] gi|114339068|gb|ABI64348.1| protein of unknown function DUF167 [Maricaulis maris MCS10] Length = 108 Score = 68.6 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 26/94 (27%), Positives = 51/94 (54%), Gaps = 2/94 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VRL P A + +A D ++ ++ +V A P+KGKAN A++A+LA++L + K Sbjct: 11 VQVRLAPGASRDCLAG--SQADAAERHYLVARVRAIPEKGKANAALIALLARQLGIPKRD 68 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLT 97 + ++ +S +K + I + E ++ ++ Sbjct: 69 IDVIRGATSRMKTVRISANGPEQDRIVAQLEAFA 102 >gi|89898566|ref|YP_515676.1| hypothetical protein CF0759 [Chlamydophila felis Fe/C-56] gi|89331938|dbj|BAE81531.1| conserved hypothetical protein [Chlamydophila felis Fe/C-56] Length = 65 Score = 68.6 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 23/57 (40%), Positives = 38/57 (66%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITE 88 +KI+VT P+KG+AN+A++A+LAK L+L K + ++S SS K + + K + I Sbjct: 2 LKIRVTEAPEKGRANEAVIALLAKTLSLPKRDVTLISGDSSRKKRLLLPKAAESIIS 58 >gi|20092890|ref|NP_618965.1| hypothetical protein MA4097 [Methanosarcina acetivorans C2A] gi|19918198|gb|AAM07445.1| conserved hypothetical protein [Methanosarcina acetivorans C2A] Length = 123 Score = 68.6 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 17/80 (21%), Positives = 35/80 (43%), Gaps = 4/80 (5%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P ++ + + +K+T QKGKAN+ ++ LA+ +S S Sbjct: 34 IEIEVTPGSRSLSV----PSGYNEWRKRIAVKLTKNAQKGKANEQLIESLAELFGISSSE 89 Query: 64 LRMLSKQSSPLKIIYIDKDC 83 + + S +S K + I Sbjct: 90 ILINSGATSSKKSLLIKGIS 109 >gi|302877711|ref|YP_003846275.1| hypothetical protein Galf_0467 [Gallionella capsiferriformans ES-2] gi|302580500|gb|ADL54511.1| protein of unknown function DUF167 [Gallionella capsiferriformans ES-2] Length = 104 Score = 68.6 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 14/83 (16%), Positives = 38/83 (45%), Gaps = 8/83 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 PN+K+ I + +++ A P++G A M+ LA ++ + ++ ++ Sbjct: 26 RPNSKQDAIGRV-------IGHQLEVYAAAVPRRGGATAHMVQYLASIFSVPEHAITVVF 78 Query: 69 KQSSPLKIIYIDKDCKEITELLQ 91 + + K + I+ ++ ++Q Sbjct: 79 GEKNVNKQLRIE-SPGKLPSVIQ 100 >gi|163757393|ref|ZP_02164482.1| hypothetical protein HPDFL43_18322 [Hoeflea phototrophica DFL-43] gi|162284895|gb|EDQ35177.1| hypothetical protein HPDFL43_18322 [Hoeflea phototrophica DFL-43] Length = 63 Score = 68.6 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 24/59 (40%), Positives = 39/59 (66%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELL 90 MKI+V A + KAN+A+ A LAK+L L+KS +R++S +S K + ++ D +E+ L Sbjct: 1 MKIRVRAVAENNKANRALEAFLAKRLKLAKSRVRVISGANSRTKTVRLEGDPQELAAKL 59 >gi|330469281|ref|YP_004407024.1| hypothetical protein VAB18032_26756 [Verrucosispora maris AB-18-032] gi|328812252|gb|AEB46424.1| hypothetical protein VAB18032_26756 [Verrucosispora maris AB-18-032] Length = 98 Score = 68.6 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 37/87 (42%), Gaps = 3/87 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P A ++ + D + I V A G+A +A LA+ L + ++ Sbjct: 8 VAVRVKPGASRARVGG---RYDGPHGPALVIAVNAPAVDGRATEAARRALAEALGVRPAT 64 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELL 90 + + + SS K+ I +T L Sbjct: 65 VSLRTGASSRDKLFQITPASPTLTAHL 91 >gi|296284807|ref|ZP_06862805.1| hypothetical protein CbatJ_14361 [Citromicrobium bathyomarinum JL354] Length = 92 Score = 68.2 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 22/77 (28%), Positives = 40/77 (51%), Gaps = 8/77 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P A+ +A ++ +++KV A PQ G AN A+ ++AK L + KS Sbjct: 23 LRVRVTPGARTESLAIVD--------GGVQVKVRAKPQDGAANVAVAELVAKALGIPKSR 74 Query: 64 LRMLSKQSSPLKIIYID 80 +L +S K++ + Sbjct: 75 CTLLRGATSREKVLGVT 91 >gi|261350574|ref|ZP_05975991.1| conserved hypothetical protein [Methanobrevibacter smithii DSM 2374] gi|288861357|gb|EFC93655.1| conserved hypothetical protein [Methanobrevibacter smithii DSM 2374] Length = 101 Score = 68.2 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 39/92 (42%), Gaps = 13/92 (14%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + PN+ K I+ +I++ PQKGKANK ++ L+K Sbjct: 18 VDIEVSPNSNKFQISGFN-----EWRNRFEIRIKQVPQKGKANKEIVKELSKIFNC---D 69 Query: 64 LRMLSKQSSPLKIIY-----IDKDCKEITELL 90 + + + S K I I+ ++ E+L Sbjct: 70 VSISKGEKSSQKTIVCYDVSIEYILDKLGEIL 101 >gi|323356791|ref|YP_004223187.1| hypothetical protein MTES_0343 [Microbacterium testaceum StLB037] gi|323273162|dbj|BAJ73307.1| uncharacterized conserved protein [Microbacterium testaceum StLB037] Length = 73 Score = 68.2 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 12/80 (15%), Positives = 30/80 (37%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P +++ + + + V G AN ++ LA + Sbjct: 1 MQLTVRVKPGSRRGPLV-------EDTAEGLVVHVRERAVDGAANSGVVKALAAHFGVPA 53 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 + +L ++ +K + +D Sbjct: 54 RDVEILRGHAARVKRVEVDA 73 >gi|116754806|ref|YP_843924.1| hypothetical protein Mthe_1512 [Methanosaeta thermophila PT] gi|116754813|ref|YP_843931.1| hypothetical protein Mthe_1519 [Methanosaeta thermophila PT] gi|116754820|ref|YP_843938.1| hypothetical protein Mthe_1526 [Methanosaeta thermophila PT] gi|116666257|gb|ABK15284.1| protein of unknown function DUF167 [Methanosaeta thermophila PT] gi|116666264|gb|ABK15291.1| protein of unknown function DUF167 [Methanosaeta thermophila PT] gi|116666271|gb|ABK15298.1| protein of unknown function DUF167 [Methanosaeta thermophila PT] Length = 122 Score = 68.2 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 15/91 (16%), Positives = 42/91 (46%), Gaps = 5/91 (5%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + +IP + + + +++++T KGKAN+ +L +++ L++ Sbjct: 32 VVLHIDVIPGSS----ELVVPAGFNAWRGSIEVRLTERADKGKANRQLLQEISRAFGLTQ 87 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQ 91 + ++S S K++ + D + + L+ Sbjct: 88 GDVEIMSGHRSQRKVVLMRGIDAESVLAALR 118 >gi|154150390|ref|YP_001404008.1| hypothetical protein Mboo_0847 [Candidatus Methanoregula boonei 6A8] gi|153998942|gb|ABS55365.1| protein of unknown function DUF167 [Methanoregula boonei 6A8] Length = 104 Score = 68.2 bits (166), Expect = 3e-10, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 35/84 (41%), Gaps = 4/84 (4%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V + + AK L + +V A GKANKA++ +++ + Sbjct: 15 VLVSIEVTAGAKSD----LFPAGYNEWRKAIGCRVAAPAVNGKANKAVIGIISAGTGVPA 70 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85 +S+ +++ +S K + I KE Sbjct: 71 ASVTIVAGLTSSQKKVRIAGITKE 94 >gi|322697515|gb|EFY89294.1| DUF167 domain protein [Metarhizium acridum CQMa 102] Length = 122 Score = 68.2 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 25/104 (24%), Positives = 45/104 (43%), Gaps = 12/104 (11%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ K + D +++ VTA P+ G+ANKA++ L+ + + K Sbjct: 22 VQLQLRVKAGTSKDR-EGILAVTDR----AIELCVTAQPRHGEANKAVVQALSNAIGIPK 76 Query: 62 SSLRMLSKQSSPLKIIYI-----DKD--CKEITELLQNNDSLTL 98 S R +S S K++ I D + + LL+ TL Sbjct: 77 SRFRFVSGLKSRDKVVAIGDIQGDGPDYTETVLRLLREASYRTL 120 >gi|15668799|ref|NP_247602.1| hypothetical protein MJ_0618 [Methanocaldococcus jannaschii DSM 2661] gi|2496078|sp|Q58035|Y618_METJA RecName: Full=UPF0235 protein MJ0618 gi|1591329|gb|AAB98613.1| conserved hypothetical protein [Methanocaldococcus jannaschii DSM 2661] Length = 98 Score = 68.2 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 42/91 (46%), Gaps = 9/91 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + NAKK+ I + + IK+ A +GKANK ++ + K Sbjct: 13 VLIDIDVQANAKKNEIVGIN-----EWRKRLSIKIKAPATEGKANKEIIKFFKEIF---K 64 Query: 62 SSLRMLSKQSSPLKIIYI-DKDCKEITELLQ 91 + ++S + +P K + I D E+ E+L+ Sbjct: 65 KDVEIVSGKLNPQKTVLIGDIKKDEVIEILK 95 >gi|119509193|ref|ZP_01628343.1| hypothetical protein N9414_14620 [Nodularia spumigena CCY9414] gi|119466035|gb|EAW46922.1| hypothetical protein N9414_14620 [Nodularia spumigena CCY9414] Length = 52 Score = 68.2 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 19/49 (38%), Positives = 32/49 (65%) Query: 34 IKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKD 82 +++ + P GKAN+ ++ +LA+K + KS +R+ S SS K+I ID D Sbjct: 3 VRLKSPPVDGKANEELIKLLAEKFHVPKSHIRIKSGVSSRQKLIEIDTD 51 >gi|116328001|ref|YP_797721.1| hypothetical protein LBL_1291 [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116330879|ref|YP_800597.1| hypothetical protein LBJ_1240 [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|122281367|sp|Q04TD1|Y1240_LEPBJ RecName: Full=UPF0235 protein LBJ_1240 gi|122284207|sp|Q052F7|Y1291_LEPBL RecName: Full=UPF0235 protein LBL_1291 gi|116120745|gb|ABJ78788.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116124568|gb|ABJ75839.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 73 Score = 68.2 bits (166), Expect = 4e-10, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 39/79 (49%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 VR+ PN+KK I + + I V +GKAN+A++ +++++ + K Sbjct: 1 MKFTVRVKPNSKK-------IFFRKEEDGSVTIAVREPALEGKANEAVIETISREMKIPK 53 Query: 62 SSLRMLSKQSSPLKIIYID 80 +R++S + K I ID Sbjct: 54 RKIRIVSGEKGKKKTIEID 72 >gi|254419593|ref|ZP_05033317.1| conserved hypothetical protein [Brevundimonas sp. BAL3] gi|196185770|gb|EDX80746.1| conserved hypothetical protein [Brevundimonas sp. BAL3] Length = 69 Score = 67.8 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 18/53 (33%), Positives = 33/53 (62%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCK 84 +K++V A P G+AN A++ LAK L +S+SS+ + S LK++ ++ + Sbjct: 2 LKVRVRARPVDGEANAALVKFLAKALGVSRSSVVLERGGQSRLKMVSVEGLDE 54 >gi|288818090|ref|YP_003432438.1| hypothetical protein HTH_0777 [Hydrogenobacter thermophilus TK-6] gi|288787490|dbj|BAI69237.1| hypothetical protein HTH_0777 [Hydrogenobacter thermophilus TK-6] Length = 73 Score = 67.8 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 20/79 (25%), Positives = 39/79 (49%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR P AK+ + + ++ V PQ+GKAN+ + +L+ L + K Sbjct: 1 MILEVRAKPKAKREYVKKIT-------ESVYEVAVKEPPQEGKANERIAVLLSYHLGIPK 53 Query: 62 SSLRMLSKQSSPLKIIYID 80 S +++L +S +K+ +D Sbjct: 54 SRIKLLKGHTSKIKLFQVD 72 >gi|124515876|gb|EAY57385.1| conserved hypothetical protein [Leptospirillum rubarum] Length = 104 Score = 67.8 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 39/91 (42%), Gaps = 9/91 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +R+ P KK + ++A ++G AN +L L+ LA S Sbjct: 13 RVRIRVRPGKKKDVLL--------LSGEEFSADLSAPAREGAANDRLLRNLSYWLAWPVS 64 Query: 63 SLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 ++R+ +SS LK I I +E+ + L Sbjct: 65 NIRIEKGESSRLKTIAIRGMTGEEVRKRLLA 95 >gi|297568085|ref|YP_003689429.1| protein of unknown function DUF167 [Desulfurivibrio alkaliphilus AHT2] gi|296924000|gb|ADH84810.1| protein of unknown function DUF167 [Desulfurivibrio alkaliphilus AHT2] Length = 76 Score = 67.8 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 14/79 (17%), Positives = 38/79 (48%), Gaps = 7/79 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +++ P ++ S + D S +K + P +G+AN+ ++A++A K Sbjct: 3 MILQIKVKPRSQSSSLT---QEADGSWLARLK----SPPVEGRANRELIALVADHFRCRK 55 Query: 62 SSLRMLSKQSSPLKIIYID 80 + + + + S K++ ++ Sbjct: 56 ADVEIKAGSSGRTKLVRVE 74 >gi|117927694|ref|YP_872245.1| hypothetical protein Acel_0486 [Acidothermus cellulolyticus 11B] gi|117648157|gb|ABK52259.1| protein of unknown function DUF167 [Acidothermus cellulolyticus 11B] Length = 85 Score = 67.8 bits (165), Expect = 4e-10, Method: Composition-based stats. Identities = 22/91 (24%), Positives = 42/91 (46%), Gaps = 8/91 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 +++ + P + ++ + + + V + G+A A LA +A+ + K Sbjct: 1 MRLVIHVRPGSSRATVGGSH-------NGALIVAVREPAEHGRATDAALAAVAQAFGVPK 53 Query: 62 SSLRMLSKQSSPLKII-YIDKDCKEITELLQ 91 S +R++S +S KII ID D + ELL Sbjct: 54 SQVRLVSGATSRRKIIDVIDGDPVRLAELLA 84 >gi|148358546|ref|YP_001249753.1| hypothetical protein LPC_0417 [Legionella pneumophila str. Corby] gi|296108365|ref|YP_003620066.1| hypothetical protein lpa_03962 [Legionella pneumophila 2300/99 Alcoy] gi|148280319|gb|ABQ54407.1| conserved hypothetical protein; DUF167 [Legionella pneumophila str. Corby] gi|295650267|gb|ADG26114.1| hypothetical protein lpa_03962 [Legionella pneumophila 2300/99 Alcoy] Length = 70 Score = 67.8 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 15/65 (23%), Positives = 33/65 (50%), Gaps = 1/65 (1%) Query: 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEIT 87 + I + A PQ+G+AN +L +++ + K+ + ++ +SS K+I + + + Sbjct: 4 SDDRLHIALHAKPQEGEANNELLFFISQFFKIPKTQIELIKGKSSRHKLIRLP-LSESVF 62 Query: 88 ELLQN 92 L N Sbjct: 63 RFLNN 67 >gi|297603606|ref|NP_001054326.2| Os04g0686300 [Oryza sativa Japonica Group] gi|215687220|dbj|BAG91785.1| unnamed protein product [Oryza sativa Japonica Group] gi|222629814|gb|EEE61946.1| hypothetical protein OsJ_16702 [Oryza sativa Japonica Group] gi|255675903|dbj|BAF16240.2| Os04g0686300 [Oryza sativa Japonica Group] Length = 129 Score = 67.8 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 13/76 (17%), Positives = 35/76 (46%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + ++ P +K + I + + +++ A + G+AN A++ ++ L + K Sbjct: 42 ISIQAKPGSKLATITEI-------GDEAVGVQIDAPARDGEANAALVDFISSVLGVKKRE 94 Query: 64 LRMLSKQSSPLKIIYI 79 + + S S K++ + Sbjct: 95 VSIGSGSKSREKVVLV 110 >gi|45358618|ref|NP_988175.1| hypothetical protein MMP1055 [Methanococcus maripaludis S2] gi|74554395|sp|Q6LYD6|Y1055_METMP RecName: Full=UPF0235 protein MMP1055 gi|45047484|emb|CAF30611.1| conserved hypothetical protein [Methanococcus maripaludis S2] Length = 101 Score = 67.8 bits (165), Expect = 5e-10, Method: Composition-based stats. Identities = 23/91 (25%), Positives = 44/91 (48%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + NAKK+ I + ++I++ P +GKANKA++ L + KS Sbjct: 15 IDIEVTTNAKKNEIGKIN-----EWRKRIEIRIKEQPIEGKANKAIIKFLK---GIFKSE 66 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93 + + S +S K + I DK ++ +L+ Sbjct: 67 ILINSGTTSSQKTVLIPDKTKDDVVTILKKE 97 >gi|149185290|ref|ZP_01863607.1| hypothetical protein ED21_19592 [Erythrobacter sp. SD-21] gi|148831401|gb|EDL49835.1| hypothetical protein ED21_19592 [Erythrobacter sp. SD-21] Length = 93 Score = 67.5 bits (164), Expect = 5e-10, Method: Composition-based stats. Identities = 18/77 (23%), Positives = 34/77 (44%), Gaps = 8/77 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +R+ P A+ + + + KV A PQ G AN A+ ++A + + Sbjct: 23 RVAIRVTPGARTESLT--------LEGGVLAAKVRAKPQDGAANDAVRKLIAAAYRVPPT 74 Query: 63 SLRMLSKQSSPLKIIYI 79 + +L +S K++ I Sbjct: 75 RVELLRGATSREKLLRI 91 >gi|149057385|gb|EDM08708.1| similar to RIKEN cDNA 3110040N11, isoform CRA_e [Rattus norvegicus] Length = 130 Score = 67.5 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 28/67 (41%), Gaps = 7/67 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + A P +G+AN + L+K L L K Sbjct: 36 VTIAIHAKPGSKQNAVTDLNTEAVG-------VAIAAPPSEGEANAELCRYLSKVLDLRK 88 Query: 62 SSLRMLS 68 S + + Sbjct: 89 SDVVLDK 95 >gi|331695278|ref|YP_004331517.1| hypothetical protein Psed_1425 [Pseudonocardia dioxanivorans CB1190] gi|326949967|gb|AEA23664.1| UPF0235 protein yggU [Pseudonocardia dioxanivorans CB1190] Length = 89 Score = 67.5 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 18/89 (20%), Positives = 41/89 (46%), Gaps = 8/89 (8%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + +R+ P A ++ + + + ++V G A A L +A + + Sbjct: 7 RLTIRVRPGASRTSVGG-------AYADALVVRVNERAVDGAATAAALRAVAGAFGVPRG 59 Query: 63 SLRMLSKQSSPLKIIYID-KDCKEITELL 90 ++R++S +S KI+ ++ D + + ELL Sbjct: 60 AVRLVSGATSRTKILDVEGGDPERLRELL 88 >gi|296242831|ref|YP_003650318.1| hypothetical protein Tagg_1097 [Thermosphaera aggregans DSM 11486] gi|296095415|gb|ADG91366.1| protein of unknown function DUF167 [Thermosphaera aggregans DSM 11486] Length = 111 Score = 67.5 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 19/96 (19%), Positives = 44/96 (45%), Gaps = 9/96 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P + E T ++ + + + ++G+AN +++ L+++L + Sbjct: 23 VILQVRVKPGS--------EPEGFTIESDELVFRTSEPAERGRANASLIKYLSRELKIPV 74 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE-ITELLQNNDSL 96 S + ++ Q LK + I + + I E L +L Sbjct: 75 SKIDIVYGQREKLKKVLIMDEPADKIIEKLARVLNL 110 >gi|322708042|gb|EFY99619.1| DUF167 domain protein [Metarhizium anisopliae ARSEF 23] Length = 126 Score = 67.5 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 25/104 (24%), Positives = 44/104 (42%), Gaps = 12/104 (11%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + +R+ K + D +++ V A P+ G+ANKA++ L+ + + K Sbjct: 26 VQLQLRVKAGTSKDR-EGILAVTDR----AIELCVAAQPRHGEANKAVVQALSNAIGIPK 80 Query: 62 SSLRMLSKQSSPLKIIYI-----DKD--CKEITELLQNNDSLTL 98 S R +S S K++ I D + I LL+ TL Sbjct: 81 SRFRFVSGIKSRDKVVAIGDIQGDGPEYTETILRLLREASHRTL 124 >gi|121603294|ref|YP_980623.1| hypothetical protein Pnap_0379 [Polaromonas naphthalenivorans CJ2] gi|120592263|gb|ABM35702.1| protein of unknown function DUF167 [Polaromonas naphthalenivorans CJ2] Length = 107 Score = 67.5 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 18/82 (21%), Positives = 34/82 (41%), Gaps = 8/82 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+A K I +K+ VTA P GKA M+ LA ++ + + ++ Sbjct: 29 KPSAGKDAIG-------KPKGTQLKVSVTAAPLAGKATDHMVRFLAPLFGVAVADIEVVF 81 Query: 69 KQSSPLKIIYIDKDCKEITELL 90 + + K + I K++ + Sbjct: 82 GRENVNKQLRIRA-PKKLPAVF 102 >gi|297297113|ref|XP_002804962.1| PREDICTED: UPF0235 protein C15orf40-like isoform 2 [Macaca mulatta] Length = 150 Score = 67.5 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 7/67 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 64 VTITIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 116 Query: 62 SSLRMLS 68 S + + Sbjct: 117 SDVVLDK 123 >gi|237858670|ref|NP_001153588.1| hypothetical protein LOC123207 isoform e [Homo sapiens] Length = 149 Score = 67.5 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 7/67 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 63 VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 115 Query: 62 SSLRMLS 68 S + + Sbjct: 116 SDVVLDK 122 >gi|189501744|ref|YP_001957461.1| hypothetical protein Aasi_0294 [Candidatus Amoebophilus asiaticus 5a2] gi|259646928|sp|B3ER80|Y294_AMOA5 RecName: Full=UPF0235 protein Aasi_0294 gi|189497185|gb|ACE05732.1| hypothetical protein Aasi_0294 [Candidatus Amoebophilus asiaticus 5a2] Length = 97 Score = 67.1 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 42/79 (53%), Gaps = 7/79 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V+ IP +K S I + I++T+ P+ GKAN+ ++ ++AK L L +++ Sbjct: 5 IEVKAIPKSKISAIT-------IDKLGRLCIRITSAPENGKANREIIKLIAKTLKLPQAN 57 Query: 64 LRMLSKQSSPLKIIYIDKD 82 + +++ + LK I I Sbjct: 58 VEIIAGLTIKLKRIRITSS 76 >gi|330507436|ref|YP_004383864.1| hypothetical protein MCON_1367 [Methanosaeta concilii GP-6] gi|328928244|gb|AEB68046.1| hypothetical protein MCON_1367 [Methanosaeta concilii GP-6] Length = 105 Score = 67.1 bits (163), Expect = 8e-10, Method: Composition-based stats. Identities = 14/84 (16%), Positives = 40/84 (47%), Gaps = 4/84 (4%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C + ++P + + + ++ ++T P +G+AN+ ++ LA+ L + + Sbjct: 14 CIIRFEVVPGSSRLAV----PSGFNPWRRSLEARLTEKPSRGRANRQLVEELARILGVDE 69 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85 S + ++ + S K++ + K+ Sbjct: 70 SGIEVIKGEKSGRKLLLVKGIEKD 93 >gi|148284889|ref|YP_001248979.1| hypothetical protein OTBS_1640 [Orientia tsutsugamushi str. Boryong] gi|146740328|emb|CAM80735.1| conserved hypothetical protein [Orientia tsutsugamushi str. Boryong] Length = 112 Score = 67.1 bits (163), Expect = 8e-10, Method: Composition-based stats. Identities = 19/95 (20%), Positives = 44/95 (46%), Gaps = 7/95 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ AK + I L + S + I + P+ KAN+ ++ L++ L +S+ Sbjct: 20 INLKVKAGAKINKIIGLYHINNKS---FLYISINTIPENNKANQLIIKFLSQWLETGRSN 76 Query: 64 LRMLSKQSSPLKIIYIDKD----CKEITELLQNND 94 ++++ S LK+I + I L++ + Sbjct: 77 IKIVYGLHSNLKVISVMNTNGNISNLIISKLKSIN 111 >gi|307611594|emb|CBX01276.1| hypothetical protein LPW_29741 [Legionella pneumophila 130b] Length = 70 Score = 67.1 bits (163), Expect = 9e-10, Method: Composition-based stats. Identities = 13/52 (25%), Positives = 29/52 (55%) Query: 28 DTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI 79 + I + A PQ+G+AN +L +++ + K+ + ++ +SS K+I + Sbjct: 4 SDDSLHIALHAKPQEGEANNELLFFISQFFKIPKTQIELIKGKSSRHKLIRL 55 >gi|237858664|ref|NP_001153585.1| hypothetical protein LOC123207 isoform b [Homo sapiens] Length = 167 Score = 67.1 bits (163), Expect = 9e-10, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 7/67 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 63 VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 115 Query: 62 SSLRMLS 68 S + + Sbjct: 116 SDVVLDK 122 >gi|237858666|ref|NP_001153586.1| hypothetical protein LOC123207 isoform c [Homo sapiens] Length = 153 Score = 66.7 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 18/76 (23%), Positives = 34/76 (44%), Gaps = 7/76 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 63 VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 115 Query: 62 SSLRMLSKQSSPLKII 77 S + + K LKI+ Sbjct: 116 SDVVLDKKLRDLLKIV 131 >gi|206602120|gb|EDZ38602.1| Conserved hypothetical protein [Leptospirillum sp. Group II '5-way CG'] Length = 102 Score = 66.7 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 39/91 (42%), Gaps = 9/91 (9%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 V +R+ P KK + ++A ++G AN +L L++ LA S Sbjct: 11 RVRIRVRPGKKKDVL--------FLSGEEFSADLSAPAREGAANDRLLRNLSQWLAWPVS 62 Query: 63 SLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 +R+ Q+S LK I I +EI + L Sbjct: 63 KIRLEKGQASRLKTIVIAGMTGEEIRKRLLA 93 >gi|20093713|ref|NP_613560.1| hypothetical protein MK0273 [Methanopyrus kandleri AV19] gi|29839574|sp|Q8TYM3|Y273_METKA RecName: Full=UPF0235 protein MK0273 gi|19886604|gb|AAM01490.1| Uncharacterized conserved protein [Methanopyrus kandleri AV19] Length = 96 Score = 66.7 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 44/88 (50%), Gaps = 9/88 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VR+ P+A + + ++ +++ V A P KGKAN+ +L L +KL ++ Sbjct: 14 IRVRVNPDADTTDLKGVD-----EWRGVLEVDVAAPPVKGKANRELLEFLGRKLN---TT 65 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELL 90 ++S + S K++ D E+ E L Sbjct: 66 CELVSGEKSREKLVLARDVSVDEVKERL 93 >gi|86136531|ref|ZP_01055110.1| hypothetical protein MED193_20449 [Roseobacter sp. MED193] gi|85827405|gb|EAQ47601.1| hypothetical protein MED193_20449 [Roseobacter sp. MED193] Length = 91 Score = 66.7 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 38/77 (49%), Gaps = 8/77 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A + I + +K+ VT+ P+ GKA +A+ ++LAK + ++ Sbjct: 20 AKIAVRVTPKAASNSI--------SVSEAGLKVTVTSVPENGKATEAVRSLLAKAMGVAA 71 Query: 62 SSLRMLSKQSSPLKIIY 78 S L + +S K+ Sbjct: 72 SKLDLSQGATSRNKVFV 88 >gi|150401269|ref|YP_001325035.1| hypothetical protein Maeo_0841 [Methanococcus aeolicus Nankai-3] gi|166235100|sp|A6UVA1|Y841_META3 RecName: Full=UPF0235 protein Maeo_0841 gi|150013972|gb|ABR56423.1| protein of unknown function DUF167 [Methanococcus aeolicus Nankai-3] Length = 101 Score = 66.7 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 10/92 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V + NAKK+ I + + IK+ A P GKANK + K Sbjct: 17 IDVEISSNAKKNEIGEIN-----EWRKRLIIKIKALPVDGKANKEIAKFFKKTFG---KD 68 Query: 64 LRMLSKQSSPLKIIYIDKDCKE--ITELLQNN 93 + ++S +S K I + K+ I ++L+NN Sbjct: 69 IIIVSGLTSSQKTICVIGATKDEIIDKILKNN 100 >gi|312084117|ref|XP_003144143.1| hypothetical protein LOAG_08565 [Loa loa] gi|307760694|gb|EFO19928.1| hypothetical protein LOAG_08565 [Loa loa] Length = 129 Score = 66.7 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 19/95 (20%), Positives = 43/95 (45%), Gaps = 10/95 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + PNAK + + + +++ + A P G+AN+A++ + L L K+ Sbjct: 39 LRIHAKPNAKTTRVIDI-------GANEVELAIAAPPHDGQANEALINAMMDILELRKNE 91 Query: 64 LRMLSKQSSPLKIIYI---DKDCKEITELLQNNDS 95 + + S K++ + +E+ E L+ N + Sbjct: 92 ITFDTGARSRSKVLRLMSKRITLEEVREKLERNAT 126 >gi|156039347|ref|XP_001586781.1| predicted protein [Sclerotinia sclerotiorum 1980] gi|154697547|gb|EDN97285.1| predicted protein [Sclerotinia sclerotiorum 1980 UF-70] Length = 128 Score = 66.3 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 23/107 (21%), Positives = 45/107 (42%), Gaps = 15/107 (14%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 ++ + PN S +++ DTS ++I V + +AN+ ++ +L L K Sbjct: 20 IHLRCHVKPNVSASRAGAIKPFTDTSP--ILEICVPEPARDNEANEGVIELLGDVLLCPK 77 Query: 62 SSLRMLSKQSSPLKII---YIDK----------DCKEITELLQNNDS 95 + L ++ + S K+I ++ KE+ E LQ S Sbjct: 78 TDLEVIRGKKSRDKVIAYRSLEGLWINCAIARMTVKELRERLQEASS 124 >gi|21226924|ref|NP_632846.1| hypothetical protein MM_0822 [Methanosarcina mazei Go1] gi|29839564|sp|Q8PYN9|Y822_METMA RecName: Full=UPF0235 protein MM_0822 gi|20905233|gb|AAM30518.1| hypothetical protein MM_0822 [Methanosarcina mazei Go1] Length = 108 Score = 66.3 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 35/78 (44%), Gaps = 4/78 (5%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + P ++ + +++K+T QKGKAN+ ++ LA+ + S Sbjct: 19 VDIEVTPGSRSLSV----PSGYNEWRKRIEVKLTRNAQKGKANEQLIESLAELFGICSSD 74 Query: 64 LRMLSKQSSPLKIIYIDK 81 + + S +S K + I Sbjct: 75 IFISSGATSSKKSLLIKG 92 >gi|73668032|ref|YP_304047.1| hypothetical protein Mbar_A0485 [Methanosarcina barkeri str. Fusaro] gi|72395194|gb|AAZ69467.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro] Length = 100 Score = 66.3 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 16/80 (20%), Positives = 38/80 (47%), Gaps = 4/80 (5%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + P ++ + +++K+T QKGKAN+ ++ LA+ ++S Sbjct: 5 VIIDIEVTPGSRSISV----PSGYNEWRKRIEVKLTKNAQKGKANEQLVECLAELFSISS 60 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 S++ + S +S K + + Sbjct: 61 SNILINSGATSSKKSLLLKG 80 >gi|91773909|ref|YP_566601.1| hypothetical protein Mbur_1971 [Methanococcoides burtonii DSM 6242] gi|91712924|gb|ABE52851.1| protein of unknown function DUF167 [Methanococcoides burtonii DSM 6242] Length = 107 Score = 66.3 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 19/91 (20%), Positives = 44/91 (48%), Gaps = 5/91 (5%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P +KK+ ++IK+T+ QKGKAN ++ ++A + + Sbjct: 18 IDLEVTPGSKKACF----PAGYNQWRERIEIKLTSAAQKGKANGQLIEIVADFFNIGQRE 73 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNN 93 + + S S K + I++ D +++ L++ Sbjct: 74 VIIGSGAKSSKKTVIINRPDQEQVVLALESI 104 >gi|294010668|ref|YP_003544128.1| hypothetical protein SJA_C1-06820 [Sphingobium japonicum UT26S] gi|292673998|dbj|BAI95516.1| conserved hypothetical protein [Sphingobium japonicum UT26S] Length = 103 Score = 66.3 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 23/85 (27%), Positives = 44/85 (51%), Gaps = 2/85 (2%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + VRL P A + + + D + +V A P++GKAN A++A+LAK+L + + Sbjct: 13 LSVRLTPGAAREEVGGV--WTDDKGANWLSARVRAVPERGKANAALIALLAKRLDWPRGA 70 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITE 88 + + S ++ LK + I + + Sbjct: 71 ISLESGDANRLKRLRIKGGGEALAS 95 >gi|134045581|ref|YP_001097067.1| hypothetical protein MmarC5_0538 [Methanococcus maripaludis C5] gi|166227318|sp|A4FXC3|Y538_METM5 RecName: Full=UPF0235 protein MmarC5_0538 gi|132663206|gb|ABO34852.1| protein of unknown function DUF167 [Methanococcus maripaludis C5] Length = 101 Score = 65.9 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 45/91 (49%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + NAKK+ I + ++I++ P +GKANKA++ L + KS Sbjct: 15 IDIEVTTNAKKNEIGKIN-----EWRKRIEIRIKEQPIEGKANKAIMKFLK---GIFKSE 66 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93 + + S +S K + I DK +I ++L+ Sbjct: 67 ILINSGTTSAQKTVLIPDKTKDDIVKILKKE 97 >gi|169629033|ref|YP_001702682.1| hypothetical protein MAB_1946 [Mycobacterium abscessus ATCC 19977] gi|169241000|emb|CAM62028.1| Conserved hypothetical protein [Mycobacterium abscessus] Length = 76 Score = 65.9 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 21/72 (29%), Positives = 39/72 (54%), Gaps = 6/72 (8%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P ++K + +D + + V GKANKA +A+LA+ L + KS++R++ Sbjct: 10 IKPGSRK------GPAVEVADDGALTLFVREPAIDGKANKAAIALLAEYLDVPKSTVRLV 63 Query: 68 SKQSSPLKIIYI 79 + Q+S LK + Sbjct: 64 AGQTSRLKRFSV 75 >gi|29839573|sp|Q8TIP5|Y4097_METAC RecName: Full=UPF0235 protein MA_4097 Length = 109 Score = 65.9 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 17/80 (21%), Positives = 35/80 (43%), Gaps = 4/80 (5%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P ++ + + +K+T QKGKAN+ ++ LA+ +S S Sbjct: 20 IEIEVTPGSRSLSV----PSGYNEWRKRIAVKLTKNAQKGKANEQLIESLAELFGISSSE 75 Query: 64 LRMLSKQSSPLKIIYIDKDC 83 + + S +S K + I Sbjct: 76 ILINSGATSSKKSLLIKGIS 95 >gi|256810180|ref|YP_003127549.1| protein of unknown function DUF167 [Methanocaldococcus fervens AG86] gi|256793380|gb|ACV24049.1| protein of unknown function DUF167 [Methanocaldococcus fervens AG86] Length = 98 Score = 65.9 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 39/92 (42%), Gaps = 9/92 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + AKK I + + IK+ A +GKANK ++ + K Sbjct: 13 VLIDIDVQAGAKKDEITGIN-----EWRKRLSIKIKAPATEGKANKEIIKFFKEIF---K 64 Query: 62 SSLRMLSKQSSPLKIIYIDKDCK-EITELLQN 92 + +++ + +P K I + K E+ E L+ Sbjct: 65 KDIEIVAGKLNPQKTILVKDIKKDEVIETLKK 96 >gi|86740601|ref|YP_481001.1| hypothetical protein Francci3_1896 [Frankia sp. CcI3] gi|86567463|gb|ABD11272.1| protein of unknown function DUF167 [Frankia sp. CcI3] Length = 97 Score = 65.9 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 17/95 (17%), Positives = 41/95 (43%), Gaps = 2/95 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 +++R+ P A ++ + +D + ++VT G+A +A L L L + + Sbjct: 1 MRLMIRVQPGAGRTAVGG--RREDPLHGPLLIVRVTEPAVDGRATEAALRALCAALRVRR 58 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSL 96 +R++ + +K + ++ + L D L Sbjct: 59 GDVRLVRGATGRVKAVEVEVAAVDEPALRIRIDEL 93 >gi|289192506|ref|YP_003458447.1| protein of unknown function DUF167 [Methanocaldococcus sp. FS406-22] gi|288938956|gb|ADC69711.1| protein of unknown function DUF167 [Methanocaldococcus sp. FS406-22] Length = 98 Score = 65.9 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 18/84 (21%), Positives = 37/84 (44%), Gaps = 8/84 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + NAKK+ I + + IK+ A +GKANK ++ L K Sbjct: 13 VLIDIDVQANAKKNEIIGIN-----EWRKRLTIKIKAPATEGKANKEIIKFLKDIF---K 64 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE 85 + +++ + +P K + + K+ Sbjct: 65 KDVEIVAGKLNPQKTVLVKDIKKD 88 >gi|255540269|ref|XP_002511199.1| conserved hypothetical protein [Ricinus communis] gi|223550314|gb|EEF51801.1| conserved hypothetical protein [Ricinus communis] Length = 232 Score = 65.9 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S + + ++ K++ ++ +++ E L Sbjct: 196 LSQMTLQRGWNNKSKLLVVEDLSARQVYEKL 226 >gi|268326229|emb|CBH39817.1| conserved hypothetical protein, DUF167 family [uncultured archaeon] Length = 70 Score = 65.9 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 24/80 (30%), Positives = 42/80 (52%), Gaps = 12/80 (15%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + +++IPN+K I E M I+V P KGKANKA++ +L++ Sbjct: 1 MKRIAIKVIPNSKTEEIIYAEP---------MIIRVKEPPTKGKANKAVVMLLSRYFN-- 49 Query: 61 KSSLRMLSKQSSPLKIIYID 80 + +R++S S KI+ ++ Sbjct: 50 -ADVRIVSGAKSRRKIVEVE 68 >gi|289641324|ref|ZP_06473490.1| protein of unknown function DUF167 [Frankia symbiont of Datisca glomerata] gi|289508922|gb|EFD29855.1| protein of unknown function DUF167 [Frankia symbiont of Datisca glomerata] Length = 122 Score = 65.9 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 15/119 (12%), Positives = 38/119 (31%), Gaps = 24/119 (20%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIH-----------------------MKIKVTA 38 + +R+ P + ++ + + ++VTA Sbjct: 1 MRLTIRVSPRSARTSVGGSSPASGAGAPGESPAENPTAADDPTNGSPPDAATPLVVRVTA 60 Query: 39 TPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCK-EITELLQNNDSL 96 G+A ++ L LA + + + ++ +S KI+ I + + L + Sbjct: 61 PAVDGQATESALRALATAFGVRRREVVLVRGATSRTKIVEISGLPESALRSRLAELTAT 119 >gi|312197091|ref|YP_004017152.1| hypothetical protein FraEuI1c_3270 [Frankia sp. EuI1c] gi|311228427|gb|ADP81282.1| protein of unknown function DUF167 [Frankia sp. EuI1c] Length = 97 Score = 65.9 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 37/88 (42%), Gaps = 2/88 (2%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V +R+ P A + + D + ++V A GKA +A L LA L L + Sbjct: 1 MRVTIRVRPGASGTAVGGELGGPDGEP--SLVVRVCARAVDGKATEAALRALADALGLRR 58 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITEL 89 + + ++ +S K++ I + L Sbjct: 59 ADVSLVHGATSRTKLVEIAASPADEPAL 86 >gi|237858668|ref|NP_001153587.1| hypothetical protein LOC123207 isoform d [Homo sapiens] Length = 167 Score = 65.9 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 7/67 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + L + + + A P +G+AN + L+K L L K Sbjct: 63 VTIAIHAKPGSKQNAVTDLTA-------EAVNVAIAAPPSEGEANAELCRYLSKVLELRK 115 Query: 62 SSLRMLS 68 S + + Sbjct: 116 SDVVLDK 122 >gi|30697845|ref|NP_568972.2| unknown protein [Arabidopsis thaliana] gi|26452404|dbj|BAC43287.1| unknown protein [Arabidopsis thaliana] gi|332010365|gb|AED97748.1| uncharacterized protein [Arabidopsis thaliana] Length = 232 Score = 65.5 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + + L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S + + +S K++ ++ +++ E L Sbjct: 196 LSQMTLQRGWNSKSKLLVVEDLSARQVYEKL 226 >gi|288917345|ref|ZP_06411712.1| protein of unknown function DUF167 [Frankia sp. EUN1f] gi|288351210|gb|EFC85420.1| protein of unknown function DUF167 [Frankia sp. EUN1f] Length = 118 Score = 65.5 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 14/93 (15%), Positives = 40/93 (43%), Gaps = 3/93 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 +V +R+ P + ++ + + D + ++V +G+AN+A L +A+ L + + Sbjct: 17 VHVAIRVRPASDRTAVGPV--TADPIHGQLLVVRVREPAVEGRANEAALRAIAQALGVRR 74 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 + + + +K + +D + + Sbjct: 75 ADVTLSR-SIGRVKFVAVDAPDDIVAARVAELA 106 >gi|225456297|ref|XP_002283689.1| PREDICTED: hypothetical protein [Vitis vinifera] gi|147823132|emb|CAN75279.1| hypothetical protein VITISV_030868 [Vitis vinifera] gi|297734405|emb|CBI15652.3| unnamed protein product [Vitis vinifera] Length = 232 Score = 65.5 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S + + ++ K++ ++ +++ E L Sbjct: 196 LSQMTLQRGWNNKSKLLVVEDLSARQVYEKL 226 >gi|297793925|ref|XP_002864847.1| hypothetical protein ARALYDRAFT_332565 [Arabidopsis lyrata subsp. lyrata] gi|297310682|gb|EFH41106.1| hypothetical protein ARALYDRAFT_332565 [Arabidopsis lyrata subsp. lyrata] Length = 232 Score = 65.5 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + + L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S + + +S K++ ++ +++ E L Sbjct: 196 LSQMTLQRGWNSKSKLLVVEDLSARQVYEKL 226 >gi|198282577|ref|YP_002218898.1| hypothetical protein Lferr_0437 [Acidithiobacillus ferrooxidans ATCC 53993] gi|218665107|ref|YP_002424767.1| conserved hypothetical protein TIGR00251, putative [Acidithiobacillus ferrooxidans ATCC 23270] gi|198247098|gb|ACH82691.1| protein of unknown function DUF167 [Acidithiobacillus ferrooxidans ATCC 53993] gi|218517320|gb|ACK77906.1| conserved hypothetical protein TIGR00251, putative [Acidithiobacillus ferrooxidans ATCC 23270] Length = 116 Score = 65.1 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 37/89 (41%), Gaps = 7/89 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + P AK I +KI++ A P G AN A+ A+L L L + Sbjct: 20 VRIHVQPGAKTDAIGGCH-------GDALKIRLRARPVDGAANAALSALLCATLRLKRQQ 72 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQN 92 + ++ +S+ K++ I + L Sbjct: 73 ITLVQGESARDKVLRISAASTHVQTQLAK 101 >gi|328772106|gb|EGF82145.1| hypothetical protein BATDEDRAFT_86894 [Batrachochytrium dendrobatidis JAM81] Length = 128 Score = 65.1 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 16/73 (21%), Positives = 35/73 (47%), Gaps = 7/73 (9%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRML 67 + P K S + ++ + I++ A ++G+AN ++ +A L L K + ++ Sbjct: 45 VKPGTKVSQVIDIQ-------GDAVGIQIAAVAREGEANAELIQTVADVLKLRKYQVAIV 97 Query: 68 SKQSSPLKIIYID 80 + S K++ ID Sbjct: 98 AGHKSRTKVLKID 110 >gi|326403567|ref|YP_004283649.1| hypothetical protein ACMV_14200 [Acidiphilium multivorum AIU301] gi|325050429|dbj|BAJ80767.1| hypothetical protein ACMV_14200 [Acidiphilium multivorum AIU301] Length = 131 Score = 65.1 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 15/83 (18%), Positives = 37/83 (44%), Gaps = 2/83 (2%) Query: 4 VIVRLIPNAKKSGIASL--EIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V +++ P A+++ + + +++ V P+ G+AN A+L LA L + Sbjct: 23 VALKVQPGARRARLGPVVPAAAAPGWPPARLRLAVVVPPEDGRANDAVLKALAAWLGVGA 82 Query: 62 SSLRMLSKQSSPLKIIYIDKDCK 84 + L + + + K++ + Sbjct: 83 ARLALRAGGQARDKLVLVVGGSA 105 >gi|224133940|ref|XP_002321697.1| predicted protein [Populus trichocarpa] gi|222868693|gb|EEF05824.1| predicted protein [Populus trichocarpa] Length = 231 Score = 64.8 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 142 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFVGKVLGLK 194 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S + + ++ K++ ++ +++ E L Sbjct: 195 LSQMTLQRGWNNKSKLLVVEDLSARQVYEKL 225 >gi|9758285|dbj|BAB08809.1| unnamed protein product [Arabidopsis thaliana] Length = 213 Score = 64.8 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + + L L Sbjct: 124 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 176 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S + + +S K++ ++ +++ E L Sbjct: 177 LSQMTLQRGWNSKSKLLVVEDLSARQVYEKL 207 >gi|251772211|gb|EES52781.1| conserved hypothetical protein [Leptospirillum ferrodiazotrophum] Length = 120 Score = 64.8 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 37/81 (45%), Gaps = 3/81 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 +++ + P + L + + ++G+AN+ +L LA+ L +S SS Sbjct: 28 LVIEVKPAQSR---VFLRRGASHGPAPIWVVGIREPAREGQANEGVLRTLAEFLGVSPSS 84 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 + ++S +S +K + + + Sbjct: 85 VALVSGHTSRIKRLSVRGIDE 105 >gi|187251200|ref|YP_001875682.1| hypothetical protein Emin_0790 [Elusimicrobium minutum Pei191] gi|186971360|gb|ACC98345.1| Conserved hypothetical protein DUF167 [Elusimicrobium minutum Pei191] Length = 93 Score = 64.8 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 14/89 (15%), Positives = 36/89 (40%), Gaps = 7/89 (7%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+IP A ++ + + +++KV + +AN + LA+ + Sbjct: 7 MIIKVRVIPTAGENEVV-------SRIGSVLRVKVKTKSIEDEANNIIQYFLAEFFGVEN 59 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELL 90 + + ++ K + I +E + + Sbjct: 60 NYINIVKGAKGKEKTVEIRGKSEEHLKKV 88 >gi|302763307|ref|XP_002965075.1| hypothetical protein SELMODRAFT_230469 [Selaginella moellendorffii] gi|300167308|gb|EFJ33913.1| hypothetical protein SELMODRAFT_230469 [Selaginella moellendorffii] Length = 233 Score = 64.8 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 43/93 (46%), Gaps = 8/93 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L +AK L+L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEYMAKVLSLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 + + + ++ K++ ++ +++ E L Sbjct: 196 VTQMTLQRGWNNKSKLLVVEDLSVRDVYEKLLA 228 >gi|255628955|gb|ACU14822.1| unknown [Glycine max] Length = 227 Score = 64.8 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 35/80 (43%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQLAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYID 80 S + + ++ K++ + Sbjct: 196 LSQMTLQRGWNNKSKLLVVS 215 >gi|300871287|ref|YP_003786160.1| hypothetical protein BP951000_1678 [Brachyspira pilosicoli 95/1000] gi|300688988|gb|ADK31659.1| conserved hypothetical protein [Brachyspira pilosicoli 95/1000] Length = 83 Score = 64.8 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 21/87 (24%), Positives = 42/87 (48%), Gaps = 8/87 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 N+ V++ +AK + + I+++A GKANKA++ L+ +L L K Sbjct: 1 MNIEVKVTASAKSNSF--------KKENGIYYIRISAKAIDGKANKAIIDFLSSELNLKK 52 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITE 88 + +L + S K+I ++ D ++ Sbjct: 53 KDVEILKGEKSSKKLISLNIDEYKLES 79 >gi|302756509|ref|XP_002961678.1| hypothetical protein SELMODRAFT_76023 [Selaginella moellendorffii] gi|300170337|gb|EFJ36938.1| hypothetical protein SELMODRAFT_76023 [Selaginella moellendorffii] Length = 233 Score = 64.4 bits (156), Expect = 5e-09, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 43/93 (46%), Gaps = 8/93 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L +AK L+L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEYMAKVLSLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 + + + ++ K++ ++ +++ E L Sbjct: 196 ATQMTLQRGWNNKSKLLVVEDLSVRDVYEKLLA 228 >gi|255630115|gb|ACU15411.1| unknown [Glycine max] Length = 225 Score = 64.4 bits (156), Expect = 5e-09, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 36/80 (45%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + + + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQLAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYID 80 S + + ++ K++ ++ Sbjct: 196 LSQMTLQRGWNNKSKLLVVE 215 >gi|307199144|gb|EFN79854.1| UPF0235 protein C15orf40-like protein [Harpegnathos saltator] Length = 100 Score = 64.4 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 14/67 (20%), Positives = 28/67 (41%), Gaps = 7/67 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + ++ P AK + I + + ++A P +G+AN ++ LA L K Sbjct: 35 VTIKIQAKPGAKCNNITDITNEGVG-------VAISAPPTEGEANAELVKYLASIFGLRK 87 Query: 62 SSLRMLS 68 S + + Sbjct: 88 SHVSLDR 94 >gi|115471615|ref|NP_001059406.1| Os07g0295200 [Oryza sativa Japonica Group] gi|34394983|dbj|BAC84531.1| unknown protein [Oryza sativa Japonica Group] gi|113610942|dbj|BAF21320.1| Os07g0295200 [Oryza sativa Japonica Group] gi|215765278|dbj|BAG86975.1| unnamed protein product [Oryza sativa Japonica Group] gi|218199457|gb|EEC81884.1| hypothetical protein OsI_25695 [Oryza sativa Indica Group] gi|222636860|gb|EEE66992.1| hypothetical protein OsJ_23901 [Oryza sativa Japonica Group] Length = 232 Score = 64.0 bits (155), Expect = 6e-09, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S + + ++ K++ ++ +++ E L Sbjct: 196 LSQMTLQRGWNNKSKLLIVEDLSARQVYEKL 226 >gi|126340695|ref|XP_001369778.1| PREDICTED: similar to chromosome 15 open reading frame 40, [Monodelphis domestica] Length = 73 Score = 64.0 bits (155), Expect = 6e-09, Method: Composition-based stats. Identities = 17/65 (26%), Positives = 30/65 (46%), Gaps = 2/65 (3%) Query: 32 MKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYI--DKDCKEITEL 89 + + + A P +G+AN + L+K L L KS + + S K++ I +EI E Sbjct: 6 VSVAIAAPPSEGEANTELCCYLSKVLELRKSDVILDKGGKSHEKVVPILASTTPEEILEK 65 Query: 90 LQNND 94 + Sbjct: 66 FKMQA 70 >gi|194671931|ref|XP_001789164.1| PREDICTED: C21H15orf40 protein-like [Bos taurus] Length = 234 Score = 64.0 bits (155), Expect = 6e-09, Method: Composition-based stats. Identities = 15/86 (17%), Positives = 34/86 (39%), Gaps = 9/86 (10%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 + +K++ + + + + + P +G+AN + L+K L L S + Sbjct: 107 AIHDKAGSKQNAMTDVTT-------EAVSVGIAGPPIEGEANVELCCCLSKILELRTSDV 159 Query: 65 RMLSKQSSPLKIIYIDK--DCKEITE 88 + S K++ + +EI E Sbjct: 160 VLDKGSKSHEKVVKLLACTPPEEILE 185 >gi|307721935|ref|YP_003893075.1| hypothetical protein Saut_2020 [Sulfurimonas autotrophica DSM 16294] gi|306980028|gb|ADN10063.1| protein of unknown function DUF167 [Sulfurimonas autotrophica DSM 16294] Length = 118 Score = 64.0 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 14/85 (16%), Positives = 33/85 (38%), Gaps = 8/85 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P+AK + I +KI V P GKA ++ LA + + + ++ Sbjct: 42 KPSAKITKIG-------KPFGNQLKISVACAPVNGKATDHLVKFLAHEFDVKIKDIEVVF 94 Query: 69 KQSSPLKIIYIDKDCKEITELLQNN 93 + + K + + + ++ + + Sbjct: 95 GRMNVNKQLRVTR-PNKLPSVFKAK 118 >gi|116784383|gb|ABK23322.1| unknown [Picea sitchensis] gi|224284776|gb|ACN40118.1| unknown [Picea sitchensis] Length = 232 Score = 64.0 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEYMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 + + + ++ K++ ++ +++ E L Sbjct: 196 LTQMTLQRGWNNKSKLLVVEDLSARDVYEKL 226 >gi|167648715|ref|YP_001686378.1| hypothetical protein Caul_4760 [Caulobacter sp. K31] gi|167351145|gb|ABZ73880.1| protein of unknown function DUF167 [Caulobacter sp. K31] Length = 96 Score = 64.0 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 31/90 (34%), Positives = 46/90 (51%), Gaps = 3/90 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VRL P + I + D ++K++V A P +G AN A+LA LAK L L K Sbjct: 1 MRLAVRLTPRGGREAIDGWAVDGDGRP--YLKVRVAAPPVEGAANAALLAFLAKALGLPK 58 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S+L + S + LK+I I D + +L Sbjct: 59 SALTLASGAGARLKLIEIAGCDPLSLERVL 88 >gi|291615278|ref|YP_003525435.1| hypothetical protein Slit_2823 [Sideroxydans lithotrophicus ES-1] gi|291585390|gb|ADE13048.1| protein of unknown function DUF167 [Sideroxydans lithotrophicus ES-1] Length = 102 Score = 63.6 bits (154), Expect = 8e-09, Method: Composition-based stats. Identities = 16/86 (18%), Positives = 36/86 (41%), Gaps = 8/86 (9%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P AKK+ I + +++ V P +G+A ++ LA + +++SS+ ++ Sbjct: 19 RPRAKKTKIGKV-------IGNQLEVHVAENPVRGRATAHLVKFLAGEFDVTESSITVVF 71 Query: 69 KQSSPLKIIYIDKDCKEITELLQNND 94 + K + I K + + Sbjct: 72 GVYNVNKQLRIVA-PKRLPAAIAKQQ 96 >gi|145334887|ref|NP_001078789.1| unknown protein [Arabidopsis thaliana] gi|332010366|gb|AED97749.1| uncharacterized protein [Arabidopsis thaliana] Length = 213 Score = 63.2 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + + L L Sbjct: 124 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 176 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S + + +S K++ ++ +++ E L Sbjct: 177 LSQMTLQRGWNSKSKLLVVEDLSARQVYEKL 207 >gi|224119652|ref|XP_002318126.1| predicted protein [Populus trichocarpa] gi|222858799|gb|EEE96346.1| predicted protein [Populus trichocarpa] Length = 231 Score = 63.2 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + + L L Sbjct: 142 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGRVLGLR 194 Query: 61 KSSLRMLSKQSSPLKIIYIDKD-CKEITELL 90 S + + ++ K++ ++ +++ E L Sbjct: 195 LSQMTLQRGWNNKSKLLVVEDLYARQVYEKL 225 >gi|297618780|ref|YP_003706885.1| protein of unknown function DUF167 [Methanococcus voltae A3] gi|297377757|gb|ADI35912.1| protein of unknown function DUF167 [Methanococcus voltae A3] Length = 112 Score = 63.2 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 23/94 (24%), Positives = 46/94 (48%), Gaps = 7/94 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V + + NAKK+ I + ++I++ P +GKANKA+L + +L L K+ Sbjct: 24 VNIDVSTNAKKNEIGKIN-----EWRKRLEIRIKQQPVEGKANKAILKFIKSELNL-KTD 77 Query: 64 LRMLSKQSSPLKIIYIDK-DCKEITELLQNNDSL 96 + + + ++ K ++ D I + L +S+ Sbjct: 78 VEIATGSTNSQKTLFFKDLDKNTILKKLNLLNSI 111 >gi|281348884|gb|EFB24468.1| hypothetical protein PANDA_020551 [Ailuropoda melanoleuca] Length = 87 Score = 63.2 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 13/67 (19%), Positives = 29/67 (43%), Gaps = 7/67 (10%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + P +K++ + + + + + A P +G+AN + L+K L L K Sbjct: 26 VTIAIHAKPGSKQNAVTDVTA-------EAVSVAIAAPPSEGEANAELCRYLSKVLELRK 78 Query: 62 SSLRMLS 68 S + + Sbjct: 79 SDVVLDK 85 >gi|167999197|ref|XP_001752304.1| predicted protein [Physcomitrella patens subsp. patens] gi|162696699|gb|EDQ83037.1| predicted protein [Physcomitrella patens subsp. patens] Length = 231 Score = 63.2 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 41/93 (44%), Gaps = 8/93 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 142 LVQVAIEVEDRAQRSQITRVNADD-------VRVTVAAPAARGEANNELLEYMGKVLGLR 194 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 + + + ++ K++ ++ +E+ E L Sbjct: 195 LTQMTLQRGWNNKSKLLAVEDLSVREVYEKLLA 227 >gi|150399132|ref|YP_001322899.1| hypothetical protein Mevan_0378 [Methanococcus vannielii SB] gi|166232643|sp|A6UP65|Y378_METVS RecName: Full=UPF0235 protein Mevan_0378 gi|150011835|gb|ABR54287.1| protein of unknown function DUF167 [Methanococcus vannielii SB] Length = 101 Score = 62.8 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 44/93 (47%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + NAKK+ I + +++K+ P +G+ANKA+L L + K Sbjct: 13 VLIDIEVTTNAKKNEIGKIN-----KWRKRLEVKIKEQPIEGRANKAILKFLKEIF---K 64 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKE-ITELLQNN 93 + + + +SP K + I D KE + +L+ Sbjct: 65 TDVELNPVTTSPQKTVLISCDTKEYVVNILKRE 97 >gi|298243513|ref|ZP_06967320.1| protein of unknown function DUF167 [Ktedonobacter racemifer DSM 44963] gi|297556567|gb|EFH90431.1| protein of unknown function DUF167 [Ktedonobacter racemifer DSM 44963] Length = 90 Score = 62.8 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 25/94 (26%), Positives = 49/94 (52%), Gaps = 9/94 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V VR+IP + ++ + + +K ++TA P G AN A++A+LA+ L+L K Sbjct: 1 MQVPVRVIPRSNRNTLE--------WEEGAIKARLTAPPVDGAANAALIALLAETLSLPK 52 Query: 62 SSLRMLSKQSSPLKIIYIDK-DCKEITELLQNND 94 ++ ++ + KI+ I+ + EI + L + Sbjct: 53 RAITLIRGTTGRQKIVEIEGLEQVEIIQRLSASS 86 >gi|169600439|ref|XP_001793642.1| hypothetical protein SNOG_03053 [Phaeosphaeria nodorum SN15] gi|111068664|gb|EAT89784.1| hypothetical protein SNOG_03053 [Phaeosphaeria nodorum SN15] Length = 132 Score = 62.8 bits (152), Expect = 1e-08, Method: Composition-based stats. Identities = 15/77 (19%), Positives = 34/77 (44%), Gaps = 5/77 (6%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P + + H+ + V +G ANK++ +LAK L++ KS Sbjct: 26 ITLFVKPGVSR-----MREGIAAVSDSHVFMNVANYAFEGSANKSVQVLLAKTLSVPKSH 80 Query: 64 LRMLSKQSSPLKIIYID 80 + ++ +S K+ + Sbjct: 81 VSIVKGLTSREKVAEVK 97 >gi|297470564|ref|XP_002684034.1| PREDICTED: C21H15orf40 protein-like, partial [Bos taurus] gi|296491729|gb|DAA33762.1| C21H15orf40 protein-like [Bos taurus] Length = 107 Score = 62.8 bits (152), Expect = 2e-08, Method: Composition-based stats. Identities = 15/86 (17%), Positives = 34/86 (39%), Gaps = 9/86 (10%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 + +K++ + + + + + P +G+AN + L+K L L S + Sbjct: 17 AIHDKAGSKQNAMTDVTT-------EAVSVGIAGPPIEGEANVELCCCLSKILELRTSDV 69 Query: 65 RMLSKQSSPLKIIYIDK--DCKEITE 88 + S K++ + +EI E Sbjct: 70 VLDKGSKSHEKVVKLLACTPPEEILE 95 >gi|322822396|gb|EFZ28456.1| hypothetical protein TCSYLVIO_5305 [Trypanosoma cruzi] Length = 156 Score = 62.4 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 22/141 (15%), Positives = 45/141 (31%), Gaps = 55/141 (39%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAM--------LA 54 ++ V P A+ S +A D ++++V A P +GKAN ++ LA Sbjct: 16 HLTVHAKPGARSSSLACHPAVMDA----ALEVRVGAPPVEGKANAELVDFMQMLLEQELA 71 Query: 55 KK------------------------------------LALS-----KSSLRMLSKQSSP 73 + + K + ++S S+ Sbjct: 72 RVRATQQHTPLESNGAVMNCVYGGHSKKDKKKNKPNTKMECPVNYPDKVRVSLVSGASAR 131 Query: 74 LKIIYI--DKDCKEITELLQN 92 K + + +E+ +LQ+ Sbjct: 132 HKTLEVAFPGTQEELISVLQS 152 >gi|145494428|ref|XP_001433208.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124400325|emb|CAK65811.1| unnamed protein product [Paramecium tetraurelia] Length = 99 Score = 62.4 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 39/79 (49%), Gaps = 8/79 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P +K + +K A P AN+ ++ ML++KL++ +SS Sbjct: 20 LQLFVKPKSK--------SEMLEFSEEFIIVKTKAQPIDNAANEDVIRMLSEKLSIDQSS 71 Query: 64 LRMLSKQSSPLKIIYIDKD 82 ++++ Q S K ++I+ + Sbjct: 72 IKIVKGQQSKYKTVFIENE 90 >gi|116780073|gb|ABK21543.1| unknown [Picea sitchensis] Length = 265 Score = 62.1 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 14/80 (17%), Positives = 35/80 (43%), Gaps = 7/80 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEYMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYID 80 + + + ++ K++ + Sbjct: 196 LTQMTLQRGWNNKSKLLVVS 215 >gi|269860472|ref|XP_002649957.1| hypothetical cytosolic protein [Enterocytozoon bieneusi H348] gi|220066644|gb|EED44119.1| hypothetical cytosolic protein [Enterocytozoon bieneusi H348] Length = 88 Score = 62.1 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 40/76 (52%), Gaps = 7/76 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +++ +++++ I + ++ I++ A P KAN ++A+L+K K + Sbjct: 18 LNIKVKLSSRETAIL-------CQEDDYLIIQIAAPPVDNKANNELIALLSKTYKTKKEN 70 Query: 64 LRMLSKQSSPLKIIYI 79 + ++ ++S KII I Sbjct: 71 ISIIKGKTSTTKIIKI 86 >gi|221106236|ref|XP_002164251.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 1280 Score = 62.1 bits (150), Expect = 3e-08, Method: Composition-based stats. Identities = 14/61 (22%), Positives = 30/61 (49%), Gaps = 7/61 (11%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AK++ + + + I++ A P+ GKAN +L+ L++ + KS Sbjct: 915 LQIYAKPGAKRNKVTDISVEFIG-------IQLAAQPRDGKANDELLSYLSELFNIKKSG 967 Query: 64 L 64 + Sbjct: 968 I 968 >gi|296088594|emb|CBI37585.3| unnamed protein product [Vitis vinifera] Length = 89 Score = 61.7 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 11/56 (19%), Positives = 29/56 (51%) Query: 25 DTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 D D + +++ A + G+AN A+L ++ + + + + + S S K++ ++ Sbjct: 16 DYFDDEALGVQIDAPAKDGEANAALLDYISSVVGVKRRQVSISSGSKSRDKVVIVE 71 >gi|168035145|ref|XP_001770071.1| predicted protein [Physcomitrella patens subsp. patens] gi|162678597|gb|EDQ65053.1| predicted protein [Physcomitrella patens subsp. patens] Length = 232 Score = 61.3 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 17/93 (18%), Positives = 41/93 (44%), Gaps = 8/93 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQVGIEVEDRAQRSQITRVNADD-------VRVTVAAPAARGEANNELLEYMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELLQN 92 + + + ++ K++ ++ +E+ E L Sbjct: 196 LTQMTLQRGWNNKSKLLVVEDLTVREVYEKLLA 228 >gi|167932988|ref|ZP_02520075.1| hypothetical protein cdivTM7_00062 [candidate division TM7 single-cell isolate TM7b] gi|169836860|ref|ZP_02870048.1| hypothetical protein cdivTM_07094 [candidate division TM7 single-cell isolate TM7a] Length = 59 Score = 60.9 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 13/52 (25%), Positives = 27/52 (51%) Query: 29 TIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 + + A +G+AN A + +LAK ++ S +++L +S K+ +D Sbjct: 5 DGVLTVYTKAPAIEGRANLATVKLLAKYFGVASSKVKLLRGAASKYKVFEVD 56 >gi|158314062|ref|YP_001506570.1| hypothetical protein Franean1_2229 [Frankia sp. EAN1pec] gi|158109467|gb|ABW11664.1| protein of unknown function DUF167 [Frankia sp. EAN1pec] Length = 121 Score = 60.9 bits (147), Expect = 5e-08, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 41/93 (44%), Gaps = 3/93 (3%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V +R+ P A ++ + D + + ++V +G+AN+A L LA+ L + + Sbjct: 20 VRVAIRVRPAADRTAVG--PATSDPTHGRLLVVRVREPAVEGRANEAALRALAQALGVRR 77 Query: 62 SSLRMLSKQSSPLKIIYIDKDCKEITELLQNND 94 S + + +K I +D IT ++ Sbjct: 78 SDVTLTR-SIGRVKFIEVDAPDDVITRRVEELA 109 >gi|212541150|ref|XP_002150730.1| DUF167 domain protein [Penicillium marneffei ATCC 18224] gi|210068029|gb|EEA22121.1| DUF167 domain protein [Penicillium marneffei ATCC 18224] Length = 124 Score = 60.5 bits (146), Expect = 6e-08, Method: Composition-based stats. Identities = 16/78 (20%), Positives = 38/78 (48%), Gaps = 8/78 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P ++ G ++ D + I V + P+KG+AN A++ +L++ L + KS Sbjct: 26 IRCHVTPGSR--GFQGVKKICD----EQVYIHVASAPRKGEANAAVVKVLSEVLGIPKSD 79 Query: 64 LRMLSKQSSPLKIIYIDK 81 + ++ K+ ++ Sbjct: 80 ITIM--GKHRDKVGQVNG 95 >gi|312221888|emb|CBY01828.1| similar to DUF167 domain protein [Leptosphaeria maculans] Length = 118 Score = 60.5 bits (146), Expect = 7e-08, Method: Composition-based stats. Identities = 12/78 (15%), Positives = 36/78 (46%), Gaps = 5/78 (6%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 ++ + P S + + + +D +++ V + G+AN A++ ++ + L + Sbjct: 19 HLRCHVKPGVSSSRLGIIAVTED-----AVEVGVAEQAKNGEANDAVVHVICRALHAPRD 73 Query: 63 SLRMLSKQSSPLKIIYID 80 +R++ S +K + + Sbjct: 74 EVRIVRGWKSRVKTVVVS 91 >gi|124028038|ref|YP_001013358.1| hypothetical protein Hbut_1179 [Hyperthermus butylicus DSM 5456] gi|123978732|gb|ABM81013.1| hypothetical protein Hbut_1179 [Hyperthermus butylicus DSM 5456] Length = 108 Score = 60.1 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 14/99 (14%), Positives = 39/99 (39%), Gaps = 13/99 (13%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 +V + + P A +G+ + + +G+ N +++ ++ L +S Sbjct: 16 VDVTLYVKPEASFTGL--------RMELGELVFYTEELDVEGRVNASIVMFFSRLLGVSP 67 Query: 62 SSLRMLSKQSSPLKIIYIDKDC-----KEITELLQNNDS 95 S + ++ K + I ++I E L+ +++ Sbjct: 68 SMIDIVYGTREKTKRVRIKNVTWNQVFEKIVEALRESEA 106 >gi|145509569|ref|XP_001440723.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124407951|emb|CAK73326.1| unnamed protein product [Paramecium tetraurelia] Length = 99 Score = 59.7 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 39/79 (49%), Gaps = 8/79 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + + P +K + K A P +AN+ ++ ML++KL++ +SS Sbjct: 20 LQLFVKPKSK--------AEMLEFSEEFVIAKTKAQPIDNEANEDVIRMLSEKLSIDQSS 71 Query: 64 LRMLSKQSSPLKIIYIDKD 82 ++++ Q S K ++I+ + Sbjct: 72 IKIVKGQQSKYKTVFIENE 90 >gi|312136610|ref|YP_004003947.1| hypothetical protein Mfer_0383 [Methanothermus fervidus DSM 2088] gi|311224329|gb|ADP77185.1| protein of unknown function DUF167 [Methanothermus fervidus DSM 2088] Length = 97 Score = 59.7 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 20/91 (21%), Positives = 43/91 (47%), Gaps = 9/91 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + +IP++KK GI + S + + V + +KGKANK ++ +K Sbjct: 11 VLLQIHVIPSSKKFGIEKYD-----SWRKRLYVTVKSPARKGKANKEIIEEFSKLFN--- 62 Query: 62 SSLRMLSKQSSPLKIIYIDKDC-KEITELLQ 91 ++++ S K + + K+I E+++ Sbjct: 63 KEVKIVKGIKSRDKTLVVKDVEYKKIMEIIR 93 >gi|71660987|ref|XP_817521.1| hypothetical protein [Trypanosoma cruzi strain CL Brener] gi|70882718|gb|EAN95670.1| hypothetical protein, conserved [Trypanosoma cruzi] Length = 156 Score = 59.7 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 44/141 (31%), Gaps = 55/141 (39%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAM--------LA 54 ++ V P A+ S +A D +++++ A P +GKAN ++ LA Sbjct: 16 HLTVHAKPGARSSSLACHPAVTDA----ALEVRIGAPPVEGKANAELVDFMQMLLEQELA 71 Query: 55 KK------------------------------------LALS-----KSSLRMLSKQSSP 73 + + K + ++ S+ Sbjct: 72 RVRAAQQHTPVESNGAFMNCVYDGHSKKDKKKNKPNTKMECPVNYPDKVRVSLVGGASAR 131 Query: 74 LKIIYI--DKDCKEITELLQN 92 K + + +E+ +LQ+ Sbjct: 132 HKTLEVAFPGTEEELISVLQS 152 >gi|238060239|ref|ZP_04604948.1| hypothetical protein MCAG_01205 [Micromonospora sp. ATCC 39149] gi|237882050|gb|EEP70878.1| hypothetical protein MCAG_01205 [Micromonospora sp. ATCC 39149] Length = 99 Score = 59.7 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 40/88 (45%), Gaps = 3/88 (3%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V VR+ P A ++ + D + I V A P G+A +A LA L + ++ Sbjct: 7 VAVRVKPGAARARVGGRH---DGPHGPALVIAVNAPPVDGRATEAARRALADALGIRPAA 63 Query: 64 LRMLSKQSSPLKIIYIDKDCKEITELLQ 91 + + + +S K+ +++ + E L+ Sbjct: 64 VALRAGAASRDKLFLVERPTPGLAEALR 91 >gi|218884121|ref|YP_002428503.1| hypothetical protein DKAM_0810 [Desulfurococcus kamchatkensis 1221n] gi|218765737|gb|ACL11136.1| Uncharacterized conserved protein [Desulfurococcus kamchatkensis 1221n] Length = 115 Score = 59.4 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 36/91 (39%), Gaps = 9/91 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + +R+ P T + + +KG+ N A++ LA++L + S Sbjct: 29 LSIRVKP--------GDVEDYITIEGDELVFHTAEQSEKGRENAALVKYLARELKIPVSK 80 Query: 64 LRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93 + ++ + LK + + D D E+ L Sbjct: 81 IDIVYGRRETLKKVLLNDVDPDELVIKLAKL 111 >gi|71668144|ref|XP_821011.1| hypothetical protein [Trypanosoma cruzi strain CL Brener] gi|70886377|gb|EAN99160.1| hypothetical protein, conserved [Trypanosoma cruzi] Length = 156 Score = 59.0 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 22/141 (15%), Positives = 45/141 (31%), Gaps = 55/141 (39%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAM--------LA 54 ++ V P A+ S +A D ++++V A P +GKAN ++ LA Sbjct: 16 HLTVHAKPGARSSSLACHPAVTDA----AIEVRVGAPPVEGKANAELVEFMQMLLEQELA 71 Query: 55 KK------------------------------------LALS-----KSSLRMLSKQSSP 73 + + K + ++S S+ Sbjct: 72 RVRAAQQHTPLESNGAVMNCVYGGHSKKDKRKNKPNTKMECPVNYPEKVRVSLVSGASAR 131 Query: 74 LKIIYI--DKDCKEITELLQN 92 K + + +E+ +LQ+ Sbjct: 132 HKTLEVAFPGTQEELISVLQS 152 >gi|148550788|ref|YP_001260227.1| hypothetical protein Swit_5352 [Sphingomonas wittichii RW1] gi|148503207|gb|ABQ71460.1| protein of unknown function DUF167 [Sphingomonas wittichii RW1] Length = 94 Score = 58.2 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 24/77 (31%), Positives = 40/77 (51%), Gaps = 6/77 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VR+ PNA I + ++++ TATP+ GKAN+A+L +LA L S Sbjct: 23 RLPVRVAPNASADAII------LATADGILQVRTTATPEGGKANEAVLRLLAAALGQPVS 76 Query: 63 SLRMLSKQSSPLKIIYI 79 +L +L + KI+ + Sbjct: 77 ALELLRGSTGRNKIVRV 93 >gi|260576559|ref|ZP_05844548.1| protein of unknown function DUF167 [Rhodobacter sp. SW2] gi|259021282|gb|EEW24589.1| protein of unknown function DUF167 [Rhodobacter sp. SW2] Length = 66 Score = 58.2 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 16/60 (26%), Positives = 32/60 (53%), Gaps = 8/60 (13%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + VR+ P A ++ + + +++ VT P+ GKA + ++A+LAK L ++K Sbjct: 14 ADFAVRVTPKASRNAVV--------VEEGAIRVYVTCVPEDGKATREVVALLAKALGVAK 65 >gi|327289071|ref|XP_003229248.1| PREDICTED: UPF0235 protein C15orf40 homolog isoform 2 [Anolis carolinensis] Length = 104 Score = 58.2 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 13/60 (21%), Positives = 24/60 (40%), Gaps = 7/60 (11%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V P +K++ + L I + A P G+AN + L+K L + + Sbjct: 48 VTIAVHAKPGSKQNAVTDLSAEAVG-------IAIAAPPSDGEANAELCRYLSKVLEVKR 100 >gi|194688626|gb|ACF78397.1| unknown [Zea mays] Length = 232 Score = 58.2 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAALAARGEANSELLEFMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 + + + ++ K++ ++ +++ E L Sbjct: 196 LTQMTLQRGWNNKSKLLIVEDLSARQVYEKL 226 >gi|11499654|ref|NP_070896.1| hypothetical protein AF2072 [Archaeoglobus fulgidus DSM 4304] gi|29839698|sp|O28207|Y2072_ARCFU RecName: Full=UPF0235 protein AF_2072 gi|2648454|gb|AAB89177.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304] Length = 78 Score = 57.8 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 11/77 (14%), Positives = 33/77 (42%), Gaps = 10/77 (12%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + V + P +K+ +++++ + ++GKAN+ +L + + Sbjct: 8 VLISVHVSPGSKE------VSFSYDEWRRAVEVRIKSPAKEGKANRELLGIFRQIFG--- 58 Query: 62 SSLRMLSKQSSPLKIIY 78 + ++S + S K++ Sbjct: 59 -EVELVSGEKSRSKVLK 74 >gi|325959811|ref|YP_004291277.1| hypothetical protein Metbo_2086 [Methanobacterium sp. AL-21] gi|325331243|gb|ADZ10305.1| UPF0235 protein yggU [Methanobacterium sp. AL-21] Length = 100 Score = 57.8 bits (139), Expect = 5e-07, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 41/93 (44%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + + I +IK+ A PQKGKANK ++ AK L+ Sbjct: 14 VLLNIEVGTKSDNFRITGYND-----WRKSFEIKIKAVPQKGKANKEIILEFAK---LTN 65 Query: 62 SSLRMLSKQSSPLKIIYI-DKDCKEITELLQNN 93 + ++S S K + I D + +++ +L++ Sbjct: 66 KRVEIISGHKSHRKTLKIYDINEEDLLKLIEQE 98 >gi|1730921|sp|P52064|YPI4_VIBAL RecName: Full=UPF0235 protein in proC 3'region Length = 54 Score = 57.4 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 20/48 (41%), Gaps = 7/48 (14%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLA 51 + + + P A + I L +KI +TA P GKAN + Sbjct: 14 LKLYIQPKASRDKIVGLH-------GEELKIAITAPPVDGKANAHLTK 54 >gi|325120013|emb|CBZ55566.1| hypothetical protein NCLIV_059910 [Neospora caninum Liverpool] Length = 531 Score = 57.4 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 14/56 (25%), Positives = 26/56 (46%), Gaps = 5/56 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLAL 59 + V P AK+S I S+ + +++ A ++G AN+ + LA +L Sbjct: 52 LAVHAKPGAKQSQIPSINEQA-----EQLDVQIDAPAREGAANEELCDFLADACSL 102 >gi|218889133|ref|YP_002437997.1| hypothetical protein PLES_03891 [Pseudomonas aeruginosa LESB58] gi|218769356|emb|CAW25116.1| conserved hypothetical protein [Pseudomonas aeruginosa LESB58] Length = 56 Score = 57.4 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 15/49 (30%), Positives = 26/49 (53%), Gaps = 1/49 (2%) Query: 42 KGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELL 90 +GKAN +LA L K ++KS + + S + + K + I + + E L Sbjct: 2 EGKANAHLLAFLGKAFGVAKSLVSLESGELNRQKRVRI-RHPTRLPEEL 49 >gi|294494893|ref|YP_003541386.1| hypothetical protein Mmah_0207 [Methanohalophilus mahii DSM 5219] gi|292665892|gb|ADE35741.1| protein of unknown function DUF167 [Methanohalophilus mahii DSM 5219] Length = 103 Score = 57.4 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 16/92 (17%), Positives = 41/92 (44%), Gaps = 8/92 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 C + + P + K + ++ K+T + QKGKAN ++ L+ ++ Sbjct: 14 CIIDFEINPGSSK----LVVPSGYNIWRKRVEGKLTESAQKGKANDQLIQRLSHIFQINS 69 Query: 62 SSLRMLSKQSSPLKIIYIDK----DCKEITEL 89 SS+ +++ + K ++++ +++ E Sbjct: 70 SSITIVAGAKTTKKSVHLENVYPKTAEDVLEQ 101 >gi|116054122|ref|YP_788565.1| hypothetical protein PA14_05120 [Pseudomonas aeruginosa UCBPP-PA14] gi|115589343|gb|ABJ15358.1| conserved hypothetical protein [Pseudomonas aeruginosa UCBPP-PA14] Length = 56 Score = 57.4 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 15/49 (30%), Positives = 26/49 (53%), Gaps = 1/49 (2%) Query: 42 KGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELL 90 +GKAN +LA L K ++KS + + S + + K + I + + E L Sbjct: 2 EGKANAHLLAFLGKAFGVAKSLVSLESGELNRQKRVRIRR-PTRLPEEL 49 >gi|257077103|ref|ZP_05571464.1| hypothetical protein Faci_08566 [Ferroplasma acidarmanus fer1] Length = 66 Score = 56.7 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 16/55 (29%), Positives = 27/55 (49%) Query: 26 TSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYID 80 S+ +KI +A + KAN ++ LA + ++++S Q S KI ID Sbjct: 11 ESEGDRLKIYTSAPRENNKANYDIMKQLATYYNVEFYKIKLISGQKSRKKIFSID 65 >gi|107099377|ref|ZP_01363295.1| hypothetical protein PaerPA_01000389 [Pseudomonas aeruginosa PACS2] gi|152985685|ref|YP_001345886.1| hypothetical protein PSPA7_0491 [Pseudomonas aeruginosa PA7] gi|254243491|ref|ZP_04936813.1| conserved hypothetical protein [Pseudomonas aeruginosa 2192] gi|296386890|ref|ZP_06876389.1| hypothetical protein PaerPAb_02102 [Pseudomonas aeruginosa PAb1] gi|313111988|ref|ZP_07797775.1| hypothetical protein PA39016_004070010 [Pseudomonas aeruginosa 39016] gi|126196869|gb|EAZ60932.1| conserved hypothetical protein [Pseudomonas aeruginosa 2192] gi|150960843|gb|ABR82868.1| protein VV1_1522 [Pseudomonas aeruginosa PA7] gi|310884277|gb|EFQ42871.1| hypothetical protein PA39016_004070010 [Pseudomonas aeruginosa 39016] Length = 56 Score = 56.7 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 15/49 (30%), Positives = 26/49 (53%), Gaps = 1/49 (2%) Query: 42 KGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELL 90 +GKAN +LA L K ++KS + + S + + K + I + + E L Sbjct: 2 EGKANAHLLAFLGKAFGVAKSLVSLESGELNRQKRVRIRR-PTRLPEEL 49 >gi|189913153|ref|YP_001965041.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|189913490|ref|YP_001964718.1| hypothetical protein LEPBI_p0042 [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167777829|gb|ABZ96128.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781558|gb|ABZ99854.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 75 Score = 56.7 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 19/78 (24%), Positives = 40/78 (51%), Gaps = 7/78 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 +++++ I SLE +T +K + P KGKAN+ ++ +L+K ++K Sbjct: 1 MKIVIKVK---SNQKIQSLEFKSETECIAKLK----SLPVKGKANQELVGLLSKHYGVTK 53 Query: 62 SSLRMLSKQSSPLKIIYI 79 ++++S S +K + I Sbjct: 54 KEIQIISGHFSNIKTVEI 71 >gi|303245044|ref|ZP_07331365.1| protein of unknown function DUF167 [Methanothermococcus okinawensis IH1] gi|302484607|gb|EFL47550.1| protein of unknown function DUF167 [Methanothermococcus okinawensis IH1] Length = 101 Score = 56.3 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 22/93 (23%), Positives = 42/93 (45%), Gaps = 9/93 (9%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 + + + PNAKK+ I + + IK+ A P +GKANK ++ + K Sbjct: 15 VLIDIDISPNAKKNEIGGIN-----EWRKRIIIKIKAQPIEGKANKEIIK---FLKKIFK 66 Query: 62 SSLRMLSKQSSPLKIIY-IDKDCKEITELLQNN 93 + ++S +S K + I + +EI ++ Sbjct: 67 KDVEIVSGLTSSQKTVLVIGGNREEIINIITKQ 99 >gi|223944751|gb|ACN26459.1| unknown [Zea mays] Length = 194 Score = 55.9 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + K L L Sbjct: 105 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAALAARGEANSELLEFMGKVLGLR 157 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 + + + ++ K++ ++ +++ E L Sbjct: 158 LTQMTLQRGWNNKSKLLIVEDLSARQVYEKL 188 >gi|283778809|ref|YP_003369564.1| hypothetical protein Psta_1020 [Pirellula staleyi DSM 6068] gi|283437262|gb|ADB15704.1| protein of unknown function DUF167 [Pirellula staleyi DSM 6068] Length = 102 Score = 55.5 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 28/96 (29%), Positives = 47/96 (48%), Gaps = 12/96 (12%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V V+ AKK+ + +KI VT P+KGKAN A+ +LA LA+ + Sbjct: 11 VLVGVKAQAAAKKNSLRGEHA-------GLLKISVTTAPEKGKANDAIADLLAAALAVRR 63 Query: 62 SSLRMLSKQSSPLKIIYIDKDC-----KEITELLQN 92 S++++++ + PLK I ++I L+N Sbjct: 64 SAVQIVAGHTQPLKKFLISGASLDEVREKIARALEN 99 >gi|226529615|ref|NP_001152637.1| hypothetical protein LOC100286278 [Zea mays] gi|195658405|gb|ACG48670.1| uncharacterized ACR, YggU family COG1872 containing protein [Zea mays] Length = 194 Score = 54.7 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 15/91 (16%), Positives = 41/91 (45%), Gaps = 8/91 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S + + +++ V A +G+AN +L + K L L Sbjct: 105 LVQVAIEVEDRAQRSAVTRVNADD-------VRVTVAALAARGEANSELLEFMGKVLGLR 157 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 + + + ++ K++ ++ +++ E L Sbjct: 158 LTQMTLQRGWNNKSKLLIVEDLSARQVYEKL 188 >gi|225620549|ref|YP_002721806.1| hypothetical protein BHWA1_01632 [Brachyspira hyodysenteriae WA1] gi|225215368|gb|ACN84102.1| hypothetical protein BHWA1_01632 [Brachyspira hyodysenteriae WA1] Length = 56 Score = 54.7 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 16/55 (29%), Positives = 29/55 (52%) Query: 38 ATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELLQN 92 A GKANKA++ LA +L + K + +L + + K+I I+ + E+ + Sbjct: 2 AKAIDGKANKAIIDFLADELNIKKRDVEILKGEKNSKKLISININDNELKKYFNK 56 >gi|307108344|gb|EFN56584.1| hypothetical protein CHLNCDRAFT_34979 [Chlorella variabilis] Length = 251 Score = 54.4 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 13/80 (16%), Positives = 34/80 (42%), Gaps = 7/80 (8%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSK 61 V + + K++ + + +++++ + G AN+ +L ML L + Sbjct: 142 TQVALEVDDRGKRALVLRVTA-------DFVRVQLKSGANAGHANEELLEMLRGVLGVRL 194 Query: 62 SSLRMLSKQSSPLKIIYIDK 81 L + +SS K++ ++ Sbjct: 195 GQLSLQRGESSRHKVLLVEG 214 >gi|261327632|emb|CBH10608.1| hypothetical protein, conserved [Trypanosoma brucei gambiense DAL972] gi|289742777|gb|ADD20136.1| hypothetical protein Tb927.4.3080 [Glossina morsitans morsitans] Length = 165 Score = 54.0 bits (129), Expect = 6e-06, Method: Composition-based stats. Identities = 11/55 (20%), Positives = 26/55 (47%), Gaps = 4/55 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKL 57 +++ P A+ + +A+ D +++++ A P GKAN ++ + L Sbjct: 16 RLMIHAKPGARSTALAAQPQALD----EALEVRLAAPPVDGKANTELVEFMQTLL 66 Score = 39.3 bits (91), Expect = 0.16, Method: Composition-based stats. Identities = 6/36 (16%), Positives = 19/36 (52%), Gaps = 2/36 (5%) Query: 61 KSSLRMLSKQSSPLKIIYID--KDCKEITELLQNND 94 K + ++S +S K++ + +++ +L++ D Sbjct: 128 KVRVSLVSGLTSRNKVLEVTFPGTEEDLLAVLKSAD 163 >gi|317509370|ref|ZP_07966989.1| hypothetical protein HMPREF9336_03361 [Segniliparus rugosus ATCC BAA-974] gi|316252293|gb|EFV11744.1| hypothetical protein HMPREF9336_03361 [Segniliparus rugosus ATCC BAA-974] Length = 79 Score = 53.6 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 16/76 (21%), Positives = 36/76 (47%), Gaps = 6/76 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V V + P +++ + ++ + + V +G+A A +LAK L ++KS Sbjct: 8 VTVTVKPGSRR------GPSVEAAEDGSLTVCVREPAVEGRATAAAAVVLAKHLGVAKSR 61 Query: 64 LRMLSKQSSPLKIIYI 79 + ++S +S +K + Sbjct: 62 VALVSGATSRVKRFAV 77 >gi|72388068|ref|XP_844458.1| hypothetical protein [Trypanosoma brucei TREU927] gi|62359409|gb|AAX79847.1| hypothetical protein, conserved [Trypanosoma brucei] gi|70800991|gb|AAZ10899.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain 927/4 GUTat10.1] Length = 165 Score = 53.6 bits (128), Expect = 8e-06, Method: Composition-based stats. Identities = 11/55 (20%), Positives = 26/55 (47%), Gaps = 4/55 (7%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKL 57 +++ P A+ + +A+ D +++++ A P GKAN ++ + L Sbjct: 16 RLMIHAKPGARSTALAAQPQALD----EALEVRLAAPPVDGKANTELVEFMQTLL 66 Score = 38.9 bits (90), Expect = 0.20, Method: Composition-based stats. Identities = 6/36 (16%), Positives = 19/36 (52%), Gaps = 2/36 (5%) Query: 61 KSSLRMLSKQSSPLKIIYID--KDCKEITELLQNND 94 K + ++S +S K++ + +++ +L++ D Sbjct: 128 KVRVSLVSGLTSRNKVLEVTFPGTEEDLLAVLKSAD 163 >gi|187251746|ref|YP_001876228.1| hypothetical protein Emin_1341 [Elusimicrobium minutum Pei191] gi|186971906|gb|ACC98891.1| Uncharacterized conserved protein DUF167 [Elusimicrobium minutum Pei191] Length = 71 Score = 53.6 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 17/75 (22%), Positives = 38/75 (50%), Gaps = 7/75 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + V++ + K++ + + +I V A ++G AN+A+ +LAK++ + Sbjct: 3 IKVKVHADEKQNKLI-------KKNEDTFEIWVKAPAERGLANEAVREILAKEIGVGVKK 55 Query: 64 LRMLSKQSSPLKIIY 78 +R++ +SP KI Sbjct: 56 IRLIKGATSPSKIFE 70 >gi|326386210|ref|ZP_08207834.1| hypothetical protein Y88_2102 [Novosphingobium nitrogenifigens DSM 19370] gi|326209435|gb|EGD60228.1| hypothetical protein Y88_2102 [Novosphingobium nitrogenifigens DSM 19370] Length = 95 Score = 53.2 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 41/77 (53%), Gaps = 8/77 (10%) Query: 3 NVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKS 62 + VR+ P A++ +A + + +KV A P+ GKA A+L+++A L ++ S Sbjct: 23 RLAVRVTPGAREETVAIV--------DGRVLVKVRAKPEDGKATTAVLSLVAAALGVAAS 74 Query: 63 SLRMLSKQSSPLKIIYI 79 + +L +S K++ + Sbjct: 75 RVELLRGATSREKLLRL 91 >gi|242799550|ref|XP_002483404.1| DUF167 domain protein [Talaromyces stipitatus ATCC 10500] gi|218716749|gb|EED16170.1| DUF167 domain protein [Talaromyces stipitatus ATCC 10500] Length = 124 Score = 52.8 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 35/78 (44%), Gaps = 8/78 (10%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P ++ G ++ + + V + P+KG+AN A+ +L++ L KS Sbjct: 26 IRCHVTPGSR--GFQGIKQI----HNEQVYVHVGSEPRKGEANTAVARVLSEVLGFPKSD 79 Query: 64 LRMLSKQSSPLKIIYIDK 81 + ++ +KI + Sbjct: 80 VIIV--GKQRVKIGQVTG 95 >gi|326517290|dbj|BAK00012.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326529833|dbj|BAK08196.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 232 Score = 51.7 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 15/91 (16%), Positives = 36/91 (39%), Gaps = 8/91 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +G+AN +L + K L L Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADDVRVAVAAPA-------ARGEANNELLEFMGKVLGLR 195 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 S + + ++ K++ ++ +++ E L Sbjct: 196 LSQMTLQRGWNNKSKLLIVEDLSARQVYEKL 226 >gi|281212114|gb|EFA86275.1| hypothetical protein PPL_00837 [Polysphondylium pallidum PN500] Length = 83 Score = 51.7 bits (123), Expect = 4e-05, Method: Composition-based stats. Identities = 14/56 (25%), Positives = 33/56 (58%), Gaps = 5/56 (8%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56 + + V + PNAK+S + S+ D+ + I+++ P +G+AN+ ++ L+++ Sbjct: 9 IVKLKVNVHPNAKQSSVVSVNELADS-----VDIRISQPPTEGRANEEVIEYLSEQ 59 >gi|148260376|ref|YP_001234503.1| hypothetical protein Acry_1373 [Acidiphilium cryptum JF-5] gi|146402057|gb|ABQ30584.1| hypothetical protein Acry_1373 [Acidiphilium cryptum JF-5] Length = 75 Score = 51.3 bits (122), Expect = 5e-05, Method: Composition-based stats. Identities = 11/47 (23%), Positives = 23/47 (48%) Query: 39 TPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKE 85 P+ G+AN A+L LA L + + L + + + K++ + + Sbjct: 4 PPEDGRANDAVLKALAAWLGIGAARLALRAGGQARDKLVLVAGGSAD 50 >gi|308456125|ref|XP_003090529.1| hypothetical protein CRE_03518 [Caenorhabditis remanei] gi|308262652|gb|EFP06605.1| hypothetical protein CRE_03518 [Caenorhabditis remanei] Length = 102 Score = 50.9 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 16/74 (21%), Positives = 32/74 (43%), Gaps = 9/74 (12%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P AKKS + ++ + + + A P++G AN+ +++ L L L K+ Sbjct: 34 LRIHAKPGAKKSCVVAI-------GESEIDVSIGAAPREGAANEELISYLMAALGLRKNE 86 Query: 64 LRMLS--KQSSPLK 75 L+ K Sbjct: 87 LQFDKVLGLMGNDK 100 >gi|183982913|ref|YP_001851204.1| hypothetical protein MMAR_2910 [Mycobacterium marinum M] gi|226701561|sp|B2HE69|Y2910_MYCMM RecName: Full=UPF0235 protein MMAR_2910 gi|183176239|gb|ACC41349.1| conserved hypothetical protein [Mycobacterium marinum M] Length = 76 Score = 48.2 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 21/78 (26%), Positives = 39/78 (50%), Gaps = 6/78 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 V+VR+ P ++K + +T + I V GKAN+A +LA L L +S Sbjct: 5 VVVRVKPGSRKGPLV------ETGSDAELTIYVRERAVDGKANEAAARLLAAHLQLPRSR 58 Query: 64 LRMLSKQSSPLKIIYIDK 81 + +++ +S LK +++ Sbjct: 59 VELVAGATSRLKRFRVER 76 >gi|322494953|emb|CBZ30256.1| conserved hypothetical protein [Leishmania mexicana MHOM/GT/2001/U1103] Length = 204 Score = 47.0 bits (111), Expect = 8e-04, Method: Composition-based stats. Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 4/53 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56 + V P+A+ S A+ P T +++ A P G+AN +L L + Sbjct: 25 LTVHAKPSARASAFAAPVTPTLTEAD----LRIAAPPVDGQANAELLRYLGEL 73 Score = 34.7 bits (79), Expect = 3.9, Method: Composition-based stats. Identities = 6/34 (17%), Positives = 14/34 (41%), Gaps = 2/34 (5%) Query: 63 SLRMLSKQSSPLKIIYI--DKDCKEITELLQNND 94 + ++ +S K + I E+T +L+ Sbjct: 169 EVSLVRGGTSREKTVLIVFPGTRAELTAVLEKVS 202 >gi|302418742|ref|XP_003007202.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102] gi|261354804|gb|EEY17232.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102] Length = 116 Score = 46.7 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 14/53 (26%), Positives = 26/53 (49%), Gaps = 5/53 (9%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56 + + P A K+ ++I V A P++G+ANKA+L +L++ Sbjct: 24 LHCYVKPGAAKAR-----EGVTGLTEDAIEICVAAQPREGQANKAVLRLLSEA 71 >gi|146098877|ref|XP_001468495.1| hypothetical protein [Leishmania infantum] gi|134072863|emb|CAM71579.1| conserved hypothetical protein [Leishmania infantum JPCM5] Length = 204 Score = 46.7 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 14/53 (26%), Positives = 25/53 (47%), Gaps = 4/53 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56 + V P+A+ S A+ P T +++ A P +G+AN +L L + Sbjct: 25 LTVHAKPSARASAFAAPLTPALTEAD----LRIAAPPVEGQANAELLRYLGEL 73 >gi|322502521|emb|CBZ37604.1| unnamed protein product [Leishmania donovani BPK282A1] Length = 204 Score = 46.7 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 14/53 (26%), Positives = 25/53 (47%), Gaps = 4/53 (7%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56 + V P+A+ S A+ P T +++ A P +G+AN +L L + Sbjct: 25 LTVHAKPSARASAFAAPLTPALTEAD----LRIAAPPVEGQANAELLRYLGEL 73 >gi|255634374|gb|ACU17552.1| unknown [Glycine max] Length = 77 Score = 46.3 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 8/51 (15%), Positives = 22/51 (43%), Gaps = 7/51 (13%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLA 54 + + P +K + + + +++ A + G+AN A+L ++ Sbjct: 20 ITIHAKPGSKSASVTDISDEAVG-------VQIDAPARDGEANAALLDYIS 63 >gi|301774793|ref|XP_002922805.1| PREDICTED: LOW QUALITY PROTEIN: UPF0235 protein C15orf40-like [Ailuropoda melanoleuca] Length = 129 Score = 46.3 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 37/94 (39%), Gaps = 4/94 (4%) Query: 2 CNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQ-KGKANKAMLAMLAKKLALS 60 + + P +K++ + + A Q +G+AN + L+K L L Sbjct: 36 VTIAIHAKPGSKQNATTDVTAKVVSVAITAPPPTPPAPRQSEGEANAELSWCLSKVLELR 95 Query: 61 KS-SLRMLSKQSSPLKIIYI--DKDCKEITELLQ 91 KS + + S K++ + +EI E ++ Sbjct: 96 KSDDVILDKGGXSHEKVVKLLASTTAEEILEKVK 129 >gi|157875650|ref|XP_001686209.1| hypothetical protein [Leishmania major strain Friedlin] gi|68129283|emb|CAJ07823.1| conserved hypothetical protein [Leishmania major strain Friedlin] Length = 204 Score = 46.3 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 14/50 (28%), Positives = 24/50 (48%), Gaps = 4/50 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAML 53 + V P+A+ S A+ P T +++ A P +G+AN +L L Sbjct: 25 LTVHAKPSARASAFAAPLTPVLTEAD----LRIAAPPVEGQANAELLRYL 70 Score = 34.3 bits (78), Expect = 5.1, Method: Composition-based stats. Identities = 4/34 (11%), Positives = 14/34 (41%), Gaps = 2/34 (5%) Query: 63 SLRMLSKQSSPLKIIYI--DKDCKEITELLQNND 94 + ++ +S K + + ++T +L+ Sbjct: 169 EVSLVRGGTSREKTVLVMFPGTRAQLTAVLEKES 202 >gi|197106855|ref|YP_002132232.1| hypothetical protein PHZ_c3394 [Phenylobacterium zucineum HLK1] gi|196480275|gb|ACG79803.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 86 Score = 45.9 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 42/76 (55%), Gaps = 2/76 (2%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P + I D + +K++V+A P G AN A++A+LAK L + KS++R+ + Sbjct: 2 TPKGGRDAIDGW--GADEAGRPVLKVRVSAAPADGAANAAVVALLAKALKVPKSAVRIAA 59 Query: 69 KQSSPLKIIYIDKDCK 84 +++ +K + ID + Sbjct: 60 GETARIKRLEIDGASE 75 >gi|255089425|ref|XP_002506634.1| predicted protein [Micromonas sp. RCC299] gi|226521907|gb|ACO67892.1| predicted protein [Micromonas sp. RCC299] Length = 319 Score = 45.5 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 15/91 (16%), Positives = 36/91 (39%), Gaps = 10/91 (10%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 M + + + A+ ++ + + + +T P G+ + +L L K L L Sbjct: 141 MIQIAIDVEDKARTKAVSKITADEVG-------VALT-LPV-GQCDDELLEFLGKVLHLR 191 Query: 61 KSSLRMLSKQSSPLKIIYIDK-DCKEITELL 90 + +L S+ K++ + ++ E L Sbjct: 192 LPQMSLLRGWSTRSKLLMVQGLSATQVYERL 222 >gi|320590983|gb|EFX03422.1| dash complex subunit [Grosmannia clavigera kw1407] Length = 312 Score = 45.5 bits (107), Expect = 0.003, Method: Composition-based stats. Identities = 12/81 (14%), Positives = 23/81 (28%), Gaps = 25/81 (30%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 + + P A K + ++I V L + KSS Sbjct: 229 LQCHIKPGASK-----IRQGVTAVTDDAIEICV--------------------LDVPKSS 263 Query: 64 LRMLSKQSSPLKIIYIDKDCK 84 L++ S K + + + Sbjct: 264 LQITRGLKSRDKTVAVAGLSE 284 >gi|111223913|ref|YP_714707.1| hypothetical protein FRAAL4520 [Frankia alni ACN14a] gi|111151445|emb|CAJ63162.1| hypothetical protein FRAAL4520 [Frankia alni ACN14a] Length = 93 Score = 45.1 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 17/75 (22%), Positives = 31/75 (41%) Query: 22 IPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK 81 D + ++VT G+A +A L LA L L ++ +R++ +S +K + Sbjct: 4 RWTDPRAGSIVIVRVTERAVDGRATEAALRALAGALGLRRTQVRLVRGATSRVKTFELTT 63 Query: 82 DCKEITELLQNNDSL 96 + L D L Sbjct: 64 PAADEPALRARLDRL 78 >gi|322504678|emb|CBZ14505.1| conserved hypothetical protein [Leishmania braziliensis MHOM/BR/75/M2904] Length = 193 Score = 44.7 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 14/50 (28%), Positives = 23/50 (46%), Gaps = 4/50 (8%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAML 53 + V P A+ S A+ P T +++ A P +G+AN +L L Sbjct: 15 LRVYAKPGARASAFAAPLTPSLTEAD----LRIAAAPVEGQANAELLRYL 60 Score = 33.6 bits (76), Expect = 9.2, Method: Composition-based stats. Identities = 3/34 (8%), Positives = 13/34 (38%), Gaps = 2/34 (5%) Query: 63 SLRMLSKQSSPLKIIYI--DKDCKEITELLQNND 94 + ++ +S K + + ++ +L+ Sbjct: 158 EVSLVRGGTSREKTLLVVFPGTRAQLAAILEKES 191 >gi|7770327|gb|AAF69697.1|AC016041_2 F27J15.6 [Arabidopsis thaliana] Length = 91 Score = 43.6 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 9/53 (16%), Positives = 22/53 (41%), Gaps = 7/53 (13%) Query: 4 VIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56 + + P +K + I + +++ A + G+AN A+L ++ Sbjct: 39 ITIHAKPGSKAASITDVSDEAVG-------VQIDAPARDGEANAALLEYMSSV 84 >gi|30697842|ref|NP_851256.1| unknown protein [Arabidopsis thaliana] gi|16226478|gb|AAL16178.1|AF428410_1 AT5g63440/MLE2_7 [Arabidopsis thaliana] gi|22137224|gb|AAM91457.1| AT5g63440/MLE2_7 [Arabidopsis thaliana] gi|332010364|gb|AED97747.1| uncharacterized protein [Arabidopsis thaliana] Length = 205 Score = 43.2 bits (101), Expect = 0.011, Method: Composition-based stats. Identities = 11/64 (17%), Positives = 27/64 (42%), Gaps = 10/64 (15%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 + V + + A++S I + +++ V A +G+AN +L + + + Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAAPAARGEANNELLEFMGR---VP 192 Query: 61 KSSL 64 S+ Sbjct: 193 NQSV 196 >gi|303284124|ref|XP_003061353.1| predicted protein [Micromonas pusilla CCMP1545] gi|226457704|gb|EEH55003.1| predicted protein [Micromonas pusilla CCMP1545] Length = 224 Score = 42.0 bits (98), Expect = 0.025, Method: Composition-based stats. Identities = 17/89 (19%), Positives = 33/89 (37%), Gaps = 6/89 (6%) Query: 8 LIPNAKKSGIASLEIPKDTSDTIHMKIKVTAT--PQKGK---ANKAMLAMLAKKLALSKS 62 + P + +LEI KI K AN ++ L K L L Sbjct: 136 IQPKTPGTVQIALEIEDRKKWRAITKITADEVGVAVNAKCAVANDEIVEFLGKTLHLRLP 195 Query: 63 SLRMLSKQSSPLKIIYIDK-DCKEITELL 90 + +L+ S+ K++ + +++ + L Sbjct: 196 QMSLLAGWSARSKLLVVQGLTPQQVYDRL 224 >gi|212723588|ref|NP_001132046.1| hypothetical protein LOC100193457 [Zea mays] gi|194693286|gb|ACF80727.1| unknown [Zea mays] Length = 213 Score = 39.7 bits (92), Expect = 0.13, Method: Composition-based stats. Identities = 11/56 (19%), Positives = 24/56 (42%), Gaps = 7/56 (12%) Query: 1 MCNVIVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKK 56 + V + + A++S I + +++ V A +G+AN +L + K Sbjct: 143 LVQVAIEVEDRAQRSAITRVNADD-------VRVTVAALAARGEANSELLEFMGKV 191 >gi|315122788|ref|YP_004063277.1| hypothetical protein CKC_05215 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496190|gb|ADR52789.1| hypothetical protein CKC_05215 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 50 Score = 38.9 bits (90), Expect = 0.21, Method: Composition-based stats. Identities = 33/50 (66%), Positives = 43/50 (86%) Query: 49 MLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCKEITELLQNNDSLTL 98 ML +LA++L+L+KSSL+MLSK SSP+K IYIDKDCKEI EL + N+ +TL Sbjct: 1 MLTILAERLSLNKSSLKMLSKHSSPIKKIYIDKDCKEIIELFKRNNPVTL 50 >gi|302833257|ref|XP_002948192.1| hypothetical protein VOLCADRAFT_103816 [Volvox carteri f. nagariensis] gi|300266412|gb|EFJ50599.1| hypothetical protein VOLCADRAFT_103816 [Volvox carteri f. nagariensis] Length = 243 Score = 38.6 bits (89), Expect = 0.26, Method: Composition-based stats. Identities = 13/64 (20%), Positives = 30/64 (46%), Gaps = 1/64 (1%) Query: 30 IHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK-DCKEITE 88 +++ VT ++ + + AK L + S L + +SS +I+ ++ ++I E Sbjct: 162 DVVRVHVTGLMANDAVHEELFDLFAKVLNVRLSQLDIRRAKSSRNRIMTVESLTPEQIYE 221 Query: 89 LLQN 92 L+ Sbjct: 222 RLRE 225 >gi|304312491|ref|YP_003812089.1| tRNA pseudouridine synthase D [gamma proteobacterium HdN1] gi|301798224|emb|CBL46446.1| tRNA pseudouridine synthase D [gamma proteobacterium HdN1] Length = 352 Score = 37.0 bits (85), Expect = 0.78, Method: Composition-based stats. Identities = 11/60 (18%), Positives = 23/60 (38%), Gaps = 7/60 (11%) Query: 15 SGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPL 74 + E H+ ++V A +G+ ++ LA+ + +SS+ S Q Sbjct: 32 DEVLGFEPDG---SGEHLCVQVWA---RGQNTAWLVRQLAQWANVPRSSVSF-SGQKDRH 84 >gi|159465355|ref|XP_001690888.1| predicted protein [Chlamydomonas reinhardtii] gi|158279574|gb|EDP05334.1| predicted protein [Chlamydomonas reinhardtii] Length = 243 Score = 37.0 bits (85), Expect = 0.84, Method: Composition-based stats. Identities = 9/65 (13%), Positives = 30/65 (46%), Gaps = 1/65 (1%) Query: 30 IHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLSKQSSPLKIIYIDK-DCKEITE 88 +++ VT ++ + +++K L + S L + + + +I+ ++ +++ E Sbjct: 162 DVVRVHVTGLMAGDAVHEELFDLISKVLNVRLSQLDIRRAKHNRNRIMTVEGLTPEQVFE 221 Query: 89 LLQNN 93 L+ Sbjct: 222 RLREQ 226 >gi|254522602|ref|ZP_05134657.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14] gi|219720193|gb|EED38718.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14] Length = 1475 Score = 36.6 bits (84), Expect = 1.1, Method: Composition-based stats. Identities = 10/55 (18%), Positives = 24/55 (43%), Gaps = 2/55 (3%) Query: 11 NAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 A + + + +D + +I T +GKA+ + +L+ +L + K + Sbjct: 974 KASVTAVERASVTRDEALGE--RINTTNAALEGKASTGSVQLLSSELGVQKGRID 1026 >gi|254523366|ref|ZP_05135421.1| p22 [Stenotrophomonas sp. SKA14] gi|219720957|gb|EED39482.1| p22 [Stenotrophomonas sp. SKA14] Length = 1651 Score = 36.6 bits (84), Expect = 1.3, Method: Composition-based stats. Identities = 10/54 (18%), Positives = 25/54 (46%), Gaps = 2/54 (3%) Query: 12 AKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 A + + + +D++ +I T +GKA+ + +L+ +L + K + Sbjct: 1151 ASVNAVEQASVSRDSALGE--RINTTNAALEGKASTGSVQLLSSELGVQKGRID 1202 >gi|254522586|ref|ZP_05134641.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14] gi|219720177|gb|EED38702.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14] Length = 1671 Score = 36.6 bits (84), Expect = 1.3, Method: Composition-based stats. Identities = 10/54 (18%), Positives = 25/54 (46%), Gaps = 2/54 (3%) Query: 12 AKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 A + + + +D++ +I T +GKA+ + +L+ +L + K + Sbjct: 1171 ASVNAVEQASVSRDSALGE--RINTTNAALEGKASTGSVQLLSSELGVQKGRID 1222 >gi|254521915|ref|ZP_05133970.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14] gi|219719506|gb|EED38031.1| Carbohydrate binding domain protein [Stenotrophomonas sp. SKA14] Length = 1553 Score = 36.2 bits (83), Expect = 1.5, Method: Composition-based stats. Identities = 9/54 (16%), Positives = 23/54 (42%), Gaps = 2/54 (3%) Query: 12 AKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLR 65 A + + + +D + + T +GKA+ + +L+ +L + K + Sbjct: 1076 ASVNAVDRASVSRDAALGERL--NTTDAALEGKASTGSVQLLSSELGVQKGRID 1127 >gi|255582001|ref|XP_002531798.1| conserved hypothetical protein [Ricinus communis] gi|223528564|gb|EEF30586.1| conserved hypothetical protein [Ricinus communis] Length = 176 Score = 35.9 bits (82), Expect = 1.8, Method: Composition-based stats. Identities = 4/24 (16%), Positives = 11/24 (45%) Query: 56 KLALSKSSLRMLSKQSSPLKIIYI 79 L + + + + S S K++ + Sbjct: 122 VLGVKRRQVSIRSGSKSRDKVVIV 145 >gi|296535729|ref|ZP_06897896.1| carbon-monoxide dehydrogenase large subunit [Roseomonas cervicalis ATCC 49957] gi|296263942|gb|EFH10400.1| carbon-monoxide dehydrogenase large subunit [Roseomonas cervicalis ATCC 49957] Length = 813 Score = 35.5 bits (81), Expect = 2.4, Method: Composition-based stats. Identities = 7/69 (10%), Positives = 20/69 (28%), Gaps = 18/69 (26%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 +R+ P S +T +++++L + S + Sbjct: 505 NIRVHPTGSISVFTGTHSHGQGHETTF------------------AQLVSEQLGVPLSQV 546 Query: 65 RMLSKQSSP 73 ++ +S Sbjct: 547 EIVHGDTSK 555 >gi|103487614|ref|YP_617175.1| hypothetical protein Sala_2133 [Sphingopyxis alaskensis RB2256] gi|98977691|gb|ABF53842.1| protein of unknown function DUF167 [Sphingopyxis alaskensis RB2256] Length = 67 Score = 35.5 bits (81), Expect = 2.7, Method: Composition-based stats. Identities = 18/74 (24%), Positives = 34/74 (45%), Gaps = 8/74 (10%) Query: 9 IPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSLRMLS 68 P A+ + + + ++VT P G AN A+L +LA L + L ++ Sbjct: 2 TPGARSEAV--------RIEGGTVHLRVTVPPADGAANAAVLRLLAAALDVPPRDLTLIR 53 Query: 69 KQSSPLKIIYIDKD 82 S+ +K+I I ++ Sbjct: 54 GASARIKLIGIARN 67 >gi|219850376|ref|YP_002464809.1| Xanthine dehydrogenase [Chloroflexus aggregans DSM 9485] gi|219544635|gb|ACL26373.1| Xanthine dehydrogenase [Chloroflexus aggregans DSM 9485] Length = 756 Score = 35.5 bits (81), Expect = 2.8, Method: Composition-based stats. Identities = 8/71 (11%), Positives = 23/71 (32%), Gaps = 9/71 (12%) Query: 9 IPNAKKS-GIASLEIPKDTSD-------TIHMKIKVTATPQKGKANKAMLAMLAKKLALS 60 P + I + +++ V A G N + + + A+ L + Sbjct: 445 TPGSGIGLAIGGWPCGMSPAAAVCRVDTDGTVRVHVGAVDISG-VNSSFVLVAAEILGVP 503 Query: 61 KSSLRMLSKQS 71 + +++ + Sbjct: 504 PEQVEIVAGDT 514 >gi|269837428|ref|YP_003319656.1| aldehyde oxidase and xanthine dehydrogenase molybdopterin binding protein [Sphaerobacter thermophilus DSM 20745] gi|269786691|gb|ACZ38834.1| aldehyde oxidase and xanthine dehydrogenase molybdopterin binding protein [Sphaerobacter thermophilus DSM 20745] Length = 785 Score = 35.1 bits (80), Expect = 3.0, Method: Composition-based stats. Identities = 8/69 (11%), Positives = 20/69 (28%), Gaps = 18/69 (26%) Query: 5 IVRLIPNAKKSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSSL 64 VR+ P+ K S +T ++A++L + + Sbjct: 484 TVRVHPSGKVSVFTGSNPHGQGEETTF------------------AQLVAEELGVPLDDI 525 Query: 65 RMLSKQSSP 73 ++ + Sbjct: 526 EIVHGDTGR 534 >gi|293607793|ref|ZP_06690123.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] gi|292813808|gb|EFF72959.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] Length = 383 Score = 35.1 bits (80), Expect = 3.0, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 31/79 (39%), Gaps = 10/79 (12%) Query: 6 VRLIPNAK--KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 +R+ P + K+G+A +E+ I V GKA + +LA+ Sbjct: 306 LRVTPGSAAHKAGLAGVEVTPQGIVPGDRIIDV-----DGKATDDVAKLLARLDDRKVGD 360 Query: 64 LRMLSKQ---SSPLKIIYI 79 + +LS + S + + Sbjct: 361 VVVLSVERAGKSREVRVEL 379 >gi|241662231|ref|YP_002980591.1| 2-alkenal reductase [Ralstonia pickettii 12D] gi|240864258|gb|ACS61919.1| 2-alkenal reductase [Ralstonia pickettii 12D] Length = 383 Score = 35.1 bits (80), Expect = 3.4, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 31/79 (39%), Gaps = 10/79 (12%) Query: 6 VRLIPNAK--KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 +R+ P + K+G+A +E+ I V GKA + +LA+ Sbjct: 306 LRVTPGSAAHKAGLAGVEVTPQGIVPGDRIIDV-----DGKATDDVAKLLARLDDRKVGD 360 Query: 64 LRMLSKQ---SSPLKIIYI 79 + +LS + S + + Sbjct: 361 VVVLSVERAGKSREVRVEL 379 >gi|299136787|ref|ZP_07029970.1| isoleucyl-tRNA synthetase [Acidobacterium sp. MP5ACTX8] gi|298601302|gb|EFI57457.1| isoleucyl-tRNA synthetase [Acidobacterium sp. MP5ACTX8] Length = 959 Score = 34.7 bits (79), Expect = 3.9, Method: Composition-based stats. Identities = 11/61 (18%), Positives = 28/61 (45%), Gaps = 5/61 (8%) Query: 29 TIHMKIKVTATPQKGKANKAML----AMLAKKLALSKSSLRMLSKQSSPLKIIYIDKDCK 84 ++ + Q+G A+ +L L + +S++S++++ ++ K+I I Sbjct: 856 GKSLEASIQIMAQEGSADAILLVKYEEALPEFFNVSQASVQVV-GATNEDKVILIRATVA 914 Query: 85 E 85 E Sbjct: 915 E 915 >gi|161525478|ref|YP_001580490.1| 2-alkenal reductase [Burkholderia multivorans ATCC 17616] gi|189349793|ref|YP_001945421.1| putative trypsin-like serine protease [Burkholderia multivorans ATCC 17616] gi|160342907|gb|ABX15993.1| 2-alkenal reductase [Burkholderia multivorans ATCC 17616] gi|189333815|dbj|BAG42885.1| putative trypsin-like serine protease [Burkholderia multivorans ATCC 17616] Length = 383 Score = 34.7 bits (79), Expect = 4.4, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 31/79 (39%), Gaps = 10/79 (12%) Query: 6 VRLIPNAK--KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 +R+ P + K+G+A +E+ I V GKA + +LA+ Sbjct: 306 LRVTPGSAAHKAGLAGVEVTPQGIVPGDRIIDV-----DGKATDDVAKLLARLDDRKVGD 360 Query: 64 LRMLSKQ---SSPLKIIYI 79 + +LS + S + + Sbjct: 361 VVVLSVERAGKSREVRVEL 379 >gi|119476064|ref|ZP_01616416.1| putative coenzyme F390 synthetase [marine gamma proteobacterium HTCC2143] gi|119450691|gb|EAW31925.1| putative coenzyme F390 synthetase [marine gamma proteobacterium HTCC2143] Length = 452 Score = 34.3 bits (78), Expect = 5.6, Method: Composition-based stats. Identities = 15/100 (15%), Positives = 34/100 (34%), Gaps = 23/100 (23%) Query: 6 VRLIPNAKKSGIASLEIPKDTSDTIHMKIK--VTATPQK----------GKANKAMLAML 53 V + PNA + + + I + V A P + G A+ A+ L Sbjct: 352 VNVFPNAVRDIVNKHSAQTTGNIRIMLPEPGPVAAPPIRVLVETHQRLQGAADVALCKEL 411 Query: 54 AKKLA---LSKSSLRMLS--------KQSSPLKIIYIDKD 82 ++ + ++ + + + + K++ I D Sbjct: 412 SELVHHHLRFRAKIELQAEGVFQVQTGATGKSKLVEIIGD 451 >gi|78358459|ref|YP_389908.1| DegP2 peptidase [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78220864|gb|ABB40213.1| DegP2 peptidase, Serine peptidase, MEROPS family S01B [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 383 Score = 33.6 bits (76), Expect = 10.0, Method: Composition-based stats. Identities = 15/78 (19%), Positives = 32/78 (41%), Gaps = 8/78 (10%) Query: 6 VRLIPNAK--KSGIASLEIPKDTSDTIHMKIKVTATPQKGKANKAMLAMLAKKLALSKSS 63 +R+ P + K+G+A +E+ I V GKA + +LA+ Sbjct: 306 LRVTPGSAAHKAGLAGVEVTPQGIVPGDRIIGV-----DGKATDNVAKLLARLDDRKVGD 360 Query: 64 LRMLSKQS-SPLKIIYID 80 + +LS + + + ++ Sbjct: 361 VVVLSVERAGKTREVRVE 378 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.309 0.168 0.486 Lambda K H 0.267 0.0520 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,268,908,179 Number of Sequences: 13984884 Number of extensions: 68017790 Number of successful extensions: 147259 Number of sequences better than 10.0: 860 Number of HSP's better than 10.0 without gapping: 832 Number of HSP's successfully gapped in prelim test: 28 Number of HSP's that attempted gapping in prelim test: 145640 Number of HSP's gapped (non-prelim): 878 length of query: 98 length of database: 4,792,584,752 effective HSP length: 67 effective length of query: 31 effective length of database: 3,855,597,524 effective search space: 119523523244 effective search space used: 119523523244 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.8 bits) S2: 76 (33.5 bits)