BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780542|ref|YP_003064955.1| hypothetical protein CLIBASIA_02145 [Candidatus Liberibacter asiaticus str. psy62] (210 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254780542|ref|YP_003064955.1| hypothetical protein CLIBASIA_02145 [Candidatus Liberibacter asiaticus str. psy62] gi|254040219|gb|ACT57015.1| hypothetical protein CLIBASIA_02145 [Candidatus Liberibacter asiaticus str. psy62] Length = 210 Score = 285 bits (730), Expect = 2e-75, Method: Composition-based stats. Identities = 210/210 (100%), Positives = 210/210 (100%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLII 60 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLII Sbjct: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLII 60 Query: 61 KPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ 120 KPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ Sbjct: 61 KPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ 120 Query: 121 CKDPINDSISNSESISKKALSEYSSTDITSQGSEKSSGSSSNKTLEKESSQPLENNLSMD 180 CKDPINDSISNSESISKKALSEYSSTDITSQGSEKSSGSSSNKTLEKESSQPLENNLSMD Sbjct: 121 CKDPINDSISNSESISKKALSEYSSTDITSQGSEKSSGSSSNKTLEKESSQPLENNLSMD 180 Query: 181 LKGRPIQELGNNLSDSGLNEQDHNDVQISK 210 LKGRPIQELGNNLSDSGLNEQDHNDVQISK Sbjct: 181 LKGRPIQELGNNLSDSGLNEQDHNDVQISK 210 >gi|170738647|ref|YP_001767302.1| hypothetical protein M446_0297 [Methylobacterium sp. 4-46] gi|168192921|gb|ACA14868.1| conserved hypothetical proteinn [Methylobacterium sp. 4-46] Length = 210 Score = 208 bits (530), Expect = 4e-52, Method: Composition-based stats. Identities = 67/185 (36%), Positives = 105/185 (56%), Gaps = 15/185 (8%) Query: 6 LLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVC 65 L+L L V + A A+ R N A F+G+DKITGR++TF+V ++++ QFG+L + P VC Sbjct: 14 LVLSLAGVLAPAAQADKIR--NPTAVFSGLDKITGRIVTFEVSVDETVQFGALQLTPRVC 71 Query: 66 YSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPI 125 Y+R E+ R AF+ + E+ + R IF+GWMFA SP ++AI+H IYD+WL+ CK Sbjct: 72 YTRPPTESARTTAFLEVDEVTLENKYRRIFTGWMFAASPGLHAIEHPIYDVWLVDCKG-G 130 Query: 126 NDSISNSESISKKALSEYSSTDITSQGSEKSSGSSSNKTLEKESSQPLENNLSMD---LK 182 D I+ ++ E + + E+ N+ E +QP+ +D L+ Sbjct: 131 TDIIAEAK--------EQDDAPVAAAKPERRR-RDPNQQQEARRAQPVNRQGQVDVAPLR 181 Query: 183 GRPIQ 187 G P+Q Sbjct: 182 GTPVQ 186 >gi|220925866|ref|YP_002501168.1| hypothetical protein Mnod_6040 [Methylobacterium nodulans ORS 2060] gi|219950473|gb|ACL60865.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 210 Score = 201 bits (512), Expect = 4e-50, Method: Composition-based stats. Identities = 58/168 (34%), Positives = 96/168 (57%), Gaps = 13/168 (7%) Query: 23 ARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSI 82 + N A F+G+DKITGR+++F+V ++++ QFG+L + P VCY+R E+ + AF+ + Sbjct: 29 DKIRNPTAVFSGLDKITGRIVSFEVAVDETVQFGALQLTPRVCYTRPPTESAKTTAFLEV 88 Query: 83 SEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKALSE 142 E+ + R IF+GWMFA SP ++AI+H IYD+WL+ CK D I+ ++ E Sbjct: 89 DEVTLENKYRRIFTGWMFAASPGLHAIEHPIYDVWLVDCKG-GTDIIAEAK--------E 139 Query: 143 YSSTDITSQGSEKSSGSSSNKTLEKESSQPLENNLSMD---LKGRPIQ 187 + + E+ N+ E +QP+ +D L+G P+Q Sbjct: 140 QDDAPVAAAKPERRR-RDPNQREEARRAQPVNRQGQVDVTPLRGTPVQ 186 >gi|254294263|ref|YP_003060286.1| hypothetical protein Hbal_1904 [Hirschia baltica ATCC 49814] gi|254042794|gb|ACT59589.1| conserved hypothetical protein [Hirschia baltica ATCC 49814] Length = 217 Score = 192 bits (487), Expect = 3e-47, Method: Composition-based stats. Identities = 39/179 (21%), Positives = 80/179 (44%), Gaps = 21/179 (11%) Query: 28 KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDRE-AQRIDAFVSISEI- 85 + ++KITG+ ++E++++ QFG L + C+ + A++ + + Sbjct: 30 PGVKVRALEKITGKATDIEIELDETVQFGGLGLTVRACHQSPPEDQPPEAAAYLEVISMG 89 Query: 86 ------FTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKA 139 +FSGWMFA SP +NA++HS+YD+W++ C + + + Sbjct: 90 VNAETGTAKDDDPRLFSGWMFASSPGLNALEHSLYDVWVISCSAALPGTEAK-----PLD 144 Query: 140 LSEYSSTDITSQGSEKSSGSSSNK-------TLEKESSQPLENNLSMDLKGRPIQELGN 191 L E S+ + E S N+ +++ + +P+ DL+ P++ G+ Sbjct: 145 LYEESNLGFDAIPEEALPSSDINESASMGLPSIDDFNPEPIFVE-EADLEAVPVERSGS 202 >gi|170746752|ref|YP_001753012.1| hypothetical protein Mrad2831_0305 [Methylobacterium radiotolerans JCM 2831] gi|170653274|gb|ACB22329.1| conserved hypothetical protein [Methylobacterium radiotolerans JCM 2831] Length = 218 Score = 182 bits (463), Expect = 2e-44, Method: Composition-based stats. Identities = 62/173 (35%), Positives = 98/173 (56%), Gaps = 19/173 (10%) Query: 20 ANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAF 79 A + + N A F+G+DKITGR++ F+V ++++ QFG+L + P VCY+R E+ + AF Sbjct: 25 AAADKIKNPTAVFSGLDKITGRIVNFEVAVDETVQFGALQLTPRVCYTRPPTESAKTTAF 84 Query: 80 VSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKA 139 + + E+ D R IF+GWMFA SP ++AI+H IYD+WL+ CK +D I+ ++ Sbjct: 85 LEVDEVTLDNKYRRIFTGWMFASSPGLHAIEHPIYDVWLVDCKG-GSDVIAEAK------ 137 Query: 140 LSEYSSTDITSQGSE--KSSGSSSNKTLEKESSQPLENNLSMDL---KGRPIQ 187 E + E K G + KT +Q L N +D+ +G P+Q Sbjct: 138 --EQEDVPAVAAKPEKAKRPGKDATKT-----AQQLNANGQVDVEAPRGVPVQ 183 >gi|188580300|ref|YP_001923745.1| hypothetical protein Mpop_1034 [Methylobacterium populi BJ001] gi|179343798|gb|ACB79210.1| conserved hypothetical proteinn [Methylobacterium populi BJ001] Length = 219 Score = 180 bits (456), Expect = 2e-43, Method: Composition-based stats. Identities = 63/177 (35%), Positives = 96/177 (54%), Gaps = 19/177 (10%) Query: 16 HAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQR 75 A++ + N A F+G+DKITGR++TF+V I+++ QFG+L + P VCYSR E + Sbjct: 22 SVLPASADKIKNPTAVFSGLDKITGRIVTFEVAIDETVQFGALQMTPRVCYSRPPTETPK 81 Query: 76 IDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESI 135 AF+ + E+ D R IF+GWMFA SP ++AI+H IYD+WL CK D I+ ++ Sbjct: 82 TTAFLEVDEVTLDSKYRRIFTGWMFASSPGLHAIEHPIYDVWLTDCKG-GTDVIAEAK-- 138 Query: 136 SKKALSEYSSTDITSQGSE--KSSGSSSNKTLEKESSQPLENNLSMDL---KGRPIQ 187 E + E K G+ KT + + N +D+ +G P+Q Sbjct: 139 ------EQEDVPALASRQEKPKKKGADPTKT-----ASQVNQNGQVDVEGPRGVPVQ 184 >gi|163850532|ref|YP_001638575.1| hypothetical protein Mext_1100 [Methylobacterium extorquens PA1] gi|218529229|ref|YP_002420045.1| hypothetical protein Mchl_1229 [Methylobacterium chloromethanicum CM4] gi|240137597|ref|YP_002962068.1| hypothetical protein MexAM1_META1p0870 [Methylobacterium extorquens AM1] gi|254560069|ref|YP_003067164.1| hypothetical protein METDI1582 [Methylobacterium extorquens DM4] gi|163662137|gb|ABY29504.1| conserved hypothetical proteinn [Methylobacterium extorquens PA1] gi|218521532|gb|ACK82117.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] gi|240007565|gb|ACS38791.1| conserved hypothetical protein; putative exported protein [Methylobacterium extorquens AM1] gi|254267347|emb|CAX23179.1| conserved hypothetical protein; putative exported protein [Methylobacterium extorquens DM4] Length = 219 Score = 172 bits (435), Expect = 4e-41, Method: Composition-based stats. Identities = 61/170 (35%), Positives = 94/170 (55%), Gaps = 19/170 (11%) Query: 23 ARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSI 82 + N A F+G+DKITGR++TF+V I+++ QFG+L + P VCYSR E + AF+ + Sbjct: 29 DKIKNPTAVFSGLDKITGRIVTFEVAIDETVQFGALQMTPRVCYSRPPTETPKTTAFLEV 88 Query: 83 SEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKALSE 142 E+ D R IF+GWMFA SP ++AI+H IYD+WL CK +D I+ ++ E Sbjct: 89 DEVTLDSKYRRIFTGWMFAASPGLHAIEHPIYDVWLTDCKG-GSDVIAEAK--------E 139 Query: 143 YSSTDITSQGSE--KSSGSSSNKTLEKESSQPLENNLSMDL---KGRPIQ 187 + + + G+ KT S + N +D+ +G P+Q Sbjct: 140 QEDVPALASRQDKPRKKGADPTKT-----SAQVNQNGQVDVEGPRGVPVQ 184 >gi|254456041|ref|ZP_05069470.1| conserved hypothetical protein [Candidatus Pelagibacter sp. HTCC7211] gi|207083043|gb|EDZ60469.1| conserved hypothetical protein [Candidatus Pelagibacter sp. HTCC7211] Length = 135 Score = 168 bits (426), Expect = 4e-40, Method: Composition-based stats. Identities = 28/122 (22%), Positives = 60/122 (49%), Gaps = 2/122 (1%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLII 60 + +L ++ + FA + + +DKI+ + ++ + +F L I Sbjct: 14 LNLFYFILFIYLFLCNFSFAKNNT-EGVFTDLKILDKISSKNTLIQLKNGELVKFKDLSI 72 Query: 61 KPMVCYSRDDREAQRIDAFVSISEIFT-DRIVRSIFSGWMFADSPAMNAIDHSIYDIWLM 119 K + C + + + I A++ + ++ D+ +F+GWMF+ SP++ DH +YD+WL+ Sbjct: 73 KSLKCKNSEFDDNPEITAYIQVKDLTDQDKDEVFVFNGWMFSSSPSITPFDHPVYDVWLV 132 Query: 120 QC 121 C Sbjct: 133 NC 134 >gi|222148330|ref|YP_002549287.1| hypothetical protein Avi_1785 [Agrobacterium vitis S4] gi|221735318|gb|ACM36281.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 171 Score = 163 bits (413), Expect = 1e-38, Method: Composition-based stats. Identities = 65/115 (56%), Positives = 85/115 (73%) Query: 8 LILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYS 67 L + F + A++AR +N VA F+G+DKITGR+ FDV +N++ QFG+L + P CYS Sbjct: 47 LTVAFSITATVPADAARISNAVAVFSGLDKITGRITEFDVYLNETVQFGALQVTPKACYS 106 Query: 68 RDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 RD+ EAQ +DAFV + EI DR +R IFSGWMFADSPA+NAI+H IYD+WL CK Sbjct: 107 RDETEAQHVDAFVQVDEITLDRRIRQIFSGWMFADSPALNAIEHPIYDVWLKDCK 161 >gi|91762149|ref|ZP_01264114.1| hypothetical protein PU1002_02751 [Candidatus Pelagibacter ubique HTCC1002] gi|91717951|gb|EAS84601.1| hypothetical protein PU1002_02751 [Candidatus Pelagibacter ubique HTCC1002] Length = 135 Score = 162 bits (409), Expect = 4e-38, Method: Composition-based stats. Identities = 36/120 (30%), Positives = 64/120 (53%), Gaps = 2/120 (1%) Query: 4 RVLLLILFFVFSHAK-FANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKP 62 + +LI FF+ S + + K E +DK++ + ++I + +F SL+IK Sbjct: 15 FLFILIYFFLTSISSPLVANENSEGKFVEIKILDKVSSKTDLLKLKIGEELRFKSLLIKS 74 Query: 63 MVCYSRDDREAQRIDAFVSISE-IFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 + C + + + I +++ + + I D IF+GW F+ SPA+N DH +YDIWL +C Sbjct: 75 LKCKNSEFDDNPEITSYIQVKDTINNDNNEVFIFNGWTFSSSPAVNPFDHPVYDIWLTRC 134 >gi|315122870|ref|YP_004063359.1| hypothetical protein CKC_05630 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496272|gb|ADR52871.1| hypothetical protein CKC_05630 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 193 Score = 162 bits (409), Expect = 4e-38, Method: Composition-based stats. Identities = 112/204 (54%), Positives = 138/204 (67%), Gaps = 13/204 (6%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLII 60 MK++VL L + F F+ A SARF NK+AEFAGMDKITGR+L FDV+IN+S QFGSL I Sbjct: 1 MKHKVLFLAVLFFFNTAGIVKSARFENKIAEFAGMDKITGRILRFDVDINRSVQFGSLKI 60 Query: 61 KPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ 120 PMVCYSRDD+E QR+D+FVSISEI TD VRSIFSGWMFADSPAMNAIDHSIYD+WL+Q Sbjct: 61 TPMVCYSRDDKEIQRVDSFVSISEISTDHTVRSIFSGWMFADSPAMNAIDHSIYDVWLIQ 120 Query: 121 CKDPINDSISNSESISKKALSEYSSTDITSQGSEKSSGSSSNKTLEKESSQPLENNLSMD 180 CK+PI DS NS S T + + + ++ K SSQ +E + Sbjct: 121 CKNPIKDSDKNSTRYS------------TPVPKMTVTENPDDNSIPKASSQSIEIS-DAH 167 Query: 181 LKGRPIQELGNNLSDSGLNEQDHN 204 L Q+ NNL+ S L+ +D + Sbjct: 168 LDKNYNQKSENNLNTSDLDRKDDD 191 >gi|163868079|ref|YP_001609283.1| hypothetical protein Btr_0884 [Bartonella tribocorum CIP 105476] gi|161017730|emb|CAK01288.1| conserved hypothetical protein [Bartonella tribocorum CIP 105476] Length = 138 Score = 162 bits (409), Expect = 4e-38, Method: Composition-based stats. Identities = 50/127 (39%), Positives = 77/127 (60%), Gaps = 1/127 (0%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLII 60 ++ + L+ + S + R +N +A FAG+DKITGR F+V + + Q+G+L + Sbjct: 8 LRIHIFLIGILVFLSLNSGGRAERISNGIAVFAGLDKITGRTTRFEVSLGEVYQYGALQV 67 Query: 61 KPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ 120 P CY+ E R FV ++E+ D+ +R IF+GWMFADSP +NA++H IYD+WL Sbjct: 68 TPRACYTSSKDEPTRTTGFVEVNEVTLDKKIRRIFTGWMFADSPGLNAVEHPIYDVWLKD 127 Query: 121 CK-DPIN 126 CK + N Sbjct: 128 CKQNSQN 134 >gi|144898205|emb|CAM75069.1| conserved hypothetical protein, secreted [Magnetospirillum gryphiswaldense MSR-1] Length = 127 Score = 161 bits (408), Expect = 5e-38, Method: Composition-based stats. Identities = 36/113 (31%), Positives = 65/113 (57%) Query: 10 LFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRD 69 L ++ + A + + +A G+DKIT RV+T + + + +FG+L + C R Sbjct: 12 LLWLTAPAIAQQAPELSLDMAVLGGLDKITARVVTIEAPVGEPVRFGTLEVVARACKKRR 71 Query: 70 DREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 E+ AF+ I +I + + +F GWMFA SPA++A++H +YD+W++ C+ Sbjct: 72 PEESPESAAFLDIWDIKQGQPAQGVFRGWMFASSPALSAMEHPVYDVWVLDCR 124 >gi|153009606|ref|YP_001370821.1| hypothetical protein Oant_2276 [Ochrobactrum anthropi ATCC 49188] gi|151561494|gb|ABS14992.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 156 Score = 160 bits (406), Expect = 9e-38, Method: Composition-based stats. Identities = 65/118 (55%), Positives = 83/118 (70%) Query: 5 VLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMV 64 V L+ L S + A + R N VAEF+G+DKITGR+ TFDV IN++ QFG+L + P V Sbjct: 27 VALISLIATTSSFQAAMAERITNPVAEFSGLDKITGRITTFDVYINETVQFGALQVTPKV 86 Query: 65 CYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 CYSR + EA R D FV + EI DR +R IF+GWMFADSP +NA++H IYD+WL CK Sbjct: 87 CYSRTENEAPRTDGFVEVEEITLDRKIRRIFTGWMFADSPGLNAVEHPIYDVWLKDCK 144 >gi|225627393|ref|ZP_03785430.1| Hypothetical protein, conserved [Brucella ceti str. Cudo] gi|237815335|ref|ZP_04594333.1| Hypothetical protein, conserved [Brucella abortus str. 2308 A] gi|225617398|gb|EEH14443.1| Hypothetical protein, conserved [Brucella ceti str. Cudo] gi|237790172|gb|EEP64382.1| Hypothetical protein, conserved [Brucella abortus str. 2308 A] Length = 157 Score = 160 bits (405), Expect = 1e-37, Method: Composition-based stats. Identities = 62/122 (50%), Positives = 85/122 (69%) Query: 5 VLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMV 64 + L+ + A + R +N VA+F+G+DKITGR+ TFDV IN++ QFG+L + P V Sbjct: 28 IALITVLAGMGSLHAARAERISNPVAQFSGLDKITGRITTFDVYINETVQFGALQVTPKV 87 Query: 65 CYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 CYSR + EA R DAFV++ EI DR +R IF+GWMFADSP +NA++H IYD+WL CK Sbjct: 88 CYSRTEDEAPRTDAFVTVDEITLDRKIRRIFTGWMFADSPGLNAVEHPIYDVWLKDCKQK 147 Query: 125 IN 126 + Sbjct: 148 SD 149 >gi|17987348|ref|NP_539982.1| hypothetical protein BMEI1065 [Brucella melitensis bv. 1 str. 16M] gi|189024089|ref|YP_001934857.1| hypothetical protein BAbS19_I08620 [Brucella abortus S19] gi|260545406|ref|ZP_05821147.1| conserved hypothetical protein [Brucella abortus NCTC 8038] gi|260566541|ref|ZP_05837011.1| conserved hypothetical protein [Brucella suis bv. 4 str. 40] gi|260754652|ref|ZP_05867000.1| conserved hypothetical protein [Brucella abortus bv. 6 str. 870] gi|260757875|ref|ZP_05870223.1| conserved hypothetical protein [Brucella abortus bv. 4 str. 292] gi|260761698|ref|ZP_05874041.1| conserved hypothetical protein [Brucella abortus bv. 2 str. 86/8/59] gi|260883678|ref|ZP_05895292.1| conserved hypothetical protein [Brucella abortus bv. 9 str. C68] gi|261314352|ref|ZP_05953549.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10] gi|261325009|ref|ZP_05964206.1| conserved hypothetical protein [Brucella neotomae 5K33] gi|265991001|ref|ZP_06103558.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. Rev.1] gi|265999496|ref|ZP_06111709.1| conserved hypothetical protein [Brucella melitensis bv. 2 str. 63/9] gi|294852259|ref|ZP_06792932.1| hypothetical protein BAZG_01178 [Brucella sp. NVSL 07-0026] gi|17983032|gb|AAL52246.1| retrovirus-related pol polyprotein [Brucella melitensis bv. 1 str. 16M] gi|189019661|gb|ACD72383.1| hypothetical protein BAbS19_I08620 [Brucella abortus S19] gi|260096813|gb|EEW80688.1| conserved hypothetical protein [Brucella abortus NCTC 8038] gi|260156059|gb|EEW91139.1| conserved hypothetical protein [Brucella suis bv. 4 str. 40] gi|260668193|gb|EEX55133.1| conserved hypothetical protein [Brucella abortus bv. 4 str. 292] gi|260672130|gb|EEX58951.1| conserved hypothetical protein [Brucella abortus bv. 2 str. 86/8/59] gi|260674760|gb|EEX61581.1| conserved hypothetical protein [Brucella abortus bv. 6 str. 870] gi|260873206|gb|EEX80275.1| conserved hypothetical protein [Brucella abortus bv. 9 str. C68] gi|261300989|gb|EEY04486.1| conserved hypothetical protein [Brucella neotomae 5K33] gi|261303378|gb|EEY06875.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10] gi|263001785|gb|EEZ14360.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. Rev.1] gi|263094290|gb|EEZ18151.1| conserved hypothetical protein [Brucella melitensis bv. 2 str. 63/9] gi|294820848|gb|EFG37847.1| hypothetical protein BAZG_01178 [Brucella sp. NVSL 07-0026] Length = 151 Score = 160 bits (405), Expect = 1e-37, Method: Composition-based stats. Identities = 62/122 (50%), Positives = 85/122 (69%) Query: 5 VLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMV 64 + L+ + A + R +N VA+F+G+DKITGR+ TFDV IN++ QFG+L + P V Sbjct: 22 IALITVLAGMGSLHAARAERISNPVAQFSGLDKITGRITTFDVYINETVQFGALQVTPKV 81 Query: 65 CYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 CYSR + EA R DAFV++ EI DR +R IF+GWMFADSP +NA++H IYD+WL CK Sbjct: 82 CYSRTEDEAPRTDAFVTVDEITLDRKIRRIFTGWMFADSPGLNAVEHPIYDVWLKDCKQK 141 Query: 125 IN 126 + Sbjct: 142 SD 143 >gi|23501790|ref|NP_697917.1| hypothetical protein BR0904 [Brucella suis 1330] gi|62289847|ref|YP_221640.1| hypothetical protein BruAb1_0915 [Brucella abortus bv. 1 str. 9-941] gi|82699773|ref|YP_414347.1| hypothetical protein BAB1_0922 [Brucella melitensis biovar Abortus 2308] gi|148559612|ref|YP_001258882.1| hypothetical protein BOV_0900 [Brucella ovis ATCC 25840] gi|161618862|ref|YP_001592749.1| hypothetical protein BCAN_A0917 [Brucella canis ATCC 23365] gi|163843175|ref|YP_001627579.1| hypothetical protein BSUIS_A0943 [Brucella suis ATCC 23445] gi|225852416|ref|YP_002732649.1| hypothetical protein BMEA_A0943 [Brucella melitensis ATCC 23457] gi|254689153|ref|ZP_05152407.1| hypothetical protein Babob68_02995 [Brucella abortus bv. 6 str. 870] gi|254693636|ref|ZP_05155464.1| hypothetical protein Babob3T_03019 [Brucella abortus bv. 3 str. Tulya] gi|254697288|ref|ZP_05159116.1| hypothetical protein Babob28_06109 [Brucella abortus bv. 2 str. 86/8/59] gi|254701668|ref|ZP_05163496.1| hypothetical protein Bsuib55_12531 [Brucella suis bv. 5 str. 513] gi|254704211|ref|ZP_05166039.1| hypothetical protein Bsuib36_09839 [Brucella suis bv. 3 str. 686] gi|254706887|ref|ZP_05168715.1| hypothetical protein BpinM_07851 [Brucella pinnipedialis M163/99/10] gi|254710005|ref|ZP_05171816.1| hypothetical protein BpinB_06966 [Brucella pinnipedialis B2/94] gi|254714006|ref|ZP_05175817.1| hypothetical protein BcetM6_11741 [Brucella ceti M644/93/1] gi|254716935|ref|ZP_05178746.1| hypothetical protein BcetM_11037 [Brucella ceti M13/05/1] gi|254730186|ref|ZP_05188764.1| hypothetical protein Babob42_03034 [Brucella abortus bv. 4 str. 292] gi|256031500|ref|ZP_05445114.1| hypothetical protein BpinM2_12743 [Brucella pinnipedialis M292/94/1] gi|256044577|ref|ZP_05447481.1| hypothetical protein Bmelb1R_08773 [Brucella melitensis bv. 1 str. Rev.1] gi|256061009|ref|ZP_05451166.1| hypothetical protein Bneo5_11692 [Brucella neotomae 5K33] gi|256113450|ref|ZP_05454291.1| hypothetical protein Bmelb3E_11952 [Brucella melitensis bv. 3 str. Ether] gi|256159625|ref|ZP_05457387.1| hypothetical protein BcetM4_11713 [Brucella ceti M490/95/1] gi|256254905|ref|ZP_05460441.1| hypothetical protein BcetB_11530 [Brucella ceti B1/94] gi|256257403|ref|ZP_05462939.1| hypothetical protein Babob9C_08584 [Brucella abortus bv. 9 str. C68] gi|256369332|ref|YP_003106840.1| hypothetical protein BMI_I903 [Brucella microti CCM 4915] gi|260168633|ref|ZP_05755444.1| hypothetical protein BruF5_09729 [Brucella sp. F5/99] gi|260563928|ref|ZP_05834414.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. 16M] gi|261213902|ref|ZP_05928183.1| conserved hypothetical protein [Brucella abortus bv. 3 str. Tulya] gi|261218741|ref|ZP_05933022.1| conserved hypothetical protein [Brucella ceti M13/05/1] gi|261222087|ref|ZP_05936368.1| conserved hypothetical protein [Brucella ceti B1/94] gi|261317553|ref|ZP_05956750.1| conserved hypothetical protein [Brucella pinnipedialis B2/94] gi|261321760|ref|ZP_05960957.1| conserved hypothetical protein [Brucella ceti M644/93/1] gi|261752220|ref|ZP_05995929.1| conserved hypothetical protein [Brucella suis bv. 5 str. 513] gi|261754879|ref|ZP_05998588.1| conserved hypothetical protein [Brucella suis bv. 3 str. 686] gi|261758106|ref|ZP_06001815.1| conserved hypothetical protein [Brucella sp. F5/99] gi|265988587|ref|ZP_06101144.1| conserved hypothetical protein [Brucella pinnipedialis M292/94/1] gi|265994838|ref|ZP_06107395.1| conserved hypothetical protein [Brucella melitensis bv. 3 str. Ether] gi|265998052|ref|ZP_06110609.1| conserved hypothetical protein [Brucella ceti M490/95/1] gi|297248252|ref|ZP_06931970.1| hypothetical protein BAYG_01190 [Brucella abortus bv. 5 str. B3196] gi|23347721|gb|AAN29832.1| conserved hypothetical protein [Brucella suis 1330] gi|62195979|gb|AAX74279.1| conserved hypothetical protein [Brucella abortus bv. 1 str. 9-941] gi|82615874|emb|CAJ10878.1| conserved hypothetical protein [Brucella melitensis biovar Abortus 2308] gi|148370869|gb|ABQ60848.1| conserved hypothetical protein [Brucella ovis ATCC 25840] gi|161335673|gb|ABX61978.1| Hypothetical protein BCAN_A0917 [Brucella canis ATCC 23365] gi|163673898|gb|ABY38009.1| Hypothetical protein BSUIS_A0943 [Brucella suis ATCC 23445] gi|225640781|gb|ACO00695.1| Hypothetical protein, conserved [Brucella melitensis ATCC 23457] gi|255999492|gb|ACU47891.1| hypothetical protein BMI_I903 [Brucella microti CCM 4915] gi|260153944|gb|EEW89036.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. 16M] gi|260915509|gb|EEX82370.1| conserved hypothetical protein [Brucella abortus bv. 3 str. Tulya] gi|260920671|gb|EEX87324.1| conserved hypothetical protein [Brucella ceti B1/94] gi|260923830|gb|EEX90398.1| conserved hypothetical protein [Brucella ceti M13/05/1] gi|261294450|gb|EEX97946.1| conserved hypothetical protein [Brucella ceti M644/93/1] gi|261296776|gb|EEY00273.1| conserved hypothetical protein [Brucella pinnipedialis B2/94] gi|261738090|gb|EEY26086.1| conserved hypothetical protein [Brucella sp. F5/99] gi|261741973|gb|EEY29899.1| conserved hypothetical protein [Brucella suis bv. 5 str. 513] gi|261744632|gb|EEY32558.1| conserved hypothetical protein [Brucella suis bv. 3 str. 686] gi|262552520|gb|EEZ08510.1| conserved hypothetical protein [Brucella ceti M490/95/1] gi|262765951|gb|EEZ11740.1| conserved hypothetical protein [Brucella melitensis bv. 3 str. Ether] gi|264660784|gb|EEZ31045.1| conserved hypothetical protein [Brucella pinnipedialis M292/94/1] gi|297175421|gb|EFH34768.1| hypothetical protein BAYG_01190 [Brucella abortus bv. 5 str. B3196] Length = 156 Score = 160 bits (405), Expect = 1e-37, Method: Composition-based stats. Identities = 62/122 (50%), Positives = 85/122 (69%) Query: 5 VLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMV 64 + L+ + A + R +N VA+F+G+DKITGR+ TFDV IN++ QFG+L + P V Sbjct: 27 IALITVLAGMGSLHAARAERISNPVAQFSGLDKITGRITTFDVYINETVQFGALQVTPKV 86 Query: 65 CYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 CYSR + EA R DAFV++ EI DR +R IF+GWMFADSP +NA++H IYD+WL CK Sbjct: 87 CYSRTEDEAPRTDAFVTVDEITLDRKIRRIFTGWMFADSPGLNAVEHPIYDVWLKDCKQK 146 Query: 125 IN 126 + Sbjct: 147 SD 148 >gi|306843802|ref|ZP_07476400.1| Hypothetical protein BIBO1_0463 [Brucella sp. BO1] gi|306275880|gb|EFM57596.1| Hypothetical protein BIBO1_0463 [Brucella sp. BO1] Length = 151 Score = 160 bits (405), Expect = 1e-37, Method: Composition-based stats. Identities = 62/122 (50%), Positives = 85/122 (69%) Query: 5 VLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMV 64 + L+ + A + R +N VA+F+G+DKITGR+ TFDV IN++ QFG+L + P V Sbjct: 22 LALITVLAGIGSLHAARAERISNPVAQFSGLDKITGRITTFDVYINETVQFGALQVTPKV 81 Query: 65 CYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 CYSR + EA R DAFV++ EI DR +R IF+GWMFADSP +NA++H IYD+WL CK Sbjct: 82 CYSRTEDEAPRTDAFVTVDEITLDRKIRRIFTGWMFADSPGLNAVEHPIYDVWLKDCKQK 141 Query: 125 IN 126 + Sbjct: 142 SD 143 >gi|239831781|ref|ZP_04680110.1| Hypothetical protein, conserved [Ochrobactrum intermedium LMG 3301] gi|239824048|gb|EEQ95616.1| Hypothetical protein, conserved [Ochrobactrum intermedium LMG 3301] Length = 162 Score = 160 bits (405), Expect = 1e-37, Method: Composition-based stats. Identities = 65/118 (55%), Positives = 82/118 (69%), Gaps = 1/118 (0%) Query: 6 LLLILFFVFSHAKFAN-SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMV 64 L LI F + A + R N VAEF+G+DKITGR+ TFDV IN++ QFG+L + P V Sbjct: 33 LALISFMATASCFQAAMAERITNPVAEFSGLDKITGRITTFDVYINETVQFGALQVTPKV 92 Query: 65 CYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 CYSR + EA R D FV + EI DR +R IF+GWMFADSP +NA++H IYD+WL CK Sbjct: 93 CYSRTENEAPRTDGFVQVDEITLDRKIRRIFTGWMFADSPGLNAVEHPIYDVWLKDCK 150 >gi|328543660|ref|YP_004303769.1| Cellulase-like protein [polymorphum gilvum SL003B-26A1] gi|326413404|gb|ADZ70467.1| Cellulase-like protein [Polymorphum gilvum SL003B-26A1] Length = 183 Score = 160 bits (404), Expect = 1e-37, Method: Composition-based stats. Identities = 56/139 (40%), Positives = 83/139 (59%), Gaps = 10/139 (7%) Query: 20 ANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAF 79 A++ + N VA F+G+DKITGR+++FDV I ++ QFG+L + P VCYSR E + DAF Sbjct: 31 AHADKIENPVAVFSGLDKITGRIISFDVYIGETVQFGALQVTPRVCYSRPQTETPQTDAF 90 Query: 80 VSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKA 139 V + EI + VR IFSGWMFA SP ++A++H++YD+WL C+ + S+ Sbjct: 91 VQVDEITLNNEVRRIFSGWMFAASPGLHAVEHAVYDVWLTDCRM--------TSSVPPPE 142 Query: 140 LSEYSSTDITSQGSEKSSG 158 S + + E G Sbjct: 143 GY--SGPPVAASVPEGEDG 159 >gi|306840398|ref|ZP_07473163.1| Hypothetical protein BIBO2_0198 [Brucella sp. BO2] gi|306289636|gb|EFM60840.1| Hypothetical protein BIBO2_0198 [Brucella sp. BO2] Length = 156 Score = 159 bits (403), Expect = 2e-37, Method: Composition-based stats. Identities = 62/122 (50%), Positives = 85/122 (69%) Query: 5 VLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMV 64 + L+ + A + R +N VA+F+G+DKITGR+ TFDV IN++ QFG+L + P V Sbjct: 27 LALITVLAGMGSLHAARAERISNPVAQFSGLDKITGRITTFDVYINETVQFGALQVTPKV 86 Query: 65 CYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 CYSR + EA R DAFV++ EI DR +R IF+GWMFADSP +NA++H IYD+WL CK Sbjct: 87 CYSRTEDEAPRTDAFVTVDEITLDRKIRRIFTGWMFADSPGLNAVEHPIYDVWLKDCKQK 146 Query: 125 IN 126 + Sbjct: 147 SD 148 >gi|326408925|gb|ADZ65990.1| conserved hypothetical protein [Brucella melitensis M28] gi|326538641|gb|ADZ86856.1| conserved hypothetical protein [Brucella melitensis M5-90] Length = 121 Score = 159 bits (403), Expect = 2e-37, Method: Composition-based stats. Identities = 61/111 (54%), Positives = 81/111 (72%) Query: 16 HAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQR 75 A + R +N VA+F+G+DKITGR+ TFDV IN++ QFG+L + P VCYSR + EA R Sbjct: 3 SLHAARAERISNPVAQFSGLDKITGRITTFDVYINETVQFGALQVTPKVCYSRTEDEAPR 62 Query: 76 IDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 DAFV++ EI DR +R IF+GWMFADSP +NA++H IYD+WL CK + Sbjct: 63 TDAFVTVDEITLDRKIRRIFTGWMFADSPGLNAVEHPIYDVWLKDCKQKSD 113 >gi|265983997|ref|ZP_06096732.1| conserved hypothetical protein [Brucella sp. 83/13] gi|264662589|gb|EEZ32850.1| conserved hypothetical protein [Brucella sp. 83/13] Length = 151 Score = 159 bits (403), Expect = 2e-37, Method: Composition-based stats. Identities = 61/111 (54%), Positives = 81/111 (72%) Query: 16 HAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQR 75 A + R +N VA+F+G+DKITGR+ TFDV IN++ QFG+L + P VCYSR + EA R Sbjct: 33 SLHAARAERISNPVAQFSGLDKITGRITTFDVYINETVQFGALQVTPKVCYSRTEDEAPR 92 Query: 76 IDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 DAFV++ EI DR +R IF+GWMFADSP +NA++H IYD+WL CK + Sbjct: 93 TDAFVTVDEITLDRKIRRIFTGWMFADSPGLNAVEHPIYDVWLKDCKQKSD 143 >gi|254719007|ref|ZP_05180818.1| hypothetical protein Bru83_05609 [Brucella sp. 83/13] gi|306840106|ref|ZP_07472892.1| Hypothetical protein BROD_2979 [Brucella sp. NF 2653] gi|306404834|gb|EFM61127.1| Hypothetical protein BROD_2979 [Brucella sp. NF 2653] Length = 156 Score = 159 bits (403), Expect = 2e-37, Method: Composition-based stats. Identities = 61/111 (54%), Positives = 81/111 (72%) Query: 16 HAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQR 75 A + R +N VA+F+G+DKITGR+ TFDV IN++ QFG+L + P VCYSR + EA R Sbjct: 38 SLHAARAERISNPVAQFSGLDKITGRITTFDVYINETVQFGALQVTPKVCYSRTEDEAPR 97 Query: 76 IDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 DAFV++ EI DR +R IF+GWMFADSP +NA++H IYD+WL CK + Sbjct: 98 TDAFVTVDEITLDRKIRRIFTGWMFADSPGLNAVEHPIYDVWLKDCKQKSD 148 >gi|240850282|ref|YP_002971675.1| hypothetical protein Bgr_06800 [Bartonella grahamii as4aup] gi|240267405|gb|ACS50993.1| hypothetical protein Bgr_06800 [Bartonella grahamii as4aup] Length = 141 Score = 159 bits (402), Expect = 2e-37, Method: Composition-based stats. Identities = 51/129 (39%), Positives = 77/129 (59%), Gaps = 2/129 (1%) Query: 1 MKYRVLLLIL--FFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSL 58 +K+ + + ++ S + R +N +A FAG+DKITGR F+V + + Q+G+L Sbjct: 9 VKHFIYIFLMGVLVFLSLNSGVRAERISNGIAVFAGLDKITGRTTRFEVTLGKIYQYGAL 68 Query: 59 IIKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWL 118 + P CY+ E R FV ++E+ D+ VR IF+GWMFADSP +NA++H IYD+WL Sbjct: 69 QVTPRACYTSSKDEPTRTTGFVEVNEVTLDKKVRRIFTGWMFADSPGLNAVEHPIYDVWL 128 Query: 119 MQCKDPIND 127 CK D Sbjct: 129 KDCKQNSQD 137 >gi|116251846|ref|YP_767684.1| hypothetical protein RL2086 [Rhizobium leguminosarum bv. viciae 3841] gi|115256494|emb|CAK07578.1| conserved hypothetical exported protein [Rhizobium leguminosarum bv. viciae 3841] Length = 146 Score = 159 bits (402), Expect = 2e-37, Method: Composition-based stats. Identities = 61/110 (55%), Positives = 79/110 (71%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 AN+AR N VA F+G+DKITGR+ TFDV +N++ QFG+L + P CYSRD EAQ+I Sbjct: 25 PVAANAARIENPVAVFSGLDKITGRITTFDVYVNETVQFGALQVTPKACYSRDQSEAQKI 84 Query: 77 DAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 D FV + EI DR +R IF+GWMFA SP +NA++H IYD+WL CK + Sbjct: 85 DGFVEVDEITLDRKIRRIFTGWMFAASPGLNAVEHPIYDVWLKDCKTSSD 134 >gi|217976383|ref|YP_002360530.1| hypothetical protein Msil_0187 [Methylocella silvestris BL2] gi|217501759|gb|ACK49168.1| conserved hypothetical protein [Methylocella silvestris BL2] Length = 240 Score = 158 bits (401), Expect = 3e-37, Method: Composition-based stats. Identities = 54/165 (32%), Positives = 82/165 (49%), Gaps = 2/165 (1%) Query: 20 ANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAF 79 A + R + +A F+G+DKITGR++TF+V +++ QFG+L I CY+R EA + F Sbjct: 35 AQADRIKHPIAVFSGLDKITGRIITFEVATDETVQFGTLQITERACYTRPATEAPQTTTF 94 Query: 80 VSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK--DPINDSISNSESISK 137 V + E+ + IFSGWMFA SP ++ I+H IYDIWL CK I S S + + Sbjct: 95 VEVDEVDAKNDYKRIFSGWMFAASPGLHGIEHPIYDIWLTDCKGGKEIVVSPSAAAEPTP 154 Query: 138 KALSEYSSTDITSQGSEKSSGSSSNKTLEKESSQPLENNLSMDLK 182 S T + + ++ PL ++ Sbjct: 155 PPPENASPTPKKATKPRRVQPQLPQPPVDNFGEAPLPFQDQAPVE 199 >gi|327193632|gb|EGE60515.1| hypothetical protein RHECNPAF_1440013 [Rhizobium etli CNPAF512] Length = 146 Score = 158 bits (401), Expect = 3e-37, Method: Composition-based stats. Identities = 61/110 (55%), Positives = 79/110 (71%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 AN+AR N VA F+G+DKITGR+ TFDV +N++ QFG+L + P CYSRD EAQ+I Sbjct: 25 PVAANAARIENPVAVFSGLDKITGRITTFDVYVNETVQFGALQVTPKACYSRDQAEAQKI 84 Query: 77 DAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 D FV + EI DR +R IF+GWMFA SP +NA++H IYD+WL CK + Sbjct: 85 DGFVEVDEITLDRKIRRIFTGWMFAASPGLNAVEHPIYDVWLKDCKTSSD 134 >gi|218508641|ref|ZP_03506519.1| hypothetical protein RetlB5_14231 [Rhizobium etli Brasil 5] Length = 146 Score = 158 bits (401), Expect = 3e-37, Method: Composition-based stats. Identities = 61/110 (55%), Positives = 79/110 (71%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 AN+AR N VA F+G+DKITGR+ TFDV +N++ QFG+L + P CYSRD EAQ+I Sbjct: 25 PVAANAARIENPVAVFSGLDKITGRITTFDVYVNETVQFGALQVTPKACYSRDQAEAQKI 84 Query: 77 DAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 D FV + EI DR +R IF+GWMFA SP +NA++H IYD+WL CK + Sbjct: 85 DGFVEVDEITLDRKIRRIFTGWMFAASPGLNAVEHPIYDVWLKDCKTSSD 134 >gi|190891554|ref|YP_001978096.1| hypothetical protein RHECIAT_CH0001952 [Rhizobium etli CIAT 652] gi|218515121|ref|ZP_03511961.1| hypothetical protein Retl8_16241 [Rhizobium etli 8C-3] gi|190696833|gb|ACE90918.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 146 Score = 158 bits (401), Expect = 4e-37, Method: Composition-based stats. Identities = 63/117 (53%), Positives = 80/117 (68%), Gaps = 4/117 (3%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 AN+AR N VA F+G+DKITGR+ TFDV +N++ QFG+L + P CYSRD EAQ+I Sbjct: 25 PIAANAARIENPVAVFSGLDKITGRITTFDVYVNETVQFGALQVTPKACYSRDQAEAQKI 84 Query: 77 DAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKD----PINDSI 129 D FV + EI DR +R IF+GWMFA SP +NA++H IYD+WL CK P D Sbjct: 85 DGFVEVDEITLDRKIRRIFTGWMFAASPGLNAVEHPIYDVWLKDCKTTSDVPAPDGT 141 >gi|316934413|ref|YP_004109395.1| hypothetical protein Rpdx1_3081 [Rhodopseudomonas palustris DX-1] gi|315602127|gb|ADU44662.1| Protein of unknown function DUF2155 [Rhodopseudomonas palustris DX-1] Length = 324 Score = 158 bits (400), Expect = 4e-37, Method: Composition-based stats. Identities = 53/117 (45%), Positives = 76/117 (64%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + + NK A F G+DKITGR ++FD +I ++ QFG+L +K CY+R EA DAFV Sbjct: 161 AQKIVNKKASFTGLDKITGRTISFDADIGETVQFGALRVKTDACYTRPSTEATNTDAFVE 220 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKK 138 + EI V+ IFSGWMFA SP ++A++H IYDIWL CK P ++++++ K Sbjct: 221 VDEITLQGEVKRIFSGWMFAASPGLHAVEHPIYDIWLTDCKGPETPNVASAQPEPPK 277 >gi|254502067|ref|ZP_05114218.1| hypothetical protein SADFL11_2105 [Labrenzia alexandrii DFL-11] gi|222438138|gb|EEE44817.1| hypothetical protein SADFL11_2105 [Labrenzia alexandrii DFL-11] Length = 169 Score = 158 bits (399), Expect = 6e-37, Method: Composition-based stats. Identities = 51/143 (35%), Positives = 80/143 (55%), Gaps = 7/143 (4%) Query: 18 KFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRID 77 A S + N VA F+G+DKITGR++ FDV + ++ QFG+L + P VC++R E+ Sbjct: 17 AQAESEKIENPVAVFSGLDKITGRIINFDVYVGETVQFGALQVTPRVCHTRPQTESPLTT 76 Query: 78 AFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISK 137 FV + EI + VR IFSGWM+A SP ++A++H +YDIWL C+ S+ Sbjct: 77 GFVQVDEITLNNEVRRIFSGWMYAASPGLHAVEHPVYDIWLTDCRLA-------SKVPPP 129 Query: 138 KALSEYSSTDITSQGSEKSSGSS 160 + + ++G + +G Sbjct: 130 EDYDGPPIKGVVAEGEDPLAGPD 152 >gi|90417714|ref|ZP_01225626.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90337386|gb|EAS51037.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 135 Score = 157 bits (397), Expect = 9e-37, Method: Composition-based stats. Identities = 61/125 (48%), Positives = 83/125 (66%), Gaps = 3/125 (2%) Query: 1 MKYRVLLLILFFV---FSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGS 57 M R+ ILF V + + R ANKVA F+G+DKITGR+ +FDV I+++ QFG+ Sbjct: 1 MNRRLCASILFAVTTGLALVPASAQQRIANKVAVFSGLDKITGRITSFDVYIDETVQFGA 60 Query: 58 LIIKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIW 117 L + P VCY+ + EA + DAFV + EI R +R IFSGWMFA SP +NA++H +YD+W Sbjct: 61 LQVTPKVCYTSAEGEAAKTDAFVKVDEITLQRDIRQIFSGWMFAASPGLNAVEHPVYDVW 120 Query: 118 LMQCK 122 L CK Sbjct: 121 LKSCK 125 >gi|118589092|ref|ZP_01546499.1| hypothetical protein SIAM614_13608 [Stappia aggregata IAM 12614] gi|118438421|gb|EAV45055.1| hypothetical protein SIAM614_13608 [Stappia aggregata IAM 12614] Length = 193 Score = 157 bits (397), Expect = 1e-36, Method: Composition-based stats. Identities = 54/146 (36%), Positives = 83/146 (56%), Gaps = 7/146 (4%) Query: 15 SHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQ 74 + A++ + N VA F+G+DKITGR+++FDV I ++ QFG+L + P VCY+R E+ Sbjct: 38 TPLVSAHAEKIENPVAVFSGLDKITGRIISFDVYIGETVQFGALQVTPRVCYTRPQTESP 97 Query: 75 RIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSES 134 FV + EI + VR I+SGWMFA SP ++A++H +YDIWL CK S Sbjct: 98 LTTGFVQVDEITLNNEVRRIYSGWMFAASPGLHAVEHPVYDIWLTDCKLA-------STV 150 Query: 135 ISKKALSEYSSTDITSQGSEKSSGSS 160 + + T ++G + +G Sbjct: 151 PPPEDYAGPPITGTVAEGEDPLAGPD 176 >gi|227821674|ref|YP_002825644.1| hypothetical protein NGR_c11050 [Sinorhizobium fredii NGR234] gi|227340673|gb|ACP24891.1| hypothetical protein NGR_c11050 [Sinorhizobium fredii NGR234] Length = 150 Score = 157 bits (396), Expect = 1e-36, Method: Composition-based stats. Identities = 61/123 (49%), Positives = 86/123 (69%), Gaps = 1/123 (0%) Query: 5 VLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMV 64 + +L+ + S + A + R +N VA F+G+DKITGR+ TFDV I ++ QFG+L + P V Sbjct: 19 LAVLLALPLISAGEPARATRLSNAVAVFSGIDKITGRITTFDVYIGETVQFGALQVTPHV 78 Query: 65 CYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 CYSRD+ EA + FV + EI DR +R IF+GWMFADSP +NA++H +YD+WL CK P Sbjct: 79 CYSRDETEAPKTTTFVDVDEITLDRKIRRIFTGWMFADSPGLNAVEHPVYDVWLQSCK-P 137 Query: 125 IND 127 +D Sbjct: 138 TSD 140 >gi|86750143|ref|YP_486639.1| hypothetical protein RPB_3026 [Rhodopseudomonas palustris HaA2] gi|86573171|gb|ABD07728.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 326 Score = 157 bits (396), Expect = 1e-36, Method: Composition-based stats. Identities = 53/109 (48%), Positives = 71/109 (65%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + + NK A F G+DKITGR + FD +I ++ QFG+L +K CY+R EA DAFV Sbjct: 178 AQKIVNKKASFTGLDKITGRTINFDADIGETVQFGALRVKTDACYTRPSTEAANTDAFVE 237 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSIS 130 + EI V+ IFSGWMFA SP ++A++H IYDIWL CK+P +S Sbjct: 238 VDEITLQGEVKRIFSGWMFAASPGLHAVEHPIYDIWLTDCKNPETPVVS 286 >gi|115525001|ref|YP_781912.1| hypothetical protein RPE_2995 [Rhodopseudomonas palustris BisA53] gi|115518948|gb|ABJ06932.1| conserved hypothetical protein [Rhodopseudomonas palustris BisA53] Length = 308 Score = 156 bits (395), Expect = 2e-36, Method: Composition-based stats. Identities = 55/148 (37%), Positives = 77/148 (52%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + + NK A F+G+DKITGR++ FD +I ++ QFG+L +K CY+R EA DAFV Sbjct: 161 AQKIVNKKAVFSGLDKITGRIINFDADIGETVQFGALRVKTDACYTRPSTEATNTDAFVE 220 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKALS 141 + EI V+ IFSGWMFA SP ++ I+H IYDIWL CK P + E+ Sbjct: 221 VDEITLQGEVKRIFSGWMFAASPGLHGIEHPIYDIWLTDCKGPETVVAAQPEAPKPPPAQ 280 Query: 142 EYSSTDITSQGSEKSSGSSSNKTLEKES 169 + + + S L Sbjct: 281 KRAPKQQPRPQPQVYPQSPPQNPLPPFR 308 >gi|209885432|ref|YP_002289289.1| hypothetical protein OCAR_6311 [Oligotropha carboxidovorans OM5] gi|209873628|gb|ACI93424.1| conserved hypothetical protein [Oligotropha carboxidovorans OM5] Length = 297 Score = 156 bits (394), Expect = 2e-36, Method: Composition-based stats. Identities = 53/114 (46%), Positives = 76/114 (66%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + + NK A F+G+DKITGR++TFD +I ++ QFG+L +K CY+R EA DAFV Sbjct: 148 AIKIPNKKAVFSGLDKITGRIITFDQDIGETVQFGALRVKTDACYTRPATEAANTDAFVE 207 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESI 135 + EI V+ IFSGWMFA SP ++A++H IYD+WL+ CK P + +E + Sbjct: 208 VDEITLQNEVKRIFSGWMFAASPGLHAVEHPIYDVWLIDCKSPEQPVTAQNEPV 261 >gi|150396172|ref|YP_001326639.1| hypothetical protein Smed_0949 [Sinorhizobium medicae WSM419] gi|150027687|gb|ABR59804.1| conserved hypothetical protein [Sinorhizobium medicae WSM419] Length = 150 Score = 155 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 61/124 (49%), Positives = 82/124 (66%), Gaps = 4/124 (3%) Query: 13 VFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDRE 72 V S + A +AR N VA F+G+DKITGR+ +FDV I ++ QFG+L + P VCYSRD+ E Sbjct: 27 VVSTTEPAQAARLPNAVAVFSGIDKITGRITSFDVYIGETVQFGALQVTPRVCYSRDETE 86 Query: 73 AQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKD----PINDS 128 A + FV + EI DR +R IF+GWMFADSP +NA++H +YD+WL CK P D+ Sbjct: 87 APKTTTFVEVDEITLDRKIRRIFTGWMFADSPGLNAVEHPVYDVWLQSCKTTSEVPPPDT 146 Query: 129 ISNS 132 Sbjct: 147 AEKQ 150 >gi|15965070|ref|NP_385423.1| hypothetical protein SMc01347 [Sinorhizobium meliloti 1021] gi|307301141|ref|ZP_07580910.1| Protein of unknown function DUF2155 [Sinorhizobium meliloti BL225C] gi|307317874|ref|ZP_07597312.1| Protein of unknown function DUF2155 [Sinorhizobium meliloti AK83] gi|15074249|emb|CAC45896.1| Conserved hypothetical protein [Sinorhizobium meliloti 1021] gi|306896636|gb|EFN27384.1| Protein of unknown function DUF2155 [Sinorhizobium meliloti AK83] gi|306904096|gb|EFN34682.1| Protein of unknown function DUF2155 [Sinorhizobium meliloti BL225C] Length = 150 Score = 155 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 60/124 (48%), Positives = 84/124 (67%), Gaps = 4/124 (3%) Query: 13 VFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDRE 72 V S + A +AR +N VA F+G+DKITGR+ +FDV I ++ QFG+L + P VC+SRD+ E Sbjct: 27 VTSAVETAQAARLSNAVAVFSGIDKITGRITSFDVYIGETVQFGALQVTPRVCHSRDETE 86 Query: 73 AQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKD----PINDS 128 A + FV + EI DR +R IF+GWMFADSP +NA++H +YD+WL CK P D+ Sbjct: 87 APKTTTFVEVDEITLDRKIRRIFTGWMFADSPGLNAVEHPVYDVWLQSCKSTSEVPPPDT 146 Query: 129 ISNS 132 + Sbjct: 147 AAKQ 150 >gi|75675950|ref|YP_318371.1| hypothetical protein Nwi_1758 [Nitrobacter winogradskyi Nb-255] gi|74420820|gb|ABA05019.1| conserved hypothetical protein [Nitrobacter winogradskyi Nb-255] Length = 397 Score = 155 bits (393), Expect = 3e-36, Method: Composition-based stats. Identities = 54/112 (48%), Positives = 73/112 (65%) Query: 20 ANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAF 79 A + R NK A F+G+DKITGR++ FD +I ++ QFG+L +K CY+R EA DAF Sbjct: 233 APAERIVNKKAVFSGLDKITGRIIHFDEDIGETVQFGALRVKTSACYTRPATEAANTDAF 292 Query: 80 VSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISN 131 V + EI V+ IFSGWMFA SP ++ ++H IYD+WL CKDP I+ Sbjct: 293 VEVDEITLQGEVKRIFSGWMFASSPGLHGVEHPIYDVWLTDCKDPETTVIAE 344 >gi|209549131|ref|YP_002281048.1| hypothetical protein Rleg2_1532 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209534887|gb|ACI54822.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 146 Score = 155 bits (393), Expect = 3e-36, Method: Composition-based stats. Identities = 60/106 (56%), Positives = 78/106 (73%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 A++AR N VA F+G+DKITGR+ TFDV +N++ QFG+L + P CYSRD EAQ+I Sbjct: 25 PVAAHAARIENPVAVFSGLDKITGRITTFDVYVNETVQFGALQVTPKACYSRDQAEAQKI 84 Query: 77 DAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 D FV + EI DR +R IF+GWMFA SP +NA++H IYD+WL CK Sbjct: 85 DGFVEVDEITLDRKIRRIFTGWMFAASPGLNAVEHPIYDVWLKDCK 130 >gi|91976894|ref|YP_569553.1| hypothetical protein RPD_2422 [Rhodopseudomonas palustris BisB5] gi|91683350|gb|ABE39652.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 316 Score = 155 bits (393), Expect = 3e-36, Method: Composition-based stats. Identities = 53/117 (45%), Positives = 77/117 (65%), Gaps = 1/117 (0%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + + NK A F G+DKITGR ++FD +I ++ QFG+L +K CY+R EA DAFV Sbjct: 168 AQKIVNKKASFTGLDKITGRTISFDADIGETVQFGALRVKTDACYTRPSTEAANTDAFVE 227 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKK 138 + EI V+ I+SGWMFA SP ++A++H IYDIWL CK+P ++ N++ + K Sbjct: 228 VDEITLQGEVKRIYSGWMFAASPGLHAVEHPIYDIWLTDCKNPET-TVVNAQPEAPK 283 >gi|222085635|ref|YP_002544165.1| hypothetical protein Arad_1920 [Agrobacterium radiobacter K84] gi|221723083|gb|ACM26239.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 146 Score = 155 bits (392), Expect = 3e-36, Method: Composition-based stats. Identities = 60/110 (54%), Positives = 82/110 (74%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 + A +AR +N VA F+G+DKITGR+ TFDV +N++ QFG+L + P CYSRDD E Q++ Sbjct: 27 PQAAEAARISNPVAVFSGLDKITGRITTFDVYVNETVQFGALQVTPKACYSRDDTEQQKV 86 Query: 77 DAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 D FV + EI DR +R IF+GWMFADSP +NA++H IYD+WL +CK + Sbjct: 87 DGFVEVDEITLDRRIRRIFTGWMFADSPGLNAVEHPIYDVWLKECKQKSD 136 >gi|218462710|ref|ZP_03502801.1| hypothetical protein RetlK5_26127 [Rhizobium etli Kim 5] Length = 146 Score = 155 bits (392), Expect = 4e-36, Method: Composition-based stats. Identities = 61/119 (51%), Positives = 81/119 (68%) Query: 8 LILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYS 67 L+ A++ R N VA F+G+DKITGR+ TFDV +N++ QFG+L + P VCYS Sbjct: 16 LLALTALLPIGAAHATRIENPVAVFSGLDKITGRITTFDVYVNETVQFGALQVTPKVCYS 75 Query: 68 RDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 RD EAQ+ID FV + EI DR +R IF+GWMFA SP +NA++H IYD+WL CK + Sbjct: 76 RDQAEAQKIDGFVEVDEITLDRKIRRIFTGWMFAASPGLNAVEHPIYDVWLKDCKTSSD 134 >gi|192291083|ref|YP_001991688.1| hypothetical protein Rpal_2704 [Rhodopseudomonas palustris TIE-1] gi|192284832|gb|ACF01213.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 329 Score = 155 bits (392), Expect = 4e-36, Method: Composition-based stats. Identities = 54/110 (49%), Positives = 74/110 (67%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + + NK A FAG+DKITGR + FD +I ++ QFG+L +K CY+R EA DAFV Sbjct: 164 AQKIVNKKASFAGLDKITGRTINFDADIGETVQFGALRVKTDACYTRPSTEAANTDAFVE 223 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISN 131 + EI V+ IFSGWMFA SP ++A++H IYDIWL CKDP ++++ Sbjct: 224 VDEITLQGEVKRIFSGWMFAASPGLHAVEHPIYDIWLTDCKDPETSNVAS 273 >gi|39935492|ref|NP_947768.1| hypothetical protein RPA2426 [Rhodopseudomonas palustris CGA009] gi|39649344|emb|CAE27867.1| hypothetical protein RPA2426 [Rhodopseudomonas palustris CGA009] Length = 329 Score = 155 bits (392), Expect = 4e-36, Method: Composition-based stats. Identities = 54/110 (49%), Positives = 74/110 (67%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + + NK A FAG+DKITGR + FD +I ++ QFG+L +K CY+R EA DAFV Sbjct: 164 AQKIVNKKASFAGLDKITGRTINFDADIGETVQFGALRVKTDACYTRPSTEAANTDAFVE 223 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISN 131 + EI V+ IFSGWMFA SP ++A++H IYDIWL CKDP ++++ Sbjct: 224 VDEITLQGEVKRIFSGWMFAASPGLHAVEHPIYDIWLTDCKDPETSNVAS 273 >gi|90424380|ref|YP_532750.1| hypothetical protein RPC_2883 [Rhodopseudomonas palustris BisB18] gi|90106394|gb|ABD88431.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 309 Score = 155 bits (391), Expect = 5e-36, Method: Composition-based stats. Identities = 53/137 (38%), Positives = 81/137 (59%), Gaps = 10/137 (7%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + + NK A F+G+DKITGR++ FD +I ++ QFG+L +K CY+R EA DAFV Sbjct: 160 AQKIVNKKASFSGLDKITGRIINFDADIGETVQFGALRVKTDACYTRPATEAANTDAFVE 219 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKALS 141 + EI V+ IFSGWMFA SP ++ ++H IYDIWL CK P ++ + Sbjct: 220 VDEITLQGEVKRIFSGWMFAASPGLHGVEHPIYDIWLTDCKGPETTVVA----------A 269 Query: 142 EYSSTDITSQGSEKSSG 158 + + + +Q ++K + Sbjct: 270 QPDAKPVAAQPAQKRAA 286 >gi|218661471|ref|ZP_03517401.1| hypothetical protein RetlI_19085 [Rhizobium etli IE4771] Length = 125 Score = 155 bits (391), Expect = 5e-36, Method: Composition-based stats. Identities = 60/108 (55%), Positives = 79/108 (73%) Query: 19 FANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDA 78 A++ R N VA F+G+DKITGR+ TFDV +N++ QFG+L + P VCYSRD EAQ+ID Sbjct: 6 AAHATRIENPVAVFSGLDKITGRITTFDVYVNETVQFGALQVTPKVCYSRDQAEAQKIDG 65 Query: 79 FVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 FV + EI DR +R IF+GWMFA SP +NA++H IYD+WL CK + Sbjct: 66 FVEVDEITLDRKIRRIFTGWMFAASPGLNAVEHPIYDVWLKDCKTSSD 113 >gi|299133762|ref|ZP_07026956.1| Protein of unknown function DUF2155 [Afipia sp. 1NLS2] gi|298591598|gb|EFI51799.1| Protein of unknown function DUF2155 [Afipia sp. 1NLS2] Length = 306 Score = 155 bits (391), Expect = 5e-36, Method: Composition-based stats. Identities = 53/112 (47%), Positives = 74/112 (66%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + + NK A F+G+DKITGR++TFD +I ++ QFG+L +K CY+R EA DAFV Sbjct: 153 AVKIPNKKAVFSGLDKITGRIITFDEDIGETVQFGALRVKTDACYTRPATEAANTDAFVE 212 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSE 133 + EI V+ IFSGWMFA SP ++A++H IYD+WL CK P + +E Sbjct: 213 VDEITLQNEVKRIFSGWMFAASPGLHAVEHPIYDVWLTDCKGPEQPVTAQNE 264 >gi|148255598|ref|YP_001240183.1| hypothetical protein BBta_4221 [Bradyrhizobium sp. BTAi1] gi|146407771|gb|ABQ36277.1| putative exported protein of unknown function [Bradyrhizobium sp. BTAi1] Length = 316 Score = 154 bits (390), Expect = 5e-36, Method: Composition-based stats. Identities = 53/118 (44%), Positives = 75/118 (63%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + + NK A F+G+DKITGR++ FD +I ++ QFG+L +K CY+R EA DAFV Sbjct: 151 AQKIVNKKASFSGLDKITGRIINFDEDIGETVQFGALRVKTDACYTRPATEAANTDAFVQ 210 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKA 139 + EI V+ IFSGWMFA SP ++ ++H IYDIWL+ CK+P +S + A Sbjct: 211 VDEITLQGEVKRIFSGWMFAASPGLHGVEHPIYDIWLVDCKEPQTTVVSTAPDQKPAA 268 >gi|83858504|ref|ZP_00952026.1| hypothetical protein OA2633_03356 [Oceanicaulis alexandrii HTCC2633] gi|83853327|gb|EAP91179.1| hypothetical protein OA2633_03356 [Oceanicaulis alexandrii HTCC2633] Length = 157 Score = 154 bits (390), Expect = 6e-36, Method: Composition-based stats. Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 5/101 (4%) Query: 27 NKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIF 86 V G+DK+T R F+ I + +FG+L I C R E + AF+ I + Sbjct: 56 GSVVVLRGLDKVTARTRDFEAPIGEEVRFGALSITVPYCRKRPPEEPPEVYAFLEIEDRR 115 Query: 87 TDR-----IVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 TD +FSGWMFA +PA+NA++H +YD+W++ C+ Sbjct: 116 TDGFGVQAEGELMFSGWMFASNPALNALEHPVYDVWVIDCR 156 >gi|114704669|ref|ZP_01437577.1| hypothetical protein FP2506_07031 [Fulvimarina pelagi HTCC2506] gi|114539454|gb|EAU42574.1| hypothetical protein FP2506_07031 [Fulvimarina pelagi HTCC2506] Length = 113 Score = 153 bits (388), Expect = 1e-35, Method: Composition-based stats. Identities = 53/102 (51%), Positives = 71/102 (69%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 R N VA F+G+DKITGR+ FDV I+++ QFG+L + P VC + + EA + DAFV Sbjct: 3 QQRLENPVAVFSGLDKITGRLTDFDVFIDETVQFGALQVTPRVCKTSAEGEATQTDAFVE 62 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKD 123 + EI DR +R IFSGWMFA SP +NA++H +YD+WL CK Sbjct: 63 VDEITLDREIRRIFSGWMFAASPGLNAVEHPVYDVWLKSCKT 104 >gi|325292687|ref|YP_004278551.1| hypothetical protein AGROH133_05769 [Agrobacterium sp. H13-3] gi|325060540|gb|ADY64231.1| hypothetical protein AGROH133_05769 [Agrobacterium sp. H13-3] Length = 145 Score = 153 bits (387), Expect = 2e-35, Method: Composition-based stats. Identities = 64/123 (52%), Positives = 87/123 (70%), Gaps = 3/123 (2%) Query: 3 YRVLLLILFFVFSHA---KFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLI 59 R L + LF S ++AR N+VA F+G+DKITGR+ +FDV I+++ QFG+L Sbjct: 10 LRALTVSLFAAVSAVILVSPVSAARLENRVAVFSGIDKITGRITSFDVYIDETVQFGALQ 69 Query: 60 IKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLM 119 + P VCYSRD EAQ+IDAF+ + EI DR ++ IF+GWMFADSP +NA++H IYD+WL Sbjct: 70 VTPKVCYSRDQTEAQKIDAFIEVDEITLDRKIKRIFTGWMFADSPGLNAVEHPIYDVWLT 129 Query: 120 QCK 122 CK Sbjct: 130 GCK 132 >gi|241204456|ref|YP_002975552.1| hypothetical protein Rleg_1727 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240858346|gb|ACS56013.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 146 Score = 153 bits (386), Expect = 2e-35, Method: Composition-based stats. Identities = 58/99 (58%), Positives = 74/99 (74%) Query: 24 RFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSIS 83 R N VA F+G+DKITGR+ TFDV +N++ QFG+L + P CYSRD EAQ+ID FV + Sbjct: 32 RIENPVAVFSGLDKITGRITTFDVYVNETVQFGALQVTPKACYSRDQSEAQKIDGFVEVD 91 Query: 84 EIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 EI DR +R IF+GWMFA SP +NA++H IYD+WL CK Sbjct: 92 EITLDRKIRRIFTGWMFAASPGLNAVEHPIYDVWLKDCK 130 >gi|300022476|ref|YP_003755087.1| hypothetical protein Hden_0952 [Hyphomicrobium denitrificans ATCC 51888] gi|299524297|gb|ADJ22766.1| Protein of unknown function DUF2155 [Hyphomicrobium denitrificans ATCC 51888] Length = 215 Score = 153 bits (386), Expect = 2e-35, Method: Composition-based stats. Identities = 48/133 (36%), Positives = 76/133 (57%), Gaps = 1/133 (0%) Query: 13 VFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDRE 72 + A A + R N VA F+ +DK+T R+ F+VE+N++ +FG+L + P CYSR E Sbjct: 59 LLGPASPARADRIENGVAVFSALDKVTARISKFEVELNKTVEFGALRVTPRSCYSRPPTE 118 Query: 73 AQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNS 132 + FV + E D + IF+GWMFA+SP + ++H YD+WL C+ P S++ Sbjct: 119 EPKTTTFVEVDETQLDGTEKRIFTGWMFAESPGIYGLEHPTYDVWLTDCEKP-RRSVAEK 177 Query: 133 ESISKKALSEYSS 145 + +A SE + Sbjct: 178 KPAPAEAPSEGND 190 >gi|86357492|ref|YP_469384.1| hypothetical protein RHE_CH01866 [Rhizobium etli CFN 42] gi|86281594|gb|ABC90657.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 146 Score = 153 bits (386), Expect = 2e-35, Method: Composition-based stats. Identities = 61/116 (52%), Positives = 82/116 (70%), Gaps = 1/116 (0%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 +AR N VA F+G+DKITGR+ TFDV +N++ QFG+L + P CYSRD EAQ+ID FV Sbjct: 30 AARIDNPVAVFSGLDKITGRITTFDVYVNETVQFGALQVTPKACYSRDQAEAQKIDGFVE 89 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISK 137 + EI DR +R IF+GWMFADSP +NA++H IYD+WL CK +D + + + Sbjct: 90 VDEITLDRKIRRIFTGWMFADSPGLNAVEHPIYDVWLKDCK-ATSDVPAPDSAKAP 144 >gi|298291312|ref|YP_003693251.1| hypothetical protein Snov_1322 [Starkeya novella DSM 506] gi|296927823|gb|ADH88632.1| Protein of unknown function DUF2155 [Starkeya novella DSM 506] Length = 233 Score = 152 bits (385), Expect = 2e-35, Method: Composition-based stats. Identities = 48/101 (47%), Positives = 68/101 (67%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + NK A F+G+DKITGR+++FDV +N++ QFG+L I P CY+R + E Q FV Sbjct: 120 EQKIENKTAVFSGLDKITGRIISFDVSVNETVQFGALRITPRACYTRPETEQQNTTGFVE 179 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 + EI D V+ +F GWMFA SP ++ ++H IYD+WL CK Sbjct: 180 VQEITLDGKVQPLFGGWMFASSPGLHGVEHPIYDVWLTDCK 220 >gi|146340455|ref|YP_001205503.1| putative signal peptide [Bradyrhizobium sp. ORS278] gi|146193261|emb|CAL77277.1| conserved hypothetical protein; putative signal peptide [Bradyrhizobium sp. ORS278] Length = 321 Score = 152 bits (385), Expect = 2e-35, Method: Composition-based stats. Identities = 54/111 (48%), Positives = 74/111 (66%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + + NK A F+G+DKITGR++ FD EI ++ QFG+L +K CY+R EA DAFV Sbjct: 153 AQKIVNKKASFSGLDKITGRIINFDEEIGETVQFGALRVKTDACYTRPASEAANTDAFVQ 212 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNS 132 + EI V+ IFSGWMFA SP ++ ++H IYDIWL+ CK+P N S + Sbjct: 213 VDEITLQGEVKRIFSGWMFAASPGLHGVEHPIYDIWLVDCKEPQNTVASAA 263 >gi|114569813|ref|YP_756493.1| hypothetical protein Mmar10_1263 [Maricaulis maris MCS10] gi|114340275|gb|ABI65555.1| conserved hypothetical protein [Maricaulis maris MCS10] Length = 158 Score = 152 bits (385), Expect = 2e-35, Method: Composition-based stats. Identities = 38/101 (37%), Positives = 54/101 (53%), Gaps = 5/101 (4%) Query: 27 NKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIF 86 V G+DK+T R F+VEI + QFG+L I C R E AF+ I++ Sbjct: 57 GTVVVLRGLDKVTARTRDFEVEIGDTVQFGALSITAQYCRKRPPEETPETYAFLQINDRR 116 Query: 87 TDR-----IVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 TD +FSGWMFA PA N ++H +YD+W++ C+ Sbjct: 117 TDGFGVDVEGEQVFSGWMFASRPAQNPLEHPVYDVWVIDCR 157 >gi|159184714|ref|NP_354334.2| hypothetical protein Atu1328 [Agrobacterium tumefaciens str. C58] gi|159140002|gb|AAK87119.2| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 145 Score = 152 bits (384), Expect = 3e-35, Method: Composition-based stats. Identities = 65/123 (52%), Positives = 86/123 (69%), Gaps = 3/123 (2%) Query: 3 YRVLLLILFFVFSHAKFAN---SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLI 59 R L + LF S + +AR N+VA F+G+DKITGR+ +FDV I+++ QFG+L Sbjct: 10 LRALTVSLFAAVSAVLIVSPVAAARLENRVAVFSGIDKITGRITSFDVYIDETVQFGALQ 69 Query: 60 IKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLM 119 + P VCYSRD E Q+IDAFV + EI DR +R IF+GWMFADSP +NA++H IYD+WL Sbjct: 70 VTPKVCYSRDQTETQKIDAFVEVDEITLDRKIRRIFTGWMFADSPGLNAVEHPIYDVWLT 129 Query: 120 QCK 122 CK Sbjct: 130 GCK 132 >gi|307947206|ref|ZP_07662541.1| putative signal peptide protein [Roseibium sp. TrichSKD4] gi|307770870|gb|EFO30096.1| putative signal peptide protein [Roseibium sp. TrichSKD4] Length = 180 Score = 152 bits (383), Expect = 3e-35, Method: Composition-based stats. Identities = 54/160 (33%), Positives = 86/160 (53%), Gaps = 10/160 (6%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 S + N VA F+G+DKITGR++ FDV I ++ QFG+L + P VCY+R E+ Sbjct: 28 QSQPQSQKIENPVAVFSGLDKITGRIINFDVYIGETVQFGALQVTPRVCYTRPQTESPLT 87 Query: 77 DAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESIS 136 F+ + EI + VR IFSGWM+A SP ++A++H +YDIWL CK + ++ Sbjct: 88 TGFIQVDEITLNNEVRRIFSGWMYAASPGLHAVEHGVYDIWLTNCK--------RTSTVP 139 Query: 137 KKALSEYSSTD-ITSQGSEKSSGSSSN-KTLEKESSQPLE 174 + + +TS+ + +G T+ +P + Sbjct: 140 PPEGYDGPPVEQVTSEDQDPLAGPDDGVDTILAPRPKPFQ 179 >gi|163759361|ref|ZP_02166447.1| hypothetical protein HPDFL43_06335 [Hoeflea phototrophica DFL-43] gi|162283765|gb|EDQ34050.1| hypothetical protein HPDFL43_06335 [Hoeflea phototrophica DFL-43] Length = 145 Score = 152 bits (383), Expect = 4e-35, Method: Composition-based stats. Identities = 57/105 (54%), Positives = 79/105 (75%) Query: 18 KFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRID 77 A++AR N VA F+G+DKITGR+ TFDV + ++ QFG+L + P VCYSRD+ EA + Sbjct: 27 AQASAARIENPVAVFSGIDKITGRITTFDVYVGETVQFGALQVTPKVCYSRDESEAPKTT 86 Query: 78 AFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 FV + EI DR +R +F+GWMFADSP +NA+DH++YD+WL +CK Sbjct: 87 TFVEVDEITLDRKIRRLFTGWMFADSPGLNAVDHAVYDVWLKECK 131 >gi|71083428|ref|YP_266147.1| hypothetical protein SAR11_0725 [Candidatus Pelagibacter ubique HTCC1062] gi|71062541|gb|AAZ21544.1| conserved hypothetical protein [Candidatus Pelagibacter ubique HTCC1062] Length = 135 Score = 152 bits (383), Expect = 4e-35, Method: Composition-based stats. Identities = 34/109 (31%), Positives = 56/109 (51%), Gaps = 2/109 (1%) Query: 14 FSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREA 73 S AN K E +DK++ + ++I + +F SL+IK + C + + + Sbjct: 27 LSSPLIANENN-EGKFVEIKILDKVSSKTDLLKLKIGEELRFKSLLIKSLKCKNSEFDDN 85 Query: 74 QRIDAFVSISE-IFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 I ++ + + I D IF+GW F+ SPA+N DH +YDIWL +C Sbjct: 86 PEITVYIQVKDTIKNDNNEVFIFNGWTFSSSPAVNPFDHPVYDIWLTRC 134 >gi|254469527|ref|ZP_05082932.1| conserved hypothetical protein [Pseudovibrio sp. JE062] gi|211961362|gb|EEA96557.1| conserved hypothetical protein [Pseudovibrio sp. JE062] Length = 181 Score = 152 bits (383), Expect = 4e-35, Method: Composition-based stats. Identities = 60/159 (37%), Positives = 89/159 (55%), Gaps = 10/159 (6%) Query: 14 FSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREA 73 +S A + N VA F G+DKITGR+ TFDV I+++ QFG+L + P VC SR EA Sbjct: 26 YSLPAQAQT-PIHNPVAVFKGLDKITGRITTFDVYIDETVQFGALQVTPRVCNSRPLTEA 84 Query: 74 QRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSE 133 + AF+ + E+ D VR IFSGWMFA +P ++A++HS+YDIWL+ CK + Sbjct: 85 SQTTAFIEVDELTLDSKVRRIFSGWMFASNPGVHAVEHSVYDIWLINCK--------KTT 136 Query: 134 SISKKALSEYSSTDITSQGSEKSSGSSSNKTLEKESSQP 172 S+ + ++ S+ E +G + E +P Sbjct: 137 SVPPPEGYAGPAVELVSEEDE-LAGKDFVSSGEIPVPRP 174 >gi|85716172|ref|ZP_01047147.1| hypothetical protein NB311A_05700 [Nitrobacter sp. Nb-311A] gi|85697005|gb|EAQ34888.1| hypothetical protein NB311A_05700 [Nitrobacter sp. Nb-311A] Length = 306 Score = 151 bits (382), Expect = 5e-35, Method: Composition-based stats. Identities = 53/111 (47%), Positives = 73/111 (65%) Query: 20 ANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAF 79 A + + NK A F+G+DKITGR++ FD +I ++ QFG+L +K CY+R EA DAF Sbjct: 144 APAEKVINKKAVFSGLDKITGRIIHFDEDIGETVQFGALRVKTDACYTRPATEAANTDAF 203 Query: 80 VSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSIS 130 V + EI V+ IFSGWMFA SP ++ ++H IYD+WL CKDP I+ Sbjct: 204 VEVDEITLQGEVKRIFSGWMFAASPGLHGVEHPIYDVWLTDCKDPETTVIA 254 >gi|319404107|emb|CBI77697.1| conserved exported hypothetical protein [Bartonella rochalimae ATCC BAA-1498] Length = 140 Score = 151 bits (382), Expect = 6e-35, Method: Composition-based stats. Identities = 53/131 (40%), Positives = 77/131 (58%), Gaps = 9/131 (6%) Query: 1 MKYRVLLLILFF---------VFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQ 51 MK + FF V F + R +N++ F G+DKITG+V +F+V I Q Sbjct: 1 MKLLLSKFEYFFYTFLLGGIAVLFTVSFVQAERISNEIVIFTGLDKITGQVTSFEVHIGQ 60 Query: 52 SAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDH 111 Q+G+L + P VCY+ E R +F+ +SE+ ++ R IF+GWMFADSP +NA++H Sbjct: 61 VYQYGALQVIPRVCYTSSKNEPARTTSFIEVSEMTLEKKTRRIFTGWMFADSPGLNAVEH 120 Query: 112 SIYDIWLMQCK 122 IYD+WL CK Sbjct: 121 PIYDVWLKDCK 131 >gi|154253733|ref|YP_001414557.1| cellulase-like protein [Parvibaculum lavamentivorans DS-1] gi|154157683|gb|ABS64900.1| cellulase-like protein [Parvibaculum lavamentivorans DS-1] Length = 135 Score = 150 bits (380), Expect = 8e-35, Method: Composition-based stats. Identities = 45/119 (37%), Positives = 65/119 (54%), Gaps = 3/119 (2%) Query: 18 KFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRID 77 A + VA F+G+DK T RV +F V++++ AQFGSL + C R E + Sbjct: 17 SAAPAFADKYPVAVFSGLDKTTARVTSFSVKVDEPAQFGSLEVLVRACDKRPPEEPPQTA 76 Query: 78 AFVSISEIFTDRIVR---SIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSE 133 AF+ I +I D IF GWMFA+SP +N ++H +YDIW+ CK + + SE Sbjct: 77 AFLEIRQIDRDDDSVQPAPIFEGWMFAESPGLNGLEHPVYDIWVTDCKTASGGASTGSE 135 >gi|312116102|ref|YP_004013698.1| hypothetical protein Rvan_3418 [Rhodomicrobium vannielii ATCC 17100] gi|311221231|gb|ADP72599.1| Protein of unknown function DUF2155 [Rhodomicrobium vannielii ATCC 17100] Length = 145 Score = 150 bits (380), Expect = 9e-35, Method: Composition-based stats. Identities = 46/132 (34%), Positives = 73/132 (55%) Query: 7 LLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCY 66 +L + + A++ R AN A FA +DK+TGRV ++ + ++ FG+L I P CY Sbjct: 13 VLAGLALVAPGTPASADRIANSTAVFAALDKVTGRVQPLEIPMGRTVTFGALTITPRACY 72 Query: 67 SRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 +R E AF+ + E+ D IF+GW FA+SP ++A++H +D+WL CK P Sbjct: 73 TRPSTETPLTSAFIEVDEVVLDGSSHRIFTGWTFAESPGLHAVEHPTFDVWLTSCKTPSA 132 Query: 127 DSISNSESISKK 138 D + S + K Sbjct: 133 DISAGRRSNAPK 144 >gi|83311840|ref|YP_422104.1| hypothetical protein amb2741 [Magnetospirillum magneticum AMB-1] gi|82946681|dbj|BAE51545.1| Uncharacterized protein [Magnetospirillum magneticum AMB-1] Length = 167 Score = 150 bits (380), Expect = 9e-35, Method: Composition-based stats. Identities = 35/100 (35%), Positives = 56/100 (56%) Query: 23 ARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSI 82 A + A G+DK+T RV+T + + G+L I C R + AF+ I Sbjct: 63 ADLSFDTAVLQGLDKVTARVVTVEAPVGAPVHVGALEIIVRACKKRRPEDQPESAAFLDI 122 Query: 83 SEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 E+ D+ ++F GWMFA SPA++A++H +YDIW++ C+ Sbjct: 123 WELHKDQPASALFRGWMFASSPALSAMEHPVYDIWVLDCR 162 >gi|114799292|ref|YP_760947.1| hypothetical protein HNE_2252 [Hyphomonas neptunium ATCC 15444] gi|114739466|gb|ABI77591.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444] Length = 184 Score = 150 bits (379), Expect = 1e-34, Method: Composition-based stats. Identities = 46/181 (25%), Positives = 67/181 (37%), Gaps = 20/181 (11%) Query: 1 MKY--RVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSL 58 MK R L L V + + S A +DKITGR V++ + +GSL Sbjct: 1 MKTAARFLALASLSVLAALPASASTMAQKNEATLRALDKITGRSTDIVVKVGEPVVYGSL 60 Query: 59 IIKPMVCYSRDDREAQRIDAFVSI-----------------SEIFTDRIVRSI-FSGWMF 100 + CY E AF+ I ++ I FSGWM+ Sbjct: 61 RVDLKACYQAPPEEVPESAAFLRIASTQPVAVETMEAAVAAKDVPPSEADSPILFSGWMY 120 Query: 101 ADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKALSEYSSTDITSQGSEKSSGSS 160 A SP +NA++H +YDIW+++C P + I + Y SE Sbjct: 121 ASSPGLNALEHPVYDIWVIRCTAPDPVKLPERAIIPESEEPLYEDMPAGVTESETPPDED 180 Query: 161 S 161 Sbjct: 181 I 181 >gi|92117356|ref|YP_577085.1| hypothetical protein Nham_1811 [Nitrobacter hamburgensis X14] gi|91800250|gb|ABE62625.1| conserved hypothetical protein [Nitrobacter hamburgensis X14] Length = 306 Score = 150 bits (379), Expect = 1e-34, Method: Composition-based stats. Identities = 50/109 (45%), Positives = 72/109 (66%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + + NK A F+G+DKITGR++ FD ++ ++ QFG+L +K CY+R EA DAFV Sbjct: 145 AQKVINKKAVFSGLDKITGRIIHFDEDVGETVQFGALRVKTDACYTRPATEAANTDAFVE 204 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSIS 130 + EI V+ IFSGWMFA SP ++ ++H +YD+WL CKDP I+ Sbjct: 205 VDEITLQGEVKRIFSGWMFAASPGLHGVEHPVYDVWLTDCKDPETTVIA 253 >gi|110634070|ref|YP_674278.1| cellulase-like protein [Mesorhizobium sp. BNC1] gi|110285054|gb|ABG63113.1| cellulase-like protein [Chelativorans sp. BNC1] Length = 141 Score = 150 bits (378), Expect = 1e-34, Method: Composition-based stats. Identities = 55/103 (53%), Positives = 78/103 (75%) Query: 20 ANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAF 79 A++ R N VAEF+G+DKITGR+ FDV ++++ QFG+L + P VCYS + E + DAF Sbjct: 28 AHAERIKNPVAEFSGIDKITGRITNFDVYMDETVQFGALQVTPRVCYSSPETEEPKTDAF 87 Query: 80 VSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 V ++EI DR +R IF+GWMFA+SP +NAI+H++YD+WL CK Sbjct: 88 VEVNEITLDRQIRRIFTGWMFAESPGVNAIEHAVYDVWLKSCK 130 >gi|49475388|ref|YP_033429.1| hypothetical protein BH05970 [Bartonella henselae str. Houston-1] gi|49238194|emb|CAF27404.1| hypothetical protein BH05970 [Bartonella henselae str. Houston-1] Length = 141 Score = 150 bits (378), Expect = 2e-34, Method: Composition-based stats. Identities = 50/111 (45%), Positives = 70/111 (63%) Query: 14 FSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREA 73 S + R +N +A FAG+DKITGR F+V + + Q+G+L + P VCY+ E Sbjct: 24 LSSMDGVQAERVSNGIAVFAGLDKITGRTTRFEVSLGEVYQYGALQVTPRVCYTSSKDEP 83 Query: 74 QRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 R FV ++E+ D+ VR IF+GWMFADSP +NA++H IYD+WL CK Sbjct: 84 TRTTGFVEVNEVTLDKKVRRIFTGWMFADSPGLNAVEHPIYDVWLKDCKQS 134 >gi|294084611|ref|YP_003551369.1| cellulase-like protein [Candidatus Puniceispirillum marinum IMCC1322] gi|292664184|gb|ADE39285.1| cellulase-like protein [Candidatus Puniceispirillum marinum IMCC1322] Length = 139 Score = 149 bits (377), Expect = 2e-34, Method: Composition-based stats. Identities = 37/121 (30%), Positives = 64/121 (52%), Gaps = 2/121 (1%) Query: 4 RVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPM 63 +++ + S+A + K+ G+DKIT R+ T I+ +FG+L + Sbjct: 17 KIIFAAMVLYVSYAMPVAAEWIDGKIVVLQGLDKITARITTLTTAIDTPLRFGTLQLTVN 76 Query: 64 VCYSRDDREAQRIDAFVSISEIFTDRI--VRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 C R E AF++I + D +S+F+GWMF+ SPA++A++H +YDI L+ C Sbjct: 77 RCAFRPPEEPPENVAFLTILDRGHDLSLAPKSVFTGWMFSSSPAVSAMEHPVYDITLLSC 136 Query: 122 K 122 + Sbjct: 137 R 137 >gi|163795175|ref|ZP_02189143.1| hypothetical protein BAL199_05889 [alpha proteobacterium BAL199] gi|159179573|gb|EDP64102.1| hypothetical protein BAL199_05889 [alpha proteobacterium BAL199] Length = 142 Score = 149 bits (377), Expect = 2e-34, Method: Composition-based stats. Identities = 43/130 (33%), Positives = 63/130 (48%), Gaps = 1/130 (0%) Query: 4 RVLLLILFFVFSHAKFANS-ARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKP 62 +L + + A+ A+ VA+ G+DK+T R+ T V + S FG+L I Sbjct: 9 LLLCVAIALSPQDAQSADEPDWLPRPVAKLQGLDKVTARISTVTVPVGDSVVFGTLHITA 68 Query: 63 MVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 C A AF+ I + D R IF GWMFA SPA+N++DH +YD+W++ C Sbjct: 69 QTCQEHPPTLAPESAAFLIIEDQPPDEAPRRIFDGWMFASSPALNSVDHPVYDVWMLACS 128 Query: 123 DPINDSISNS 132 S S Sbjct: 129 SDSTAGQSPS 138 >gi|13470480|ref|NP_102049.1| cellulase-like protein [Mesorhizobium loti MAFF303099] gi|14021222|dbj|BAB47835.1| cellulase-like protein [Mesorhizobium loti MAFF303099] Length = 198 Score = 149 bits (376), Expect = 2e-34, Method: Composition-based stats. Identities = 59/151 (39%), Positives = 91/151 (60%), Gaps = 8/151 (5%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 S R N VAEFAG+DKITGR++TFDV I+++ QFG+L + P VCYSR E + D+FV Sbjct: 45 SDRITNPVAEFAGIDKITGRIITFDVYIDETVQFGALQVTPRVCYSRPQNEEPKTDSFVE 104 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKALS 141 + EI DR +R IF+GWMFA+SP +NA++H++YD+WL +CK + + Sbjct: 105 VDEITLDRKIRRIFTGWMFAESPGLNAVEHAVYDVWLKECK--------QKSDVPAPDAT 156 Query: 142 EYSSTDITSQGSEKSSGSSSNKTLEKESSQP 172 + + + + +++ ++ QP Sbjct: 157 KADAPKADASKPVATKPAAAKPAASPDAEQP 187 >gi|319405550|emb|CBI79169.1| conserved exported hypothetical protein [Bartonella sp. AR 15-3] Length = 139 Score = 148 bits (375), Expect = 3e-34, Method: Composition-based stats. Identities = 51/124 (41%), Positives = 76/124 (61%) Query: 3 YRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKP 62 + LL+ V + R +N++ F G+DKITG+V +F+V I Q Q+G+L + P Sbjct: 12 FYTYLLVGIAVLFTVSCVQAERISNEIVIFTGLDKITGQVTSFEVHIGQVYQYGALQVIP 71 Query: 63 MVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 VCY+ EA R FV ++E+ ++ R IF+GWMFADSP +NA++H IYD+WL CK Sbjct: 72 RVCYTSSKNEAARTIGFVEVNEMTLEKKTRRIFTGWMFADSPGLNAVEHPIYDVWLKDCK 131 Query: 123 DPIN 126 + Sbjct: 132 KSSD 135 >gi|319898785|ref|YP_004158878.1| hypothetical protein BARCL_0615 [Bartonella clarridgeiae 73] gi|319402749|emb|CBI76296.1| conserved protein of unknown function [Bartonella clarridgeiae 73] Length = 147 Score = 148 bits (373), Expect = 6e-34, Method: Composition-based stats. Identities = 52/120 (43%), Positives = 79/120 (65%) Query: 3 YRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKP 62 + + L+ + S + R +N++ F+G+DKITGRV +F+V I Q Q+G+L I P Sbjct: 21 FYIYLVGGIAILSAVSRVLAERVSNEIGIFSGLDKITGRVTSFEVHIGQVYQYGALQIIP 80 Query: 63 MVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 VCY+ + E R +FV ++E+ ++ R IF+GWMFADSP +NA++HSIYD+WL CK Sbjct: 81 RVCYTSSENEPARTTSFVEVNEMTLEKKTRRIFTGWMFADSPGLNAVEHSIYDVWLKDCK 140 >gi|304391690|ref|ZP_07373632.1| putative signal peptide protein [Ahrensia sp. R2A130] gi|303295919|gb|EFL90277.1| putative signal peptide protein [Ahrensia sp. R2A130] Length = 142 Score = 147 bits (372), Expect = 7e-34, Method: Composition-based stats. Identities = 52/107 (48%), Positives = 72/107 (67%) Query: 16 HAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQR 75 + +R ANKVA FAG+DKITGR+ TFDV ++++ +FG L + P CYS E + Sbjct: 23 FTEEEQISRIANKVAVFAGLDKITGRITTFDVYMDETVKFGQLELTPRACYSSSAAETPK 82 Query: 76 IDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 +F+ + EI DR +R IFSGWMFA+SP +NAI+H + D+WL CK Sbjct: 83 TTSFIEVDEITLDRRIRRIFSGWMFAESPGLNAIEHPVNDVWLKACK 129 >gi|27379396|ref|NP_770925.1| hypothetical protein blr4285 [Bradyrhizobium japonicum USDA 110] gi|27352547|dbj|BAC49550.1| blr4285 [Bradyrhizobium japonicum USDA 110] Length = 360 Score = 147 bits (372), Expect = 8e-34, Method: Composition-based stats. Identities = 50/103 (48%), Positives = 70/103 (67%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + + NK A F+G+DKITGR++ FD +I ++ QFG+L +K CY+R EA DAFV Sbjct: 188 AQKIVNKKATFSGLDKITGRIINFDEDIGETVQFGALRVKTDACYTRPATEAANTDAFVE 247 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 + EI V+ IFSGWM+A SP ++ ++H IYDIWL CK+P Sbjct: 248 VDEITLQGEVKRIFSGWMYAASPGLHGVEHPIYDIWLTDCKEP 290 >gi|260460719|ref|ZP_05808969.1| cellulase-like protein [Mesorhizobium opportunistum WSM2075] gi|259033296|gb|EEW34557.1| cellulase-like protein [Mesorhizobium opportunistum WSM2075] Length = 197 Score = 147 bits (372), Expect = 8e-34, Method: Composition-based stats. Identities = 64/157 (40%), Positives = 91/157 (57%), Gaps = 12/157 (7%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 S R N VAEFAG+DKITGR++TFDV I+++ QFG+L + P VCYSR E + D+FV Sbjct: 42 SDRITNPVAEFAGIDKITGRIITFDVYIDETVQFGALQVTPRVCYSRPQNEEPKTDSFVE 101 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKD----PINDSISNSESISK 137 + EI DR +R IF+GWMFA+SP +NA++H++YD+WL CK P D+ + + Sbjct: 102 VDEITLDRKIRRIFTGWMFAESPGLNAVEHAVYDVWLKACKQKSDVPAPDATKPDATKAD 161 Query: 138 KALSEYSSTDITSQGSEKSSGSSSNKTLEKESSQPLE 174 + + +E S + E P E Sbjct: 162 ASEPAVAKPAAAKPNAEVSP--------DVEQPDPTE 190 >gi|319407121|emb|CBI80758.1| conserved hypothetical protein [Bartonella sp. 1-1C] Length = 121 Score = 147 bits (371), Expect = 1e-33, Method: Composition-based stats. Identities = 49/110 (44%), Positives = 72/110 (65%) Query: 13 VFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDRE 72 V F + R +N++ F G+DKITG+V +F+V I Q Q+G+L + P VCY+ E Sbjct: 3 VLFTVSFLQAERISNEIVIFTGLDKITGQVTSFEVHIGQVYQYGALQVIPRVCYTSSKNE 62 Query: 73 AQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 R +F+ +SE+ ++ R IF+GWMFADSP +NA++H IYD+WL CK Sbjct: 63 PARTTSFIEVSEMTLEKKTRRIFTGWMFADSPGLNAVEHPIYDVWLKDCK 112 >gi|154247423|ref|YP_001418381.1| hypothetical protein Xaut_3495 [Xanthobacter autotrophicus Py2] gi|154161508|gb|ABS68724.1| conserved hypothetical protein [Xanthobacter autotrophicus Py2] Length = 228 Score = 147 bits (370), Expect = 1e-33, Method: Composition-based stats. Identities = 47/101 (46%), Positives = 69/101 (68%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + + AN A +AG+DKITGR+ FDV I ++AQFG+L + P VCY+R E Q +F Sbjct: 117 TQKIANAFAVYAGLDKITGRITAFDVAIGETAQFGALQVTPRVCYTRPATETQNTTSFTE 176 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 ++E+ + IF+GWMFA SP ++A++H IYD+WL+ CK Sbjct: 177 VNEVTLQGQAKRIFTGWMFASSPGLHAVEHPIYDVWLIGCK 217 >gi|49474317|ref|YP_032359.1| hypothetical protein BQ07260 [Bartonella quintana str. Toulouse] gi|49239821|emb|CAF26212.1| hypothetical protein BQ07260 [Bartonella quintana str. Toulouse] Length = 141 Score = 147 bits (370), Expect = 1e-33, Method: Composition-based stats. Identities = 49/132 (37%), Positives = 75/132 (56%), Gaps = 10/132 (7%) Query: 1 MKYRVL----------LLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEIN 50 M + +L + + + S + R +N + FAG+DKITGR F+V + Sbjct: 1 MNFFLLPGLRRIFCACFMGIVVLLSSMGGVPAERVSNAIVVFAGLDKITGRTTLFEVSLG 60 Query: 51 QSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAID 110 + Q+G+L + P VCY+ E FV ++EI ++ VR IF+GWMFADSP +NA++ Sbjct: 61 EVYQYGALQVTPRVCYTGSKDEPTHTTGFVEVNEITLEKKVRRIFTGWMFADSPGLNAVE 120 Query: 111 HSIYDIWLMQCK 122 H +YD+WL CK Sbjct: 121 HPVYDVWLKDCK 132 >gi|288958846|ref|YP_003449187.1| hypothetical protein AZL_020050 [Azospirillum sp. B510] gi|288911154|dbj|BAI72643.1| hypothetical protein AZL_020050 [Azospirillum sp. B510] Length = 134 Score = 146 bits (369), Expect = 2e-33, Method: Composition-based stats. Identities = 31/97 (31%), Positives = 51/97 (52%) Query: 25 FANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISE 84 A+ +DK+T R TF + + ++ SL I C E AF+ ++E Sbjct: 36 IERPAAKLQWLDKVTARTSTFTMRVGETKAMSSLRITLRACRENPPIETPESAAFLEVTE 95 Query: 85 IFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 I +FSGWMF+ SPA++A+++ IYD+W++ C Sbjct: 96 IKPGEQAEQVFSGWMFSSSPALSAMENPIYDVWVLGC 132 >gi|182678917|ref|YP_001833063.1| hypothetical protein Bind_1951 [Beijerinckia indica subsp. indica ATCC 9039] gi|182634800|gb|ACB95574.1| conserved hypothetical protein [Beijerinckia indica subsp. indica ATCC 9039] Length = 247 Score = 146 bits (368), Expect = 2e-33, Method: Composition-based stats. Identities = 51/137 (37%), Positives = 76/137 (55%), Gaps = 9/137 (6%) Query: 19 FANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDA 78 FA + R + +A F+G+DKITGR+++F+V +++ QFGSL I CY+R E + Sbjct: 29 FARADRIKHPIAVFSGLDKITGRIISFEVATDETVQFGSLQITERACYTRPSTETPQTIT 88 Query: 79 FVSISEI---FTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKD-----PINDSIS 130 FV + EI + + IF+GWMFA SP ++A++H +YDIWL CK P D + Sbjct: 89 FVEVDEIDAADKTKTPKQIFAGWMFAASPGLHALEHPVYDIWLNDCKGGKEVLPSPD-TA 147 Query: 131 NSESISKKALSEYSSTD 147 + E S D Sbjct: 148 AGLPATPDNAKEASDID 164 >gi|319783248|ref|YP_004142724.1| hypothetical protein Mesci_3554 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317169136|gb|ADV12674.1| Protein of unknown function DUF2155 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 196 Score = 145 bits (367), Expect = 3e-33, Method: Composition-based stats. Identities = 57/104 (54%), Positives = 78/104 (75%) Query: 23 ARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSI 82 R N VAEFAG+DKITGR++TFDV I+++ QFG+L + P VCYSR EA + D+FV + Sbjct: 42 DRVTNAVAEFAGIDKITGRIITFDVYIDETVQFGALQVTPRVCYSRPQAEAPKTDSFVEV 101 Query: 83 SEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 EI DR +R IF+GWMFA+SP +NA++H++YD+WL CK + Sbjct: 102 DEITLDRKIRRIFTGWMFAESPGLNAVEHAVYDVWLKACKQKSD 145 >gi|319408372|emb|CBI82025.1| conserved hypothetical protein [Bartonella schoenbuchensis R1] Length = 118 Score = 145 bits (365), Expect = 4e-33, Method: Composition-based stats. Identities = 51/109 (46%), Positives = 71/109 (65%) Query: 14 FSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREA 73 S + R +N V FAG+DKITGR + F+V I + Q+G+L + P VCY+ + E Sbjct: 1 MSSVNGVQAERVSNAVVVFAGLDKITGRTIRFEVSIGEVYQYGALRVTPRVCYTSSEGEP 60 Query: 74 QRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 R + FV + EI ++ +R IF+GWMFADSP +NA++H IYDIWL CK Sbjct: 61 TRTNGFVEVDEITLNKEMRRIFTGWMFADSPGLNAVEHPIYDIWLKDCK 109 >gi|121602347|ref|YP_988869.1| hypothetical protein BARBAKC583_0556 [Bartonella bacilliformis KC583] gi|120614524|gb|ABM45125.1| conserved hypothetical protein [Bartonella bacilliformis KC583] Length = 132 Score = 143 bits (362), Expect = 1e-32, Method: Composition-based stats. Identities = 48/118 (40%), Positives = 76/118 (64%) Query: 5 VLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMV 64 V L+ + V + R +N +A F+G+DKITGR F+V I++ QFG+L + P + Sbjct: 15 VFLMGMVGVLLWVGNMQAKRVSNTIAVFSGLDKITGRTTRFEVPIDRVYQFGALQVTPRI 74 Query: 65 CYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 CY+ + E R +F+ ++E+ D+ + IF+GW+FADSP +NA++H IYD+WL CK Sbjct: 75 CYTSSEDEPARPASFIEVNEVTLDKKTQRIFTGWIFADSPGLNAVEHPIYDVWLKDCK 132 >gi|83593772|ref|YP_427524.1| cellulase-like protein [Rhodospirillum rubrum ATCC 11170] gi|83576686|gb|ABC23237.1| cellulase-like protein [Rhodospirillum rubrum ATCC 11170] Length = 163 Score = 143 bits (361), Expect = 1e-32, Method: Composition-based stats. Identities = 38/143 (26%), Positives = 65/143 (45%), Gaps = 2/143 (1%) Query: 10 LFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRD 69 L + + A + A +DK T RV + + + GSL I C R Sbjct: 19 LCLMLAATAPAGAEDINADTARLGWLDKTTARVGESSIAVGGDLRLGSLTITVRSCVRRV 78 Query: 70 DREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSI 129 + AF+ I E + IF GWMFA SP+++A+DH++YD+W+++C+ P + Sbjct: 79 PPDDPESAAFLDIVERAEGVAAKQIFEGWMFASSPSLSAMDHAVYDVWVLRCEIPADRDA 138 Query: 130 SNSESISKKALSEYSSTDITSQG 152 +S ++ E + + G Sbjct: 139 --GDSGKPESAPEAAPIPVDPGG 159 >gi|296447987|ref|ZP_06889893.1| Protein of unknown function DUF2155 [Methylosinus trichosporium OB3b] gi|296254497|gb|EFH01618.1| Protein of unknown function DUF2155 [Methylosinus trichosporium OB3b] Length = 236 Score = 138 bits (349), Expect = 3e-31, Method: Composition-based stats. Identities = 51/153 (33%), Positives = 83/153 (54%), Gaps = 7/153 (4%) Query: 23 ARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSI 82 + A FAG+DK TGR++ FDV I+++ QFGSL I P VC +R EA + +FV + Sbjct: 31 DPIRHPTAVFAGLDKTTGRIINFDVAIDETVQFGSLQITPRVCNTRPQTEAPQTTSFVEV 90 Query: 83 SEIFT-DRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKALS 141 + + IFSGWMFA SP ++ ++HS+YD+WL CK I + + ++ A + Sbjct: 91 DDQDPAKNEAKRIFSGWMFAASPGLHGVEHSVYDVWLTDCKG--GKEIVQAPASAEPAAA 148 Query: 142 EYSSTDITSQGSEKSSGSSSNKTLEKESSQPLE 174 + ++ ++S +E + P+E Sbjct: 149 DPAAATPAPVEKKRSRSR----KVEPVAPTPIE 177 >gi|296534672|ref|ZP_06897071.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] gi|296264997|gb|EFH11223.1| conserved hypothetical protein [Roseomonas cervicalis ATCC 49957] Length = 124 Score = 137 bits (345), Expect = 9e-31, Method: Composition-based stats. Identities = 29/104 (27%), Positives = 56/104 (53%) Query: 19 FANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDA 78 + A + A+ +DK+T RV + +NQ QFG+L + C +R E A Sbjct: 21 QQDPGWVAARTAKLQALDKVTARVTVLETPVNQPIQFGTLRVTVRACNARPPEEVPDAAA 80 Query: 79 FVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 ++ + + D + F GWMFA++P ++ ++H +YD+ +++C+ Sbjct: 81 WLEVLDTRNDPNGAAAFRGWMFANAPGVSMLEHPVYDLRILECR 124 >gi|254459519|ref|ZP_05072935.1| conserved hypothetical protein [Rhodobacterales bacterium HTCC2083] gi|206676108|gb|EDZ40595.1| conserved hypothetical protein [Rhodobacteraceae bacterium HTCC2083] Length = 120 Score = 136 bits (344), Expect = 1e-30, Method: Composition-based stats. Identities = 38/114 (33%), Positives = 56/114 (49%), Gaps = 3/114 (2%) Query: 11 FFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDD 70 F+ + A A A + AE +DK++G+ +D+ + G L I C R Sbjct: 10 LFLCASAVHAQQAVSSGTGAELRVLDKVSGQSSNYDLASGSKMEIGQLTIALRAC--RYP 67 Query: 71 REAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 A DAF I E+ IF GWM A SPA+NA++H YD+W+++CK Sbjct: 68 EGAPANDAFAYI-EVNETESATGIFGGWMIASSPALNAMEHPRYDVWVLRCKTS 120 >gi|323137691|ref|ZP_08072767.1| Protein of unknown function DUF2155 [Methylocystis sp. ATCC 49242] gi|322396988|gb|EFX99513.1| Protein of unknown function DUF2155 [Methylocystis sp. ATCC 49242] Length = 256 Score = 136 bits (342), Expect = 2e-30, Method: Composition-based stats. Identities = 50/182 (27%), Positives = 92/182 (50%), Gaps = 24/182 (13%) Query: 14 FSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREA 73 F+ + A + + A FAG+DK TGR++ FDV I+++ QFG+L + P VC +R E Sbjct: 16 FALSGVAVAEPIRHPTATFAGLDKTTGRIINFDVAIDETVQFGALQVTPRVCNTRPQTET 75 Query: 74 QRIDAFVSISEI-------------------FTDRIVRSIFSGWMFADSPAMNAIDHSIY 114 + +FV + E+ + + IFSGWMFA SP ++ ++H +Y Sbjct: 76 PQTTSFVEVDELILKPERQGRPEAKPEQAKTDGKQEAKRIFSGWMFAASPGLHGVEHPVY 135 Query: 115 DIWLMQCKDPINDSISNSESISKKALSEYSSTDITSQGSEKSSGSSSNKTLEKESSQPLE 174 D+WL+ CK + + + + ++ + + ++ G ++ S + +E + P+E Sbjct: 136 DVWLVDCKGGKESAPAPAAAAAEPSAAPDAAAPAAETGKKRRS-----RKVEPAAPAPVE 190 Query: 175 NN 176 N Sbjct: 191 NQ 192 >gi|260575570|ref|ZP_05843568.1| conserved hypothetical protein [Rhodobacter sp. SW2] gi|259022213|gb|EEW25511.1| conserved hypothetical protein [Rhodobacter sp. SW2] Length = 145 Score = 133 bits (335), Expect = 1e-29, Method: Composition-based stats. Identities = 42/150 (28%), Positives = 69/150 (46%), Gaps = 8/150 (5%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLII 60 MK +LL +L + A +DK++G ++ QSA G L I Sbjct: 1 MKRLLLLAVL-----ASPAAAQEVADAPGGILRWLDKVSGETADIELSRGQSAVSGRLTI 55 Query: 61 KPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ 120 + C R + +AF ++ I D++ +FSGWM A SPA++A+DH YD+W+++ Sbjct: 56 QLDAC--RYPVDNPASNAFAHLT-ITEDKVATPVFSGWMVAASPALSALDHRRYDVWVLR 112 Query: 121 CKDPINDSISNSESISKKALSEYSSTDITS 150 C P D I E + +E + + Sbjct: 113 CITPTTDQIEVPEDAPVEDAAEPPALPEDA 142 >gi|83949686|ref|ZP_00958419.1| hypothetical protein ISM_01290 [Roseovarius nubinhibens ISM] gi|83837585|gb|EAP76881.1| hypothetical protein ISM_01290 [Roseovarius nubinhibens ISM] Length = 129 Score = 132 bits (333), Expect = 2e-29, Method: Composition-based stats. Identities = 38/107 (35%), Positives = 54/107 (50%), Gaps = 3/107 (2%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 A A + A +DK+ G V ++ SA+FG L I C + A Sbjct: 24 VSAAQEAASLGQGAILRALDKVNGSVTDLELGNASSARFGRLTINLGECRFPEGDPAGDA 83 Query: 77 DAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKD 123 AF++I E D + FSGWM A SPA++A+DH YD+W+M+CK Sbjct: 84 YAFLTIQE---DGQTQPQFSGWMIASSPALSALDHPRYDVWVMRCKT 127 >gi|114767025|ref|ZP_01445933.1| hypothetical protein 1100011001191_R2601_18438 [Pelagibaca bermudensis HTCC2601] gi|114540809|gb|EAU43873.1| hypothetical protein R2601_18438 [Roseovarius sp. HTCC2601] Length = 119 Score = 130 bits (328), Expect = 9e-29, Method: Composition-based stats. Identities = 33/104 (31%), Positives = 50/104 (48%), Gaps = 3/104 (2%) Query: 21 NSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFV 80 A A G+DK+TGR ++ ++AQF I C + A AF+ Sbjct: 19 QEATNTGTGAVLRGLDKLTGRAYDIEMRAGETAQFARTEISLQECRYPEGDPAGDAFAFL 78 Query: 81 SISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 ++ E +F GWM A SPA+NA+DH YD+W+++C Sbjct: 79 TVREA---GNAEPVFRGWMIASSPALNAMDHQRYDVWVLRCTTS 119 >gi|85704727|ref|ZP_01035828.1| hypothetical protein ROS217_06595 [Roseovarius sp. 217] gi|85670545|gb|EAQ25405.1| hypothetical protein ROS217_06595 [Roseovarius sp. 217] Length = 119 Score = 130 bits (327), Expect = 1e-28, Method: Composition-based stats. Identities = 33/102 (32%), Positives = 53/102 (51%), Gaps = 3/102 (2%) Query: 21 NSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFV 80 + A G+DKI G+ D+ + FG+L ++ C + A A++ Sbjct: 20 QEVATVAQGAILRGLDKINGQASDLDLANGEMGAFGTLDVELGECRYPEGNPAGDSYAYL 79 Query: 81 SISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 +I E +FSGWM A SPA+NA++H+ YDIW+++CK Sbjct: 80 TIRE---QNGGAVVFSGWMLASSPALNALEHARYDIWVLRCK 118 >gi|149201009|ref|ZP_01877984.1| hypothetical protein RTM1035_15327 [Roseovarius sp. TM1035] gi|149145342|gb|EDM33368.1| hypothetical protein RTM1035_15327 [Roseovarius sp. TM1035] Length = 111 Score = 129 bits (325), Expect = 2e-28, Method: Composition-based stats. Identities = 39/103 (37%), Positives = 53/103 (51%), Gaps = 3/103 (2%) Query: 20 ANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAF 79 A + A G+DKI G ++ QS FGSL + C D A A+ Sbjct: 11 AQEVAAVAQGALLRGLDKINGSAQDLELANGQSGVFGSLDVVLGECRYPQDDPAADAYAY 70 Query: 80 VSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 ++ISE IFSGWM A SPA+NA++H YDIW+++CK Sbjct: 71 LTISEQAGG---AVIFSGWMLASSPALNALEHPRYDIWVLRCK 110 >gi|295689564|ref|YP_003593257.1| hypothetical protein Cseg_2176 [Caulobacter segnis ATCC 21756] gi|295431467|gb|ADG10639.1| Protein of unknown function DUF2155 [Caulobacter segnis ATCC 21756] Length = 221 Score = 129 bits (325), Expect = 2e-28, Method: Composition-based stats. Identities = 30/105 (28%), Positives = 51/105 (48%), Gaps = 7/105 (6%) Query: 29 VAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDRE-AQRIDAFVSISEIFT 87 +A +DK+T + F+V + Q ++ +L+ C + E A A+V + Sbjct: 110 IAILQALDKVTTETMRFEVPVGQPIRYKTLVFTVRACETAAADEIAPETTAYVIVDTQPK 169 Query: 88 DRIVRS------IFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 + R I+ GWM+A SP +N ++H +YD WL+ CK I Sbjct: 170 AQAGRPAPPGRQIYKGWMYASSPGLNPLEHPVYDAWLIACKQSIP 214 >gi|260426797|ref|ZP_05780776.1| conserved hypothetical protein [Citreicella sp. SE45] gi|260421289|gb|EEX14540.1| conserved hypothetical protein [Citreicella sp. SE45] Length = 119 Score = 129 bits (324), Expect = 3e-28, Method: Composition-based stats. Identities = 38/115 (33%), Positives = 58/115 (50%), Gaps = 3/115 (2%) Query: 7 LLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCY 66 L + + + A A A + A G+DK+TGR ++ Q+AQFG + I C Sbjct: 5 LALSLCLSATALSAQEATSSGSGAVLRGLDKLTGRASDIELGTGQTAQFGRIEISLAECR 64 Query: 67 SRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 +A AF+++ E F GWM A SPA+NA+DH YD+W+++C Sbjct: 65 YPVGNQAGDAFAFLTVREAGH---PDPAFRGWMVASSPALNAMDHQRYDVWVLRC 116 >gi|86137322|ref|ZP_01055899.1| hypothetical protein MED193_05669 [Roseobacter sp. MED193] gi|85825657|gb|EAQ45855.1| hypothetical protein MED193_05669 [Roseobacter sp. MED193] Length = 133 Score = 128 bits (323), Expect = 3e-28, Method: Composition-based stats. Identities = 30/102 (29%), Positives = 50/102 (49%), Gaps = 3/102 (2%) Query: 26 ANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEI 85 A G+DK+ G +V+I SA+ LI+ C A+++I + Sbjct: 33 TGDSAVLRGLDKVNGHHTDIEVQIGGSAEIYGLIVTLTECRYPAANPTGDAYAYLTIRDP 92 Query: 86 FTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIND 127 F GWM A SPA++A+DH+ YD+W+++CK + + Sbjct: 93 L---NGEVFFDGWMIASSPALSALDHARYDVWVIRCKSSVGE 131 >gi|254486361|ref|ZP_05099566.1| conserved hypothetical protein [Roseobacter sp. GAI101] gi|214043230|gb|EEB83868.1| conserved hypothetical protein [Roseobacter sp. GAI101] Length = 117 Score = 128 bits (321), Expect = 6e-28, Method: Composition-based stats. Identities = 38/118 (32%), Positives = 57/118 (48%), Gaps = 4/118 (3%) Query: 7 LLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCY 66 LLIL V S A A + +DK++G + Q+A G+L + C Sbjct: 4 LLILLAVLSSPLHAEEAT-SAPGGVLRALDKVSGAAQDIVMFRGQTAHVGNLDVLMTDC- 61 Query: 67 SRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 R + DA+V + T R + +FSGWM A SPA++A++H YDIW ++C Sbjct: 62 -RFPKGNPAGDAYVELEIKTTGRDDK-LFSGWMIASSPALSALEHPRYDIWAIRCTTS 117 >gi|304570805|ref|YP_002517336.2| hypothetical protein CCNA_01963 [Caulobacter crescentus NA1000] Length = 194 Score = 128 bits (321), Expect = 7e-28, Method: Composition-based stats. Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 7/112 (6%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDRE-AQRIDAFV 80 + R VA +DK+T + F+V I Q ++ +LI C + E A A+V Sbjct: 76 AKRARYSVAILQALDKVTTETMRFEVPIGQPIRYKTLIFTVRACETAAADEVAPESAAYV 135 Query: 81 SISEIFTDR------IVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 + + R I+ GWM+A SP +N + H +YD WL+ CK I Sbjct: 136 VVDTQPKAQAGRAAPPGRQIYKGWMYASSPGLNPLQHPVYDAWLIACKQSIP 187 >gi|16126129|ref|NP_420693.1| hypothetical protein CC_1886 [Caulobacter crescentus CB15] gi|13423333|gb|AAK23861.1| hypothetical protein CC_1886 [Caulobacter crescentus CB15] Length = 223 Score = 128 bits (321), Expect = 7e-28, Method: Composition-based stats. Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 7/112 (6%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDRE-AQRIDAFV 80 + R VA +DK+T + F+V I Q ++ +LI C + E A A+V Sbjct: 105 AKRARYSVAILQALDKVTTETMRFEVPIGQPIRYKTLIFTVRACETAAADEVAPESAAYV 164 Query: 81 SISEIFTDR------IVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 + + R I+ GWM+A SP +N + H +YD WL+ CK I Sbjct: 165 VVDTQPKAQAGRAAPPGRQIYKGWMYASSPGLNPLQHPVYDAWLIACKQSIP 216 >gi|259416815|ref|ZP_05740735.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259348254|gb|EEW60031.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 145 Score = 127 bits (319), Expect = 9e-28, Method: Composition-based stats. Identities = 31/99 (31%), Positives = 51/99 (51%), Gaps = 7/99 (7%) Query: 26 ANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQ--RIDAFVSIS 83 +A G+DK+ G+ DV+ S + LI+ C R E A+++I Sbjct: 50 KGSIASLRGLDKVNGKSTDVDVQTGGSVEVFGLIVTMREC--RYPTENPSGDAFAYLTIR 107 Query: 84 EIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 + + + F GWM A SPA++A+DH YD+W+++CK Sbjct: 108 D---RQDGKVFFDGWMIASSPALSALDHRRYDVWVLRCK 143 >gi|255263053|ref|ZP_05342395.1| conserved hypothetical protein [Thalassiobium sp. R2A62] gi|255105388|gb|EET48062.1| conserved hypothetical protein [Thalassiobium sp. R2A62] Length = 119 Score = 127 bits (319), Expect = 1e-27, Method: Composition-based stats. Identities = 36/122 (29%), Positives = 57/122 (46%), Gaps = 4/122 (3%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLII 60 M V +IL + A A A +DKITGRV +++ +A+ G L + Sbjct: 1 MARFVSAMILLCALAAPVGAQQVT-AGSGAMLRILDKITGRVADVELDTGGTARQGRLSV 59 Query: 61 KPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ 120 C + A +++ E +F GWM A +PA+NA++H YD+W+M+ Sbjct: 60 TLAECRYPSGNRSGNAYALLTVIEA---GTPDPVFRGWMIASAPALNAMEHPRYDVWVMR 116 Query: 121 CK 122 CK Sbjct: 117 CK 118 >gi|149916369|ref|ZP_01904889.1| hypothetical protein RAZWK3B_12282 [Roseobacter sp. AzwK-3b] gi|149809823|gb|EDM69675.1| hypothetical protein RAZWK3B_12282 [Roseobacter sp. AzwK-3b] Length = 121 Score = 127 bits (319), Expect = 1e-27, Method: Composition-based stats. Identities = 28/106 (26%), Positives = 51/106 (48%), Gaps = 3/106 (2%) Query: 18 KFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRID 77 + A G+DKI G + + +SA+ G+L + C A Sbjct: 18 AASAQDVEIGTGAALRGLDKINGDTVDMMLATGESAELGNLEVTLGECRYPAGDAASDAF 77 Query: 78 AFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKD 123 A++++ + + +F GWM A SPA+NA++H YD+W+++C+ Sbjct: 78 AYITVRDPRIG---QPVFEGWMIASSPALNAMEHQRYDVWVLRCRT 120 >gi|220964072|gb|ACL95428.1| conserved hypothetical protein [Caulobacter crescentus NA1000] Length = 112 Score = 126 bits (318), Expect = 2e-27, Method: Composition-based stats. Identities = 32/104 (30%), Positives = 49/104 (47%), Gaps = 7/104 (6%) Query: 30 AEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDRE-AQRIDAFVSISEIFTD 88 A +DK+T + F+V I Q ++ +LI C + E A A+V + Sbjct: 2 AILQALDKVTTETMRFEVPIGQPIRYKTLIFTVRACETAAADEVAPESAAYVVVDTQPKA 61 Query: 89 R------IVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 + R I+ GWM+A SP +N + H +YD WL+ CK I Sbjct: 62 QAGRAAPPGRQIYKGWMYASSPGLNPLQHPVYDAWLIACKQSIP 105 >gi|126732550|ref|ZP_01748348.1| hypothetical protein SSE37_10642 [Sagittula stellata E-37] gi|126706996|gb|EBA06064.1| hypothetical protein SSE37_10642 [Sagittula stellata E-37] Length = 124 Score = 126 bits (317), Expect = 2e-27, Method: Composition-based stats. Identities = 29/105 (27%), Positives = 48/105 (45%), Gaps = 3/105 (2%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 A A + A G+DK+ +V F + ++ G L + C Sbjct: 20 AAQAQEEVNSGTGAVLRGLDKLNAKVADFTLSNGENHVMGLLEVVLRECRYPVGDPTGNA 79 Query: 77 DAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 AF++I E + + +F GWM A SPA+ +DH YD+W+++C Sbjct: 80 YAFLTIREA---GVAQPVFEGWMVASSPALYPLDHPRYDVWVLRC 121 >gi|329850493|ref|ZP_08265338.1| hypothetical protein ABI_34000 [Asticcacaulis biprosthecum C19] gi|328840808|gb|EGF90379.1| hypothetical protein ABI_34000 [Asticcacaulis biprosthecum C19] Length = 233 Score = 126 bits (316), Expect = 2e-27, Method: Composition-based stats. Identities = 31/110 (28%), Positives = 51/110 (46%), Gaps = 8/110 (7%) Query: 29 VAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQ-RIDAFVSIS---- 83 A +DK+ + F+ I Q +F LI C D EAQ + A++++ Sbjct: 124 TAIIEALDKVNAESVRFEAPIGQPVRFKGLIYLVKACEMTADDEAQNDVMAYMTVRTNPV 183 Query: 84 ---EIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSIS 130 + IF GW F+ SP++N + H IYD W++ C+ P+ + S Sbjct: 184 AATNTSIGSKSKQIFQGWSFSSSPSLNPMQHPIYDAWVIGCRKPLGGTTS 233 >gi|83943116|ref|ZP_00955576.1| hypothetical protein EE36_13083 [Sulfitobacter sp. EE-36] gi|83846124|gb|EAP84001.1| hypothetical protein EE36_13083 [Sulfitobacter sp. EE-36] Length = 117 Score = 126 bits (316), Expect = 2e-27, Method: Composition-based stats. Identities = 29/118 (24%), Positives = 56/118 (47%), Gaps = 4/118 (3%) Query: 7 LLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCY 66 +L++ + + A A +DK++G ++ ++A+ G+L + C Sbjct: 4 ILMMLAIMAAPVQAQQVASAE-GGVLRALDKVSGVSRDVEMRRGETARVGNLNVTMNEC- 61 Query: 67 SRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 R DA+ + EI +F+GWM A +PA++A++H YDIW+++C Sbjct: 62 -RYPSGNPAGDAYAEL-EIVETGDENRLFAGWMIASAPALSALEHPRYDIWVIRCTTS 117 >gi|99081361|ref|YP_613515.1| hypothetical protein TM1040_1520 [Ruegeria sp. TM1040] gi|99037641|gb|ABF64253.1| hypothetical protein TM1040_1520 [Ruegeria sp. TM1040] Length = 145 Score = 126 bits (316), Expect = 3e-27, Method: Composition-based stats. Identities = 30/99 (30%), Positives = 52/99 (52%), Gaps = 7/99 (7%) Query: 26 ANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQ--RIDAFVSIS 83 + G+DK+ G+ + +V+ +A+ LI+ C R E A+++I Sbjct: 50 EGTLTSLRGLDKVNGKSVDVEVQTGGTAEIFGLIVTLREC--RYPTENPSGDAFAYLTIR 107 Query: 84 EIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 + + + F GWM A SPA+NA+DH YD+W+++CK Sbjct: 108 D---RQDGKVFFDGWMIASSPALNALDHRRYDVWVLRCK 143 >gi|254475589|ref|ZP_05088975.1| conserved hypothetical protein [Ruegeria sp. R11] gi|214029832|gb|EEB70667.1| conserved hypothetical protein [Ruegeria sp. R11] Length = 122 Score = 125 bits (315), Expect = 3e-27, Method: Composition-based stats. Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 3/101 (2%) Query: 21 NSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFV 80 + A G+DK+ G+ +V + SA+ +I+ M C D A++ Sbjct: 22 QGDAVQGQSAILRGLDKVNGQTQDLEVPVGSSAEIFGVIVNVMDCRYPADNPTGDAFAYL 81 Query: 81 SISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 ++ + F GWM A SPA+NA+DHS YDIW+M+C Sbjct: 82 TVRD---PNDGTVFFDGWMIASSPALNALDHSRYDIWVMRC 119 >gi|163741338|ref|ZP_02148730.1| hypothetical protein RG210_17800 [Phaeobacter gallaeciensis 2.10] gi|161385691|gb|EDQ10068.1| hypothetical protein RG210_17800 [Phaeobacter gallaeciensis 2.10] Length = 120 Score = 125 bits (315), Expect = 3e-27, Method: Composition-based stats. Identities = 31/105 (29%), Positives = 49/105 (46%), Gaps = 3/105 (2%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 A A A G+DK+ G+ ++ + SA+ +I+ C D Sbjct: 16 TASAQGAAENGSSAVLRGLDKVNGQTQDLEIPVGGSAEIFGVIVSLRECRYPADNPTGDA 75 Query: 77 DAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 A++++ F GWM A SPA+NA+DHS YD+W+M+C Sbjct: 76 YAYLTVR---NPNDATVYFDGWMIASSPALNALDHSRYDVWVMRC 117 >gi|167646524|ref|YP_001684187.1| hypothetical protein Caul_2562 [Caulobacter sp. K31] gi|167348954|gb|ABZ71689.1| conserved hypothetical protein [Caulobacter sp. K31] Length = 227 Score = 125 bits (315), Expect = 3e-27, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 49/111 (44%), Gaps = 7/111 (6%) Query: 23 ARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREA-QRIDAFVS 81 R + VA +DK+T + F+ + Q ++ +L+ C + E A+V Sbjct: 110 KRSRSSVAIIQALDKVTTETMRFEAPVGQPIRYKTLVFTVRACETTTPDEDAPDSVAYVV 169 Query: 82 ISEIFTD------RIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 + R I+ GWM+A+SP +N + H +YD WL+ CK Sbjct: 170 VDTQPKALPGRVAPPGRQIYKGWMYANSPGLNPLQHPVYDAWLIACKTSAP 220 >gi|83954273|ref|ZP_00962993.1| hypothetical protein NAS141_18244 [Sulfitobacter sp. NAS-14.1] gi|83841310|gb|EAP80480.1| hypothetical protein NAS141_18244 [Sulfitobacter sp. NAS-14.1] Length = 141 Score = 125 bits (314), Expect = 3e-27, Method: Composition-based stats. Identities = 29/118 (24%), Positives = 56/118 (47%), Gaps = 4/118 (3%) Query: 7 LLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCY 66 +L++ + + A A +DK++G ++ ++A+ G+L + C Sbjct: 28 ILMMLAIMAAPVQAQQVASAE-GGILRALDKVSGVSRDVEMRRGETARVGNLNVTMNEC- 85 Query: 67 SRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 R DA+ + EI +F+GWM A +PA++A++H YDIW+++C Sbjct: 86 -RYPSGNPAGDAYAEL-EIVETGDENRLFAGWMIASAPALSALEHPRYDIWVIRCTTS 141 >gi|163736132|ref|ZP_02143551.1| hypothetical protein RGBS107_13411 [Phaeobacter gallaeciensis BS107] gi|161390002|gb|EDQ14352.1| hypothetical protein RGBS107_13411 [Phaeobacter gallaeciensis BS107] Length = 120 Score = 125 bits (314), Expect = 4e-27, Method: Composition-based stats. Identities = 31/105 (29%), Positives = 49/105 (46%), Gaps = 3/105 (2%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 A A A G+DK+ G+ ++ + SA+ +I+ C D Sbjct: 16 TASAQGAADNGSSAVLRGLDKVNGQTQDLEIPVGGSAEIFGVIVSLRECRYPADNPTGDA 75 Query: 77 DAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 A++++ F GWM A SPA+NA+DHS YD+W+M+C Sbjct: 76 YAYLTVR---NPNDATVYFDGWMIASSPALNALDHSRYDVWVMRC 117 >gi|114771113|ref|ZP_01448553.1| hypothetical protein OM2255_03407 [alpha proteobacterium HTCC2255] gi|114548395|gb|EAU51281.1| hypothetical protein OM2255_03407 [alpha proteobacterium HTCC2255] Length = 131 Score = 125 bits (313), Expect = 5e-27, Method: Composition-based stats. Identities = 31/117 (26%), Positives = 57/117 (48%), Gaps = 5/117 (4%) Query: 7 LLILFFVFSHAKFAN-SARFAN-KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMV 64 L LF +F+ A S + N A +D+++G V F + + G++ + Sbjct: 4 LTFLFIIFAQTVLAQGSIKVTNGSGALLRTLDRLSGNVTDFKITNGEEIILGNINVLMKE 63 Query: 65 CYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 C R ++ +AF + I + F GWM + SPA++A++H YD+W+++C Sbjct: 64 C--RYPSQSIDSNAFAFLV-ISGQETEKLFFEGWMISSSPALSALEHPRYDVWVLKC 117 >gi|77463761|ref|YP_353265.1| hypothetical protein RSP_0193 [Rhodobacter sphaeroides 2.4.1] gi|126462591|ref|YP_001043705.1| hypothetical protein Rsph17029_1826 [Rhodobacter sphaeroides ATCC 17029] gi|332558617|ref|ZP_08412939.1| hypothetical protein RSWS8N_06160 [Rhodobacter sphaeroides WS8N] gi|77388179|gb|ABA79364.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] gi|126104255|gb|ABN76933.1| conserved hypothetical protein [Rhodobacter sphaeroides ATCC 17029] gi|332276329|gb|EGJ21644.1| hypothetical protein RSWS8N_06160 [Rhodobacter sphaeroides WS8N] Length = 123 Score = 125 bits (313), Expect = 5e-27, Method: Composition-based stats. Identities = 31/101 (30%), Positives = 49/101 (48%), Gaps = 3/101 (2%) Query: 21 NSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFV 80 + A +DK++G +++ QSA G L I+ C A A + Sbjct: 18 DQRTGEGTGALLRWLDKMSGETADVELQRGQSAVSGHLTIELDECRFPAGDPASDAYAHL 77 Query: 81 SISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 +I + R +F GWM A SPA++A+DH YD+WL++C Sbjct: 78 TIRDS---RAAEPVFDGWMIASSPALSALDHPRYDVWLLRC 115 >gi|330813928|ref|YP_004358167.1| hypothetical protein SAR11G3_00953 [Candidatus Pelagibacter sp. IMCC9063] gi|327487023|gb|AEA81428.1| hypothetical protein SAR11G3_00953 [Candidatus Pelagibacter sp. IMCC9063] Length = 131 Score = 125 bits (313), Expect = 5e-27, Method: Composition-based stats. Identities = 35/130 (26%), Positives = 58/130 (44%), Gaps = 9/130 (6%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFAN--------KVAEFAGMDKITGRVLTFDVEINQS 52 M R+L F+F +A + A +DKIT + T ++IN+ Sbjct: 1 MATRILAYFFLFIFLSPIYALGQGLKDVKILDSNANTANIVILDKITSKKNTHTIQINKK 60 Query: 53 AQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIFTDRIV-RSIFSGWMFADSPAMNAIDH 111 +F SL + C + + + AFV I + I++GWMF+ P++N ++H Sbjct: 61 YKFYSLEVLVKRCVLDNSDGSLKTSAFVQIQDPNKKNKDQVYIYNGWMFSGFPSINPMEH 120 Query: 112 SIYDIWLMQC 121 YDIW+ C Sbjct: 121 VNYDIWIESC 130 >gi|84515419|ref|ZP_01002781.1| hypothetical protein SKA53_02136 [Loktanella vestfoldensis SKA53] gi|84510702|gb|EAQ07157.1| hypothetical protein SKA53_02136 [Loktanella vestfoldensis SKA53] Length = 131 Score = 125 bits (313), Expect = 5e-27, Method: Composition-based stats. Identities = 32/109 (29%), Positives = 51/109 (46%), Gaps = 3/109 (2%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 + A+ +DK+TG V +V Q+ G+L + C + A A + Sbjct: 20 QEMLSGIGADIRILDKLTGAVTDLEVSNGQTDNVGALSVTLGDCRYPAENVASEGYAALV 79 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSIS 130 I V IF+GWM A SPA+NA+DH YD+W+++C + + Sbjct: 80 IH---YRAEVAPIFAGWMLASSPALNALDHPRYDVWVLRCITSLGAGTA 125 >gi|254511429|ref|ZP_05123496.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] gi|221535140|gb|EEE38128.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] Length = 119 Score = 124 bits (311), Expect = 8e-27, Method: Composition-based stats. Identities = 31/112 (27%), Positives = 54/112 (48%), Gaps = 3/112 (2%) Query: 13 VFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDRE 72 V + A + A G+DK++G+ L ++ Q+ L + C + Sbjct: 11 VIATGATAQQKAESGPGAMLRGLDKVSGQTLDVEIRNGQTETVFGLDVALGDCRYPAENP 70 Query: 73 AQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 A+++I E + +F GWM A SPA+NA+DH+ YD+W+++C P Sbjct: 71 TGDAFAYLTIWE---QGKAQQLFDGWMVATSPALNALDHARYDVWVIRCMTP 119 >gi|126727094|ref|ZP_01742931.1| hypothetical protein RB2150_00624 [Rhodobacterales bacterium HTCC2150] gi|126703522|gb|EBA02618.1| hypothetical protein RB2150_00624 [Rhodobacterales bacterium HTCC2150] Length = 119 Score = 123 bits (310), Expect = 1e-26, Method: Composition-based stats. Identities = 36/120 (30%), Positives = 57/120 (47%), Gaps = 3/120 (2%) Query: 5 VLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMV 64 + +L + + A +DK++G+ FD+ QSA G+L + Sbjct: 2 IRVLAVIAALCPFAALAEDTTSTTTANMRALDKVSGQTWDFDISSGQSASLGNLTLFSKE 61 Query: 65 CYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 C R + +A+V + I +R +F GWM A SPA+NA DH+ YD+WL+ C P Sbjct: 62 C--RYPTDDPSSNAYVYL-SIQDERDGGELFRGWMVAASPALNAFDHARYDVWLLSCALP 118 >gi|260432985|ref|ZP_05786956.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] gi|260416813|gb|EEX10072.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] Length = 119 Score = 123 bits (309), Expect = 1e-26, Method: Composition-based stats. Identities = 32/101 (31%), Positives = 53/101 (52%), Gaps = 7/101 (6%) Query: 26 ANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQ--RIDAFVSIS 83 + A G+DK++G+ + +++ ++A L + C R E AF++I Sbjct: 24 SGAGAVLRGLDKVSGQTVDVEMQPGETASIFGLDVALGDC--RYPTENPTGDAFAFLTIW 81 Query: 84 EIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 E +F GWM A SPA+NA+DHS YD+W+++C P Sbjct: 82 E---KGEAEQLFDGWMIATSPALNALDHSRYDVWVIRCITP 119 >gi|254464893|ref|ZP_05078304.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] gi|206685801|gb|EDZ46283.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] Length = 125 Score = 123 bits (308), Expect = 2e-26, Method: Composition-based stats. Identities = 28/105 (26%), Positives = 48/105 (45%), Gaps = 3/105 (2%) Query: 20 ANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAF 79 A S+ A +DK+ G + ++ + SA+ L++ C + AF Sbjct: 24 AQSSAAQGTAAVLRALDKVNGHSMDAEIAVGSSAEMFGLLVTVSDCRYPAENPTGDAYAF 83 Query: 80 VSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 +++ F GWM A SPA+N +DHS YD+W+++C Sbjct: 84 LTVR---NPGDSAVQFEGWMIASSPALNPLDHSRYDVWVIRCSSS 125 >gi|304321753|ref|YP_003855396.1| hypothetical protein PB2503_11034 [Parvularcula bermudensis HTCC2503] gi|303300655|gb|ADM10254.1| hypothetical protein PB2503_11034 [Parvularcula bermudensis HTCC2503] Length = 259 Score = 122 bits (307), Expect = 2e-26, Method: Composition-based stats. Identities = 30/147 (20%), Positives = 49/147 (33%), Gaps = 54/147 (36%) Query: 30 AEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSI------- 82 +DKIT + + ++A FG L + P C R E F+ + Sbjct: 107 VTLRALDKITATFTDITIPLGETAAFGPLTLLPRTCDRRPPEEPPETTVFLEVYAGDGDV 166 Query: 83 -----SEIFTDRIV------------------------------------------RSIF 95 + +R +F Sbjct: 167 QGQRARDARAEREAMQVEAPRSTLQLPGTQMSSGAEADTPPSALAQENVIDTEALGEDVF 226 Query: 96 SGWMFADSPAMNAIDHSIYDIWLMQCK 122 GWMFA SP++NA++H +YD+W++ CK Sbjct: 227 KGWMFASSPSLNAMEHPVYDVWVIDCK 253 >gi|84499427|ref|ZP_00997715.1| hypothetical protein OB2597_05850 [Oceanicola batsensis HTCC2597] gi|84392571|gb|EAQ04782.1| hypothetical protein OB2597_05850 [Oceanicola batsensis HTCC2597] Length = 127 Score = 122 bits (307), Expect = 3e-26, Method: Composition-based stats. Identities = 34/105 (32%), Positives = 46/105 (43%), Gaps = 2/105 (1%) Query: 20 ANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAF 79 A A G+DK+ G V + G L I C A AF Sbjct: 25 AQEDVSVGTGAVLRGLDKMNGETRDVSVPSGTAVMVGKLSITMWECRYPAGNPAGDAYAF 84 Query: 80 VSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 ++I+E + IFSGWM A SPA+NA+DH YD+W++ C Sbjct: 85 MTITE--PAKSSDPIFSGWMVASSPALNALDHFRYDVWVLSCTTS 127 >gi|254440647|ref|ZP_05054140.1| hypothetical protein OA307_62 [Octadecabacter antarcticus 307] gi|198250725|gb|EDY75040.1| hypothetical protein OA307_62 [Octadecabacter antarcticus 307] Length = 106 Score = 121 bits (304), Expect = 5e-26, Method: Composition-based stats. Identities = 27/101 (26%), Positives = 43/101 (42%), Gaps = 3/101 (2%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVS 81 +DKITGR + Q+ + I C ++ AF++ Sbjct: 7 QQAITASGGTLRVLDKITGRTQDLEFGNGQTQTVELMAITMTECRYPSGNQSGDAYAFLT 66 Query: 82 ISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 I + +F GWM A +PA+NA+DH YD+W ++C Sbjct: 67 I---LYNNAADPVFRGWMIASAPALNALDHPRYDVWALRCS 104 >gi|119384014|ref|YP_915070.1| hypothetical protein Pden_1269 [Paracoccus denitrificans PD1222] gi|119373781|gb|ABL69374.1| conserved hypothetical protein [Paracoccus denitrificans PD1222] Length = 163 Score = 121 bits (304), Expect = 6e-26, Method: Composition-based stats. Identities = 33/96 (34%), Positives = 52/96 (54%), Gaps = 3/96 (3%) Query: 27 NKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIF 86 A+ G+DKITGR F + + + A+FG L + C R DA+ ++ I Sbjct: 69 GTGAQLRGLDKITGRTQDFTLAVGEVAEFGRLQLSLAEC--RYPAADPTSDAYAELT-IT 125 Query: 87 TDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 + +FSGWM A SPA++++D S YD+W++ C Sbjct: 126 DSQANARLFSGWMIASSPALSSLDDSRYDVWVISCN 161 >gi|146277492|ref|YP_001167651.1| hypothetical protein Rsph17025_1452 [Rhodobacter sphaeroides ATCC 17025] gi|145555733|gb|ABP70346.1| conserved hypothetical protein [Rhodobacter sphaeroides ATCC 17025] Length = 123 Score = 121 bits (304), Expect = 6e-26, Method: Composition-based stats. Identities = 30/96 (31%), Positives = 47/96 (48%), Gaps = 3/96 (3%) Query: 26 ANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEI 85 A +DK++G ++ QSA G L I+ C A A ++I + Sbjct: 23 EGSGALLRWLDKMSGETADAELMRGQSAVSGHLTIELDECRYPAGDPASDAFAHLTIRDS 82 Query: 86 FTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 R +F GWM A SPA++++DH YD+WL++C Sbjct: 83 ---RAAEPVFDGWMIASSPALSSLDHPRYDVWLLRC 115 >gi|126741250|ref|ZP_01756929.1| hypothetical protein RSK20926_17467 [Roseobacter sp. SK209-2-6] gi|126717655|gb|EBA14378.1| hypothetical protein RSK20926_17467 [Roseobacter sp. SK209-2-6] Length = 126 Score = 121 bits (303), Expect = 8e-26, Method: Composition-based stats. Identities = 30/96 (31%), Positives = 50/96 (52%), Gaps = 3/96 (3%) Query: 27 NKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIF 86 +A G+DK+ G+ +V++ +SA+ L++ C D A++ I + Sbjct: 32 GSMAILRGLDKVNGQSTDVEVQVGRSAEVFGLLVTLAQCRYPVDNPTGDAFAYLIIRD-- 89 Query: 87 TDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 F GWM A SPA+NA+DHS YD+W+++C Sbjct: 90 -PNNGAQFFEGWMIASSPALNALDHSRYDVWVIRCS 124 >gi|56695909|ref|YP_166260.1| hypothetical protein SPO1008 [Ruegeria pomeroyi DSS-3] gi|56677646|gb|AAV94312.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 119 Score = 120 bits (302), Expect = 9e-26, Method: Composition-based stats. Identities = 31/99 (31%), Positives = 52/99 (52%), Gaps = 3/99 (3%) Query: 26 ANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEI 85 + A G+DK++G+ F V +A+ L + C + A+++I E Sbjct: 24 SGTGAMLRGLDKVSGQTEDFRVATGGTAEIYGLDVALGDCRYPVENPTGDAFAYLTIWE- 82 Query: 86 FTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 ++IF GWM A SPA++A+DHS YD+W+++C P Sbjct: 83 --RGQRQAIFDGWMIASSPALSALDHSRYDVWVIRCMTP 119 >gi|89055209|ref|YP_510660.1| hypothetical protein Jann_2718 [Jannaschia sp. CCS1] gi|88864758|gb|ABD55635.1| hypothetical protein Jann_2718 [Jannaschia sp. CCS1] Length = 184 Score = 120 bits (302), Expect = 9e-26, Method: Composition-based stats. Identities = 26/106 (24%), Positives = 48/106 (45%), Gaps = 4/106 (3%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 + A +DK+ G+ ++ + Q+ FG + I+ + C Sbjct: 82 TSISQPATEVGTTVSLRALDKMLGQPTDIELSMGQTVVFGRVAIRVIECRYPAADPGGDA 141 Query: 77 DAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 A + + + ++F GWM A SPA+NA++HS YD+W++ C Sbjct: 142 FALLEV----LNMEGETLFDGWMIASSPALNALEHSRYDVWVLGCS 183 >gi|84686584|ref|ZP_01014477.1| hypothetical protein 1099457000254_RB2654_07826 [Maritimibacter alkaliphilus HTCC2654] gi|84665497|gb|EAQ11974.1| hypothetical protein RB2654_07826 [Rhodobacterales bacterium HTCC2654] Length = 179 Score = 120 bits (302), Expect = 1e-25, Method: Composition-based stats. Identities = 29/96 (30%), Positives = 48/96 (50%), Gaps = 3/96 (3%) Query: 26 ANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEI 85 A G+DK+ G + ++ ++ + G L + C R + + DAF + I Sbjct: 84 QGTGAVLRGLDKLAGTSIDLNLATGETGELGWLQVTMAEC--RYPNDNPQGDAFAHLV-I 140 Query: 86 FTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 +F GWM A SPA++A+DHS +D+W+M C Sbjct: 141 RNGNDEEPLFDGWMIASSPALSALDHSRFDVWVMNC 176 >gi|294678127|ref|YP_003578742.1| hypothetical protein RCAP_rcc02605 [Rhodobacter capsulatus SB 1003] gi|294476947|gb|ADE86335.1| conserved hypothetical protein [Rhodobacter capsulatus SB 1003] Length = 122 Score = 119 bits (299), Expect = 2e-25, Method: Composition-based stats. Identities = 34/103 (33%), Positives = 52/103 (50%), Gaps = 3/103 (2%) Query: 20 ANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAF 79 A A G+DKI G + + QS +GSL ++ C D A AF Sbjct: 21 APEGLAEAPGATLRGLDKIAGAATDLPLSVGQSLDYGSLSVRLTDCRYPADDPASNAYAF 80 Query: 80 VSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 + I++ R +F GWM A +PA++A+DH YD+W+++CK Sbjct: 81 LEITDTAIGRE---VFRGWMIAQNPALSALDHQRYDVWVLRCK 120 >gi|159043941|ref|YP_001532735.1| hypothetical protein Dshi_1392 [Dinoroseobacter shibae DFL 12] gi|157911701|gb|ABV93134.1| conserved hypothetical protein [Dinoroseobacter shibae DFL 12] Length = 180 Score = 119 bits (299), Expect = 2e-25, Method: Composition-based stats. Identities = 36/110 (32%), Positives = 51/110 (46%), Gaps = 6/110 (5%) Query: 16 HAKFANSARFANKVA-EFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQ 74 F R + A +DK TGRV T ++ ++ Q G L I + C R E Sbjct: 54 FQTFEQELRVSAAEAGLIRVLDKTTGRVETLEIPAGEARQSGRLSITLIEC--RFPEENP 111 Query: 75 RIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 DAFV + ++ GWM A SPA+ A+DH YD+W ++C P Sbjct: 112 ASDAFVHLQ---ITERDTPLYDGWMIASSPALAALDHHRYDVWALRCATP 158 >gi|254453527|ref|ZP_05066964.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198267933|gb|EDY92203.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 105 Score = 119 bits (299), Expect = 2e-25, Method: Composition-based stats. Identities = 34/106 (32%), Positives = 48/106 (45%), Gaps = 4/106 (3%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 FA A +DKITGR + Q+ G L I C R Q Sbjct: 2 PVFAQEAT-TASGGTLRVLDKITGRTHDLEFGNGQTQTVGLLAITMTEC--RYPSGNQSG 58 Query: 77 DAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 DA+ ++ I + +F GWM A +PA+NA+DH YD+W ++C Sbjct: 59 DAY-TLLTIVYNNAADPVFRGWMIASAPALNALDHPRYDVWTLRCS 103 >gi|126733140|ref|ZP_01748887.1| hypothetical protein RCCS2_03274 [Roseobacter sp. CCS2] gi|126716006|gb|EBA12870.1| hypothetical protein RCCS2_03274 [Roseobacter sp. CCS2] Length = 146 Score = 118 bits (297), Expect = 3e-25, Method: Composition-based stats. Identities = 34/97 (35%), Positives = 50/97 (51%), Gaps = 5/97 (5%) Query: 28 KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIFT 87 + +DK+TG+V +E Q+A G L +K C R E DAF I Sbjct: 55 SGGDLRILDKLTGQVSDVSLETGQTATLGFLSVKLNEC--RYPIENPSGDAFTQIVVRDN 112 Query: 88 DRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 + ++FSGWM A +PA+NA+DH YD+W ++C Sbjct: 113 EG---TLFSGWMLASAPALNAMDHPRYDVWALRCMTS 146 >gi|197105316|ref|YP_002130693.1| hypothetical protein PHZ_c1853 [Phenylobacterium zucineum HLK1] gi|196478736|gb|ACG78264.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 227 Score = 118 bits (297), Expect = 4e-25, Method: Composition-based stats. Identities = 26/101 (25%), Positives = 45/101 (44%), Gaps = 7/101 (6%) Query: 29 VAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRID-AFVSISEIFT 87 A +DK++ L F+ + + ++ LI C E A+V+I Sbjct: 111 TAVLQALDKVSAETLKFEAPVGRPVRWKGLIFTVRACERSAPDEPVEDAIAYVTIDSQPR 170 Query: 88 DRIVR------SIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 + R F GWM+A SP +N ++H+ YD W++ C+ Sbjct: 171 PQPGRPTPPPRQAFRGWMYASSPGLNPMEHATYDAWVISCR 211 >gi|315499994|ref|YP_004088797.1| hypothetical protein Astex_3010 [Asticcacaulis excentricus CB 48] gi|315418006|gb|ADU14646.1| Protein of unknown function DUF2155 [Asticcacaulis excentricus CB 48] Length = 272 Score = 118 bits (296), Expect = 5e-25, Method: Composition-based stats. Identities = 22/112 (19%), Positives = 49/112 (43%), Gaps = 7/112 (6%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREA-QRIDAFV 80 R A +DK+TG + F+ + + ++ ++ C + EA ++ Sbjct: 154 EKRLRYSAAILTVLDKVTGEAIRFEAPVGKPKRYRGMVYTVKACETSAQDEAMSDTMTYL 213 Query: 81 SISEIFTD------RIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIN 126 + + +F GW +A +P +N ++H +YD+W++ C+ P+ Sbjct: 214 EVRTSPQPAANGTVPKPKDVFKGWTYASTPGVNGMEHPVYDVWVVSCRTPLP 265 >gi|310816207|ref|YP_003964171.1| hypothetical protein EIO_1753 [Ketogulonicigenium vulgare Y25] gi|308754942|gb|ADO42871.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25] Length = 121 Score = 117 bits (294), Expect = 8e-25, Method: Composition-based stats. Identities = 33/121 (27%), Positives = 57/121 (47%), Gaps = 3/121 (2%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLII 60 M+ +++ + A A A + +DK+ G V ++ QSA G L++ Sbjct: 1 MRKLTSVILAACLLPVAAAAQEAATSAPGGTVRVLDKLNGSVTDLELTNGQSATVGRLVV 60 Query: 61 KPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ 120 C R + DAF I+ + + + +F GWM A SPA++A D++ YD+W + Sbjct: 61 TLGEC--RFPTDNPMGDAFQMIT-LQFEGNLEPVFMGWMIASSPAVSAFDNARYDVWPLS 117 Query: 121 C 121 C Sbjct: 118 C 118 >gi|110680058|ref|YP_683065.1| hypothetical protein RD1_2853 [Roseobacter denitrificans OCh 114] gi|109456174|gb|ABG32379.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 119 Score = 116 bits (292), Expect = 1e-24, Method: Composition-based stats. Identities = 34/99 (34%), Positives = 50/99 (50%), Gaps = 7/99 (7%) Query: 28 KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAF--VSISEI 85 A+ G+D+I G V Q+A+ + I+ C R DAF + + +I Sbjct: 26 TGAQLRGVDRINGETFEIIVPKGQTAKLDRISIRLNAC--RYPVGNPSGDAFASLEVRDI 83 Query: 86 FTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 + IFSGWM A SPA++A+DH YDIW+M+C Sbjct: 84 DSG---AFIFSGWMIASSPALSAMDHPRYDIWVMRCTTS 119 >gi|163731530|ref|ZP_02138977.1| hypothetical protein RLO149_19539 [Roseobacter litoralis Och 149] gi|161394984|gb|EDQ19306.1| hypothetical protein RLO149_19539 [Roseobacter litoralis Och 149] Length = 119 Score = 116 bits (291), Expect = 2e-24, Method: Composition-based stats. Identities = 38/116 (32%), Positives = 55/116 (47%), Gaps = 7/116 (6%) Query: 11 FFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDD 70 F + A A A A+ G+D+I G V Q+A+ + I C R Sbjct: 9 FVFTASAALAQQAATEATGAQLRGVDRINGDTFEIIVPRGQTAELERISITLNSC--RYP 66 Query: 71 REAQRIDAF--VSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 DAF +++ +I + IFSGWM A SPA++A+DH YDIW+M+C Sbjct: 67 VGNPSGDAFASLNVRDINSGAN---IFSGWMIASSPALSAMDHPRYDIWVMRCTTS 119 >gi|89068966|ref|ZP_01156348.1| hypothetical protein OG2516_01791 [Oceanicola granulosus HTCC2516] gi|89045547|gb|EAR51611.1| hypothetical protein OG2516_01791 [Oceanicola granulosus HTCC2516] Length = 117 Score = 116 bits (290), Expect = 2e-24, Method: Composition-based stats. Identities = 28/96 (29%), Positives = 47/96 (48%), Gaps = 3/96 (3%) Query: 26 ANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEI 85 + +DK TG V +++ Q+AQ G + + C A A +++ Sbjct: 22 SAPGGVLRVLDKQTGHVEDLELQAGQTAQSGLVEVSLGACRYPAGNPAGDAYALLTVH-- 79 Query: 86 FTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 V +F GWM A +PA+NA+DH YD+W+++C Sbjct: 80 -YRGQVEPVFRGWMIASAPALNAMDHPRYDVWVLRC 114 >gi|85375052|ref|YP_459114.1| hypothetical protein ELI_11125 [Erythrobacter litoralis HTCC2594] gi|84788135|gb|ABC64317.1| hypothetical protein ELI_11125 [Erythrobacter litoralis HTCC2594] Length = 157 Score = 115 bits (288), Expect = 4e-24, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 49/106 (46%), Gaps = 4/106 (3%) Query: 20 ANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQ-RIDA 78 + + +VA ++K ++ +S + G +I++ C E A Sbjct: 36 SGATPMEERVATLGLLNKRNNISQDLEMSPGESRRIGDIIVRLSACERTAPWEMPQETGA 95 Query: 79 FVSIS---EIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 FV + + + R IFSGWMF SP++N ++H +YD+W+ C Sbjct: 96 FVQVLVEGKGEDEGEWRKIFSGWMFQRSPSLNVVEHPVYDVWVKDC 141 >gi|329890102|ref|ZP_08268445.1| hypothetical protein BDIM_17990 [Brevundimonas diminuta ATCC 11568] gi|328845403|gb|EGF94967.1| hypothetical protein BDIM_17990 [Brevundimonas diminuta ATCC 11568] Length = 231 Score = 114 bits (286), Expect = 8e-24, Method: Composition-based stats. Identities = 30/106 (28%), Positives = 52/106 (49%), Gaps = 7/106 (6%) Query: 24 RFANKVAEFAGMDKITGRVLTFDVEIN-QSAQFG-SLIIKPMVCYSRDDREAQRID-AFV 80 R ++A +DK T + F+VE+ + +FG +L+ K C E A++ Sbjct: 125 RQRRRIAVIQAVDKTTAETMRFEVEVGGRPVRFGKTLLFKARACEVSASDEMTEDAIAYM 184 Query: 81 SI----SEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 + + R +F GWMFA SP+++ + H +YD W++ CK Sbjct: 185 EVGVQPRGLAAPTEARQVFKGWMFASSPSVSGLQHPVYDAWVVGCK 230 >gi|254418951|ref|ZP_05032675.1| hypothetical protein BBAL3_1261 [Brevundimonas sp. BAL3] gi|196185128|gb|EDX80104.1| hypothetical protein BBAL3_1261 [Brevundimonas sp. BAL3] Length = 221 Score = 113 bits (282), Expect = 2e-23, Method: Composition-based stats. Identities = 31/109 (28%), Positives = 50/109 (45%), Gaps = 7/109 (6%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEIN-QSAQFG-SLIIKPMVCYSRDDRE-AQRIDA 78 + R K A +DK T + F+VE+ + +F +LI C E + A Sbjct: 113 ARRQRRKFAVIQAIDKTTAETMKFEVEVGGRPVRFNRNLIFSVRACEVSTPDELTEDAIA 172 Query: 79 FVSI----SEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKD 123 +V + R I+ GWMFA SPA++ + + YD W++ CK+ Sbjct: 173 YVDVSLQSRGANQPAEPRQIYRGWMFASSPAVSGLQNPNYDAWVVGCKN 221 >gi|85710256|ref|ZP_01041321.1| hypothetical protein NAP1_15263 [Erythrobacter sp. NAP1] gi|85688966|gb|EAQ28970.1| hypothetical protein NAP1_15263 [Erythrobacter sp. NAP1] Length = 164 Score = 111 bits (278), Expect = 6e-23, Method: Composition-based stats. Identities = 26/118 (22%), Positives = 50/118 (42%), Gaps = 2/118 (1%) Query: 14 FSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREA 73 + + S +VA ++K ++ ++A+ G +I++ C E Sbjct: 47 LTPLEVGESTPMDERVATIGLLNKRNNVSQDLELSPGETAEVGPVIVRLEACERTAPYEF 106 Query: 74 Q-RIDAFVSISEIFTD-RIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSI 129 AFV + + IFSGW+F ++P++N ++H IYD+W+ C Sbjct: 107 PQETGAFVQVDVLERGASEHARIFSGWLFKENPSLNVVEHPIYDVWVKDCAMSFPGDE 164 >gi|163746707|ref|ZP_02154064.1| hypothetical protein OIHEL45_14929 [Oceanibulbus indolifex HEL-45] gi|161379821|gb|EDQ04233.1| hypothetical protein OIHEL45_14929 [Oceanibulbus indolifex HEL-45] Length = 146 Score = 109 bits (272), Expect = 3e-22, Method: Composition-based stats. Identities = 27/99 (27%), Positives = 50/99 (50%), Gaps = 3/99 (3%) Query: 26 ANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEI 85 + +DKI+G + ++ + G+L I + C R +A+ ++ EI Sbjct: 51 SATGGVLRVLDKISGDTIDLEITKGDNQSLGNLQITMVDC--RYPVGDPAANAYAAL-EI 107 Query: 86 FTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDP 124 ++FSGWM A +PA++A++H YDIW+++C Sbjct: 108 TESGDSGTLFSGWMIAAAPALHALEHFRYDIWVLRCSTS 146 >gi|296282309|ref|ZP_06860307.1| hypothetical protein CbatJ_01745 [Citromicrobium bathyomarinum JL354] Length = 173 Score = 108 bits (271), Expect = 4e-22, Method: Composition-based stats. Identities = 28/115 (24%), Positives = 52/115 (45%), Gaps = 6/115 (5%) Query: 23 ARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQ-RIDAFVS 81 A +VA ++K F+++ ++ + G ++I+ C E AFV Sbjct: 59 TPMAERVATIGLLNKRNNVSRDFEMKPGEATRVGDVVIRLRACEKTAPWELPQDEGAFVQ 118 Query: 82 I-----SEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISN 131 + T+R +FSGW+F +SP++N ++H IYD+W+ C + Sbjct: 119 VFVRERRGAETERSWNKVFSGWLFRNSPSLNVVEHPIYDVWVKSCAMSFPGEEED 173 >gi|157825620|ref|YP_001493340.1| hypothetical protein A1C_02685 [Rickettsia akari str. Hartford] gi|157799578|gb|ABV74832.1| hypothetical protein A1C_02685 [Rickettsia akari str. Hartford] Length = 160 Score = 108 bits (270), Expect = 5e-22, Method: Composition-based stats. Identities = 25/111 (22%), Positives = 51/111 (45%), Gaps = 1/111 (0%) Query: 12 FVFSHAKFANSARFAN-KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDD 70 + + +S+ F N + ++KIT + ++ + FG++ IK C D Sbjct: 46 ILNPNDNINDSSEFKNYTNGKIIALNKITATSEEINFKVGEEKYFGNIKIKLHKCIKNLD 105 Query: 71 REAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 + ++I+E D +F GWM + S +++ +H IY+I++ C Sbjct: 106 PYNEDNYLLMTITEYTIDEDPNLLFQGWMTSSSISLSTFEHPIYEIFVKDC 156 >gi|241761853|ref|ZP_04759939.1| conserved hypothetical protein [Zymomonas mobilis subsp. mobilis ATCC 10988] gi|241373767|gb|EER63327.1| conserved hypothetical protein [Zymomonas mobilis subsp. mobilis ATCC 10988] Length = 214 Score = 108 bits (269), Expect = 6e-22, Method: Composition-based stats. Identities = 35/140 (25%), Positives = 60/140 (42%), Gaps = 5/140 (3%) Query: 24 RFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQR-IDAFVSI 82 + +VA ++K TG + + F LIIK C EA+ AFV + Sbjct: 79 PMSQRVAVLGVLNKKTGEWQDITLHTGEITHFPDLIIKLQACDETMPWEAEHLTGAFVQV 138 Query: 83 SEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKALSE 142 + + + IFSGW++ ++P++N ++ YDIW C + + S A+ Sbjct: 139 ESLKYNHRWQRIFSGWLYKEAPSLNVLESPDYDIWPKSCTMRAPLGHAKEKEASPGAVPS 198 Query: 143 YSSTDITSQGSEKSSGSSSN 162 S + S K +S+N Sbjct: 199 VSK----AMPSVKGKSTSAN 214 >gi|302383315|ref|YP_003819138.1| hypothetical protein Bresu_2205 [Brevundimonas subvibrioides ATCC 15264] gi|302193943|gb|ADL01515.1| Protein of unknown function DUF2155 [Brevundimonas subvibrioides ATCC 15264] Length = 228 Score = 108 bits (269), Expect = 7e-22, Method: Composition-based stats. Identities = 34/105 (32%), Positives = 51/105 (48%), Gaps = 6/105 (5%) Query: 24 RFANKVAEFAGMDKITGRVLTFDVEINQS-AQFGS-LIIKPMVCYSRDDRE-AQRIDAFV 80 R +VA +DKIT + F+VE+ +F + LI C D E A++ Sbjct: 123 RQRRRVAIVEAIDKITAESMRFEVEVGGPPVRFNNNLIFTARACEVSADNELVNDAIAYL 182 Query: 81 SIS---EIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 I+ R IF GWMF+ +PA++ + H IYD W++ CK Sbjct: 183 DITLQPRATPAAAPRQIFRGWMFSSTPAISGLQHPIYDAWIVGCK 227 >gi|260752613|ref|YP_003225506.1| hypothetical protein Za10_0371 [Zymomonas mobilis subsp. mobilis NCIMB 11163] gi|258551976|gb|ACV74922.1| conserved hypothetical protein [Zymomonas mobilis subsp. mobilis NCIMB 11163] Length = 216 Score = 108 bits (269), Expect = 7e-22, Method: Composition-based stats. Identities = 35/140 (25%), Positives = 60/140 (42%), Gaps = 5/140 (3%) Query: 24 RFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQR-IDAFVSI 82 + +VA ++K TG + + F LIIK C EA+ AFV + Sbjct: 81 PMSQRVAVLGVLNKKTGEWQDITLHTGEITHFPDLIIKLQACDETMPWEAEHLTGAFVQV 140 Query: 83 SEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKALSE 142 + + + IFSGW++ ++P++N ++ YDIW C + + S A+ Sbjct: 141 ESLKYNHRWQRIFSGWLYKEAPSLNVLESPDYDIWPKSCTMRAPLGHAKEKETSPAAVPS 200 Query: 143 YSSTDITSQGSEKSSGSSSN 162 S + S K +S+N Sbjct: 201 VSK----AMPSVKGKSASAN 216 >gi|239947487|ref|ZP_04699240.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes scapularis] gi|239921763|gb|EER21787.1| conserved hypothetical protein [Rickettsia endosymbiont of Ixodes scapularis] Length = 157 Score = 107 bits (267), Expect = 1e-21, Method: Composition-based stats. Identities = 27/111 (24%), Positives = 50/111 (45%), Gaps = 1/111 (0%) Query: 12 FVFSHAKFANSARFAN-KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDD 70 + + NS+ F N + ++KIT D ++ + FG++ IK C D Sbjct: 46 ILNPNDNINNSSEFKNYTNGKIIALNKITATSEEIDFKVGEEKYFGNIKIKLHKCIKNLD 105 Query: 71 REAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 + ++I+E D +F GWM + S +++ +H IY+I+ C Sbjct: 106 PYNEDNYLLMTITEYKIDEDPNLLFQGWMISSSISLSTFEHPIYEIFAKDC 156 >gi|149179614|ref|ZP_01858130.1| hypothetical protein PM8797T_18696 [Planctomyces maris DSM 8797] gi|148841545|gb|EDL55992.1| hypothetical protein PM8797T_18696 [Planctomyces maris DSM 8797] Length = 161 Score = 107 bits (267), Expect = 1e-21, Method: Composition-based stats. Identities = 25/103 (24%), Positives = 47/103 (45%), Gaps = 3/103 (2%) Query: 22 SARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDRE-AQRIDAFV 80 + A ++K +++ + + G +II+ C E + AFV Sbjct: 51 KTPMEERTATIGLLNKRNNLSQDLELKPGEQRRVGDVIIRLRACERTAPWEMEKDEGAFV 110 Query: 81 SI--SEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 + E + R +FSGW+F + P++N ++H IYD+W+ C Sbjct: 111 QVLVRERGSTSDFRRVFSGWLFKNKPSINVVEHPIYDVWVKSC 153 >gi|329114933|ref|ZP_08243689.1| Hypothetical protein APO_1737 [Acetobacter pomorum DM001] gi|326695830|gb|EGE47515.1| Hypothetical protein APO_1737 [Acetobacter pomorum DM001] Length = 195 Score = 106 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 24/108 (22%), Positives = 46/108 (42%), Gaps = 3/108 (2%) Query: 15 SHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQ 74 A +A A +D++ V V + +A + SL I P C R + Sbjct: 31 PPAVYAPDTWQGKNTAVVRVLDRLDAHVEVISVPVGTTAHYKSLDITPSRCLQRPPTLSP 90 Query: 75 RIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 A++++ + + F GWM A PA+ + +YD+ +++C+ Sbjct: 91 DAAAWLAVQDKHPNGAA---FQGWMLAAEPALGVFESPVYDVRMVRCE 135 >gi|283856366|ref|YP_003377799.1| hypothetical protein ZMO2007 [Zymomonas mobilis subsp. mobilis ZM4] gi|283775365|gb|ADB28965.1| conserved hypothetical protein [Zymomonas mobilis subsp. mobilis ZM4] Length = 214 Score = 106 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 35/140 (25%), Positives = 60/140 (42%), Gaps = 5/140 (3%) Query: 24 RFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQR-IDAFVSI 82 + +VA ++K TG + + F LIIK C EA+ AFV + Sbjct: 79 PMSQRVAILGVLNKKTGEWQDITLHTGEITHFPDLIIKLQACDETMPWEAEHLTGAFVQV 138 Query: 83 SEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKALSE 142 + + + IFSGW++ ++P++N ++ YDIW C + + S A+ Sbjct: 139 ESLKYNHRWQRIFSGWLYKEAPSLNVLESPDYDIWPKSCTMRAPLGHAKEKEASPGAVPS 198 Query: 143 YSSTDITSQGSEKSSGSSSN 162 S + S K +S+N Sbjct: 199 VSK----AMPSVKGKSASAN 214 >gi|149186415|ref|ZP_01864728.1| hypothetical protein ED21_23038 [Erythrobacter sp. SD-21] gi|148830004|gb|EDL48442.1| hypothetical protein ED21_23038 [Erythrobacter sp. SD-21] Length = 161 Score = 106 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 25/102 (24%), Positives = 47/102 (46%), Gaps = 3/102 (2%) Query: 23 ARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDRE-AQRIDAFVS 81 + A ++K +++ + + G +II+ C E + AFV Sbjct: 52 TPMEERTATIGLLNKRNNLSQDLELKPGEQRRVGDVIIRLRACERTAPWEMEKDEGAFVQ 111 Query: 82 I--SEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 + E + R +FSGW+F + P++N ++H IYD+W+ C Sbjct: 112 VLVRERGSTSDFRRVFSGWLFKNKPSINVVEHPIYDVWVKSC 153 >gi|67458961|ref|YP_246585.1| hypothetical protein RF_0569 [Rickettsia felis URRWXCal2] gi|67004494|gb|AAY61420.1| unknown [Rickettsia felis URRWXCal2] Length = 157 Score = 106 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 23/101 (22%), Positives = 46/101 (45%) Query: 21 NSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFV 80 +S + + ++KIT + ++ + FG++ IK C D + + Sbjct: 56 SSEFKSYTNGKIIALNKITATSEEINFKVGEEKYFGNIKIKLHKCIKNLDPYNEDNYLLM 115 Query: 81 SISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 +I+E D +F GWM + S +++ +H IY+I+ C Sbjct: 116 TITEYKIDEDPNLLFQGWMISSSISLSTFEHPIYEIFAKDC 156 >gi|15604227|ref|NP_220743.1| hypothetical protein RP359 [Rickettsia prowazekii str. Madrid E] gi|6647974|sp|Q9ZDG9|Y359_RICPR RecName: Full=Uncharacterized protein RP359 gi|3860919|emb|CAA14819.1| unknown [Rickettsia prowazekii] gi|292571968|gb|ADE29883.1| hypothetical protein rpr22_CDS352 [Rickettsia prowazekii Rp22] Length = 155 Score = 106 bits (266), Expect = 2e-21, Method: Composition-based stats. Identities = 27/110 (24%), Positives = 48/110 (43%), Gaps = 1/110 (0%) Query: 13 VFSHAKFANSARFAN-KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDR 71 + + SA F N + ++KIT ++ + FG++ IK C D Sbjct: 45 ILNQKDNIYSAEFKNYTNGKIIALNKITATSEEIGLKAGEEKYFGNIKIKLHKCIKNLDP 104 Query: 72 EAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 Q ++I+E D +F GWM + S +++ +H IY+I+ C Sbjct: 105 YNQDNYLLMTITEYKIDEDPTLLFQGWMVSSSISLSTFEHPIYEIFAKDC 154 >gi|258541331|ref|YP_003186764.1| hypothetical protein APA01_02320 [Acetobacter pasteurianus IFO 3283-01] gi|256632409|dbj|BAH98384.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-01] gi|256635466|dbj|BAI01435.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-03] gi|256638521|dbj|BAI04483.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-07] gi|256641575|dbj|BAI07530.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-22] gi|256644630|dbj|BAI10578.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-26] gi|256647685|dbj|BAI13626.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-32] gi|256650738|dbj|BAI16672.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-01-42C] gi|256653729|dbj|BAI19656.1| hypothetical protein [Acetobacter pasteurianus IFO 3283-12] Length = 195 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 24/108 (22%), Positives = 45/108 (41%), Gaps = 3/108 (2%) Query: 15 SHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQ 74 A +A A +DK+ V V + +A + SL I P C R + Sbjct: 31 PPAVYAPDTWQGKNTAVVRVLDKLDAHVEVLSVPVGTTAHYKSLDITPSRCLQRPPTLSP 90 Query: 75 RIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 A++++ + + F GWM A P + + +YD+ +++C+ Sbjct: 91 DAAAWLALQDKHPNGAT---FQGWMLAAEPTLGVFESPVYDVRMVRCE 135 >gi|91205215|ref|YP_537570.1| hypothetical protein RBE_0400 [Rickettsia bellii RML369-C] gi|157827446|ref|YP_001496510.1| hypothetical protein A1I_05755 [Rickettsia bellii OSU 85-389] gi|91068759|gb|ABE04481.1| unknown [Rickettsia bellii RML369-C] gi|157802750|gb|ABV79473.1| hypothetical protein A1I_05755 [Rickettsia bellii OSU 85-389] Length = 158 Score = 105 bits (261), Expect = 5e-21, Method: Composition-based stats. Identities = 26/110 (23%), Positives = 52/110 (47%), Gaps = 1/110 (0%) Query: 13 VFSHAKFANSARFAN-KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDR 71 + S+ ++S F N E ++K T + ++ + FG++ IK C D Sbjct: 48 LNSNQAISDSTEFKNCDNCEITALNKTTAKSEKLTFKVGEEQYFGNIKIKIHKCVKNLDP 107 Query: 72 EAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 + ++I+E D + +F GWM + S +++ +H IY+I+ +C Sbjct: 108 YNEDNYILMTITEYIIDEDPKLLFQGWMTSGSISLSTFEHPIYEIFAKEC 157 >gi|15892410|ref|NP_360124.1| hypothetical protein RC0487 [Rickettsia conorii str. Malish 7] gi|15619561|gb|AAL03025.1| unknown [Rickettsia conorii str. Malish 7] Length = 157 Score = 103 bits (258), Expect = 1e-20, Method: Composition-based stats. Identities = 25/111 (22%), Positives = 49/111 (44%), Gaps = 1/111 (0%) Query: 12 FVFSHAKFANSARFAN-KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDD 70 + + +S+ F N + ++ IT D ++ + FG++ IK C D Sbjct: 46 ILNPNDNINDSSEFKNYTNGKIIALNNITATSEEIDFKVGEEKYFGNIKIKLHKCIKNLD 105 Query: 71 REAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 + ++I+E D +F GWM + S +++ +H IY+I+ C Sbjct: 106 PYNEDNYLLMTITEYKIDEDPNVLFQGWMISSSISLSTFEHPIYEIFAKDC 156 >gi|238651032|ref|YP_002916889.1| hypothetical protein RPR_06840 [Rickettsia peacockii str. Rustic] gi|238625130|gb|ACR47836.1| hypothetical protein RPR_06840 [Rickettsia peacockii str. Rustic] Length = 157 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 25/111 (22%), Positives = 49/111 (44%), Gaps = 1/111 (0%) Query: 12 FVFSHAKFANSARFAN-KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDD 70 + + +S+ F N + ++ IT D ++ + FG++ IK C D Sbjct: 46 ILNPNDNINDSSEFKNYTNGKIIALNNITATSEEIDFKVGEEKYFGNIKIKLHKCIKNLD 105 Query: 71 REAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 + ++I+E D +F GWM + S +++ +H IY+I+ C Sbjct: 106 PYNEDNYLLMTITEYKIDEDPNLLFQGWMISSSISLSTFEHPIYEIFAKDC 156 >gi|51473553|ref|YP_067310.1| hypothetical protein RT0348 [Rickettsia typhi str. Wilmington] gi|51459865|gb|AAU03828.1| conserved hypothetical protein [Rickettsia typhi str. Wilmington] Length = 155 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 26/101 (25%), Positives = 46/101 (45%) Query: 21 NSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFV 80 N+ + ++KIT D++ + FG++ IK C D Q + Sbjct: 54 NAEFKNYTNGKIIALNKITATSEEIDLKTGEEKYFGNIKIKLHKCIKNLDPYNQDNYLLM 113 Query: 81 SISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 +I+E D +F GWM + S +++ +HSIY+I+ C Sbjct: 114 TITEYKIDEDPSLLFQGWMVSSSISLSTFEHSIYEIFAKDC 154 >gi|34580588|ref|ZP_00142068.1| hypothetical protein [Rickettsia sibirica 246] gi|157828361|ref|YP_001494603.1| hypothetical protein A1G_02765 [Rickettsia rickettsii str. 'Sheila Smith'] gi|165933069|ref|YP_001649858.1| hypothetical protein RrIowa_0581 [Rickettsia rickettsii str. Iowa] gi|28261973|gb|EAA25477.1| unknown [Rickettsia sibirica 246] gi|157800842|gb|ABV76095.1| hypothetical protein A1G_02765 [Rickettsia rickettsii str. 'Sheila Smith'] gi|165908156|gb|ABY72452.1| hypothetical protein RrIowa_0581 [Rickettsia rickettsii str. Iowa] Length = 157 Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats. Identities = 25/111 (22%), Positives = 49/111 (44%), Gaps = 1/111 (0%) Query: 12 FVFSHAKFANSARFAN-KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDD 70 + + +S+ F N + ++ IT D ++ + FG++ IK C D Sbjct: 46 ILNPNDNINDSSEFKNYTNGKIIALNNITATSEEIDFKVGEEKYFGNIKIKLHKCIKNLD 105 Query: 71 REAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 + ++I+E D +F GWM + S +++ +H IY+I+ C Sbjct: 106 PYNEDNYLLMTITEYKIDEDPNLLFQGWMISSSISLSTFEHPIYEIFAKDC 156 >gi|87198847|ref|YP_496104.1| hypothetical protein Saro_0825 [Novosphingobium aromaticivorans DSM 12444] gi|87134528|gb|ABD25270.1| conserved hypothetical protein [Novosphingobium aromaticivorans DSM 12444] Length = 217 Score = 102 bits (255), Expect = 3e-20, Method: Composition-based stats. Identities = 25/137 (18%), Positives = 56/137 (40%), Gaps = 8/137 (5%) Query: 23 ARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDD-REAQRIDAFVS 81 ++VA ++K ++ +S + G+ I+K C + AFV Sbjct: 55 TPIKDRVATLGFLNKRNNITQDVVLKSGESRRIGNAIVKLATCEKTAPWEDPPETGAFVQ 114 Query: 82 IS-----EIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESIS 136 + R +FSGW+F ++P++N ++H +YD+W+ C + S Sbjct: 115 LFVEERATTQEKLAWRKVFSGWLFRNAPSLNVVEHPVYDVWVKDCAMTFPG--EEEPAPS 172 Query: 137 KKALSEYSSTDITSQGS 153 ++ ++ + + + Sbjct: 173 ARSAAKPAGSPSAAASP 189 >gi|307295108|ref|ZP_07574950.1| Protein of unknown function DUF2155 [Sphingobium chlorophenolicum L-1] gi|306879582|gb|EFN10800.1| Protein of unknown function DUF2155 [Sphingobium chlorophenolicum L-1] Length = 218 Score = 101 bits (251), Expect = 8e-20, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 66/172 (38%), Gaps = 9/172 (5%) Query: 23 ARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREA-QRIDAFVS 81 +VA ++K G ++ ++ + G I++ C + E Q AFV Sbjct: 54 TPMNERVAVIGLLNKRNGITTDLQMKPGEALRVGDAIVRLQACETTAPWENVQETGAFVQ 113 Query: 82 IS-EIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKAL 140 + D R FSGW+F + P N + H IYD+W+ C ++ +++ Sbjct: 114 LDVRSTADNKWRRNFSGWLFRERPDRNVVQHPIYDVWVRSCTMSWPET--GPDTVKLGDK 171 Query: 141 SEYSSTDITSQGSEKSSGSSSNKTLEKESSQPLENNLSMDLKGRPIQELGNN 192 E S+ ++ S S N + ++ +P + P N+ Sbjct: 172 GEASAGG----PAQASPASGENAS-SAQTPEPPASTPRPAPSATPSSATAND 218 >gi|157803899|ref|YP_001492448.1| hypothetical protein A1E_03660 [Rickettsia canadensis str. McKiel] gi|157785162|gb|ABV73663.1| hypothetical protein A1E_03660 [Rickettsia canadensis str. McKiel] Length = 157 Score = 100 bits (250), Expect = 1e-19, Method: Composition-based stats. Identities = 25/111 (22%), Positives = 47/111 (42%), Gaps = 2/111 (1%) Query: 13 VFSHAKFANSARFANKVAE--FAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDD 70 V + N++ A ++KIT + + + FG++ IK C D Sbjct: 46 VLNPNYNINNSSEFKNYANGKIIVLNKITATSKEMNFTVGEEQYFGNIKIKLHKCIKNLD 105 Query: 71 REAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 + ++I+E D +F GWM + S +++ +H IY+I+ C Sbjct: 106 PYNEDNYLLMTITEYKIDEDPNLLFQGWMTSSSISLSTFEHPIYEIFAKDC 156 >gi|229586626|ref|YP_002845127.1| hypothetical protein RAF_ORF0454 [Rickettsia africae ESF-5] gi|228021676|gb|ACP53384.1| Unknown [Rickettsia africae ESF-5] Length = 157 Score = 100 bits (250), Expect = 1e-19, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 49/111 (44%), Gaps = 1/111 (0%) Query: 12 FVFSHAKFANSARFAN-KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDD 70 + + +S+ F N + ++ IT D+++ + F ++ IK C D Sbjct: 46 ILNPNDNINDSSEFKNYTNGKIIALNNITATSEEIDLKVGEEKYFCNIKIKLHKCIKNLD 105 Query: 71 REAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 + ++I+E D +F GWM + S +++ +H IY+I+ C Sbjct: 106 PYNEDNYLLMTITEYKIDEDPNLLFQGWMISSSISLSTFEHPIYEIFAKDC 156 >gi|262277373|ref|ZP_06055166.1| conserved hypothetical protein [alpha proteobacterium HIMB114] gi|262224476|gb|EEY74935.1| conserved hypothetical protein [alpha proteobacterium HIMB114] Length = 127 Score = 98.1 bits (243), Expect = 6e-19, Method: Composition-based stats. Identities = 27/100 (27%), Positives = 41/100 (41%), Gaps = 1/100 (1%) Query: 24 RFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSIS 83 + N AE +DKIT R+ T + + I C + A V I Sbjct: 28 KNDNNYAEIKIIDKITSRLSTKKINLKTLKNIKDFEIFIDKCVLDTRKGFLETSALVQIK 87 Query: 84 EIFTDRIV-RSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 ++ +F+ WMFA + ++N I+H YDI L C Sbjct: 88 DVKNQTKDRVFLFNNWMFASNSSINEIEHPNYDISLKSCN 127 >gi|94498773|ref|ZP_01305321.1| hypothetical protein SKA58_14272 [Sphingomonas sp. SKA58] gi|94421782|gb|EAT06835.1| hypothetical protein SKA58_14272 [Sphingomonas sp. SKA58] Length = 176 Score = 94.6 bits (234), Expect = 8e-18, Method: Composition-based stats. Identities = 28/101 (27%), Positives = 45/101 (44%), Gaps = 2/101 (1%) Query: 23 ARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREA-QRIDAFVS 81 A + A ++K G ++ ++ + G I++ C + E Q AFV Sbjct: 17 TPMAERSAVLGLLNKRNGLTRDLTLKPGEAVRVGDAIVRLQACETTAPWENIQDTGAFVQ 76 Query: 82 ISEIFT-DRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 + + D R FSGW+F D P N + H IYD+W+ C Sbjct: 77 LDVRSSADNKWRRAFSGWLFRDRPDRNVVQHPIYDVWVRSC 117 >gi|296114615|ref|ZP_06833268.1| hypothetical protein GXY_02521 [Gluconacetobacter hansenii ATCC 23769] gi|295978971|gb|EFG85696.1| hypothetical protein GXY_02521 [Gluconacetobacter hansenii ATCC 23769] Length = 247 Score = 94.2 bits (233), Expect = 1e-17, Method: Composition-based stats. Identities = 25/139 (17%), Positives = 50/139 (35%), Gaps = 10/139 (7%) Query: 23 ARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSI 82 VA +DK+ V ++ Q A + SL + C R A++++ Sbjct: 43 TWKGRGVAIVRILDKLDAHVQILNIPAGQDATYKSLTLHARACLERPPTLPADTAAWLAV 102 Query: 83 SEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKALSE 142 + + F GWM PA+ + +YD+ ++ C + A+ + Sbjct: 103 RDA---HEGMTPFDGWMLTQEPALGLFQNPLYDVQVVGC-----AGADVAPIPPPLAVVQ 154 Query: 143 YSSTDIT--SQGSEKSSGS 159 +T + S + G+ Sbjct: 155 QQATPADVPAAPSTAALGT 173 >gi|58039577|ref|YP_191541.1| hypothetical protein GOX1117 [Gluconobacter oxydans 621H] gi|58001991|gb|AAW60885.1| Hypothetical protein GOX1117 [Gluconobacter oxydans 621H] Length = 229 Score = 86.5 bits (213), Expect = 2e-15, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 64/197 (32%), Gaps = 15/197 (7%) Query: 15 SHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQ 74 A + + A +D++ + + + SA + L + C SR A Sbjct: 34 PPAMYPAATWQGQSQAVVRVLDRLDAHLELLTIPVGGSATYHGLSVGVEACVSRPQTLAA 93 Query: 75 RIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSES 134 A + + + R F GWM A P++ +YD+ ++ C + Sbjct: 94 DAGALLHLKDSSD--PQRPPFDGWMLAQEPSVATYGSPLYDVRVVSCAGAPTAPQAGPLP 151 Query: 135 ISKKALSEYSSTDITSQG--SEKSSGSSSNKTL----EKESSQPLENNLSMD---LKGRP 185 + K + + + G GS+S + + + PL P Sbjct: 152 VVKAPVLASAEVPVEEGGDAPASQPGSASGGPVPLAPDSHNPIPLAPPSGAAPSLAPAMP 211 Query: 186 IQELGNNLS----DSGL 198 Q G LS D GL Sbjct: 212 AQPSGQPLSPPEADPGL 228 >gi|209543277|ref|YP_002275506.1| hypothetical protein Gdia_1108 [Gluconacetobacter diazotrophicus PAl 5] gi|209530954|gb|ACI50891.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 307 Score = 84.6 bits (208), Expect = 8e-15, Method: Composition-based stats. Identities = 19/104 (18%), Positives = 36/104 (34%), Gaps = 3/104 (2%) Query: 18 KFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRID 77 + VA +D + V + + + Q + +L I C R Sbjct: 30 VYPADTWQGRSVATVRVLDGLDSHVQSLTIPVGQDVTYRALTIHVGACRDRPATLVPDSA 89 Query: 78 AFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 +++I + D F GWM A P + +Y + ++ C Sbjct: 90 GWLTIRDTRQDGRG---FDGWMLAGEPFLGVFQDPVYTVQIVSC 130 >gi|162146736|ref|YP_001601195.1| hypothetical protein GDI_0914 [Gluconacetobacter diazotrophicus PAl 5] gi|161785311|emb|CAP54857.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 334 Score = 84.6 bits (208), Expect = 8e-15, Method: Composition-based stats. Identities = 19/104 (18%), Positives = 36/104 (34%), Gaps = 3/104 (2%) Query: 18 KFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRID 77 + VA +D + V + + + Q + +L I C R Sbjct: 30 VYPADTWQGRSVATVRVLDGLDSHVQSLTIPVGQDVTYRALTIHVGACRDRPATLVPDSA 89 Query: 78 AFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 +++I + D F GWM A P + +Y + ++ C Sbjct: 90 GWLTIRDTRQDGRG---FDGWMLAGEPFLGVFQDPVYTVQIVSC 130 >gi|157964439|ref|YP_001499263.1| hypothetical protein RMA_0506 [Rickettsia massiliae MTU5] gi|157844215|gb|ABV84716.1| hypothetical protein RMA_0506 [Rickettsia massiliae MTU5] Length = 162 Score = 81.5 bits (200), Expect = 7e-14, Method: Composition-based stats. Identities = 26/111 (23%), Positives = 49/111 (44%), Gaps = 1/111 (0%) Query: 12 FVFSHAKFANSARFAN-KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDD 70 + + +SA F N + ++ IT D ++ + FG++ IK C D Sbjct: 51 ILNPNDNINDSAEFKNYTNGKIIALNNITATSEEIDFKVGEEKYFGNIKIKLHRCIKNLD 110 Query: 71 REAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 + ++I+E D +F GWM + S +++ +H IY+I+ C Sbjct: 111 PYNEDNYLLMTITEYKIDEDPNLLFQGWMISSSISLSMFEHPIYEIFAKDC 161 >gi|114328397|ref|YP_745554.1| hypothetical protein GbCGDNIH1_1733 [Granulibacter bethesdensis CGDNIH1] gi|114316571|gb|ABI62631.1| hypothetical secreted protein [Granulibacter bethesdensis CGDNIH1] Length = 237 Score = 79.2 bits (194), Expect = 3e-13, Method: Composition-based stats. Identities = 21/98 (21%), Positives = 38/98 (38%), Gaps = 3/98 (3%) Query: 24 RFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSIS 83 + ++K++ RV ++ G L + C Q A++ Sbjct: 142 WQPGHSVQLQILEKLSDRVSRVTLKDGDRHTIGHLTVVMRNCLKHAAEAPQDFAAWL--- 198 Query: 84 EIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQC 121 +I D FSGWM A P + + +YD+ +M+C Sbjct: 199 DITADTEGAPRFSGWMLAKEPWVAVYESPLYDVRVMRC 236 >gi|218680000|ref|ZP_03527897.1| hypothetical protein RetlC8_14346 [Rhizobium etli CIAT 894] Length = 40 Score = 61.1 bits (147), Expect = 8e-08, Method: Composition-based stats. Identities = 18/34 (52%), Positives = 22/34 (64%), Gaps = 4/34 (11%) Query: 99 MFADSPAMNAIDHSIYDIWLMQCKD----PINDS 128 MFA SP +NA++H IYD+WL CK P DS Sbjct: 1 MFAASPGLNAVEHPIYDVWLKDCKTNSDVPAPDS 34 >gi|326402345|ref|YP_004282426.1| hypothetical protein ACMV_01970 [Acidiphilium multivorum AIU301] gi|325049206|dbj|BAJ79544.1| hypothetical protein ACMV_01970 [Acidiphilium multivorum AIU301] Length = 193 Score = 57.6 bits (138), Expect = 9e-07, Method: Composition-based stats. Identities = 16/108 (14%), Positives = 32/108 (29%), Gaps = 3/108 (2%) Query: 15 SHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQ 74 + K + A ++K G V + S G+L + C R Sbjct: 87 APPKQVKPIWDPRQAAILDVLEKADGAVNRIIAPVGSSFTEGALRVTIGACVVRPADMPP 146 Query: 75 RIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 ++++ +F GW+ P + + L+ C Sbjct: 147 DAAVYMTVRHGMAAPD---LFRGWLIRSEPGATVVGDAAVTFRLIGCS 191 >gi|148259192|ref|YP_001233319.1| hypothetical protein Acry_0172 [Acidiphilium cryptum JF-5] gi|146400873|gb|ABQ29400.1| hypothetical protein Acry_0172 [Acidiphilium cryptum JF-5] Length = 193 Score = 57.6 bits (138), Expect = 9e-07, Method: Composition-based stats. Identities = 16/108 (14%), Positives = 32/108 (29%), Gaps = 3/108 (2%) Query: 15 SHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQ 74 + K + A ++K G V + S G+L + C R Sbjct: 87 APPKQVKPIWDPRQAAILDVLEKADGAVNRIIAPVGSSFTEGALRVTIGACVVRPADMPP 146 Query: 75 RIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCK 122 ++++ +F GW+ P + + L+ C Sbjct: 147 DAAVYMTVRHGMAAPD---LFRGWLIRSEPGATVVGDAAVTFRLIGCS 191 >gi|319790199|ref|YP_004151832.1| hypothetical protein Theam_1227 [Thermovibrio ammonificans HB-1] gi|317114701|gb|ADU97191.1| hypothetical protein Theam_1227 [Thermovibrio ammonificans HB-1] Length = 248 Score = 56.5 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 27/103 (26%), Positives = 43/103 (41%), Gaps = 15/103 (14%) Query: 28 KVAEFAGMDKITGRV-LTFDVEINQSAQFGSLIIKPMVC---------YSRDDREAQRID 77 K A +DK TG+V F V Q+ +G L IK + Y+ E Q Sbjct: 147 KHATIDIVDKTTGKVVKEFKVSKGQTVNYGGLEIKILYIVPHLVLDNGYTSASNEPQNPA 206 Query: 78 AFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ 120 V + E ++I++G ++ P M I+H Y++ L Sbjct: 207 ILVEVKE-----NGKTIYAGPIYQKFPTMYNINHPRYELILKN 244 >gi|222054292|ref|YP_002536654.1| hypothetical protein Geob_1193 [Geobacter sp. FRC-32] gi|221563581|gb|ACM19553.1| conserved hypothetical protein [Geobacter sp. FRC-32] Length = 161 Score = 53.8 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 22/106 (20%), Positives = 36/106 (33%), Gaps = 17/106 (16%) Query: 21 NSARFANKVAEFAGMDKITGRVLTFDVEINQSAQF--GSLIIKPM----------VCYSR 68 ++ + K + A DK T + + V I G+L +K + Sbjct: 50 DNVKGKWKAVKIAVTDKTTKKDTIYTVNIGAEVTLPGGNLTLKVDNFLPQFVMEGTTLTS 109 Query: 69 DDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIY 114 E + A + I + IF GW+F P +A H Y Sbjct: 110 QSNEPKNPAA--QVRVIENGKE---IFKGWLFTLYPTTHAFQHPRY 150 >gi|148262371|ref|YP_001229077.1| hypothetical protein Gura_0288 [Geobacter uraniireducens Rf4] gi|146395871|gb|ABQ24504.1| hypothetical protein Gura_0288 [Geobacter uraniireducens Rf4] Length = 157 Score = 50.7 bits (120), Expect = 1e-04, Method: Composition-based stats. Identities = 24/112 (21%), Positives = 37/112 (33%), Gaps = 17/112 (15%) Query: 21 NSARFANKVAEFAGMDKITGRVLTFDVEINQSAQF--GSLIIKPM----------VCYSR 68 +S + K + A DK T + + V I +L IK + Sbjct: 46 DSVKGKWKAVKIAVTDKNTKKDTVYTVNIGSELALPNSNLTIKVENFLPHFMMEGTTLTS 105 Query: 69 DDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ 120 E + A + I + IF GW+F P +A H Y L+ Sbjct: 106 QSNEPKNPAA--QVRVIENGKE---IFKGWLFTLYPTTHAFQHPRYGFTLVD 152 >gi|39995149|ref|NP_951100.1| putative lipoprotein [Geobacter sulfurreducens PCA] gi|39981911|gb|AAR33373.1| lipoprotein, putative [Geobacter sulfurreducens PCA] gi|298504179|gb|ADI82902.1| lipoprotein, putative [Geobacter sulfurreducens KN400] Length = 162 Score = 44.9 bits (105), Expect = 0.007, Method: Composition-based stats. Identities = 20/118 (16%), Positives = 39/118 (33%), Gaps = 17/118 (14%) Query: 15 SHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQF--GSLIIKPM--------- 63 S ++ + K + A DK + + + + +L I Sbjct: 45 SVVVVPDNVKGKWKSVKIAVTDKAANKESVYTINVGAELAIPESNLTIAVDNFLPHFTMD 104 Query: 64 -VCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ 120 + E + A + I E + +F GW+F+ P +A +H Y L+ Sbjct: 105 GTTLTSQSNEPKNPAAQIRILEGGKE-----VFKGWLFSLYPTTHAFNHPKYGFTLVD 157 >gi|253698746|ref|YP_003019935.1| hypothetical protein GM21_0090 [Geobacter sp. M21] gi|251773596|gb|ACT16177.1| conserved hypothetical protein [Geobacter sp. M21] Length = 159 Score = 44.2 bits (103), Expect = 0.011, Method: Composition-based stats. Identities = 18/105 (17%), Positives = 34/105 (32%), Gaps = 17/105 (16%) Query: 28 KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPM------------VCYSRDDREAQR 75 K E A DK + + +++ + + + E Sbjct: 55 KAVEIAVSDKQHNQQKVYTLQLGSEVKIPGSNLTLRVENFLPHFVMEGTTLTSQSNELVN 114 Query: 76 IDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ 120 A + I E + I+ GW+F+ P +A H +Y L+ Sbjct: 115 PAAQIVIRE-----DAKEIYKGWLFSLYPTTHAFQHPLYGFTLVD 154 >gi|197116509|ref|YP_002136936.1| lipoprotein [Geobacter bemidjiensis Bem] gi|197085869|gb|ACH37140.1| lipoprotein, putative [Geobacter bemidjiensis Bem] Length = 159 Score = 42.2 bits (98), Expect = 0.044, Method: Composition-based stats. Identities = 17/105 (16%), Positives = 34/105 (32%), Gaps = 17/105 (16%) Query: 28 KVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPM------------VCYSRDDREAQR 75 K E A DK + + +++ + + + + Sbjct: 55 KAVEIAVSDKQHNQQKVYTIKLGSELKIPGSNLTLRVENFLPHFVMEGTTLTSQSNQLVN 114 Query: 76 IDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ 120 A + I E + I+ GW+F+ P +A H +Y L+ Sbjct: 115 PAAQIVIRE-----DAKEIYKGWLFSLYPTTHAFQHPLYGFTLVD 154 >gi|114776981|ref|ZP_01452001.1| hypothetical protein SPV1_06454 [Mariprofundus ferrooxydans PV-1] gi|114552502|gb|EAU54962.1| hypothetical protein SPV1_06454 [Mariprofundus ferrooxydans PV-1] Length = 168 Score = 41.1 bits (95), Expect = 0.090, Method: Composition-based stats. Identities = 18/100 (18%), Positives = 39/100 (39%), Gaps = 14/100 (14%) Query: 30 AEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPM------VCYSRDDREAQRI---DAFV 80 AE + K T ++ + + +A I+ + + + + + A V Sbjct: 63 AELVWLQKSTTHLVHTKLALGDAADVEGWHIRLLGLASGLRVKNSTFLDDENVHNPAALV 122 Query: 81 SISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ 120 IS R + ++ GW+F + P + +D + +WL Sbjct: 123 EIS-----RGGKVVYRGWLFQEFPELFGLDDPEWKVWLKG 157 >gi|117925509|ref|YP_866126.1| hypothetical protein Mmc1_2219 [Magnetococcus sp. MC-1] gi|117609265|gb|ABK44720.1| hypothetical protein Mmc1_2219 [Magnetococcus sp. MC-1] Length = 168 Score = 40.7 bits (94), Expect = 0.13, Method: Composition-based stats. Identities = 17/99 (17%), Positives = 33/99 (33%), Gaps = 17/99 (17%) Query: 30 AEFAGMDKITGRVLTFDVEINQS------------AQFGSLIIKPMVCYSRDDREAQRID 77 F +DK T ++ F V + + A L+I Sbjct: 65 VRFQVLDKRTLKIHAFVVSVGEPTAAPWNGGVLVHAFVPDLLIY-QSQAIHGPDGHINPA 123 Query: 78 AFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDI 116 ++ + R + ++ GW+F + A DH +D+ Sbjct: 124 VWLELR----GRDHQLLYEGWLFVRDGSQVAWDHPRFDL 158 >gi|196019061|ref|XP_002118919.1| hypothetical protein TRIADDRAFT_62904 [Trichoplax adhaerens] gi|190577724|gb|EDV18595.1| hypothetical protein TRIADDRAFT_62904 [Trichoplax adhaerens] Length = 367 Score = 40.7 bits (94), Expect = 0.13, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 44/117 (37%), Gaps = 13/117 (11%) Query: 14 FSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQF-GSLIIKPMVCYSR-DDR 71 + + +A+ +D TG ++++ ++ + L + C D Sbjct: 251 ITKQSIIQGELESFNLAKIRILDYNTGHSSNKELKLEENLELTEGLFVNLKECKKDIKDT 310 Query: 72 EAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYD---IWLMQCKDPI 125 AF+S++ + I+ GW+F+ + ++ D I+L C + + Sbjct: 311 LNPVSMAFISVT-----NHDKIIYEGWIFSKNTSIAL---PKIDDGLIYLTSCDNQV 359 >gi|332296137|ref|YP_004438060.1| hypothetical protein Thena_1312 [Thermodesulfobium narugense DSM 14796] gi|332179240|gb|AEE14929.1| hypothetical protein Thena_1312 [Thermodesulfobium narugense DSM 14796] Length = 171 Score = 40.7 bits (94), Expect = 0.15, Method: Composition-based stats. Identities = 16/97 (16%), Positives = 40/97 (41%), Gaps = 12/97 (12%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMD----------KITGRVLTFDVEIN 50 MK+ + +L L F+F+ + +A+ N + +D K TG++ + Sbjct: 1 MKFLIFILALIFLFTASAYADETN--NTYFQLLNLDVQADSSIYIPKGTGKITERKMGDE 58 Query: 51 QSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIFT 87 ++F +I + ++ I V+++++ Sbjct: 59 DLSKFYDIIKSDLNAHNVGINPNSNIHLIVTVTDVKK 95 >gi|195470348|ref|XP_002087470.1| GE15931 [Drosophila yakuba] gi|194173571|gb|EDW87182.1| GE15931 [Drosophila yakuba] Length = 717 Score = 40.3 bits (93), Expect = 0.19, Method: Composition-based stats. Identities = 18/111 (16%), Positives = 41/111 (36%), Gaps = 31/111 (27%) Query: 40 GRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWM 99 + + +E + L ++P C + + +E + + + + D M Sbjct: 366 AQTTSIKMEFEEE-----LKVEPEQCPNPETQENPDV---MEVDKQEQDPQ--------M 409 Query: 100 FADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKALSEYSSTDITS 150 F P N ++H+IY + S E ++ + +E+ S+ S Sbjct: 410 F---PGENTMEHTIYKLQ------------SEEEEVNPQPETEHLSSYFAS 445 >gi|157819983|ref|NP_001101208.1| sperm specific antigen 2 [Rattus norvegicus] gi|149022373|gb|EDL79267.1| sperm specific antigen 2 (predicted) [Rattus norvegicus] Length = 1255 Score = 39.5 bits (91), Expect = 0.25, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 32/83 (38%), Gaps = 4/83 (4%) Query: 116 IWLMQCKDPINDSISNSESISKKALSEYSSTDIT---SQGSEKSSGSSSNKTLEK-ESSQ 171 IWL C+ P+ S+ S+ K + + S G+E + + +E + Sbjct: 78 IWLKDCRTPLGASLDEQSSVGPKGVLLRNGGSFEDDLSLGAEANHLHEPDAQVENCNNIL 137 Query: 172 PLENNLSMDLKGRPIQELGNNLS 194 E L KGR + G+ S Sbjct: 138 AKERRLQFHQKGRSMNSTGSGKS 160 >gi|78224709|ref|YP_386456.1| putative lipoprotein [Geobacter metallireducens GS-15] gi|78195964|gb|ABB33731.1| lipoprotein, putative [Geobacter metallireducens GS-15] Length = 163 Score = 39.5 bits (91), Expect = 0.28, Method: Composition-based stats. Identities = 12/54 (22%), Positives = 22/54 (40%), Gaps = 5/54 (9%) Query: 67 SRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQ 120 + E + A + I E + +F GW+F+ P ++ H Y L+ Sbjct: 110 TSQSNEPKNPAAQIRIIEGGKE-----VFKGWLFSLYPTTHSFSHPKYGFTLVD 158 >gi|325295194|ref|YP_004281708.1| hypothetical protein Dester_1010 [Desulfurobacterium thermolithotrophum DSM 11699] gi|325065642|gb|ADY73649.1| hypothetical protein Dester_1010 [Desulfurobacterium thermolithotrophum DSM 11699] Length = 217 Score = 39.2 bits (90), Expect = 0.41, Method: Composition-based stats. Identities = 23/102 (22%), Positives = 37/102 (36%), Gaps = 15/102 (14%) Query: 28 KVAEFAGMDKITG-RVLTFDVEINQSAQFGSLIIKPMVC---------YSRDDREAQRID 77 K A +DK TG V V + +F L IK + Y+ E Sbjct: 116 KYATIEVVDKTTGKVVKKEKVTKDSDVKFQDLEIKVLYIVPHLVYDQQYTSGSNEPNNPA 175 Query: 78 AFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLM 119 V + + I++G ++ P M I H Y++ L+ Sbjct: 176 VIVEVKS-----NGKVIYAGPIYQKFPTMYNIKHPKYELKLV 212 >gi|122889500|emb|CAM14507.1| sperm specific antigen 2 [Mus musculus] gi|123232496|emb|CAM17565.1| sperm specific antigen 2 [Mus musculus] Length = 1219 Score = 39.2 bits (90), Expect = 0.42, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 32/83 (38%), Gaps = 4/83 (4%) Query: 116 IWLMQCKDPINDSISNSESISKKALSEYSSTDIT---SQGSEKSSGSSSNKTLEK-ESSQ 171 IWL C+ P+ S+ S + K + + S G+E + + +E + Sbjct: 78 IWLKDCRTPLGASLDEQSSGTPKGVLVRNGGSFEDDLSLGAEANHLHEPDAQVENCNNIL 137 Query: 172 PLENNLSMDLKGRPIQELGNNLS 194 E L KGR + G+ S Sbjct: 138 AKERRLQFHQKGRSMNSTGSGKS 160 >gi|34850469|dbj|BAC87833.1| KRAP [Mus musculus] Length = 1252 Score = 39.2 bits (90), Expect = 0.42, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 32/83 (38%), Gaps = 4/83 (4%) Query: 116 IWLMQCKDPINDSISNSESISKKALSEYSSTDIT---SQGSEKSSGSSSNKTLEK-ESSQ 171 IWL C+ P+ S+ S + K + + S G+E + + +E + Sbjct: 78 IWLKDCRTPLGASLDEQSSGTPKGVLVRNGGSFEDDLSLGAEANHLHEPDAQVENCNNIL 137 Query: 172 PLENNLSMDLKGRPIQELGNNLS 194 E L KGR + G+ S Sbjct: 138 AKERRLQFHQKGRSMNSTGSGKS 160 >gi|134047942|sp|Q922B9|SSFA2_MOUSE RecName: Full=Sperm-specific antigen 2 homolog; AltName: Full=Ki-ras-induced actin-interacting protein gi|146327765|gb|AAI41884.1| Sperm specific antigen 2 [Mus musculus] gi|148695302|gb|EDL27249.1| sperm specific antigen 2, isoform CRA_a [Mus musculus] Length = 1252 Score = 38.8 bits (89), Expect = 0.44, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 32/83 (38%), Gaps = 4/83 (4%) Query: 116 IWLMQCKDPINDSISNSESISKKALSEYSSTDIT---SQGSEKSSGSSSNKTLEK-ESSQ 171 IWL C+ P+ S+ S + K + + S G+E + + +E + Sbjct: 78 IWLKDCRTPLGASLDEQSSGTPKGVLVRNGGSFEDDLSLGAEANHLHEPDAQVENCNNIL 137 Query: 172 PLENNLSMDLKGRPIQELGNNLS 194 E L KGR + G+ S Sbjct: 138 AKERRLQFHQKGRSMNSTGSGKS 160 >gi|115305112|gb|AAI22520.1| Ssfa2 protein [Mus musculus] Length = 1252 Score = 38.8 bits (89), Expect = 0.44, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 32/83 (38%), Gaps = 4/83 (4%) Query: 116 IWLMQCKDPINDSISNSESISKKALSEYSSTDIT---SQGSEKSSGSSSNKTLEK-ESSQ 171 IWL C+ P+ S+ S + K + + S G+E + + +E + Sbjct: 78 IWLKDCRTPLGASLDEQSSGTPKGVLVRNGGSFEDDLSLGAEANHLHEPDAQVENCNNIL 137 Query: 172 PLENNLSMDLKGRPIQELGNNLS 194 E L KGR + G+ S Sbjct: 138 AKERRLQFHQKGRSMNSTGSGKS 160 >gi|26006285|dbj|BAC41485.1| mKIAA1927 protein [Mus musculus] Length = 1248 Score = 38.8 bits (89), Expect = 0.44, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 32/83 (38%), Gaps = 4/83 (4%) Query: 116 IWLMQCKDPINDSISNSESISKKALSEYSSTDIT---SQGSEKSSGSSSNKTLEK-ESSQ 171 IWL C+ P+ S+ S + K + + S G+E + + +E + Sbjct: 74 IWLKDCRTPLGASLDEQSSGTPKGVLVRNGGSFEDDLSLGAEANHLHEPDAQVENCNNIL 133 Query: 172 PLENNLSMDLKGRPIQELGNNLS 194 E L KGR + G+ S Sbjct: 134 AKERRLQFHQKGRSMNSTGSGKS 156 >gi|122889499|emb|CAM14506.1| sperm specific antigen 2 [Mus musculus] gi|123232495|emb|CAM17564.1| sperm specific antigen 2 [Mus musculus] Length = 1230 Score = 38.8 bits (89), Expect = 0.45, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 32/83 (38%), Gaps = 4/83 (4%) Query: 116 IWLMQCKDPINDSISNSESISKKALSEYSSTDIT---SQGSEKSSGSSSNKTLEK-ESSQ 171 IWL C+ P+ S+ S + K + + S G+E + + +E + Sbjct: 78 IWLKDCRTPLGASLDEQSSGTPKGVLVRNGGSFEDDLSLGAEANHLHEPDAQVENCNNIL 137 Query: 172 PLENNLSMDLKGRPIQELGNNLS 194 E L KGR + G+ S Sbjct: 138 AKERRLQFHQKGRSMNSTGSGKS 160 >gi|194473671|ref|NP_542125.3| sperm-specific antigen 2 homolog [Mus musculus] gi|122889498|emb|CAM14505.1| sperm specific antigen 2 [Mus musculus] gi|123232494|emb|CAM17563.1| sperm specific antigen 2 [Mus musculus] Length = 1252 Score = 38.8 bits (89), Expect = 0.45, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 32/83 (38%), Gaps = 4/83 (4%) Query: 116 IWLMQCKDPINDSISNSESISKKALSEYSSTDIT---SQGSEKSSGSSSNKTLEK-ESSQ 171 IWL C+ P+ S+ S + K + + S G+E + + +E + Sbjct: 78 IWLKDCRTPLGASLDEQSSGTPKGVLVRNGGSFEDDLSLGAEANHLHEPDAQVENCNNIL 137 Query: 172 PLENNLSMDLKGRPIQELGNNLS 194 E L KGR + G+ S Sbjct: 138 AKERRLQFHQKGRSMNSTGSGKS 160 >gi|74143785|dbj|BAE41220.1| unnamed protein product [Mus musculus] Length = 943 Score = 38.8 bits (89), Expect = 0.56, Method: Composition-based stats. Identities = 19/83 (22%), Positives = 32/83 (38%), Gaps = 4/83 (4%) Query: 116 IWLMQCKDPINDSISNSESISKKALSEYSSTDIT---SQGSEKSSGSSSNKTLEK-ESSQ 171 IWL C+ P+ S+ S + K + + S G+E + + +E + Sbjct: 78 IWLKDCRTPLGASLDEQSSGTPKGVLVRNGGSFEDDLSLGAEANHLHEPDAQVENCNNIL 137 Query: 172 PLENNLSMDLKGRPIQELGNNLS 194 E L KGR + G+ S Sbjct: 138 AKERRLQFHQKGRSMNSTGSGKS 160 >gi|168178342|ref|ZP_02613006.1| conserved hypothetical protein [Clostridium botulinum NCTC 2916] gi|182670599|gb|EDT82573.1| conserved hypothetical protein [Clostridium botulinum NCTC 2916] Length = 143 Score = 38.4 bits (88), Expect = 0.63, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 28/78 (35%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 N K G+ KIT + ++D++IN + G L IK + + Sbjct: 51 NSSGNEYNVKFKYFNGKGVKKITSKKSSYDIKINSKIESGDLNIKIYDDKKTLFNKNGTL 110 Query: 77 DAFVSISEIFTDRIVRSI 94 D + IS + I Sbjct: 111 DETIRISNTDNKEVKIEI 128 >gi|170754667|ref|YP_001780571.1| hypothetical protein CLD_3621 [Clostridium botulinum B1 str. Okra] gi|169119879|gb|ACA43715.1| conserved hypothetical protein [Clostridium botulinum B1 str. Okra] Length = 143 Score = 38.0 bits (87), Expect = 0.82, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 28/78 (35%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 N K G+ KIT + ++D++IN + G L IK + + Sbjct: 51 NSSGNEYNVKFKYFNGKGVKKITSKKSSYDIKINSKIESGDLNIKIYDDKKTLFNKNGTL 110 Query: 77 DAFVSISEIFTDRIVRSI 94 D + IS + I Sbjct: 111 DETIRISNTDNKEVKIEI 128 >gi|268557756|ref|XP_002636868.1| C. briggsae CBR-ASPM-1 protein [Caenorhabditis briggsae] gi|187031942|emb|CAP29242.1| CBR-ASPM-1 protein [Caenorhabditis briggsae AF16] Length = 1275 Score = 38.0 bits (87), Expect = 0.87, Method: Composition-based stats. Identities = 18/67 (26%), Positives = 32/67 (47%), Gaps = 1/67 (1%) Query: 115 DIWLMQCKDPINDSISNSESISKKALSEYSSTDIT-SQGSEKSSGSSSNKTLEKESSQPL 173 D+ C++ I D ++SESI+ +E +S D + G + S N + + L Sbjct: 781 DVQNADCEEVIEDLEASSESITPDKNNEEASEDHENAHGPVEISPEDVNVLKNDFTPEVL 840 Query: 174 ENNLSMD 180 EN++ D Sbjct: 841 ENDIVAD 847 >gi|198427746|ref|XP_002130249.1| PREDICTED: similar to RNA polymerase I-specific transcription initiation factor RRN3 (Transcription initiation factor IA) (TIF-IA) [Ciona intestinalis] Length = 588 Score = 37.6 bits (86), Expect = 1.1, Method: Composition-based stats. Identities = 16/69 (23%), Positives = 28/69 (40%), Gaps = 4/69 (5%) Query: 107 NAIDHSIYDIWLMQ---CKDPINDSISNSESISKKALSEYSSTDITSQGSEKSSGSSSNK 163 + H IY +W + C+D ND N E+I + L + + I S + S + Sbjct: 520 STFIHPIYKVWEGRSPHCEDEDNDD-PNKENIEDQGLFDEDDSGIKGSFSNQVPPSPLSP 578 Query: 164 TLEKESSQP 172 + + P Sbjct: 579 GFQHVTPSP 587 >gi|187778057|ref|ZP_02994530.1| hypothetical protein CLOSPO_01649 [Clostridium sporogenes ATCC 15579] gi|187774985|gb|EDU38787.1| hypothetical protein CLOSPO_01649 [Clostridium sporogenes ATCC 15579] Length = 143 Score = 37.2 bits (85), Expect = 1.3, Method: Composition-based stats. Identities = 18/78 (23%), Positives = 28/78 (35%), Gaps = 2/78 (2%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 N K G+ KIT + ++D++IN + G L IK + + Sbjct: 51 NSSGNEYNIKFKHFNGKGVKKITSKKSSYDIKINSKIESGDLNIKIYDNKRTLFNKNGTL 110 Query: 77 DAFVSISEIFTDRIVRSI 94 D +I TD I Sbjct: 111 DE--TIRIPNTDNKDVKI 126 >gi|66475652|ref|XP_627642.1| cullin domain containing protein [Cryptosporidium parvum Iowa II] gi|32398872|emb|CAD98582.1| hypothetical predicted protein, unknown function [Cryptosporidium parvum] gi|46229077|gb|EAK89926.1| cullin domain containing protein [Cryptosporidium parvum Iowa II] Length = 1467 Score = 37.2 bits (85), Expect = 1.3, Method: Composition-based stats. Identities = 20/88 (22%), Positives = 38/88 (43%), Gaps = 3/88 (3%) Query: 110 DHSIYDIWLMQCKDPINDSISNSESISKKALSEYSSTDITSQGSEKSSGSSSNKTLEKES 169 +H+ YDI + +C+ I ++ + +++ L I SQG +K S S+ ++ Sbjct: 1370 EHANYDI-IKECELLIRATLQLNGAMAPAVLFARVRAAIASQGEDKFSQKDSDDHQGSKA 1428 Query: 170 SQPLENNLSMDLKGRPIQELGNNLSDSG 197 S + L + NN+ D G Sbjct: 1429 S--TNTDTQYTLTWPQHVQAINNMVDRG 1454 >gi|262403712|ref|ZP_06080270.1| multidrug resistance efflux pump [Vibrio sp. RC586] gi|262350216|gb|EEY99351.1| multidrug resistance efflux pump [Vibrio sp. RC586] Length = 354 Score = 37.2 bits (85), Expect = 1.3, Method: Composition-based stats. Identities = 13/71 (18%), Positives = 26/71 (36%), Gaps = 3/71 (4%) Query: 1 MKYRV-LLLILFFVFSHAKFANSARFANKVA--EFAGMDKITGRVLTFDVEINQSAQFGS 57 M+ + L ++LFF A + +V +++G+V + NQ G Sbjct: 11 MRTLIVLFIVLFFYIIFADQHSPITTEGRVQGYVVQVAPEVSGKVTQVQIRNNQQVHQGD 70 Query: 58 LIIKPMVCYSR 68 ++ R Sbjct: 71 VLFTIDARKYR 81 >gi|217968174|ref|YP_002353680.1| 5'-nucleotidase [Dictyoglomus turgidum DSM 6724] gi|217337273|gb|ACK43066.1| 5'-nucleotidase [Dictyoglomus turgidum DSM 6724] Length = 504 Score = 37.2 bits (85), Expect = 1.4, Method: Composition-based stats. Identities = 25/103 (24%), Positives = 42/103 (40%), Gaps = 16/103 (15%) Query: 2 KYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKIT---GRVLTFDVE-INQSAQFGS 57 K LLLI+FF+FS FA K E + I GR+ + V+ I+++ G Sbjct: 5 KKFSLLLIVFFLFSSLIFAQEL----KPIEIKILH-INDFHGRLQPYIVKSISETIPVGG 59 Query: 58 ---LIIKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSG 97 L + + + +S ++F + +IF G Sbjct: 60 GAYLSYLI----NEERSKNPDGTILLSAGDMFQGTPISNIFKG 98 >gi|168182789|ref|ZP_02617453.1| conserved hypothetical protein [Clostridium botulinum Bf] gi|237794236|ref|YP_002861788.1| hypothetical protein CLJ_B0990 [Clostridium botulinum Ba4 str. 657] gi|182673985|gb|EDT85946.1| conserved hypothetical protein [Clostridium botulinum Bf] gi|229262600|gb|ACQ53633.1| conserved hypothetical protein [Clostridium botulinum Ba4 str. 657] Length = 143 Score = 37.2 bits (85), Expect = 1.5, Method: Composition-based stats. Identities = 17/78 (21%), Positives = 28/78 (35%) Query: 17 AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRI 76 N K G+ KIT + ++D++IN + G L IK + + Sbjct: 51 NSNGNEYNIKFKYFNGKGVKKITSKKSSYDIKINSKIESGDLNIKIYDDKKILFNKNGTL 110 Query: 77 DAFVSISEIFTDRIVRSI 94 D + IS + I Sbjct: 111 DETIRISNTDDKDVKIEI 128 >gi|229175216|ref|ZP_04302732.1| Sodium export permease protein [Bacillus cereus MM3] gi|228608352|gb|EEK65658.1| Sodium export permease protein [Bacillus cereus MM3] Length = 407 Score = 37.2 bits (85), Expect = 1.5, Method: Composition-based stats. Identities = 21/105 (20%), Positives = 38/105 (36%), Gaps = 15/105 (14%) Query: 5 VLLLILFFVFSHAKFANSARFANKVAEFAGMDKIT--GRVLTFDVEINQSAQF---GSLI 59 +L LI+F +F+ F +S DKI T+ ++ + + L Sbjct: 27 ILFLIVFGIFAFNHFTSSNDKNKDK------DKIAVVTESSTYKIQKEELTKLLPSAKLT 80 Query: 60 IKPMVCYS--RDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFAD 102 I ++ E +D ++E V +F+G FA Sbjct: 81 IGSKEDFNKLHKQVEEGELDGLFRVTEKNGVPEVTYMFNG--FAS 123 >gi|261403755|ref|YP_003247979.1| hypothetical protein Metvu_1644 [Methanocaldococcus vulcanius M7] gi|261370748|gb|ACX73497.1| hypothetical protein Metvu_1644 [Methanocaldococcus vulcanius M7] Length = 548 Score = 36.8 bits (84), Expect = 1.6, Method: Composition-based stats. Identities = 19/79 (24%), Positives = 28/79 (35%), Gaps = 18/79 (22%) Query: 1 MKYRVLLLILFFV-FSHAKFANSARFANKVAE------FAGMDKITGRVLTF-------- 45 MK +L LIL F+ H FA+ +A + DKI ++ Sbjct: 1 MKKVILFLILIFIYLFHPLFADENISIEGMATNGTDVMISVYDKINSKMYEILYNGKNFE 60 Query: 46 ---DVEINQSAQFGSLIIK 61 IN+S F + I Sbjct: 61 VILKFPINESELFNNSKIN 79 >gi|126699903|ref|YP_001088800.1| two-component sensor histidine kinase [Clostridium difficile 630] gi|115251340|emb|CAJ69172.1| Two-component sensor histidine kinase [Clostridium difficile] Length = 689 Score = 36.8 bits (84), Expect = 1.9, Method: Composition-based stats. Identities = 9/95 (9%), Positives = 26/95 (27%), Gaps = 9/95 (9%) Query: 9 ILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKP------ 62 + + S+A + + ++KI G+ + + +L ++ Sbjct: 137 VFYTNTSYASLYDFKQKTKSYCNVEIINKI-GKSSYIKTINGEKFELNNLSLELNEVDEG 195 Query: 63 MVCYSRDDREA--QRIDAFVSISEIFTDRIVRSIF 95 Y E + + + +F Sbjct: 196 FEAYVSFPEEPTIEDGIVYTNFQIFKQATEKVRLF 230 >gi|255101431|ref|ZP_05330408.1| two-component sensor histidine kinase [Clostridium difficile QCD-63q42] gi|255307304|ref|ZP_05351475.1| two-component sensor histidine kinase [Clostridium difficile ATCC 43255] Length = 686 Score = 36.8 bits (84), Expect = 2.0, Method: Composition-based stats. Identities = 9/95 (9%), Positives = 26/95 (27%), Gaps = 9/95 (9%) Query: 9 ILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKP------ 62 + + S+A + + ++KI G+ + + +L ++ Sbjct: 134 VFYTNTSYASLYDFKQKTKSYCNVEIINKI-GKSSYIKTINGEKFELNNLSLELNEVDEG 192 Query: 63 MVCYSRDDREA--QRIDAFVSISEIFTDRIVRSIF 95 Y E + + + +F Sbjct: 193 FEAYVSFPEEPTIEDGIVYTNFQIFKQATEKVRLF 227 >gi|225619012|ref|YP_002720238.1| aerobic-type carbon monoxide dehydrogenase large subunit CoxL/CutL [Brachyspira hyodysenteriae WA1] gi|152963776|gb|ABS50203.1| CoxL [Brachyspira hyodysenteriae] gi|225213831|gb|ACN82565.1| aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL s [Brachyspira hyodysenteriae WA1] Length = 711 Score = 36.8 bits (84), Expect = 2.0, Method: Composition-based stats. Identities = 12/80 (15%), Positives = 25/80 (31%), Gaps = 4/80 (5%) Query: 23 ARFANKVAEFAGMDKITGRVLTF-DVEINQSAQFGSLIIKPM---VCYSRDDREAQRIDA 78 N V+ +DKITG+ D++ + ++ + E + Sbjct: 6 KELKNSVSRVDALDKITGKTKYLNDIDFGKEVLHAKIVHSTKARAKILKINIPELPEGYS 65 Query: 79 FVSISEIFTDRIVRSIFSGW 98 + ++ I S W Sbjct: 66 VIDYKDVPGKNAATMIISDW 85 >gi|262173840|ref|ZP_06041517.1| multidrug resistance efflux pump [Vibrio mimicus MB-451] gi|261891198|gb|EEY37185.1| multidrug resistance efflux pump [Vibrio mimicus MB-451] Length = 354 Score = 36.8 bits (84), Expect = 2.0, Method: Composition-based stats. Identities = 13/71 (18%), Positives = 25/71 (35%), Gaps = 3/71 (4%) Query: 1 MKYRV-LLLILFFVFSHAKFANSARFANKVA--EFAGMDKITGRVLTFDVEINQSAQFGS 57 M+ + L ++LFF A +V +++G+V + NQ G Sbjct: 11 MRTLIVLFIVLFFYIIFADQHAPITTEGRVQGYVVQVAPEVSGKVTQVQIRNNQQVHQGD 70 Query: 58 LIIKPMVCYSR 68 ++ R Sbjct: 71 VLFTIDARKYR 81 >gi|258620758|ref|ZP_05715793.1| putative secretion protein [Vibrio mimicus VM573] gi|258586956|gb|EEW11670.1| putative secretion protein [Vibrio mimicus VM573] Length = 358 Score = 36.8 bits (84), Expect = 2.0, Method: Composition-based stats. Identities = 13/71 (18%), Positives = 25/71 (35%), Gaps = 3/71 (4%) Query: 1 MKYRV-LLLILFFVFSHAKFANSARFANKVA--EFAGMDKITGRVLTFDVEINQSAQFGS 57 M+ + L ++LFF A +V +++G+V + NQ G Sbjct: 15 MRTLIVLFIVLFFYIIFADQHAPITTEGRVQGYVVQVAPEVSGKVTQVQIRNNQQVHQGD 74 Query: 58 LIIKPMVCYSR 68 ++ R Sbjct: 75 VLFTIDARKYR 85 >gi|258625295|ref|ZP_05720198.1| putative secretion protein [Vibrio mimicus VM603] gi|258582403|gb|EEW07249.1| putative secretion protein [Vibrio mimicus VM603] Length = 344 Score = 36.8 bits (84), Expect = 2.0, Method: Composition-based stats. Identities = 13/71 (18%), Positives = 25/71 (35%), Gaps = 3/71 (4%) Query: 1 MKYRV-LLLILFFVFSHAKFANSARFANKVA--EFAGMDKITGRVLTFDVEINQSAQFGS 57 M+ + L ++LFF A +V +++G+V + NQ G Sbjct: 1 MRTLIVLFIVLFFYIIFADQHAPITTEGRVQGYVVQVAPEVSGKVTQVQIRNNQQVHQGD 60 Query: 58 LIIKPMVCYSR 68 ++ R Sbjct: 61 VLFTIDARKYR 71 >gi|310778137|ref|YP_003966470.1| hypothetical protein Ilyop_0333 [Ilyobacter polytropus DSM 2926] gi|309747460|gb|ADO82122.1| conserved hypothetical protein [Ilyobacter polytropus DSM 2926] Length = 213 Score = 36.5 bits (83), Expect = 2.5, Method: Composition-based stats. Identities = 14/63 (22%), Positives = 28/63 (44%), Gaps = 2/63 (3%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLII 60 MK + +LI F + S + FA SA+ + + ++G + + ++A L I Sbjct: 1 MKKFITILIGFLLLSFSAFAASAQISVPGKNIPNEENVSG--VRLSLLHGETAIVKGLDI 58 Query: 61 KPM 63 + Sbjct: 59 SVL 61 >gi|315634087|ref|ZP_07889376.1| hypothetical protein HMPREF9064_0743 [Aggregatibacter segnis ATCC 33393] gi|315477337|gb|EFU68080.1| hypothetical protein HMPREF9064_0743 [Aggregatibacter segnis ATCC 33393] Length = 170 Score = 36.5 bits (83), Expect = 2.7, Method: Composition-based stats. Identities = 13/100 (13%), Positives = 31/100 (31%), Gaps = 9/100 (9%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEF-AGMDKITGRVLTFDVEINQSAQFGSLI 59 MK V + + F + A + +DK + + + F + + Sbjct: 1 MKLFVFIFLSFIFSCNTVVAAEKNIQGIQNQLEQQVDKKNSNAQSVSLGV-----FQNYV 55 Query: 60 IKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSI-FSGW 98 + + + A+V + ++ + I F W Sbjct: 56 VVGFEGREIAVDDNNQ--AYVLVKFTVENKSNKPIRFLQW 93 >gi|294656912|ref|XP_002770330.1| DEHA2D17314p [Debaryomyces hansenii CBS767] gi|199431834|emb|CAR65684.1| DEHA2D17314p [Debaryomyces hansenii] Length = 1309 Score = 35.7 bits (81), Expect = 3.8, Method: Composition-based stats. Identities = 25/94 (26%), Positives = 40/94 (42%), Gaps = 4/94 (4%) Query: 105 AMNAIDHSIYDIWLMQCKDPINDSISNSESISKKALSEYSSTDITSQGSEKSSGSSSNKT 164 A+N++ + IYDI ++ K+ IND SN + + K T + S N Sbjct: 787 ALNSLKNPIYDI--VRIKNDIND--SNRQIEALKDELSEYGVSKTPLDELQQLQQSKNME 842 Query: 165 LEKESSQPLENNLSMDLKGRPIQELGNNLSDSGL 198 ++ Q E N K + + L NN+ D L Sbjct: 843 IKDLRIQINEINELKFTKQKELARLENNIKDKQL 876 >gi|294781899|ref|ZP_06747231.1| conserved hypothetical protein [Fusobacterium sp. 1_1_41FAA] gi|294481710|gb|EFG29479.1| conserved hypothetical protein [Fusobacterium sp. 1_1_41FAA] Length = 272 Score = 35.7 bits (81), Expect = 3.9, Method: Composition-based stats. Identities = 13/94 (13%), Positives = 38/94 (40%), Gaps = 8/94 (8%) Query: 2 KYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIK 61 ++ +LL LF +F++ + + NK +D+ + +TF ++N++ +L Sbjct: 3 RFLILLFSLFSIFTYGANSVNEVEVNKYIREK-LDR--DKTITFTTKLNKTNN--TLEGY 57 Query: 62 PMV---CYSRDDREAQRIDAFVSISEIFTDRIVR 92 C + + + + +++ + Sbjct: 58 SDEGVLCAITPLDKQPDMINLLQVKSTISEKNGK 91 >gi|15602053|ref|NP_245125.1| hypothetical protein PM0188 [Pasteurella multocida subsp. multocida str. Pm70] gi|12720409|gb|AAK02272.1| unknown [Pasteurella multocida subsp. multocida str. Pm70] Length = 412 Score = 35.7 bits (81), Expect = 4.1, Method: Composition-based stats. Identities = 16/108 (14%), Positives = 36/108 (33%), Gaps = 21/108 (19%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGM-----------DKITGRV---LTFD 46 + +++ LI+F +FS ++ + A + DK R+ F Sbjct: 6 LNFKLFFLIIFSLFSTLSWSKTITLYLDPASLPALNQLMDFTQNNEDKTHPRIFGLSRFK 65 Query: 47 VEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSI 94 + N Q+ ++ + +A +I + + I I Sbjct: 66 IPDNIITQYQNIHFVELK------DNRP-TEALFTILDQYPGNIELDI 106 >gi|68271071|gb|AAY89061.1| alpha-2,3/2,6-sialyltransferase/sialidase [Pasteurella multocida] Length = 412 Score = 35.7 bits (81), Expect = 4.2, Method: Composition-based stats. Identities = 16/108 (14%), Positives = 37/108 (34%), Gaps = 21/108 (19%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGM-----------DKITGRV---LTFD 46 + +++ LI+F +FS ++ + A + DK R+ F Sbjct: 6 LNFKLFFLIIFSLFSTLSWSKTITLYLDPASLPALNQLMDFTQNNEDKTHPRIFGLSRFK 65 Query: 47 VEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSI 94 + N Q+ ++ + +A +I + + I +I Sbjct: 66 IPDNIITQYQNIHFVELK------DNRP-TEALFTILDQYPGNIELNI 106 >gi|88606817|ref|YP_504682.1| hypothetical protein APH_0049 [Anaplasma phagocytophilum HZ] gi|88597880|gb|ABD43350.1| hypothetical protein APH_0049 [Anaplasma phagocytophilum HZ] Length = 2269 Score = 35.3 bits (80), Expect = 5.2, Method: Composition-based stats. Identities = 22/83 (26%), Positives = 37/83 (44%), Gaps = 3/83 (3%) Query: 131 NSESISKKALSEYSSTDITSQGSEKSSGSSSNKTLEKESSQPLENNLSMD---LKGRPIQ 187 N+ +SKK + ++T + EKS SN T E ++ L ++D + Sbjct: 621 NTVHLSKKDAVDKPHVNVTQKAEEKSDSHDSNNTSENRNTVHLSKKDAVDEPYVHTTQKA 680 Query: 188 ELGNNLSDSGLNEQDHNDVQISK 210 E +N DS ++ N V +SK Sbjct: 681 EEKSNSHDSNNTSENRNTVHLSK 703 >gi|253581818|ref|ZP_04859042.1| peptidase M23B [Fusobacterium varium ATCC 27725] gi|251836167|gb|EES64704.1| peptidase M23B [Fusobacterium varium ATCC 27725] Length = 279 Score = 35.3 bits (80), Expect = 5.8, Method: Composition-based stats. Identities = 18/92 (19%), Positives = 34/92 (36%), Gaps = 15/92 (16%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKIT-GRVLTFDVEINQ--SAQFGS 57 MK + LLL +FF + FA+ + DK+ G + N+ F + Sbjct: 1 MKMKKLLLFIFFTLTILSFASENIIFSD-------DKVNQGGFFYIEYPANKNYEITFKN 53 Query: 58 LIIKPMVCYSRDDREAQRIDAFVSISEIFTDR 89 +IK ++ + AF+ + + Sbjct: 54 SLIKIKS-----FKDNNKKIAFIPVHYSTPEG 80 >gi|225850195|ref|YP_002730429.1| thermonuclease [Persephonella marina EX-H1] gi|225646183|gb|ACO04369.1| thermonuclease (TNase) (Micrococcal nuclease)(Staphylococcal nuclease) [Persephonella marina EX-H1] Length = 192 Score = 35.3 bits (80), Expect = 6.2, Method: Composition-based stats. Identities = 11/79 (13%), Positives = 20/79 (25%), Gaps = 4/79 (5%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFAN----KVAEFAGMDKITGRVLTFDVEINQSAQFG 56 MK ++L + F K +D T V +V+ N + Sbjct: 1 MKVKILFFLCFLAIITLSEGKEVWKPPKEFVKAKVLRVIDGDTVVVSIPEVKFNNRKKLK 60 Query: 57 SLIIKPMVCYSRDDREAQR 75 +L + Sbjct: 61 NLRFTVRLIGIDTPESRPN 79 >gi|296125540|ref|YP_003632792.1| nitrate/sulfonate/bicarbonate ABC transporter periplasmic protein [Brachyspira murdochii DSM 12563] gi|296017356|gb|ADG70593.1| ABC-type nitrate/sulfonate/bicarbonate transport system, periplasmic component [Brachyspira murdochii DSM 12563] Length = 299 Score = 34.5 bits (78), Expect = 8.7, Method: Composition-based stats. Identities = 15/56 (26%), Positives = 25/56 (44%), Gaps = 8/56 (14%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGM------DKITGRVLTFDVEIN 50 MK L+LI+ FVF ++ ++ N G+ +KIT + T + N Sbjct: 1 MKNLKLILIILFVFINSLYSQKMYLLNGPTSIGGLKMMKEYNKIT--INTVNAPNN 54 >gi|225621010|ref|YP_002722268.1| hypothetical protein BHWA1_02106 [Brachyspira hyodysenteriae WA1] gi|225215830|gb|ACN84564.1| hypothetical protein BHWA1_02106 [Brachyspira hyodysenteriae WA1] Length = 105 Score = 34.5 bits (78), Expect = 9.0, Method: Composition-based stats. Identities = 14/104 (13%), Positives = 32/104 (30%), Gaps = 11/104 (10%) Query: 1 MKYRVLLLILFFVFSH-------AKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSA 53 ++ +L L +F +F +++ +K F K++G++ + Sbjct: 2 IRLFILFLSIFCIFILGCNHKILNPNIDNSNNKSKTMYFKF--KVSGKLSLNKFKYGNEF 59 Query: 54 QFGSLIIKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSG 97 + D A +SI + + F G Sbjct: 60 FRNNFSCTIP--LKDYDESKNNNTALISIVLYPDNNTYKFTFDG 101 >gi|297545482|ref|YP_003677784.1| hypothetical protein Tmath_2099 [Thermoanaerobacter mathranii subsp. mathranii str. A3] gi|296843257|gb|ADH61773.1| hypothetical protein Tmath_2099 [Thermoanaerobacter mathranii subsp. mathranii str. A3] Length = 262 Score = 34.5 bits (78), Expect = 9.0, Method: Composition-based stats. Identities = 15/95 (15%), Positives = 31/95 (32%), Gaps = 13/95 (13%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLII 60 +K +LLL++ F S + + + K T + + IN ++ Sbjct: 22 VKKLILLLVIAFFLSISILSLAFATDLK----------TSSLNDKKLMINSPQEYE--SY 69 Query: 61 KPMVCYSRDDREAQRID-AFVSISEIFTDRIVRSI 94 + + E I A + D+ + I Sbjct: 70 LIQKANNSNSEEKSAIVNALYKYKSLTRDKQEKFI 104 >gi|256545071|ref|ZP_05472438.1| aminoacyl-histidine dipeptidase [Anaerococcus vaginalis ATCC 51170] gi|256399274|gb|EEU12884.1| aminoacyl-histidine dipeptidase [Anaerococcus vaginalis ATCC 51170] Length = 466 Score = 34.5 bits (78), Expect = 9.2, Method: Composition-based stats. Identities = 30/148 (20%), Positives = 52/148 (35%), Gaps = 6/148 (4%) Query: 36 DKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIF 95 D IT I + F I + E + I ++ + + SI+ Sbjct: 321 DGITVESSDNLALIKEEDGFIKSEISLRSSDNDALEELSKKIR-TVIEDLGINYKIDSIY 379 Query: 96 SGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSISNSESISKKALSE-YSSTDITSQGSE 154 GW + + + + +Y + + D+I + A E Y + DI S G Sbjct: 380 PGWEYKEDSKLRPLAQKVY----KEFEGKEFDTIVIHAGLECGAFYEKYPNLDIISIGPN 435 Query: 155 KSSGSSSNKTLEKESSQPLENNLSMDLK 182 + S + +E ES Q + L LK Sbjct: 436 ITGAHSPEEKVEIESVQRVYAYLKQLLK 463 >gi|126663909|ref|ZP_01734904.1| hypothetical protein FBBAL38_10517 [Flavobacteria bacterium BAL38] gi|126624173|gb|EAZ94866.1| hypothetical protein FBBAL38_10517 [Flavobacteria bacterium BAL38] Length = 206 Score = 34.5 bits (78), Expect = 9.2, Method: Composition-based stats. Identities = 14/78 (17%), Positives = 28/78 (35%), Gaps = 6/78 (7%) Query: 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMDKITGRVL------TFDVEINQSAQ 54 MK L L+LF + A+ A K A +D ++ V+ + + + Sbjct: 1 MKKLFLFLVLFVSTISFAQKSKAKPAPKNIILATVDNVSAEVISEKSGKRVVLFVKNEGK 60 Query: 55 FGSLIIKPMVCYSRDDRE 72 +L +K + + Sbjct: 61 IDTLEVKKLDKITFKPTN 78 >gi|9634815|ref|NP_039108.1| Molluscum contagiosum virus MC089L homolog [Fowlpox virus] gi|7271643|gb|AAF44489.1|AF198100_136 ORF FPV145 Molluscum contagiosum virus MC089L homolog [Fowlpox virus] Length = 103 Score = 34.5 bits (78), Expect = 9.8, Method: Composition-based stats. Identities = 12/40 (30%), Positives = 18/40 (45%), Gaps = 2/40 (5%) Query: 11 FFVFSHAKFANSARFANKVAEFAGM--DKITGRVLTFDVE 48 FF+F K A S R +G+ DKIT + ++ Sbjct: 14 FFLFMLTKKATSVRLDKDNMILSGLYKDKITAQNTLVKLQ 53 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.306 0.146 0.437 Lambda K H 0.267 0.0447 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,706,426,134 Number of Sequences: 14124377 Number of extensions: 61810509 Number of successful extensions: 303731 Number of sequences better than 10.0: 289 Number of HSP's better than 10.0 without gapping: 198 Number of HSP's successfully gapped in prelim test: 91 Number of HSP's that attempted gapping in prelim test: 303344 Number of HSP's gapped (non-prelim): 334 length of query: 210 length of database: 4,842,793,630 effective HSP length: 133 effective length of query: 77 effective length of database: 2,964,251,489 effective search space: 228247364653 effective search space used: 228247364653 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.9 bits) S2: 78 (34.5 bits)