BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781135|ref|YP_003065548.1| hypothetical protein CLIBASIA_05185 [Candidatus Liberibacter asiaticus str. psy62] (83 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781135|ref|YP_003065548.1| hypothetical protein CLIBASIA_05185 [Candidatus Liberibacter asiaticus str. psy62] gi|254040812|gb|ACT57608.1| hypothetical protein CLIBASIA_05185 [Candidatus Liberibacter asiaticus str. psy62] Length = 83 Score = 177 bits (449), Expect = 4e-43, Method: Compositional matrix adjust. Identities = 83/83 (100%), Positives = 83/83 (100%) Query: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYGW 60 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYGW Sbjct: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYGW 60 Query: 61 FFKNRQWYIASDSMDAWTCQLKL 83 FFKNRQWYIASDSMDAWTCQLKL Sbjct: 61 FFKNRQWYIASDSMDAWTCQLKL 83 >gi|315122528|ref|YP_004063017.1| hypothetical protein CKC_03900 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495930|gb|ADR52529.1| hypothetical protein CKC_03900 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 532 Score = 132 bits (333), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 58/82 (70%), Positives = 69/82 (84%) Query: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYGW 60 M+SG+W TF YTTP+KTFLIALNGQDERQIY+GN+ KS PI+C GH+ NP+YFSYGW Sbjct: 117 MQSGDWSTFHYTTPEKTFLIALNGQDERQIYDGNDWKSDYPPIVCEGHSDINPIYFSYGW 176 Query: 61 FFKNRQWYIASDSMDAWTCQLK 82 FKNRQWYIA+DSMDAW ++ Sbjct: 177 LFKNRQWYIAADSMDAWYLPVR 198 >gi|227822437|ref|YP_002826409.1| hypothetical protein NGR_c18920 [Sinorhizobium fredii NGR234] gi|227341438|gb|ACP25656.1| hypothetical protein NGR_c18920 [Sinorhizobium fredii NGR234] Length = 536 Score = 56.6 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 9/80 (11%) Query: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNG---NNCKSSLLPIICAGHNQNNPVYFS 57 + G+W FQ+TT +FL+ LNG D R+IY+G + P A N Sbjct: 123 LAGGDWCAFQHTTAGGSFLVCLNGADTRRIYDGTAWTTTPTITFPDATAMAELN------ 176 Query: 58 YGWFFKNRQWYIASDSMDAW 77 YGW FKNR++++ + ++DA+ Sbjct: 177 YGWLFKNREFFLKNATLDAY 196 >gi|150397022|ref|YP_001327489.1| hypothetical protein Smed_1819 [Sinorhizobium medicae WSM419] gi|150028537|gb|ABR60654.1| conserved hypothetical protein [Sinorhizobium medicae WSM419] Length = 538 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 23/77 (29%), Positives = 44/77 (57%), Gaps = 3/77 (3%) Query: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYGW 60 M G+W FQ++ ++L+ LNG ++R++YNG ++ G +YGW Sbjct: 125 MSGGDWCAFQHSNAGTSWLVCLNGANDRRLYNGTTWTTAPAITFTDGTTMAQ---LNYGW 181 Query: 61 FFKNRQWYIASDSMDAW 77 FKNR++++ + ++DA+ Sbjct: 182 LFKNREFFLKNGTLDAY 198 >gi|13470677|ref|NP_102246.1| hypothetical protein mll0452 [Mesorhizobium loti MAFF303099] gi|14021419|dbj|BAB48032.1| mll0452 [Mesorhizobium loti MAFF303099] Length = 528 Score = 39.3 bits (90), Expect = 0.18, Method: Composition-based stats. Identities = 22/79 (27%), Positives = 40/79 (50%), Gaps = 9/79 (11%) Query: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKS--SLLPIICAGHNQNNPVYFSY 58 + SG + Q TP +L ALNG D + ++G + +L P I N S+ Sbjct: 116 LTSGYFSYVQMPTPGGYYLSALNGTDSMRQFDGTTWTTVAALGPSI-------NTNQLSF 168 Query: 59 GWFFKNRQWYIASDSMDAW 77 W ++NRQ+++ ++++ W Sbjct: 169 IWTYRNRQFFVQKNTLNVW 187 >gi|316933870|ref|YP_004108852.1| hypothetical protein Rpdx1_2528 [Rhodopseudomonas palustris DX-1] gi|315601584|gb|ADU44119.1| hypothetical protein Rpdx1_2528 [Rhodopseudomonas palustris DX-1] Length = 558 Score = 38.5 bits (88), Expect = 0.27, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 37/77 (48%), Gaps = 6/77 (7%) Query: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYGW 60 + G WV Q+ TP FL A+NG D +++G++ + P I + S+ W Sbjct: 148 LTGGNWVAVQFATPGGVFLRAVNGVDAPLVFDGSDWAET--PAITG----VDAASLSHVW 201 Query: 61 FFKNRQWYIASDSMDAW 77 + R +++ S+ A+ Sbjct: 202 LHQRRLFFVKGGSLSAY 218 >gi|152982947|ref|YP_001353888.1| hypothetical protein mma_2198 [Janthinobacterium sp. Marseille] gi|151283024|gb|ABR91434.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille] Length = 519 Score = 37.7 bits (86), Expect = 0.53, Method: Compositional matrix adjust. Identities = 22/78 (28%), Positives = 35/78 (44%), Gaps = 6/78 (7%) Query: 3 SGEWVTFQYTTPDKTFLIALNGQDERQIYNGNN---CKSSLLPIICAGHNQNNPVYFSYG 59 S +W + T F+ +NG D Q++NG + P+ G +N + + Sbjct: 112 SDKWQHVNFGTAGGNFMYLVNGSDAPQLFNGTTWQAVTAVSAPVSITGVTASNLINVTA- 170 Query: 60 WFFKNRQWYIASDSMDAW 77 FK R W+I +SM AW Sbjct: 171 --FKARLWFIERNSMKAW 186 Searching..................................................done Results from round 2 >gi|315122528|ref|YP_004063017.1| hypothetical protein CKC_03900 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495930|gb|ADR52529.1| hypothetical protein CKC_03900 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 532 Score = 141 bits (356), Expect = 2e-32, Method: Composition-based stats. Identities = 58/82 (70%), Positives = 69/82 (84%) Query: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYGW 60 M+SG+W TF YTTP+KTFLIALNGQDERQIY+GN+ KS PI+C GH+ NP+YFSYGW Sbjct: 117 MQSGDWSTFHYTTPEKTFLIALNGQDERQIYDGNDWKSDYPPIVCEGHSDINPIYFSYGW 176 Query: 61 FFKNRQWYIASDSMDAWTCQLK 82 FKNRQWYIA+DSMDAW ++ Sbjct: 177 LFKNRQWYIAADSMDAWYLPVR 198 >gi|254781135|ref|YP_003065548.1| hypothetical protein CLIBASIA_05185 [Candidatus Liberibacter asiaticus str. psy62] gi|254040812|gb|ACT57608.1| hypothetical protein CLIBASIA_05185 [Candidatus Liberibacter asiaticus str. psy62] Length = 83 Score = 130 bits (327), Expect = 6e-29, Method: Composition-based stats. Identities = 83/83 (100%), Positives = 83/83 (100%) Query: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYGW 60 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYGW Sbjct: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYGW 60 Query: 61 FFKNRQWYIASDSMDAWTCQLKL 83 FFKNRQWYIASDSMDAWTCQLKL Sbjct: 61 FFKNRQWYIASDSMDAWTCQLKL 83 >gi|150397022|ref|YP_001327489.1| hypothetical protein Smed_1819 [Sinorhizobium medicae WSM419] gi|150028537|gb|ABR60654.1| conserved hypothetical protein [Sinorhizobium medicae WSM419] Length = 538 Score = 123 bits (309), Expect = 7e-27, Method: Composition-based stats. Identities = 23/81 (28%), Positives = 45/81 (55%), Gaps = 3/81 (3%) Query: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYGW 60 M G+W FQ++ ++L+ LNG ++R++YNG ++ G +YGW Sbjct: 125 MSGGDWCAFQHSNAGTSWLVCLNGANDRRLYNGTTWTTAPAITFTDGTTMAQ---LNYGW 181 Query: 61 FFKNRQWYIASDSMDAWTCQL 81 FKNR++++ + ++DA+ + Sbjct: 182 LFKNREFFLKNGTLDAYYLPV 202 >gi|227822437|ref|YP_002826409.1| hypothetical protein NGR_c18920 [Sinorhizobium fredii NGR234] gi|227341438|gb|ACP25656.1| hypothetical protein NGR_c18920 [Sinorhizobium fredii NGR234] Length = 536 Score = 106 bits (264), Expect = 1e-21, Method: Composition-based stats. Identities = 25/81 (30%), Positives = 44/81 (54%), Gaps = 3/81 (3%) Query: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYGW 60 + G+W FQ+TT +FL+ LNG D R+IY+G ++ +YGW Sbjct: 123 LAGGDWCAFQHTTAGGSFLVCLNGADTRRIYDGTAWTTTPTITFPDATAMAE---LNYGW 179 Query: 61 FFKNRQWYIASDSMDAWTCQL 81 FKNR++++ + ++DA+ + Sbjct: 180 LFKNREFFLKNATLDAYYLPV 200 >gi|316933870|ref|YP_004108852.1| hypothetical protein Rpdx1_2528 [Rhodopseudomonas palustris DX-1] gi|315601584|gb|ADU44119.1| hypothetical protein Rpdx1_2528 [Rhodopseudomonas palustris DX-1] Length = 558 Score = 66.6 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 18/80 (22%), Positives = 35/80 (43%), Gaps = 6/80 (7%) Query: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYGW 60 + G WV Q+ TP FL A+NG D +++G++ + + S+ W Sbjct: 148 LTGGNWVAVQFATPGGVFLRAVNGVDAPLVFDGSDWAETPAITGVDAAS------LSHVW 201 Query: 61 FFKNRQWYIASDSMDAWTCQ 80 + R +++ S+ A+ Sbjct: 202 LHQRRLFFVKGGSLSAYYLP 221 >gi|13470677|ref|NP_102246.1| hypothetical protein mll0452 [Mesorhizobium loti MAFF303099] gi|14021419|dbj|BAB48032.1| mll0452 [Mesorhizobium loti MAFF303099] Length = 528 Score = 61.2 bits (147), Expect = 4e-08, Method: Composition-based stats. Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 5/81 (6%) Query: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYGW 60 + SG + Q TP +L ALNG D + ++G + + A N S+ W Sbjct: 116 LTSGYFSYVQMPTPGGYYLSALNGTDSMRQFDGTTWTT-----VAALGPSINTNQLSFIW 170 Query: 61 FFKNRQWYIASDSMDAWTCQL 81 ++NRQ+++ ++++ W + Sbjct: 171 TYRNRQFFVQKNTLNVWFLPV 191 >gi|254720058|ref|ZP_05181869.1| hypothetical protein Bru83_11056 [Brucella sp. 83/13] gi|265985065|ref|ZP_06097800.1| conserved hypothetical protein [Brucella sp. 83/13] gi|264663657|gb|EEZ33918.1| conserved hypothetical protein [Brucella sp. 83/13] Length = 481 Score = 54.3 bits (129), Expect = 6e-06, Method: Composition-based stats. Identities = 18/81 (22%), Positives = 34/81 (41%), Gaps = 2/81 (2%) Query: 1 MKSGEWVTFQYTT-PDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYG 59 + G+W + + LI +NG D + +NG + + G + S Sbjct: 115 LGGGKWSSVMFANSAGDISLIGVNGADYMKRFNGTAW-TDISDTTTPGVTGVDTRKLSAI 173 Query: 60 WFFKNRQWYIASDSMDAWTCQ 80 W ++NR +++ D+M AW Sbjct: 174 WAYQNRLFFVQQDTMSAWYLP 194 >gi|306839390|ref|ZP_07472204.1| Hypothetical protein BROD_2246 [Brucella sp. NF 2653] gi|306405513|gb|EFM61778.1| Hypothetical protein BROD_2246 [Brucella sp. NF 2653] Length = 481 Score = 53.5 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 18/81 (22%), Positives = 34/81 (41%), Gaps = 2/81 (2%) Query: 1 MKSGEWVTFQYTT-PDKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQNNPVYFSYG 59 + G+W + + LI +NG D + +NG + + G + S Sbjct: 115 LGGGKWSSVIFANSAGDISLIGVNGADYMKRFNGTAW-TDISDTTTPGVTGVDTRKLSAI 173 Query: 60 WFFKNRQWYIASDSMDAWTCQ 80 W ++NR +++ D+M AW Sbjct: 174 WAYQNRLFFVQQDTMSAWYLP 194 >gi|152982947|ref|YP_001353888.1| hypothetical protein mma_2198 [Janthinobacterium sp. Marseille] gi|151283024|gb|ABR91434.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille] Length = 519 Score = 51.2 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 22/82 (26%), Positives = 36/82 (43%), Gaps = 6/82 (7%) Query: 2 KSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCKSSL---LPIICAGHNQNNPVYFSY 58 S +W + T F+ +NG D Q++NG ++ P+ G +N + + Sbjct: 111 TSDKWQHVNFGTAGGNFMYLVNGSDAPQLFNGTTWQAVTAVSAPVSITGVTASNLINVT- 169 Query: 59 GWFFKNRQWYIASDSMDAWTCQ 80 FK R W+I +SM AW Sbjct: 170 --AFKARLWFIERNSMKAWYLP 189 >gi|209548341|ref|YP_002280258.1| hypothetical protein Rleg2_0736 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209534097|gb|ACI54032.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 607 Score = 39.3 bits (90), Expect = 0.16, Method: Composition-based stats. Identities = 18/52 (34%), Positives = 25/52 (48%), Gaps = 1/52 (1%) Query: 1 MKSGEWVTFQYTTP-DKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQN 51 M S W QYT P + FL +NG D ++NG + ++ L A QN Sbjct: 110 MASAHWSVQQYTNPAGQEFLRLVNGLDTPLLFNGTSWTNNFLVGTAALATQN 161 >gi|110632595|ref|YP_672803.1| hypothetical protein Meso_0234 [Mesorhizobium sp. BNC1] gi|110283579|gb|ABG61638.1| hypothetical protein Meso_0234 [Chelativorans sp. BNC1] Length = 632 Score = 38.5 bits (88), Expect = 0.29, Method: Composition-based stats. Identities = 13/32 (40%), Positives = 18/32 (56%) Query: 2 KSGEWVTFQYTTPDKTFLIALNGQDERQIYNG 33 G+W T Q+ T FL+ +NGQ IY+G Sbjct: 140 TGGDWFTVQFATTGGVFLVGVNGQSTGFIYDG 171 >gi|327191475|gb|EGE58495.1| hypothetical protein RHECNPAF_300005 [Rhizobium etli CNPAF512] Length = 607 Score = 38.1 bits (87), Expect = 0.38, Method: Composition-based stats. Identities = 17/52 (32%), Positives = 25/52 (48%), Gaps = 1/52 (1%) Query: 1 MKSGEWVTFQYTTP-DKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQN 51 + S W QYT P + FL +NG D ++NG + ++ L A QN Sbjct: 110 LASAHWSVQQYTNPAGQEFLRLVNGLDTPLLFNGTSWTNNFLVGTAALATQN 161 >gi|218674218|ref|ZP_03523887.1| hypothetical protein RetlG_23247 [Rhizobium etli GR56] Length = 457 Score = 38.1 bits (87), Expect = 0.41, Method: Composition-based stats. Identities = 18/52 (34%), Positives = 25/52 (48%), Gaps = 1/52 (1%) Query: 1 MKSGEWVTFQYTTP-DKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQN 51 + S W QYT P + FL +NG D I+NG + ++ L A QN Sbjct: 110 LASAHWSVQQYTNPAGQEFLRLVNGLDTPLIFNGTSWTNNFLVGTAALATQN 161 >gi|116253670|ref|YP_769508.1| hypothetical protein RL3930 [Rhizobium leguminosarum bv. viciae 3841] gi|115258318|emb|CAK09420.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 607 Score = 36.6 bits (83), Expect = 1.2, Method: Composition-based stats. Identities = 16/52 (30%), Positives = 24/52 (46%), Gaps = 1/52 (1%) Query: 1 MKSGEWVTFQYTTP-DKTFLIALNGQDERQIYNGNNCKSSLLPIICAGHNQN 51 + S W QYT P + +L +NG D +YNG +++L QN Sbjct: 110 LSSARWSVQQYTNPAGQEYLRLVNGLDLPLLYNGTTWTNNILVGTATLATQN 161 >gi|126444876|ref|YP_001063338.1| hypothetical protein BURPS668_A2344 [Burkholderia pseudomallei 668] gi|126224367|gb|ABN87872.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 590 Score = 35.4 bits (80), Expect = 2.5, Method: Composition-based stats. Identities = 10/37 (27%), Positives = 18/37 (48%) Query: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCK 37 + + W +T TFL+ +NG D +Y+G + Sbjct: 106 LTNSRWQWVNFTNAGLTFLVMVNGVDAPLLYDGTTWR 142 >gi|167907341|ref|ZP_02494546.1| hypothetical protein BpseN_34245 [Burkholderia pseudomallei NCTC 13177] Length = 590 Score = 35.4 bits (80), Expect = 2.6, Method: Composition-based stats. Identities = 10/37 (27%), Positives = 18/37 (48%) Query: 1 MKSGEWVTFQYTTPDKTFLIALNGQDERQIYNGNNCK 37 + + W +T TFL+ +NG D +Y+G + Sbjct: 106 LTNSRWQWVNFTNAGLTFLVMVNGVDAPLLYDGTTWR 142 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.316 0.133 0.410 Lambda K H 0.267 0.0415 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,656,189,124 Number of Sequences: 14124377 Number of extensions: 55518737 Number of successful extensions: 94053 Number of sequences better than 10.0: 18 Number of HSP's better than 10.0 without gapping: 19 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 94015 Number of HSP's gapped (non-prelim): 32 length of query: 83 length of database: 4,842,793,630 effective HSP length: 54 effective length of query: 29 effective length of database: 4,080,077,272 effective search space: 118322240888 effective search space used: 118322240888 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 75 (33.5 bits)