BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781139|ref|YP_003065552.1| hypothetical protein CLIBASIA_05205 [Candidatus Liberibacter asiaticus str. psy62] (65 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781139|ref|YP_003065552.1| hypothetical protein CLIBASIA_05205 [Candidatus Liberibacter asiaticus str. psy62] gi|254040816|gb|ACT57612.1| hypothetical protein CLIBASIA_05205 [Candidatus Liberibacter asiaticus str. psy62] Length = 65 Score = 136 bits (342), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 65/65 (100%), Positives = 65/65 (100%) Query: 1 MEINQACAMATRKKNRDWQRIASIPVSILKDSHLLQAHTEGDDVWVNKWLNNRDNASWRT 60 MEINQACAMATRKKNRDWQRIASIPVSILKDSHLLQAHTEGDDVWVNKWLNNRDNASWRT Sbjct: 1 MEINQACAMATRKKNRDWQRIASIPVSILKDSHLLQAHTEGDDVWVNKWLNNRDNASWRT 60 Query: 61 SEGYV 65 SEGYV Sbjct: 61 SEGYV 65 >gi|315122532|ref|YP_004063021.1| hypothetical protein CKC_03920 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495934|gb|ADR52533.1| hypothetical protein CKC_03920 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 118 Score = 105 bits (263), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 46/65 (70%), Positives = 59/65 (90%) Query: 1 MEINQACAMATRKKNRDWQRIASIPVSILKDSHLLQAHTEGDDVWVNKWLNNRDNASWRT 60 +++NQACA+++ KK DWQRIAS+P+S+++ SHLLQAH+EGDD WV+KWLNNRDNASWRT Sbjct: 54 LDLNQACAVSSGKKREDWQRIASVPLSVVRSSHLLQAHSEGDDQWVSKWLNNRDNASWRT 113 Query: 61 SEGYV 65 SEG V Sbjct: 114 SEGCV 118 >gi|150397026|ref|YP_001327493.1| hypothetical protein Smed_1823 [Sinorhizobium medicae WSM419] gi|150028541|gb|ABR60658.1| hypothetical protein Smed_1823 [Sinorhizobium medicae WSM419] Length = 107 Score = 63.5 bits (153), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 28/56 (50%), Positives = 41/56 (73%) Query: 10 ATRKKNRDWQRIASIPVSILKDSHLLQAHTEGDDVWVNKWLNNRDNASWRTSEGYV 65 A R DW R+ASIP+++ + L+QAH+EGDD +V ++LNN DN +WRT EG++ Sbjct: 52 AERAWKGDWHRVASIPLNVAHGAGLVQAHSEGDDRFVKRFLNNSDNRAWRTKEGHL 107 >gi|227822440|ref|YP_002826412.1| hypothetical protein NGR_c18950 [Sinorhizobium fredii NGR234] gi|227341441|gb|ACP25659.1| hypothetical protein NGR_c18950 [Sinorhizobium fredii NGR234] Length = 107 Score = 57.0 bits (136), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 21/49 (42%), Positives = 39/49 (79%) Query: 17 DWQRIASIPVSILKDSHLLQAHTEGDDVWVNKWLNNRDNASWRTSEGYV 65 +W ++AS+P+++ +L++AH+EGDD +V +WLN+ DN +WR+ EG++ Sbjct: 59 EWTKVASVPLNVAHSENLVRAHSEGDDRYVKRWLNDGDNRAWRSFEGHL 107 Searching..................................................done Results from round 2 CONVERGED! >gi|315122532|ref|YP_004063021.1| hypothetical protein CKC_03920 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495934|gb|ADR52533.1| hypothetical protein CKC_03920 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 118 Score = 108 bits (269), Expect = 3e-22, Method: Composition-based stats. Identities = 46/65 (70%), Positives = 59/65 (90%) Query: 1 MEINQACAMATRKKNRDWQRIASIPVSILKDSHLLQAHTEGDDVWVNKWLNNRDNASWRT 60 +++NQACA+++ KK DWQRIAS+P+S+++ SHLLQAH+EGDD WV+KWLNNRDNASWRT Sbjct: 54 LDLNQACAVSSGKKREDWQRIASVPLSVVRSSHLLQAHSEGDDQWVSKWLNNRDNASWRT 113 Query: 61 SEGYV 65 SEG V Sbjct: 114 SEGCV 118 >gi|254781139|ref|YP_003065552.1| hypothetical protein CLIBASIA_05205 [Candidatus Liberibacter asiaticus str. psy62] gi|254040816|gb|ACT57612.1| hypothetical protein CLIBASIA_05205 [Candidatus Liberibacter asiaticus str. psy62] Length = 65 Score = 100 bits (249), Expect = 7e-20, Method: Composition-based stats. Identities = 65/65 (100%), Positives = 65/65 (100%) Query: 1 MEINQACAMATRKKNRDWQRIASIPVSILKDSHLLQAHTEGDDVWVNKWLNNRDNASWRT 60 MEINQACAMATRKKNRDWQRIASIPVSILKDSHLLQAHTEGDDVWVNKWLNNRDNASWRT Sbjct: 1 MEINQACAMATRKKNRDWQRIASIPVSILKDSHLLQAHTEGDDVWVNKWLNNRDNASWRT 60 Query: 61 SEGYV 65 SEGYV Sbjct: 61 SEGYV 65 >gi|150397026|ref|YP_001327493.1| hypothetical protein Smed_1823 [Sinorhizobium medicae WSM419] gi|150028541|gb|ABR60658.1| hypothetical protein Smed_1823 [Sinorhizobium medicae WSM419] Length = 107 Score = 92.9 bits (229), Expect = 1e-17, Method: Composition-based stats. Identities = 28/56 (50%), Positives = 41/56 (73%) Query: 10 ATRKKNRDWQRIASIPVSILKDSHLLQAHTEGDDVWVNKWLNNRDNASWRTSEGYV 65 A R DW R+ASIP+++ + L+QAH+EGDD +V ++LNN DN +WRT EG++ Sbjct: 52 AERAWKGDWHRVASIPLNVAHGAGLVQAHSEGDDRFVKRFLNNSDNRAWRTKEGHL 107 >gi|227822440|ref|YP_002826412.1| hypothetical protein NGR_c18950 [Sinorhizobium fredii NGR234] gi|227341441|gb|ACP25659.1| hypothetical protein NGR_c18950 [Sinorhizobium fredii NGR234] Length = 107 Score = 82.5 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 21/50 (42%), Positives = 39/50 (78%) Query: 16 RDWQRIASIPVSILKDSHLLQAHTEGDDVWVNKWLNNRDNASWRTSEGYV 65 +W ++AS+P+++ +L++AH+EGDD +V +WLN+ DN +WR+ EG++ Sbjct: 58 GEWTKVASVPLNVAHSENLVRAHSEGDDRYVKRWLNDGDNRAWRSFEGHL 107 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.310 0.131 0.405 Lambda K H 0.267 0.0411 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,315,682,811 Number of Sequences: 14124377 Number of extensions: 39196108 Number of successful extensions: 91727 Number of sequences better than 10.0: 10 Number of HSP's better than 10.0 without gapping: 8 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 91719 Number of HSP's gapped (non-prelim): 14 length of query: 65 length of database: 4,842,793,630 effective HSP length: 37 effective length of query: 28 effective length of database: 4,320,191,681 effective search space: 120965367068 effective search space used: 120965367068 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 75 (33.5 bits)