BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780207|ref|YP_003064620.1| hypothetical protein CLIBASIA_00460 [Candidatus Liberibacter asiaticus str. psy62] (98 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done >gi|254780207|ref|YP_003064620.1| hypothetical protein CLIBASIA_00460 [Candidatus Liberibacter asiaticus str. psy62] gi|254039884|gb|ACT56680.1| hypothetical protein CLIBASIA_00460 [Candidatus Liberibacter asiaticus str. psy62] Length = 98 Score = 184 bits (466), Expect = 4e-45, Method: Composition-based stats. Identities = 98/98 (100%), Positives = 98/98 (100%) Query: 1 MRHLILIMLLSILTTNIARAQVYHIHSPRIATKSSIHIKCHSCTLNKHHINKTPSSSSAV 60 MRHLILIMLLSILTTNIARAQVYHIHSPRIATKSSIHIKCHSCTLNKHHINKTPSSSSAV Sbjct: 1 MRHLILIMLLSILTTNIARAQVYHIHSPRIATKSSIHIKCHSCTLNKHHINKTPSSSSAV 60 Query: 61 YTKKEELIDGKKAMITTDNFMGGEPITFIKYLFEEDKK 98 YTKKEELIDGKKAMITTDNFMGGEPITFIKYLFEEDKK Sbjct: 61 YTKKEELIDGKKAMITTDNFMGGEPITFIKYLFEEDKK 98 >gi|315122632|ref|YP_004063121.1| hypothetical protein CKC_04420 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496034|gb|ADR52633.1| hypothetical protein CKC_04420 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 119 Score = 75.5 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 41/100 (41%), Positives = 60/100 (60%), Gaps = 2/100 (2%) Query: 1 MRHLILIMLLSILTTNIARAQ-VYHIHSPRIATKSSIHIKCHSCTLNKHHINKTPSS-SS 58 M L+L ++L + + +AQ VY IH P SI++KC SC L K + + + Sbjct: 1 MNFLMLAVILFAFSAQVGKAQSVYEIHHPIRTQDFSIYVKCLSCELPKQSKSNIQNQLND 60 Query: 59 AVYTKKEELIDGKKAMITTDNFMGGEPITFIKYLFEEDKK 98 + T+KEE+IDGK A++ TDN MG EP TFIK+ E++K Sbjct: 61 VISTQKEEIIDGKNAIVMTDNLMGDEPETFIKHFKHEEQK 100 >gi|159185417|ref|NP_353669.2| hypothetical protein Atu8084 [Agrobacterium tumefaciens str. C58] gi|159140665|gb|AAK86454.2| conserved hypothetical protein [Agrobacterium tumefaciens str. C58] Length = 169 Score = 39.3 bits (90), Expect = 0.18, Method: Composition-based stats. Identities = 23/97 (23%), Positives = 49/97 (50%), Gaps = 12/97 (12%) Query: 1 MRHLILIMLLSILTTNIARAQVYHIHSPRIATKSSIHIKCHSCTLNKHHINKTPSSSSAV 60 M+ L L ++ + + + + + +I+ ++ S + + C C K NK + AV Sbjct: 1 MQRLFLTAIVVLTSGSAMASSIEYINDVQVTNGSFVRLNCAGCQPLK---NKPAAQGYAV 57 Query: 61 --------YTKKEELIDGKKAMITTDNFMGGEPITFI 89 +T+ E +DGK+ ++ T+ ++GG P+TF+ Sbjct: 58 PSIEPGTQHTEMHE-VDGKRTLVRTEAWLGGAPVTFV 93 >gi|325292028|ref|YP_004277892.1| hypothetical protein AGROH133_04143 [Agrobacterium sp. H13-3] gi|325059881|gb|ADY63572.1| hypothetical protein AGROH133_04143 [Agrobacterium sp. H13-3] Length = 170 Score = 37.0 bits (84), Expect = 0.85, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 44/93 (47%), Gaps = 4/93 (4%) Query: 1 MRHLILIMLLSILTTNIARAQVYHIHSPRIATKSSIHIKCHSCTLNKHHIN----KTPSS 56 MR L+L ++++ + + + +++ + S + + C C K + PS Sbjct: 1 MRRLLLTAVVALTGGSAMASSIEYVNGTHTSNGSFVRLDCARCQPVKDNPAPEGFAIPSI 60 Query: 57 SSAVYTKKEELIDGKKAMITTDNFMGGEPITFI 89 + I+GK+ ++ T+ ++GG P+TF+ Sbjct: 61 EPGTQHTEMREIEGKRTLVRTEAWLGGAPVTFV 93 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.321 0.132 0.381 Lambda K H 0.267 0.0436 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,019,230,228 Number of Sequences: 13984884 Number of extensions: 39050483 Number of successful extensions: 84907 Number of sequences better than 10.0: 4 Number of HSP's better than 10.0 without gapping: 2 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 84903 Number of HSP's gapped (non-prelim): 4 length of query: 98 length of database: 4,792,584,752 effective HSP length: 67 effective length of query: 31 effective length of database: 3,855,597,524 effective search space: 119523523244 effective search space used: 119523523244 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 76 (33.8 bits)