BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780912|ref|YP_003065325.1| hypothetical protein CLIBASIA_04055 [Candidatus Liberibacter asiaticus str. psy62] (68 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254780912|ref|YP_003065325.1| hypothetical protein CLIBASIA_04055 [Candidatus Liberibacter asiaticus str. psy62] gi|254040589|gb|ACT57385.1| hypothetical protein CLIBASIA_04055 [Candidatus Liberibacter asiaticus str. psy62] Length = 68 Score = 134 bits (336), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 68/68 (100%), Positives = 68/68 (100%) Query: 1 MINNPLSLLSLPSFMKSLLRSLELCSQIMQCVSKEKNTINKASVRAGARNAISNPGKYVI 60 MINNPLSLLSLPSFMKSLLRSLELCSQIMQCVSKEKNTINKASVRAGARNAISNPGKYVI Sbjct: 1 MINNPLSLLSLPSFMKSLLRSLELCSQIMQCVSKEKNTINKASVRAGARNAISNPGKYVI 60 Query: 61 SGLLEGLL 68 SGLLEGLL Sbjct: 61 SGLLEGLL 68 >gi|289549173|ref|YP_003474161.1| hypothetical protein Thal_1405 [Thermocrinis albus DSM 14484] gi|289182790|gb|ADC90034.1| Tetratricopeptide TPR_2 repeat protein [Thermocrinis albus DSM 14484] Length = 388 Score = 34.3 bits (77), Expect = 5.2, Method: Composition-based stats. Identities = 18/55 (32%), Positives = 33/55 (60%) Query: 4 NPLSLLSLPSFMKSLLRSLELCSQIMQCVSKEKNTINKASVRAGARNAISNPGKY 58 N L L + +K+LL++LEL ++++C +T KA V A NA+++ G++ Sbjct: 288 NALVYLPTANPLKNLLKALELYEEVLRCKESLGDTEGKARVLANMGNALAHLGRF 342 Searching..................................................done Results from round 2 CONVERGED! >gi|254780912|ref|YP_003065325.1| hypothetical protein CLIBASIA_04055 [Candidatus Liberibacter asiaticus str. psy62] gi|254040589|gb|ACT57385.1| hypothetical protein CLIBASIA_04055 [Candidatus Liberibacter asiaticus str. psy62] Length = 68 Score = 120 bits (300), Expect = 7e-26, Method: Composition-based stats. Identities = 68/68 (100%), Positives = 68/68 (100%) Query: 1 MINNPLSLLSLPSFMKSLLRSLELCSQIMQCVSKEKNTINKASVRAGARNAISNPGKYVI 60 MINNPLSLLSLPSFMKSLLRSLELCSQIMQCVSKEKNTINKASVRAGARNAISNPGKYVI Sbjct: 1 MINNPLSLLSLPSFMKSLLRSLELCSQIMQCVSKEKNTINKASVRAGARNAISNPGKYVI 60 Query: 61 SGLLEGLL 68 SGLLEGLL Sbjct: 61 SGLLEGLL 68 >gi|289549173|ref|YP_003474161.1| hypothetical protein Thal_1405 [Thermocrinis albus DSM 14484] gi|289182790|gb|ADC90034.1| Tetratricopeptide TPR_2 repeat protein [Thermocrinis albus DSM 14484] Length = 388 Score = 35.5 bits (80), Expect = 2.3, Method: Composition-based stats. Identities = 18/55 (32%), Positives = 33/55 (60%) Query: 4 NPLSLLSLPSFMKSLLRSLELCSQIMQCVSKEKNTINKASVRAGARNAISNPGKY 58 N L L + +K+LL++LEL ++++C +T KA V A NA+++ G++ Sbjct: 288 NALVYLPTANPLKNLLKALELYEEVLRCKESLGDTEGKARVLANMGNALAHLGRF 342 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.317 0.131 0.357 Lambda K H 0.267 0.0406 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,010,719,267 Number of Sequences: 14124377 Number of extensions: 22982916 Number of successful extensions: 57701 Number of sequences better than 10.0: 2 Number of HSP's better than 10.0 without gapping: 4 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 57697 Number of HSP's gapped (non-prelim): 4 length of query: 68 length of database: 4,842,793,630 effective HSP length: 40 effective length of query: 28 effective length of database: 4,277,818,550 effective search space: 119778919400 effective search space used: 119778919400 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 75 (33.5 bits)