BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781002|ref|YP_003065415.1| hypothetical protein CLIBASIA_04515 [Candidatus Liberibacter asiaticus str. psy62] (108 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254781002|ref|YP_003065415.1| hypothetical protein CLIBASIA_04515 [Candidatus Liberibacter asiaticus str. psy62] gi|254040679|gb|ACT57475.1| hypothetical protein CLIBASIA_04515 [Candidatus Liberibacter asiaticus str. psy62] Length = 108 Score = 215 bits (547), Expect = 2e-54, Method: Compositional matrix adjust. Identities = 108/108 (100%), Positives = 108/108 (100%) Query: 1 MMEMIFFAFVFTCILLLWCVFIGLIACWIVDRWEYFFIRRKYVLTAILLNYQFFYSLQTI 60 MMEMIFFAFVFTCILLLWCVFIGLIACWIVDRWEYFFIRRKYVLTAILLNYQFFYSLQTI Sbjct: 1 MMEMIFFAFVFTCILLLWCVFIGLIACWIVDRWEYFFIRRKYVLTAILLNYQFFYSLQTI 60 Query: 61 FSVEHYTNPHSIILAFVWSILSFFITFWIFVASVSMGRRVIQYFKRSR 108 FSVEHYTNPHSIILAFVWSILSFFITFWIFVASVSMGRRVIQYFKRSR Sbjct: 61 FSVEHYTNPHSIILAFVWSILSFFITFWIFVASVSMGRRVIQYFKRSR 108 >gi|195554718|ref|XP_002076949.1| GD24543 [Drosophila simulans] gi|194202967|gb|EDX16543.1| GD24543 [Drosophila simulans] Length = 344 Score = 36.2 bits (82), Expect = 1.6, Method: Composition-based stats. Identities = 22/95 (23%), Positives = 45/95 (47%), Gaps = 14/95 (14%) Query: 15 LLLW------CVFIGLIACWIVDRWEYFFIRRKYVLTAILLNYQFFYSLQTIFSVEHYTN 68 L+LW + GL ++ W ++++ +L +++ L +FSV + + Sbjct: 124 LVLWRRPLQTTKYCGLELFTLLRTWSTRLVQQRVLLATLIV-------LSIVFSVIYKID 176 Query: 69 -PHSIILAFVWSILSFFITFWIFVASVSMGRRVIQ 102 PH + FV++ F + FW F + +G+ VI+ Sbjct: 177 GPHQFAIEFVFTCGHFLVPFWTFFGATLIGKAVIK 211 >gi|311029778|ref|ZP_07707868.1| hypothetical protein Bm3-1_04349 [Bacillus sp. m3-13] Length = 110 Score = 33.5 bits (75), Expect = 8.8, Method: Compositional matrix adjust. Identities = 20/66 (30%), Positives = 35/66 (53%), Gaps = 1/66 (1%) Query: 16 LLWCVFIGLIACWIVDRWEYFFIRRKYVLT-AILLNYQFFYSLQTIFSVEHYTNPHSIIL 74 L+ V IG++ +IV+R++ F RR+Y++ I Y TI +++ P +I Sbjct: 28 LIVAVMIGIMFAFIVERFQLFDKRRQYLVAIGITSPTILLYFPLTILAIKETPAPTDVIA 87 Query: 75 AFVWSI 80 +WSI Sbjct: 88 ILLWSI 93 >gi|305664907|ref|YP_003861194.1| hypothetical protein FB2170_01347 [Maribacter sp. HTCC2170] gi|88707737|gb|EAQ99977.1| hypothetical protein FB2170_01347 [Maribacter sp. HTCC2170] Length = 570 Score = 33.5 bits (75), Expect = 8.8, Method: Composition-based stats. Identities = 21/63 (33%), Positives = 34/63 (53%), Gaps = 6/63 (9%) Query: 36 FFIRRKYVLTAILLNYQFFYSLQTIFSVEHYTNPHSIILAFVWSILSFFITF--WIFVAS 93 + ++ +LT L YQ F +T FSV N H +IL + ++L FIT ++ ++S Sbjct: 7 YLLKASGILTIFYLAYQVFLKRETFFSV----NRHFLILGILTALLLPFITITNYVEISS 62 Query: 94 VSM 96 V M Sbjct: 63 VPM 65 Searching..................................................done Results from round 2 CONVERGED! >gi|254781002|ref|YP_003065415.1| hypothetical protein CLIBASIA_04515 [Candidatus Liberibacter asiaticus str. psy62] gi|254040679|gb|ACT57475.1| hypothetical protein CLIBASIA_04515 [Candidatus Liberibacter asiaticus str. psy62] Length = 108 Score = 136 bits (344), Expect = 8e-31, Method: Composition-based stats. Identities = 108/108 (100%), Positives = 108/108 (100%) Query: 1 MMEMIFFAFVFTCILLLWCVFIGLIACWIVDRWEYFFIRRKYVLTAILLNYQFFYSLQTI 60 MMEMIFFAFVFTCILLLWCVFIGLIACWIVDRWEYFFIRRKYVLTAILLNYQFFYSLQTI Sbjct: 1 MMEMIFFAFVFTCILLLWCVFIGLIACWIVDRWEYFFIRRKYVLTAILLNYQFFYSLQTI 60 Query: 61 FSVEHYTNPHSIILAFVWSILSFFITFWIFVASVSMGRRVIQYFKRSR 108 FSVEHYTNPHSIILAFVWSILSFFITFWIFVASVSMGRRVIQYFKRSR Sbjct: 61 FSVEHYTNPHSIILAFVWSILSFFITFWIFVASVSMGRRVIQYFKRSR 108 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.344 0.148 0.521 Lambda K H 0.267 0.0474 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,652,701,650 Number of Sequences: 14124377 Number of extensions: 131059012 Number of successful extensions: 869001 Number of sequences better than 10.0: 502 Number of HSP's better than 10.0 without gapping: 36 Number of HSP's successfully gapped in prelim test: 558 Number of HSP's that attempted gapping in prelim test: 868527 Number of HSP's gapped (non-prelim): 837 length of query: 108 length of database: 4,842,793,630 effective HSP length: 76 effective length of query: 32 effective length of database: 3,769,340,978 effective search space: 120618911296 effective search space used: 120618911296 T: 11 A: 40 X1: 16 ( 7.9 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 38 (21.6 bits) S2: 76 (33.7 bits)