BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780996|ref|YP_003065409.1| hypothetical protein CLIBASIA_04485 [Candidatus Liberibacter asiaticus str. psy62] (109 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254780996|ref|YP_003065409.1| hypothetical protein CLIBASIA_04485 [Candidatus Liberibacter asiaticus str. psy62] gi|254040673|gb|ACT57469.1| hypothetical protein CLIBASIA_04485 [Candidatus Liberibacter asiaticus str. psy62] Length = 109 Score = 170 bits (431), Expect = 5e-41, Method: Composition-based stats. Identities = 109/109 (100%), Positives = 109/109 (100%) Query: 1 MPEDKWYIFYSGCGKNPVWDSMKGCLNFSSYDDNSGNIDTDESDINTAIAIRKDVARVLQ 60 MPEDKWYIFYSGCGKNPVWDSMKGCLNFSSYDDNSGNIDTDESDINTAIAIRKDVARVLQ Sbjct: 1 MPEDKWYIFYSGCGKNPVWDSMKGCLNFSSYDDNSGNIDTDESDINTAIAIRKDVARVLQ 60 Query: 61 VSYPLPAPQEITPRMGNRKTVELLIEIDDQKVWLLNVHLKSSCVVKKIQ 109 VSYPLPAPQEITPRMGNRKTVELLIEIDDQKVWLLNVHLKSSCVVKKIQ Sbjct: 61 VSYPLPAPQEITPRMGNRKTVELLIEIDDQKVWLLNVHLKSSCVVKKIQ 109 >gi|254780797|ref|YP_003065210.1| hypothetical protein CLIBASIA_03440 [Candidatus Liberibacter asiaticus str. psy62] gi|254040474|gb|ACT57270.1| hypothetical protein CLIBASIA_03440 [Candidatus Liberibacter asiaticus str. psy62] Length = 231 Score = 139 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 31/91 (34%), Positives = 53/91 (58%) Query: 19 WDSMKGCLNFSSYDDNSGNIDTDESDINTAIAIRKDVARVLQVSYPLPAPQEITPRMGNR 78 ++++K +D +TD+ ++TAI IRK +LQ SY + + + G R Sbjct: 60 YEAIKRVFPNDKWDILYSGSNTDKHAMHTAIVIRKGAIHLLQKSYLPMDTEGLDSKAGKR 119 Query: 79 KTVELLIEIDDQKVWLLNVHLKSSCVVKKIQ 109 + VE+L E+D +K+WLL++HLKS C + I+ Sbjct: 120 RAVEILFEVDGRKIWLLDIHLKSFCFLDSIE 150 >gi|254781003|ref|YP_003065416.1| hypothetical protein CLIBASIA_04520 [Candidatus Liberibacter asiaticus str. psy62] gi|254040680|gb|ACT57476.1| hypothetical protein CLIBASIA_04520 [Candidatus Liberibacter asiaticus str. psy62] Length = 304 Score = 133 bits (334), Expect = 1e-29, Method: Composition-based stats. Identities = 44/109 (40%), Positives = 64/109 (58%), Gaps = 18/109 (16%) Query: 1 MPEDKWYIFYSGCGKNPVWDSMKGCLNFSSYDDNSGNIDTDESDINTAIAIRKDVARVLQ 60 P++ W IFY S + +N S D N +DI+TAIA+RK RVLQ Sbjct: 84 FPKNTWCIFY----------STERLINHSKRDSN--------NDIHTAIAVRKKNVRVLQ 125 Query: 61 VSYPLPAPQEITPRMGNRKTVELLIEIDDQKVWLLNVHLKSSCVVKKIQ 109 SYPL ++ R GNR+ VELL+EI+ +K+W+L++HLKS C + ++ Sbjct: 126 QSYPLLGAKDSFSRAGNRRAVELLVEINGKKIWVLDIHLKSFCFLDSLE 174 >gi|315122019|ref|YP_004062508.1| hypothetical protein CKC_01345 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495421|gb|ADR52020.1| hypothetical protein CKC_01345 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 315 Score = 115 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 29/91 (31%), Positives = 52/91 (57%) Query: 19 WDSMKGCLNFSSYDDNSGNIDTDESDINTAIAIRKDVARVLQVSYPLPAPQEITPRMGNR 78 + ++K +++ D+DE ++TAI RK VL+ SY ++ + G R Sbjct: 92 YAAIKRVFPEDTWEILYSGNDSDEHTVHTAIVARKGTVHVLEKSYLSMDTNKLDSKAGKR 151 Query: 79 KTVELLIEIDDQKVWLLNVHLKSSCVVKKIQ 109 ++VE+L E++ K+WLL++HLKS C V ++ Sbjct: 152 RSVEILFEVNGIKIWLLDIHLKSFCFVDSLK 182 >gi|190892922|ref|YP_001979464.1| hypothetical protein RHECIAT_CH0003338 [Rhizobium etli CIAT 652] gi|190698201|gb|ACE92286.1| hypothetical protein RHECIAT_CH0003338 [Rhizobium etli CIAT 652] Length = 357 Score = 38.2 bits (87), Expect = 0.38, Method: Composition-based stats. Identities = 25/86 (29%), Positives = 44/86 (51%), Gaps = 11/86 (12%) Query: 23 KGCLNFSSYDDNSGNIDTDESDINTAIAIRKDVARVLQVSYPLPAPQEITPRMGN----- 77 + C+N SS+ + D DI TAIA +K + ++ +V P + + Sbjct: 93 ERCMNNSSH------CNEDIDDIFTAIAYKKSLGQLTEVQVPQLSVMHTSECANETPRPV 146 Query: 78 RKTVELLIEIDDQKVWLLNVHLKSSC 103 R V + I+ D++ + +L+VHLK+SC Sbjct: 147 RGGVGIQIQRDNETIVVLSVHLKASC 172 >gi|88860700|ref|ZP_01135337.1| hypothetical protein PTD2_05565 [Pseudoalteromonas tunicata D2] gi|88817295|gb|EAR27113.1| hypothetical protein PTD2_05565 [Pseudoalteromonas tunicata D2] Length = 316 Score = 37.0 bits (84), Expect = 0.79, Method: Composition-based stats. Identities = 29/113 (25%), Positives = 42/113 (37%), Gaps = 26/113 (23%) Query: 1 MPEDKWYIFYSGCGKNPVWDSMKGCLNFSSYDDNSGNIDTDESDINTAIAIRKDVARVLQ 60 PED+W I S + V+ + SGN T + A A+RK Sbjct: 102 FPEDQWQIIISKRADSEVYSCRE-----------SGNTSTQQK---VAFAVRK------- 140 Query: 61 VSYPLPAPQEITPRM----GNRKTVELLIEIDDQKVWLLNVHLKSSCVVKKIQ 109 S P+ G R + + ++ D +LNVHLKS C V + Sbjct: 141 -SIPVLNTHHYDELALGLNGLRYGLSVTVDTADGATEVLNVHLKSGCFVDDYE 192 >gi|109896633|ref|YP_659888.1| endonuclease/exonuclease/phosphatase [Pseudoalteromonas atlantica T6c] gi|109698914|gb|ABG38834.1| Endonuclease/exonuclease/phosphatase [Pseudoalteromonas atlantica T6c] Length = 331 Score = 35.1 bits (79), Expect = 3.5, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 39/106 (36%), Gaps = 18/106 (16%) Query: 1 MPEDKWYIFYSGCGKNPVWDSMKGCLNFSSYDDNSGNIDTDESDINTAIAIRKDVARVLQ 60 PE W + S +P ++ + + A AI+K ++ + Sbjct: 122 FPESDWTLVLSARADSPAYECRESGFTSTQQKV--------------AFAIKKGISILQV 167 Query: 61 VSYPLPAPQEITPRMGNRKTVELLIEIDDQKVWLLNVHLKSSCVVK 106 Q+I R G TV+ + D +LNVHLKS C V Sbjct: 168 QQNAQLGLQKIGLRFGLAVTVDTPLGPTD----ILNVHLKSGCFVD 209 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.322 0.131 0.369 Lambda K H 0.267 0.0394 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 830,195,974 Number of Sequences: 14124377 Number of extensions: 21964667 Number of successful extensions: 68353 Number of sequences better than 10.0: 16 Number of HSP's better than 10.0 without gapping: 5 Number of HSP's successfully gapped in prelim test: 11 Number of HSP's that attempted gapping in prelim test: 68342 Number of HSP's gapped (non-prelim): 18 length of query: 109 length of database: 4,842,793,630 effective HSP length: 77 effective length of query: 32 effective length of database: 3,755,216,601 effective search space: 120166931232 effective search space used: 120166931232 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (22.0 bits) S2: 75 (33.6 bits)