BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254780830|ref|YP_003065243.1| hypothetical protein CLIBASIA_03615 [Candidatus Liberibacter asiaticus str. psy62] (40 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|254780492|ref|YP_003064905.1| hypothetical protein CLIBASIA_01895 [Candidatus Liberibacter asiaticus str. psy62] gi|254780830|ref|YP_003065243.1| hypothetical protein CLIBASIA_03615 [Candidatus Liberibacter asiaticus str. psy62] gi|254780880|ref|YP_003065293.1| hypothetical protein CLIBASIA_03885 [Candidatus Liberibacter asiaticus str. psy62] gi|254040169|gb|ACT56965.1| hypothetical protein CLIBASIA_01895 [Candidatus Liberibacter asiaticus str. psy62] gi|254040507|gb|ACT57303.1| hypothetical protein CLIBASIA_03615 [Candidatus Liberibacter asiaticus str. psy62] gi|254040557|gb|ACT57353.1| hypothetical protein CLIBASIA_03885 [Candidatus Liberibacter asiaticus str. psy62] Length = 40 Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 40/40 (100%), Positives = 40/40 (100%) Query: 1 MTIDNESNQARKGIWWMPWHAQAMKDVICCDKLWGAANKH 40 MTIDNESNQARKGIWWMPWHAQAMKDVICCDKLWGAANKH Sbjct: 1 MTIDNESNQARKGIWWMPWHAQAMKDVICCDKLWGAANKH 40 >gi|297182451|gb|ADI18614.1| hypothetical protein [uncultured Rhodospirillales bacterium HF4000_24M03] Length = 60 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 21/30 (70%), Positives = 25/30 (83%) Query: 11 RKGIWWMPWHAQAMKDVICCDKLWGAANKH 40 +KGIWWMPW +AMKDVI C+K GAAN+H Sbjct: 31 KKGIWWMPWRQEAMKDVIGCEKPRGAANRH 60 >gi|210623970|ref|ZP_03294136.1| hypothetical protein CLOHIR_02088 [Clostridium hiranonis DSM 13275] gi|210153265|gb|EEA84271.1| hypothetical protein CLOHIR_02088 [Clostridium hiranonis DSM 13275] Length = 49 Score = 45.8 bits (107), Expect = 0.002, Method: Compositional matrix adjust. Identities = 20/33 (60%), Positives = 22/33 (66%) Query: 7 SNQARKGIWWMPWHAQAMKDVICCDKLWGAANK 39 + Q KG MPWH + MKDVI CDKLWG A K Sbjct: 16 TGQVIKGAGRMPWHREPMKDVISCDKLWGVARK 48 >gi|51557703|gb|AAU06491.1| putative salivary protein [Culicoides sonorensis] Length = 46 Score = 43.5 bits (101), Expect = 0.011, Method: Compositional matrix adjust. Identities = 17/32 (53%), Positives = 22/32 (68%) Query: 8 NQARKGIWWMPWHAQAMKDVICCDKLWGAANK 39 Q +K I WMPW ++AMKDV+ CDKL G + Sbjct: 1 GQVKKRIRWMPWQSEAMKDVVACDKLRGVGKQ 32 >gi|297184727|gb|ADI20838.1| hypothetical protein [uncultured alpha proteobacterium EF100_102A06] Length = 83 Score = 42.0 bits (97), Expect = 0.031, Method: Compositional matrix adjust. Identities = 20/34 (58%), Positives = 25/34 (73%) Query: 6 ESNQARKGIWWMPWHAQAMKDVICCDKLWGAANK 39 E + +KGIWWMPW +AMKDV C+K GAA+K Sbjct: 49 ELIKRQKGIWWMPWCQEAMKDVTRCEKPGGAASK 82 >gi|297180501|gb|ADI16715.1| hypothetical protein [uncultured gamma proteobacterium HF0010_05D02] Length = 77 Score = 41.6 bits (96), Expect = 0.035, Method: Compositional matrix adjust. Identities = 16/28 (57%), Positives = 20/28 (71%) Query: 12 KGIWWMPWHAQAMKDVICCDKLWGAANK 39 K I WMPW ++AMKDV+ CDKL G + Sbjct: 3 KRIRWMPWQSEAMKDVVACDKLRGVGKQ 30 >gi|297182112|gb|ADI18285.1| hypothetical protein [uncultured Chromatiales bacterium HF0200_41F04] Length = 56 Score = 40.4 bits (93), Expect = 0.077, Method: Compositional matrix adjust. Identities = 18/32 (56%), Positives = 22/32 (68%) Query: 8 NQARKGIWWMPWHAQAMKDVICCDKLWGAANK 39 QA K +WWMP +AMKDV+ C+K GA NK Sbjct: 24 GQANKRMWWMPRRQEAMKDVVVCEKPRGAGNK 55 >gi|297183883|gb|ADI20005.1| hypothetical protein [uncultured gamma proteobacterium EB000_65A11] Length = 57 Score = 39.7 bits (91), Expect = 0.13, Method: Compositional matrix adjust. Identities = 17/29 (58%), Positives = 20/29 (68%) Query: 8 NQARKGIWWMPWHAQAMKDVICCDKLWGA 36 QA K IWWMP +AMKDV+ C+KL G Sbjct: 6 GQANKRIWWMPRQLEAMKDVVVCEKLGGG 34 >gi|297182912|gb|ADI19062.1| hypothetical protein [uncultured delta proteobacterium HF0070_07E19] Length = 69 Score = 38.1 bits (87), Expect = 0.37, Method: Compositional matrix adjust. Identities = 19/36 (52%), Positives = 23/36 (63%), Gaps = 2/36 (5%) Query: 3 IDNESNQARKGIWWMPWHAQAMKDVICCDKLWGAAN 38 I+ +SN+ KG MPW +AMKDV CDK G AN Sbjct: 34 IEKKSNE--KGTGRMPWLQEAMKDVASCDKPRGEAN 67 >gi|255325736|ref|ZP_05366831.1| putative salivary protein [Corynebacterium tuberculostearicum SK141] gi|255297202|gb|EET76524.1| putative salivary protein [Corynebacterium tuberculostearicum SK141] Length = 58 Score = 37.4 bits (85), Expect = 0.68, Method: Compositional matrix adjust. Identities = 17/27 (62%), Positives = 18/27 (66%) Query: 12 KGIWWMPWHAQAMKDVICCDKLWGAAN 38 KG WWMPWHA+ MKDV C K G N Sbjct: 32 KGAWWMPWHAEPMKDVKGCVKPRGVVN 58 >gi|297182737|gb|ADI18892.1| hypothetical protein [uncultured delta proteobacterium HF0010_08B07] Length = 68 Score = 35.8 bits (81), Expect = 2.1, Method: Compositional matrix adjust. Identities = 15/23 (65%), Positives = 17/23 (73%) Query: 17 MPWHAQAMKDVICCDKLWGAANK 39 MPWH +AMKDV CDKL GA + Sbjct: 1 MPWHLEAMKDVGNCDKLRGAVTQ 23 Searching..................................................done Results from round 2 CONVERGED! >gi|254780492|ref|YP_003064905.1| hypothetical protein CLIBASIA_01895 [Candidatus Liberibacter asiaticus str. psy62] gi|254780830|ref|YP_003065243.1| hypothetical protein CLIBASIA_03615 [Candidatus Liberibacter asiaticus str. psy62] gi|254780880|ref|YP_003065293.1| hypothetical protein CLIBASIA_03885 [Candidatus Liberibacter asiaticus str. psy62] gi|254040169|gb|ACT56965.1| hypothetical protein CLIBASIA_01895 [Candidatus Liberibacter asiaticus str. psy62] gi|254040507|gb|ACT57303.1| hypothetical protein CLIBASIA_03615 [Candidatus Liberibacter asiaticus str. psy62] gi|254040557|gb|ACT57353.1| hypothetical protein CLIBASIA_03885 [Candidatus Liberibacter asiaticus str. psy62] Length = 40 Score = 64.7 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 40/40 (100%), Positives = 40/40 (100%) Query: 1 MTIDNESNQARKGIWWMPWHAQAMKDVICCDKLWGAANKH 40 MTIDNESNQARKGIWWMPWHAQAMKDVICCDKLWGAANKH Sbjct: 1 MTIDNESNQARKGIWWMPWHAQAMKDVICCDKLWGAANKH 40 >gi|210623970|ref|ZP_03294136.1| hypothetical protein CLOHIR_02088 [Clostridium hiranonis DSM 13275] gi|210153265|gb|EEA84271.1| hypothetical protein CLOHIR_02088 [Clostridium hiranonis DSM 13275] Length = 49 Score = 56.6 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 20/33 (60%), Positives = 22/33 (66%) Query: 7 SNQARKGIWWMPWHAQAMKDVICCDKLWGAANK 39 + Q KG MPWH + MKDVI CDKLWG A K Sbjct: 16 TGQVIKGAGRMPWHREPMKDVISCDKLWGVARK 48 >gi|297182451|gb|ADI18614.1| hypothetical protein [uncultured Rhodospirillales bacterium HF4000_24M03] Length = 60 Score = 56.2 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 21/30 (70%), Positives = 25/30 (83%) Query: 11 RKGIWWMPWHAQAMKDVICCDKLWGAANKH 40 +KGIWWMPW +AMKDVI C+K GAAN+H Sbjct: 31 KKGIWWMPWRQEAMKDVIGCEKPRGAANRH 60 >gi|297184727|gb|ADI20838.1| hypothetical protein [uncultured alpha proteobacterium EF100_102A06] Length = 83 Score = 44.7 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 19/29 (65%), Positives = 23/29 (79%) Query: 11 RKGIWWMPWHAQAMKDVICCDKLWGAANK 39 +KGIWWMPW +AMKDV C+K GAA+K Sbjct: 54 QKGIWWMPWCQEAMKDVTRCEKPGGAASK 82 >gi|255325736|ref|ZP_05366831.1| putative salivary protein [Corynebacterium tuberculostearicum SK141] gi|255297202|gb|EET76524.1| putative salivary protein [Corynebacterium tuberculostearicum SK141] Length = 58 Score = 43.1 bits (100), Expect = 0.012, Method: Composition-based stats. Identities = 17/28 (60%), Positives = 18/28 (64%) Query: 11 RKGIWWMPWHAQAMKDVICCDKLWGAAN 38 KG WWMPWHA+ MKDV C K G N Sbjct: 31 IKGAWWMPWHAEPMKDVKGCVKPRGVVN 58 >gi|297182112|gb|ADI18285.1| hypothetical protein [uncultured Chromatiales bacterium HF0200_41F04] Length = 56 Score = 43.1 bits (100), Expect = 0.013, Method: Composition-based stats. Identities = 18/32 (56%), Positives = 22/32 (68%) Query: 8 NQARKGIWWMPWHAQAMKDVICCDKLWGAANK 39 QA K +WWMP +AMKDV+ C+K GA NK Sbjct: 24 GQANKRMWWMPRRQEAMKDVVVCEKPRGAGNK 55 >gi|51557703|gb|AAU06491.1| putative salivary protein [Culicoides sonorensis] Length = 46 Score = 43.1 bits (100), Expect = 0.014, Method: Composition-based stats. Identities = 17/32 (53%), Positives = 22/32 (68%) Query: 8 NQARKGIWWMPWHAQAMKDVICCDKLWGAANK 39 Q +K I WMPW ++AMKDV+ CDKL G + Sbjct: 1 GQVKKRIRWMPWQSEAMKDVVACDKLRGVGKQ 32 >gi|297182912|gb|ADI19062.1| hypothetical protein [uncultured delta proteobacterium HF0070_07E19] Length = 69 Score = 42.8 bits (99), Expect = 0.018, Method: Composition-based stats. Identities = 17/33 (51%), Positives = 18/33 (54%) Query: 6 ESNQARKGIWWMPWHAQAMKDVICCDKLWGAAN 38 E KG MPW +AMKDV CDK G AN Sbjct: 35 EKKSNEKGTGRMPWLQEAMKDVASCDKPRGEAN 67 >gi|297180501|gb|ADI16715.1| hypothetical protein [uncultured gamma proteobacterium HF0010_05D02] Length = 77 Score = 41.6 bits (96), Expect = 0.035, Method: Composition-based stats. Identities = 16/28 (57%), Positives = 20/28 (71%) Query: 12 KGIWWMPWHAQAMKDVICCDKLWGAANK 39 K I WMPW ++AMKDV+ CDKL G + Sbjct: 3 KRIRWMPWQSEAMKDVVACDKLRGVGKQ 30 >gi|269968958|ref|ZP_06182898.1| hypothetical protein VMC_43280 [Vibrio alginolyticus 40B] gi|269826431|gb|EEZ80825.1| hypothetical protein VMC_43280 [Vibrio alginolyticus 40B] Length = 71 Score = 37.7 bits (86), Expect = 0.53, Method: Composition-based stats. Identities = 12/24 (50%), Positives = 16/24 (66%) Query: 17 MPWHAQAMKDVICCDKLWGAANKH 40 MPW ++AMKDV+ CDK + H Sbjct: 1 MPWQSEAMKDVLTCDKPRLGSKNH 24 >gi|297183883|gb|ADI20005.1| hypothetical protein [uncultured gamma proteobacterium EB000_65A11] Length = 57 Score = 37.4 bits (85), Expect = 0.68, Method: Composition-based stats. Identities = 17/28 (60%), Positives = 20/28 (71%) Query: 8 NQARKGIWWMPWHAQAMKDVICCDKLWG 35 QA K IWWMP +AMKDV+ C+KL G Sbjct: 6 GQANKRIWWMPRQLEAMKDVVVCEKLGG 33 >gi|297182737|gb|ADI18892.1| hypothetical protein [uncultured delta proteobacterium HF0010_08B07] Length = 68 Score = 37.4 bits (85), Expect = 0.73, Method: Composition-based stats. Identities = 15/23 (65%), Positives = 17/23 (73%) Query: 17 MPWHAQAMKDVICCDKLWGAANK 39 MPWH +AMKDV CDKL GA + Sbjct: 1 MPWHLEAMKDVGNCDKLRGAVTQ 23 >gi|297180674|gb|ADI16883.1| hypothetical protein [uncultured gamma proteobacterium HF0010_16J05] Length = 71 Score = 35.0 bits (79), Expect = 3.2, Method: Composition-based stats. Identities = 11/23 (47%), Positives = 14/23 (60%) Query: 17 MPWHAQAMKDVICCDKLWGAANK 39 MPW +AMKDV+ CD G + Sbjct: 1 MPWQLKAMKDVVACDMPRGVGKQ 23 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.316 0.133 0.459 Lambda K H 0.267 0.0404 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 751,410,531 Number of Sequences: 14124377 Number of extensions: 14854867 Number of successful extensions: 32541 Number of sequences better than 10.0: 14 Number of HSP's better than 10.0 without gapping: 25 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 32516 Number of HSP's gapped (non-prelim): 25 length of query: 40 length of database: 4,842,793,630 effective HSP length: 14 effective length of query: 26 effective length of database: 4,645,052,352 effective search space: 120771361152 effective search space used: 120771361152 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 75 (33.5 bits)