BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= 537021.9.peg.1140_1 (90 letters) Database: nr 13,984,884 sequences; 4,792,584,752 total letters Searching..................................................done Results from round 1 >gi|317120665|gb|ADV02488.1| hypothetical protein SC1_gp010 [Liberibacter phage SC1] gi|317120809|gb|ADV02630.1| hypothetical protein SC1_gp010 [Candidatus Liberibacter asiaticus] Length = 90 Score = 184 bits (467), Expect = 3e-45, Method: Compositional matrix adjust. Identities = 89/90 (98%), Positives = 90/90 (100%) Query: 1 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL Sbjct: 1 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60 Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90 FSALKNIAIAVTSLTAIIYG+LNIKGWFKQ Sbjct: 61 FSALKNIAIAVTSLTAIIYGILNIKGWFKQ 90 >gi|317120706|gb|ADV02528.1| hypothetical protein SC2_gp010 [Liberibacter phage SC2] gi|317120767|gb|ADV02588.1| hypothetical protein SC2_gp010 [Candidatus Liberibacter asiaticus] Length = 90 Score = 181 bits (460), Expect = 2e-44, Method: Compositional matrix adjust. Identities = 88/90 (97%), Positives = 89/90 (98%) Query: 1 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60 MTKRQED YITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL Sbjct: 1 MTKRQEDRYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60 Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90 FSALKNIAIAVTSLTAIIYG+LNIKGWFKQ Sbjct: 61 FSALKNIAIAVTSLTAIIYGILNIKGWFKQ 90 >gi|317120749|gb|ADV02571.1| hypothetical protein SC2_gp255 [Liberibacter phage SC2] gi|317120763|gb|ADV02584.1| hypothetical protein SC2_gp255 [Candidatus Liberibacter asiaticus] Length = 90 Score = 175 bits (443), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 85/90 (94%), Positives = 87/90 (96%) Query: 1 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60 M KRQED+YITREEFIEFCTNSNSKQDCLIS KLFEKHYREQQKGVNEILDILKSVKWL Sbjct: 1 MAKRQEDYYITREEFIEFCTNSNSKQDCLISHCKLFEKHYREQQKGVNEILDILKSVKWL 60 Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90 FSALKNIAIAVTSLTAIIYG+LNIKGWFKQ Sbjct: 61 FSALKNIAIAVTSLTAIIYGILNIKGWFKQ 90 >gi|315121925|ref|YP_004062414.1| hypothetical protein CKC_00875 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495327|gb|ADR51926.1| hypothetical protein CKC_00875 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 85 Score = 101 bits (252), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 49/90 (54%), Positives = 70/90 (77%), Gaps = 5/90 (5%) Query: 1 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60 MTKRQE Y+T++EF E ++K DC+I+ K+ E+ R+QQ+G+ EIL+IL+ +KW Sbjct: 1 MTKRQE-QYVTKQEFNEL----SAKVDCIITHLKVCERSERKQQQGIEEILNILQGLKWF 55 Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90 F+++KNIAI VTSL+AI+YG+ NIKGW KQ Sbjct: 56 FASIKNIAIIVTSLSAILYGVFNIKGWLKQ 85 >gi|315122887|ref|YP_004063376.1| hypothetical protein CKC_05715 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496289|gb|ADR52888.1| hypothetical protein CKC_05715 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 85 Score = 84.3 bits (207), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 45/90 (50%), Positives = 57/90 (63%), Gaps = 5/90 (5%) Query: 1 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60 MTKRQE Y+T++EF E N+K DCLI+ K+FE+HY EQQ + IL IL + K L Sbjct: 1 MTKRQE-QYVTKQEFNEL----NAKVDCLITHCKVFERHYNEQQNDIKSILQILNTSKGL 55 Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90 S +K SL+AIIY L N+K W KQ Sbjct: 56 ASFIKTSGAITASLSAIIYALYNLKAWLKQ 85 >gi|315122304|ref|YP_004062793.1| hypothetical protein CKC_02780 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495706|gb|ADR52305.1| hypothetical protein CKC_02780 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 92 Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 43/97 (44%), Positives = 55/97 (56%), Gaps = 12/97 (12%) Query: 1 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFE-----KHYRE--QQKGVNEILDI 53 M KRQ D +TR+EF +SK D + QFK E K R+ QQK + EIL+I Sbjct: 1 MAKRQ-DQTVTRQEFKAL----DSKVDHIDKQFKALEARDKKKQARDEKQQKYIEEILNI 55 Query: 54 LKSVKWLFSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90 L + K L S +K I SL+AIIY + N+KGW KQ Sbjct: 56 LNTSKGLASFIKMIGAITASLSAIIYAIYNLKGWLKQ 92 >gi|315122320|ref|YP_004062809.1| hypothetical protein CKC_02860 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495722|gb|ADR52321.1| hypothetical protein CKC_02860 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 55 Score = 43.9 bits (102), Expect = 0.007, Method: Compositional matrix adjust. Identities = 24/50 (48%), Positives = 31/50 (62%), Gaps = 1/50 (2%) Query: 40 YREQQKGVNEILDILKSVKWLFSALKNIAIAVTSLTAIIYGLLNIKGWFK 89 YR QQKG+ EI ++L + K L S +K SL+AIIY L N+K W K Sbjct: 6 YR-QQKGIEEIFNLLNTSKGLASFIKTSGAITASLSAIIYALYNLKNWIK 54 >gi|168186942|ref|ZP_02621577.1| DNA gyrase subunit A [Clostridium botulinum C str. Eklund] gi|169295241|gb|EDS77374.1| DNA gyrase subunit A [Clostridium botulinum C str. Eklund] Length = 970 Score = 34.3 bits (77), Expect = 6.5, Method: Compositional matrix adjust. Identities = 16/34 (47%), Positives = 22/34 (64%) Query: 32 QFKLFEKHYREQQKGVNEILDILKSVKWLFSALK 65 + K+FEK YRE Q+ +N + IL S K LF +K Sbjct: 448 EIKVFEKEYRELQRRINALTKILNSEKELFKVIK 481 >gi|167757744|ref|ZP_02429871.1| hypothetical protein CLOSCI_00075 [Clostridium scindens ATCC 35704] gi|167664626|gb|EDS08756.1| hypothetical protein CLOSCI_00075 [Clostridium scindens ATCC 35704] Length = 519 Score = 33.5 bits (75), Expect = 8.8, Method: Compositional matrix adjust. Identities = 19/58 (32%), Positives = 31/58 (53%), Gaps = 2/58 (3%) Query: 13 EEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWLFSALKNIAIA 70 E+ IEFC N +++ L+S +EK E KG+ E+ + VK L S +A++ Sbjct: 385 EQNIEFCDNLQGQKNRLLSDIAAYEKKVAEYSKGIRELY--MDKVKGLLSESDYVAMS 440 Searching..................................................done Results from round 2 >gi|317120749|gb|ADV02571.1| hypothetical protein SC2_gp255 [Liberibacter phage SC2] gi|317120763|gb|ADV02584.1| hypothetical protein SC2_gp255 [Candidatus Liberibacter asiaticus] Length = 90 Score = 135 bits (339), Expect = 2e-30, Method: Composition-based stats. Identities = 85/90 (94%), Positives = 87/90 (96%) Query: 1 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60 M KRQED+YITREEFIEFCTNSNSKQDCLIS KLFEKHYREQQKGVNEILDILKSVKWL Sbjct: 1 MAKRQEDYYITREEFIEFCTNSNSKQDCLISHCKLFEKHYREQQKGVNEILDILKSVKWL 60 Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90 FSALKNIAIAVTSLTAIIYG+LNIKGWFKQ Sbjct: 61 FSALKNIAIAVTSLTAIIYGILNIKGWFKQ 90 >gi|317120665|gb|ADV02488.1| hypothetical protein SC1_gp010 [Liberibacter phage SC1] gi|317120809|gb|ADV02630.1| hypothetical protein SC1_gp010 [Candidatus Liberibacter asiaticus] Length = 90 Score = 135 bits (339), Expect = 2e-30, Method: Composition-based stats. Identities = 89/90 (98%), Positives = 90/90 (100%) Query: 1 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL Sbjct: 1 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60 Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90 FSALKNIAIAVTSLTAIIYG+LNIKGWFKQ Sbjct: 61 FSALKNIAIAVTSLTAIIYGILNIKGWFKQ 90 >gi|317120706|gb|ADV02528.1| hypothetical protein SC2_gp010 [Liberibacter phage SC2] gi|317120767|gb|ADV02588.1| hypothetical protein SC2_gp010 [Candidatus Liberibacter asiaticus] Length = 90 Score = 133 bits (334), Expect = 8e-30, Method: Composition-based stats. Identities = 88/90 (97%), Positives = 89/90 (98%) Query: 1 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60 MTKRQED YITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL Sbjct: 1 MTKRQEDRYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60 Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90 FSALKNIAIAVTSLTAIIYG+LNIKGWFKQ Sbjct: 61 FSALKNIAIAVTSLTAIIYGILNIKGWFKQ 90 >gi|315121925|ref|YP_004062414.1| hypothetical protein CKC_00875 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495327|gb|ADR51926.1| hypothetical protein CKC_00875 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 85 Score = 110 bits (275), Expect = 7e-23, Method: Composition-based stats. Identities = 49/90 (54%), Positives = 70/90 (77%), Gaps = 5/90 (5%) Query: 1 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60 MTKRQE Y+T++EF E ++K DC+I+ K+ E+ R+QQ+G+ EIL+IL+ +KW Sbjct: 1 MTKRQE-QYVTKQEFNEL----SAKVDCIITHLKVCERSERKQQQGIEEILNILQGLKWF 55 Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90 F+++KNIAI VTSL+AI+YG+ NIKGW KQ Sbjct: 56 FASIKNIAIIVTSLSAILYGVFNIKGWLKQ 85 >gi|315122887|ref|YP_004063376.1| hypothetical protein CKC_05715 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313496289|gb|ADR52888.1| hypothetical protein CKC_05715 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 85 Score = 109 bits (272), Expect = 1e-22, Method: Composition-based stats. Identities = 45/90 (50%), Positives = 57/90 (63%), Gaps = 5/90 (5%) Query: 1 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60 MTKRQE Y+T++EF E N+K DCLI+ K+FE+HY EQQ + IL IL + K L Sbjct: 1 MTKRQE-QYVTKQEFNEL----NAKVDCLITHCKVFERHYNEQQNDIKSILQILNTSKGL 55 Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90 S +K SL+AIIY L N+K W KQ Sbjct: 56 ASFIKTSGAITASLSAIIYALYNLKAWLKQ 85 >gi|315122304|ref|YP_004062793.1| hypothetical protein CKC_02780 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495706|gb|ADR52305.1| hypothetical protein CKC_02780 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 92 Score = 89.8 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 43/97 (44%), Positives = 55/97 (56%), Gaps = 12/97 (12%) Query: 1 MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFE-----KHYRE--QQKGVNEILDI 53 M KRQ D +TR+EF +SK D + QFK E K R+ QQK + EIL+I Sbjct: 1 MAKRQ-DQTVTRQEFKAL----DSKVDHIDKQFKALEARDKKKQARDEKQQKYIEEILNI 55 Query: 54 LKSVKWLFSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90 L + K L S +K I SL+AIIY + N+KGW KQ Sbjct: 56 LNTSKGLASFIKMIGAITASLSAIIYAIYNLKGWLKQ 92 >gi|315122320|ref|YP_004062809.1| hypothetical protein CKC_02860 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495722|gb|ADR52321.1| hypothetical protein CKC_02860 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 55 Score = 68.2 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 22/48 (45%), Positives = 29/48 (60%) Query: 42 EQQKGVNEILDILKSVKWLFSALKNIAIAVTSLTAIIYGLLNIKGWFK 89 QQKG+ EI ++L + K L S +K SL+AIIY L N+K W K Sbjct: 7 RQQKGIEEIFNLLNTSKGLASFIKTSGAITASLSAIIYALYNLKNWIK 54 >gi|254781119|ref|YP_003065532.1| hypothetical protein CLIBASIA_05105 [Candidatus Liberibacter asiaticus str. psy62] gi|254040796|gb|ACT57592.1| hypothetical protein CLIBASIA_05105 [Candidatus Liberibacter asiaticus str. psy62] Length = 85 Score = 39.7 bits (91), Expect = 0.12, Method: Composition-based stats. Identities = 26/88 (29%), Positives = 44/88 (50%), Gaps = 14/88 (15%) Query: 10 ITREEFIEF---CTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWLFSALK- 65 +TR EF+E T K DCLI+QF +QQ ++E IL + K + +K Sbjct: 1 MTRVEFVEMKGEVTLLKQKVDCLIAQF-------NKQQSVIDEFFTILTTAKGFTAFIKG 53 Query: 66 --NIAIAVTSLTAI-IYGLLNIKGWFKQ 90 +IA+ + S A+ + + ++ G K+ Sbjct: 54 FISIALPIGSFPALRTWIIHHVVGLLKK 81 >gi|168186942|ref|ZP_02621577.1| DNA gyrase subunit A [Clostridium botulinum C str. Eklund] gi|169295241|gb|EDS77374.1| DNA gyrase subunit A [Clostridium botulinum C str. Eklund] Length = 970 Score = 33.5 bits (75), Expect = 9.9, Method: Composition-based stats. Identities = 16/34 (47%), Positives = 22/34 (64%) Query: 32 QFKLFEKHYREQQKGVNEILDILKSVKWLFSALK 65 + K+FEK YRE Q+ +N + IL S K LF +K Sbjct: 448 EIKVFEKEYRELQRRINALTKILNSEKELFKVIK 481 Database: nr Posted date: May 13, 2011 4:10 AM Number of letters in database: 999,999,932 Number of sequences in database: 2,987,209 Database: /data/usr2/db/fasta/nr.01 Posted date: May 13, 2011 4:17 AM Number of letters in database: 999,998,956 Number of sequences in database: 2,896,973 Database: /data/usr2/db/fasta/nr.02 Posted date: May 13, 2011 4:23 AM Number of letters in database: 999,999,979 Number of sequences in database: 2,907,862 Database: /data/usr2/db/fasta/nr.03 Posted date: May 13, 2011 4:29 AM Number of letters in database: 999,999,513 Number of sequences in database: 2,932,190 Database: /data/usr2/db/fasta/nr.04 Posted date: May 13, 2011 4:33 AM Number of letters in database: 792,586,372 Number of sequences in database: 2,260,650 Lambda K H 0.317 0.132 0.352 Lambda K H 0.267 0.0402 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,333,824,318 Number of Sequences: 13984884 Number of extensions: 36820572 Number of successful extensions: 153181 Number of sequences better than 10.0: 23 Number of HSP's better than 10.0 without gapping: 21 Number of HSP's successfully gapped in prelim test: 18 Number of HSP's that attempted gapping in prelim test: 153147 Number of HSP's gapped (non-prelim): 40 length of query: 90 length of database: 4,792,584,752 effective HSP length: 60 effective length of query: 30 effective length of database: 3,953,491,712 effective search space: 118604751360 effective search space used: 118604751360 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 75 (33.5 bits)