BLASTP 2.2.22 [Sep-27-2009]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.


Reference for composition-based statistics starting in round 2:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= 537021.9.peg.1140_1
         (90 letters)

Database: nr 
           13,984,884 sequences; 4,792,584,752 total letters

Searching..................................................done


Results from round 1


>gi|317120665|gb|ADV02488.1| hypothetical protein SC1_gp010 [Liberibacter phage SC1]
 gi|317120809|gb|ADV02630.1| hypothetical protein SC1_gp010 [Candidatus Liberibacter
          asiaticus]
          Length = 90

 Score =  184 bits (467), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 89/90 (98%), Positives = 90/90 (100%)

Query: 1  MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60
          MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL
Sbjct: 1  MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60

Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90
          FSALKNIAIAVTSLTAIIYG+LNIKGWFKQ
Sbjct: 61 FSALKNIAIAVTSLTAIIYGILNIKGWFKQ 90


>gi|317120706|gb|ADV02528.1| hypothetical protein SC2_gp010 [Liberibacter phage SC2]
 gi|317120767|gb|ADV02588.1| hypothetical protein SC2_gp010 [Candidatus Liberibacter
          asiaticus]
          Length = 90

 Score =  181 bits (460), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 88/90 (97%), Positives = 89/90 (98%)

Query: 1  MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60
          MTKRQED YITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL
Sbjct: 1  MTKRQEDRYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60

Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90
          FSALKNIAIAVTSLTAIIYG+LNIKGWFKQ
Sbjct: 61 FSALKNIAIAVTSLTAIIYGILNIKGWFKQ 90


>gi|317120749|gb|ADV02571.1| hypothetical protein SC2_gp255 [Liberibacter phage SC2]
 gi|317120763|gb|ADV02584.1| hypothetical protein SC2_gp255 [Candidatus Liberibacter
          asiaticus]
          Length = 90

 Score =  175 bits (443), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 85/90 (94%), Positives = 87/90 (96%)

Query: 1  MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60
          M KRQED+YITREEFIEFCTNSNSKQDCLIS  KLFEKHYREQQKGVNEILDILKSVKWL
Sbjct: 1  MAKRQEDYYITREEFIEFCTNSNSKQDCLISHCKLFEKHYREQQKGVNEILDILKSVKWL 60

Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90
          FSALKNIAIAVTSLTAIIYG+LNIKGWFKQ
Sbjct: 61 FSALKNIAIAVTSLTAIIYGILNIKGWFKQ 90


>gi|315121925|ref|YP_004062414.1| hypothetical protein CKC_00875 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
 gi|313495327|gb|ADR51926.1| hypothetical protein CKC_00875 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
          Length = 85

 Score =  101 bits (252), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 49/90 (54%), Positives = 70/90 (77%), Gaps = 5/90 (5%)

Query: 1  MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60
          MTKRQE  Y+T++EF E     ++K DC+I+  K+ E+  R+QQ+G+ EIL+IL+ +KW 
Sbjct: 1  MTKRQE-QYVTKQEFNEL----SAKVDCIITHLKVCERSERKQQQGIEEILNILQGLKWF 55

Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90
          F+++KNIAI VTSL+AI+YG+ NIKGW KQ
Sbjct: 56 FASIKNIAIIVTSLSAILYGVFNIKGWLKQ 85


>gi|315122887|ref|YP_004063376.1| hypothetical protein CKC_05715 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
 gi|313496289|gb|ADR52888.1| hypothetical protein CKC_05715 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
          Length = 85

 Score = 84.3 bits (207), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 45/90 (50%), Positives = 57/90 (63%), Gaps = 5/90 (5%)

Query: 1  MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60
          MTKRQE  Y+T++EF E     N+K DCLI+  K+FE+HY EQQ  +  IL IL + K L
Sbjct: 1  MTKRQE-QYVTKQEFNEL----NAKVDCLITHCKVFERHYNEQQNDIKSILQILNTSKGL 55

Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90
           S +K       SL+AIIY L N+K W KQ
Sbjct: 56 ASFIKTSGAITASLSAIIYALYNLKAWLKQ 85


>gi|315122304|ref|YP_004062793.1| hypothetical protein CKC_02780 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
 gi|313495706|gb|ADR52305.1| hypothetical protein CKC_02780 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
          Length = 92

 Score = 63.2 bits (152), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 43/97 (44%), Positives = 55/97 (56%), Gaps = 12/97 (12%)

Query: 1  MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFE-----KHYRE--QQKGVNEILDI 53
          M KRQ D  +TR+EF       +SK D +  QFK  E     K  R+  QQK + EIL+I
Sbjct: 1  MAKRQ-DQTVTRQEFKAL----DSKVDHIDKQFKALEARDKKKQARDEKQQKYIEEILNI 55

Query: 54 LKSVKWLFSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90
          L + K L S +K I     SL+AIIY + N+KGW KQ
Sbjct: 56 LNTSKGLASFIKMIGAITASLSAIIYAIYNLKGWLKQ 92


>gi|315122320|ref|YP_004062809.1| hypothetical protein CKC_02860 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
 gi|313495722|gb|ADR52321.1| hypothetical protein CKC_02860 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
          Length = 55

 Score = 43.9 bits (102), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 31/50 (62%), Gaps = 1/50 (2%)

Query: 40 YREQQKGVNEILDILKSVKWLFSALKNIAIAVTSLTAIIYGLLNIKGWFK 89
          YR QQKG+ EI ++L + K L S +K       SL+AIIY L N+K W K
Sbjct: 6  YR-QQKGIEEIFNLLNTSKGLASFIKTSGAITASLSAIIYALYNLKNWIK 54


>gi|168186942|ref|ZP_02621577.1| DNA gyrase subunit A [Clostridium botulinum C str. Eklund]
 gi|169295241|gb|EDS77374.1| DNA gyrase subunit A [Clostridium botulinum C str. Eklund]
          Length = 970

 Score = 34.3 bits (77), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 16/34 (47%), Positives = 22/34 (64%)

Query: 32  QFKLFEKHYREQQKGVNEILDILKSVKWLFSALK 65
           + K+FEK YRE Q+ +N +  IL S K LF  +K
Sbjct: 448 EIKVFEKEYRELQRRINALTKILNSEKELFKVIK 481


>gi|167757744|ref|ZP_02429871.1| hypothetical protein CLOSCI_00075 [Clostridium scindens ATCC 35704]
 gi|167664626|gb|EDS08756.1| hypothetical protein CLOSCI_00075 [Clostridium scindens ATCC 35704]
          Length = 519

 Score = 33.5 bits (75), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 19/58 (32%), Positives = 31/58 (53%), Gaps = 2/58 (3%)

Query: 13  EEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWLFSALKNIAIA 70
           E+ IEFC N   +++ L+S    +EK   E  KG+ E+   +  VK L S    +A++
Sbjct: 385 EQNIEFCDNLQGQKNRLLSDIAAYEKKVAEYSKGIRELY--MDKVKGLLSESDYVAMS 440


Searching..................................................done


Results from round 2




>gi|317120749|gb|ADV02571.1| hypothetical protein SC2_gp255 [Liberibacter phage SC2]
 gi|317120763|gb|ADV02584.1| hypothetical protein SC2_gp255 [Candidatus Liberibacter
          asiaticus]
          Length = 90

 Score =  135 bits (339), Expect = 2e-30,   Method: Composition-based stats.
 Identities = 85/90 (94%), Positives = 87/90 (96%)

Query: 1  MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60
          M KRQED+YITREEFIEFCTNSNSKQDCLIS  KLFEKHYREQQKGVNEILDILKSVKWL
Sbjct: 1  MAKRQEDYYITREEFIEFCTNSNSKQDCLISHCKLFEKHYREQQKGVNEILDILKSVKWL 60

Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90
          FSALKNIAIAVTSLTAIIYG+LNIKGWFKQ
Sbjct: 61 FSALKNIAIAVTSLTAIIYGILNIKGWFKQ 90


>gi|317120665|gb|ADV02488.1| hypothetical protein SC1_gp010 [Liberibacter phage SC1]
 gi|317120809|gb|ADV02630.1| hypothetical protein SC1_gp010 [Candidatus Liberibacter
          asiaticus]
          Length = 90

 Score =  135 bits (339), Expect = 2e-30,   Method: Composition-based stats.
 Identities = 89/90 (98%), Positives = 90/90 (100%)

Query: 1  MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60
          MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL
Sbjct: 1  MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60

Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90
          FSALKNIAIAVTSLTAIIYG+LNIKGWFKQ
Sbjct: 61 FSALKNIAIAVTSLTAIIYGILNIKGWFKQ 90


>gi|317120706|gb|ADV02528.1| hypothetical protein SC2_gp010 [Liberibacter phage SC2]
 gi|317120767|gb|ADV02588.1| hypothetical protein SC2_gp010 [Candidatus Liberibacter
          asiaticus]
          Length = 90

 Score =  133 bits (334), Expect = 8e-30,   Method: Composition-based stats.
 Identities = 88/90 (97%), Positives = 89/90 (98%)

Query: 1  MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60
          MTKRQED YITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL
Sbjct: 1  MTKRQEDRYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60

Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90
          FSALKNIAIAVTSLTAIIYG+LNIKGWFKQ
Sbjct: 61 FSALKNIAIAVTSLTAIIYGILNIKGWFKQ 90


>gi|315121925|ref|YP_004062414.1| hypothetical protein CKC_00875 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
 gi|313495327|gb|ADR51926.1| hypothetical protein CKC_00875 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
          Length = 85

 Score =  110 bits (275), Expect = 7e-23,   Method: Composition-based stats.
 Identities = 49/90 (54%), Positives = 70/90 (77%), Gaps = 5/90 (5%)

Query: 1  MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60
          MTKRQE  Y+T++EF E     ++K DC+I+  K+ E+  R+QQ+G+ EIL+IL+ +KW 
Sbjct: 1  MTKRQE-QYVTKQEFNEL----SAKVDCIITHLKVCERSERKQQQGIEEILNILQGLKWF 55

Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90
          F+++KNIAI VTSL+AI+YG+ NIKGW KQ
Sbjct: 56 FASIKNIAIIVTSLSAILYGVFNIKGWLKQ 85


>gi|315122887|ref|YP_004063376.1| hypothetical protein CKC_05715 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
 gi|313496289|gb|ADR52888.1| hypothetical protein CKC_05715 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
          Length = 85

 Score =  109 bits (272), Expect = 1e-22,   Method: Composition-based stats.
 Identities = 45/90 (50%), Positives = 57/90 (63%), Gaps = 5/90 (5%)

Query: 1  MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWL 60
          MTKRQE  Y+T++EF E     N+K DCLI+  K+FE+HY EQQ  +  IL IL + K L
Sbjct: 1  MTKRQE-QYVTKQEFNEL----NAKVDCLITHCKVFERHYNEQQNDIKSILQILNTSKGL 55

Query: 61 FSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90
           S +K       SL+AIIY L N+K W KQ
Sbjct: 56 ASFIKTSGAITASLSAIIYALYNLKAWLKQ 85


>gi|315122304|ref|YP_004062793.1| hypothetical protein CKC_02780 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
 gi|313495706|gb|ADR52305.1| hypothetical protein CKC_02780 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
          Length = 92

 Score = 89.8 bits (221), Expect = 1e-16,   Method: Composition-based stats.
 Identities = 43/97 (44%), Positives = 55/97 (56%), Gaps = 12/97 (12%)

Query: 1  MTKRQEDHYITREEFIEFCTNSNSKQDCLISQFKLFE-----KHYRE--QQKGVNEILDI 53
          M KRQ D  +TR+EF       +SK D +  QFK  E     K  R+  QQK + EIL+I
Sbjct: 1  MAKRQ-DQTVTRQEFKAL----DSKVDHIDKQFKALEARDKKKQARDEKQQKYIEEILNI 55

Query: 54 LKSVKWLFSALKNIAIAVTSLTAIIYGLLNIKGWFKQ 90
          L + K L S +K I     SL+AIIY + N+KGW KQ
Sbjct: 56 LNTSKGLASFIKMIGAITASLSAIIYAIYNLKGWLKQ 92


>gi|315122320|ref|YP_004062809.1| hypothetical protein CKC_02860 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
 gi|313495722|gb|ADR52321.1| hypothetical protein CKC_02860 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
          Length = 55

 Score = 68.2 bits (165), Expect = 3e-10,   Method: Composition-based stats.
 Identities = 22/48 (45%), Positives = 29/48 (60%)

Query: 42 EQQKGVNEILDILKSVKWLFSALKNIAIAVTSLTAIIYGLLNIKGWFK 89
           QQKG+ EI ++L + K L S +K       SL+AIIY L N+K W K
Sbjct: 7  RQQKGIEEIFNLLNTSKGLASFIKTSGAITASLSAIIYALYNLKNWIK 54


>gi|254781119|ref|YP_003065532.1| hypothetical protein CLIBASIA_05105 [Candidatus Liberibacter
          asiaticus str. psy62]
 gi|254040796|gb|ACT57592.1| hypothetical protein CLIBASIA_05105 [Candidatus Liberibacter
          asiaticus str. psy62]
          Length = 85

 Score = 39.7 bits (91), Expect = 0.12,   Method: Composition-based stats.
 Identities = 26/88 (29%), Positives = 44/88 (50%), Gaps = 14/88 (15%)

Query: 10 ITREEFIEF---CTNSNSKQDCLISQFKLFEKHYREQQKGVNEILDILKSVKWLFSALK- 65
          +TR EF+E     T    K DCLI+QF        +QQ  ++E   IL + K   + +K 
Sbjct: 1  MTRVEFVEMKGEVTLLKQKVDCLIAQF-------NKQQSVIDEFFTILTTAKGFTAFIKG 53

Query: 66 --NIAIAVTSLTAI-IYGLLNIKGWFKQ 90
            +IA+ + S  A+  + + ++ G  K+
Sbjct: 54 FISIALPIGSFPALRTWIIHHVVGLLKK 81


>gi|168186942|ref|ZP_02621577.1| DNA gyrase subunit A [Clostridium botulinum C str. Eklund]
 gi|169295241|gb|EDS77374.1| DNA gyrase subunit A [Clostridium botulinum C str. Eklund]
          Length = 970

 Score = 33.5 bits (75), Expect = 9.9,   Method: Composition-based stats.
 Identities = 16/34 (47%), Positives = 22/34 (64%)

Query: 32  QFKLFEKHYREQQKGVNEILDILKSVKWLFSALK 65
           + K+FEK YRE Q+ +N +  IL S K LF  +K
Sbjct: 448 EIKVFEKEYRELQRRINALTKILNSEKELFKVIK 481


  Database: nr
    Posted date:  May 13, 2011  4:10 AM
  Number of letters in database: 999,999,932
  Number of sequences in database:  2,987,209
  
  Database: /data/usr2/db/fasta/nr.01
    Posted date:  May 13, 2011  4:17 AM
  Number of letters in database: 999,998,956
  Number of sequences in database:  2,896,973
  
  Database: /data/usr2/db/fasta/nr.02
    Posted date:  May 13, 2011  4:23 AM
  Number of letters in database: 999,999,979
  Number of sequences in database:  2,907,862
  
  Database: /data/usr2/db/fasta/nr.03
    Posted date:  May 13, 2011  4:29 AM
  Number of letters in database: 999,999,513
  Number of sequences in database:  2,932,190
  
  Database: /data/usr2/db/fasta/nr.04
    Posted date:  May 13, 2011  4:33 AM
  Number of letters in database: 792,586,372
  Number of sequences in database:  2,260,650
  
Lambda     K      H
   0.317    0.132    0.352 

Lambda     K      H
   0.267   0.0402    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,333,824,318
Number of Sequences: 13984884
Number of extensions: 36820572
Number of successful extensions: 153181
Number of sequences better than 10.0: 23
Number of HSP's better than 10.0 without gapping: 21
Number of HSP's successfully gapped in prelim test: 18
Number of HSP's that attempted gapping in prelim test: 153147
Number of HSP's gapped (non-prelim): 40
length of query: 90
length of database: 4,792,584,752
effective HSP length: 60
effective length of query: 30
effective length of database: 3,953,491,712
effective search space: 118604751360
effective search space used: 118604751360
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 75 (33.5 bits)