BLASTP 2.2.22 [Sep-27-2009]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= gi|254780492|ref|YP_003064905.1| hypothetical protein
CLIBASIA_01895 [Candidatus Liberibacter asiaticus str. psy62]
         (40 letters)

Database: nr 
           14,124,377 sequences; 4,842,793,630 total letters

Searching..................................................done



>gi|254780492|ref|YP_003064905.1| hypothetical protein CLIBASIA_01895 [Candidatus Liberibacter
          asiaticus str. psy62]
 gi|254780830|ref|YP_003065243.1| hypothetical protein CLIBASIA_03615 [Candidatus Liberibacter
          asiaticus str. psy62]
 gi|254780880|ref|YP_003065293.1| hypothetical protein CLIBASIA_03885 [Candidatus Liberibacter
          asiaticus str. psy62]
 gi|254040169|gb|ACT56965.1| hypothetical protein CLIBASIA_01895 [Candidatus Liberibacter
          asiaticus str. psy62]
 gi|254040507|gb|ACT57303.1| hypothetical protein CLIBASIA_03615 [Candidatus Liberibacter
          asiaticus str. psy62]
 gi|254040557|gb|ACT57353.1| hypothetical protein CLIBASIA_03885 [Candidatus Liberibacter
          asiaticus str. psy62]
          Length = 40

 Score = 64.4 bits (155), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 40/40 (100%), Positives = 40/40 (100%)

Query: 1  MTIDNESNQARKGIWWMPWHAQAMKDVICCDKLWGAANKH 40
          MTIDNESNQARKGIWWMPWHAQAMKDVICCDKLWGAANKH
Sbjct: 1  MTIDNESNQARKGIWWMPWHAQAMKDVICCDKLWGAANKH 40


>gi|297182451|gb|ADI18614.1| hypothetical protein [uncultured Rhodospirillales bacterium
          HF4000_24M03]
          Length = 60

 Score = 60.1 bits (144), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 21/32 (65%), Positives = 26/32 (81%)

Query: 9  QARKGIWWMPWHAQAMKDVICCDKLWGAANKH 40
          + +KGIWWMPW  +AMKDVI C+K  GAAN+H
Sbjct: 29 KYKKGIWWMPWRQEAMKDVIGCEKPRGAANRH 60


>gi|297180501|gb|ADI16715.1| hypothetical protein [uncultured gamma proteobacterium
          HF0010_05D02]
          Length = 77

 Score = 53.2 bits (126), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 16/29 (55%), Positives = 20/29 (68%)

Query: 11 RKGIWWMPWHAQAMKDVICCDKLWGAANK 39
           K I WMPW ++AMKDV+ CDKL G   +
Sbjct: 2  TKRIRWMPWQSEAMKDVVACDKLRGVGKQ 30


>gi|297184727|gb|ADI20838.1| hypothetical protein [uncultured alpha proteobacterium
          EF100_102A06]
          Length = 83

 Score = 53.2 bits (126), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 19/29 (65%), Positives = 23/29 (79%)

Query: 11 RKGIWWMPWHAQAMKDVICCDKLWGAANK 39
          +KGIWWMPW  +AMKDV  C+K  GAA+K
Sbjct: 54 QKGIWWMPWCQEAMKDVTRCEKPGGAASK 82


>gi|297182112|gb|ADI18285.1| hypothetical protein [uncultured Chromatiales bacterium
          HF0200_41F04]
          Length = 56

 Score = 52.8 bits (125), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 18/31 (58%), Positives = 22/31 (70%)

Query: 9  QARKGIWWMPWHAQAMKDVICCDKLWGAANK 39
          QA K +WWMP   +AMKDV+ C+K  GA NK
Sbjct: 25 QANKRMWWMPRRQEAMKDVVVCEKPRGAGNK 55


>gi|297183883|gb|ADI20005.1| hypothetical protein [uncultured gamma proteobacterium
          EB000_65A11]
          Length = 57

 Score = 50.9 bits (120), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 17/31 (54%), Positives = 21/31 (67%)

Query: 9  QARKGIWWMPWHAQAMKDVICCDKLWGAANK 39
          QA K IWWMP   +AMKDV+ C+KL G   +
Sbjct: 7  QANKRIWWMPRQLEAMKDVVVCEKLGGGDKQ 37


>gi|51557703|gb|AAU06491.1| putative salivary protein [Culicoides sonorensis]
          Length = 46

 Score = 50.5 bits (119), Expect = 7e-05,   Method: Composition-based stats.
 Identities = 17/31 (54%), Positives = 22/31 (70%)

Query: 9  QARKGIWWMPWHAQAMKDVICCDKLWGAANK 39
          Q +K I WMPW ++AMKDV+ CDKL G   +
Sbjct: 2  QVKKRIRWMPWQSEAMKDVVACDKLRGVGKQ 32


>gi|255325736|ref|ZP_05366831.1| putative salivary protein [Corynebacterium tuberculostearicum
          SK141]
 gi|255297202|gb|EET76524.1| putative salivary protein [Corynebacterium tuberculostearicum
          SK141]
          Length = 58

 Score = 47.4 bits (111), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 17/27 (62%), Positives = 18/27 (66%)

Query: 12 KGIWWMPWHAQAMKDVICCDKLWGAAN 38
          KG WWMPWHA+ MKDV  C K  G  N
Sbjct: 32 KGAWWMPWHAEPMKDVKGCVKPRGVVN 58


>gi|297182737|gb|ADI18892.1| hypothetical protein [uncultured delta proteobacterium
          HF0010_08B07]
          Length = 68

 Score = 45.5 bits (106), Expect = 0.002,   Method: Composition-based stats.
 Identities = 15/23 (65%), Positives = 17/23 (73%)

Query: 17 MPWHAQAMKDVICCDKLWGAANK 39
          MPWH +AMKDV  CDKL GA  +
Sbjct: 1  MPWHLEAMKDVGNCDKLRGAVTQ 23


>gi|210623970|ref|ZP_03294136.1| hypothetical protein CLOHIR_02088 [Clostridium hiranonis DSM
          13275]
 gi|210153265|gb|EEA84271.1| hypothetical protein CLOHIR_02088 [Clostridium hiranonis DSM
          13275]
          Length = 49

 Score = 45.1 bits (105), Expect = 0.003,   Method: Composition-based stats.
 Identities = 20/33 (60%), Positives = 22/33 (66%)

Query: 7  SNQARKGIWWMPWHAQAMKDVICCDKLWGAANK 39
          + Q  KG   MPWH + MKDVI CDKLWG A K
Sbjct: 16 TGQVIKGAGRMPWHREPMKDVISCDKLWGVARK 48


>gi|297180674|gb|ADI16883.1| hypothetical protein [uncultured gamma proteobacterium
          HF0010_16J05]
          Length = 71

 Score = 43.2 bits (100), Expect = 0.011,   Method: Composition-based stats.
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query: 17 MPWHAQAMKDVICCDKLWGAANK 39
          MPW  +AMKDV+ CD   G   +
Sbjct: 1  MPWQLKAMKDVVACDMPRGVGKQ 23


>gi|269968958|ref|ZP_06182898.1| hypothetical protein VMC_43280 [Vibrio alginolyticus 40B]
 gi|269826431|gb|EEZ80825.1| hypothetical protein VMC_43280 [Vibrio alginolyticus 40B]
          Length = 71

 Score = 43.2 bits (100), Expect = 0.012,   Method: Composition-based stats.
 Identities = 12/24 (50%), Positives = 16/24 (66%)

Query: 17 MPWHAQAMKDVICCDKLWGAANKH 40
          MPW ++AMKDV+ CDK    +  H
Sbjct: 1  MPWQSEAMKDVLTCDKPRLGSKNH 24


>gi|297182912|gb|ADI19062.1| hypothetical protein [uncultured delta proteobacterium
          HF0070_07E19]
          Length = 69

 Score = 40.9 bits (94), Expect = 0.064,   Method: Composition-based stats.
 Identities = 17/33 (51%), Positives = 18/33 (54%)

Query: 6  ESNQARKGIWWMPWHAQAMKDVICCDKLWGAAN 38
          E     KG   MPW  +AMKDV  CDK  G AN
Sbjct: 35 EKKSNEKGTGRMPWLQEAMKDVASCDKPRGEAN 67


>gi|94502636|ref|YP_588293.1| chloroplast hypothetical protein [Zea mays subsp. mays]
 gi|40795075|gb|AAR91119.1| chloroplast hypothetical protein [Zea mays]
          Length = 121

 Score = 39.3 bits (90), Expect = 0.15,   Method: Composition-based stats.
 Identities = 15/36 (41%), Positives = 19/36 (52%)

Query: 5  NESNQARKGIWWMPWHAQAMKDVICCDKLWGAANKH 40
          N   Q RKG+ W+P H +  K V   + L G  NKH
Sbjct: 56 NGEVQKRKGLRWIPRHPETRKGVASDEMLRGVENKH 91


>gi|222874672|gb|EEF11803.1| predicted protein [Populus trichocarpa]
          Length = 64

 Score = 34.7 bits (78), Expect = 4.0,   Method: Composition-based stats.
 Identities = 13/31 (41%), Positives = 18/31 (58%)

Query: 9  QARKGIWWMPWHAQAMKDVICCDKLWGAANK 39
          Q RKG+ W+P H +  K V+  + L G  NK
Sbjct: 25 QTRKGLRWIPRHPETRKGVVSDEMLRGVENK 55


>gi|222836335|gb|EEE74742.1| predicted protein [Populus trichocarpa]
          Length = 61

 Score = 33.6 bits (75), Expect = 8.4,   Method: Composition-based stats.
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 9  QARKGIWWMPWHAQAMKDVICCDKLWGAANK 39
          Q RKG+ W+P H +  K ++  + L G  NK
Sbjct: 25 QTRKGLRWIPRHPETRKGIVSDEMLRGVENK 55


  Database: nr
    Posted date:  May 22, 2011 12:22 AM
  Number of letters in database: 999,999,966
  Number of sequences in database:  2,987,313
  
  Database: /data/usr2/db/fasta/nr.01
    Posted date:  May 22, 2011 12:30 AM
  Number of letters in database: 999,999,796
  Number of sequences in database:  2,903,041
  
  Database: /data/usr2/db/fasta/nr.02
    Posted date:  May 22, 2011 12:36 AM
  Number of letters in database: 999,999,281
  Number of sequences in database:  2,904,016
  
  Database: /data/usr2/db/fasta/nr.03
    Posted date:  May 22, 2011 12:41 AM
  Number of letters in database: 999,999,960
  Number of sequences in database:  2,935,328
  
  Database: /data/usr2/db/fasta/nr.04
    Posted date:  May 22, 2011 12:46 AM
  Number of letters in database: 842,794,627
  Number of sequences in database:  2,394,679
  
Lambda     K      H
   0.306    0.128    0.440 

Lambda     K      H
   0.267   0.0385    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 398,052,681
Number of Sequences: 14124377
Number of extensions: 7782552
Number of successful extensions: 17889
Number of sequences better than 10.0: 16
Number of HSP's better than 10.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 17873
Number of HSP's gapped (non-prelim): 16
length of query: 40
length of database: 4,842,793,630
effective HSP length: 14
effective length of query: 26
effective length of database: 4,645,052,352
effective search space: 120771361152
effective search space used: 120771361152
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.1 bits)
S2: 75 (33.6 bits)