BLASTP 2.2.22 [Sep-27-2009]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= gi|254781214|ref|YP_003065627.1| hypothetical protein
CLIBASIA_05605 [Candidatus Liberibacter asiaticus str. psy62]
         (110 letters)

Database: nr 
           14,124,377 sequences; 4,842,793,630 total letters

Searching..................................................done



>gi|254781214|ref|YP_003065627.1| hypothetical protein CLIBASIA_05605 [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|254040891|gb|ACT57687.1| hypothetical protein CLIBASIA_05605 [Candidatus Liberibacter
           asiaticus str. psy62]
 gi|317120679|gb|ADV02502.1| hypothetical protein SC1_gp110 [Liberibacter phage SC1]
 gi|317120823|gb|ADV02644.1| hypothetical protein SC1_gp110 [Candidatus Liberibacter asiaticus]
          Length = 110

 Score =  204 bits (519), Expect = 3e-51,   Method: Composition-based stats.
 Identities = 110/110 (100%), Positives = 110/110 (100%)

Query: 1   MSFLDSSVKFGSSLSSGFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGSSQKEGK 60
           MSFLDSSVKFGSSLSSGFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGSSQKEGK
Sbjct: 1   MSFLDSSVKFGSSLSSGFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGSSQKEGK 60

Query: 61  KEEEGVRPLEGDELAEVRRQESLRAYEMNRIPIPARRFTSSSLLSGVHVR 110
           KEEEGVRPLEGDELAEVRRQESLRAYEMNRIPIPARRFTSSSLLSGVHVR
Sbjct: 61  KEEEGVRPLEGDELAEVRRQESLRAYEMNRIPIPARRFTSSSLLSGVHVR 110


>gi|197294457|ref|YP_001798998.1| hypothetical protein PAa_0382 [Candidatus Phytoplasma australiense]
 gi|171853784|emb|CAM11717.1| Conserved hypothetical protein [Candidatus Phytoplasma australiense]
          Length = 1164

 Score = 38.5 bits (88), Expect = 0.31,   Method: Composition-based stats.
 Identities = 18/56 (32%), Positives = 29/56 (51%), Gaps = 8/56 (14%)

Query: 6    SSVKFGSSLSSGFKLGSIFGPVGGVIGGLMGGAAG--------LYSEWDTIGGFFG 53
            +++  G S  +G  +G+  G   G IG  +GGAAG        + S+WD +  +FG
Sbjct: 1108 AAINIGVSTKAGVMVGATLGSAAGPIGTFLGGAAGGVISAAVAVASKWDVVKTWFG 1163


>gi|159043458|ref|YP_001532252.1| binding-protein-dependent transport systems inner membrane
           component [Dinoroseobacter shibae DFL 12]
 gi|157911218|gb|ABV92651.1| binding-protein-dependent transport systems inner membrane
           component [Dinoroseobacter shibae DFL 12]
          Length = 244

 Score = 38.1 bits (87), Expect = 0.40,   Method: Composition-based stats.
 Identities = 15/29 (51%), Positives = 21/29 (72%)

Query: 13  SLSSGFKLGSIFGPVGGVIGGLMGGAAGL 41
           SL SGFKL +++ P+G VIG  +G + GL
Sbjct: 170 SLGSGFKLAAVYAPIGAVIGEWVGSSQGL 198


>gi|162452218|ref|YP_001614585.1| hypothetical protein sce3945 [Sorangium cellulosum 'So ce 56']
 gi|161162800|emb|CAN94105.1| putative membrane protein [Sorangium cellulosum 'So ce 56']
          Length = 87

 Score = 37.7 bits (86), Expect = 0.53,   Method: Composition-based stats.
 Identities = 16/38 (42%), Positives = 24/38 (63%)

Query: 17 GFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGS 54
          GF + ++ G  G ++GG +G A GLY  + + GGFF S
Sbjct: 28 GFVVTALLGVAGSLLGGFIGRALGLYPSYQSSGGFFTS 65


>gi|119382790|ref|YP_913846.1| binding-protein-dependent transport systems inner membrane
           component [Paracoccus denitrificans PD1222]
 gi|119372557|gb|ABL68150.1| binding-protein-dependent transport systems inner membrane
           component [Paracoccus denitrificans PD1222]
          Length = 243

 Score = 37.7 bits (86), Expect = 0.55,   Method: Composition-based stats.
 Identities = 14/29 (48%), Positives = 22/29 (75%)

Query: 13  SLSSGFKLGSIFGPVGGVIGGLMGGAAGL 41
           SL++G +L +++ P+G VIG  +GGA GL
Sbjct: 168 SLATGLRLAAVYAPIGAVIGEWVGGARGL 196


>gi|293382313|ref|ZP_06628252.1| putative M23 peptidase domain protein [Enterococcus faecalis R712]
 gi|293389425|ref|ZP_06633883.1| putative M23 peptidase domain protein [Enterococcus faecalis S613]
 gi|312907445|ref|ZP_07766436.1| peptidase, M23 family [Enterococcus faecalis DAPTO 512]
 gi|312910063|ref|ZP_07768910.1| peptidase, M23 family [Enterococcus faecalis DAPTO 516]
 gi|291080258|gb|EFE17622.1| putative M23 peptidase domain protein [Enterococcus faecalis R712]
 gi|291081312|gb|EFE18275.1| putative M23 peptidase domain protein [Enterococcus faecalis S613]
 gi|310626473|gb|EFQ09756.1| peptidase, M23 family [Enterococcus faecalis DAPTO 512]
 gi|311289336|gb|EFQ67892.1| peptidase, M23 family [Enterococcus faecalis DAPTO 516]
          Length = 1722

 Score = 37.7 bits (86), Expect = 0.57,   Method: Composition-based stats.
 Identities = 21/51 (41%), Positives = 28/51 (54%), Gaps = 9/51 (17%)

Query: 16  SGFKLGSI----FGPVGGVIGGLMGGAAGLYSEWDTIGGFFGSSQKEGKKE 62
           +G KLG++    FGP+G VIGG++G AAG      + G   G   KE   E
Sbjct: 671 AGAKLGAVAGTTFGPIGTVIGGILGTAAG-----SSFGSVLGKKLKENWPE 716


>gi|317407404|gb|EFV87367.1| hypothetical protein HMPREF0005_05331 [Achromobacter xylosoxidans
           C54]
          Length = 1092

 Score = 37.3 bits (85), Expect = 0.65,   Method: Composition-based stats.
 Identities = 22/49 (44%), Positives = 29/49 (59%), Gaps = 3/49 (6%)

Query: 15  SSGFKLGSIFGPVGGVIGGLMGGAAG-LYSEWDTIGGFFGSSQKEGKKE 62
           S+G   GS FGP+G VIG  +G A G + SE ++ G     SQ EG+ E
Sbjct: 301 SAGVLAGSFFGPLGVVIGSQVGAAIGTVLSEQESTGT--SRSQAEGRSE 347


>gi|323969108|gb|EGB64413.1| hypothetical protein ERHG_04812 [Escherichia coli TA007]
          Length = 571

 Score = 36.9 bits (84), Expect = 0.84,   Method: Composition-based stats.
 Identities = 30/77 (38%), Positives = 38/77 (49%), Gaps = 9/77 (11%)

Query: 3   FLDSSVKFGSSLSSGFKLGSIFGPVGGVIGGLMGGAAG-LYSEWDTIGGFFGSSQKEGKK 61
           F   ++K GS   SG  +GSIF     VIG L+G A G L     T+ G F S   + +K
Sbjct: 432 FSSMALKVGSYAMSGSAIGSIFP----VIGSLVGSALGALVGIIMTVAGIFTSKASKIRK 487

Query: 62  EEEGVRPLEGDELAEVR 78
            +  VR    D L EVR
Sbjct: 488 AQGKVR----DRLEEVR 500


>gi|77458888|ref|YP_348394.1| hypothetical protein Pfl01_2663 [Pseudomonas fluorescens Pf0-1]
 gi|77382891|gb|ABA74404.1| conserved hypothetical protein [Pseudomonas fluorescens Pf0-1]
          Length = 543

 Score = 36.2 bits (82), Expect = 1.5,   Method: Composition-based stats.
 Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 5/65 (7%)

Query: 2   SFLDSSVKFGSSLSSGFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGSSQKEGKK 61
           SFL   +  G    SGF +G+ F  +G ++G ++GG  G+++     G F G   K+  +
Sbjct: 410 SFL---LSVGGYALSGFSIGTFFPVIGNIVGAIVGGVIGVFAA--AFGYFLGGRGKKIAE 464

Query: 62  EEEGV 66
            +E V
Sbjct: 465 AQEQV 469


>gi|271965069|ref|YP_003339265.1| hypothetical protein Sros_3588 [Streptosporangium roseum DSM 43021]
 gi|270508244|gb|ACZ86522.1| hypothetical protein Sros_3588 [Streptosporangium roseum DSM 43021]
          Length = 610

 Score = 36.2 bits (82), Expect = 1.5,   Method: Composition-based stats.
 Identities = 29/90 (32%), Positives = 38/90 (42%), Gaps = 23/90 (25%)

Query: 26  PVGGVIGGLMGGAAGLYSEWDTI-GGFFGSSQKEGKKEEEGVRPLEGDELAEVRRQESLR 84
           P GG  G L GG  G     D + GG  G +   G+  EE + PL+      +RRQ++LR
Sbjct: 29  PPGGFRGALPGGEPG-----DGLTGGEPGGAPAAGESGEEALAPLQRPAALRLRRQQALR 83

Query: 85  AY-----------------EMNRIPIPARR 97
            +                 E  R PI ARR
Sbjct: 84  QWPQQGQRGVGRGHRRDPIETGRDPIEARR 113


>gi|311105269|ref|YP_003978122.1| tape measure domain-containing protein [Achromobacter xylosoxidans
           A8]
 gi|310759958|gb|ADP15407.1| tape measure domain protein [Achromobacter xylosoxidans A8]
          Length = 782

 Score = 36.2 bits (82), Expect = 1.7,   Method: Composition-based stats.
 Identities = 17/46 (36%), Positives = 27/46 (58%), Gaps = 2/46 (4%)

Query: 26  PVGGVIGGLMGGAAGLYSEWDTIGGFFGSSQKEGKKEEEGVRPLEG 71
           P+G +IG ++ G   ++  WD+I  FFG +  E KK  +G   L+G
Sbjct: 590 PLGRIIGLVLAGLLLIWQNWDSISKFFGETWDEFKK--KGKEALDG 633


>gi|156100839|ref|XP_001616113.1| hypothetical protein [Plasmodium vivax SaI-1]
 gi|148804987|gb|EDL46386.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 5707

 Score = 35.4 bits (80), Expect = 2.5,   Method: Composition-based stats.
 Identities = 22/55 (40%), Positives = 29/55 (52%), Gaps = 8/55 (14%)

Query: 10   FGSSLSSGFKLGSIFGPV----GGVIGGLMGGAAGLY-SEWDTIGGFFGSSQKEG 59
            FGS+  SG   G +FG      GG+ GG      GL+ S   + GG FGS+Q+ G
Sbjct: 2487 FGSAQQSG---GGLFGGAQQSGGGLFGGAQQSGGGLFGSAQQSGGGLFGSAQQSG 2538


>gi|78186455|ref|YP_374498.1| hypothetical protein Plut_0572 [Chlorobium luteolum DSM 273]
 gi|78166357|gb|ABB23455.1| conserved hypothetical protein [Chlorobium luteolum DSM 273]
          Length = 163

 Score = 35.4 bits (80), Expect = 2.7,   Method: Composition-based stats.
 Identities = 18/44 (40%), Positives = 26/44 (59%)

Query: 12 SSLSSGFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGSS 55
          S+++ G   G+ FG +GG++ GLM G  GL+    T GGF   S
Sbjct: 43 SAVTLGRNAGTGFGFLGGMLCGLMTGDMGLHMLAGTTGGFMAGS 86


>gi|117920477|ref|YP_869669.1| hypothetical protein Shewana3_2032 [Shewanella sp. ANA-3]
 gi|117612809|gb|ABK48263.1| conserved hypothetical protein [Shewanella sp. ANA-3]
          Length = 542

 Score = 35.0 bits (79), Expect = 3.5,   Method: Composition-based stats.
 Identities = 37/109 (33%), Positives = 47/109 (43%), Gaps = 20/109 (18%)

Query: 7   SVKFGSSLSSGFKLGSIFGPVGGVIGGLMGGAA-----GLYSEWDTIGGFFGSSQKEGKK 61
           + + GS   SGF +GS F PV G I G +GGA      GL+       G F S +K  + 
Sbjct: 409 AFQVGSYALSGFTIGSAF-PVIGNIAGAIGGAVLGIVMGLF-------GLFTSKEKRIRN 460

Query: 62  EEEGVRPLEGDELAEVRRQESLRAYEMNRIPIPARRFT--SSSLLSGVH 108
            +  VR     E  E    E L   E N   I AR  +   S  LS V+
Sbjct: 461 AKSEVR-----EKIEAVCDEKLATIESNVAKIVARVHSEVESKALSAVN 504


>gi|317500213|ref|ZP_07958444.1| hypothetical protein HMPREF1026_00386 [Lachnospiraceae bacterium
           8_1_57FAA]
 gi|331087559|ref|ZP_08336490.1| hypothetical protein HMPREF1025_00073 [Lachnospiraceae bacterium
           3_1_46FAA]
 gi|316898384|gb|EFV20424.1| hypothetical protein HMPREF1026_00386 [Lachnospiraceae bacterium
           8_1_57FAA]
 gi|330400699|gb|EGG80303.1| hypothetical protein HMPREF1025_00073 [Lachnospiraceae bacterium
           3_1_46FAA]
          Length = 561

 Score = 35.0 bits (79), Expect = 3.7,   Method: Composition-based stats.
 Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 9/73 (12%)

Query: 1   MSFLDSSVKFGSSLSSGFK-----LGSIFGPVGGVIGGLMGGAAG----LYSEWDTIGGF 51
           +S + +SV FG    +G K     LG  +G  GG +G ++G AAG    L+  W   GG 
Sbjct: 178 VSIVGASVLFGMGSRAGEKAGEELLGPAYGAAGGTLGTVIGAAAGLLFLLFVLWIFKGGI 237

Query: 52  FGSSQKEGKKEEE 64
               Q++  K +E
Sbjct: 238 RRQIQRDHTKGKE 250


>gi|153816464|ref|ZP_01969132.1| hypothetical protein RUMTOR_02717 [Ruminococcus torques ATCC 27756]
 gi|145846206|gb|EDK23124.1| hypothetical protein RUMTOR_02717 [Ruminococcus torques ATCC 27756]
          Length = 575

 Score = 35.0 bits (79), Expect = 3.7,   Method: Composition-based stats.
 Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 9/73 (12%)

Query: 1   MSFLDSSVKFGSSLSSGFK-----LGSIFGPVGGVIGGLMGGAAG----LYSEWDTIGGF 51
           +S + +SV FG    +G K     LG  +G  GG +G ++G AAG    L+  W   GG 
Sbjct: 192 VSIVGASVLFGMGSRAGEKAGEELLGPAYGAAGGTLGTVIGAAAGLLFLLFVLWIFKGGI 251

Query: 52  FGSSQKEGKKEEE 64
               Q++  K +E
Sbjct: 252 RRQIQRDHTKGKE 264


>gi|330983522|gb|EGH81625.1| hypothetical protein PLA107_00730 [Pseudomonas syringae pv.
           lachrymans str. M301315]
          Length = 124

 Score = 33.9 bits (76), Expect = 6.9,   Method: Composition-based stats.
 Identities = 12/28 (42%), Positives = 21/28 (75%)

Query: 16  SGFKLGSIFGPVGGVIGGLMGGAAGLYS 43
           +G  +G++ GP G  +GGL+GGA G+++
Sbjct: 86  AGVAMGAVAGPAGMAVGGLVGGAIGVFA 113


>gi|302693961|ref|XP_003036659.1| hypothetical protein SCHCODRAFT_80348 [Schizophyllum commune H4-8]
 gi|300110356|gb|EFJ01757.1| hypothetical protein SCHCODRAFT_80348 [Schizophyllum commune H4-8]
          Length = 557

 Score = 33.9 bits (76), Expect = 7.8,   Method: Composition-based stats.
 Identities = 18/40 (45%), Positives = 21/40 (52%)

Query: 15  SSGFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGS 54
           + G   GS FG  G   GGL GG A   +  +T GG FGS
Sbjct: 64  NQGASTGSAFGQTGATSGGLFGGGAANSNNTNTGGGLFGS 103


>gi|91976470|ref|YP_569129.1| gene transfer agent (GTA) orfg15 [Rhodopseudomonas palustris
          BisB5]
 gi|91682926|gb|ABE39228.1| gene transfer agent (GTA) orfg15 [Rhodopseudomonas palustris
          BisB5]
          Length = 1283

 Score = 33.9 bits (76), Expect = 8.0,   Method: Composition-based stats.
 Identities = 22/76 (28%), Positives = 34/76 (44%), Gaps = 8/76 (10%)

Query: 16 SGFKLGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGFFGSSQKEGKKEEEGVRPLEGDELA 75
          +G  +G++FGPVG + G + G  AG   +   +GG        G +  EG R  + D +A
Sbjct: 9  AGGAVGALFGPVGAIAGRIAGALAGSMLDHALLGG--------GDRNVEGPRLADLDIMA 60

Query: 76 EVRRQESLRAYEMNRI 91
                  R Y   R+
Sbjct: 61 STEGAPIPRVYGRARL 76


>gi|167588612|ref|ZP_02381000.1| hypothetical protein BuboB_24974 [Burkholderia ubonensis Bu]
          Length = 217

 Score = 33.9 bits (76), Expect = 8.8,   Method: Composition-based stats.
 Identities = 18/43 (41%), Positives = 24/43 (55%), Gaps = 2/43 (4%)

Query: 20 LGSIFGPVGGVIGGLMGGAAGLYSEWDTIGGF-FGSSQKEGKK 61
          LG + G +G V+ GL GG AG + E+D   GF F     EG +
Sbjct: 50 LGRVLGGIGNVVAGLTGGGAG-FDEYDEGDGFEFADEADEGDE 91


  Database: nr
    Posted date:  May 22, 2011 12:22 AM
  Number of letters in database: 999,999,966
  Number of sequences in database:  2,987,313
  
  Database: /data/usr2/db/fasta/nr.01
    Posted date:  May 22, 2011 12:30 AM
  Number of letters in database: 999,999,796
  Number of sequences in database:  2,903,041
  
  Database: /data/usr2/db/fasta/nr.02
    Posted date:  May 22, 2011 12:36 AM
  Number of letters in database: 999,999,281
  Number of sequences in database:  2,904,016
  
  Database: /data/usr2/db/fasta/nr.03
    Posted date:  May 22, 2011 12:41 AM
  Number of letters in database: 999,999,960
  Number of sequences in database:  2,935,328
  
  Database: /data/usr2/db/fasta/nr.04
    Posted date:  May 22, 2011 12:46 AM
  Number of letters in database: 842,794,627
  Number of sequences in database:  2,394,679
  
Lambda     K      H
   0.315    0.137    0.390 

Lambda     K      H
   0.267   0.0423    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,000,104,939
Number of Sequences: 14124377
Number of extensions: 39259546
Number of successful extensions: 143221
Number of sequences better than 10.0: 138
Number of HSP's better than 10.0 without gapping: 74
Number of HSP's successfully gapped in prelim test: 64
Number of HSP's that attempted gapping in prelim test: 142807
Number of HSP's gapped (non-prelim): 374
length of query: 110
length of database: 4,842,793,630
effective HSP length: 78
effective length of query: 32
effective length of database: 3,741,092,224
effective search space: 119714951168
effective search space used: 119714951168
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 76 (33.8 bits)