BLASTP 2.2.22 [Sep-27-2009]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= gi|254780699|ref|YP_003065112.1| hypothetical protein
CLIBASIA_02930 [Candidatus Liberibacter asiaticus str. psy62]
         (97 letters)

Database: nr 
           14,124,377 sequences; 4,842,793,630 total letters

Searching..................................................done



>gi|254780699|ref|YP_003065112.1| hypothetical protein CLIBASIA_02930 [Candidatus Liberibacter
          asiaticus str. psy62]
 gi|254040376|gb|ACT57172.1| hypothetical protein CLIBASIA_02930 [Candidatus Liberibacter
          asiaticus str. psy62]
          Length = 97

 Score =  174 bits (442), Expect = 3e-42,   Method: Composition-based stats.
 Identities = 97/97 (100%), Positives = 97/97 (100%)

Query: 1  MLFDNQFIENILKEDREDNNIKSLAQLQRIAIRTVLANRTKNIKSITKEFMEYWIAHSSR 60
          MLFDNQFIENILKEDREDNNIKSLAQLQRIAIRTVLANRTKNIKSITKEFMEYWIAHSSR
Sbjct: 1  MLFDNQFIENILKEDREDNNIKSLAQLQRIAIRTVLANRTKNIKSITKEFMEYWIAHSSR 60

Query: 61 AWRTSKPRTYLNLHIASQSKKIPVSIYIGNIQKNKKI 97
          AWRTSKPRTYLNLHIASQSKKIPVSIYIGNIQKNKKI
Sbjct: 61 AWRTSKPRTYLNLHIASQSKKIPVSIYIGNIQKNKKI 97


>gi|315121864|ref|YP_004062353.1| hypothetical protein CKC_00570 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
 gi|313495266|gb|ADR51865.1| hypothetical protein CKC_00570 [Candidatus Liberibacter
          solanacearum CLso-ZC1]
          Length = 101

 Score =  116 bits (291), Expect = 9e-25,   Method: Composition-based stats.
 Identities = 61/95 (64%), Positives = 81/95 (85%), Gaps = 1/95 (1%)

Query: 1  MLFDNQFIENILKEDREDNNIKSLAQ-LQRIAIRTVLANRTKNIKSITKEFMEYWIAHSS 59
          M FDNQF+ENILK+ R++N  KSL+Q LQ IA+  +LANRTKNIKSITKEF+EYW+ ++S
Sbjct: 1  MAFDNQFLENILKKSRKENLNKSLSQALQGIALHAILANRTKNIKSITKEFIEYWMTYNS 60

Query: 60 RAWRTSKPRTYLNLHIASQSKKIPVSIYIGNIQKN 94
          RAWR+S+ R YLN+HI+S+ KK+P+ I +GNI+KN
Sbjct: 61 RAWRSSQARAYLNIHISSRGKKVPIYISLGNIEKN 95


>gi|222148620|ref|YP_002549577.1| hypothetical protein Avi_2192 [Agrobacterium vitis S4]
 gi|221735606|gb|ACM36569.1| conserved hypothetical protein [Agrobacterium vitis S4]
          Length = 95

 Score = 43.9 bits (102), Expect = 0.008,   Method: Composition-based stats.
 Identities = 22/57 (38%), Positives = 32/57 (56%), Gaps = 2/57 (3%)

Query: 23 SLAQLQRIAIRTVLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQS 79
          SL  L   A+R  LA+     ++ TKE M   + H +RAWR S+P   ++LH+A  S
Sbjct: 31 SLNPLHEAAMR--LADGINKPRARTKELMSLLLCHGARAWRYSQPEANIHLHVAPHS 85


>gi|15888831|ref|NP_354512.1| hypothetical protein Atu1506 [Agrobacterium tumefaciens str. C58]
 gi|15156591|gb|AAK87297.1| hypothetical protein Atu1506 [Agrobacterium tumefaciens str. C58]
          Length = 86

 Score = 43.2 bits (100), Expect = 0.014,   Method: Composition-based stats.
 Identities = 15/45 (33%), Positives = 30/45 (66%)

Query: 44 KSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSKKIPVSIYI 88
          K+ T++ +   ++H +RAWR ++P   ++LH+A +S + P+ I I
Sbjct: 41 KAKTRDLVSLLLSHGARAWRANQPEARIHLHVAHRSGRAPIHIRI 85


>gi|222086065|ref|YP_002544597.1| hypothetical protein Arad_2493 [Agrobacterium radiobacter K84]
 gi|221723513|gb|ACM26669.1| conserved hypothetical protein [Agrobacterium radiobacter K84]
          Length = 91

 Score = 42.0 bits (97), Expect = 0.025,   Method: Composition-based stats.
 Identities = 18/51 (35%), Positives = 32/51 (62%), Gaps = 1/51 (1%)

Query: 38 NRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSKKIPVSIYI 88
          NR+++ +S TK+ +   + H +RAWR S+P   + LHI S + + PV + +
Sbjct: 41 NRSRS-RSKTKDLIGILLCHGARAWRYSQPEANIRLHITSPNGQAPVQLRV 90


>gi|325292860|ref|YP_004278724.1| hypothetical protein AGROH133_06134 [Agrobacterium sp. H13-3]
 gi|325060713|gb|ADY64404.1| hypothetical protein AGROH133_06134 [Agrobacterium sp. H13-3]
          Length = 86

 Score = 42.0 bits (97), Expect = 0.031,   Method: Composition-based stats.
 Identities = 15/45 (33%), Positives = 29/45 (64%)

Query: 44 KSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSKKIPVSIYI 88
          K+ T++ +   ++H +RAWR ++P   ++LH+  +S + PV I I
Sbjct: 41 KAKTRDLVSLLLSHGARAWRANQPEANIHLHVGRRSGRAPVHIRI 85


>gi|15965163|ref|NP_385516.1| hypothetical protein SMc01068 [Sinorhizobium meliloti 1021]
 gi|307309176|ref|ZP_07588847.1| conserved hypothetical protein [Sinorhizobium meliloti BL225C]
 gi|307321506|ref|ZP_07600901.1| conserved hypothetical protein [Sinorhizobium meliloti AK83]
 gi|15074343|emb|CAC45989.1| Hypothetical/unknown protein [Sinorhizobium meliloti 1021]
 gi|306892813|gb|EFN23604.1| conserved hypothetical protein [Sinorhizobium meliloti AK83]
 gi|306900322|gb|EFN30938.1| conserved hypothetical protein [Sinorhizobium meliloti BL225C]
          Length = 89

 Score = 40.9 bits (94), Expect = 0.065,   Method: Composition-based stats.
 Identities = 14/43 (32%), Positives = 29/43 (67%)

Query: 44 KSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSKKIPVSI 86
          K+ T++ +   + H +RAWR+++PR  ++LH+A+  +  P+ I
Sbjct: 44 KAKTRDLVGMLLTHGARAWRSTQPRAGIHLHVAAPGRSRPLRI 86


>gi|150396260|ref|YP_001326727.1| hypothetical protein Smed_1039 [Sinorhizobium medicae WSM419]
 gi|150027775|gb|ABR59892.1| conserved hypothetical protein [Sinorhizobium medicae WSM419]
          Length = 89

 Score = 39.3 bits (90), Expect = 0.18,   Method: Composition-based stats.
 Identities = 22/75 (29%), Positives = 42/75 (56%), Gaps = 6/75 (8%)

Query: 15 DREDNNIK-SLAQLQRIAIRT--VLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYL 71
          DR+ N  +  L  L   A+R   +  NR+K   + T++ +   + H +RAWR+++PR  +
Sbjct: 15 DRKANPARVRLHPLHEAAMRIADIGLNRSK---AKTRDLVGMLLTHGARAWRSTQPRADI 71

Query: 72 NLHIASQSKKIPVSI 86
          +LH+ +  +  P+ I
Sbjct: 72 HLHVTALGRSHPLRI 86


>gi|209549454|ref|YP_002281371.1| hypothetical protein Rleg2_1860 [Rhizobium leguminosarum bv.
          trifolii WSM2304]
 gi|241204781|ref|YP_002975877.1| hypothetical protein Rleg_2060 [Rhizobium leguminosarum bv.
          trifolii WSM1325]
 gi|209535210|gb|ACI55145.1| conserved hypothetical protein [Rhizobium leguminosarum bv.
          trifolii WSM2304]
 gi|240858671|gb|ACS56338.1| conserved hypothetical protein [Rhizobium leguminosarum bv.
          trifolii WSM1325]
          Length = 91

 Score = 39.3 bits (90), Expect = 0.20,   Method: Composition-based stats.
 Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 2/66 (3%)

Query: 23 SLAQLQRIAIR--TVLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSK 80
          SL  L   A+R   +   R K     T++ +   + H +RAWR S+P   ++LH+ S   
Sbjct: 23 SLHPLHEAAMRLAEIGLQRPKAKSPKTRDLINLLLCHGARAWRYSQPEARIHLHVTSPDG 82

Query: 81 KIPVSI 86
            PV +
Sbjct: 83 SAPVQL 88


>gi|86357828|ref|YP_469720.1| hypothetical protein RHE_CH02212 [Rhizobium etli CFN 42]
 gi|86281930|gb|ABC90993.1| hypothetical conserved protein [Rhizobium etli CFN 42]
          Length = 91

 Score = 38.9 bits (89), Expect = 0.21,   Method: Composition-based stats.
 Identities = 19/66 (28%), Positives = 33/66 (50%), Gaps = 2/66 (3%)

Query: 23 SLAQLQRIAIR--TVLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSK 80
          SL  L   A+R   +   R +   + T++ +   + H +RAWR S+P   ++LH+ S   
Sbjct: 23 SLHPLHEAALRLAEIGLQRPRAKSAKTRDLINLLLCHGARAWRYSQPEARIHLHVTSPDG 82

Query: 81 KIPVSI 86
            PV +
Sbjct: 83 SAPVQL 88


>gi|116252288|ref|YP_768126.1| hypothetical protein RL2542 [Rhizobium leguminosarum bv. viciae
          3841]
 gi|115256936|emb|CAK08030.1| conserved hypothetical protein [Rhizobium leguminosarum bv.
          viciae 3841]
          Length = 91

 Score = 38.5 bits (88), Expect = 0.32,   Method: Composition-based stats.
 Identities = 20/66 (30%), Positives = 32/66 (48%), Gaps = 2/66 (3%)

Query: 23 SLAQLQRIAIR--TVLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSK 80
          SL  L   A+R   +   R K     T++ +   + H +RAWR S+P   ++LH+ S   
Sbjct: 23 SLHPLHEAALRLAEIGLPRPKAKSPKTRDLINLLLCHGARAWRYSQPEARIHLHVTSPDG 82

Query: 81 KIPVSI 86
            PV +
Sbjct: 83 SAPVQL 88


>gi|218515844|ref|ZP_03512684.1| hypothetical protein Retl8_20236 [Rhizobium etli 8C-3]
          Length = 82

 Score = 38.2 bits (87), Expect = 0.37,   Method: Composition-based stats.
 Identities = 18/58 (31%), Positives = 31/58 (53%), Gaps = 2/58 (3%)

Query: 29 RIAIRTVLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSKKIPVSI 86
          R+A   +   R K+ K  T++ +   + H +RAWR S+P   ++LH+ S     PV +
Sbjct: 24 RLAEMGLQRPRAKSAK--TRDLINLLLCHGARAWRYSQPEARIHLHVTSPDGSAPVQL 79


>gi|218462488|ref|ZP_03502579.1| hypothetical protein RetlK5_24823 [Rhizobium etli Kim 5]
 gi|218674959|ref|ZP_03524628.1| hypothetical protein RetlG_27770 [Rhizobium etli GR56]
          Length = 91

 Score = 38.2 bits (87), Expect = 0.37,   Method: Composition-based stats.
 Identities = 18/58 (31%), Positives = 31/58 (53%), Gaps = 2/58 (3%)

Query: 29 RIAIRTVLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSKKIPVSI 86
          R+A   +   R K+ K  T++ +   + H +RAWR S+P   ++LH+ S     PV +
Sbjct: 33 RLAEMGLQRPRAKSAK--TRDLINLLLCHGARAWRYSQPEARIHLHVTSPDGSAPVQL 88


>gi|190891892|ref|YP_001978434.1| hypothetical protein RHECIAT_CH0002301 [Rhizobium etli CIAT 652]
 gi|190697171|gb|ACE91256.1| hypothetical conserved protein [Rhizobium etli CIAT 652]
          Length = 91

 Score = 38.2 bits (87), Expect = 0.37,   Method: Composition-based stats.
 Identities = 18/58 (31%), Positives = 31/58 (53%), Gaps = 2/58 (3%)

Query: 29 RIAIRTVLANRTKNIKSITKEFMEYWIAHSSRAWRTSKPRTYLNLHIASQSKKIPVSI 86
          R+A   +   R K+ K  T++ +   + H +RAWR S+P   ++LH+ S     PV +
Sbjct: 33 RLAEMGLQRPRAKSAK--TRDLINLLLCHGARAWRYSQPEARIHLHVTSPDGSAPVQL 88


>gi|292615491|ref|XP_686042.3| PREDICTED: c-type mannose receptor 2-like isoform 1 [Danio rerio]
          Length = 368

 Score = 35.5 bits (80), Expect = 2.3,   Method: Composition-based stats.
 Identities = 18/40 (45%), Positives = 26/40 (65%), Gaps = 2/40 (5%)

Query: 58  SSRAWRTSKPRTYLNLHIAS--QSKKIPVSIYIGNIQKNK 95
           S R WRT +PR+  N H A+  Q+ +   SIY+ NI+KN+
Sbjct: 202 SFRYWRTGEPRSQTNNHRAAVEQTSQGQWSIYMYNIEKNR 241


  Database: nr
    Posted date:  May 22, 2011 12:22 AM
  Number of letters in database: 999,999,966
  Number of sequences in database:  2,987,313
  
  Database: /data/usr2/db/fasta/nr.01
    Posted date:  May 22, 2011 12:30 AM
  Number of letters in database: 999,999,796
  Number of sequences in database:  2,903,041
  
  Database: /data/usr2/db/fasta/nr.02
    Posted date:  May 22, 2011 12:36 AM
  Number of letters in database: 999,999,281
  Number of sequences in database:  2,904,016
  
  Database: /data/usr2/db/fasta/nr.03
    Posted date:  May 22, 2011 12:41 AM
  Number of letters in database: 999,999,960
  Number of sequences in database:  2,935,328
  
  Database: /data/usr2/db/fasta/nr.04
    Posted date:  May 22, 2011 12:46 AM
  Number of letters in database: 842,794,627
  Number of sequences in database:  2,394,679
  
Lambda     K      H
   0.319    0.131    0.367 

Lambda     K      H
   0.267   0.0402    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 779,133,230
Number of Sequences: 14124377
Number of extensions: 22359681
Number of successful extensions: 63904
Number of sequences better than 10.0: 16
Number of HSP's better than 10.0 without gapping: 14
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 63889
Number of HSP's gapped (non-prelim): 16
length of query: 97
length of database: 4,842,793,630
effective HSP length: 66
effective length of query: 31
effective length of database: 3,910,584,748
effective search space: 121228127188
effective search space used: 121228127188
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 75 (33.5 bits)