BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 034135
         (103 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225452450|ref|XP_002274298.1| PREDICTED: uncharacterized protein LOC100252584 [Vitis vinifera]
 gi|296087660|emb|CBI34916.3| unnamed protein product [Vitis vinifera]
          Length = 583

 Score = 88.6 bits (218), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 46/72 (63%), Positives = 55/72 (76%), Gaps = 8/72 (11%)

Query: 36 SFGRD-------RVKVRFSKK-IVRVHALSSNSNSYLKMNLNEYMVTLEKPLGIRFALTV 87
          S+GRD          V  SKK I RV+A+SS+++S  KMNLNEYMVTLEKPLGIRFAL+ 
Sbjct: 27 SWGRDFLCFPNATAAVTSSKKAIARVYAMSSDTSSSFKMNLNEYMVTLEKPLGIRFALSA 86

Query: 88 DGKIFVHALRKG 99
          DGK+FVHAL+KG
Sbjct: 87 DGKVFVHALKKG 98


>gi|449446666|ref|XP_004141092.1| PREDICTED: phosphoglucan phosphatase LSF1, chloroplastic-like
           [Cucumis sativus]
 gi|449489459|ref|XP_004158318.1| PREDICTED: LOW QUALITY PROTEIN: phosphoglucan phosphatase LSF1,
           chloroplastic-like [Cucumis sativus]
          Length = 589

 Score = 87.4 bits (215), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 48/86 (55%), Positives = 62/86 (72%), Gaps = 10/86 (11%)

Query: 17  SSSFLEKNL-FSINGVPKQLSFGRD--RVKVRFSKKIVRVHALSSNSNSYLKMNLNEYMV 73
           SSSF  ++L F   GV       RD  R +  F+ ++V+V A+S  S+S  KMNLNEYMV
Sbjct: 25  SSSFWGRDLCFGNGGV-------RDGPRPRASFNGRLVKVFAMSDTSSSSFKMNLNEYMV 77

Query: 74  TLEKPLGIRFALTVDGKIFVHALRKG 99
           TLEKPLGIRFA++VDG+IFVH+L+KG
Sbjct: 78  TLEKPLGIRFAISVDGRIFVHSLKKG 103


>gi|147821809|emb|CAN61666.1| hypothetical protein VITISV_037832 [Vitis vinifera]
          Length = 538

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 38/50 (76%), Positives = 46/50 (92%)

Query: 50 IVRVHALSSNSNSYLKMNLNEYMVTLEKPLGIRFALTVDGKIFVHALRKG 99
          I RV+A+SS+++S  KMNLNEYMVTLEKPLGIRFAL+ DGK+FVHAL+KG
Sbjct: 4  IARVYAMSSDTSSSFKMNLNEYMVTLEKPLGIRFALSADGKVFVHALKKG 53


>gi|255567329|ref|XP_002524644.1| protein tyrosine, putative [Ricinus communis]
 gi|223536005|gb|EEF37663.1| protein tyrosine, putative [Ricinus communis]
          Length = 536

 Score = 82.4 bits (202), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 40/56 (71%), Positives = 46/56 (82%), Gaps = 2/56 (3%)

Query: 46  FSKKIVRVHALSSNSN--SYLKMNLNEYMVTLEKPLGIRFALTVDGKIFVHALRKG 99
            SK+ ++VHA S N+N  S LKMNLNEYMV L+KP GIRFAL+VDGKI VHALRKG
Sbjct: 57  LSKRTLKVHATSDNTNNNSSLKMNLNEYMVNLDKPFGIRFALSVDGKIIVHALRKG 112


>gi|297828590|ref|XP_002882177.1| hypothetical protein ARALYDRAFT_340294 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328017|gb|EFH58436.1| hypothetical protein ARALYDRAFT_340294 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 765

 Score = 81.3 bits (199), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 41/62 (66%), Positives = 53/62 (85%), Gaps = 4/62 (6%)

Query: 41  RVKVRFSKK---IVRVHALSSNSNSYLKMNLNEYMVTLEKPLGIRFALTVDGKIFVHALR 97
           R KV F+++   ++RV A+SS+S+ + KMNLNEYMVTLEKPLGIRFAL+ DGKIFVHA++
Sbjct: 220 REKVGFNRRSRLVLRVVAMSSSSSPF-KMNLNEYMVTLEKPLGIRFALSADGKIFVHAIK 278

Query: 98  KG 99
           KG
Sbjct: 279 KG 280


>gi|6016710|gb|AAF01536.1|AC009325_6 unknown protein [Arabidopsis thaliana]
          Length = 716

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 35/45 (77%), Positives = 41/45 (91%), Gaps = 1/45 (2%)

Query: 55  ALSSNSNSYLKMNLNEYMVTLEKPLGIRFALTVDGKIFVHALRKG 99
           A+SS+S  + KMNLNEYMVTLEKPLGIRFAL+ DGKIFVHA++KG
Sbjct: 270 AMSSSSTPF-KMNLNEYMVTLEKPLGIRFALSADGKIFVHAIKKG 313


>gi|18395843|ref|NP_566139.1| protein like SEX4 1 [Arabidopsis thaliana]
 gi|387942514|sp|F4J117.1|LSF1_ARATH RecName: Full=Phosphoglucan phosphatase LSF1, chloroplastic;
           AltName: Full=Phosphoglucan phosphatase like sex Four1;
           AltName: Full=Protein LIKE SEX4 1; Flags: Precursor
 gi|332640157|gb|AEE73678.1| protein like SEX4 1 [Arabidopsis thaliana]
          Length = 591

 Score = 74.7 bits (182), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 35/45 (77%), Positives = 41/45 (91%), Gaps = 1/45 (2%)

Query: 55  ALSSNSNSYLKMNLNEYMVTLEKPLGIRFALTVDGKIFVHALRKG 99
           A+SS+S  + KMNLNEYMVTLEKPLGIRFAL+ DGKIFVHA++KG
Sbjct: 62  AMSSSSTPF-KMNLNEYMVTLEKPLGIRFALSADGKIFVHAIKKG 105


>gi|21592520|gb|AAM64470.1| unknown [Arabidopsis thaliana]
          Length = 591

 Score = 74.3 bits (181), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 35/45 (77%), Positives = 41/45 (91%), Gaps = 1/45 (2%)

Query: 55  ALSSNSNSYLKMNLNEYMVTLEKPLGIRFALTVDGKIFVHALRKG 99
           A+SS+S  + KMNLNEYMVTLEKPLGIRFAL+ DGKIFVHA++KG
Sbjct: 62  AMSSSSTPF-KMNLNEYMVTLEKPLGIRFALSADGKIFVHAVKKG 105


>gi|356573817|ref|XP_003555052.1| PREDICTED: uncharacterized protein LOC100814503 [Glycine max]
          Length = 589

 Score = 69.3 bits (168), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 35/49 (71%), Positives = 40/49 (81%), Gaps = 2/49 (4%)

Query: 51  VRVHALSSNSNSYLKMNLNEYMVTLEKPLGIRFALTVDGKIFVHALRKG 99
           +RV A+S +S    KMNLNEY+VTLEKPLGIRFALT DGKI VH+L KG
Sbjct: 57  LRVFAVSGSS--AFKMNLNEYLVTLEKPLGIRFALTSDGKIIVHSLTKG 103


>gi|356560597|ref|XP_003548577.1| PREDICTED: uncharacterized protein LOC100806392 [Glycine max]
          Length = 585

 Score = 68.2 bits (165), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 34/49 (69%), Positives = 40/49 (81%), Gaps = 2/49 (4%)

Query: 51 VRVHALSSNSNSYLKMNLNEYMVTLEKPLGIRFALTVDGKIFVHALRKG 99
          +RV A+S NS    KMNL+EY+VTLEKPLGIRFALT DG+I VH+L KG
Sbjct: 53 LRVFAVSGNS--AFKMNLSEYLVTLEKPLGIRFALTSDGRIIVHSLTKG 99


>gi|222640469|gb|EEE68601.1| hypothetical protein OsJ_27133 [Oryza sativa Japonica Group]
          Length = 593

 Score = 63.5 bits (153), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 24/35 (68%), Positives = 33/35 (94%)

Query: 65  KMNLNEYMVTLEKPLGIRFALTVDGKIFVHALRKG 99
           +MNLNEYMV +++PLG+RFAL VDG++FVH+L+KG
Sbjct: 72  RMNLNEYMVAVDRPLGVRFALAVDGRVFVHSLKKG 106


>gi|115476188|ref|NP_001061690.1| Os08g0379300 [Oryza sativa Japonica Group]
 gi|40253458|dbj|BAD05409.1| unknown protein [Oryza sativa Japonica Group]
 gi|40253686|dbj|BAD05629.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623659|dbj|BAF23604.1| Os08g0379300 [Oryza sativa Japonica Group]
          Length = 593

 Score = 63.2 bits (152), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 24/35 (68%), Positives = 33/35 (94%)

Query: 65  KMNLNEYMVTLEKPLGIRFALTVDGKIFVHALRKG 99
           +MNLNEYMV +++PLG+RFAL VDG++FVH+L+KG
Sbjct: 72  RMNLNEYMVAVDRPLGVRFALAVDGRVFVHSLKKG 106


>gi|125561375|gb|EAZ06823.1| hypothetical protein OsI_29062 [Oryza sativa Indica Group]
          Length = 598

 Score = 62.8 bits (151), Expect = 2e-08,   Method: Composition-based stats.
 Identities = 24/35 (68%), Positives = 33/35 (94%)

Query: 65  KMNLNEYMVTLEKPLGIRFALTVDGKIFVHALRKG 99
           +MNLNEYMV +++PLG+RFAL VDG++FVH+L+KG
Sbjct: 77  RMNLNEYMVAVDRPLGVRFALAVDGRVFVHSLKKG 111


>gi|242078999|ref|XP_002444268.1| hypothetical protein SORBIDRAFT_07g019250 [Sorghum bicolor]
 gi|241940618|gb|EES13763.1| hypothetical protein SORBIDRAFT_07g019250 [Sorghum bicolor]
          Length = 602

 Score = 60.5 bits (145), Expect = 1e-07,   Method: Composition-based stats.
 Identities = 24/34 (70%), Positives = 31/34 (91%)

Query: 66  MNLNEYMVTLEKPLGIRFALTVDGKIFVHALRKG 99
           M LNEYMV +++PLG+RFAL VDG++FVH+LRKG
Sbjct: 83  MRLNEYMVAVDRPLGVRFALGVDGRVFVHSLRKG 116


>gi|357141264|ref|XP_003572159.1| PREDICTED: uncharacterized protein LOC100836564 [Brachypodium
           distachyon]
          Length = 603

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 22/36 (61%), Positives = 34/36 (94%)

Query: 64  LKMNLNEYMVTLEKPLGIRFALTVDGKIFVHALRKG 99
           ++MNLNEYMV +++PLG+RFAL V+G++FVH+L++G
Sbjct: 83  VRMNLNEYMVAVDRPLGVRFALAVNGRVFVHSLKRG 118


>gi|414870598|tpg|DAA49155.1| TPA: hypothetical protein ZEAMMB73_095755 [Zea mays]
          Length = 596

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 24/34 (70%), Positives = 31/34 (91%)

Query: 66  MNLNEYMVTLEKPLGIRFALTVDGKIFVHALRKG 99
           M LNEYMV +++PLG+RFAL VDG++FVH+LRKG
Sbjct: 77  MRLNEYMVAVDRPLGVRFALGVDGRVFVHSLRKG 110


>gi|168061634|ref|XP_001782792.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162665694|gb|EDQ52369.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 577

 Score = 57.4 bits (137), Expect = 1e-06,   Method: Composition-based stats.
 Identities = 25/38 (65%), Positives = 32/38 (84%)

Query: 61 NSYLKMNLNEYMVTLEKPLGIRFALTVDGKIFVHALRK 98
          N+  K+NL+EYMVTLEKP+GIRFA T+ GK++V AL K
Sbjct: 46 NNKFKINLDEYMVTLEKPIGIRFAQTLSGKVYVEALAK 83


>gi|302769638|ref|XP_002968238.1| hypothetical protein SELMODRAFT_89622 [Selaginella
          moellendorffii]
 gi|300163882|gb|EFJ30492.1| hypothetical protein SELMODRAFT_89622 [Selaginella
          moellendorffii]
          Length = 581

 Score = 56.2 bits (134), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 25/35 (71%), Positives = 29/35 (82%)

Query: 64 LKMNLNEYMVTLEKPLGIRFALTVDGKIFVHALRK 98
           KMNLNEY+VT+ KPLGIRFA T+ GK+FV AL K
Sbjct: 54 FKMNLNEYLVTIHKPLGIRFAQTLSGKVFVDALAK 88


>gi|302788746|ref|XP_002976142.1| hypothetical protein SELMODRAFT_151004 [Selaginella
          moellendorffii]
 gi|300156418|gb|EFJ23047.1| hypothetical protein SELMODRAFT_151004 [Selaginella
          moellendorffii]
          Length = 547

 Score = 54.7 bits (130), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 24/33 (72%), Positives = 28/33 (84%)

Query: 66 MNLNEYMVTLEKPLGIRFALTVDGKIFVHALRK 98
          MNLNEY+VT+ KPLGIRFA T+ GK+FV AL K
Sbjct: 1  MNLNEYLVTIHKPLGIRFAQTLSGKVFVDALAK 33


>gi|404450148|ref|ZP_11015134.1| hypothetical protein A33Q_12515 [Indibacter alkaliphilus LW1]
 gi|403764347|gb|EJZ25252.1| hypothetical protein A33Q_12515 [Indibacter alkaliphilus LW1]
          Length = 1786

 Score = 35.4 bits (80), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 44/94 (46%), Gaps = 10/94 (10%)

Query: 7   SSRRTTSFPSSSSFLEKNLFSINGVPKQLSFGRDRVKVRFSKKIVRVHALSSNSNSYLKM 66
             RRT SF + +  LE  LFS++ V   + FG   V   ++K I         S+++L+ 
Sbjct: 513 GERRTPSFKTKAIALENGLFSVSPVNACIIFG--EVNQEWNKII--------ESHTFLQF 562

Query: 67  NLNEYMVTLEKPLGIRFALTVDGKIFVHALRKGV 100
            L  YM TL  P  I +    D ++ +    +G+
Sbjct: 563 GLFGYMPTLPDPYLINYKWRRDTRVNIQKGVEGI 596


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.133    0.368 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,403,317,136
Number of Sequences: 23463169
Number of extensions: 46001106
Number of successful extensions: 110819
Number of sequences better than 100.0: 20
Number of HSP's better than 100.0 without gapping: 19
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 110799
Number of HSP's gapped (non-prelim): 20
length of query: 103
length of database: 8,064,228,071
effective HSP length: 72
effective length of query: 31
effective length of database: 6,374,879,903
effective search space: 197621276993
effective search space used: 197621276993
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 69 (31.2 bits)