BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 035469
         (62 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255581387|ref|XP_002531502.1| conserved hypothetical protein [Ricinus communis]
 gi|223528889|gb|EEF30889.1| conserved hypothetical protein [Ricinus communis]
          Length = 56

 Score = 54.7 bits (130), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 30/57 (52%), Positives = 35/57 (61%), Gaps = 3/57 (5%)

Query: 1  MMDHEHEEAQRKGGKRKCSMCARGF--ARTKKKGNDRSFIEGCLFALCCCWICDACF 55
          M   E +E Q   G    +  A+      TKKKG DR FIEGCLFALCCCW+C+ACF
Sbjct: 1  MPKSEKQENQPPPGYPTDTPTAKKKCCPNTKKKG-DRGFIEGCLFALCCCWLCEACF 56


>gi|297806383|ref|XP_002871075.1| hypothetical protein ARALYDRAFT_908293 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297316912|gb|EFH47334.1| hypothetical protein ARALYDRAFT_908293 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 64

 Score = 53.5 bits (127), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 25/31 (80%), Positives = 26/31 (83%), Gaps = 1/31 (3%)

Query: 25 FARTKKKGNDRSFIEGCLFALCCCWICDACF 55
          F  TKKKG DR FIEGCLFALCCCWIC+ CF
Sbjct: 35 FFETKKKG-DRGFIEGCLFALCCCWICEMCF 64


>gi|297739898|emb|CBI30080.3| unnamed protein product [Vitis vinifera]
          Length = 56

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/50 (56%), Positives = 31/50 (62%), Gaps = 8/50 (16%)

Query: 6  HEEAQRKGGKRKCSMCARGFARTKKKGNDRSFIEGCLFALCCCWICDACF 55
          +  A    GK  C        R+K KG DR FIEGCLFALCCCWIC+ACF
Sbjct: 15 YPPANPPTGKNCC-------PRSKSKG-DRGFIEGCLFALCCCWICEACF 56


>gi|30680382|ref|NP_196028.2| uncharacterized protein [Arabidopsis thaliana]
 gi|38603900|gb|AAR24695.1| At5g04080 [Arabidopsis thaliana]
 gi|41349906|gb|AAS00338.1| At5g04080 [Arabidopsis thaliana]
 gi|332003311|gb|AED90694.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 63

 Score = 52.4 bits (124), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 28/49 (57%), Positives = 33/49 (67%), Gaps = 2/49 (4%)

Query: 7  EEAQRKGGKRKCSMCARGFARTKKKGNDRSFIEGCLFALCCCWICDACF 55
          E+   + G+ K     R F  TK+KG DR FIEGCLFALCCCWIC+ CF
Sbjct: 17 EQVSEQPGQDKKKKKPR-FFETKQKG-DRGFIEGCLFALCCCWICEMCF 63


>gi|116783083|gb|ABK22787.1| unknown [Picea sitchensis]
 gi|224285591|gb|ACN40514.1| unknown [Picea sitchensis]
          Length = 66

 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 27/34 (79%), Gaps = 1/34 (2%)

Query: 22 ARGFARTKKKGNDRSFIEGCLFALCCCWICDACF 55
          + G  +T+++G D+ F+EGCL ALCCCW+C+ CF
Sbjct: 34 SSGPVKTQQRG-DKGFVEGCLAALCCCWLCETCF 66


>gi|116782188|gb|ABK22401.1| unknown [Picea sitchensis]
          Length = 66

 Score = 46.6 bits (109), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 27/34 (79%), Gaps = 1/34 (2%)

Query: 22 ARGFARTKKKGNDRSFIEGCLFALCCCWICDACF 55
          + G  +T+++G D+ F+EGCL ALCCCW+C+ CF
Sbjct: 34 SSGPVKTQQRG-DKGFVEGCLAALCCCWLCETCF 66


>gi|6723429|emb|CAB66922.1| putative protein [Arabidopsis thaliana]
          Length = 651

 Score = 35.8 bits (81), Expect = 3.6,   Method: Composition-based stats.
 Identities = 12/19 (63%), Positives = 15/19 (78%)

Query: 37  FIEGCLFALCCCWICDACF 55
           F+EGCL  LCCC + +ACF
Sbjct: 633 FMEGCLAMLCCCVLLEACF 651


>gi|18402895|ref|NP_565739.1| uncharacterized protein [Arabidopsis thaliana]
 gi|4263700|gb|AAD15386.1| expressed protein [Arabidopsis thaliana]
 gi|21593591|gb|AAM65558.1| unknown [Arabidopsis thaliana]
 gi|26451539|dbj|BAC42867.1| unknown protein [Arabidopsis thaliana]
 gi|28827664|gb|AAO50676.1| unknown protein [Arabidopsis thaliana]
 gi|330253553|gb|AEC08647.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 71

 Score = 35.0 bits (79), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 16/29 (55%), Positives = 21/29 (72%), Gaps = 2/29 (6%)

Query: 27 RTKKKGNDRSFIEGCLFALCCCWICDACF 55
           TK KG+   F++GCL A+CCC + DACF
Sbjct: 45 ETKSKGD--GFLKGCLAAMCCCCVLDACF 71


>gi|18402900|ref|NP_565740.1| uncharacterized protein [Arabidopsis thaliana]
 gi|13272417|gb|AAK17147.1|AF325079_1 unknown protein [Arabidopsis thaliana]
 gi|4263698|gb|AAD15384.1| expressed protein [Arabidopsis thaliana]
 gi|18389222|gb|AAL67054.1| unknown protein [Arabidopsis thaliana]
 gi|20465359|gb|AAM20083.1| unknown protein [Arabidopsis thaliana]
 gi|21618272|gb|AAM67322.1| unknown [Arabidopsis thaliana]
 gi|330253556|gb|AEC08650.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 71

 Score = 35.0 bits (79), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 16/29 (55%), Positives = 21/29 (72%), Gaps = 2/29 (6%)

Query: 27 RTKKKGNDRSFIEGCLFALCCCWICDACF 55
           TK KG+   F++GCL A+CCC + DACF
Sbjct: 45 ETKSKGD--GFLKGCLAAMCCCCVLDACF 71


>gi|297822979|ref|XP_002879372.1| hypothetical protein ARALYDRAFT_482146 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297325211|gb|EFH55631.1| hypothetical protein ARALYDRAFT_482146 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 71

 Score = 35.0 bits (79), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 16/29 (55%), Positives = 21/29 (72%), Gaps = 2/29 (6%)

Query: 27 RTKKKGNDRSFIEGCLFALCCCWICDACF 55
           TK KG+   F++GCL A+CCC + DACF
Sbjct: 45 ETKSKGD--GFLKGCLAAMCCCCVLDACF 71


>gi|224090007|ref|XP_002308904.1| predicted protein [Populus trichocarpa]
 gi|222854880|gb|EEE92427.1| predicted protein [Populus trichocarpa]
          Length = 56

 Score = 35.0 bits (79), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 18/22 (81%), Positives = 19/22 (86%), Gaps = 1/22 (4%)

Query: 25 FARTKKKGNDRSFIEGCLFALC 46
          F RTKKKG +R FIEGCLFALC
Sbjct: 27 FPRTKKKG-ERGFIEGCLFALC 47


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.330    0.139    0.497 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 925,653,241
Number of Sequences: 23463169
Number of extensions: 25766082
Number of successful extensions: 82164
Number of sequences better than 100.0: 145
Number of HSP's better than 100.0 without gapping: 139
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 81995
Number of HSP's gapped (non-prelim): 166
length of query: 62
length of database: 8,064,228,071
effective HSP length: 34
effective length of query: 28
effective length of database: 7,266,480,325
effective search space: 203461449100
effective search space used: 203461449100
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 69 (31.2 bits)