BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 048001
         (48 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|147767796|emb|CAN76054.1| hypothetical protein VITISV_036406 [Vitis vinifera]
          Length = 289

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Composition-based stats.
 Identities = 32/36 (88%), Positives = 36/36 (100%)

Query: 1  MYIAITLKRYRAVKEVGKIKMSVGVIAYYQLVQVCQ 36
          MYIA+TLKRYRAVKEVGKIKMSVG+IA+YQL+QVCQ
Sbjct: 1  MYIAVTLKRYRAVKEVGKIKMSVGIIAHYQLMQVCQ 36


>gi|186479051|ref|NP_001117386.1| conserved peptide upstream open reading frame 35 [Arabidopsis
          thaliana]
 gi|332193034|gb|AEE31155.1| conserved peptide upstream open reading frame 35 [Arabidopsis
          thaliana]
          Length = 48

 Score = 58.2 bits (139), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 29/48 (60%), Positives = 33/48 (68%)

Query: 1  MYIAITLKRYRAVKEVGKIKMSVGVIAYYQLVQVCQAEYFRQLLKPVT 48
          M I    K  R++KE+G   M+   IA YQ VQVCQAEYFRQLLKPVT
Sbjct: 1  MCIVGNKKGNRSLKEIGTFMMTTCFIANYQSVQVCQAEYFRQLLKPVT 48


>gi|356524812|ref|XP_003531022.1| PREDICTED: uncharacterized protein LOC100817942 [Glycine max]
          Length = 53

 Score = 45.8 bits (107), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/49 (51%), Positives = 31/49 (63%), Gaps = 4/49 (8%)

Query: 4  AITLKRYRAVKE----VGKIKMSVGVIAYYQLVQVCQAEYFRQLLKPVT 48
          A +  R+RA+K      GK  + V VIA +Q +  CQAEYFR LLKPVT
Sbjct: 5  AASQTRFRALKHEYGIDGKATIIVRVIACFQPLHFCQAEYFRHLLKPVT 53


>gi|357489915|ref|XP_003615245.1| Transcription factor bHLH143 [Medicago truncatula]
 gi|355516580|gb|AES98203.1| Transcription factor bHLH143 [Medicago truncatula]
          Length = 431

 Score = 45.4 bits (106), Expect = 0.004,   Method: Composition-based stats.
 Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 4/49 (8%)

Query: 4  AITLKRYRAVKE----VGKIKMSVGVIAYYQLVQVCQAEYFRQLLKPVT 48
          A +  ++RA+K      G   + V VIA +Q ++ CQAEYFRQLLKPVT
Sbjct: 5  AASQTKFRALKHENGTAGCATIIVRVIACFQTLRECQAEYFRQLLKPVT 53


>gi|356557577|ref|XP_003547092.1| PREDICTED: transcription factor bHLH143-like [Glycine max]
          Length = 406

 Score = 45.4 bits (106), Expect = 0.005,   Method: Composition-based stats.
 Identities = 25/49 (51%), Positives = 31/49 (63%), Gaps = 4/49 (8%)

Query: 4  AITLKRYRAVKEVGKIK----MSVGVIAYYQLVQVCQAEYFRQLLKPVT 48
          A +  R+RA+K    IK    + V VIA +Q +  CQAEYFR LLKPVT
Sbjct: 5  AASQTRFRALKHESGIKGEPAIIVRVIACFQPLHDCQAEYFRHLLKPVT 53


>gi|356518416|ref|XP_003527875.1| PREDICTED: transcription factor SAC51-like [Glycine max]
          Length = 370

 Score = 45.4 bits (106), Expect = 0.005,   Method: Composition-based stats.
 Identities = 24/49 (48%), Positives = 31/49 (63%), Gaps = 4/49 (8%)

Query: 4  AITLKRYRAVKE----VGKIKMSVGVIAYYQLVQVCQAEYFRQLLKPVT 48
          A +  R+RA+K      G   + V VIA +Q ++ CQAEYFR LLKPVT
Sbjct: 5  AASQTRFRALKHENGIAGSSTIIVKVIACFQPLRECQAEYFRHLLKPVT 53


>gi|186530845|ref|NP_001119403.1| conserved peptide upstream open reading frame 37 [Arabidopsis
          thaliana]
 gi|332008501|gb|AED95884.1| conserved peptide upstream open reading frame 37 [Arabidopsis
          thaliana]
          Length = 51

 Score = 44.3 bits (103), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 24/42 (57%), Positives = 29/42 (69%), Gaps = 2/42 (4%)

Query: 9  RYRAVKEVGKI--KMSVGVIAYYQLVQVCQAEYFRQLLKPVT 48
          R+R +K    I   + V VIA +Q +Q CQAEYFRQLLKPVT
Sbjct: 10 RFRTLKHEHGITGNIVVRVIACFQPLQDCQAEYFRQLLKPVT 51


>gi|356498829|ref|XP_003518251.1| PREDICTED: transcription factor bHLH145-like [Glycine max]
          Length = 429

 Score = 43.9 bits (102), Expect = 0.012,   Method: Composition-based stats.
 Identities = 24/49 (48%), Positives = 30/49 (61%), Gaps = 4/49 (8%)

Query: 4  AITLKRYRAVKE----VGKIKMSVGVIAYYQLVQVCQAEYFRQLLKPVT 48
          A +  R+RA+K      G   + V VIA +Q +  CQAEYFR LLKPVT
Sbjct: 5  AASQTRFRALKHENGYAGSSTIIVRVIACFQPLTDCQAEYFRHLLKPVT 53


>gi|356521225|ref|XP_003529257.1| PREDICTED: uncharacterized protein LOC100790978 [Glycine max]
          Length = 53

 Score = 43.9 bits (102), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 24/49 (48%), Positives = 30/49 (61%), Gaps = 4/49 (8%)

Query: 4  AITLKRYRAVKE----VGKIKMSVGVIAYYQLVQVCQAEYFRQLLKPVT 48
          A +  R+RA+K      G   + V VIA +Q +  CQAEYFR LLKPVT
Sbjct: 5  AASQTRFRALKHEYGIDGAATIIVRVIACFQPLHFCQAEYFRHLLKPVT 53


>gi|186532727|ref|NP_001119490.1| conserved peptide upstream open reading frame 40 [Arabidopsis
          thaliana]
 gi|332010490|gb|AED97873.1| conserved peptide upstream open reading frame 40 [Arabidopsis
          thaliana]
          Length = 53

 Score = 40.8 bits (94), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 24/44 (54%), Positives = 27/44 (61%), Gaps = 4/44 (9%)

Query: 9  RYRAVK-EVGKIKMS---VGVIAYYQLVQVCQAEYFRQLLKPVT 48
          R+R +K E G    S   V VI  YQ +  CQAEYFR LLKPVT
Sbjct: 10 RFRGLKYETGNANESTIVVRVIECYQPMDNCQAEYFRLLLKPVT 53


>gi|186521555|ref|NP_001119196.1| conserved peptide upstream open reading frame 43 [Arabidopsis
          thaliana]
 gi|332004014|gb|AED91397.1| conserved peptide upstream open reading frame 43 [Arabidopsis
          thaliana]
          Length = 54

 Score = 39.7 bits (91), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 25/40 (62%)

Query: 9  RYRAVKEVGKIKMSVGVIAYYQLVQVCQAEYFRQLLKPVT 48
          +Y    +  +  + V VIA +Q +  CQAEYFR +LKPVT
Sbjct: 15 KYENNGDSSRPTIVVRVIACFQPMDNCQAEYFRHILKPVT 54


>gi|297811037|ref|XP_002873402.1| expressed protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319239|gb|EFH49661.1| expressed protein [Arabidopsis lyrata subsp. lyrata]
          Length = 54

 Score = 38.9 bits (89), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 25/40 (62%)

Query: 9  RYRAVKEVGKIKMSVGVIAYYQLVQVCQAEYFRQLLKPVT 48
          +Y    +  +  + V VIA +Q +  CQAEYFR +LKPVT
Sbjct: 15 KYENNGDSSRQTIVVRVIACFQPMDNCQAEYFRHILKPVT 54


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.329    0.140    0.393 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 590,970,928
Number of Sequences: 23463169
Number of extensions: 12284938
Number of successful extensions: 28824
Number of sequences better than 100.0: 12
Number of HSP's better than 100.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 28812
Number of HSP's gapped (non-prelim): 12
length of query: 48
length of database: 8,064,228,071
effective HSP length: 21
effective length of query: 27
effective length of database: 7,571,501,522
effective search space: 204430541094
effective search space used: 204430541094
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 69 (31.2 bits)