BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 034916
         (79 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255541566|ref|XP_002511847.1| conserved hypothetical protein [Ricinus communis]
 gi|223549027|gb|EEF50516.1| conserved hypothetical protein [Ricinus communis]
          Length = 61

 Score =  117 bits (294), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 55/61 (90%), Positives = 60/61 (98%)

Query: 1  MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
          MNP+TDKIVRRTTMVATAVASYFLLTADYGP+PNA+DP+KKAILSAE+SVKEFIFGSKK 
Sbjct: 1  MNPKTDKIVRRTTMVATAVASYFLLTADYGPQPNAVDPVKKAILSAENSVKEFIFGSKKG 60

Query: 61 P 61
          P
Sbjct: 61 P 61


>gi|224127652|ref|XP_002320127.1| predicted protein [Populus trichocarpa]
 gi|222860900|gb|EEE98442.1| predicted protein [Populus trichocarpa]
          Length = 61

 Score =  114 bits (285), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 53/60 (88%), Positives = 59/60 (98%)

Query: 1  MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
          MNP+TDKIVRRTTMVATAVASYFLLTADYGP+PNA++PIK+AILSA+SSVKEFIFGS KE
Sbjct: 1  MNPRTDKIVRRTTMVATAVASYFLLTADYGPQPNAVEPIKRAILSAQSSVKEFIFGSGKE 60


>gi|334186244|ref|NP_191962.2| uncharacterized protein [Arabidopsis thaliana]
 gi|49617787|gb|AAT67589.1| hypothetical protein At4G00530 [Arabidopsis thaliana]
 gi|60547841|gb|AAX23884.1| hypothetical protein At4g00530 [Arabidopsis thaliana]
 gi|332656495|gb|AEE81895.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 71

 Score =  114 bits (284), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 55/71 (77%), Positives = 61/71 (85%), Gaps = 2/71 (2%)

Query: 1  MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
          MNPQTDK+VRRTTMVATAVASYFLLT DYGPEPN LDPIK+ IL+A+ SVK+FIFG K E
Sbjct: 1  MNPQTDKVVRRTTMVATAVASYFLLTGDYGPEPNVLDPIKRKILAAQDSVKQFIFGPKGE 60

Query: 61 P--REESQTAK 69
          P  +E S TAK
Sbjct: 61 PSGKEPSDTAK 71


>gi|297810127|ref|XP_002872947.1| hypothetical protein ARALYDRAFT_327720 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297318784|gb|EFH49206.1| hypothetical protein ARALYDRAFT_327720 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 71

 Score =  113 bits (282), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 55/76 (72%), Positives = 61/76 (80%), Gaps = 5/76 (6%)

Query: 1  MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
          MNPQTDK+VRRTTMVATAVASYFLLT DYGPEPN LDPIK+ ILSA+ SVK+FIFG K  
Sbjct: 1  MNPQTDKVVRRTTMVATAVASYFLLTGDYGPEPNVLDPIKRKILSAQDSVKQFIFGPKGG 60

Query: 61 PREESQTAKLSSDNSK 76
          P  + Q     SDN+K
Sbjct: 61 PSGKEQ-----SDNAK 71


>gi|225454192|ref|XP_002272715.1| PREDICTED: uncharacterized protein LOC100267043 [Vitis vinifera]
          Length = 80

 Score =  107 bits (266), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 51/60 (85%), Positives = 55/60 (91%)

Query: 1  MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
          MNP+TDK+VRRTTMVATA ASYFLLTADYG EPNALDPIK AI SAE SVKEFIFGS+K+
Sbjct: 17 MNPRTDKLVRRTTMVATATASYFLLTADYGAEPNALDPIKNAIHSAERSVKEFIFGSEKD 76


>gi|297745269|emb|CBI40349.3| unnamed protein product [Vitis vinifera]
          Length = 64

 Score =  107 bits (266), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 51/60 (85%), Positives = 55/60 (91%)

Query: 1  MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
          MNP+TDK+VRRTTMVATA ASYFLLTADYG EPNALDPIK AI SAE SVKEFIFGS+K+
Sbjct: 1  MNPRTDKLVRRTTMVATATASYFLLTADYGAEPNALDPIKNAIHSAERSVKEFIFGSEKD 60


>gi|388501504|gb|AFK38818.1| unknown [Medicago truncatula]
          Length = 79

 Score =  104 bits (260), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 52/79 (65%), Positives = 62/79 (78%), Gaps = 3/79 (3%)

Query: 1  MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
          MNP+TDK V+R T+VAT  ASYF LTADYGP+PNALDPIKK +LSA+S+VKEF  GSK E
Sbjct: 4  MNPKTDKFVKRLTIVATITASYFFLTADYGPKPNALDPIKKQLLSAQSTVKEFFLGSKSE 63

Query: 61 PREESQTAKLSSDNSKEHP 79
            EE    K  SD++K+HP
Sbjct: 64 -SEEKHIGK--SDSNKDHP 79


>gi|356568382|ref|XP_003552390.1| PREDICTED: uncharacterized protein LOC100804482 [Glycine max]
          Length = 74

 Score =  104 bits (259), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 52/79 (65%), Positives = 64/79 (81%), Gaps = 5/79 (6%)

Query: 1  MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
          MNPQ D++VRR TM+AT  ASYFLLTADY  EP  LDPIKK +LSAES+VKE++FGSKK+
Sbjct: 1  MNPQADRVVRRVTMIATITASYFLLTADY--EPTVLDPIKKGLLSAESTVKEYVFGSKKQ 58

Query: 61 PREESQTAKLSSDNSKEHP 79
            +E+Q  KL  D++KEHP
Sbjct: 59 S-QENQMEKL--DSNKEHP 74


>gi|356504525|ref|XP_003521046.1| PREDICTED: uncharacterized protein LOC100306571 [Glycine max]
 gi|255628915|gb|ACU14802.1| unknown [Glycine max]
          Length = 79

 Score =  103 bits (256), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 51/79 (64%), Positives = 65/79 (82%), Gaps = 3/79 (3%)

Query: 1  MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
          M+P+T+K+VRR T V T  ASYFLLTADYGP+PNALDPIKK ILSA+S+VKE+I GSKKE
Sbjct: 4  MSPRTEKLVRRLTFVTTVAASYFLLTADYGPKPNALDPIKKQILSAQSTVKEYIVGSKKE 63

Query: 61 PREESQTAKLSSDNSKEHP 79
            +ES   +L  D++K++P
Sbjct: 64 -SQESHVGRL--DSNKDNP 79


>gi|356522628|ref|XP_003529948.1| PREDICTED: uncharacterized protein LOC100787462 [Glycine max]
          Length = 72

 Score = 98.6 bits (244), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 47/73 (64%), Positives = 58/73 (79%), Gaps = 1/73 (1%)

Query: 1  MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
          MNP+T  ++RR T V TA ASYFLLTADYG +PNALDPIKK  LSA+S+VKE+I+GSKKE
Sbjct: 1  MNPRTSSVLRRLTFVTTAAASYFLLTADYGSQPNALDPIKKQKLSAQSTVKEYIWGSKKE 60

Query: 61 PREESQTAKLSSD 73
            +E+   KL S+
Sbjct: 61 S-QENHVGKLDSN 72


>gi|18378874|ref|NP_563633.1| uncharacterized protein [Arabidopsis thaliana]
 gi|8671841|gb|AAF78404.1|AC009273_10 Contains similarity to an unknown protein F6N23.20 gi|7486554
          from Arabidopsis thaliana BAC F6N23 gb|AF058919. EST
          gb|AI995757 comes from this gene [Arabidopsis thaliana]
 gi|12083288|gb|AAG48803.1|AF332440_1 unknown protein [Arabidopsis thaliana]
 gi|21593917|gb|AAM65882.1| unknown [Arabidopsis thaliana]
 gi|332189206|gb|AEE27327.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 62

 Score = 82.0 bits (201), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 45/59 (76%), Positives = 53/59 (89%)

Query: 1  MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKK 59
          MNPQ +K+V+ T++VATAV SYFLLTADYGPEPNALDPI++ ILSA+ SVKEFIF SKK
Sbjct: 1  MNPQIEKVVKVTSVVATAVVSYFLLTADYGPEPNALDPIRQRILSAQDSVKEFIFPSKK 59


>gi|297842946|ref|XP_002889354.1| hypothetical protein ARALYDRAFT_470102 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297335196|gb|EFH65613.1| hypothetical protein ARALYDRAFT_470102 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 59

 Score = 81.6 bits (200), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 45/59 (76%), Positives = 52/59 (88%)

Query: 1  MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKK 59
          MNPQ +K+VR T++VATAV SYFLLTADYGPEPNALDPI++ ILSA+ SVKE IF SKK
Sbjct: 1  MNPQIEKVVRVTSVVATAVVSYFLLTADYGPEPNALDPIRQRILSAQDSVKELIFPSKK 59


>gi|3047110|gb|AAC13621.1| F6N23.20 gene product [Arabidopsis thaliana]
 gi|7267392|emb|CAB80862.1| hypothetical protein [Arabidopsis thaliana]
          Length = 94

 Score = 70.1 bits (170), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 43/94 (45%), Positives = 48/94 (51%), Gaps = 38/94 (40%)

Query: 14 MVATAVASYFLLTADYGPEPNALDP----------------------------------- 38
          MVATAVASYFLLT DYGPEPN LDP                                   
Sbjct: 1  MVATAVASYFLLTGDYGPEPNVLDPYLSSSMILLLVEVLVIRMFSYWWVMLHLFMLVIEH 60

Query: 39 -IKKAILSAESSVKEFIFGSKKEP--REESQTAK 69
           IK+ IL+A+ SVK+FIFG K EP  +E S TAK
Sbjct: 61 WIKRKILAAQDSVKQFIFGPKGEPSGKEPSDTAK 94


>gi|194466191|gb|ACF74326.1| unknown [Arachis hypogaea]
          Length = 36

 Score = 54.7 bits (130), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 22/33 (66%), Positives = 30/33 (90%)

Query: 1  MNPQTDKIVRRTTMVATAVASYFLLTADYGPEP 33
          ++P+T+K+V+R T+VAT  ASYFLLTADYGP+P
Sbjct: 4  LSPKTEKLVKRVTIVATVTASYFLLTADYGPQP 36


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.308    0.123    0.330 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,050,326,434
Number of Sequences: 23463169
Number of extensions: 29947893
Number of successful extensions: 69014
Number of sequences better than 100.0: 19
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 68995
Number of HSP's gapped (non-prelim): 20
length of query: 79
length of database: 8,064,228,071
effective HSP length: 50
effective length of query: 29
effective length of database: 6,891,069,621
effective search space: 199841019009
effective search space used: 199841019009
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 69 (31.2 bits)