BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 034916
(79 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255541566|ref|XP_002511847.1| conserved hypothetical protein [Ricinus communis]
gi|223549027|gb|EEF50516.1| conserved hypothetical protein [Ricinus communis]
Length = 61
Score = 117 bits (294), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 55/61 (90%), Positives = 60/61 (98%)
Query: 1 MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
MNP+TDKIVRRTTMVATAVASYFLLTADYGP+PNA+DP+KKAILSAE+SVKEFIFGSKK
Sbjct: 1 MNPKTDKIVRRTTMVATAVASYFLLTADYGPQPNAVDPVKKAILSAENSVKEFIFGSKKG 60
Query: 61 P 61
P
Sbjct: 61 P 61
>gi|224127652|ref|XP_002320127.1| predicted protein [Populus trichocarpa]
gi|222860900|gb|EEE98442.1| predicted protein [Populus trichocarpa]
Length = 61
Score = 114 bits (285), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 53/60 (88%), Positives = 59/60 (98%)
Query: 1 MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
MNP+TDKIVRRTTMVATAVASYFLLTADYGP+PNA++PIK+AILSA+SSVKEFIFGS KE
Sbjct: 1 MNPRTDKIVRRTTMVATAVASYFLLTADYGPQPNAVEPIKRAILSAQSSVKEFIFGSGKE 60
>gi|334186244|ref|NP_191962.2| uncharacterized protein [Arabidopsis thaliana]
gi|49617787|gb|AAT67589.1| hypothetical protein At4G00530 [Arabidopsis thaliana]
gi|60547841|gb|AAX23884.1| hypothetical protein At4g00530 [Arabidopsis thaliana]
gi|332656495|gb|AEE81895.1| uncharacterized protein [Arabidopsis thaliana]
Length = 71
Score = 114 bits (284), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 55/71 (77%), Positives = 61/71 (85%), Gaps = 2/71 (2%)
Query: 1 MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
MNPQTDK+VRRTTMVATAVASYFLLT DYGPEPN LDPIK+ IL+A+ SVK+FIFG K E
Sbjct: 1 MNPQTDKVVRRTTMVATAVASYFLLTGDYGPEPNVLDPIKRKILAAQDSVKQFIFGPKGE 60
Query: 61 P--REESQTAK 69
P +E S TAK
Sbjct: 61 PSGKEPSDTAK 71
>gi|297810127|ref|XP_002872947.1| hypothetical protein ARALYDRAFT_327720 [Arabidopsis lyrata subsp.
lyrata]
gi|297318784|gb|EFH49206.1| hypothetical protein ARALYDRAFT_327720 [Arabidopsis lyrata subsp.
lyrata]
Length = 71
Score = 113 bits (282), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 55/76 (72%), Positives = 61/76 (80%), Gaps = 5/76 (6%)
Query: 1 MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
MNPQTDK+VRRTTMVATAVASYFLLT DYGPEPN LDPIK+ ILSA+ SVK+FIFG K
Sbjct: 1 MNPQTDKVVRRTTMVATAVASYFLLTGDYGPEPNVLDPIKRKILSAQDSVKQFIFGPKGG 60
Query: 61 PREESQTAKLSSDNSK 76
P + Q SDN+K
Sbjct: 61 PSGKEQ-----SDNAK 71
>gi|225454192|ref|XP_002272715.1| PREDICTED: uncharacterized protein LOC100267043 [Vitis vinifera]
Length = 80
Score = 107 bits (266), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 51/60 (85%), Positives = 55/60 (91%)
Query: 1 MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
MNP+TDK+VRRTTMVATA ASYFLLTADYG EPNALDPIK AI SAE SVKEFIFGS+K+
Sbjct: 17 MNPRTDKLVRRTTMVATATASYFLLTADYGAEPNALDPIKNAIHSAERSVKEFIFGSEKD 76
>gi|297745269|emb|CBI40349.3| unnamed protein product [Vitis vinifera]
Length = 64
Score = 107 bits (266), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 51/60 (85%), Positives = 55/60 (91%)
Query: 1 MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
MNP+TDK+VRRTTMVATA ASYFLLTADYG EPNALDPIK AI SAE SVKEFIFGS+K+
Sbjct: 1 MNPRTDKLVRRTTMVATATASYFLLTADYGAEPNALDPIKNAIHSAERSVKEFIFGSEKD 60
>gi|388501504|gb|AFK38818.1| unknown [Medicago truncatula]
Length = 79
Score = 104 bits (260), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 52/79 (65%), Positives = 62/79 (78%), Gaps = 3/79 (3%)
Query: 1 MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
MNP+TDK V+R T+VAT ASYF LTADYGP+PNALDPIKK +LSA+S+VKEF GSK E
Sbjct: 4 MNPKTDKFVKRLTIVATITASYFFLTADYGPKPNALDPIKKQLLSAQSTVKEFFLGSKSE 63
Query: 61 PREESQTAKLSSDNSKEHP 79
EE K SD++K+HP
Sbjct: 64 -SEEKHIGK--SDSNKDHP 79
>gi|356568382|ref|XP_003552390.1| PREDICTED: uncharacterized protein LOC100804482 [Glycine max]
Length = 74
Score = 104 bits (259), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 52/79 (65%), Positives = 64/79 (81%), Gaps = 5/79 (6%)
Query: 1 MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
MNPQ D++VRR TM+AT ASYFLLTADY EP LDPIKK +LSAES+VKE++FGSKK+
Sbjct: 1 MNPQADRVVRRVTMIATITASYFLLTADY--EPTVLDPIKKGLLSAESTVKEYVFGSKKQ 58
Query: 61 PREESQTAKLSSDNSKEHP 79
+E+Q KL D++KEHP
Sbjct: 59 S-QENQMEKL--DSNKEHP 74
>gi|356504525|ref|XP_003521046.1| PREDICTED: uncharacterized protein LOC100306571 [Glycine max]
gi|255628915|gb|ACU14802.1| unknown [Glycine max]
Length = 79
Score = 103 bits (256), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/79 (64%), Positives = 65/79 (82%), Gaps = 3/79 (3%)
Query: 1 MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
M+P+T+K+VRR T V T ASYFLLTADYGP+PNALDPIKK ILSA+S+VKE+I GSKKE
Sbjct: 4 MSPRTEKLVRRLTFVTTVAASYFLLTADYGPKPNALDPIKKQILSAQSTVKEYIVGSKKE 63
Query: 61 PREESQTAKLSSDNSKEHP 79
+ES +L D++K++P
Sbjct: 64 -SQESHVGRL--DSNKDNP 79
>gi|356522628|ref|XP_003529948.1| PREDICTED: uncharacterized protein LOC100787462 [Glycine max]
Length = 72
Score = 98.6 bits (244), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 47/73 (64%), Positives = 58/73 (79%), Gaps = 1/73 (1%)
Query: 1 MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKKE 60
MNP+T ++RR T V TA ASYFLLTADYG +PNALDPIKK LSA+S+VKE+I+GSKKE
Sbjct: 1 MNPRTSSVLRRLTFVTTAAASYFLLTADYGSQPNALDPIKKQKLSAQSTVKEYIWGSKKE 60
Query: 61 PREESQTAKLSSD 73
+E+ KL S+
Sbjct: 61 S-QENHVGKLDSN 72
>gi|18378874|ref|NP_563633.1| uncharacterized protein [Arabidopsis thaliana]
gi|8671841|gb|AAF78404.1|AC009273_10 Contains similarity to an unknown protein F6N23.20 gi|7486554
from Arabidopsis thaliana BAC F6N23 gb|AF058919. EST
gb|AI995757 comes from this gene [Arabidopsis thaliana]
gi|12083288|gb|AAG48803.1|AF332440_1 unknown protein [Arabidopsis thaliana]
gi|21593917|gb|AAM65882.1| unknown [Arabidopsis thaliana]
gi|332189206|gb|AEE27327.1| uncharacterized protein [Arabidopsis thaliana]
Length = 62
Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 45/59 (76%), Positives = 53/59 (89%)
Query: 1 MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKK 59
MNPQ +K+V+ T++VATAV SYFLLTADYGPEPNALDPI++ ILSA+ SVKEFIF SKK
Sbjct: 1 MNPQIEKVVKVTSVVATAVVSYFLLTADYGPEPNALDPIRQRILSAQDSVKEFIFPSKK 59
>gi|297842946|ref|XP_002889354.1| hypothetical protein ARALYDRAFT_470102 [Arabidopsis lyrata subsp.
lyrata]
gi|297335196|gb|EFH65613.1| hypothetical protein ARALYDRAFT_470102 [Arabidopsis lyrata subsp.
lyrata]
Length = 59
Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 45/59 (76%), Positives = 52/59 (88%)
Query: 1 MNPQTDKIVRRTTMVATAVASYFLLTADYGPEPNALDPIKKAILSAESSVKEFIFGSKK 59
MNPQ +K+VR T++VATAV SYFLLTADYGPEPNALDPI++ ILSA+ SVKE IF SKK
Sbjct: 1 MNPQIEKVVRVTSVVATAVVSYFLLTADYGPEPNALDPIRQRILSAQDSVKELIFPSKK 59
>gi|3047110|gb|AAC13621.1| F6N23.20 gene product [Arabidopsis thaliana]
gi|7267392|emb|CAB80862.1| hypothetical protein [Arabidopsis thaliana]
Length = 94
Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/94 (45%), Positives = 48/94 (51%), Gaps = 38/94 (40%)
Query: 14 MVATAVASYFLLTADYGPEPNALDP----------------------------------- 38
MVATAVASYFLLT DYGPEPN LDP
Sbjct: 1 MVATAVASYFLLTGDYGPEPNVLDPYLSSSMILLLVEVLVIRMFSYWWVMLHLFMLVIEH 60
Query: 39 -IKKAILSAESSVKEFIFGSKKEP--REESQTAK 69
IK+ IL+A+ SVK+FIFG K EP +E S TAK
Sbjct: 61 WIKRKILAAQDSVKQFIFGPKGEPSGKEPSDTAK 94
>gi|194466191|gb|ACF74326.1| unknown [Arachis hypogaea]
Length = 36
Score = 54.7 bits (130), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 22/33 (66%), Positives = 30/33 (90%)
Query: 1 MNPQTDKIVRRTTMVATAVASYFLLTADYGPEP 33
++P+T+K+V+R T+VAT ASYFLLTADYGP+P
Sbjct: 4 LSPKTEKLVKRVTIVATVTASYFLLTADYGPQP 36
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.308 0.123 0.330
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,050,326,434
Number of Sequences: 23463169
Number of extensions: 29947893
Number of successful extensions: 69014
Number of sequences better than 100.0: 19
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 68995
Number of HSP's gapped (non-prelim): 20
length of query: 79
length of database: 8,064,228,071
effective HSP length: 50
effective length of query: 29
effective length of database: 6,891,069,621
effective search space: 199841019009
effective search space used: 199841019009
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 69 (31.2 bits)