BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 040697
(61 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|22331822|ref|NP_191190.2| alpha-galactosidase [Arabidopsis thaliana]
gi|18377735|gb|AAL67017.1| putative alpha-galactosidase [Arabidopsis thaliana]
gi|21281028|gb|AAM45068.1| putative alpha-galactosidase [Arabidopsis thaliana]
gi|332645987|gb|AEE79508.1| alpha-galactosidase [Arabidopsis thaliana]
Length = 437
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 22/30 (73%), Positives = 23/30 (76%)
Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
FSKSF I+ TS YG LQL NGLA TPQMG
Sbjct: 47 FSKSFNSIYDTSMYGRLQLNNGLARTPQMG 76
>gi|7572929|emb|CAB87430.1| alpha-galactosidase-like protein [Arabidopsis thaliana]
Length = 434
Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats.
Identities = 22/30 (73%), Positives = 23/30 (76%)
Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
FSKSF I+ TS YG LQL NGLA TPQMG
Sbjct: 44 FSKSFNSIYDTSMYGRLQLNNGLARTPQMG 73
>gi|42572703|ref|NP_974447.1| alpha-galactosidase [Arabidopsis thaliana]
gi|332645986|gb|AEE79507.1| alpha-galactosidase [Arabidopsis thaliana]
Length = 413
Score = 47.0 bits (110), Expect = 0.002, Method: Composition-based stats.
Identities = 22/30 (73%), Positives = 23/30 (76%)
Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
FSKSF I+ TS YG LQL NGLA TPQMG
Sbjct: 47 FSKSFNSIYDTSMYGRLQLNNGLARTPQMG 76
>gi|297816956|ref|XP_002876361.1| hypothetical protein ARALYDRAFT_486071 [Arabidopsis lyrata subsp.
lyrata]
gi|297322199|gb|EFH52620.1| hypothetical protein ARALYDRAFT_486071 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 47.0 bits (110), Expect = 0.002, Method: Composition-based stats.
Identities = 22/30 (73%), Positives = 23/30 (76%)
Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
FSKSF I+ TS YG LQL NGLA TPQMG
Sbjct: 40 FSKSFNSIYDTSMYGRLQLNNGLARTPQMG 69
>gi|52699549|gb|AAU86897.1| glycosyl hydrolase family-like protein [Salvia miltiorrhiza]
Length = 431
Score = 44.7 bits (104), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 21/34 (61%), Positives = 22/34 (64%)
Query: 16 YSRGFSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
Y G K I+ TSNYG LQL NGLA TPQMG
Sbjct: 38 YLNGQEKPISNIYDTSNYGILQLNNGLAKTPQMG 71
>gi|449457411|ref|XP_004146442.1| PREDICTED: alpha-galactosidase-like [Cucumis sativus]
gi|449491655|ref|XP_004158965.1| PREDICTED: alpha-galactosidase-like [Cucumis sativus]
Length = 430
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 21/30 (70%), Positives = 21/30 (70%)
Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
F SF IF TS YG LQL NGLA TPQMG
Sbjct: 40 FKSSFHRIFDTSKYGILQLQNGLARTPQMG 69
>gi|353441550|gb|AEQ94270.1| alpha galacdosidase 3 [Cucumis sativus]
Length = 430
Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats.
Identities = 21/30 (70%), Positives = 21/30 (70%)
Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
F SF IF TS YG LQL NGLA TPQMG
Sbjct: 40 FKSSFHRIFDTSKYGILQLQNGLARTPQMG 69
>gi|224144974|ref|XP_002325481.1| predicted protein [Populus trichocarpa]
gi|222862356|gb|EEE99862.1| predicted protein [Populus trichocarpa]
Length = 380
Score = 43.5 bits (101), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 19/27 (70%), Positives = 20/27 (74%)
Query: 23 SFGEIFYTSNYGTLQLTNGLASTPQMG 49
SF +F TSNYG QL NGLA TPQMG
Sbjct: 1 SFNNVFSTSNYGIFQLNNGLARTPQMG 27
>gi|356516253|ref|XP_003526810.1| PREDICTED: alpha-galactosidase-like isoform 1 [Glycine max]
Length = 418
Score = 40.0 bits (92), Expect = 0.17, Method: Composition-based stats.
Identities = 17/30 (56%), Positives = 20/30 (66%)
Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
F +F I+ TS YG QL+NGL TPQMG
Sbjct: 41 FRSNFHSIYDTSKYGVFQLSNGLGKTPQMG 70
>gi|356516255|ref|XP_003526811.1| PREDICTED: alpha-galactosidase-like isoform 2 [Glycine max]
Length = 407
Score = 39.7 bits (91), Expect = 0.22, Method: Composition-based stats.
Identities = 17/30 (56%), Positives = 20/30 (66%)
Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
F +F I+ TS YG QL+NGL TPQMG
Sbjct: 41 FRSNFHSIYDTSKYGVFQLSNGLGKTPQMG 70
>gi|255640756|gb|ACU20662.1| unknown [Glycine max]
Length = 374
Score = 39.7 bits (91), Expect = 0.22, Method: Composition-based stats.
Identities = 17/30 (56%), Positives = 20/30 (66%)
Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
F +F I+ TS YG QL+NGL TPQMG
Sbjct: 41 FRSNFHSIYDTSKYGVFQLSNGLGKTPQMG 70
>gi|218200260|gb|EEC82687.1| hypothetical protein OsI_27343 [Oryza sativa Indica Group]
Length = 427
Score = 39.7 bits (91), Expect = 0.26, Method: Composition-based stats.
Identities = 17/23 (73%), Positives = 19/23 (82%)
Query: 27 IFYTSNYGTLQLTNGLASTPQMG 49
++ TSNYG LQL NGLA TPQMG
Sbjct: 45 LYDTSNYGKLQLNNGLALTPQMG 67
>gi|115474081|ref|NP_001060639.1| Os07g0679300 [Oryza sativa Japonica Group]
gi|33146441|dbj|BAC79549.1| putative alpha-galactosidase [Oryza sativa Japonica Group]
gi|50509523|dbj|BAD31216.1| putative alpha-galactosidase [Oryza sativa Japonica Group]
gi|113612175|dbj|BAF22553.1| Os07g0679300 [Oryza sativa Japonica Group]
gi|215708737|dbj|BAG94006.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 425
Score = 39.3 bits (90), Expect = 0.28, Method: Composition-based stats.
Identities = 17/23 (73%), Positives = 19/23 (82%)
Query: 27 IFYTSNYGTLQLTNGLASTPQMG 49
++ TSNYG LQL NGLA TPQMG
Sbjct: 43 LYDTSNYGKLQLNNGLALTPQMG 65
>gi|147780179|emb|CAN75523.1| hypothetical protein VITISV_043596 [Vitis vinifera]
Length = 449
Score = 39.3 bits (90), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 20/29 (68%), Positives = 20/29 (68%)
Query: 27 IFYTSNYGTLQLTNGLASTPQMGGILDYS 55
IF S YG LQL NGLA TPQMG L YS
Sbjct: 36 IFDNSKYGILQLNNGLARTPQMGYGLSYS 64
>gi|212274867|ref|NP_001130661.1| hypothetical protein precursor [Zea mays]
gi|194689772|gb|ACF78970.1| unknown [Zea mays]
gi|194708268|gb|ACF88218.1| unknown [Zea mays]
gi|224031287|gb|ACN34719.1| unknown [Zea mays]
gi|414888111|tpg|DAA64125.1| TPA: hypothetical protein ZEAMMB73_968166 [Zea mays]
Length = 425
Score = 39.3 bits (90), Expect = 0.29, Method: Composition-based stats.
Identities = 17/29 (58%), Positives = 21/29 (72%)
Query: 21 SKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
+ + ++ TSNYG LQL NGLA TPQMG
Sbjct: 37 TAALSRLYDTSNYGRLQLNNGLALTPQMG 65
>gi|414888110|tpg|DAA64124.1| TPA: hypothetical protein ZEAMMB73_968166 [Zea mays]
Length = 231
Score = 39.3 bits (90), Expect = 0.32, Method: Composition-based stats.
Identities = 17/29 (58%), Positives = 21/29 (72%)
Query: 21 SKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
+ + ++ TSNYG LQL NGLA TPQMG
Sbjct: 37 TAALSRLYDTSNYGRLQLNNGLALTPQMG 65
>gi|414888112|tpg|DAA64126.1| TPA: hypothetical protein ZEAMMB73_968166, partial [Zea mays]
Length = 294
Score = 38.9 bits (89), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 17/29 (58%), Positives = 21/29 (72%)
Query: 21 SKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
+ + ++ TSNYG LQL NGLA TPQMG
Sbjct: 37 TAALSRLYDTSNYGRLQLNNGLALTPQMG 65
>gi|242046940|ref|XP_002461216.1| hypothetical protein SORBIDRAFT_02g042960 [Sorghum bicolor]
gi|241924593|gb|EER97737.1| hypothetical protein SORBIDRAFT_02g042960 [Sorghum bicolor]
Length = 431
Score = 38.5 bits (88), Expect = 0.47, Method: Composition-based stats.
Identities = 17/23 (73%), Positives = 19/23 (82%)
Query: 27 IFYTSNYGTLQLTNGLASTPQMG 49
++ TSNYG LQL NGLA TPQMG
Sbjct: 49 LYDTSNYGRLQLNNGLALTPQMG 71
>gi|225448651|ref|XP_002279730.1| PREDICTED: alpha-galactosidase [Vitis vinifera]
gi|297736508|emb|CBI25379.3| unnamed protein product [Vitis vinifera]
Length = 427
Score = 37.4 bits (85), Expect = 1.0, Method: Composition-based stats.
Identities = 17/23 (73%), Positives = 17/23 (73%)
Query: 27 IFYTSNYGTLQLTNGLASTPQMG 49
IF S YG LQL NGLA TPQMG
Sbjct: 44 IFDNSKYGILQLNNGLARTPQMG 66
>gi|326508514|dbj|BAJ95779.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 431
Score = 37.4 bits (85), Expect = 1.2, Method: Composition-based stats.
Identities = 16/23 (69%), Positives = 18/23 (78%)
Query: 27 IFYTSNYGTLQLTNGLASTPQMG 49
++ TSNYG LQL NGLA PQMG
Sbjct: 48 LYETSNYGMLQLNNGLALAPQMG 70
>gi|357121485|ref|XP_003562450.1| PREDICTED: alpha-galactosidase-like [Brachypodium distachyon]
Length = 428
Score = 36.6 bits (83), Expect = 1.9, Method: Composition-based stats.
Identities = 16/23 (69%), Positives = 18/23 (78%)
Query: 27 IFYTSNYGTLQLTNGLASTPQMG 49
++ TSNYG LQL NGLA PQMG
Sbjct: 46 LYDTSNYGRLQLNNGLALVPQMG 68
>gi|116787336|gb|ABK24468.1| unknown [Picea sitchensis]
Length = 377
Score = 35.4 bits (80), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 15/43 (34%), Positives = 25/43 (58%)
Query: 7 GSLSYEKASYSRGFSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
+++ + +S K++ ++ SNYG QL NG+ TPQMG
Sbjct: 20 ATVTGRQTPWSIQGPKAWSVLYNNSNYGKFQLNNGVGQTPQMG 62
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.136 0.413
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 979,167,395
Number of Sequences: 23463169
Number of extensions: 29192909
Number of successful extensions: 64105
Number of sequences better than 100.0: 23
Number of HSP's better than 100.0 without gapping: 23
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 64082
Number of HSP's gapped (non-prelim): 23
length of query: 61
length of database: 8,064,228,071
effective HSP length: 33
effective length of query: 28
effective length of database: 7,289,943,494
effective search space: 204118417832
effective search space used: 204118417832
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 69 (31.2 bits)