BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 040697
         (61 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|22331822|ref|NP_191190.2| alpha-galactosidase [Arabidopsis thaliana]
 gi|18377735|gb|AAL67017.1| putative alpha-galactosidase [Arabidopsis thaliana]
 gi|21281028|gb|AAM45068.1| putative alpha-galactosidase [Arabidopsis thaliana]
 gi|332645987|gb|AEE79508.1| alpha-galactosidase [Arabidopsis thaliana]
          Length = 437

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 22/30 (73%), Positives = 23/30 (76%)

Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
          FSKSF  I+ TS YG LQL NGLA TPQMG
Sbjct: 47 FSKSFNSIYDTSMYGRLQLNNGLARTPQMG 76


>gi|7572929|emb|CAB87430.1| alpha-galactosidase-like protein [Arabidopsis thaliana]
          Length = 434

 Score = 47.0 bits (110), Expect = 0.001,   Method: Composition-based stats.
 Identities = 22/30 (73%), Positives = 23/30 (76%)

Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
          FSKSF  I+ TS YG LQL NGLA TPQMG
Sbjct: 44 FSKSFNSIYDTSMYGRLQLNNGLARTPQMG 73


>gi|42572703|ref|NP_974447.1| alpha-galactosidase [Arabidopsis thaliana]
 gi|332645986|gb|AEE79507.1| alpha-galactosidase [Arabidopsis thaliana]
          Length = 413

 Score = 47.0 bits (110), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/30 (73%), Positives = 23/30 (76%)

Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
          FSKSF  I+ TS YG LQL NGLA TPQMG
Sbjct: 47 FSKSFNSIYDTSMYGRLQLNNGLARTPQMG 76


>gi|297816956|ref|XP_002876361.1| hypothetical protein ARALYDRAFT_486071 [Arabidopsis lyrata subsp.
          lyrata]
 gi|297322199|gb|EFH52620.1| hypothetical protein ARALYDRAFT_486071 [Arabidopsis lyrata subsp.
          lyrata]
          Length = 430

 Score = 47.0 bits (110), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/30 (73%), Positives = 23/30 (76%)

Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
          FSKSF  I+ TS YG LQL NGLA TPQMG
Sbjct: 40 FSKSFNSIYDTSMYGRLQLNNGLARTPQMG 69


>gi|52699549|gb|AAU86897.1| glycosyl hydrolase family-like protein [Salvia miltiorrhiza]
          Length = 431

 Score = 44.7 bits (104), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 21/34 (61%), Positives = 22/34 (64%)

Query: 16 YSRGFSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
          Y  G  K    I+ TSNYG LQL NGLA TPQMG
Sbjct: 38 YLNGQEKPISNIYDTSNYGILQLNNGLAKTPQMG 71


>gi|449457411|ref|XP_004146442.1| PREDICTED: alpha-galactosidase-like [Cucumis sativus]
 gi|449491655|ref|XP_004158965.1| PREDICTED: alpha-galactosidase-like [Cucumis sativus]
          Length = 430

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 21/30 (70%), Positives = 21/30 (70%)

Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
          F  SF  IF TS YG LQL NGLA TPQMG
Sbjct: 40 FKSSFHRIFDTSKYGILQLQNGLARTPQMG 69


>gi|353441550|gb|AEQ94270.1| alpha galacdosidase 3 [Cucumis sativus]
          Length = 430

 Score = 44.7 bits (104), Expect = 0.007,   Method: Composition-based stats.
 Identities = 21/30 (70%), Positives = 21/30 (70%)

Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
          F  SF  IF TS YG LQL NGLA TPQMG
Sbjct: 40 FKSSFHRIFDTSKYGILQLQNGLARTPQMG 69


>gi|224144974|ref|XP_002325481.1| predicted protein [Populus trichocarpa]
 gi|222862356|gb|EEE99862.1| predicted protein [Populus trichocarpa]
          Length = 380

 Score = 43.5 bits (101), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 19/27 (70%), Positives = 20/27 (74%)

Query: 23 SFGEIFYTSNYGTLQLTNGLASTPQMG 49
          SF  +F TSNYG  QL NGLA TPQMG
Sbjct: 1  SFNNVFSTSNYGIFQLNNGLARTPQMG 27


>gi|356516253|ref|XP_003526810.1| PREDICTED: alpha-galactosidase-like isoform 1 [Glycine max]
          Length = 418

 Score = 40.0 bits (92), Expect = 0.17,   Method: Composition-based stats.
 Identities = 17/30 (56%), Positives = 20/30 (66%)

Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
          F  +F  I+ TS YG  QL+NGL  TPQMG
Sbjct: 41 FRSNFHSIYDTSKYGVFQLSNGLGKTPQMG 70


>gi|356516255|ref|XP_003526811.1| PREDICTED: alpha-galactosidase-like isoform 2 [Glycine max]
          Length = 407

 Score = 39.7 bits (91), Expect = 0.22,   Method: Composition-based stats.
 Identities = 17/30 (56%), Positives = 20/30 (66%)

Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
          F  +F  I+ TS YG  QL+NGL  TPQMG
Sbjct: 41 FRSNFHSIYDTSKYGVFQLSNGLGKTPQMG 70


>gi|255640756|gb|ACU20662.1| unknown [Glycine max]
          Length = 374

 Score = 39.7 bits (91), Expect = 0.22,   Method: Composition-based stats.
 Identities = 17/30 (56%), Positives = 20/30 (66%)

Query: 20 FSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
          F  +F  I+ TS YG  QL+NGL  TPQMG
Sbjct: 41 FRSNFHSIYDTSKYGVFQLSNGLGKTPQMG 70


>gi|218200260|gb|EEC82687.1| hypothetical protein OsI_27343 [Oryza sativa Indica Group]
          Length = 427

 Score = 39.7 bits (91), Expect = 0.26,   Method: Composition-based stats.
 Identities = 17/23 (73%), Positives = 19/23 (82%)

Query: 27 IFYTSNYGTLQLTNGLASTPQMG 49
          ++ TSNYG LQL NGLA TPQMG
Sbjct: 45 LYDTSNYGKLQLNNGLALTPQMG 67


>gi|115474081|ref|NP_001060639.1| Os07g0679300 [Oryza sativa Japonica Group]
 gi|33146441|dbj|BAC79549.1| putative alpha-galactosidase [Oryza sativa Japonica Group]
 gi|50509523|dbj|BAD31216.1| putative alpha-galactosidase [Oryza sativa Japonica Group]
 gi|113612175|dbj|BAF22553.1| Os07g0679300 [Oryza sativa Japonica Group]
 gi|215708737|dbj|BAG94006.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 425

 Score = 39.3 bits (90), Expect = 0.28,   Method: Composition-based stats.
 Identities = 17/23 (73%), Positives = 19/23 (82%)

Query: 27 IFYTSNYGTLQLTNGLASTPQMG 49
          ++ TSNYG LQL NGLA TPQMG
Sbjct: 43 LYDTSNYGKLQLNNGLALTPQMG 65


>gi|147780179|emb|CAN75523.1| hypothetical protein VITISV_043596 [Vitis vinifera]
          Length = 449

 Score = 39.3 bits (90), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 20/29 (68%), Positives = 20/29 (68%)

Query: 27 IFYTSNYGTLQLTNGLASTPQMGGILDYS 55
          IF  S YG LQL NGLA TPQMG  L YS
Sbjct: 36 IFDNSKYGILQLNNGLARTPQMGYGLSYS 64


>gi|212274867|ref|NP_001130661.1| hypothetical protein precursor [Zea mays]
 gi|194689772|gb|ACF78970.1| unknown [Zea mays]
 gi|194708268|gb|ACF88218.1| unknown [Zea mays]
 gi|224031287|gb|ACN34719.1| unknown [Zea mays]
 gi|414888111|tpg|DAA64125.1| TPA: hypothetical protein ZEAMMB73_968166 [Zea mays]
          Length = 425

 Score = 39.3 bits (90), Expect = 0.29,   Method: Composition-based stats.
 Identities = 17/29 (58%), Positives = 21/29 (72%)

Query: 21 SKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
          + +   ++ TSNYG LQL NGLA TPQMG
Sbjct: 37 TAALSRLYDTSNYGRLQLNNGLALTPQMG 65


>gi|414888110|tpg|DAA64124.1| TPA: hypothetical protein ZEAMMB73_968166 [Zea mays]
          Length = 231

 Score = 39.3 bits (90), Expect = 0.32,   Method: Composition-based stats.
 Identities = 17/29 (58%), Positives = 21/29 (72%)

Query: 21 SKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
          + +   ++ TSNYG LQL NGLA TPQMG
Sbjct: 37 TAALSRLYDTSNYGRLQLNNGLALTPQMG 65


>gi|414888112|tpg|DAA64126.1| TPA: hypothetical protein ZEAMMB73_968166, partial [Zea mays]
          Length = 294

 Score = 38.9 bits (89), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 21/29 (72%)

Query: 21 SKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
          + +   ++ TSNYG LQL NGLA TPQMG
Sbjct: 37 TAALSRLYDTSNYGRLQLNNGLALTPQMG 65


>gi|242046940|ref|XP_002461216.1| hypothetical protein SORBIDRAFT_02g042960 [Sorghum bicolor]
 gi|241924593|gb|EER97737.1| hypothetical protein SORBIDRAFT_02g042960 [Sorghum bicolor]
          Length = 431

 Score = 38.5 bits (88), Expect = 0.47,   Method: Composition-based stats.
 Identities = 17/23 (73%), Positives = 19/23 (82%)

Query: 27 IFYTSNYGTLQLTNGLASTPQMG 49
          ++ TSNYG LQL NGLA TPQMG
Sbjct: 49 LYDTSNYGRLQLNNGLALTPQMG 71


>gi|225448651|ref|XP_002279730.1| PREDICTED: alpha-galactosidase [Vitis vinifera]
 gi|297736508|emb|CBI25379.3| unnamed protein product [Vitis vinifera]
          Length = 427

 Score = 37.4 bits (85), Expect = 1.0,   Method: Composition-based stats.
 Identities = 17/23 (73%), Positives = 17/23 (73%)

Query: 27 IFYTSNYGTLQLTNGLASTPQMG 49
          IF  S YG LQL NGLA TPQMG
Sbjct: 44 IFDNSKYGILQLNNGLARTPQMG 66


>gi|326508514|dbj|BAJ95779.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 431

 Score = 37.4 bits (85), Expect = 1.2,   Method: Composition-based stats.
 Identities = 16/23 (69%), Positives = 18/23 (78%)

Query: 27 IFYTSNYGTLQLTNGLASTPQMG 49
          ++ TSNYG LQL NGLA  PQMG
Sbjct: 48 LYETSNYGMLQLNNGLALAPQMG 70


>gi|357121485|ref|XP_003562450.1| PREDICTED: alpha-galactosidase-like [Brachypodium distachyon]
          Length = 428

 Score = 36.6 bits (83), Expect = 1.9,   Method: Composition-based stats.
 Identities = 16/23 (69%), Positives = 18/23 (78%)

Query: 27 IFYTSNYGTLQLTNGLASTPQMG 49
          ++ TSNYG LQL NGLA  PQMG
Sbjct: 46 LYDTSNYGRLQLNNGLALVPQMG 68


>gi|116787336|gb|ABK24468.1| unknown [Picea sitchensis]
          Length = 377

 Score = 35.4 bits (80), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 15/43 (34%), Positives = 25/43 (58%)

Query: 7  GSLSYEKASYSRGFSKSFGEIFYTSNYGTLQLTNGLASTPQMG 49
           +++  +  +S    K++  ++  SNYG  QL NG+  TPQMG
Sbjct: 20 ATVTGRQTPWSIQGPKAWSVLYNNSNYGKFQLNNGVGQTPQMG 62


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.136    0.413 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 979,167,395
Number of Sequences: 23463169
Number of extensions: 29192909
Number of successful extensions: 64105
Number of sequences better than 100.0: 23
Number of HSP's better than 100.0 without gapping: 23
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 64082
Number of HSP's gapped (non-prelim): 23
length of query: 61
length of database: 8,064,228,071
effective HSP length: 33
effective length of query: 28
effective length of database: 7,289,943,494
effective search space: 204118417832
effective search space used: 204118417832
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 69 (31.2 bits)