BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 044145
         (69 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224057098|ref|XP_002299126.1| predicted protein [Populus trichocarpa]
 gi|222846384|gb|EEE83931.1| predicted protein [Populus trichocarpa]
          Length = 176

 Score = 61.6 bits (148), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 27/43 (62%), Positives = 32/43 (74%)

Query: 11  FLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCH 53
           F  W  VDEI+++ P SD I FQVG INK+LDA +FKTVHSC 
Sbjct: 103 FFLWLDVDEIKVDLPPSDSIYFQVGWINKKLDADQFKTVHSCR 145


>gi|297812169|ref|XP_002873968.1| hypothetical protein ARALYDRAFT_326382 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319805|gb|EFH50227.1| hypothetical protein ARALYDRAFT_326382 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 181

 Score = 60.8 bits (146), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 26/43 (60%), Positives = 32/43 (74%)

Query: 11  FLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCH 53
           F  W  VDEI+++ P SD I F+VG INK+LD  +FKTVHSCH
Sbjct: 107 FFIWLDVDEIKVDLPPSDSIYFKVGFINKKLDIDQFKTVHSCH 149


>gi|18420052|ref|NP_568383.1| uncharacterized protein [Arabidopsis thaliana]
 gi|33589810|gb|AAQ22671.1| At5g19860 [Arabidopsis thaliana]
 gi|110740914|dbj|BAE98553.1| hypothetical protein [Arabidopsis thaliana]
 gi|332005375|gb|AED92758.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 181

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 25/43 (58%), Positives = 32/43 (74%)

Query: 11  FLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCH 53
           F  W  VDEI+++ P SD I F+VG INK+LD  +FKT+HSCH
Sbjct: 107 FFIWLDVDEIKVDLPPSDSIYFKVGFINKKLDIDQFKTIHSCH 149


>gi|194466185|gb|ACF74323.1| unknown [Arachis hypogaea]
          Length = 181

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 26/43 (60%), Positives = 30/43 (69%)

Query: 11  FLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCH 53
           F  W  VDEI ++ P SD I FQVGLINK+LD  +FKTV SC 
Sbjct: 106 FFIWFNVDEIRVDLPPSDNIYFQVGLINKKLDVDQFKTVRSCR 148


>gi|330318680|gb|AEC11000.1| hypothetical protein [Camellia sinensis]
          Length = 173

 Score = 57.4 bits (137), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/43 (58%), Positives = 30/43 (69%)

Query: 11  FLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCH 53
            L W  VDEI+++ P SD I F VG INK+LD  +FKTVHSC 
Sbjct: 106 LLLWFDVDEIKVDLPPSDSIYFLVGFINKKLDVDQFKTVHSCR 148


>gi|356512477|ref|XP_003524945.1| PREDICTED: uncharacterized protein LOC100780285 [Glycine max]
          Length = 175

 Score = 57.0 bits (136), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 28/49 (57%), Positives = 33/49 (67%), Gaps = 3/49 (6%)

Query: 14  WCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCHLIL---PFL 59
           W  VDEI ++ P SD I FQVG+INK+LD  +FKTV SC   L   PFL
Sbjct: 108 WFNVDEIRVDLPPSDSIYFQVGIINKKLDVDQFKTVRSCRKSLSCPPFL 156


>gi|224076008|ref|XP_002304870.1| predicted protein [Populus trichocarpa]
 gi|222842302|gb|EEE79849.1| predicted protein [Populus trichocarpa]
          Length = 177

 Score = 57.0 bits (136), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/43 (58%), Positives = 29/43 (67%)

Query: 11  FLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCH 53
           F  W  VD I ++ P SD I FQVG INK+LD  +FKTVHSC 
Sbjct: 103 FFLWLGVDNIRVDLPPSDSIYFQVGWINKKLDVDQFKTVHSCR 145


>gi|255580086|ref|XP_002530876.1| conserved hypothetical protein [Ricinus communis]
 gi|223529565|gb|EEF31516.1| conserved hypothetical protein [Ricinus communis]
          Length = 165

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/43 (55%), Positives = 31/43 (72%)

Query: 11  FLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCH 53
           F  W  VDEI+++ P SD I F+VG+INK+LD  +F TVHSC 
Sbjct: 106 FFLWFDVDEIKVDLPPSDSIYFKVGIINKKLDVDQFMTVHSCR 148


>gi|357518823|ref|XP_003629700.1| hypothetical protein MTR_8g085580 [Medicago truncatula]
 gi|355523722|gb|AET04176.1| hypothetical protein MTR_8g085580 [Medicago truncatula]
 gi|388521935|gb|AFK49029.1| unknown [Medicago truncatula]
          Length = 172

 Score = 54.7 bits (130), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 23/40 (57%), Positives = 30/40 (75%)

Query: 14  WCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCH 53
           W  VDEI ++ P S+ I FQVG+INK+L+  +FKTVHSC 
Sbjct: 104 WFNVDEIRVDLPPSNSIYFQVGIINKKLNVDQFKTVHSCR 143


>gi|400131571|emb|CCH50971.1| T4.10 [Malus x robusta]
          Length = 220

 Score = 54.7 bits (130), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 23/42 (54%), Positives = 29/42 (69%)

Query: 11  FLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSC 52
           FLFW  V EI ++ P SD I F VG+INK+LD G+F+ V  C
Sbjct: 107 FLFWLGVGEIRVDLPPSDNIYFTVGIINKKLDIGQFQNVRPC 148


>gi|449448818|ref|XP_004142162.1| PREDICTED: uncharacterized protein LOC101215998 [Cucumis sativus]
 gi|449530349|ref|XP_004172158.1| PREDICTED: uncharacterized protein LOC101227250 [Cucumis sativus]
          Length = 176

 Score = 54.7 bits (130), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 26/44 (59%), Positives = 31/44 (70%), Gaps = 1/44 (2%)

Query: 10  LFLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCH 53
           LFL W  V EI ++ P SD I FQVG INK+LD  +FKT+HSC 
Sbjct: 104 LFL-WFDVKEIRVDLPPSDNIYFQVGFINKKLDIDQFKTIHSCQ 146


>gi|225441545|ref|XP_002281080.1| PREDICTED: uncharacterized protein LOC100260585 [Vitis vinifera]
 gi|297739791|emb|CBI29973.3| unnamed protein product [Vitis vinifera]
          Length = 165

 Score = 54.3 bits (129), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/43 (53%), Positives = 28/43 (65%)

Query: 11  FLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCH 53
           FL W  VDEI ++ P SD I F VG+INK+LD  +FK   SC 
Sbjct: 99  FLLWLNVDEIRVDLPPSDYIYFHVGMINKKLDVKQFKKARSCR 141


>gi|356525241|ref|XP_003531235.1| PREDICTED: uncharacterized protein LOC100797337 [Glycine max]
          Length = 175

 Score = 53.9 bits (128), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 30/41 (73%)

Query: 13  FWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCH 53
            W  VDEI+++ P S+ I FQVG+INK+LD  +FKTV SC 
Sbjct: 107 IWFNVDEIKVDLPPSNSIYFQVGIINKKLDVHQFKTVRSCR 147


>gi|359478446|ref|XP_002283456.2| PREDICTED: uncharacterized protein LOC100253539 [Vitis vinifera]
          Length = 180

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 29/42 (69%)

Query: 11  FLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSC 52
           F  W  VDEI+++ P S+ I F VG INK+LD  +F+T+ SC
Sbjct: 108 FFLWFDVDEIKVDLPPSNSIYFLVGFINKKLDVQQFQTIRSC 149


>gi|18419639|gb|AAL69386.1|AF462223_1 unknown [Narcissus pseudonarcissus]
          Length = 91

 Score = 43.9 bits (102), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 27/43 (62%)

Query: 11 FLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCH 53
          FL W  V+ I+++ P SD + F+VG I K+L   +F+ V SC 
Sbjct: 28 FLMWFDVEAIKVDLPPSDFVYFEVGWITKKLRVEQFQIVRSCE 70


>gi|224284712|gb|ACN40087.1| unknown [Picea sitchensis]
          Length = 168

 Score = 43.5 bits (101), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 17/43 (39%), Positives = 29/43 (67%)

Query: 11  FLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCH 53
           F  W  V  +E++ P+SD I F+VG+I+K+++   F+TV +C 
Sbjct: 105 FFIWVDVTGMEMDLPASDYIYFKVGIISKKIEISWFETVPTCK 147


>gi|116782165|gb|ABK22392.1| unknown [Picea sitchensis]
          Length = 172

 Score = 43.5 bits (101), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 17/43 (39%), Positives = 29/43 (67%)

Query: 11  FLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCH 53
           F  W  V  +E++ P+SD I F+VG+I+K+++   F+TV +C 
Sbjct: 109 FFIWVDVTGMEMDLPASDYIYFKVGIISKKIEISWFETVPTCK 151


>gi|297746437|emb|CBI16493.3| unnamed protein product [Vitis vinifera]
          Length = 130

 Score = 37.4 bits (85), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 23/33 (69%)

Query: 20 IELNFPSSDMINFQVGLINKRLDAGRFKTVHSC 52
          + ++ P S+ I F VG INK+LD  +F+T+ SC
Sbjct: 67 VVIDLPPSNSIYFLVGFINKKLDVQQFQTIRSC 99


>gi|148908750|gb|ABR17482.1| unknown [Picea sitchensis]
          Length = 142

 Score = 36.2 bits (82), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 13/32 (40%), Positives = 23/32 (71%)

Query: 11  FLFWCKVDEIELNFPSSDMINFQVGLINKRLD 42
           F  W  V  +E++ P+SD I F+VG+I+K+++
Sbjct: 109 FFIWVDVTGMEMDLPASDYIYFKVGIISKKIE 140


>gi|62867582|emb|CAI84656.1| hypothetical protein [Nicotiana tabacum]
          Length = 220

 Score = 36.2 bits (82), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 17/46 (36%), Positives = 28/46 (60%), Gaps = 1/46 (2%)

Query: 10  LFLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSCHLI 55
           LFL W  V +I ++ PSS +I F VG+++K+     F+T   C ++
Sbjct: 91  LFL-WFPVKDIRVDIPSSGLIYFNVGVVSKQFSLSSFETPRDCTVV 135


>gi|413925494|gb|AFW65426.1| hypothetical protein ZEAMMB73_279812 [Zea mays]
          Length = 280

 Score = 35.8 bits (81), Expect = 3.2,   Method: Composition-based stats.
 Identities = 17/54 (31%), Positives = 27/54 (50%), Gaps = 5/54 (9%)

Query: 4   HSSSHILF-----LFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSC 52
           H+S H+++       W  +  I++N PSS +I F V ++ K L    F    SC
Sbjct: 51  HNSEHVIYDIPSTFMWFPIHNIKVNIPSSGVIYFNVSVVKKHLSLTVFDVPLSC 104


>gi|21592465|gb|AAM64416.1| unknown [Arabidopsis thaliana]
          Length = 177

 Score = 34.7 bits (78), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 1/47 (2%)

Query: 6   SSHILFLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSC 52
           S+  LFL W +V  I ++ PSS +I F VG++ K+     F+T   C
Sbjct: 98  SAQELFL-WLQVKGIRVDVPSSGLIFFDVGVLRKQYSLSLFETPRDC 143


>gi|22330911|ref|NP_683539.1| uncharacterized protein [Arabidopsis thaliana]
 gi|22136054|gb|AAM91609.1| unknown protein [Arabidopsis thaliana]
 gi|23197758|gb|AAN15406.1| unknown protein [Arabidopsis thaliana]
 gi|332641024|gb|AEE74545.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 177

 Score = 34.3 bits (77), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 1/47 (2%)

Query: 6   SSHILFLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSC 52
           S+  LFL W +V  I ++ PSS +I F VG++ K+     F+T   C
Sbjct: 98  SAQELFL-WLQVKGIRVDVPSSGLIFFDVGVLRKQYSLSLFETPRDC 143


>gi|6041844|gb|AAF02153.1|AC009853_13 unknown protein [Arabidopsis thaliana]
          Length = 158

 Score = 34.3 bits (77), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 1/47 (2%)

Query: 6   SSHILFLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSC 52
           S+  LFL W +V  I ++ PSS +I F VG++ K+     F+T   C
Sbjct: 98  SAQELFL-WLQVKGIRVDVPSSGLIFFDVGVLRKQYSLSLFETPRDC 143


>gi|18397959|ref|NP_566308.1| uncharacterized protein [Arabidopsis thaliana]
 gi|15081622|gb|AAK82466.1| AT3g07470/F21O3_18 [Arabidopsis thaliana]
 gi|27363316|gb|AAO11577.1| At3g07470/F21O3_18 [Arabidopsis thaliana]
 gi|332641026|gb|AEE74547.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 169

 Score = 34.3 bits (77), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 6   SSHILFLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSC 52
           S+  LFL W  V  I ++ PSS +I F VG++ K+     F+T   C
Sbjct: 99  SAQELFL-WFPVKGIRVDVPSSGLIYFDVGVVRKQYSLSLFETPRDC 144


>gi|6041854|gb|AAF02163.1|AC009853_23 unknown protein [Arabidopsis thaliana]
          Length = 160

 Score = 34.3 bits (77), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 26/47 (55%), Gaps = 1/47 (2%)

Query: 6   SSHILFLFWCKVDEIELNFPSSDMINFQVGLINKRLDAGRFKTVHSC 52
           S+  LFL W  V  I ++ PSS +I F VG++ K+     F+T   C
Sbjct: 99  SAQELFL-WFPVKGIRVDVPSSGLIYFDVGVVRKQYSLSLFETPRDC 144


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.331    0.144    0.477 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,086,011,383
Number of Sequences: 23463169
Number of extensions: 34354459
Number of successful extensions: 77391
Number of sequences better than 100.0: 29
Number of HSP's better than 100.0 without gapping: 20
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 77371
Number of HSP's gapped (non-prelim): 29
length of query: 69
length of database: 8,064,228,071
effective HSP length: 41
effective length of query: 28
effective length of database: 7,102,238,142
effective search space: 198862667976
effective search space used: 198862667976
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 69 (31.2 bits)