BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 037826
         (105 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|356557219|ref|XP_003546915.1| PREDICTED: uncharacterized protein LOC100809039 [Glycine max]
          Length = 108

 Score =  105 bits (261), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 55/91 (60%), Positives = 69/91 (75%), Gaps = 1/91 (1%)

Query: 2   DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
           DKLYHVL C +LTF F A AS++ Y + R ++I +GS+LSLLAGAAKEAAD LG F S+G
Sbjct: 12  DKLYHVLMCFSLTFLFYALASLTPYPY-RRYAISIGSVLSLLAGAAKEAADHLGYFRSSG 70

Query: 62  ASFKDAIADVIGVLIASSALSLWRICSSSRH 92
           AS +DA+AD++GV IAS ALSL+R      H
Sbjct: 71  ASLRDALADILGVCIASFALSLFRSPPPPPH 101


>gi|15218367|ref|NP_173042.1| uncharacterized protein [Arabidopsis thaliana]
 gi|332191260|gb|AEE29381.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 113

 Score =  103 bits (256), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 61/105 (58%), Positives = 77/105 (73%), Gaps = 8/105 (7%)

Query: 1   KDKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSA 60
           +DKLYHV+FC +++  FS  AS SRYSF+R HSI +GS  SL AGAAKEAADQ+G+FPSA
Sbjct: 17  RDKLYHVIFCFSISLIFSTLASFSRYSFLRRHSIWIGSAFSLAAGAAKEAADQIGIFPSA 76

Query: 61  GASFKDAIADVIGVLIASSALSLWRICSSSRHGSDSGRTRRVLPV 105
           GAS +DA+AD IGV+IA+  L LW+        S   RTR +LP+
Sbjct: 77  GASVRDAVADAIGVVIAALVLLLWK--------SRRSRTRPILPI 113


>gi|449484799|ref|XP_004156983.1| PREDICTED: uncharacterized protein LOC101230248 [Cucumis sativus]
          Length = 122

 Score =  101 bits (252), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 52/84 (61%), Positives = 62/84 (73%)

Query: 2  DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
          DK YH L C +LT  F+++A  +RY FIR HSI +GS+LSL AGAAKE AD+LG F SAG
Sbjct: 16 DKFYHFLLCFSLTILFASFALHTRYPFIRRHSILIGSVLSLFAGAAKEVADELGFFKSAG 75

Query: 62 ASFKDAIADVIGVLIASSALSLWR 85
          AS +DA+AD IGVLIAS  L   R
Sbjct: 76 ASTRDAVADFIGVLIASFLLHALR 99


>gi|225465512|ref|XP_002273014.1| PREDICTED: uncharacterized protein LOC100253325 [Vitis vinifera]
          Length = 111

 Score =  101 bits (251), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 52/82 (63%), Positives = 64/82 (78%)

Query: 2  DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
          DK+YH+LFC  LT  FSA A+ +RY F+R +S+ V SILSL  GAAKE AD++G F SAG
Sbjct: 11 DKVYHILFCSALTLIFSALANRTRYPFLRRYSVWVASILSLATGAAKEVADEIGFFKSAG 70

Query: 62 ASFKDAIADVIGVLIASSALSL 83
          AS KDA+AD++GVLIA  ALSL
Sbjct: 71 ASPKDALADLLGVLIACLALSL 92


>gi|297844508|ref|XP_002890135.1| hypothetical protein ARALYDRAFT_312580 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335977|gb|EFH66394.1| hypothetical protein ARALYDRAFT_312580 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 113

 Score = 99.8 bits (247), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 59/104 (56%), Positives = 76/104 (73%), Gaps = 8/104 (7%)

Query: 2   DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
           DK+YHV+FC +++  FS  AS+SRYSF+R HSI +GS  S+ AGAAKEAADQ G+FPSAG
Sbjct: 18  DKVYHVVFCFSISLLFSTLASLSRYSFLRRHSIWIGSAFSVAAGAAKEAADQFGIFPSAG 77

Query: 62  ASFKDAIADVIGVLIASSALSLWRICSSSRHGSDSGRTRRVLPV 105
           AS +DA+AD IGV+IA+  L LW+        S   RTR +LP+
Sbjct: 78  ASARDAVADAIGVVIAALVLFLWK--------SRRSRTRPILPI 113


>gi|255572753|ref|XP_002527309.1| conserved hypothetical protein [Ricinus communis]
 gi|223533309|gb|EEF35061.1| conserved hypothetical protein [Ricinus communis]
          Length = 115

 Score = 99.0 bits (245), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 72/107 (67%), Positives = 86/107 (80%), Gaps = 6/107 (5%)

Query: 1   KDKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSA 60
            DKLYH+LFCL LT FFS  AS++R+SF+R HSIR+GSILSL AGAAKEAAD  GLFPSA
Sbjct: 13  PDKLYHILFCLFLTLFFSKLASLTRHSFLRNHSIRIGSILSLAAGAAKEAADHFGLFPSA 72

Query: 61  GASFKDAIADVIGVLIASSALSLWRICSSSRHGSDS--GRTRRVLPV 105
           GAS KDA+AD++GV IA+ ALS   IC +  +GSD+  G+ RRVLPV
Sbjct: 73  GASVKDAVADLLGVFIAAFALS---ICKNP-NGSDTGPGQNRRVLPV 115


>gi|294461516|gb|ADE76319.1| unknown [Picea sitchensis]
          Length = 114

 Score = 61.2 bits (147), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 48/82 (58%), Gaps = 2/82 (2%)

Query: 2  DKLYHVLFC--LTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPS 59
          DK  H+  C  + +   ++   S +    +   +  VG ILSL  GA KE  DQ+GL+PS
Sbjct: 9  DKFEHLAMCFFIVIMCVYTIRLSFTPSKLLYRCATPVGCILSLTIGAVKELGDQMGLWPS 68

Query: 60 AGASFKDAIADVIGVLIASSAL 81
          AG S KD +ADV GVL+A++ L
Sbjct: 69 AGGSLKDGLADVGGVLLAATLL 90


>gi|357118496|ref|XP_003560990.1| PREDICTED: uncharacterized protein LOC100822918 [Brachypodium
          distachyon]
          Length = 116

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/78 (48%), Positives = 51/78 (65%)

Query: 2  DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
          DKL HVL CL +    +  A  S   F+R H++ +G   SL  GAAKEA D+ GLF S+G
Sbjct: 11 DKLQHVLACLLIALAAAGLAGRSSRPFLRRHALALGCAASLAVGAAKEAVDEAGLFGSSG 70

Query: 62 ASFKDAIADVIGVLIASS 79
          AS +DA AD++GV +A+S
Sbjct: 71 ASLRDAAADLLGVSLAAS 88


>gi|297724881|ref|NP_001174804.1| Os06g0497600 [Oryza sativa Japonica Group]
 gi|222635635|gb|EEE65767.1| hypothetical protein OsJ_21442 [Oryza sativa Japonica Group]
 gi|255677071|dbj|BAH93532.1| Os06g0497600 [Oryza sativa Japonica Group]
          Length = 165

 Score = 45.8 bits (107), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 51/84 (60%)

Query: 2  DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
          DKL HVL C  +    +A A  S  S +R     +G   SL AGAAKE AD+ G F S+G
Sbjct: 11 DKLQHVLACFLIALAGAALAGRSSRSPLRRRPAALGCAASLAAGAAKEIADEAGFFGSSG 70

Query: 62 ASFKDAIADVIGVLIASSALSLWR 85
          AS +DA AD++G  +A+ AL++ R
Sbjct: 71 ASLRDAAADLVGATLAAVALAILR 94


>gi|125555444|gb|EAZ01050.1| hypothetical protein OsI_23082 [Oryza sativa Indica Group]
          Length = 183

 Score = 45.4 bits (106), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 51/84 (60%)

Query: 2  DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
          DKL HVL C  +    +A A  S  S +R     +G   SL AGAAKE AD+ G F S+G
Sbjct: 11 DKLQHVLACFLIALAGAALAGRSSRSPLRRRPAALGCAASLAAGAAKEIADEAGFFGSSG 70

Query: 62 ASFKDAIADVIGVLIASSALSLWR 85
          AS +DA AD++G  +A+ AL++ R
Sbjct: 71 ASLRDAAADLVGATLAAVALAILR 94


>gi|52076839|dbj|BAD45781.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 117

 Score = 45.4 bits (106), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 51/84 (60%)

Query: 2  DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
          DKL HVL C  +    +A A  S  S +R     +G   SL AGAAKE AD+ G F S+G
Sbjct: 11 DKLQHVLACFLIALAGAALAGRSSRSPLRRRPAALGCAASLAAGAAKEIADEAGFFGSSG 70

Query: 62 ASFKDAIADVIGVLIASSALSLWR 85
          AS +DA AD++G  +A+ AL++ R
Sbjct: 71 ASLRDAAADLVGATLAAVALAILR 94


>gi|242094970|ref|XP_002437975.1| hypothetical protein SORBIDRAFT_10g005750 [Sorghum bicolor]
 gi|241916198|gb|EER89342.1| hypothetical protein SORBIDRAFT_10g005750 [Sorghum bicolor]
          Length = 119

 Score = 36.6 bits (83), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 36/69 (52%), Positives = 43/69 (62%)

Query: 2  DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
          DKL H + CL +    +A A  S    +R  ++ VGS  SL AGAAKE AD+ GLF SAG
Sbjct: 11 DKLQHAVACLLIALLAAALAGRSARPGLRRRAVAVGSAASLAAGAAKEVADEAGLFGSAG 70

Query: 62 ASFKDAIAD 70
          AS KDA AD
Sbjct: 71 ASPKDAAAD 79


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.326    0.135    0.401 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,501,150,015
Number of Sequences: 23463169
Number of extensions: 47833328
Number of successful extensions: 179538
Number of sequences better than 100.0: 141
Number of HSP's better than 100.0 without gapping: 13
Number of HSP's successfully gapped in prelim test: 128
Number of HSP's that attempted gapping in prelim test: 179522
Number of HSP's gapped (non-prelim): 141
length of query: 105
length of database: 8,064,228,071
effective HSP length: 73
effective length of query: 32
effective length of database: 6,351,416,734
effective search space: 203245335488
effective search space used: 203245335488
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 69 (31.2 bits)