BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 037826
(105 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|356557219|ref|XP_003546915.1| PREDICTED: uncharacterized protein LOC100809039 [Glycine max]
Length = 108
Score = 105 bits (261), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 55/91 (60%), Positives = 69/91 (75%), Gaps = 1/91 (1%)
Query: 2 DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
DKLYHVL C +LTF F A AS++ Y + R ++I +GS+LSLLAGAAKEAAD LG F S+G
Sbjct: 12 DKLYHVLMCFSLTFLFYALASLTPYPY-RRYAISIGSVLSLLAGAAKEAADHLGYFRSSG 70
Query: 62 ASFKDAIADVIGVLIASSALSLWRICSSSRH 92
AS +DA+AD++GV IAS ALSL+R H
Sbjct: 71 ASLRDALADILGVCIASFALSLFRSPPPPPH 101
>gi|15218367|ref|NP_173042.1| uncharacterized protein [Arabidopsis thaliana]
gi|332191260|gb|AEE29381.1| uncharacterized protein [Arabidopsis thaliana]
Length = 113
Score = 103 bits (256), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/105 (58%), Positives = 77/105 (73%), Gaps = 8/105 (7%)
Query: 1 KDKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSA 60
+DKLYHV+FC +++ FS AS SRYSF+R HSI +GS SL AGAAKEAADQ+G+FPSA
Sbjct: 17 RDKLYHVIFCFSISLIFSTLASFSRYSFLRRHSIWIGSAFSLAAGAAKEAADQIGIFPSA 76
Query: 61 GASFKDAIADVIGVLIASSALSLWRICSSSRHGSDSGRTRRVLPV 105
GAS +DA+AD IGV+IA+ L LW+ S RTR +LP+
Sbjct: 77 GASVRDAVADAIGVVIAALVLLLWK--------SRRSRTRPILPI 113
>gi|449484799|ref|XP_004156983.1| PREDICTED: uncharacterized protein LOC101230248 [Cucumis sativus]
Length = 122
Score = 101 bits (252), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 52/84 (61%), Positives = 62/84 (73%)
Query: 2 DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
DK YH L C +LT F+++A +RY FIR HSI +GS+LSL AGAAKE AD+LG F SAG
Sbjct: 16 DKFYHFLLCFSLTILFASFALHTRYPFIRRHSILIGSVLSLFAGAAKEVADELGFFKSAG 75
Query: 62 ASFKDAIADVIGVLIASSALSLWR 85
AS +DA+AD IGVLIAS L R
Sbjct: 76 ASTRDAVADFIGVLIASFLLHALR 99
>gi|225465512|ref|XP_002273014.1| PREDICTED: uncharacterized protein LOC100253325 [Vitis vinifera]
Length = 111
Score = 101 bits (251), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 52/82 (63%), Positives = 64/82 (78%)
Query: 2 DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
DK+YH+LFC LT FSA A+ +RY F+R +S+ V SILSL GAAKE AD++G F SAG
Sbjct: 11 DKVYHILFCSALTLIFSALANRTRYPFLRRYSVWVASILSLATGAAKEVADEIGFFKSAG 70
Query: 62 ASFKDAIADVIGVLIASSALSL 83
AS KDA+AD++GVLIA ALSL
Sbjct: 71 ASPKDALADLLGVLIACLALSL 92
>gi|297844508|ref|XP_002890135.1| hypothetical protein ARALYDRAFT_312580 [Arabidopsis lyrata subsp.
lyrata]
gi|297335977|gb|EFH66394.1| hypothetical protein ARALYDRAFT_312580 [Arabidopsis lyrata subsp.
lyrata]
Length = 113
Score = 99.8 bits (247), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/104 (56%), Positives = 76/104 (73%), Gaps = 8/104 (7%)
Query: 2 DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
DK+YHV+FC +++ FS AS+SRYSF+R HSI +GS S+ AGAAKEAADQ G+FPSAG
Sbjct: 18 DKVYHVVFCFSISLLFSTLASLSRYSFLRRHSIWIGSAFSVAAGAAKEAADQFGIFPSAG 77
Query: 62 ASFKDAIADVIGVLIASSALSLWRICSSSRHGSDSGRTRRVLPV 105
AS +DA+AD IGV+IA+ L LW+ S RTR +LP+
Sbjct: 78 ASARDAVADAIGVVIAALVLFLWK--------SRRSRTRPILPI 113
>gi|255572753|ref|XP_002527309.1| conserved hypothetical protein [Ricinus communis]
gi|223533309|gb|EEF35061.1| conserved hypothetical protein [Ricinus communis]
Length = 115
Score = 99.0 bits (245), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 72/107 (67%), Positives = 86/107 (80%), Gaps = 6/107 (5%)
Query: 1 KDKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSA 60
DKLYH+LFCL LT FFS AS++R+SF+R HSIR+GSILSL AGAAKEAAD GLFPSA
Sbjct: 13 PDKLYHILFCLFLTLFFSKLASLTRHSFLRNHSIRIGSILSLAAGAAKEAADHFGLFPSA 72
Query: 61 GASFKDAIADVIGVLIASSALSLWRICSSSRHGSDS--GRTRRVLPV 105
GAS KDA+AD++GV IA+ ALS IC + +GSD+ G+ RRVLPV
Sbjct: 73 GASVKDAVADLLGVFIAAFALS---ICKNP-NGSDTGPGQNRRVLPV 115
>gi|294461516|gb|ADE76319.1| unknown [Picea sitchensis]
Length = 114
Score = 61.2 bits (147), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 48/82 (58%), Gaps = 2/82 (2%)
Query: 2 DKLYHVLFC--LTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPS 59
DK H+ C + + ++ S + + + VG ILSL GA KE DQ+GL+PS
Sbjct: 9 DKFEHLAMCFFIVIMCVYTIRLSFTPSKLLYRCATPVGCILSLTIGAVKELGDQMGLWPS 68
Query: 60 AGASFKDAIADVIGVLIASSAL 81
AG S KD +ADV GVL+A++ L
Sbjct: 69 AGGSLKDGLADVGGVLLAATLL 90
>gi|357118496|ref|XP_003560990.1| PREDICTED: uncharacterized protein LOC100822918 [Brachypodium
distachyon]
Length = 116
Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/78 (48%), Positives = 51/78 (65%)
Query: 2 DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
DKL HVL CL + + A S F+R H++ +G SL GAAKEA D+ GLF S+G
Sbjct: 11 DKLQHVLACLLIALAAAGLAGRSSRPFLRRHALALGCAASLAVGAAKEAVDEAGLFGSSG 70
Query: 62 ASFKDAIADVIGVLIASS 79
AS +DA AD++GV +A+S
Sbjct: 71 ASLRDAAADLLGVSLAAS 88
>gi|297724881|ref|NP_001174804.1| Os06g0497600 [Oryza sativa Japonica Group]
gi|222635635|gb|EEE65767.1| hypothetical protein OsJ_21442 [Oryza sativa Japonica Group]
gi|255677071|dbj|BAH93532.1| Os06g0497600 [Oryza sativa Japonica Group]
Length = 165
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 51/84 (60%)
Query: 2 DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
DKL HVL C + +A A S S +R +G SL AGAAKE AD+ G F S+G
Sbjct: 11 DKLQHVLACFLIALAGAALAGRSSRSPLRRRPAALGCAASLAAGAAKEIADEAGFFGSSG 70
Query: 62 ASFKDAIADVIGVLIASSALSLWR 85
AS +DA AD++G +A+ AL++ R
Sbjct: 71 ASLRDAAADLVGATLAAVALAILR 94
>gi|125555444|gb|EAZ01050.1| hypothetical protein OsI_23082 [Oryza sativa Indica Group]
Length = 183
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 51/84 (60%)
Query: 2 DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
DKL HVL C + +A A S S +R +G SL AGAAKE AD+ G F S+G
Sbjct: 11 DKLQHVLACFLIALAGAALAGRSSRSPLRRRPAALGCAASLAAGAAKEIADEAGFFGSSG 70
Query: 62 ASFKDAIADVIGVLIASSALSLWR 85
AS +DA AD++G +A+ AL++ R
Sbjct: 71 ASLRDAAADLVGATLAAVALAILR 94
>gi|52076839|dbj|BAD45781.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 117
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 51/84 (60%)
Query: 2 DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
DKL HVL C + +A A S S +R +G SL AGAAKE AD+ G F S+G
Sbjct: 11 DKLQHVLACFLIALAGAALAGRSSRSPLRRRPAALGCAASLAAGAAKEIADEAGFFGSSG 70
Query: 62 ASFKDAIADVIGVLIASSALSLWR 85
AS +DA AD++G +A+ AL++ R
Sbjct: 71 ASLRDAAADLVGATLAAVALAILR 94
>gi|242094970|ref|XP_002437975.1| hypothetical protein SORBIDRAFT_10g005750 [Sorghum bicolor]
gi|241916198|gb|EER89342.1| hypothetical protein SORBIDRAFT_10g005750 [Sorghum bicolor]
Length = 119
Score = 36.6 bits (83), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 36/69 (52%), Positives = 43/69 (62%)
Query: 2 DKLYHVLFCLTLTFFFSAWASVSRYSFIRTHSIRVGSILSLLAGAAKEAADQLGLFPSAG 61
DKL H + CL + +A A S +R ++ VGS SL AGAAKE AD+ GLF SAG
Sbjct: 11 DKLQHAVACLLIALLAAALAGRSARPGLRRRAVAVGSAASLAAGAAKEVADEAGLFGSAG 70
Query: 62 ASFKDAIAD 70
AS KDA AD
Sbjct: 71 ASPKDAAAD 79
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.326 0.135 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,501,150,015
Number of Sequences: 23463169
Number of extensions: 47833328
Number of successful extensions: 179538
Number of sequences better than 100.0: 141
Number of HSP's better than 100.0 without gapping: 13
Number of HSP's successfully gapped in prelim test: 128
Number of HSP's that attempted gapping in prelim test: 179522
Number of HSP's gapped (non-prelim): 141
length of query: 105
length of database: 8,064,228,071
effective HSP length: 73
effective length of query: 32
effective length of database: 6,351,416,734
effective search space: 203245335488
effective search space used: 203245335488
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 69 (31.2 bits)