BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 034721
(86 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|356573772|ref|XP_003555030.1| PREDICTED: probable galacturonosyltransferase 9-like [Glycine
max]
Length = 529
Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats.
Identities = 40/66 (60%), Positives = 52/66 (78%), Gaps = 4/66 (6%)
Query: 1 MAVAIRGGRGGTAVGPGLRSLFSYRIFVSAMFSLLFLATLSVILTSHPSTSHHDPVSPPP 60
MAVA RG RGG++ RSLFS+RIF+SAMFSLLF+ATLSV+LT++PSTS+ DP P
Sbjct: 1 MAVATRGARGGSS----FRSLFSFRIFISAMFSLLFVATLSVLLTTNPSTSNDDPDLPTT 56
Query: 61 LHSFFH 66
+++ H
Sbjct: 57 GNAYVH 62
>gi|449438749|ref|XP_004137150.1| PREDICTED: probable galacturonosyltransferase 9-like [Cucumis
sativus]
gi|449476425|ref|XP_004154733.1| PREDICTED: probable galacturonosyltransferase 9-like [Cucumis
sativus]
Length = 550
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 37/59 (62%), Positives = 45/59 (76%), Gaps = 1/59 (1%)
Query: 1 MAVAIRGGRGGTAV-GPGLRSLFSYRIFVSAMFSLLFLATLSVILTSHPSTSHHDPVSP 58
MAVA+RG RG + G R+ FSYRIF+SA+FSLLF+ATLSVI T++PST HHD P
Sbjct: 1 MAVAVRGSRGASNFSGFNPRNFFSYRIFISALFSLLFIATLSVIFTTNPSTPHHDSALP 59
>gi|356545523|ref|XP_003541190.1| PREDICTED: probable galacturonosyltransferase 9-like [Glycine
max]
Length = 529
Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats.
Identities = 37/66 (56%), Positives = 50/66 (75%), Gaps = 4/66 (6%)
Query: 1 MAVAIRGGRGGTAVGPGLRSLFSYRIFVSAMFSLLFLATLSVILTSHPSTSHHDPVSPPP 60
MAVA RG RGG++ RSLFS+RIF+SA+FSLLF+ATLSV+ T++PSTS+ D P
Sbjct: 1 MAVATRGARGGSS----FRSLFSFRIFISAIFSLLFVATLSVLFTTNPSTSNDDSDLPTT 56
Query: 61 LHSFFH 66
+++ H
Sbjct: 57 GNAYVH 62
>gi|358343636|ref|XP_003635905.1| Glycosyltransferase [Medicago truncatula]
gi|355501840|gb|AES83043.1| Glycosyltransferase [Medicago truncatula]
Length = 555
Score = 73.2 bits (178), Expect = 2e-11, Method: Composition-based stats.
Identities = 36/66 (54%), Positives = 47/66 (71%), Gaps = 4/66 (6%)
Query: 1 MAVAIRGGRGGTAVGPGLRSLFSYRIFVSAMFSLLFLATLSVILTSHPSTSHHDPVSPPP 60
MAV+ RG RGG+ LR FSYRIF+SAMFSLLF+ATLSV+ T++PST + D P
Sbjct: 1 MAVSGRGSRGGST----LRGFFSYRIFISAMFSLLFIATLSVLFTTNPSTENDDSDLPTT 56
Query: 61 LHSFFH 66
+++ H
Sbjct: 57 GNAYVH 62
>gi|255579934|ref|XP_002530802.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis]
gi|223529623|gb|EEF31570.1| Glycosyltransferase QUASIMODO1, putative [Ricinus communis]
Length = 563
Score = 64.3 bits (155), Expect = 9e-09, Method: Composition-based stats.
Identities = 30/41 (73%), Positives = 34/41 (82%)
Query: 18 LRSLFSYRIFVSAMFSLLFLATLSVILTSHPSTSHHDPVSP 58
LRS FSYRI VSAMF+LLFLATLSV+LT+HP TS H+ P
Sbjct: 22 LRSFFSYRILVSAMFTLLFLATLSVLLTTHPPTSPHESSLP 62
>gi|90657560|gb|ABD96860.1| hypothetical protein [Cleome spinosa]
Length = 556
Score = 61.2 bits (147), Expect = 8e-08, Method: Composition-based stats.
Identities = 31/49 (63%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 18 LRSLFSYRIFVSAMFSLLFLATLSVILTSHPSTSHHDPVSPPPLHSFFH 66
LR LFSYRIFVSAMFSLLFLAT SV+L S S H DP P ++ H
Sbjct: 17 LRGLFSYRIFVSAMFSLLFLATFSVVLNS--SRQHQDPTLPNTGSAYMH 63
>gi|297828684|ref|XP_002882224.1| hypothetical protein ARALYDRAFT_477468 [Arabidopsis lyrata subsp.
lyrata]
gi|297328064|gb|EFH58483.1| hypothetical protein ARALYDRAFT_477468 [Arabidopsis lyrata subsp.
lyrata]
Length = 561
Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats.
Identities = 26/42 (61%), Positives = 29/42 (69%)
Query: 17 GLRSLFSYRIFVSAMFSLLFLATLSVILTSHPSTSHHDPVSP 58
GLRS FSYRIF+SA+FS LFLAT SV+L S H D P
Sbjct: 19 GLRSFFSYRIFISALFSFLFLATFSVVLNSSRHQPHQDHTLP 60
>gi|18396159|ref|NP_566170.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana]
gi|26394254|sp|Q9FWA4.1|GAUT9_ARATH RecName: Full=Probable galacturonosyltransferase 9
gi|10092184|gb|AAG12603.1|AC068900_9 unknown protein; 9779-11709 [Arabidopsis thaliana]
gi|19310441|gb|AAL84957.1| AT3g02350/F11A12_103 [Arabidopsis thaliana]
gi|21536764|gb|AAM61096.1| glycosyl transferase, putative [Arabidopsis thaliana]
gi|28416491|gb|AAO42776.1| At3g02350/F11A12_103 [Arabidopsis thaliana]
gi|332640274|gb|AEE73795.1| alpha-1,4-galacturonosyltransferase [Arabidopsis thaliana]
Length = 561
Score = 56.2 bits (134), Expect = 2e-06, Method: Composition-based stats.
Identities = 26/42 (61%), Positives = 29/42 (69%)
Query: 17 GLRSLFSYRIFVSAMFSLLFLATLSVILTSHPSTSHHDPVSP 58
GLRS FSYRIF+SA+FS LFLAT SV+L S H D P
Sbjct: 19 GLRSFFSYRIFISALFSFLFLATFSVVLNSSRHQPHQDHTLP 60
>gi|298204422|emb|CBI16902.3| unnamed protein product [Vitis vinifera]
Length = 543
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 20/39 (51%), Positives = 31/39 (79%)
Query: 20 SLFSYRIFVSAMFSLLFLATLSVILTSHPSTSHHDPVSP 58
++ SYRIF SA+F++LF+AT+SV+L ++P+ HD V P
Sbjct: 19 NIVSYRIFASALFTILFIATISVLLNTNPAPPSHDSVIP 57
>gi|147794385|emb|CAN73730.1| hypothetical protein VITISV_022574 [Vitis vinifera]
Length = 543
Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats.
Identities = 20/39 (51%), Positives = 31/39 (79%)
Query: 20 SLFSYRIFVSAMFSLLFLATLSVILTSHPSTSHHDPVSP 58
++ SYRIF SA+F++LF+AT+SV+L ++P+ HD V P
Sbjct: 19 NIVSYRIFASALFTILFIATISVLLNTNPAPPSHDSVIP 57
>gi|359488313|ref|XP_002282423.2| PREDICTED: LOW QUALITY PROTEIN: probable galacturonosyltransferase
9, partial [Vitis vinifera]
Length = 595
Score = 50.1 bits (118), Expect = 2e-04, Method: Composition-based stats.
Identities = 20/39 (51%), Positives = 31/39 (79%)
Query: 20 SLFSYRIFVSAMFSLLFLATLSVILTSHPSTSHHDPVSP 58
++ SYRIF SA+F++LF+AT+SV+L ++P+ HD V P
Sbjct: 71 NIVSYRIFASALFTILFIATISVLLNTNPAPPSHDSVIP 109
>gi|224112273|ref|XP_002332802.1| glycosyltransferase [Populus trichocarpa]
gi|222834237|gb|EEE72714.1| glycosyltransferase [Populus trichocarpa]
Length = 564
Score = 46.2 bits (108), Expect = 0.003, Method: Composition-based stats.
Identities = 19/29 (65%), Positives = 25/29 (86%)
Query: 18 LRSLFSYRIFVSAMFSLLFLATLSVILTS 46
LR FSYRIF+SA+F+LLFLAT S++ +S
Sbjct: 20 LRPFFSYRIFISAIFTLLFLATFSILFSS 48
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.327 0.141 0.424
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,392,489,418
Number of Sequences: 23463169
Number of extensions: 45577596
Number of successful extensions: 215465
Number of sequences better than 100.0: 19
Number of HSP's better than 100.0 without gapping: 15
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 215437
Number of HSP's gapped (non-prelim): 19
length of query: 86
length of database: 8,064,228,071
effective HSP length: 56
effective length of query: 30
effective length of database: 6,750,290,607
effective search space: 202508718210
effective search space used: 202508718210
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 69 (31.2 bits)