BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 034694
(87 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255543645|ref|XP_002512885.1| conserved hypothetical protein [Ricinus communis]
gi|223547896|gb|EEF49388.1| conserved hypothetical protein [Ricinus communis]
Length = 81
Score = 131 bits (329), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 65/80 (81%), Positives = 69/80 (86%), Gaps = 1/80 (1%)
Query: 3 MGGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPS 62
M GAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVL+AC+EPPPS
Sbjct: 1 MLGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLKACAEPPPS 60
Query: 63 KTSEFS-ADGGGQGDSGGSQ 81
KTS S A G + D +Q
Sbjct: 61 KTSALSDATGVAEKDGLSTQ 80
>gi|225425170|ref|XP_002264458.1| PREDICTED: uncharacterized protein LOC100264285 [Vitis vinifera]
Length = 75
Score = 128 bits (321), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 61/73 (83%), Positives = 64/73 (87%)
Query: 3 MGGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPS 62
M GAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVL+ CSEPPPS
Sbjct: 1 MAGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLKVCSEPPPS 60
Query: 63 KTSEFSADGGGQG 75
KTS + G+
Sbjct: 61 KTSAPGTNVTGEA 73
>gi|449445578|ref|XP_004140549.1| PREDICTED: uncharacterized protein LOC101220585 [Cucumis sativus]
gi|449519856|ref|XP_004166950.1| PREDICTED: uncharacterized protein LOC101231766 [Cucumis sativus]
Length = 74
Score = 122 bits (306), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 57/63 (90%), Positives = 60/63 (95%)
Query: 3 MGGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPS 62
M GAFWGTRV+EIVKKHDSGGLVWKRIKLTTTRKANAKKRL RVWQNEAVL+ACSEPPP
Sbjct: 1 MAGAFWGTRVLEIVKKHDSGGLVWKRIKLTTTRKANAKKRLHRVWQNEAVLKACSEPPPP 60
Query: 63 KTS 65
K+S
Sbjct: 61 KSS 63
>gi|224060305|ref|XP_002300133.1| predicted protein [Populus trichocarpa]
gi|222847391|gb|EEE84938.1| predicted protein [Populus trichocarpa]
Length = 82
Score = 120 bits (302), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 58/68 (85%), Positives = 60/68 (88%)
Query: 4 GGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPSK 63
GAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVL+ACS+ PSK
Sbjct: 4 AGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLKACSQSQPSK 63
Query: 64 TSEFSADG 71
S A G
Sbjct: 64 MSAAGASG 71
>gi|296088720|emb|CBI38170.3| unnamed protein product [Vitis vinifera]
Length = 65
Score = 108 bits (271), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 52/63 (82%), Positives = 55/63 (87%)
Query: 13 MEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPSKTSEFSADGG 72
MEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVL+ CSEPPPSKTS +
Sbjct: 1 MEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLKVCSEPPPSKTSAPGTNVT 60
Query: 73 GQG 75
G+
Sbjct: 61 GEA 63
>gi|356569111|ref|XP_003552749.1| PREDICTED: uncharacterized protein LOC100819437 [Glycine max]
Length = 70
Score = 108 bits (269), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 51/54 (94%), Positives = 52/54 (96%)
Query: 7 FWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPP 60
FWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVL+ACS P
Sbjct: 3 FWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLKACSATP 56
>gi|356527453|ref|XP_003532325.1| PREDICTED: uncharacterized protein LOC100804262 [Glycine max]
Length = 76
Score = 107 bits (267), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 51/56 (91%), Positives = 52/56 (92%)
Query: 6 AFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPP 61
AFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVL+AC P
Sbjct: 2 AFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLKACVAQVP 57
>gi|388499974|gb|AFK38053.1| unknown [Lotus japonicus]
Length = 78
Score = 105 bits (261), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 59/68 (86%), Positives = 62/68 (91%)
Query: 3 MGGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPS 62
M GAFWGTRVMEIVKK DSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVL+A S+ PPS
Sbjct: 1 MSGAFWGTRVMEIVKKRDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLKATSQSPPS 60
Query: 63 KTSEFSAD 70
+S FSAD
Sbjct: 61 MSSSFSAD 68
>gi|18396334|ref|NP_564283.1| uncharacterized protein [Arabidopsis thaliana]
gi|14423400|gb|AAK62382.1|AF386937_1 Unknown protein [Arabidopsis thaliana]
gi|24899683|gb|AAN65056.1| Unknown protein [Arabidopsis thaliana]
gi|26452099|dbj|BAC43139.1| unknown protein [Arabidopsis thaliana]
gi|332192708|gb|AEE30829.1| uncharacterized protein [Arabidopsis thaliana]
Length = 81
Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 51/71 (71%), Positives = 58/71 (81%), Gaps = 5/71 (7%)
Query: 5 GAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSE-----P 59
GAFWGTRVME+VKKHDSGGL+WKRIKLT TRKANAK RL RVWQNEAVLRAC E P
Sbjct: 6 GAFWGTRVMELVKKHDSGGLLWKRIKLTPTRKANAKTRLRRVWQNEAVLRACGESDAPNP 65
Query: 60 PPSKTSEFSAD 70
P + ++ S++
Sbjct: 66 PGASNTKSSSN 76
>gi|297851172|ref|XP_002893467.1| hypothetical protein ARALYDRAFT_890267 [Arabidopsis lyrata subsp.
lyrata]
gi|297339309|gb|EFH69726.1| hypothetical protein ARALYDRAFT_890267 [Arabidopsis lyrata subsp.
lyrata]
Length = 81
Score = 102 bits (254), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/54 (88%), Positives = 50/54 (92%)
Query: 5 GAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSE 58
GAFWGTRVME+VKKHDSGGL+WKRIKLT TRKANAK RL RVWQNEAVLRAC E
Sbjct: 6 GAFWGTRVMELVKKHDSGGLLWKRIKLTPTRKANAKTRLRRVWQNEAVLRACGE 59
>gi|414877799|tpg|DAA54930.1| TPA: hypothetical protein ZEAMMB73_276313 [Zea mays]
Length = 79
Score = 94.7 bits (234), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 44/72 (61%), Positives = 54/72 (75%)
Query: 3 MGGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPS 62
+GGAFW TR +E+VK++DS GL+WKRIKLTTTRK NAKKRL R+WQNEAV+RAC E
Sbjct: 7 LGGAFWATRALEVVKRNDSPGLLWKRIKLTTTRKNNAKKRLKRLWQNEAVIRACGESESR 66
Query: 63 KTSEFSADGGGQ 74
S ++ Q
Sbjct: 67 SASNVASTTEKQ 78
>gi|326524972|dbj|BAK04422.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 81
Score = 94.4 bits (233), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 43/68 (63%), Positives = 53/68 (77%)
Query: 7 FWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPSKTSE 66
FW TR +E+VK++DS GL+WKRIKLTTTRK NAKKRL R+WQNEAV+RAC++ S TS
Sbjct: 13 FWATRALEVVKRNDSPGLLWKRIKLTTTRKNNAKKRLKRLWQNEAVIRACTQAESSSTSN 72
Query: 67 FSADGGGQ 74
+A Q
Sbjct: 73 TAATSEKQ 80
>gi|388521793|gb|AFK48958.1| unknown [Medicago truncatula]
Length = 72
Score = 93.6 bits (231), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 47/64 (73%), Positives = 51/64 (79%), Gaps = 1/64 (1%)
Query: 3 MGGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPS 62
MGGAFWGTRVME+VKKHDS GL+WKRIKLTTTRKANAKKRL RVWQ + R P PS
Sbjct: 1 MGGAFWGTRVMEVVKKHDSSGLLWKRIKLTTTRKANAKKRLRRVWQMKLSSR-LFRPAPS 59
Query: 63 KTSE 66
+S
Sbjct: 60 ASSS 63
>gi|242083390|ref|XP_002442120.1| hypothetical protein SORBIDRAFT_08g014050 [Sorghum bicolor]
gi|241942813|gb|EES15958.1| hypothetical protein SORBIDRAFT_08g014050 [Sorghum bicolor]
Length = 79
Score = 88.6 bits (218), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 41/68 (60%), Positives = 50/68 (73%)
Query: 7 FWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPSKTSE 66
FW TR +E+VK++DS GL+WKRIKLTTTRK NAKKRL R+WQNEAV+RAC E S
Sbjct: 11 FWATRALEVVKRNDSPGLLWKRIKLTTTRKNNAKKRLKRLWQNEAVIRACGESESVSASN 70
Query: 67 FSADGGGQ 74
++ Q
Sbjct: 71 TASTAEKQ 78
>gi|116784377|gb|ABK23320.1| unknown [Picea sitchensis]
Length = 80
Score = 86.7 bits (213), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/78 (56%), Positives = 53/78 (67%)
Query: 7 FWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSEPPPSKTSE 66
F TR ++IVK++DS GL+WK+IKLT TRKANAKKRL RVWQNEAVL+AC+ P P +
Sbjct: 3 FVPTRFLQIVKRNDSPGLLWKKIKLTPTRKANAKKRLRRVWQNEAVLKACAGPLPEGVTP 62
Query: 67 FSADGGGQGDSGGSQAKG 84
GD S AK
Sbjct: 63 PPESKKKSGDVKDSVAKS 80
>gi|224035127|gb|ACN36639.1| unknown [Zea mays]
gi|413916390|gb|AFW56322.1| hypothetical protein ZEAMMB73_173232 [Zea mays]
Length = 75
Score = 84.7 bits (208), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 38/50 (76%), Positives = 45/50 (90%)
Query: 7 FWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRAC 56
FW TR +E+VK++DS GL+WKRIKLTTTRK NAKKRL R+WQNEAV+RAC
Sbjct: 11 FWATRALEVVKRNDSPGLLWKRIKLTTTRKNNAKKRLKRLWQNEAVIRAC 60
>gi|357501347|ref|XP_003620962.1| hypothetical protein MTR_7g005370 [Medicago truncatula]
gi|355495977|gb|AES77180.1| hypothetical protein MTR_7g005370 [Medicago truncatula]
Length = 254
Score = 84.3 bits (207), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 39/46 (84%), Positives = 42/46 (91%)
Query: 3 MGGAFWGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQ 48
MGGAFWGTRVM++VKKHDS L+WKRIKLTTT KANAKKRL RVWQ
Sbjct: 168 MGGAFWGTRVMQVVKKHDSPRLLWKRIKLTTTCKANAKKRLHRVWQ 213
>gi|115488542|ref|NP_001066758.1| Os12g0477600 [Oryza sativa Japonica Group]
gi|113649265|dbj|BAF29777.1| Os12g0477600 [Oryza sativa Japonica Group]
gi|222617065|gb|EEE53197.1| hypothetical protein OsJ_36069 [Oryza sativa Japonica Group]
Length = 83
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/51 (74%), Positives = 45/51 (88%)
Query: 8 WGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSE 58
W TR +E+VK++DS GL+WKRIKLTTTRK NAKKRL R+WQNEAV+RAC E
Sbjct: 14 WATRALEVVKRNDSPGLLWKRIKLTTTRKNNAKKRLKRLWQNEAVIRACGE 64
>gi|218186839|gb|EEC69266.1| hypothetical protein OsI_38307 [Oryza sativa Indica Group]
Length = 83
Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 38/51 (74%), Positives = 45/51 (88%)
Query: 8 WGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQNEAVLRACSE 58
W TR +E+VK++DS GL+WKRIKLTTTRK NAKKRL R+WQNEAV+RAC E
Sbjct: 14 WATRALEVVKRNDSPGLLWKRIKLTTTRKNNAKKRLKRLWQNEAVIRACGE 64
>gi|357150639|ref|XP_003575527.1| PREDICTED: uncharacterized protein LOC100837101 [Brachypodium
distachyon]
Length = 68
Score = 64.3 bits (155), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 36/41 (87%)
Query: 8 WGTRVMEIVKKHDSGGLVWKRIKLTTTRKANAKKRLLRVWQ 48
W TR +E+VK++D+ GL+WKRIKLTTTRK NAKKRL R+WQ
Sbjct: 14 WATRALEVVKRNDAPGLLWKRIKLTTTRKNNAKKRLKRLWQ 54
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.312 0.129 0.395
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,527,968,358
Number of Sequences: 23463169
Number of extensions: 49534394
Number of successful extensions: 152956
Number of sequences better than 100.0: 25
Number of HSP's better than 100.0 without gapping: 23
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 152930
Number of HSP's gapped (non-prelim): 28
length of query: 87
length of database: 8,064,228,071
effective HSP length: 57
effective length of query: 30
effective length of database: 6,726,827,438
effective search space: 201804823140
effective search space used: 201804823140
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 69 (31.2 bits)