BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 046761
(109 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449443231|ref|XP_004139383.1| PREDICTED: uncharacterized protein LOC101219713 [Cucumis sativus]
Length = 545
Score = 97.8 bits (242), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 61/121 (50%), Positives = 79/121 (65%), Gaps = 17/121 (14%)
Query: 1 MMTTFISFASSSSLILQPLKPPKSSRLSKAKTLTST----IKATSSQPKPNN-------- 48
+ T FIS SSLILQPLK + + S +K ++T + A+ SQPKPN+
Sbjct: 4 LFTPFIS----SSLILQPLKFSQLTIFS-SKPYSNTNFIPVVASHSQPKPNSKHNSRKPT 58
Query: 49 ANAKKPTSRNRRRSSYGSSRRSALKKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALSLV 108
+ KPTS+ RR+SSYG+SRRS LKKTF QEQVTFT+ +S DP + IIGGGMA + +L
Sbjct: 59 MDGSKPTSKFRRKSSYGTSRRSILKKTFNQEQVTFTSALSDDPLIAIIGGGMAGIMCALS 118
Query: 109 L 109
L
Sbjct: 119 L 119
>gi|449483109|ref|XP_004156495.1| PREDICTED: uncharacterized protein LOC101226129 [Cucumis sativus]
Length = 536
Score = 97.8 bits (242), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 61/121 (50%), Positives = 79/121 (65%), Gaps = 17/121 (14%)
Query: 1 MMTTFISFASSSSLILQPLKPPKSSRLSKAKTLTST----IKATSSQPKPNN-------- 48
+ T FIS SSLILQPLK + + S +K ++T + A+ SQPKPN+
Sbjct: 4 LFTPFIS----SSLILQPLKFSQLTIFS-SKPYSNTNFIPVVASHSQPKPNSKHNSRKPT 58
Query: 49 ANAKKPTSRNRRRSSYGSSRRSALKKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALSLV 108
+ KPTS+ RR+SSYG+SRRS LKKTF QEQVTFT+ +S DP + IIGGGMA + +L
Sbjct: 59 MDGSKPTSKFRRKSSYGTSRRSILKKTFNQEQVTFTSALSDDPLIAIIGGGMAGIMCALS 118
Query: 109 L 109
L
Sbjct: 119 L 119
>gi|224130614|ref|XP_002328333.1| predicted protein [Populus trichocarpa]
gi|222838048|gb|EEE76413.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 95.5 bits (236), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 63/114 (55%), Positives = 72/114 (63%), Gaps = 18/114 (15%)
Query: 6 ISFASSSSLILQPLKPP------KSSRLSKAKTLTSTIKATSSQPKPNN----ANAKKPT 55
ISF SS L LQPLK +SR KTLT T KPNN + +KPT
Sbjct: 11 ISFLSS--LTLQPLKIKTHKKSLSTSRSEPIKTLTCTTS------KPNNKSPTMDGRKPT 62
Query: 56 SRNRRRSSYGSSRRSALKKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALSLVL 109
S+ R+SSYG+SRRS LKKTF QEQVTFT+ +SSDP VGIIGGGMA L +L L
Sbjct: 63 SKKTRKSSYGTSRRSVLKKTFIQEQVTFTSQLSSDPHVGIIGGGMAGLLCALSL 116
>gi|255580905|ref|XP_002531271.1| conserved hypothetical protein [Ricinus communis]
gi|223529104|gb|EEF31084.1| conserved hypothetical protein [Ricinus communis]
Length = 467
Score = 88.2 bits (217), Expect = 5e-16, Method: Composition-based stats.
Identities = 55/102 (53%), Positives = 67/102 (65%), Gaps = 15/102 (14%)
Query: 14 LILQPLKPPKSSRLSKAK------TLTSTIKATSSQPKPNNANAKKPTSRNRRRSSYGSS 67
+ QP+K P S S A TLT T T+SQP K+PTS+NR++SSYG+S
Sbjct: 17 ITFQPIKIPLKSSTSTATKSSKALTLTCT---TTSQPN------KRPTSKNRKKSSYGTS 67
Query: 68 RRSALKKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALSLVL 109
RRS LKKTF+QEQV F+ PVS DP VGIIGGGMA L ++ L
Sbjct: 68 RRSVLKKTFSQEQVNFSFPVSDDPHVGIIGGGMAGLVCAVNL 109
>gi|225463946|ref|XP_002268010.1| PREDICTED: uncharacterized protein LOC100241002 [Vitis vinifera]
Length = 467
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/76 (53%), Positives = 53/76 (69%), Gaps = 3/76 (3%)
Query: 37 IKATSSQPKPNNA---NAKKPTSRNRRRSSYGSSRRSALKKTFAQEQVTFTTPVSSDPCV 93
+ + S+P P A +KPTSR R++SYG+SR+S LKK+F QEQV FT P+S DP V
Sbjct: 17 VSGSLSKPNPKKASSMQGQKPTSRKPRKASYGASRKSILKKSFNQEQVMFTAPLSDDPAV 76
Query: 94 GIIGGGMARLALSLVL 109
IIGGGMA L+ +L L
Sbjct: 77 AIIGGGMAGLSCALYL 92
>gi|296087896|emb|CBI35179.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/76 (53%), Positives = 53/76 (69%), Gaps = 3/76 (3%)
Query: 37 IKATSSQPKPNNA---NAKKPTSRNRRRSSYGSSRRSALKKTFAQEQVTFTTPVSSDPCV 93
+ + S+P P A +KPTSR R++SYG+SR+S LKK+F QEQV FT P+S DP V
Sbjct: 17 VSGSLSKPNPKKASSMQGQKPTSRKPRKASYGASRKSILKKSFNQEQVMFTAPLSDDPAV 76
Query: 94 GIIGGGMARLALSLVL 109
IIGGGMA L+ +L L
Sbjct: 77 AIIGGGMAGLSCALYL 92
>gi|18396966|ref|NP_566235.1| FAD/NAD(P)-binding oxidoreductase domain-containing protein
[Arabidopsis thaliana]
gi|6175169|gb|AAF04895.1|AC011437_10 unknown protein [Arabidopsis thaliana]
gi|15215628|gb|AAK91359.1| AT3g04650/F7O18_13 [Arabidopsis thaliana]
gi|28416461|gb|AAO42761.1| At3g04650/F7O18_13 [Arabidopsis thaliana]
gi|332640593|gb|AEE74114.1| FAD/NAD(P)-binding oxidoreductase domain-containing protein
[Arabidopsis thaliana]
Length = 486
Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 64/97 (65%), Gaps = 2/97 (2%)
Query: 14 LILQPLKPPKSSRLSKAKTLTSTIKATSSQPKPNNANAKKPTSRNRRRSS-YGSSRRSAL 72
L L+ L P+++ LS + + +T + ++ K +N +++ P + RR SS YG+SRRS L
Sbjct: 16 LKLKTLAQPRNTDLSTTRKIITTCDSQNNTGK-SNPSSRNPNFQKRRNSSKYGTSRRSIL 74
Query: 73 KKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALSLVL 109
KK+F QEQVTFT VS DP V IIGGGMA L +L L
Sbjct: 75 KKSFLQEQVTFTARVSDDPHVAIIGGGMAGLVCALNL 111
>gi|297828968|ref|XP_002882366.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297328206|gb|EFH58625.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 45/97 (46%), Positives = 63/97 (64%), Gaps = 2/97 (2%)
Query: 14 LILQPLKPPKSSRLSKAKTLTSTIKATSSQPKPNNANAKKPTSRNRRRSS-YGSSRRSAL 72
L L+ L P+++ LS + + +T + + K +N +++ P + RR SS YG+SRRS L
Sbjct: 16 LKLKTLSKPRNTDLSTTRKIITTCDSQKNTGK-SNPSSRNPNFQKRRNSSKYGTSRRSIL 74
Query: 73 KKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALSLVL 109
KK+F QEQVTFT +S DP V IIGGGMA L +L L
Sbjct: 75 KKSFLQEQVTFTARISDDPHVAIIGGGMAGLVCALNL 111
>gi|226530158|ref|NP_001140386.1| uncharacterized protein LOC100272440 [Zea mays]
gi|223973745|gb|ACN31060.1| unknown [Zea mays]
Length = 505
Score = 61.2 bits (147), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 29/50 (58%), Positives = 37/50 (74%)
Query: 57 RNRRRSSYGSSRRSALKKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALS 106
R RR G SRRSA+KK+F QEQV F+TPVS+DP V +IGGG + L+ +
Sbjct: 38 RRGRRGKPGFSRRSAIKKSFHQEQVVFSTPVSTDPTVAVIGGGASGLSCA 87
>gi|414868816|tpg|DAA47373.1| TPA: hypothetical protein ZEAMMB73_111446, partial [Zea mays]
Length = 384
Score = 61.2 bits (147), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 29/50 (58%), Positives = 37/50 (74%)
Query: 57 RNRRRSSYGSSRRSALKKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALS 106
R RR G SRRSA+KK+F QEQV F+TPVS+DP V +IGGG + L+ +
Sbjct: 38 RRGRRGKPGFSRRSAIKKSFHQEQVVFSTPVSTDPTVAVIGGGASGLSCA 87
>gi|242086110|ref|XP_002443480.1| hypothetical protein SORBIDRAFT_08g020230 [Sorghum bicolor]
gi|241944173|gb|EES17318.1| hypothetical protein SORBIDRAFT_08g020230 [Sorghum bicolor]
Length = 507
Score = 61.2 bits (147), Expect = 7e-08, Method: Composition-based stats.
Identities = 31/61 (50%), Positives = 40/61 (65%)
Query: 49 ANAKKPTSRNRRRSSYGSSRRSALKKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALSLV 108
AN + R RR G SRRSA+KK+F QEQV F+TPV +DP V +IGGG + L+ +
Sbjct: 30 ANPPRTAPRRGRRGKPGFSRRSAIKKSFHQEQVVFSTPVPADPTVAVIGGGASGLSCASA 89
Query: 109 L 109
L
Sbjct: 90 L 90
>gi|414868815|tpg|DAA47372.1| TPA: hypothetical protein ZEAMMB73_111446 [Zea mays]
Length = 210
Score = 61.2 bits (147), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 30/60 (50%), Positives = 40/60 (66%)
Query: 47 NNANAKKPTSRNRRRSSYGSSRRSALKKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALS 106
A + + R RR G SRRSA+KK+F QEQV F+TPVS+DP V +IGGG + L+ +
Sbjct: 28 GGATPPRASPRRGRRGKPGFSRRSAIKKSFHQEQVVFSTPVSTDPTVAVIGGGASGLSCA 87
>gi|414868817|tpg|DAA47374.1| TPA: hypothetical protein ZEAMMB73_111446, partial [Zea mays]
Length = 266
Score = 60.8 bits (146), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 30/60 (50%), Positives = 40/60 (66%)
Query: 47 NNANAKKPTSRNRRRSSYGSSRRSALKKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALS 106
A + + R RR G SRRSA+KK+F QEQV F+TPVS+DP V +IGGG + L+ +
Sbjct: 28 GGATPPRASPRRGRRGKPGFSRRSAIKKSFHQEQVVFSTPVSTDPTVAVIGGGASGLSCA 87
>gi|357116570|ref|XP_003560053.1| PREDICTED: uncharacterized protein LOC100824134 [Brachypodium
distachyon]
Length = 477
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/43 (58%), Positives = 33/43 (76%)
Query: 67 SRRSALKKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALSLVL 109
SR+SA++K+F QEQV F+TPVS DP V +IGGG + LA + L
Sbjct: 51 SRQSAIRKSFQQEQVVFSTPVSPDPTVAVIGGGASGLACASAL 93
>gi|326493132|dbj|BAJ85027.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 469
Score = 52.8 bits (125), Expect = 2e-05, Method: Composition-based stats.
Identities = 25/43 (58%), Positives = 33/43 (76%)
Query: 67 SRRSALKKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALSLVL 109
SR+SA+KK+F QEQV F+TPV +DP V +IGGG + LA + L
Sbjct: 44 SRQSAIKKSFQQEQVVFSTPVPADPTVAVIGGGASGLACASAL 86
>gi|115489414|ref|NP_001067194.1| Os12g0597400 [Oryza sativa Japonica Group]
gi|77556443|gb|ABA99239.1| expressed protein [Oryza sativa Japonica Group]
gi|113649701|dbj|BAF30213.1| Os12g0597400 [Oryza sativa Japonica Group]
gi|215701342|dbj|BAG92766.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 497
Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats.
Identities = 25/45 (55%), Positives = 33/45 (73%)
Query: 65 GSSRRSALKKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALSLVL 109
G SR+SA+KK+F QEQV F+TPV DP V ++GGG + LA + L
Sbjct: 63 GFSRQSAIKKSFHQEQVVFSTPVPPDPSVAVVGGGASGLACAASL 107
>gi|125537277|gb|EAY83765.1| hypothetical protein OsI_38983 [Oryza sativa Indica Group]
Length = 497
Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats.
Identities = 25/45 (55%), Positives = 33/45 (73%)
Query: 65 GSSRRSALKKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALSLVL 109
G SR+SA+KK+F QEQV F+TPV DP V ++GGG + LA + L
Sbjct: 63 GFSRQSAIKKSFHQEQVVFSTPVPPDPSVAVVGGGASGLACAASL 107
>gi|443716122|gb|ELU07798.1| hypothetical protein CAPTEDRAFT_193169 [Capitella teleta]
Length = 480
Score = 36.2 bits (82), Expect = 2.8, Method: Composition-based stats.
Identities = 24/60 (40%), Positives = 31/60 (51%)
Query: 31 KTLTSTIKATSSQPKPNNANAKKPTSRNRRRSSYGSSRRSALKKTFAQEQVTFTTPVSSD 90
+TLTS + + + K AK PT+ RR SS G+S SA KK E+ T VS D
Sbjct: 411 RTLTSVVALSRQETKSIMDIAKTPTAMKRRHSSTGASCSSANKKLRPTEKSASDTSVSCD 470
>gi|327271103|ref|XP_003220327.1| PREDICTED: MAP kinase-interacting serine/threonine-protein kinase
1-like, partial [Anolis carolinensis]
Length = 422
Score = 35.8 bits (81), Expect = 3.2, Method: Composition-based stats.
Identities = 15/39 (38%), Positives = 24/39 (61%)
Query: 11 SSSLILQPLKPPKSSRLSKAKTLTSTIKATSSQPKPNNA 49
S ++ L PP SRL+K + +T IK++ QP P++A
Sbjct: 383 SQAMCSMKLSPPSKSRLAKRRAMTQAIKSSDFQPAPHSA 421
>gi|367043218|ref|XP_003651989.1| glycosyltransferase family 32 protein [Thielavia terrestris NRRL
8126]
gi|346999251|gb|AEO65653.1| glycosyltransferase family 32 protein [Thielavia terrestris NRRL
8126]
Length = 380
Score = 35.4 bits (80), Expect = 4.0, Method: Composition-based stats.
Identities = 24/84 (28%), Positives = 40/84 (47%), Gaps = 3/84 (3%)
Query: 5 FISFASSSSLILQPLKPPKSSRLSKAKTLTSTIKATSSQPKPNNANAKKPTSRNRRR--S 62
F + ++S + L ++ PK+ S++K + T +S +P NN N P RNR R S
Sbjct: 20 FTTRSTSPAASLSAIEFPKTQSGSESKANSQT-TTSSDRPTSNNDNDAVPAGRNRNRPSS 78
Query: 63 SYGSSRRSALKKTFAQEQVTFTTP 86
G + K QE++ + P
Sbjct: 79 PQGQQPMQDMSKMTLQEKLAYQFP 102
>gi|170583274|ref|XP_001896506.1| Cullin family protein [Brugia malayi]
gi|158596270|gb|EDP34648.1| Cullin family protein [Brugia malayi]
Length = 876
Score = 35.0 bits (79), Expect = 5.1, Method: Composition-based stats.
Identities = 28/85 (32%), Positives = 40/85 (47%), Gaps = 2/85 (2%)
Query: 10 SSSSLILQPLKPPKSSRLSKAKTLTSTIKATSSQ-PKPNNANAKKPTSRNRRRSSYGSSR 68
SSSS+ + P ++ S A + K +SS +P N + KP R RSS SS
Sbjct: 2 SSSSVPVNSFIPGAETQHSSADRKEAIRKTSSSLLAEPKNLDEFKPPESKRHRSSLSSST 61
Query: 69 RSALKKTFAQEQVTFTTPVSSDPCV 93
R+ LK + T T+ V+ D V
Sbjct: 62 RAGLKTELGKGATT-TSVVAMDETV 85
>gi|168008463|ref|XP_001756926.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691797|gb|EDQ78157.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 439
Score = 35.0 bits (79), Expect = 5.9, Method: Composition-based stats.
Identities = 17/41 (41%), Positives = 24/41 (58%)
Query: 69 RSALKKTFAQEQVTFTTPVSSDPCVGIIGGGMARLALSLVL 109
R A ++ QV F +P+S P V I+GGGM+ L +L L
Sbjct: 17 RKAKREPAVAPQVHFDSPMSKQPHVAILGGGMSGLVCALTL 57
>gi|312865550|ref|ZP_07725777.1| translation initiation factor IF-2 [Streptococcus downei F0415]
gi|311099068|gb|EFQ57285.1| translation initiation factor IF-2 [Streptococcus downei F0415]
Length = 921
Score = 35.0 bits (79), Expect = 6.4, Method: Composition-based stats.
Identities = 20/72 (27%), Positives = 36/72 (50%)
Query: 9 ASSSSLILQPLKPPKSSRLSKAKTLTSTIKATSSQPKPNNANAKKPTSRNRRRSSYGSSR 68
AS SS +P + PK+ + +KA + A +SQ P+ A KP ++ +++ +
Sbjct: 67 ASESSAPAKPAQAPKAEQEAKASSTQEVKPAQASQSVPSQATQAKPATKPKQKIRNFKAE 126
Query: 69 RSALKKTFAQEQ 80
R A K A+ +
Sbjct: 127 REAKAKAEAERR 138
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.311 0.122 0.325
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,401,594,088
Number of Sequences: 23463169
Number of extensions: 43010261
Number of successful extensions: 232514
Number of sequences better than 100.0: 213
Number of HSP's better than 100.0 without gapping: 67
Number of HSP's successfully gapped in prelim test: 146
Number of HSP's that attempted gapping in prelim test: 232081
Number of HSP's gapped (non-prelim): 516
length of query: 109
length of database: 8,064,228,071
effective HSP length: 77
effective length of query: 32
effective length of database: 6,257,564,058
effective search space: 200242049856
effective search space used: 200242049856
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 69 (31.2 bits)