BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 036594
(214 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|222422959|dbj|BAH19464.1| AT5G35970 [Arabidopsis thaliana]
Length = 626
Score = 261 bits (666), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 123/205 (60%), Positives = 160/205 (78%), Gaps = 13/205 (6%)
Query: 10 STTFKKKPRRKSSAFSKSKIQKTKTLSGTGSSTKVNVSSVVEKSSGEKQLEQPKKSDKAV 69
S+ KKKPRRKS+ K + +K + + S ++V +E+PK +DK +
Sbjct: 56 SSVTKKKPRRKSNVSDKLRFKKIEKRNDNTESESLSV------------VEEPK-NDKEL 102
Query: 70 NVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRMGPGLTFVI 129
+++AL+QNG+PLGRRDLG+ VV+WI Q M+AMASDFA+AE+QGEFSELRQ +G GLTFVI
Sbjct: 103 SLRALNQNGDPLGRRDLGRNVVKWISQAMKAMASDFATAEVQGEFSELRQNVGSGLTFVI 162
Query: 130 EAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWK 189
+AQPYLNAIPMP+G E +CLKA THYPTLFDHFQRELRDVLQ+L++K++++ W E+ESWK
Sbjct: 163 QAQPYLNAIPMPLGSEVICLKACTHYPTLFDHFQRELRDVLQDLERKNIMESWKESESWK 222
Query: 190 LLKELANSAQHRAIVRRVTQPKPVQ 214
LLKE+ANSAQHR + R+ Q KPVQ
Sbjct: 223 LLKEIANSAQHREVARKAAQAKPVQ 247
>gi|30692868|ref|NP_198446.3| putative DNA-binding protein [Arabidopsis thaliana]
gi|332006651|gb|AED94034.1| putative DNA-binding protein [Arabidopsis thaliana]
Length = 961
Score = 260 bits (664), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 124/212 (58%), Positives = 161/212 (75%), Gaps = 13/212 (6%)
Query: 3 SSSSTKSSTTFKKKPRRKSSAFSKSKIQKTKTLSGTGSSTKVNVSSVVEKSSGEKQLEQP 62
S S+ KKKPRRKS+ K + +K + + S ++V +E+P
Sbjct: 49 CSGVNGGSSVTKKKPRRKSNVSDKLRFKKIEKRNDNTESESLSV------------VEEP 96
Query: 63 KKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRMG 122
K +DK ++++AL+QNG+PLGRRDLG+ VV+WI Q M+AMASDFA+AE+QGEFSELRQ +G
Sbjct: 97 K-NDKELSLRALNQNGDPLGRRDLGRNVVKWISQAMKAMASDFATAEVQGEFSELRQNVG 155
Query: 123 PGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDW 182
GLTFVI+AQPYLNAIPMP+G E +CLKA THYPTLFDHFQRELRDVLQ+L++K++++ W
Sbjct: 156 SGLTFVIQAQPYLNAIPMPLGSEVICLKACTHYPTLFDHFQRELRDVLQDLERKNIMESW 215
Query: 183 HETESWKLLKELANSAQHRAIVRRVTQPKPVQ 214
E+ESWKLLKE+ANSAQHR + R+ Q KPVQ
Sbjct: 216 KESESWKLLKEIANSAQHREVARKAAQAKPVQ 247
>gi|449451781|ref|XP_004143639.1| PREDICTED: DNA-binding protein SMUBP-2-like [Cucumis sativus]
gi|449527761|ref|XP_004170878.1| PREDICTED: DNA-binding protein SMUBP-2-like [Cucumis sativus]
Length = 957
Score = 259 bits (662), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 115/158 (72%), Positives = 140/158 (88%)
Query: 57 KQLEQPKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSE 116
++LE+ KK D+ VNVQ + QNG+PLGRR+LGK VVRWI MRAMASDFA+AE+QG+F E
Sbjct: 86 RELEEKKKKDREVNVQGIYQNGDPLGRRELGKSVVRWIGLAMRAMASDFAAAEVQGDFPE 145
Query: 117 LRQRMGPGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQK 176
L+QRMG GLTFVI+AQPYLNA+PMP+GLEAVCLKA THYPTLFDHFQRELRDVLQ+LQ++
Sbjct: 146 LQQRMGQGLTFVIQAQPYLNAVPMPLGLEAVCLKASTHYPTLFDHFQRELRDVLQDLQRQ 205
Query: 177 SLVQDWHETESWKLLKELANSAQHRAIVRRVTQPKPVQ 214
SL DW ET+SWKLLK+LA+S QH+AI R++++PK VQ
Sbjct: 206 SLFLDWRETQSWKLLKKLAHSVQHKAIARKISEPKVVQ 243
>gi|297805152|ref|XP_002870460.1| hypothetical protein ARALYDRAFT_493645 [Arabidopsis lyrata subsp.
lyrata]
gi|297316296|gb|EFH46719.1| hypothetical protein ARALYDRAFT_493645 [Arabidopsis lyrata subsp.
lyrata]
Length = 979
Score = 257 bits (656), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 125/201 (62%), Positives = 162/201 (80%), Gaps = 3/201 (1%)
Query: 14 KKKPRRKSSAFSKSKIQKTKTLSGTGSSTKVNVSSVVEKSSGEKQLEQPKKSDKAVNVQA 73
KKKPRRKS+ K++ K +S V++SS + +++ E+ KSDK ++++A
Sbjct: 68 KKKPRRKSNV--SDKLRSKKVEKRNDNSESVSLSSEIVVEEVKEEDEK-PKSDKELSLRA 124
Query: 74 LSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRMGPGLTFVIEAQP 133
L+QNG+PLGRRDLG+ VV+WI Q M+AMASDFA+AE+QGEFSELRQ +G GLTFVI+AQP
Sbjct: 125 LNQNGDPLGRRDLGRNVVKWISQAMKAMASDFANAEVQGEFSELRQNVGSGLTFVIQAQP 184
Query: 134 YLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWKLLKE 193
YLNAIPMP+G E +CLKA THYPTLFDHFQRELRDVLQ+L++K+++++W ETESWKLLKE
Sbjct: 185 YLNAIPMPLGSEVICLKACTHYPTLFDHFQRELRDVLQDLERKNIMENWKETESWKLLKE 244
Query: 194 LANSAQHRAIVRRVTQPKPVQ 214
+ANSAQHR + R+ Q KPVQ
Sbjct: 245 IANSAQHREVARKAAQAKPVQ 265
>gi|225454589|ref|XP_002264216.1| PREDICTED: DNA-binding protein SMUBP-2 [Vitis vinifera]
Length = 953
Score = 253 bits (647), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 115/152 (75%), Positives = 135/152 (88%)
Query: 63 KKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRMG 122
K +K V+V+ L QNG+PLGRR+L + VVRWI QGMR MA DFASAE+QGEF+ELRQRMG
Sbjct: 88 KSKNKPVSVRTLYQNGDPLGRRELRRCVVRWISQGMRGMALDFASAELQGEFAELRQRMG 147
Query: 123 PGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDW 182
PGL+FVI+AQPYLNAIPMP+G EA+CLKA THYPTLFDHFQRELRDVLQ+ Q+KS QDW
Sbjct: 148 PGLSFVIQAQPYLNAIPMPLGHEAICLKACTHYPTLFDHFQRELRDVLQDHQRKSQFQDW 207
Query: 183 HETESWKLLKELANSAQHRAIVRRVTQPKPVQ 214
ET+SW+LLKELANSAQHRAI R+V+QPKP++
Sbjct: 208 RETQSWQLLKELANSAQHRAISRKVSQPKPLK 239
>gi|255566048|ref|XP_002524012.1| DNA-binding protein smubp-2, putative [Ricinus communis]
gi|223536739|gb|EEF38380.1| DNA-binding protein smubp-2, putative [Ricinus communis]
Length = 989
Score = 249 bits (637), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 121/201 (60%), Positives = 152/201 (75%), Gaps = 4/201 (1%)
Query: 16 KPRRKSSAFSKSKIQKTKTLSGTGSSTKVNVSSVVEKSSGEKQLEQPKKSDKAVNVQALS 75
KPR ++ +K + K +G +TK ++ + E+ +K VNV++L
Sbjct: 77 KPRTATT--TKKRKAHVKKSTGMSKNTKKYTAANTDGGKLAVSEEREEKVKMKVNVKSLH 134
Query: 76 QNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRMG--PGLTFVIEAQP 133
QNG+PLG++DLGK VV+WI QGMRAMA+DFASAE QGEF ELRQRM GLTFVI+AQP
Sbjct: 135 QNGDPLGKKDLGKTVVKWISQGMRAMAADFASAETQGEFLELRQRMDLEAGLTFVIQAQP 194
Query: 134 YLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWKLLKE 193
Y+NA+P+P+G EA+CLKA HYPTLFDHFQRELRDVLQ+LQ+K LVQDW TESWKLLKE
Sbjct: 195 YINAVPIPLGFEALCLKACIHYPTLFDHFQRELRDVLQDLQRKGLVQDWQNTESWKLLKE 254
Query: 194 LANSAQHRAIVRRVTQPKPVQ 214
LANS QHRA+ R+V++PKP+Q
Sbjct: 255 LANSVQHRAVARKVSKPKPLQ 275
>gi|147867288|emb|CAN81193.1| hypothetical protein VITISV_022848 [Vitis vinifera]
Length = 806
Score = 227 bits (578), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 106/151 (70%), Positives = 125/151 (82%), Gaps = 4/151 (2%)
Query: 63 KKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRMG 122
K +K V+V+ L QNG+PLGRR+L + VVRWI QGMR MA DFASAE+QGEF+ELRQRMG
Sbjct: 88 KPKNKPVSVRTLYQNGDPLGRRELRRCVVRWISQGMRGMALDFASAELQGEFAELRQRMG 147
Query: 123 PGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDW 182
PGL+FVI+AQPYLNAIPMP+G EA+CLKA THYPTLFDHFQRELRDVLQ+ Q+KS QDW
Sbjct: 148 PGLSFVIQAQPYLNAIPMPLGHEAICLKACTHYPTLFDHFQRELRDVLQDHQRKSQFQDW 207
Query: 183 HETESWKLLKELANSAQH----RAIVRRVTQ 209
ET+SW+LLKELANS +AI R+ +
Sbjct: 208 RETQSWQLLKELANSGMELDKAKAIQSRIDE 238
>gi|356536591|ref|XP_003536820.1| PREDICTED: DNA-binding protein SMUBP-2-like [Glycine max]
Length = 949
Score = 205 bits (521), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 107/193 (55%), Positives = 139/193 (72%), Gaps = 15/193 (7%)
Query: 27 SKIQKTKTLSGTG--SSTKVNVSSVVEKSSGEKQLEQPKKSDKAVNVQALSQNGNPLGRR 84
+ +TK +S T S + S V+K+ +++E+ L QNG+P+G++
Sbjct: 53 TNTNETKVVSSTNVKSRRRRRRSGFVDKTCQTREVEE----------GILHQNGDPIGKK 102
Query: 85 DLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQR--MGPGLTFVIEAQPYLNAIPMPV 142
DLGK V+RWI MRAMASD A+AE++G E MGPGLTF++ AQPYLNA+PMP+
Sbjct: 103 DLGKSVIRWIRDSMRAMASDLAAAELEGGEGEFELWELMGPGLTFIMLAQPYLNAVPMPI 162
Query: 143 GLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQ-KSLVQDWHETESWKLLKELANSAQHR 201
GLE +CLKA THYPTLFDHFQRELR VL++LQQ S +QDW +T+SWKLLK+LANSAQHR
Sbjct: 163 GLEGLCLKACTHYPTLFDHFQRELRQVLRDLQQSNSFIQDWRDTKSWKLLKDLANSAQHR 222
Query: 202 AIVRRVTQPKPVQ 214
A+VR++TQPK VQ
Sbjct: 223 AVVRKITQPKSVQ 235
>gi|222623514|gb|EEE57646.1| hypothetical protein OsJ_08074 [Oryza sativa Japonica Group]
Length = 980
Score = 194 bits (494), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 95/159 (59%), Positives = 118/159 (74%), Gaps = 9/159 (5%)
Query: 62 PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELR--- 118
P + ++ V L QNG+PLGR++LG+ VV W+ QGM++MAS FASAE+QG+ ++L
Sbjct: 101 PSMEEASIRVGTLYQNGDPLGRKELGRCVVEWLRQGMQSMASKFASAELQGDMADLDAAA 160
Query: 119 --QRMGPG---LTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQEL 173
Q G L FVI+AQPY++AIPMP GLEA+CLKA THYPTLFDHFQRELRDVLQ
Sbjct: 161 LAQEWGSADGRLGFVIQAQPYMSAIPMPKGLEALCLKACTHYPTLFDHFQRELRDVLQSC 220
Query: 174 QQKSLVQDWHETESWKLLKELANSAQHRAIVRRVTQPKP 212
Q + L+ DW T+SWKLLKE+ANS QHR VRR T P+P
Sbjct: 221 QNQGLISDWRSTQSWKLLKEMANSVQHREAVRR-TVPRP 258
>gi|218191424|gb|EEC73851.1| hypothetical protein OsI_08612 [Oryza sativa Indica Group]
Length = 979
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 95/159 (59%), Positives = 118/159 (74%), Gaps = 9/159 (5%)
Query: 62 PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELR--- 118
P + ++ V L QNG+PLGR++LG+ VV W+ QGM++MAS FASAE+QG+ ++L
Sbjct: 100 PSMEEASIRVGTLYQNGDPLGRKELGRCVVEWLRQGMQSMASKFASAELQGDMADLDAAA 159
Query: 119 --QRMGPG---LTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQEL 173
Q G L FVI+AQPY++AIPMP GLEA+CLKA THYPTLFDHFQRELRDVLQ
Sbjct: 160 LAQEWGSADGRLGFVIQAQPYMSAIPMPKGLEALCLKACTHYPTLFDHFQRELRDVLQSC 219
Query: 174 QQKSLVQDWHETESWKLLKELANSAQHRAIVRRVTQPKP 212
Q + L+ DW T+SWKLLKE+ANS QHR VRR T P+P
Sbjct: 220 QNQGLISDWRSTQSWKLLKEMANSVQHREAVRR-TVPRP 257
>gi|356501783|ref|XP_003519703.1| PREDICTED: DNA-binding protein SMUBP-2-like [Glycine max]
Length = 928
Score = 194 bits (493), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 94/143 (65%), Positives = 115/143 (80%), Gaps = 6/143 (4%)
Query: 74 LSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQ--RMGPGLTFVIEA 131
L QNG+P G++DLGK V+ WI MRAMASD A+AE++G E RMGPGLTF++ A
Sbjct: 76 LHQNGDPFGKKDLGKSVMSWIRDSMRAMASDLAAAELEGGEGEFELWERMGPGLTFIMLA 135
Query: 132 QPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWKLL 191
QPYLNA+PMP+GLE +CLK THYPTLFDHFQRELR VL++ S +QDW +T+SWKLL
Sbjct: 136 QPYLNAVPMPIGLEGLCLKVCTHYPTLFDHFQRELRQVLRD----SFIQDWRDTKSWKLL 191
Query: 192 KELANSAQHRAIVRRVTQPKPVQ 214
K+LANSAQHRA+VR++TQPK VQ
Sbjct: 192 KDLANSAQHRAVVRKITQPKSVQ 214
>gi|357137188|ref|XP_003570183.1| PREDICTED: DNA-binding protein SMUBP-2-like [Brachypodium
distachyon]
Length = 975
Score = 187 bits (474), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 89/155 (57%), Positives = 111/155 (71%), Gaps = 8/155 (5%)
Query: 62 PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSE----- 116
P + ++ V L +NG+PLGR++LG+ VV W+ QGM++MAS FASAE+QG+ +E
Sbjct: 96 PSVEEASIRVATLYENGDPLGRKELGRCVVEWLKQGMQSMASKFASAELQGDMAEPNTAA 155
Query: 117 LRQRMGPG---LTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQEL 173
L G L FVI+AQPYL+AIPMP GLEA+C KA THYPTLFDHFQRELRDVL
Sbjct: 156 LTLDWGSAEGQLGFVIQAQPYLSAIPMPKGLEALCFKACTHYPTLFDHFQRELRDVLISY 215
Query: 174 QQKSLVQDWHETESWKLLKELANSAQHRAIVRRVT 208
Q + L+ +W T +WKLLKE+ANS HRA VRR T
Sbjct: 216 QNQGLISEWRSTHTWKLLKEMANSTHHRAAVRRTT 250
>gi|326490191|dbj|BAJ94169.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 786
Score = 184 bits (468), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 87/155 (56%), Positives = 111/155 (71%), Gaps = 8/155 (5%)
Query: 62 PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQ-- 119
P + ++ V L +NG+PLGR++LG+ VV W+ QGM++MAS FASAE+QGE ++L
Sbjct: 97 PSVEEASIRVGTLYENGDPLGRKELGRCVVEWLRQGMQSMASKFASAELQGEMADLTTAA 156
Query: 120 -RMGPG-----LTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQEL 173
+ G L FVI+AQPYL+ +PMP GLEA+C KA THYPTLFDHFQRELR VL
Sbjct: 157 LTLNWGSAEGQLGFVIQAQPYLSPVPMPKGLEALCFKACTHYPTLFDHFQRELRGVLLSY 216
Query: 174 QQKSLVQDWHETESWKLLKELANSAQHRAIVRRVT 208
Q + L+ DW T++WKLLKE+ANS HRA VRR T
Sbjct: 217 QNQGLIYDWRSTQTWKLLKEMANSTHHRAAVRRTT 251
>gi|218201695|gb|EEC84122.1| hypothetical protein OsI_30457 [Oryza sativa Indica Group]
Length = 981
Score = 172 bits (436), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 81/152 (53%), Positives = 107/152 (70%), Gaps = 3/152 (1%)
Query: 62 PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRM 121
P + ++ V L ++G+P+G+++LG+ VV+W+ QGM +MA +AS E+Q + +
Sbjct: 106 PSAEEVSIRVNTLYESGDPIGKKELGRCVVQWLKQGMHSMAIKYASTEMQNDGATFLLDG 165
Query: 122 G---PGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSL 178
G L FV+ AQPYL+AIPMP G EA+CLKA THYPTLFDHFQRELRDVL + Q + L
Sbjct: 166 GSSEDNLGFVMLAQPYLSAIPMPKGQEALCLKASTHYPTLFDHFQRELRDVLLKQQNQGL 225
Query: 179 VQDWHETESWKLLKELANSAQHRAIVRRVTQP 210
+ DW T+SW LLKELANSAQHRA R+ P
Sbjct: 226 ISDWRTTQSWMLLKELANSAQHRAAARKPKAP 257
>gi|115478052|ref|NP_001062621.1| Os09g0130800 [Oryza sativa Japonica Group]
gi|113630854|dbj|BAF24535.1| Os09g0130800 [Oryza sativa Japonica Group]
gi|215694894|dbj|BAG90085.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641090|gb|EEE69222.1| hypothetical protein OsJ_28445 [Oryza sativa Japonica Group]
Length = 981
Score = 172 bits (436), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 81/152 (53%), Positives = 107/152 (70%), Gaps = 3/152 (1%)
Query: 62 PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRM 121
P + ++ V L ++G+P+G+++LG+ VV+W+ QGM +MA +AS E+Q + +
Sbjct: 106 PSAEEVSIRVNTLYESGDPIGKKELGRCVVQWLKQGMHSMAIKYASTEMQNDGATFLLDG 165
Query: 122 G---PGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSL 178
G L FV+ AQPYL+AIPMP G EA+CLKA THYPTLFDHFQRELRDVL + Q + L
Sbjct: 166 GSSEDNLGFVMLAQPYLSAIPMPKGQEALCLKASTHYPTLFDHFQRELRDVLLKQQNQGL 225
Query: 179 VQDWHETESWKLLKELANSAQHRAIVRRVTQP 210
+ DW T+SW LLKELANSAQHRA R+ P
Sbjct: 226 ISDWRTTQSWMLLKELANSAQHRAAARKPKAP 257
>gi|242062790|ref|XP_002452684.1| hypothetical protein SORBIDRAFT_04g030630 [Sorghum bicolor]
gi|241932515|gb|EES05660.1| hypothetical protein SORBIDRAFT_04g030630 [Sorghum bicolor]
Length = 981
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 82/156 (52%), Positives = 109/156 (69%), Gaps = 5/156 (3%)
Query: 62 PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEF--SELRQ 119
P + ++V L QNG+PLGR +LG+ VV W+ QGMR+MAS A+AE+ G+ + L
Sbjct: 104 PSMEEAPISVATLYQNGDPLGRNELGRCVVEWLRQGMRSMASKLAAAEMAGDLVPTALAL 163
Query: 120 RMGPG---LTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQK 176
G L FVI+AQPYL+A PMP GLEA+CLKA THYPTLFDHFQREL DVL Q +
Sbjct: 164 EWGSAEGRLAFVIQAQPYLSAAPMPQGLEALCLKACTHYPTLFDHFQRELCDVLLSYQHQ 223
Query: 177 SLVQDWHETESWKLLKELANSAQHRAIVRRVTQPKP 212
L+ D T+SW++L+E+ANS++H+A VR + +P
Sbjct: 224 GLISDLRATQSWRVLEEMANSSEHQAAVRTTSPRRP 259
>gi|357155053|ref|XP_003576992.1| PREDICTED: DNA-binding protein SMUBP-2-like [Brachypodium
distachyon]
Length = 980
Score = 161 bits (408), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 78/151 (51%), Positives = 100/151 (66%), Gaps = 2/151 (1%)
Query: 62 PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQ--GEFSELRQ 119
P + + AL ++G+PLGR++LG+ VV+W+ GM +MAS FAS E+Q G +
Sbjct: 106 PSAEEVFTRIGALYESGDPLGRKELGRCVVQWLKLGMHSMASAFASTEVQNDGAAFSMEA 165
Query: 120 RMGPGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLV 179
L F+ AQPYL A PMP G EA+CLKA THYPTLFDHFQRELRDVL + Q + L+
Sbjct: 166 SFEGNLGFINLAQPYLAATPMPKGQEALCLKASTHYPTLFDHFQRELRDVLLKQQNQGLI 225
Query: 180 QDWHETESWKLLKELANSAQHRAIVRRVTQP 210
DW T+SW +LKEL NS+QHR R+ P
Sbjct: 226 SDWRSTQSWMILKELPNSSQHRKGTRKPKSP 256
>gi|326507228|dbj|BAJ95691.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 980
Score = 156 bits (395), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 76/152 (50%), Positives = 102/152 (67%), Gaps = 3/152 (1%)
Query: 62 PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRM 121
P + ++ + ++G+PLGR++LG+ V++W+ GM +MAS FAS E+Q + +
Sbjct: 105 PSAEEVSIRIAEAYESGDPLGRKELGRCVMQWLKLGMCSMASTFASTEVQNDGAAFSLDG 164
Query: 122 GPG---LTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSL 178
G L F+I AQPYL+A PMP G EA+CLKA THYPTLFDHFQRELR+VL + Q +
Sbjct: 165 GSSEGSLEFMILAQPYLSATPMPKGHEALCLKASTHYPTLFDHFQRELREVLLKHQNQGR 224
Query: 179 VQDWHETESWKLLKELANSAQHRAIVRRVTQP 210
+ DW TESW LLK+LA SAQHR R+ P
Sbjct: 225 ISDWRSTESWMLLKKLAKSAQHRQGARKPKVP 256
>gi|242048802|ref|XP_002462147.1| hypothetical protein SORBIDRAFT_02g020475 [Sorghum bicolor]
gi|241925524|gb|EER98668.1| hypothetical protein SORBIDRAFT_02g020475 [Sorghum bicolor]
Length = 862
Score = 154 bits (389), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 79/145 (54%), Positives = 100/145 (68%), Gaps = 10/145 (6%)
Query: 71 VQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSE-----LRQRMGPGL 125
V L Q+G+PLGR++LG+ VV+W+ QGM MAS FAS++ Q E + MG
Sbjct: 2 VNNLYQSGDPLGRKELGQRVVQWLKQGMHLMASKFASSD-QSETQDNDPLCFHGHMG--- 57
Query: 126 TFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHET 185
FV++AQ YL+A MP G EA+CLKA THYPTLFD+FQRELRDVL + Q++ LV DW T
Sbjct: 58 -FVMQAQAYLSATAMPRGQEALCLKASTHYPTLFDNFQRELRDVLLQQQEEGLVTDWRST 116
Query: 186 ESWKLLKELANSAQHRAIVRRVTQP 210
SW L+KE+A SA+HR RR T P
Sbjct: 117 RSWMLMKEMARSAEHRTAARRPTAP 141
>gi|224124044|ref|XP_002319231.1| predicted protein [Populus trichocarpa]
gi|222857607|gb|EEE95154.1| predicted protein [Populus trichocarpa]
Length = 788
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 60/75 (80%), Positives = 69/75 (92%)
Query: 140 MPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWKLLKELANSAQ 199
MP+GLEA+CLKA THYPTLFDHFQRELR+VLQ+L++K LVQDW +TESWKLLKELANSAQ
Sbjct: 1 MPLGLEAICLKACTHYPTLFDHFQRELREVLQDLKRKGLVQDWQKTESWKLLKELANSAQ 60
Query: 200 HRAIVRRVTQPKPVQ 214
HRAI R+ TQ KP+Q
Sbjct: 61 HRAIARKATQSKPLQ 75
>gi|297737213|emb|CBI26414.3| unnamed protein product [Vitis vinifera]
Length = 789
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 58/75 (77%), Positives = 68/75 (90%)
Query: 140 MPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWKLLKELANSAQ 199
MP+G EA+CLKA THYPTLFDHFQRELRDVLQ+ Q+KS QDW ET+SW+LLKELANSAQ
Sbjct: 1 MPLGHEAICLKACTHYPTLFDHFQRELRDVLQDHQRKSQFQDWRETQSWQLLKELANSAQ 60
Query: 200 HRAIVRRVTQPKPVQ 214
HRAI R+V+QPKP++
Sbjct: 61 HRAISRKVSQPKPLK 75
>gi|41053136|dbj|BAD08079.1| putative DNA helicase [Oryza sativa Japonica Group]
Length = 794
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 53/73 (72%), Positives = 59/73 (80%), Gaps = 1/73 (1%)
Query: 140 MPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWKLLKELANSAQ 199
MP GLEA+CLKA THYPTLFDHFQRELRDVLQ Q + L+ DW T+SWKLLKE+ANS Q
Sbjct: 1 MPKGLEALCLKACTHYPTLFDHFQRELRDVLQSCQNQGLISDWRSTQSWKLLKEMANSVQ 60
Query: 200 HRAIVRRVTQPKP 212
HR VRR T P+P
Sbjct: 61 HREAVRR-TVPRP 72
>gi|302771055|ref|XP_002968946.1| hypothetical protein SELMODRAFT_90782 [Selaginella moellendorffii]
gi|300163451|gb|EFJ30062.1| hypothetical protein SELMODRAFT_90782 [Selaginella moellendorffii]
Length = 827
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 64/93 (68%)
Query: 113 EFSELRQRMGPGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQE 172
+ + + +R+G G V QP+L+ + +P GLE C +A HYPTL+DHFQREL++ L +
Sbjct: 13 DLAGVEERVGVGEELVRLVQPFLDRVELPEGLEDECQRARNHYPTLYDHFQRELKNSLVD 72
Query: 173 LQQKSLVQDWHETESWKLLKELANSAQHRAIVR 205
LQ+K V++W ET+SWKL K+ + S QHR+ R
Sbjct: 73 LQEKGSVEEWQETDSWKLFKKRSKSGQHRSSAR 105
>gi|302816629|ref|XP_002989993.1| hypothetical protein SELMODRAFT_130836 [Selaginella moellendorffii]
gi|300142304|gb|EFJ09006.1| hypothetical protein SELMODRAFT_130836 [Selaginella moellendorffii]
Length = 826
Score = 99.4 bits (246), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 64/93 (68%)
Query: 113 EFSELRQRMGPGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQE 172
+ + + +R+G G V QP+L+ + +P GLE C +A HYPTL+DHFQREL++ L +
Sbjct: 13 DLAGVEERVGVGEELVRLVQPFLDRVELPEGLEDECQRARNHYPTLYDHFQRELKNSLVD 72
Query: 173 LQQKSLVQDWHETESWKLLKELANSAQHRAIVR 205
LQ++ V++W ET+SWKL K+ + S QHR+ R
Sbjct: 73 LQEQGSVEEWQETDSWKLFKKRSKSGQHRSSAR 105
>gi|168051009|ref|XP_001777949.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670709|gb|EDQ57273.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 791
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 40/75 (53%), Positives = 53/75 (70%)
Query: 140 MPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWKLLKELANSAQ 199
MP GLE +CL+A HYPT+FDH+QRELR LQ Q++ + +W TESWKLLK+ A S +
Sbjct: 1 MPEGLEPLCLRASLHYPTIFDHYQRELRAALQASQERGKLDNWQNTESWKLLKKFAKSGE 60
Query: 200 HRAIVRRVTQPKPVQ 214
HRA+ R+V P +
Sbjct: 61 HRAVARKVKDPNQIH 75
>gi|9758800|dbj|BAB09253.1| DNA helicase-like [Arabidopsis thaliana]
Length = 750
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 23/36 (63%), Positives = 29/36 (80%)
Query: 179 VQDWHETESWKLLKELANSAQHRAIVRRVTQPKPVQ 214
++ W E+ESWKLLKE+ANSAQHR + R+ Q KPVQ
Sbjct: 1 MESWKESESWKLLKEIANSAQHREVARKAAQAKPVQ 36
>gi|409078635|gb|EKM78998.1| hypothetical protein AGABI1DRAFT_129263 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 781
Score = 37.4 bits (85), Expect = 3.9, Method: Composition-based stats.
Identities = 31/121 (25%), Positives = 51/121 (42%), Gaps = 20/121 (16%)
Query: 85 DLGKGVVRWIC-QGMRAMASDFASAEIQGEFSELRQRMGPGLTFVIEAQ----PYLNAIP 139
DL + ++ W+C G R+ A+IQ +F+ R + +A P +
Sbjct: 81 DLTREIISWLCDSGRRSNVYLIQEAQIQTDFAHEIWRANAFASICYDAHRFGLPLRGTLD 140
Query: 140 MPVGLEAVCLKAGTHYP--------TLFDHFQRELRDVLQELQQKSLVQDWHETESWKLL 191
P + +C THYP LFD+ E+R++ + Q K L+ E WK L
Sbjct: 141 FPPDMFRICAALATHYPPFRRILTKILFDN--TEIRNMPKRTQFKKLIY-----EPWKSL 193
Query: 192 K 192
+
Sbjct: 194 Q 194
>gi|393238363|gb|EJD45900.1| hypothetical protein AURDEDRAFT_164960 [Auricularia delicata
TFB-10046 SS5]
Length = 410
Score = 37.0 bits (84), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 8/101 (7%)
Query: 26 KSKIQKTKTLSGTGSSTKVNVSSVVEKSSGEKQLEQPKKSDKAVNV----QALSQNGNPL 81
K+K T L G +TK + +V +G K+LE P +S+KA + QA++Q G+ L
Sbjct: 240 KAKKPPTYRLRGGWKATKSGLLPLVLARNGLKKLEYPTQSEKAKRIPRMFQAMAQKGDTL 299
Query: 82 GRRDLGKGVVRWICQG----MRAMASDFASAEIQGEFSELR 118
R G ++ +G MR SD +I G E+R
Sbjct: 300 ASRADGWAFCLFMPKGGAGIMRTWHSDQVQQDIPGLPDEVR 340
>gi|393241780|gb|EJD49300.1| hypothetical protein AURDEDRAFT_161397 [Auricularia delicata
TFB-10046 SS5]
Length = 374
Score = 37.0 bits (84), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 8/101 (7%)
Query: 26 KSKIQKTKTLSGTGSSTKVNVSSVVEKSSGEKQLEQPKKSDKAVNV----QALSQNGNPL 81
K+K T L G +TK + +V +G K+LE P +S+KA + QA++Q G+ L
Sbjct: 204 KAKKPPTYRLRGGWKATKSGLLPLVLARNGLKKLEYPTQSEKAKRIPRMFQAMAQKGDTL 263
Query: 82 GRRDLGKGVVRWICQG----MRAMASDFASAEIQGEFSELR 118
R G ++ +G MR SD +I G E+R
Sbjct: 264 ASRADGWAFCLFMPKGGAGIMRTWHSDQVQQDIPGLPDEVR 304
>gi|393229750|gb|EJD37367.1| hypothetical protein AURDEDRAFT_173528 [Auricularia delicata
TFB-10046 SS5]
Length = 412
Score = 36.6 bits (83), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 8/101 (7%)
Query: 26 KSKIQKTKTLSGTGSSTKVNVSSVVEKSSGEKQLEQPKKSDKAVNV----QALSQNGNPL 81
K+K T L G +TK + +V +G K+LE P +S+KA + QA++Q G+ L
Sbjct: 240 KAKKPPTYRLRGGWKATKSGLLPLVLARNGLKKLEYPTQSEKAKCIPRMFQAMAQKGDTL 299
Query: 82 GRRDLGKGVVRWICQG----MRAMASDFASAEIQGEFSELR 118
R G ++ +G MR SD +I G E+R
Sbjct: 300 ASRADGWAFCLFMPKGGAGIMRTWHSDQVQQDIPGLPDEVR 340
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.312 0.125 0.350
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,065,332,121
Number of Sequences: 23463169
Number of extensions: 117715924
Number of successful extensions: 374787
Number of sequences better than 100.0: 157
Number of HSP's better than 100.0 without gapping: 57
Number of HSP's successfully gapped in prelim test: 100
Number of HSP's that attempted gapping in prelim test: 374096
Number of HSP's gapped (non-prelim): 683
length of query: 214
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 78
effective length of database: 9,168,204,383
effective search space: 715119941874
effective search space used: 715119941874
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 74 (33.1 bits)