BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 036594
         (214 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|222422959|dbj|BAH19464.1| AT5G35970 [Arabidopsis thaliana]
          Length = 626

 Score =  261 bits (666), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 123/205 (60%), Positives = 160/205 (78%), Gaps = 13/205 (6%)

Query: 10  STTFKKKPRRKSSAFSKSKIQKTKTLSGTGSSTKVNVSSVVEKSSGEKQLEQPKKSDKAV 69
           S+  KKKPRRKS+   K + +K +  +    S  ++V            +E+PK +DK +
Sbjct: 56  SSVTKKKPRRKSNVSDKLRFKKIEKRNDNTESESLSV------------VEEPK-NDKEL 102

Query: 70  NVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRMGPGLTFVI 129
           +++AL+QNG+PLGRRDLG+ VV+WI Q M+AMASDFA+AE+QGEFSELRQ +G GLTFVI
Sbjct: 103 SLRALNQNGDPLGRRDLGRNVVKWISQAMKAMASDFATAEVQGEFSELRQNVGSGLTFVI 162

Query: 130 EAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWK 189
           +AQPYLNAIPMP+G E +CLKA THYPTLFDHFQRELRDVLQ+L++K++++ W E+ESWK
Sbjct: 163 QAQPYLNAIPMPLGSEVICLKACTHYPTLFDHFQRELRDVLQDLERKNIMESWKESESWK 222

Query: 190 LLKELANSAQHRAIVRRVTQPKPVQ 214
           LLKE+ANSAQHR + R+  Q KPVQ
Sbjct: 223 LLKEIANSAQHREVARKAAQAKPVQ 247


>gi|30692868|ref|NP_198446.3| putative DNA-binding protein [Arabidopsis thaliana]
 gi|332006651|gb|AED94034.1| putative DNA-binding protein [Arabidopsis thaliana]
          Length = 961

 Score =  260 bits (664), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 124/212 (58%), Positives = 161/212 (75%), Gaps = 13/212 (6%)

Query: 3   SSSSTKSSTTFKKKPRRKSSAFSKSKIQKTKTLSGTGSSTKVNVSSVVEKSSGEKQLEQP 62
            S     S+  KKKPRRKS+   K + +K +  +    S  ++V            +E+P
Sbjct: 49  CSGVNGGSSVTKKKPRRKSNVSDKLRFKKIEKRNDNTESESLSV------------VEEP 96

Query: 63  KKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRMG 122
           K +DK ++++AL+QNG+PLGRRDLG+ VV+WI Q M+AMASDFA+AE+QGEFSELRQ +G
Sbjct: 97  K-NDKELSLRALNQNGDPLGRRDLGRNVVKWISQAMKAMASDFATAEVQGEFSELRQNVG 155

Query: 123 PGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDW 182
            GLTFVI+AQPYLNAIPMP+G E +CLKA THYPTLFDHFQRELRDVLQ+L++K++++ W
Sbjct: 156 SGLTFVIQAQPYLNAIPMPLGSEVICLKACTHYPTLFDHFQRELRDVLQDLERKNIMESW 215

Query: 183 HETESWKLLKELANSAQHRAIVRRVTQPKPVQ 214
            E+ESWKLLKE+ANSAQHR + R+  Q KPVQ
Sbjct: 216 KESESWKLLKEIANSAQHREVARKAAQAKPVQ 247


>gi|449451781|ref|XP_004143639.1| PREDICTED: DNA-binding protein SMUBP-2-like [Cucumis sativus]
 gi|449527761|ref|XP_004170878.1| PREDICTED: DNA-binding protein SMUBP-2-like [Cucumis sativus]
          Length = 957

 Score =  259 bits (662), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 115/158 (72%), Positives = 140/158 (88%)

Query: 57  KQLEQPKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSE 116
           ++LE+ KK D+ VNVQ + QNG+PLGRR+LGK VVRWI   MRAMASDFA+AE+QG+F E
Sbjct: 86  RELEEKKKKDREVNVQGIYQNGDPLGRRELGKSVVRWIGLAMRAMASDFAAAEVQGDFPE 145

Query: 117 LRQRMGPGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQK 176
           L+QRMG GLTFVI+AQPYLNA+PMP+GLEAVCLKA THYPTLFDHFQRELRDVLQ+LQ++
Sbjct: 146 LQQRMGQGLTFVIQAQPYLNAVPMPLGLEAVCLKASTHYPTLFDHFQRELRDVLQDLQRQ 205

Query: 177 SLVQDWHETESWKLLKELANSAQHRAIVRRVTQPKPVQ 214
           SL  DW ET+SWKLLK+LA+S QH+AI R++++PK VQ
Sbjct: 206 SLFLDWRETQSWKLLKKLAHSVQHKAIARKISEPKVVQ 243


>gi|297805152|ref|XP_002870460.1| hypothetical protein ARALYDRAFT_493645 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316296|gb|EFH46719.1| hypothetical protein ARALYDRAFT_493645 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 979

 Score =  257 bits (656), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 125/201 (62%), Positives = 162/201 (80%), Gaps = 3/201 (1%)

Query: 14  KKKPRRKSSAFSKSKIQKTKTLSGTGSSTKVNVSSVVEKSSGEKQLEQPKKSDKAVNVQA 73
           KKKPRRKS+     K++  K      +S  V++SS +     +++ E+  KSDK ++++A
Sbjct: 68  KKKPRRKSNV--SDKLRSKKVEKRNDNSESVSLSSEIVVEEVKEEDEK-PKSDKELSLRA 124

Query: 74  LSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRMGPGLTFVIEAQP 133
           L+QNG+PLGRRDLG+ VV+WI Q M+AMASDFA+AE+QGEFSELRQ +G GLTFVI+AQP
Sbjct: 125 LNQNGDPLGRRDLGRNVVKWISQAMKAMASDFANAEVQGEFSELRQNVGSGLTFVIQAQP 184

Query: 134 YLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWKLLKE 193
           YLNAIPMP+G E +CLKA THYPTLFDHFQRELRDVLQ+L++K+++++W ETESWKLLKE
Sbjct: 185 YLNAIPMPLGSEVICLKACTHYPTLFDHFQRELRDVLQDLERKNIMENWKETESWKLLKE 244

Query: 194 LANSAQHRAIVRRVTQPKPVQ 214
           +ANSAQHR + R+  Q KPVQ
Sbjct: 245 IANSAQHREVARKAAQAKPVQ 265


>gi|225454589|ref|XP_002264216.1| PREDICTED: DNA-binding protein SMUBP-2 [Vitis vinifera]
          Length = 953

 Score =  253 bits (647), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 115/152 (75%), Positives = 135/152 (88%)

Query: 63  KKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRMG 122
           K  +K V+V+ L QNG+PLGRR+L + VVRWI QGMR MA DFASAE+QGEF+ELRQRMG
Sbjct: 88  KSKNKPVSVRTLYQNGDPLGRRELRRCVVRWISQGMRGMALDFASAELQGEFAELRQRMG 147

Query: 123 PGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDW 182
           PGL+FVI+AQPYLNAIPMP+G EA+CLKA THYPTLFDHFQRELRDVLQ+ Q+KS  QDW
Sbjct: 148 PGLSFVIQAQPYLNAIPMPLGHEAICLKACTHYPTLFDHFQRELRDVLQDHQRKSQFQDW 207

Query: 183 HETESWKLLKELANSAQHRAIVRRVTQPKPVQ 214
            ET+SW+LLKELANSAQHRAI R+V+QPKP++
Sbjct: 208 RETQSWQLLKELANSAQHRAISRKVSQPKPLK 239


>gi|255566048|ref|XP_002524012.1| DNA-binding protein smubp-2, putative [Ricinus communis]
 gi|223536739|gb|EEF38380.1| DNA-binding protein smubp-2, putative [Ricinus communis]
          Length = 989

 Score =  249 bits (637), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 121/201 (60%), Positives = 152/201 (75%), Gaps = 4/201 (1%)

Query: 16  KPRRKSSAFSKSKIQKTKTLSGTGSSTKVNVSSVVEKSSGEKQLEQPKKSDKAVNVQALS 75
           KPR  ++  +K +    K  +G   +TK   ++  +        E+ +K    VNV++L 
Sbjct: 77  KPRTATT--TKKRKAHVKKSTGMSKNTKKYTAANTDGGKLAVSEEREEKVKMKVNVKSLH 134

Query: 76  QNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRMG--PGLTFVIEAQP 133
           QNG+PLG++DLGK VV+WI QGMRAMA+DFASAE QGEF ELRQRM    GLTFVI+AQP
Sbjct: 135 QNGDPLGKKDLGKTVVKWISQGMRAMAADFASAETQGEFLELRQRMDLEAGLTFVIQAQP 194

Query: 134 YLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWKLLKE 193
           Y+NA+P+P+G EA+CLKA  HYPTLFDHFQRELRDVLQ+LQ+K LVQDW  TESWKLLKE
Sbjct: 195 YINAVPIPLGFEALCLKACIHYPTLFDHFQRELRDVLQDLQRKGLVQDWQNTESWKLLKE 254

Query: 194 LANSAQHRAIVRRVTQPKPVQ 214
           LANS QHRA+ R+V++PKP+Q
Sbjct: 255 LANSVQHRAVARKVSKPKPLQ 275


>gi|147867288|emb|CAN81193.1| hypothetical protein VITISV_022848 [Vitis vinifera]
          Length = 806

 Score =  227 bits (578), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 106/151 (70%), Positives = 125/151 (82%), Gaps = 4/151 (2%)

Query: 63  KKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRMG 122
           K  +K V+V+ L QNG+PLGRR+L + VVRWI QGMR MA DFASAE+QGEF+ELRQRMG
Sbjct: 88  KPKNKPVSVRTLYQNGDPLGRRELRRCVVRWISQGMRGMALDFASAELQGEFAELRQRMG 147

Query: 123 PGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDW 182
           PGL+FVI+AQPYLNAIPMP+G EA+CLKA THYPTLFDHFQRELRDVLQ+ Q+KS  QDW
Sbjct: 148 PGLSFVIQAQPYLNAIPMPLGHEAICLKACTHYPTLFDHFQRELRDVLQDHQRKSQFQDW 207

Query: 183 HETESWKLLKELANSAQH----RAIVRRVTQ 209
            ET+SW+LLKELANS       +AI  R+ +
Sbjct: 208 RETQSWQLLKELANSGMELDKAKAIQSRIDE 238


>gi|356536591|ref|XP_003536820.1| PREDICTED: DNA-binding protein SMUBP-2-like [Glycine max]
          Length = 949

 Score =  205 bits (521), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 107/193 (55%), Positives = 139/193 (72%), Gaps = 15/193 (7%)

Query: 27  SKIQKTKTLSGTG--SSTKVNVSSVVEKSSGEKQLEQPKKSDKAVNVQALSQNGNPLGRR 84
           +   +TK +S T   S  +   S  V+K+   +++E+            L QNG+P+G++
Sbjct: 53  TNTNETKVVSSTNVKSRRRRRRSGFVDKTCQTREVEE----------GILHQNGDPIGKK 102

Query: 85  DLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQR--MGPGLTFVIEAQPYLNAIPMPV 142
           DLGK V+RWI   MRAMASD A+AE++G   E      MGPGLTF++ AQPYLNA+PMP+
Sbjct: 103 DLGKSVIRWIRDSMRAMASDLAAAELEGGEGEFELWELMGPGLTFIMLAQPYLNAVPMPI 162

Query: 143 GLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQ-KSLVQDWHETESWKLLKELANSAQHR 201
           GLE +CLKA THYPTLFDHFQRELR VL++LQQ  S +QDW +T+SWKLLK+LANSAQHR
Sbjct: 163 GLEGLCLKACTHYPTLFDHFQRELRQVLRDLQQSNSFIQDWRDTKSWKLLKDLANSAQHR 222

Query: 202 AIVRRVTQPKPVQ 214
           A+VR++TQPK VQ
Sbjct: 223 AVVRKITQPKSVQ 235


>gi|222623514|gb|EEE57646.1| hypothetical protein OsJ_08074 [Oryza sativa Japonica Group]
          Length = 980

 Score =  194 bits (494), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 95/159 (59%), Positives = 118/159 (74%), Gaps = 9/159 (5%)

Query: 62  PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELR--- 118
           P   + ++ V  L QNG+PLGR++LG+ VV W+ QGM++MAS FASAE+QG+ ++L    
Sbjct: 101 PSMEEASIRVGTLYQNGDPLGRKELGRCVVEWLRQGMQSMASKFASAELQGDMADLDAAA 160

Query: 119 --QRMGPG---LTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQEL 173
             Q  G     L FVI+AQPY++AIPMP GLEA+CLKA THYPTLFDHFQRELRDVLQ  
Sbjct: 161 LAQEWGSADGRLGFVIQAQPYMSAIPMPKGLEALCLKACTHYPTLFDHFQRELRDVLQSC 220

Query: 174 QQKSLVQDWHETESWKLLKELANSAQHRAIVRRVTQPKP 212
           Q + L+ DW  T+SWKLLKE+ANS QHR  VRR T P+P
Sbjct: 221 QNQGLISDWRSTQSWKLLKEMANSVQHREAVRR-TVPRP 258


>gi|218191424|gb|EEC73851.1| hypothetical protein OsI_08612 [Oryza sativa Indica Group]
          Length = 979

 Score =  194 bits (494), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 95/159 (59%), Positives = 118/159 (74%), Gaps = 9/159 (5%)

Query: 62  PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELR--- 118
           P   + ++ V  L QNG+PLGR++LG+ VV W+ QGM++MAS FASAE+QG+ ++L    
Sbjct: 100 PSMEEASIRVGTLYQNGDPLGRKELGRCVVEWLRQGMQSMASKFASAELQGDMADLDAAA 159

Query: 119 --QRMGPG---LTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQEL 173
             Q  G     L FVI+AQPY++AIPMP GLEA+CLKA THYPTLFDHFQRELRDVLQ  
Sbjct: 160 LAQEWGSADGRLGFVIQAQPYMSAIPMPKGLEALCLKACTHYPTLFDHFQRELRDVLQSC 219

Query: 174 QQKSLVQDWHETESWKLLKELANSAQHRAIVRRVTQPKP 212
           Q + L+ DW  T+SWKLLKE+ANS QHR  VRR T P+P
Sbjct: 220 QNQGLISDWRSTQSWKLLKEMANSVQHREAVRR-TVPRP 257


>gi|356501783|ref|XP_003519703.1| PREDICTED: DNA-binding protein SMUBP-2-like [Glycine max]
          Length = 928

 Score =  194 bits (493), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 94/143 (65%), Positives = 115/143 (80%), Gaps = 6/143 (4%)

Query: 74  LSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQ--RMGPGLTFVIEA 131
           L QNG+P G++DLGK V+ WI   MRAMASD A+AE++G   E     RMGPGLTF++ A
Sbjct: 76  LHQNGDPFGKKDLGKSVMSWIRDSMRAMASDLAAAELEGGEGEFELWERMGPGLTFIMLA 135

Query: 132 QPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWKLL 191
           QPYLNA+PMP+GLE +CLK  THYPTLFDHFQRELR VL++    S +QDW +T+SWKLL
Sbjct: 136 QPYLNAVPMPIGLEGLCLKVCTHYPTLFDHFQRELRQVLRD----SFIQDWRDTKSWKLL 191

Query: 192 KELANSAQHRAIVRRVTQPKPVQ 214
           K+LANSAQHRA+VR++TQPK VQ
Sbjct: 192 KDLANSAQHRAVVRKITQPKSVQ 214


>gi|357137188|ref|XP_003570183.1| PREDICTED: DNA-binding protein SMUBP-2-like [Brachypodium
           distachyon]
          Length = 975

 Score =  187 bits (474), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 89/155 (57%), Positives = 111/155 (71%), Gaps = 8/155 (5%)

Query: 62  PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSE----- 116
           P   + ++ V  L +NG+PLGR++LG+ VV W+ QGM++MAS FASAE+QG+ +E     
Sbjct: 96  PSVEEASIRVATLYENGDPLGRKELGRCVVEWLKQGMQSMASKFASAELQGDMAEPNTAA 155

Query: 117 LRQRMGPG---LTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQEL 173
           L    G     L FVI+AQPYL+AIPMP GLEA+C KA THYPTLFDHFQRELRDVL   
Sbjct: 156 LTLDWGSAEGQLGFVIQAQPYLSAIPMPKGLEALCFKACTHYPTLFDHFQRELRDVLISY 215

Query: 174 QQKSLVQDWHETESWKLLKELANSAQHRAIVRRVT 208
           Q + L+ +W  T +WKLLKE+ANS  HRA VRR T
Sbjct: 216 QNQGLISEWRSTHTWKLLKEMANSTHHRAAVRRTT 250


>gi|326490191|dbj|BAJ94169.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 786

 Score =  184 bits (468), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 87/155 (56%), Positives = 111/155 (71%), Gaps = 8/155 (5%)

Query: 62  PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQ-- 119
           P   + ++ V  L +NG+PLGR++LG+ VV W+ QGM++MAS FASAE+QGE ++L    
Sbjct: 97  PSVEEASIRVGTLYENGDPLGRKELGRCVVEWLRQGMQSMASKFASAELQGEMADLTTAA 156

Query: 120 -RMGPG-----LTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQEL 173
             +  G     L FVI+AQPYL+ +PMP GLEA+C KA THYPTLFDHFQRELR VL   
Sbjct: 157 LTLNWGSAEGQLGFVIQAQPYLSPVPMPKGLEALCFKACTHYPTLFDHFQRELRGVLLSY 216

Query: 174 QQKSLVQDWHETESWKLLKELANSAQHRAIVRRVT 208
           Q + L+ DW  T++WKLLKE+ANS  HRA VRR T
Sbjct: 217 QNQGLIYDWRSTQTWKLLKEMANSTHHRAAVRRTT 251


>gi|218201695|gb|EEC84122.1| hypothetical protein OsI_30457 [Oryza sativa Indica Group]
          Length = 981

 Score =  172 bits (436), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 81/152 (53%), Positives = 107/152 (70%), Gaps = 3/152 (1%)

Query: 62  PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRM 121
           P   + ++ V  L ++G+P+G+++LG+ VV+W+ QGM +MA  +AS E+Q + +      
Sbjct: 106 PSAEEVSIRVNTLYESGDPIGKKELGRCVVQWLKQGMHSMAIKYASTEMQNDGATFLLDG 165

Query: 122 G---PGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSL 178
           G     L FV+ AQPYL+AIPMP G EA+CLKA THYPTLFDHFQRELRDVL + Q + L
Sbjct: 166 GSSEDNLGFVMLAQPYLSAIPMPKGQEALCLKASTHYPTLFDHFQRELRDVLLKQQNQGL 225

Query: 179 VQDWHETESWKLLKELANSAQHRAIVRRVTQP 210
           + DW  T+SW LLKELANSAQHRA  R+   P
Sbjct: 226 ISDWRTTQSWMLLKELANSAQHRAAARKPKAP 257


>gi|115478052|ref|NP_001062621.1| Os09g0130800 [Oryza sativa Japonica Group]
 gi|113630854|dbj|BAF24535.1| Os09g0130800 [Oryza sativa Japonica Group]
 gi|215694894|dbj|BAG90085.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641090|gb|EEE69222.1| hypothetical protein OsJ_28445 [Oryza sativa Japonica Group]
          Length = 981

 Score =  172 bits (436), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 81/152 (53%), Positives = 107/152 (70%), Gaps = 3/152 (1%)

Query: 62  PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRM 121
           P   + ++ V  L ++G+P+G+++LG+ VV+W+ QGM +MA  +AS E+Q + +      
Sbjct: 106 PSAEEVSIRVNTLYESGDPIGKKELGRCVVQWLKQGMHSMAIKYASTEMQNDGATFLLDG 165

Query: 122 G---PGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSL 178
           G     L FV+ AQPYL+AIPMP G EA+CLKA THYPTLFDHFQRELRDVL + Q + L
Sbjct: 166 GSSEDNLGFVMLAQPYLSAIPMPKGQEALCLKASTHYPTLFDHFQRELRDVLLKQQNQGL 225

Query: 179 VQDWHETESWKLLKELANSAQHRAIVRRVTQP 210
           + DW  T+SW LLKELANSAQHRA  R+   P
Sbjct: 226 ISDWRTTQSWMLLKELANSAQHRAAARKPKAP 257


>gi|242062790|ref|XP_002452684.1| hypothetical protein SORBIDRAFT_04g030630 [Sorghum bicolor]
 gi|241932515|gb|EES05660.1| hypothetical protein SORBIDRAFT_04g030630 [Sorghum bicolor]
          Length = 981

 Score =  172 bits (435), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 82/156 (52%), Positives = 109/156 (69%), Gaps = 5/156 (3%)

Query: 62  PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEF--SELRQ 119
           P   +  ++V  L QNG+PLGR +LG+ VV W+ QGMR+MAS  A+AE+ G+   + L  
Sbjct: 104 PSMEEAPISVATLYQNGDPLGRNELGRCVVEWLRQGMRSMASKLAAAEMAGDLVPTALAL 163

Query: 120 RMGPG---LTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQK 176
             G     L FVI+AQPYL+A PMP GLEA+CLKA THYPTLFDHFQREL DVL   Q +
Sbjct: 164 EWGSAEGRLAFVIQAQPYLSAAPMPQGLEALCLKACTHYPTLFDHFQRELCDVLLSYQHQ 223

Query: 177 SLVQDWHETESWKLLKELANSAQHRAIVRRVTQPKP 212
            L+ D   T+SW++L+E+ANS++H+A VR  +  +P
Sbjct: 224 GLISDLRATQSWRVLEEMANSSEHQAAVRTTSPRRP 259


>gi|357155053|ref|XP_003576992.1| PREDICTED: DNA-binding protein SMUBP-2-like [Brachypodium
           distachyon]
          Length = 980

 Score =  161 bits (408), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 78/151 (51%), Positives = 100/151 (66%), Gaps = 2/151 (1%)

Query: 62  PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQ--GEFSELRQ 119
           P   +    + AL ++G+PLGR++LG+ VV+W+  GM +MAS FAS E+Q  G    +  
Sbjct: 106 PSAEEVFTRIGALYESGDPLGRKELGRCVVQWLKLGMHSMASAFASTEVQNDGAAFSMEA 165

Query: 120 RMGPGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLV 179
                L F+  AQPYL A PMP G EA+CLKA THYPTLFDHFQRELRDVL + Q + L+
Sbjct: 166 SFEGNLGFINLAQPYLAATPMPKGQEALCLKASTHYPTLFDHFQRELRDVLLKQQNQGLI 225

Query: 180 QDWHETESWKLLKELANSAQHRAIVRRVTQP 210
            DW  T+SW +LKEL NS+QHR   R+   P
Sbjct: 226 SDWRSTQSWMILKELPNSSQHRKGTRKPKSP 256


>gi|326507228|dbj|BAJ95691.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 980

 Score =  156 bits (395), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 76/152 (50%), Positives = 102/152 (67%), Gaps = 3/152 (1%)

Query: 62  PKKSDKAVNVQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSELRQRM 121
           P   + ++ +    ++G+PLGR++LG+ V++W+  GM +MAS FAS E+Q + +      
Sbjct: 105 PSAEEVSIRIAEAYESGDPLGRKELGRCVMQWLKLGMCSMASTFASTEVQNDGAAFSLDG 164

Query: 122 GPG---LTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSL 178
           G     L F+I AQPYL+A PMP G EA+CLKA THYPTLFDHFQRELR+VL + Q +  
Sbjct: 165 GSSEGSLEFMILAQPYLSATPMPKGHEALCLKASTHYPTLFDHFQRELREVLLKHQNQGR 224

Query: 179 VQDWHETESWKLLKELANSAQHRAIVRRVTQP 210
           + DW  TESW LLK+LA SAQHR   R+   P
Sbjct: 225 ISDWRSTESWMLLKKLAKSAQHRQGARKPKVP 256


>gi|242048802|ref|XP_002462147.1| hypothetical protein SORBIDRAFT_02g020475 [Sorghum bicolor]
 gi|241925524|gb|EER98668.1| hypothetical protein SORBIDRAFT_02g020475 [Sorghum bicolor]
          Length = 862

 Score =  154 bits (389), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 79/145 (54%), Positives = 100/145 (68%), Gaps = 10/145 (6%)

Query: 71  VQALSQNGNPLGRRDLGKGVVRWICQGMRAMASDFASAEIQGEFSE-----LRQRMGPGL 125
           V  L Q+G+PLGR++LG+ VV+W+ QGM  MAS FAS++ Q E  +         MG   
Sbjct: 2   VNNLYQSGDPLGRKELGQRVVQWLKQGMHLMASKFASSD-QSETQDNDPLCFHGHMG--- 57

Query: 126 TFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHET 185
            FV++AQ YL+A  MP G EA+CLKA THYPTLFD+FQRELRDVL + Q++ LV DW  T
Sbjct: 58  -FVMQAQAYLSATAMPRGQEALCLKASTHYPTLFDNFQRELRDVLLQQQEEGLVTDWRST 116

Query: 186 ESWKLLKELANSAQHRAIVRRVTQP 210
            SW L+KE+A SA+HR   RR T P
Sbjct: 117 RSWMLMKEMARSAEHRTAARRPTAP 141


>gi|224124044|ref|XP_002319231.1| predicted protein [Populus trichocarpa]
 gi|222857607|gb|EEE95154.1| predicted protein [Populus trichocarpa]
          Length = 788

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 60/75 (80%), Positives = 69/75 (92%)

Query: 140 MPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWKLLKELANSAQ 199
           MP+GLEA+CLKA THYPTLFDHFQRELR+VLQ+L++K LVQDW +TESWKLLKELANSAQ
Sbjct: 1   MPLGLEAICLKACTHYPTLFDHFQRELREVLQDLKRKGLVQDWQKTESWKLLKELANSAQ 60

Query: 200 HRAIVRRVTQPKPVQ 214
           HRAI R+ TQ KP+Q
Sbjct: 61  HRAIARKATQSKPLQ 75


>gi|297737213|emb|CBI26414.3| unnamed protein product [Vitis vinifera]
          Length = 789

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 58/75 (77%), Positives = 68/75 (90%)

Query: 140 MPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWKLLKELANSAQ 199
           MP+G EA+CLKA THYPTLFDHFQRELRDVLQ+ Q+KS  QDW ET+SW+LLKELANSAQ
Sbjct: 1   MPLGHEAICLKACTHYPTLFDHFQRELRDVLQDHQRKSQFQDWRETQSWQLLKELANSAQ 60

Query: 200 HRAIVRRVTQPKPVQ 214
           HRAI R+V+QPKP++
Sbjct: 61  HRAISRKVSQPKPLK 75


>gi|41053136|dbj|BAD08079.1| putative DNA helicase [Oryza sativa Japonica Group]
          Length = 794

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 53/73 (72%), Positives = 59/73 (80%), Gaps = 1/73 (1%)

Query: 140 MPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWKLLKELANSAQ 199
           MP GLEA+CLKA THYPTLFDHFQRELRDVLQ  Q + L+ DW  T+SWKLLKE+ANS Q
Sbjct: 1   MPKGLEALCLKACTHYPTLFDHFQRELRDVLQSCQNQGLISDWRSTQSWKLLKEMANSVQ 60

Query: 200 HRAIVRRVTQPKP 212
           HR  VRR T P+P
Sbjct: 61  HREAVRR-TVPRP 72


>gi|302771055|ref|XP_002968946.1| hypothetical protein SELMODRAFT_90782 [Selaginella moellendorffii]
 gi|300163451|gb|EFJ30062.1| hypothetical protein SELMODRAFT_90782 [Selaginella moellendorffii]
          Length = 827

 Score =  100 bits (249), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 43/93 (46%), Positives = 64/93 (68%)

Query: 113 EFSELRQRMGPGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQE 172
           + + + +R+G G   V   QP+L+ + +P GLE  C +A  HYPTL+DHFQREL++ L +
Sbjct: 13  DLAGVEERVGVGEELVRLVQPFLDRVELPEGLEDECQRARNHYPTLYDHFQRELKNSLVD 72

Query: 173 LQQKSLVQDWHETESWKLLKELANSAQHRAIVR 205
           LQ+K  V++W ET+SWKL K+ + S QHR+  R
Sbjct: 73  LQEKGSVEEWQETDSWKLFKKRSKSGQHRSSAR 105


>gi|302816629|ref|XP_002989993.1| hypothetical protein SELMODRAFT_130836 [Selaginella moellendorffii]
 gi|300142304|gb|EFJ09006.1| hypothetical protein SELMODRAFT_130836 [Selaginella moellendorffii]
          Length = 826

 Score = 99.4 bits (246), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 42/93 (45%), Positives = 64/93 (68%)

Query: 113 EFSELRQRMGPGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQE 172
           + + + +R+G G   V   QP+L+ + +P GLE  C +A  HYPTL+DHFQREL++ L +
Sbjct: 13  DLAGVEERVGVGEELVRLVQPFLDRVELPEGLEDECQRARNHYPTLYDHFQRELKNSLVD 72

Query: 173 LQQKSLVQDWHETESWKLLKELANSAQHRAIVR 205
           LQ++  V++W ET+SWKL K+ + S QHR+  R
Sbjct: 73  LQEQGSVEEWQETDSWKLFKKRSKSGQHRSSAR 105


>gi|168051009|ref|XP_001777949.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670709|gb|EDQ57273.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 791

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 40/75 (53%), Positives = 53/75 (70%)

Query: 140 MPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQELQQKSLVQDWHETESWKLLKELANSAQ 199
           MP GLE +CL+A  HYPT+FDH+QRELR  LQ  Q++  + +W  TESWKLLK+ A S +
Sbjct: 1   MPEGLEPLCLRASLHYPTIFDHYQRELRAALQASQERGKLDNWQNTESWKLLKKFAKSGE 60

Query: 200 HRAIVRRVTQPKPVQ 214
           HRA+ R+V  P  + 
Sbjct: 61  HRAVARKVKDPNQIH 75


>gi|9758800|dbj|BAB09253.1| DNA helicase-like [Arabidopsis thaliana]
          Length = 750

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 29/36 (80%)

Query: 179 VQDWHETESWKLLKELANSAQHRAIVRRVTQPKPVQ 214
           ++ W E+ESWKLLKE+ANSAQHR + R+  Q KPVQ
Sbjct: 1   MESWKESESWKLLKEIANSAQHREVARKAAQAKPVQ 36


>gi|409078635|gb|EKM78998.1| hypothetical protein AGABI1DRAFT_129263 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 781

 Score = 37.4 bits (85), Expect = 3.9,   Method: Composition-based stats.
 Identities = 31/121 (25%), Positives = 51/121 (42%), Gaps = 20/121 (16%)

Query: 85  DLGKGVVRWIC-QGMRAMASDFASAEIQGEFSELRQRMGPGLTFVIEAQ----PYLNAIP 139
           DL + ++ W+C  G R+       A+IQ +F+    R     +   +A     P    + 
Sbjct: 81  DLTREIISWLCDSGRRSNVYLIQEAQIQTDFAHEIWRANAFASICYDAHRFGLPLRGTLD 140

Query: 140 MPVGLEAVCLKAGTHYP--------TLFDHFQRELRDVLQELQQKSLVQDWHETESWKLL 191
            P  +  +C    THYP         LFD+   E+R++ +  Q K L+      E WK L
Sbjct: 141 FPPDMFRICAALATHYPPFRRILTKILFDN--TEIRNMPKRTQFKKLIY-----EPWKSL 193

Query: 192 K 192
           +
Sbjct: 194 Q 194


>gi|393238363|gb|EJD45900.1| hypothetical protein AURDEDRAFT_164960 [Auricularia delicata
           TFB-10046 SS5]
          Length = 410

 Score = 37.0 bits (84), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 8/101 (7%)

Query: 26  KSKIQKTKTLSGTGSSTKVNVSSVVEKSSGEKQLEQPKKSDKAVNV----QALSQNGNPL 81
           K+K   T  L G   +TK  +  +V   +G K+LE P +S+KA  +    QA++Q G+ L
Sbjct: 240 KAKKPPTYRLRGGWKATKSGLLPLVLARNGLKKLEYPTQSEKAKRIPRMFQAMAQKGDTL 299

Query: 82  GRRDLGKGVVRWICQG----MRAMASDFASAEIQGEFSELR 118
             R  G     ++ +G    MR   SD    +I G   E+R
Sbjct: 300 ASRADGWAFCLFMPKGGAGIMRTWHSDQVQQDIPGLPDEVR 340


>gi|393241780|gb|EJD49300.1| hypothetical protein AURDEDRAFT_161397 [Auricularia delicata
           TFB-10046 SS5]
          Length = 374

 Score = 37.0 bits (84), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 8/101 (7%)

Query: 26  KSKIQKTKTLSGTGSSTKVNVSSVVEKSSGEKQLEQPKKSDKAVNV----QALSQNGNPL 81
           K+K   T  L G   +TK  +  +V   +G K+LE P +S+KA  +    QA++Q G+ L
Sbjct: 204 KAKKPPTYRLRGGWKATKSGLLPLVLARNGLKKLEYPTQSEKAKRIPRMFQAMAQKGDTL 263

Query: 82  GRRDLGKGVVRWICQG----MRAMASDFASAEIQGEFSELR 118
             R  G     ++ +G    MR   SD    +I G   E+R
Sbjct: 264 ASRADGWAFCLFMPKGGAGIMRTWHSDQVQQDIPGLPDEVR 304


>gi|393229750|gb|EJD37367.1| hypothetical protein AURDEDRAFT_173528 [Auricularia delicata
           TFB-10046 SS5]
          Length = 412

 Score = 36.6 bits (83), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 49/101 (48%), Gaps = 8/101 (7%)

Query: 26  KSKIQKTKTLSGTGSSTKVNVSSVVEKSSGEKQLEQPKKSDKAVNV----QALSQNGNPL 81
           K+K   T  L G   +TK  +  +V   +G K+LE P +S+KA  +    QA++Q G+ L
Sbjct: 240 KAKKPPTYRLRGGWKATKSGLLPLVLARNGLKKLEYPTQSEKAKCIPRMFQAMAQKGDTL 299

Query: 82  GRRDLGKGVVRWICQG----MRAMASDFASAEIQGEFSELR 118
             R  G     ++ +G    MR   SD    +I G   E+R
Sbjct: 300 ASRADGWAFCLFMPKGGAGIMRTWHSDQVQQDIPGLPDEVR 340


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.312    0.125    0.350 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,065,332,121
Number of Sequences: 23463169
Number of extensions: 117715924
Number of successful extensions: 374787
Number of sequences better than 100.0: 157
Number of HSP's better than 100.0 without gapping: 57
Number of HSP's successfully gapped in prelim test: 100
Number of HSP's that attempted gapping in prelim test: 374096
Number of HSP's gapped (non-prelim): 683
length of query: 214
length of database: 8,064,228,071
effective HSP length: 136
effective length of query: 78
effective length of database: 9,168,204,383
effective search space: 715119941874
effective search space used: 715119941874
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 74 (33.1 bits)