BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 029330
         (195 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359490779|ref|XP_002269798.2| PREDICTED: uncharacterized protein LOC100252112 [Vitis vinifera]
 gi|302144012|emb|CBI23117.3| unnamed protein product [Vitis vinifera]
          Length = 397

 Score =  142 bits (359), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 93/222 (41%), Positives = 113/222 (50%), Gaps = 69/222 (31%)

Query: 9   QAPKSSILALSSDSESCKDSSLLREE----------KEFQFQGKEMDEDAALTGRD---- 54
            AP  S+  LSSDSESC D+S +RE+          K  QF  KE D   +  G++    
Sbjct: 100 NAPNHSVWKLSSDSESCPDNSSMREDYIHHEESSVHKTTQFPDKEKDAVLSENGQESLLN 159

Query: 55  ----EKTSVKKVS-----------------------------------------TLTLPL 69
               EK+  K+V                                          +  LPL
Sbjct: 160 EASKEKSPKKRVKVEDHISSKKKKANSDMKGKGHDDGTELLEEEASDKHTEPQVSSGLPL 219

Query: 70  VISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPS 129
           V+SEK+  +KA VECEGESIDLSGDMGAVGRI++  T   NH MFLDLKGTIYKTT+VPS
Sbjct: 220 VLSEKVHRSKALVECEGESIDLSGDMGAVGRIVILDTPSRNHEMFLDLKGTIYKTTIVPS 279

Query: 130 RTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIY 171
           RTFCIVSFG  EAKV         ++ DF   I L  +S +Y
Sbjct: 280 RTFCIVSFGQSEAKV-------EAVMNDF---IQLKPQSNVY 311


>gi|224121130|ref|XP_002318505.1| predicted protein [Populus trichocarpa]
 gi|222859178|gb|EEE96725.1| predicted protein [Populus trichocarpa]
          Length = 187

 Score =  136 bits (342), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 66/108 (61%), Positives = 81/108 (75%), Gaps = 10/108 (9%)

Query: 64  TLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYK 123
           T  LPLV+SEK+Q +KA VECEGESIDLSGDMGAVGR+++P T  GN  M+LDLKGTIY+
Sbjct: 1   TSRLPLVLSEKVQRSKALVECEGESIDLSGDMGAVGRVVIPDTPSGNSEMYLDLKGTIYR 60

Query: 124 TTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIY 171
           TT+VPSRTFC+VSFG  EAK+         I+ DF   I L ++S +Y
Sbjct: 61  TTIVPSRTFCVVSFGQSEAKI-------EAIMNDF---IQLKTQSNVY 98


>gi|357464307|ref|XP_003602435.1| DNA-binding protein BIN4 [Medicago truncatula]
 gi|355491483|gb|AES72686.1| DNA-binding protein BIN4 [Medicago truncatula]
          Length = 416

 Score =  122 bits (305), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 68/136 (50%), Positives = 90/136 (66%), Gaps = 16/136 (11%)

Query: 40  QGKEMDEDAALTGRDEKTSVK----KVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDM 95
           +GK  D D  +  +DE+T  K     VS+  LPL++SEK+  TKA VEC+G+SIDLSGDM
Sbjct: 207 KGKGGDGDVEV--KDEETFEKPVEPNVSSSILPLMLSEKVHRTKALVECQGDSIDLSGDM 264

Query: 96  GAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLIL 155
           GAVGR+++  +  G+  M LDLKGTIYKT+++PSRTFC+VSFG  EAK+         I+
Sbjct: 265 GAVGRVIISDSPSGDQEMCLDLKGTIYKTSIIPSRTFCVVSFGQTEAKI-------EAIM 317

Query: 156 VDFPILILLLSKSVIY 171
            DF   I L   S +Y
Sbjct: 318 NDF---IQLNPPSDVY 330


>gi|356507532|ref|XP_003522518.1| PREDICTED: uncharacterized protein LOC100795660 [Glycine max]
          Length = 380

 Score =  118 bits (296), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 63/120 (52%), Positives = 81/120 (67%), Gaps = 16/120 (13%)

Query: 58  SVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDL 117
           +V+ VS+  LPL++SEK+Q TKA +EC+G+SIDLSGDMGAVGRI++  +  G+  M LDL
Sbjct: 185 NVRMVSSARLPLMLSEKVQRTKALIECQGDSIDLSGDMGAVGRIIISDSPSGDQEMCLDL 244

Query: 118 K------GTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIY 171
           K      GTIYKT++VP RTFC+VSFG  EAKV         I+ DF   I L   S +Y
Sbjct: 245 KDFAVISGTIYKTSIVPCRTFCVVSFGQSEAKV-------EAIMNDF---IQLKPHSNVY 294


>gi|240256338|ref|NP_197851.5| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005954|gb|AED93337.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 451

 Score =  116 bits (291), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 62/118 (52%), Positives = 81/118 (68%), Gaps = 13/118 (11%)

Query: 54  DEKTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVM 113
           D+K      S+  LPLV+SEK+  TK  VECEG+SIDLSGDMGAVGR++V  T  G+  M
Sbjct: 259 DQKIKPSSGSSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT-GD--M 315

Query: 114 FLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIY 171
           +LDLKGTIYK+T++PSRTFC+V+ G  EAK+         I+ DF   I L+ +S +Y
Sbjct: 316 YLDLKGTIYKSTIIPSRTFCVVNVGQTEAKI-------EAIMNDF---IQLIPQSNVY 363


>gi|238481371|ref|NP_001154738.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005958|gb|AED93341.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 446

 Score =  116 bits (291), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 62/118 (52%), Positives = 81/118 (68%), Gaps = 13/118 (11%)

Query: 54  DEKTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVM 113
           D+K      S+  LPLV+SEK+  TK  VECEG+SIDLSGDMGAVGR++V  T  G+  M
Sbjct: 254 DQKIKPSSGSSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT-GD--M 310

Query: 114 FLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIY 171
           +LDLKGTIYK+T++PSRTFC+V+ G  EAK+         I+ DF   I L+ +S +Y
Sbjct: 311 YLDLKGTIYKSTIIPSRTFCVVNVGQTEAKI-------EAIMNDF---IQLIPQSNVY 358


>gi|238481373|ref|NP_001154739.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|257050999|sp|Q9FLU1.2|BIN4_ARATH RecName: Full=DNA-binding protein BIN4; AltName: Full=Protein
           BRASSINOSTEROID INSENSITIVE 4; AltName: Full=Protein
           MIDGET
 gi|332005959|gb|AED93342.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 454

 Score =  116 bits (291), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 62/118 (52%), Positives = 81/118 (68%), Gaps = 13/118 (11%)

Query: 54  DEKTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVM 113
           D+K      S+  LPLV+SEK+  TK  VECEG+SIDLSGDMGAVGR++V  T  G+  M
Sbjct: 262 DQKIKPSSGSSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT-GD--M 318

Query: 114 FLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIY 171
           +LDLKGTIYK+T++PSRTFC+V+ G  EAK+         I+ DF   I L+ +S +Y
Sbjct: 319 YLDLKGTIYKSTIIPSRTFCVVNVGQTEAKI-------EAIMNDF---IQLIPQSNVY 366


>gi|238481367|ref|NP_001154736.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|238481369|ref|NP_001154737.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005956|gb|AED93339.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005957|gb|AED93340.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 450

 Score =  116 bits (291), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 62/118 (52%), Positives = 81/118 (68%), Gaps = 13/118 (11%)

Query: 54  DEKTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVM 113
           D+K      S+  LPLV+SEK+  TK  VECEG+SIDLSGDMGAVGR++V  T  G+  M
Sbjct: 258 DQKIKPSSGSSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT-GD--M 314

Query: 114 FLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIY 171
           +LDLKGTIYK+T++PSRTFC+V+ G  EAK+         I+ DF   I L+ +S +Y
Sbjct: 315 YLDLKGTIYKSTIIPSRTFCVVNVGQTEAKI-------EAIMNDF---IQLIPQSNVY 362


>gi|45736025|dbj|BAD13052.1| unknown protein [Oryza sativa Japonica Group]
 gi|45736067|dbj|BAD13093.1| unknown protein [Oryza sativa Japonica Group]
          Length = 320

 Score =  116 bits (290), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 49/78 (62%), Positives = 65/78 (83%)

Query: 67  LPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTL 126
           LPL+I +K+Q +KA +EC+G+SIDLSGD+GAVGRI++  +  GN  + LDLKGTIYK+T+
Sbjct: 140 LPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTI 199

Query: 127 VPSRTFCIVSFGHPEAKV 144
           VPSRTFC+VS G  EAK+
Sbjct: 200 VPSRTFCVVSVGQTEAKI 217


>gi|238481365|ref|NP_001154735.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005955|gb|AED93338.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 432

 Score =  116 bits (290), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 62/118 (52%), Positives = 81/118 (68%), Gaps = 13/118 (11%)

Query: 54  DEKTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVM 113
           D+K      S+  LPLV+SEK+  TK  VECEG+SIDLSGDMGAVGR++V  T  G+  M
Sbjct: 240 DQKIKPSSGSSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT-GD--M 296

Query: 114 FLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSKSVIY 171
           +LDLKGTIYK+T++PSRTFC+V+ G  EAK+         I+ DF   I L+ +S +Y
Sbjct: 297 YLDLKGTIYKSTIIPSRTFCVVNVGQTEAKI-------EAIMNDF---IQLIPQSNVY 344


>gi|115444213|ref|NP_001045886.1| Os02g0147700 [Oryza sativa Japonica Group]
 gi|113535417|dbj|BAF07800.1| Os02g0147700 [Oryza sativa Japonica Group]
 gi|215765268|dbj|BAG86965.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222622166|gb|EEE56298.1| hypothetical protein OsJ_05371 [Oryza sativa Japonica Group]
          Length = 342

 Score =  116 bits (290), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 49/78 (62%), Positives = 65/78 (83%)

Query: 67  LPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTL 126
           LPL+I +K+Q +KA +EC+G+SIDLSGD+GAVGRI++  +  GN  + LDLKGTIYK+T+
Sbjct: 162 LPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTI 221

Query: 127 VPSRTFCIVSFGHPEAKV 144
           VPSRTFC+VS G  EAK+
Sbjct: 222 VPSRTFCVVSVGQTEAKI 239


>gi|218190056|gb|EEC72483.1| hypothetical protein OsI_05847 [Oryza sativa Indica Group]
          Length = 342

 Score =  116 bits (290), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 49/78 (62%), Positives = 65/78 (83%)

Query: 67  LPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTL 126
           LPL+I +K+Q +KA +EC+G+SIDLSGD+GAVGRI++  +  GN  + LDLKGTIYK+T+
Sbjct: 162 LPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTI 221

Query: 127 VPSRTFCIVSFGHPEAKV 144
           VPSRTFC+VS G  EAK+
Sbjct: 222 VPSRTFCVVSVGQTEAKI 239


>gi|45736061|dbj|BAD13087.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 189

 Score =  115 bits (288), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 50/78 (64%), Positives = 65/78 (83%)

Query: 67  LPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTL 126
           LPL+I +K+Q +KA +EC+G+SIDLSGD+GAVGRI++  +  GN  + LDLKGTIYK+T+
Sbjct: 104 LPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTI 163

Query: 127 VPSRTFCIVSFGHPEAKV 144
           VPSRTFC+VS G  EAKV
Sbjct: 164 VPSRTFCVVSVGQTEAKV 181


>gi|357138685|ref|XP_003570920.1| PREDICTED: uncharacterized protein LOC100821932 [Brachypodium
           distachyon]
          Length = 358

 Score =  114 bits (285), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 52/92 (56%), Positives = 69/92 (75%), Gaps = 7/92 (7%)

Query: 67  LPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTL 126
           LPL+I++K+Q +KA VEC+G+SIDLSGD+GAVGRI++     GN  + LDLKGT+YK+T+
Sbjct: 178 LPLMITDKVQRSKALVECDGDSIDLSGDIGAVGRIVISNGPTGNQDLLLDLKGTVYKSTI 237

Query: 127 VPSRTFCIVSFGHPEAKVCCKSSPFSLILVDF 158
           VPSRTFC+VS G  EAK+         I+ DF
Sbjct: 238 VPSRTFCVVSVGQTEAKI-------EAIMNDF 262


>gi|218189675|gb|EEC72102.1| hypothetical protein OsI_05068 [Oryza sativa Indica Group]
          Length = 473

 Score =  113 bits (283), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 49/78 (62%), Positives = 65/78 (83%)

Query: 67  LPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTL 126
           LPL+I +K+Q +KA +EC+G+SIDLSGD+GAVGRI++  +  GN  + LDLKGTIYK+T+
Sbjct: 293 LPLIIPDKVQRSKALIECDGDSIDLSGDVGAVGRIVISNSPNGNQDLLLDLKGTIYKSTI 352

Query: 127 VPSRTFCIVSFGHPEAKV 144
           VPSRTFC+VS G  EAK+
Sbjct: 353 VPSRTFCVVSVGQTEAKI 370


>gi|449459184|ref|XP_004147326.1| PREDICTED: uncharacterized protein LOC101208657 [Cucumis sativus]
          Length = 391

 Score =  112 bits (279), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 58/106 (54%), Positives = 73/106 (68%), Gaps = 8/106 (7%)

Query: 62  VSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTI 121
           VS+  LPLV+S+K    KA VECEG SIDLSGDMGAVGR++V  ++   + + LDLKGT+
Sbjct: 207 VSSSRLPLVLSDKAHRLKALVECEGTSIDLSGDMGAVGRVVVSDSSSAKNELCLDLKGTL 266

Query: 122 YKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDFPILILLLSK 167
           Y+  +VPSRTFCIVSFG  EAK+         I+ DF I +  LSK
Sbjct: 267 YRAVIVPSRTFCIVSFGQSEAKI-------ESIMNDF-IQLKALSK 304


>gi|413935594|gb|AFW70145.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
          Length = 354

 Score =  109 bits (273), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 51/92 (55%), Positives = 65/92 (70%), Gaps = 7/92 (7%)

Query: 67  LPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTL 126
           LPL+  +K+Q +KA +EC+G+SIDLSGD+GAVGRI+V         + LDLKGTIYKTT+
Sbjct: 183 LPLIFPDKVQRSKALIECDGDSIDLSGDIGAVGRIVVSNGPTAKQDLLLDLKGTIYKTTI 242

Query: 127 VPSRTFCIVSFGHPEAKVCCKSSPFSLILVDF 158
           VPSRTFC+VS G  EAK+         I+ DF
Sbjct: 243 VPSRTFCVVSVGQSEAKI-------EAIMNDF 267


>gi|195625200|gb|ACG34430.1| hypothetical protein [Zea mays]
 gi|413935593|gb|AFW70144.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
          Length = 364

 Score =  109 bits (272), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 68/175 (38%), Positives = 94/175 (53%), Gaps = 35/175 (20%)

Query: 11  PKSSILALSSDSESCKDSSLLREEKEFQFQG---------------KEMDEDAALTGRDE 55
           PK  ++ LSS S++   +SL R  ++   +G               K+  +    TG D+
Sbjct: 101 PKKDMVTLSSGSDASPGNSLSRAHEDNHEEGSLSTAKRKNAQQTKTKKTKDAGTKTGADQ 160

Query: 56  ------------KTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILV 103
                       K +   VS   LPL+  +K+Q +KA +EC+G+SIDLSGD+GAVGRI+V
Sbjct: 161 AGNGDAEDDVQDKLTGNSVSQR-LPLIFPDKVQRSKALIECDGDSIDLSGDIGAVGRIVV 219

Query: 104 PGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVDF 158
                    + LDLKGTIYKTT+VPSRTFC+VS G  EAK+         I+ DF
Sbjct: 220 SNGPTAKQDLLLDLKGTIYKTTIVPSRTFCVVSVGQSEAKIEA-------IMNDF 267


>gi|297812657|ref|XP_002874212.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297320049|gb|EFH50471.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 492

 Score =  108 bits (270), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 68/149 (45%), Positives = 90/149 (60%), Gaps = 24/149 (16%)

Query: 34  EKEFQFQGKEMDEDAALTGR-DEKTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLS 92
           +K F+ + K+ D   A  G  D+K      S+  LPLV+SEK+  TK  VECEG+SIDLS
Sbjct: 269 QKIFKSEDKDTDTTIAEEGTMDQKIKPSSGSSSRLPLVLSEKVNRTKVLVECEGDSIDLS 328

Query: 93  GDMGAVGRILVPGTAEGNHVMFLDLK----------GTIYKTTLVPSRTFCIVSFGHPEA 142
           GDMGAVGR++V  T  G+  M+LDLK          GTIYK+T++PSRTFC+V+ G  EA
Sbjct: 329 GDMGAVGRVVVSDTT-GD--MYLDLKGINTFSFHVSGTIYKSTIIPSRTFCVVNVGQTEA 385

Query: 143 KVCCKSSPFSLILVDFPILILLLSKSVIY 171
           K+         I+ DF   I L  +S +Y
Sbjct: 386 KIEA-------IMNDF---IQLTPQSNVY 404


>gi|297598587|ref|NP_001045881.2| Os02g0147100 [Oryza sativa Japonica Group]
 gi|255670601|dbj|BAF07795.2| Os02g0147100, partial [Oryza sativa Japonica Group]
          Length = 126

 Score = 99.0 bits (245), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 42/65 (64%), Positives = 54/65 (83%)

Query: 80  AFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGH 139
           A +EC+G+SIDLSGD+GAVGRI++  +  GN  + LDLKGTIYK+T+VPSRTFC+VS G 
Sbjct: 1   ALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPSRTFCVVSVGQ 60

Query: 140 PEAKV 144
            EAK+
Sbjct: 61  TEAKI 65


>gi|168010889|ref|XP_001758136.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690592|gb|EDQ76958.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 373

 Score = 96.3 bits (238), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 53/120 (44%), Positives = 77/120 (64%), Gaps = 5/120 (4%)

Query: 40  QGKEMDEDAALTGRDEKTSVKKVSTL--TLPLVISEKLQLTKAFVECEGESIDLSGDMGA 97
           Q +E+DE  AL  ++++  +K  ++   +LP+V  EK+  TK  +ECEG+++DLSGDMGA
Sbjct: 147 QTQEVDEGDALETQEQERRIKPRASAASSLPIVFGEKVNKTKVLLECEGDALDLSGDMGA 206

Query: 98  VGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILVD 157
           VGR  V      ++ + LDLKG IYKTT+VPS TF +V+ G  +AKV    S F  +  D
Sbjct: 207 VGRFTV---NRRDNELLLDLKGVIYKTTIVPSNTFFLVNVGQTDAKVESIMSDFVQLRAD 263


>gi|168037576|ref|XP_001771279.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677368|gb|EDQ63839.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 379

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 52/122 (42%), Positives = 75/122 (61%), Gaps = 12/122 (9%)

Query: 39  FQGKEMDEDAALTGRDEKTSVKK--VSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMG 96
           F+ +E DE  AL  ++++   K   ++  +LP+V  EK+  TK  +ECEG+++DLSGDMG
Sbjct: 154 FRAQETDEGDALEVQEQERHSKPRVIAASSLPIVFGEKVNRTKVLLECEGDALDLSGDMG 213

Query: 97  AVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLILV 156
           AVGR  V      +  + LDLKG +YKTT+VPS T+ +V+ G  EAKV         I+ 
Sbjct: 214 AVGRFTV---NRPDDELLLDLKGVVYKTTIVPSNTYFVVNVGQMEAKVES-------IMT 263

Query: 157 DF 158
           DF
Sbjct: 264 DF 265


>gi|255544304|ref|XP_002513214.1| conserved hypothetical protein [Ricinus communis]
 gi|223547712|gb|EEF49205.1| conserved hypothetical protein [Ricinus communis]
          Length = 155

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 41/77 (53%), Positives = 54/77 (70%), Gaps = 10/77 (12%)

Query: 95  MGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKVCCKSSPFSLI 154
           MGAVGR+++P  + GNH M+LDLKGTIY+T +VPSRTFC+VSFG  EAK+         I
Sbjct: 1   MGAVGRVVIPDGSSGNHEMYLDLKGTIYRTKIVPSRTFCVVSFGQSEAKI-------EAI 53

Query: 155 LVDFPILILLLSKSVIY 171
           + DF   I L ++S +Y
Sbjct: 54  MNDF---IQLKAQSNVY 67


>gi|410129774|dbj|BAM64852.1| hypothetical protein [Beta vulgaris]
          Length = 467

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 60/144 (41%), Positives = 75/144 (52%), Gaps = 35/144 (24%)

Query: 10  APKSSILALSSDSESCKDSSLLR-EEK-----------------------EFQFQGKEMD 45
           AP  SI  LSSDSE+  D+++ + EEK                       E     K  +
Sbjct: 242 APNHSICELSSDSETKSDATIRKIEEKIDNEESAPRSAVKRKSPKKRVKQEDDATEKRTN 301

Query: 46  EDAALTG-----------RDEKTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGD 94
           E A + G           + EK S   VS+ T+PLV+ EK+Q TKA VECEGESIDLSGD
Sbjct: 302 EPAKIEGEVENMDVAEEVKPEKQSEPAVSSSTMPLVLPEKVQRTKALVECEGESIDLSGD 361

Query: 95  MGAVGRILVPGTAEGNHVMFLDLK 118
           MGAVGR+++  T      MFLDLK
Sbjct: 362 MGAVGRVIISETPSKESEMFLDLK 385


>gi|449532507|ref|XP_004173222.1| PREDICTED: DNA-binding protein BIN4-like, partial [Cucumis sativus]
          Length = 198

 Score = 84.7 bits (208), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 51/133 (38%), Positives = 72/133 (54%), Gaps = 23/133 (17%)

Query: 9   QAPKSSILALSSDSESCKDSSLLREEKEF----------QFQGKEMDEDAALTGRDEKTS 58
           +  K S+  LS DSESC D++ ++E+  +          + QG+  DE+A   G      
Sbjct: 66  ENSKHSVWMLSLDSESCSDNNFIKEDYSYHEELAELATSEIQGRRKDENAGSEGHVRNGD 125

Query: 59  VK-------------KVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPG 105
           V+              VS+  LPLV+S+K    KA VECEG SIDLSGDMGAVGR++V  
Sbjct: 126 VEILEKDALDDCIGPPVSSSRLPLVLSDKAHRLKALVECEGTSIDLSGDMGAVGRVVVSD 185

Query: 106 TAEGNHVMFLDLK 118
           ++   + + LDLK
Sbjct: 186 SSSAKNELCLDLK 198


>gi|224132538|ref|XP_002321341.1| predicted protein [Populus trichocarpa]
 gi|222868337|gb|EEF05468.1| predicted protein [Populus trichocarpa]
          Length = 159

 Score = 83.2 bits (204), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 40/60 (66%), Positives = 48/60 (80%)

Query: 62  VSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTI 121
           VS   LPLV+SEK+Q +KA VECEGESIDLSG+MGAVGR ++P T  G+  M LDLKGT+
Sbjct: 39  VSISRLPLVLSEKVQRSKALVECEGESIDLSGEMGAVGRPVIPDTPSGDSEMHLDLKGTL 98


>gi|45736024|dbj|BAD13051.1| unknown protein [Oryza sativa Japonica Group]
 gi|45736066|dbj|BAD13092.1| unknown protein [Oryza sativa Japonica Group]
          Length = 193

 Score = 78.6 bits (192), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 33/52 (63%), Positives = 42/52 (80%)

Query: 93  GDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKV 144
           GD+GAVGRI++  +  GN  + LDLKGTIYK+T+VPSRTFC+VS G  EAK+
Sbjct: 39  GDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPSRTFCVVSVGQTEAKI 90


>gi|10177863|dbj|BAB11215.1| unnamed protein product [Arabidopsis thaliana]
          Length = 531

 Score = 71.2 bits (173), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 38/65 (58%), Positives = 47/65 (72%), Gaps = 3/65 (4%)

Query: 54  DEKTSVKKVSTLTLPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVM 113
           D+K      S+  LPLV+SEK+  TK  VECEG+SIDLSGDMGAVGR++V  T  G+  M
Sbjct: 322 DQKIKPSSGSSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT-GD--M 378

Query: 114 FLDLK 118
           +LDLK
Sbjct: 379 YLDLK 383


>gi|384247484|gb|EIE20971.1| hypothetical protein COCSUDRAFT_83530 [Coccomyxa subellipsoidea
           C-169]
          Length = 378

 Score = 66.6 bits (161), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 41/98 (41%), Positives = 59/98 (60%), Gaps = 10/98 (10%)

Query: 56  KTSVKKVSTLT-----LPLVISEKLQLTKAFVECEG---ESIDLSGDMGAVGRILVPGTA 107
           K+ V K++ L      +P+++ EKL +TK  VE EG    S+DL+GD G VGR LV G  
Sbjct: 180 KSPVSKIAVLAPVAGEIPVMMPEKLPVTKFIVELEGGNDSSVDLTGDAGVVGRWLVEGD- 238

Query: 108 EGNHVMFLDLKGTIYKTTLVP-SRTFCIVSFGHPEAKV 144
           + N  + +DLKG +Y  T VP + T C+V+    EA+V
Sbjct: 239 DSNPELRMDLKGVVYAATTVPCATTLCVVNLSATEARV 276


>gi|307107138|gb|EFN55382.1| hypothetical protein CHLNCDRAFT_52595 [Chlorella variabilis]
          Length = 402

 Score = 63.2 bits (152), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 37/96 (38%), Positives = 52/96 (54%), Gaps = 8/96 (8%)

Query: 58  SVKKVSTLTLPLVISEKLQLTKAFVECEGES------IDLSGDMGAVGRILVPGTAEGNH 111
            +    T  +P+V+ EKL   K  VE E          DLSGD GA+GR+++ GT E   
Sbjct: 172 GLPHAPTSQMPVVLPEKLPQMKMLVELESNPDDAHHVTDLSGDSGAIGRLVIAGTKEAPE 231

Query: 112 VMFLDLKGTIYKTTLVPSR-TFCIVSFGHPEAKVCC 146
            M +DLKG +Y  T+VP   +  +++ G  EAKV C
Sbjct: 232 -MQIDLKGVLYNATIVPCPVSIAVINMGQTEAKVEC 266


>gi|147811095|emb|CAN76881.1| hypothetical protein VITISV_037245 [Vitis vinifera]
          Length = 693

 Score = 60.5 bits (145), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 57/197 (28%), Positives = 76/197 (38%), Gaps = 81/197 (41%)

Query: 9   QAPKSSILALSSDSESCKDSSLLREE----------KEFQFQGKEMD------------- 45
            AP  S+  LSSDSESC D+S +RE+          K  QF  KE D             
Sbjct: 130 NAPNHSVWXLSSDSESCPDNSSMREDYIHHEESSVHKTTQFPDKEKDAVLSENGQESLLN 189

Query: 46  --------------------------EDAALTGRDEKTSV-----------KKVSTLTLP 68
                                      D    G D+ T +            +VS+  LP
Sbjct: 190 EASKEKSPKKRVKVEDHISSKKKKANSDMKGKGHDDGTELLEEEASDKHTEPQVSSSGLP 249

Query: 69  LVISEKLQLTKAF---------------VECEGESIDLSGDMGAVGRILVPGTAEG---- 109
           LV+SEK+  +K F               +  +GESIDLSGDMGAVGRI+  G +E     
Sbjct: 250 LVLSEKVHRSKGFRFSIEKAASLEIRPLLSVKGESIDLSGDMGAVGRIVSFGQSEAKVEA 309

Query: 110 --NHVMFLDLKGTIYKT 124
             N  + L  +  +Y+ 
Sbjct: 310 VMNDFIQLKPQSNVYEA 326


>gi|303275097|ref|XP_003056848.1| hypothetical protein MICPUCDRAFT_62312 [Micromonas pusilla
           CCMP1545]
 gi|226461200|gb|EEH58493.1| hypothetical protein MICPUCDRAFT_62312 [Micromonas pusilla
           CCMP1545]
          Length = 392

 Score = 58.5 bits (140), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 50/83 (60%), Gaps = 3/83 (3%)

Query: 67  LPLVISEKLQLTKAFVECEG--ESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKT 124
           +PL++   +   K F ECEG  ES+DL GD+G VGR+L   +A+    + +DLKG IY  
Sbjct: 195 IPLLMPATVNRNKIFFECEGAGESVDLEGDVGVVGRLLSDPSAKYTSGVKVDLKGVIYDA 254

Query: 125 TLVPSR-TFCIVSFGHPEAKVCC 146
            ++P+  +  +++    EAKV C
Sbjct: 255 KILPTPASIVVLAVNQTEAKVEC 277


>gi|320166916|gb|EFW43815.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 236

 Score = 54.7 bits (130), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 48/78 (61%), Gaps = 3/78 (3%)

Query: 67  LPLVISEKLQLTKAFVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTL 126
           +PLV+ +    T   +E     ++LSGD G+VGR++V    +    + LDLKG I++ T+
Sbjct: 53  IPLVLGKPA--TAFLMETTEPQLNLSGDTGSVGRLVV-NAQDAAAPLVLDLKGLIFQGTV 109

Query: 127 VPSRTFCIVSFGHPEAKV 144
           VPS TF +V+ G  EA++
Sbjct: 110 VPSNTFMLVTIGPNEARI 127


>gi|299471419|emb|CBN79372.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 240

 Score = 51.6 bits (122), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/64 (40%), Positives = 37/64 (57%), Gaps = 1/64 (1%)

Query: 81  FVECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHP 140
            V    E+++ +GD GA+GRI V G  +G  + F DLKG  +   ++PS T  +V  G  
Sbjct: 76  LVHLRDEALNFAGDTGAIGRISVGGKKKGRGLTF-DLKGQQFSGQIIPSATVMVVGIGKT 134

Query: 141 EAKV 144
           EAKV
Sbjct: 135 EAKV 138


>gi|255084389|ref|XP_002508769.1| predicted protein [Micromonas sp. RCC299]
 gi|226524046|gb|ACO70027.1| predicted protein [Micromonas sp. RCC299]
          Length = 376

 Score = 50.4 bits (119), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 46/81 (56%), Gaps = 3/81 (3%)

Query: 67  LPLVISEKLQLTKAFVECEG--ESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKT 124
           +PL +   +   K F E EG  E++DL GD+G VGR+L   + +    + +D+KG IY  
Sbjct: 178 VPLFMPSAVNRNKVFFELEGTGEAVDLEGDVGVVGRLLSNASEKHGSGLQMDMKGVIYNA 237

Query: 125 TLVPS-RTFCIVSFGHPEAKV 144
            ++P+  +  I++    EAKV
Sbjct: 238 RILPTPASVVILAVNANEAKV 258


>gi|428174918|gb|EKX43811.1| hypothetical protein GUITHDRAFT_140250 [Guillardia theta CCMP2712]
          Length = 309

 Score = 48.1 bits (113), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 56/114 (49%), Gaps = 13/114 (11%)

Query: 39  FQGKEMDEDAALTGRDEKTSVKKVST------LTLPLVISEKLQLTKAF--VECEGESID 90
            Q K + E+A     D KT   K+ T       ++P+++    ++ K    V+CE +++D
Sbjct: 114 LQNKIVAEEADEKTSDVKTGTAKIETSKIRFSKSVPVLVPNIKRVCKNMFVVQCEDQNLD 173

Query: 91  LSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKV 144
             GD G +G I       G   M LDLKG +Y+ +++P  T  ++  G  E K+
Sbjct: 174 FEGDSGVIGTIKT-----GERSMELDLKGQLYEGSILPCCTMMVMDMGATEVKI 222


>gi|308804497|ref|XP_003079561.1| unnamed protein product [Ostreococcus tauri]
 gi|116058016|emb|CAL54219.1| unnamed protein product [Ostreococcus tauri]
          Length = 229

 Score = 45.4 bits (106), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 21/47 (44%), Positives = 29/47 (61%), Gaps = 4/47 (8%)

Query: 82  VECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVP 128
           +E  GE++DL GD GAVGR L    AE    + +D+KG +Y   +VP
Sbjct: 85  IEGNGEAVDLDGDTGAVGRWL----AESARSLKVDMKGLMYNARVVP 127


>gi|412989095|emb|CCO15686.1| hypothetical protein VOLCADRAFT_101408 [Bathycoccus prasinos]
          Length = 422

 Score = 42.7 bits (99), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 35/101 (34%), Positives = 54/101 (53%), Gaps = 24/101 (23%)

Query: 67  LPLVIS----EKLQLTKAFVECEG--ESIDLSGDMGAVGRILV-----PGTAEGNHV--- 112
           +PL+I+    +K   +    EC+G  E++DLSGD+G VGR+LV     P    G+     
Sbjct: 242 MPLLINHGALQKTANSTILFECKGSGEAVDLSGDVGVVGRLLVERESQPTGGGGSKASTS 301

Query: 113 -------MFLDLKGTIY--KTTLVPSRTFCIVSFGHPEAKV 144
                  + +DLKG +Y  +T L+ S T  IV+ G  +AK+
Sbjct: 302 KQQPHASVKVDLKGVMYLARTHLL-SSTVAIVNVGAEDAKI 341


>gi|145347277|ref|XP_001418100.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578328|gb|ABO96393.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 999

 Score = 42.4 bits (98), Expect = 0.10,   Method: Composition-based stats.
 Identities = 21/48 (43%), Positives = 30/48 (62%), Gaps = 4/48 (8%)

Query: 82  VECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPS 129
           +E  GE++DL GD GAVGR L    AE +  + +D+KG +Y   +V S
Sbjct: 55  IEGAGEAVDLDGDTGAVGRWL----AESSRALKVDMKGVMYNARVVSS 98


>gi|255544302|ref|XP_002513213.1| conserved hypothetical protein [Ricinus communis]
 gi|223547711|gb|EEF49204.1| conserved hypothetical protein [Ricinus communis]
          Length = 137

 Score = 42.0 bits (97), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 33/94 (35%), Positives = 47/94 (50%), Gaps = 4/94 (4%)

Query: 5   FIVEQAPKSSILALSSDSESCKDSSLLREEKEFQFQGKEMDEDAALTGRDEKTSVKKVST 64
            +V+QA K SI ALSSDSESC D S  R+ K  Q +    ++DA L     + ++K  S 
Sbjct: 12  LVVKQAAKRSIWALSSDSESCPDDSSQRKTKSNQPKESRKEKDAILVDNSVEKALKGKSP 71

Query: 65  LTLPLVISEKLQLTKAFVECEGESIDLSGDMGAV 98
             L  V   KL+      E + + +  +GD   V
Sbjct: 72  KKLLKVEGRKLKKK----ENKNDDVPKTGDNDDV 101


>gi|387199325|gb|AFJ68895.1| hypothetical protein NGATSA_2017510 [Nannochloropsis gaditana
           CCMP526]
          Length = 226

 Score = 39.7 bits (91), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)

Query: 87  ESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKV 144
           ++ DL+GD GAVGR+ V    + +  + LDLKG  Y   L    T  +VS G  EAKV
Sbjct: 25  KAQDLTGDSGAVGRVHV----KKDRGVVLDLKGHQYLAELCRCPTMLLVSLGEEEAKV 78


>gi|422294628|gb|EKU21928.1| hypothetical protein NGA_2017520, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 238

 Score = 39.3 bits (90), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 25/58 (43%), Positives = 33/58 (56%), Gaps = 4/58 (6%)

Query: 87  ESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVPSRTFCIVSFGHPEAKV 144
           ++ DL+GD GAVGR+ V    + +  + LDLKG  Y   L    T  +VS G  EAKV
Sbjct: 37  KAQDLTGDSGAVGRVHV----KKDRGVVLDLKGHQYLAELCRCPTMLLVSLGEEEAKV 90


>gi|302857504|ref|XP_002959890.1| hypothetical protein VOLCADRAFT_101408 [Volvox carteri f.
           nagariensis]
 gi|300254022|gb|EFJ39045.1| hypothetical protein VOLCADRAFT_101408 [Volvox carteri f.
           nagariensis]
          Length = 340

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 34/118 (28%), Positives = 50/118 (42%), Gaps = 35/118 (29%)

Query: 62  VSTLTLPLVISEKLQLTKAFVE-----CEGE----SIDLSGDMGAVGRILVPGTAEGNH- 111
           V    LP+++ EKL   K  +E       GE    + DLSGD GAVGR++V      N  
Sbjct: 38  VEAGKLPIMLPEKLNRNKVLLELPPAAMSGEHYHGAADLSGDSGAVGRLVVGKAPAANSK 97

Query: 112 ------------------------VMFLDLKGTIYKTTLVP-SRTFCIVSFGHPEAKV 144
                                    + +DLKG +Y   ++P + T  +++ G  EAKV
Sbjct: 98  GTAAAAAAAAAGGSEGGGSGGDSVQLQVDLKGLLYNAWVLPLAGTAMVLNIGPSEAKV 155


>gi|307109224|gb|EFN57462.1| hypothetical protein CHLNCDRAFT_142962 [Chlorella variabilis]
          Length = 642

 Score = 37.0 bits (84), Expect = 4.2,   Method: Composition-based stats.
 Identities = 17/46 (36%), Positives = 25/46 (54%)

Query: 83  ECEGESIDLSGDMGAVGRILVPGTAEGNHVMFLDLKGTIYKTTLVP 128
           E  G  ++L GD G +GR+ V   +     + LDL G +Y+  LVP
Sbjct: 443 EVPGHLLNLEGDSGTIGRVAVERGSGKEAALQLDLMGIVYQAELVP 488


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.135    0.373 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,639,213,718
Number of Sequences: 23463169
Number of extensions: 99551547
Number of successful extensions: 245714
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 35
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 245637
Number of HSP's gapped (non-prelim): 66
length of query: 195
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 60
effective length of database: 9,191,667,552
effective search space: 551500053120
effective search space used: 551500053120
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 73 (32.7 bits)