BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 026010
         (245 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359490779|ref|XP_002269798.2| PREDICTED: uncharacterized protein LOC100252112 [Vitis vinifera]
 gi|302144012|emb|CBI23117.3| unnamed protein product [Vitis vinifera]
          Length = 397

 Score =  263 bits (672), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 143/232 (61%), Positives = 176/232 (75%), Gaps = 18/232 (7%)

Query: 2   DEDAALTGREEKTSVKKVSKEKSPKKGLKSPKKEKDVINNVKRKGNND-----HEDA--- 53
           ++DA L+   +++ + + SKEKSPKK +K     +D I++ K+K N+D     H+D    
Sbjct: 144 EKDAVLSENGQESLLNEASKEKSPKKRVKV----EDHISSKKKKANSDMKGKGHDDGTEL 199

Query: 54  -EEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAE 112
            EE+ ++K +   VS S LPLV+SEK+ R+KALVECEGES+DLSGDMGAVGRI++  T  
Sbjct: 200 LEEEASDKHTEPQVS-SGLPLVLSEKVHRSKALVECEGESIDLSGDMGAVGRIVILDTPS 258

Query: 113 GNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVE 172
            NHEMFLDLKGTIYKTT+VPSRTFCIVSFG SEAK+EA+MNDFIQLKPQSNVYEAETMVE
Sbjct: 259 RNHEMFLDLKGTIYKTTIVPSRTFCIVSFGQSEAKVEAVMNDFIQLKPQSNVYEAETMVE 318

Query: 173 GTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVK----SGPCRK 220
           G L+GF FDS+DE+DK+ KA   Q DQN+G EEQTN KTK K    SG  RK
Sbjct: 319 GILEGFSFDSDDEADKMPKASTRQTDQNEGDEEQTNGKTKGKAEKASGVVRK 370


>gi|224121130|ref|XP_002318505.1| predicted protein [Populus trichocarpa]
 gi|222859178|gb|EEE96725.1| predicted protein [Populus trichocarpa]
          Length = 187

 Score =  243 bits (620), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 118/159 (74%), Positives = 135/159 (84%), Gaps = 5/159 (3%)

Query: 68  TSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 127
           TS LPLV+SEK+QR+KALVECEGES+DLSGDMGAVGR+++P T  GN EM+LDLKGTIY+
Sbjct: 1   TSRLPLVLSEKVQRSKALVECEGESIDLSGDMGAVGRVVIPDTPSGNSEMYLDLKGTIYR 60

Query: 128 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVE-GTLDGFLFDSEDES 186
           TT+VPSRTFC+VSFG SEAKIEAIMNDFIQLK QSNVYEAETMVE GTL+GF FDSEDE+
Sbjct: 61  TTIVPSRTFCVVSFGQSEAKIEAIMNDFIQLKTQSNVYEAETMVEVGTLEGFSFDSEDET 120

Query: 187 DKLAKAMPHQIDQNDGIEEQTNEKTKVK----SGPCRKE 221
           DK+ KA   Q DQN+G+EE  N KTK K    SG  RK+
Sbjct: 121 DKITKATALQTDQNEGVEEPANGKTKRKPVKSSGVARKK 159


>gi|356507532|ref|XP_003522518.1| PREDICTED: uncharacterized protein LOC100795660 [Glycine max]
          Length = 380

 Score =  224 bits (571), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 115/194 (59%), Positives = 147/194 (75%), Gaps = 10/194 (5%)

Query: 31  SPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNV---STSTLPLVMSEKLQRTKALVE 87
           +P K K++  + K KG  D +  EE+  EK +  NV   S++ LPL++SEK+QRTKAL+E
Sbjct: 151 TPVKGKEIKASAKGKGIGDLKVEEEETCEKPAEPNVRMVSSARLPLMLSEKVQRTKALIE 210

Query: 88  CEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK------GTIYKTTLVPSRTFCIVSF 141
           C+G+S+DLSGDMGAVGRI++  +  G+ EM LDLK      GTIYKT++VP RTFC+VSF
Sbjct: 211 CQGDSIDLSGDMGAVGRIIISDSPSGDQEMCLDLKDFAVISGTIYKTSIVPCRTFCVVSF 270

Query: 142 GHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQND 201
           G SEAK+EAIMNDFIQLKP SNVYEAETMVEGTLDGF FDS++ES K+ K+  +Q DQN+
Sbjct: 271 GQSEAKVEAIMNDFIQLKPHSNVYEAETMVEGTLDGFFFDSDEESGKIQKST-NQTDQNE 329

Query: 202 GIEEQTNEKTKVKS 215
             EEQ N  +K K+
Sbjct: 330 NAEEQANGNSKGKA 343


>gi|357464307|ref|XP_003602435.1| DNA-binding protein BIN4 [Medicago truncatula]
 gi|355491483|gb|AES72686.1| DNA-binding protein BIN4 [Medicago truncatula]
          Length = 416

 Score =  224 bits (570), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 111/183 (60%), Positives = 140/183 (76%), Gaps = 1/183 (0%)

Query: 33  KKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGES 92
           K +K  +N   + G+ D E  +E+  EK    NVS+S LPL++SEK+ RTKALVEC+G+S
Sbjct: 198 KGKKTKVNAKGKGGDGDVEVKDEETFEKPVEPNVSSSILPLMLSEKVHRTKALVECQGDS 257

Query: 93  VDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIM 152
           +DLSGDMGAVGR+++  +  G+ EM LDLKGTIYKT+++PSRTFC+VSFG +EAKIEAIM
Sbjct: 258 IDLSGDMGAVGRVIISDSPSGDQEMCLDLKGTIYKTSIIPSRTFCVVSFGQTEAKIEAIM 317

Query: 153 NDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTK 212
           NDFIQL P S+VYEAETMVEGTLDGF FDS++E  K  K   H  DQN+  EEQT  K+K
Sbjct: 318 NDFIQLNPPSDVYEAETMVEGTLDGFSFDSDEEDGKRQKTT-HPSDQNEHAEEQTPGKSK 376

Query: 213 VKS 215
            K+
Sbjct: 377 RKA 379


>gi|218190056|gb|EEC72483.1| hypothetical protein OsI_05847 [Oryza sativa Indica Group]
          Length = 342

 Score =  211 bits (537), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 113/227 (49%), Positives = 151/227 (66%), Gaps = 12/227 (5%)

Query: 18  KVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHED----------AEEDIAEKRSMLNVS 67
           K  K  +PKK L  P    +       K   D  D          AEED+ +K +  +VS
Sbjct: 100 KPPKRSTPKKKLVKPPSGSNASKVTGPKAGPDQIDDTLEHQEEGVAEEDMQDKLTEHSVS 159

Query: 68  TSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 127
              LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++  +  GN E+ LDLKGTIYK
Sbjct: 160 -QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYK 218

Query: 128 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESD 187
           +T+VPSRTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E D
Sbjct: 219 STIVPSRTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGD 278

Query: 188 KLAKAMPHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATATK 233
           KL +    Q DQN+   +Q   KTK K+  P  K + +      ATK
Sbjct: 279 KLPEPHASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKATK 325


>gi|45736025|dbj|BAD13052.1| unknown protein [Oryza sativa Japonica Group]
 gi|45736067|dbj|BAD13093.1| unknown protein [Oryza sativa Japonica Group]
          Length = 320

 Score =  209 bits (532), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 102/182 (56%), Positives = 139/182 (76%), Gaps = 2/182 (1%)

Query: 53  AEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAE 112
           AEE++ +K +  +VS   LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++  +  
Sbjct: 123 AEEEMQDKLTEHSVS-QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPN 181

Query: 113 GNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVE 172
           GN E+ LDLKGTIYK+T+VPSRTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+E
Sbjct: 182 GNQELLLDLKGTIYKSTIVPSRTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMME 241

Query: 173 GTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATA 231
           GTLDGF FDS++E DKL +    Q DQN+   +Q   KTK K+  P  K + +      A
Sbjct: 242 GTLDGFTFDSDEEGDKLPEPHASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKA 301

Query: 232 TK 233
           TK
Sbjct: 302 TK 303


>gi|115444213|ref|NP_001045886.1| Os02g0147700 [Oryza sativa Japonica Group]
 gi|113535417|dbj|BAF07800.1| Os02g0147700 [Oryza sativa Japonica Group]
 gi|215765268|dbj|BAG86965.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222622166|gb|EEE56298.1| hypothetical protein OsJ_05371 [Oryza sativa Japonica Group]
          Length = 342

 Score =  209 bits (531), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 112/227 (49%), Positives = 151/227 (66%), Gaps = 12/227 (5%)

Query: 18  KVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHED----------AEEDIAEKRSMLNVS 67
           K  K  +PKK L  P    +       K   D  D          AEE++ +K +  +VS
Sbjct: 100 KPPKRSTPKKKLVKPPSGSNASKVTGPKAGPDQIDDTLEHQEEGVAEEEMQDKLTEHSVS 159

Query: 68  TSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 127
              LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++  +  GN E+ LDLKGTIYK
Sbjct: 160 -QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYK 218

Query: 128 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESD 187
           +T+VPSRTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E D
Sbjct: 219 STIVPSRTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGD 278

Query: 188 KLAKAMPHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATATK 233
           KL +    Q DQN+   +Q   KTK K+  P  K + +      ATK
Sbjct: 279 KLPEPHASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKATK 325


>gi|218189675|gb|EEC72102.1| hypothetical protein OsI_05068 [Oryza sativa Indica Group]
          Length = 473

 Score =  209 bits (531), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 110/220 (50%), Positives = 154/220 (70%), Gaps = 14/220 (6%)

Query: 9   GREEKTSVK----KVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHED---------AEE 55
           G++EK +++    K +K  +PKK +K P           + G +  +D         AE 
Sbjct: 219 GKDEKPTMESNQDKPAKRSTPKKLVKFPSSSNASKVTGPKAGPDQIDDTLEHREEGVAEV 278

Query: 56  DIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNH 115
           D+ +K +  +VS   LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++  +  GN 
Sbjct: 279 DMQDKLTEHSVS-QRLPLIIPDKVQRSKALIECDGDSIDLSGDVGAVGRIVISNSPNGNQ 337

Query: 116 EMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTL 175
           ++ LDLKGTIYK+T+VPSRTFC+VS G +EAKIE+IMNDFIQL+PQSN++EAETMVEGTL
Sbjct: 338 DLLLDLKGTIYKSTIVPSRTFCVVSVGQTEAKIESIMNDFIQLEPQSNLFEAETMVEGTL 397

Query: 176 DGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 215
           DGF FDS++E DKL +    Q DQN+   +Q   KTK K+
Sbjct: 398 DGFTFDSDEEGDKLPEPHASQNDQNNEDGDQPKAKTKRKA 437


>gi|297812657|ref|XP_002874212.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297320049|gb|EFH50471.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 492

 Score =  209 bits (531), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 126/265 (47%), Positives = 162/265 (61%), Gaps = 61/265 (23%)

Query: 2   DEDAALTGREEKTSVKKVSKEKSPK----KGLKSPKK----------EKDVIN------- 40
           DED  L   EE+ +VKKV KEKSPK     G K+PK+          E  VIN       
Sbjct: 186 DEDFVLDATEEEPAVKKVGKEKSPKTKSKSGRKTPKEGNGAQKIFKSEDKVINTTIAEEV 245

Query: 41  ---------------------NVKRKGNNDHEDAEEDIAEKRSM------LNVSTSTLPL 73
                                + ++   ++ +D +  IAE+ +M       + S+S LPL
Sbjct: 246 TMEKSPETKSKSSRKTPKEENSAQKIFKSEDKDTDTTIAEEGTMDQKIKPSSGSSSRLPL 305

Query: 74  VMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG---------- 123
           V+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKG          
Sbjct: 306 VLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGINTFSFHVSG 362

Query: 124 TIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSE 183
           TIYK+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+
Sbjct: 363 TIYKSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLTPQSNVYEAETMVEGTLEGFSFESD 422

Query: 184 DESDKLAKAMPHQIDQNDGIEEQTN 208
           DE++K AK      DQ+ G EE+TN
Sbjct: 423 DETNKNAKTASKPADQSGGTEEETN 447


>gi|195625200|gb|ACG34430.1| hypothetical protein [Zea mays]
 gi|413935593|gb|AFW70144.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
          Length = 364

 Score =  208 bits (530), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 101/165 (61%), Positives = 130/165 (78%), Gaps = 2/165 (1%)

Query: 52  DAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTA 111
           DAE+D+ +K +  +VS   LPL+  +K+QR+KAL+EC+G+S+DLSGD+GAVGRI+V    
Sbjct: 165 DAEDDVQDKLTGNSVS-QRLPLIFPDKVQRSKALIECDGDSIDLSGDIGAVGRIVVSNGP 223

Query: 112 EGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMV 171
               ++ LDLKGTIYKTT+VPSRTFC+VS G SEAKIEAIMNDFIQL+PQSN++EAETMV
Sbjct: 224 TAKQDLLLDLKGTIYKTTIVPSRTFCVVSVGQSEAKIEAIMNDFIQLEPQSNLFEAETMV 283

Query: 172 EGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEE-QTNEKTKVKS 215
           EGTLDGF FDS++E DKL +   +Q D N+  +E Q   KTK K+
Sbjct: 284 EGTLDGFTFDSDEEGDKLYEPQANQNDLNNNEDEGQPKAKTKRKA 328


>gi|238481371|ref|NP_001154738.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005958|gb|AED93341.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 446

 Score =  207 bits (527), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 67  STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 126
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 263 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGTIY 319

Query: 127 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 186
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 320 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 379

Query: 187 DKLAKAMPHQIDQNDGIEEQTN 208
           +K AK      DQ+ G EE+TN
Sbjct: 380 NKNAKTAVKPADQSVGTEEETN 401


>gi|240256338|ref|NP_197851.5| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005954|gb|AED93337.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 451

 Score =  207 bits (527), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 67  STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 126
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 268 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGTIY 324

Query: 127 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 186
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 325 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 384

Query: 187 DKLAKAMPHQIDQNDGIEEQTN 208
           +K AK      DQ+ G EE+TN
Sbjct: 385 NKNAKTAVKPADQSVGTEEETN 406


>gi|238481367|ref|NP_001154736.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|238481369|ref|NP_001154737.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005956|gb|AED93339.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005957|gb|AED93340.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 450

 Score =  207 bits (527), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 67  STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 126
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 267 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGTIY 323

Query: 127 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 186
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 324 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 383

Query: 187 DKLAKAMPHQIDQNDGIEEQTN 208
           +K AK      DQ+ G EE+TN
Sbjct: 384 NKNAKTAVKPADQSVGTEEETN 405


>gi|238481373|ref|NP_001154739.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|257050999|sp|Q9FLU1.2|BIN4_ARATH RecName: Full=DNA-binding protein BIN4; AltName: Full=Protein
           BRASSINOSTEROID INSENSITIVE 4; AltName: Full=Protein
           MIDGET
 gi|332005959|gb|AED93342.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 454

 Score =  207 bits (527), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 67  STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 126
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 271 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKGTIY 327

Query: 127 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 186
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 328 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 387

Query: 187 DKLAKAMPHQIDQNDGIEEQTN 208
           +K AK      DQ+ G EE+TN
Sbjct: 388 NKNAKTAVKPADQSVGTEEETN 409


>gi|449459184|ref|XP_004147326.1| PREDICTED: uncharacterized protein LOC101208657 [Cucumis sativus]
          Length = 391

 Score =  207 bits (527), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 120/240 (50%), Positives = 162/240 (67%), Gaps = 26/240 (10%)

Query: 2   DEDAALTGREEKTSVKKVSKEKSPKKGLKS---PKKEKDVINNVKRKGNNDHEDAEE--- 55
           DE+A     E K+  +KVS E SPKK +KS      +++++N+   KG +  E +E    
Sbjct: 128 DENAGRRFTEGKSKSRKVSNEMSPKKKVKSEVCTSAKENIMNSGTNKGGSTMEGSEGHVR 187

Query: 56  ----DIAEKRSMLN-----VSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRIL 106
               +I EK ++ +     VS+S LPLV+S+K  R KALVECEG S+DLSGDMGAVGR++
Sbjct: 188 NGDVEILEKDALDDCIGPPVSSSRLPLVLSDKAHRLKALVECEGTSIDLSGDMGAVGRVV 247

Query: 107 VPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYE 166
           V  ++   +E+ LDLKGT+Y+  +VPSRTFCIVSFG SEAKIE+IMNDFIQLK  S V E
Sbjct: 248 VSDSSSAKNELCLDLKGTLYRAVIVPSRTFCIVSFGQSEAKIESIMNDFIQLKALSKVDE 307

Query: 167 AETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIE----------EQTNEKTKVKSG 216
           AETMVEGTLDGF FDSED+++K+ KA     DQN+ +E          E+++ + +VK+G
Sbjct: 308 AETMVEGTLDGFSFDSEDDAEKITKAAS-PADQNEPVEGLNTKSKNKAEKSSGRKRVKTG 366


>gi|238481365|ref|NP_001154735.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005955|gb|AED93338.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 432

 Score =  207 bits (526), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 67  STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 126
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 249 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKGTIY 305

Query: 127 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 186
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 306 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 365

Query: 187 DKLAKAMPHQIDQNDGIEEQTN 208
           +K AK      DQ+ G EE+TN
Sbjct: 366 NKNAKTAVKPADQSVGTEEETN 387


>gi|357138685|ref|XP_003570920.1| PREDICTED: uncharacterized protein LOC100821932 [Brachypodium
           distachyon]
          Length = 358

 Score =  193 bits (491), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 86/137 (62%), Positives = 115/137 (83%), Gaps = 1/137 (0%)

Query: 53  AEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAE 112
           AEE + +K    ++S   LPL++++K+QR+KALVEC+G+S+DLSGD+GAVGRI++     
Sbjct: 161 AEEQMQDKLPGNSIS-QRLPLMITDKVQRSKALVECDGDSIDLSGDIGAVGRIVISNGPT 219

Query: 113 GNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVE 172
           GN ++ LDLKGT+YK+T+VPSRTFC+VS G +EAKIEAIMNDFIQL+P SN++EAETM+E
Sbjct: 220 GNQDLLLDLKGTVYKSTIVPSRTFCVVSVGQTEAKIEAIMNDFIQLEPHSNLFEAETMME 279

Query: 173 GTLDGFLFDSEDESDKL 189
           GTLDGF FDS+ E D+L
Sbjct: 280 GTLDGFTFDSDGEGDRL 296


>gi|255544304|ref|XP_002513214.1| conserved hypothetical protein [Ricinus communis]
 gi|223547712|gb|EEF49205.1| conserved hypothetical protein [Ricinus communis]
          Length = 155

 Score =  192 bits (489), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 88/117 (75%), Positives = 102/117 (87%)

Query: 99  MGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQL 158
           MGAVGR+++P  + GNHEM+LDLKGTIY+T +VPSRTFC+VSFG SEAKIEAIMNDFIQL
Sbjct: 1   MGAVGRVVIPDGSSGNHEMYLDLKGTIYRTKIVPSRTFCVVSFGQSEAKIEAIMNDFIQL 60

Query: 159 KPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 215
           K QSNVYEAETMVEGTL+GF FDSEDE++K+ K + HQ DQNDG EEQ N KTK K+
Sbjct: 61  KAQSNVYEAETMVEGTLEGFSFDSEDEAEKMPKPVSHQTDQNDGNEEQMNGKTKRKA 117


>gi|413935594|gb|AFW70145.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
          Length = 354

 Score =  180 bits (456), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 92/165 (55%), Positives = 121/165 (73%), Gaps = 12/165 (7%)

Query: 52  DAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTA 111
           DAE+D+ +K +  +VS   LPL+  +K+QR+KAL+EC+G+S+DLSGD+GAVGRI+V    
Sbjct: 165 DAEDDVQDKLTGNSVS-QRLPLIFPDKVQRSKALIECDGDSIDLSGDIGAVGRIVVSNGP 223

Query: 112 EGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMV 171
               ++ LDLKGTIYKTT+VPSRTFC+VS G SEAKIEAIMNDFIQL+PQSN++EAETMV
Sbjct: 224 TAKQDLLLDLKGTIYKTTIVPSRTFCVVSVGQSEAKIEAIMNDFIQLEPQSNLFEAETMV 283

Query: 172 EGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEE-QTNEKTKVKS 215
           E          ++E DKL +   +Q D N+  +E Q   KTK K+
Sbjct: 284 E----------DEEGDKLYEPQANQNDLNNNEDEGQPKAKTKRKA 318


>gi|297598587|ref|NP_001045881.2| Os02g0147100 [Oryza sativa Japonica Group]
 gi|255670601|dbj|BAF07795.2| Os02g0147100, partial [Oryza sativa Japonica Group]
          Length = 126

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 79/118 (66%), Positives = 102/118 (86%)

Query: 84  ALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGH 143
           AL+EC+G+S+DLSGD+GAVGRI++  +  GN E+ LDLKGTIYK+T+VPSRTFC+VS G 
Sbjct: 1   ALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPSRTFCVVSVGQ 60

Query: 144 SEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQND 201
           +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E DKL +    Q DQN+
Sbjct: 61  TEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGDKLPEPHASQNDQNN 118


>gi|147811095|emb|CAN76881.1| hypothetical protein VITISV_037245 [Vitis vinifera]
          Length = 693

 Score =  173 bits (438), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 109/238 (45%), Positives = 143/238 (60%), Gaps = 61/238 (25%)

Query: 2   DEDAALTGREEKTSVKKVSKEKSPKKGLKSPKKEKDVINNVKRKGNND-----HEDA--- 53
           ++DA L+   +++ + + SKEKSPKK +K     +D I++ K+K N+D     H+D    
Sbjct: 174 EKDAVLSENGQESLLNEASKEKSPKKRVKV----EDHISSKKKKANSDMKGKGHDDGTEL 229

Query: 54  -EEDIAEKRSMLNVSTSTLPLVMSEKLQRTKA---------------LVECEGESVDLSG 97
            EE+ ++K +   VS+S LPLV+SEK+ R+K                L+  +GES+DLSG
Sbjct: 230 LEEEASDKHTEPQVSSSGLPLVLSEKVHRSKGFRFSIEKAASLEIRPLLSVKGESIDLSG 289

Query: 98  DMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQ 157
           DMGAVGRI                                 VSFG SEAK+EA+MNDFIQ
Sbjct: 290 DMGAVGRI---------------------------------VSFGQSEAKVEAVMNDFIQ 316

Query: 158 LKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 215
           LKPQSNVYEAETMVEG L+GF FDS+DE+DK+ KA   Q DQN+G EEQTN KTK K+
Sbjct: 317 LKPQSNVYEAETMVEGILEGFSFDSDDEADKMPKASTRQTDQNEGDEEQTNGKTKGKA 374


>gi|410129774|dbj|BAM64852.1| hypothetical protein [Beta vulgaris]
          Length = 467

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 99/198 (50%), Positives = 131/198 (66%), Gaps = 25/198 (12%)

Query: 12  EKTSVKKVSKEKSPKKGLKSP--KKEKDVINNVKRKGNNDHEDAEEDI-AEKRSMLNVST 68
           E+++ +   K KSPKK +K      EK      K +G  ++ D  E++  EK+S   VS+
Sbjct: 272 EESAPRSAVKRKSPKKRVKQEDDATEKRTNEPAKIEGEVENMDVAEEVKPEKQSEPAVSS 331

Query: 69  STLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKT 128
           ST+PLV+ EK+QRTKALVECEGES+DLSGDMGAVGR+++  T     EMFLDLK ++   
Sbjct: 332 STMPLVLPEKVQRTKALVECEGESIDLSGDMGAVGRVIISETPSKESEMFLDLKVSLE-- 389

Query: 129 TLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDK 188
                             +IEAIMNDFIQLKPQSNVY AETMVEGTL+GF F+S+DE+DK
Sbjct: 390 ------------------QIEAIMNDFIQLKPQSNVYNAETMVEGTLEGFTFESDDEADK 431

Query: 189 LAKAMPHQIDQNDGIEEQ 206
           + K +  + DQ++G EE+
Sbjct: 432 MTKDV--KGDQSEGAEEK 447


>gi|45736024|dbj|BAD13051.1| unknown protein [Oryza sativa Japonica Group]
 gi|45736066|dbj|BAD13092.1| unknown protein [Oryza sativa Japonica Group]
          Length = 193

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 75/119 (63%), Positives = 96/119 (80%)

Query: 97  GDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFI 156
           GD+GAVGRI++  +  GN E+ LDLKGTIYK+T+VPSRTFC+VS G +EAKIE+IM+DFI
Sbjct: 39  GDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPSRTFCVVSVGQTEAKIESIMDDFI 98

Query: 157 QLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 215
           QL+PQSN++EAETM+EGTLDGF FDS++E DKL +    Q DQN+   +Q   KTK K+
Sbjct: 99  QLEPQSNLFEAETMMEGTLDGFTFDSDEEGDKLPEPHASQNDQNNEDGDQPKAKTKRKA 157


>gi|10177863|dbj|BAB11215.1| unnamed protein product [Arabidopsis thaliana]
          Length = 531

 Score =  160 bits (404), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 90/159 (56%), Positives = 107/159 (67%), Gaps = 20/159 (12%)

Query: 67  STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 126
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLK    
Sbjct: 331 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKEHFA 387

Query: 127 KTTLVPSRTF------C-----------IVSFGHSEAKIEAIMNDFIQLKPQSNVYEAET 169
                 S TF      C            V+ G +EAKIEAIMNDFIQL PQSNVYEAET
Sbjct: 388 LYVFSSSCTFSDCKGTCGTIIHTSDLVFQVNVGQTEAKIEAIMNDFIQLIPQSNVYEAET 447

Query: 170 MVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTN 208
           MVEGTL+GF F+S+DES+K AK      DQ+ G EE+TN
Sbjct: 448 MVEGTLEGFTFESDDESNKNAKTAVKPADQSVGTEEETN 486


>gi|168037576|ref|XP_001771279.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677368|gb|EDQ63839.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 379

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 66/125 (52%), Positives = 93/125 (74%), Gaps = 3/125 (2%)

Query: 66  VSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTI 125
           ++ S+LP+V  EK+ RTK L+ECEG+++DLSGDMGAVGR  V      + E+ LDLKG +
Sbjct: 179 IAASSLPIVFGEKVNRTKVLLECEGDALDLSGDMGAVGRFTV---NRPDDELLLDLKGVV 235

Query: 126 YKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDE 185
           YKTT+VPS T+ +V+ G  EAK+E+IM DF+QL+  ++  E ETMVEGTL  F F+S++E
Sbjct: 236 YKTTIVPSNTYFVVNVGQMEAKVESIMTDFMQLRADTSWNENETMVEGTLKDFAFESDEE 295

Query: 186 SDKLA 190
            ++ A
Sbjct: 296 GERPA 300


>gi|168010889|ref|XP_001758136.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690592|gb|EDQ76958.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 373

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 64/122 (52%), Positives = 93/122 (76%), Gaps = 3/122 (2%)

Query: 67  STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 126
           + S+LP+V  EK+ +TK L+ECEG+++DLSGDMGAVGR  V      ++E+ LDLKG IY
Sbjct: 172 AASSLPIVFGEKVNKTKVLLECEGDALDLSGDMGAVGRFTV---NRRDNELLLDLKGVIY 228

Query: 127 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 186
           KTT+VPS TF +V+ G ++AK+E+IM+DF+QL+  +   E ET+VEGTL  F F+S++E 
Sbjct: 229 KTTIVPSNTFFLVNVGQTDAKVESIMSDFVQLRADTIGKENETVVEGTLQDFTFESDEEG 288

Query: 187 DK 188
           ++
Sbjct: 289 ER 290


>gi|45736061|dbj|BAD13087.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 189

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 60/106 (56%), Positives = 85/106 (80%), Gaps = 4/106 (3%)

Query: 53  AEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAE 112
           AEED+ +K +  +VS   LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++  +  
Sbjct: 87  AEEDMQDKLTEHSVS-QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPN 145

Query: 113 GNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQL 158
           GN E+ LDLKGTIYK+T+VPSRTFC+VS G +EAK   +++D I L
Sbjct: 146 GNQELLLDLKGTIYKSTIVPSRTFCVVSVGQTEAK---VVDDNINL 188


>gi|224132538|ref|XP_002321341.1| predicted protein [Populus trichocarpa]
 gi|222868337|gb|EEF05468.1| predicted protein [Populus trichocarpa]
          Length = 159

 Score =  113 bits (282), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 75/178 (42%), Positives = 91/178 (51%), Gaps = 54/178 (30%)

Query: 47  NNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRIL 106
           + D E  EE  +EK    +VS S LPLV+SEK+QR+KALVECEGES+DLSG+MGAVGR +
Sbjct: 20  HGDVEITEEVTSEKLINAHVSISRLPLVLSEKVQRSKALVECEGESIDLSGEMGAVGRPV 79

Query: 107 VPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYE 166
           +P T  G+ EM LDLKGT                                          
Sbjct: 80  IPDTPSGDSEMHLDLKGT------------------------------------------ 97

Query: 167 AETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVK----SGPCRK 220
                   L+GF F SEDE+DK+ KA  HQ DQN  +EE +N   K K    SG  RK
Sbjct: 98  --------LEGFSFFSEDETDKITKATAHQTDQNMDVEEPSNGMIKRKAEKSSGVARK 147


>gi|384247484|gb|EIE20971.1| hypothetical protein COCSUDRAFT_83530 [Coccomyxa subellipsoidea
           C-169]
          Length = 378

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 68/105 (64%), Gaps = 5/105 (4%)

Query: 60  KRSMLNVSTSTLPLVMSEKLQRTKALVECEG---ESVDLSGDMGAVGRILVPGTAEGNHE 116
           K ++L      +P++M EKL  TK +VE EG    SVDL+GD G VGR LV G  + N E
Sbjct: 185 KIAVLAPVAGEIPVMMPEKLPVTKFIVELEGGNDSSVDLTGDAGVVGRWLVEGD-DSNPE 243

Query: 117 MFLDLKGTIYKTTLVP-SRTFCIVSFGHSEAKIEAIMNDFIQLKP 160
           + +DLKG +Y  T VP + T C+V+   +EA++EA+ + +IQL+P
Sbjct: 244 LRMDLKGVVYAATTVPCATTLCVVNLSATEARVEALGHQYIQLRP 288


>gi|303275097|ref|XP_003056848.1| hypothetical protein MICPUCDRAFT_62312 [Micromonas pusilla
           CCMP1545]
 gi|226461200|gb|EEH58493.1| hypothetical protein MICPUCDRAFT_62312 [Micromonas pusilla
           CCMP1545]
          Length = 392

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 47/119 (39%), Positives = 72/119 (60%), Gaps = 8/119 (6%)

Query: 71  LPLVMSEKLQRTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKT 128
           +PL+M   + R K   ECEG  ESVDL GD+G VGR+L   +A+    + +DLKG IY  
Sbjct: 195 IPLLMPATVNRNKIFFECEGAGESVDLEGDVGVVGRLLSDPSAKYTSGVKVDLKGVIYDA 254

Query: 129 TLVPS-RTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFL-FDSEDE 185
            ++P+  +  +++   +EAK+E I ND++QL+           V GTL+G+L  DS+D+
Sbjct: 255 KILPTPASIVVLAVNQTEAKVECIANDYVQLRRDVTANN----VGGTLEGYLGVDSDDD 309


>gi|307107138|gb|EFN55382.1| hypothetical protein CHLNCDRAFT_52595 [Chlorella variabilis]
          Length = 402

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 46/125 (36%), Positives = 73/125 (58%), Gaps = 8/125 (6%)

Query: 58  AEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGES------VDLSGDMGAVGRILVPGTA 111
           A    + +  TS +P+V+ EKL + K LVE E          DLSGD GA+GR+++ GT 
Sbjct: 168 AAGLGLPHAPTSQMPVVLPEKLPQMKMLVELESNPDDAHHVTDLSGDSGAIGRLVIAGTK 227

Query: 112 EGNHEMFLDLKGTIYKTTLVPSR-TFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETM 170
           E   EM +DLKG +Y  T+VP   +  +++ G +EAK+E + ++F+QL+  +     +T 
Sbjct: 228 EAP-EMQIDLKGVLYNATIVPCPVSIAVINMGQTEAKVECLFSEFVQLREDTRFSFVDTE 286

Query: 171 VEGTL 175
           V G +
Sbjct: 287 VLGGM 291


>gi|255084389|ref|XP_002508769.1| predicted protein [Micromonas sp. RCC299]
 gi|226524046|gb|ACO70027.1| predicted protein [Micromonas sp. RCC299]
          Length = 376

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/121 (38%), Positives = 72/121 (59%), Gaps = 11/121 (9%)

Query: 71  LPLVMSEKLQRTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKT 128
           +PL M   + R K   E EG  E+VDL GD+G VGR+L   + +    + +D+KG IY  
Sbjct: 178 VPLFMPSAVNRNKVFFELEGTGEAVDLEGDVGVVGRLLSNASEKHGSGLQMDMKGVIYNA 237

Query: 129 TLVPS-RTFCIVSFGHSEAKIEAIMNDFIQLK--PQSNVYEAETMVEGTLDGFL-FDSED 184
            ++P+  +  I++   +EAK+E+I +DF+QL+  P +N       +  TLDG+L  DS+D
Sbjct: 238 RILPTPASVVILAVNANEAKVESITHDFVQLRRDPVAN-----GGLGATLDGYLGVDSDD 292

Query: 185 E 185
           E
Sbjct: 293 E 293


>gi|449532507|ref|XP_004173222.1| PREDICTED: DNA-binding protein BIN4-like, partial [Cucumis sativus]
          Length = 198

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 36/57 (63%), Positives = 46/57 (80%)

Query: 66  VSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK 122
           VS+S LPLV+S+K  R KALVECEG S+DLSGDMGAVGR++V  ++   +E+ LDLK
Sbjct: 142 VSSSRLPLVLSDKAHRLKALVECEGTSIDLSGDMGAVGRVVVSDSSSAKNELCLDLK 198


>gi|320166916|gb|EFW43815.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 236

 Score = 77.8 bits (190), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 68/110 (61%), Gaps = 3/110 (2%)

Query: 71  LPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTL 130
           +PLV+ +    T  L+E     ++LSGD G+VGR++V    +    + LDLKG I++ T+
Sbjct: 53  IPLVLGKP--ATAFLMETTEPQLNLSGDTGSVGRLVV-NAQDAAAPLVLDLKGLIFQGTV 109

Query: 131 VPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLF 180
           VPS TF +V+ G +EA+I+A +N+ I      +  ++ T+VEG++D   F
Sbjct: 110 VPSNTFMLVTIGPNEARIDATVNNVIHTTCLGDALDSMTVVEGSIDADAF 159


>gi|428174918|gb|EKX43811.1| hypothetical protein GUITHDRAFT_140250 [Guillardia theta CCMP2712]
          Length = 309

 Score = 70.5 bits (171), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 52/82 (63%), Gaps = 5/82 (6%)

Query: 85  LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHS 144
           +V+CE +++D  GD G +G I       G   M LDLKG +Y+ +++P  T  ++  G +
Sbjct: 164 VVQCEDQNLDFEGDSGVIGTI-----KTGERSMELDLKGQLYEGSILPCCTMMVMDMGAT 218

Query: 145 EAKIEAIMNDFIQLKPQSNVYE 166
           E KIE++MNDF+++ P +N ++
Sbjct: 219 EVKIESVMNDFVKVHPIANPHD 240


>gi|449526337|ref|XP_004170170.1| PREDICTED: DNA-binding protein BIN4-like [Cucumis sativus]
          Length = 71

 Score = 67.0 bits (162), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 35/53 (66%), Positives = 41/53 (77%), Gaps = 1/53 (1%)

Query: 152 MNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIE 204
           MNDFIQLK  S V EAETMVEGTLDGF FDSED+++K+ KA     DQN+ +E
Sbjct: 1   MNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDDAEKITKA-ASPADQNEPVE 52


>gi|302857504|ref|XP_002959890.1| hypothetical protein VOLCADRAFT_101408 [Volvox carteri f.
           nagariensis]
 gi|300254022|gb|EFJ39045.1| hypothetical protein VOLCADRAFT_101408 [Volvox carteri f.
           nagariensis]
          Length = 340

 Score = 67.0 bits (162), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 64/129 (49%), Gaps = 35/129 (27%)

Query: 66  VSTSTLPLVMSEKLQRTKALVEC-----EGE----SVDLSGDMGAVGRILVPGTAEGNH- 115
           V    LP+++ EKL R K L+E       GE    + DLSGD GAVGR++V      N  
Sbjct: 38  VEAGKLPIMLPEKLNRNKVLLELPPAAMSGEHYHGAADLSGDSGAVGRLVVGKAPAANSK 97

Query: 116 ------------------------EMFLDLKGTIYKTTLVP-SRTFCIVSFGHSEAKIEA 150
                                   ++ +DLKG +Y   ++P + T  +++ G SEAK+E+
Sbjct: 98  GTAAAAAAAAAGGSEGGGSGGDSVQLQVDLKGLLYNAWVLPLAGTAMVLNIGPSEAKVES 157

Query: 151 IMNDFIQLK 159
           + NDFI+L+
Sbjct: 158 LFNDFIRLR 166


>gi|299471419|emb|CBN79372.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 240

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 68/120 (56%), Gaps = 7/120 (5%)

Query: 71  LPLVMSEKL---QRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 127
           +PL++   +    +   LV    E+++ +GD GA+GRI V G  +G    F DLKG  + 
Sbjct: 59  VPLIVPPAVCKDAKASLLVHLRDEALNFAGDTGAIGRISVGGKKKGRGLTF-DLKGQQFS 117

Query: 128 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAET--MVEGTL-DGFLFDSED 184
             ++PS T  +V  G +EAK+E + ++  +L  + NV++A    +++G + D + F+ ED
Sbjct: 118 GQIIPSATVMVVGIGKTEAKVEGLFSEVCRLDYKGNVFDAMGGEVLKGEMDDSYKFEEED 177


>gi|412989095|emb|CCO15686.1| hypothetical protein VOLCADRAFT_101408 [Bathycoccus prasinos]
          Length = 422

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 44/117 (37%), Positives = 66/117 (56%), Gaps = 24/117 (20%)

Query: 66  VSTSTLPLVMSE-KLQRTK---ALVECEG--ESVDLSGDMGAVGRILV-----PGTAEG- 113
           V  S +PL+++   LQ+T     L EC+G  E+VDLSGD+G VGR+LV     P    G 
Sbjct: 237 VPKSKMPLLINHGALQKTANSTILFECKGSGEAVDLSGDVGVVGRLLVERESQPTGGGGS 296

Query: 114 ---------NHEMFLDLKGTIY--KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLK 159
                    +  + +DLKG +Y  +T L+ S T  IV+ G  +AKIE   ++++QL+
Sbjct: 297 KASTSKQQPHASVKVDLKGVMYLARTHLL-SSTVAIVNVGAEDAKIETFFDEYVQLR 352


>gi|308804497|ref|XP_003079561.1| unnamed protein product [Ostreococcus tauri]
 gi|116058016|emb|CAL54219.1| unnamed protein product [Ostreococcus tauri]
          Length = 229

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/124 (37%), Positives = 70/124 (56%), Gaps = 11/124 (8%)

Query: 67  STSTLPLVMSEKLQ-RTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 123
           S++ + LV S  L  ++K L E EG  E+VDL GD GAVGR L    AE    + +D+KG
Sbjct: 63  SSTRVDLVASAHLGGKSKVLFEIEGNGEAVDLDGDTGAVGRWL----AESARSLKVDMKG 118

Query: 124 TIYKTTLVP-SRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDS 182
            +Y   +VP + T  +V+     AKI+++  +F+QL+ +  V  A T   G   G LF+ 
Sbjct: 119 LMYNARVVPLAGTVVVVAVNGDVAKIDSVHREFVQLR-EDPVANAGTENYGV--GSLFEM 175

Query: 183 EDES 186
            DE+
Sbjct: 176 TDEA 179


>gi|356516190|ref|XP_003526779.1| PREDICTED: uncharacterized protein LOC100812582 [Glycine max]
          Length = 82

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/46 (56%), Positives = 36/46 (78%), Gaps = 1/46 (2%)

Query: 170 MVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 215
           MVEGTLDGF FDS++E+ K+ K+  +Q DQN+ +EEQ N K+K K+
Sbjct: 1   MVEGTLDGFFFDSDEEAGKMQKST-NQTDQNENVEEQANGKSKGKT 45


>gi|159483423|ref|XP_001699760.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158281702|gb|EDP07456.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 308

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 38/138 (27%), Positives = 64/138 (46%), Gaps = 41/138 (29%)

Query: 69  STLPLVMSEKLQRTKALVECEGE---------------SVDLSGDMGAVGRILVPGTAEG 113
            +LPL++ +K+ R K L E                   + DLSGD GAVGR++V   A  
Sbjct: 140 GSLPLMLPDKINRNKVLFELPPPPASSSNADGYGHGHGATDLSGDAGAVGRLIVLAAAPT 199

Query: 114 NHE-------------------------MFLDLKGTIYKTTLVP-SRTFCIVSFGHSEAK 147
             +                         + +DLKG IY    +P + T  +++ G +EAK
Sbjct: 200 GGKGAAAGGGEPGPSSQGGGGPGGAEVCLQVDLKGVIYNAWALPLAGTAMVLNVGPTEAK 259

Query: 148 IEAIMNDFIQLKPQSNVY 165
           +E++ NDF++L+  ++V+
Sbjct: 260 VESLFNDFVRLREAADVH 277


>gi|145347277|ref|XP_001418100.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578328|gb|ABO96393.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 999

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 38/97 (39%), Positives = 58/97 (59%), Gaps = 8/97 (8%)

Query: 67  STSTLPLVMSEKLQ-RTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 123
           S++ + LV +  L  ++K L E EG  E+VDL GD GAVGR L    AE +  + +D+KG
Sbjct: 33  SSTAVELVATANLGGKSKVLFEIEGAGEAVDLDGDTGAVGRWL----AESSRALKVDMKG 88

Query: 124 TIYKTTLVPSR-TFCIVSFGHSEAKIEAIMNDFIQLK 159
            +Y   +V S  T  +V+     AKIE++  +F+QL+
Sbjct: 89  VMYNARVVSSAGTVVVVAVNADVAKIESVHREFVQLR 125


>gi|255544302|ref|XP_002513213.1| conserved hypothetical protein [Ricinus communis]
 gi|223547711|gb|EEF49204.1| conserved hypothetical protein [Ricinus communis]
          Length = 137

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/77 (44%), Positives = 52/77 (67%), Gaps = 5/77 (6%)

Query: 15  SVKKVSKEKSPKKGLK----SPKKEKDVINNVKRKGNNDH-EDAEEDIAEKRSMLNVSTS 69
           SV+K  K KSPKK LK      KK+++  ++V + G+ND  E A+ED++EK +   VSTS
Sbjct: 61  SVEKALKGKSPKKLLKVEGRKLKKKENKNDDVPKTGDNDDVEVADEDVSEKHTGPQVSTS 120

Query: 70  TLPLVMSEKLQRTKALV 86
            +PL++S+K+ RTK  +
Sbjct: 121 MIPLLLSDKVSRTKMFL 137


>gi|397569025|gb|EJK46493.1| hypothetical protein THAOC_34834 [Thalassiosira oceanica]
          Length = 315

 Score = 47.8 bits (112), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 85  LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGH- 143
           LV+   + +DL G  GAVGR+           + LDLKG  Y+ T+ P  T  +V+    
Sbjct: 175 LVQMNDDKLDLHGQSGAVGRLEA-----DERGVILDLKGYQYRGTIRPGPTAMVVALTRE 229

Query: 144 SEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSED 184
            + K+E+I ++F+ L  +   +  +++ +   +GF+   ED
Sbjct: 230 GKFKVESITDEFVTLDAK-GAHVMDSLGDNMREGFMVREED 269


>gi|422294628|gb|EKU21928.1| hypothetical protein NGA_2017520, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 238

 Score = 45.8 bits (107), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 4/74 (5%)

Query: 94  DLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMN 153
           DL+GD GAVGR+ V    + +  + LDLKG  Y   L    T  +VS G  EAK+E +++
Sbjct: 40  DLTGDSGAVGRVHV----KKDRGVVLDLKGHQYLAELCRCPTMLLVSLGEEEAKVEDVVD 95

Query: 154 DFIQLKPQSNVYEA 167
               L    +  EA
Sbjct: 96  TLCVLTHHHDELEA 109


>gi|219116542|ref|XP_002179066.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409833|gb|EEC49764.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 289

 Score = 44.7 bits (104), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 55/105 (52%), Gaps = 14/105 (13%)

Query: 85  LVECEGE---SVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSF 141
           LV+ + E   S+D +G +GA+GR+     AEG   + LD KG  Y+  + P  T  ++S 
Sbjct: 150 LVQIDPEDSASLDFTGAVGAIGRLET--VAEG---VVLDFKGCQYRGRIYPGSTAVVLST 204

Query: 142 GHSEA-----KIEAIMNDFIQLKPQSNVYE-AETMVEGTLDGFLF 180
              +A     K+E++ ++F +L  + N  E    +V G + G+ +
Sbjct: 205 NGKKADTQLLKVESMTDEFCRLTKEGNAMEKLNAIVRGDMVGYQY 249


>gi|387199325|gb|AFJ68895.1| hypothetical protein NGATSA_2017510 [Nannochloropsis gaditana
           CCMP526]
          Length = 226

 Score = 44.7 bits (104), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 4/74 (5%)

Query: 94  DLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMN 153
           DL+GD GAVGR+ V    + +  + LDLKG  Y   L    T  +VS G  EAK++ +++
Sbjct: 28  DLTGDSGAVGRVHV----KKDRGVVLDLKGHQYLAELCRCPTMLLVSLGEEEAKVKDVVD 83

Query: 154 DFIQLKPQSNVYEA 167
               L    +  EA
Sbjct: 84  TLCVLTHHHDELEA 97


>gi|449018377|dbj|BAM81779.1| hypothetical protein CYME_CMP364C [Cyanidioschyzon merolae strain
           10D]
          Length = 263

 Score = 43.9 bits (102), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 51/100 (51%), Gaps = 10/100 (10%)

Query: 84  ALVECEGESVDLSGDMGAVGRILVPGTAE-GNHEMFLDLKGTIYKTTLVPSRTFCI---- 138
           AL+  E +  DL+GD G +G +     A+ G   + LDLKG +Y+T     RTF +    
Sbjct: 110 ALLVFENDEFDLAGDTGTIGCVQWSHNADSGAVRLCLDLKGVVYET-----RTFSLCGSA 164

Query: 139 VSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGF 178
           ++   +  ++EA+++D     P +   E   ++ G ++ +
Sbjct: 165 LAVDPTSQRVEAVLHDITYSIPTNVADERAQLIRGKIEPY 204


>gi|293394374|ref|ZP_06638674.1| cystathionine beta-lyase [Serratia odorifera DSM 4582]
 gi|291423352|gb|EFE96581.1| cystathionine beta-lyase [Serratia odorifera DSM 4582]
          Length = 392

 Score = 39.3 bits (90), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 34/138 (24%), Positives = 65/138 (47%), Gaps = 14/138 (10%)

Query: 30  KSPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKA-LVEC 88
           + P+++   +N    +G+     + +DI  KR+  +  TS  P +  E L+   + L + 
Sbjct: 21  RDPEQQSGFVNAPVYRGSTVVYKSCDDIRHKRARFSYGTSGTPTI--ENLENAWSTLCDA 78

Query: 89  EGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFC---IVSFGHSE 145
            G  +  SG +GA+   L   T  G+H +  D   ++Y+    P+R FC   +V +G S 
Sbjct: 79  AGTVISPSG-LGALALALFSVTRAGDHILVTD---SVYR----PTRKFCNTLLVKYGVSV 130

Query: 146 AKIEAIMNDFIQLKPQSN 163
              + ++   I+   Q+N
Sbjct: 131 TYYDPLIGGDIERLLQNN 148


>gi|307109224|gb|EFN57462.1| hypothetical protein CHLNCDRAFT_142962 [Chlorella variabilis]
          Length = 642

 Score = 38.5 bits (88), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 30/56 (53%), Gaps = 1/56 (1%)

Query: 85  LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVP-SRTFCIV 139
           L E  G  ++L GD G +GR+ V   +     + LDL G +Y+  LVP + T  +V
Sbjct: 441 LCEVPGHLLNLEGDSGTIGRVAVERGSGKEAALQLDLMGIVYQAELVPLAATAAVV 496


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.308    0.126    0.344 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,560,197,107
Number of Sequences: 23463169
Number of extensions: 143378815
Number of successful extensions: 542441
Number of sequences better than 100.0: 159
Number of HSP's better than 100.0 without gapping: 59
Number of HSP's successfully gapped in prelim test: 100
Number of HSP's that attempted gapping in prelim test: 541765
Number of HSP's gapped (non-prelim): 628
length of query: 245
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 106
effective length of database: 9,097,814,876
effective search space: 964368376856
effective search space used: 964368376856
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 75 (33.5 bits)