BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 023380
         (283 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359490779|ref|XP_002269798.2| PREDICTED: uncharacterized protein LOC100252112 [Vitis vinifera]
 gi|302144012|emb|CBI23117.3| unnamed protein product [Vitis vinifera]
          Length = 397

 Score =  288 bits (736), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 163/280 (58%), Positives = 201/280 (71%), Gaps = 30/280 (10%)

Query: 2   EPFAPKSSILASSSDSESCQNSSLLREE----------KAFQFQEKEMDEDAALTGREEK 51
           EP AP  S+   SSDSESC ++S +RE+          K  QF +KE  +DA L+   ++
Sbjct: 98  EPNAPNHSVWKLSSDSESCPDNSSMREDYIHHEESSVHKTTQFPDKE--KDAVLSENGQE 155

Query: 52  TSVKKVSKEKSPKKGLKSPKKEKDVINNVKRKGNND-----HEDA----EEDIAEKRSML 102
           + + + SKEKSPKK +K     +D I++ K+K N+D     H+D     EE+ ++K +  
Sbjct: 156 SLLNEASKEKSPKKRVKV----EDHISSKKKKANSDMKGKGHDDGTELLEEEASDKHTEP 211

Query: 103 NVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGT 162
            VS S LPLV+SEK+ R+KALVECEGES+DLSGDMGAVGRI++  T   NHEMFLDLKGT
Sbjct: 212 QVS-SGLPLVLSEKVHRSKALVECEGESIDLSGDMGAVGRIVILDTPSRNHEMFLDLKGT 270

Query: 163 IYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSED 222
           IYKTT+VPSRTFCIVSFG SEAK+EA+MNDFIQLKPQSNVYEAETMVEG L+GF FDS+D
Sbjct: 271 IYKTTIVPSRTFCIVSFGQSEAKVEAVMNDFIQLKPQSNVYEAETMVEGILEGFSFDSDD 330

Query: 223 ESDKLAKAMPHQIDQNDGIEEQTNEKTKVK----SGPCRK 258
           E+DK+ KA   Q DQN+G EEQTN KTK K    SG  RK
Sbjct: 331 EADKMPKASTRQTDQNEGDEEQTNGKTKGKAEKASGVVRK 370


>gi|224121130|ref|XP_002318505.1| predicted protein [Populus trichocarpa]
 gi|222859178|gb|EEE96725.1| predicted protein [Populus trichocarpa]
          Length = 187

 Score =  243 bits (620), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 118/159 (74%), Positives = 135/159 (84%), Gaps = 5/159 (3%)

Query: 106 TSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 165
           TS LPLV+SEK+QR+KALVECEGES+DLSGDMGAVGR+++P T  GN EM+LDLKGTIY+
Sbjct: 1   TSRLPLVLSEKVQRSKALVECEGESIDLSGDMGAVGRVVIPDTPSGNSEMYLDLKGTIYR 60

Query: 166 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVE-GTLDGFLFDSEDES 224
           TT+VPSRTFC+VSFG SEAKIEAIMNDFIQLK QSNVYEAETMVE GTL+GF FDSEDE+
Sbjct: 61  TTIVPSRTFCVVSFGQSEAKIEAIMNDFIQLKTQSNVYEAETMVEVGTLEGFSFDSEDET 120

Query: 225 DKLAKAMPHQIDQNDGIEEQTNEKTKVK----SGPCRKE 259
           DK+ KA   Q DQN+G+EE  N KTK K    SG  RK+
Sbjct: 121 DKITKATALQTDQNEGVEEPANGKTKRKPVKSSGVARKK 159


>gi|357464307|ref|XP_003602435.1| DNA-binding protein BIN4 [Medicago truncatula]
 gi|355491483|gb|AES72686.1| DNA-binding protein BIN4 [Medicago truncatula]
          Length = 416

 Score =  224 bits (572), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 134/261 (51%), Positives = 172/261 (65%), Gaps = 23/261 (8%)

Query: 9   SILASSSDSESCQNSSLLREEKAFQFQEKEM-----DEDAALTGREEKTSVKKVSKEK-- 61
           S+   SSDSESC++ S   E++  Q + K        ED  L   + K+  KK SK K  
Sbjct: 126 SMWELSSDSESCRDHSPKTEDQIDQVETKPQTSQPPGEDGLLRDNDGKSPSKKASKAKAS 185

Query: 62  ---------SPKKGLKSPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNVSTSTLPLV 112
                    +P KG K+       +N   + G+ D E  +E+  EK    NVS+S LPL+
Sbjct: 186 QKQTDLEDCTPVKGKKT------KVNAKGKGGDGDVEVKDEETFEKPVEPNVSSSILPLM 239

Query: 113 MSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSR 172
           +SEK+ RTKALVEC+G+S+DLSGDMGAVGR+++  +  G+ EM LDLKGTIYKT+++PSR
Sbjct: 240 LSEKVHRTKALVECQGDSIDLSGDMGAVGRVIISDSPSGDQEMCLDLKGTIYKTSIIPSR 299

Query: 173 TFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMP 232
           TFC+VSFG +EAKIEAIMNDFIQL P S+VYEAETMVEGTLDGF FDS++E  K  K   
Sbjct: 300 TFCVVSFGQTEAKIEAIMNDFIQLNPPSDVYEAETMVEGTLDGFSFDSDEEDGKRQKTT- 358

Query: 233 HQIDQNDGIEEQTNEKTKVKS 253
           H  DQN+  EEQT  K+K K+
Sbjct: 359 HPSDQNEHAEEQTPGKSKRKA 379


>gi|356507532|ref|XP_003522518.1| PREDICTED: uncharacterized protein LOC100795660 [Glycine max]
          Length = 380

 Score =  224 bits (571), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 115/194 (59%), Positives = 147/194 (75%), Gaps = 10/194 (5%)

Query: 69  SPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNV---STSTLPLVMSEKLQRTKALVE 125
           +P K K++  + K KG  D +  EE+  EK +  NV   S++ LPL++SEK+QRTKAL+E
Sbjct: 151 TPVKGKEIKASAKGKGIGDLKVEEEETCEKPAEPNVRMVSSARLPLMLSEKVQRTKALIE 210

Query: 126 CEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK------GTIYKTTLVPSRTFCIVSF 179
           C+G+S+DLSGDMGAVGRI++  +  G+ EM LDLK      GTIYKT++VP RTFC+VSF
Sbjct: 211 CQGDSIDLSGDMGAVGRIIISDSPSGDQEMCLDLKDFAVISGTIYKTSIVPCRTFCVVSF 270

Query: 180 GHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQND 239
           G SEAK+EAIMNDFIQLKP SNVYEAETMVEGTLDGF FDS++ES K+ K+  +Q DQN+
Sbjct: 271 GQSEAKVEAIMNDFIQLKPHSNVYEAETMVEGTLDGFFFDSDEESGKIQKST-NQTDQNE 329

Query: 240 GIEEQTNEKTKVKS 253
             EEQ N  +K K+
Sbjct: 330 NAEEQANGNSKGKA 343


>gi|449459184|ref|XP_004147326.1| PREDICTED: uncharacterized protein LOC101208657 [Cucumis sativus]
          Length = 391

 Score =  220 bits (560), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 130/283 (45%), Positives = 183/283 (64%), Gaps = 36/283 (12%)

Query: 7   KSSILASSSDSESCQNSSLLREEKAF----------QFQEKEMDEDAALTGREEKTSVKK 56
           K S+   S DSESC +++ ++E+ ++          + Q +  DE+A     E K+  +K
Sbjct: 85  KHSVWMLSLDSESCSDNNFIKEDYSYHEELAELATSEIQGRRKDENAGRRFTEGKSKSRK 144

Query: 57  VSKEKSPKKGLKS---PKKEKDVINNVKRKGNNDHEDAEE-------DIAEKRSMLN--- 103
           VS E SPKK +KS      +++++N+   KG +  E +E        +I EK ++ +   
Sbjct: 145 VSNEMSPKKKVKSEVCTSAKENIMNSGTNKGGSTMEGSEGHVRNGDVEILEKDALDDCIG 204

Query: 104 --VSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 161
             VS+S LPLV+S+K  R KALVECEG S+DLSGDMGAVGR++V  ++   +E+ LDLKG
Sbjct: 205 PPVSSSRLPLVLSDKAHRLKALVECEGTSIDLSGDMGAVGRVVVSDSSSAKNELCLDLKG 264

Query: 162 TIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSE 221
           T+Y+  +VPSRTFCIVSFG SEAKIE+IMNDFIQLK  S V EAETMVEGTLDGF FDSE
Sbjct: 265 TLYRAVIVPSRTFCIVSFGQSEAKIESIMNDFIQLKALSKVDEAETMVEGTLDGFSFDSE 324

Query: 222 DESDKLAKAMPHQIDQNDGIE----------EQTNEKTKVKSG 254
           D+++K+ KA     DQN+ +E          E+++ + +VK+G
Sbjct: 325 DDAEKITKAAS-PADQNEPVEGLNTKSKNKAEKSSGRKRVKTG 366


>gi|218190056|gb|EEC72483.1| hypothetical protein OsI_05847 [Oryza sativa Indica Group]
          Length = 342

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 113/227 (49%), Positives = 151/227 (66%), Gaps = 12/227 (5%)

Query: 56  KVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHED----------AEEDIAEKRSMLNVS 105
           K  K  +PKK L  P    +       K   D  D          AEED+ +K +  +VS
Sbjct: 100 KPPKRSTPKKKLVKPPSGSNASKVTGPKAGPDQIDDTLEHQEEGVAEEDMQDKLTEHSVS 159

Query: 106 TSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 165
              LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++  +  GN E+ LDLKGTIYK
Sbjct: 160 -QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYK 218

Query: 166 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESD 225
           +T+VPSRTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E D
Sbjct: 219 STIVPSRTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGD 278

Query: 226 KLAKAMPHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATATK 271
           KL +    Q DQN+   +Q   KTK K+  P  K + +      ATK
Sbjct: 279 KLPEPHASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKATK 325


>gi|297812657|ref|XP_002874212.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297320049|gb|EFH50471.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 492

 Score =  210 bits (535), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 127/267 (47%), Positives = 163/267 (61%), Gaps = 61/267 (22%)

Query: 38  EMDEDAALTGREEKTSVKKVSKEKSPK----KGLKSPKK----------EKDVIN----- 78
           E DED  L   EE+ +VKKV KEKSPK     G K+PK+          E  VIN     
Sbjct: 184 EKDEDFVLDATEEEPAVKKVGKEKSPKTKSKSGRKTPKEGNGAQKIFKSEDKVINTTIAE 243

Query: 79  -----------------------NVKRKGNNDHEDAEEDIAEKRSM------LNVSTSTL 109
                                  + ++   ++ +D +  IAE+ +M       + S+S L
Sbjct: 244 EVTMEKSPETKSKSSRKTPKEENSAQKIFKSEDKDTDTTIAEEGTMDQKIKPSSGSSSRL 303

Query: 110 PLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG-------- 161
           PLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKG        
Sbjct: 304 PLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGINTFSFHV 360

Query: 162 --TIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFD 219
             TIYK+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+
Sbjct: 361 SGTIYKSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLTPQSNVYEAETMVEGTLEGFSFE 420

Query: 220 SEDESDKLAKAMPHQIDQNDGIEEQTN 246
           S+DE++K AK      DQ+ G EE+TN
Sbjct: 421 SDDETNKNAKTASKPADQSGGTEEETN 447


>gi|45736025|dbj|BAD13052.1| unknown protein [Oryza sativa Japonica Group]
 gi|45736067|dbj|BAD13093.1| unknown protein [Oryza sativa Japonica Group]
          Length = 320

 Score =  210 bits (534), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 102/182 (56%), Positives = 139/182 (76%), Gaps = 2/182 (1%)

Query: 91  AEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAE 150
           AEE++ +K +  +VS   LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++  +  
Sbjct: 123 AEEEMQDKLTEHSVS-QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPN 181

Query: 151 GNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVE 210
           GN E+ LDLKGTIYK+T+VPSRTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+E
Sbjct: 182 GNQELLLDLKGTIYKSTIVPSRTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMME 241

Query: 211 GTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATA 269
           GTLDGF FDS++E DKL +    Q DQN+   +Q   KTK K+  P  K + +      A
Sbjct: 242 GTLDGFTFDSDEEGDKLPEPHASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKA 301

Query: 270 TK 271
           TK
Sbjct: 302 TK 303


>gi|115444213|ref|NP_001045886.1| Os02g0147700 [Oryza sativa Japonica Group]
 gi|113535417|dbj|BAF07800.1| Os02g0147700 [Oryza sativa Japonica Group]
 gi|215765268|dbj|BAG86965.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222622166|gb|EEE56298.1| hypothetical protein OsJ_05371 [Oryza sativa Japonica Group]
          Length = 342

 Score =  209 bits (533), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 112/227 (49%), Positives = 151/227 (66%), Gaps = 12/227 (5%)

Query: 56  KVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHED----------AEEDIAEKRSMLNVS 105
           K  K  +PKK L  P    +       K   D  D          AEE++ +K +  +VS
Sbjct: 100 KPPKRSTPKKKLVKPPSGSNASKVTGPKAGPDQIDDTLEHQEEGVAEEEMQDKLTEHSVS 159

Query: 106 TSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 165
              LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++  +  GN E+ LDLKGTIYK
Sbjct: 160 -QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYK 218

Query: 166 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESD 225
           +T+VPSRTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E D
Sbjct: 219 STIVPSRTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGD 278

Query: 226 KLAKAMPHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATATK 271
           KL +    Q DQN+   +Q   KTK K+  P  K + +      ATK
Sbjct: 279 KLPEPHASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKATK 325


>gi|218189675|gb|EEC72102.1| hypothetical protein OsI_05068 [Oryza sativa Indica Group]
          Length = 473

 Score =  209 bits (531), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 110/220 (50%), Positives = 154/220 (70%), Gaps = 14/220 (6%)

Query: 47  GREEKTSVK----KVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHED---------AEE 93
           G++EK +++    K +K  +PKK +K P           + G +  +D         AE 
Sbjct: 219 GKDEKPTMESNQDKPAKRSTPKKLVKFPSSSNASKVTGPKAGPDQIDDTLEHREEGVAEV 278

Query: 94  DIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNH 153
           D+ +K +  +VS   LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++  +  GN 
Sbjct: 279 DMQDKLTEHSVS-QRLPLIIPDKVQRSKALIECDGDSIDLSGDVGAVGRIVISNSPNGNQ 337

Query: 154 EMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTL 213
           ++ LDLKGTIYK+T+VPSRTFC+VS G +EAKIE+IMNDFIQL+PQSN++EAETMVEGTL
Sbjct: 338 DLLLDLKGTIYKSTIVPSRTFCVVSVGQTEAKIESIMNDFIQLEPQSNLFEAETMVEGTL 397

Query: 214 DGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 253
           DGF FDS++E DKL +    Q DQN+   +Q   KTK K+
Sbjct: 398 DGFTFDSDEEGDKLPEPHASQNDQNNEDGDQPKAKTKRKA 437


>gi|195625200|gb|ACG34430.1| hypothetical protein [Zea mays]
 gi|413935593|gb|AFW70144.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
          Length = 364

 Score =  209 bits (531), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 101/165 (61%), Positives = 130/165 (78%), Gaps = 2/165 (1%)

Query: 90  DAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTA 149
           DAE+D+ +K +  +VS   LPL+  +K+QR+KAL+EC+G+S+DLSGD+GAVGRI+V    
Sbjct: 165 DAEDDVQDKLTGNSVS-QRLPLIFPDKVQRSKALIECDGDSIDLSGDIGAVGRIVVSNGP 223

Query: 150 EGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMV 209
               ++ LDLKGTIYKTT+VPSRTFC+VS G SEAKIEAIMNDFIQL+PQSN++EAETMV
Sbjct: 224 TAKQDLLLDLKGTIYKTTIVPSRTFCVVSVGQSEAKIEAIMNDFIQLEPQSNLFEAETMV 283

Query: 210 EGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEE-QTNEKTKVKS 253
           EGTLDGF FDS++E DKL +   +Q D N+  +E Q   KTK K+
Sbjct: 284 EGTLDGFTFDSDEEGDKLYEPQANQNDLNNNEDEGQPKAKTKRKA 328


>gi|238481371|ref|NP_001154738.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005958|gb|AED93341.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 446

 Score =  208 bits (529), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 105 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 164
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 263 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGTIY 319

Query: 165 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 224
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 320 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 379

Query: 225 DKLAKAMPHQIDQNDGIEEQTN 246
           +K AK      DQ+ G EE+TN
Sbjct: 380 NKNAKTAVKPADQSVGTEEETN 401


>gi|238481367|ref|NP_001154736.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|238481369|ref|NP_001154737.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005956|gb|AED93339.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005957|gb|AED93340.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 450

 Score =  208 bits (529), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 105 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 164
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 267 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGTIY 323

Query: 165 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 224
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 324 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 383

Query: 225 DKLAKAMPHQIDQNDGIEEQTN 246
           +K AK      DQ+ G EE+TN
Sbjct: 384 NKNAKTAVKPADQSVGTEEETN 405


>gi|240256338|ref|NP_197851.5| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005954|gb|AED93337.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 451

 Score =  208 bits (529), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 105 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 164
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 268 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGTIY 324

Query: 165 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 224
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 325 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 384

Query: 225 DKLAKAMPHQIDQNDGIEEQTN 246
           +K AK      DQ+ G EE+TN
Sbjct: 385 NKNAKTAVKPADQSVGTEEETN 406


>gi|238481373|ref|NP_001154739.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|257050999|sp|Q9FLU1.2|BIN4_ARATH RecName: Full=DNA-binding protein BIN4; AltName: Full=Protein
           BRASSINOSTEROID INSENSITIVE 4; AltName: Full=Protein
           MIDGET
 gi|332005959|gb|AED93342.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 454

 Score =  207 bits (528), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 105 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 164
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 271 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKGTIY 327

Query: 165 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 224
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 328 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 387

Query: 225 DKLAKAMPHQIDQNDGIEEQTN 246
           +K AK      DQ+ G EE+TN
Sbjct: 388 NKNAKTAVKPADQSVGTEEETN 409


>gi|238481365|ref|NP_001154735.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005955|gb|AED93338.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 432

 Score =  207 bits (528), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 105 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 164
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 249 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKGTIY 305

Query: 165 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 224
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 306 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 365

Query: 225 DKLAKAMPHQIDQNDGIEEQTN 246
           +K AK      DQ+ G EE+TN
Sbjct: 366 NKNAKTAVKPADQSVGTEEETN 387


>gi|147811095|emb|CAN76881.1| hypothetical protein VITISV_037245 [Vitis vinifera]
          Length = 693

 Score =  196 bits (499), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 129/286 (45%), Positives = 168/286 (58%), Gaps = 73/286 (25%)

Query: 2   EPFAPKSSILASSSDSESCQNSSLLREE----------KAFQFQEKEMDEDAALTGREEK 51
           EP AP  S+   SSDSESC ++S +RE+          K  QF +KE  +DA L+   ++
Sbjct: 128 EPNAPNHSVWXLSSDSESCPDNSSMREDYIHHEESSVHKTTQFPDKE--KDAVLSENGQE 185

Query: 52  TSVKKVSKEKSPKKGLKSPKKEKDVINNVKRKGNND-----HEDA----EEDIAEKRSML 102
           + + + SKEKSPKK +K     +D I++ K+K N+D     H+D     EE+ ++K +  
Sbjct: 186 SLLNEASKEKSPKKRVKV----EDHISSKKKKANSDMKGKGHDDGTELLEEEASDKHTEP 241

Query: 103 NVSTSTLPLVMSEKLQRTKA---------------LVECEGESVDLSGDMGAVGRILVPG 147
            VS+S LPLV+SEK+ R+K                L+  +GES+DLSGDMGAVGRI    
Sbjct: 242 QVSSSGLPLVLSEKVHRSKGFRFSIEKAASLEIRPLLSVKGESIDLSGDMGAVGRI---- 297

Query: 148 TAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAET 207
                                        VSFG SEAK+EA+MNDFIQLKPQSNVYEAET
Sbjct: 298 -----------------------------VSFGQSEAKVEAVMNDFIQLKPQSNVYEAET 328

Query: 208 MVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 253
           MVEG L+GF FDS+DE+DK+ KA   Q DQN+G EEQTN KTK K+
Sbjct: 329 MVEGILEGFSFDSDDEADKMPKASTRQTDQNEGDEEQTNGKTKGKA 374


>gi|357138685|ref|XP_003570920.1| PREDICTED: uncharacterized protein LOC100821932 [Brachypodium
           distachyon]
          Length = 358

 Score =  194 bits (493), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 86/137 (62%), Positives = 115/137 (83%), Gaps = 1/137 (0%)

Query: 91  AEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAE 150
           AEE + +K    ++S   LPL++++K+QR+KALVEC+G+S+DLSGD+GAVGRI++     
Sbjct: 161 AEEQMQDKLPGNSIS-QRLPLMITDKVQRSKALVECDGDSIDLSGDIGAVGRIVISNGPT 219

Query: 151 GNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVE 210
           GN ++ LDLKGT+YK+T+VPSRTFC+VS G +EAKIEAIMNDFIQL+P SN++EAETM+E
Sbjct: 220 GNQDLLLDLKGTVYKSTIVPSRTFCVVSVGQTEAKIEAIMNDFIQLEPHSNLFEAETMME 279

Query: 211 GTLDGFLFDSEDESDKL 227
           GTLDGF FDS+ E D+L
Sbjct: 280 GTLDGFTFDSDGEGDRL 296


>gi|255544304|ref|XP_002513214.1| conserved hypothetical protein [Ricinus communis]
 gi|223547712|gb|EEF49205.1| conserved hypothetical protein [Ricinus communis]
          Length = 155

 Score =  193 bits (491), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 88/117 (75%), Positives = 102/117 (87%)

Query: 137 MGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQL 196
           MGAVGR+++P  + GNHEM+LDLKGTIY+T +VPSRTFC+VSFG SEAKIEAIMNDFIQL
Sbjct: 1   MGAVGRVVIPDGSSGNHEMYLDLKGTIYRTKIVPSRTFCVVSFGQSEAKIEAIMNDFIQL 60

Query: 197 KPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 253
           K QSNVYEAETMVEGTL+GF FDSEDE++K+ K + HQ DQNDG EEQ N KTK K+
Sbjct: 61  KAQSNVYEAETMVEGTLEGFSFDSEDEAEKMPKPVSHQTDQNDGNEEQMNGKTKRKA 117


>gi|413935594|gb|AFW70145.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
          Length = 354

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 92/165 (55%), Positives = 121/165 (73%), Gaps = 12/165 (7%)

Query: 90  DAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTA 149
           DAE+D+ +K +  +VS   LPL+  +K+QR+KAL+EC+G+S+DLSGD+GAVGRI+V    
Sbjct: 165 DAEDDVQDKLTGNSVS-QRLPLIFPDKVQRSKALIECDGDSIDLSGDIGAVGRIVVSNGP 223

Query: 150 EGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMV 209
               ++ LDLKGTIYKTT+VPSRTFC+VS G SEAKIEAIMNDFIQL+PQSN++EAETMV
Sbjct: 224 TAKQDLLLDLKGTIYKTTIVPSRTFCVVSVGQSEAKIEAIMNDFIQLEPQSNLFEAETMV 283

Query: 210 EGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEE-QTNEKTKVKS 253
           E          ++E DKL +   +Q D N+  +E Q   KTK K+
Sbjct: 284 E----------DEEGDKLYEPQANQNDLNNNEDEGQPKAKTKRKA 318


>gi|297598587|ref|NP_001045881.2| Os02g0147100 [Oryza sativa Japonica Group]
 gi|255670601|dbj|BAF07795.2| Os02g0147100, partial [Oryza sativa Japonica Group]
          Length = 126

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 79/118 (66%), Positives = 102/118 (86%)

Query: 122 ALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGH 181
           AL+EC+G+S+DLSGD+GAVGRI++  +  GN E+ LDLKGTIYK+T+VPSRTFC+VS G 
Sbjct: 1   ALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPSRTFCVVSVGQ 60

Query: 182 SEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQND 239
           +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E DKL +    Q DQN+
Sbjct: 61  TEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGDKLPEPHASQNDQNN 118


>gi|410129774|dbj|BAM64852.1| hypothetical protein [Beta vulgaris]
          Length = 467

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 112/243 (46%), Positives = 154/243 (63%), Gaps = 40/243 (16%)

Query: 5   APKSSILASSSDSESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPK 64
           AP  SI   SSDSE+  ++++ +       +EK  +E++A      +++VK+    KSPK
Sbjct: 242 APNHSICELSSDSETKSDATIRK------IEEKIDNEESA-----PRSAVKR----KSPK 286

Query: 65  KGLKS--PKKEKDVINNVKRKGNNDHEDAEEDI-AEKRSMLNVSTSTLPLVMSEKLQRTK 121
           K +K      EK      K +G  ++ D  E++  EK+S   VS+ST+PLV+ EK+QRTK
Sbjct: 287 KRVKQEDDATEKRTNEPAKIEGEVENMDVAEEVKPEKQSEPAVSSSTMPLVLPEKVQRTK 346

Query: 122 ALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGH 181
           ALVECEGES+DLSGDMGAVGR+++  T     EMFLDLK ++                  
Sbjct: 347 ALVECEGESIDLSGDMGAVGRVIISETPSKESEMFLDLKVSLE----------------- 389

Query: 182 SEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGI 241
              +IEAIMNDFIQLKPQSNVY AETMVEGTL+GF F+S+DE+DK+ K +  + DQ++G 
Sbjct: 390 ---QIEAIMNDFIQLKPQSNVYNAETMVEGTLEGFTFESDDEADKMTKDV--KGDQSEGA 444

Query: 242 EEQ 244
           EE+
Sbjct: 445 EEK 447


>gi|45736024|dbj|BAD13051.1| unknown protein [Oryza sativa Japonica Group]
 gi|45736066|dbj|BAD13092.1| unknown protein [Oryza sativa Japonica Group]
          Length = 193

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 80/138 (57%), Positives = 103/138 (74%), Gaps = 1/138 (0%)

Query: 135 GDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFI 194
           GD+GAVGRI++  +  GN E+ LDLKGTIYK+T+VPSRTFC+VS G +EAKIE+IM+DFI
Sbjct: 39  GDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPSRTFCVVSVGQTEAKIESIMDDFI 98

Query: 195 QLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS- 253
           QL+PQSN++EAETM+EGTLDGF FDS++E DKL +    Q DQN+   +Q   KTK K+ 
Sbjct: 99  QLEPQSNLFEAETMMEGTLDGFTFDSDEEGDKLPEPHASQNDQNNEDGDQPKAKTKRKAE 158

Query: 254 GPCRKERYECWRTATATK 271
            P  K + +      ATK
Sbjct: 159 KPAGKGQKKAKVAGKATK 176


>gi|10177863|dbj|BAB11215.1| unnamed protein product [Arabidopsis thaliana]
          Length = 531

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 91/163 (55%), Positives = 108/163 (66%), Gaps = 28/163 (17%)

Query: 105 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK---- 160
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLK    
Sbjct: 331 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKEHFA 387

Query: 161 -----------------GTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVY 203
                            GTI  T    S     V+ G +EAKIEAIMNDFIQL PQSNVY
Sbjct: 388 LYVFSSSCTFSDCKGTCGTIIHT----SDLVFQVNVGQTEAKIEAIMNDFIQLIPQSNVY 443

Query: 204 EAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTN 246
           EAETMVEGTL+GF F+S+DES+K AK      DQ+ G EE+TN
Sbjct: 444 EAETMVEGTLEGFTFESDDESNKNAKTAVKPADQSVGTEEETN 486


>gi|168037576|ref|XP_001771279.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677368|gb|EDQ63839.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 379

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 66/125 (52%), Positives = 93/125 (74%), Gaps = 3/125 (2%)

Query: 104 VSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTI 163
           ++ S+LP+V  EK+ RTK L+ECEG+++DLSGDMGAVGR  V      + E+ LDLKG +
Sbjct: 179 IAASSLPIVFGEKVNRTKVLLECEGDALDLSGDMGAVGRFTV---NRPDDELLLDLKGVV 235

Query: 164 YKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDE 223
           YKTT+VPS T+ +V+ G  EAK+E+IM DF+QL+  ++  E ETMVEGTL  F F+S++E
Sbjct: 236 YKTTIVPSNTYFVVNVGQMEAKVESIMTDFMQLRADTSWNENETMVEGTLKDFAFESDEE 295

Query: 224 SDKLA 228
            ++ A
Sbjct: 296 GERPA 300


>gi|168010889|ref|XP_001758136.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690592|gb|EDQ76958.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 373

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 64/122 (52%), Positives = 93/122 (76%), Gaps = 3/122 (2%)

Query: 105 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 164
           + S+LP+V  EK+ +TK L+ECEG+++DLSGDMGAVGR  V      ++E+ LDLKG IY
Sbjct: 172 AASSLPIVFGEKVNKTKVLLECEGDALDLSGDMGAVGRFTV---NRRDNELLLDLKGVIY 228

Query: 165 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 224
           KTT+VPS TF +V+ G ++AK+E+IM+DF+QL+  +   E ET+VEGTL  F F+S++E 
Sbjct: 229 KTTIVPSNTFFLVNVGQTDAKVESIMSDFVQLRADTIGKENETVVEGTLQDFTFESDEEG 288

Query: 225 DK 226
           ++
Sbjct: 289 ER 290


>gi|45736061|dbj|BAD13087.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 189

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 60/106 (56%), Positives = 85/106 (80%), Gaps = 4/106 (3%)

Query: 91  AEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAE 150
           AEED+ +K +  +VS   LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++  +  
Sbjct: 87  AEEDMQDKLTEHSVS-QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPN 145

Query: 151 GNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQL 196
           GN E+ LDLKGTIYK+T+VPSRTFC+VS G +EAK   +++D I L
Sbjct: 146 GNQELLLDLKGTIYKSTIVPSRTFCVVSVGQTEAK---VVDDNINL 188


>gi|224132538|ref|XP_002321341.1| predicted protein [Populus trichocarpa]
 gi|222868337|gb|EEF05468.1| predicted protein [Populus trichocarpa]
          Length = 159

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 75/178 (42%), Positives = 91/178 (51%), Gaps = 54/178 (30%)

Query: 85  NNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRIL 144
           + D E  EE  +EK    +VS S LPLV+SEK+QR+KALVECEGES+DLSG+MGAVGR +
Sbjct: 20  HGDVEITEEVTSEKLINAHVSISRLPLVLSEKVQRSKALVECEGESIDLSGEMGAVGRPV 79

Query: 145 VPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYE 204
           +P T  G+ EM LDLKGT                                          
Sbjct: 80  IPDTPSGDSEMHLDLKGT------------------------------------------ 97

Query: 205 AETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVK----SGPCRK 258
                   L+GF F SEDE+DK+ KA  HQ DQN  +EE +N   K K    SG  RK
Sbjct: 98  --------LEGFSFFSEDETDKITKATAHQTDQNMDVEEPSNGMIKRKAEKSSGVARK 147


>gi|384247484|gb|EIE20971.1| hypothetical protein COCSUDRAFT_83530 [Coccomyxa subellipsoidea
           C-169]
          Length = 378

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 47/105 (44%), Positives = 68/105 (64%), Gaps = 5/105 (4%)

Query: 98  KRSMLNVSTSTLPLVMSEKLQRTKALVECEG---ESVDLSGDMGAVGRILVPGTAEGNHE 154
           K ++L      +P++M EKL  TK +VE EG    SVDL+GD G VGR LV G  + N E
Sbjct: 185 KIAVLAPVAGEIPVMMPEKLPVTKFIVELEGGNDSSVDLTGDAGVVGRWLVEGD-DSNPE 243

Query: 155 MFLDLKGTIYKTTLVP-SRTFCIVSFGHSEAKIEAIMNDFIQLKP 198
           + +DLKG +Y  T VP + T C+V+   +EA++EA+ + +IQL+P
Sbjct: 244 LRMDLKGVVYAATTVPCATTLCVVNLSATEARVEALGHQYIQLRP 288


>gi|303275097|ref|XP_003056848.1| hypothetical protein MICPUCDRAFT_62312 [Micromonas pusilla
           CCMP1545]
 gi|226461200|gb|EEH58493.1| hypothetical protein MICPUCDRAFT_62312 [Micromonas pusilla
           CCMP1545]
          Length = 392

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/119 (39%), Positives = 72/119 (60%), Gaps = 8/119 (6%)

Query: 109 LPLVMSEKLQRTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKT 166
           +PL+M   + R K   ECEG  ESVDL GD+G VGR+L   +A+    + +DLKG IY  
Sbjct: 195 IPLLMPATVNRNKIFFECEGAGESVDLEGDVGVVGRLLSDPSAKYTSGVKVDLKGVIYDA 254

Query: 167 TLVPS-RTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFL-FDSEDE 223
            ++P+  +  +++   +EAK+E I ND++QL+           V GTL+G+L  DS+D+
Sbjct: 255 KILPTPASIVVLAVNQTEAKVECIANDYVQLRRDVTANN----VGGTLEGYLGVDSDDD 309


>gi|307107138|gb|EFN55382.1| hypothetical protein CHLNCDRAFT_52595 [Chlorella variabilis]
          Length = 402

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 46/125 (36%), Positives = 73/125 (58%), Gaps = 8/125 (6%)

Query: 96  AEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGES------VDLSGDMGAVGRILVPGTA 149
           A    + +  TS +P+V+ EKL + K LVE E          DLSGD GA+GR+++ GT 
Sbjct: 168 AAGLGLPHAPTSQMPVVLPEKLPQMKMLVELESNPDDAHHVTDLSGDSGAIGRLVIAGTK 227

Query: 150 EGNHEMFLDLKGTIYKTTLVPSR-TFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETM 208
           E   EM +DLKG +Y  T+VP   +  +++ G +EAK+E + ++F+QL+  +     +T 
Sbjct: 228 EAP-EMQIDLKGVLYNATIVPCPVSIAVINMGQTEAKVECLFSEFVQLREDTRFSFVDTE 286

Query: 209 VEGTL 213
           V G +
Sbjct: 287 VLGGM 291


>gi|449532507|ref|XP_004173222.1| PREDICTED: DNA-binding protein BIN4-like, partial [Cucumis sativus]
          Length = 198

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 56/154 (36%), Positives = 86/154 (55%), Gaps = 24/154 (15%)

Query: 7   KSSILASSSDSESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPKKG 66
           K S+   S DSESC +++ ++E+ ++  +  E+   + + GR +         E +  +G
Sbjct: 69  KHSVWMLSLDSESCSDNNFIKEDYSYHEELAEL-ATSEIQGRRKD--------ENAGSEG 119

Query: 67  LKSPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVEC 126
                       +V+   N D E  E+D  +      VS+S LPLV+S+K  R KALVEC
Sbjct: 120 ------------HVR---NGDVEILEKDALDDCIGPPVSSSRLPLVLSDKAHRLKALVEC 164

Query: 127 EGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK 160
           EG S+DLSGDMGAVGR++V  ++   +E+ LDLK
Sbjct: 165 EGTSIDLSGDMGAVGRVVVSDSSSAKNELCLDLK 198


>gi|255084389|ref|XP_002508769.1| predicted protein [Micromonas sp. RCC299]
 gi|226524046|gb|ACO70027.1| predicted protein [Micromonas sp. RCC299]
          Length = 376

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 46/121 (38%), Positives = 72/121 (59%), Gaps = 11/121 (9%)

Query: 109 LPLVMSEKLQRTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKT 166
           +PL M   + R K   E EG  E+VDL GD+G VGR+L   + +    + +D+KG IY  
Sbjct: 178 VPLFMPSAVNRNKVFFELEGTGEAVDLEGDVGVVGRLLSNASEKHGSGLQMDMKGVIYNA 237

Query: 167 TLVPS-RTFCIVSFGHSEAKIEAIMNDFIQLK--PQSNVYEAETMVEGTLDGFL-FDSED 222
            ++P+  +  I++   +EAK+E+I +DF+QL+  P +N       +  TLDG+L  DS+D
Sbjct: 238 RILPTPASVVILAVNANEAKVESITHDFVQLRRDPVAN-----GGLGATLDGYLGVDSDD 292

Query: 223 E 223
           E
Sbjct: 293 E 293


>gi|320166916|gb|EFW43815.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 236

 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 68/110 (61%), Gaps = 3/110 (2%)

Query: 109 LPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTL 168
           +PLV+ +    T  L+E     ++LSGD G+VGR++V    +    + LDLKG I++ T+
Sbjct: 53  IPLVLGKP--ATAFLMETTEPQLNLSGDTGSVGRLVV-NAQDAAAPLVLDLKGLIFQGTV 109

Query: 169 VPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLF 218
           VPS TF +V+ G +EA+I+A +N+ I      +  ++ T+VEG++D   F
Sbjct: 110 VPSNTFMLVTIGPNEARIDATVNNVIHTTCLGDALDSMTVVEGSIDADAF 159


>gi|255544302|ref|XP_002513213.1| conserved hypothetical protein [Ricinus communis]
 gi|223547711|gb|EEF49204.1| conserved hypothetical protein [Ricinus communis]
          Length = 137

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 54/125 (43%), Positives = 78/125 (62%), Gaps = 9/125 (7%)

Query: 5   APKSSILASSSDSESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPK 64
           A K SI A SSDSESC + S  R+ K+ Q +E   ++DA L       SV+K  K KSPK
Sbjct: 17  AAKRSIWALSSDSESCPDDSSQRKTKSNQPKESRKEKDAILVD----NSVEKALKGKSPK 72

Query: 65  KGLK----SPKKEKDVINNVKRKGNNDH-EDAEEDIAEKRSMLNVSTSTLPLVMSEKLQR 119
           K LK      KK+++  ++V + G+ND  E A+ED++EK +   VSTS +PL++S+K+ R
Sbjct: 73  KLLKVEGRKLKKKENKNDDVPKTGDNDDVEVADEDVSEKHTGPQVSTSMIPLLLSDKVSR 132

Query: 120 TKALV 124
           TK  +
Sbjct: 133 TKMFL 137


>gi|428174918|gb|EKX43811.1| hypothetical protein GUITHDRAFT_140250 [Guillardia theta CCMP2712]
          Length = 309

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 52/82 (63%), Gaps = 5/82 (6%)

Query: 123 LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHS 182
           +V+CE +++D  GD G +G I       G   M LDLKG +Y+ +++P  T  ++  G +
Sbjct: 164 VVQCEDQNLDFEGDSGVIGTI-----KTGERSMELDLKGQLYEGSILPCCTMMVMDMGAT 218

Query: 183 EAKIEAIMNDFIQLKPQSNVYE 204
           E KIE++MNDF+++ P +N ++
Sbjct: 219 EVKIESVMNDFVKVHPIANPHD 240


>gi|302857504|ref|XP_002959890.1| hypothetical protein VOLCADRAFT_101408 [Volvox carteri f.
           nagariensis]
 gi|300254022|gb|EFJ39045.1| hypothetical protein VOLCADRAFT_101408 [Volvox carteri f.
           nagariensis]
          Length = 340

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 64/129 (49%), Gaps = 35/129 (27%)

Query: 104 VSTSTLPLVMSEKLQRTKALVEC-----EGE----SVDLSGDMGAVGRILVPGTAEGNH- 153
           V    LP+++ EKL R K L+E       GE    + DLSGD GAVGR++V      N  
Sbjct: 38  VEAGKLPIMLPEKLNRNKVLLELPPAAMSGEHYHGAADLSGDSGAVGRLVVGKAPAANSK 97

Query: 154 ------------------------EMFLDLKGTIYKTTLVP-SRTFCIVSFGHSEAKIEA 188
                                   ++ +DLKG +Y   ++P + T  +++ G SEAK+E+
Sbjct: 98  GTAAAAAAAAAGGSEGGGSGGDSVQLQVDLKGLLYNAWVLPLAGTAMVLNIGPSEAKVES 157

Query: 189 IMNDFIQLK 197
           + NDFI+L+
Sbjct: 158 LFNDFIRLR 166


>gi|449526337|ref|XP_004170170.1| PREDICTED: DNA-binding protein BIN4-like [Cucumis sativus]
          Length = 71

 Score = 67.0 bits (162), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 35/53 (66%), Positives = 41/53 (77%), Gaps = 1/53 (1%)

Query: 190 MNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIE 242
           MNDFIQLK  S V EAETMVEGTLDGF FDSED+++K+ KA     DQN+ +E
Sbjct: 1   MNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDDAEKITKA-ASPADQNEPVE 52


>gi|299471419|emb|CBN79372.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 240

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 68/120 (56%), Gaps = 7/120 (5%)

Query: 109 LPLVMSEKL---QRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 165
           +PL++   +    +   LV    E+++ +GD GA+GRI V G  +G    F DLKG  + 
Sbjct: 59  VPLIVPPAVCKDAKASLLVHLRDEALNFAGDTGAIGRISVGGKKKGRGLTF-DLKGQQFS 117

Query: 166 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAET--MVEGTL-DGFLFDSED 222
             ++PS T  +V  G +EAK+E + ++  +L  + NV++A    +++G + D + F+ ED
Sbjct: 118 GQIIPSATVMVVGIGKTEAKVEGLFSEVCRLDYKGNVFDAMGGEVLKGEMDDSYKFEEED 177


>gi|412989095|emb|CCO15686.1| hypothetical protein VOLCADRAFT_101408 [Bathycoccus prasinos]
          Length = 422

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/117 (37%), Positives = 66/117 (56%), Gaps = 24/117 (20%)

Query: 104 VSTSTLPLVMSE-KLQRTK---ALVECEG--ESVDLSGDMGAVGRILV-----PGTAEG- 151
           V  S +PL+++   LQ+T     L EC+G  E+VDLSGD+G VGR+LV     P    G 
Sbjct: 237 VPKSKMPLLINHGALQKTANSTILFECKGSGEAVDLSGDVGVVGRLLVERESQPTGGGGS 296

Query: 152 ---------NHEMFLDLKGTIY--KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLK 197
                    +  + +DLKG +Y  +T L+ S T  IV+ G  +AKIE   ++++QL+
Sbjct: 297 KASTSKQQPHASVKVDLKGVMYLARTHLL-SSTVAIVNVGAEDAKIETFFDEYVQLR 352


>gi|308804497|ref|XP_003079561.1| unnamed protein product [Ostreococcus tauri]
 gi|116058016|emb|CAL54219.1| unnamed protein product [Ostreococcus tauri]
          Length = 229

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 47/124 (37%), Positives = 70/124 (56%), Gaps = 11/124 (8%)

Query: 105 STSTLPLVMSEKLQ-RTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 161
           S++ + LV S  L  ++K L E EG  E+VDL GD GAVGR L    AE    + +D+KG
Sbjct: 63  SSTRVDLVASAHLGGKSKVLFEIEGNGEAVDLDGDTGAVGRWL----AESARSLKVDMKG 118

Query: 162 TIYKTTLVP-SRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDS 220
            +Y   +VP + T  +V+     AKI+++  +F+QL+ +  V  A T   G   G LF+ 
Sbjct: 119 LMYNARVVPLAGTVVVVAVNGDVAKIDSVHREFVQLR-EDPVANAGTENYGV--GSLFEM 175

Query: 221 EDES 224
            DE+
Sbjct: 176 TDEA 179


>gi|159483423|ref|XP_001699760.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158281702|gb|EDP07456.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 308

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 38/138 (27%), Positives = 64/138 (46%), Gaps = 41/138 (29%)

Query: 107 STLPLVMSEKLQRTKALVECEGE---------------SVDLSGDMGAVGRILVPGTAEG 151
            +LPL++ +K+ R K L E                   + DLSGD GAVGR++V   A  
Sbjct: 140 GSLPLMLPDKINRNKVLFELPPPPASSSNADGYGHGHGATDLSGDAGAVGRLIVLAAAPT 199

Query: 152 NHE-------------------------MFLDLKGTIYKTTLVP-SRTFCIVSFGHSEAK 185
             +                         + +DLKG IY    +P + T  +++ G +EAK
Sbjct: 200 GGKGAAAGGGEPGPSSQGGGGPGGAEVCLQVDLKGVIYNAWALPLAGTAMVLNVGPTEAK 259

Query: 186 IEAIMNDFIQLKPQSNVY 203
           +E++ NDF++L+  ++V+
Sbjct: 260 VESLFNDFVRLREAADVH 277


>gi|356516190|ref|XP_003526779.1| PREDICTED: uncharacterized protein LOC100812582 [Glycine max]
          Length = 82

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 26/46 (56%), Positives = 36/46 (78%), Gaps = 1/46 (2%)

Query: 208 MVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 253
           MVEGTLDGF FDS++E+ K+ K+  +Q DQN+ +EEQ N K+K K+
Sbjct: 1   MVEGTLDGFFFDSDEEAGKMQKST-NQTDQNENVEEQANGKSKGKT 45


>gi|145347277|ref|XP_001418100.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578328|gb|ABO96393.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 999

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 38/97 (39%), Positives = 58/97 (59%), Gaps = 8/97 (8%)

Query: 105 STSTLPLVMSEKLQ-RTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 161
           S++ + LV +  L  ++K L E EG  E+VDL GD GAVGR L    AE +  + +D+KG
Sbjct: 33  SSTAVELVATANLGGKSKVLFEIEGAGEAVDLDGDTGAVGRWL----AESSRALKVDMKG 88

Query: 162 TIYKTTLVPSR-TFCIVSFGHSEAKIEAIMNDFIQLK 197
            +Y   +V S  T  +V+     AKIE++  +F+QL+
Sbjct: 89  VMYNARVVSSAGTVVVVAVNADVAKIESVHREFVQLR 125


>gi|397569025|gb|EJK46493.1| hypothetical protein THAOC_34834 [Thalassiosira oceanica]
          Length = 315

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 123 LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGH- 181
           LV+   + +DL G  GAVGR+           + LDLKG  Y+ T+ P  T  +V+    
Sbjct: 175 LVQMNDDKLDLHGQSGAVGRLEA-----DERGVILDLKGYQYRGTIRPGPTAMVVALTRE 229

Query: 182 SEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSED 222
            + K+E+I ++F+ L  +   +  +++ +   +GF+   ED
Sbjct: 230 GKFKVESITDEFVTLDAK-GAHVMDSLGDNMREGFMVREED 269


>gi|422294628|gb|EKU21928.1| hypothetical protein NGA_2017520, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 238

 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 4/74 (5%)

Query: 132 DLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMN 191
           DL+GD GAVGR+ V    + +  + LDLKG  Y   L    T  +VS G  EAK+E +++
Sbjct: 40  DLTGDSGAVGRVHV----KKDRGVVLDLKGHQYLAELCRCPTMLLVSLGEEEAKVEDVVD 95

Query: 192 DFIQLKPQSNVYEA 205
               L    +  EA
Sbjct: 96  TLCVLTHHHDELEA 109


>gi|219116542|ref|XP_002179066.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409833|gb|EEC49764.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 289

 Score = 45.4 bits (106), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 55/105 (52%), Gaps = 14/105 (13%)

Query: 123 LVECEGE---SVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSF 179
           LV+ + E   S+D +G +GA+GR+     AEG   + LD KG  Y+  + P  T  ++S 
Sbjct: 150 LVQIDPEDSASLDFTGAVGAIGRLET--VAEG---VVLDFKGCQYRGRIYPGSTAVVLST 204

Query: 180 GHSEA-----KIEAIMNDFIQLKPQSNVYE-AETMVEGTLDGFLF 218
              +A     K+E++ ++F +L  + N  E    +V G + G+ +
Sbjct: 205 NGKKADTQLLKVESMTDEFCRLTKEGNAMEKLNAIVRGDMVGYQY 249


>gi|387199325|gb|AFJ68895.1| hypothetical protein NGATSA_2017510 [Nannochloropsis gaditana
           CCMP526]
          Length = 226

 Score = 44.7 bits (104), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 4/74 (5%)

Query: 132 DLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMN 191
           DL+GD GAVGR+ V    + +  + LDLKG  Y   L    T  +VS G  EAK++ +++
Sbjct: 28  DLTGDSGAVGRVHV----KKDRGVVLDLKGHQYLAELCRCPTMLLVSLGEEEAKVKDVVD 83

Query: 192 DFIQLKPQSNVYEA 205
               L    +  EA
Sbjct: 84  TLCVLTHHHDELEA 97


>gi|449018377|dbj|BAM81779.1| hypothetical protein CYME_CMP364C [Cyanidioschyzon merolae strain
           10D]
          Length = 263

 Score = 43.9 bits (102), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 51/100 (51%), Gaps = 10/100 (10%)

Query: 122 ALVECEGESVDLSGDMGAVGRILVPGTAE-GNHEMFLDLKGTIYKTTLVPSRTFCI---- 176
           AL+  E +  DL+GD G +G +     A+ G   + LDLKG +Y+T     RTF +    
Sbjct: 110 ALLVFENDEFDLAGDTGTIGCVQWSHNADSGAVRLCLDLKGVVYET-----RTFSLCGSA 164

Query: 177 VSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGF 216
           ++   +  ++EA+++D     P +   E   ++ G ++ +
Sbjct: 165 LAVDPTSQRVEAVLHDITYSIPTNVADERAQLIRGKIEPY 204


>gi|293394374|ref|ZP_06638674.1| cystathionine beta-lyase [Serratia odorifera DSM 4582]
 gi|291423352|gb|EFE96581.1| cystathionine beta-lyase [Serratia odorifera DSM 4582]
          Length = 392

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 34/138 (24%), Positives = 65/138 (47%), Gaps = 14/138 (10%)

Query: 68  KSPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKA-LVEC 126
           + P+++   +N    +G+     + +DI  KR+  +  TS  P +  E L+   + L + 
Sbjct: 21  RDPEQQSGFVNAPVYRGSTVVYKSCDDIRHKRARFSYGTSGTPTI--ENLENAWSTLCDA 78

Query: 127 EGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFC---IVSFGHSE 183
            G  +  SG +GA+   L   T  G+H +  D   ++Y+    P+R FC   +V +G S 
Sbjct: 79  AGTVISPSG-LGALALALFSVTRAGDHILVTD---SVYR----PTRKFCNTLLVKYGVSV 130

Query: 184 AKIEAIMNDFIQLKPQSN 201
              + ++   I+   Q+N
Sbjct: 131 TYYDPLIGGDIERLLQNN 148


>gi|307109224|gb|EFN57462.1| hypothetical protein CHLNCDRAFT_142962 [Chlorella variabilis]
          Length = 642

 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 30/56 (53%), Gaps = 1/56 (1%)

Query: 123 LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVP-SRTFCIV 177
           L E  G  ++L GD G +GR+ V   +     + LDL G +Y+  LVP + T  +V
Sbjct: 441 LCEVPGHLLNLEGDSGTIGRVAVERGSGKEAALQLDLMGIVYQAELVPLAATAAVV 496


>gi|70946242|ref|XP_742856.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56522065|emb|CAH75695.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 925

 Score = 38.1 bits (87), Expect = 4.7,   Method: Composition-based stats.
 Identities = 29/99 (29%), Positives = 49/99 (49%), Gaps = 20/99 (20%)

Query: 8   SSILASSSDSE-SCQN--------------SSLLREEKAFQFQEKEMDEDAALTGREEKT 52
           S+IL S  +++ SCQN               +L++ + AF   EK+ DED A    + KT
Sbjct: 450 STILNSYEEADYSCQNELNMGVPNFIEENKQNLIKNDTAFSKNEKKSDEDIAPFIEDTKT 509

Query: 53  SVKKVS-----KEKSPKKGLKSPKKEKDVINNVKRKGNN 86
           ++  +S      E+S KK  K+  +  D+ N + ++ NN
Sbjct: 510 AINDMSITNNISEQSEKKITKNNNEINDMYNGLSKRKNN 548


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.307    0.125    0.338 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,038,417,431
Number of Sequences: 23463169
Number of extensions: 163709775
Number of successful extensions: 762419
Number of sequences better than 100.0: 998
Number of HSP's better than 100.0 without gapping: 177
Number of HSP's successfully gapped in prelim test: 821
Number of HSP's that attempted gapping in prelim test: 751130
Number of HSP's gapped (non-prelim): 9864
length of query: 283
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 142
effective length of database: 9,050,888,538
effective search space: 1285226172396
effective search space used: 1285226172396
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 76 (33.9 bits)