BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 019746
         (336 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359490779|ref|XP_002269798.2| PREDICTED: uncharacterized protein LOC100252112 [Vitis vinifera]
 gi|302144012|emb|CBI23117.3| unnamed protein product [Vitis vinifera]
          Length = 397

 Score =  317 bits (811), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 194/377 (51%), Positives = 235/377 (62%), Gaps = 73/377 (19%)

Query: 1   MSGSREESPDWLRSFQAPTHSVLTLSSDSDS-------------DSSHN----------- 36
           M  SREESPDWL  FQAPT S LTLSSDSDS             DS  N           
Sbjct: 1   MGSSREESPDWLHCFQAPTRSALTLSSDSDSPPEDSPLREDKDADSPFNETSKAKSPRKR 60

Query: 37  -------------------GNGDGGKNIAELTFEKHMEPFAPKSSILASSSDSESCQNSS 77
                                G+ G+  A +      EP AP  S+   SSDSESC ++S
Sbjct: 61  LKAEDPKSTKKQKVNNHKKKEGNEGRKKASVEKPSDGEPNAPNHSVWKLSSDSESCPDNS 120

Query: 78  LLREE----------KAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPKKGLKSPKKEK 127
            +RE+          K  QF +KE  +DA L+   +++ + + SKEKSPKK +K     +
Sbjct: 121 SMREDYIHHEESSVHKTTQFPDKE--KDAVLSENGQESLLNEASKEKSPKKRVKV----E 174

Query: 128 DVINNVKRKGNND-----HEDA----EEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVE 178
           D I++ K+K N+D     H+D     EE+ ++K +   VS S LPLV+SEK+ R+KALVE
Sbjct: 175 DHISSKKKKANSDMKGKGHDDGTELLEEEASDKHTEPQVS-SGLPLVLSEKVHRSKALVE 233

Query: 179 CEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAK 238
           CEGES+DLSGDMGAVGRI++  T   NHEMFLDLKGTIYKTT+VPSRTFCIVSFG SEAK
Sbjct: 234 CEGESIDLSGDMGAVGRIVILDTPSRNHEMFLDLKGTIYKTTIVPSRTFCIVSFGQSEAK 293

Query: 239 IEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQT 298
           +EA+MNDFIQLKPQSNVYEAETMVEG L+GF FDS+DE+DK+ KA   Q DQN+G EEQT
Sbjct: 294 VEAVMNDFIQLKPQSNVYEAETMVEGILEGFSFDSDDEADKMPKASTRQTDQNEGDEEQT 353

Query: 299 NEKTKVK----SGPCRK 311
           N KTK K    SG  RK
Sbjct: 354 NGKTKGKAEKASGVVRK 370


>gi|449459184|ref|XP_004147326.1| PREDICTED: uncharacterized protein LOC101208657 [Cucumis sativus]
          Length = 391

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 158/367 (43%), Positives = 219/367 (59%), Gaps = 61/367 (16%)

Query: 1   MSGSREESPDWLRSFQAPTHSVLTLSS--------------DSDSDSSHNGNGD------ 40
           MS SRE+SPDW+RSFQAPT   L+ +S              D    SSH    D      
Sbjct: 1   MSSSREQSPDWMRSFQAPTGVALSSNSGSSKNGSSSMDNAIDQRDPSSHKTTQDLDGDQI 60

Query: 41  ----GGKNIA-ELTFEKHMEPFAPKSSILASSSDSESCQNSSLLREEKAF---------- 85
               G  N+A E+  ++H      K S+   S DSESC +++ ++E+ ++          
Sbjct: 61  QGDCGNHNLAKEVKLDRHTGHENSKHSVWMLSLDSESCSDNNFIKEDYSYHEELAELATS 120

Query: 86  QFQEKEMDEDAALTGREEKTSVKKVSKEKSPKKGLKS---PKKEKDVINNVKRKGNNDHE 142
           + Q +  DE+A     E K+  +KVS E SPKK +KS      +++++N+   KG +  E
Sbjct: 121 EIQGRRKDENAGRRFTEGKSKSRKVSNEMSPKKKVKSEVCTSAKENIMNSGTNKGGSTME 180

Query: 143 DAEE-------DIAEKRSMLN-----VSTSTLPLVMSEKLQRTKALVECEGESVDLSGDM 190
            +E        +I EK ++ +     VS+S LPLV+S+K  R KALVECEG S+DLSGDM
Sbjct: 181 GSEGHVRNGDVEILEKDALDDCIGPPVSSSRLPLVLSDKAHRLKALVECEGTSIDLSGDM 240

Query: 191 GAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLK 250
           GAVGR++V  ++   +E+ LDLKGT+Y+  +VPSRTFCIVSFG SEAKIE+IMNDFIQLK
Sbjct: 241 GAVGRVVVSDSSSAKNELCLDLKGTLYRAVIVPSRTFCIVSFGQSEAKIESIMNDFIQLK 300

Query: 251 PQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIE----------EQTNE 300
             S V EAETMVEGTLDGF FDSED+++K+ KA     DQN+ +E          E+++ 
Sbjct: 301 ALSKVDEAETMVEGTLDGFSFDSEDDAEKITKA-ASPADQNEPVEGLNTKSKNKAEKSSG 359

Query: 301 KTKVKSG 307
           + +VK+G
Sbjct: 360 RKRVKTG 366


>gi|224121130|ref|XP_002318505.1| predicted protein [Populus trichocarpa]
 gi|222859178|gb|EEE96725.1| predicted protein [Populus trichocarpa]
          Length = 187

 Score =  243 bits (621), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 118/159 (74%), Positives = 135/159 (84%), Gaps = 5/159 (3%)

Query: 159 TSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 218
           TS LPLV+SEK+QR+KALVECEGES+DLSGDMGAVGR+++P T  GN EM+LDLKGTIY+
Sbjct: 1   TSRLPLVLSEKVQRSKALVECEGESIDLSGDMGAVGRVVIPDTPSGNSEMYLDLKGTIYR 60

Query: 219 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVE-GTLDGFLFDSEDES 277
           TT+VPSRTFC+VSFG SEAKIEAIMNDFIQLK QSNVYEAETMVE GTL+GF FDSEDE+
Sbjct: 61  TTIVPSRTFCVVSFGQSEAKIEAIMNDFIQLKTQSNVYEAETMVEVGTLEGFSFDSEDET 120

Query: 278 DKLAKAMPHQIDQNDGIEEQTNEKTKVK----SGPCRKE 312
           DK+ KA   Q DQN+G+EE  N KTK K    SG  RK+
Sbjct: 121 DKITKATALQTDQNEGVEEPANGKTKRKPVKSSGVARKK 159


>gi|218190056|gb|EEC72483.1| hypothetical protein OsI_05847 [Oryza sativa Indica Group]
          Length = 342

 Score =  233 bits (595), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 145/341 (42%), Positives = 195/341 (57%), Gaps = 36/341 (10%)

Query: 3   GSREESPDWLRSFQAPTHSVLTLSSDSDSDSSHN--------GNGDGGKNIAELTFEKHM 54
           G  EE PDWLR+FQ PT S + LSS SD DS  N        G    G+N A        
Sbjct: 2   GEEEEDPDWLRAFQPPTTSTVMLSSGSD-DSPENSPTRTTPSGEEQKGENKASSDHAGDG 60

Query: 55  EPFAPKSSILASSSDSESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEK 114
           +     ++ L     +   +  +   +E AF   EK   E          ++  K  K  
Sbjct: 61  D-----AAALNKGKKATPTRRKTPTSQEDAFDKDEKPTME----------SNQDKPPKRS 105

Query: 115 SPKKGLKSPKKEKDVINNVKRKGNNDHED----------AEEDIAEKRSMLNVSTSTLPL 164
           +PKK L  P    +       K   D  D          AEED+ +K +  +VS   LPL
Sbjct: 106 TPKKKLVKPPSGSNASKVTGPKAGPDQIDDTLEHQEEGVAEEDMQDKLTEHSVS-QRLPL 164

Query: 165 VMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPS 224
           ++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++  +  GN E+ LDLKGTIYK+T+VPS
Sbjct: 165 IIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPS 224

Query: 225 RTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAM 284
           RTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E DKL +  
Sbjct: 225 RTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGDKLPEPH 284

Query: 285 PHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATATK 324
             Q DQN+   +Q   KTK K+  P  K + +      ATK
Sbjct: 285 ASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKATK 325


>gi|115444213|ref|NP_001045886.1| Os02g0147700 [Oryza sativa Japonica Group]
 gi|113535417|dbj|BAF07800.1| Os02g0147700 [Oryza sativa Japonica Group]
 gi|215765268|dbj|BAG86965.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222622166|gb|EEE56298.1| hypothetical protein OsJ_05371 [Oryza sativa Japonica Group]
          Length = 342

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 144/341 (42%), Positives = 195/341 (57%), Gaps = 36/341 (10%)

Query: 3   GSREESPDWLRSFQAPTHSVLTLSSDSDSDSSHN--------GNGDGGKNIAELTFEKHM 54
           G  EE PDWLR+FQ PT S + LSS SD DS  N        G    G+N A        
Sbjct: 2   GEEEEDPDWLRAFQPPTTSTVMLSSGSD-DSPENSPTRTTPSGEEQKGENKASSDHAGDG 60

Query: 55  EPFAPKSSILASSSDSESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEK 114
           +     ++ L     +   +  +   +E AF   EK   E          ++  K  K  
Sbjct: 61  D-----AAALNKGKKATPTRRKTPTSQEDAFDKDEKPTME----------SNQDKPPKRS 105

Query: 115 SPKKGLKSPKKEKDVINNVKRKGNNDHED----------AEEDIAEKRSMLNVSTSTLPL 164
           +PKK L  P    +       K   D  D          AEE++ +K +  +VS   LPL
Sbjct: 106 TPKKKLVKPPSGSNASKVTGPKAGPDQIDDTLEHQEEGVAEEEMQDKLTEHSVS-QRLPL 164

Query: 165 VMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPS 224
           ++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++  +  GN E+ LDLKGTIYK+T+VPS
Sbjct: 165 IIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPS 224

Query: 225 RTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAM 284
           RTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E DKL +  
Sbjct: 225 RTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGDKLPEPH 284

Query: 285 PHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATATK 324
             Q DQN+   +Q   KTK K+  P  K + +      ATK
Sbjct: 285 ASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKATK 325


>gi|195625200|gb|ACG34430.1| hypothetical protein [Zea mays]
 gi|413935593|gb|AFW70144.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
          Length = 364

 Score =  225 bits (574), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 141/333 (42%), Positives = 198/333 (59%), Gaps = 35/333 (10%)

Query: 3   GSREESPDWLRSFQAPTHSVLTLSSDSDSDSSHNGNGDGGKNIAELTFEKHMEPFAPKSS 62
           G  E+ PDWLR+FQAPT + + LSS SD+ S          +  +   EKH  P   +  
Sbjct: 2   GDEEDDPDWLRAFQAPTVAPVMLSSGSDT-SPEASPTRTSTSRKQNKGEKHASPDHARDR 60

Query: 63  ILASSSD---SESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPKKG 119
             AS +    S + +  +L+ +++     EK+ +    LT +++  ++   S + SP   
Sbjct: 61  DGASQNKNKISAATRRKNLVSKKEGSTMDEKQANTPRRLTPKKDMVTLSSGS-DASPGNS 119

Query: 120 LKSPKK---EKDVINNVKRK----------------------GNNDHEDAEEDIAEKRSM 154
           L    +   E+  ++  KRK                      GN D   AE+D+ +K + 
Sbjct: 120 LSRAHEDNHEEGSLSTAKRKNAQQTKTKKTKDAGTKTGADQAGNGD---AEDDVQDKLTG 176

Query: 155 LNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 214
            +VS   LPL+  +K+QR+KAL+EC+G+S+DLSGD+GAVGRI+V        ++ LDLKG
Sbjct: 177 NSVS-QRLPLIFPDKVQRSKALIECDGDSIDLSGDIGAVGRIVVSNGPTAKQDLLLDLKG 235

Query: 215 TIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSE 274
           TIYKTT+VPSRTFC+VS G SEAKIEAIMNDFIQL+PQSN++EAETMVEGTLDGF FDS+
Sbjct: 236 TIYKTTIVPSRTFCVVSVGQSEAKIEAIMNDFIQLEPQSNLFEAETMVEGTLDGFTFDSD 295

Query: 275 DESDKLAKAMPHQIDQNDGIEE-QTNEKTKVKS 306
           +E DKL +   +Q D N+  +E Q   KTK K+
Sbjct: 296 EEGDKLYEPQANQNDLNNNEDEGQPKAKTKRKA 328


>gi|147811095|emb|CAN76881.1| hypothetical protein VITISV_037245 [Vitis vinifera]
          Length = 693

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 160/383 (41%), Positives = 202/383 (52%), Gaps = 116/383 (30%)

Query: 1   MSGSREESPDWLRSFQAPTHSVLTLSSDSDS-------------DSSHN----------- 36
           M  SREESPDWL  FQAPT S LTLSSDSDS             DS  N           
Sbjct: 31  MGSSREESPDWLHCFQAPTRSALTLSSDSDSPPEDSPLREDKDADSPFNETSKAKSPRKR 90

Query: 37  -------------------GNGDGGKNIAELTFEKHMEPFAPKSSILASSSDSESCQNSS 77
                                G+ G+  A +      EP AP  S+   SSDSESC ++S
Sbjct: 91  LKAEXPKSTKKQKVNNHKKKEGNEGRKKASVEKPSDGEPNAPNHSVWXLSSDSESCPDNS 150

Query: 78  LLREE----------KAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPKKGLKSPKKEK 127
            +RE+          K  QF +KE  +DA L+   +++ + + SKEKSPKK +K     +
Sbjct: 151 SMREDYIHHEESSVHKTTQFPDKE--KDAVLSENGQESLLNEASKEKSPKKRVKV----E 204

Query: 128 DVINNVKRKGNND-----HEDA----EEDIAEKRSMLNVSTSTLPLVMSEKLQRTKA--- 175
           D I++ K+K N+D     H+D     EE+ ++K +   VS+S LPLV+SEK+ R+K    
Sbjct: 205 DHISSKKKKANSDMKGKGHDDGTELLEEEASDKHTEPQVSSSGLPLVLSEKVHRSKGFRF 264

Query: 176 ------------LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVP 223
                       L+  +GES+DLSGDMGAVGRI                           
Sbjct: 265 SIEKAASLEIRPLLSVKGESIDLSGDMGAVGRI--------------------------- 297

Query: 224 SRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKA 283
                 VSFG SEAK+EA+MNDFIQLKPQSNVYEAETMVEG L+GF FDS+DE+DK+ KA
Sbjct: 298 ------VSFGQSEAKVEAVMNDFIQLKPQSNVYEAETMVEGILEGFSFDSDDEADKMPKA 351

Query: 284 MPHQIDQNDGIEEQTNEKTKVKS 306
              Q DQN+G EEQTN KTK K+
Sbjct: 352 STRQTDQNEGDEEQTNGKTKGKA 374


>gi|356507532|ref|XP_003522518.1| PREDICTED: uncharacterized protein LOC100795660 [Glycine max]
          Length = 380

 Score =  224 bits (571), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 115/194 (59%), Positives = 147/194 (75%), Gaps = 10/194 (5%)

Query: 122 SPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNV---STSTLPLVMSEKLQRTKALVE 178
           +P K K++  + K KG  D +  EE+  EK +  NV   S++ LPL++SEK+QRTKAL+E
Sbjct: 151 TPVKGKEIKASAKGKGIGDLKVEEEETCEKPAEPNVRMVSSARLPLMLSEKVQRTKALIE 210

Query: 179 CEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK------GTIYKTTLVPSRTFCIVSF 232
           C+G+S+DLSGDMGAVGRI++  +  G+ EM LDLK      GTIYKT++VP RTFC+VSF
Sbjct: 211 CQGDSIDLSGDMGAVGRIIISDSPSGDQEMCLDLKDFAVISGTIYKTSIVPCRTFCVVSF 270

Query: 233 GHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQND 292
           G SEAK+EAIMNDFIQLKP SNVYEAETMVEGTLDGF FDS++ES K+ K+  +Q DQN+
Sbjct: 271 GQSEAKVEAIMNDFIQLKPHSNVYEAETMVEGTLDGFFFDSDEESGKIQKST-NQTDQNE 329

Query: 293 GIEEQTNEKTKVKS 306
             EEQ N  +K K+
Sbjct: 330 NAEEQANGNSKGKA 343


>gi|357464307|ref|XP_003602435.1| DNA-binding protein BIN4 [Medicago truncatula]
 gi|355491483|gb|AES72686.1| DNA-binding protein BIN4 [Medicago truncatula]
          Length = 416

 Score =  224 bits (571), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 111/183 (60%), Positives = 140/183 (76%), Gaps = 1/183 (0%)

Query: 124 KKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGES 183
           K +K  +N   + G+ D E  +E+  EK    NVS+S LPL++SEK+ RTKALVEC+G+S
Sbjct: 198 KGKKTKVNAKGKGGDGDVEVKDEETFEKPVEPNVSSSILPLMLSEKVHRTKALVECQGDS 257

Query: 184 VDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIM 243
           +DLSGDMGAVGR+++  +  G+ EM LDLKGTIYKT+++PSRTFC+VSFG +EAKIEAIM
Sbjct: 258 IDLSGDMGAVGRVIISDSPSGDQEMCLDLKGTIYKTSIIPSRTFCVVSFGQTEAKIEAIM 317

Query: 244 NDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTK 303
           NDFIQL P S+VYEAETMVEGTLDGF FDS++E  K  K   H  DQN+  EEQT  K+K
Sbjct: 318 NDFIQLNPPSDVYEAETMVEGTLDGFSFDSDEEDGKRQKTT-HPSDQNEHAEEQTPGKSK 376

Query: 304 VKS 306
            K+
Sbjct: 377 RKA 379



 Score = 41.6 bits (96), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 20/31 (64%), Positives = 25/31 (80%), Gaps = 1/31 (3%)

Query: 1  MSGSREESPDWLRSFQAPT-HSVLTLSSDSD 30
          MS S + SPDWLRS+Q PT +S +TLSSDS+
Sbjct: 1  MSSSGDSSPDWLRSYQVPTPNSPVTLSSDSE 31


>gi|218189675|gb|EEC72102.1| hypothetical protein OsI_05068 [Oryza sativa Indica Group]
          Length = 473

 Score =  224 bits (571), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 137/311 (44%), Positives = 188/311 (60%), Gaps = 26/311 (8%)

Query: 9   PDWLRSFQAPTHSVLTLSSDSDSDSSHNGNGDGGKNIAELTFEKHMEPFAPKSSILASSS 68
           PDWLR+FQ PT S + LSSDS  DS  N       +  E   E             ASS 
Sbjct: 140 PDWLRAFQPPTTSTVMLSSDS-YDSPENSPTRTTPSGEEQKGENK-----------ASSD 187

Query: 69  DSESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVK----KVSKEKSPKKGLKSPK 124
                  +S  + +KA   + K         G++EK +++    K +K  +PKK +K P 
Sbjct: 188 HVGDGDVASPNKGKKATPTRRKTPTSQEDAFGKDEKPTMESNQDKPAKRSTPKKLVKFPS 247

Query: 125 KEKDVINNVKRKGNNDHED---------AEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKA 175
                     + G +  +D         AE D+ +K +  +VS   LPL++ +K+QR+KA
Sbjct: 248 SSNASKVTGPKAGPDQIDDTLEHREEGVAEVDMQDKLTEHSVS-QRLPLIIPDKVQRSKA 306

Query: 176 LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHS 235
           L+EC+G+S+DLSGD+GAVGRI++  +  GN ++ LDLKGTIYK+T+VPSRTFC+VS G +
Sbjct: 307 LIECDGDSIDLSGDVGAVGRIVISNSPNGNQDLLLDLKGTIYKSTIVPSRTFCVVSVGQT 366

Query: 236 EAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIE 295
           EAKIE+IMNDFIQL+PQSN++EAETMVEGTLDGF FDS++E DKL +    Q DQN+   
Sbjct: 367 EAKIESIMNDFIQLEPQSNLFEAETMVEGTLDGFTFDSDEEGDKLPEPHASQNDQNNEDG 426

Query: 296 EQTNEKTKVKS 306
           +Q   KTK K+
Sbjct: 427 DQPKAKTKRKA 437


>gi|357138685|ref|XP_003570920.1| PREDICTED: uncharacterized protein LOC100821932 [Brachypodium
           distachyon]
          Length = 358

 Score =  213 bits (542), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 123/296 (41%), Positives = 178/296 (60%), Gaps = 16/296 (5%)

Query: 1   MSGSREESPDWLRSFQAPTHSVLTLSSDSDS---------DSSHNGNGDGGKNIAELTFE 51
           M+G  E  PDWL +FQAP+ + + LSS SDS         D      G G  + A  T +
Sbjct: 1   MAGQEEGDPDWLTAFQAPSLAPVMLSSGSDSSRENSPTRADQEEKEAGKGNSDTAGRTDD 60

Query: 52  --KHMEPFAPKSSILASSSDSESCQNSSLLREEKAFQFQEKEMDED-AALTGREEKTSVK 108
             K  +P A +   LAS   +      S + E++    + +   ++  AL+      S  
Sbjct: 61  ATKGKQPTATRRKALASKKGALGKHEGSSMEEQEDIPPKRRTPKKNLVALSDSSAGNSPS 120

Query: 109 KVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSM----LNVSTSTLPL 164
           + S+    +  LK    +     +   + ++  E  EE +AE++       N  +  LPL
Sbjct: 121 RASEANEEEDSLKDKNAKVVGTESCPDQRDDTLEQLEEGVAEEQMQDKLPGNSISQRLPL 180

Query: 165 VMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPS 224
           ++++K+QR+KALVEC+G+S+DLSGD+GAVGRI++     GN ++ LDLKGT+YK+T+VPS
Sbjct: 181 MITDKVQRSKALVECDGDSIDLSGDIGAVGRIVISNGPTGNQDLLLDLKGTVYKSTIVPS 240

Query: 225 RTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKL 280
           RTFC+VS G +EAKIEAIMNDFIQL+P SN++EAETM+EGTLDGF FDS+ E D+L
Sbjct: 241 RTFCVVSVGQTEAKIEAIMNDFIQLEPHSNLFEAETMMEGTLDGFTFDSDGEGDRL 296


>gi|297812657|ref|XP_002874212.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297320049|gb|EFH50471.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 492

 Score =  210 bits (535), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 127/267 (47%), Positives = 163/267 (61%), Gaps = 61/267 (22%)

Query: 91  EMDEDAALTGREEKTSVKKVSKEKSPK----KGLKSPKK----------EKDVIN----- 131
           E DED  L   EE+ +VKKV KEKSPK     G K+PK+          E  VIN     
Sbjct: 184 EKDEDFVLDATEEEPAVKKVGKEKSPKTKSKSGRKTPKEGNGAQKIFKSEDKVINTTIAE 243

Query: 132 -----------------------NVKRKGNNDHEDAEEDIAEKRSM------LNVSTSTL 162
                                  + ++   ++ +D +  IAE+ +M       + S+S L
Sbjct: 244 EVTMEKSPETKSKSSRKTPKEENSAQKIFKSEDKDTDTTIAEEGTMDQKIKPSSGSSSRL 303

Query: 163 PLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK--------- 213
           PLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLK         
Sbjct: 304 PLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGINTFSFHV 360

Query: 214 -GTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFD 272
            GTIYK+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+
Sbjct: 361 SGTIYKSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLTPQSNVYEAETMVEGTLEGFSFE 420

Query: 273 SEDESDKLAKAMPHQIDQNDGIEEQTN 299
           S+DE++K AK      DQ+ G EE+TN
Sbjct: 421 SDDETNKNAKTASKPADQSGGTEEETN 447


>gi|45736025|dbj|BAD13052.1| unknown protein [Oryza sativa Japonica Group]
 gi|45736067|dbj|BAD13093.1| unknown protein [Oryza sativa Japonica Group]
          Length = 320

 Score =  210 bits (535), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 112/227 (49%), Positives = 151/227 (66%), Gaps = 12/227 (5%)

Query: 109 KVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHED----------AEEDIAEKRSMLNVS 158
           K  K  +PKK L  P    +       K   D  D          AEE++ +K +  +VS
Sbjct: 78  KPPKRSTPKKKLVKPPSGSNASKVTGPKAGPDQIDDTLEHQEEGVAEEEMQDKLTEHSVS 137

Query: 159 TSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 218
              LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++  +  GN E+ LDLKGTIYK
Sbjct: 138 -QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYK 196

Query: 219 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESD 278
           +T+VPSRTFC+VS G +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E D
Sbjct: 197 STIVPSRTFCVVSVGQTEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGD 256

Query: 279 KLAKAMPHQIDQNDGIEEQTNEKTKVKS-GPCRKERYECWRTATATK 324
           KL +    Q DQN+   +Q   KTK K+  P  K + +      ATK
Sbjct: 257 KLPEPHASQNDQNNEDGDQPKAKTKRKAEKPAGKGQKKAKVAGKATK 303


>gi|238481371|ref|NP_001154738.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005958|gb|AED93341.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 446

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 158 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 217
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 263 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDTT---GDMYLDLKGTIY 319

Query: 218 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 277
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 320 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 379

Query: 278 DKLAKAMPHQIDQNDGIEEQTN 299
           +K AK      DQ+ G EE+TN
Sbjct: 380 NKNAKTAVKPADQSVGTEEETN 401


>gi|238481367|ref|NP_001154736.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|238481369|ref|NP_001154737.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005956|gb|AED93339.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005957|gb|AED93340.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 450

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 158 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 217
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 267 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKGTIY 323

Query: 218 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 277
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 324 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 383

Query: 278 DKLAKAMPHQIDQNDGIEEQTN 299
           +K AK      DQ+ G EE+TN
Sbjct: 384 NKNAKTAVKPADQSVGTEEETN 405


>gi|240256338|ref|NP_197851.5| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005954|gb|AED93337.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 451

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 158 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 217
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 268 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKGTIY 324

Query: 218 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 277
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 325 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 384

Query: 278 DKLAKAMPHQIDQNDGIEEQTN 299
           +K AK      DQ+ G EE+TN
Sbjct: 385 NKNAKTAVKPADQSVGTEEETN 406


>gi|238481365|ref|NP_001154735.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|332005955|gb|AED93338.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 432

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 158 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 217
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 249 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKGTIY 305

Query: 218 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 277
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 306 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 365

Query: 278 DKLAKAMPHQIDQNDGIEEQTN 299
           +K AK      DQ+ G EE+TN
Sbjct: 366 NKNAKTAVKPADQSVGTEEETN 387


>gi|238481373|ref|NP_001154739.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
 gi|257050999|sp|Q9FLU1.2|BIN4_ARATH RecName: Full=DNA-binding protein BIN4; AltName: Full=Protein
           BRASSINOSTEROID INSENSITIVE 4; AltName: Full=Protein
           MIDGET
 gi|332005959|gb|AED93342.1| DNA-binding protein BIN4 [Arabidopsis thaliana]
          Length = 454

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 98/142 (69%), Positives = 119/142 (83%), Gaps = 3/142 (2%)

Query: 158 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 217
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLKGTIY
Sbjct: 271 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKGTIY 327

Query: 218 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 277
           K+T++PSRTFC+V+ G +EAKIEAIMNDFIQL PQSNVYEAETMVEGTL+GF F+S+DES
Sbjct: 328 KSTIIPSRTFCVVNVGQTEAKIEAIMNDFIQLIPQSNVYEAETMVEGTLEGFTFESDDES 387

Query: 278 DKLAKAMPHQIDQNDGIEEQTN 299
           +K AK      DQ+ G EE+TN
Sbjct: 388 NKNAKTAVKPADQSVGTEEETN 409


>gi|413935594|gb|AFW70145.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
          Length = 354

 Score =  197 bits (500), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 132/333 (39%), Positives = 189/333 (56%), Gaps = 45/333 (13%)

Query: 3   GSREESPDWLRSFQAPTHSVLTLSSDSDSDSSHNGNGDGGKNIAELTFEKHMEPFAPKSS 62
           G  E+ PDWLR+FQAPT + + LSS SD+ S          +  +   EKH  P   +  
Sbjct: 2   GDEEDDPDWLRAFQAPTVAPVMLSSGSDT-SPEASPTRTSTSRKQNKGEKHASPDHARDR 60

Query: 63  ILASSSD---SESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPKKG 119
             AS +    S + +  +L+ +++     EK+ +    LT +++  ++   S + SP   
Sbjct: 61  DGASQNKNKISAATRRKNLVSKKEGSTMDEKQANTPRRLTPKKDMVTLSSGS-DASPGNS 119

Query: 120 LKSPKK---EKDVINNVKRK----------------------GNNDHEDAEEDIAEKRSM 154
           L    +   E+  ++  KRK                      GN D   AE+D+ +K + 
Sbjct: 120 LSRAHEDNHEEGSLSTAKRKNAQQTKTKKTKDAGTKTGADQAGNGD---AEDDVQDKLTG 176

Query: 155 LNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 214
            +VS   LPL+  +K+QR+KAL+EC+G+S+DLSGD+GAVGRI+V        ++ LDLKG
Sbjct: 177 NSVS-QRLPLIFPDKVQRSKALIECDGDSIDLSGDIGAVGRIVVSNGPTAKQDLLLDLKG 235

Query: 215 TIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSE 274
           TIYKTT+VPSRTFC+VS G SEAKIEAIMNDFIQL+PQSN++EAETMVE          +
Sbjct: 236 TIYKTTIVPSRTFCVVSVGQSEAKIEAIMNDFIQLEPQSNLFEAETMVE----------D 285

Query: 275 DESDKLAKAMPHQIDQNDGIEE-QTNEKTKVKS 306
           +E DKL +   +Q D N+  +E Q   KTK K+
Sbjct: 286 EEGDKLYEPQANQNDLNNNEDEGQPKAKTKRKA 318


>gi|255544304|ref|XP_002513214.1| conserved hypothetical protein [Ricinus communis]
 gi|223547712|gb|EEF49205.1| conserved hypothetical protein [Ricinus communis]
          Length = 155

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 88/117 (75%), Positives = 102/117 (87%)

Query: 190 MGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQL 249
           MGAVGR+++P  + GNHEM+LDLKGTIY+T +VPSRTFC+VSFG SEAKIEAIMNDFIQL
Sbjct: 1   MGAVGRVVIPDGSSGNHEMYLDLKGTIYRTKIVPSRTFCVVSFGQSEAKIEAIMNDFIQL 60

Query: 250 KPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 306
           K QSNVYEAETMVEGTL+GF FDSEDE++K+ K + HQ DQNDG EEQ N KTK K+
Sbjct: 61  KAQSNVYEAETMVEGTLEGFSFDSEDEAEKMPKPVSHQTDQNDGNEEQMNGKTKRKA 117


>gi|297598587|ref|NP_001045881.2| Os02g0147100 [Oryza sativa Japonica Group]
 gi|255670601|dbj|BAF07795.2| Os02g0147100, partial [Oryza sativa Japonica Group]
          Length = 126

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 79/118 (66%), Positives = 102/118 (86%)

Query: 175 ALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGH 234
           AL+EC+G+S+DLSGD+GAVGRI++  +  GN E+ LDLKGTIYK+T+VPSRTFC+VS G 
Sbjct: 1   ALIECDGDSIDLSGDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPSRTFCVVSVGQ 60

Query: 235 SEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQND 292
           +EAKIE+IM+DFIQL+PQSN++EAETM+EGTLDGF FDS++E DKL +    Q DQN+
Sbjct: 61  TEAKIESIMDDFIQLEPQSNLFEAETMMEGTLDGFTFDSDEEGDKLPEPHASQNDQNN 118


>gi|410129774|dbj|BAM64852.1| hypothetical protein [Beta vulgaris]
          Length = 467

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 112/243 (46%), Positives = 154/243 (63%), Gaps = 40/243 (16%)

Query: 58  APKSSILASSSDSESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPK 117
           AP  SI   SSDSE+  ++++ +       +EK  +E++A      +++VK+    KSPK
Sbjct: 242 APNHSICELSSDSETKSDATIRK------IEEKIDNEESA-----PRSAVKR----KSPK 286

Query: 118 KGLKSP--KKEKDVINNVKRKGNNDHEDAEEDI-AEKRSMLNVSTSTLPLVMSEKLQRTK 174
           K +K      EK      K +G  ++ D  E++  EK+S   VS+ST+PLV+ EK+QRTK
Sbjct: 287 KRVKQEDDATEKRTNEPAKIEGEVENMDVAEEVKPEKQSEPAVSSSTMPLVLPEKVQRTK 346

Query: 175 ALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGH 234
           ALVECEGES+DLSGDMGAVGR+++  T     EMFLDLK ++                  
Sbjct: 347 ALVECEGESIDLSGDMGAVGRVIISETPSKESEMFLDLKVSLE----------------- 389

Query: 235 SEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGI 294
              +IEAIMNDFIQLKPQSNVY AETMVEGTL+GF F+S+DE+DK+ K +  + DQ++G 
Sbjct: 390 ---QIEAIMNDFIQLKPQSNVYNAETMVEGTLEGFTFESDDEADKMTKDV--KGDQSEGA 444

Query: 295 EEQ 297
           EE+
Sbjct: 445 EEK 447



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 20/28 (71%), Positives = 23/28 (82%)

Query: 1   MSGSREESPDWLRSFQAPTHSVLTLSSD 28
           M  SRE SPDWLR+FQAP+ S+ TLSSD
Sbjct: 115 MGDSREASPDWLRTFQAPSQSITTLSSD 142


>gi|45736024|dbj|BAD13051.1| unknown protein [Oryza sativa Japonica Group]
 gi|45736066|dbj|BAD13092.1| unknown protein [Oryza sativa Japonica Group]
          Length = 193

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 80/138 (57%), Positives = 103/138 (74%), Gaps = 1/138 (0%)

Query: 188 GDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFI 247
           GD+GAVGRI++  +  GN E+ LDLKGTIYK+T+VPSRTFC+VS G +EAKIE+IM+DFI
Sbjct: 39  GDVGAVGRIIISNSPNGNQELLLDLKGTIYKSTIVPSRTFCVVSVGQTEAKIESIMDDFI 98

Query: 248 QLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS- 306
           QL+PQSN++EAETM+EGTLDGF FDS++E DKL +    Q DQN+   +Q   KTK K+ 
Sbjct: 99  QLEPQSNLFEAETMMEGTLDGFTFDSDEEGDKLPEPHASQNDQNNEDGDQPKAKTKRKAE 158

Query: 307 GPCRKERYECWRTATATK 324
            P  K + +      ATK
Sbjct: 159 KPAGKGQKKAKVAGKATK 176


>gi|10177863|dbj|BAB11215.1| unnamed protein product [Arabidopsis thaliana]
          Length = 531

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 91/163 (55%), Positives = 108/163 (66%), Gaps = 28/163 (17%)

Query: 158 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK---- 213
           S+S LPLV+SEK+ RTK LVECEG+S+DLSGDMGAVGR++V  T     +M+LDLK    
Sbjct: 331 SSSRLPLVLSEKVNRTKVLVECEGDSIDLSGDMGAVGRVVVSDT---TGDMYLDLKEHFA 387

Query: 214 -----------------GTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVY 256
                            GTI  T    S     V+ G +EAKIEAIMNDFIQL PQSNVY
Sbjct: 388 LYVFSSSCTFSDCKGTCGTIIHT----SDLVFQVNVGQTEAKIEAIMNDFIQLIPQSNVY 443

Query: 257 EAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTN 299
           EAETMVEGTL+GF F+S+DES+K AK      DQ+ G EE+TN
Sbjct: 444 EAETMVEGTLEGFTFESDDESNKNAKTAVKPADQSVGTEEETN 486


>gi|168037576|ref|XP_001771279.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677368|gb|EDQ63839.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 379

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 66/125 (52%), Positives = 93/125 (74%), Gaps = 3/125 (2%)

Query: 157 VSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTI 216
           ++ S+LP+V  EK+ RTK L+ECEG+++DLSGDMGAVGR  V      + E+ LDLKG +
Sbjct: 179 IAASSLPIVFGEKVNRTKVLLECEGDALDLSGDMGAVGRFTV---NRPDDELLLDLKGVV 235

Query: 217 YKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDE 276
           YKTT+VPS T+ +V+ G  EAK+E+IM DF+QL+  ++  E ETMVEGTL  F F+S++E
Sbjct: 236 YKTTIVPSNTYFVVNVGQMEAKVESIMTDFMQLRADTSWNENETMVEGTLKDFAFESDEE 295

Query: 277 SDKLA 281
            ++ A
Sbjct: 296 GERPA 300


>gi|168010889|ref|XP_001758136.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690592|gb|EDQ76958.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 373

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 64/122 (52%), Positives = 93/122 (76%), Gaps = 3/122 (2%)

Query: 158 STSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIY 217
           + S+LP+V  EK+ +TK L+ECEG+++DLSGDMGAVGR  V      ++E+ LDLKG IY
Sbjct: 172 AASSLPIVFGEKVNKTKVLLECEGDALDLSGDMGAVGRFTV---NRRDNELLLDLKGVIY 228

Query: 218 KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDES 277
           KTT+VPS TF +V+ G ++AK+E+IM+DF+QL+  +   E ET+VEGTL  F F+S++E 
Sbjct: 229 KTTIVPSNTFFLVNVGQTDAKVESIMSDFVQLRADTIGKENETVVEGTLQDFTFESDEEG 288

Query: 278 DK 279
           ++
Sbjct: 289 ER 290


>gi|45736061|dbj|BAD13087.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 189

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 60/106 (56%), Positives = 85/106 (80%), Gaps = 4/106 (3%)

Query: 144 AEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAE 203
           AEED+ +K +  +VS   LPL++ +K+QR+KAL+EC+G+S+DLSGD+GAVGRI++  +  
Sbjct: 87  AEEDMQDKLTEHSVS-QRLPLIIPDKIQRSKALIECDGDSIDLSGDVGAVGRIIISNSPN 145

Query: 204 GNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQL 249
           GN E+ LDLKGTIYK+T+VPSRTFC+VS G +EAK   +++D I L
Sbjct: 146 GNQELLLDLKGTIYKSTIVPSRTFCVVSVGQTEAK---VVDDNINL 188


>gi|224132538|ref|XP_002321341.1| predicted protein [Populus trichocarpa]
 gi|222868337|gb|EEF05468.1| predicted protein [Populus trichocarpa]
          Length = 159

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 79/196 (40%), Positives = 98/196 (50%), Gaps = 61/196 (31%)

Query: 127 KDVINNVKRKG-------NNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKALVEC 179
           K ++  VK +G       + D E  EE  +EK    +VS S LPLV+SEK+QR+KALVEC
Sbjct: 2   KAIMCKVKVQGEEGTCGNHGDVEITEEVTSEKLINAHVSISRLPLVLSEKVQRSKALVEC 61

Query: 180 EGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKI 239
           EGES+DLSG+MGAVGR ++P T  G+ EM LDLKGT                        
Sbjct: 62  EGESIDLSGEMGAVGRPVIPDTPSGDSEMHLDLKGT------------------------ 97

Query: 240 EAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTN 299
                                     L+GF F SEDE+DK+ KA  HQ DQN  +EE +N
Sbjct: 98  --------------------------LEGFSFFSEDETDKITKATAHQTDQNMDVEEPSN 131

Query: 300 EKTKVK----SGPCRK 311
              K K    SG  RK
Sbjct: 132 GMIKRKAEKSSGVARK 147


>gi|384247484|gb|EIE20971.1| hypothetical protein COCSUDRAFT_83530 [Coccomyxa subellipsoidea
           C-169]
          Length = 378

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 84/276 (30%), Positives = 130/276 (47%), Gaps = 36/276 (13%)

Query: 9   PDWLRSFQAPTHSVLTLSSDSD--------SDSSHNGNGDGGKNIAEL--TFEKHMEPFA 58
           PDWL    +P  + L L SDSD         D +  G G G  + AE      K     A
Sbjct: 16  PDWLSENNSPVKA-LELLSDSDEVLEVASSDDGAEEGAGPGPGSAAEQKEAPRKRGRAAA 74

Query: 59  PKSS-----ILASSSDSES--------CQNSSLLREEKAFQFQEKEMDEDAALTGREEKT 105
           P++      +L SS +S+S         Q  +  R++     Q+ +        G  +K 
Sbjct: 75  PQAKRNLNRVLLSSENSDSDDVEGSGKGQQPARKRQKTMVATQQSQRFSSQDSAGPSQKA 134

Query: 106 SVKKVSKEKSPKKGLKSPKKEKDVINN-VKRKG-----NNDHEDAEEDIAEKRSMLNVST 159
             +K +     K  L+  ++E +  +N V  +G      +      +    K ++L    
Sbjct: 135 PSRKAAGSGG-KAALRRIQEEDEQPDNDVGGEGTQAIVGSQAAKGPKSPVSKIAVLAPVA 193

Query: 160 STLPLVMSEKLQRTKALVECEG---ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTI 216
             +P++M EKL  TK +VE EG    SVDL+GD G VGR LV G  + N E+ +DLKG +
Sbjct: 194 GEIPVMMPEKLPVTKFIVELEGGNDSSVDLTGDAGVVGRWLVEGD-DSNPELRMDLKGVV 252

Query: 217 YKTTLVP-SRTFCIVSFGHSEAKIEAIMNDFIQLKP 251
           Y  T VP + T C+V+   +EA++EA+ + +IQL+P
Sbjct: 253 YAATTVPCATTLCVVNLSATEARVEALGHQYIQLRP 288


>gi|303275097|ref|XP_003056848.1| hypothetical protein MICPUCDRAFT_62312 [Micromonas pusilla
           CCMP1545]
 gi|226461200|gb|EEH58493.1| hypothetical protein MICPUCDRAFT_62312 [Micromonas pusilla
           CCMP1545]
          Length = 392

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/119 (39%), Positives = 72/119 (60%), Gaps = 8/119 (6%)

Query: 162 LPLVMSEKLQRTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKT 219
           +PL+M   + R K   ECEG  ESVDL GD+G VGR+L   +A+    + +DLKG IY  
Sbjct: 195 IPLLMPATVNRNKIFFECEGAGESVDLEGDVGVVGRLLSDPSAKYTSGVKVDLKGVIYDA 254

Query: 220 TLVPS-RTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFL-FDSEDE 276
            ++P+  +  +++   +EAK+E I ND++QL+           V GTL+G+L  DS+D+
Sbjct: 255 KILPTPASIVVLAVNQTEAKVECIANDYVQLRRDVTANN----VGGTLEGYLGVDSDDD 309


>gi|449532507|ref|XP_004173222.1| PREDICTED: DNA-binding protein BIN4-like, partial [Cucumis sativus]
          Length = 198

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 61/174 (35%), Positives = 95/174 (54%), Gaps = 25/174 (14%)

Query: 41  GGKNIA-ELTFEKHMEPFAPKSSILASSSDSESCQNSSLLREEKAFQFQEKEMDEDAALT 99
           G  N+A E+  ++H      K S+   S DSESC +++ ++E+ ++  +  E+   + + 
Sbjct: 49  GNHNLAKEVKLDRHTGHENSKHSVWMLSLDSESCSDNNFIKEDYSYHEELAEL-ATSEIQ 107

Query: 100 GREEKTSVKKVSKEKSPKKGLKSPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNVST 159
           GR +         E +  +G            +V+   N D E  E+D  +      VS+
Sbjct: 108 GRRKD--------ENAGSEG------------HVR---NGDVEILEKDALDDCIGPPVSS 144

Query: 160 STLPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLK 213
           S LPLV+S+K  R KALVECEG S+DLSGDMGAVGR++V  ++   +E+ LDLK
Sbjct: 145 SRLPLVLSDKAHRLKALVECEGTSIDLSGDMGAVGRVVVSDSSSAKNELCLDLK 198


>gi|307107138|gb|EFN55382.1| hypothetical protein CHLNCDRAFT_52595 [Chlorella variabilis]
          Length = 402

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 46/125 (36%), Positives = 73/125 (58%), Gaps = 8/125 (6%)

Query: 149 AEKRSMLNVSTSTLPLVMSEKLQRTKALVECEGES------VDLSGDMGAVGRILVPGTA 202
           A    + +  TS +P+V+ EKL + K LVE E          DLSGD GA+GR+++ GT 
Sbjct: 168 AAGLGLPHAPTSQMPVVLPEKLPQMKMLVELESNPDDAHHVTDLSGDSGAIGRLVIAGTK 227

Query: 203 EGNHEMFLDLKGTIYKTTLVPSR-TFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETM 261
           E   EM +DLKG +Y  T+VP   +  +++ G +EAK+E + ++F+QL+  +     +T 
Sbjct: 228 EAP-EMQIDLKGVLYNATIVPCPVSIAVINMGQTEAKVECLFSEFVQLREDTRFSFVDTE 286

Query: 262 VEGTL 266
           V G +
Sbjct: 287 VLGGM 291


>gi|255084389|ref|XP_002508769.1| predicted protein [Micromonas sp. RCC299]
 gi|226524046|gb|ACO70027.1| predicted protein [Micromonas sp. RCC299]
          Length = 376

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/121 (38%), Positives = 72/121 (59%), Gaps = 11/121 (9%)

Query: 162 LPLVMSEKLQRTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKT 219
           +PL M   + R K   E EG  E+VDL GD+G VGR+L   + +    + +D+KG IY  
Sbjct: 178 VPLFMPSAVNRNKVFFELEGTGEAVDLEGDVGVVGRLLSNASEKHGSGLQMDMKGVIYNA 237

Query: 220 TLVPS-RTFCIVSFGHSEAKIEAIMNDFIQLK--PQSNVYEAETMVEGTLDGFL-FDSED 275
            ++P+  +  I++   +EAK+E+I +DF+QL+  P +N       +  TLDG+L  DS+D
Sbjct: 238 RILPTPASVVILAVNANEAKVESITHDFVQLRRDPVAN-----GGLGATLDGYLGVDSDD 292

Query: 276 E 276
           E
Sbjct: 293 E 293


>gi|320166916|gb|EFW43815.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 236

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 68/110 (61%), Gaps = 3/110 (2%)

Query: 162 LPLVMSEKLQRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTL 221
           +PLV+ +    T  L+E     ++LSGD G+VGR++V    +    + LDLKG I++ T+
Sbjct: 53  IPLVLGKP--ATAFLMETTEPQLNLSGDTGSVGRLVV-NAQDAAAPLVLDLKGLIFQGTV 109

Query: 222 VPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLF 271
           VPS TF +V+ G +EA+I+A +N+ I      +  ++ T+VEG++D   F
Sbjct: 110 VPSNTFMLVTIGPNEARIDATVNNVIHTTCLGDALDSMTVVEGSIDADAF 159


>gi|255544302|ref|XP_002513213.1| conserved hypothetical protein [Ricinus communis]
 gi|223547711|gb|EEF49204.1| conserved hypothetical protein [Ricinus communis]
          Length = 137

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 54/125 (43%), Positives = 78/125 (62%), Gaps = 9/125 (7%)

Query: 58  APKSSILASSSDSESCQNSSLLREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPK 117
           A K SI A SSDSESC + S  R+ K+ Q +E   ++DA L       SV+K  K KSPK
Sbjct: 17  AAKRSIWALSSDSESCPDDSSQRKTKSNQPKESRKEKDAILVD----NSVEKALKGKSPK 72

Query: 118 KGLK----SPKKEKDVINNVKRKGNNDH-EDAEEDIAEKRSMLNVSTSTLPLVMSEKLQR 172
           K LK      KK+++  ++V + G+ND  E A+ED++EK +   VSTS +PL++S+K+ R
Sbjct: 73  KLLKVEGRKLKKKENKNDDVPKTGDNDDVEVADEDVSEKHTGPQVSTSMIPLLLSDKVSR 132

Query: 173 TKALV 177
           TK  +
Sbjct: 133 TKMFL 137


>gi|428174918|gb|EKX43811.1| hypothetical protein GUITHDRAFT_140250 [Guillardia theta CCMP2712]
          Length = 309

 Score = 70.9 bits (172), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 52/82 (63%), Gaps = 5/82 (6%)

Query: 176 LVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHS 235
           +V+CE +++D  GD G +G I       G   M LDLKG +Y+ +++P  T  ++  G +
Sbjct: 164 VVQCEDQNLDFEGDSGVIGTI-----KTGERSMELDLKGQLYEGSILPCCTMMVMDMGAT 218

Query: 236 EAKIEAIMNDFIQLKPQSNVYE 257
           E KIE++MNDF+++ P +N ++
Sbjct: 219 EVKIESVMNDFVKVHPIANPHD 240


>gi|302857504|ref|XP_002959890.1| hypothetical protein VOLCADRAFT_101408 [Volvox carteri f.
           nagariensis]
 gi|300254022|gb|EFJ39045.1| hypothetical protein VOLCADRAFT_101408 [Volvox carteri f.
           nagariensis]
          Length = 340

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 64/129 (49%), Gaps = 35/129 (27%)

Query: 157 VSTSTLPLVMSEKLQRTKALVEC-----EGE----SVDLSGDMGAVGRILVPGTAEGNH- 206
           V    LP+++ EKL R K L+E       GE    + DLSGD GAVGR++V      N  
Sbjct: 38  VEAGKLPIMLPEKLNRNKVLLELPPAAMSGEHYHGAADLSGDSGAVGRLVVGKAPAANSK 97

Query: 207 ------------------------EMFLDLKGTIYKTTLVP-SRTFCIVSFGHSEAKIEA 241
                                   ++ +DLKG +Y   ++P + T  +++ G SEAK+E+
Sbjct: 98  GTAAAAAAAAAGGSEGGGSGGDSVQLQVDLKGLLYNAWVLPLAGTAMVLNIGPSEAKVES 157

Query: 242 IMNDFIQLK 250
           + NDFI+L+
Sbjct: 158 LFNDFIRLR 166


>gi|449526337|ref|XP_004170170.1| PREDICTED: DNA-binding protein BIN4-like [Cucumis sativus]
          Length = 71

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 35/53 (66%), Positives = 41/53 (77%), Gaps = 1/53 (1%)

Query: 243 MNDFIQLKPQSNVYEAETMVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIE 295
           MNDFIQLK  S V EAETMVEGTLDGF FDSED+++K+ KA     DQN+ +E
Sbjct: 1   MNDFIQLKALSKVDEAETMVEGTLDGFSFDSEDDAEKITKA-ASPADQNEPVE 52


>gi|299471419|emb|CBN79372.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 240

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 68/120 (56%), Gaps = 7/120 (5%)

Query: 162 LPLVMSEKL---QRTKALVECEGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYK 218
           +PL++   +    +   LV    E+++ +GD GA+GRI V G  +G    F DLKG  + 
Sbjct: 59  VPLIVPPAVCKDAKASLLVHLRDEALNFAGDTGAIGRISVGGKKKGRGLTF-DLKGQQFS 117

Query: 219 TTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAET--MVEGTL-DGFLFDSED 275
             ++PS T  +V  G +EAK+E + ++  +L  + NV++A    +++G + D + F+ ED
Sbjct: 118 GQIIPSATVMVVGIGKTEAKVEGLFSEVCRLDYKGNVFDAMGGEVLKGEMDDSYKFEEED 177


>gi|412989095|emb|CCO15686.1| hypothetical protein VOLCADRAFT_101408 [Bathycoccus prasinos]
          Length = 422

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 44/117 (37%), Positives = 66/117 (56%), Gaps = 24/117 (20%)

Query: 157 VSTSTLPLVMSE-KLQRTK---ALVECEG--ESVDLSGDMGAVGRILV-----PGTAEGN 205
           V  S +PL+++   LQ+T     L EC+G  E+VDLSGD+G VGR+LV     P    G+
Sbjct: 237 VPKSKMPLLINHGALQKTANSTILFECKGSGEAVDLSGDVGVVGRLLVERESQPTGGGGS 296

Query: 206 H----------EMFLDLKGTIY--KTTLVPSRTFCIVSFGHSEAKIEAIMNDFIQLK 250
                       + +DLKG +Y  +T L+ S T  IV+ G  +AKIE   ++++QL+
Sbjct: 297 KASTSKQQPHASVKVDLKGVMYLARTHLL-SSTVAIVNVGAEDAKIETFFDEYVQLR 352


>gi|308804497|ref|XP_003079561.1| unnamed protein product [Ostreococcus tauri]
 gi|116058016|emb|CAL54219.1| unnamed protein product [Ostreococcus tauri]
          Length = 229

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 47/124 (37%), Positives = 70/124 (56%), Gaps = 11/124 (8%)

Query: 158 STSTLPLVMSEKLQ-RTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 214
           S++ + LV S  L  ++K L E EG  E+VDL GD GAVGR L    AE    + +D+KG
Sbjct: 63  SSTRVDLVASAHLGGKSKVLFEIEGNGEAVDLDGDTGAVGRWL----AESARSLKVDMKG 118

Query: 215 TIYKTTLVP-SRTFCIVSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDS 273
            +Y   +VP + T  +V+     AKI+++  +F+QL+ +  V  A T   G   G LF+ 
Sbjct: 119 LMYNARVVPLAGTVVVVAVNGDVAKIDSVHREFVQLR-EDPVANAGTENYGV--GSLFEM 175

Query: 274 EDES 277
            DE+
Sbjct: 176 TDEA 179


>gi|159483423|ref|XP_001699760.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158281702|gb|EDP07456.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 308

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 38/138 (27%), Positives = 64/138 (46%), Gaps = 41/138 (29%)

Query: 160 STLPLVMSEKLQRTKALVECEGE---------------SVDLSGDMGAVGRILVPGTAEG 204
            +LPL++ +K+ R K L E                   + DLSGD GAVGR++V   A  
Sbjct: 140 GSLPLMLPDKINRNKVLFELPPPPASSSNADGYGHGHGATDLSGDAGAVGRLIVLAAAPT 199

Query: 205 NHE-------------------------MFLDLKGTIYKTTLVP-SRTFCIVSFGHSEAK 238
             +                         + +DLKG IY    +P + T  +++ G +EAK
Sbjct: 200 GGKGAAAGGGEPGPSSQGGGGPGGAEVCLQVDLKGVIYNAWALPLAGTAMVLNVGPTEAK 259

Query: 239 IEAIMNDFIQLKPQSNVY 256
           +E++ NDF++L+  ++V+
Sbjct: 260 VESLFNDFVRLREAADVH 277


>gi|356516190|ref|XP_003526779.1| PREDICTED: uncharacterized protein LOC100812582 [Glycine max]
          Length = 82

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 26/46 (56%), Positives = 36/46 (78%), Gaps = 1/46 (2%)

Query: 261 MVEGTLDGFLFDSEDESDKLAKAMPHQIDQNDGIEEQTNEKTKVKS 306
           MVEGTLDGF FDS++E+ K+ K+  +Q DQN+ +EEQ N K+K K+
Sbjct: 1   MVEGTLDGFFFDSDEEAGKMQKST-NQTDQNENVEEQANGKSKGKT 45


>gi|255544300|ref|XP_002513212.1| conserved hypothetical protein [Ricinus communis]
 gi|223547710|gb|EEF49203.1| conserved hypothetical protein [Ricinus communis]
          Length = 119

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 24/31 (77%), Positives = 25/31 (80%)

Query: 1  MSGSREESPDWLRSFQAPTHSVLTLSSDSDS 31
          MS SRE SPDWL  FQAP HS +TLSSDSDS
Sbjct: 1  MSSSRENSPDWLHCFQAPIHSTVTLSSDSDS 31


>gi|145347277|ref|XP_001418100.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578328|gb|ABO96393.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 999

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 38/97 (39%), Positives = 58/97 (59%), Gaps = 8/97 (8%)

Query: 158 STSTLPLVMSEKLQ-RTKALVECEG--ESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKG 214
           S++ + LV +  L  ++K L E EG  E+VDL GD GAVGR L    AE +  + +D+KG
Sbjct: 33  SSTAVELVATANLGGKSKVLFEIEGAGEAVDLDGDTGAVGRWL----AESSRALKVDMKG 88

Query: 215 TIYKTTLVPSR-TFCIVSFGHSEAKIEAIMNDFIQLK 250
            +Y   +V S  T  +V+     AKIE++  +F+QL+
Sbjct: 89  VMYNARVVSSAGTVVVVAVNADVAKIESVHREFVQLR 125


>gi|397569025|gb|EJK46493.1| hypothetical protein THAOC_34834 [Thalassiosira oceanica]
          Length = 315

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 54/102 (52%), Gaps = 9/102 (8%)

Query: 176 LVECEGESVDLSGDMGAVGRILVPGTAEGNHE-MFLDLKGTIYKTTLVPSRTFCIVSFGH 234
           LV+   + +DL G  GAVGR+      E +   + LDLKG  Y+ T+ P  T  +V+   
Sbjct: 175 LVQMNDDKLDLHGQSGAVGRL------EADERGVILDLKGYQYRGTIRPGPTAMVVALTR 228

Query: 235 -SEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGFLFDSED 275
             + K+E+I ++F+ L  +   +  +++ +   +GF+   ED
Sbjct: 229 EGKFKVESITDEFVTLDAK-GAHVMDSLGDNMREGFMVREED 269


>gi|422294628|gb|EKU21928.1| hypothetical protein NGA_2017520, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 238

 Score = 46.2 bits (108), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 4/74 (5%)

Query: 185 DLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMN 244
           DL+GD GAVGR+ V    + +  + LDLKG  Y   L    T  +VS G  EAK+E +++
Sbjct: 40  DLTGDSGAVGRVHV----KKDRGVVLDLKGHQYLAELCRCPTMLLVSLGEEEAKVEDVVD 95

Query: 245 DFIQLKPQSNVYEA 258
               L    +  EA
Sbjct: 96  TLCVLTHHHDELEA 109


>gi|219116542|ref|XP_002179066.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409833|gb|EEC49764.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 289

 Score = 45.4 bits (106), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 55/105 (52%), Gaps = 14/105 (13%)

Query: 176 LVECEGE---SVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSF 232
           LV+ + E   S+D +G +GA+GR+     AEG   + LD KG  Y+  + P  T  ++S 
Sbjct: 150 LVQIDPEDSASLDFTGAVGAIGRLET--VAEG---VVLDFKGCQYRGRIYPGSTAVVLST 204

Query: 233 GHSEA-----KIEAIMNDFIQLKPQSNVYE-AETMVEGTLDGFLF 271
              +A     K+E++ ++F +L  + N  E    +V G + G+ +
Sbjct: 205 NGKKADTQLLKVESMTDEFCRLTKEGNAMEKLNAIVRGDMVGYQY 249


>gi|387199325|gb|AFJ68895.1| hypothetical protein NGATSA_2017510 [Nannochloropsis gaditana
           CCMP526]
          Length = 226

 Score = 45.1 bits (105), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 39/74 (52%), Gaps = 4/74 (5%)

Query: 185 DLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFCIVSFGHSEAKIEAIMN 244
           DL+GD GAVGR+ V    + +  + LDLKG  Y   L    T  +VS G  EAK++ +++
Sbjct: 28  DLTGDSGAVGRVHV----KKDRGVVLDLKGHQYLAELCRCPTMLLVSLGEEEAKVKDVVD 83

Query: 245 DFIQLKPQSNVYEA 258
               L    +  EA
Sbjct: 84  TLCVLTHHHDELEA 97


>gi|449018377|dbj|BAM81779.1| hypothetical protein CYME_CMP364C [Cyanidioschyzon merolae strain
           10D]
          Length = 263

 Score = 44.7 bits (104), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 51/100 (51%), Gaps = 10/100 (10%)

Query: 175 ALVECEGESVDLSGDMGAVGRILVPGTAE-GNHEMFLDLKGTIYKTTLVPSRTFCI---- 229
           AL+  E +  DL+GD G +G +     A+ G   + LDLKG +Y+T     RTF +    
Sbjct: 110 ALLVFENDEFDLAGDTGTIGCVQWSHNADSGAVRLCLDLKGVVYET-----RTFSLCGSA 164

Query: 230 VSFGHSEAKIEAIMNDFIQLKPQSNVYEAETMVEGTLDGF 269
           ++   +  ++EA+++D     P +   E   ++ G ++ +
Sbjct: 165 LAVDPTSQRVEAVLHDITYSIPTNVADERAQLIRGKIEPY 204


>gi|413935595|gb|AFW70146.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
          Length = 186

 Score = 40.8 bits (94), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 3  GSREESPDWLRSFQAPTHSVLTLSSDSDS 31
          G  E+ PDWLR+FQAPT + + LSS SD+
Sbjct: 2  GDEEDDPDWLRAFQAPTVAPVMLSSGSDT 30


>gi|413935596|gb|AFW70147.1| hypothetical protein ZEAMMB73_185491 [Zea mays]
          Length = 198

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 3  GSREESPDWLRSFQAPTHSVLTLSSDSDS 31
          G  E+ PDWLR+FQAPT + + LSS SD+
Sbjct: 2  GDEEDDPDWLRAFQAPTVAPVMLSSGSDT 30


>gi|293394374|ref|ZP_06638674.1| cystathionine beta-lyase [Serratia odorifera DSM 4582]
 gi|291423352|gb|EFE96581.1| cystathionine beta-lyase [Serratia odorifera DSM 4582]
          Length = 392

 Score = 38.9 bits (89), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 34/138 (24%), Positives = 65/138 (47%), Gaps = 14/138 (10%)

Query: 121 KSPKKEKDVINNVKRKGNNDHEDAEEDIAEKRSMLNVSTSTLPLVMSEKLQRTKA-LVEC 179
           + P+++   +N    +G+     + +DI  KR+  +  TS  P +  E L+   + L + 
Sbjct: 21  RDPEQQSGFVNAPVYRGSTVVYKSCDDIRHKRARFSYGTSGTPTI--ENLENAWSTLCDA 78

Query: 180 EGESVDLSGDMGAVGRILVPGTAEGNHEMFLDLKGTIYKTTLVPSRTFC---IVSFGHSE 236
            G  +  SG +GA+   L   T  G+H +  D   ++Y+    P+R FC   +V +G S 
Sbjct: 79  AGTVISPSG-LGALALALFSVTRAGDHILVTD---SVYR----PTRKFCNTLLVKYGVSV 130

Query: 237 AKIEAIMNDFIQLKPQSN 254
              + ++   I+   Q+N
Sbjct: 131 TYYDPLIGGDIERLLQNN 148


>gi|358336321|dbj|GAA54864.1| midasin [Clonorchis sinensis]
          Length = 5090

 Score = 38.9 bits (89), Expect = 3.9,   Method: Composition-based stats.
 Identities = 37/144 (25%), Positives = 59/144 (40%), Gaps = 24/144 (16%)

Query: 25   LSSDSDSDSSHNGNGDGGKNIAELTFEKHM----EPFAPKSSILASSSDSESCQNSSL-- 78
            LSS  ++    N  GDG    ++  F++      E   P S+I  + +D+   Q+ +   
Sbjct: 4249 LSSADETQDLENRMGDGASGASDEVFKEMWASDNEEEEPASTIPPAENDTGGVQDQTNGK 4308

Query: 79   -----LREEKAFQFQEKEMDEDAALTGREEKTSVKKVSKEKSPKKGLKSPKKEKDVINNV 133
                  R+    Q Q+K +D D     + E+ ++ +      PKKG   PK  KD  N  
Sbjct: 4309 QDEKGTRDSSDVQRQKKSLDTDQPAQEKNEEANLGEAESSPPPKKG---PKSAKDFQNKT 4365

Query: 134  KRKG-------NNDHE---DAEED 147
               G       NN+ E   DAE +
Sbjct: 4366 TVAGDEGQMLPNNEQELTPDAESE 4389


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.306    0.124    0.340 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,002,068,949
Number of Sequences: 23463169
Number of extensions: 207109386
Number of successful extensions: 918536
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 204
Number of HSP's successfully gapped in prelim test: 1645
Number of HSP's that attempted gapping in prelim test: 900332
Number of HSP's gapped (non-prelim): 15018
length of query: 336
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 193
effective length of database: 9,003,962,200
effective search space: 1737764704600
effective search space used: 1737764704600
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 77 (34.3 bits)