BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 025542
         (251 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225451265|ref|XP_002272524.1| PREDICTED: uncharacterized protein LOC100264176 [Vitis vinifera]
          Length = 242

 Score =  260 bits (664), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 138/233 (59%), Positives = 164/233 (70%), Gaps = 35/233 (15%)

Query: 16  RRSILGSIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKV 75
           R+ +   +HG++CCS T        +NK  TPQ+L+ AV GVTELLRL S F + RLD +
Sbjct: 39  RQHLQKRVHGIRCCSTT-------PDNKEKTPQLLRIAVGGVTELLRLVS-FGQNRLDSM 90

Query: 76  SYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSR 135
           SYKQ  E+LVS IDD++ IL+SDYENAYFVTGIFTS IY EDCIFEDPTI+FRG +LYSR
Sbjct: 91  SYKQGDELLVSCIDDVLLILKSDYENAYFVTGIFTSAIYDEDCIFEDPTIKFRGKDLYSR 150

Query: 136 NLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRIS 195
           NL+LLVPFF++PSI LQ IEK                 G ++ T      F+L      S
Sbjct: 151 NLKLLVPFFDHPSIALQKIEK-----------------GSNSETK-----FVLA-----S 183

Query: 196 LICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPG 248
              RTYLKLPW+PLISI GSTVY+LNDE KI RHAESWN+SAL+AVGQIFTP 
Sbjct: 184 WKLRTYLKLPWRPLISIAGSTVYDLNDEFKIVRHAESWNISALQAVGQIFTPS 236


>gi|298204898|emb|CBI34205.3| unnamed protein product [Vitis vinifera]
          Length = 216

 Score =  260 bits (664), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 144/248 (58%), Positives = 169/248 (68%), Gaps = 38/248 (15%)

Query: 1   MAGIHWFGQTHQSVNRRSILGSIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTEL 60
           MAGI W   T  S     +   +HG++CCS T        +NK  TPQ+L+ AV GVTEL
Sbjct: 1   MAGI-WCSSTVSST--LHLRKRVHGIRCCSTT-------PDNKEKTPQLLRIAVGGVTEL 50

Query: 61  LRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIF 120
           LRL S F + RLD +SYKQ  E+LVS IDD++ IL+SDYENAYFVTGIFTS IY EDCIF
Sbjct: 51  LRLVS-FGQNRLDSMSYKQGDELLVSCIDDVLLILKSDYENAYFVTGIFTSAIYDEDCIF 109

Query: 121 EDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTA 180
           EDPTI+FRG +LYSRNL+LLVPFF++PSI LQ IEK                 G ++ T 
Sbjct: 110 EDPTIKFRGKDLYSRNLKLLVPFFDHPSIALQKIEK-----------------GSNSETK 152

Query: 181 LGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEA 240
                F+L      S   RTYLKLPW+PLISI GSTVY+LNDE KI RHAESWN+SAL+A
Sbjct: 153 -----FVLA-----SWKLRTYLKLPWRPLISIAGSTVYDLNDEFKIVRHAESWNISALQA 202

Query: 241 VGQIFTPG 248
           VGQIFTP 
Sbjct: 203 VGQIFTPS 210


>gi|449442907|ref|XP_004139222.1| PREDICTED: uncharacterized protein LOC101220191 [Cucumis sativus]
 gi|449482948|ref|XP_004156451.1| PREDICTED: uncharacterized LOC101220191 [Cucumis sativus]
          Length = 217

 Score =  227 bits (579), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 120/227 (52%), Positives = 149/227 (65%), Gaps = 29/227 (12%)

Query: 22  SIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKY 81
           ++  ++C S +    N N + K     I+K  VSG+TELLRLFSS    R+D++   Q  
Sbjct: 18  TVSVIRCFSSSPE--NSNGSRKKEASAIVKITVSGITELLRLFSSPISKRVDEIRDNQGE 75

Query: 82  EILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV 141
           E +V+G+D++V IL+SDYENAYFVTGIFTS IY +DC+FEDPTIRFRG ELYSRNL+LLV
Sbjct: 76  EFVVTGVDEVVNILKSDYENAYFVTGIFTSAIYTDDCLFEDPTIRFRGKELYSRNLKLLV 135

Query: 142 PFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTY 201
           PFF+ PSI LQ I K                     S   G    L  + L      RTY
Sbjct: 136 PFFDCPSIQLQTINK---------------------SNKSGVEFVLAAWKL------RTY 168

Query: 202 LKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPG 248
           LKLPW+PLISIDG+T+YEL++E KI RHAESW+VSALEA+ QIF P 
Sbjct: 169 LKLPWRPLISIDGNTLYELDEEFKIVRHAESWSVSALEAITQIFIPS 215


>gi|224125592|ref|XP_002319624.1| predicted protein [Populus trichocarpa]
 gi|222858000|gb|EEE95547.1| predicted protein [Populus trichocarpa]
          Length = 199

 Score =  221 bits (564), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 122/226 (53%), Positives = 147/226 (65%), Gaps = 50/226 (22%)

Query: 22  SIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKY 81
           +IHG++CCSG   D  K +  +T TPQILK AVSGVTELLR+FS   K RL+KV+ K + 
Sbjct: 17  TIHGIRCCSG-ATDNEKKSQTRTKTPQILKLAVSGVTELLRVFSFSGKERLEKVNNKDRD 75

Query: 82  EILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV 141
           EI VSGIDD++ IL+SDYENAYFVT                      GT+LYSRNL+LLV
Sbjct: 76  EISVSGIDDVIMILKSDYENAYFVT----------------------GTKLYSRNLKLLV 113

Query: 142 PFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTY 201
           PFF+ PSIGLQ+IEK                 G+++ T          Y +      RTY
Sbjct: 114 PFFDCPSIGLQDIEK-----------------GVNSET----------YFVLARWKLRTY 146

Query: 202 LKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           LK PW+PLISIDGSTVYEL+++LKI RHAESWNVSALEA+GQIFTP
Sbjct: 147 LKFPWRPLISIDGSTVYELDNKLKIVRHAESWNVSALEAIGQIFTP 192


>gi|351720803|ref|NP_001235141.1| uncharacterized protein LOC100306700 [Glycine max]
 gi|255629317|gb|ACU15003.1| unknown [Glycine max]
          Length = 255

 Score =  211 bits (537), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 121/231 (52%), Positives = 150/231 (64%), Gaps = 38/231 (16%)

Query: 24  HGVKCCSGTVNDGNK-------NTNNKTATPQILKFAVSGVTELLRLFS-SFDKGRLDKV 75
           HGV+ CS T N  N+       N   ++ TPQILK AVSGVTELLRLFS S  +  L+K 
Sbjct: 24  HGVQNCSNT-NTLNRRKIYACLNREKESETPQILKIAVSGVTELLRLFSPSSHQTSLEK- 81

Query: 76  SYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSR 135
             + + E   S +DD++ I++SDY+NAYFVTG F+S IYAE+CIFEDPTI+FRG ELY+R
Sbjct: 82  -QRDQIEFPASSVDDVLRIIKSDYDNAYFVTGNFSSSIYAENCIFEDPTIKFRGRELYAR 140

Query: 136 NLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRIS 195
           NL+LLVPFF+  SI LQ I+K V                         +  L  + L   
Sbjct: 141 NLKLLVPFFDSASIILQKIDKDV---------------------DSDTNFVLASWKL--- 176

Query: 196 LICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFT 246
              RT LKLPW+PLISIDGSTVYELN++ KI RH ESWN+SA+EAV QIF+
Sbjct: 177 ---RTNLKLPWRPLISIDGSTVYELNEDYKIVRHVESWNISAVEAVLQIFS 224


>gi|297828994|ref|XP_002882379.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328219|gb|EFH58638.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 216

 Score =  207 bits (526), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 124/248 (50%), Positives = 155/248 (62%), Gaps = 42/248 (16%)

Query: 1   MAGIHWFGQTHQSVNRRSILGSI-HGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTE 59
           MAG+   GQ   S++R S L  + H  +C SG+         NKT  P +LK+AV GVTE
Sbjct: 1   MAGVA--GQI--SISRSSTLRRVNHRPRCFSGS-------PENKT--PAVLKWAVGGVTE 47

Query: 60  LLRLFS-SFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDC 118
           LLRLFS +     +     + +YE+    ++D++ ILRSDY NAYFVTGI TS IY+++C
Sbjct: 48  LLRLFSGAPSSSSIPTNKDRSRYELSAGNVEDVMEILRSDYRNAYFVTGILTSSIYSDEC 107

Query: 119 IFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNS 178
           IFEDPTI F+GTELY RNL+LLVPF E  SI LQN++K            E   R     
Sbjct: 108 IFEDPTISFQGTELYERNLKLLVPFLEDASIELQNMDK-----------SESSERN---- 152

Query: 179 TALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSAL 238
                      Y+L  +   RTYLKLPW+PLISI+GSTVY+L+ + KI RH ESWNVSAL
Sbjct: 153 -----------YIL-ATWKLRTYLKLPWRPLISINGSTVYDLDRDFKIVRHVESWNVSAL 200

Query: 239 EAVGQIFT 246
           EAVGQIFT
Sbjct: 201 EAVGQIFT 208


>gi|357441471|ref|XP_003591013.1| hypothetical protein MTR_1g081280 [Medicago truncatula]
 gi|355480061|gb|AES61264.1| hypothetical protein MTR_1g081280 [Medicago truncatula]
          Length = 238

 Score =  199 bits (506), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 114/226 (50%), Positives = 139/226 (61%), Gaps = 37/226 (16%)

Query: 29  CSGTVNDGNKNT-------NNKTATPQILKFAVSGVTELLRLFSSFDKGRL--DKVSYKQ 79
           CS +    ++NT         K  TPQ+LK AVSGVTELLRLFS   +  +  D +  KQ
Sbjct: 34  CSNSYTPSSRNTCAIYNMVREKNETPQVLKIAVSGVTELLRLFSPPQQTSVLSDDIE-KQ 92

Query: 80  KYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRL 139
             +  VS ++D++ I++SDY+N YFVTG FTS IY E+CIFEDPTI+F G +LY+RNL+L
Sbjct: 93  NNDSTVSSVEDVLIIIKSDYDNDYFVTGNFTSSIYTENCIFEDPTIKFSGRDLYARNLKL 152

Query: 140 LVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICR 199
           LVPFF+  SI L  IEK V          E                      LR S   R
Sbjct: 153 LVPFFDCASIKLLKIEKEV----------ESDTN-----------------FLRASWKLR 185

Query: 200 TYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           T LKLPW+PLI+IDGST YELN++ KI RH ESWNVSALEAV QIF
Sbjct: 186 TNLKLPWRPLIAIDGSTSYELNEDFKIVRHVESWNVSALEAVLQIF 231


>gi|21537187|gb|AAM61528.1| unknown [Arabidopsis thaliana]
          Length = 216

 Score =  197 bits (502), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 107/202 (52%), Positives = 131/202 (64%), Gaps = 28/202 (13%)

Query: 46  TPQILKFAVSGVTELLRLFS-SFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYF 104
           TP +LK+AVSGVTE LRL S +     +     + K E+    +DD++ ILRSDY N YF
Sbjct: 34  TPAVLKWAVSGVTEFLRLISGAPSSTSIATNKDRSKNEVTAGDVDDVMGILRSDYRNFYF 93

Query: 105 VTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLM 164
           VTG+ TS IY++DCIFEDPTI F+GTELY RNL+LLVPF E  SI LQN+EK        
Sbjct: 94  VTGVLTSAIYSDDCIFEDPTISFQGTELYERNLKLLVPFLEDASIELQNMEK-------- 145

Query: 165 VGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL 224
               E   R                Y+L  +   RTYLKLPW+PLISI+G+TVY+L+ + 
Sbjct: 146 ---SESSQRN---------------YIL-ATWKLRTYLKLPWRPLISINGNTVYDLDKDF 186

Query: 225 KITRHAESWNVSALEAVGQIFT 246
           KI RH ESWNVSALEA+GQIFT
Sbjct: 187 KIVRHVESWNVSALEAIGQIFT 208


>gi|18397041|ref|NP_566242.1| uncharacterized protein [Arabidopsis thaliana]
 gi|13878017|gb|AAK44086.1|AF370271_1 unknown protein [Arabidopsis thaliana]
 gi|17104639|gb|AAL34208.1| unknown protein [Arabidopsis thaliana]
 gi|332640630|gb|AEE74151.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 216

 Score =  197 bits (501), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 107/202 (52%), Positives = 131/202 (64%), Gaps = 28/202 (13%)

Query: 46  TPQILKFAVSGVTELLRLFS-SFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYF 104
           TP +LK+AVSGVTE LRL S +     +     + K E+    +DD++ ILRSDY N YF
Sbjct: 34  TPAVLKWAVSGVTEFLRLISGAPSSTSIATNKDRSKNEVTAGDVDDVMGILRSDYRNFYF 93

Query: 105 VTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLM 164
           VTG+ TS IY++DCIFEDPTI F+GTELY RNL+LLVPF E  SI LQN+EK        
Sbjct: 94  VTGVLTSAIYSDDCIFEDPTISFQGTELYERNLKLLVPFLEDASIELQNMEK-------- 145

Query: 165 VGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL 224
               E   R                Y+L  +   RTYLKLPW+PLISI+G+TVY+L+ + 
Sbjct: 146 ---SESSQRN---------------YIL-ATWKLRTYLKLPWRPLISINGNTVYDLDKDF 186

Query: 225 KITRHAESWNVSALEAVGQIFT 246
           KI RH ESWNVSALEA+GQIFT
Sbjct: 187 KIVRHVESWNVSALEAIGQIFT 208


>gi|218199281|gb|EEC81708.1| hypothetical protein OsI_25317 [Oryza sativa Indica Group]
          Length = 246

 Score =  177 bits (449), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 95/212 (44%), Positives = 129/212 (60%), Gaps = 33/212 (15%)

Query: 46  TPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQK-----YEI-LVSGIDDIVTILRSDY 99
            P +++ AVS VTELLR+ +  +   +D    K        E+     +DD+V +L +DY
Sbjct: 58  APPLVRAAVSAVTELLRVLTPKNPRVVDDDLQKGDEGGGGAELDSPRSVDDVVAVLEADY 117

Query: 100 ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVF 159
           + AYF+TG FT +IY EDC+FEDPTI+FRG   YS+NL LLVPFF+ PS+ L+NIEK   
Sbjct: 118 QRAYFLTGNFTPDIYTEDCLFEDPTIKFRGRSRYSQNLDLLVPFFDSPSLELENIEK--- 174

Query: 160 LVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYE 219
                         GL   T            +  +   RTYL+LPW+PLI+I G+T Y+
Sbjct: 175 --------------GLRVETK----------FIMATWTLRTYLRLPWRPLIAIRGNTTYD 210

Query: 220 LNDELKITRHAESWNVSALEAVGQIFTPGDRN 251
           L++E K+TRHAESW+VSALEA+GQIF P  + 
Sbjct: 211 LDEEYKVTRHAESWDVSALEAIGQIFVPAPKQ 242


>gi|357111214|ref|XP_003557409.1| PREDICTED: uncharacterized protein LOC100837510 [Brachypodium
           distachyon]
          Length = 231

 Score =  176 bits (447), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 93/223 (41%), Positives = 132/223 (59%), Gaps = 33/223 (14%)

Query: 23  IHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYE 82
           +  V+C S +   G++        P +++ AVS VTELLR  S   K R    +   + E
Sbjct: 33  LPSVRCSSSSPGSGDQG-----GAPPVVRAAVSAVTELLRALSPSKKPRQMGEAVDTELE 87

Query: 83  ILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVP 142
            L   ++D+V +L +DY+ AYF+TG FT  IYAEDC++EDPTI+FRG   Y++NL LLVP
Sbjct: 88  -LPRSVEDVVAVLEADYQRAYFLTGNFTLGIYAEDCLYEDPTIKFRGRSKYAQNLDLLVP 146

Query: 143 FFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYL 202
           F + PS+ L++IEK                 GL   T            +  +   RTYL
Sbjct: 147 FLDSPSLELESIEK-----------------GLRAETKS----------IMATWTLRTYL 179

Query: 203 KLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           +LPW+PLI+I G+T Y+L++E K+ RHAESW+VSALEA+ Q+F
Sbjct: 180 RLPWRPLIAIRGNTTYDLDEEYKVVRHAESWDVSALEAIAQLF 222


>gi|357441473|ref|XP_003591014.1| hypothetical protein MTR_1g081280 [Medicago truncatula]
 gi|355480062|gb|AES61265.1| hypothetical protein MTR_1g081280 [Medicago truncatula]
          Length = 252

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 103/213 (48%), Positives = 137/213 (64%), Gaps = 22/213 (10%)

Query: 29  CSGTVNDGNKNT-------NNKTATPQILKFAVSGVTELLRLFSSFDKGRL--DKVSYKQ 79
           CS +    ++NT         K  TPQ+LK AVSGVTELLRLFS   +  +  D +  KQ
Sbjct: 34  CSNSYTPSSRNTCAIYNMVREKNETPQVLKIAVSGVTELLRLFSPPQQTSVLSDDIE-KQ 92

Query: 80  KYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRL 139
             +  VS ++D++ I++SDY+N YFVTG FTS IY E+CIFEDPTI+F G +LY+RNL+L
Sbjct: 93  NNDSTVSSVEDVLIIIKSDYDNDYFVTGNFTSSIYTENCIFEDPTIKFSGRDLYARNLKL 152

Query: 140 LVPFFEYPSIGLQNIEKFV-----FLVSLMVGRREGRMRGLHNS-TALGGHIFLLPYLLR 193
           LVPFF+  SI L  IEK V     FL      R   ++R L +S + L  +I++  +   
Sbjct: 153 LVPFFDCASIKLLKIEKEVESDTNFL------RASWKLRCLPSSLSMLICNIWIYFFKHF 206

Query: 194 ISLICRTYLKLPWKPLISIDGSTVYELNDELKI 226
           I   CRT LKLPW+PLI+IDGST YELN++ K+
Sbjct: 207 IIPQCRTNLKLPWRPLIAIDGSTSYELNEDFKV 239


>gi|116780121|gb|ABK21558.1| unknown [Picea sitchensis]
          Length = 239

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 95/208 (45%), Positives = 123/208 (59%), Gaps = 27/208 (12%)

Query: 40  TNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDY 99
              K  +P  LK AV  VTELLR+F++  +            EI V+ IDD+V +L SDY
Sbjct: 51  AKKKKESPLFLKLAVGLVTELLRIFTTSQERSSFVSGSTTPNEISVADIDDVVAVLESDY 110

Query: 100 ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVF 159
           + +YF+TG FT  IYAEDC F DPTI+F+G +LY RNL+LLVPFF  PS+ L  IE+   
Sbjct: 111 KRSYFLTGEFTRSIYAEDCWFIDPTIKFQGRDLYQRNLQLLVPFFVNPSLLLCGIEQ--- 167

Query: 160 LVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYE 219
                         G    T            ++     RTYLKLPW+PLIS++GST+Y+
Sbjct: 168 --------------GFKGDTKF----------IKAKWYLRTYLKLPWRPLISLEGSTIYD 203

Query: 220 LNDELKITRHAESWNVSALEAVGQIFTP 247
           L+D+LKI  H ESWN+SA EA+ QIF P
Sbjct: 204 LDDDLKIAYHVESWNISAFEAITQIFLP 231


>gi|414883940|tpg|DAA59954.1| TPA: hypothetical protein ZEAMMB73_015011 [Zea mays]
          Length = 243

 Score =  171 bits (433), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 89/204 (43%), Positives = 120/204 (58%), Gaps = 31/204 (15%)

Query: 46  TPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEI----LVSGIDDIVTILRSDYEN 101
            P +++ AVS VTELLR  S     R      +Q  E         ++D++ +LR DY  
Sbjct: 56  APPVVRAAVSAVTELLRALSPNKALRGAATQQQQGEEPDPDPTCGSVEDVLAVLRDDYRR 115

Query: 102 AYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLV 161
           AYF+TG FT  IY EDC+FEDPTI+FRG   YS+NL LLVPFF+ PS+ L+NIEK     
Sbjct: 116 AYFLTGDFTPGIYTEDCLFEDPTIKFRGLSRYSQNLGLLVPFFDSPSLELENIEK----- 170

Query: 162 SLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELN 221
                       G    T            ++ +   RTYL+LPW+PL++I GST Y+L+
Sbjct: 171 ------------GSRAET----------KFVKATWKLRTYLRLPWRPLVAIGGSTTYDLD 208

Query: 222 DELKITRHAESWNVSALEAVGQIF 245
            + K+TRH+ESW+VSAL+A+ QIF
Sbjct: 209 GDFKVTRHSESWDVSALQAIAQIF 232


>gi|242043308|ref|XP_002459525.1| hypothetical protein SORBIDRAFT_02g006110 [Sorghum bicolor]
 gi|241922902|gb|EER96046.1| hypothetical protein SORBIDRAFT_02g006110 [Sorghum bicolor]
          Length = 245

 Score =  167 bits (423), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 89/205 (43%), Positives = 124/205 (60%), Gaps = 33/205 (16%)

Query: 46  TPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVS-----GIDDIVTILRSDYE 100
            P +++ AVS VTELLR  S  +K   D  + +Q  E   S      ++D++ +L+ DY 
Sbjct: 59  APPVVRAAVSAVTELLRALSP-NKNLRDAAAQQQGEEPDSSPPRCGSVEDVLAVLQDDYR 117

Query: 101 NAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFL 160
            AYF+TG FT  IY  DC+FEDPTI+FRG   YS+NL LLVPFF+ PS+ L++IEK    
Sbjct: 118 RAYFLTGDFTPGIYTADCLFEDPTIKFRGLSRYSQNLYLLVPFFDSPSLELESIEK---- 173

Query: 161 VSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYEL 220
                        GL   T            ++ +   RTYL+LPW+PLI+I G+T Y+L
Sbjct: 174 -------------GLRVETK----------FVKATWKLRTYLRLPWRPLIAIRGNTTYDL 210

Query: 221 NDELKITRHAESWNVSALEAVGQIF 245
           N++ K+ RH+ESW+VSA+EA+ QIF
Sbjct: 211 NEDFKVIRHSESWDVSAVEAIAQIF 235


>gi|186509771|ref|NP_001118575.1| uncharacterized protein [Arabidopsis thaliana]
 gi|227202522|dbj|BAH56734.1| AT3G04890 [Arabidopsis thaliana]
 gi|332640631|gb|AEE74152.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 221

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 89/181 (49%), Positives = 112/181 (61%), Gaps = 28/181 (15%)

Query: 46  TPQILKFAVSGVTELLRLFS-SFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYF 104
           TP +LK+AVSGVTE LRL S +     +     + K E+    +DD++ ILRSDY N YF
Sbjct: 34  TPAVLKWAVSGVTEFLRLISGAPSSTSIATNKDRSKNEVTAGDVDDVMGILRSDYRNFYF 93

Query: 105 VTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLM 164
           VTG+ TS IY++DCIFEDPTI F+GTELY RNL+LLVPF E  SI LQN+EK        
Sbjct: 94  VTGVLTSAIYSDDCIFEDPTISFQGTELYERNLKLLVPFLEDASIELQNMEK-------- 145

Query: 165 VGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL 224
               E   R                Y+L    + RTYLKLPW+PLISI+G+TVY+L+ + 
Sbjct: 146 ---SESSQRN---------------YILATWKL-RTYLKLPWRPLISINGNTVYDLDKDF 186

Query: 225 K 225
           K
Sbjct: 187 K 187


>gi|326508892|dbj|BAJ86839.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 194

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 87/202 (43%), Positives = 115/202 (56%), Gaps = 35/202 (17%)

Query: 25  GVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEIL 84
           GV+C SG+   G+        +P +++ AVS VTELLR FS   K     V         
Sbjct: 28  GVRCGSGSPGSGDD------GSPPVVRAAVSAVTELLRAFSP--KKPSPAVEAVDAESES 79

Query: 85  VSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFF 144
              +DD+V +L +DY  AYF+TG FT  IYAEDC+FEDPTI+FRG   YS+NL LLVPFF
Sbjct: 80  PRSVDDVVAVLEADYRRAYFLTGNFTLGIYAEDCLFEDPTIKFRGRSRYSQNLDLLVPFF 139

Query: 145 EYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKL 204
           + PS+ L+NI+K                 GL   T      F++      S   RTYL+L
Sbjct: 140 DSPSLQLENIDK-----------------GLRVDTK-----FII-----ASWTLRTYLRL 172

Query: 205 PWKPLISIDGSTVYELNDELKI 226
           PW+PLI+I G+T Y+L++E KI
Sbjct: 173 PWRPLIAIRGNTTYDLDEEYKI 194


>gi|168004834|ref|XP_001755116.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693709|gb|EDQ80060.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 140

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 72/160 (45%), Positives = 97/160 (60%), Gaps = 25/160 (15%)

Query: 91  IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
           ++ +LR DYE AYF+TG FT  +YA+DC+F DPTI+FRG + Y RNL+LLVPFFE P + 
Sbjct: 1   VLEVLREDYERAYFLTGDFTPTLYADDCLFADPTIQFRGRDRYQRNLKLLVPFFEEPKLI 60

Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
           L +I++            E      H S     ++             RTYL+LPWKPLI
Sbjct: 61  LFDIQE------------EDSKSRQHKSINANWNL-------------RTYLRLPWKPLI 95

Query: 211 SIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGDR 250
            +DGST+Y L+   KI +H ESW++S  +AV Q+F P DR
Sbjct: 96  DVDGSTLYVLDANSKIVKHIESWSISGWQAVQQLFVPSDR 135


>gi|222636639|gb|EEE66771.1| hypothetical protein OsJ_23496 [Oryza sativa Japonica Group]
          Length = 249

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 86/215 (40%), Positives = 115/215 (53%), Gaps = 36/215 (16%)

Query: 46  TPQILKFAVSGVTELLRLFSSFDKGRLDKVSYK-QKYEILVSG--------IDDIVTILR 96
            P +++ AVS VTELLR+ +  +      V    QK +    G        +DD++ +L 
Sbjct: 58  APPLVRAAVSAVTELLRVLTPKNPRHAKVVDDDLQKGDEGGGGAELDSPRSVDDVIAVLE 117

Query: 97  SDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEK 156
           +DY+ AYF+TG FT +IY EDC+FEDPTI+FRG   YS+NL LLVPFF+ PS+ L+NIEK
Sbjct: 118 ADYQRAYFLTGNFTPDIYTEDCLFEDPTIKFRGRSRYSQNLDLLVPFFDSPSLELENIEK 177

Query: 157 FVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGST 216
                            GL   T       +  + LR        L      L+    S 
Sbjct: 178 -----------------GLRVET----KFIMATWTLRSRYSQNLDL------LVPFFDSP 210

Query: 217 VYELNDELKITRHAESWNVSALEAVGQIFTPGDRN 251
             EL +  K+TRHAESW+VSALEA+GQIF P  + 
Sbjct: 211 SLELENIEKVTRHAESWDVSALEAIGQIFVPAPKQ 245


>gi|147812119|emb|CAN65783.1| hypothetical protein VITISV_010563 [Vitis vinifera]
          Length = 170

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 72/143 (50%), Positives = 91/143 (63%), Gaps = 30/143 (20%)

Query: 16  RRSILGSIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKV 75
           R+ +   +HG++CCS T        +NK  TPQ+L+ AV GVTELLRL S F + RLD +
Sbjct: 39  RQHLQKRVHGIRCCSTT-------PDNKEKTPQLLRIAVGGVTELLRLVS-FGQNRLDSM 90

Query: 76  SYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSR 135
           SYKQ  E+LVS IDD++ IL+SDYENAYFVT                      G +LYSR
Sbjct: 91  SYKQGDELLVSCIDDVLLILKSDYENAYFVT----------------------GKDLYSR 128

Query: 136 NLRLLVPFFEYPSIGLQNIEKFV 158
           NL+LLVPFF++PSI LQ IEK +
Sbjct: 129 NLKLLVPFFDHPSIALQKIEKMM 151


>gi|302836648|ref|XP_002949884.1| hypothetical protein VOLCADRAFT_109744 [Volvox carteri f.
           nagariensis]
 gi|300264793|gb|EFJ48987.1| hypothetical protein VOLCADRAFT_109744 [Volvox carteri f.
           nagariensis]
          Length = 237

 Score =  130 bits (327), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 79/218 (36%), Positives = 117/218 (53%), Gaps = 33/218 (15%)

Query: 37  NKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQ-KYEILVSGIDDIVTIL 95
           N  T ++     +   AV+ +TE LRLF    + R + V+ +  +   L  G  D+  ++
Sbjct: 44  NPKTRSRQKDNPLTVGAVTVMTETLRLFG-VGRERYEAVAAEPPRNRPLRRG--DVAGLM 100

Query: 96  R---SDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQ 152
           R   +D++ AY VTGI    IY  DC F DPT+ FRG +L+ RNL LLVPF + P++ L+
Sbjct: 101 RRITTDFKQAYLVTGILDDSIYDPDCFFADPTVAFRGVDLWKRNLALLVPFLDQPAVQLK 160

Query: 153 NIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISI 212
            +++         GR E            G  +      +R     RT+L+LPW+PLI I
Sbjct: 161 RVQRL--------GRDEQ-----------GAEV------VRAEWRLRTFLRLPWRPLIDI 195

Query: 213 DGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTPGD 249
           DG+T Y LN+E  +I RH ESW VS  +A+ Q+F P +
Sbjct: 196 DGATEYTLNEESNRIVRHVESWGVSGTQALLQMFRPSN 233


>gi|255542358|ref|XP_002512242.1| hypothetical protein RCOM_1428170 [Ricinus communis]
 gi|223548203|gb|EEF49694.1| hypothetical protein RCOM_1428170 [Ricinus communis]
          Length = 153

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 72/139 (51%), Positives = 86/139 (61%), Gaps = 39/139 (28%)

Query: 122 DPTIRFR------------GTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRRE 169
           +PT+ FR            GTELYSRNL+LLVPFFE PSIGL+N+EK             
Sbjct: 8   NPTLSFRRVNGIRCCLGAPGTELYSRNLKLLVPFFESPSIGLENMEK------------- 54

Query: 170 GRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRH 229
               G++  +      FLL      +   RTYLK PW+P ISIDGST+YELND+ K+ RH
Sbjct: 55  ----GVNTESN-----FLL-----ATWKLRTYLKFPWRPFISIDGSTIYELNDKFKVVRH 100

Query: 230 AESWNVSALEAVGQIFTPG 248
           AESWNVSALEA+GQIFTP 
Sbjct: 101 AESWNVSALEAIGQIFTPS 119


>gi|159481098|ref|XP_001698619.1| SOUL3-like protein [Chlamydomonas reinhardtii]
 gi|158282359|gb|EDP08112.1| SOUL3-like protein [Chlamydomonas reinhardtii]
          Length = 232

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 71/198 (35%), Positives = 104/198 (52%), Gaps = 31/198 (15%)

Query: 54  VSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTILR---SDYENAYFVTGIFT 110
           V+ +TE LR+     +  ++  +   +   +  G  D+  ++R   +D++ AYFVTG+  
Sbjct: 56  VTVLTEALRVLGVGKERYVEVAAAPSRAAPVRRG--DVAGLMRRLTADFKQAYFVTGVLD 113

Query: 111 SEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREG 170
             IY  DC F DPT+ FRGT+L+ RNL LL PF E P++ L  + +         GR E 
Sbjct: 114 DSIYEPDCYFADPTVAFRGTDLWKRNLALLTPFLEAPNVQLYGMRQL--------GRDED 165

Query: 171 RMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL-KITRH 229
                      G  +      +R      T LKLPW+PLI +DG+T Y LN+E  +I RH
Sbjct: 166 -----------GAEV------VRAEWRLTTILKLPWRPLIDLDGATEYTLNEESNRIVRH 208

Query: 230 AESWNVSALEAVGQIFTP 247
            E W++S  EAV Q+F P
Sbjct: 209 VEFWSISGTEAVLQMFRP 226


>gi|223948397|gb|ACN28282.1| unknown [Zea mays]
 gi|414883941|tpg|DAA59955.1| TPA: hypothetical protein ZEAMMB73_015011 [Zea mays]
          Length = 214

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 57/115 (49%), Positives = 73/115 (63%), Gaps = 4/115 (3%)

Query: 46  TPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEI----LVSGIDDIVTILRSDYEN 101
            P +++ AVS VTELLR  S     R      +Q  E         ++D++ +LR DY  
Sbjct: 56  APPVVRAAVSAVTELLRALSPNKALRGAATQQQQGEEPDPDPTCGSVEDVLAVLRDDYRR 115

Query: 102 AYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEK 156
           AYF+TG FT  IY EDC+FEDPTI+FRG   YS+NL LLVPFF+ PS+ L+NIEK
Sbjct: 116 AYFLTGDFTPGIYTEDCLFEDPTIKFRGLSRYSQNLGLLVPFFDSPSLELENIEK 170


>gi|223942399|gb|ACN25283.1| unknown [Zea mays]
 gi|414883939|tpg|DAA59953.1| TPA: hypothetical protein ZEAMMB73_015011 [Zea mays]
          Length = 216

 Score =  113 bits (282), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 57/119 (47%), Positives = 76/119 (63%), Gaps = 8/119 (6%)

Query: 46  TPQILKFAVSGVTELLRLFSSFDK-GRLDKVSYKQKYE-------ILVSGIDDIVTILRS 97
            P +++ AVS VTELLR  S      RL + +  Q+ +            ++D++ +LR 
Sbjct: 56  APPVVRAAVSAVTELLRALSPNKALSRLCRGAATQQQQGEEPDPDPTCGSVEDVLAVLRD 115

Query: 98  DYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEK 156
           DY  AYF+TG FT  IY EDC+FEDPTI+FRG   YS+NL LLVPFF+ PS+ L+NIEK
Sbjct: 116 DYRRAYFLTGDFTPGIYTEDCLFEDPTIKFRGLSRYSQNLGLLVPFFDSPSLELENIEK 174


>gi|307108290|gb|EFN56530.1| hypothetical protein CHLNCDRAFT_144158 [Chlorella variabilis]
          Length = 172

 Score =  107 bits (266), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 60/157 (38%), Positives = 82/157 (52%), Gaps = 25/157 (15%)

Query: 95  LRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNI 154
           LR DY   YFVT + T  IY   C F DPT+ FRG +LY RNL LLVPF   P+I L+++
Sbjct: 32  LRRDYARQYFVTALVTDAIYDPGCYFADPTVSFRGRDLYKRNLALLVPFLWEPAIQLRSL 91

Query: 155 EKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDG 214
            +                     + A GG   L        L C  +++LPW P + I+G
Sbjct: 92  RRL-------------------PAPAPGGSAQL---FAEWRLSC--WVRLPWAPYVDING 127

Query: 215 STVYELN-DELKITRHAESWNVSALEAVGQIFTPGDR 250
           +T Y LN D  +I RH E W+VSA++A+  +  P +R
Sbjct: 128 TTTYTLNADGNQIVRHVEQWDVSAVQALLLLLRPSER 164


>gi|414883944|tpg|DAA59958.1| TPA: hypothetical protein ZEAMMB73_015011 [Zea mays]
          Length = 148

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 43/69 (62%), Positives = 55/69 (79%)

Query: 88  IDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYP 147
           ++D++ +LR DY  AYF+TG FT  IY EDC+FEDPTI+FRG   YS+NL LLVPFF+ P
Sbjct: 36  VEDVLAVLRDDYRRAYFLTGDFTPGIYTEDCLFEDPTIKFRGLSRYSQNLGLLVPFFDSP 95

Query: 148 SIGLQNIEK 156
           S+ L+NIEK
Sbjct: 96  SLELENIEK 104


>gi|239050713|ref|NP_001141609.2| hypothetical protein [Zea mays]
 gi|238908815|gb|ACF86714.2| unknown [Zea mays]
 gi|414883943|tpg|DAA59957.1| TPA: hypothetical protein ZEAMMB73_015011 [Zea mays]
          Length = 146

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 43/69 (62%), Positives = 55/69 (79%)

Query: 88  IDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYP 147
           ++D++ +LR DY  AYF+TG FT  IY EDC+FEDPTI+FRG   YS+NL LLVPFF+ P
Sbjct: 36  VEDVLAVLRDDYRRAYFLTGDFTPGIYTEDCLFEDPTIKFRGLSRYSQNLGLLVPFFDSP 95

Query: 148 SIGLQNIEK 156
           S+ L+NIEK
Sbjct: 96  SLELENIEK 104


>gi|302799533|ref|XP_002981525.1| hypothetical protein SELMODRAFT_58216 [Selaginella moellendorffii]
 gi|300150691|gb|EFJ17340.1| hypothetical protein SELMODRAFT_58216 [Selaginella moellendorffii]
          Length = 178

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 55/156 (35%), Positives = 77/156 (49%), Gaps = 33/156 (21%)

Query: 94  ILRSDYENAYFVTGIF-TSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQ 152
           ++R DYE +YFVTG F T+ IY  DC F DP + F+G + + +N+  L  F E  ++ + 
Sbjct: 52  LIRKDYEQSYFVTGTFMTTGIYKSDCEFADPFVSFKGLKRFKQNVSNLGAFMEKSTLKIA 111

Query: 153 NIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISI 212
           + ++     SL VG R                              R  L LPW+P IS 
Sbjct: 112 SWDEKE--DSLKVGWR-----------------------------FRCVLALPWRPAISA 140

Query: 213 DGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
            GST Y  +DE  KI +H ESWN+S  + V Q+F P
Sbjct: 141 SGSTEYFFDDESGKICKHVESWNISPADGVRQLFKP 176


>gi|302760261|ref|XP_002963553.1| hypothetical protein SELMODRAFT_68183 [Selaginella moellendorffii]
 gi|300168821|gb|EFJ35424.1| hypothetical protein SELMODRAFT_68183 [Selaginella moellendorffii]
          Length = 178

 Score = 87.0 bits (214), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 54/156 (34%), Positives = 77/156 (49%), Gaps = 33/156 (21%)

Query: 94  ILRSDYENAYFVTGIF-TSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQ 152
           ++R DYE +YFVTG F T+ IY  DC F DP + F+G + + +N+  L  F +  ++ + 
Sbjct: 52  LIRKDYEQSYFVTGTFMTTGIYKSDCEFADPFVSFKGLKRFKQNVSNLGAFMKKSTLKIA 111

Query: 153 NIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISI 212
           + ++     SL VG R                              R  L LPW+P IS 
Sbjct: 112 SWDEKE--DSLKVGWR-----------------------------FRCVLALPWRPAISA 140

Query: 213 DGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
            GST Y  +DE  KI +H ESWN+S  + V Q+F P
Sbjct: 141 SGSTEYFFDDESGKICKHVESWNISPADGVRQLFKP 176


>gi|224125600|ref|XP_002319626.1| predicted protein [Populus trichocarpa]
 gi|222858002|gb|EEE95549.1| predicted protein [Populus trichocarpa]
          Length = 150

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 37/49 (75%), Positives = 44/49 (89%)

Query: 108 IFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEK 156
           +FTS IY EDCIFEDPT++F+GT+LYSRNL LLVPFF+ PS GLQ+IEK
Sbjct: 35  VFTSTIYDEDCIFEDPTVKFQGTKLYSRNLNLLVPFFDCPSTGLQDIEK 83


>gi|116791970|gb|ABK26182.1| unknown [Picea sitchensis]
          Length = 247

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 69/222 (31%), Positives = 96/222 (43%), Gaps = 40/222 (18%)

Query: 30  SGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGI- 88
           S +  + N N+NN      +LK A  G   L    S F      + S +Q  +  V+G  
Sbjct: 55  SPSPEESNSNSNN-----ILLKLAWYGSEALGIAASVFRPSTAVQPSGEQSSQNEVAGSI 109

Query: 89  --DDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
             D +V  ++ D+E +YFVTG  T+ IY EDC F DP   FRG + + RN        E 
Sbjct: 110 PKDAVVDAIKKDFERSYFVTGNLTTIIYEEDCEFADPAGSFRGLKRFKRNCSNFGSLLEK 169

Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
            ++ L + E F               RG+       GH            IC   L  PW
Sbjct: 170 SNMKLMDWEDF-------------EDRGI-------GH---------WRFIC--VLAFPW 198

Query: 207 KPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
           +P++S  G T Y  + +  KI RH E WNV  +  + QIF P
Sbjct: 199 RPILSATGYTEYYYDAQSGKICRHVEHWNVPKMVLLKQIFKP 240


>gi|12322860|gb|AAG51422.1|AC009465_22 hypothetical protein; 57237-56085 [Arabidopsis thaliana]
          Length = 154

 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 65/124 (52%), Gaps = 35/124 (28%)

Query: 46  TPQILKFAVSGVTELLRLFS--------SFDKGRLD-----KVSYKQKYEILVSGIDDIV 92
           TP +LK+AVSGVTE LRL S        + +K RL          + K E+    +DD++
Sbjct: 34  TPAVLKWAVSGVTEFLRLISGAPSSTSIATNKDRLVVEIDLNTGCRSKNEVTAGDVDDVM 93

Query: 93  TILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQ 152
            ILRSDY N YFVTG                      TELY RNL+LLVPF E  SI LQ
Sbjct: 94  GILRSDYRNFYFVTG----------------------TELYERNLKLLVPFLEDASIELQ 131

Query: 153 NIEK 156
           N+EK
Sbjct: 132 NMEK 135


>gi|168003427|ref|XP_001754414.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694516|gb|EDQ80864.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 197

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 59/226 (26%), Positives = 99/226 (43%), Gaps = 40/226 (17%)

Query: 25  GVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEIL 84
           GV  C     +  K      A+P ++K    G     +  ++F   R       +  E+ 
Sbjct: 8   GVVACEA--KNETKTPPTADASP-LVKMVWYGSEAFGKFVAAF---RPSATVEPEDEEMF 61

Query: 85  VSGI--DDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVP 142
           V  +   ++V +++ DYE +YFVTG  T  IY  DC F DP + F+G   + +N+  L  
Sbjct: 62  VGPVPRSEVVDLIKKDYERSYFVTGNMTMGIYEADCEFADPFVAFKGLRRFKQNVSNLGS 121

Query: 143 FFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYL 202
           F E  S+ + + +++           E R+                    R    C   L
Sbjct: 122 FMEESSLKITDWQEY-----------EDRV------------------YARWRFNC--IL 150

Query: 203 KLPWKPLISIDGSTVYELN-DELKITRHAESWNVSALEAVGQIFTP 247
            LPW+P+++  GST Y  + +  KI +H E+W++S  + V Q+F P
Sbjct: 151 GLPWRPILAATGSTEYFFDSNSGKICKHVENWDISPADGVRQLFKP 196


>gi|357504325|ref|XP_003622451.1| hypothetical protein MTR_7g037920 [Medicago truncatula]
 gi|355497466|gb|AES78669.1| hypothetical protein MTR_7g037920 [Medicago truncatula]
          Length = 243

 Score = 77.8 bits (190), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 64/223 (28%), Positives = 95/223 (42%), Gaps = 40/223 (17%)

Query: 29  CSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGI 88
           C+GT    N+N  ++  T   LK A    +ELL + +S  +   D+        +L + I
Sbjct: 41  CNGT----NQNQESQPQTNAFLKVAWYS-SELLGIAASAFRSPSDEEEASPPQRLLQT-I 94

Query: 89  D--DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
           D   +V  ++ D++ +YFVTG  T   Y EDC F DP   F+G + + RN        E 
Sbjct: 95  DRASVVDTIKQDFQRSYFVTGDLTLNAYEEDCEFADPAGSFKGLQRFKRNCTNFGSLLEK 154

Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
            ++ L   E F               +G+       GH        R S I    L  PW
Sbjct: 155 STMNLMKWEDF-------------EDKGI-------GH-------WRFSCI----LSFPW 183

Query: 207 KPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTPG 248
           +P++S  G T Y  + +  K+ RH E WNV  +    QI  P 
Sbjct: 184 RPILSATGYTEYYFDTQSGKVYRHVEHWNVPKMALFKQILRPS 226


>gi|224125602|ref|XP_002319627.1| predicted protein [Populus trichocarpa]
 gi|222858003|gb|EEE95550.1| predicted protein [Populus trichocarpa]
          Length = 122

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 57/156 (36%), Positives = 71/156 (45%), Gaps = 60/156 (38%)

Query: 1   MAGIHWFGQTHQSVNRRSILGSIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTEL 60
           MAGI     T     +R     IH ++CCSG  ++ N N+  +T TPQILK AVSGV EL
Sbjct: 1   MAGICSLNPTTLMTFKR-----IHRIRCCSGATDNEN-NSQTRTKTPQILKLAVSGVIEL 54

Query: 61  LRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIF 120
           LR+FS                    SG D + +I         F+               
Sbjct: 55  LRVFS-------------------FSGKDSLYSI---------FIP-------------- 72

Query: 121 EDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEK 156
                       +SRNL+LLVPFF+ PSIGLQNIEK
Sbjct: 73  ------------HSRNLKLLVPFFDCPSIGLQNIEK 96


>gi|388502340|gb|AFK39236.1| unknown [Medicago truncatula]
          Length = 230

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 63/221 (28%), Positives = 95/221 (42%), Gaps = 40/221 (18%)

Query: 29  CSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGI 88
           C+GT    N+N  ++  T   LK A    +ELL + +S  +   D+        +L + I
Sbjct: 41  CNGT----NQNQESQPQTNAFLKVAWYS-SELLGIAASAFRSPSDEEEASPPQRLLQT-I 94

Query: 89  D--DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
           D   +V  ++ D++ +YFVTG  T   Y EDC F DP   F+G + + RN        E 
Sbjct: 95  DRASVVDTIKQDFQRSYFVTGDLTLNAYEEDCEFADPAGSFKGLQRFKRNCTNFGSLLEK 154

Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
            ++ L   E F               +G+       GH        R S I    L  PW
Sbjct: 155 STMNLMKWEDF-------------EDKGI-------GH-------WRFSCI----LSFPW 183

Query: 207 KPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFT 246
           +P++S  G T Y  + +  K+ RH E WNV  +    QI +
Sbjct: 184 RPILSATGYTEYYFDTQSGKVYRHVEHWNVPKMALFKQILS 224


>gi|356560693|ref|XP_003548624.1| PREDICTED: uncharacterized protein LOC100801965 [Glycine max]
          Length = 238

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 65/223 (29%), Positives = 94/223 (42%), Gaps = 42/223 (18%)

Query: 29  CSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGI 88
           CSGT    N N  +++    ILK A    +ELL + +S  +   ++    Q+   L+  I
Sbjct: 35  CSGT----NPNKESESQNNAILKLAWYS-SELLGIAASVFRSPSNEEVPPQR---LLQTI 86

Query: 89  D--DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
           D   +V  ++ D+E +YFVTG  T   Y EDC F DP   F+G + + RN        E 
Sbjct: 87  DRAAVVDTIKQDFERSYFVTGDLTLNAYEEDCEFADPAGSFKGLQRFKRNCTNFGSLLEK 146

Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
            ++ L   E F               +G+       GH        R + I    L  PW
Sbjct: 147 SNMKLMKWEDF-------------EDKGI-------GH-------WRFNCI----LSFPW 175

Query: 207 KPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTPG 248
           +P++S  G T Y  +    K+ RH E WNV       QI  P 
Sbjct: 176 RPILSATGYTEYYFDARSGKVCRHVEHWNVPKKALFKQILRPS 218


>gi|312283147|dbj|BAJ34439.1| unnamed protein product [Thellungiella halophila]
          Length = 242

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 63/227 (27%), Positives = 91/227 (40%), Gaps = 40/227 (17%)

Query: 29  CSGTVNDGNKNTNNKTATPQ--ILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVS 86
           C G  N  N+   +K A P+  +LKFA  G   L    S+F             +E+ V 
Sbjct: 39  CRGQ-NPANEPQTSKGAEPENVLLKFAWYGSELLGIAASAFRSPASPPPPTVTGFEVPVD 97

Query: 87  GIDDIVTI-----LRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV 141
                V +     ++ D++ +YFVTG  T E+Y E C F DP   F+G   + RN     
Sbjct: 98  CSGRAVRVAVVDSIKQDFKRSYFVTGNLTPEVYEEKCEFADPAGSFKGLTRFKRNCTNFG 157

Query: 142 PFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTY 201
              E  ++ L   E F               +G+       GH        + S +    
Sbjct: 158 TLIEKSNMKLMKWENF-------------EDKGV-------GH-------WKFSCV---- 186

Query: 202 LKLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
           +  PWKP++S  G T Y  + E  KI RH E WNV  +    Q+  P
Sbjct: 187 MSFPWKPILSATGYTEYYFDTESGKICRHVEHWNVPKIALFKQLLRP 233


>gi|255541694|ref|XP_002511911.1| conserved hypothetical protein [Ricinus communis]
 gi|223549091|gb|EEF50580.1| conserved hypothetical protein [Ricinus communis]
          Length = 235

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 63/246 (25%), Positives = 99/246 (40%), Gaps = 43/246 (17%)

Query: 11  HQSVNRRSILGSIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKG 70
           HQ++    I   I+  K      N  N +T  ++     L       +ELL + +SF   
Sbjct: 18  HQNI----IFSPIYPRKFNRIRCNGENPSTRKESEPENALLKVAWYSSELLGIAASF--F 71

Query: 71  RLDKVSYKQKYEILVSGIDDI-----VTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTI 125
           R   V+ +  +++ +  +  I     V  ++ D++ +YFVTG  TS+ Y EDC F DP  
Sbjct: 72  RSPSVTSEANFKLSIDDLGSIDRATVVQSIKDDFQRSYFVTGNLTSDAYEEDCEFADPAG 131

Query: 126 RFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHI 185
            F+G   + RN        E  ++ L   E F                       +G   
Sbjct: 132 SFKGLRRFKRNCTNFGLLLEKSNMKLMKWEDF-------------------EDKGIG--- 169

Query: 186 FLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQI 244
                  R S I    +  PWKP++S  G T Y  + +  ++ RH E WNV  +  + QI
Sbjct: 170 -----YWRFSCI----MSFPWKPILSATGYTEYYFDVQSGRVCRHVEHWNVPKMALLKQI 220

Query: 245 FTPGDR 250
             P  R
Sbjct: 221 LRPSQR 226


>gi|224067926|ref|XP_002302602.1| predicted protein [Populus trichocarpa]
 gi|222844328|gb|EEE81875.1| predicted protein [Populus trichocarpa]
          Length = 245

 Score = 73.6 bits (179), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 53/198 (26%), Positives = 89/198 (44%), Gaps = 39/198 (19%)

Query: 58  TELLRLFSSFDKGRLDKVSYKQKYEIL----VSGIDD---IVTILRSDYENAYFVTGIFT 110
           +ELL + +SF +   +  +   + +I     VSG+ D   ++  ++ D++ +YFVTG  T
Sbjct: 67  SELLGIAASFFRSPSNSSTEASEKDIKLGKDVSGVIDRAVVMETIKEDFQKSYFVTGSLT 126

Query: 111 SEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREG 170
            E Y E+C F DP   F+G + + RN        E  ++ L   E F             
Sbjct: 127 LEAYEENCEFADPAGSFKGLQRFKRNCTNFGLLIEKSNMKLTKWEDF------------- 173

Query: 171 RMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL-KITRH 229
             +G+       GH        R S +    +  PWKP++S  G T Y  +++  ++ RH
Sbjct: 174 EDKGI-------GH-------WRFSCV----MSFPWKPILSATGYTEYYFDEQSGRVCRH 215

Query: 230 AESWNVSALEAVGQIFTP 247
            E WNV  +  + Q+  P
Sbjct: 216 VEHWNVPKMALLKQLLKP 233


>gi|323455378|gb|EGB11246.1| hypothetical protein AURANDRAFT_61587 [Aureococcus anophagefferens]
          Length = 492

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 95/207 (45%), Gaps = 33/207 (15%)

Query: 46  TPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFV 105
           T    K  +S +T+++ L     +G  D     ++++ L      +   L  D+ N Y  
Sbjct: 24  TSSTTKNIISTLTKVVNLQGG--RGEADLAPRARQFDTLSGAA--VKEGLAVDFGNEYLW 79

Query: 106 TGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE--YPSIGLQNIEKF-VFLVS 162
           +G  T E+Y EDC+F DPT+ F G   +  NL  L  + E   P  G + ++ F + LV 
Sbjct: 80  SGKITPELYDEDCVFTDPTLSFAGLRTFEANLANLDFWIESLVPE-GRRRVDLFDLALVP 138

Query: 163 LMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYEL-N 221
               R   + R L                          L+LPW P++ ++G+T Y L  
Sbjct: 139 GATDRVRAKWRMLG------------------------ALRLPWGPVLDLNGTTTYTLGG 174

Query: 222 DELKITRHAESWNVSALEAVGQIFTPG 248
           D  +I+ + E+W+++A +A+GQ+  PG
Sbjct: 175 DGGRISSYDETWDMTAGDALGQLLRPG 201


>gi|225454326|ref|XP_002277369.1| PREDICTED: uncharacterized protein LOC100258452 [Vitis vinifera]
          Length = 256

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 67/237 (28%), Positives = 103/237 (43%), Gaps = 50/237 (21%)

Query: 23  IHGVKCCSGTVNDG-NKNTNNKTAT---PQ--ILKFAVSGVTELLRLFSSFDKGRLDKVS 76
           IH  +C      DG N  +N+ TA    P+  +LK A  G +ELL + +SF +      +
Sbjct: 47  IHRFRC------DGENPRSNSSTAQESEPENALLKVAWYG-SELLGIAASFFRSPSSVEA 99

Query: 77  YKQKYEILVSG---ID--DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTE 131
            ++  ++   G   +D   +V  ++ D++ +YFVTG  T   Y +DC F DP   FRG  
Sbjct: 100 PERAIDLAGDGSGAVDRAALVETIKEDFQRSYFVTGNLTLSAYEDDCEFADPAGSFRGLR 159

Query: 132 LYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYL 191
            + RN        +  ++ L   E F               +G+       GH       
Sbjct: 160 RFKRNCTNFGSLIQKSNMKLMKWEDF-------------EDKGI-------GH------- 192

Query: 192 LRISLICRTYLKLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
            R S +    L  PWKP++S  G T Y  + +  K+ RH E WNV  +  + QI  P
Sbjct: 193 WRFSCV----LSFPWKPILSATGYTEYYFDSQSGKVCRHVEHWNVPKMALLKQILRP 245


>gi|297745341|emb|CBI40421.3| unnamed protein product [Vitis vinifera]
          Length = 240

 Score = 73.2 bits (178), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 67/237 (28%), Positives = 103/237 (43%), Gaps = 50/237 (21%)

Query: 23  IHGVKCCSGTVNDG-NKNTNNKTAT---PQ--ILKFAVSGVTELLRLFSSFDKGRLDKVS 76
           IH  +C      DG N  +N+ TA    P+  +LK A  G +ELL + +SF +      +
Sbjct: 31  IHRFRC------DGENPRSNSSTAQESEPENALLKVAWYG-SELLGIAASFFRSPSSVEA 83

Query: 77  YKQKYEILVSG---ID--DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTE 131
            ++  ++   G   +D   +V  ++ D++ +YFVTG  T   Y +DC F DP   FRG  
Sbjct: 84  PERAIDLAGDGSGAVDRAALVETIKEDFQRSYFVTGNLTLSAYEDDCEFADPAGSFRGLR 143

Query: 132 LYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYL 191
            + RN        +  ++ L   E F               +G+       GH       
Sbjct: 144 RFKRNCTNFGSLIQKSNMKLMKWEDF-------------EDKGI-------GH------- 176

Query: 192 LRISLICRTYLKLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
            R S +    L  PWKP++S  G T Y  + +  K+ RH E WNV  +  + QI  P
Sbjct: 177 WRFSCV----LSFPWKPILSATGYTEYYFDSQSGKVCRHVEHWNVPKMALLKQILRP 229


>gi|449441428|ref|XP_004138484.1| PREDICTED: uncharacterized protein LOC101218208 [Cucumis sativus]
          Length = 239

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 65/227 (28%), Positives = 98/227 (43%), Gaps = 41/227 (18%)

Query: 29  CSGTVNDGNKNTNNKTATPQ--ILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEIL-- 84
           C G  N    + NN+ + P+  +LK A  G +ELL + +S+ +  LD  +  +  E+   
Sbjct: 33  CQGN-NPTTDSPNNQESKPENAVLKVAWYG-SELLGIAASYLRPPLDVQTPLRAQELTTD 90

Query: 85  VSGIDD---IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV 141
           VSG      IV  ++ D+  +YFVTG  T + Y E C F DP   F+G   + RN     
Sbjct: 91  VSGSIPRPLIVETIKEDFRRSYFVTGNLTLQAYEEQCEFADPAGSFKGLRRFKRNCTNFG 150

Query: 142 PFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTY 201
              +  ++ L   E F               +G+       GH        + S I    
Sbjct: 151 SLVDKSNMKLTKWEGF-------------EDKGI-------GH-------WKFSCI---- 179

Query: 202 LKLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
           L  PW+P++S  G T Y  +    K+ RH E WNV  +  + QI  P
Sbjct: 180 LSFPWRPILSATGYTEYYFDARSGKVCRHVEHWNVPKMALLKQILRP 226


>gi|449495217|ref|XP_004159768.1| PREDICTED: uncharacterized LOC101218208 [Cucumis sativus]
          Length = 248

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 65/227 (28%), Positives = 98/227 (43%), Gaps = 41/227 (18%)

Query: 29  CSGTVNDGNKNTNNKTATPQ--ILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEIL-- 84
           C G  N    + NN+ + P+  +LK A  G +ELL + +S+ +  LD  +  +  E+   
Sbjct: 42  CQGN-NPTTDSPNNQESKPENAVLKVAWYG-SELLGIAASYLRPPLDVQTPLRAQELTTD 99

Query: 85  VSGIDD---IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV 141
           VSG      IV  ++ D+  +YFVTG  T + Y E C F DP   F+G   + RN     
Sbjct: 100 VSGSIPRPLIVETIKEDFRRSYFVTGNLTLQAYEEQCEFADPAGSFKGLRRFKRNCTNFG 159

Query: 142 PFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTY 201
              +  ++ L   E F               +G+       GH        + S I    
Sbjct: 160 SLVDKSNMKLTKWEGF-------------EDKGI-------GH-------WKFSCI---- 188

Query: 202 LKLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
           L  PW+P++S  G T Y  +    K+ RH E WNV  +  + QI  P
Sbjct: 189 LSFPWRPILSATGYTEYYFDARSGKVCRHVEHWNVPKMALLKQILRP 235


>gi|302783284|ref|XP_002973415.1| hypothetical protein SELMODRAFT_413750 [Selaginella moellendorffii]
 gi|300159168|gb|EFJ25789.1| hypothetical protein SELMODRAFT_413750 [Selaginella moellendorffii]
          Length = 170

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/166 (31%), Positives = 80/166 (48%), Gaps = 33/166 (19%)

Query: 87  GIDDIVTILRSDYEN-AYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE 145
            +D+I  I+R D E+  Y+VTG  T  IY+EDC F DPT   +G + Y   ++ L   F+
Sbjct: 36  AVDEIEQIIRQDIEDRQYYVTGDLTRSIYSEDCRFIDPTTNVQGVDSYVAAVKSL---FD 92

Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLP 205
            PS     + + + + S      + R++G                           LKLP
Sbjct: 93  -PSDSKHELLE-IHVTSPTTIEAKWRLQG--------------------------SLKLP 124

Query: 206 WKPLISI-DGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGDR 250
           WKP I+  +G+T Y LND+  +  H E+W++S L A  ++ TP  R
Sbjct: 125 WKPHIAAYEGTTTYTLNDDGFVASHKETWDISLLTAFIELLTPSFR 170


>gi|242074082|ref|XP_002446977.1| hypothetical protein SORBIDRAFT_06g026170 [Sorghum bicolor]
 gi|241938160|gb|EES11305.1| hypothetical protein SORBIDRAFT_06g026170 [Sorghum bicolor]
          Length = 262

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 46/158 (29%), Positives = 68/158 (43%), Gaps = 32/158 (20%)

Query: 91  IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
           +V  +++D+  +YFVTG  T   Y EDC F DP   FRG   + RN        E  ++ 
Sbjct: 128 VVEAVKADFARSYFVTGNLTLRAYEEDCEFADPAGSFRGLRRFKRNCTNFGSLLEKSNMK 187

Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
           L   E                   + + +   GH        R S +    +  PW+P++
Sbjct: 188 LTKWED------------------MEDKSI--GH-------WRFSCV----MSFPWRPIL 216

Query: 211 SIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
           S  G T Y  + E  K+ RH E+WNV  +  + QIF P
Sbjct: 217 SATGYTEYYFDAESGKVCRHVENWNVPKMALLRQIFRP 254


>gi|298713681|emb|CBJ48872.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 192

 Score = 70.5 bits (171), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 66/146 (45%), Gaps = 32/146 (21%)

Query: 103 YFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVS 162
           Y+VTG  T  +Y  DC+F DPT R    E Y+  ++LL            N +  V L+S
Sbjct: 67  YYVTGDLTPSLYRPDCLFTDPTTRVESVERYTAAVKLL--------FDPDNAQ--VDLLS 116

Query: 163 LMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLIS-IDGSTVYELN 221
           L V                  HIF            + YLK PW P +    G+T+Y L+
Sbjct: 117 LAVKD--------------DNHIF-------AEWQLQGYLKFPWHPYVKPYTGTTLYTLD 155

Query: 222 DELKITRHAESWNVSALEAVGQIFTP 247
            +  I+RH E+W++SALEA     TP
Sbjct: 156 SDGLISRHEETWSISALEAARATITP 181


>gi|298708412|emb|CBJ48475.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 252

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 32/163 (19%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           D V  L++D++  YFVTG    ++Y  +C+F DP + F G + +  NL  L    +  S+
Sbjct: 117 DAVARLKTDFDRFYFVTGQMDLDLYEPECVFADPFVAFEGRDRFKNNLDNLGSLMQDVSL 176

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            + + E+                                   ++     R  L LPWKP 
Sbjct: 177 DVTSWEE-------------------------------AEASIKTKWRFRCVLGLPWKPT 205

Query: 210 ISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTPGDRN 251
           ++  G T +  N E  +I RH ESW +  ++A+ Q+  PG + 
Sbjct: 206 LAAAGGTEFFFNQESGRIERHVESWEIEPIDALKQLIRPGRKK 248


>gi|194701212|gb|ACF84690.1| unknown [Zea mays]
 gi|195612814|gb|ACG28237.1| hypothetical protein [Zea mays]
 gi|414585788|tpg|DAA36359.1| TPA: hypothetical protein ZEAMMB73_852215 [Zea mays]
 gi|414585789|tpg|DAA36360.1| TPA: hypothetical protein ZEAMMB73_852215 [Zea mays]
          Length = 258

 Score = 70.1 bits (170), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 46/158 (29%), Positives = 67/158 (42%), Gaps = 32/158 (20%)

Query: 91  IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
           +   +++D+  +YFVTG  T   Y EDC F DP   FRG   + RN        E  ++ 
Sbjct: 124 VAEAIKADFARSYFVTGNLTLRAYEEDCEFADPAGSFRGLRRFKRNCTNFGSLLEKSNMK 183

Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
           L   E                   L + +   GH        R S +    +  PW+P++
Sbjct: 184 LTKWED------------------LEDKSI--GH-------WRFSCV----MSFPWRPIL 212

Query: 211 SIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
           S  G T Y  + E  K+ RH E+WNV  +  + QIF P
Sbjct: 213 SATGYTEYYFDAESGKVCRHVENWNVPKMALLRQIFRP 250


>gi|30690220|ref|NP_182134.2| Nuclear transport factor 2 (NTF2) family protein [Arabidopsis
           thaliana]
 gi|48310194|gb|AAT41772.1| At2g46100 [Arabidopsis thaliana]
 gi|50198900|gb|AAT70469.1| At2g46100 [Arabidopsis thaliana]
 gi|330255549|gb|AEC10643.1| Nuclear transport factor 2 (NTF2) family protein [Arabidopsis
           thaliana]
          Length = 240

 Score = 70.1 bits (170), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 64/245 (26%), Positives = 97/245 (39%), Gaps = 37/245 (15%)

Query: 8   GQTHQSVNRRSILGSIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSS- 66
           G   +S+ R +      GV C      D  + +        +LK A  G +ELL + +S 
Sbjct: 19  GDGGRSIFRTNRRFEATGVSCRGQNPTDEPQTSKGPEPDNVLLKIAWYG-SELLGIAASV 77

Query: 67  FDKGRLDKVSYKQKYEILVSGID---DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDP 123
           F       +    +  +  SG      +V  ++ D++ +YFVTG  T E+Y E C F DP
Sbjct: 78  FRSPETSPIVTGFEVPVDCSGRAVRVAVVDSIKQDFKRSYFVTGNLTPEVYEEKCEFADP 137

Query: 124 TIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGG 183
              F+G   + RN        E  ++ L   E F               +G+       G
Sbjct: 138 AGSFKGLARFKRNCTNFGSLIEKSNMKLMKWENF-------------EDKGI-------G 177

Query: 184 HIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVG 242
           H        + S +    +  PWKP++S  G T Y  + E  KI RH E WNV  +    
Sbjct: 178 H-------WKFSCV----MSFPWKPILSATGYTEYYFDTESGKICRHVEHWNVPKIALFK 226

Query: 243 QIFTP 247
           Q+  P
Sbjct: 227 QLLRP 231


>gi|302789448|ref|XP_002976492.1| hypothetical protein SELMODRAFT_416514 [Selaginella moellendorffii]
 gi|300155530|gb|EFJ22161.1| hypothetical protein SELMODRAFT_416514 [Selaginella moellendorffii]
          Length = 170

 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 51/166 (30%), Positives = 79/166 (47%), Gaps = 33/166 (19%)

Query: 87  GIDDIVTILRSDYEN-AYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE 145
            +D+I  I+R D E+  Y+VTG  T  IY++DC F DPT   +G + Y   ++ L   F+
Sbjct: 36  AVDEIEQIIRQDIEDRQYYVTGDLTRSIYSDDCRFIDPTTNVQGVDSYVAAVKSL---FD 92

Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLP 205
            PS     + + + + S      + R++G                           LKLP
Sbjct: 93  -PSDSKHELLE-IRVTSPTTIEAKWRLQG--------------------------SLKLP 124

Query: 206 WKPLISI-DGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGDR 250
           WKP I+  +G+T Y LND+  +  H E+W++S L A  +  TP  R
Sbjct: 125 WKPHIAAYEGTTTYTLNDDGFVASHKETWDISLLTAFIEFLTPSFR 170


>gi|297824667|ref|XP_002880216.1| hypothetical protein ARALYDRAFT_483757 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326055|gb|EFH56475.1| hypothetical protein ARALYDRAFT_483757 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 241

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 62/226 (27%), Positives = 94/226 (41%), Gaps = 40/226 (17%)

Query: 29  CSGTVNDGNKNTNNKTATPQ--ILKFAVSGVTELLRLFSS-FDKGRLDKVSYKQKYEILV 85
           C G  N  N+   +K   P+  +LK A  G +ELL + +S F       +    +  +  
Sbjct: 39  CRGQ-NPTNEPQTSKAPEPENVVLKIAWYG-SELLGIAASVFRSPETSPIVTGFEVPVDC 96

Query: 86  SGID---DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVP 142
           SG      +V  ++ D++ +YFVTG  T E+Y + C F DP   F+G   + RN      
Sbjct: 97  SGRAVRVAVVDSIKQDFKRSYFVTGNLTPEVYEDKCEFADPAGSFKGLARFKRNCTNFGS 156

Query: 143 FFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYL 202
             E  ++ L   E F               +G+       GH        + S +    +
Sbjct: 157 LIEKSNMKLMKWENF-------------EDKGV-------GH-------WKFSCV----M 185

Query: 203 KLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
             PWKP++S  G T Y  + E  KI RH E WNV  +    Q+  P
Sbjct: 186 SFPWKPILSATGYTEYFFDPESGKICRHVEHWNVPKIALFKQLLRP 231


>gi|168007562|ref|XP_001756477.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692516|gb|EDQ78873.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 324

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 59/182 (32%), Positives = 85/182 (46%), Gaps = 42/182 (23%)

Query: 66  SFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTI 125
           +F +G+   +S +Q  EI+ + I           +  Y+VTG  T  IY ++C F DPT 
Sbjct: 178 AFTEGKQTNLSVEQVKEIIENDI----------RKGQYYVTGNLTPSIYRDNCKFTDPTT 227

Query: 126 RFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHI 185
              G   Y   +RLL   F+ P +  Q       L+S+ V       R +  +  LGG  
Sbjct: 228 VVTGVNKYVAAVRLL---FD-PDLSQQE------LLSIDVTGP----RTIEVNWRLGG-- 271

Query: 186 FLLPYLLRISLICRTYLKLPWKPLIS-IDGSTVYELNDELKITRHAESWNVSALEAVGQI 244
                          YLKLPWKP I+  +GST Y L D+  I  H E+WN+S L A+ ++
Sbjct: 272 ---------------YLKLPWKPHITPYEGSTRYTLGDDGLIESHDETWNISLLTALLEL 316

Query: 245 FT 246
           FT
Sbjct: 317 FT 318


>gi|357168254|ref|XP_003581559.1| PREDICTED: uncharacterized protein LOC100843329 [Brachypodium
           distachyon]
          Length = 249

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 47/162 (29%), Positives = 65/162 (40%), Gaps = 32/162 (19%)

Query: 87  GIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
           G   +   ++ D+  +YFVTG  T   Y EDC F DP   FRG   + RN        E 
Sbjct: 111 GPAQVAQAIKDDFARSYFVTGNLTLRAYEEDCEFADPAGSFRGLGRFKRNCTNFGSLLEK 170

Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
             + L   E                   L + +   GH        R S +    +  PW
Sbjct: 171 SDMKLTKWED------------------LQDKSV--GH-------WRFSCV----MSFPW 199

Query: 207 KPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
           +P++S  G T Y  + E  K+ RH E WNV  +  + QIF P
Sbjct: 200 RPVLSATGYTEYFFDAESGKVCRHVEHWNVPKMALLRQIFRP 241


>gi|384247090|gb|EIE20578.1| hypothetical protein COCSUDRAFT_10876, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 100

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/117 (35%), Positives = 59/117 (50%), Gaps = 24/117 (20%)

Query: 113 IYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRM 172
           IY+E+C F DPT+ F G + + RNL+LLVPF E P I L ++         +VG RE + 
Sbjct: 7   IYSEECFFGDPTVSFTGLQKWRRNLQLLVPFLEDPQIELFSLG--------VVGSREKQA 58

Query: 173 RGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRH 229
                              L+      TYL+LPW+P I + GST Y L++  +  R+
Sbjct: 59  ----------------AVRLKADWRLSTYLRLPWRPFIDVLGSTEYTLDEASQQARN 99


>gi|326500042|dbj|BAJ90856.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326510695|dbj|BAJ87564.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 239

 Score = 68.2 bits (165), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 46/163 (28%), Positives = 66/163 (40%), Gaps = 32/163 (19%)

Query: 86  SGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE 145
           SG   +   ++ D+  +YFVTG  T   Y +DC F DP   FRG   + RN        E
Sbjct: 100 SGRFQVAEAIKDDFARSYFVTGNLTLGAYEDDCEFADPAGSFRGLGRFKRNCTNFGSLLE 159

Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLP 205
             ++ L   E    L    +G                          R S +    +  P
Sbjct: 160 KSNMKLTKWED---LEEKSIGH------------------------WRFSCV----MSFP 188

Query: 206 WKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
           W+P++S  G T Y  + E  K+ RH E+WNV  +  + QIF P
Sbjct: 189 WRPILSATGYTEYYFDGESGKVCRHVENWNVPKMALLRQIFRP 231


>gi|308804017|ref|XP_003079321.1| unnamed protein product [Ostreococcus tauri]
 gi|116057776|emb|CAL53979.1| unnamed protein product [Ostreococcus tauri]
          Length = 232

 Score = 68.2 bits (165), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 42/161 (26%), Positives = 66/161 (40%), Gaps = 31/161 (19%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           + +  +R DY+  YFV+G      Y + C F DP   FRG E + RN             
Sbjct: 23  EAIEAIREDYDENYFVSGAGKMRAYDDQCEFADPFASFRGVERFKRN------------- 69

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            + N+   +  V L +   E    GL                 + S +    L LPW+PL
Sbjct: 70  -VSNLGGLMRDVDLKITSFEETKDGLQTE-------------WKFSCV----LDLPWRPL 111

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGDR 250
           ++  G T + LND+  + RH E W+V   + +  +  P  +
Sbjct: 112 LAASGGTTHTLNDDHLVVRHYERWDVEPGKVLKALLRPASK 152


>gi|125591376|gb|EAZ31726.1| hypothetical protein OsJ_15875 [Oryza sativa Japonica Group]
          Length = 243

 Score = 67.8 bits (164), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 56/221 (25%), Positives = 89/221 (40%), Gaps = 38/221 (17%)

Query: 33  VNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRL-----DKVSYKQKYEILVSG 87
            + G  N  + ++   +L+ A  G +ELL + +SF +        D     ++      G
Sbjct: 44  ASPGAGNETSSSSENAVLRAAWYG-SELLGIAASFFRPSQPPTEGDSAGAVEEAASEPQG 102

Query: 88  IDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYP 147
              +   ++ D+  +YFVTG  T + Y EDC F DP   F G + + RN        E  
Sbjct: 103 RAQVAEAVKDDFARSYFVTGNLTLKAYEEDCEFADPAGSFNGLQRFKRNCTNFGSLLEKS 162

Query: 148 SIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWK 207
           ++ L   E                   L + +   GH        R S +    +  PW+
Sbjct: 163 NMKLTKWED------------------LEDKSI--GH-------WRFSCV----MSFPWR 191

Query: 208 PLISIDGSTVYELN-DELKITRHAESWNVSALEAVGQIFTP 247
           P++S  G T Y  +    K+ RH E WNV  +  + QIF P
Sbjct: 192 PILSATGYTEYYFDAGSGKVCRHVEHWNVPKMALLRQIFRP 232


>gi|125549434|gb|EAY95256.1| hypothetical protein OsI_17076 [Oryza sativa Indica Group]
          Length = 243

 Score = 67.4 bits (163), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 56/221 (25%), Positives = 89/221 (40%), Gaps = 38/221 (17%)

Query: 33  VNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRL-----DKVSYKQKYEILVSG 87
            + G  N  + ++   +L+ A  G +ELL + +SF +        D     ++      G
Sbjct: 44  ASPGAGNETSSSSENAVLRAAWYG-SELLGIAASFFRPSQPPTEGDGAGAVEEAASEPQG 102

Query: 88  IDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYP 147
              +   ++ D+  +YFVTG  T + Y EDC F DP   F G + + RN        E  
Sbjct: 103 RAQVAEAVKDDFARSYFVTGNLTLKAYEEDCEFADPAGSFNGLQRFKRNCTNFGSLLEKA 162

Query: 148 SIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWK 207
           ++ L   E                   L + +   GH        R S +    +  PW+
Sbjct: 163 NMKLTKWED------------------LEDKSI--GH-------WRFSCV----MSFPWR 191

Query: 208 PLISIDGSTVYELN-DELKITRHAESWNVSALEAVGQIFTP 247
           P++S  G T Y  +    K+ RH E WNV  +  + QIF P
Sbjct: 192 PILSATGYTEYYFDAGSGKVCRHVEHWNVPKMALLRQIFRP 232


>gi|302844526|ref|XP_002953803.1| hypothetical protein VOLCADRAFT_94648 [Volvox carteri f.
           nagariensis]
 gi|300260911|gb|EFJ45127.1| hypothetical protein VOLCADRAFT_94648 [Volvox carteri f.
           nagariensis]
          Length = 344

 Score = 66.6 bits (161), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 72/160 (45%), Gaps = 32/160 (20%)

Query: 89  DDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPS 148
           +DI+  +R DY+  YFV G+   E YA DC+F DP + F GT+ + +N+  L        
Sbjct: 120 EDILAAIRRDYDVQYFVRGLADMEAYAPDCVFADPFVSFSGTQRFKQNVSNLGGLMS--D 177

Query: 149 IGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKP 208
           I L ++ ++V   S +  R                         R S +    L LPW+P
Sbjct: 178 IRL-DVYEWVEGTSTLETR------------------------WRFSAL----LDLPWRP 208

Query: 209 LISIDGSTVYELN-DELKITRHAESWNVSALEAVGQIFTP 247
            ++  G T + ++ D   + RH E W+V   + V Q+  P
Sbjct: 209 RLAAAGGTTHIIDLDRGLVVRHEERWDVEPAKVVAQLLQP 248


>gi|303276212|ref|XP_003057400.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461752|gb|EEH59045.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 186

 Score = 66.6 bits (161), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 43/157 (27%), Positives = 70/157 (44%), Gaps = 32/157 (20%)

Query: 92  VTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGL 151
           + +LR DY+++YFV+G+     YA DC F DP + F+G + + +N              +
Sbjct: 60  IALLREDYDSSYFVSGVGELAAYAPDCEFSDPFVAFKGVDRFKQN--------------V 105

Query: 152 QNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLIS 211
            N+   +  V L +   E     L  S              R S +    L LPWKP ++
Sbjct: 106 GNLGGMMRDVDLKITGWEESESDLVTS-------------WRFSCV----LDLPWKPKLA 148

Query: 212 IDGSTVYELN-DELKITRHAESWNVSALEAVGQIFTP 247
             G T +  + D  K+ +H E W+V   + V Q+ +P
Sbjct: 149 AAGGTTHVFDPDTGKVVKHIERWDVEPKKVVKQLLSP 185


>gi|223998977|ref|XP_002289161.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974369|gb|EED92698.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 324

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 48/173 (27%), Positives = 81/173 (46%), Gaps = 27/173 (15%)

Query: 86  SGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE 145
           S + + +  ++ D + +YF++G     IY EDC+F DP + F G        +  V ++E
Sbjct: 154 SSLQETLARVKLDNDRSYFLSGEVDRLIYDEDCVFADPFVAFNG--------KNFVVWYE 205

Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIF-----LLPYLLR------- 193
              + ++ +  F  L+       EGR R + N   LG  I      +L Y +        
Sbjct: 206 CMYM-IRTLSSFNLLLV-----TEGRDRFIDNLANLGSFITNYDAKMLKYDVEKDGQEIV 259

Query: 194 ISLICRTYLKLPWKPLISIDGSTVYELNDELK-ITRHAESWNVSALEAVGQIF 245
             ++ +  L LPW P+++      YE++ +   IT H ESW++ A E V QIF
Sbjct: 260 TKVMVKLQLNLPWAPVLAWPWGVTYEVDPQTYLITSHVESWDIEAWEGVKQIF 312


>gi|302850531|ref|XP_002956792.1| hypothetical protein VOLCADRAFT_97844 [Volvox carteri f.
           nagariensis]
 gi|300257852|gb|EFJ42095.1| hypothetical protein VOLCADRAFT_97844 [Volvox carteri f.
           nagariensis]
          Length = 179

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 73/157 (46%), Gaps = 35/157 (22%)

Query: 94  ILRSDY-ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV-PFFEYPSIGL 151
           ILR D  E  YFVTG  T EI+A+DC F DPT    G   Y   L +L  P  +Y  + L
Sbjct: 46  ILRHDLAEGQYFVTGKLTREIFADDCRFVDPTNDVTGLSKYLTALGVLFDP--QYSKVEL 103

Query: 152 QNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLIS 211
            +I         +VG R+     +     LGG                 YL  PW+PL+ 
Sbjct: 104 LDIR--------VVGPRQ-----VEADWRLGG-----------------YLVFPWRPLVQ 133

Query: 212 -IDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
              G TVY LN+   ++   ++W++SA  A+ + FTP
Sbjct: 134 PFTGHTVYTLNESGLVSEQRQTWSISAATALLESFTP 170


>gi|159486318|ref|XP_001701188.1| SOUL3-like protein [Chlamydomonas reinhardtii]
 gi|158271888|gb|EDO97698.1| SOUL3-like protein [Chlamydomonas reinhardtii]
          Length = 263

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 52/163 (31%), Positives = 70/163 (42%), Gaps = 37/163 (22%)

Query: 89  DDIVTILRSDYE-NAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE-- 145
           +++  +LR D E   YFVTG  T EI+A+DC F DPT    G   Y   L++L   F+  
Sbjct: 125 EEVAAVLRRDLEVGQYFVTGDLTREIFADDCRFIDPTNDVTGLSKYLTALKVL---FDPA 181

Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLP 205
           Y  + L +I                  R +     LGG                 YL  P
Sbjct: 182 YSKVQLLSISA-------------SGPRTVEADWRLGG-----------------YLVFP 211

Query: 206 WKPLI-SIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           W P +    G TVY L+DE  I    ++W  SA EA+ + FTP
Sbjct: 212 WNPRVEPFKGHTVYTLSDEGLIVEQRQTWEKSAAEALRESFTP 254


>gi|116792207|gb|ABK26275.1| unknown [Picea sitchensis]
          Length = 222

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 59/212 (27%), Positives = 87/212 (41%), Gaps = 38/212 (17%)

Query: 41  NNKTATPQILKFAVSGVTELLRLFSSFDKGRL---DKVSYKQKYEILVSGIDDIVTILRS 97
           N   A P   KF    V + + L S F  G       V+ + K   L   I++I +I+ +
Sbjct: 39  NCTGAIPGAGKFGRRTVLQKMALLSVFIAGNNACGGGVASQNKKTNL--SIEEIKSIIEN 96

Query: 98  DYENA-YFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEK 156
           D     Y+VTG  T EIY  +C F DPT    G E Y   ++ L      P+   Q    
Sbjct: 97  DISKGQYYVTGDMTKEIYENNCRFRDPTTDLTGLEKYISAVKFLF----NPNTSKQE--- 149

Query: 157 FVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKP-LISIDGS 215
              L+S+ V                     + P+ +        YLK PW P ++  +GS
Sbjct: 150 ---LLSIAV---------------------VDPHTIEAKWRLEGYLKFPWNPHILPYEGS 185

Query: 216 TVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           T Y L+D   I  H E+W++S   A+ +   P
Sbjct: 186 TRYVLDDRGLIMSHEETWDISVWTALQEFLLP 217


>gi|434388028|ref|YP_007098639.1| hypothetical protein Cha6605_4167 [Chamaesiphon minutus PCC 6605]
 gi|428019018|gb|AFY95112.1| hypothetical protein Cha6605_4167 [Chamaesiphon minutus PCC 6605]
          Length = 127

 Score = 64.7 bits (156), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 68/156 (43%), Gaps = 34/156 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+T LR DY    F     + EIYAED  F+DP  +FRG   Y + +  +  +FE+P +
Sbjct: 2   DIITTLRQDYAKFPFDQ---SYEIYAEDVYFKDPLTQFRGLPRYRKTIEFIQKWFEHPHL 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            +  I++    V  ++  R                     +LL  +        LPW P 
Sbjct: 59  EMYEIDR----VDRLITTR---------------------WLLSWN------TPLPWHPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           I I G +   LND   I  H + W+ S L+ + Q F
Sbjct: 88  IEIPGKSELTLNDADLIISHIDYWDRSPLDVLKQHF 123


>gi|412989127|emb|CCO15718.1| predicted protein [Bathycoccus prasinos]
          Length = 288

 Score = 63.9 bits (154), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 40/173 (23%), Positives = 80/173 (46%), Gaps = 30/173 (17%)

Query: 78  KQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNL 137
           KQ+  ++ +  + I ++ + DYE  YF+ G    + Y+  C+F DP + F G + + +N+
Sbjct: 68  KQEENVMTTRAETIDSLAK-DYEKNYFIGGESEMKAYSSSCVFADPFVSFTGLDRFKQNV 126

Query: 138 RLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLI 197
             L          L+++E  V             ++ + N   +GG IF   +   +   
Sbjct: 127 GNL-------GTSLRDVECKV-------------LKTVDN--GVGGVIFYWKFSAVVD-- 162

Query: 198 CRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGDR 250
                 LPW+P ++  G+T + L+D  K+ +H E+W+V     + ++  P  +
Sbjct: 163 -----ALPWRPKLAASGNTTHVLDDANKVVKHIEAWDVDPWVVLKKLLVPASK 210


>gi|302840933|ref|XP_002952012.1| hypothetical protein VOLCADRAFT_92622 [Volvox carteri f.
           nagariensis]
 gi|300262598|gb|EFJ46803.1| hypothetical protein VOLCADRAFT_92622 [Volvox carteri f.
           nagariensis]
          Length = 200

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 48/158 (30%), Positives = 74/158 (46%), Gaps = 37/158 (23%)

Query: 86  SGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE 145
           S +  I  I R   E  Y+VTG  T +++A DC+F+DPT+   G E Y R L+ L   F+
Sbjct: 52  SALKAIDAIRRDFVERQYYVTGNLTPQLFAPDCVFKDPTVEVVGVEPYVRALQAL---FD 108

Query: 146 YPSIG---LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYL 202
            PS     L +I        ++  R EG ++       L G + + PY            
Sbjct: 109 -PSTSRADLISIRATAPSTVVLRWRLEGSLK-------LAG-LKIKPY------------ 147

Query: 203 KLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEA 240
                      G+TVY L+D+ K+ RH E+W++S ++A
Sbjct: 148 ----------TGTTVYTLSDDGKVIRHEETWDISTVDA 175


>gi|354564738|ref|ZP_08983914.1| Protein of unknown function DUF2358 [Fischerella sp. JSC-11]
 gi|353549864|gb|EHC19303.1| Protein of unknown function DUF2358 [Fischerella sp. JSC-11]
          Length = 130

 Score = 63.5 bits (153), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 65/156 (41%), Gaps = 34/156 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DIV IL+ DY+         T  IYAE+  F+D   +FRG +LY   ++ +  FF  P +
Sbjct: 2   DIVQILKEDYQR---FPANQTYSIYAENVYFQDQVFKFRGIQLYKLMIKFIETFFLNPKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L +I+K            EG       + +                       LPWKP 
Sbjct: 59  DLHDIQK------------EGDTIKTKWTLSWNS-------------------PLPWKPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           ISI G +   LN +  I  H + WN S L+ + Q F
Sbjct: 88  ISIPGWSELRLNCDGLIISHIDYWNCSRLDVIKQHF 123


>gi|186684325|ref|YP_001867521.1| hypothetical protein Npun_R4203 [Nostoc punctiforme PCC 73102]
 gi|186466777|gb|ACC82578.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
          Length = 123

 Score = 63.5 bits (153), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 64/156 (41%), Gaps = 34/156 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T  IYA D  F+DP  +FRG + Y + +  +  +F  P +
Sbjct: 2   DIIEILKKDYQRFPINQ---TYSIYAPDVYFQDPLNKFRGVKRYQKMINFIQTWFLDPKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L NI++                        LG  I       +          LPWKP 
Sbjct: 59  DLHNIQR------------------------LGDTI-------KTEWTLSWNTPLPWKPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           ISI G +   LN +  I  H + WN S L+ V Q F
Sbjct: 88  ISIPGWSELGLNSDGLIVSHVDYWNCSPLDVVKQHF 123


>gi|303281246|ref|XP_003059915.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458570|gb|EEH55867.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 266

 Score = 63.2 bits (152), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 51/163 (31%), Positives = 66/163 (40%), Gaps = 34/163 (20%)

Query: 88  IDDIVTILRSDY-ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
           IDDI   L  D  E  YFVTG  T E++A+DC F DPT    G   Y   L LL      
Sbjct: 129 IDDIAKQLEVDLREGQYFVTGKLTRELFADDCRFVDPTNDVVGLSRYLTALDLL------ 182

Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
                 N    V L  + V                       P  +      + YL+ PW
Sbjct: 183 ----FDNDTSSVELFDVRVTS---------------------PTTIEADWSLQGYLRFPW 217

Query: 207 KPLI-SIDGSTVYELNDELK-ITRHAESWNVSALEAVGQIFTP 247
           KP +   DG T Y LN E   I    ++W++S  +A+ + FTP
Sbjct: 218 KPRVEPYDGHTTYVLNPENGLIQTQTQTWSISGAKAIAESFTP 260


>gi|255082958|ref|XP_002504465.1| predicted protein [Micromonas sp. RCC299]
 gi|226519733|gb|ACO65723.1| predicted protein [Micromonas sp. RCC299]
          Length = 237

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 49/180 (27%), Positives = 73/180 (40%), Gaps = 35/180 (19%)

Query: 70  GRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRG 129
           G + +V   ++ ++    + D   + R   +  YFVTG  T EI+A+DC F DPT    G
Sbjct: 85  GEVKRVGGDKRTDLTAEKVRD--QLERDLRDGQYFVTGNLTREIFADDCRFTDPTNDVVG 142

Query: 130 TELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLP 189
              Y   L LL                 V L  + V                       P
Sbjct: 143 LSRYLTALGLL----------FDPATSSVDLYDIKV---------------------TSP 171

Query: 190 YLLRISLICRTYLKLPWKPLISI-DGSTVYELNDELKITRHA-ESWNVSALEAVGQIFTP 247
             +      + YL+ PWKP +    G TVY L+ E K+ R   ++WN+S  +A+ + FTP
Sbjct: 172 NTVEADWQLQGYLRFPWKPRVDPYSGHTVYSLDPETKLVRSQFQTWNISGAKAIAESFTP 231


>gi|427723122|ref|YP_007070399.1| hypothetical protein Lepto7376_1208 [Leptolyngbya sp. PCC 7376]
 gi|427354842|gb|AFY37565.1| Protein of unknown function DUF2358 [Leptolyngbya sp. PCC 7376]
          Length = 126

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 34/159 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T EIYA+D  F+DP   FRG   Y + +  L  +F+   +
Sbjct: 2   DIIEILQRDYQQ---FPKEQTFEIYADDVFFKDPLNEFRGVTKYKKMIGFLGRWFQNIQL 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L  I++                    N+T              +++ C     LPW+P 
Sbjct: 59  ELHEIQR-------------------DNATICTD--------WTLNMTC----PLPWQPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPG 248
           +SI G ++ E+N++  I  H + WN + L+ +GQ+F  G
Sbjct: 88  LSISGYSLLEVNEQDLIISHIDYWNDAPLKVLGQVFKFG 126


>gi|298713054|emb|CBJ48829.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 240

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 54/208 (25%), Positives = 89/208 (42%), Gaps = 29/208 (13%)

Query: 46  TPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFV 105
           T   +K  V G+T+L   F+  D+   + +S     +  +S  + +   +R +Y   Y  
Sbjct: 55  TSGSVKAFVGGLTDLFVRFAGGDEDS-EALSADTSAKASLSEAE-LEAGIRQEYAKNYLW 112

Query: 106 TGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMV 165
           TG    ++Y EDC F DPT+ F G   Y RN+  L    +   + ++N    ++   L  
Sbjct: 113 TGDINEDLYEEDCNFTDPTLSFSGLSTYKRNVGSLQGVLD---LFVRNSRSVLYSCEL-- 167

Query: 166 GRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL- 224
            R+E                       R  ++    L+LPW P I + G T +  +    
Sbjct: 168 -RQEDSC-----------------VQTRWRMVGD--LRLPWGPTIDVVGRTTFSYDSSRG 207

Query: 225 -KITRHAESWNVSALEAVGQIFTPGDRN 251
            +I  + E W + A EA+ QIF PG R+
Sbjct: 208 NRIFSYDEVWEMEAAEALMQIFRPGTRD 235


>gi|159487172|ref|XP_001701609.1| SOUL heme-binding protein [Chlamydomonas reinhardtii]
 gi|158271550|gb|EDO97367.1| SOUL heme-binding protein [Chlamydomonas reinhardtii]
          Length = 323

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/163 (26%), Positives = 71/163 (43%), Gaps = 34/163 (20%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           +I+  +R+DY+  YFV G+     Y   C+F DP + F GT+ + +N             
Sbjct: 102 EILASIRADYDEQYFVRGVSEMSAYDPQCVFADPFVSFSGTQRFKQN------------- 148

Query: 150 GLQNIEKFVFLVSLMV-GRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKP 208
            + N+   +  + L V   +EG      N+         L    R S +    L LPW+P
Sbjct: 149 -VSNLGGLMSDIKLDVYDWQEG-----DNT---------LTTRWRFSCL----LDLPWRP 189

Query: 209 LISIDGSTVYELNDELK-ITRHAESWNVSALEAVGQIFTPGDR 250
            ++  G T + ++     + RH E W V   + V Q+FTP  +
Sbjct: 190 RLAAAGGTTHVIDPARGLVVRHEERWEVEPAKVVAQLFTPAAK 232


>gi|434402210|ref|YP_007145095.1| hypothetical protein Cylst_0032 [Cylindrospermum stagnale PCC 7417]
 gi|428256465|gb|AFZ22415.1| hypothetical protein Cylst_0032 [Cylindrospermum stagnale PCC 7417]
          Length = 125

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 71/158 (44%), Gaps = 34/158 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+  + V   ++  IYA D  F+DP  +FRG + Y + +  +  +F  P +
Sbjct: 2   DIIQILKDDYQR-FPVNQTYS--IYAPDVYFQDPLNQFRGVKRYQKMINFIQTWFINPKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L NI++        +G +      +     LG +                   LPWKP 
Sbjct: 59  DLHNIQQ--------LGDK------IKTEWTLGWNT-----------------PLPWKPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           ISI G +   LN +  I  H + W+ S L+ + Q F P
Sbjct: 88  ISISGWSELGLNSDGFIVSHIDYWHCSRLDVLKQHFFP 125


>gi|255076935|ref|XP_002502130.1| predicted protein [Micromonas sp. RCC299]
 gi|226517395|gb|ACO63388.1| predicted protein [Micromonas sp. RCC299]
          Length = 176

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 40/160 (25%), Positives = 68/160 (42%), Gaps = 32/160 (20%)

Query: 89  DDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPS 148
           + +V +L+ DY+ +YFV+G+     Y  DC F DP + F+G + + +N            
Sbjct: 47  EQVVALLKEDYDQSYFVSGVGELAAYDPDCEFADPFVSFKGVDRFKQN------------ 94

Query: 149 IGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKP 208
             + N+   +  + L +   + R   L  S              R S +    L LPW+P
Sbjct: 95  --VGNLGGMMRDIDLKITGWDERADELETS-------------WRFSCV----LDLPWRP 135

Query: 209 LISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
            ++  G T +  +    K+ RH E W+V     V Q+  P
Sbjct: 136 KLAAAGGTTHVFDPATGKVIRHVERWDVRPGVVVKQLLLP 175


>gi|428208414|ref|YP_007092767.1| hypothetical protein Chro_3440 [Chroococcidiopsis thermalis PCC
           7203]
 gi|428010335|gb|AFY88898.1| Protein of unknown function DUF2358 [Chroococcidiopsis thermalis
           PCC 7203]
          Length = 126

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 63/156 (40%), Gaps = 34/156 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ ILR DY+         T  IYA D  F+DP  RFRG E Y + +  +  +F    +
Sbjct: 2   DIIQILRQDYQR---FPDNQTYSIYASDVFFQDPLNRFRGVERYKQMINFINTWFIAVKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L +I            R+EG                     ++          LPWKP 
Sbjct: 59  DLHDI------------RQEGDT-------------------IKTEWTLSWNTPLPWKPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           I+I G +   LN +  I  H + WN S L+ V Q F
Sbjct: 88  IAIPGWSELHLNPQGLIDSHIDYWNCSRLDVVKQHF 123


>gi|427708881|ref|YP_007051258.1| hypothetical protein Nos7107_3539 [Nostoc sp. PCC 7107]
 gi|427361386|gb|AFY44108.1| Protein of unknown function DUF2358 [Nostoc sp. PCC 7107]
          Length = 127

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 46/162 (28%), Positives = 67/162 (41%), Gaps = 42/162 (25%)

Query: 90  DIVTILRSDY----ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE 145
           DI+  L++DY    EN        T  IYAE+  F+D   +FRG +LY   ++ +  FF 
Sbjct: 2   DIIETLKNDYQRFPENQ-------TYSIYAENVYFQDMVFKFRGVKLYQWMIKFIKTFFL 54

Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLP 205
            P + L NI++                        LG  I       +          LP
Sbjct: 55  NPKMDLHNIQR------------------------LGDTI-------KTEWTLSWNSPLP 83

Query: 206 WKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           WKP I+I G +   LN +  I  H + WN S ++ + Q F P
Sbjct: 84  WKPRIAISGWSELLLNADDLIISHIDYWNCSQVDVLKQHFFP 125


>gi|307108501|gb|EFN56741.1| hypothetical protein CHLNCDRAFT_144171 [Chlorella variabilis]
          Length = 202

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 50/179 (27%), Positives = 78/179 (43%), Gaps = 36/179 (20%)

Query: 70  GRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRG 129
           GR+++V   +   + +S  +DI  + R+  E  YFVTG  T EI+A+DC F+DPT    G
Sbjct: 45  GRVERVGGDKL--VGLSPEEDI--LARNLREGQYFVTGDLTPEIFADDCRFKDPTNETAG 100

Query: 130 TELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLP 189
              Y + L LL                  F  S             +++  L G     P
Sbjct: 101 LSRYMKALTLL------------------FDAS-------------YSAVQLVGISVTSP 129

Query: 190 YLLRISLICRTYLKLPWKPLIS-IDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
             +        YL+ PW P +    G TVY LN++  I    ++W+++   A+ + FTP
Sbjct: 130 TTIEADWKMGGYLRFPWHPRVEPFLGHTVYHLNEQGLIALQDQTWSITGTTALVESFTP 188


>gi|440679743|ref|YP_007154538.1| Protein of unknown function DUF2358 [Anabaena cylindrica PCC 7122]
 gi|428676862|gb|AFZ55628.1| Protein of unknown function DUF2358 [Anabaena cylindrica PCC 7122]
          Length = 129

 Score = 60.5 bits (145), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 45/154 (29%), Positives = 67/154 (43%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL++DY+         T  IYA D  F+DP  +FRG ELY   ++ +  FF    +
Sbjct: 2   DIIEILKADYQRFPINQ---TYSIYAPDVYFQDPVFKFRGLELYKWMIKFIQTFFLNLKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L NI+           R+E  ++                     +L   +   LPWKP 
Sbjct: 59  DLHNIQ-----------RQEDTIKS------------------EWTLSWNS--PLPWKPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           ISI G +   LN +  I  H + W+ S L+ + Q
Sbjct: 88  ISISGWSELRLNADSLIISHIDYWHGSRLDVLKQ 121


>gi|300863520|ref|ZP_07108472.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300338476|emb|CBN53614.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 129

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 68/158 (43%), Gaps = 34/158 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T  IYAE   F+DP  RF+G + Y + +  +  FF+   +
Sbjct: 2   DIIEILKEDYKK---FPAAQTYSIYAEKVYFQDPMNRFKGIDRYQQMIGFMSTFFKDIKL 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L NI +        VG R      +     L   +                  +PW+P 
Sbjct: 59  DLHNISQ--------VGDR------IETRWTLSWTV-----------------PVPWQPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           I++ G +  +LN E  I+ H + WN S L+ + Q+  P
Sbjct: 88  IAVPGWSELKLNTEGLISSHIDYWNCSRLDVLKQLLFP 125


>gi|307109547|gb|EFN57785.1| hypothetical protein CHLNCDRAFT_143130 [Chlorella variabilis]
          Length = 257

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 44/177 (24%), Positives = 71/177 (40%), Gaps = 33/177 (18%)

Query: 71  RLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGT 130
           +LD  S  ++        +  + ++R D+   YFVTG      Y  DC+F DP   F GT
Sbjct: 29  KLDGPSLAEQQAGAALKREQALALIREDFIQNYFVTGQGALAAYDPDCLFADPFASFNGT 88

Query: 131 ELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPY 190
             + RN+  L          L +I+  + L     G  E R +   ++T  G        
Sbjct: 89  ARFKRNVSNLGGL-------LTDID--LTLTDWQEGEDELRTKWRFSATLSG-------- 131

Query: 191 LLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
                        LPWKPL++  G T +      ++ +H ESW++     + Q+  P
Sbjct: 132 -------------LPWKPLLAAAGGTTFATG---RVCKHIESWDIEPARVLRQLLKP 172


>gi|428303699|ref|YP_007140524.1| hypothetical protein Cri9333_0009 [Crinalium epipsammum PCC 9333]
 gi|428245234|gb|AFZ11014.1| Protein of unknown function DUF2358 [Crinalium epipsammum PCC 9333]
          Length = 127

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 46/154 (29%), Positives = 67/154 (43%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+  + V   ++  IYAE+  F+DP  +FRG E Y + +R +  +F  P +
Sbjct: 2   DIIEILKQDYQ-KFPVDQTYS--IYAENVYFQDPLNKFRGVERYKQMIRFINQWFINPQL 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
                                    LH    LG  I       R +L   T   LPWKP 
Sbjct: 59  ------------------------DLHEINQLGDTI-----KTRWTLSWTT--PLPWKPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I G +   LN E  I  H + W+ S  + + Q
Sbjct: 88  IAIPGWSELHLNAEKLIDSHIDYWDCSRFDVITQ 121


>gi|255089807|ref|XP_002506825.1| predicted protein [Micromonas sp. RCC299]
 gi|226522098|gb|ACO68083.1| predicted protein [Micromonas sp. RCC299]
          Length = 226

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 68/160 (42%), Gaps = 25/160 (15%)

Query: 94  ILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQN 153
           I R   E  Y  TG    E+Y   C F DPT+ F G E + RNL  L P           
Sbjct: 83  IAREFTEAKYLWTGDINPEMYDLYCTFTDPTLSFAGLETFQRNLANLQP----------- 131

Query: 154 IEKFVFLVSLMVGRREGRMRGLHNST-ALGGHIFLLPYLLRISLICRTYLKLPWKPLISI 212
                 ++S +V  R+  +  L++   A  G        +R        L++PW+P I +
Sbjct: 132 ------VLSRLV--RDSDVEQLYSCELAGDGKGGGGGGAVRARWRMTGNLRVPWRPRIDL 183

Query: 213 DGSTVYELND-----ELKITRHAESWNVSALEAVGQIFTP 247
           +G T +   D        IT + E W +SA EAV Q+ TP
Sbjct: 184 EGQTTFTFKDYGGDRGCLITAYQEEWGLSAGEAVMQLVTP 223


>gi|397643843|gb|EJK76114.1| hypothetical protein THAOC_02143 [Thalassiosira oceanica]
          Length = 658

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 41/157 (26%), Positives = 67/157 (42%), Gaps = 28/157 (17%)

Query: 100 ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVF 159
           EN Y  TG   S  +A DC F DPT+ F+G + +  N++ LVP  ++   G  +      
Sbjct: 511 ENNYLWTGNIDSSSFAADCTFTDPTLSFQGVDKFVSNVQNLVPVVDFLLEGKTDENSESV 570

Query: 160 LVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVY- 218
           L+ + +   E  +    N                   +  +  +LPW+P I + G T + 
Sbjct: 571 LLDIRLNEEESYVETRWN-------------------MVGSLKRLPWRPKIDVIGRTKFW 611

Query: 219 ---ELNDELKITR-----HAESWNVSALEAVGQIFTP 247
              EL D+  + R     + E W + A  A+ Q+ TP
Sbjct: 612 YETELRDDDSVARVQVNFYDEQWEIPAGLALMQLITP 648


>gi|428218507|ref|YP_007102972.1| hypothetical protein Pse7367_2281 [Pseudanabaena sp. PCC 7367]
 gi|427990289|gb|AFY70544.1| Protein of unknown function DUF2358 [Pseudanabaena sp. PCC 7367]
          Length = 154

 Score = 59.3 bits (142), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 71/164 (43%), Gaps = 24/164 (14%)

Query: 80  KYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRL 139
           K EI    ID+++ I+ +DY  A F     T  IYAED  F+DP   FRG + Y + +  
Sbjct: 7   KAEIQPKSIDEMIEIVIADY--AKFPEAQ-TYAIYAEDVYFKDPVYEFRGLKQYQKMIGF 63

Query: 140 LVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICR 199
           +  +F   ++ L  IE+       +   +     G+                ++      
Sbjct: 64  ITKWFANLNLALHTIEE-------VETSQSNPTEGVTT--------------IKTEWTMS 102

Query: 200 TYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
               +PWKP IS+DG +   +N + +I  H + W+ +  + V Q
Sbjct: 103 WNSPMPWKPRISVDGWSELGINHQGQIISHVDYWHCTKWDVVKQ 146


>gi|224005118|ref|XP_002296210.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|209586242|gb|ACI64927.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 279

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/227 (25%), Positives = 104/227 (45%), Gaps = 19/227 (8%)

Query: 31  GTVNDGNKNTNNKT--ATPQILKFAVSGVTELLRLFS--SFDKGRLDKVSYKQKY-EILV 85
            T ++   +TNN +  ++   +K  VS +T +   FS  S D+      S +Q     L+
Sbjct: 54  STQDNAENDTNNSSYESSASFVKGLVSSLTSVTNYFSLSSQDEPGPSSPSMQQTAIATLL 113

Query: 86  S---GIDDIVTILRSDY-ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV 141
           S      +++  +R+DY EN Y  TG     ++++DC F DPT+ F G + Y  N+  LV
Sbjct: 114 SPPTSPSELLERIRADYAENNYLWTGKLDVSLFSKDCRFTDPTLSFEGIDNYVTNVGNLV 173

Query: 142 PFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTY 201
           P  E+  +G +   +   L+ +     EG +    N   + G +  +P+  RI +I RT 
Sbjct: 174 PVVEF-LLGKEQSSQSK-LLDISCNEEEGYVETRWN---MIGDLNAIPWSPRIDVIGRTK 228

Query: 202 LKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPG 248
               W    + +    ++    L++  + E W + A  A+ Q  TP 
Sbjct: 229 F---WYEATTNEEDKQHQA--ALQVYFYDEKWEIPAGLALLQFITPA 270


>gi|255074113|ref|XP_002500731.1| predicted protein [Micromonas sp. RCC299]
 gi|226515994|gb|ACO61989.1| predicted protein [Micromonas sp. RCC299]
          Length = 297

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/159 (29%), Positives = 65/159 (40%), Gaps = 24/159 (15%)

Query: 94  ILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQN 153
           I R   E  Y  TG    E+Y   C F DPT+ F G E + RNL  L P           
Sbjct: 155 IAREFTEAKYLWTGDINPEMYDLYCTFTDPTLSFAGLETFQRNLANLQP----------- 203

Query: 154 IEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISID 213
                 ++S +V  R+  +       A  G        +R        L++PW+P I ++
Sbjct: 204 ------VLSRLV--RDSDVELYSCELAGDGKGGGGGGAVRARWRMTGNLRVPWRPRIDLE 255

Query: 214 GSTVYELND-----ELKITRHAESWNVSALEAVGQIFTP 247
           G T +   D        IT + E W +SA EAV Q+ TP
Sbjct: 256 GQTTFTFKDYGGDRGCLITAYQEEWGLSAGEAVMQLVTP 294


>gi|282896015|ref|ZP_06304046.1| conserved hypothetical protein [Raphidiopsis brookii D9]
 gi|281199125|gb|EFA73995.1| conserved hypothetical protein [Raphidiopsis brookii D9]
          Length = 130

 Score = 58.5 bits (140), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/154 (27%), Positives = 65/154 (42%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           +I+  L++DY+         T  IYA D  F+DP  +FRG ELY   ++ +  FF    +
Sbjct: 3   NIIETLKADYQRFPLDQ---TYSIYALDVYFQDPVFKFRGLELYKWMIKFIHIFFTNLRM 59

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L +IE+            +  M     + +                       LPWKP 
Sbjct: 60  DLHHIEQ------------DQNMIKTEWTLSWNA-------------------SLPWKPR 88

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           ISI G T   LN++  I+ H + W+ S L+ + Q
Sbjct: 89  ISISGWTQLSLNNQGLISSHIDYWHCSRLDVLKQ 122


>gi|443315376|ref|ZP_21044870.1| hypothetical protein Lep6406DRAFT_00038910 [Leptolyngbya sp. PCC
           6406]
 gi|442785015|gb|ELR94861.1| hypothetical protein Lep6406DRAFT_00038910 [Leptolyngbya sp. PCC
           6406]
          Length = 137

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 42/153 (27%), Positives = 64/153 (41%), Gaps = 33/153 (21%)

Query: 95  LRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNI 154
           LR DY          + ++YA D  FEDP  RF+G + Y + +  +  +F  P++ L  +
Sbjct: 16  LRQDYAQ---FPAHQSYDLYAADVQFEDPLNRFQGVQKYQKMIGFIDRWFIDPTLDLHEL 72

Query: 155 EKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDG 214
                                H+   +  H  L       S +      LPWKP I+I G
Sbjct: 73  -------------------ICHDPDQIQTHWTL-------SWVA----PLPWKPGIAISG 102

Query: 215 STVYELNDELKITRHAESWNVSALEAVGQIFTP 247
            T Y LN E +I  H + W+ S L+ + Q+  P
Sbjct: 103 WTDYRLNAEGQICAHIDHWHCSRLDVLKQVLAP 135


>gi|428211396|ref|YP_007084540.1| hypothetical protein Oscil6304_0887 [Oscillatoria acuminata PCC
           6304]
 gi|427999777|gb|AFY80620.1| hypothetical protein Oscil6304_0887 [Oscillatoria acuminata PCC
           6304]
          Length = 132

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 69/158 (43%), Gaps = 34/158 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T  +Y+E   F+DP   FRG + Y + +  +  +F+   +
Sbjct: 2   DIIEILKQDYQQ---FPENQTYSLYSETVYFKDPMTEFRGCDRYRQMIGFMSTWFKQIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L +I                  R + N+              R +L   T   LPW+P 
Sbjct: 59  DLHDI------------------RRIDNTIET-----------RWTLNWTT--PLPWQPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           I+I G +  +LN E KI  H + W++S L+ + Q F P
Sbjct: 88  IAIPGRSELQLNPEGKIISHIDYWDISRLDVLKQHFKP 125


>gi|427733624|ref|YP_007053168.1| hypothetical protein Riv7116_0005 [Rivularia sp. PCC 7116]
 gi|427368665|gb|AFY52621.1| hypothetical protein Riv7116_0005 [Rivularia sp. PCC 7116]
          Length = 129

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 44/158 (27%), Positives = 62/158 (39%), Gaps = 34/158 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DIV IL+ DY+         T  IYAED  F+DP   F G E Y   ++ +  +F    +
Sbjct: 2   DIVEILKEDYQR---FPENQTYSIYAEDVYFKDPMNEFNGVERYKLMIKFIQTWFVQTQM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L +I            RREG                     ++          LPWKP 
Sbjct: 59  DLHDI------------RREGDT-------------------IKTEWTLNWNTPLPWKPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           ISI G +   +N +  I  H + W  S  + + Q F+P
Sbjct: 88  ISIPGWSELGVNSDNLIVSHVDYWKCSRFDVLKQHFSP 125


>gi|282899336|ref|ZP_06307305.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
 gi|281195793|gb|EFA70721.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
          Length = 130

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 42/156 (26%), Positives = 66/156 (42%), Gaps = 34/156 (21%)

Query: 88  IDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYP 147
           + +I+  L++DY+         T  IYA D  F+DP  +FRG ELY   ++ +  FF   
Sbjct: 1   MTNIIETLKADYQRFPLEQ---TYSIYALDVYFQDPVFKFRGLELYKWMIKFIHIFFTNL 57

Query: 148 SIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWK 207
            + L +IE+            +  M                   ++          LPWK
Sbjct: 58  RMDLHHIEQ------------DKNM-------------------IKTEWTLSWSASLPWK 86

Query: 208 PLISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           P ISI G T   LN++  I+ H + W+ S L+ + Q
Sbjct: 87  PRISISGWTQLGLNNQGLISSHIDYWHCSRLDVLKQ 122


>gi|434392432|ref|YP_007127379.1| Protein of unknown function DUF2358 [Gloeocapsa sp. PCC 7428]
 gi|428264273|gb|AFZ30219.1| Protein of unknown function DUF2358 [Gloeocapsa sp. PCC 7428]
          Length = 129

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 62/158 (39%), Gaps = 34/158 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DIV  L+ DY+         T  IYA+D  F+DP   FRG E Y   +  +  +F  P +
Sbjct: 2   DIVETLKEDYQR---FPANQTYSIYAQDVYFKDPLNEFRGVERYKAMIGFIETWFIAPKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L +I            RREG                     ++          +PWKP 
Sbjct: 59  DLHDI------------RREGDT-------------------IKTEWTLSWNTPVPWKPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           I+I G +   +N +  IT H + W  S L+ + Q   P
Sbjct: 88  IAIPGWSELRVNQQDIITSHVDYWKCSRLDVIKQHLFP 125


>gi|397569802|gb|EJK46976.1| hypothetical protein THAOC_34333 [Thalassiosira oceanica]
          Length = 233

 Score = 57.8 bits (138), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 41/159 (25%), Positives = 68/159 (42%), Gaps = 30/159 (18%)

Query: 88  IDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYP 147
           +D+ +  ++ D + +YF++G     IY E C F DP + F G + +  NL  L  F    
Sbjct: 95  LDETLQRIQLDNDRSYFLSGEIDEMIYDEQCTFADPFVSFDGRQRFVDNLANLGSFITDY 154

Query: 148 SIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWK 207
           S    N                      ++ +  G  I       +  ++ +  L LPWK
Sbjct: 155 SAKTIN----------------------YDVSENGKQI-------QTKVMVKLELNLPWK 185

Query: 208 PLISIDGSTVYELNDE-LKITRHAESWNVSALEAVGQIF 245
           P+++      Y +++    IT H ESW+VS  E + QIF
Sbjct: 186 PVLAWPWGVKYVIDEATFLITDHIESWDVSPSEGIAQIF 224


>gi|145343266|ref|XP_001416306.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576531|gb|ABO94599.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 136

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 52/165 (31%), Positives = 70/165 (42%), Gaps = 38/165 (23%)

Query: 92  VTILRSDYEN-AYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
           +  LR D     YFVTG  T  I+A DC F DPT   RG E Y R +R L  F E  S  
Sbjct: 1   MATLREDVATRQYFVTGELTEAIFARDCRFVDPTTDVRGLERYLRAVRAL--FDERRSS- 57

Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
                     V L+     G +R +  +T  G +                YLKLPW P +
Sbjct: 58  ----------VRLI-----GDLRLVDATTIEGDYR------------AEGYLKLPWNPRV 90

Query: 211 S-IDGSTVYELN------DELKITRHAESWNVSALEAVGQIFTPG 248
              +G   ++L       D   I    ++W++S  EA+ + FTPG
Sbjct: 91  PPYEGHITWKLRPNDGSADAGLIVEQRQTWSISGGEALRETFTPG 135


>gi|219109759|ref|XP_002176633.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411168|gb|EEC51096.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 231

 Score = 57.4 bits (137), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 52/212 (24%), Positives = 86/212 (40%), Gaps = 38/212 (17%)

Query: 36  GNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTIL 95
             + + N      + K +  GV    ++F S    + D  +Y    E   S +++    L
Sbjct: 49  AQEESKNTKEVDLLTKASWYGVEVFGKVFGS----KRDSSTYAYSLEQPPSSVEETKARL 104

Query: 96  RSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFF-EYPSIGLQNI 154
           ++D    YF++G     IY + C+F DP + F+G + +  NL  L  F  +Y +  L   
Sbjct: 105 QADNNREYFLSGEVDKCIYDDQCVFADPFVSFKGRDRFVENLVNLGSFVTKYSAKPLSYY 164

Query: 155 EKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISID- 213
           E+   +V+  + + E                                L LPWKP+++   
Sbjct: 165 EEDEAVVTKFMVKLE--------------------------------LNLPWKPVLAWPW 192

Query: 214 GSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           G       D   IT H ESW+++A E V QIF
Sbjct: 193 GVRCVIDRDTNLITLHEESWDIAAWEGVKQIF 224


>gi|427727295|ref|YP_007073532.1| hypothetical protein Nos7524_0006 [Nostoc sp. PCC 7524]
 gi|427363214|gb|AFY45935.1| hypothetical protein Nos7524_0006 [Nostoc sp. PCC 7524]
          Length = 125

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 63/158 (39%), Gaps = 34/158 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL++DY+         T  IYAED  F+DP   FRG E Y + ++ +  +F     
Sbjct: 2   DIIAILKADYQR---FPDHQTYNIYAEDVYFQDPLNSFRGVERYKKMIKFIQTWF----- 53

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
                                RM  LH+   LG  I       +          LPWKP 
Sbjct: 54  ------------------LNCRM-DLHHIQQLGDTI-------KTEWTLSWNTPLPWKPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           I I G +   LN +  I  H + W+ S L+ + Q   P
Sbjct: 88  IEISGWSELGLNSDGLIVSHIDYWHCSKLDVLKQHIIP 125


>gi|119509169|ref|ZP_01628320.1| hypothetical protein N9414_05190 [Nodularia spumigena CCY9414]
 gi|119466335|gb|EAW47221.1| hypothetical protein N9414_05190 [Nodularia spumigena CCY9414]
          Length = 125

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 41/158 (25%), Positives = 68/158 (43%), Gaps = 35/158 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T  IYAE+  F+DP  +FRG + Y + ++ +  +F  P +
Sbjct: 2   DIIEILKQDYQRFPLNQ---TYSIYAENVYFQDPLTKFRGVKRYQQMIKFIQTWFINPQM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            + +I++                        LG  I       +          +PWKP 
Sbjct: 59  DVHDIQR------------------------LGDTI-------KTEWTLSWNTPIPWKPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ-IFT 246
           ISI G +  ++N    I  H + W+ S L+ + Q +FT
Sbjct: 88  ISISGWSELKINSADLIISHIDYWHCSPLDVIKQHLFT 125


>gi|414077348|ref|YP_006996666.1| hypothetical protein ANA_C12109 [Anabaena sp. 90]
 gi|413970764|gb|AFW94853.1| hypothetical protein ANA_C12109 [Anabaena sp. 90]
          Length = 130

 Score = 57.0 bits (136), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 43/156 (27%), Positives = 64/156 (41%), Gaps = 34/156 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DIV  L++DY  A F     T  IYA+D  F+D   +FRG ELY   ++ +  FF    +
Sbjct: 2   DIVETLKADY--ARFPVNQ-TYSIYADDVYFQDAVFKFRGIELYKWMIKFIQTFFSNLKL 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L  I+           ++E                      ++          LPWKP 
Sbjct: 59  DLHTIQ----------SQQEN---------------------IKTEWTMSWNSPLPWKPH 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           IS+ G +   LN +  I  H + W+ S L+ + Q F
Sbjct: 88  ISVSGWSELRLNADGLIVSHIDYWHCSRLDVIKQHF 123


>gi|412989019|emb|CCO15610.1| predicted protein [Bathycoccus prasinos]
          Length = 187

 Score = 57.0 bits (136), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 49/175 (28%), Positives = 75/175 (42%), Gaps = 44/175 (25%)

Query: 86  SGIDDIVTILRSD-----------YENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYS 134
           + I++I  ILR D           +  AYFVTG  T EI+ +DC F DPT   +    Y 
Sbjct: 43  ASIEEIKEILRRDLQPVEKNADGTFRGAYFVTGNVTPEIFEDDCRFIDPTNTTKSLSKYV 102

Query: 135 RNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRI 194
             L++L    E  S+ L++IE           + E  + G+++                 
Sbjct: 103 NALKILFD-PENSSVDLRSIEV----------KDENTIVGIYD----------------- 134

Query: 195 SLICRTYLKLPWKPLIS-IDGSTVYELNDELKITRHAESWN-VSALEAVGQIFTP 247
              C  YLKLPW P +    G+  ++ +    I    + W+ VSA  A+ + FTP
Sbjct: 135 ---CEGYLKLPWHPRVKPYTGTVTWKTSKNGLIESQTQEWSGVSAAGAIMESFTP 186


>gi|428165198|gb|EKX34199.1| hypothetical protein GUITHDRAFT_90543 [Guillardia theta CCMP2712]
          Length = 275

 Score = 56.6 bits (135), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 75/165 (45%), Gaps = 31/165 (18%)

Query: 85  VSGIDDIVTILRSDY-ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPF 143
           +S   + + ++R+D+ E  Y+VTG  +  IY+ DC F+ P                    
Sbjct: 128 MSSAAEALEVIRNDFLERRYYVTGRLSKAIYSNDCFFDAPD------------------- 168

Query: 144 FEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLK 203
              P + ++ + K+V   S +   +   +  L   T +     L  + L  +L+      
Sbjct: 169 ---PDMPVRGLRKYVDATSHLFEPKSSSIDLLSIET-VDDSAVLARWRLEATLM------ 218

Query: 204 LPWKPLI-SIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           LPW+P + +  G T+YELN+E  ++RH E W+++A +A      P
Sbjct: 219 LPWRPSVKAYTGVTLYELNEEGLVSRHTELWSITAADAFISTLFP 263


>gi|384245965|gb|EIE19457.1| hypothetical protein COCSUDRAFT_31253 [Coccomyxa subellipsoidea
           C-169]
          Length = 386

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/163 (26%), Positives = 63/163 (38%), Gaps = 33/163 (20%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           D V  +R DY+  YFV+G      Y  DC+F DP   F G E + +N+  L    E   I
Sbjct: 102 DAVAAIRKDYDCNYFVSGRGEMSAYESDCLFADPFAGFNGVERFKKNVSNLGSLME--DI 159

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L            M+  +E                  L    R S +    L LPW+P 
Sbjct: 160 KLD-----------MLDWKETEE--------------ALETKWRFSAV----LSLPWRPR 190

Query: 210 ISIDGST--VYELNDELKITRHAESWNVSALEAVGQIFTPGDR 250
           ++  G T  V+      ++ +H ESW+V     V  +  P  R
Sbjct: 191 LAASGGTTHVFSQVRGCQVIKHIESWDVEPAAVVRSLLKPSTR 233


>gi|298491076|ref|YP_003721253.1| hypothetical protein Aazo_2085 ['Nostoc azollae' 0708]
 gi|298232994|gb|ADI64130.1| Protein of unknown function DUF2358 ['Nostoc azollae' 0708]
          Length = 129

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/154 (28%), Positives = 65/154 (42%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DIV IL++DY+  + V   ++  IYA +  F+D   +FRG ELY   ++ +  FF    +
Sbjct: 2   DIVEILKADYQR-FPVNQTYS--IYAPEVYFQDAVFKFRGLELYKWMIKFIQTFFLNLKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L NIE+           +E  ++                              LPWKP 
Sbjct: 59  DLHNIEQ-----------QEDTIKS--------------------DWTLSWNSPLPWKPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           ISI G +   LN E  I  H + W+ S L  + Q
Sbjct: 88  ISISGWSELRLNSEGLIISHIDYWHSSRLNLLKQ 121


>gi|411120874|ref|ZP_11393246.1| hypothetical protein OsccyDRAFT_4880 [Oscillatoriales
           cyanobacterium JSC-12]
 gi|410709543|gb|EKQ67058.1| hypothetical protein OsccyDRAFT_4880 [Oscillatoriales
           cyanobacterium JSC-12]
          Length = 128

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 48/157 (30%), Positives = 65/157 (41%), Gaps = 34/157 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ +L++DY  A F     T EIYAED  F+DP   FRG + Y    R +V F +   I
Sbjct: 2   DILEVLQADY--ARFPQDQ-TYEIYAEDVYFKDPMTEFRGLDRY----RSMVKFIQTWFI 54

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
                                    +H     G  I       R          LPWKP 
Sbjct: 55  NCH--------------------MDVHALQQTGNQI-------RSDWTLTWNTPLPWKPK 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFT 246
           I+I G +   LND+  IT H + W+ S L+ + Q F+
Sbjct: 88  ITISGWSELTLNDKGFITSHIDYWHCSRLDVLKQHFS 124


>gi|428203524|ref|YP_007082113.1| hypothetical protein Ple7327_3338 [Pleurocapsa sp. PCC 7327]
 gi|427980956|gb|AFY78556.1| hypothetical protein Ple7327_3338 [Pleurocapsa sp. PCC 7327]
          Length = 126

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 67/158 (42%), Gaps = 34/158 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T  IYAE+  F+DP   FRG   Y   +  +  +F+   +
Sbjct: 2   DIIEILKDDYQR---FPAHQTYSIYAENVYFKDPLNEFRGIARYKEMIGFMSNWFQDIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L +I            RREG    +H    L    +  P              +PWKP 
Sbjct: 59  ELHDI------------RREG--DTIHTEWTLN---WTTP--------------IPWKPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           I+I G +  +L+++  I  H + W+ S L+ + Q   P
Sbjct: 88  ITIPGRSELKLDEQEMIVSHVDYWHCSRLDVIKQHLFP 125


>gi|145346535|ref|XP_001417742.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577970|gb|ABO96035.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 99

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 36/130 (27%), Positives = 55/130 (42%), Gaps = 31/130 (23%)

Query: 114 YAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMR 173
           YA DC F DP + FRG + +  N+               N+   +  V L +   E    
Sbjct: 1   YAGDCEFADPFVSFRGLDRFRANV--------------ANLGGMMREVDLQIKSFEETED 46

Query: 174 GLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESW 233
           G+                 R S +    L LPW+P+++  G T + LNDE ++ RH E W
Sbjct: 47  GVQTE-------------WRFSCV----LDLPWRPMLAASGGTTHVLNDENRVVRHYERW 89

Query: 234 NVSALEAVGQ 243
           +V   + +GQ
Sbjct: 90  DVDPKKVLGQ 99


>gi|219109561|ref|XP_002176535.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411070|gb|EEC50998.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 332

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/170 (22%), Positives = 75/170 (44%), Gaps = 33/170 (19%)

Query: 89  DDIVTILRSDYE----NAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFF 144
           +D++T+L++D++      Y++TG   + +Y +DC+F+ P                     
Sbjct: 176 EDLMTVLQADWQTDTNKGYYITGRLNTAVYRDDCLFDGPD-------------------- 215

Query: 145 EYPSIGLQNIEKFVFLVSLMVGRREGRMRGL------HNSTALGGHIFLLPYLLRISLIC 198
             P + ++ + K++   S +   R  R   L       +   +   +  LP  +      
Sbjct: 216 --PDMPVRGLRKYLNAASQLFDPRTSRATLLSLEEEKEDGVLVTNDVDKLPTRIVAKWEM 273

Query: 199 RTYLKLPWKP-LISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           R  L+LPWKP L +  G+T Y  + +  I +H E+W++S  EA  +   P
Sbjct: 274 RGILRLPWKPSLPTWTGTTTYVRDADGLIYQHLETWDMSVGEAFLRTLAP 323


>gi|428176663|gb|EKX45546.1| hypothetical protein GUITHDRAFT_71148 [Guillardia theta CCMP2712]
          Length = 122

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 61/136 (44%), Gaps = 31/136 (22%)

Query: 101 NAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFL 160
           + Y+VTG  T  IY +DC F+DPT   RG                        +++++  
Sbjct: 13  SQYYVTGNLTRSIYRDDCRFKDPTTDVRG------------------------LKRYILA 48

Query: 161 VSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLIS-IDGSTVYE 219
           V  +    +     L++ T +  +  L  + L+ S      LKLPWKP I   DG+T Y 
Sbjct: 49  VQNLFADPKDSKVTLYSVTPIEDNKILAKWSLQGS------LKLPWKPYIEPYDGTTTYW 102

Query: 220 LNDELKITRHAESWNV 235
           +++E  I  H E+W V
Sbjct: 103 IDNEGLIEFHDETWLV 118


>gi|443324513|ref|ZP_21053261.1| hypothetical protein Xen7305DRAFT_00049690 [Xenococcus sp. PCC
           7305]
 gi|442795887|gb|ELS05226.1| hypothetical protein Xen7305DRAFT_00049690 [Xenococcus sp. PCC
           7305]
          Length = 134

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 42/156 (26%), Positives = 66/156 (42%), Gaps = 34/156 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           D+V +++ DY+         T  IYAED  FEDP  +F+G   Y + +  L  FF   +I
Sbjct: 9   DLVAVIKQDYQQ---FPKDQTYSIYAEDVYFEDPLNKFQGIGRYQKMIGFLGRFFG--NI 63

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L+                      LHN T           +++     +    LPWKPL
Sbjct: 64  DLE----------------------LHNITQDKN-------IIKTEWTLKMTSPLPWKPL 94

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           + I G +  E+N    I  H + WN++ ++ + Q F
Sbjct: 95  LVIPGWSELEINQNNLIVAHRDYWNIAPIKVLLQNF 130


>gi|147832280|emb|CAN73278.1| hypothetical protein VITISV_040607 [Vitis vinifera]
          Length = 215

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 54/211 (25%), Positives = 81/211 (38%), Gaps = 42/211 (19%)

Query: 38  KNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTILRS 97
           KN N +T   ++L    S  +EL   FS     R+ K+  +    +   G  D   ++ +
Sbjct: 35  KNPNPRTRCSRLLGTGRS-CSELPLRFSGHR--RVSKLPKELLTSLETVGAVDRAALVET 91

Query: 98  DYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKF 157
             E+       F    Y +DC F DP   FRG   + RN        +  ++ L   E F
Sbjct: 92  IKED-------FQRSAYEDDCEFADPAGSFRGLRRFKRNCTNFGSLIQKSNMKLMKWEDF 144

Query: 158 VFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTV 217
                          +G+       GH        R S +    L  PWKP++S  G T 
Sbjct: 145 -------------EDKGI-------GH-------WRFSCV----LSFPWKPILSATGYTE 173

Query: 218 YELNDEL-KITRHAESWNVSALEAVGQIFTP 247
           Y  + +  K+ RH E WNV  +  + QI  P
Sbjct: 174 YYFDSQSGKVCRHVEHWNVPKMALLKQILRP 204


>gi|428319231|ref|YP_007117113.1| Protein of unknown function DUF2358 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428242911|gb|AFZ08697.1| Protein of unknown function DUF2358 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 130

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 45/162 (27%), Positives = 65/162 (40%), Gaps = 42/162 (25%)

Query: 90  DIVTILRSDY----ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE 145
           DI+ IL+ DY    EN        T  IYA+D  FEDP  RF G + Y + +  +  FF+
Sbjct: 2   DIIEILKDDYQRFPENQ-------TYNIYAQDVYFEDPVNRFTGVDRYRKMIGFMGTFFQ 54

Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLP 205
              I L                       LH  +  G  I             R    LP
Sbjct: 55  --DINLD----------------------LHGISQSGDTI-------DTRWTLRWIAPLP 83

Query: 206 WKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           WKP ++I G +  ++N +  I  H + W+ S L+ + Q+  P
Sbjct: 84  WKPKMAIAGRSELKVNSDGLIVSHIDYWDCSRLDVLKQLVFP 125


>gi|113473951|ref|YP_720012.1| hypothetical protein Tery_0014 [Trichodesmium erythraeum IMS101]
 gi|110164999|gb|ABG49539.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 130

 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/156 (25%), Positives = 68/156 (43%), Gaps = 34/156 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI++ L+ DY+         T  IYAE+  FEDP  +FRG E Y + +  +  +F    +
Sbjct: 2   DIISRLKQDYQK---FPADQTYSIYAENVYFEDPLNKFRGIERYQKMIGFMNNWFNDIKL 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L +I +                          G++    + L  +        +PWKP 
Sbjct: 59  DLHDISQ-------------------------SGNVIKTRWTLSWT------APVPWKPP 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           ++I G +  +LN +  I  H + W++S L+ + Q+F
Sbjct: 88  MAIPGWSELKLNADGLINSHIDYWDISRLDVLKQLF 123


>gi|86609849|ref|YP_478611.1| hypothetical protein CYB_2413 [Synechococcus sp. JA-2-3B'a(2-13)]
 gi|86558391|gb|ABD03348.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
          Length = 142

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 64/160 (40%), Gaps = 34/160 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           D+V ILR+DY  A F     T  IY     F DP   FRG + Y R +  +  + +   +
Sbjct: 12  DLVAILRADY--ARFPRDQ-TYAIYDPQVYFRDPLNEFRGLDRYRRMIEGIGRWLQDIRL 68

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            LQ+I +                         G  I       R   I    L LPW+P 
Sbjct: 69  DLQDIRQ------------------------TGNQI-------RTEWILSGALALPWRPR 97

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGD 249
           + I G T  ++N+   I  H + W  S  + + Q+F PGD
Sbjct: 98  LCIPGWTEMQINETGLIGSHVDYWRCSRRQVLLQMFRPGD 137


>gi|409990703|ref|ZP_11274042.1| hypothetical protein APPUASWS_07005 [Arthrospira platensis str.
           Paraca]
 gi|291570101|dbj|BAI92373.1| hypothetical protein [Arthrospira platensis NIES-39]
 gi|409938430|gb|EKN79755.1| hypothetical protein APPUASWS_07005 [Arthrospira platensis str.
           Paraca]
          Length = 134

 Score = 54.3 bits (129), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 39/156 (25%), Positives = 63/156 (40%), Gaps = 33/156 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           D++  ++ DY+N        T  IYAE+  F+DP   F G + Y + +  +  +F+   +
Sbjct: 5   DLIETIKQDYQN---FPDNQTYSIYAENVYFKDPVNEFSGCDRYQKMINFMATWFQDIQL 61

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L +I +                          G      + L  ++       LPW P 
Sbjct: 62  DLHDISE-------------------------SGDTIETQWTLSWTVSV-----LPWAPR 91

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           +SI G +   +N E  I RH + WN S L  V Q+F
Sbjct: 92  LSIPGYSHLHINPEGLIDRHIDYWNCSRLAVVKQLF 127


>gi|218248348|ref|YP_002373719.1| hypothetical protein PCC8801_3601 [Cyanothece sp. PCC 8801]
 gi|218168826|gb|ACK67563.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
          Length = 132

 Score = 53.9 bits (128), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 42/156 (26%), Positives = 66/156 (42%), Gaps = 34/156 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           +I+ I++ DY+N  F T   T  IYAED  F+DP   FRG + Y   ++ +  +F+   +
Sbjct: 2   NILEIIKQDYQN--FPTNP-TYSIYAEDVYFKDPMTEFRGIKRYKSMIQFMATWFKEIKL 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L  I +                  +H    L    ++ P              LPWKP 
Sbjct: 59  DLHKIYQ--------------SKDTIHAEWTLH---WITP--------------LPWKPP 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           I+I G +   +N +  I  H + WN S  + + Q F
Sbjct: 88  IAIAGRSELVINSQNLIISHIDYWNCSRWDVLKQHF 123


>gi|332711323|ref|ZP_08431255.1| uncharacterized conserved protein, DUF2358 [Moorea producens 3L]
 gi|332349872|gb|EGJ29480.1| uncharacterized conserved protein, DUF2358 [Moorea producens 3L]
          Length = 146

 Score = 53.9 bits (128), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 41/155 (26%), Positives = 63/155 (40%), Gaps = 34/155 (21%)

Query: 91  IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
           I  I++ DY+         T  IYA D  F+DP  +FRG E Y + +  +  +F  P + 
Sbjct: 21  IAQIIQQDYQR---FPADQTYSIYASDVYFKDPLNQFRGLERYKQMIAFINRWFLEPHLD 77

Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
           LQNI +                          G      + L  +        LPWKP +
Sbjct: 78  LQNISQ-------------------------SGDTIKTRWTLSWT------TPLPWKPRM 106

Query: 211 SIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           +I G +  +LN E  I  H + W+ S L+ + Q+ 
Sbjct: 107 TIPGWSELKLNAEGLIVSHIDYWDCSRLDVLKQLL 141


>gi|170077270|ref|YP_001733908.1| hypothetical protein SYNPCC7002_A0647 [Synechococcus sp. PCC 7002]
 gi|169884939|gb|ACA98652.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
          Length = 127

 Score = 53.9 bits (128), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 39/159 (24%), Positives = 68/159 (42%), Gaps = 34/159 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           D++ ILR DY+         T  IY++D  F+DP  +FRG + Y + +  L  +F    +
Sbjct: 2   DLLDILRQDYQK---FPQNQTFAIYSDDVFFQDPLNQFRGRDRYQKMIGFLGRWFRDIHL 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L ++++     +    R E  +                      S+ C     LPW+P 
Sbjct: 59  ELHDLQQ-----TQQTIRSEWTL----------------------SMTC----PLPWQPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPG 248
           + I G ++ E+N +  I  H + W    L+ + Q+F P 
Sbjct: 88  LRISGHSLLEINADNLIVSHIDYWQKPPLKILLQVFQPS 126


>gi|334117385|ref|ZP_08491476.1| Protein of unknown function DUF2358 [Microcoleus vaginatus FGP-2]
 gi|333460494|gb|EGK89102.1| Protein of unknown function DUF2358 [Microcoleus vaginatus FGP-2]
          Length = 130

 Score = 53.5 bits (127), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 42/158 (26%), Positives = 66/158 (41%), Gaps = 34/158 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T  IYA+D  FEDP  RF G + Y + +  +  FF+  ++
Sbjct: 2   DIIEILKDDYQR---FPDNQTYSIYAKDVYFEDPVNRFTGVDRYRKMIGFMGTFFQDVNL 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L  I +                          G      + L  S I      LPWKP 
Sbjct: 59  DLHGISQ-------------------------SGDTIETRWTL--SWIA----PLPWKPK 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           ++I G +  ++N +  I  H + W+ S L+ + Q+  P
Sbjct: 88  MAIAGRSELKVNSDGLIVSHIDYWDCSRLDVLKQLVFP 125


>gi|427716868|ref|YP_007064862.1| hypothetical protein Cal7507_1569 [Calothrix sp. PCC 7507]
 gi|427349304|gb|AFY32028.1| Protein of unknown function DUF2358 [Calothrix sp. PCC 7507]
          Length = 133

 Score = 53.5 bits (127), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 62/159 (38%), Gaps = 34/159 (21%)

Query: 89  DDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPS 148
           ++IV IL+ DY+         T  IYA D  F+DP  +FRG E Y + +  +  +F  P 
Sbjct: 3   NNIVEILKEDYQRFPINQ---TYSIYATDVYFQDPLNKFRGVERYKQTINFIQTWFLNPK 59

Query: 149 IGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKP 208
           + L +I+                         LG  I       +          LPWKP
Sbjct: 60  MDLHDIQ------------------------LLGDTI-------KTEWTLSWNTPLPWKP 88

Query: 209 LISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
            ISI G +   LN    I  H + W+ S  + + Q   P
Sbjct: 89  RISISGWSELRLNALGLIVSHIDYWHCSRFDVLKQHLFP 127


>gi|303290268|ref|XP_003064421.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454019|gb|EEH51326.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 299

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 63/159 (39%), Gaps = 17/159 (10%)

Query: 94  ILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQN 153
           I R   E  Y  TG    E+Y   C F DPT+ F G + +  NL+ L P      +   N
Sbjct: 150 IAREFTEAKYLWTGDINPEMYDLFCTFTDPTLSFAGLQTFETNLKNLQPVLRR-LVRNSN 208

Query: 154 IEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISID 213
           +E +   +               +  + GG        +R S      L LPW+P I + 
Sbjct: 209 VELYSCEIV-----------DDGSGGSGGGSGGGGGGSVRASWRMTGDLNLPWRPRIDLT 257

Query: 214 GSTVYELNDE-----LKITRHAESWNVSALEAVGQIFTP 247
           G T +   D        I  + E W +SAL+AV Q+  P
Sbjct: 258 GQTTFGFKDAGVERGCLIVSYLEEWGISALDAVAQLVAP 296


>gi|443309241|ref|ZP_21038980.1| hypothetical protein Syn7509DRAFT_00046060 [Synechocystis sp. PCC
           7509]
 gi|442780710|gb|ELR90864.1| hypothetical protein Syn7509DRAFT_00046060 [Synechocystis sp. PCC
           7509]
          Length = 128

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/156 (26%), Positives = 64/156 (41%), Gaps = 34/156 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DIV IL  DY   + V   +   +YA+D  F+DP  +FRG   Y   +  +  FF  P +
Sbjct: 2   DIVEILVDDYRR-FPVDQNYN--LYAQDVFFQDPLNKFRGINKYKAMIGFINTFFIEPKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L  I +                        +G  I       ++  +      LPWKP 
Sbjct: 59  DLLAINQ------------------------VGDTI-------KMEWVLSWNTPLPWKPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           I+I G +  ++N +  I  H + WN S L+ + Q F
Sbjct: 88  IAIPGQSELKINADGLIASHVDYWNCSRLDVIRQHF 123


>gi|75910467|ref|YP_324763.1| hypothetical protein Ava_4269 [Anabaena variabilis ATCC 29413]
 gi|75704192|gb|ABA23868.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
          Length = 128

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 66/155 (42%), Gaps = 34/155 (21%)

Query: 91  IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
           I+ IL+ DY+  + V   ++  IYAED  F+DP  +FRG   Y + +  +  +F    + 
Sbjct: 3   IIEILKQDYQR-FPVNQTYS--IYAEDVYFQDPLNKFRGITRYKQMINFMQTWFLNIKMD 59

Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
           L +I+             E +++                     +L   T   LPWKP I
Sbjct: 60  LHDIQHL-----------EDKIKT------------------EWTLSWNT--PLPWKPRI 88

Query: 211 SIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           SI G +   LN E  I  H + W  S L+ + Q F
Sbjct: 89  SISGWSELGLNCEGLIVSHIDYWQCSPLDVIKQHF 123


>gi|308800990|ref|XP_003075276.1| unnamed protein product [Ostreococcus tauri]
 gi|116061830|emb|CAL52548.1| unnamed protein product [Ostreococcus tauri]
          Length = 228

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 68/162 (41%), Gaps = 33/162 (20%)

Query: 93  TILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQ 152
           T+ R     AYFVTG  T   +AEDC F DPT   +G   Y   L  L   F+     ++
Sbjct: 91  TLARDVGVEAYFVTGRLTEAAFAEDCRFVDPTTDVKGLRRY---LDALGALFDPQRSSVE 147

Query: 153 NIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLIS- 211
                      ++G  E R       TA GG      Y L        YL+LPW+P +  
Sbjct: 148 -----------LLGELEAR------ETADGGGEIRGTYAL------EGYLRLPWRPKVPR 184

Query: 212 IDGSTVYEL--NDELK----ITRHAESWNVSALEAVGQIFTP 247
            +G   + +  ND       I    ++W+VS  +A+ + FTP
Sbjct: 185 YEGRITWTVAPNDGGANAGLIVEQRQTWSVSGFDALRETFTP 226


>gi|37522054|ref|NP_925431.1| hypothetical protein glr2485 [Gloeobacter violaceus PCC 7421]
 gi|35213053|dbj|BAC90426.1| glr2485 [Gloeobacter violaceus PCC 7421]
          Length = 141

 Score = 52.0 bits (123), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 63/154 (40%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL++DY+         T  IYA D  F+DP   FRG E Y R +  +  +F+   +
Sbjct: 6   DILEILKADYQR---FPEDQTYAIYANDVYFKDPLNSFRGIERYRRMIGWMHRWFQPIRL 62

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L +IE           R++ R                    +  +        LPWKP 
Sbjct: 63  ELHSIE-----------RQQTR--------------------IVTTWTLSWQAPLPWKPH 91

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I IDG +  +L+ + +I  H + W  S  +   Q
Sbjct: 92  IVIDGWSELDLDTQGQIAAHTDHWRCSPNDVAWQ 125


>gi|254413402|ref|ZP_05027173.1| hypothetical protein MC7420_889 [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196180022|gb|EDX75015.1| hypothetical protein MC7420_889 [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 127

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/156 (27%), Positives = 67/156 (42%), Gaps = 34/156 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+  + V   ++  IYA    F+DP   FRG   Y + +  +  +F  P +
Sbjct: 5   DIIEILKQDYQR-FPVDQTYS--IYANTVFFKDPLNEFRGLNRYKQMIGFIQTWFINPQL 61

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L                        H+ +  G  I       R +L   T   LPWKP 
Sbjct: 62  DL------------------------HDISQSGDTI-----KTRWTLSWTT--PLPWKPR 90

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           ISI G +   LN +  I+ H + W++  L+ + Q+F
Sbjct: 91  ISIPGWSELRLNADGLISSHIDYWDIPRLDVLKQLF 126


>gi|56750302|ref|YP_171003.1| hypothetical protein syc0293_c [Synechococcus elongatus PCC 6301]
 gi|56685261|dbj|BAD78483.1| hypothetical protein [Synechococcus elongatus PCC 6301]
          Length = 137

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 39/160 (24%), Positives = 65/160 (40%), Gaps = 34/160 (21%)

Query: 84  LVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPF 143
           +   +DD++  LR DY  A F     + E+Y  D  F+DP  RF+G + Y + +  +  +
Sbjct: 1   MAIALDDLLETLRQDY--ARFPKDQ-SFEVYDPDAFFQDPLTRFQGRDRYQKMIAFIDRW 57

Query: 144 FEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLK 203
           F  P + L +I                   G+ +   L    +  P+             
Sbjct: 58  FLDPELTLHDI--------------HATETGIESRWTL---TWTSPW------------- 87

Query: 204 LPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
            PW+P   I G T++EL  E  I+ H + W+ S    + Q
Sbjct: 88  -PWRPRSQISGRTLFELTAEGTISSHRDYWDCSPWAVLRQ 126


>gi|428311429|ref|YP_007122406.1| hypothetical protein Mic7113_3262 [Microcoleus sp. PCC 7113]
 gi|428253041|gb|AFZ19000.1| hypothetical protein Mic7113_3262 [Microcoleus sp. PCC 7113]
          Length = 129

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/162 (26%), Positives = 67/162 (41%), Gaps = 34/162 (20%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T  IYA D  F+DP  +F+G E Y + +  +  +F  P +
Sbjct: 2   DILQILKEDYQR---FPADQTYSIYAPDVFFKDPLNQFQGIERYKQMIGFINTWFGAPKL 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L  I            R E  ++                   R +L   T   LPW+P 
Sbjct: 59  DLHEIH-----------RSEDTIKT------------------RWTLSWTT--PLPWRPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGDRN 251
           ISI G +  +LN +  I  H + W+ S  + + Q     ++N
Sbjct: 88  ISIPGWSELKLNVDELIISHIDYWDCSRWDVLKQHLPFQEKN 129


>gi|218439252|ref|YP_002377581.1| hypothetical protein PCC7424_2291 [Cyanothece sp. PCC 7424]
 gi|218171980|gb|ACK70713.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
          Length = 125

 Score = 51.2 bits (121), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 64/154 (41%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+  + V    T EIYA +  F+DP   FRG + Y   +  +  +F+   +
Sbjct: 2   DIIEILKDDYQR-FPVNQ--TYEIYAPNVYFKDPVNEFRGIKRYQEMITFMGTWFKEIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L +I +   ++             LH +T                        +PW+P 
Sbjct: 59  DLHDIHQKENIIYT--------QWTLHWTTP-----------------------VPWQPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           ISI G +   L+D   I  H + WN S  + V Q
Sbjct: 88  ISIPGRSELTLDDNHLIVSHVDYWNCSRWDVVKQ 121


>gi|257060326|ref|YP_003138214.1| hypothetical protein Cyan8802_2513 [Cyanothece sp. PCC 8802]
 gi|256590492|gb|ACV01379.1| conserved hypothetical protein [Cyanothece sp. PCC 8802]
          Length = 132

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 40/156 (25%), Positives = 66/156 (42%), Gaps = 34/156 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           +I+ I++ DY+N + V   ++  IYAED  F+DP   FRG + Y   +  +  +F+   +
Sbjct: 2   NILEIIQQDYQN-FPVNPTYS--IYAEDVYFKDPMTEFRGIKRYKSMIEFMATWFKAIKL 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L  I +                  +H    L    ++ P              LPWKP 
Sbjct: 59  DLHKIYQ--------------SQDTIHAEWTLH---WITP--------------LPWKPP 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           I+I G +   +N +  I  H + WN S  + + Q F
Sbjct: 88  IAIAGRSELVINAQNLIISHIDYWNCSRWDVLKQHF 123


>gi|81300067|ref|YP_400275.1| hypothetical protein Synpcc7942_1258 [Synechococcus elongatus PCC
           7942]
 gi|81168948|gb|ABB57288.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 137

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 39/160 (24%), Positives = 65/160 (40%), Gaps = 34/160 (21%)

Query: 84  LVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPF 143
           +   +DD++  LR DY  A F     + E+Y  D  F+DP  RF+G + Y + +  +  +
Sbjct: 1   MAIALDDLLETLRQDY--ARFPKDQ-SFEVYDPDVFFQDPLTRFQGRDRYQKMIAFIDRW 57

Query: 144 FEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLK 203
           F  P + L +I                   G+ +   L    +  P+             
Sbjct: 58  FLDPELTLHDI--------------HATETGIESRWTL---TWTSPW------------- 87

Query: 204 LPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
            PW+P   I G T++EL  E  I+ H + W+ S    + Q
Sbjct: 88  -PWRPRSQISGRTLFELTAEGTISSHRDYWDCSPWAVLRQ 126


>gi|219123757|ref|XP_002182185.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217406146|gb|EEC46086.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 457

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 47/169 (27%), Positives = 76/169 (44%), Gaps = 33/169 (19%)

Query: 88  IDDIVTILRSDYEN------AYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV 141
           ID +   L +D  N      +Y VTG     I+ +DC+F+DPT R      Y + L +L 
Sbjct: 114 IDALAQKLTNDLANGATGRGSYIVTGDLNPRIFRDDCVFQDPTNRVASLRQYRQALTIL- 172

Query: 142 PFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTY 201
             F+     ++ +E    +VS      +G +R                  LR     R Y
Sbjct: 173 --FDPERSVVEVLEPVRRIVS---DTDDGSIR------------------LRGRFRQRGY 209

Query: 202 LK-LPWKPLISIDGSTV-YELN-DELKITRHAESWNVSALEAVGQIFTP 247
           L+ LPW+P ++   ST+ Y ++ D  +I   +++W  SA  A+ + FTP
Sbjct: 210 LQFLPWQPYVTAYESTITYRIDPDTGRIAEQSQTWTKSANRALRESFTP 258


>gi|209527250|ref|ZP_03275761.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|376003800|ref|ZP_09781603.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423066117|ref|ZP_17054907.1| hypothetical protein SPLC1_S410140 [Arthrospira platensis C1]
 gi|209492317|gb|EDZ92661.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|375327831|emb|CCE17356.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406712159|gb|EKD07348.1| hypothetical protein SPLC1_S410140 [Arthrospira platensis C1]
          Length = 134

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 37/155 (23%), Positives = 62/155 (40%), Gaps = 33/155 (21%)

Query: 91  IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
           ++  ++ DY+N        T  IYA++  F+DP   F G + Y + +  +  +F+   + 
Sbjct: 6   LIETIKQDYQN---FPDNQTYSIYADNVYFKDPVNEFSGCDRYQKMINFMATWFQDIQLD 62

Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
           L +I +                          G      + L  ++       LPW P +
Sbjct: 63  LHDISE-------------------------SGDTIETQWTLSWTVSV-----LPWAPRL 92

Query: 211 SIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           SI G +   +N E  I RH + WN S L  V Q+F
Sbjct: 93  SIPGYSHLHINPEGLIDRHIDYWNCSRLAVVKQLF 127


>gi|440753389|ref|ZP_20932592.1| hypothetical protein O53_1767 [Microcystis aeruginosa TAIHU98]
 gi|440177882|gb|ELP57155.1| hypothetical protein O53_1767 [Microcystis aeruginosa TAIHU98]
          Length = 128

 Score = 50.8 bits (120), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 64/154 (41%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T +IYA++  F+DP  +FRG + Y   +  +  +F+   +
Sbjct: 2   DIIAILQQDYQRFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            +  IE+         G        LH +T                        LPW+P 
Sbjct: 59  DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I GS+   L+D   I  H + W+ S  + + Q
Sbjct: 88  IAISGSSQLTLDDNNLIISHIDYWDCSRWDVLRQ 121


>gi|427711855|ref|YP_007060479.1| hypothetical protein Syn6312_0717 [Synechococcus sp. PCC 6312]
 gi|427375984|gb|AFY59936.1| hypothetical protein Syn6312_0717 [Synechococcus sp. PCC 6312]
          Length = 125

 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 37/156 (23%), Positives = 60/156 (38%), Gaps = 34/156 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           +++ +++ DY          + EIYA D  F+DP   FRG + Y + +  +  +F    +
Sbjct: 2   NLIEVVKQDYSR---FPQAQSYEIYASDVFFKDPVYEFRGIDKYQKMIGFITYWFSNLKL 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L +I                     H   A+          +           LPW+P 
Sbjct: 59  ELHDI---------------------HQKDAI----------IHTQWTMSWNAPLPWQPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           IS+ G +  E+N E  I  H + W  S L+ V Q F
Sbjct: 88  ISVTGRSELEVNAEGLIISHIDYWQCSRLDVVKQHF 123


>gi|428224997|ref|YP_007109094.1| hypothetical protein GEI7407_1551 [Geitlerinema sp. PCC 7407]
 gi|427984898|gb|AFY66042.1| Protein of unknown function DUF2358 [Geitlerinema sp. PCC 7407]
          Length = 134

 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 42/154 (27%), Positives = 65/154 (42%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DIVT+LR DY++          E+Y     F+DP   FRG + + + + L+  +F    +
Sbjct: 8   DIVTLLREDYQH---FPRDQRYELYDPGVYFKDPLSEFRGIKRFRQMVGLMERWFLNARM 64

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L +I++   L++                              R +L   T   LPWKP 
Sbjct: 65  DLHDIQQRDRLITT-----------------------------RWTLSWNT--PLPWKPR 93

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I G +   LNDE  I  H + W+ S  + V Q
Sbjct: 94  IAISGRSELTLNDEGLIVSHIDWWDCSIWDVVQQ 127


>gi|116309685|emb|CAH66733.1| H0404F02.9 [Oryza sativa Indica Group]
          Length = 247

 Score = 50.8 bits (120), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 44/191 (23%), Positives = 72/191 (37%), Gaps = 37/191 (19%)

Query: 33  VNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRL-----DKVSYKQKYEILVSG 87
            + G  N  + ++   +L+ A  G +ELL + +SF +        D     ++      G
Sbjct: 44  ASPGAGNETSSSSENAVLRAAWYG-SELLGIAASFFRPSQPPTEGDGAGAVEEAASEPQG 102

Query: 88  IDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYP 147
              +   ++ D+  +YFVTG  T + Y EDC F DP   F G + + RN        E  
Sbjct: 103 RAQVAEAVKDDFARSYFVTGNLTLKAYEEDCEFADPAGSFNGLQRFKRNCTNFGSLLEKA 162

Query: 148 SIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWK 207
           ++ L   E    L    +G                          R S +    +  PW+
Sbjct: 163 NMKLTKWED---LEDKSIGH------------------------WRFSCV----MSFPWR 191

Query: 208 PLISIDGSTVY 218
           P++S  G T Y
Sbjct: 192 PILSATGYTEY 202


>gi|427419623|ref|ZP_18909806.1| hypothetical protein Lepto7375DRAFT_5471 [Leptolyngbya sp. PCC
           7375]
 gi|425762336|gb|EKV03189.1| hypothetical protein Lepto7375DRAFT_5471 [Leptolyngbya sp. PCC
           7375]
          Length = 126

 Score = 50.4 bits (119), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 55/136 (40%), Gaps = 30/136 (22%)

Query: 112 EIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGR 171
           +IYA+D  F+DP   F+G E Y R +  +  +F       +NIE                
Sbjct: 21  QIYAKDVYFKDPMSEFQGIERYQRMIGFIAQWF-------RNIE---------------- 57

Query: 172 MRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAE 231
              LH       +  +  + L  +         PW+P ISI G +  ++N E  I  H +
Sbjct: 58  -LELHRVEQPQANTVITRWTLHFTA------PTPWQPRISIPGWSELQINTEGLICAHID 110

Query: 232 SWNVSALEAVGQIFTP 247
            WN S  +   Q+F P
Sbjct: 111 YWNCSRWDVFKQLFAP 126


>gi|425469266|ref|ZP_18848216.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
 gi|389882034|emb|CCI37462.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
          Length = 128

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 63/154 (40%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T +IYA+D  F+DP  +FRG + Y   +  +  +F+   +
Sbjct: 2   DIIAILQQDYQRFPLDQ---TYDIYADDVYFQDPLNQFRGIKRYREMIGFMGQWFQAIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            +  IE+         G        LH +T                        LPW+P 
Sbjct: 59  DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I G +   L+D   I  H + W+ S  + + Q
Sbjct: 88  IAISGRSQLTLDDNNLIISHIDYWDCSRWDVLRQ 121


>gi|219130148|ref|XP_002185234.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403413|gb|EEC43366.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 286

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 45/172 (26%), Positives = 73/172 (42%), Gaps = 27/172 (15%)

Query: 90  DIVTILRSDY-ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE--- 145
           D++  +R+DY +  Y  TG      +   C F DPT+ F GT  + RN+R L P  +   
Sbjct: 118 DLLERIRADYVDKNYLWTGDIDLACFDPHCRFTDPTLSFEGTATFVRNIRNLRPIVDALV 177

Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLP 205
            P  G    E       L + +R+     +++S   G       Y+     +     +LP
Sbjct: 178 VPPDGHCRSEL------LEIEQRDA--VDVNDSDGQG-------YIETRWNMVGVLNQLP 222

Query: 206 WKPLISIDGSTVY--------ELNDELKITRHAESWNVSALEAVGQIFTPGD 249
           W+P I + G T +          +  L + R+ E W + A  A+ Q+ TP D
Sbjct: 223 WRPKIDVIGRTQFWYRPCRDNRDDAALHVYRYDEEWEIPAGRALLQLVTPAD 274


>gi|428181968|gb|EKX50830.1| hypothetical protein GUITHDRAFT_103420 [Guillardia theta CCMP2712]
          Length = 139

 Score = 50.1 bits (118), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 56/125 (44%), Gaps = 28/125 (22%)

Query: 87  GIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
           G +D++  L+ +YE  YF++G     +Y  DC+F DP   FRG E + RNL         
Sbjct: 38  GNEDLLQRLQLEYEKNYFLSGDIDVSLYHPDCLFADPFASFRGRERFRRNLS-------- 89

Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
            ++GL  +     L+SL     +                   P + R  ++ +  L LPW
Sbjct: 90  -NLGLFVLSYRCRLLSLAQDPVD-------------------PNVFRSCVMVKLQLNLPW 129

Query: 207 KPLIS 211
           +P+++
Sbjct: 130 RPVLA 134


>gi|434398676|ref|YP_007132680.1| Protein of unknown function DUF2358 [Stanieria cyanosphaera PCC
           7437]
 gi|428269773|gb|AFZ35714.1| Protein of unknown function DUF2358 [Stanieria cyanosphaera PCC
           7437]
          Length = 131

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 60/154 (38%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           +I+ I++ DY+N        T  IYA+D  F+DP   FRG + Y   +  L  FF   ++
Sbjct: 2   EIIEIIKQDYQN---FPKNQTYSIYADDVYFKDPLNEFRGIQRYQTMIGFLGNFFREINL 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L +I                                L    L+          LPW+P 
Sbjct: 59  DLHDIS-------------------------------LQENCLKTEWTLHLTSPLPWQPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           +SI G +  +LN    I  H + W++S  E + Q
Sbjct: 88  LSIPGWSELKLNQNGLIIAHIDYWHISPWEVLKQ 121


>gi|220906651|ref|YP_002481962.1| hypothetical protein Cyan7425_1222 [Cyanothece sp. PCC 7425]
 gi|219863262|gb|ACL43601.1| conserved hypothetical protein [Cyanothece sp. PCC 7425]
          Length = 125

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 62/156 (39%), Gaps = 34/156 (21%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DIV+ILR DY+         T EIYA D  F DP  +FRG + Y                
Sbjct: 2   DIVSILRQDYQR---FPQDQTYEIYAADVYFRDPLTQFRGIKRY---------------- 42

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
             QN+ +F+   +  +  R      LH        I     L  I+        LPW P 
Sbjct: 43  --QNMIQFI--QTWFINTR----MELHEIQQQEQQITTRWTLSWIA-------PLPWHPQ 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           +SI G +   LN    I  H + W+ S  E + Q+F
Sbjct: 88  LSISGRSELTLNQAGLIVSHLDYWDCSRWEVLQQLF 123


>gi|3702337|gb|AAC62894.1| unknown protein [Arabidopsis thaliana]
          Length = 286

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 34/67 (50%)

Query: 91  IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
           +V  ++ D++ +YFVTG  T E+Y E C F DP   F+G   + RN        E  ++ 
Sbjct: 105 VVDSIKQDFKRSYFVTGNLTPEVYEEKCEFADPAGSFKGLARFKRNCTNFGSLIEKSNMK 164

Query: 151 LQNIEKF 157
           L   E F
Sbjct: 165 LMKWENF 171


>gi|428775383|ref|YP_007167170.1| hypothetical protein PCC7418_0737 [Halothece sp. PCC 7418]
 gi|428689662|gb|AFZ42956.1| Protein of unknown function DUF2358 [Halothece sp. PCC 7418]
          Length = 125

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/135 (25%), Positives = 56/135 (41%), Gaps = 31/135 (22%)

Query: 112 EIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGR 171
           +IYAED  F+DP   FRG + Y   +  +  +F+  ++ L +I            R+E  
Sbjct: 21  DIYAEDVYFKDPLTEFRGLQRYQSLIDFISTWFQDINLELHDI------------RQEDN 68

Query: 172 MRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAE 231
                               ++          LPW+P ISI G +  ++N + KI  H +
Sbjct: 69  T-------------------IQTEWTLNWTTPLPWQPRISIPGWSELKVNADHKIISHID 109

Query: 232 SWNVSALEAVGQIFT 246
            W+ S  E + Q F+
Sbjct: 110 YWHCSRWEVIKQHFS 124


>gi|17230193|ref|NP_486741.1| hypothetical protein all2701 [Nostoc sp. PCC 7120]
 gi|17131794|dbj|BAB74400.1| all2701 [Nostoc sp. PCC 7120]
          Length = 128

 Score = 49.3 bits (116), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/155 (25%), Positives = 65/155 (41%), Gaps = 34/155 (21%)

Query: 91  IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
           I+ IL+ DY+  + V   ++  IYAED  F+DP  +FRG   Y + +  +  +F    + 
Sbjct: 3   IIEILKQDYQR-FPVNQTYS--IYAEDVYFQDPLNKFRGITRYKQMINFMQTWFLNIKMD 59

Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
           L +I+             E +++                     +L   T   +PWKP I
Sbjct: 60  LHDIQHL-----------EDKIKT------------------EWTLSWNT--PVPWKPRI 88

Query: 211 SIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           SI G +   LN +  I  H + W  S  + + Q F
Sbjct: 89  SISGWSELGLNSKGLIVSHIDYWQCSPFDVIKQHF 123


>gi|425463533|ref|ZP_18842863.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
 gi|389831546|emb|CCI25628.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
          Length = 128

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T +IYA++  F+DP  +FRG + Y   +  +  +F+   +
Sbjct: 2   DIIAILQQDYQKFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            +  IE+         G        LH +T                        LPW+P 
Sbjct: 59  DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I G +   L+D   I  H + W+ S  + + Q
Sbjct: 88  IAISGRSELTLDDNNLIISHIDYWDCSHWDVLRQ 121


>gi|186508518|ref|NP_001118532.1| Nuclear transport factor 2 (NTF2) family protein [Arabidopsis
           thaliana]
 gi|330255550|gb|AEC10644.1| Nuclear transport factor 2 (NTF2) family protein [Arabidopsis
           thaliana]
          Length = 229

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 34/67 (50%)

Query: 91  IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
           +V  ++ D++ +YFVTG  T E+Y E C F DP   F+G   + RN        E  ++ 
Sbjct: 105 VVDSIKQDFKRSYFVTGNLTPEVYEEKCEFADPAGSFKGLARFKRNCTNFGSLIEKSNMK 164

Query: 151 LQNIEKF 157
           L   E F
Sbjct: 165 LMKWENF 171


>gi|67923125|ref|ZP_00516615.1| hypothetical protein CwatDRAFT_3436 [Crocosphaera watsonii WH 8501]
 gi|416395382|ref|ZP_11686323.1| hypothetical protein CWATWH0003_3119 [Crocosphaera watsonii WH
           0003]
 gi|67855023|gb|EAM50292.1| hypothetical protein CwatDRAFT_3436 [Crocosphaera watsonii WH 8501]
 gi|357263116|gb|EHJ12162.1| hypothetical protein CWATWH0003_3119 [Crocosphaera watsonii WH
           0003]
          Length = 128

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 35/139 (25%), Positives = 55/139 (39%), Gaps = 31/139 (22%)

Query: 110 TSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRRE 169
           T  IYA++  F+DP   F G + Y   ++ +  +F+   + L NI +   ++        
Sbjct: 19  TYSIYADNVYFKDPMTEFTGVKRYQEMIQFMSTWFKNIDLELHNIYQSDNII-------- 70

Query: 170 GRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRH 229
                LH +T L                       PWKP ISI G +    N++  I  H
Sbjct: 71  HTQWTLHWTTPL-----------------------PWKPPISIPGRSEMTFNEQNLIISH 107

Query: 230 AESWNVSALEAVGQIFTPG 248
            + WN S  + + Q F P 
Sbjct: 108 IDYWNCSRWDVLKQHFVPN 126


>gi|422301888|ref|ZP_16389253.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
 gi|389789025|emb|CCI14979.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
          Length = 128

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T +IYA++  F+DP  +FRG + Y   +  +  +F+   +
Sbjct: 2   DIIAILQQDYQKFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            +  IE+         G        LH +T                        LPW+P 
Sbjct: 59  DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I G +   L+D   I  H + W+ S  + + Q
Sbjct: 88  IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121


>gi|390440334|ref|ZP_10228673.1| conserved hypothetical protein [Microcystis sp. T1-4]
 gi|389836261|emb|CCI32799.1| conserved hypothetical protein [Microcystis sp. T1-4]
          Length = 128

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 66/154 (42%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T +IYA+D  F+DP  +FRG + Y   +  +  +F+   +
Sbjct: 2   DIIAILQQDYQRFPLDQ---TYDIYADDVYFQDPLNQFRGIKRYREMIGFMGQWFQAIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            +  IE+                +G   +T    H ++ P              LPW+P 
Sbjct: 59  DVHAIEQ----------------QGNIINTRWTLH-WITP--------------LPWRPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I G +   L+D   I  H + W+ S  + + Q
Sbjct: 88  IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121


>gi|126660704|ref|ZP_01731803.1| hypothetical protein CY0110_09390 [Cyanothece sp. CCY0110]
 gi|126618002|gb|EAZ88772.1| hypothetical protein CY0110_09390 [Cyanothece sp. CCY0110]
          Length = 128

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/136 (27%), Positives = 55/136 (40%), Gaps = 31/136 (22%)

Query: 110 TSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRRE 169
           T  +YA++  F+DP   F G + Y   ++ +  +F+   + L NI +     S  +   E
Sbjct: 19  TYSVYADNVYFKDPVNEFTGVKRYQEMIQFMSTWFKNVDLELHNIYQ-----SNNIIHTE 73

Query: 170 GRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRH 229
                LH +T L                       PWKP ISI G +    ND+  I  H
Sbjct: 74  W---TLHWTTPL-----------------------PWKPPISIPGRSELTFNDQNLIISH 107

Query: 230 AESWNVSALEAVGQIF 245
            + WN S  + V Q F
Sbjct: 108 VDYWNCSRWDVVKQHF 123


>gi|425440423|ref|ZP_18820725.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
 gi|389719126|emb|CCH96985.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
          Length = 128

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/154 (24%), Positives = 65/154 (42%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T +IYA++  F+DP  +FRG + Y   +  +  +F+   +
Sbjct: 2   DIIAILQQDYQRFPIDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            +  IE+   +++            LH +T                        LPW+P 
Sbjct: 59  DIHAIEQQENIINTRW--------TLHWTTP-----------------------LPWRPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I G +   L+D   I  H + W+ S  + + Q
Sbjct: 88  IAISGRSQLTLDDNNLIISHIDYWDCSRWDVLRQ 121


>gi|425436057|ref|ZP_18816498.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
 gi|389679290|emb|CCH91903.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
          Length = 128

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T +IYA++  F+DP  +FRG + Y   +  +  +F+   +
Sbjct: 2   DIIAILQQDYQRFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            +  IE+         G        LH +T                        LPW+P 
Sbjct: 59  DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I G +   L+D   I  H + W+ S  + + Q
Sbjct: 88  IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121


>gi|425451700|ref|ZP_18831520.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
 gi|389766861|emb|CCI07601.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
          Length = 128

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T +IYA++  F+DP  +FRG + Y   +  +  +F+   +
Sbjct: 2   DIIAILQQDYQRFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            +  IE+         G        LH +T                        LPW+P 
Sbjct: 59  DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPP 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I G +   L+D   I  H + W+ S  + + Q
Sbjct: 88  IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121


>gi|219362933|ref|NP_001136788.1| uncharacterized protein LOC100216931 [Zea mays]
 gi|194697096|gb|ACF82632.1| unknown [Zea mays]
 gi|414585787|tpg|DAA36358.1| TPA: hypothetical protein ZEAMMB73_852215 [Zea mays]
          Length = 206

 Score = 48.5 bits (114), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 31/65 (47%)

Query: 91  IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
           +   +++D+  +YFVTG  T   Y EDC F DP   FRG   + RN        E  ++ 
Sbjct: 124 VAEAIKADFARSYFVTGNLTLRAYEEDCEFADPAGSFRGLRRFKRNCTNFGSLLEKSNMK 183

Query: 151 LQNIE 155
           L   E
Sbjct: 184 LTKWE 188


>gi|425447601|ref|ZP_18827586.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
 gi|389731776|emb|CCI04190.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
          Length = 128

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T +IYA++  F+DP  +FRG + Y   +  +  +F+   +
Sbjct: 2   DIIAILQQDYQRFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            +  IE+         G        LH +T                        LPW+P 
Sbjct: 59  DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I G +   L+D   I  H + W+ S  + + Q
Sbjct: 88  IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121


>gi|425459254|ref|ZP_18838740.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
 gi|389823076|emb|CCI29008.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
          Length = 128

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T +IYA++  F+DP  +FRG + Y   +  +  +F+   +
Sbjct: 2   DIIAILQQDYQRFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            +  IE+         G        LH +T                        LPW+P 
Sbjct: 59  DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPP 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I G +   L+D   I  H + W+ S  + + Q
Sbjct: 88  IAISGRSELTLDDNNLIISHIDYWDCSRWDVMRQ 121


>gi|411116636|ref|ZP_11389123.1| hypothetical protein OsccyDRAFT_0516 [Oscillatoriales
           cyanobacterium JSC-12]
 gi|410712739|gb|EKQ70240.1| hypothetical protein OsccyDRAFT_0516 [Oscillatoriales
           cyanobacterium JSC-12]
          Length = 150

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/172 (22%), Positives = 69/172 (40%), Gaps = 40/172 (23%)

Query: 85  VSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFF 144
            + I + + +L+ +     F T + +  IY++D  F+DP   F+G               
Sbjct: 12  TAKIQEAIAVLKQELPTL-FQTDL-SYHIYSQDIFFQDPVNTFKG--------------- 54

Query: 145 EYPSIGLQNIEKFVFLVSLMVGRREGRMR------GLHNSTALGGHIFLLPYLLRISLIC 198
                      KF + +     R  GR+        LHN      +       +R     
Sbjct: 55  -----------KFNYRIIFWTLRFHGRLFFTELFFDLHNVQQASENT------IRADWTV 97

Query: 199 RTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGDR 250
           R  L+LPWKP +  +G ++Y LN +  I +H ++W+    E + Q +  G++
Sbjct: 98  RGTLRLPWKPRLLFNGYSIYTLNSDALIFKHIDTWDRPLGEILSQFWRRGEQ 149


>gi|428780825|ref|YP_007172611.1| hypothetical protein Dacsa_2676 [Dactylococcopsis salina PCC 8305]
 gi|428695104|gb|AFZ51254.1| hypothetical protein Dacsa_2676 [Dactylococcopsis salina PCC 8305]
          Length = 125

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/156 (23%), Positives = 65/156 (41%), Gaps = 34/156 (21%)

Query: 91  IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
           I+  + +DY+         +  IYAED  F+DP   F+G + Y   +  +  +F+  ++ 
Sbjct: 3   IIETIEADYQK---FPANQSYNIYAEDVYFKDPLTEFQGIKRYQSMINFISTWFKDINLE 59

Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
           L +I++                  +H    L    +  P              LPW+P I
Sbjct: 60  LHSIKQI--------------ENTIHTEWTLN---WTTP--------------LPWQPRI 88

Query: 211 SIDGSTVYELNDELKITRHAESWNVSALEAVGQIFT 246
           SI G +  ++N + KI  H + W+ S  E + Q F+
Sbjct: 89  SIPGWSELKINTDQKIISHIDYWHCSRWEVLKQHFS 124


>gi|425453289|ref|ZP_18833047.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
 gi|389801445|emb|CCI19385.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
          Length = 128

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T +IYA++  F+DP  +FRG + Y   +  +  +F+   +
Sbjct: 2   DIIAILQQDYQRFPIDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            +  IE+         G        LH +T                        LPW+P 
Sbjct: 59  DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPP 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I G +   L+D   I  H + W+ S  + + Q
Sbjct: 88  IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121


>gi|443649469|ref|ZP_21130255.1| hypothetical protein C789_795 [Microcystis aeruginosa DIANCHI905]
 gi|159027936|emb|CAO87099.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443334951|gb|ELS49439.1| hypothetical protein C789_795 [Microcystis aeruginosa DIANCHI905]
          Length = 128

 Score = 48.1 bits (113), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T +IYA++  F+DP  +FRG + Y   +  +  +F+   +
Sbjct: 2   DIIAILQQDYQRFPLDQ---TYDIYADNVYFKDPLNQFRGIKRYREMIGFMSQWFQAIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            +  IE+         G        LH +T                        LPW+P 
Sbjct: 59  DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPP 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I G +   L+D   I  H + W+ S  + + Q
Sbjct: 88  IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121


>gi|166367539|ref|YP_001659812.1| hypothetical protein MAE_47980 [Microcystis aeruginosa NIES-843]
 gi|166089912|dbj|BAG04620.1| hypothetical protein MAE_47980 [Microcystis aeruginosa NIES-843]
          Length = 128

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 38/154 (24%), Positives = 66/154 (42%), Gaps = 34/154 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           DI+ IL+ DY+         T +IYA++  F+DP  +FRG + Y   +  +  +F+   +
Sbjct: 2   DIIAILQQDYQRFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            +  IE+                +G   +T    H ++ P              LPW+P 
Sbjct: 59  DVHAIEQ----------------QGNIINTRWTLH-WITP--------------LPWRPR 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
           I+I G +   L+D   I  H + W+ S  + + Q
Sbjct: 88  IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121


>gi|172039389|ref|YP_001805890.1| hypothetical protein cce_4476 [Cyanothece sp. ATCC 51142]
 gi|354552346|ref|ZP_08971654.1| Protein of unknown function DUF2358 [Cyanothece sp. ATCC 51472]
 gi|171700843|gb|ACB53824.1| unknown [Cyanothece sp. ATCC 51142]
 gi|353555668|gb|EHC25056.1| Protein of unknown function DUF2358 [Cyanothece sp. ATCC 51472]
          Length = 128

 Score = 46.2 bits (108), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 33/136 (24%), Positives = 53/136 (38%), Gaps = 31/136 (22%)

Query: 110 TSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRRE 169
           T  +YA++  F+DP   F G + Y   ++ +  +F+   + L NI +     S  +   E
Sbjct: 19  TYSVYADNVYFKDPVNEFTGVKRYQEMIQFMSTWFKNVDLELHNIYQ-----SDNIIHTE 73

Query: 170 GRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRH 229
             +  +                            LPWKP ISI G +    ND+  I  H
Sbjct: 74  WTLHWI--------------------------TPLPWKPPISIPGRSELTFNDQNLIISH 107

Query: 230 AESWNVSALEAVGQIF 245
            + WN S  + + Q F
Sbjct: 108 IDYWNCSRWDVLKQHF 123


>gi|428770087|ref|YP_007161877.1| hypothetical protein Cyan10605_1730 [Cyanobacterium aponinum PCC
           10605]
 gi|428684366|gb|AFZ53833.1| Protein of unknown function DUF2358 [Cyanobacterium aponinum PCC
           10605]
          Length = 125

 Score = 44.7 bits (104), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 38/156 (24%), Positives = 62/156 (39%), Gaps = 35/156 (22%)

Query: 90  DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
           +++ I++ DY+         T  IYAE+  F+DP   F G + Y   +  L  +F   ++
Sbjct: 2   NLIDIIKEDYQK---FPEDQTYAIYAENVHFKDPVYDFYGLKKYQEMIAFLRKWFSNLNL 58

Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
            L  I +       +  +   R     NS                         LPWKP 
Sbjct: 59  ELHEINQ-------IENQINTRWTMSWNSP------------------------LPWKPF 87

Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           IS+ G +  +L D+L I  H + W+ S  + + Q F
Sbjct: 88  ISVSGRSELKLKDDL-IVGHYDYWDCSFFDMIKQHF 122


>gi|158337451|ref|YP_001518626.1| hypothetical protein AM1_4330 [Acaryochloris marina MBIC11017]
 gi|158307692|gb|ABW29309.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 131

 Score = 43.9 bits (102), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 38/161 (23%), Positives = 64/161 (39%), Gaps = 42/161 (26%)

Query: 91  IVTILRSDY----ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
           I+  L++DY    EN  F       ++YAED  F+DP  +FRG   Y   +  +  +F  
Sbjct: 3   ILERLQADYDRFPENQSF-------DLYAEDVYFKDPLNQFRGVAKYQEMIGFIQKWFIN 55

Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
             + +  IE+           ++  +                    R +L  +     PW
Sbjct: 56  TRMEVHGIEQ-----------QDNEINT------------------RWTLYWQA--PFPW 84

Query: 207 KPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
           KP ++I G +  +LND   +  H + W+ S      Q+F P
Sbjct: 85  KPQMAITGRSELKLNDAGLVCSHIDYWDCSRFSVFKQLFFP 125


>gi|119494619|ref|ZP_01624743.1| hypothetical protein L8106_16284 [Lyngbya sp. PCC 8106]
 gi|119452054|gb|EAW33270.1| hypothetical protein L8106_16284 [Lyngbya sp. PCC 8106]
          Length = 127

 Score = 43.5 bits (101), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 37/154 (24%), Positives = 61/154 (39%), Gaps = 34/154 (22%)

Query: 92  VTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGL 151
           + I++ DY+         T  +Y  D  F+DP  +F G E Y + +  +  +F    + L
Sbjct: 4   IDIIKQDYQR---FPEDQTYSLYTPDVYFQDPLNQFNGVERYKQMIGFIRQWFLDIQLDL 60

Query: 152 QNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLIS 211
             I +   ++                               R +L   T   LPWKP +S
Sbjct: 61  HEISQAENVIKT-----------------------------RWTLSWTT--PLPWKPRVS 89

Query: 212 IDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           I G +  +LN E  I+ H + W+ + L+ V QI 
Sbjct: 90  IPGWSELKLNSEGLISSHIDYWDCTRLDVVKQIL 123


>gi|307154421|ref|YP_003889805.1| hypothetical protein Cyan7822_4622 [Cyanothece sp. PCC 7822]
 gi|306984649|gb|ADN16530.1| Protein of unknown function DUF2358 [Cyanothece sp. PCC 7822]
          Length = 150

 Score = 42.0 bits (97), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 37/166 (22%), Positives = 67/166 (40%), Gaps = 28/166 (16%)

Query: 84  LVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPF 143
           L + I+ ++  L++D    +      + +IY +D  F+DP   F+G      N R++   
Sbjct: 6   LKNKINQVIDTLKADLPTLF--EKDISYDIYTQDIYFKDPVNTFKG----KLNYRII--- 56

Query: 144 FEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLK 203
             Y ++        +F   +     E +               + P ++R     R  L 
Sbjct: 57  --YWTLRFHGQ---LFFREIFFDLHEVKE--------------IEPDIIRADWTVRGTLL 97

Query: 204 LPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGD 249
           +PWK  I   G + Y+LN E  I  H ++W+    E + Q F  G+
Sbjct: 98  VPWKAYIFFKGFSTYKLNSEGLIYEHIDTWDRQPGEILQQFFRSGE 143


>gi|300869129|ref|ZP_07113728.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300332898|emb|CBN58926.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 144

 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 32/139 (23%), Positives = 55/139 (39%), Gaps = 26/139 (18%)

Query: 112 EIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGR 171
           +IY +D  F+DP  +F+G      N R++     + +         +F   +        
Sbjct: 30  DIYTKDIYFQDPVNKFKG----KINYRIIFWTLRFHAQ--------LFFTEIHFD----- 72

Query: 172 MRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAE 231
              LH          L  + +R +L      ++PWK  I  +G + Y+LN E  I  H +
Sbjct: 73  ---LHEVGQTASDTILAKWTVRGTL------RVPWKAQIFFNGYSTYKLNTEGLIYEHID 123

Query: 232 SWNVSALEAVGQIFTPGDR 250
           +W+    E + Q    GD 
Sbjct: 124 TWDRKPGEVLKQFIRKGDE 142


>gi|119489662|ref|ZP_01622421.1| hypothetical protein L8106_13095 [Lyngbya sp. PCC 8106]
 gi|119454399|gb|EAW35548.1| hypothetical protein L8106_13095 [Lyngbya sp. PCC 8106]
          Length = 157

 Score = 38.9 bits (89), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 17/54 (31%), Positives = 27/54 (50%)

Query: 191 LLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQI 244
           L+      R  L+LPWKP I  +GS+ Y+L     +  H ++W+ S    + Q 
Sbjct: 92  LITADWTVRGTLRLPWKPQIFFNGSSAYKLTKHALVYHHEDTWDRSPATILKQF 145


>gi|428210029|ref|YP_007094382.1| hypothetical protein Chro_5140 [Chroococcidiopsis thermalis PCC
           7203]
 gi|428011950|gb|AFY90513.1| Protein of unknown function DUF2358 [Chroococcidiopsis thermalis
           PCC 7203]
          Length = 139

 Score = 38.9 bits (89), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 17/51 (33%), Positives = 28/51 (54%)

Query: 195 SLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
           +   R  L++PWK  +  +G + Y+LND   I  H ++W+ S  E + Q F
Sbjct: 86  TWTVRGVLRVPWKAKVFFNGYSNYKLNDNGLIYEHIDTWDRSPKEILQQFF 136


>gi|428223898|ref|YP_007107995.1| hypothetical protein GEI7407_0442 [Geitlerinema sp. PCC 7407]
 gi|427983799|gb|AFY64943.1| Protein of unknown function DUF2358 [Geitlerinema sp. PCC 7407]
          Length = 160

 Score = 38.5 bits (88), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 17/60 (28%), Positives = 31/60 (51%)

Query: 189 PYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPG 248
           P ++ ++   R  L+LPW+P +  +G++ Y L  +  I  H + W+    E + Q F  G
Sbjct: 98  PDVVFVTWTVRGTLRLPWRPRLFFNGNSTYRLGPDGLIYDHRDVWDRKPSELLKQFFVKG 157


>gi|397641287|gb|EJK74572.1| hypothetical protein THAOC_03742, partial [Thalassiosira oceanica]
          Length = 195

 Score = 38.5 bits (88), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 16/39 (41%), Positives = 23/39 (58%)

Query: 100 ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLR 138
           EN Y  TG   S  +A DC F DPT+ F+G + +  N++
Sbjct: 157 ENNYLWTGNIDSSSFAADCTFTDPTLSFQGVDKFVSNVQ 195


>gi|303287839|ref|XP_003063208.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455040|gb|EEH52344.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 103

 Score = 38.5 bits (88), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 26/51 (50%), Gaps = 5/51 (9%)

Query: 202 LKLPWKPLISIDGSTVYELNDE-----LKITRHAESWNVSALEAVGQIFTP 247
           L LPW+P I + G T +   D        I  + E W +SAL+AV Q+  P
Sbjct: 50  LNLPWRPRIDLTGQTTFGFIDAGVERGCLIVGYREEWGISALDAVAQLVAP 100


>gi|303280892|ref|XP_003059738.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226458393|gb|EEH55690.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 101

 Score = 38.5 bits (88), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 25/51 (49%), Gaps = 5/51 (9%)

Query: 202 LKLPWKPLISIDGSTVYELNDE-----LKITRHAESWNVSALEAVGQIFTP 247
           L LPW+P I + G T     D        I  + E W VSAL+AV Q+  P
Sbjct: 48  LNLPWRPRIDLTGQTTLGFMDAGVERGCLIVSYREEWGVSALDAVAQLVAP 98


>gi|242032997|ref|XP_002463893.1| hypothetical protein SORBIDRAFT_01g008410 [Sorghum bicolor]
 gi|241917747|gb|EER90891.1| hypothetical protein SORBIDRAFT_01g008410 [Sorghum bicolor]
          Length = 3513

 Score = 38.5 bits (88), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 34/70 (48%), Gaps = 3/70 (4%)

Query: 29   CSGTVNDGN-KNTNNKTATPQILKFAVSGVTELLRLFSSF--DKGRLDKVSYKQKYEILV 85
            C+G  +DGN   T        + +F   G+  LLR+  +     G++  VS  Q Y ++V
Sbjct: 3272 CTGVSHDGNILTTGGDDGVVAVWRFVKDGIRRLLRMEKALCAHTGKITCVSVSQPYSLIV 3331

Query: 86   SGIDDIVTIL 95
            SG DD   IL
Sbjct: 3332 SGSDDCSVIL 3341


>gi|427735544|ref|YP_007055088.1| hypothetical protein Riv7116_2004 [Rivularia sp. PCC 7116]
 gi|427370585|gb|AFY54541.1| hypothetical protein Riv7116_2004 [Rivularia sp. PCC 7116]
          Length = 144

 Score = 38.1 bits (87), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 26/48 (54%)

Query: 198 CRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
            R  L+LPWK  I  +G + Y+LN +  I  H +SW+    E + Q F
Sbjct: 91  VRGTLRLPWKAKIFFNGHSNYKLNQDGLIYEHVDSWDRKPSEILRQFF 138


>gi|414872729|tpg|DAA51286.1| TPA: hypothetical protein ZEAMMB73_996940 [Zea mays]
          Length = 2552

 Score = 37.0 bits (84), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 34/70 (48%), Gaps = 3/70 (4%)

Query: 29   CSGTVNDGN-KNTNNKTATPQILKFAVSGVTELLRLFSSF--DKGRLDKVSYKQKYEILV 85
            C+G  +DGN   T        + +F   G+  LLR+  +     G++  VS  Q Y ++V
Sbjct: 2309 CTGVSHDGNILITGGDDGVVAVWRFVKDGIRRLLRMEKALCAHTGKITCVSVSQPYSLIV 2368

Query: 86   SGIDDIVTIL 95
            SG DD   IL
Sbjct: 2369 SGSDDCSVIL 2378


>gi|397575083|gb|EJK49524.1| hypothetical protein THAOC_31588 [Thalassiosira oceanica]
          Length = 282

 Score = 37.0 bits (84), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 1/54 (1%)

Query: 88  IDDIVTILRSD-YENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLL 140
           I+D+   L  + Y   +FVTG+     +A+D  F+DP ++  G E Y+R +  L
Sbjct: 104 IEDLAAALDFELYNREWFVTGLINPIYFADDFQFQDPDVKLTGVEEYARGVNKL 157


>gi|428306747|ref|YP_007143572.1| hypothetical protein Cri9333_3229 [Crinalium epipsammum PCC 9333]
 gi|428248282|gb|AFZ14062.1| Protein of unknown function DUF2358 [Crinalium epipsammum PCC 9333]
          Length = 142

 Score = 37.0 bits (84), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 16/51 (31%), Positives = 26/51 (50%)

Query: 198 CRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPG 248
            R  L++PWK  I  +G + Y+LN +  I  H ++W+    E + Q    G
Sbjct: 89  VRGVLRVPWKAHIFFNGYSTYKLNQDCLIYEHIDTWDRKPSEILQQFIRKG 139


>gi|113478067|ref|YP_724128.1| hypothetical protein Tery_4691 [Trichodesmium erythraeum IMS101]
 gi|110169115|gb|ABG53655.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 135

 Score = 36.6 bits (83), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 16/48 (33%), Positives = 26/48 (54%)

Query: 198 CRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
            R  L++PWK  I  +G + Y+LN +  I  H ++W+    E + Q F
Sbjct: 85  VRGILRVPWKAKIFFNGYSTYKLNQDGLIYEHIDTWDRKPSEILKQFF 132


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.323    0.140    0.420 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,912,975,934
Number of Sequences: 23463169
Number of extensions: 157546167
Number of successful extensions: 313425
Number of sequences better than 100.0: 187
Number of HSP's better than 100.0 without gapping: 149
Number of HSP's successfully gapped in prelim test: 38
Number of HSP's that attempted gapping in prelim test: 313036
Number of HSP's gapped (non-prelim): 336
length of query: 251
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 112
effective length of database: 9,097,814,876
effective search space: 1018955266112
effective search space used: 1018955266112
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 75 (33.5 bits)