BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 033347
         (121 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224099213|ref|XP_002311406.1| predicted protein [Populus trichocarpa]
 gi|222851226|gb|EEE88773.1| predicted protein [Populus trichocarpa]
          Length = 306

 Score =  205 bits (521), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 99/120 (82%), Positives = 111/120 (92%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWGVA GRKTMVATAMLL VLSGPVKAL YLLTHG++GF+MGSLWR+G +WGLSIFLCT
Sbjct: 149 LRWGVAGGRKTMVATAMLLFVLSGPVKALTYLLTHGLVGFTMGSLWRMGANWGLSIFLCT 208

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           IAR+ GA+GY+LTSSFLIRENILALITINIHASLTFIF+AAG+N VPSMN IY +FG LV
Sbjct: 209 IARATGAVGYVLTSSFLIRENILALITINIHASLTFIFAAAGINTVPSMNFIYSLFGILV 268


>gi|255546947|ref|XP_002514531.1| conserved hypothetical protein [Ricinus communis]
 gi|223546135|gb|EEF47637.1| conserved hypothetical protein [Ricinus communis]
          Length = 306

 Score =  202 bits (513), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 96/120 (80%), Positives = 108/120 (90%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           MRWGVAAGRKTMVATAMLL VLSGP+KAL YLLTHG++GF+MGSLWRL  DWGLSIFLCT
Sbjct: 148 MRWGVAAGRKTMVATAMLLFVLSGPLKALTYLLTHGIVGFTMGSLWRLRADWGLSIFLCT 207

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           IAR+ GAMGY+  +SFLIRENILALITINIHASLT IF+A G+N +PSMN+IY +FG LV
Sbjct: 208 IARATGAMGYVFVTSFLIRENILALITINIHASLTLIFAAIGINTIPSMNLIYTLFGILV 267


>gi|449499494|ref|XP_004160832.1| PREDICTED: uncharacterized LOC101204847 [Cucumis sativus]
          Length = 469

 Score =  194 bits (493), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 94/120 (78%), Positives = 107/120 (89%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           MRWG++AGRKTMVAT +LLLVLSGPVKAL YLL HG++GF+MGSLWRLG +W  SIFLCT
Sbjct: 312 MRWGISAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCT 371

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           I R+ GA+GY+L SSFLIRENILALITINIHASLT IF+A GVN++PSMN IY IFGTLV
Sbjct: 372 IVRAFGAVGYVLVSSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLV 431


>gi|449446255|ref|XP_004140887.1| PREDICTED: uncharacterized protein LOC101204847 [Cucumis sativus]
          Length = 301

 Score =  193 bits (490), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 94/120 (78%), Positives = 107/120 (89%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           MRWG++AGRKTMVAT +LLLVLSGPVKAL YLL HG++GF+MGSLWRLG +W  SIFLCT
Sbjct: 144 MRWGISAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCT 203

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           I R+ GA+GY+L SSFLIRENILALITINIHASLT IF+A GVN++PSMN IY IFGTLV
Sbjct: 204 IVRAFGAVGYVLVSSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLV 263


>gi|356499992|ref|XP_003518819.1| PREDICTED: uncharacterized protein LOC100783446 [Glycine max]
          Length = 287

 Score =  186 bits (471), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 89/120 (74%), Positives = 104/120 (86%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           MRWGV AGRKT+VAT +LLLVLSGPVKAL YLL HG++GF+MG+LWR G  W +SIFLCT
Sbjct: 130 MRWGVDAGRKTLVATTILLLVLSGPVKALTYLLKHGIVGFTMGTLWRTGASWNMSIFLCT 189

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           I R+ GA+G++L SSFLIRENILALITINIHASLTF+ +AAGV  +PSMNVIY +FG LV
Sbjct: 190 IVRALGAVGFVLISSFLIRENILALITINIHASLTFLLTAAGVTSIPSMNVIYTLFGILV 249


>gi|225423785|ref|XP_002279949.1| PREDICTED: uncharacterized protein LOC100256033 isoform 1 [Vitis
           vinifera]
          Length = 286

 Score =  185 bits (470), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 88/120 (73%), Positives = 105/120 (87%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           MRWGVAAGRKTMVAT MLL VLSGPVKA  YLL HG++G +MGSLWRLG +WGLSI LCT
Sbjct: 129 MRWGVAAGRKTMVATTMLLFVLSGPVKASTYLLLHGLVGLTMGSLWRLGANWGLSICLCT 188

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
             R+ GAMGY+L SSFLIRENILALIT+N+HAS+T++F+A GVN +PSM++IY +FGTL+
Sbjct: 189 FVRAMGAMGYVLISSFLIRENILALITLNVHASITYMFTAIGVNTIPSMDLIYALFGTLL 248


>gi|297737924|emb|CBI27125.3| unnamed protein product [Vitis vinifera]
          Length = 192

 Score =  184 bits (468), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 88/120 (73%), Positives = 105/120 (87%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           MRWGVAAGRKTMVAT MLL VLSGPVKA  YLL HG++G +MGSLWRLG +WGLSI LCT
Sbjct: 35  MRWGVAAGRKTMVATTMLLFVLSGPVKASTYLLLHGLVGLTMGSLWRLGANWGLSICLCT 94

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
             R+ GAMGY+L SSFLIRENILALIT+N+HAS+T++F+A GVN +PSM++IY +FGTL+
Sbjct: 95  FVRAMGAMGYVLISSFLIRENILALITLNVHASITYMFTAIGVNTIPSMDLIYALFGTLL 154


>gi|357496929|ref|XP_003618753.1| Membrane protein [Medicago truncatula]
 gi|355493768|gb|AES74971.1| Membrane protein [Medicago truncatula]
          Length = 247

 Score =  182 bits (463), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 87/120 (72%), Positives = 102/120 (85%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           MRWG+ AGRKTMVAT +LL VLSGPVKAL YLL HG++G++MG+LWRL   W LS+FLC 
Sbjct: 90  MRWGIDAGRKTMVATTILLFVLSGPVKALTYLLKHGIVGYTMGTLWRLETSWSLSVFLCA 149

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           I RS GA+G++L SSFLIRENILALITINIHASLTF+ +AAGVN +PSMN IY +FG LV
Sbjct: 150 IVRSIGAVGFVLISSFLIRENILALITINIHASLTFVLTAAGVNSIPSMNFIYTLFGILV 209


>gi|297851064|ref|XP_002893413.1| hypothetical protein ARALYDRAFT_890129 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339255|gb|EFH69672.1| hypothetical protein ARALYDRAFT_890129 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 293

 Score =  177 bits (449), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 81/120 (67%), Positives = 106/120 (88%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RW +A GRKTMVAT MLL +LSGPVKAL Y LTHG++G ++GSLWR+G  W LSIF+CT
Sbjct: 136 IRWNIAGGRKTMVATVMLLFILSGPVKALTYFLTHGLVGLALGSLWRMGASWRLSIFVCT 195

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           + R+ G +GY+LTSSFLIRENILA+ITINIHASL+++F+A G+NI+PSM++IY+IFGT++
Sbjct: 196 MVRALGLIGYVLTSSFLIRENILAVITINIHASLSYVFTAMGLNIMPSMSLIYMIFGTVL 255


>gi|22329794|ref|NP_173943.2| uncharacterized protein [Arabidopsis thaliana]
 gi|18252157|gb|AAL61911.1| unknown protein [Arabidopsis thaliana]
 gi|21386935|gb|AAM47871.1| unknown protein [Arabidopsis thaliana]
 gi|332192537|gb|AEE30658.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 289

 Score =  176 bits (445), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 81/120 (67%), Positives = 105/120 (87%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RW +A GRKTMVAT MLL +LSGPVKAL Y LTHG++G ++GSLW +G  W LSIFLCT
Sbjct: 132 IRWNIAGGRKTMVATVMLLFILSGPVKALTYFLTHGLVGLALGSLWSMGASWRLSIFLCT 191

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           + R+ G +GY+LTSSFLIRENILA+ITINIHASL+++F+A G+NI+PSM++IY+IFGT++
Sbjct: 192 MVRALGLIGYVLTSSFLIRENILAVITINIHASLSYVFTAMGLNIMPSMSLIYMIFGTVL 251


>gi|357139303|ref|XP_003571222.1| PREDICTED: uncharacterized protein LOC100830022 [Brachypodium
           distachyon]
          Length = 295

 Score =  174 bits (441), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 82/120 (68%), Positives = 100/120 (83%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG+ AGRKTMVAT +LL  LSGPVKA  YLL HGV+G +MG++WRL  +W +SI LC+
Sbjct: 119 LRWGLEAGRKTMVATVLLLFTLSGPVKASTYLLMHGVVGLAMGTMWRLETNWIVSIILCS 178

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           I R+ GA GY+L SSFLIRENIL LIT+NIHASLT+I +AAGVN +PSM+ IYVIFGTL+
Sbjct: 179 ITRAVGACGYVLVSSFLIRENILELITVNIHASLTYILAAAGVNAIPSMDAIYVIFGTLL 238


>gi|242091734|ref|XP_002436357.1| hypothetical protein SORBIDRAFT_10g000990 [Sorghum bicolor]
 gi|241914580|gb|EER87724.1| hypothetical protein SORBIDRAFT_10g000990 [Sorghum bicolor]
          Length = 286

 Score =  168 bits (426), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 81/120 (67%), Positives = 100/120 (83%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG+ AGRKT+VAT +LL  LSGPVKA  YLL HGV+G  MG++WRL  +W +SI LC+
Sbjct: 129 LRWGLKAGRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSILLCS 188

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           I R+ GA GY+L SSFLIRENILALIT+NIHASLT+I +AAGVN +PSM+ IYV+FGTL+
Sbjct: 189 IVRALGACGYVLVSSFLIRENILALITVNIHASLTYILTAAGVNTIPSMDAIYVLFGTLL 248


>gi|115445849|ref|NP_001046704.1| Os02g0326000 [Oryza sativa Japonica Group]
 gi|46390264|dbj|BAD15693.1| unknown protein [Oryza sativa Japonica Group]
 gi|113536235|dbj|BAF08618.1| Os02g0326000 [Oryza sativa Japonica Group]
 gi|215736884|dbj|BAG95813.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765527|dbj|BAG87224.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222622737|gb|EEE56869.1| hypothetical protein OsJ_06502 [Oryza sativa Japonica Group]
          Length = 281

 Score =  167 bits (424), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 80/120 (66%), Positives = 99/120 (82%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG+ AGRKT VAT +LL  LSGPVKA  YLL HGV+G +MG++WRL  +W +SI LC+
Sbjct: 124 LRWGLEAGRKTAVATVLLLFTLSGPVKASTYLLMHGVVGLAMGTMWRLETNWIVSIILCS 183

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           I R+ GA GY+L SSFLIRENIL LIT+NIHASLT+I +AAGVN +PSM+ IYV+FGTL+
Sbjct: 184 ITRALGACGYVLLSSFLIRENILGLITVNIHASLTYILAAAGVNTIPSMDAIYVLFGTLL 243


>gi|218190626|gb|EEC73053.1| hypothetical protein OsI_07007 [Oryza sativa Indica Group]
          Length = 281

 Score =  167 bits (424), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 80/120 (66%), Positives = 99/120 (82%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG+ AGRKT VAT +LL  LSGPVKA  YLL HGV+G +MG++WRL  +W +SI LC+
Sbjct: 124 LRWGLEAGRKTAVATVLLLFTLSGPVKASTYLLMHGVVGLAMGTMWRLETNWIVSIILCS 183

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           I R+ GA GY+L SSFLIRENIL LIT+NIHASLT+I +AAGVN +PSM+ IYV+FGTL+
Sbjct: 184 ITRALGACGYVLLSSFLIRENILGLITVNIHASLTYILAAAGVNTIPSMDAIYVLFGTLL 243


>gi|226508684|ref|NP_001140399.1| uncharacterized protein LOC100272453 [Zea mays]
 gi|194699326|gb|ACF83747.1| unknown [Zea mays]
 gi|413953513|gb|AFW86162.1| hypothetical protein ZEAMMB73_298407 [Zea mays]
          Length = 250

 Score =  166 bits (420), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 80/120 (66%), Positives = 98/120 (81%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG+ A RKT+VAT +LL  LSGPVKA  YLL HGV+G  MG++WRL  +W +SI LC+
Sbjct: 121 LRWGLEASRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSIILCS 180

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           I R+ GA GY+L SSFLIRENILALIT+NIHASLT+I +AAGVN +PSM  IY++FGTLV
Sbjct: 181 IVRALGACGYVLVSSFLIRENILALITVNIHASLTYILTAAGVNTIPSMGTIYLLFGTLV 240


>gi|326494316|dbj|BAJ90427.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326522989|dbj|BAJ88540.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 310

 Score =  160 bits (406), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 82/120 (68%), Positives = 99/120 (82%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG+ AGRKTMVAT +LL  LSGPVKAL YLL HGV+G SMG++WRL  +W  SI +C 
Sbjct: 151 LRWGLEAGRKTMVATVLLLFTLSGPVKALTYLLMHGVVGLSMGTMWRLETNWIASIIICA 210

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           I R+ GA GY+L +SFLIRENI  LIT+NIHASLT+I +AAGVN++PSM+ IYVIFGTLV
Sbjct: 211 IIRAVGACGYVLVTSFLIRENIFQLITVNIHASLTYILAAAGVNMIPSMDAIYVIFGTLV 270


>gi|168067903|ref|XP_001785841.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162662495|gb|EDQ49342.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 192

 Score =  156 bits (394), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 72/120 (60%), Positives = 100/120 (83%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           MRWG AAGRKTMVATAMLLL+LSGP+KA +YLL HG +G +MG+ W+   +WGLSI  CT
Sbjct: 35  MRWGAAAGRKTMVATAMLLLILSGPLKAASYLLMHGFVGLAMGAFWKWNFNWGLSIVGCT 94

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           + R  GA+G++L +S+L+R+NILALIT+N HASL+++ +A G+N+VP+M +IYV+F +L+
Sbjct: 95  VVRCLGALGFVLLTSWLLRDNILALITMNAHASLSYMLAAVGINLVPTMPMIYVVFASLL 154


>gi|224033465|gb|ACN35808.1| unknown [Zea mays]
 gi|413953514|gb|AFW86163.1| membrane protein [Zea mays]
          Length = 278

 Score =  155 bits (392), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 75/113 (66%), Positives = 91/113 (80%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG+ A RKT+VAT +LL  LSGPVKA  YLL HGV+G  MG++WRL  +W +SI LC+
Sbjct: 121 LRWGLEASRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSIILCS 180

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIY 113
           I R+ GA GY+L SSFLIRENILALIT+NIHASLT+I +AAGVN +PSM  IY
Sbjct: 181 IVRALGACGYVLVSSFLIRENILALITVNIHASLTYILTAAGVNTIPSMGTIY 233


>gi|195626624|gb|ACG35142.1| membrane protein [Zea mays]
          Length = 192

 Score =  155 bits (392), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 75/113 (66%), Positives = 91/113 (80%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG+ A RKT+VAT +LL  LSGPVKA  YLL HGV+G  MG++WRL  +W +SI LC+
Sbjct: 35  LRWGLEASRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSIILCS 94

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIY 113
           I R+ GA GY+L SSFLIRENILALIT+NIHASLT+I +AAGVN +PSM  IY
Sbjct: 95  IVRALGACGYVLVSSFLIRENILALITVNIHASLTYILTAAGVNTIPSMGTIY 147


>gi|302802149|ref|XP_002982830.1| hypothetical protein SELMODRAFT_47311 [Selaginella moellendorffii]
 gi|300149420|gb|EFJ16075.1| hypothetical protein SELMODRAFT_47311 [Selaginella moellendorffii]
          Length = 206

 Score =  148 bits (374), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 70/120 (58%), Positives = 97/120 (80%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           MRWG AAGRKTMV TA+LLLV++GP+KAL+Y+L HG LG +MG+LWR  ++WG+SI  CT
Sbjct: 49  MRWGAAAGRKTMVTTALLLLVIAGPLKALSYVLMHGFLGVAMGALWRWRINWGVSILACT 108

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           + R+ GA+G++  SS+LI+ENIL LITIN HASL+++ +A G+NI  +M  IY +F +L+
Sbjct: 109 LVRAVGAIGFVFLSSWLIKENILNLITINTHASLSYMVAAMGINISLTMPTIYAVFTSLL 168


>gi|302800209|ref|XP_002981862.1| hypothetical protein SELMODRAFT_57372 [Selaginella moellendorffii]
 gi|300150304|gb|EFJ16955.1| hypothetical protein SELMODRAFT_57372 [Selaginella moellendorffii]
          Length = 206

 Score =  147 bits (372), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 70/120 (58%), Positives = 97/120 (80%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           MRWG AAGRKTMV TA+LLLV++GP+KAL+Y+L HG LG +MG+LWR  ++WG+SI  CT
Sbjct: 49  MRWGAAAGRKTMVTTALLLLVIAGPLKALSYVLMHGFLGVAMGALWRWRINWGVSILACT 108

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           + R+ GA+G++  SS+LI+ENIL LITIN HASL+++ +A G+NI  +M  IY +F +L+
Sbjct: 109 LVRALGAIGFVFLSSWLIKENILNLITINTHASLSYMVAAMGINISLTMPTIYAVFTSLL 168


>gi|255646056|gb|ACU23515.1| unknown [Glycine max]
          Length = 223

 Score =  126 bits (316), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 57/81 (70%), Positives = 70/81 (86%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           MRWGV AGRKT+VAT +LLLVLSGPVKAL YLL HG++GF+MG+LWR G  W +SIFLCT
Sbjct: 130 MRWGVDAGRKTLVATTILLLVLSGPVKALTYLLKHGIVGFTMGTLWRTGASWNMSIFLCT 189

Query: 61  IARSAGAMGYILTSSFLIREN 81
           I R+ GA+G++L SSFLIR++
Sbjct: 190 IVRALGAVGFVLISSFLIRKH 210


>gi|413953512|gb|AFW86161.1| hypothetical protein ZEAMMB73_298407 [Zea mays]
          Length = 254

 Score =  108 bits (269), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 57/113 (50%), Positives = 71/113 (62%), Gaps = 24/113 (21%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG+ A RKT+VAT +LL  LSGPVKA  YLL HGV+G  MG++WRL  +W +SI LC+
Sbjct: 121 LRWGLEASRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSIILCS 180

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIY 113
           I                        IT+NIHASLT+I +AAGVN +PSM  IY
Sbjct: 181 I------------------------ITVNIHASLTYILTAAGVNTIPSMGTIY 209


>gi|12321175|gb|AAG50677.1|AC079829_10 unknown protein [Arabidopsis thaliana]
          Length = 275

 Score =  102 bits (255), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 64/124 (51%), Positives = 84/124 (67%), Gaps = 19/124 (15%)

Query: 13  VATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLS----IFL--------CT 60
           VAT MLL +LSGPVKAL Y ++   L F       L  D  L+    IF+        CT
Sbjct: 117 VATVMLLFILSGPVKALTYFVS---LRFPRLPSRVLSRDHCLTFGDFIFVAYSWSCGACT 173

Query: 61  ----IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIF 116
                 R+ G +GY+LTSSFLIRENILA+ITINIHASL+++F+A G+NI+PSM++IY+IF
Sbjct: 174 GIIVEVRALGLIGYVLTSSFLIRENILAVITINIHASLSYVFTAMGLNIMPSMSLIYMIF 233

Query: 117 GTLV 120
           GT++
Sbjct: 234 GTVL 237


>gi|384245220|gb|EIE18715.1| hypothetical protein COCSUDRAFT_20460, partial [Coccomyxa
           subellipsoidea C-169]
          Length = 206

 Score = 82.8 bits (203), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/123 (37%), Positives = 73/123 (59%), Gaps = 3/123 (2%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKA---LAYLLTHGVLGFSMGSLWRLGVDWGLSIF 57
           MRWG AAGRKTM AT  LLL  + P+     L +LL  G+L  ++G+ W LG+ W LS+ 
Sbjct: 40  MRWGPAAGRKTMTATCFLLLGENFPLSTEDILTFLLNQGLLAATLGASWSLGMHWALSVA 99

Query: 58  LCTIARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFG 117
              + R  G M Y+L SS+++ EN+ +++  NI++ L  + +  G +  P   V+ V+  
Sbjct: 100 AGALVRVGGMMAYVLVSSWIMNENLFSILMTNIYSLLDQMNAFIGASGAPPPTVVIVVLA 159

Query: 118 TLV 120
           +L+
Sbjct: 160 SLL 162


>gi|302840505|ref|XP_002951808.1| hypothetical protein VOLCADRAFT_81593 [Volvox carteri f.
           nagariensis]
 gi|300263056|gb|EFJ47259.1| hypothetical protein VOLCADRAFT_81593 [Volvox carteri f.
           nagariensis]
          Length = 330

 Score = 67.8 bits (164), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 40/108 (37%), Positives = 64/108 (59%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +R G  AG +TM AT  LL+VL GP++A++YL  HG+L  ++GSLW   V +   +    
Sbjct: 167 LRSGGGAGWRTMTATCFLLVVLLGPLRAMSYLFLHGLLAATLGSLWSSRVGFWPGVVAGA 226

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
           + R  G + Y++ SS  + EN+ AL+  N++  L    +A G++  PS
Sbjct: 227 LVRMCGQLSYLVMSSVTMNENMFALLLSNVYNMLDQASAALGLSGAPS 274


>gi|412985991|emb|CCO17191.1| predicted protein [Bathycoccus prasinos]
          Length = 243

 Score = 67.4 bits (163), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 38/114 (33%), Positives = 62/114 (54%)

Query: 2   RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
           RWG  A  +T++   +LL +LSGP++A  Y   HG  G ++G  W+    W  S+ +  +
Sbjct: 78  RWGDVAAWRTLLVAFLLLTLLSGPLRASNYACLHGFTGLALGMCWKRKFSWVQSVPISAL 137

Query: 62  ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           AR+ G    +  SS +IREN+L L+   ++A +  I +  G   VP+M  I  +
Sbjct: 138 ARTIGVFSSLAISSLVIRENVLRLMVQQVYALIDQICATLGATFVPNMTAILCV 191


>gi|427712923|ref|YP_007061547.1| hypothetical protein Syn6312_1862 [Synechococcus sp. PCC 6312]
 gi|427377052|gb|AFY61004.1| putative membrane protein (DUF2232) [Synechococcus sp. PCC 6312]
          Length = 251

 Score = 63.5 bits (153), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 59/112 (52%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RW   A     + +A+LL VL GP ++L YLL HG+LG  +G LW+    W +S+F   
Sbjct: 90  LRWHRRAAWMGAIVSALLLSVLMGPPRSLQYLLPHGILGVVLGGLWKQKAPWHISLFWGA 149

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
           +  S G    IL  S L+ EN+   +   + + L +     G+ ++PS  ++
Sbjct: 150 VLGSIGFFFQILVVSALLGENLWIYVNQQVTSFLDWALVQLGILLIPSFGLV 201


>gi|255085474|ref|XP_002505168.1| predicted protein [Micromonas sp. RCC299]
 gi|226520437|gb|ACO66426.1| predicted protein [Micromonas sp. RCC299]
          Length = 178

 Score = 62.8 bits (151), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/112 (30%), Positives = 62/112 (55%)

Query: 2   RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
           R G  A  +  V T+MLL ++SGP++A+ Y   HG + +S+G  W     W +SI L   
Sbjct: 41  RNGPKAASRVAVVTSMLLFLISGPLRAVNYYCLHGAMAYSLGWAWTKRFSWWVSIPLSAT 100

Query: 62  ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIY 113
            RS G    ++ +S ++REN++ L+   +   L  I +  G++ +P++  ++
Sbjct: 101 VRSIGIFCSLMFTSLVLRENVMQLMVTQMMGLLDQIAANIGMSWMPTLGWVW 152


>gi|388506298|gb|AFK41215.1| unknown [Lotus japonicus]
          Length = 185

 Score = 59.7 bits (143), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/35 (74%), Positives = 30/35 (85%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTH 35
           MRWGV AG+KT+V+T +LL VLSGPVKAL YLL H
Sbjct: 150 MRWGVDAGKKTLVSTTILLFVLSGPVKALTYLLKH 184


>gi|443477605|ref|ZP_21067439.1| Protein of unknown function DUF2232, membrane [Pseudanabaena biceps
           PCC 7429]
 gi|443017225|gb|ELS31713.1| Protein of unknown function DUF2232, membrane [Pseudanabaena biceps
           PCC 7429]
          Length = 208

 Score = 58.5 bits (140), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 34/112 (30%), Positives = 57/112 (50%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RW   A    MV T++LL +L GP +++ Y++ HG++G ++G LW+    W  S+ + T
Sbjct: 51  LRWNSRAAWMAMVVTSLLLAILMGPTRSIQYIVPHGLVGVTLGYLWKRDFPWSGSLSIAT 110

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
           I  S G     +  S L+ EN+     + I A L ++    G    P   +I
Sbjct: 111 IIGSVGTAFQFVFLSVLLGENVWTYSIVQISAFLNWLMQLFGSLEQPDFAMI 162


>gi|307109189|gb|EFN57427.1| hypothetical protein CHLNCDRAFT_21406, partial [Chlorella
           variabilis]
          Length = 206

 Score = 58.5 bits (140), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 34/97 (35%), Positives = 55/97 (56%)

Query: 24  GPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENIL 83
           GPV+A  YLL +GVL  S+   W L + W +SI L  +AR  G + YI  SS++  EN+L
Sbjct: 67  GPVRATTYLLVYGVLSLSLAVCWALRLPWVVSIPLGALARIGGYLAYIALSSWVTNENLL 126

Query: 84  ALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
            L+  N++A L  + +  G    P +  + ++  +L+
Sbjct: 127 MLMLNNVYALLDQLSALLGTTGSPPLLAVAIVLCSLL 163


>gi|428219126|ref|YP_007103591.1| hypothetical protein Pse7367_2910 [Pseudanabaena sp. PCC 7367]
 gi|427990908|gb|AFY71163.1| Protein of unknown function DUF2232, membrane [Pseudanabaena sp.
           PCC 7367]
          Length = 209

 Score = 58.2 bits (139), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 48/82 (58%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RW   A   TM+ T +LL VL GP +++ +L+ HG +G  +G+LWR    W  SIF  T
Sbjct: 52  LRWDRRAAIMTMIVTFLLLTVLMGPTRSIQFLIPHGFIGLFLGALWRRRSPWLASIFTGT 111

Query: 61  IARSAGAMGYILTSSFLIRENI 82
           +  + G++  +   S L+ EN+
Sbjct: 112 VLGTLGSIFQLYLVSLLLGENV 133


>gi|427725977|ref|YP_007073254.1| hypothetical protein Lepto7376_4311 [Leptolyngbya sp. PCC 7376]
 gi|427357697|gb|AFY40420.1| Protein of unknown function DUF2232, membrane [Leptolyngbya sp. PCC
           7376]
          Length = 248

 Score = 56.6 bits (135), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 61/115 (53%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     + + +LL +L GP +++ Y++ +G++G  +G +WR  V W  SI L T
Sbjct: 88  LRWGSRAAWMATLVSGLLLSILMGPTRSIIYVIPYGIMGVQLGWMWRKQVGWEWSIALGT 147

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           +  + G    +   SFL+ E++   +   I     +IF   G+ + P++  I ++
Sbjct: 148 VLGTIGFFFRVWLLSFLLGEDLWVYVISQISELADWIFLHLGLLVQPNLMFIQLL 202


>gi|218439630|ref|YP_002377959.1| hypothetical protein PCC7424_2679 [Cyanothece sp. PCC 7424]
 gi|218172358|gb|ACK71091.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
          Length = 269

 Score = 55.5 bits (132), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 56/109 (51%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A   T + + +LL VL GP +++ +L+ +GV+G  +G+ W+ G  W  S+ L T
Sbjct: 110 LRWGKRASWMTAIVSGLLLSVLMGPTRSIVFLIPYGVMGVQLGACWKRGAPWQWSMLLGT 169

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
           I    G    I   S L+ E++         A L + F   G+ I P++
Sbjct: 170 IIGVFGLFFRIWLFSILLGEDLWLYAITQATAFLDWAFMKLGLLIEPTV 218


>gi|428771033|ref|YP_007162823.1| hypothetical protein Cyan10605_2706 [Cyanobacterium aponinum PCC
           10605]
 gi|428685312|gb|AFZ54779.1| Protein of unknown function DUF2232, membrane [Cyanobacterium
           aponinum PCC 10605]
          Length = 248

 Score = 55.1 bits (131), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 57/109 (52%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +R G  A   T +   +LL +L GP +++ Y++ +G++G  +G+LWR GV+W +SIF+ +
Sbjct: 89  LRRGKRASIITTIVCGLLLAILMGPPRSIVYIVPYGLMGIQLGALWRRGVNWYVSIFVAS 148

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
           +    G        S L+ E++   +   I     +IF   G+   P M
Sbjct: 149 LIGCFGFFFRFWLFSILLGEDLWVYVISQITNLADWIFLKLGILARPDM 197


>gi|428222487|ref|YP_007106657.1| hypothetical protein Syn7502_02551 [Synechococcus sp. PCC 7502]
 gi|427995827|gb|AFY74522.1| putative membrane protein (DUF2232) [Synechococcus sp. PCC 7502]
          Length = 207

 Score = 54.7 bits (130), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 57/112 (50%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG      TM  T +LL +L GP +++ +L+ HG++   +G LW+    W +++   T
Sbjct: 50  LRWGQRCAWMTMTVTVLLLTILMGPTRSIQFLIPHGLMAVMLGFLWKRRSPWVVTLTFGT 109

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
           +A +AG        S L+ EN+ A  TI +   + + F   G    P + +I
Sbjct: 110 LAGAAGVGFQFWLLSALLGENLWAYSTIQVTELIRWGFQLVGSLEQPELWLI 161


>gi|86605519|ref|YP_474282.1| membrane protein [Synechococcus sp. JA-3-3Ab]
 gi|86554061|gb|ABC99019.1| putative membrane proteinprotein [Synechococcus sp. JA-3-3Ab]
          Length = 227

 Score = 54.7 bits (130), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 50/90 (55%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           MRW    G   +  +A+LL VL GP +++ Y++ +G+LG+    LWR  + W +S+    
Sbjct: 66  MRWDPRTGAMALAVSALLLTVLMGPTRSILYVIPYGLLGYWCARLWRRRLSWYVSVASGA 125

Query: 61  IARSAGAMGYILTSSFLIRENILALITINI 90
              + G +  +L SS L+ EN+   +TI +
Sbjct: 126 ALSAFGLVFQLLLSSLLVGENLWTYVTIQL 155


>gi|86609748|ref|YP_478510.1| hypothetical protein CYB_2307 [Synechococcus sp. JA-2-3B'a(2-13)]
 gi|86558290|gb|ABD03247.1| putative membrane protein [Synechococcus sp. JA-2-3B'a(2-13)]
          Length = 227

 Score = 53.5 bits (127), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 51/90 (56%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           MRW    G   +V + +LL VL GP +++ Y++ +G+LG+    LW+  + W +S+    
Sbjct: 66  MRWDPRTGAMALVVSGLLLTVLIGPTRSILYVIPYGLLGYWCARLWQQRLSWYVSVVSGA 125

Query: 61  IARSAGAMGYILTSSFLIRENILALITINI 90
           +  + G +  ++ SS L+ EN+   +TI +
Sbjct: 126 VLSALGLIFQLVLSSLLVGENLWIYVTIQL 155


>gi|428300982|ref|YP_007139288.1| hypothetical protein Cal6303_4411 [Calothrix sp. PCC 6303]
 gi|428237526|gb|AFZ03316.1| Protein of unknown function DUF2232, membrane [Calothrix sp. PCC
           6303]
          Length = 235

 Score = 53.1 bits (126), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 61/115 (53%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWGV A   + V + +LL VL GP+++L +++  G +G ++GS W   V W +S+ L T
Sbjct: 76  LRWGVRAAWMSAVVSGLLLAVLMGPIRSLLFVMPFGFMGVTLGSTWYRRVPWLVSMSLGT 135

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           I  + G    +   S L  E++   +   +   + ++F   GV   PS+  I V+
Sbjct: 136 ILGTVGVFFRVWLLSILAGEDLWIYVINQMTDLIEWVFLKLGVLANPSLLWIQVV 190


>gi|307153482|ref|YP_003888866.1| hypothetical protein Cyan7822_3654 [Cyanothece sp. PCC 7822]
 gi|306983710|gb|ADN15591.1| Protein of unknown function DUF2232, membrane [Cyanothece sp. PCC
           7822]
          Length = 265

 Score = 52.8 bits (125), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 59/126 (46%), Gaps = 6/126 (4%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     + + +LL VL GP +++ + + +GV+G  +G+ WR G  W LS+ L T
Sbjct: 106 LRWGQRASWMAAIVSGLLLSVLMGPTRSIVFFIPYGVMGVQLGACWRRGASWYLSMLLGT 165

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS------MNVIYV 114
           I    G    I   S L+ E++       +   L + F   G+   P       + VI +
Sbjct: 166 IIGVFGLFFRIWLFSILLGEDLWLYAITQVTGFLDWAFMKLGLLAQPDILLVQLLAVIAI 225

Query: 115 IFGTLV 120
           IF   +
Sbjct: 226 IFNNFL 231


>gi|359473028|ref|XP_003631234.1| PREDICTED: uncharacterized protein LOC100256033 isoform 2 [Vitis
           vinifera]
          Length = 140

 Score = 52.0 bits (123), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 21/35 (60%), Positives = 32/35 (91%)

Query: 86  ITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           IT+N+HAS+T++F+A GVN +PSM++IY +FGTL+
Sbjct: 68  ITLNVHASITYMFTAIGVNTIPSMDLIYALFGTLL 102


>gi|428771964|ref|YP_007163752.1| hypothetical protein Cyast_0120 [Cyanobacterium stanieri PCC 7202]
 gi|428686243|gb|AFZ46103.1| Protein of unknown function DUF2232, membrane [Cyanobacterium
           stanieri PCC 7202]
          Length = 247

 Score = 51.2 bits (121), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 55/108 (50%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           MR G  A   T + + +LL +L GP +++ Y++ +G++G  +G +WR G +W LS+   +
Sbjct: 89  MRRGRRASIMTTIVSGLLLSILMGPPRSIIYIIPYGLMGIQLGGMWRNGANWHLSLLTGS 148

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
           I  S G        S L+ E++   +   I     +IF   G+   PS
Sbjct: 149 IIGSFGFFFRFWLFSILLGEDLWIYVINQITGLADWIFLQLGILSQPS 196


>gi|172036337|ref|YP_001802838.1| hypothetical protein cce_1422 [Cyanothece sp. ATCC 51142]
 gi|354553126|ref|ZP_08972433.1| Protein of unknown function DUF2232, membrane [Cyanothece sp. ATCC
           51472]
 gi|171697791|gb|ACB50772.1| conserved hypothetical protein [Cyanothece sp. ATCC 51142]
 gi|353554956|gb|EHC24345.1| Protein of unknown function DUF2232, membrane [Cyanothece sp. ATCC
           51472]
          Length = 240

 Score = 50.8 bits (120), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/108 (27%), Positives = 53/108 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  +     + + +LL VL GP +++ +L+ +GV+G  +G+ WR GV W  S+    
Sbjct: 81  LRWGSRSAWMGALVSGLLLTVLMGPTRSIVFLIPYGVMGVQLGAFWRRGVPWEFSMITGA 140

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
           +  + G        S L+ E++   I   +     +IF   G+   PS
Sbjct: 141 LLGTFGFFFRFWLFSILLGEDLWQYIINQMTGLAEWIFLRLGILAQPS 188


>gi|218245754|ref|YP_002371125.1| hypothetical protein PCC8801_0891 [Cyanothece sp. PCC 8801]
 gi|257058799|ref|YP_003136687.1| hypothetical protein Cyan8802_0917 [Cyanothece sp. PCC 8802]
 gi|218166232|gb|ACK64969.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
 gi|256588965|gb|ACU99851.1| putative membrane protein [Cyanothece sp. PCC 8802]
          Length = 260

 Score = 50.4 bits (119), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/115 (23%), Positives = 57/115 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     + + +LL VL GP +++ +L+ +G++G  +G  W+ G +WG S+    
Sbjct: 101 LRWGHRAAWMGALVSGLLLTVLMGPTRSIVFLIPYGLMGIQLGYCWQRGANWGFSLVTGA 160

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           +  + G        S L+ E++   +   +     ++F   G+   P+  +I ++
Sbjct: 161 LIGTFGFFFRFWLFSILLGEDLWQYVITQVTELAEWVFLKLGILSQPTFELIQLL 215


>gi|22298945|ref|NP_682192.1| hypothetical protein tlr1402 [Thermosynechococcus elongatus BP-1]
 gi|22295126|dbj|BAC08954.1| tlr1402 [Thermosynechococcus elongatus BP-1]
          Length = 236

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 55/115 (47%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RW   A     + +A+LL VL GP ++L +LL HG++G   G  W     WG SI    
Sbjct: 76  LRWNRRAAWMAAIVSALLLSVLMGPPRSLQFLLPHGIMGVIFGGCWAKKESWGRSILSGA 135

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           +  +AG    I   S L+ EN+       +   + ++     + + P++ +I V+
Sbjct: 136 VIGTAGFFFQISLVSILLGENLWIYFNQQVTNFIDWVLVNLNLLLEPTVTLIQVV 190


>gi|428207947|ref|YP_007092300.1| hypothetical protein Chro_2964 [Chroococcidiopsis thermalis PCC
           7203]
 gi|428009868|gb|AFY88431.1| Protein of unknown function DUF2232, membrane [Chroococcidiopsis
           thermalis PCC 7203]
          Length = 232

 Score = 50.1 bits (118), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 57/115 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RW   AG      + +LL VL GP+ +L Y++ +GV+G  +G+ WR    W +S+ L T
Sbjct: 73  LRWSARAGWMAAGVSGLLLAVLMGPIPSLLYVMPYGVMGVLLGAAWRRRSPWIVSVALGT 132

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           +  + G        S     ++   + + +     +IF   G+ + PSM VI V+
Sbjct: 133 LLGAIGFFFRFWLMSIWSGRDLWVYLIVEVTELAEWIFDKLGLLLQPSMLVIQVL 187


>gi|428778778|ref|YP_007170564.1| hypothetical protein Dacsa_0419 [Dactylococcopsis salina PCC 8305]
 gi|428693057|gb|AFZ49207.1| putative membrane protein (DUF2232) [Dactylococcopsis salina PCC
           8305]
          Length = 258

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/118 (27%), Positives = 60/118 (50%), Gaps = 6/118 (5%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +R G  A   +++   +LL +L GP +++ Y++ +GV+G  +G+ WR G  WG S+   T
Sbjct: 99  LRRGRRASGMSVITVGLLLSILMGPTRSILYVIPYGVMGLQLGAFWRRGFGWGQSV---T 155

Query: 61  IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           I    G +G+       S  + E++   +T  +   L + F   G+   PS+ ++  I
Sbjct: 156 IGAIIGTLGFFFRFWLLSIFLGEDLWVYVTSQMTQFLEWGFLQLGLLAQPSLWLVQAI 213


>gi|126654855|ref|ZP_01726389.1| hypothetical protein CY0110_10477 [Cyanothece sp. CCY0110]
 gi|126623590|gb|EAZ94294.1| hypothetical protein CY0110_10477 [Cyanothece sp. CCY0110]
          Length = 240

 Score = 49.7 bits (117), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 53/108 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  +     + + +LL VL GP +++ +L+ +GV+G  +G+ WR G+ W  S+    
Sbjct: 81  LRWGNRSAWMGALVSGLLLTVLMGPTRSIVFLIPYGVMGVQLGAFWRRGLPWEFSMITGA 140

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
           +  + G        S L+ E++   I   +     +IF   G+   PS
Sbjct: 141 LLGTFGFFFRFWLFSILLGEDLWQYIINQMTGLAEWIFLKLGILTQPS 188


>gi|145346093|ref|XP_001417529.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577756|gb|ABO95822.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 180

 Score = 48.5 bits (114), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/108 (33%), Positives = 62/108 (57%)

Query: 2   RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
           R G AA  +T+V  ++LLL++SGP++AL Y   HGVL + + SLW  G  W +++    +
Sbjct: 43  RSGRAAAMRTLVVASLLLLMISGPLRALNYFFLHGVLAYVLSSLWSSGSSWWVTVPASAL 102

Query: 62  ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
            R+ G +  +  SS L REN++ L+   +++ L    +  G    P++
Sbjct: 103 TRTFGILASLSISSLLYRENVMKLLVTQMYSLLDQFAANVGATFAPTI 150


>gi|284929773|ref|YP_003422295.1| hypothetical protein UCYN_12480 [cyanobacterium UCYN-A]
 gi|284810217|gb|ADB95914.1| Predicted membrane protein (DUF2232) [cyanobacterium UCYN-A]
          Length = 236

 Score = 47.8 bits (112), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 53/109 (48%), Gaps = 2/109 (1%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  +     + + +LL VL GP++++ ++  +G+L   +G+ WR  + W  SIF+  
Sbjct: 78  LRWGSRSAWMGWLVSGLLLTVLMGPIQSILFITNYGLLSIQLGAFWRKNISWEFSIFIGA 137

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASL-TFIFSAAGVNIVPS 108
           I             S L  E++    +IN   SL  ++F   G  I PS
Sbjct: 138 IISIFSFFFKFWLFSILTGEDLWQY-SINQMTSLAEWLFLKFGTLIQPS 185


>gi|443323263|ref|ZP_21052271.1| putative membrane protein (DUF2232) [Gloeocapsa sp. PCC 73106]
 gi|442787001|gb|ELR96726.1| putative membrane protein (DUF2232) [Gloeocapsa sp. PCC 73106]
          Length = 244

 Score = 47.4 bits (111), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/115 (26%), Positives = 56/115 (48%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +R G  A         +LL VL GPV+++ +L+ +G++G  +G  WR    W  S+ +  
Sbjct: 85  LRRGDRAAIMASTVATLLLSVLMGPVRSVIFLIPYGLMGIQLGVFWRRSASWNFSLCVGG 144

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           I  + G    I   S  + E++ + + + I     +IF   G+   PS+ +I ++
Sbjct: 145 IIGTFGVFFRIFLFSIFLGEDLWSYVMVQITELANWIFLKLGILAEPSLGMIELL 199


>gi|334117938|ref|ZP_08492028.1| Protein of unknown function DUF2232, membrane [Microcoleus
           vaginatus FGP-2]
 gi|333459923|gb|EGK88533.1| Protein of unknown function DUF2232, membrane [Microcoleus
           vaginatus FGP-2]
          Length = 251

 Score = 47.0 bits (110), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 58/112 (51%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     + + +LL VL GP +++ Y++  G++G ++G LWR    W LSI +  
Sbjct: 91  LRWGNRAAWMAALVSGLLLSVLMGPTRSILYVMPFGIMGVALGGLWRRKASWSLSISVGA 150

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
           +  + G        S L+ +++   +T  +   + ++F   G   VPS+N I
Sbjct: 151 LIGTVGFFFRFWLLSALVGQDLWVYVTAQVTEMVEWVFVKLGWLNVPSLNTI 202


>gi|376001833|ref|ZP_09779687.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
           8005]
 gi|375329744|emb|CCE15440.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
           8005]
          Length = 226

 Score = 47.0 bits (110), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/118 (27%), Positives = 57/118 (48%), Gaps = 6/118 (5%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RW   A     V + +LL VL GP +++ Y++ +G +   +G LW+ G  W +++    
Sbjct: 66  LRWDRRAAIMGAVTSGLLLTVLMGPTRSILYVIPYGWMAVVLGGLWKRGAKWPVTV---A 122

Query: 61  IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           IA   GA+G+       S L+  ++    T  I   L  IF   G+   PS+  I ++
Sbjct: 123 IASLIGAIGFFFRYWLLSILLSRDLWVYATTQIAQLLESIFIRLGILAPPSLLFIQIV 180


>gi|359457479|ref|ZP_09246042.1| hypothetical protein ACCM5_02044 [Acaryochloris sp. CCMEE 5410]
          Length = 215

 Score = 47.0 bits (110), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/112 (26%), Positives = 56/112 (50%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  +   +   T +L+ VL GP +++ +L+ +GV+G  +G LW+  V W +S+    
Sbjct: 54  LRWGNRSAWMSAWVTTLLITVLMGPPRSIQFLMPYGVVGVMLGGLWKRQVSWAISMGWSI 113

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
           +  + G    +   S L+  N+   I   I   L ++    G+ + P + VI
Sbjct: 114 LIMAIGFFFQLNLLSLLVGTNLWIYINRQITGFLDWVVIKLGLLLQPDVIVI 165


>gi|428317916|ref|YP_007115798.1| Protein of unknown function DUF2232, membrane [Oscillatoria
           nigro-viridis PCC 7112]
 gi|428241596|gb|AFZ07382.1| Protein of unknown function DUF2232, membrane [Oscillatoria
           nigro-viridis PCC 7112]
          Length = 251

 Score = 46.6 bits (109), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 57/112 (50%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     + + +LL VL GP +++ Y++  G++G ++G LWR    W LSI +  
Sbjct: 91  LRWGNRAAWMAALVSGLLLSVLMGPTRSILYVMPFGIMGVALGGLWRRKASWSLSISVGA 150

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
              + G        S L+ +++   +T  +   + ++F   G   VPS+N I
Sbjct: 151 FIGTVGFFFRFWLLSALVGQDLWVYVTAQVTEMVEWVFVKLGWLNVPSLNTI 202


>gi|282895674|ref|ZP_06303799.1| conserved hypothetical protein [Raphidiopsis brookii D9]
 gi|281199368|gb|EFA74233.1| conserved hypothetical protein [Raphidiopsis brookii D9]
          Length = 178

 Score = 46.6 bits (109), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 54/108 (50%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     + + +LL VL+GPV++L +++    LG  +G+ W   V W +SI L T
Sbjct: 18  LRWGNRASWMAAITSGLLLSVLTGPVRSLLFIMPFAFLGVLLGASWHRRVPWFVSITLGT 77

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
           I  + G    +   S L  E++   +T  +   L ++F    +   PS
Sbjct: 78  ILCTLGVFFRLWLLSILSGEDLWIYLTNQVTEILQWLFLNLRILTPPS 125


>gi|158338243|ref|YP_001519420.1| hypothetical protein AM1_5139 [Acaryochloris marina MBIC11017]
 gi|158308484|gb|ABW30101.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 215

 Score = 46.6 bits (109), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 55/111 (49%)

Query: 2   RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
           RWG  +   +   T +L+ VL GP +++ +L+ +GV+G  +G LW+  V W +S+    +
Sbjct: 55  RWGNRSAWMSAWVTTLLITVLMGPPRSIQFLMPYGVVGVILGGLWKRQVSWAISMGWSIL 114

Query: 62  ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
             + G    +   S L+  N+   I   I   L ++    G+ + P + VI
Sbjct: 115 VMAIGFFFQLNLLSLLVGTNLWIYINRQITGFLDWVVIKLGLLLQPDVIVI 165


>gi|190192140|dbj|BAG48275.1| hypothetical protein MAE_15185 [Microcystis aeruginosa NIES-843]
          Length = 251

 Score = 46.6 bits (109), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 54/109 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG      T + +++LL VL GP +++ +L+ HG++G  +G  WR G  W  SI    
Sbjct: 92  LRWGARCASMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
           +  + G        S L+ EN+   +       + +IF+  G+   P++
Sbjct: 152 LLGAFGFFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTL 200


>gi|16330851|ref|NP_441579.1| hypothetical protein sll1372 [Synechocystis sp. PCC 6803]
 gi|383322593|ref|YP_005383446.1| hypothetical protein SYNGTI_1684 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383325762|ref|YP_005386615.1| hypothetical protein SYNPCCP_1683 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383491646|ref|YP_005409322.1| hypothetical protein SYNPCCN_1683 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384436913|ref|YP_005651637.1| hypothetical protein SYNGTS_1684 [Synechocystis sp. PCC 6803]
 gi|451815009|ref|YP_007451461.1| hypothetical protein MYO_117020 [Synechocystis sp. PCC 6803]
 gi|1653344|dbj|BAA18259.1| sll1372 [Synechocystis sp. PCC 6803]
 gi|339273945|dbj|BAK50432.1| hypothetical protein SYNGTS_1684 [Synechocystis sp. PCC 6803]
 gi|359271912|dbj|BAL29431.1| hypothetical protein SYNGTI_1684 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359275082|dbj|BAL32600.1| hypothetical protein SYNPCCN_1683 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359278252|dbj|BAL35769.1| hypothetical protein SYNPCCP_1683 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|451780978|gb|AGF51947.1| hypothetical protein MYO_117020 [Synechocystis sp. PCC 6803]
          Length = 275

 Score = 46.2 bits (108), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 33/57 (57%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIF 57
           +RW   A  K M+A  +LL VL GP +++ +++ +G++   +G  W     WG SIF
Sbjct: 115 LRWRSPAAIKGMIAAGLLLSVLMGPTRSVVFVIPYGLMSLQLGFCWVRNASWGFSIF 171


>gi|425467150|ref|ZP_18846434.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           9809]
 gi|389830158|emb|CCI28064.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           9809]
          Length = 251

 Score = 46.2 bits (108), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 54/109 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG      T + +++LL VL GP +++ +L+ HG++G  +G  WR G  W  SI    
Sbjct: 92  LRWGARCASMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
           +  + G        S L+ EN+   +       + +IF+  G+   P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTL 200


>gi|423062304|ref|ZP_17051094.1| hypothetical protein SPLC1_S031930 [Arthrospira platensis C1]
 gi|406716212|gb|EKD11363.1| hypothetical protein SPLC1_S031930 [Arthrospira platensis C1]
          Length = 226

 Score = 46.2 bits (108), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/118 (27%), Positives = 57/118 (48%), Gaps = 6/118 (5%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RW   A     V + +LL VL GP +++ Y++ +G +   +G LW+ G  W +++    
Sbjct: 66  LRWDRRAAIMGAVTSGLLLTVLMGPTRSILYVIPYGWMAVVLGGLWKRGAKWPVTV---A 122

Query: 61  IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           IA   GA+G+       S L+  ++    T  I   L  IF   G+   PS+  I ++
Sbjct: 123 IASLIGAIGFFFRYWLLSILLGRDLWVYATTQIAQLLESIFIRLGILAPPSLLFIQIV 180


>gi|209522908|ref|ZP_03271465.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|209496495|gb|EDZ96793.1| conserved hypothetical protein [Arthrospira maxima CS-328]
          Length = 226

 Score = 46.2 bits (108), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/118 (27%), Positives = 57/118 (48%), Gaps = 6/118 (5%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RW   A     V + +LL VL GP +++ Y++ +G +   +G LW+ G  W +++    
Sbjct: 66  LRWDRRAAIMGAVTSGLLLTVLMGPTRSILYVIPYGWMAVVLGGLWKRGAKWPVTV---A 122

Query: 61  IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           IA   GA+G+       S L+  ++    T  I   L  IF   G+   PS+  I ++
Sbjct: 123 IASLIGAIGFFFRYWLLSILLGRDLWVYATTQIAQLLESIFIRLGILAPPSLLFIQIV 180


>gi|434389139|ref|YP_007099750.1| putative membrane protein (DUF2232) [Chamaesiphon minutus PCC 6605]
 gi|428020129|gb|AFY96223.1| putative membrane protein (DUF2232) [Chamaesiphon minutus PCC 6605]
          Length = 199

 Score = 46.2 bits (108), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 60/115 (52%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  AG  + + + +LL VL GPV+++ +++ +GV+G  +G+ WR    W  SI L  
Sbjct: 41  LRWGNRAGWMSALVSGLLLSVLMGPVRSILFVMPYGVMGVQLGACWRRQASWLTSISLGA 100

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           +    G       +S L+ E++   +   I     ++F   G+   PS+ VI ++
Sbjct: 101 LVDCLGVFFRFGLTSALLGEDLWMYLMARIRDLSEWLFVQLGILAEPSLLVIQIL 155


>gi|443325608|ref|ZP_21054294.1| putative membrane protein (DUF2232) [Xenococcus sp. PCC 7305]
 gi|442794767|gb|ELS04168.1| putative membrane protein (DUF2232) [Xenococcus sp. PCC 7305]
          Length = 251

 Score = 46.2 bits (108), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/115 (26%), Positives = 58/115 (50%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +R G  AG  +   + +LL VL GPV+++ +++ +G++G  +G+ W   V W LS+ L  
Sbjct: 93  LRRGNRAGWMSAWVSMLLLSVLMGPVRSIVFIIPYGLIGIQLGACWVRRVHWALSVLLGA 152

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           +  S G        S L+ E++   +   I     ++F   G+   P + +I ++
Sbjct: 153 LIGSLGLFFRFWLFSILLGEDLWVYVMTQITQLADWLFLKIGILAQPDIILIQIL 207


>gi|407961790|dbj|BAM55030.1| hypothetical protein BEST7613_6099 [Synechocystis sp. PCC 6803]
          Length = 245

 Score = 45.8 bits (107), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 33/57 (57%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIF 57
           +RW   A  K M+A  +LL VL GP +++ +++ +G++   +G  W     WG SIF
Sbjct: 85  LRWRSPAAIKGMIAAGLLLSVLMGPTRSVVFVIPYGLMSLQLGFCWVRNASWGFSIF 141


>gi|443648943|ref|ZP_21130125.1| hypothetical protein C789_665 [Microcystis aeruginosa DIANCHI905]
 gi|159026388|emb|CAO90486.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443335056|gb|ELS49539.1| hypothetical protein C789_665 [Microcystis aeruginosa DIANCHI905]
          Length = 251

 Score = 45.4 bits (106), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 54/109 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG      T + +++LL VL GP +++ +L+ HG++G  +G  WR G  W  SI    
Sbjct: 92  LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGIQLGLCWRRGARWEFSILTGG 151

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
           +  + G        S L+ EN+   +       + +IF+  G+   P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTL 200


>gi|428775069|ref|YP_007166856.1| hypothetical protein PCC7418_0408 [Halothece sp. PCC 7418]
 gi|428689348|gb|AFZ42642.1| Protein of unknown function DUF2232, membrane [Halothece sp. PCC
           7418]
          Length = 256

 Score = 45.4 bits (106), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/115 (26%), Positives = 57/115 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +R G+ A   + +   +LL +L GP +++ Y++ +G++G  +G+ W     WGLS+ +  
Sbjct: 97  LRRGMRASWMSALIVGLLLSILMGPTRSILYVIPYGLMGVQLGAFWCRNTHWGLSVTVGA 156

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           I  + G        S  + E++   +T  I   L + F   G+   PS+ +I  I
Sbjct: 157 IIGTFGFFFRFWLLSIFLGEDLWVYVTSQIAQFLEWGFVQLGLLTQPSLWLIQAI 211


>gi|425436381|ref|ZP_18816817.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
           9432]
 gi|425449736|ref|ZP_18829571.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
           7941]
 gi|440752045|ref|ZP_20931248.1| putative membrane protein [Microcystis aeruginosa TAIHU98]
 gi|389678911|emb|CCH92273.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
           9432]
 gi|389769737|emb|CCI05505.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
           7941]
 gi|440176538|gb|ELP55811.1| putative membrane protein [Microcystis aeruginosa TAIHU98]
          Length = 251

 Score = 45.4 bits (106), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 54/109 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG      T + +++LL VL GP +++ +L+ HG++G  +G  WR G  W  SI    
Sbjct: 92  LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGVQLGLCWRRGARWEFSILTGG 151

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
           +  + G        S L+ EN+   +       + +IF+  G+   P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTL 200


>gi|425461780|ref|ZP_18841254.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
           9808]
 gi|389825302|emb|CCI25026.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
           9808]
          Length = 251

 Score = 45.1 bits (105), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 54/109 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG      T + +++LL VL GP +++ +L+ HG++G  +G  WR G  W  SI    
Sbjct: 92  LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGVQLGLCWRRGARWEFSILTGG 151

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
           +  + G        S L+ EN+   +       + +IF+  G+   P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTL 200


>gi|282900315|ref|ZP_06308266.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
 gi|281194820|gb|EFA69766.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
          Length = 226

 Score = 45.1 bits (105), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 31/108 (28%), Positives = 53/108 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     + + +LL VL+GPV++L +++    LG  +G  W   V W +SI L T
Sbjct: 66  LRWGNRASWMAAITSGLLLSVLTGPVRSLLFIMPFAFLGVLLGVSWHRRVPWVVSISLGT 125

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
           I  + G    +   S L  E++   +T  +   + ++F    +   PS
Sbjct: 126 ILCTLGVFFRLWLLSILSGEDLWVYLTNQVTEIIEWLFLNLQILTTPS 173


>gi|409990906|ref|ZP_11274222.1| hypothetical protein APPUASWS_07925 [Arthrospira platensis str.
           Paraca]
 gi|291567238|dbj|BAI89510.1| hypothetical protein [Arthrospira platensis NIES-39]
 gi|409938233|gb|EKN79581.1| hypothetical protein APPUASWS_07925 [Arthrospira platensis str.
           Paraca]
          Length = 226

 Score = 45.1 bits (105), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 56/117 (47%), Gaps = 6/117 (5%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RW   A     V + +LL VL GP +++ Y++ +G +G  +G LW+ G  W +++    
Sbjct: 66  LRWDRRAAIMGAVTSGLLLTVLMGPTRSILYVIPYGWMGVVLGGLWKRGAMWPVTV---A 122

Query: 61  IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
           I    GA+G+       S L+  ++    T  I   L  IF   GV   PS+  I +
Sbjct: 123 IGSLIGAIGFFFRYWLLSILLGRDLWVYATTQIAQLLESIFIRLGVLAPPSLLFIQI 179


>gi|428204395|ref|YP_007082984.1| hypothetical protein Ple7327_4314 [Pleurocapsa sp. PCC 7327]
 gi|427981827|gb|AFY79427.1| putative membrane protein [Pleurocapsa sp. PCC 7327]
          Length = 261

 Score = 45.1 bits (105), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 60/115 (52%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RW   A   T + + +LL VL GP +++ YL+ +G++G  +G+ WR G  W  SIF   
Sbjct: 102 LRWSNRASWMTTLVSVLLLTVLMGPPRSIIYLIPYGLMGVQLGACWRRGAHWWFSIFTGA 161

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           +  ++G        S L+ E++   +   +   + ++F   G+   PS+ +I +I
Sbjct: 162 LIGTSGFFFRFWLFSILLGEDLWLYVITQVTEIVDWVFLRLGLLAQPSVLLIQII 216


>gi|425444816|ref|ZP_18824857.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
           9443]
 gi|425454338|ref|ZP_18834084.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
           9807]
 gi|389735353|emb|CCI01127.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
           9443]
 gi|389805030|emb|CCI15498.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
           9807]
          Length = 251

 Score = 45.1 bits (105), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 54/109 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG      T + +++LL VL GP +++ +L+ HG++G  +G  WR G  W  SI    
Sbjct: 92  LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGVQLGLCWRRGARWEFSILTGG 151

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
           +  + G        S L+ EN+   +       + +IF+  G+   P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTL 200


>gi|427731990|ref|YP_007078227.1| hypothetical protein Nos7524_4904 [Nostoc sp. PCC 7524]
 gi|427367909|gb|AFY50630.1| putative membrane protein (DUF2232) [Nostoc sp. PCC 7524]
          Length = 243

 Score = 45.1 bits (105), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 56/114 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     V + +LL VL GPV++L +++    +G  +G+ W   V W +SI L T
Sbjct: 85  LRWGKRASWMAAVTSGLLLSVLMGPVRSLLFVMPFAFMGVLLGATWYRRVPWLVSITLGT 144

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
           +  + G    +   S L  E++   +   +     +IF   G+   PS+ VI +
Sbjct: 145 VLGTLGVFFRLWLLSILSGEDLWIYVINQVTELTEWIFLRLGLLATPSVFVIQI 198


>gi|422303788|ref|ZP_16391139.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
           9806]
 gi|389791182|emb|CCI12970.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
           9806]
          Length = 251

 Score = 45.1 bits (105), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 54/109 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG      T + +++LL VL GP +++ +L+ HG++G  +G  WR G  W  SI    
Sbjct: 92  LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
           +  + G        S L+ EN+   +       + +IF+  G+   P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTWLGILAKPTL 200


>gi|425441756|ref|ZP_18822023.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           9717]
 gi|389717455|emb|CCH98462.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           9717]
          Length = 251

 Score = 44.7 bits (104), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 54/109 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG      T + +++LL VL GP +++ +L+ HG++G  +G  WR G  W  SI    
Sbjct: 92  LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
           +  + G        S L+ EN+   +       + +IF+  G+   P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTWLGILAKPTL 200


>gi|390440736|ref|ZP_10228944.1| conserved membrane hypothetical protein [Microcystis sp. T1-4]
 gi|389835965|emb|CCI33070.1| conserved membrane hypothetical protein [Microcystis sp. T1-4]
          Length = 251

 Score = 44.7 bits (104), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 54/109 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG      T + +++LL VL GP +++ +L+ HG++G  +G  WR G  W  SI    
Sbjct: 92  LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
           +  + G        S L+ EN+   +       + +IF+  G+   P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTWLGILAKPTL 200


>gi|425471084|ref|ZP_18849944.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
           9701]
 gi|389883066|emb|CCI36507.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
           9701]
          Length = 251

 Score = 44.3 bits (103), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 30/109 (27%), Positives = 54/109 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG      T + +++LL VL GP +++ +L+ HG++G  +G  WR G  W  SI    
Sbjct: 92  LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGVQLGLCWRRGARWEFSILTGG 151

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
           +  + G        S L+ EN+   +       + +IF+  G+   P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTWLGILAKPTL 200


>gi|254423145|ref|ZP_05036863.1| hypothetical protein S7335_3299 [Synechococcus sp. PCC 7335]
 gi|196190634|gb|EDX85598.1| hypothetical protein S7335_3299 [Synechococcus sp. PCC 7335]
          Length = 278

 Score = 44.3 bits (103), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 33/56 (58%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
           +RW   A       + +LL +L GP +++ +++ +G+LG  +G LWR    WG+SI
Sbjct: 112 LRWNARAAWMGATVSGLLLTILLGPTRSVLFVMPYGILGVQLGYLWRRKAGWGISI 167


>gi|56751839|ref|YP_172540.1| hypothetical protein syc1830_d [Synechococcus elongatus PCC 6301]
 gi|81301078|ref|YP_401286.1| hypothetical protein Synpcc7942_2269 [Synechococcus elongatus PCC
           7942]
 gi|56686798|dbj|BAD80020.1| hypothetical protein [Synechococcus elongatus PCC 6301]
 gi|81169959|gb|ABB58299.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 245

 Score = 43.9 bits (102), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 46/96 (47%)

Query: 2   RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
           RWG+ +     +  ++LL VL GP + + +LL  G LG  +G  W+  + WG SI L   
Sbjct: 85  RWGLRSAWLGAIVASLLLFVLLGPTRGILFLLPPGCLGVLLGCCWQRRLSWGWSILLGGG 144

Query: 62  ARSAGAMGYILTSSFLIRENILALITINIHASLTFI 97
             + G    +   S L+ E+I   +T      L ++
Sbjct: 145 LETIGFFFRLWVLSVLLGEDIWTYVTAQATGFLEWL 180


>gi|308804379|ref|XP_003079502.1| unnamed protein product [Ostreococcus tauri]
 gi|116057957|emb|CAL54160.1| unnamed protein product [Ostreococcus tauri]
          Length = 230

 Score = 43.9 bits (102), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 29/113 (25%), Positives = 51/113 (45%), Gaps = 1/113 (0%)

Query: 2   RWGVAAGRKTMVATAMLLLVLSGPVKALA-YLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           RWG AA R+T+V   +     +     LA   L    L  +  +LW  G  W +++    
Sbjct: 69  RWGSAAARRTLVVATLARPKTTMSSNELAPRRLAPCTLAHADRALWSSGASWWVTVPASA 128

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIY 113
           + R+ G +  +  SS L REN++ L+   ++A L    +  G    P +  ++
Sbjct: 129 LTRTFGIVCSLSVSSLLYRENVMKLLVTQMYALLDQFAANVGATFAPDITWVW 181


>gi|428311088|ref|YP_007122065.1| hypothetical protein Mic7113_2881 [Microcoleus sp. PCC 7113]
 gi|428252700|gb|AFZ18659.1| putative membrane protein (DUF2232) [Microcoleus sp. PCC 7113]
          Length = 300

 Score = 43.9 bits (102), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 18/56 (32%), Positives = 35/56 (62%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
           +RWG  A     + + +LL VL GP +++ +++ +G++G  +G++WR   +W  SI
Sbjct: 140 LRWGNRASWMAAIVSGLLLSVLMGPTRSILFVIPYGLMGVQLGAMWRRQSNWLFSI 195


>gi|427705814|ref|YP_007048191.1| hypothetical protein Nos7107_0357 [Nostoc sp. PCC 7107]
 gi|427358319|gb|AFY41041.1| Protein of unknown function DUF2232, membrane [Nostoc sp. PCC 7107]
          Length = 254

 Score = 43.9 bits (102), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 32/114 (28%), Positives = 56/114 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     V + +LL VL GPV++L +++ +  +G  +G+ W   V W +SI L  
Sbjct: 94  LRWGQRAAWMAAVTSGLLLSVLMGPVRSLLFVMPYAFMGVLLGATWHRRVPWIVSITLAA 153

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
           +  + G    +   S L  E++   +   +     +IF   G+   PS+ +I V
Sbjct: 154 LLGTLGFFFRMWLLSILSGEDLWIYLINQVTEITEWIFLKLGLLNTPSVFLIQV 207


>gi|443316964|ref|ZP_21046389.1| putative membrane protein (DUF2232) [Leptolyngbya sp. PCC 6406]
 gi|442783427|gb|ELR93342.1| putative membrane protein (DUF2232) [Leptolyngbya sp. PCC 6406]
          Length = 273

 Score = 43.5 bits (101), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 28/89 (31%), Positives = 48/89 (53%), Gaps = 1/89 (1%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A   + +   +LL V+ GP ++L +L+ +G+LG  +G LWR    W  +I L  
Sbjct: 107 LRWGSRAAWMSALVATLLLAVMMGPPRSLLFLMPYGLLGVQLGWLWRRRASWYTTIGLGA 166

Query: 61  IARSAGAMGYILTSSFLIRENILALITIN 89
           +  + G    +   S ++ E+ L + TIN
Sbjct: 167 LLSALGVFVRLWLLSIMLGED-LWIYTIN 194


>gi|67922653|ref|ZP_00516158.1| similar to membrane protein [Crocosphaera watsonii WH 8501]
 gi|67855501|gb|EAM50755.1| similar to membrane protein [Crocosphaera watsonii WH 8501]
          Length = 240

 Score = 43.1 bits (100), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 28/108 (25%), Positives = 51/108 (47%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +R G  A     + + +LL VL GP +++ +L+ +G++G  +G+ WR  + W  S+    
Sbjct: 81  LRRGKRAAWMGALVSGLLLTVLMGPTRSIVFLIPYGLMGVQLGAFWRRNLPWEFSMITGA 140

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
           +  + G        S L+ E++   I   I     +IF   G+   PS
Sbjct: 141 LLGTFGFFFRFWLFSILLGEDLWQYIINQITGLAEWIFLRLGILAQPS 188


>gi|416392028|ref|ZP_11685833.1| hypothetical protein CWATWH0003_2646 [Crocosphaera watsonii WH
           0003]
 gi|357263688|gb|EHJ12658.1| hypothetical protein CWATWH0003_2646 [Crocosphaera watsonii WH
           0003]
          Length = 240

 Score = 43.1 bits (100), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 28/108 (25%), Positives = 51/108 (47%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +R G  A     + + +LL VL GP +++ +L+ +G++G  +G+ WR  + W  S+    
Sbjct: 81  LRRGKRAAWMGALVSGLLLTVLMGPTRSIVFLIPYGLMGVQLGAFWRRNLPWEFSMITGA 140

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
           +  + G        S L+ E++   I   I     +IF   G+   PS
Sbjct: 141 LLGTFGFFFRFWLFSILLGEDLWQYIINQITGLAEWIFLRLGILAQPS 188


>gi|170078396|ref|YP_001735034.1| hypothetical protein SYNPCC7002_A1790 [Synechococcus sp. PCC 7002]
 gi|169886065|gb|ACA99778.1| conserved hypothetical membrane protein [Synechococcus sp. PCC
           7002]
          Length = 243

 Score = 42.7 bits (99), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 59/112 (52%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A   + + +++LL +L GP +++ Y + +G++G  +G +WR  V W  S+ L T
Sbjct: 83  LRWGSRAAWMSTLVSSLLLSILMGPTRSIIYTIPYGLIGVQLGWMWRRQVSWYWSVTLGT 142

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
           I  + G    +   S L+ E++   +   I     ++F   G+   PS+ VI
Sbjct: 143 ILGTLGFFFRVWLLSLLLGEDLWVYVISQISEMADWLFLKLGLLTQPSLLVI 194


>gi|354567780|ref|ZP_08986948.1| Protein of unknown function DUF2232, membrane [Fischerella sp.
           JSC-11]
 gi|353542238|gb|EHC11702.1| Protein of unknown function DUF2232, membrane [Fischerella sp.
           JSC-11]
          Length = 223

 Score = 42.7 bits (99), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 30/112 (26%), Positives = 55/112 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     V + +LL VL GP+++L +++    +G  +G+ W   V W +SI L  
Sbjct: 64  LRWGKRAAWMGAVTSGLLLSVLMGPIRSLLFVMPFAFMGVLLGAAWYRRVPWIVSISLGA 123

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
           +  + G    +   S L  E++   +   +   + +IF   G+   PS ++I
Sbjct: 124 VLGTLGVFFRLWLLSLLSGEDLWVYVINQVTEIVEWIFLRLGILASPSTSLI 175


>gi|427717327|ref|YP_007065321.1| hypothetical protein Cal7507_2044 [Calothrix sp. PCC 7507]
 gi|427349763|gb|AFY32487.1| Protein of unknown function DUF2232, membrane [Calothrix sp. PCC
           7507]
          Length = 245

 Score = 42.4 bits (98), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 31/114 (27%), Positives = 54/114 (47%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     +   +LL VL GP+++L +++ +  +G  +G+ W   V W +SI L T
Sbjct: 85  LRWGKRAAWMAALTCGLLLAVLMGPIRSLLFVMPYAFMGVLLGATWNRRVPWIVSITLGT 144

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
           +  + G    I   S L  E++   +   +     +IF    V   PS+  I +
Sbjct: 145 LLATLGIFFRIWLLSVLSGEDLWVYVINQVTEITEWIFLKLQVLANPSVFFIQI 198


>gi|72383606|ref|YP_292961.1| hypothetical protein PMN2A_1770 [Prochlorococcus marinus str.
           NATL2A]
 gi|72003456|gb|AAZ59258.1| uncharacterized conserved membrane protein [Prochlorococcus marinus
           str. NATL2A]
          Length = 228

 Score = 42.0 bits (97), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 46/90 (51%), Gaps = 6/90 (6%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +R GV  G + +    MLL VL GP++    L  +G+L   +G  W  G +W LS    +
Sbjct: 67  IRRGVKTGIEGVTICVMLLTVLMGPLRGPLVLFPYGLLSLWLGYCWEKGWNWWLS---WS 123

Query: 61  IARSAGAMGY---ILTSSFLIRENILALIT 87
           +  S G MG+   +   S L+ EN+  ++T
Sbjct: 124 VGVSIGTMGFLVRVFVLSLLVGENLWVILT 153


>gi|440681213|ref|YP_007156008.1| Protein of unknown function DUF2232, membrane [Anabaena cylindrica
           PCC 7122]
 gi|428678332|gb|AFZ57098.1| Protein of unknown function DUF2232, membrane [Anabaena cylindrica
           PCC 7122]
          Length = 244

 Score = 41.6 bits (96), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 30/114 (26%), Positives = 55/114 (48%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     V + +LL VL GPV+++ +++   ++G  +G+ W   V W +SI L  
Sbjct: 84  LRWGKRAAWMAAVTSGLLLAVLMGPVRSILFVMPFALMGVLLGATWHRRVPWIVSITLGM 143

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
           +  + G    +   S L  E++   +   +   + +IF    +   PS+  I V
Sbjct: 144 LLGTLGVFFRLWLLSVLSGEDLWVYVINQVTEIIEWIFLKLEILATPSVFFIQV 197


>gi|428211971|ref|YP_007085115.1| hypothetical protein Oscil6304_1490 [Oscillatoria acuminata PCC
           6304]
 gi|428000352|gb|AFY81195.1| putative membrane protein (DUF2232) [Oscillatoria acuminata PCC
           6304]
          Length = 265

 Score = 41.2 bits (95), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 35/120 (29%), Positives = 63/120 (52%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     V + +LL VL GP +++ +++  G+LG  +G LW+    W ++IFL +
Sbjct: 106 LRWGNRAAWMGAVVSGLLLSVLMGPTRSILFVVPFGLLGVLLGCLWKRRASWEVAIFLGS 165

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
           +  + G    +     L+ E++   +TI I     ++F   G+ + PS+ VI  +   LV
Sbjct: 166 LLGAFGFFFRLWLLGILVGEDLWVYLTIQITEIADWLFLRLGLLVQPSLYVIQALAAILV 225


>gi|113475248|ref|YP_721309.1| hypothetical protein Tery_1556 [Trichodesmium erythraeum IMS101]
 gi|110166296|gb|ABG50836.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 219

 Score = 41.2 bits (95), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 26/115 (22%), Positives = 58/115 (50%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     + + +LL +L GP +++ +++   ++G  +G+LWR G  W +SI   +
Sbjct: 60  LRWGSRASWMGAIVSGLLLSILMGPTRSILFVIPFALMGVMLGALWRRGKSWFVSITYGS 119

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           +  S G        S L+ +++   +       + ++F   G+   P++ ++ +I
Sbjct: 120 LLGSIGFFFRFWLLSILLGQDLWIYLITQATELVEWVFVKLGLLNQPNLLLVQLI 174


>gi|414075864|ref|YP_006995182.1| hypothetical protein ANA_C10569 [Anabaena sp. 90]
 gi|413969280|gb|AFW93369.1| hypothetical protein ANA_C10569 [Anabaena sp. 90]
          Length = 254

 Score = 41.2 bits (95), Expect = 0.084,   Method: Compositional matrix adjust.
 Identities = 29/112 (25%), Positives = 56/112 (50%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     V + +LL VL GP ++L +++  G++G  +G  W   V W +S+ L T
Sbjct: 95  LRWGKRAAWMAAVTSGLLLSVLMGPFRSLLFVMPFGLMGVLLGVTWYRRVPWIVSMTLGT 154

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
           +  + G    +   S L  E++   +   +   + +IF    + + P++ +I
Sbjct: 155 LLGTMGVFFRLWFLSVLSGEDLWVYVINQVTEIIEWIFLKLELLMTPNLLLI 206


>gi|119490055|ref|ZP_01622679.1| hypothetical protein L8106_20785 [Lyngbya sp. PCC 8106]
 gi|119454207|gb|EAW35359.1| hypothetical protein L8106_20785 [Lyngbya sp. PCC 8106]
          Length = 232

 Score = 41.2 bits (95), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 32/111 (28%), Positives = 55/111 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG        + + +LL VL GP +++ +L+ +G +G +MG LW+ G +W +SI +  
Sbjct: 72  LRWGKREAWMGALTSLLLLTVLMGPTRSIIFLIPYGWMGVAMGGLWKRGANWFVSIGVGA 131

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNV 111
           I  S G        S L+  ++    T  I     +IF   G+   PS+ +
Sbjct: 132 IIGSIGFFFRYWLLSILLGRDLWVYATTQITELAEWIFIKLGILAQPSLTL 182


>gi|298492012|ref|YP_003722189.1| hypothetical protein Aazo_3420 ['Nostoc azollae' 0708]
 gi|298233930|gb|ADI65066.1| Protein of unknown function DUF2232, membrane ['Nostoc azollae'
           0708]
          Length = 242

 Score = 40.8 bits (94), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 31/112 (27%), Positives = 55/112 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     V + +LL VL GPV+++ +++  G++G  +G+ W   V W +SI L  
Sbjct: 82  LRWGKRAAWMGAVTSGLLLGVLMGPVRSMLFVMPFGLMGVLLGATWYRRVPWIVSITLGM 141

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
           +  + G    +   S L  E++   +   +     +IF    +   PS+ VI
Sbjct: 142 LLGTLGVFFRLWLLSVLSGEDLWIYVINQVTEITEWIFLKLQILATPSVFVI 193


>gi|220905783|ref|YP_002481094.1| hypothetical protein Cyan7425_0341 [Cyanothece sp. PCC 7425]
 gi|219862394|gb|ACL42733.1| conserved hypothetical protein [Cyanothece sp. PCC 7425]
          Length = 244

 Score = 40.4 bits (93), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 57/115 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RW +     T +  A+LL VL GP ++L +L+ HG+LG  +G  W+    W +SI    
Sbjct: 85  LRWNLRTATMTALVAALLLSVLMGPPRSLQFLVPHGLLGILLGHFWQRRASWWVSIAWGA 144

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
           +    G    I   S L+ EN+   +   +   L ++F++ G  + P + V+  +
Sbjct: 145 LLGGIGLFFQIGLLSLLLGENLWLYLNQQVTNLLDWLFTSLGWLVQPDLAVVQAV 199


>gi|428304516|ref|YP_007141341.1| hypothetical protein Cri9333_0916 [Crinalium epipsammum PCC 9333]
 gi|428246051|gb|AFZ11831.1| Protein of unknown function DUF2232, membrane [Crinalium epipsammum
           PCC 9333]
          Length = 260

 Score = 40.4 bits (93), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 37/56 (66%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
           +RWG  A   + + + +LL VL GP +++ + + +G++G  +G+LWR G +W LSI
Sbjct: 100 LRWGHRASWMSALVSGLLLSVLMGPTRSILFFIPYGLMGVQLGALWRRGSNWLLSI 155


>gi|411118678|ref|ZP_11391058.1| putative membrane protein (DUF2232) [Oscillatoriales cyanobacterium
           JSC-12]
 gi|410710541|gb|EKQ68048.1| putative membrane protein (DUF2232) [Oscillatoriales cyanobacterium
           JSC-12]
          Length = 275

 Score = 40.4 bits (93), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 55/112 (49%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     + + +LL VL GP ++L YL+  G +G  +G LWR   +W  SI + +
Sbjct: 115 LRWGYRAAWMGALVSGLLLSVLMGPPRSLLYLMPFGFMGVLLGMLWRRQANWAGSIAIAS 174

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
           I  S G    +   S L+ +++    T  +   + +     G+   PS+ +I
Sbjct: 175 ILGSFGFFFRVWLVSLLLGDDLWLYTTTQVTNFIDWFCVRFGILFQPSLTLI 226


>gi|124025204|ref|YP_001014320.1| hypothetical protein NATL1_04931 [Prochlorococcus marinus str.
           NATL1A]
 gi|123960272|gb|ABM75055.1| Predicted membrane protein [Prochlorococcus marinus str. NATL1A]
          Length = 228

 Score = 40.0 bits (92), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 6/90 (6%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +R GV  G + +    MLL  L GP++    L  +G+L   +G  W  G +W LS    +
Sbjct: 67  IRRGVKTGIEGVTICVMLLTALMGPLRGPLVLFPYGLLSLWLGYCWEKGWNWWLS---WS 123

Query: 61  IARSAGAMGY---ILTSSFLIRENILALIT 87
           +  S G MG+   +   S L+ EN+  ++T
Sbjct: 124 VGVSIGTMGFLVRVFVLSLLVGENLWVILT 153


>gi|148241761|ref|YP_001226918.1| hypothetical protein SynRCC307_0662 [Synechococcus sp. RCC307]
 gi|147850071|emb|CAK27565.1| Uncharacterized conserved membrane protein [Synechococcus sp.
           RCC307]
          Length = 207

 Score = 40.0 bits (92), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 48/100 (48%)

Query: 15  TAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTS 74
           T +LL VL GPV+    L  +G+L   +G  WR    W  S+ +  +  +AG +  +   
Sbjct: 60  TVLLLTVLMGPVRGPLMLFPYGLLSLWLGWCWRRQCSWWFSMAVGVLIGAAGFLVRVALV 119

Query: 75  SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
           S L+ EN+  +IT      L  +     + + P+++ + V
Sbjct: 120 SLLLGENLWVVITAAASGLLNGLADRLQLPVAPTLDQVQV 159


>gi|434405583|ref|YP_007148468.1| putative membrane protein (DUF2232) [Cylindrospermum stagnale PCC
           7417]
 gi|428259838|gb|AFZ25788.1| putative membrane protein (DUF2232) [Cylindrospermum stagnale PCC
           7417]
          Length = 260

 Score = 39.3 bits (90), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 33/114 (28%), Positives = 57/114 (50%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RWG  A     V + +LL VL GPV++L +++    +G  +G+ W   V W +SI L T
Sbjct: 100 LRWGRRAAWMAAVTSGLLLAVLMGPVRSLLFVMPFAFMGVLLGATWYRRVPWIVSITLGT 159

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
           +  + G    +   S L  E++   +   +   + +IF   G+   PS+ +I V
Sbjct: 160 LLGTLGVFFRLWLLSVLSGEDLWIYVINQVTDIIEWIFLKLGLLTNPSVFLIQV 213


>gi|119512452|ref|ZP_01631534.1| hypothetical protein N9414_13495 [Nodularia spumigena CCY9414]
 gi|119462918|gb|EAW43873.1| hypothetical protein N9414_13495 [Nodularia spumigena CCY9414]
          Length = 241

 Score = 39.3 bits (90), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 32/56 (57%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
           +RWG  A     V + +LL VL GPV++L +++    +G  +G+ W   V W +SI
Sbjct: 81  LRWGKRAAWMAAVTSGLLLSVLMGPVRSLLFVMPFAFMGVLLGATWHRRVPWIVSI 136


>gi|254413468|ref|ZP_05027238.1| hypothetical protein MC7420_6047 [Coleofasciculus chthonoplastes
           PCC 7420]
 gi|196179575|gb|EDX74569.1| hypothetical protein MC7420_6047 [Coleofasciculus chthonoplastes
           PCC 7420]
          Length = 277

 Score = 38.9 bits (89), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 23/93 (24%), Positives = 48/93 (51%)

Query: 23  SGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENI 82
            GP +++ +L+ +G++G  +G+LW+   +W  SI + T+  + G        S L+ E++
Sbjct: 141 QGPTRSILFLIPYGLMGVQLGALWKRQANWLFSIGIGTLLGTLGFFFRFWLLSILLSEDL 200

Query: 83  LALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
              +   I   + + F   G+   PS+ +I V+
Sbjct: 201 WVYVMTQITELVEWGFVKLGLLAQPSLMLIQVV 233


>gi|427734711|ref|YP_007054255.1| hypothetical protein Riv7116_1135 [Rivularia sp. PCC 7116]
 gi|427369752|gb|AFY53708.1| putative membrane protein (DUF2232) [Rivularia sp. PCC 7116]
          Length = 240

 Score = 38.1 bits (87), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 30/114 (26%), Positives = 59/114 (51%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +RW   A   + + + +LLLVL GP +++ +++ +  +G  +GS W+ GV W +SI L T
Sbjct: 81  LRWNSRAAWMSALTSGLLLLVLMGPTRSMLFVIPYAFMGVLLGSTWQRGVPWIVSISLAT 140

Query: 61  IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
           +  + G    +   S L  +++   +   +   L ++F   G   VP+  +I +
Sbjct: 141 LLGAIGVFFRLWLLSLLSGQDLWIYLINQVTNFLEWLFVNLGFLTVPNPFLIQI 194


>gi|148240217|ref|YP_001225604.1| hypothetical protein SynWH7803_1881 [Synechococcus sp. WH 7803]
 gi|147848756|emb|CAK24307.1| Conserved hyptothetical membrane protein [Synechococcus sp. WH
           7803]
          Length = 214

 Score = 38.1 bits (87), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 46/87 (52%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +R G  AG + +  + +LL+ L GPV+    L  +G+L   +G  W+    W +S  L  
Sbjct: 53  VRRGGRAGLEGLAVSILLLIALMGPVRGPLMLFPYGLLSLWLGWCWQHRQSWWISWGLGV 112

Query: 61  IARSAGAMGYILTSSFLIRENILALIT 87
           +  +AG +  ++  S L+ EN+  +IT
Sbjct: 113 VIGAAGFLVRVVALSLLVGENLWVVIT 139


>gi|186683647|ref|YP_001866843.1| hypothetical protein Npun_F3493 [Nostoc punctiforme PCC 73102]
 gi|186466099|gb|ACC81900.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
          Length = 246

 Score = 37.7 bits (86), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 34/57 (59%), Gaps = 1/57 (1%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLW-RLGVDWGLSI 56
           +RWG  A     + + +LL VL GP ++L +++ +G +G  +G+ W R  V W +SI
Sbjct: 85  LRWGKRAAWMAALTSGLLLTVLMGPARSLLFVMPYGFMGVLLGATWYRRRVPWIVSI 141


>gi|17227792|ref|NP_484340.1| hypothetical protein alr0296 [Nostoc sp. PCC 7120]
 gi|17135274|dbj|BAB77820.1| alr0296 [Nostoc sp. PCC 7120]
          Length = 257

 Score = 37.4 bits (85), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 31/56 (55%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
           +RWG  A     V + +LL VL GPV++L +++    +G  +G+ W     W +SI
Sbjct: 99  LRWGKRAAWMAAVTSGLLLSVLMGPVRSLLFVMPFAFMGVVLGATWYRRAPWIVSI 154


>gi|88807570|ref|ZP_01123082.1| hypothetical protein WH7805_13503 [Synechococcus sp. WH 7805]
 gi|88788784|gb|EAR19939.1| hypothetical protein WH7805_13503 [Synechococcus sp. WH 7805]
          Length = 214

 Score = 37.0 bits (84), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 44/86 (51%)

Query: 2   RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
           R G  AG + +  + +LL+ L GPV+    L  +G+L   +G  W+    W +S     +
Sbjct: 54  RRGSRAGLEGLAVSILLLMALMGPVRGPLMLFPYGLLSLWLGWCWQHRQSWWISWGFGLV 113

Query: 62  ARSAGAMGYILTSSFLIRENILALIT 87
             +AG +  ++  S L+ EN+  +IT
Sbjct: 114 IGAAGFLVRVVALSLLVGENLWVVIT 139


>gi|75909273|ref|YP_323569.1| hypothetical protein Ava_3065 [Anabaena variabilis ATCC 29413]
 gi|75702998|gb|ABA22674.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
          Length = 257

 Score = 37.0 bits (84), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 31/56 (55%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
           +RWG  A     V + +LL VL GPV++L +++    +G  +G+ W     W +SI
Sbjct: 99  LRWGKRAAWMAAVTSGLLLSVLMGPVRSLLFVMPFAFMGVLLGATWYRRAPWIVSI 154


>gi|452821902|gb|EME28927.1| hypothetical protein isoform 2 [Galdieria sulphuraria]
          Length = 342

 Score = 36.6 bits (83), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 40/91 (43%), Gaps = 8/91 (8%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTH----GVLGFSMGSLWRLGVDWGLSI 56
           +RWG    R  +       ++  GP+  L Y+LT     GVL  +M   W    +W   I
Sbjct: 170 LRWGPRYSRLALCLLFCFFVLRPGPLYCLQYMLTQGLAVGVLTHTMWCQW----NWLFCI 225

Query: 57  FLCTIARSAGAMGYILTSSFLIRENILALIT 87
              TIA   G M  I  +SF+  +N   LIT
Sbjct: 226 LSSTIAYLFGIMLEISLTSFVFGKNTWTLIT 256


>gi|452821901|gb|EME28926.1| hypothetical protein isoform 1 [Galdieria sulphuraria]
          Length = 323

 Score = 36.6 bits (83), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 40/91 (43%), Gaps = 8/91 (8%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTH----GVLGFSMGSLWRLGVDWGLSI 56
           +RWG    R  +       ++  GP+  L Y+LT     GVL  +M   W    +W   I
Sbjct: 151 LRWGPRYSRLALCLLFCFFVLRPGPLYCLQYMLTQGLAVGVLTHTMWCQW----NWLFCI 206

Query: 57  FLCTIARSAGAMGYILTSSFLIRENILALIT 87
              TIA   G M  I  +SF+  +N   LIT
Sbjct: 207 LSSTIAYLFGIMLEISLTSFVFGKNTWTLIT 237


>gi|427418274|ref|ZP_18908457.1| putative membrane protein (DUF2232) [Leptolyngbya sp. PCC 7375]
 gi|425760987|gb|EKV01840.1| putative membrane protein (DUF2232) [Leptolyngbya sp. PCC 7375]
          Length = 267

 Score = 36.2 bits (82), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 17/56 (30%), Positives = 32/56 (57%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
           +R    +   T + + +LL +L GP ++L Y++ +G+LG  +G  WR    W +S+
Sbjct: 101 LRQNCRSAWMTTLVSGLLLAILLGPPRSLLYVIPYGLLGVQLGYHWRHQTSWLISL 156


>gi|349688699|ref|ZP_08899841.1| NCS1 nucleoside transporter [Gluconacetobacter oboediens 174Bp2]
          Length = 490

 Score = 35.4 bits (80), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 50/97 (51%), Gaps = 5/97 (5%)

Query: 14  ATAMLLLVLSGPVKALAYLLTHG--VLGFSMGSLWRLGVDW-GLS-IFLCTIARSAGAMG 69
           A A+++   SGP+        +G  V     G+ W L  ++ G S + +CTIA +A A G
Sbjct: 246 AVALVVSYFSGPMLNFGDFARYGRGVSDIRRGNFWGLPFNFVGFSALTICTIALTAPAFG 305

Query: 70  YILTSSFLIRENILALITINIHASLTFIFSAAGVNIV 106
           +++        +I +L  + I   LTFI + AG+NIV
Sbjct: 306 HVMIDPVETVAHIDSLTAV-IFGMLTFIIATAGINIV 341


>gi|330992162|ref|ZP_08316110.1| putative allantoin permease [Gluconacetobacter sp. SXCC-1]
 gi|329760361|gb|EGG76857.1| putative allantoin permease [Gluconacetobacter sp. SXCC-1]
          Length = 491

 Score = 35.4 bits (80), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 50/97 (51%), Gaps = 5/97 (5%)

Query: 14  ATAMLLLVLSGPVKALAYLLTHG--VLGFSMGSLWRLGVDW-GLS-IFLCTIARSAGAMG 69
           A A+++   SGP+        +G        G+ W L  ++ G S + +CTIA +A A G
Sbjct: 246 AVALVVSYFSGPMLNFGDFARYGRSAADIRRGNFWGLPFNFVGFSALTVCTIALTAPAFG 305

Query: 70  YILTSSFLIRENILALITINIHASLTFIFSAAGVNIV 106
           +++T       +I +L  + +   LTFI + AG+NIV
Sbjct: 306 HVMTDPVETVAHIDSLSAV-VFGMLTFIIATAGINIV 341


>gi|427704401|ref|YP_007047623.1| hypothetical protein Cyagr_3203 [Cyanobium gracile PCC 6307]
 gi|427347569|gb|AFY30282.1| putative membrane protein (DUF2232) [Cyanobium gracile PCC 6307]
          Length = 218

 Score = 35.0 bits (79), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 25/87 (28%), Positives = 45/87 (51%)

Query: 1   MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
           +R G     + +V T +LL+ L GP++    L  +G+L   +G  WR    W L++ +  
Sbjct: 57  LRHGWRCALEGVVVTTLLLVALMGPIRGPLVLFPYGLLSLWLGWGWRRRSSWWLTLGVGG 116

Query: 61  IARSAGAMGYILTSSFLIRENILALIT 87
           +  +AG +  +   S L+ EN+  +IT
Sbjct: 117 LIGAAGFLVRVAVLSVLVGENLWVVIT 143


>gi|254431659|ref|ZP_05045362.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
 gi|197626112|gb|EDY38671.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
          Length = 201

 Score = 35.0 bits (79), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 20/64 (31%), Positives = 35/64 (54%)

Query: 24  GPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENIL 83
           GP++    L  +G+L   +G  WR  ++W LS  L ++  +AG +  +   S L+ EN+ 
Sbjct: 63  GPIRGPLLLFPYGLLALWLGWCWRRRLNWWLSWSLGSVIGAAGFLVRVAVLSVLVGENLW 122

Query: 84  ALIT 87
            +IT
Sbjct: 123 VVIT 126


>gi|434397203|ref|YP_007131207.1| Protein of unknown function DUF2232, membrane [Stanieria
           cyanosphaera PCC 7437]
 gi|428268300|gb|AFZ34241.1| Protein of unknown function DUF2232, membrane [Stanieria
           cyanosphaera PCC 7437]
          Length = 256

 Score = 34.7 bits (78), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 22/93 (23%), Positives = 44/93 (47%)

Query: 23  SGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENI 82
            GP +++ +L+ +G++G  +G  W     W LSI + ++  + G        S L+ E++
Sbjct: 118 MGPTRSIVFLMPYGLMGVQLGVCWSRRASWVLSILIGSLIGTFGLFFRFWLFSVLLGEDL 177

Query: 83  LALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
              +   I     +IF   G+   PS  +I ++
Sbjct: 178 WIYVITQITELTDWIFLKLGLLAQPSFFLIQIL 210


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.330    0.140    0.423 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,667,931,719
Number of Sequences: 23463169
Number of extensions: 58879642
Number of successful extensions: 222236
Number of sequences better than 100.0: 192
Number of HSP's better than 100.0 without gapping: 157
Number of HSP's successfully gapped in prelim test: 35
Number of HSP's that attempted gapping in prelim test: 222055
Number of HSP's gapped (non-prelim): 197
length of query: 121
length of database: 8,064,228,071
effective HSP length: 88
effective length of query: 33
effective length of database: 5,999,469,199
effective search space: 197982483567
effective search space used: 197982483567
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 69 (31.2 bits)