BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 033347
(121 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224099213|ref|XP_002311406.1| predicted protein [Populus trichocarpa]
gi|222851226|gb|EEE88773.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 205 bits (521), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 99/120 (82%), Positives = 111/120 (92%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWGVA GRKTMVATAMLL VLSGPVKAL YLLTHG++GF+MGSLWR+G +WGLSIFLCT
Sbjct: 149 LRWGVAGGRKTMVATAMLLFVLSGPVKALTYLLTHGLVGFTMGSLWRMGANWGLSIFLCT 208
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
IAR+ GA+GY+LTSSFLIRENILALITINIHASLTFIF+AAG+N VPSMN IY +FG LV
Sbjct: 209 IARATGAVGYVLTSSFLIRENILALITINIHASLTFIFAAAGINTVPSMNFIYSLFGILV 268
>gi|255546947|ref|XP_002514531.1| conserved hypothetical protein [Ricinus communis]
gi|223546135|gb|EEF47637.1| conserved hypothetical protein [Ricinus communis]
Length = 306
Score = 202 bits (513), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 96/120 (80%), Positives = 108/120 (90%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWGVAAGRKTMVATAMLL VLSGP+KAL YLLTHG++GF+MGSLWRL DWGLSIFLCT
Sbjct: 148 MRWGVAAGRKTMVATAMLLFVLSGPLKALTYLLTHGIVGFTMGSLWRLRADWGLSIFLCT 207
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
IAR+ GAMGY+ +SFLIRENILALITINIHASLT IF+A G+N +PSMN+IY +FG LV
Sbjct: 208 IARATGAMGYVFVTSFLIRENILALITINIHASLTLIFAAIGINTIPSMNLIYTLFGILV 267
>gi|449499494|ref|XP_004160832.1| PREDICTED: uncharacterized LOC101204847 [Cucumis sativus]
Length = 469
Score = 194 bits (493), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 94/120 (78%), Positives = 107/120 (89%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWG++AGRKTMVAT +LLLVLSGPVKAL YLL HG++GF+MGSLWRLG +W SIFLCT
Sbjct: 312 MRWGISAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCT 371
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA+GY+L SSFLIRENILALITINIHASLT IF+A GVN++PSMN IY IFGTLV
Sbjct: 372 IVRAFGAVGYVLVSSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLV 431
>gi|449446255|ref|XP_004140887.1| PREDICTED: uncharacterized protein LOC101204847 [Cucumis sativus]
Length = 301
Score = 193 bits (490), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 94/120 (78%), Positives = 107/120 (89%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWG++AGRKTMVAT +LLLVLSGPVKAL YLL HG++GF+MGSLWRLG +W SIFLCT
Sbjct: 144 MRWGISAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCT 203
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA+GY+L SSFLIRENILALITINIHASLT IF+A GVN++PSMN IY IFGTLV
Sbjct: 204 IVRAFGAVGYVLVSSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLV 263
>gi|356499992|ref|XP_003518819.1| PREDICTED: uncharacterized protein LOC100783446 [Glycine max]
Length = 287
Score = 186 bits (471), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 89/120 (74%), Positives = 104/120 (86%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWGV AGRKT+VAT +LLLVLSGPVKAL YLL HG++GF+MG+LWR G W +SIFLCT
Sbjct: 130 MRWGVDAGRKTLVATTILLLVLSGPVKALTYLLKHGIVGFTMGTLWRTGASWNMSIFLCT 189
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA+G++L SSFLIRENILALITINIHASLTF+ +AAGV +PSMNVIY +FG LV
Sbjct: 190 IVRALGAVGFVLISSFLIRENILALITINIHASLTFLLTAAGVTSIPSMNVIYTLFGILV 249
>gi|225423785|ref|XP_002279949.1| PREDICTED: uncharacterized protein LOC100256033 isoform 1 [Vitis
vinifera]
Length = 286
Score = 185 bits (470), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 88/120 (73%), Positives = 105/120 (87%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWGVAAGRKTMVAT MLL VLSGPVKA YLL HG++G +MGSLWRLG +WGLSI LCT
Sbjct: 129 MRWGVAAGRKTMVATTMLLFVLSGPVKASTYLLLHGLVGLTMGSLWRLGANWGLSICLCT 188
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
R+ GAMGY+L SSFLIRENILALIT+N+HAS+T++F+A GVN +PSM++IY +FGTL+
Sbjct: 189 FVRAMGAMGYVLISSFLIRENILALITLNVHASITYMFTAIGVNTIPSMDLIYALFGTLL 248
>gi|297737924|emb|CBI27125.3| unnamed protein product [Vitis vinifera]
Length = 192
Score = 184 bits (468), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 88/120 (73%), Positives = 105/120 (87%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWGVAAGRKTMVAT MLL VLSGPVKA YLL HG++G +MGSLWRLG +WGLSI LCT
Sbjct: 35 MRWGVAAGRKTMVATTMLLFVLSGPVKASTYLLLHGLVGLTMGSLWRLGANWGLSICLCT 94
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
R+ GAMGY+L SSFLIRENILALIT+N+HAS+T++F+A GVN +PSM++IY +FGTL+
Sbjct: 95 FVRAMGAMGYVLISSFLIRENILALITLNVHASITYMFTAIGVNTIPSMDLIYALFGTLL 154
>gi|357496929|ref|XP_003618753.1| Membrane protein [Medicago truncatula]
gi|355493768|gb|AES74971.1| Membrane protein [Medicago truncatula]
Length = 247
Score = 182 bits (463), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 87/120 (72%), Positives = 102/120 (85%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWG+ AGRKTMVAT +LL VLSGPVKAL YLL HG++G++MG+LWRL W LS+FLC
Sbjct: 90 MRWGIDAGRKTMVATTILLFVLSGPVKALTYLLKHGIVGYTMGTLWRLETSWSLSVFLCA 149
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I RS GA+G++L SSFLIRENILALITINIHASLTF+ +AAGVN +PSMN IY +FG LV
Sbjct: 150 IVRSIGAVGFVLISSFLIRENILALITINIHASLTFVLTAAGVNSIPSMNFIYTLFGILV 209
>gi|297851064|ref|XP_002893413.1| hypothetical protein ARALYDRAFT_890129 [Arabidopsis lyrata subsp.
lyrata]
gi|297339255|gb|EFH69672.1| hypothetical protein ARALYDRAFT_890129 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 177 bits (449), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 81/120 (67%), Positives = 106/120 (88%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW +A GRKTMVAT MLL +LSGPVKAL Y LTHG++G ++GSLWR+G W LSIF+CT
Sbjct: 136 IRWNIAGGRKTMVATVMLLFILSGPVKALTYFLTHGLVGLALGSLWRMGASWRLSIFVCT 195
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ R+ G +GY+LTSSFLIRENILA+ITINIHASL+++F+A G+NI+PSM++IY+IFGT++
Sbjct: 196 MVRALGLIGYVLTSSFLIRENILAVITINIHASLSYVFTAMGLNIMPSMSLIYMIFGTVL 255
>gi|22329794|ref|NP_173943.2| uncharacterized protein [Arabidopsis thaliana]
gi|18252157|gb|AAL61911.1| unknown protein [Arabidopsis thaliana]
gi|21386935|gb|AAM47871.1| unknown protein [Arabidopsis thaliana]
gi|332192537|gb|AEE30658.1| uncharacterized protein [Arabidopsis thaliana]
Length = 289
Score = 176 bits (445), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 81/120 (67%), Positives = 105/120 (87%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW +A GRKTMVAT MLL +LSGPVKAL Y LTHG++G ++GSLW +G W LSIFLCT
Sbjct: 132 IRWNIAGGRKTMVATVMLLFILSGPVKALTYFLTHGLVGLALGSLWSMGASWRLSIFLCT 191
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ R+ G +GY+LTSSFLIRENILA+ITINIHASL+++F+A G+NI+PSM++IY+IFGT++
Sbjct: 192 MVRALGLIGYVLTSSFLIRENILAVITINIHASLSYVFTAMGLNIMPSMSLIYMIFGTVL 251
>gi|357139303|ref|XP_003571222.1| PREDICTED: uncharacterized protein LOC100830022 [Brachypodium
distachyon]
Length = 295
Score = 174 bits (441), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 82/120 (68%), Positives = 100/120 (83%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ AGRKTMVAT +LL LSGPVKA YLL HGV+G +MG++WRL +W +SI LC+
Sbjct: 119 LRWGLEAGRKTMVATVLLLFTLSGPVKASTYLLMHGVVGLAMGTMWRLETNWIVSIILCS 178
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA GY+L SSFLIRENIL LIT+NIHASLT+I +AAGVN +PSM+ IYVIFGTL+
Sbjct: 179 ITRAVGACGYVLVSSFLIRENILELITVNIHASLTYILAAAGVNAIPSMDAIYVIFGTLL 238
>gi|242091734|ref|XP_002436357.1| hypothetical protein SORBIDRAFT_10g000990 [Sorghum bicolor]
gi|241914580|gb|EER87724.1| hypothetical protein SORBIDRAFT_10g000990 [Sorghum bicolor]
Length = 286
Score = 168 bits (426), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 81/120 (67%), Positives = 100/120 (83%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ AGRKT+VAT +LL LSGPVKA YLL HGV+G MG++WRL +W +SI LC+
Sbjct: 129 LRWGLKAGRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSILLCS 188
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA GY+L SSFLIRENILALIT+NIHASLT+I +AAGVN +PSM+ IYV+FGTL+
Sbjct: 189 IVRALGACGYVLVSSFLIRENILALITVNIHASLTYILTAAGVNTIPSMDAIYVLFGTLL 248
>gi|115445849|ref|NP_001046704.1| Os02g0326000 [Oryza sativa Japonica Group]
gi|46390264|dbj|BAD15693.1| unknown protein [Oryza sativa Japonica Group]
gi|113536235|dbj|BAF08618.1| Os02g0326000 [Oryza sativa Japonica Group]
gi|215736884|dbj|BAG95813.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765527|dbj|BAG87224.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222622737|gb|EEE56869.1| hypothetical protein OsJ_06502 [Oryza sativa Japonica Group]
Length = 281
Score = 167 bits (424), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 80/120 (66%), Positives = 99/120 (82%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ AGRKT VAT +LL LSGPVKA YLL HGV+G +MG++WRL +W +SI LC+
Sbjct: 124 LRWGLEAGRKTAVATVLLLFTLSGPVKASTYLLMHGVVGLAMGTMWRLETNWIVSIILCS 183
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA GY+L SSFLIRENIL LIT+NIHASLT+I +AAGVN +PSM+ IYV+FGTL+
Sbjct: 184 ITRALGACGYVLLSSFLIRENILGLITVNIHASLTYILAAAGVNTIPSMDAIYVLFGTLL 243
>gi|218190626|gb|EEC73053.1| hypothetical protein OsI_07007 [Oryza sativa Indica Group]
Length = 281
Score = 167 bits (424), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 80/120 (66%), Positives = 99/120 (82%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ AGRKT VAT +LL LSGPVKA YLL HGV+G +MG++WRL +W +SI LC+
Sbjct: 124 LRWGLEAGRKTAVATVLLLFTLSGPVKASTYLLMHGVVGLAMGTMWRLETNWIVSIILCS 183
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA GY+L SSFLIRENIL LIT+NIHASLT+I +AAGVN +PSM+ IYV+FGTL+
Sbjct: 184 ITRALGACGYVLLSSFLIRENILGLITVNIHASLTYILAAAGVNTIPSMDAIYVLFGTLL 243
>gi|226508684|ref|NP_001140399.1| uncharacterized protein LOC100272453 [Zea mays]
gi|194699326|gb|ACF83747.1| unknown [Zea mays]
gi|413953513|gb|AFW86162.1| hypothetical protein ZEAMMB73_298407 [Zea mays]
Length = 250
Score = 166 bits (420), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 80/120 (66%), Positives = 98/120 (81%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ A RKT+VAT +LL LSGPVKA YLL HGV+G MG++WRL +W +SI LC+
Sbjct: 121 LRWGLEASRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSIILCS 180
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA GY+L SSFLIRENILALIT+NIHASLT+I +AAGVN +PSM IY++FGTLV
Sbjct: 181 IVRALGACGYVLVSSFLIRENILALITVNIHASLTYILTAAGVNTIPSMGTIYLLFGTLV 240
>gi|326494316|dbj|BAJ90427.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326522989|dbj|BAJ88540.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 310
Score = 160 bits (406), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 82/120 (68%), Positives = 99/120 (82%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ AGRKTMVAT +LL LSGPVKAL YLL HGV+G SMG++WRL +W SI +C
Sbjct: 151 LRWGLEAGRKTMVATVLLLFTLSGPVKALTYLLMHGVVGLSMGTMWRLETNWIASIIICA 210
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA GY+L +SFLIRENI LIT+NIHASLT+I +AAGVN++PSM+ IYVIFGTLV
Sbjct: 211 IIRAVGACGYVLVTSFLIRENIFQLITVNIHASLTYILAAAGVNMIPSMDAIYVIFGTLV 270
>gi|168067903|ref|XP_001785841.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662495|gb|EDQ49342.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 192
Score = 156 bits (394), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 72/120 (60%), Positives = 100/120 (83%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWG AAGRKTMVATAMLLL+LSGP+KA +YLL HG +G +MG+ W+ +WGLSI CT
Sbjct: 35 MRWGAAAGRKTMVATAMLLLILSGPLKAASYLLMHGFVGLAMGAFWKWNFNWGLSIVGCT 94
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ R GA+G++L +S+L+R+NILALIT+N HASL+++ +A G+N+VP+M +IYV+F +L+
Sbjct: 95 VVRCLGALGFVLLTSWLLRDNILALITMNAHASLSYMLAAVGINLVPTMPMIYVVFASLL 154
>gi|224033465|gb|ACN35808.1| unknown [Zea mays]
gi|413953514|gb|AFW86163.1| membrane protein [Zea mays]
Length = 278
Score = 155 bits (392), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 75/113 (66%), Positives = 91/113 (80%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ A RKT+VAT +LL LSGPVKA YLL HGV+G MG++WRL +W +SI LC+
Sbjct: 121 LRWGLEASRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSIILCS 180
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIY 113
I R+ GA GY+L SSFLIRENILALIT+NIHASLT+I +AAGVN +PSM IY
Sbjct: 181 IVRALGACGYVLVSSFLIRENILALITVNIHASLTYILTAAGVNTIPSMGTIY 233
>gi|195626624|gb|ACG35142.1| membrane protein [Zea mays]
Length = 192
Score = 155 bits (392), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 75/113 (66%), Positives = 91/113 (80%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ A RKT+VAT +LL LSGPVKA YLL HGV+G MG++WRL +W +SI LC+
Sbjct: 35 LRWGLEASRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSIILCS 94
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIY 113
I R+ GA GY+L SSFLIRENILALIT+NIHASLT+I +AAGVN +PSM IY
Sbjct: 95 IVRALGACGYVLVSSFLIRENILALITVNIHASLTYILTAAGVNTIPSMGTIY 147
>gi|302802149|ref|XP_002982830.1| hypothetical protein SELMODRAFT_47311 [Selaginella moellendorffii]
gi|300149420|gb|EFJ16075.1| hypothetical protein SELMODRAFT_47311 [Selaginella moellendorffii]
Length = 206
Score = 148 bits (374), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 70/120 (58%), Positives = 97/120 (80%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWG AAGRKTMV TA+LLLV++GP+KAL+Y+L HG LG +MG+LWR ++WG+SI CT
Sbjct: 49 MRWGAAAGRKTMVTTALLLLVIAGPLKALSYVLMHGFLGVAMGALWRWRINWGVSILACT 108
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ R+ GA+G++ SS+LI+ENIL LITIN HASL+++ +A G+NI +M IY +F +L+
Sbjct: 109 LVRAVGAIGFVFLSSWLIKENILNLITINTHASLSYMVAAMGINISLTMPTIYAVFTSLL 168
>gi|302800209|ref|XP_002981862.1| hypothetical protein SELMODRAFT_57372 [Selaginella moellendorffii]
gi|300150304|gb|EFJ16955.1| hypothetical protein SELMODRAFT_57372 [Selaginella moellendorffii]
Length = 206
Score = 147 bits (372), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 70/120 (58%), Positives = 97/120 (80%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWG AAGRKTMV TA+LLLV++GP+KAL+Y+L HG LG +MG+LWR ++WG+SI CT
Sbjct: 49 MRWGAAAGRKTMVTTALLLLVIAGPLKALSYVLMHGFLGVAMGALWRWRINWGVSILACT 108
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ R+ GA+G++ SS+LI+ENIL LITIN HASL+++ +A G+NI +M IY +F +L+
Sbjct: 109 LVRALGAIGFVFLSSWLIKENILNLITINTHASLSYMVAAMGINISLTMPTIYAVFTSLL 168
>gi|255646056|gb|ACU23515.1| unknown [Glycine max]
Length = 223
Score = 126 bits (316), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 57/81 (70%), Positives = 70/81 (86%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWGV AGRKT+VAT +LLLVLSGPVKAL YLL HG++GF+MG+LWR G W +SIFLCT
Sbjct: 130 MRWGVDAGRKTLVATTILLLVLSGPVKALTYLLKHGIVGFTMGTLWRTGASWNMSIFLCT 189
Query: 61 IARSAGAMGYILTSSFLIREN 81
I R+ GA+G++L SSFLIR++
Sbjct: 190 IVRALGAVGFVLISSFLIRKH 210
>gi|413953512|gb|AFW86161.1| hypothetical protein ZEAMMB73_298407 [Zea mays]
Length = 254
Score = 108 bits (269), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 57/113 (50%), Positives = 71/113 (62%), Gaps = 24/113 (21%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ A RKT+VAT +LL LSGPVKA YLL HGV+G MG++WRL +W +SI LC+
Sbjct: 121 LRWGLEASRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSIILCS 180
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIY 113
I IT+NIHASLT+I +AAGVN +PSM IY
Sbjct: 181 I------------------------ITVNIHASLTYILTAAGVNTIPSMGTIY 209
>gi|12321175|gb|AAG50677.1|AC079829_10 unknown protein [Arabidopsis thaliana]
Length = 275
Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/124 (51%), Positives = 84/124 (67%), Gaps = 19/124 (15%)
Query: 13 VATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLS----IFL--------CT 60
VAT MLL +LSGPVKAL Y ++ L F L D L+ IF+ CT
Sbjct: 117 VATVMLLFILSGPVKALTYFVS---LRFPRLPSRVLSRDHCLTFGDFIFVAYSWSCGACT 173
Query: 61 ----IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIF 116
R+ G +GY+LTSSFLIRENILA+ITINIHASL+++F+A G+NI+PSM++IY+IF
Sbjct: 174 GIIVEVRALGLIGYVLTSSFLIRENILAVITINIHASLSYVFTAMGLNIMPSMSLIYMIF 233
Query: 117 GTLV 120
GT++
Sbjct: 234 GTVL 237
>gi|384245220|gb|EIE18715.1| hypothetical protein COCSUDRAFT_20460, partial [Coccomyxa
subellipsoidea C-169]
Length = 206
Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 73/123 (59%), Gaps = 3/123 (2%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKA---LAYLLTHGVLGFSMGSLWRLGVDWGLSIF 57
MRWG AAGRKTM AT LLL + P+ L +LL G+L ++G+ W LG+ W LS+
Sbjct: 40 MRWGPAAGRKTMTATCFLLLGENFPLSTEDILTFLLNQGLLAATLGASWSLGMHWALSVA 99
Query: 58 LCTIARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFG 117
+ R G M Y+L SS+++ EN+ +++ NI++ L + + G + P V+ V+
Sbjct: 100 AGALVRVGGMMAYVLVSSWIMNENLFSILMTNIYSLLDQMNAFIGASGAPPPTVVIVVLA 159
Query: 118 TLV 120
+L+
Sbjct: 160 SLL 162
>gi|302840505|ref|XP_002951808.1| hypothetical protein VOLCADRAFT_81593 [Volvox carteri f.
nagariensis]
gi|300263056|gb|EFJ47259.1| hypothetical protein VOLCADRAFT_81593 [Volvox carteri f.
nagariensis]
Length = 330
Score = 67.8 bits (164), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 64/108 (59%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G AG +TM AT LL+VL GP++A++YL HG+L ++GSLW V + +
Sbjct: 167 LRSGGGAGWRTMTATCFLLVVLLGPLRAMSYLFLHGLLAATLGSLWSSRVGFWPGVVAGA 226
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
+ R G + Y++ SS + EN+ AL+ N++ L +A G++ PS
Sbjct: 227 LVRMCGQLSYLVMSSVTMNENMFALLLSNVYNMLDQASAALGLSGAPS 274
>gi|412985991|emb|CCO17191.1| predicted protein [Bathycoccus prasinos]
Length = 243
Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 62/114 (54%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
RWG A +T++ +LL +LSGP++A Y HG G ++G W+ W S+ + +
Sbjct: 78 RWGDVAAWRTLLVAFLLLTLLSGPLRASNYACLHGFTGLALGMCWKRKFSWVQSVPISAL 137
Query: 62 ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
AR+ G + SS +IREN+L L+ ++A + I + G VP+M I +
Sbjct: 138 ARTIGVFSSLAISSLVIRENVLRLMVQQVYALIDQICATLGATFVPNMTAILCV 191
>gi|427712923|ref|YP_007061547.1| hypothetical protein Syn6312_1862 [Synechococcus sp. PCC 6312]
gi|427377052|gb|AFY61004.1| putative membrane protein (DUF2232) [Synechococcus sp. PCC 6312]
Length = 251
Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 59/112 (52%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A + +A+LL VL GP ++L YLL HG+LG +G LW+ W +S+F
Sbjct: 90 LRWHRRAAWMGAIVSALLLSVLMGPPRSLQYLLPHGILGVVLGGLWKQKAPWHISLFWGA 149
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
+ S G IL S L+ EN+ + + + L + G+ ++PS ++
Sbjct: 150 VLGSIGFFFQILVVSALLGENLWIYVNQQVTSFLDWALVQLGILLIPSFGLV 201
>gi|255085474|ref|XP_002505168.1| predicted protein [Micromonas sp. RCC299]
gi|226520437|gb|ACO66426.1| predicted protein [Micromonas sp. RCC299]
Length = 178
Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 62/112 (55%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
R G A + V T+MLL ++SGP++A+ Y HG + +S+G W W +SI L
Sbjct: 41 RNGPKAASRVAVVTSMLLFLISGPLRAVNYYCLHGAMAYSLGWAWTKRFSWWVSIPLSAT 100
Query: 62 ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIY 113
RS G ++ +S ++REN++ L+ + L I + G++ +P++ ++
Sbjct: 101 VRSIGIFCSLMFTSLVLRENVMQLMVTQMMGLLDQIAANIGMSWMPTLGWVW 152
>gi|388506298|gb|AFK41215.1| unknown [Lotus japonicus]
Length = 185
Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/35 (74%), Positives = 30/35 (85%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTH 35
MRWGV AG+KT+V+T +LL VLSGPVKAL YLL H
Sbjct: 150 MRWGVDAGKKTLVSTTILLFVLSGPVKALTYLLKH 184
>gi|443477605|ref|ZP_21067439.1| Protein of unknown function DUF2232, membrane [Pseudanabaena biceps
PCC 7429]
gi|443017225|gb|ELS31713.1| Protein of unknown function DUF2232, membrane [Pseudanabaena biceps
PCC 7429]
Length = 208
Score = 58.5 bits (140), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 57/112 (50%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A MV T++LL +L GP +++ Y++ HG++G ++G LW+ W S+ + T
Sbjct: 51 LRWNSRAAWMAMVVTSLLLAILMGPTRSIQYIVPHGLVGVTLGYLWKRDFPWSGSLSIAT 110
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
I S G + S L+ EN+ + I A L ++ G P +I
Sbjct: 111 IIGSVGTAFQFVFLSVLLGENVWTYSIVQISAFLNWLMQLFGSLEQPDFAMI 162
>gi|307109189|gb|EFN57427.1| hypothetical protein CHLNCDRAFT_21406, partial [Chlorella
variabilis]
Length = 206
Score = 58.5 bits (140), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 55/97 (56%)
Query: 24 GPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENIL 83
GPV+A YLL +GVL S+ W L + W +SI L +AR G + YI SS++ EN+L
Sbjct: 67 GPVRATTYLLVYGVLSLSLAVCWALRLPWVVSIPLGALARIGGYLAYIALSSWVTNENLL 126
Query: 84 ALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
L+ N++A L + + G P + + ++ +L+
Sbjct: 127 MLMLNNVYALLDQLSALLGTTGSPPLLAVAIVLCSLL 163
>gi|428219126|ref|YP_007103591.1| hypothetical protein Pse7367_2910 [Pseudanabaena sp. PCC 7367]
gi|427990908|gb|AFY71163.1| Protein of unknown function DUF2232, membrane [Pseudanabaena sp.
PCC 7367]
Length = 209
Score = 58.2 bits (139), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 48/82 (58%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A TM+ T +LL VL GP +++ +L+ HG +G +G+LWR W SIF T
Sbjct: 52 LRWDRRAAIMTMIVTFLLLTVLMGPTRSIQFLIPHGFIGLFLGALWRRRSPWLASIFTGT 111
Query: 61 IARSAGAMGYILTSSFLIRENI 82
+ + G++ + S L+ EN+
Sbjct: 112 VLGTLGSIFQLYLVSLLLGENV 133
>gi|427725977|ref|YP_007073254.1| hypothetical protein Lepto7376_4311 [Leptolyngbya sp. PCC 7376]
gi|427357697|gb|AFY40420.1| Protein of unknown function DUF2232, membrane [Leptolyngbya sp. PCC
7376]
Length = 248
Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 61/115 (53%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL +L GP +++ Y++ +G++G +G +WR V W SI L T
Sbjct: 88 LRWGSRAAWMATLVSGLLLSILMGPTRSIIYVIPYGIMGVQLGWMWRKQVGWEWSIALGT 147
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
+ + G + SFL+ E++ + I +IF G+ + P++ I ++
Sbjct: 148 VLGTIGFFFRVWLLSFLLGEDLWVYVISQISELADWIFLHLGLLVQPNLMFIQLL 202
>gi|218439630|ref|YP_002377959.1| hypothetical protein PCC7424_2679 [Cyanothece sp. PCC 7424]
gi|218172358|gb|ACK71091.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
Length = 269
Score = 55.5 bits (132), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 56/109 (51%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A T + + +LL VL GP +++ +L+ +GV+G +G+ W+ G W S+ L T
Sbjct: 110 LRWGKRASWMTAIVSGLLLSVLMGPTRSIVFLIPYGVMGVQLGACWKRGAPWQWSMLLGT 169
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
I G I S L+ E++ A L + F G+ I P++
Sbjct: 170 IIGVFGLFFRIWLFSILLGEDLWLYAITQATAFLDWAFMKLGLLIEPTV 218
>gi|428771033|ref|YP_007162823.1| hypothetical protein Cyan10605_2706 [Cyanobacterium aponinum PCC
10605]
gi|428685312|gb|AFZ54779.1| Protein of unknown function DUF2232, membrane [Cyanobacterium
aponinum PCC 10605]
Length = 248
Score = 55.1 bits (131), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 57/109 (52%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G A T + +LL +L GP +++ Y++ +G++G +G+LWR GV+W +SIF+ +
Sbjct: 89 LRRGKRASIITTIVCGLLLAILMGPPRSIVYIVPYGLMGIQLGALWRRGVNWYVSIFVAS 148
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
+ G S L+ E++ + I +IF G+ P M
Sbjct: 149 LIGCFGFFFRFWLFSILLGEDLWVYVISQITNLADWIFLKLGILARPDM 197
>gi|428222487|ref|YP_007106657.1| hypothetical protein Syn7502_02551 [Synechococcus sp. PCC 7502]
gi|427995827|gb|AFY74522.1| putative membrane protein (DUF2232) [Synechococcus sp. PCC 7502]
Length = 207
Score = 54.7 bits (130), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 57/112 (50%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG TM T +LL +L GP +++ +L+ HG++ +G LW+ W +++ T
Sbjct: 50 LRWGQRCAWMTMTVTVLLLTILMGPTRSIQFLIPHGLMAVMLGFLWKRRSPWVVTLTFGT 109
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
+A +AG S L+ EN+ A TI + + + F G P + +I
Sbjct: 110 LAGAAGVGFQFWLLSALLGENLWAYSTIQVTELIRWGFQLVGSLEQPELWLI 161
>gi|86605519|ref|YP_474282.1| membrane protein [Synechococcus sp. JA-3-3Ab]
gi|86554061|gb|ABC99019.1| putative membrane proteinprotein [Synechococcus sp. JA-3-3Ab]
Length = 227
Score = 54.7 bits (130), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 50/90 (55%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRW G + +A+LL VL GP +++ Y++ +G+LG+ LWR + W +S+
Sbjct: 66 MRWDPRTGAMALAVSALLLTVLMGPTRSILYVIPYGLLGYWCARLWRRRLSWYVSVASGA 125
Query: 61 IARSAGAMGYILTSSFLIRENILALITINI 90
+ G + +L SS L+ EN+ +TI +
Sbjct: 126 ALSAFGLVFQLLLSSLLVGENLWTYVTIQL 155
>gi|86609748|ref|YP_478510.1| hypothetical protein CYB_2307 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86558290|gb|ABD03247.1| putative membrane protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 227
Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 51/90 (56%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRW G +V + +LL VL GP +++ Y++ +G+LG+ LW+ + W +S+
Sbjct: 66 MRWDPRTGAMALVVSGLLLTVLIGPTRSILYVIPYGLLGYWCARLWQQRLSWYVSVVSGA 125
Query: 61 IARSAGAMGYILTSSFLIRENILALITINI 90
+ + G + ++ SS L+ EN+ +TI +
Sbjct: 126 VLSALGLIFQLVLSSLLVGENLWIYVTIQL 155
>gi|428300982|ref|YP_007139288.1| hypothetical protein Cal6303_4411 [Calothrix sp. PCC 6303]
gi|428237526|gb|AFZ03316.1| Protein of unknown function DUF2232, membrane [Calothrix sp. PCC
6303]
Length = 235
Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 61/115 (53%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWGV A + V + +LL VL GP+++L +++ G +G ++GS W V W +S+ L T
Sbjct: 76 LRWGVRAAWMSAVVSGLLLAVLMGPIRSLLFVMPFGFMGVTLGSTWYRRVPWLVSMSLGT 135
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
I + G + S L E++ + + + ++F GV PS+ I V+
Sbjct: 136 ILGTVGVFFRVWLLSILAGEDLWIYVINQMTDLIEWVFLKLGVLANPSLLWIQVV 190
>gi|307153482|ref|YP_003888866.1| hypothetical protein Cyan7822_3654 [Cyanothece sp. PCC 7822]
gi|306983710|gb|ADN15591.1| Protein of unknown function DUF2232, membrane [Cyanothece sp. PCC
7822]
Length = 265
Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 59/126 (46%), Gaps = 6/126 (4%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL VL GP +++ + + +GV+G +G+ WR G W LS+ L T
Sbjct: 106 LRWGQRASWMAAIVSGLLLSVLMGPTRSIVFFIPYGVMGVQLGACWRRGASWYLSMLLGT 165
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS------MNVIYV 114
I G I S L+ E++ + L + F G+ P + VI +
Sbjct: 166 IIGVFGLFFRIWLFSILLGEDLWLYAITQVTGFLDWAFMKLGLLAQPDILLVQLLAVIAI 225
Query: 115 IFGTLV 120
IF +
Sbjct: 226 IFNNFL 231
>gi|359473028|ref|XP_003631234.1| PREDICTED: uncharacterized protein LOC100256033 isoform 2 [Vitis
vinifera]
Length = 140
Score = 52.0 bits (123), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 21/35 (60%), Positives = 32/35 (91%)
Query: 86 ITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
IT+N+HAS+T++F+A GVN +PSM++IY +FGTL+
Sbjct: 68 ITLNVHASITYMFTAIGVNTIPSMDLIYALFGTLL 102
>gi|428771964|ref|YP_007163752.1| hypothetical protein Cyast_0120 [Cyanobacterium stanieri PCC 7202]
gi|428686243|gb|AFZ46103.1| Protein of unknown function DUF2232, membrane [Cyanobacterium
stanieri PCC 7202]
Length = 247
Score = 51.2 bits (121), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 55/108 (50%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MR G A T + + +LL +L GP +++ Y++ +G++G +G +WR G +W LS+ +
Sbjct: 89 MRRGRRASIMTTIVSGLLLSILMGPPRSIIYIIPYGLMGIQLGGMWRNGANWHLSLLTGS 148
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
I S G S L+ E++ + I +IF G+ PS
Sbjct: 149 IIGSFGFFFRFWLFSILLGEDLWIYVINQITGLADWIFLQLGILSQPS 196
>gi|172036337|ref|YP_001802838.1| hypothetical protein cce_1422 [Cyanothece sp. ATCC 51142]
gi|354553126|ref|ZP_08972433.1| Protein of unknown function DUF2232, membrane [Cyanothece sp. ATCC
51472]
gi|171697791|gb|ACB50772.1| conserved hypothetical protein [Cyanothece sp. ATCC 51142]
gi|353554956|gb|EHC24345.1| Protein of unknown function DUF2232, membrane [Cyanothece sp. ATCC
51472]
Length = 240
Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/108 (27%), Positives = 53/108 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG + + + +LL VL GP +++ +L+ +GV+G +G+ WR GV W S+
Sbjct: 81 LRWGSRSAWMGALVSGLLLTVLMGPTRSIVFLIPYGVMGVQLGAFWRRGVPWEFSMITGA 140
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
+ + G S L+ E++ I + +IF G+ PS
Sbjct: 141 LLGTFGFFFRFWLFSILLGEDLWQYIINQMTGLAEWIFLRLGILAQPS 188
>gi|218245754|ref|YP_002371125.1| hypothetical protein PCC8801_0891 [Cyanothece sp. PCC 8801]
gi|257058799|ref|YP_003136687.1| hypothetical protein Cyan8802_0917 [Cyanothece sp. PCC 8802]
gi|218166232|gb|ACK64969.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
gi|256588965|gb|ACU99851.1| putative membrane protein [Cyanothece sp. PCC 8802]
Length = 260
Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/115 (23%), Positives = 57/115 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL VL GP +++ +L+ +G++G +G W+ G +WG S+
Sbjct: 101 LRWGHRAAWMGALVSGLLLTVLMGPTRSIVFLIPYGLMGIQLGYCWQRGANWGFSLVTGA 160
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
+ + G S L+ E++ + + ++F G+ P+ +I ++
Sbjct: 161 LIGTFGFFFRFWLFSILLGEDLWQYVITQVTELAEWVFLKLGILSQPTFELIQLL 215
>gi|22298945|ref|NP_682192.1| hypothetical protein tlr1402 [Thermosynechococcus elongatus BP-1]
gi|22295126|dbj|BAC08954.1| tlr1402 [Thermosynechococcus elongatus BP-1]
Length = 236
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 55/115 (47%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A + +A+LL VL GP ++L +LL HG++G G W WG SI
Sbjct: 76 LRWNRRAAWMAAIVSALLLSVLMGPPRSLQFLLPHGIMGVIFGGCWAKKESWGRSILSGA 135
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
+ +AG I S L+ EN+ + + ++ + + P++ +I V+
Sbjct: 136 VIGTAGFFFQISLVSILLGENLWIYFNQQVTNFIDWVLVNLNLLLEPTVTLIQVV 190
>gi|428207947|ref|YP_007092300.1| hypothetical protein Chro_2964 [Chroococcidiopsis thermalis PCC
7203]
gi|428009868|gb|AFY88431.1| Protein of unknown function DUF2232, membrane [Chroococcidiopsis
thermalis PCC 7203]
Length = 232
Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 57/115 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW AG + +LL VL GP+ +L Y++ +GV+G +G+ WR W +S+ L T
Sbjct: 73 LRWSARAGWMAAGVSGLLLAVLMGPIPSLLYVMPYGVMGVLLGAAWRRRSPWIVSVALGT 132
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
+ + G S ++ + + + +IF G+ + PSM VI V+
Sbjct: 133 LLGAIGFFFRFWLMSIWSGRDLWVYLIVEVTELAEWIFDKLGLLLQPSMLVIQVL 187
>gi|428778778|ref|YP_007170564.1| hypothetical protein Dacsa_0419 [Dactylococcopsis salina PCC 8305]
gi|428693057|gb|AFZ49207.1| putative membrane protein (DUF2232) [Dactylococcopsis salina PCC
8305]
Length = 258
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 60/118 (50%), Gaps = 6/118 (5%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G A +++ +LL +L GP +++ Y++ +GV+G +G+ WR G WG S+ T
Sbjct: 99 LRRGRRASGMSVITVGLLLSILMGPTRSILYVIPYGVMGLQLGAFWRRGFGWGQSV---T 155
Query: 61 IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
I G +G+ S + E++ +T + L + F G+ PS+ ++ I
Sbjct: 156 IGAIIGTLGFFFRFWLLSIFLGEDLWVYVTSQMTQFLEWGFLQLGLLAQPSLWLVQAI 213
>gi|126654855|ref|ZP_01726389.1| hypothetical protein CY0110_10477 [Cyanothece sp. CCY0110]
gi|126623590|gb|EAZ94294.1| hypothetical protein CY0110_10477 [Cyanothece sp. CCY0110]
Length = 240
Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 53/108 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG + + + +LL VL GP +++ +L+ +GV+G +G+ WR G+ W S+
Sbjct: 81 LRWGNRSAWMGALVSGLLLTVLMGPTRSIVFLIPYGVMGVQLGAFWRRGLPWEFSMITGA 140
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
+ + G S L+ E++ I + +IF G+ PS
Sbjct: 141 LLGTFGFFFRFWLFSILLGEDLWQYIINQMTGLAEWIFLKLGILTQPS 188
>gi|145346093|ref|XP_001417529.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577756|gb|ABO95822.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 180
Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 62/108 (57%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
R G AA +T+V ++LLL++SGP++AL Y HGVL + + SLW G W +++ +
Sbjct: 43 RSGRAAAMRTLVVASLLLLMISGPLRALNYFFLHGVLAYVLSSLWSSGSSWWVTVPASAL 102
Query: 62 ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
R+ G + + SS L REN++ L+ +++ L + G P++
Sbjct: 103 TRTFGILASLSISSLLYRENVMKLLVTQMYSLLDQFAANVGATFAPTI 150
>gi|284929773|ref|YP_003422295.1| hypothetical protein UCYN_12480 [cyanobacterium UCYN-A]
gi|284810217|gb|ADB95914.1| Predicted membrane protein (DUF2232) [cyanobacterium UCYN-A]
Length = 236
Score = 47.8 bits (112), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 53/109 (48%), Gaps = 2/109 (1%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG + + + +LL VL GP++++ ++ +G+L +G+ WR + W SIF+
Sbjct: 78 LRWGSRSAWMGWLVSGLLLTVLMGPIQSILFITNYGLLSIQLGAFWRKNISWEFSIFIGA 137
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASL-TFIFSAAGVNIVPS 108
I S L E++ +IN SL ++F G I PS
Sbjct: 138 IISIFSFFFKFWLFSILTGEDLWQY-SINQMTSLAEWLFLKFGTLIQPS 185
>gi|443323263|ref|ZP_21052271.1| putative membrane protein (DUF2232) [Gloeocapsa sp. PCC 73106]
gi|442787001|gb|ELR96726.1| putative membrane protein (DUF2232) [Gloeocapsa sp. PCC 73106]
Length = 244
Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 56/115 (48%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G A +LL VL GPV+++ +L+ +G++G +G WR W S+ +
Sbjct: 85 LRRGDRAAIMASTVATLLLSVLMGPVRSVIFLIPYGLMGIQLGVFWRRSASWNFSLCVGG 144
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
I + G I S + E++ + + + I +IF G+ PS+ +I ++
Sbjct: 145 IIGTFGVFFRIFLFSIFLGEDLWSYVMVQITELANWIFLKLGILAEPSLGMIELL 199
>gi|334117938|ref|ZP_08492028.1| Protein of unknown function DUF2232, membrane [Microcoleus
vaginatus FGP-2]
gi|333459923|gb|EGK88533.1| Protein of unknown function DUF2232, membrane [Microcoleus
vaginatus FGP-2]
Length = 251
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 58/112 (51%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL VL GP +++ Y++ G++G ++G LWR W LSI +
Sbjct: 91 LRWGNRAAWMAALVSGLLLSVLMGPTRSILYVMPFGIMGVALGGLWRRKASWSLSISVGA 150
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
+ + G S L+ +++ +T + + ++F G VPS+N I
Sbjct: 151 LIGTVGFFFRFWLLSALVGQDLWVYVTAQVTEMVEWVFVKLGWLNVPSLNTI 202
>gi|376001833|ref|ZP_09779687.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
8005]
gi|375329744|emb|CCE15440.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
8005]
Length = 226
Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 57/118 (48%), Gaps = 6/118 (5%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A V + +LL VL GP +++ Y++ +G + +G LW+ G W +++
Sbjct: 66 LRWDRRAAIMGAVTSGLLLTVLMGPTRSILYVIPYGWMAVVLGGLWKRGAKWPVTV---A 122
Query: 61 IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
IA GA+G+ S L+ ++ T I L IF G+ PS+ I ++
Sbjct: 123 IASLIGAIGFFFRYWLLSILLSRDLWVYATTQIAQLLESIFIRLGILAPPSLLFIQIV 180
>gi|359457479|ref|ZP_09246042.1| hypothetical protein ACCM5_02044 [Acaryochloris sp. CCMEE 5410]
Length = 215
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/112 (26%), Positives = 56/112 (50%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG + + T +L+ VL GP +++ +L+ +GV+G +G LW+ V W +S+
Sbjct: 54 LRWGNRSAWMSAWVTTLLITVLMGPPRSIQFLMPYGVVGVMLGGLWKRQVSWAISMGWSI 113
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
+ + G + S L+ N+ I I L ++ G+ + P + VI
Sbjct: 114 LIMAIGFFFQLNLLSLLVGTNLWIYINRQITGFLDWVVIKLGLLLQPDVIVI 165
>gi|428317916|ref|YP_007115798.1| Protein of unknown function DUF2232, membrane [Oscillatoria
nigro-viridis PCC 7112]
gi|428241596|gb|AFZ07382.1| Protein of unknown function DUF2232, membrane [Oscillatoria
nigro-viridis PCC 7112]
Length = 251
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 57/112 (50%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL VL GP +++ Y++ G++G ++G LWR W LSI +
Sbjct: 91 LRWGNRAAWMAALVSGLLLSVLMGPTRSILYVMPFGIMGVALGGLWRRKASWSLSISVGA 150
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
+ G S L+ +++ +T + + ++F G VPS+N I
Sbjct: 151 FIGTVGFFFRFWLLSALVGQDLWVYVTAQVTEMVEWVFVKLGWLNVPSLNTI 202
>gi|282895674|ref|ZP_06303799.1| conserved hypothetical protein [Raphidiopsis brookii D9]
gi|281199368|gb|EFA74233.1| conserved hypothetical protein [Raphidiopsis brookii D9]
Length = 178
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 54/108 (50%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL VL+GPV++L +++ LG +G+ W V W +SI L T
Sbjct: 18 LRWGNRASWMAAITSGLLLSVLTGPVRSLLFIMPFAFLGVLLGASWHRRVPWFVSITLGT 77
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
I + G + S L E++ +T + L ++F + PS
Sbjct: 78 ILCTLGVFFRLWLLSILSGEDLWIYLTNQVTEILQWLFLNLRILTPPS 125
>gi|158338243|ref|YP_001519420.1| hypothetical protein AM1_5139 [Acaryochloris marina MBIC11017]
gi|158308484|gb|ABW30101.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 215
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 55/111 (49%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
RWG + + T +L+ VL GP +++ +L+ +GV+G +G LW+ V W +S+ +
Sbjct: 55 RWGNRSAWMSAWVTTLLITVLMGPPRSIQFLMPYGVVGVILGGLWKRQVSWAISMGWSIL 114
Query: 62 ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
+ G + S L+ N+ I I L ++ G+ + P + VI
Sbjct: 115 VMAIGFFFQLNLLSLLVGTNLWIYINRQITGFLDWVVIKLGLLLQPDVIVI 165
>gi|190192140|dbj|BAG48275.1| hypothetical protein MAE_15185 [Microcystis aeruginosa NIES-843]
Length = 251
Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 54/109 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCASMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
+ + G S L+ EN+ + + +IF+ G+ P++
Sbjct: 152 LLGAFGFFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTL 200
>gi|16330851|ref|NP_441579.1| hypothetical protein sll1372 [Synechocystis sp. PCC 6803]
gi|383322593|ref|YP_005383446.1| hypothetical protein SYNGTI_1684 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325762|ref|YP_005386615.1| hypothetical protein SYNPCCP_1683 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491646|ref|YP_005409322.1| hypothetical protein SYNPCCN_1683 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436913|ref|YP_005651637.1| hypothetical protein SYNGTS_1684 [Synechocystis sp. PCC 6803]
gi|451815009|ref|YP_007451461.1| hypothetical protein MYO_117020 [Synechocystis sp. PCC 6803]
gi|1653344|dbj|BAA18259.1| sll1372 [Synechocystis sp. PCC 6803]
gi|339273945|dbj|BAK50432.1| hypothetical protein SYNGTS_1684 [Synechocystis sp. PCC 6803]
gi|359271912|dbj|BAL29431.1| hypothetical protein SYNGTI_1684 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359275082|dbj|BAL32600.1| hypothetical protein SYNPCCN_1683 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278252|dbj|BAL35769.1| hypothetical protein SYNPCCP_1683 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|451780978|gb|AGF51947.1| hypothetical protein MYO_117020 [Synechocystis sp. PCC 6803]
Length = 275
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 33/57 (57%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIF 57
+RW A K M+A +LL VL GP +++ +++ +G++ +G W WG SIF
Sbjct: 115 LRWRSPAAIKGMIAAGLLLSVLMGPTRSVVFVIPYGLMSLQLGFCWVRNASWGFSIF 171
>gi|425467150|ref|ZP_18846434.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|389830158|emb|CCI28064.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9809]
Length = 251
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 54/109 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCASMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
+ + G S L+ EN+ + + +IF+ G+ P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTL 200
>gi|423062304|ref|ZP_17051094.1| hypothetical protein SPLC1_S031930 [Arthrospira platensis C1]
gi|406716212|gb|EKD11363.1| hypothetical protein SPLC1_S031930 [Arthrospira platensis C1]
Length = 226
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 57/118 (48%), Gaps = 6/118 (5%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A V + +LL VL GP +++ Y++ +G + +G LW+ G W +++
Sbjct: 66 LRWDRRAAIMGAVTSGLLLTVLMGPTRSILYVIPYGWMAVVLGGLWKRGAKWPVTV---A 122
Query: 61 IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
IA GA+G+ S L+ ++ T I L IF G+ PS+ I ++
Sbjct: 123 IASLIGAIGFFFRYWLLSILLGRDLWVYATTQIAQLLESIFIRLGILAPPSLLFIQIV 180
>gi|209522908|ref|ZP_03271465.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|209496495|gb|EDZ96793.1| conserved hypothetical protein [Arthrospira maxima CS-328]
Length = 226
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 57/118 (48%), Gaps = 6/118 (5%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A V + +LL VL GP +++ Y++ +G + +G LW+ G W +++
Sbjct: 66 LRWDRRAAIMGAVTSGLLLTVLMGPTRSILYVIPYGWMAVVLGGLWKRGAKWPVTV---A 122
Query: 61 IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
IA GA+G+ S L+ ++ T I L IF G+ PS+ I ++
Sbjct: 123 IASLIGAIGFFFRYWLLSILLGRDLWVYATTQIAQLLESIFIRLGILAPPSLLFIQIV 180
>gi|434389139|ref|YP_007099750.1| putative membrane protein (DUF2232) [Chamaesiphon minutus PCC 6605]
gi|428020129|gb|AFY96223.1| putative membrane protein (DUF2232) [Chamaesiphon minutus PCC 6605]
Length = 199
Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 60/115 (52%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG AG + + + +LL VL GPV+++ +++ +GV+G +G+ WR W SI L
Sbjct: 41 LRWGNRAGWMSALVSGLLLSVLMGPVRSILFVMPYGVMGVQLGACWRRQASWLTSISLGA 100
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
+ G +S L+ E++ + I ++F G+ PS+ VI ++
Sbjct: 101 LVDCLGVFFRFGLTSALLGEDLWMYLMARIRDLSEWLFVQLGILAEPSLLVIQIL 155
>gi|443325608|ref|ZP_21054294.1| putative membrane protein (DUF2232) [Xenococcus sp. PCC 7305]
gi|442794767|gb|ELS04168.1| putative membrane protein (DUF2232) [Xenococcus sp. PCC 7305]
Length = 251
Score = 46.2 bits (108), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 58/115 (50%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G AG + + +LL VL GPV+++ +++ +G++G +G+ W V W LS+ L
Sbjct: 93 LRRGNRAGWMSAWVSMLLLSVLMGPVRSIVFIIPYGLIGIQLGACWVRRVHWALSVLLGA 152
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
+ S G S L+ E++ + I ++F G+ P + +I ++
Sbjct: 153 LIGSLGLFFRFWLFSILLGEDLWVYVMTQITQLADWLFLKIGILAQPDIILIQIL 207
>gi|407961790|dbj|BAM55030.1| hypothetical protein BEST7613_6099 [Synechocystis sp. PCC 6803]
Length = 245
Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 33/57 (57%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIF 57
+RW A K M+A +LL VL GP +++ +++ +G++ +G W WG SIF
Sbjct: 85 LRWRSPAAIKGMIAAGLLLSVLMGPTRSVVFVIPYGLMSLQLGFCWVRNASWGFSIF 141
>gi|443648943|ref|ZP_21130125.1| hypothetical protein C789_665 [Microcystis aeruginosa DIANCHI905]
gi|159026388|emb|CAO90486.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443335056|gb|ELS49539.1| hypothetical protein C789_665 [Microcystis aeruginosa DIANCHI905]
Length = 251
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 54/109 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGIQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
+ + G S L+ EN+ + + +IF+ G+ P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTL 200
>gi|428775069|ref|YP_007166856.1| hypothetical protein PCC7418_0408 [Halothece sp. PCC 7418]
gi|428689348|gb|AFZ42642.1| Protein of unknown function DUF2232, membrane [Halothece sp. PCC
7418]
Length = 256
Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/115 (26%), Positives = 57/115 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G+ A + + +LL +L GP +++ Y++ +G++G +G+ W WGLS+ +
Sbjct: 97 LRRGMRASWMSALIVGLLLSILMGPTRSILYVIPYGLMGVQLGAFWCRNTHWGLSVTVGA 156
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
I + G S + E++ +T I L + F G+ PS+ +I I
Sbjct: 157 IIGTFGFFFRFWLLSIFLGEDLWVYVTSQIAQFLEWGFVQLGLLTQPSLWLIQAI 211
>gi|425436381|ref|ZP_18816817.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9432]
gi|425449736|ref|ZP_18829571.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
7941]
gi|440752045|ref|ZP_20931248.1| putative membrane protein [Microcystis aeruginosa TAIHU98]
gi|389678911|emb|CCH92273.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9432]
gi|389769737|emb|CCI05505.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
7941]
gi|440176538|gb|ELP55811.1| putative membrane protein [Microcystis aeruginosa TAIHU98]
Length = 251
Score = 45.4 bits (106), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 54/109 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGVQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
+ + G S L+ EN+ + + +IF+ G+ P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTL 200
>gi|425461780|ref|ZP_18841254.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9808]
gi|389825302|emb|CCI25026.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9808]
Length = 251
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 54/109 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGVQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
+ + G S L+ EN+ + + +IF+ G+ P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTL 200
>gi|282900315|ref|ZP_06308266.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281194820|gb|EFA69766.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 226
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 53/108 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL VL+GPV++L +++ LG +G W V W +SI L T
Sbjct: 66 LRWGNRASWMAAITSGLLLSVLTGPVRSLLFIMPFAFLGVLLGVSWHRRVPWVVSISLGT 125
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
I + G + S L E++ +T + + ++F + PS
Sbjct: 126 ILCTLGVFFRLWLLSILSGEDLWVYLTNQVTEIIEWLFLNLQILTTPS 173
>gi|409990906|ref|ZP_11274222.1| hypothetical protein APPUASWS_07925 [Arthrospira platensis str.
Paraca]
gi|291567238|dbj|BAI89510.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409938233|gb|EKN79581.1| hypothetical protein APPUASWS_07925 [Arthrospira platensis str.
Paraca]
Length = 226
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 56/117 (47%), Gaps = 6/117 (5%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A V + +LL VL GP +++ Y++ +G +G +G LW+ G W +++
Sbjct: 66 LRWDRRAAIMGAVTSGLLLTVLMGPTRSILYVIPYGWMGVVLGGLWKRGAMWPVTV---A 122
Query: 61 IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
I GA+G+ S L+ ++ T I L IF GV PS+ I +
Sbjct: 123 IGSLIGAIGFFFRYWLLSILLGRDLWVYATTQIAQLLESIFIRLGVLAPPSLLFIQI 179
>gi|428204395|ref|YP_007082984.1| hypothetical protein Ple7327_4314 [Pleurocapsa sp. PCC 7327]
gi|427981827|gb|AFY79427.1| putative membrane protein [Pleurocapsa sp. PCC 7327]
Length = 261
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 60/115 (52%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A T + + +LL VL GP +++ YL+ +G++G +G+ WR G W SIF
Sbjct: 102 LRWSNRASWMTTLVSVLLLTVLMGPPRSIIYLIPYGLMGVQLGACWRRGAHWWFSIFTGA 161
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
+ ++G S L+ E++ + + + ++F G+ PS+ +I +I
Sbjct: 162 LIGTSGFFFRFWLFSILLGEDLWLYVITQVTEIVDWVFLRLGLLAQPSVLLIQII 216
>gi|425444816|ref|ZP_18824857.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9443]
gi|425454338|ref|ZP_18834084.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9807]
gi|389735353|emb|CCI01127.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9443]
gi|389805030|emb|CCI15498.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9807]
Length = 251
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 54/109 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGVQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
+ + G S L+ EN+ + + +IF+ G+ P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTL 200
>gi|427731990|ref|YP_007078227.1| hypothetical protein Nos7524_4904 [Nostoc sp. PCC 7524]
gi|427367909|gb|AFY50630.1| putative membrane protein (DUF2232) [Nostoc sp. PCC 7524]
Length = 243
Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 56/114 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GPV++L +++ +G +G+ W V W +SI L T
Sbjct: 85 LRWGKRASWMAAVTSGLLLSVLMGPVRSLLFVMPFAFMGVLLGATWYRRVPWLVSITLGT 144
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
+ + G + S L E++ + + +IF G+ PS+ VI +
Sbjct: 145 VLGTLGVFFRLWLLSILSGEDLWIYVINQVTELTEWIFLRLGLLATPSVFVIQI 198
>gi|422303788|ref|ZP_16391139.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9806]
gi|389791182|emb|CCI12970.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9806]
Length = 251
Score = 45.1 bits (105), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 54/109 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
+ + G S L+ EN+ + + +IF+ G+ P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTWLGILAKPTL 200
>gi|425441756|ref|ZP_18822023.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9717]
gi|389717455|emb|CCH98462.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9717]
Length = 251
Score = 44.7 bits (104), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 54/109 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
+ + G S L+ EN+ + + +IF+ G+ P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTWLGILAKPTL 200
>gi|390440736|ref|ZP_10228944.1| conserved membrane hypothetical protein [Microcystis sp. T1-4]
gi|389835965|emb|CCI33070.1| conserved membrane hypothetical protein [Microcystis sp. T1-4]
Length = 251
Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 54/109 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
+ + G S L+ EN+ + + +IF+ G+ P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTWLGILAKPTL 200
>gi|425471084|ref|ZP_18849944.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9701]
gi|389883066|emb|CCI36507.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9701]
Length = 251
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 54/109 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGVQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSM 109
+ + G S L+ EN+ + + +IF+ G+ P++
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTWLGILAKPTL 200
>gi|254423145|ref|ZP_05036863.1| hypothetical protein S7335_3299 [Synechococcus sp. PCC 7335]
gi|196190634|gb|EDX85598.1| hypothetical protein S7335_3299 [Synechococcus sp. PCC 7335]
Length = 278
Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 33/56 (58%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
+RW A + +LL +L GP +++ +++ +G+LG +G LWR WG+SI
Sbjct: 112 LRWNARAAWMGATVSGLLLTILLGPTRSVLFVMPYGILGVQLGYLWRRKAGWGISI 167
>gi|56751839|ref|YP_172540.1| hypothetical protein syc1830_d [Synechococcus elongatus PCC 6301]
gi|81301078|ref|YP_401286.1| hypothetical protein Synpcc7942_2269 [Synechococcus elongatus PCC
7942]
gi|56686798|dbj|BAD80020.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169959|gb|ABB58299.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 245
Score = 43.9 bits (102), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 46/96 (47%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
RWG+ + + ++LL VL GP + + +LL G LG +G W+ + WG SI L
Sbjct: 85 RWGLRSAWLGAIVASLLLFVLLGPTRGILFLLPPGCLGVLLGCCWQRRLSWGWSILLGGG 144
Query: 62 ARSAGAMGYILTSSFLIRENILALITINIHASLTFI 97
+ G + S L+ E+I +T L ++
Sbjct: 145 LETIGFFFRLWVLSVLLGEDIWTYVTAQATGFLEWL 180
>gi|308804379|ref|XP_003079502.1| unnamed protein product [Ostreococcus tauri]
gi|116057957|emb|CAL54160.1| unnamed protein product [Ostreococcus tauri]
Length = 230
Score = 43.9 bits (102), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 29/113 (25%), Positives = 51/113 (45%), Gaps = 1/113 (0%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALA-YLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
RWG AA R+T+V + + LA L L + +LW G W +++
Sbjct: 69 RWGSAAARRTLVVATLARPKTTMSSNELAPRRLAPCTLAHADRALWSSGASWWVTVPASA 128
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIY 113
+ R+ G + + SS L REN++ L+ ++A L + G P + ++
Sbjct: 129 LTRTFGIVCSLSVSSLLYRENVMKLLVTQMYALLDQFAANVGATFAPDITWVW 181
>gi|428311088|ref|YP_007122065.1| hypothetical protein Mic7113_2881 [Microcoleus sp. PCC 7113]
gi|428252700|gb|AFZ18659.1| putative membrane protein (DUF2232) [Microcoleus sp. PCC 7113]
Length = 300
Score = 43.9 bits (102), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 18/56 (32%), Positives = 35/56 (62%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
+RWG A + + +LL VL GP +++ +++ +G++G +G++WR +W SI
Sbjct: 140 LRWGNRASWMAAIVSGLLLSVLMGPTRSILFVIPYGLMGVQLGAMWRRQSNWLFSI 195
>gi|427705814|ref|YP_007048191.1| hypothetical protein Nos7107_0357 [Nostoc sp. PCC 7107]
gi|427358319|gb|AFY41041.1| Protein of unknown function DUF2232, membrane [Nostoc sp. PCC 7107]
Length = 254
Score = 43.9 bits (102), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 56/114 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GPV++L +++ + +G +G+ W V W +SI L
Sbjct: 94 LRWGQRAAWMAAVTSGLLLSVLMGPVRSLLFVMPYAFMGVLLGATWHRRVPWIVSITLAA 153
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
+ + G + S L E++ + + +IF G+ PS+ +I V
Sbjct: 154 LLGTLGFFFRMWLLSILSGEDLWIYLINQVTEITEWIFLKLGLLNTPSVFLIQV 207
>gi|443316964|ref|ZP_21046389.1| putative membrane protein (DUF2232) [Leptolyngbya sp. PCC 6406]
gi|442783427|gb|ELR93342.1| putative membrane protein (DUF2232) [Leptolyngbya sp. PCC 6406]
Length = 273
Score = 43.5 bits (101), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 48/89 (53%), Gaps = 1/89 (1%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL V+ GP ++L +L+ +G+LG +G LWR W +I L
Sbjct: 107 LRWGSRAAWMSALVATLLLAVMMGPPRSLLFLMPYGLLGVQLGWLWRRRASWYTTIGLGA 166
Query: 61 IARSAGAMGYILTSSFLIRENILALITIN 89
+ + G + S ++ E+ L + TIN
Sbjct: 167 LLSALGVFVRLWLLSIMLGED-LWIYTIN 194
>gi|67922653|ref|ZP_00516158.1| similar to membrane protein [Crocosphaera watsonii WH 8501]
gi|67855501|gb|EAM50755.1| similar to membrane protein [Crocosphaera watsonii WH 8501]
Length = 240
Score = 43.1 bits (100), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 28/108 (25%), Positives = 51/108 (47%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G A + + +LL VL GP +++ +L+ +G++G +G+ WR + W S+
Sbjct: 81 LRRGKRAAWMGALVSGLLLTVLMGPTRSIVFLIPYGLMGVQLGAFWRRNLPWEFSMITGA 140
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
+ + G S L+ E++ I I +IF G+ PS
Sbjct: 141 LLGTFGFFFRFWLFSILLGEDLWQYIINQITGLAEWIFLRLGILAQPS 188
>gi|416392028|ref|ZP_11685833.1| hypothetical protein CWATWH0003_2646 [Crocosphaera watsonii WH
0003]
gi|357263688|gb|EHJ12658.1| hypothetical protein CWATWH0003_2646 [Crocosphaera watsonii WH
0003]
Length = 240
Score = 43.1 bits (100), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 28/108 (25%), Positives = 51/108 (47%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G A + + +LL VL GP +++ +L+ +G++G +G+ WR + W S+
Sbjct: 81 LRRGKRAAWMGALVSGLLLTVLMGPTRSIVFLIPYGLMGVQLGAFWRRNLPWEFSMITGA 140
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPS 108
+ + G S L+ E++ I I +IF G+ PS
Sbjct: 141 LLGTFGFFFRFWLFSILLGEDLWQYIINQITGLAEWIFLRLGILAQPS 188
>gi|170078396|ref|YP_001735034.1| hypothetical protein SYNPCC7002_A1790 [Synechococcus sp. PCC 7002]
gi|169886065|gb|ACA99778.1| conserved hypothetical membrane protein [Synechococcus sp. PCC
7002]
Length = 243
Score = 42.7 bits (99), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 59/112 (52%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +++LL +L GP +++ Y + +G++G +G +WR V W S+ L T
Sbjct: 83 LRWGSRAAWMSTLVSSLLLSILMGPTRSIIYTIPYGLIGVQLGWMWRRQVSWYWSVTLGT 142
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
I + G + S L+ E++ + I ++F G+ PS+ VI
Sbjct: 143 ILGTLGFFFRVWLLSLLLGEDLWVYVISQISEMADWLFLKLGLLTQPSLLVI 194
>gi|354567780|ref|ZP_08986948.1| Protein of unknown function DUF2232, membrane [Fischerella sp.
JSC-11]
gi|353542238|gb|EHC11702.1| Protein of unknown function DUF2232, membrane [Fischerella sp.
JSC-11]
Length = 223
Score = 42.7 bits (99), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 30/112 (26%), Positives = 55/112 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GP+++L +++ +G +G+ W V W +SI L
Sbjct: 64 LRWGKRAAWMGAVTSGLLLSVLMGPIRSLLFVMPFAFMGVLLGAAWYRRVPWIVSISLGA 123
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
+ + G + S L E++ + + + +IF G+ PS ++I
Sbjct: 124 VLGTLGVFFRLWLLSLLSGEDLWVYVINQVTEIVEWIFLRLGILASPSTSLI 175
>gi|427717327|ref|YP_007065321.1| hypothetical protein Cal7507_2044 [Calothrix sp. PCC 7507]
gi|427349763|gb|AFY32487.1| Protein of unknown function DUF2232, membrane [Calothrix sp. PCC
7507]
Length = 245
Score = 42.4 bits (98), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 31/114 (27%), Positives = 54/114 (47%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + +LL VL GP+++L +++ + +G +G+ W V W +SI L T
Sbjct: 85 LRWGKRAAWMAALTCGLLLAVLMGPIRSLLFVMPYAFMGVLLGATWNRRVPWIVSITLGT 144
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
+ + G I S L E++ + + +IF V PS+ I +
Sbjct: 145 LLATLGIFFRIWLLSVLSGEDLWVYVINQVTEITEWIFLKLQVLANPSVFFIQI 198
>gi|72383606|ref|YP_292961.1| hypothetical protein PMN2A_1770 [Prochlorococcus marinus str.
NATL2A]
gi|72003456|gb|AAZ59258.1| uncharacterized conserved membrane protein [Prochlorococcus marinus
str. NATL2A]
Length = 228
Score = 42.0 bits (97), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 29/90 (32%), Positives = 46/90 (51%), Gaps = 6/90 (6%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R GV G + + MLL VL GP++ L +G+L +G W G +W LS +
Sbjct: 67 IRRGVKTGIEGVTICVMLLTVLMGPLRGPLVLFPYGLLSLWLGYCWEKGWNWWLS---WS 123
Query: 61 IARSAGAMGY---ILTSSFLIRENILALIT 87
+ S G MG+ + S L+ EN+ ++T
Sbjct: 124 VGVSIGTMGFLVRVFVLSLLVGENLWVILT 153
>gi|440681213|ref|YP_007156008.1| Protein of unknown function DUF2232, membrane [Anabaena cylindrica
PCC 7122]
gi|428678332|gb|AFZ57098.1| Protein of unknown function DUF2232, membrane [Anabaena cylindrica
PCC 7122]
Length = 244
Score = 41.6 bits (96), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 55/114 (48%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GPV+++ +++ ++G +G+ W V W +SI L
Sbjct: 84 LRWGKRAAWMAAVTSGLLLAVLMGPVRSILFVMPFALMGVLLGATWHRRVPWIVSITLGM 143
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
+ + G + S L E++ + + + +IF + PS+ I V
Sbjct: 144 LLGTLGVFFRLWLLSVLSGEDLWVYVINQVTEIIEWIFLKLEILATPSVFFIQV 197
>gi|428211971|ref|YP_007085115.1| hypothetical protein Oscil6304_1490 [Oscillatoria acuminata PCC
6304]
gi|428000352|gb|AFY81195.1| putative membrane protein (DUF2232) [Oscillatoria acuminata PCC
6304]
Length = 265
Score = 41.2 bits (95), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 63/120 (52%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GP +++ +++ G+LG +G LW+ W ++IFL +
Sbjct: 106 LRWGNRAAWMGAVVSGLLLSVLMGPTRSILFVVPFGLLGVLLGCLWKRRASWEVAIFLGS 165
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + L+ E++ +TI I ++F G+ + PS+ VI + LV
Sbjct: 166 LLGAFGFFFRLWLLGILVGEDLWVYLTIQITEIADWLFLRLGLLVQPSLYVIQALAAILV 225
>gi|113475248|ref|YP_721309.1| hypothetical protein Tery_1556 [Trichodesmium erythraeum IMS101]
gi|110166296|gb|ABG50836.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 219
Score = 41.2 bits (95), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 26/115 (22%), Positives = 58/115 (50%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL +L GP +++ +++ ++G +G+LWR G W +SI +
Sbjct: 60 LRWGSRASWMGAIVSGLLLSILMGPTRSILFVIPFALMGVMLGALWRRGKSWFVSITYGS 119
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
+ S G S L+ +++ + + ++F G+ P++ ++ +I
Sbjct: 120 LLGSIGFFFRFWLLSILLGQDLWIYLITQATELVEWVFVKLGLLNQPNLLLVQLI 174
>gi|414075864|ref|YP_006995182.1| hypothetical protein ANA_C10569 [Anabaena sp. 90]
gi|413969280|gb|AFW93369.1| hypothetical protein ANA_C10569 [Anabaena sp. 90]
Length = 254
Score = 41.2 bits (95), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 29/112 (25%), Positives = 56/112 (50%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GP ++L +++ G++G +G W V W +S+ L T
Sbjct: 95 LRWGKRAAWMAAVTSGLLLSVLMGPFRSLLFVMPFGLMGVLLGVTWYRRVPWIVSMTLGT 154
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
+ + G + S L E++ + + + +IF + + P++ +I
Sbjct: 155 LLGTMGVFFRLWFLSVLSGEDLWVYVINQVTEIIEWIFLKLELLMTPNLLLI 206
>gi|119490055|ref|ZP_01622679.1| hypothetical protein L8106_20785 [Lyngbya sp. PCC 8106]
gi|119454207|gb|EAW35359.1| hypothetical protein L8106_20785 [Lyngbya sp. PCC 8106]
Length = 232
Score = 41.2 bits (95), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 55/111 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG + + +LL VL GP +++ +L+ +G +G +MG LW+ G +W +SI +
Sbjct: 72 LRWGKREAWMGALTSLLLLTVLMGPTRSIIFLIPYGWMGVAMGGLWKRGANWFVSIGVGA 131
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNV 111
I S G S L+ ++ T I +IF G+ PS+ +
Sbjct: 132 IIGSIGFFFRYWLLSILLGRDLWVYATTQITELAEWIFIKLGILAQPSLTL 182
>gi|298492012|ref|YP_003722189.1| hypothetical protein Aazo_3420 ['Nostoc azollae' 0708]
gi|298233930|gb|ADI65066.1| Protein of unknown function DUF2232, membrane ['Nostoc azollae'
0708]
Length = 242
Score = 40.8 bits (94), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 55/112 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GPV+++ +++ G++G +G+ W V W +SI L
Sbjct: 82 LRWGKRAAWMGAVTSGLLLGVLMGPVRSMLFVMPFGLMGVLLGATWYRRVPWIVSITLGM 141
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
+ + G + S L E++ + + +IF + PS+ VI
Sbjct: 142 LLGTLGVFFRLWLLSVLSGEDLWIYVINQVTEITEWIFLKLQILATPSVFVI 193
>gi|220905783|ref|YP_002481094.1| hypothetical protein Cyan7425_0341 [Cyanothece sp. PCC 7425]
gi|219862394|gb|ACL42733.1| conserved hypothetical protein [Cyanothece sp. PCC 7425]
Length = 244
Score = 40.4 bits (93), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 57/115 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW + T + A+LL VL GP ++L +L+ HG+LG +G W+ W +SI
Sbjct: 85 LRWNLRTATMTALVAALLLSVLMGPPRSLQFLVPHGLLGILLGHFWQRRASWWVSIAWGA 144
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
+ G I S L+ EN+ + + L ++F++ G + P + V+ +
Sbjct: 145 LLGGIGLFFQIGLLSLLLGENLWLYLNQQVTNLLDWLFTSLGWLVQPDLAVVQAV 199
>gi|428304516|ref|YP_007141341.1| hypothetical protein Cri9333_0916 [Crinalium epipsammum PCC 9333]
gi|428246051|gb|AFZ11831.1| Protein of unknown function DUF2232, membrane [Crinalium epipsammum
PCC 9333]
Length = 260
Score = 40.4 bits (93), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 21/56 (37%), Positives = 37/56 (66%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
+RWG A + + + +LL VL GP +++ + + +G++G +G+LWR G +W LSI
Sbjct: 100 LRWGHRASWMSALVSGLLLSVLMGPTRSILFFIPYGLMGVQLGALWRRGSNWLLSI 155
>gi|411118678|ref|ZP_11391058.1| putative membrane protein (DUF2232) [Oscillatoriales cyanobacterium
JSC-12]
gi|410710541|gb|EKQ68048.1| putative membrane protein (DUF2232) [Oscillatoriales cyanobacterium
JSC-12]
Length = 275
Score = 40.4 bits (93), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 55/112 (49%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL VL GP ++L YL+ G +G +G LWR +W SI + +
Sbjct: 115 LRWGYRAAWMGALVSGLLLSVLMGPPRSLLYLMPFGFMGVLLGMLWRRQANWAGSIAIAS 174
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVI 112
I S G + S L+ +++ T + + + G+ PS+ +I
Sbjct: 175 ILGSFGFFFRVWLVSLLLGDDLWLYTTTQVTNFIDWFCVRFGILFQPSLTLI 226
>gi|124025204|ref|YP_001014320.1| hypothetical protein NATL1_04931 [Prochlorococcus marinus str.
NATL1A]
gi|123960272|gb|ABM75055.1| Predicted membrane protein [Prochlorococcus marinus str. NATL1A]
Length = 228
Score = 40.0 bits (92), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 45/90 (50%), Gaps = 6/90 (6%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R GV G + + MLL L GP++ L +G+L +G W G +W LS +
Sbjct: 67 IRRGVKTGIEGVTICVMLLTALMGPLRGPLVLFPYGLLSLWLGYCWEKGWNWWLS---WS 123
Query: 61 IARSAGAMGY---ILTSSFLIRENILALIT 87
+ S G MG+ + S L+ EN+ ++T
Sbjct: 124 VGVSIGTMGFLVRVFVLSLLVGENLWVILT 153
>gi|148241761|ref|YP_001226918.1| hypothetical protein SynRCC307_0662 [Synechococcus sp. RCC307]
gi|147850071|emb|CAK27565.1| Uncharacterized conserved membrane protein [Synechococcus sp.
RCC307]
Length = 207
Score = 40.0 bits (92), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 27/100 (27%), Positives = 48/100 (48%)
Query: 15 TAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTS 74
T +LL VL GPV+ L +G+L +G WR W S+ + + +AG + +
Sbjct: 60 TVLLLTVLMGPVRGPLMLFPYGLLSLWLGWCWRRQCSWWFSMAVGVLIGAAGFLVRVALV 119
Query: 75 SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
S L+ EN+ +IT L + + + P+++ + V
Sbjct: 120 SLLLGENLWVVITAAASGLLNGLADRLQLPVAPTLDQVQV 159
>gi|434405583|ref|YP_007148468.1| putative membrane protein (DUF2232) [Cylindrospermum stagnale PCC
7417]
gi|428259838|gb|AFZ25788.1| putative membrane protein (DUF2232) [Cylindrospermum stagnale PCC
7417]
Length = 260
Score = 39.3 bits (90), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 57/114 (50%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GPV++L +++ +G +G+ W V W +SI L T
Sbjct: 100 LRWGRRAAWMAAVTSGLLLAVLMGPVRSLLFVMPFAFMGVLLGATWYRRVPWIVSITLGT 159
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
+ + G + S L E++ + + + +IF G+ PS+ +I V
Sbjct: 160 LLGTLGVFFRLWLLSVLSGEDLWIYVINQVTDIIEWIFLKLGLLTNPSVFLIQV 213
>gi|119512452|ref|ZP_01631534.1| hypothetical protein N9414_13495 [Nodularia spumigena CCY9414]
gi|119462918|gb|EAW43873.1| hypothetical protein N9414_13495 [Nodularia spumigena CCY9414]
Length = 241
Score = 39.3 bits (90), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 20/56 (35%), Positives = 32/56 (57%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
+RWG A V + +LL VL GPV++L +++ +G +G+ W V W +SI
Sbjct: 81 LRWGKRAAWMAAVTSGLLLSVLMGPVRSLLFVMPFAFMGVLLGATWHRRVPWIVSI 136
>gi|254413468|ref|ZP_05027238.1| hypothetical protein MC7420_6047 [Coleofasciculus chthonoplastes
PCC 7420]
gi|196179575|gb|EDX74569.1| hypothetical protein MC7420_6047 [Coleofasciculus chthonoplastes
PCC 7420]
Length = 277
Score = 38.9 bits (89), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 23/93 (24%), Positives = 48/93 (51%)
Query: 23 SGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENI 82
GP +++ +L+ +G++G +G+LW+ +W SI + T+ + G S L+ E++
Sbjct: 141 QGPTRSILFLIPYGLMGVQLGALWKRQANWLFSIGIGTLLGTLGFFFRFWLLSILLSEDL 200
Query: 83 LALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
+ I + + F G+ PS+ +I V+
Sbjct: 201 WVYVMTQITELVEWGFVKLGLLAQPSLMLIQVV 233
>gi|427734711|ref|YP_007054255.1| hypothetical protein Riv7116_1135 [Rivularia sp. PCC 7116]
gi|427369752|gb|AFY53708.1| putative membrane protein (DUF2232) [Rivularia sp. PCC 7116]
Length = 240
Score = 38.1 bits (87), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 59/114 (51%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A + + + +LLLVL GP +++ +++ + +G +GS W+ GV W +SI L T
Sbjct: 81 LRWNSRAAWMSALTSGLLLLVLMGPTRSMLFVIPYAFMGVLLGSTWQRGVPWIVSISLAT 140
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
+ + G + S L +++ + + L ++F G VP+ +I +
Sbjct: 141 LLGAIGVFFRLWLLSLLSGQDLWIYLINQVTNFLEWLFVNLGFLTVPNPFLIQI 194
>gi|148240217|ref|YP_001225604.1| hypothetical protein SynWH7803_1881 [Synechococcus sp. WH 7803]
gi|147848756|emb|CAK24307.1| Conserved hyptothetical membrane protein [Synechococcus sp. WH
7803]
Length = 214
Score = 38.1 bits (87), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 46/87 (52%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G AG + + + +LL+ L GPV+ L +G+L +G W+ W +S L
Sbjct: 53 VRRGGRAGLEGLAVSILLLIALMGPVRGPLMLFPYGLLSLWLGWCWQHRQSWWISWGLGV 112
Query: 61 IARSAGAMGYILTSSFLIRENILALIT 87
+ +AG + ++ S L+ EN+ +IT
Sbjct: 113 VIGAAGFLVRVVALSLLVGENLWVVIT 139
>gi|186683647|ref|YP_001866843.1| hypothetical protein Npun_F3493 [Nostoc punctiforme PCC 73102]
gi|186466099|gb|ACC81900.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 246
Score = 37.7 bits (86), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 20/57 (35%), Positives = 34/57 (59%), Gaps = 1/57 (1%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLW-RLGVDWGLSI 56
+RWG A + + +LL VL GP ++L +++ +G +G +G+ W R V W +SI
Sbjct: 85 LRWGKRAAWMAALTSGLLLTVLMGPARSLLFVMPYGFMGVLLGATWYRRRVPWIVSI 141
>gi|17227792|ref|NP_484340.1| hypothetical protein alr0296 [Nostoc sp. PCC 7120]
gi|17135274|dbj|BAB77820.1| alr0296 [Nostoc sp. PCC 7120]
Length = 257
Score = 37.4 bits (85), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 31/56 (55%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
+RWG A V + +LL VL GPV++L +++ +G +G+ W W +SI
Sbjct: 99 LRWGKRAAWMAAVTSGLLLSVLMGPVRSLLFVMPFAFMGVVLGATWYRRAPWIVSI 154
>gi|88807570|ref|ZP_01123082.1| hypothetical protein WH7805_13503 [Synechococcus sp. WH 7805]
gi|88788784|gb|EAR19939.1| hypothetical protein WH7805_13503 [Synechococcus sp. WH 7805]
Length = 214
Score = 37.0 bits (84), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 44/86 (51%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
R G AG + + + +LL+ L GPV+ L +G+L +G W+ W +S +
Sbjct: 54 RRGSRAGLEGLAVSILLLMALMGPVRGPLMLFPYGLLSLWLGWCWQHRQSWWISWGFGLV 113
Query: 62 ARSAGAMGYILTSSFLIRENILALIT 87
+AG + ++ S L+ EN+ +IT
Sbjct: 114 IGAAGFLVRVVALSLLVGENLWVVIT 139
>gi|75909273|ref|YP_323569.1| hypothetical protein Ava_3065 [Anabaena variabilis ATCC 29413]
gi|75702998|gb|ABA22674.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 257
Score = 37.0 bits (84), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 19/56 (33%), Positives = 31/56 (55%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
+RWG A V + +LL VL GPV++L +++ +G +G+ W W +SI
Sbjct: 99 LRWGKRAAWMAAVTSGLLLSVLMGPVRSLLFVMPFAFMGVLLGATWYRRAPWIVSI 154
>gi|452821902|gb|EME28927.1| hypothetical protein isoform 2 [Galdieria sulphuraria]
Length = 342
Score = 36.6 bits (83), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 40/91 (43%), Gaps = 8/91 (8%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTH----GVLGFSMGSLWRLGVDWGLSI 56
+RWG R + ++ GP+ L Y+LT GVL +M W +W I
Sbjct: 170 LRWGPRYSRLALCLLFCFFVLRPGPLYCLQYMLTQGLAVGVLTHTMWCQW----NWLFCI 225
Query: 57 FLCTIARSAGAMGYILTSSFLIRENILALIT 87
TIA G M I +SF+ +N LIT
Sbjct: 226 LSSTIAYLFGIMLEISLTSFVFGKNTWTLIT 256
>gi|452821901|gb|EME28926.1| hypothetical protein isoform 1 [Galdieria sulphuraria]
Length = 323
Score = 36.6 bits (83), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 40/91 (43%), Gaps = 8/91 (8%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTH----GVLGFSMGSLWRLGVDWGLSI 56
+RWG R + ++ GP+ L Y+LT GVL +M W +W I
Sbjct: 151 LRWGPRYSRLALCLLFCFFVLRPGPLYCLQYMLTQGLAVGVLTHTMWCQW----NWLFCI 206
Query: 57 FLCTIARSAGAMGYILTSSFLIRENILALIT 87
TIA G M I +SF+ +N LIT
Sbjct: 207 LSSTIAYLFGIMLEISLTSFVFGKNTWTLIT 237
>gi|427418274|ref|ZP_18908457.1| putative membrane protein (DUF2232) [Leptolyngbya sp. PCC 7375]
gi|425760987|gb|EKV01840.1| putative membrane protein (DUF2232) [Leptolyngbya sp. PCC 7375]
Length = 267
Score = 36.2 bits (82), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 17/56 (30%), Positives = 32/56 (57%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
+R + T + + +LL +L GP ++L Y++ +G+LG +G WR W +S+
Sbjct: 101 LRQNCRSAWMTTLVSGLLLAILLGPPRSLLYVIPYGLLGVQLGYHWRHQTSWLISL 156
>gi|349688699|ref|ZP_08899841.1| NCS1 nucleoside transporter [Gluconacetobacter oboediens 174Bp2]
Length = 490
Score = 35.4 bits (80), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 50/97 (51%), Gaps = 5/97 (5%)
Query: 14 ATAMLLLVLSGPVKALAYLLTHG--VLGFSMGSLWRLGVDW-GLS-IFLCTIARSAGAMG 69
A A+++ SGP+ +G V G+ W L ++ G S + +CTIA +A A G
Sbjct: 246 AVALVVSYFSGPMLNFGDFARYGRGVSDIRRGNFWGLPFNFVGFSALTICTIALTAPAFG 305
Query: 70 YILTSSFLIRENILALITINIHASLTFIFSAAGVNIV 106
+++ +I +L + I LTFI + AG+NIV
Sbjct: 306 HVMIDPVETVAHIDSLTAV-IFGMLTFIIATAGINIV 341
>gi|330992162|ref|ZP_08316110.1| putative allantoin permease [Gluconacetobacter sp. SXCC-1]
gi|329760361|gb|EGG76857.1| putative allantoin permease [Gluconacetobacter sp. SXCC-1]
Length = 491
Score = 35.4 bits (80), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 50/97 (51%), Gaps = 5/97 (5%)
Query: 14 ATAMLLLVLSGPVKALAYLLTHG--VLGFSMGSLWRLGVDW-GLS-IFLCTIARSAGAMG 69
A A+++ SGP+ +G G+ W L ++ G S + +CTIA +A A G
Sbjct: 246 AVALVVSYFSGPMLNFGDFARYGRSAADIRRGNFWGLPFNFVGFSALTVCTIALTAPAFG 305
Query: 70 YILTSSFLIRENILALITINIHASLTFIFSAAGVNIV 106
+++T +I +L + + LTFI + AG+NIV
Sbjct: 306 HVMTDPVETVAHIDSLSAV-VFGMLTFIIATAGINIV 341
>gi|427704401|ref|YP_007047623.1| hypothetical protein Cyagr_3203 [Cyanobium gracile PCC 6307]
gi|427347569|gb|AFY30282.1| putative membrane protein (DUF2232) [Cyanobium gracile PCC 6307]
Length = 218
Score = 35.0 bits (79), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 45/87 (51%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G + +V T +LL+ L GP++ L +G+L +G WR W L++ +
Sbjct: 57 LRHGWRCALEGVVVTTLLLVALMGPIRGPLVLFPYGLLSLWLGWGWRRRSSWWLTLGVGG 116
Query: 61 IARSAGAMGYILTSSFLIRENILALIT 87
+ +AG + + S L+ EN+ +IT
Sbjct: 117 LIGAAGFLVRVAVLSVLVGENLWVVIT 143
>gi|254431659|ref|ZP_05045362.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
gi|197626112|gb|EDY38671.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
Length = 201
Score = 35.0 bits (79), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 20/64 (31%), Positives = 35/64 (54%)
Query: 24 GPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENIL 83
GP++ L +G+L +G WR ++W LS L ++ +AG + + S L+ EN+
Sbjct: 63 GPIRGPLLLFPYGLLALWLGWCWRRRLNWWLSWSLGSVIGAAGFLVRVAVLSVLVGENLW 122
Query: 84 ALIT 87
+IT
Sbjct: 123 VVIT 126
>gi|434397203|ref|YP_007131207.1| Protein of unknown function DUF2232, membrane [Stanieria
cyanosphaera PCC 7437]
gi|428268300|gb|AFZ34241.1| Protein of unknown function DUF2232, membrane [Stanieria
cyanosphaera PCC 7437]
Length = 256
Score = 34.7 bits (78), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 22/93 (23%), Positives = 44/93 (47%)
Query: 23 SGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENI 82
GP +++ +L+ +G++G +G W W LSI + ++ + G S L+ E++
Sbjct: 118 MGPTRSIVFLMPYGLMGVQLGVCWSRRASWVLSILIGSLIGTFGLFFRFWLFSVLLGEDL 177
Query: 83 LALITINIHASLTFIFSAAGVNIVPSMNVIYVI 115
+ I +IF G+ PS +I ++
Sbjct: 178 WIYVITQITELTDWIFLKLGLLAQPSFFLIQIL 210
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.330 0.140 0.423
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,667,931,719
Number of Sequences: 23463169
Number of extensions: 58879642
Number of successful extensions: 222236
Number of sequences better than 100.0: 192
Number of HSP's better than 100.0 without gapping: 157
Number of HSP's successfully gapped in prelim test: 35
Number of HSP's that attempted gapping in prelim test: 222055
Number of HSP's gapped (non-prelim): 197
length of query: 121
length of database: 8,064,228,071
effective HSP length: 88
effective length of query: 33
effective length of database: 5,999,469,199
effective search space: 197982483567
effective search space used: 197982483567
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 69 (31.2 bits)