BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 031495
(158 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224099213|ref|XP_002311406.1| predicted protein [Populus trichocarpa]
gi|222851226|gb|EEE88773.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 272 bits (696), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 134/158 (84%), Positives = 147/158 (93%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWGVA GRKTMVATAMLL VLSGPVKAL YLLTHG++GF+MGSLWR+G +WGLSIFLCT
Sbjct: 149 LRWGVAGGRKTMVATAMLLFVLSGPVKALTYLLTHGLVGFTMGSLWRMGANWGLSIFLCT 208
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
IAR+ GA+GY+LTSSFLIRENILALITINIHASLTFIF+AAG+N VPSMN IY +FG LV
Sbjct: 209 IARATGAVGYVLTSSFLIRENILALITINIHASLTFIFAAAGINTVPSMNFIYSLFGILV 268
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+LNSGFFVFLLHLLYSVFLTRLGMK SLRLPRWLE AL
Sbjct: 269 VLNSGFFVFLLHLLYSVFLTRLGMKDSLRLPRWLEKAL 306
>gi|255546947|ref|XP_002514531.1| conserved hypothetical protein [Ricinus communis]
gi|223546135|gb|EEF47637.1| conserved hypothetical protein [Ricinus communis]
Length = 306
Score = 266 bits (679), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 130/158 (82%), Positives = 142/158 (89%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWGVAAGRKTMVATAMLL VLSGP+KAL YLLTHG++GF+MGSLWRL DWGLSIFLCT
Sbjct: 148 MRWGVAAGRKTMVATAMLLFVLSGPLKALTYLLTHGIVGFTMGSLWRLRADWGLSIFLCT 207
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
IAR+ GAMGY+ +SFLIRENILALITINIHASLT IF+A G+N +PSMN+IY +FG LV
Sbjct: 208 IARATGAMGYVFVTSFLIRENILALITINIHASLTLIFAAIGINTIPSMNLIYTLFGILV 267
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
LLNSG FVFLLHLLYSVF TRLGMK SLRLPRWLE AL
Sbjct: 268 LLNSGSFVFLLHLLYSVFFTRLGMKDSLRLPRWLEKAL 305
>gi|449499494|ref|XP_004160832.1| PREDICTED: uncharacterized LOC101204847 [Cucumis sativus]
Length = 469
Score = 250 bits (639), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 122/158 (77%), Positives = 139/158 (87%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWG++AGRKTMVAT +LLLVLSGPVKAL YLL HG++GF+MGSLWRLG +W SIFLCT
Sbjct: 312 MRWGISAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCT 371
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA+GY+L SSFLIRENILALITINIHASLT IF+A GVN++PSMN IY IFGTLV
Sbjct: 372 IVRAFGAVGYVLVSSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLV 431
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
LN G F+FLLHLLYS+FLTRLG+K SL LPRWLE A+
Sbjct: 432 FLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM 469
>gi|449446255|ref|XP_004140887.1| PREDICTED: uncharacterized protein LOC101204847 [Cucumis sativus]
Length = 301
Score = 249 bits (636), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 122/158 (77%), Positives = 139/158 (87%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWG++AGRKTMVAT +LLLVLSGPVKAL YLL HG++GF+MGSLWRLG +W SIFLCT
Sbjct: 144 MRWGISAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCT 203
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA+GY+L SSFLIRENILALITINIHASLT IF+A GVN++PSMN IY IFGTLV
Sbjct: 204 IVRAFGAVGYVLVSSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLV 263
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
LN G F+FLLHLLYS+FLTRLG+K SL LPRWLE A+
Sbjct: 264 FLNCGCFMFLLHLLYSIFLTRLGLKTSLTLPRWLEKAM 301
>gi|356499992|ref|XP_003518819.1| PREDICTED: uncharacterized protein LOC100783446 [Glycine max]
Length = 287
Score = 249 bits (635), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 120/158 (75%), Positives = 140/158 (88%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWGV AGRKT+VAT +LLLVLSGPVKAL YLL HG++GF+MG+LWR G W +SIFLCT
Sbjct: 130 MRWGVDAGRKTLVATTILLLVLSGPVKALTYLLKHGIVGFTMGTLWRTGASWNMSIFLCT 189
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA+G++L SSFLIRENILALITINIHASLTF+ +AAGV +PSMNVIY +FG LV
Sbjct: 190 IVRALGAVGFVLISSFLIRENILALITINIHASLTFLLTAAGVTSIPSMNVIYTLFGILV 249
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
L+NSG F+FLLHLLYSVFLTR+GMK+SLRLPRWLE A+
Sbjct: 250 LINSGCFMFLLHLLYSVFLTRMGMKSSLRLPRWLERAI 287
>gi|225423785|ref|XP_002279949.1| PREDICTED: uncharacterized protein LOC100256033 isoform 1 [Vitis
vinifera]
Length = 286
Score = 248 bits (632), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 119/158 (75%), Positives = 141/158 (89%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWGVAAGRKTMVAT MLL VLSGPVKA YLL HG++G +MGSLWRLG +WGLSI LCT
Sbjct: 129 MRWGVAAGRKTMVATTMLLFVLSGPVKASTYLLLHGLVGLTMGSLWRLGANWGLSICLCT 188
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
R+ GAMGY+L SSFLIRENILALIT+N+HAS+T++F+A GVN +PSM++IY +FGTL+
Sbjct: 189 FVRAMGAMGYVLISSFLIRENILALITLNVHASITYMFTAIGVNTIPSMDLIYALFGTLL 248
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
LLNSGFFVFLLH+LY+VFL+RLGMKASL +PRWLE A+
Sbjct: 249 LLNSGFFVFLLHILYAVFLSRLGMKASLTMPRWLEKAI 286
>gi|297737924|emb|CBI27125.3| unnamed protein product [Vitis vinifera]
Length = 192
Score = 247 bits (631), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 119/158 (75%), Positives = 141/158 (89%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWGVAAGRKTMVAT MLL VLSGPVKA YLL HG++G +MGSLWRLG +WGLSI LCT
Sbjct: 35 MRWGVAAGRKTMVATTMLLFVLSGPVKASTYLLLHGLVGLTMGSLWRLGANWGLSICLCT 94
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
R+ GAMGY+L SSFLIRENILALIT+N+HAS+T++F+A GVN +PSM++IY +FGTL+
Sbjct: 95 FVRAMGAMGYVLISSFLIRENILALITLNVHASITYMFTAIGVNTIPSMDLIYALFGTLL 154
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
LLNSGFFVFLLH+LY+VFL+RLGMKASL +PRWLE A+
Sbjct: 155 LLNSGFFVFLLHILYAVFLSRLGMKASLTMPRWLEKAI 192
>gi|357496929|ref|XP_003618753.1| Membrane protein [Medicago truncatula]
gi|355493768|gb|AES74971.1| Membrane protein [Medicago truncatula]
Length = 247
Score = 241 bits (614), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 115/158 (72%), Positives = 137/158 (86%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWG+ AGRKTMVAT +LL VLSGPVKAL YLL HG++G++MG+LWRL W LS+FLC
Sbjct: 90 MRWGIDAGRKTMVATTILLFVLSGPVKALTYLLKHGIVGYTMGTLWRLETSWSLSVFLCA 149
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I RS GA+G++L SSFLIRENILALITINIHASLTF+ +AAGVN +PSMN IY +FG LV
Sbjct: 150 IVRSIGAVGFVLISSFLIRENILALITINIHASLTFVLTAAGVNSIPSMNFIYTLFGILV 209
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
LLN G F+FLLH+LYSVFLTR+GMK+SL+LPRWL+ A+
Sbjct: 210 LLNGGCFMFLLHMLYSVFLTRMGMKSSLKLPRWLKKAI 247
>gi|297851064|ref|XP_002893413.1| hypothetical protein ARALYDRAFT_890129 [Arabidopsis lyrata subsp.
lyrata]
gi|297339255|gb|EFH69672.1| hypothetical protein ARALYDRAFT_890129 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 238 bits (607), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 112/158 (70%), Positives = 141/158 (89%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW +A GRKTMVAT MLL +LSGPVKAL Y LTHG++G ++GSLWR+G W LSIF+CT
Sbjct: 136 IRWNIAGGRKTMVATVMLLFILSGPVKALTYFLTHGLVGLALGSLWRMGASWRLSIFVCT 195
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ R+ G +GY+LTSSFLIRENILA+ITINIHASL+++F+A G+NI+PSM++IY+IFGT++
Sbjct: 196 MVRALGLIGYVLTSSFLIRENILAVITINIHASLSYVFTAMGLNIMPSMSLIYMIFGTVL 255
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
LLNSGFFV LLHLLYS+FLTRLGMK+SLRLP WL+ A+
Sbjct: 256 LLNSGFFVLLLHLLYSIFLTRLGMKSSLRLPAWLDKAI 293
>gi|22329794|ref|NP_173943.2| uncharacterized protein [Arabidopsis thaliana]
gi|18252157|gb|AAL61911.1| unknown protein [Arabidopsis thaliana]
gi|21386935|gb|AAM47871.1| unknown protein [Arabidopsis thaliana]
gi|332192537|gb|AEE30658.1| uncharacterized protein [Arabidopsis thaliana]
Length = 289
Score = 237 bits (604), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 112/158 (70%), Positives = 140/158 (88%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW +A GRKTMVAT MLL +LSGPVKAL Y LTHG++G ++GSLW +G W LSIFLCT
Sbjct: 132 IRWNIAGGRKTMVATVMLLFILSGPVKALTYFLTHGLVGLALGSLWSMGASWRLSIFLCT 191
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ R+ G +GY+LTSSFLIRENILA+ITINIHASL+++F+A G+NI+PSM++IY+IFGT++
Sbjct: 192 MVRALGLIGYVLTSSFLIRENILAVITINIHASLSYVFTAMGLNIMPSMSLIYMIFGTVL 251
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
LLNSGFFV LLHLLYS+FLTRLGMK+SLRLP WL+ A+
Sbjct: 252 LLNSGFFVLLLHLLYSIFLTRLGMKSSLRLPAWLDKAI 289
>gi|357139303|ref|XP_003571222.1| PREDICTED: uncharacterized protein LOC100830022 [Brachypodium
distachyon]
Length = 295
Score = 232 bits (592), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 107/155 (69%), Positives = 133/155 (85%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ AGRKTMVAT +LL LSGPVKA YLL HGV+G +MG++WRL +W +SI LC+
Sbjct: 119 LRWGLEAGRKTMVATVLLLFTLSGPVKASTYLLMHGVVGLAMGTMWRLETNWIVSIILCS 178
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA GY+L SSFLIRENIL LIT+NIHASLT+I +AAGVN +PSM+ IYVIFGTL+
Sbjct: 179 ITRAVGACGYVLVSSFLIRENILELITVNIHASLTYILAAAGVNAIPSMDAIYVIFGTLL 238
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
LLN GFFVF+LH++Y++FLT+LG+K SLRLPRWL+
Sbjct: 239 LLNCGFFVFILHIIYTIFLTKLGIKPSLRLPRWLD 273
>gi|115445849|ref|NP_001046704.1| Os02g0326000 [Oryza sativa Japonica Group]
gi|46390264|dbj|BAD15693.1| unknown protein [Oryza sativa Japonica Group]
gi|113536235|dbj|BAF08618.1| Os02g0326000 [Oryza sativa Japonica Group]
gi|215736884|dbj|BAG95813.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765527|dbj|BAG87224.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222622737|gb|EEE56869.1| hypothetical protein OsJ_06502 [Oryza sativa Japonica Group]
Length = 281
Score = 225 bits (573), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 105/158 (66%), Positives = 133/158 (84%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ AGRKT VAT +LL LSGPVKA YLL HGV+G +MG++WRL +W +SI LC+
Sbjct: 124 LRWGLEAGRKTAVATVLLLFTLSGPVKASTYLLMHGVVGLAMGTMWRLETNWIVSIILCS 183
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA GY+L SSFLIRENIL LIT+NIHASLT+I +AAGVN +PSM+ IYV+FGTL+
Sbjct: 184 ITRALGACGYVLLSSFLIRENILGLITVNIHASLTYILAAAGVNTIPSMDAIYVLFGTLL 243
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
LLN GFF+FLLH++Y++FLT+LG+K SLR PRWL+ A+
Sbjct: 244 LLNCGFFIFLLHIMYTIFLTKLGIKPSLRPPRWLDKAI 281
>gi|218190626|gb|EEC73053.1| hypothetical protein OsI_07007 [Oryza sativa Indica Group]
Length = 281
Score = 224 bits (572), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 105/158 (66%), Positives = 133/158 (84%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ AGRKT VAT +LL LSGPVKA YLL HGV+G +MG++WRL +W +SI LC+
Sbjct: 124 LRWGLEAGRKTAVATVLLLFTLSGPVKASTYLLMHGVVGLAMGTMWRLETNWIVSIILCS 183
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA GY+L SSFLIRENIL LIT+NIHASLT+I +AAGVN +PSM+ IYV+FGTL+
Sbjct: 184 ITRALGACGYVLLSSFLIRENILGLITVNIHASLTYILAAAGVNTIPSMDAIYVLFGTLL 243
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
LLN GFF+FLLH++Y++FLT+LG+K SLR PRWL+ A+
Sbjct: 244 LLNCGFFIFLLHIMYTIFLTKLGIKPSLRPPRWLDKAI 281
>gi|242091734|ref|XP_002436357.1| hypothetical protein SORBIDRAFT_10g000990 [Sorghum bicolor]
gi|241914580|gb|EER87724.1| hypothetical protein SORBIDRAFT_10g000990 [Sorghum bicolor]
Length = 286
Score = 221 bits (564), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 105/155 (67%), Positives = 130/155 (83%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ AGRKT+VAT +LL LSGPVKA YLL HGV+G MG++WRL +W +SI LC+
Sbjct: 129 LRWGLKAGRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSILLCS 188
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA GY+L SSFLIRENILALIT+NIHASLT+I +AAGVN +PSM+ IYV+FGTL+
Sbjct: 189 IVRALGACGYVLVSSFLIRENILALITVNIHASLTYILTAAGVNTIPSMDAIYVLFGTLL 248
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
L N FFVFLLH++Y+VFLT+LG+K SLR PRWL+
Sbjct: 249 LFNCAFFVFLLHIMYTVFLTKLGIKPSLRPPRWLD 283
>gi|326494316|dbj|BAJ90427.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326522989|dbj|BAJ88540.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 310
Score = 218 bits (556), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 109/157 (69%), Positives = 133/157 (84%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ AGRKTMVAT +LL LSGPVKAL YLL HGV+G SMG++WRL +W SI +C
Sbjct: 151 LRWGLEAGRKTMVATVLLLFTLSGPVKALTYLLMHGVVGLSMGTMWRLETNWIASIIICA 210
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA GY+L +SFLIRENI LIT+NIHASLT+I +AAGVN++PSM+ IYVIFGTLV
Sbjct: 211 IIRAVGACGYVLVTSFLIRENIFQLITVNIHASLTYILAAAGVNMIPSMDAIYVIFGTLV 270
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMA 157
LLNSGFFVF+LH++Y++FLT+LG+K SLRLPRWL A
Sbjct: 271 LLNSGFFVFILHIIYTIFLTKLGIKPSLRLPRWLGKA 307
>gi|195626624|gb|ACG35142.1| membrane protein [Zea mays]
Length = 192
Score = 204 bits (519), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 104/155 (67%), Positives = 129/155 (83%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ A RKT+VAT +LL LSGPVKA YLL HGV+G MG++WRL +W +SI LC+
Sbjct: 35 LRWGLEASRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSIILCS 94
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA GY+L SSFLIRENILALIT+NIHASLT+I +AAGVN +PSM IY++FGTL+
Sbjct: 95 IVRALGACGYVLVSSFLIRENILALITVNIHASLTYILTAAGVNTIPSMGTIYLLFGTLL 154
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
LLN GFFVFLLH++Y+VFL +LG+K SLR PRWL+
Sbjct: 155 LLNCGFFVFLLHVMYTVFLAKLGIKPSLRSPRWLD 189
>gi|224033465|gb|ACN35808.1| unknown [Zea mays]
gi|413953514|gb|AFW86163.1| membrane protein [Zea mays]
Length = 278
Score = 204 bits (518), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 104/155 (67%), Positives = 129/155 (83%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ A RKT+VAT +LL LSGPVKA YLL HGV+G MG++WRL +W +SI LC+
Sbjct: 121 LRWGLEASRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSIILCS 180
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA GY+L SSFLIRENILALIT+NIHASLT+I +AAGVN +PSM IY++FGTL+
Sbjct: 181 IVRALGACGYVLVSSFLIRENILALITVNIHASLTYILTAAGVNTIPSMGTIYLLFGTLL 240
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
LLN GFFVFLLH++Y+VFL +LG+K SLR PRWL+
Sbjct: 241 LLNCGFFVFLLHVMYTVFLAKLGIKPSLRSPRWLD 275
>gi|168067903|ref|XP_001785841.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662495|gb|EDQ49342.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 192
Score = 199 bits (505), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 92/158 (58%), Positives = 131/158 (82%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWG AAGRKTMVATAMLLL+LSGP+KA +YLL HG +G +MG+ W+ +WGLSI CT
Sbjct: 35 MRWGAAAGRKTMVATAMLLLILSGPLKAASYLLMHGFVGLAMGAFWKWNFNWGLSIVGCT 94
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ R GA+G++L +S+L+R+NILALIT+N HASL+++ +A G+N+VP+M +IYV+F +L+
Sbjct: 95 VVRCLGALGFVLLTSWLLRDNILALITMNAHASLSYMLAAVGINLVPTMPMIYVVFASLL 154
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
++N G FVFLLH+LY++FL RLG++ S+ +P W+E A+
Sbjct: 155 VVNCGSFVFLLHVLYAIFLRRLGIRGSITVPGWVERAI 192
>gi|302802149|ref|XP_002982830.1| hypothetical protein SELMODRAFT_47311 [Selaginella moellendorffii]
gi|300149420|gb|EFJ16075.1| hypothetical protein SELMODRAFT_47311 [Selaginella moellendorffii]
Length = 206
Score = 184 bits (467), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 87/158 (55%), Positives = 125/158 (79%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWG AAGRKTMV TA+LLLV++GP+KAL+Y+L HG LG +MG+LWR ++WG+SI CT
Sbjct: 49 MRWGAAAGRKTMVTTALLLLVIAGPLKALSYVLMHGFLGVAMGALWRWRINWGVSILACT 108
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ R+ GA+G++ SS+LI+ENIL LITIN HASL+++ +A G+NI +M IY +F +L+
Sbjct: 109 LVRAVGAIGFVFLSSWLIKENILNLITINTHASLSYMVAAMGINISLTMPTIYAVFTSLL 168
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
++N FVFLLH+LY++ L LG+K+SL +P W+ ++
Sbjct: 169 IVNCASFVFLLHVLYAIILQGLGIKSSLAVPGWMSRSM 206
>gi|302800209|ref|XP_002981862.1| hypothetical protein SELMODRAFT_57372 [Selaginella moellendorffii]
gi|300150304|gb|EFJ16955.1| hypothetical protein SELMODRAFT_57372 [Selaginella moellendorffii]
Length = 206
Score = 183 bits (465), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 87/158 (55%), Positives = 125/158 (79%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWG AAGRKTMV TA+LLLV++GP+KAL+Y+L HG LG +MG+LWR ++WG+SI CT
Sbjct: 49 MRWGAAAGRKTMVTTALLLLVIAGPLKALSYVLMHGFLGVAMGALWRWRINWGVSILACT 108
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ R+ GA+G++ SS+LI+ENIL LITIN HASL+++ +A G+NI +M IY +F +L+
Sbjct: 109 LVRALGAIGFVFLSSWLIKENILNLITINTHASLSYMVAAMGINISLTMPTIYAVFTSLL 168
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
++N FVFLLH+LY++ L LG+K+SL +P W+ ++
Sbjct: 169 IVNCASFVFLLHVLYAIILQGLGIKSSLAVPGWMSRSM 206
>gi|226508684|ref|NP_001140399.1| uncharacterized protein LOC100272453 [Zea mays]
gi|194699326|gb|ACF83747.1| unknown [Zea mays]
gi|413953513|gb|AFW86162.1| hypothetical protein ZEAMMB73_298407 [Zea mays]
Length = 250
Score = 166 bits (421), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 80/123 (65%), Positives = 100/123 (81%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ A RKT+VAT +LL LSGPVKA YLL HGV+G MG++WRL +W +SI LC+
Sbjct: 121 LRWGLEASRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSIILCS 180
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I R+ GA GY+L SSFLIRENILALIT+NIHASLT+I +AAGVN +PSM IY++FGTLV
Sbjct: 181 IVRALGACGYVLVSSFLIRENILALITVNIHASLTYILTAAGVNTIPSMGTIYLLFGTLV 240
Query: 121 LLN 123
+ +
Sbjct: 241 IPD 243
>gi|12321175|gb|AAG50677.1|AC079829_10 unknown protein [Arabidopsis thaliana]
Length = 275
Score = 165 bits (417), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 94/164 (57%), Positives = 117/164 (71%), Gaps = 23/164 (14%)
Query: 13 VATAMLLLVLSGPVKALAYL------------------LTHGVLGFSMGSLWRLGVDWGL 54
VAT MLL +LSGPVKAL Y LT G F S W G G+
Sbjct: 117 VATVMLLFILSGPVKALTYFVSLRFPRLPSRVLSRDHCLTFGDFIFVAYS-WSCGACTGI 175
Query: 55 SIFLCTIARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYV 114
+ + R+ G +GY+LTSSFLIRENILA+ITINIHASL+++F+A G+NI+PSM++IY+
Sbjct: 176 IVEV----RALGLIGYVLTSSFLIRENILAVITINIHASLSYVFTAMGLNIMPSMSLIYM 231
Query: 115 IFGTLVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
IFGT++LLNSGFFV LLHLLYS+FLTRLGMK+SLRLP WL+ A+
Sbjct: 232 IFGTVLLLNSGFFVLLLHLLYSIFLTRLGMKSSLRLPAWLDKAI 275
>gi|413953512|gb|AFW86161.1| hypothetical protein ZEAMMB73_298407 [Zea mays]
Length = 254
Score = 157 bits (396), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/155 (55%), Positives = 109/155 (70%), Gaps = 24/155 (15%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG+ A RKT+VAT +LL LSGPVKA YLL HGV+G MG++WRL +W +SI LC+
Sbjct: 121 LRWGLEASRKTVVATVLLLFTLSGPVKASTYLLMHGVVGLIMGTVWRLETNWIVSIILCS 180
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I IT+NIHASLT+I +AAGVN +PSM IY++FGTL+
Sbjct: 181 I------------------------ITVNIHASLTYILTAAGVNTIPSMGTIYLLFGTLL 216
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
LLN GFFVFLLH++Y+VFL +LG+K SLR PRWL+
Sbjct: 217 LLNCGFFVFLLHVMYTVFLAKLGIKPSLRSPRWLD 251
>gi|255646056|gb|ACU23515.1| unknown [Glycine max]
Length = 223
Score = 125 bits (315), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 57/81 (70%), Positives = 70/81 (86%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRWGV AGRKT+VAT +LLLVLSGPVKAL YLL HG++GF+MG+LWR G W +SIFLCT
Sbjct: 130 MRWGVDAGRKTLVATTILLLVLSGPVKALTYLLKHGIVGFTMGTLWRTGASWNMSIFLCT 189
Query: 61 IARSAGAMGYILTSSFLIREN 81
I R+ GA+G++L SSFLIR++
Sbjct: 190 IVRALGAVGFVLISSFLIRKH 210
>gi|359473028|ref|XP_003631234.1| PREDICTED: uncharacterized protein LOC100256033 isoform 2 [Vitis
vinifera]
Length = 140
Score = 114 bits (286), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 52/73 (71%), Positives = 68/73 (93%)
Query: 86 ITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVLLNSGFFVFLLHLLYSVFLTRLGMK 145
IT+N+HAS+T++F+A GVN +PSM++IY +FGTL+LLNSGFFVFLLH+LY+VFL+RLGMK
Sbjct: 68 ITLNVHASITYMFTAIGVNTIPSMDLIYALFGTLLLLNSGFFVFLLHILYAVFLSRLGMK 127
Query: 146 ASLRLPRWLEMAL 158
ASL +PRWLE A+
Sbjct: 128 ASLTMPRWLEKAI 140
>gi|384245220|gb|EIE18715.1| hypothetical protein COCSUDRAFT_20460, partial [Coccomyxa
subellipsoidea C-169]
Length = 206
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 60/161 (37%), Positives = 98/161 (60%), Gaps = 4/161 (2%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKA---LAYLLTHGVLGFSMGSLWRLGVDWGLSIF 57
MRWG AAGRKTM AT LLL + P+ L +LL G+L ++G+ W LG+ W LS+
Sbjct: 40 MRWGPAAGRKTMTATCFLLLGENFPLSTEDILTFLLNQGLLAATLGASWSLGMHWALSVA 99
Query: 58 LCTIARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFG 117
+ R G M Y+L SS+++ EN+ +++ NI++ L + + G + P V+ V+
Sbjct: 100 AGALVRVGGMMAYVLVSSWIMNENLFSILMTNIYSLLDQMNAFIGASGAPPPTVVIVVLA 159
Query: 118 TLVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+L+++N+ ++FL+H+LY+V L +G S LP +E A+
Sbjct: 160 SLLIVNAVLYIFLMHVLYTVLLRSMGYTMS-SLPASVERAV 199
>gi|302840505|ref|XP_002951808.1| hypothetical protein VOLCADRAFT_81593 [Volvox carteri f.
nagariensis]
gi|300263056|gb|EFJ47259.1| hypothetical protein VOLCADRAFT_81593 [Volvox carteri f.
nagariensis]
Length = 330
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 84/145 (57%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G AG +TM AT LL+VL GP++A++YL HG+L ++GSLW V + +
Sbjct: 167 LRSGGGAGWRTMTATCFLLVVLLGPLRAMSYLFLHGLLAATLGSLWSSRVGFWPGVVAGA 226
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ R G + Y++ SS + EN+ AL+ N++ L +A G++ PS + +L+
Sbjct: 227 LVRMCGQLSYLVMSSVTMNENMFALLLSNVYNMLDQASAALGLSGAPSPLAVNCTISSLL 286
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMK 145
L+N + FL+H++Y V L +G +
Sbjct: 287 LINGLTYCFLVHVVYRVLLGTMGYQ 311
>gi|412985991|emb|CCO17191.1| predicted protein [Bathycoccus prasinos]
Length = 243
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/138 (34%), Positives = 77/138 (55%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
RWG A +T++ +LL +LSGP++A Y HG G ++G W+ W S+ + +
Sbjct: 78 RWGDVAAWRTLLVAFLLLTLLSGPLRASNYACLHGFTGLALGMCWKRKFSWVQSVPISAL 137
Query: 62 ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVL 121
AR+ G + SS +IREN+L L+ ++A + I + G VP+M I + V+
Sbjct: 138 ARTIGVFSSLAISSLVIRENVLRLMVQQVYALIDQICATLGATFVPNMTAILCVAFVFVI 197
Query: 122 LNSGFFVFLLHLLYSVFL 139
LNS + +LHL+YS+ L
Sbjct: 198 LNSLSYTLILHLVYSIAL 215
>gi|427725977|ref|YP_007073254.1| hypothetical protein Lepto7376_4311 [Leptolyngbya sp. PCC 7376]
gi|427357697|gb|AFY40420.1| Protein of unknown function DUF2232, membrane [Leptolyngbya sp. PCC
7376]
Length = 248
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 86/156 (55%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL +L GP +++ Y++ +G++G +G +WR V W SI L T
Sbjct: 88 LRWGSRAAWMATLVSGLLLSILMGPTRSIIYVIPYGIMGVQLGWMWRKQVGWEWSIALGT 147
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + SFL+ E++ + I +IF G+ + P++ I ++ ++
Sbjct: 148 VLGTIGFFFRVWLLSFLLGEDLWVYVISQISELADWIFLHLGLLVQPNLMFIQLLAIAML 207
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
LLNS ++F++HL+ V L RLG R P W+++
Sbjct: 208 LLNSVLYLFVVHLVALVMLDRLGNPIP-RPPYWVQV 242
>gi|427712923|ref|YP_007061547.1| hypothetical protein Syn6312_1862 [Synechococcus sp. PCC 6312]
gi|427377052|gb|AFY61004.1| putative membrane protein (DUF2232) [Synechococcus sp. PCC 6312]
Length = 251
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 77/158 (48%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A + +A+LL VL GP ++L YLL HG+LG +G LW+ W +S+F
Sbjct: 90 LRWHRRAAWMGAIVSALLLSVLMGPPRSLQYLLPHGILGVVLGGLWKQKAPWHISLFWGA 149
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ S G IL S L+ EN+ + + + L + G+ ++PS ++ LV
Sbjct: 150 VLGSIGFFFQILVVSALLGENLWIYVNQQVTSFLDWALVQLGILLIPSFGLVQATAVVLV 209
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
S + ++H++ L RLG P WLE L
Sbjct: 210 FCQSYVYALIVHMVAWRLLARLGNPIP-DPPAWLEAVL 246
>gi|307109189|gb|EFN57427.1| hypothetical protein CHLNCDRAFT_21406, partial [Chlorella
variabilis]
Length = 206
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 85/145 (58%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G KT+ T +LLL+L GPV+A YLL +GVL S+ W L + W +SI L
Sbjct: 44 LRSGPVPAVKTLTVTCLLLLILMGPVRATTYLLVYGVLSLSLAVCWALRLPWVVSIPLGA 103
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+AR G + YI SS++ EN+L L+ N++A L + + G P + + ++ +L+
Sbjct: 104 LARIGGYLAYIALSSWVTNENLLMLMLNNVYALLDQLSALLGTTGSPPLLAVAIVLCSLL 163
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMK 145
+NS +V L H+LY++ L +G++
Sbjct: 164 FVNSIMYVALSHVLYAIMLRGMGLE 188
>gi|255085474|ref|XP_002505168.1| predicted protein [Micromonas sp. RCC299]
gi|226520437|gb|ACO66426.1| predicted protein [Micromonas sp. RCC299]
Length = 178
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 78/138 (56%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
R G A + V T+MLL ++SGP++A+ Y HG + +S+G W W +SI L
Sbjct: 41 RNGPKAASRVAVVTSMLLFLISGPLRAVNYYCLHGAMAYSLGWAWTKRFSWWVSIPLSAT 100
Query: 62 ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVL 121
RS G ++ +S ++REN++ L+ + L I + G++ +P++ ++ + VL
Sbjct: 101 VRSIGIFCSLMFTSLVLRENVMQLMVTQMMGLLDQIAANIGMSWMPTLGWVWAMALFFVL 160
Query: 122 LNSGFFVFLLHLLYSVFL 139
LNS +V +LH ++++ L
Sbjct: 161 LNSLSYVVILHAVFTILL 178
>gi|428219126|ref|YP_007103591.1| hypothetical protein Pse7367_2910 [Pseudanabaena sp. PCC 7367]
gi|427990908|gb|AFY71163.1| Protein of unknown function DUF2232, membrane [Pseudanabaena sp.
PCC 7367]
Length = 209
Score = 72.8 bits (177), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 82/155 (52%), Gaps = 1/155 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A TM+ T +LL VL GP +++ +L+ HG +G +G+LWR W SIF T
Sbjct: 52 LRWDRRAAIMTMIVTFLLLTVLMGPTRSIQFLIPHGFIGLFLGALWRRRSPWLASIFTGT 111
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G++ + S L+ EN+ +T + +I G P + ++ ++ +
Sbjct: 112 VLGTLGSIFQLYLVSLLLGENVWIYLTSQAAKFVGWIMGLLGSLDEPQLLIVQIVTAGGI 171
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
+ +S ++ L+HL+ V L RLG S P+W+E
Sbjct: 172 VFSSLMYMILVHLVAWVLLERLGNPIS-PAPQWVE 205
>gi|428300982|ref|YP_007139288.1| hypothetical protein Cal6303_4411 [Calothrix sp. PCC 6303]
gi|428237526|gb|AFZ03316.1| Protein of unknown function DUF2232, membrane [Calothrix sp. PCC
6303]
Length = 235
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 84/156 (53%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWGV A + V + +LL VL GP+++L +++ G +G ++GS W V W +S+ L T
Sbjct: 76 LRWGVRAAWMSAVVSGLLLAVLMGPIRSLLFVMPFGFMGVTLGSTWYRRVPWLVSMSLGT 135
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I + G + S L E++ + + + ++F GV PS+ I V+ +
Sbjct: 136 ILGTVGVFFRVWLLSILAGEDLWIYVINQMTDLIEWVFLKLGVLANPSLLWIQVVAIAFI 195
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
+ N+ ++FL+H+ + L RLG R P+W+++
Sbjct: 196 IFNNFIYLFLVHIAAWLLLDRLGNPIP-RPPKWVQV 230
>gi|428771033|ref|YP_007162823.1| hypothetical protein Cyan10605_2706 [Cyanobacterium aponinum PCC
10605]
gi|428685312|gb|AFZ54779.1| Protein of unknown function DUF2232, membrane [Cyanobacterium
aponinum PCC 10605]
Length = 248
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 83/158 (52%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G A T + +LL +L GP +++ Y++ +G++G +G+LWR GV+W +SIF+ +
Sbjct: 89 LRRGKRASIITTIVCGLLLAILMGPPRSIVYIVPYGLMGIQLGALWRRGVNWYVSIFVAS 148
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ G S L+ E++ + I +IF G+ P M I ++ +V
Sbjct: 149 LIGCFGFFFRFWLFSILLGEDLWVYVISQITNLADWIFLKLGILARPDMITIQLLAIAVV 208
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
++N+ + +HL+ + L RL R P+WL++ L
Sbjct: 209 MVNNIIYSLTVHLVALLMLDRLKTPIP-RPPKWLQVIL 245
>gi|443477605|ref|ZP_21067439.1| Protein of unknown function DUF2232, membrane [Pseudanabaena biceps
PCC 7429]
gi|443017225|gb|ELS31713.1| Protein of unknown function DUF2232, membrane [Pseudanabaena biceps
PCC 7429]
Length = 208
Score = 70.5 bits (171), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 78/155 (50%), Gaps = 1/155 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A MV T++LL +L GP +++ Y++ HG++G ++G LW+ W S+ + T
Sbjct: 51 LRWNSRAAWMAMVVTSLLLAILMGPTRSIQYIVPHGLVGVTLGYLWKRDFPWSGSLSIAT 110
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I S G + S L+ EN+ + I A L ++ G P +I +
Sbjct: 111 IIGSVGTAFQFVFLSVLLGENVWTYSIVQISAFLNWLMQLFGSLEQPDFAMIQAFAIGSI 170
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
L+++ + L+HL+ + L R+G P+WLE
Sbjct: 171 LISNLAYQLLVHLVAWIVLDRIGNPIP-HPPKWLE 204
>gi|218245754|ref|YP_002371125.1| hypothetical protein PCC8801_0891 [Cyanothece sp. PCC 8801]
gi|257058799|ref|YP_003136687.1| hypothetical protein Cyan8802_0917 [Cyanothece sp. PCC 8802]
gi|218166232|gb|ACK64969.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
gi|256588965|gb|ACU99851.1| putative membrane protein [Cyanothece sp. PCC 8802]
Length = 260
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 83/158 (52%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL VL GP +++ +L+ +G++G +G W+ G +WG S+
Sbjct: 101 LRWGHRAAWMGALVSGLLLTVLMGPTRSIVFLIPYGLMGIQLGYCWQRGANWGFSLVTGA 160
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ E++ + + ++F G+ P+ +I ++ L+
Sbjct: 161 LIGTFGFFFRFWLFSILLGEDLWQYVITQVTELAEWVFLKLGILSQPTFELIQLLAVVLI 220
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
L+NS ++F +HL+ + L +LG R P+W+++ L
Sbjct: 221 LINSLIYLFAVHLVSLLVLDKLGNPIP-RPPKWVKVIL 257
>gi|428207947|ref|YP_007092300.1| hypothetical protein Chro_2964 [Chroococcidiopsis thermalis PCC
7203]
gi|428009868|gb|AFY88431.1| Protein of unknown function DUF2232, membrane [Chroococcidiopsis
thermalis PCC 7203]
Length = 232
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 86/159 (54%), Gaps = 7/159 (4%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW AG + +LL VL GP+ +L Y++ +GV+G +G+ WR W +S+ L T
Sbjct: 73 LRWSARAGWMAAGVSGLLLAVLMGPIPSLLYVMPYGVMGVLLGAAWRRRSPWIVSVALGT 132
Query: 61 IARSAGAMGYILT---SSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFG 117
+ GA+G+ S ++ + + + +IF G+ + PSM VI V+
Sbjct: 133 L---LGAIGFFFRFWLMSIWSGRDLWVYLIVEVTELAEWIFDKLGLLLQPSMLVIQVLAL 189
Query: 118 TLVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
LVL+++ ++F++HL+ + L RLG R PRW+++
Sbjct: 190 ALVLVSNLVYLFVVHLVAWLLLDRLGNPIP-RPPRWVQV 227
>gi|428778778|ref|YP_007170564.1| hypothetical protein Dacsa_0419 [Dactylococcopsis salina PCC 8305]
gi|428693057|gb|AFZ49207.1| putative membrane protein (DUF2232) [Dactylococcopsis salina PCC
8305]
Length = 258
Score = 69.3 bits (168), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 84/161 (52%), Gaps = 7/161 (4%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G A +++ +LL +L GP +++ Y++ +GV+G +G+ WR G WG S+ T
Sbjct: 99 LRRGRRASGMSVITVGLLLSILMGPTRSILYVIPYGVMGLQLGAFWRRGFGWGQSV---T 155
Query: 61 IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFG 117
I G +G+ S + E++ +T + L + F G+ PS+ ++ I
Sbjct: 156 IGAIIGTLGFFFRFWLLSIFLGEDLWVYVTSQMTQFLEWGFLQLGLLAQPSLWLVQAIAL 215
Query: 118 TLVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
++LNS +VF +HL+ + L RLG PRWL++ L
Sbjct: 216 ATIILNSIIYVFAVHLVSLLILNRLGNPIP-SPPRWLQVIL 255
>gi|126654855|ref|ZP_01726389.1| hypothetical protein CY0110_10477 [Cyanothece sp. CCY0110]
gi|126623590|gb|EAZ94294.1| hypothetical protein CY0110_10477 [Cyanothece sp. CCY0110]
Length = 240
Score = 68.9 bits (167), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 82/158 (51%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG + + + +LL VL GP +++ +L+ +GV+G +G+ WR G+ W S+
Sbjct: 81 LRWGNRSAWMGALVSGLLLTVLMGPTRSIVFLIPYGVMGVQLGAFWRRGLPWEFSMITGA 140
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ E++ I + +IF G+ PS I ++ LV
Sbjct: 141 LLGTFGFFFRFWLFSILLGEDLWQYIINQMTGLAEWIFLKLGILTQPSAIFIQLLAVVLV 200
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+NS ++F +HL+ + L +LG + R P+W+++ L
Sbjct: 201 FVNSLIYLFAVHLVALLVLDKLGNRIP-RPPKWVQVIL 237
>gi|172036337|ref|YP_001802838.1| hypothetical protein cce_1422 [Cyanothece sp. ATCC 51142]
gi|354553126|ref|ZP_08972433.1| Protein of unknown function DUF2232, membrane [Cyanothece sp. ATCC
51472]
gi|171697791|gb|ACB50772.1| conserved hypothetical protein [Cyanothece sp. ATCC 51142]
gi|353554956|gb|EHC24345.1| Protein of unknown function DUF2232, membrane [Cyanothece sp. ATCC
51472]
Length = 240
Score = 68.6 bits (166), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 81/158 (51%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG + + + +LL VL GP +++ +L+ +GV+G +G+ WR GV W S+
Sbjct: 81 LRWGSRSAWMGALVSGLLLTVLMGPTRSIVFLIPYGVMGVQLGAFWRRGVPWEFSMITGA 140
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ E++ I + +IF G+ PS I ++ L+
Sbjct: 141 LLGTFGFFFRFWLFSILLGEDLWQYIINQMTGLAEWIFLRLGILAQPSAVFIQLLAVVLI 200
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+NS ++F +HL+ + L +LG R P+W+++ L
Sbjct: 201 FVNSLIYLFAVHLVALLVLDKLGNPIP-RPPKWVQVIL 237
>gi|376001833|ref|ZP_09779687.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
8005]
gi|375329744|emb|CCE15440.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
8005]
Length = 226
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 82/161 (50%), Gaps = 7/161 (4%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A V + +LL VL GP +++ Y++ +G + +G LW+ G W +++
Sbjct: 66 LRWDRRAAIMGAVTSGLLLTVLMGPTRSILYVIPYGWMAVVLGGLWKRGAKWPVTV---A 122
Query: 61 IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFG 117
IA GA+G+ S L+ ++ T I L IF G+ PS+ I ++
Sbjct: 123 IASLIGAIGFFFRYWLLSILLSRDLWVYATTQIAQLLESIFIRLGILAPPSLLFIQIVAV 182
Query: 118 TLVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+VL N+ ++FL+HL+ + L RL R PRW+++ L
Sbjct: 183 LMVLANNIVYLFLVHLVSLLMLDRLKSPIP-RPPRWVQILL 222
>gi|307153482|ref|YP_003888866.1| hypothetical protein Cyan7822_3654 [Cyanothece sp. PCC 7822]
gi|306983710|gb|ADN15591.1| Protein of unknown function DUF2232, membrane [Cyanothece sp. PCC
7822]
Length = 265
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 80/158 (50%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL VL GP +++ + + +GV+G +G+ WR G W LS+ L T
Sbjct: 106 LRWGQRASWMAAIVSGLLLSVLMGPTRSIVFFIPYGVMGVQLGACWRRGASWYLSMLLGT 165
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I G I S L+ E++ + L + F G+ P + ++ ++ +
Sbjct: 166 IIGVFGLFFRIWLFSILLGEDLWLYAITQVTGFLDWAFMKLGLLAQPDILLVQLLAVIAI 225
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+ N+ ++F +HL+ + L RLG R P W+++ L
Sbjct: 226 IFNNFLYLFAVHLVALLVLDRLGNPIP-RPPSWVQVIL 262
>gi|209522908|ref|ZP_03271465.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|209496495|gb|EDZ96793.1| conserved hypothetical protein [Arthrospira maxima CS-328]
Length = 226
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 82/161 (50%), Gaps = 7/161 (4%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A V + +LL VL GP +++ Y++ +G + +G LW+ G W +++
Sbjct: 66 LRWDRRAAIMGAVTSGLLLTVLMGPTRSILYVIPYGWMAVVLGGLWKRGAKWPVTV---A 122
Query: 61 IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFG 117
IA GA+G+ S L+ ++ T I L IF G+ PS+ I ++
Sbjct: 123 IASLIGAIGFFFRYWLLSILLGRDLWVYATTQIAQLLESIFIRLGILAPPSLLFIQIVAV 182
Query: 118 TLVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+VL N+ ++FL+HL+ + L RL R PRW+++ L
Sbjct: 183 LMVLANNIVYLFLVHLVSLLMLDRLKSPIP-RPPRWVQILL 222
>gi|423062304|ref|ZP_17051094.1| hypothetical protein SPLC1_S031930 [Arthrospira platensis C1]
gi|406716212|gb|EKD11363.1| hypothetical protein SPLC1_S031930 [Arthrospira platensis C1]
Length = 226
Score = 65.9 bits (159), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 82/161 (50%), Gaps = 7/161 (4%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A V + +LL VL GP +++ Y++ +G + +G LW+ G W +++
Sbjct: 66 LRWDRRAAIMGAVTSGLLLTVLMGPTRSILYVIPYGWMAVVLGGLWKRGAKWPVTV---A 122
Query: 61 IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFG 117
IA GA+G+ S L+ ++ T I L IF G+ PS+ I ++
Sbjct: 123 IASLIGAIGFFFRYWLLSILLGRDLWVYATTQIAQLLESIFIRLGILAPPSLLFIQIVAV 182
Query: 118 TLVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+VL N+ ++FL+HL+ + L RL R PRW+++ L
Sbjct: 183 LMVLANNIVYLFLVHLVSLLMLDRLKSPIP-RPPRWVQILL 222
>gi|190192140|dbj|BAG48275.1| hypothetical protein MAE_15185 [Microcystis aeruginosa NIES-843]
Length = 251
Score = 64.3 bits (155), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 81/158 (51%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCASMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ EN+ + + +IF+ G+ P++ + I +
Sbjct: 152 LLGAFGFFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTLLQVQSIAVVGI 211
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+NS ++F +HL+ + L RLG S R P+W+++ L
Sbjct: 212 FVNSLLYLFAVHLVALLVLDRLGNPIS-RPPKWVQVIL 248
>gi|434389139|ref|YP_007099750.1| putative membrane protein (DUF2232) [Chamaesiphon minutus PCC 6605]
gi|428020129|gb|AFY96223.1| putative membrane protein (DUF2232) [Chamaesiphon minutus PCC 6605]
Length = 199
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 84/155 (54%), Gaps = 1/155 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG AG + + + +LL VL GPV+++ +++ +GV+G +G+ WR W SI L
Sbjct: 41 LRWGNRAGWMSALVSGLLLSVLMGPVRSILFVMPYGVMGVQLGACWRRQASWLTSISLGA 100
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ G +S L+ E++ + I ++F G+ PS+ VI ++ +L+
Sbjct: 101 LVDCLGVFFRFGLTSALLGEDLWMYLMARIRDLSEWLFVQLGILAEPSLLVIQILAISLI 160
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
LL++ +VF++HL+ + RLG + P W+E
Sbjct: 161 LLSNFVYVFVVHLVAILMFERLGNPLP-KPPAWVE 194
>gi|284929773|ref|YP_003422295.1| hypothetical protein UCYN_12480 [cyanobacterium UCYN-A]
gi|284810217|gb|ADB95914.1| Predicted membrane protein (DUF2232) [cyanobacterium UCYN-A]
Length = 236
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 82/159 (51%), Gaps = 3/159 (1%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG + + + +LL VL GP++++ ++ +G+L +G+ WR + W SIF+
Sbjct: 78 LRWGSRSAWMGWLVSGLLLTVLMGPIQSILFITNYGLLSIQLGAFWRKNISWEFSIFIGA 137
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASL-TFIFSAAGVNIVPSMNVIYVIFGTL 119
I S L E++ +IN SL ++F G I PS ++ + L
Sbjct: 138 IISIFSFFFKFWLFSILTGEDLWQY-SINQMTSLAEWLFLKFGTLIQPSFLLVQLFTCLL 196
Query: 120 VLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+ +NS ++F +H++ + L +LG + R P+W+++ L
Sbjct: 197 IFINSIIYLFAVHVVALMVLDKLGNPIT-RPPKWVQIIL 234
>gi|443325608|ref|ZP_21054294.1| putative membrane protein (DUF2232) [Xenococcus sp. PCC 7305]
gi|442794767|gb|ELS04168.1| putative membrane protein (DUF2232) [Xenococcus sp. PCC 7305]
Length = 251
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/158 (26%), Positives = 82/158 (51%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G AG + + +LL VL GPV+++ +++ +G++G +G+ W V W LS+ L
Sbjct: 93 LRRGNRAGWMSAWVSMLLLSVLMGPVRSIVFIIPYGLIGIQLGACWVRRVHWALSVLLGA 152
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ S G S L+ E++ + I ++F G+ P + +I ++ ++
Sbjct: 153 LIGSLGLFFRFWLFSILLGEDLWVYVMTQITQLADWLFLKIGILAQPDIILIQILALVMI 212
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+LN+ ++F +HL+ + RLG R P W+++ L
Sbjct: 213 ILNNLIYLFAVHLVALILFDRLGNPIP-RPPEWVQVLL 249
>gi|218439630|ref|YP_002377959.1| hypothetical protein PCC7424_2679 [Cyanothece sp. PCC 7424]
gi|218172358|gb|ACK71091.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
Length = 269
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 80/154 (51%), Gaps = 1/154 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A T + + +LL VL GP +++ +L+ +GV+G +G+ W+ G W S+ L T
Sbjct: 110 LRWGKRASWMTAIVSGLLLSVLMGPTRSIVFLIPYGVMGVQLGACWKRGAPWQWSMLLGT 169
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I G I S L+ E++ A L + F G+ I P++ +I I +
Sbjct: 170 IIGVFGLFFRIWLFSILLGEDLWLYAITQATAFLDWAFMKLGLLIEPTVLMIQSIAIIAI 229
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWL 154
++N+ ++F +HL+ + L RLG R P W+
Sbjct: 230 IINNLLYLFAVHLIALLVLDRLGNPIP-RPPSWI 262
>gi|428771964|ref|YP_007163752.1| hypothetical protein Cyast_0120 [Cyanobacterium stanieri PCC 7202]
gi|428686243|gb|AFZ46103.1| Protein of unknown function DUF2232, membrane [Cyanobacterium
stanieri PCC 7202]
Length = 247
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 81/158 (51%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MR G A T + + +LL +L GP +++ Y++ +G++G +G +WR G +W LS+ +
Sbjct: 89 MRRGRRASIMTTIVSGLLLSILMGPPRSIIYIIPYGLMGIQLGGMWRNGANWHLSLLTGS 148
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I S G S L+ E++ + I +IF G+ PS VI ++ +V
Sbjct: 149 IIGSFGFFFRFWLFSILLGEDLWIYVINQITGLADWIFLQLGILSQPSTLVIQLLILLVV 208
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+N+ + F +HL+ + L RL R P W+++ L
Sbjct: 209 FVNNLIYCFTVHLVALLMLDRLKNPIP-RPPEWIKVIL 245
>gi|409990906|ref|ZP_11274222.1| hypothetical protein APPUASWS_07925 [Arthrospira platensis str.
Paraca]
gi|291567238|dbj|BAI89510.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409938233|gb|EKN79581.1| hypothetical protein APPUASWS_07925 [Arthrospira platensis str.
Paraca]
Length = 226
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 81/161 (50%), Gaps = 7/161 (4%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A V + +LL VL GP +++ Y++ +G +G +G LW+ G W +++
Sbjct: 66 LRWDRRAAIMGAVTSGLLLTVLMGPTRSILYVIPYGWMGVVLGGLWKRGAMWPVTV---A 122
Query: 61 IARSAGAMGYILTS---SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFG 117
I GA+G+ S L+ ++ T I L IF GV PS+ I +
Sbjct: 123 IGSLIGAIGFFFRYWLLSILLGRDLWVYATTQIAQLLESIFIRLGVLAPPSLLFIQIFAV 182
Query: 118 TLVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+VL N+ ++F++HL+ + L RL R PRW+++ L
Sbjct: 183 LMVLANNIVYLFVVHLVSLLMLDRLKSPIP-RPPRWVQILL 222
>gi|425467150|ref|ZP_18846434.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|389830158|emb|CCI28064.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9809]
Length = 251
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 81/158 (51%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCASMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ EN+ + + +IF+ G+ P++ + I +
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTLLQVQSIAVVGI 211
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+NS ++F +HL+ + L RLG S R P+W+++ L
Sbjct: 212 FVNSLLYLFAVHLVALLVLDRLGNPIS-RPPKWVQVIL 248
>gi|22298945|ref|NP_682192.1| hypothetical protein tlr1402 [Thermosynechococcus elongatus BP-1]
gi|22295126|dbj|BAC08954.1| tlr1402 [Thermosynechococcus elongatus BP-1]
Length = 236
Score = 62.8 bits (151), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 72/158 (45%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A + +A+LL VL GP ++L +LL HG++G G W WG SI
Sbjct: 76 LRWNRRAAWMAAIVSALLLSVLMGPPRSLQFLLPHGIMGVIFGGCWAKKESWGRSILSGA 135
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ +AG I S L+ EN+ + + ++ + + P++ +I V+ LV
Sbjct: 136 VIGTAGFFFQISLVSILLGENLWIYFNQQVTNFIDWVLVNLNLLLEPTVTLIQVVALLLV 195
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
S + ++H+L L RLG P WL L
Sbjct: 196 FFQSTVYALVVHILAWTLLERLGNPIP-DPPPWLRTIL 232
>gi|86605519|ref|YP_474282.1| membrane protein [Synechococcus sp. JA-3-3Ab]
gi|86554061|gb|ABC99019.1| putative membrane proteinprotein [Synechococcus sp. JA-3-3Ab]
Length = 227
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 80/161 (49%), Gaps = 9/161 (5%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRW G + +A+LL VL GP +++ Y++ +G+LG+ LWR + W +S+
Sbjct: 66 MRWDPRTGAMALAVSALLLTVLMGPTRSILYVIPYGLLGYWCARLWRRRLSWYVSVASGA 125
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAA----GVNIVPSMNVIYVIF 116
+ G + +L SS L+ EN+ +TI + ++ + GV V ++ V+
Sbjct: 126 ALSAFGLVFQLLLSSLLVGENLWTYVTIQLTGLTNWLLDVSLARLGVYWVAEPWMVQVVV 185
Query: 117 GTLVLLNSGFFVFLLHLLYSVFLT--RLGMKASLRLPRWLE 155
+ NS +VF +HL+ ++ + R + P+W++
Sbjct: 186 AGFIAFNSLVYVFTVHLVAALVMEHFRCPLPPP---PKWVQ 223
>gi|427705814|ref|YP_007048191.1| hypothetical protein Nos7107_0357 [Nostoc sp. PCC 7107]
gi|427358319|gb|AFY41041.1| Protein of unknown function DUF2232, membrane [Nostoc sp. PCC 7107]
Length = 254
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 81/156 (51%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GPV++L +++ + +G +G+ W V W +SI L
Sbjct: 94 LRWGQRAAWMAAVTSGLLLSVLMGPVRSLLFVMPYAFMGVLLGATWHRRVPWIVSITLAA 153
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + S L E++ + + +IF G+ PS+ +I V LV
Sbjct: 154 LLGTLGFFFRMWLLSILSGEDLWIYLINQVTEITEWIFLKLGLLNTPSVFLIQVGAFALV 213
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
++N+ ++F++HL + L RLG R PRW+++
Sbjct: 214 VINNFIYLFVVHLAAWLLLDRLGNPIP-RPPRWVQV 248
>gi|359457479|ref|ZP_09246042.1| hypothetical protein ACCM5_02044 [Acaryochloris sp. CCMEE 5410]
Length = 215
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 79/155 (50%), Gaps = 1/155 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG + + T +L+ VL GP +++ +L+ +GV+G +G LW+ V W +S+
Sbjct: 54 LRWGNRSAWMSAWVTTLLITVLMGPPRSIQFLMPYGVVGVMLGGLWKRQVSWAISMGWSI 113
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + S L+ N+ I I L ++ G+ + P + VI + L+
Sbjct: 114 LIMAIGFFFQLNLLSLLVGTNLWIYINRQITGFLDWVVIKLGLLLQPDVIVIQLFAVGLI 173
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
LLN+ ++ L+HL+ + R+G+ P WL+
Sbjct: 174 LLNAFLYILLVHLVAWLVFDRIGVSIP-DPPLWLQ 207
>gi|425444816|ref|ZP_18824857.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9443]
gi|425454338|ref|ZP_18834084.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9807]
gi|389735353|emb|CCI01127.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9443]
gi|389805030|emb|CCI15498.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9807]
Length = 251
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 81/158 (51%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGVQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ EN+ + + +IF+ G+ P++ + I +
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTLLQVQSIAVVGI 211
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+NS ++F +HL+ + L RLG S R P+W+++ L
Sbjct: 212 FVNSLLYLFAVHLVALLVLDRLGNPIS-RPPKWVQVIL 248
>gi|158338243|ref|YP_001519420.1| hypothetical protein AM1_5139 [Acaryochloris marina MBIC11017]
gi|158308484|gb|ABW30101.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 215
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 78/154 (50%), Gaps = 1/154 (0%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
RWG + + T +L+ VL GP +++ +L+ +GV+G +G LW+ V W +S+ +
Sbjct: 55 RWGNRSAWMSAWVTTLLITVLMGPPRSIQFLMPYGVVGVILGGLWKRQVSWAISMGWSIL 114
Query: 62 ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVL 121
+ G + S L+ N+ I I L ++ G+ + P + VI + L+L
Sbjct: 115 VMAIGFFFQLNLLSLLVGTNLWIYINRQITGFLDWVVIKLGLLLQPDVIVIQLFAVGLIL 174
Query: 122 LNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
LN+ ++ L+HL+ + R+G+ P WL+
Sbjct: 175 LNAFLYILLVHLVAWLVFDRIGVSIP-DPPLWLQ 207
>gi|428204395|ref|YP_007082984.1| hypothetical protein Ple7327_4314 [Pleurocapsa sp. PCC 7327]
gi|427981827|gb|AFY79427.1| putative membrane protein [Pleurocapsa sp. PCC 7327]
Length = 261
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 83/158 (52%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A T + + +LL VL GP +++ YL+ +G++G +G+ WR G W SIF
Sbjct: 102 LRWSNRASWMTTLVSVLLLTVLMGPPRSIIYLIPYGLMGVQLGACWRRGAHWWFSIFTGA 161
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ ++G S L+ E++ + + + ++F G+ PS+ +I +I ++
Sbjct: 162 LIGTSGFFFRFWLFSILLGEDLWLYVITQVTEIVDWVFLRLGLLAQPSVLLIQIIAVLMI 221
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
LN ++F +HL+ + L RLG R P W+++ L
Sbjct: 222 FLNDLIYLFAVHLVALLVLDRLGNPIP-RPPEWVQVLL 258
>gi|425471084|ref|ZP_18849944.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9701]
gi|389883066|emb|CCI36507.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9701]
Length = 251
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 81/158 (51%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGVQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ EN+ + + +IF+ G+ P++ + I +
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTWLGILAKPTLLQVQSIAVVGI 211
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+NS ++F +HL+ + L RLG S R P+W+++ L
Sbjct: 212 FVNSLLYLFAVHLVALLVLDRLGNPIS-RPPKWVQVIL 248
>gi|428222487|ref|YP_007106657.1| hypothetical protein Syn7502_02551 [Synechococcus sp. PCC 7502]
gi|427995827|gb|AFY74522.1| putative membrane protein (DUF2232) [Synechococcus sp. PCC 7502]
Length = 207
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 77/155 (49%), Gaps = 1/155 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG TM T +LL +L GP +++ +L+ HG++ +G LW+ W +++ T
Sbjct: 50 LRWGQRCAWMTMTVTVLLLTILMGPTRSIQFLIPHGLMAVMLGFLWKRRSPWVVTLTFGT 109
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+A +AG S L+ EN+ A TI + + + F G P + +I + +
Sbjct: 110 LAGAAGVGFQFWLLSALLGENLWAYSTIQVTELIRWGFQLVGSLEQPELWLIQTMAIAGI 169
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
++ + ++ L+H+ + RLG S P WL+
Sbjct: 170 VIANFAYMILVHVAAWLLCDRLGNPIS-PPPEWLQ 203
>gi|427731990|ref|YP_007078227.1| hypothetical protein Nos7524_4904 [Nostoc sp. PCC 7524]
gi|427367909|gb|AFY50630.1| putative membrane protein (DUF2232) [Nostoc sp. PCC 7524]
Length = 243
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 80/156 (51%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GPV++L +++ +G +G+ W V W +SI L T
Sbjct: 85 LRWGKRASWMAAVTSGLLLSVLMGPVRSLLFVMPFAFMGVLLGATWYRRVPWLVSITLGT 144
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + S L E++ + + +IF G+ PS+ VI + L+
Sbjct: 145 VLGTLGVFFRLWLLSILSGEDLWIYVINQVTELTEWIFLRLGLLATPSVFVIQIGAIALI 204
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
++N+ ++F++HL + L RLG R P W+++
Sbjct: 205 VINNFIYLFIVHLAAWLLLDRLGNPIP-RPPNWVQV 239
>gi|443648943|ref|ZP_21130125.1| hypothetical protein C789_665 [Microcystis aeruginosa DIANCHI905]
gi|159026388|emb|CAO90486.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443335056|gb|ELS49539.1| hypothetical protein C789_665 [Microcystis aeruginosa DIANCHI905]
Length = 251
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 80/158 (50%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGIQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ EN+ + + +IF+ G+ P++ + I +
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTLLQVQSIAVVGI 211
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+NS ++F +HL+ + L RLG R P+W+++ L
Sbjct: 212 FVNSLLYLFAVHLVALLVLDRLGNPIP-RPPKWVQVIL 248
>gi|425441756|ref|ZP_18822023.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9717]
gi|389717455|emb|CCH98462.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9717]
Length = 251
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 81/158 (51%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ EN+ + + +IF+ G+ P++ + + +
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTWLGILAKPTLLQVQSVAVVGI 211
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+NS ++F +HL+ + L RLG S R P+W+++ L
Sbjct: 212 FVNSLLYLFAVHLVALLVLDRLGNPIS-RPPKWVQVIL 248
>gi|425461780|ref|ZP_18841254.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9808]
gi|389825302|emb|CCI25026.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9808]
Length = 251
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 80/158 (50%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGVQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ EN+ + + +IF+ G+ P++ + I +
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTLLQVQSIAVVGI 211
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+NS ++F +HL+ + L RLG R P+W+++ L
Sbjct: 212 FVNSLLYLFAVHLVALLVLDRLGNPIP-RPPKWVQVIL 248
>gi|425436381|ref|ZP_18816817.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9432]
gi|425449736|ref|ZP_18829571.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
7941]
gi|440752045|ref|ZP_20931248.1| putative membrane protein [Microcystis aeruginosa TAIHU98]
gi|389678911|emb|CCH92273.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9432]
gi|389769737|emb|CCI05505.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
7941]
gi|440176538|gb|ELP55811.1| putative membrane protein [Microcystis aeruginosa TAIHU98]
Length = 251
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 80/158 (50%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGVQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ EN+ + + +IF+ G+ P++ + I +
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTQLGILAKPTLLQVQSIAVVGI 211
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+NS ++F +HL+ + L RLG R P+W+++ L
Sbjct: 212 FVNSLLYLFAVHLVALLVLDRLGNPIP-RPPKWVQVIL 248
>gi|86609748|ref|YP_478510.1| hypothetical protein CYB_2307 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86558290|gb|ABD03247.1| putative membrane protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 227
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 40/162 (24%), Positives = 80/162 (49%), Gaps = 5/162 (3%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
MRW G +V + +LL VL GP +++ Y++ +G+LG+ LW+ + W +S+
Sbjct: 66 MRWDPRTGAMALVVSGLLLTVLIGPTRSILYVIPYGLLGYWCARLWQQRLSWYVSVVSGA 125
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAA----GVNIVPSMNVIYVIF 116
+ + G + ++ SS L+ EN+ +TI + ++ + GV V ++ V+
Sbjct: 126 VLSALGLIFQLVLSSLLVGENLWIYVTIQLTGLTNWLLDVSLSRWGVYWVAEPWMVQVVV 185
Query: 117 GTLVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+ NS +VF +HL+ ++ + P+W++ L
Sbjct: 186 VGFIAFNSLIYVFTVHLVAALVMEHFRCPLP-PPPKWVQFLL 226
>gi|422303788|ref|ZP_16391139.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9806]
gi|389791182|emb|CCI12970.1| Genome sequencing data, contig C262 [Microcystis aeruginosa PCC
9806]
Length = 251
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 79/155 (50%), Gaps = 1/155 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ EN+ + + +IF+ G+ P++ + + +
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTWLGILAKPTLLQVQSVAVVGI 211
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
+NS ++F +HL+ + L RLG S R P+W++
Sbjct: 212 FVNSLLYLFAVHLVALLVLDRLGNPIS-RPPKWVQ 245
>gi|390440736|ref|ZP_10228944.1| conserved membrane hypothetical protein [Microcystis sp. T1-4]
gi|389835965|emb|CCI33070.1| conserved membrane hypothetical protein [Microcystis sp. T1-4]
Length = 251
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 79/155 (50%), Gaps = 1/155 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG T + +++LL VL GP +++ +L+ HG++G +G WR G W SI
Sbjct: 92 LRWGARCAWMTALVSSLLLSVLMGPTRSIVFLIPHGLMGLQLGLCWRRGARWEFSILTGG 151
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ EN+ + + +IF+ G+ P++ + + +
Sbjct: 152 LLGAFGLFFRFWLFSILLGENLWLYVITQATNFIDWIFTWLGILAKPTLLQVQSVAVVGI 211
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
+NS ++F +HL+ + L RLG S R P+W++
Sbjct: 212 FVNSLLYLFAVHLVALLVLDRLGNPIS-RPPKWVQ 245
>gi|170078396|ref|YP_001735034.1| hypothetical protein SYNPCC7002_A1790 [Synechococcus sp. PCC 7002]
gi|169886065|gb|ACA99778.1| conserved hypothetical membrane protein [Synechococcus sp. PCC
7002]
Length = 243
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 86/156 (55%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +++LL +L GP +++ Y + +G++G +G +WR V W S+ L T
Sbjct: 83 LRWGSRAAWMSTLVSSLLLSILMGPTRSIIYTIPYGLIGVQLGWMWRRQVSWYWSVTLGT 142
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I + G + S L+ E++ + I ++F G+ PS+ VI ++ ++
Sbjct: 143 ILGTLGFFFRVWLLSLLLGEDLWVYVISQISEMADWLFLKLGLLTQPSLLVIQLLALVML 202
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
+NS ++F++HL+ + L RLG + R P W+++
Sbjct: 203 TINSVLYLFVVHLVALLMLDRLG-NSIPRPPHWVQV 237
>gi|407961790|dbj|BAM55030.1| hypothetical protein BEST7613_6099 [Synechocystis sp. PCC 6803]
Length = 245
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 78/158 (49%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A K M+A +LL VL GP +++ +++ +G++ +G W WG SIF
Sbjct: 85 LRWRSPAAIKGMIAAGLLLSVLMGPTRSVVFVIPYGLMSLQLGFCWVRNASWGFSIFTGA 144
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G SFL+ EN+ + I L ++F + P + ++ + +V
Sbjct: 145 LIGTFGFFFRFWLFSFLLGENLWQYVMGQITNLLDWLFLQFNILAQPDVLLVQFLAVGMV 204
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
LNS ++ +HL + L RLG R P+WL++ L
Sbjct: 205 FLNSLIYMLAVHLTALLILDRLGNPIP-RPPQWLKVIL 241
>gi|388506298|gb|AFK41215.1| unknown [Lotus japonicus]
Length = 185
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/35 (74%), Positives = 30/35 (85%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTH 35
MRWGV AG+KT+V+T +LL VLSGPVKAL YLL H
Sbjct: 150 MRWGVDAGKKTLVSTTILLFVLSGPVKALTYLLKH 184
>gi|427717327|ref|YP_007065321.1| hypothetical protein Cal7507_2044 [Calothrix sp. PCC 7507]
gi|427349763|gb|AFY32487.1| Protein of unknown function DUF2232, membrane [Calothrix sp. PCC
7507]
Length = 245
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 78/156 (50%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + +LL VL GP+++L +++ + +G +G+ W V W +SI L T
Sbjct: 85 LRWGKRAAWMAALTCGLLLAVLMGPIRSLLFVMPYAFMGVLLGATWNRRVPWIVSITLGT 144
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G I S L E++ + + +IF V PS+ I + ++
Sbjct: 145 LLATLGIFFRIWLLSVLSGEDLWVYVINQVTEITEWIFLKLQVLANPSVFFIQIGAIAVI 204
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
LLN+ ++F++HL + L RLG R P W+++
Sbjct: 205 LLNNFIYLFVVHLAAWLLLDRLGNPIP-RPPNWIQV 239
>gi|16330851|ref|NP_441579.1| hypothetical protein sll1372 [Synechocystis sp. PCC 6803]
gi|383322593|ref|YP_005383446.1| hypothetical protein SYNGTI_1684 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325762|ref|YP_005386615.1| hypothetical protein SYNPCCP_1683 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491646|ref|YP_005409322.1| hypothetical protein SYNPCCN_1683 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436913|ref|YP_005651637.1| hypothetical protein SYNGTS_1684 [Synechocystis sp. PCC 6803]
gi|451815009|ref|YP_007451461.1| hypothetical protein MYO_117020 [Synechocystis sp. PCC 6803]
gi|1653344|dbj|BAA18259.1| sll1372 [Synechocystis sp. PCC 6803]
gi|339273945|dbj|BAK50432.1| hypothetical protein SYNGTS_1684 [Synechocystis sp. PCC 6803]
gi|359271912|dbj|BAL29431.1| hypothetical protein SYNGTI_1684 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359275082|dbj|BAL32600.1| hypothetical protein SYNPCCN_1683 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359278252|dbj|BAL35769.1| hypothetical protein SYNPCCP_1683 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|451780978|gb|AGF51947.1| hypothetical protein MYO_117020 [Synechocystis sp. PCC 6803]
Length = 275
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 78/158 (49%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A K M+A +LL VL GP +++ +++ +G++ +G W WG SIF
Sbjct: 115 LRWRSPAAIKGMIAAGLLLSVLMGPTRSVVFVIPYGLMSLQLGFCWVRNASWGFSIFTGA 174
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G SFL+ EN+ + I L ++F + P + ++ + +V
Sbjct: 175 LIGTFGFFFRFWLFSFLLGENLWQYVMGQITNLLDWLFLQFNILAQPDVLLVQFLAVGMV 234
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
LNS ++ +HL + L RLG R P+WL++ L
Sbjct: 235 FLNSLIYMLAVHLTALLILDRLGNPIP-RPPQWLKVIL 271
>gi|428775069|ref|YP_007166856.1| hypothetical protein PCC7418_0408 [Halothece sp. PCC 7418]
gi|428689348|gb|AFZ42642.1| Protein of unknown function DUF2232, membrane [Halothece sp. PCC
7418]
Length = 256
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 80/158 (50%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G+ A + + +LL +L GP +++ Y++ +G++G +G+ W WGLS+ +
Sbjct: 97 LRRGMRASWMSALIVGLLLSILMGPTRSILYVIPYGLMGVQLGAFWCRNTHWGLSVTVGA 156
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I + G S + E++ +T I L + F G+ PS+ +I I +
Sbjct: 157 IIGTFGFFFRFWLLSIFLGEDLWVYVTSQIAQFLEWGFVQLGLLTQPSLWLIQAIALFTI 216
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+LNS +VF +HL+ + L RLG P W+++ L
Sbjct: 217 ILNSILYVFAVHLVALLILNRLGNPIP-SPPGWVQVIL 253
>gi|354567780|ref|ZP_08986948.1| Protein of unknown function DUF2232, membrane [Fischerella sp.
JSC-11]
gi|353542238|gb|EHC11702.1| Protein of unknown function DUF2232, membrane [Fischerella sp.
JSC-11]
Length = 223
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 79/156 (50%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GP+++L +++ +G +G+ W V W +SI L
Sbjct: 64 LRWGKRAAWMGAVTSGLLLSVLMGPIRSLLFVMPFAFMGVLLGAAWYRRVPWIVSISLGA 123
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + S L E++ + + + +IF G+ PS ++I L+
Sbjct: 124 VLGTLGVFFRLWLLSLLSGEDLWVYVINQVTEIVEWIFLRLGILASPSTSLINSGAIALI 183
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
+ N+ ++F++H+ + L RLG R PRW+++
Sbjct: 184 IFNNFLYLFIVHIAAWLLLDRLGNPIP-RPPRWVQV 218
>gi|308804379|ref|XP_003079502.1| unnamed protein product [Ostreococcus tauri]
gi|116057957|emb|CAL54160.1| unnamed protein product [Ostreococcus tauri]
Length = 230
Score = 58.5 bits (140), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 67/139 (48%), Gaps = 1/139 (0%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALA-YLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
RWG AA R+T+V + + LA L L + +LW G W +++
Sbjct: 69 RWGSAAARRTLVVATLARPKTTMSSNELAPRRLAPCTLAHADRALWSSGASWWVTVPASA 128
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ R+ G + + SS L REN++ L+ ++A L + G P + ++ L+
Sbjct: 129 LTRTFGIVCSLSVSSLLYRENVMKLLVTQMYALLDQFAANVGATFAPDITWVWATAMVLI 188
Query: 121 LLNSGFFVFLLHLLYSVFL 139
L+NS + +LH++Y++ L
Sbjct: 189 LVNSLSYTLILHMVYTIVL 207
>gi|440681213|ref|YP_007156008.1| Protein of unknown function DUF2232, membrane [Anabaena cylindrica
PCC 7122]
gi|428678332|gb|AFZ57098.1| Protein of unknown function DUF2232, membrane [Anabaena cylindrica
PCC 7122]
Length = 244
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 78/156 (50%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GPV+++ +++ ++G +G+ W V W +SI L
Sbjct: 84 LRWGKRAAWMAAVTSGLLLAVLMGPVRSILFVMPFALMGVLLGATWHRRVPWIVSITLGM 143
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + S L E++ + + + +IF + PS+ I V LV
Sbjct: 144 LLGTLGVFFRLWLLSVLSGEDLWVYVINQVTEIIEWIFLKLEILATPSVFFIQVGAVLLV 203
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
N+ ++F++HL + L RLG R PRW+++
Sbjct: 204 AFNNLIYLFVVHLAAWLLLDRLGNPIP-RPPRWVQV 238
>gi|145346093|ref|XP_001417529.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577756|gb|ABO95822.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 180
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 79/138 (57%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
R G AA +T+V ++LLL++SGP++AL Y HGVL + + SLW G W +++ +
Sbjct: 43 RSGRAAAMRTLVVASLLLLMISGPLRALNYFFLHGVLAYVLSSLWSSGSSWWVTVPASAL 102
Query: 62 ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVL 121
R+ G + + SS L REN++ L+ +++ L + G P++ ++ +L
Sbjct: 103 TRTFGILASLSISSLLYRENVMKLLVTQMYSLLDQFAANVGATFAPTIGWVWATALFFIL 162
Query: 122 LNSGFFVFLLHLLYSVFL 139
+NS + +LH++Y++ L
Sbjct: 163 VNSFSYTLILHMVYTIVL 180
>gi|298492012|ref|YP_003722189.1| hypothetical protein Aazo_3420 ['Nostoc azollae' 0708]
gi|298233930|gb|ADI65066.1| Protein of unknown function DUF2232, membrane ['Nostoc azollae'
0708]
Length = 242
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 79/156 (50%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GPV+++ +++ G++G +G+ W V W +SI L
Sbjct: 82 LRWGKRAAWMGAVTSGLLLGVLMGPVRSMLFVMPFGLMGVLLGATWYRRVPWIVSITLGM 141
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + S L E++ + + +IF + PS+ VI + L+
Sbjct: 142 LLGTLGVFFRLWLLSVLSGEDLWIYVINQVTEITEWIFLKLQILATPSVFVIQLGAVVLI 201
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
N+ ++F++HL + L RLG R PRW+++
Sbjct: 202 AFNNLIYLFVVHLAAWLLLDRLGNPIP-RPPRWVQV 236
>gi|443323263|ref|ZP_21052271.1| putative membrane protein (DUF2232) [Gloeocapsa sp. PCC 73106]
gi|442787001|gb|ELR96726.1| putative membrane protein (DUF2232) [Gloeocapsa sp. PCC 73106]
Length = 244
Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 80/157 (50%), Gaps = 2/157 (1%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
R AA + VAT +LL VL GPV+++ +L+ +G++G +G WR W S+ + I
Sbjct: 87 RGDRAAIMASTVAT-LLLSVLMGPVRSVIFLIPYGLMGIQLGVFWRRSASWNFSLCVGGI 145
Query: 62 ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVL 121
+ G I S + E++ + + + I +IF G+ PS+ +I ++ ++
Sbjct: 146 IGTFGVFFRIFLFSIFLGEDLWSYVMVQITELANWIFLKLGILAEPSLGMIELLALVMIY 205
Query: 122 LNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
L++ ++ +HL + L +LG R P W++ L
Sbjct: 206 LSNLLYMLTVHLAALLILDKLGNPIP-RPPNWIKTIL 241
>gi|282895674|ref|ZP_06303799.1| conserved hypothetical protein [Raphidiopsis brookii D9]
gi|281199368|gb|EFA74233.1| conserved hypothetical protein [Raphidiopsis brookii D9]
Length = 178
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 78/155 (50%), Gaps = 1/155 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL VL+GPV++L +++ LG +G+ W V W +SI L T
Sbjct: 18 LRWGNRASWMAAITSGLLLSVLTGPVRSLLFIMPFAFLGVLLGASWHRRVPWFVSITLGT 77
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I + G + S L E++ +T + L ++F + PS + + L+
Sbjct: 78 ILCTLGVFFRLWLLSILSGEDLWIYLTNQVTEILQWLFLNLRILTPPSTLGVKLAALLLI 137
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
++N+ ++F++HL + L RLG P W++
Sbjct: 138 VINNLIYMFIVHLAAWLLLERLGNSIP-NPPHWIQ 171
>gi|119490055|ref|ZP_01622679.1| hypothetical protein L8106_20785 [Lyngbya sp. PCC 8106]
gi|119454207|gb|EAW35359.1| hypothetical protein L8106_20785 [Lyngbya sp. PCC 8106]
Length = 232
Score = 56.2 bits (134), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 83/158 (52%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG + + +LL VL GP +++ +L+ +G +G +MG LW+ G +W +SI +
Sbjct: 72 LRWGKREAWMGALTSLLLLTVLMGPTRSIIFLIPYGWMGVAMGGLWKRGANWFVSIGVGA 131
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I S G S L+ ++ T I +IF G+ PS+ ++ VI +V
Sbjct: 132 IIGSIGFFFRYWLLSILLGRDLWVYATTQITELAEWIFIKLGILAQPSLTLVQVIAVVMV 191
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
++NS ++F++HL+ + L RL R P W+++ L
Sbjct: 192 VVNSAIYLFVVHLVALLILDRLKSPIP-RPPHWVQVLL 228
>gi|427734711|ref|YP_007054255.1| hypothetical protein Riv7116_1135 [Rivularia sp. PCC 7116]
gi|427369752|gb|AFY53708.1| putative membrane protein (DUF2232) [Rivularia sp. PCC 7116]
Length = 240
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 85/156 (54%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A + + + +LLLVL GP +++ +++ + +G +GS W+ GV W +SI L T
Sbjct: 81 LRWNSRAAWMSALTSGLLLLVLMGPTRSMLFVIPYAFMGVLLGSTWQRGVPWIVSISLAT 140
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + S L +++ + + L ++F G VP+ +I + L+
Sbjct: 141 LLGAIGVFFRLWLLSLLSGQDLWIYLINQVTNFLEWLFVNLGFLTVPNPFLIQIGAICLI 200
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
LLN+ ++F++H+ + L RLG + R P+W+++
Sbjct: 201 LLNNFIYLFVVHIAAWLLLDRLG-NSIPRPPQWVQV 235
>gi|220905783|ref|YP_002481094.1| hypothetical protein Cyan7425_0341 [Cyanothece sp. PCC 7425]
gi|219862394|gb|ACL42733.1| conserved hypothetical protein [Cyanothece sp. PCC 7425]
Length = 244
Score = 55.5 bits (132), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 77/155 (49%), Gaps = 1/155 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW + T + A+LL VL GP ++L +L+ HG+LG +G W+ W +SI
Sbjct: 85 LRWNLRTATMTALVAALLLSVLMGPPRSLQFLVPHGLLGILLGHFWQRRASWWVSIAWGA 144
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ G I S L+ EN+ + + L ++F++ G + P + V+ + +V
Sbjct: 145 LLGGIGLFFQIGLLSLLLGENLWLYLNQQVTNLLDWLFTSLGWLVQPDLAVVQAVAVLIV 204
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
N+ + F++HL + L R+G P+WL+
Sbjct: 205 FFNATLYTFVVHLAAWLLLERVGNPIP-PPPQWLQ 238
>gi|186683647|ref|YP_001866843.1| hypothetical protein Npun_F3493 [Nostoc punctiforme PCC 73102]
gi|186466099|gb|ACC81900.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 246
Score = 55.5 bits (132), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 81/157 (51%), Gaps = 2/157 (1%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLW-RLGVDWGLSIFLC 59
+RWG A + + +LL VL GP ++L +++ +G +G +G+ W R V W +SI L
Sbjct: 85 LRWGKRAAWMAALTSGLLLTVLMGPARSLLFVMPYGFMGVLLGATWYRRRVPWIVSITLG 144
Query: 60 TIARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTL 119
T+ + G + S L E++ + + + ++F + PS+ +I V L
Sbjct: 145 TLLGTLGVFFRLWLLSVLSGEDLWIYVITQVTEFIEWVFLKLSLLASPSVFLIQVGAIAL 204
Query: 120 VLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
+LLN+ ++F++HL RLG R PRW+++
Sbjct: 205 ILLNNFIYLFVVHLAAWFLFDRLGNPIP-RPPRWVQV 240
>gi|17227792|ref|NP_484340.1| hypothetical protein alr0296 [Nostoc sp. PCC 7120]
gi|17135274|dbj|BAB77820.1| alr0296 [Nostoc sp. PCC 7120]
Length = 257
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 81/156 (51%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GPV++L +++ +G +G+ W W +SI L T
Sbjct: 99 LRWGKRAAWMAAVTSGLLLSVLMGPVRSLLFVMPFAFMGVVLGATWYRRAPWIVSITLGT 158
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + S L E++ + + + +IF G+ PS+ +I + L+
Sbjct: 159 LLGTLGVFFRLWLLSVLSGEDLWIYLITQVTELIEWIFLKLGLLNSPSVFLIQLGAVALI 218
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
++N+ ++F++HL + L RLG R PRW+++
Sbjct: 219 IINNFIYLFVVHLAAWLLLDRLGNPIP-RPPRWVQV 253
>gi|434397203|ref|YP_007131207.1| Protein of unknown function DUF2232, membrane [Stanieria
cyanosphaera PCC 7437]
gi|428268300|gb|AFZ34241.1| Protein of unknown function DUF2232, membrane [Stanieria
cyanosphaera PCC 7437]
Length = 256
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 71/136 (52%), Gaps = 1/136 (0%)
Query: 23 SGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENI 82
GP +++ +L+ +G++G +G W W LSI + ++ + G S L+ E++
Sbjct: 118 MGPTRSIVFLMPYGLMGVQLGVCWSRRASWVLSILIGSLIGTFGLFFRFWLFSVLLGEDL 177
Query: 83 LALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVLLNSGFFVFLLHLLYSVFLTRL 142
+ I +IF G+ PS +I ++ ++L+N+ ++F +HL+ + L RL
Sbjct: 178 WIYVITQITELTDWIFLKLGLLAQPSFFLIQILAIAMILINNLVYLFAVHLVALILLDRL 237
Query: 143 GMKASLRLPRWLEMAL 158
G S R P+W+++ L
Sbjct: 238 GNPIS-RPPQWVQVLL 252
>gi|282900315|ref|ZP_06308266.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281194820|gb|EFA69766.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 226
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 78/156 (50%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL VL+GPV++L +++ LG +G W V W +SI L T
Sbjct: 66 LRWGNRASWMAAITSGLLLSVLTGPVRSLLFIMPFAFLGVLLGVSWHRRVPWVVSISLGT 125
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I + G + S L E++ +T + + ++F + PS + + L+
Sbjct: 126 ILCTLGVFFRLWLLSILSGEDLWVYLTNQVTEIIEWLFLNLQILTTPSALGVKLAALLLI 185
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
++N+ ++F++HL + L RLG P W+++
Sbjct: 186 MINNLIYMFIVHLAAWLLLDRLGNLIP-NPPDWIQV 220
>gi|75909273|ref|YP_323569.1| hypothetical protein Ava_3065 [Anabaena variabilis ATCC 29413]
gi|75702998|gb|ABA22674.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 257
Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 82/158 (51%), Gaps = 5/158 (3%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GPV++L +++ +G +G+ W W +SI L T
Sbjct: 99 LRWGKRAAWMAAVTSGLLLSVLMGPVRSLLFVMPFAFMGVLLGATWYRRAPWIVSITLGT 158
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFG--T 118
+ + G + S L E++ + + + +IF G+ PS V+ + FG
Sbjct: 159 LLGTLGVFFRLWLLSVLSGEDLWIYLITQVTELIEWIFLKLGLLNSPS--VLLIQFGAVA 216
Query: 119 LVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
L+++N+ ++F++HL + L RLG R PRW+++
Sbjct: 217 LIIINNFIYLFVVHLAAWLLLDRLGNPIP-RPPRWVQV 253
>gi|254423145|ref|ZP_05036863.1| hypothetical protein S7335_3299 [Synechococcus sp. PCC 7335]
gi|196190634|gb|EDX85598.1| hypothetical protein S7335_3299 [Synechococcus sp. PCC 7335]
Length = 278
Score = 54.3 bits (129), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/164 (26%), Positives = 80/164 (48%), Gaps = 9/164 (5%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RW A + +LL +L GP +++ +++ +G+LG +G LWR WG+SI
Sbjct: 112 LRWNARAAWMGATVSGLLLTILLGPTRSVLFVMPYGILGVQLGYLWRRKAGWGISILTGG 171
Query: 61 IARSAGAMGYILTSSFLIREN----ILALITINIHASLTFIFSAAGVNI----VPSMNVI 112
+ + G I S L E+ ++A IT I S+ + + + + P M +I
Sbjct: 172 LLVTFGTFFRIWLLSILAGEDLWGYLIAQITQFIDWSVRLLTNWGLIGLGRFGQPDMEII 231
Query: 113 YVIFGTLVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
+ ++ + S ++F +HL + L RLG+ S P W+++
Sbjct: 232 AIAAVSMTVFASVVYLFTVHLAAWLILERLGIVMS-EPPNWVQV 274
>gi|56751839|ref|YP_172540.1| hypothetical protein syc1830_d [Synechococcus elongatus PCC 6301]
gi|81301078|ref|YP_401286.1| hypothetical protein Synpcc7942_2269 [Synechococcus elongatus PCC
7942]
gi|56686798|dbj|BAD80020.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169959|gb|ABB58299.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 245
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 72/157 (45%), Gaps = 1/157 (0%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
RWG+ + + ++LL VL GP + + +LL G LG +G W+ + WG SI L
Sbjct: 85 RWGLRSAWLGAIVASLLLFVLLGPTRGILFLLPPGCLGVLLGCCWQRRLSWGWSILLGGG 144
Query: 62 ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVL 121
+ G + S L+ E+I +T L ++ G + ++ + + ++
Sbjct: 145 LETIGFFFRLWVLSVLLGEDIWTYVTAQATGFLEWLTGVLGWLVQIPLSTVQAVAFAMIT 204
Query: 122 LNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
L+S ++ ++H + V RLG P WL+ L
Sbjct: 205 LSSLLYILVVHGVAWVLFRRLGTPIPAP-PHWLQTVL 240
>gi|254413468|ref|ZP_05027238.1| hypothetical protein MC7420_6047 [Coleofasciculus chthonoplastes
PCC 7420]
gi|196179575|gb|EDX74569.1| hypothetical protein MC7420_6047 [Coleofasciculus chthonoplastes
PCC 7420]
Length = 277
Score = 53.1 bits (126), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 33/136 (24%), Positives = 72/136 (52%), Gaps = 1/136 (0%)
Query: 23 SGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENI 82
GP +++ +L+ +G++G +G+LW+ +W SI + T+ + G S L+ E++
Sbjct: 141 QGPTRSILFLIPYGLMGVQLGALWKRQANWLFSIGIGTLLGTLGFFFRFWLLSILLSEDL 200
Query: 83 LALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVLLNSGFFVFLLHLLYSVFLTRL 142
+ I + + F G+ PS+ +I V+ +++N+ ++F++HL+ + RL
Sbjct: 201 WVYVMTQITELVEWGFVKLGLLAQPSLMLIQVVAIVAIVINNAIYLFVVHLVALLIFDRL 260
Query: 143 GMKASLRLPRWLEMAL 158
R P+W+++ L
Sbjct: 261 KNPIP-RPPQWVQVLL 275
>gi|119512452|ref|ZP_01631534.1| hypothetical protein N9414_13495 [Nodularia spumigena CCY9414]
gi|119462918|gb|EAW43873.1| hypothetical protein N9414_13495 [Nodularia spumigena CCY9414]
Length = 241
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 79/156 (50%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GPV++L +++ +G +G+ W V W +SI L T
Sbjct: 81 LRWGKRAAWMAAVTSGLLLSVLMGPVRSLLFVMPFAFMGVLLGATWHRRVPWIVSITLGT 140
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + S L E++ + + + + F + PS+ +I V L+
Sbjct: 141 LLGTLGVFFRLWLLSVLSGEDLWVYVINQVTELIEWAFLRLELLASPSLFLIQVGAVALI 200
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
+LN+ ++F++HL + L RLG R P W+++
Sbjct: 201 VLNNFIYLFMVHLAAWLLLDRLGNPIP-RPPEWVQV 235
>gi|67922653|ref|ZP_00516158.1| similar to membrane protein [Crocosphaera watsonii WH 8501]
gi|67855501|gb|EAM50755.1| similar to membrane protein [Crocosphaera watsonii WH 8501]
Length = 240
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 77/158 (48%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G A + + +LL VL GP +++ +L+ +G++G +G+ WR + W S+
Sbjct: 81 LRRGKRAAWMGALVSGLLLTVLMGPTRSIVFLIPYGLMGVQLGAFWRRNLPWEFSMITGA 140
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ E++ I I +IF G+ PS I + ++
Sbjct: 141 LLGTFGFFFRFWLFSILLGEDLWQYIINQITGLAEWIFLRLGILAQPSSVFIQSLAIFII 200
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
NS ++F +HL+ + L +LG R P+W+++ L
Sbjct: 201 FFNSIIYLFAVHLVSLLVLDKLGNPIP-RPPKWVQVIL 237
>gi|411118678|ref|ZP_11391058.1| putative membrane protein (DUF2232) [Oscillatoriales cyanobacterium
JSC-12]
gi|410710541|gb|EKQ68048.1| putative membrane protein (DUF2232) [Oscillatoriales cyanobacterium
JSC-12]
Length = 275
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 78/156 (50%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL VL GP ++L YL+ G +G +G LWR +W SI + +
Sbjct: 115 LRWGYRAAWMGALVSGLLLSVLMGPPRSLLYLMPFGFMGVLLGMLWRRQANWAGSIAIAS 174
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I S G + S L+ +++ T + + + G+ PS+ +I + V
Sbjct: 175 ILGSFGFFFRVWLVSLLLGDDLWLYTTTQVTNFIDWFCVRFGILFQPSLTLIQALVAGGV 234
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
++N+ ++F++HL+ RL R P+W+++
Sbjct: 235 VINNLVYLFVVHLVAWFLFDRLRNPIP-RPPKWVQV 269
>gi|416392028|ref|ZP_11685833.1| hypothetical protein CWATWH0003_2646 [Crocosphaera watsonii WH
0003]
gi|357263688|gb|EHJ12658.1| hypothetical protein CWATWH0003_2646 [Crocosphaera watsonii WH
0003]
Length = 240
Score = 52.8 bits (125), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 77/158 (48%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G A + + +LL VL GP +++ +L+ +G++G +G+ WR + W S+
Sbjct: 81 LRRGKRAAWMGALVSGLLLTVLMGPTRSIVFLIPYGLMGVQLGAFWRRNLPWEFSMITGA 140
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ E++ I I +IF G+ PS I + ++
Sbjct: 141 LLGTFGFFFRFWLFSILLGEDLWQYIINQITGLAEWIFLRLGILAQPSSVFIQSLAIFII 200
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
NS ++F +HL+ + L +LG R P+W+++ L
Sbjct: 201 FFNSIIYLFAVHLVSLLVLDKLGNPIP-RPPKWVQVIL 237
>gi|434405583|ref|YP_007148468.1| putative membrane protein (DUF2232) [Cylindrospermum stagnale PCC
7417]
gi|428259838|gb|AFZ25788.1| putative membrane protein (DUF2232) [Cylindrospermum stagnale PCC
7417]
Length = 260
Score = 52.4 bits (124), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 81/156 (51%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GPV++L +++ +G +G+ W V W +SI L T
Sbjct: 100 LRWGRRAAWMAAVTSGLLLAVLMGPVRSLLFVMPFAFMGVLLGATWYRRVPWIVSITLGT 159
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + S L E++ + + + +IF G+ PS+ +I V L+
Sbjct: 160 LLGTLGVFFRLWLLSVLSGEDLWIYVINQVTDIIEWIFLKLGLLTNPSVFLIQVGAVALI 219
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
+N+ ++ ++HL + L RLG + R P+W+++
Sbjct: 220 AINNFIYLIVVHLAAWLLLDRLG-NSIPRPPQWVQV 254
>gi|428211971|ref|YP_007085115.1| hypothetical protein Oscil6304_1490 [Oscillatoria acuminata PCC
6304]
gi|428000352|gb|AFY81195.1| putative membrane protein (DUF2232) [Oscillatoria acuminata PCC
6304]
Length = 265
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 77/143 (53%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GP +++ +++ G+LG +G LW+ W ++IFL +
Sbjct: 106 LRWGNRAAWMGAVVSGLLLSVLMGPTRSILFVVPFGLLGVLLGCLWKRRASWEVAIFLGS 165
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + L+ E++ +TI I ++F G+ + PS+ VI + LV
Sbjct: 166 LLGAFGFFFRLWLLGILVGEDLWVYLTIQITEIADWLFLRLGLLVQPSLYVIQALAAILV 225
Query: 121 LLNSGFFVFLLHLLYSVFLTRLG 143
+ + ++F +HL S+ L ++G
Sbjct: 226 WVQNLIYLFAVHLAASLLLEKIG 248
>gi|334117938|ref|ZP_08492028.1| Protein of unknown function DUF2232, membrane [Microcoleus
vaginatus FGP-2]
gi|333459923|gb|EGK88533.1| Protein of unknown function DUF2232, membrane [Microcoleus
vaginatus FGP-2]
Length = 251
Score = 50.4 bits (119), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 84/156 (53%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL VL GP +++ Y++ G++G ++G LWR W LSI +
Sbjct: 91 LRWGNRAAWMAALVSGLLLSVLMGPTRSILYVMPFGIMGVALGGLWRRKASWSLSISVGA 150
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ +++ +T + + ++F G VPS+N I I ++
Sbjct: 151 LIGTVGFFFRFWLLSALVGQDLWVYVTAQVTEMVEWVFVKLGWLNVPSLNTIGAIAAGMI 210
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
L+N+ ++F++HL+ + L RLG R P W+++
Sbjct: 211 LVNNIVYLFVVHLVALLLLDRLGNPIP-RPPSWVQV 245
>gi|428317916|ref|YP_007115798.1| Protein of unknown function DUF2232, membrane [Oscillatoria
nigro-viridis PCC 7112]
gi|428241596|gb|AFZ07382.1| Protein of unknown function DUF2232, membrane [Oscillatoria
nigro-viridis PCC 7112]
Length = 251
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 64/124 (51%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL VL GP +++ Y++ G++G ++G LWR W LSI +
Sbjct: 91 LRWGNRAAWMAALVSGLLLSVLMGPTRSILYVMPFGIMGVALGGLWRRKASWSLSISVGA 150
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ G S L+ +++ +T + + ++F G VPS+N I I ++
Sbjct: 151 FIGTVGFFFRFWLLSALVGQDLWVYVTAQVTEMVEWVFVKLGWLNVPSLNTIGAIAAGMI 210
Query: 121 LLNS 124
L+N+
Sbjct: 211 LVNN 214
>gi|414075864|ref|YP_006995182.1| hypothetical protein ANA_C10569 [Anabaena sp. 90]
gi|413969280|gb|AFW93369.1| hypothetical protein ANA_C10569 [Anabaena sp. 90]
Length = 254
Score = 50.1 bits (118), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 80/156 (51%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A V + +LL VL GP ++L +++ G++G +G W V W +S+ L T
Sbjct: 95 LRWGKRAAWMAAVTSGLLLSVLMGPFRSLLFVMPFGLMGVLLGVTWYRRVPWIVSMTLGT 154
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + S L E++ + + + +IF + + P++ +I + +
Sbjct: 155 LLGTMGVFFRLWFLSVLSGEDLWVYVINQVTEIIEWIFLKLELLMTPNLLLIQLGAIAFI 214
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
+LN+ ++F++HL + L RLG R P WL++
Sbjct: 215 VLNNFIYLFIVHLTALLLLDRLGNPIP-RPPHWLQV 249
>gi|332706232|ref|ZP_08426301.1| putative membrane protein, DUF2232 [Moorea producens 3L]
gi|332355069|gb|EGJ34540.1| putative membrane protein, DUF2232 [Moorea producens 3L]
Length = 284
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/135 (23%), Positives = 72/135 (53%), Gaps = 1/135 (0%)
Query: 24 GPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENIL 83
G +++ +L+ +G++G +G+LW+ W LSI + T+ + G S L+ E++
Sbjct: 148 GVTRSIIFLVPYGLMGVQLGALWKRRASWLLSISMGTLIGTIGFFFRFWLFSILLGEDLW 207
Query: 84 ALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVLLNSGFFVFLLHLLYSVFLTRLG 143
+ + L + F G+ PS+ ++ + ++++N+ ++F++H++ + L RLG
Sbjct: 208 VYVMTQVTGLLDWWFLKLGLLFQPSLFLVQTLAVVMIIINNAIYLFVVHIVALLMLDRLG 267
Query: 144 MKASLRLPRWLEMAL 158
R P W+++ L
Sbjct: 268 NPIP-RPPNWVKVLL 281
>gi|72383606|ref|YP_292961.1| hypothetical protein PMN2A_1770 [Prochlorococcus marinus str.
NATL2A]
gi|72003456|gb|AAZ59258.1| uncharacterized conserved membrane protein [Prochlorococcus marinus
str. NATL2A]
Length = 228
Score = 49.3 bits (116), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 69/154 (44%), Gaps = 8/154 (5%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R GV G + + MLL VL GP++ L +G+L +G W G +W LS +
Sbjct: 67 IRRGVKTGIEGVTICVMLLTVLMGPLRGPLVLFPYGLLSLWLGYCWEKGWNWWLS---WS 123
Query: 61 IARSAGAMGY---ILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFG 117
+ S G MG+ + S L+ EN+ ++T L ++ P M + ++
Sbjct: 124 VGVSIGTMGFLVRVFVLSLLVGENLWVILTRAGAGLLEKGIDIFNLSFTPDMRQVQIVAL 183
Query: 118 TLVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLP 151
L++ +V LH L RL K+S+ P
Sbjct: 184 CLIITQEIIYVLCLHALAYWIFPRL--KSSIPEP 215
>gi|124025204|ref|YP_001014320.1| hypothetical protein NATL1_04931 [Prochlorococcus marinus str.
NATL1A]
gi|123960272|gb|ABM75055.1| Predicted membrane protein [Prochlorococcus marinus str. NATL1A]
Length = 228
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 68/154 (44%), Gaps = 8/154 (5%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R GV G + + MLL L GP++ L +G+L +G W G +W LS +
Sbjct: 67 IRRGVKTGIEGVTICVMLLTALMGPLRGPLVLFPYGLLSLWLGYCWEKGWNWWLS---WS 123
Query: 61 IARSAGAMGY---ILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFG 117
+ S G MG+ + S L+ EN+ ++T L ++ P M + ++
Sbjct: 124 VGVSIGTMGFLVRVFVLSLLVGENLWVILTRAGAGLLEKGIDIFNLSFTPDMRQVQIVAL 183
Query: 118 TLVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLP 151
L++ +V LH L RL K+S+ P
Sbjct: 184 CLIITQEIIYVLCLHALAYWIFPRL--KSSIPEP 215
>gi|113475248|ref|YP_721309.1| hypothetical protein Tery_1556 [Trichodesmium erythraeum IMS101]
gi|110166296|gb|ABG50836.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 219
Score = 47.4 bits (111), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/158 (23%), Positives = 82/158 (51%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL +L GP +++ +++ ++G +G+LWR G W +SI +
Sbjct: 60 LRWGSRASWMGAIVSGLLLSILMGPTRSILFVIPFALMGVMLGALWRRGKSWFVSITYGS 119
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ S G S L+ +++ + + ++F G+ P++ ++ +I L+
Sbjct: 120 LLGSIGFFFRFWLLSILLGQDLWIYLITQATELVEWVFVKLGLLNQPNLLLVQLIALILI 179
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+N+ ++ ++HL+ + L R+G R P+W++ L
Sbjct: 180 FINNVIYLSVVHLVALLLLDRIGNSIP-RPPKWIKKLL 216
>gi|443312339|ref|ZP_21041957.1| putative membrane protein (DUF2232) [Synechocystis sp. PCC 7509]
gi|442777577|gb|ELR87852.1| putative membrane protein (DUF2232) [Synechocystis sp. PCC 7509]
Length = 226
Score = 47.0 bits (110), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 40/156 (25%), Positives = 79/156 (50%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG + + +LL VL GP +++ +++ +G++G +G+ W+ W +SI L T
Sbjct: 66 LRWGNRTAWMGALVSGLLLSVLMGPTRSIVFVVPYGLMGVLLGATWKRRSPWVVSIGLAT 125
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G + L E++ + L ++F AG+ PS+ ++ I ++
Sbjct: 126 LLGAFGVFFRLWLLLVLSGEDVWVYSITQVTNLLEWLFLKAGLLAQPSIVLVEAIAIAII 185
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
+N+ ++F +HL L RLG R PRW+++
Sbjct: 186 FINNMIYLFAVHLASWFLLDRLGNPIP-RPPRWVQV 220
>gi|443316964|ref|ZP_21046389.1| putative membrane protein (DUF2232) [Leptolyngbya sp. PCC 6406]
gi|442783427|gb|ELR93342.1| putative membrane protein (DUF2232) [Leptolyngbya sp. PCC 6406]
Length = 273
Score = 46.2 bits (108), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 44/164 (26%), Positives = 80/164 (48%), Gaps = 11/164 (6%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + +LL V+ GP ++L +L+ +G+LG +G LWR W +I L
Sbjct: 107 LRWGSRAAWMSALVATLLLAVMMGPPRSLLFLMPYGLLGVQLGWLWRRRASWYTTIGLGA 166
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASL---------TFIFSAAGVNIVPSMNV 111
+ + G + S ++ E+ L + TIN L + + GV +
Sbjct: 167 LLSALGVFVRLWLLSIMLGED-LWIYTINQVTQLIRWVLGILVNWGWLGWGVLSQVDGTL 225
Query: 112 IYVIFGTLVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLE 155
+ ++ L+L++S ++F +HL + L RLG +A P W++
Sbjct: 226 VQILAMGLILVSSVVYLFTVHLAAWLLLERLG-EAMSPPPLWVQ 268
>gi|148241761|ref|YP_001226918.1| hypothetical protein SynRCC307_0662 [Synechococcus sp. RCC307]
gi|147850071|emb|CAK27565.1| Uncharacterized conserved membrane protein [Synechococcus sp.
RCC307]
Length = 207
Score = 43.1 bits (100), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 34/128 (26%), Positives = 60/128 (46%)
Query: 15 TAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTS 74
T +LL VL GPV+ L +G+L +G WR W S+ + + +AG + +
Sbjct: 60 TVLLLTVLMGPVRGPLMLFPYGLLSLWLGWCWRRQCSWWFSMAVGVLIGAAGFLVRVALV 119
Query: 75 SFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVLLNSGFFVFLLHLL 134
S L+ EN+ +IT L + + + P+++ + V LV+ + ++ LH +
Sbjct: 120 SLLLGENLWVVITAAASGLLNGLADRLQLPVAPTLDQVQVAALLLVVAQNIVYLLALHAV 179
Query: 135 YSVFLTRL 142
L RL
Sbjct: 180 AWWVLPRL 187
>gi|428311088|ref|YP_007122065.1| hypothetical protein Mic7113_2881 [Microcoleus sp. PCC 7113]
gi|428252700|gb|AFZ18659.1| putative membrane protein (DUF2232) [Microcoleus sp. PCC 7113]
Length = 300
Score = 43.1 bits (100), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 18/56 (32%), Positives = 35/56 (62%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSI 56
+RWG A + + +LL VL GP +++ +++ +G++G +G++WR +W SI
Sbjct: 140 LRWGNRASWMAAIVSGLLLSVLMGPTRSILFVIPYGLMGVQLGAMWRRQSNWLFSI 195
>gi|116073491|ref|ZP_01470753.1| hypothetical protein RS9916_33612 [Synechococcus sp. RS9916]
gi|116068796|gb|EAU74548.1| hypothetical protein RS9916_33612 [Synechococcus sp. RS9916]
Length = 221
Score = 43.1 bits (100), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 76/159 (47%), Gaps = 13/159 (8%)
Query: 4 GVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIAR 63
G AG + ++ +LL+ L GPV+ L +G L +G W G+ W +S L T+
Sbjct: 52 GGKAGVEGVLLVVLLLVALMGPVRGPLILFPYGFLSLWLGWCWLRGMSWWISWGLGTLIG 111
Query: 64 SAGAMGYILTSSFLIRENILALIT------IN-----IHASLTFIFSAAGVNIVPSMNVI 112
+AG + ++ S L+ EN+ +IT +N I+ +L F+ ++I P + +
Sbjct: 112 AAGFLVRVVALSLLVGENLWVVITRAGDNMLNGLIGVINNALAFVQVPLRLSIAPDLEQV 171
Query: 113 YVIFGTLVLLNSGFFVFLLHLLYSVFLTRLGMKASLRLP 151
++ LVL+ +V LH L RL KA + P
Sbjct: 172 QLMALALVLVQQLIYVLALHALSYWIFPRL--KAPIPSP 208
>gi|148240217|ref|YP_001225604.1| hypothetical protein SynWH7803_1881 [Synechococcus sp. WH 7803]
gi|147848756|emb|CAK24307.1| Conserved hyptothetical membrane protein [Synechococcus sp. WH
7803]
Length = 214
Score = 43.1 bits (100), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 40/151 (26%), Positives = 71/151 (47%), Gaps = 2/151 (1%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G AG + + + +LL+ L GPV+ L +G+L +G W+ W +S L
Sbjct: 53 VRRGGRAGLEGLAVSILLLIALMGPVRGPLMLFPYGLLSLWLGWCWQHRQSWWISWGLGV 112
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ +AG + ++ S L+ EN+ +IT L + + + P + ++ V+ LV
Sbjct: 113 VIGAAGFLVRVVALSLLVGENLWVVITRAGAGLLDRLLELLQLPLAPDLLLVQVMALVLV 172
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLP 151
++ +V LH L RL KA + P
Sbjct: 173 VIQQLIYVLALHALAYWIFPRL--KAPVPEP 201
>gi|428304516|ref|YP_007141341.1| hypothetical protein Cri9333_0916 [Crinalium epipsammum PCC 9333]
gi|428246051|gb|AFZ11831.1| Protein of unknown function DUF2232, membrane [Crinalium epipsammum
PCC 9333]
Length = 260
Score = 42.4 bits (98), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 84/156 (53%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + + +LL VL GP +++ + + +G++G +G+LWR G +W LSI +
Sbjct: 100 LRWGHRASWMSALVSGLLLSVLMGPTRSILFFIPYGLMGVQLGALWRRGSNWLLSIGIGG 159
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L+ E++ + I ++F G P++ ++ I +V
Sbjct: 160 LIGTFGFFFRFWLLSILLGEDLWIYLMTQITEVAEWVFVKLGFLAQPNLLLVQAIALAMV 219
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
+LN+ ++F++HL+ + L RLG R P W+++
Sbjct: 220 ILNNMIYLFVVHLVALLLLDRLGNPIP-RPPHWVQV 254
>gi|87303040|ref|ZP_01085844.1| hypothetical protein WH5701_07696 [Synechococcus sp. WH 5701]
gi|87282536|gb|EAQ74495.1| hypothetical protein WH5701_07696 [Synechococcus sp. WH 5701]
Length = 219
Score = 41.2 bits (95), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 39/152 (25%), Positives = 68/152 (44%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G G + V ++LL+ L GP++ L +G+L +G W W L+ +
Sbjct: 58 VRQGWRPGWEGAVVASLLLVALMGPIRGPLVLFPYGLLALWLGWGWAERRSWWLTWSVGL 117
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ SAG + + S L+ EN+ LIT L + S + + P + + ++ LV
Sbjct: 118 LIGSAGFLVRVAVLSLLLGENLWVLITAAAAGLLERLSSLLNLGVGPDLLQVQLMALVLV 177
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPR 152
L+ + +V LH + RL S P+
Sbjct: 178 LVQNLIYVLALHAVAYWIFPRLQAPISEPPPQ 209
>gi|88807570|ref|ZP_01123082.1| hypothetical protein WH7805_13503 [Synechococcus sp. WH 7805]
gi|88788784|gb|EAR19939.1| hypothetical protein WH7805_13503 [Synechococcus sp. WH 7805]
Length = 214
Score = 40.8 bits (94), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 36/142 (25%), Positives = 66/142 (46%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G AG + + + +LL+ L GPV+ L +G+L +G W+ W +S
Sbjct: 53 VRRGSRAGLEGLAVSILLLMALMGPVRGPLMLFPYGLLSLWLGWCWQHRQSWWISWGFGL 112
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ +AG + ++ S L+ EN+ +IT L + + + P + ++ V+ LV
Sbjct: 113 VIGAAGFLVRVVALSLLVGENLWVVITRAGAGLLDRLLELLQLPLAPDLLLVQVMALVLV 172
Query: 121 LLNSGFFVFLLHLLYSVFLTRL 142
++ +V LH L RL
Sbjct: 173 VIQQLIYVLALHALAYWIFPRL 194
>gi|427418274|ref|ZP_18908457.1| putative membrane protein (DUF2232) [Leptolyngbya sp. PCC 7375]
gi|425760987|gb|EKV01840.1| putative membrane protein (DUF2232) [Leptolyngbya sp. PCC 7375]
Length = 267
Score = 40.4 bits (93), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 74/159 (46%), Gaps = 15/159 (9%)
Query: 11 TMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGY 70
T + + +LL +L GP ++L Y++ +G+LG +G WR W +S+ + + G
Sbjct: 111 TTLVSGLLLAILLGPPRSLLYVIPYGLLGVQLGYHWRHQTSWLISLPIGAFIVTLGIFFR 170
Query: 71 ILTSSFLIRENILALITINIHASLTFI-----------FSAAGVNIVPSMNVIYVIFGTL 119
I S++ E++ + + +I F A G + P + IF +
Sbjct: 171 IWLLSWMAGEDLWGYLVAQVIQLTDWITNRLLDFGLLGFGAIG-QLSPGIIQTVAIF--M 227
Query: 120 VLLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
V +S ++F +HL + L R+G+ A P+W++ L
Sbjct: 228 VFFSSIVYLFTVHLTAWLLLERMGI-AIPSPPQWVQQIL 265
>gi|194477188|ref|YP_002049367.1| hypothetical protein PCC_0742 [Paulinella chromatophora]
gi|171192195|gb|ACB43157.1| hypothetical protein PCC_0742 [Paulinella chromatophora]
Length = 223
Score = 40.4 bits (93), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 35/146 (23%), Positives = 68/146 (46%)
Query: 2 RWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTI 61
R G + + T +LL L GP++ L +G+L +G W ++W ++ + I
Sbjct: 63 RQGQRCAIEGLFVTVLLLTTLMGPIRGSIMLFPYGILAIWLGLGWYHRLNWWITWGVGII 122
Query: 62 ARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVL 121
S G + + S L+ EN+ +IT + L +I + + PS + + + T +
Sbjct: 123 IGSLGFVIRLTMLSILVGENLWMVITHSSTDFLEYIAVSLKLGFSPSFSEVQSLTITFIF 182
Query: 122 LNSGFFVFLLHLLYSVFLTRLGMKAS 147
L + +VF +H++ +RL + S
Sbjct: 183 LQNLIYVFTVHVIAYWTFSRLDVNIS 208
>gi|159902976|ref|YP_001550320.1| hypothetical protein P9211_04351 [Prochlorococcus marinus str. MIT
9211]
gi|159888152|gb|ABX08366.1| Predicted membrane protein [Prochlorococcus marinus str. MIT 9211]
Length = 225
Score = 38.5 bits (88), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 63/132 (47%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G AG + + MLL+VL GP++ L +G L +G W + W LS +
Sbjct: 64 VRRGHQAGLEGVSLLIMLLIVLMGPIRGPLVLFPYGFLSLWLGWSWCRKLSWWLSWSVGA 123
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ G + +L S L+ EN+ +IT A L I + ++P M +I + +LV
Sbjct: 124 FLGTIGFLIRVLFLSILVGENLWVVITRAASALLERIIDIFNLPLIPDMALIQLAALSLV 183
Query: 121 LLNSGFFVFLLH 132
++ FV LH
Sbjct: 184 VIQELIFVLTLH 195
>gi|434391074|ref|YP_007126021.1| Protein of unknown function DUF2232, membrane [Gloeocapsa sp. PCC
7428]
gi|428262915|gb|AFZ28861.1| Protein of unknown function DUF2232, membrane [Gloeocapsa sp. PCC
7428]
Length = 237
Score = 38.5 bits (88), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 70/156 (44%), Gaps = 1/156 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + +LL VL GP + + Y++ ++G +G+ W W +SI L
Sbjct: 75 LRWGARAAWMATGVSGLLLSVLLGPTRGILYIMPFALMGVLLGATWHRRAPWIVSIILGA 134
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ + G S L E++ + + F G+ PS+ +I LV
Sbjct: 135 LLGTIGFFFRFWVLSVLTGEDLWVYTITQVTRFAEWAFLRLGLLAQPSVLLIEAAAIALV 194
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEM 156
+ + ++F++HL L RLG + R P W+++
Sbjct: 195 FVQNIVYLFVVHLASWFLLDRLG-NSIPRPPHWVQV 229
>gi|198419335|ref|XP_002119961.1| PREDICTED: similar to G protein-coupled receptor 128 [Ciona
intestinalis]
Length = 1432
Score = 38.5 bits (88), Expect = 1.0, Method: Composition-based stats.
Identities = 31/109 (28%), Positives = 55/109 (50%), Gaps = 15/109 (13%)
Query: 48 LGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVP 107
LGV+ L +FL A ++ S +++N+L +T+ + LT+IF G I+P
Sbjct: 1278 LGVNLMLYVFLTYKVTCAKRPNFMAGKSKDLQKNLLFSLTLFVTLGLTWIF---GFVIIP 1334
Query: 108 -------SMNVIYVIFGTLVLLNSGFFVFLLHLLYSVFLTRLGMKASLR 149
+ +V++ +F +L GFF+FLL+++ F TR + R
Sbjct: 1335 GNGDASFAFSVLFTVFNSL----QGFFLFLLYVVRQKF-TRSAISEQAR 1378
>gi|124022342|ref|YP_001016649.1| hypothetical protein P9303_06321 [Prochlorococcus marinus str. MIT
9303]
gi|123962628|gb|ABM77384.1| Predicted membrane protein [Prochlorococcus marinus str. MIT 9303]
Length = 226
Score = 38.1 bits (87), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 67/142 (47%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G AG + + +LL+ L GPV+ L +G+L F +G W G+ W +S
Sbjct: 65 VRRGSKAGLEGVALAVLLLIALMGPVRGPLVLFPYGLLAFWLGWSWHRGLSWWVSWGCGV 124
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
+ +AG + ++ S L+ EN+ +IT L + + + P +N + ++ LV
Sbjct: 125 VIGTAGFLVRVVVLSLLVGENLWVVITRAGSGLLDRLVDLLNLPLAPDLNHVQLMAFALV 184
Query: 121 LLNSGFFVFLLHLLYSVFLTRL 142
+ +V +LH L RL
Sbjct: 185 VFQELVYVLVLHALAFWIFPRL 206
>gi|428223992|ref|YP_007108089.1| hypothetical protein GEI7407_0538 [Geitlerinema sp. PCC 7407]
gi|427983893|gb|AFY65037.1| Protein of unknown function DUF2232, membrane [Geitlerinema sp. PCC
7407]
Length = 264
Score = 37.7 bits (86), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 85/158 (53%), Gaps = 1/158 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+RWG A + + + +LL VL GP +++ +++ +G+LG +G LW+ W +I L +
Sbjct: 103 LRWGGRAAWMSALVSGLLLSVLMGPTRSVLFVMPYGLLGVLLGFLWQRAATWTTAIALGS 162
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I + G + S L+ +++ +T I + L +I G+ + PS++++ + +
Sbjct: 163 ILGTFGIFFRLWLLSVLLGDDLWIYLTTQITSLLDWILLKLGLLLQPSLSLVQSVAVVAI 222
Query: 121 LLNSGFFVFLLHLLYSVFLTRLGMKASLRLPRWLEMAL 158
+LNS ++ ++H+ L RLG P+W+++ L
Sbjct: 223 VLNSLLYLLVVHIAAWWLLERLGNPIPAP-PKWVQILL 259
>gi|317970523|ref|ZP_07971913.1| hypothetical protein SCB02_13396 [Synechococcus sp. CB0205]
Length = 212
Score = 37.4 bits (85), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 28/109 (25%), Positives = 49/109 (44%)
Query: 24 GPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENIL 83
GP++ L +G L +G WR G+ W LS + + + G + + S L+ EN+
Sbjct: 74 GPIRGPLVLFPYGFLALWLGWCWRRGLSWYLSWGVGVLIGAGGFLVRVAVLSILLGENLW 133
Query: 84 ALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVLLNSGFFVFLLH 132
LIT L + + P + + ++ LVL+ + +V LH
Sbjct: 134 VLITSAASQMLERLLGLVQIGGGPDLFQVQLMALALVLVQNLIYVLALH 182
>gi|123965735|ref|YP_001010816.1| hypothetical protein P9515_05001 [Prochlorococcus marinus str. MIT
9515]
gi|123200101|gb|ABM71709.1| Predicted membrane protein [Prochlorococcus marinus str. MIT 9515]
Length = 216
Score = 37.0 bits (84), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 68/139 (48%), Gaps = 1/139 (0%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G + + ++ +LLL+L GP++ +L +G+L F +G W +W LS +
Sbjct: 56 LRRGTSTALEGLIIQFLLLLILMGPIRGTLFLFPYGILAFWLGWSWFKQKNWWLSWSVGF 115
Query: 61 IARSAGAMGYILTSSFLIRENILALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLV 120
I + G + S L+ +N+ +IT + L + PS+ +I ++ L+
Sbjct: 116 ILGTLGFFIRVFALSTLVGDNLWIIITRASYGLLDKFIELFNLPFSPSIRIIQLVALLLI 175
Query: 121 LLNSGFFVFLLHLL-YSVF 138
+ +V +H+L YS+F
Sbjct: 176 IFQEMVYVLTIHILSYSLF 194
>gi|33860995|ref|NP_892556.1| hypothetical protein PMM0438 [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
gi|33639727|emb|CAE18897.1| conserved hypothetical protein [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
Length = 212
Score = 35.8 bits (81), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 28/117 (23%), Positives = 56/117 (47%), Gaps = 1/117 (0%)
Query: 23 SGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENI 82
GP++ +L +G+L F +G W + +W LS I + G + S L+ +N+
Sbjct: 74 MGPIRGTLFLFPYGILAFWLGWSWFMKKNWWLSWCGGFILGTLGFFIRVFALSTLVGDNL 133
Query: 83 LALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVLLNSGFFVFLLHLL-YSVF 138
+IT + + + + PS+ +I ++ L++ +V +H+L YS+F
Sbjct: 134 WIIITRASYGLIDKFIELFNLPLSPSIRIIQLVAILLIIFQEMVYVLTIHILAYSLF 190
>gi|330992162|ref|ZP_08316110.1| putative allantoin permease [Gluconacetobacter sp. SXCC-1]
gi|329760361|gb|EGG76857.1| putative allantoin permease [Gluconacetobacter sp. SXCC-1]
Length = 491
Score = 35.8 bits (81), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 50/97 (51%), Gaps = 5/97 (5%)
Query: 14 ATAMLLLVLSGPVKALAYLLTHG--VLGFSMGSLWRLGVDW-GLS-IFLCTIARSAGAMG 69
A A+++ SGP+ +G G+ W L ++ G S + +CTIA +A A G
Sbjct: 246 AVALVVSYFSGPMLNFGDFARYGRSAADIRRGNFWGLPFNFVGFSALTVCTIALTAPAFG 305
Query: 70 YILTSSFLIRENILALITINIHASLTFIFSAAGVNIV 106
+++T +I +L + + LTFI + AG+NIV
Sbjct: 306 HVMTDPVETVAHIDSLSAV-VFGMLTFIIATAGINIV 341
>gi|349688699|ref|ZP_08899841.1| NCS1 nucleoside transporter [Gluconacetobacter oboediens 174Bp2]
Length = 490
Score = 35.4 bits (80), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 50/97 (51%), Gaps = 5/97 (5%)
Query: 14 ATAMLLLVLSGPVKALAYLLTHG--VLGFSMGSLWRLGVDW-GLS-IFLCTIARSAGAMG 69
A A+++ SGP+ +G V G+ W L ++ G S + +CTIA +A A G
Sbjct: 246 AVALVVSYFSGPMLNFGDFARYGRGVSDIRRGNFWGLPFNFVGFSALTICTIALTAPAFG 305
Query: 70 YILTSSFLIRENILALITINIHASLTFIFSAAGVNIV 106
+++ +I +L + I LTFI + AG+NIV
Sbjct: 306 HVMIDPVETVAHIDSLTAV-IFGMLTFIIATAGINIV 341
>gi|452821901|gb|EME28926.1| hypothetical protein isoform 1 [Galdieria sulphuraria]
Length = 323
Score = 35.4 bits (80), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 40/91 (43%), Gaps = 8/91 (8%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTH----GVLGFSMGSLWRLGVDWGLSI 56
+RWG R + ++ GP+ L Y+LT GVL +M W +W I
Sbjct: 151 LRWGPRYSRLALCLLFCFFVLRPGPLYCLQYMLTQGLAVGVLTHTMWCQW----NWLFCI 206
Query: 57 FLCTIARSAGAMGYILTSSFLIRENILALIT 87
TIA G M I +SF+ +N LIT
Sbjct: 207 LSSTIAYLFGIMLEISLTSFVFGKNTWTLIT 237
>gi|254431659|ref|ZP_05045362.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
gi|197626112|gb|EDY38671.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
Length = 201
Score = 35.4 bits (80), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 50/109 (45%)
Query: 24 GPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCTIARSAGAMGYILTSSFLIRENIL 83
GP++ L +G+L +G WR ++W LS L ++ +AG + + S L+ EN+
Sbjct: 63 GPIRGPLLLFPYGLLALWLGWCWRRRLNWWLSWSLGSVIGAAGFLVRVAVLSVLVGENLW 122
Query: 84 ALITINIHASLTFIFSAAGVNIVPSMNVIYVIFGTLVLLNSGFFVFLLH 132
+IT L + S G+ + + V LV + + V LH
Sbjct: 123 VVITTAAAGLLERLASLVGLGAGIELAQVQVAAVLLVWVQNVIVVLALH 171
>gi|452821902|gb|EME28927.1| hypothetical protein isoform 2 [Galdieria sulphuraria]
Length = 342
Score = 35.4 bits (80), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 29/91 (31%), Positives = 40/91 (43%), Gaps = 8/91 (8%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTH----GVLGFSMGSLWRLGVDWGLSI 56
+RWG R + ++ GP+ L Y+LT GVL +M W +W I
Sbjct: 170 LRWGPRYSRLALCLLFCFFVLRPGPLYCLQYMLTQGLAVGVLTHTMWCQW----NWLFCI 225
Query: 57 FLCTIARSAGAMGYILTSSFLIRENILALIT 87
TIA G M I +SF+ +N LIT
Sbjct: 226 LSSTIAYLFGIMLEISLTSFVFGKNTWTLIT 256
>gi|427704401|ref|YP_007047623.1| hypothetical protein Cyagr_3203 [Cyanobium gracile PCC 6307]
gi|427347569|gb|AFY30282.1| putative membrane protein (DUF2232) [Cyanobium gracile PCC 6307]
Length = 218
Score = 35.0 bits (79), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 25/87 (28%), Positives = 45/87 (51%)
Query: 1 MRWGVAAGRKTMVATAMLLLVLSGPVKALAYLLTHGVLGFSMGSLWRLGVDWGLSIFLCT 60
+R G + +V T +LL+ L GP++ L +G+L +G WR W L++ +
Sbjct: 57 LRHGWRCALEGVVVTTLLLVALMGPIRGPLVLFPYGLLSLWLGWGWRRRSSWWLTLGVGG 116
Query: 61 IARSAGAMGYILTSSFLIRENILALIT 87
+ +AG + + S L+ EN+ +IT
Sbjct: 117 LIGAAGFLVRVAVLSVLVGENLWVVIT 143
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.332 0.143 0.435
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,171,236,058
Number of Sequences: 23463169
Number of extensions: 80010289
Number of successful extensions: 338103
Number of sequences better than 100.0: 294
Number of HSP's better than 100.0 without gapping: 155
Number of HSP's successfully gapped in prelim test: 139
Number of HSP's that attempted gapping in prelim test: 337912
Number of HSP's gapped (non-prelim): 299
length of query: 158
length of database: 8,064,228,071
effective HSP length: 120
effective length of query: 38
effective length of database: 9,543,615,087
effective search space: 362657373306
effective search space used: 362657373306
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (22.0 bits)
S2: 71 (32.0 bits)