BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 025542
(251 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225451265|ref|XP_002272524.1| PREDICTED: uncharacterized protein LOC100264176 [Vitis vinifera]
Length = 242
Score = 260 bits (664), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 138/233 (59%), Positives = 164/233 (70%), Gaps = 35/233 (15%)
Query: 16 RRSILGSIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKV 75
R+ + +HG++CCS T +NK TPQ+L+ AV GVTELLRL S F + RLD +
Sbjct: 39 RQHLQKRVHGIRCCSTT-------PDNKEKTPQLLRIAVGGVTELLRLVS-FGQNRLDSM 90
Query: 76 SYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSR 135
SYKQ E+LVS IDD++ IL+SDYENAYFVTGIFTS IY EDCIFEDPTI+FRG +LYSR
Sbjct: 91 SYKQGDELLVSCIDDVLLILKSDYENAYFVTGIFTSAIYDEDCIFEDPTIKFRGKDLYSR 150
Query: 136 NLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRIS 195
NL+LLVPFF++PSI LQ IEK G ++ T F+L S
Sbjct: 151 NLKLLVPFFDHPSIALQKIEK-----------------GSNSETK-----FVLA-----S 183
Query: 196 LICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPG 248
RTYLKLPW+PLISI GSTVY+LNDE KI RHAESWN+SAL+AVGQIFTP
Sbjct: 184 WKLRTYLKLPWRPLISIAGSTVYDLNDEFKIVRHAESWNISALQAVGQIFTPS 236
>gi|298204898|emb|CBI34205.3| unnamed protein product [Vitis vinifera]
Length = 216
Score = 260 bits (664), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 144/248 (58%), Positives = 169/248 (68%), Gaps = 38/248 (15%)
Query: 1 MAGIHWFGQTHQSVNRRSILGSIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTEL 60
MAGI W T S + +HG++CCS T +NK TPQ+L+ AV GVTEL
Sbjct: 1 MAGI-WCSSTVSST--LHLRKRVHGIRCCSTT-------PDNKEKTPQLLRIAVGGVTEL 50
Query: 61 LRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIF 120
LRL S F + RLD +SYKQ E+LVS IDD++ IL+SDYENAYFVTGIFTS IY EDCIF
Sbjct: 51 LRLVS-FGQNRLDSMSYKQGDELLVSCIDDVLLILKSDYENAYFVTGIFTSAIYDEDCIF 109
Query: 121 EDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTA 180
EDPTI+FRG +LYSRNL+LLVPFF++PSI LQ IEK G ++ T
Sbjct: 110 EDPTIKFRGKDLYSRNLKLLVPFFDHPSIALQKIEK-----------------GSNSETK 152
Query: 181 LGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEA 240
F+L S RTYLKLPW+PLISI GSTVY+LNDE KI RHAESWN+SAL+A
Sbjct: 153 -----FVLA-----SWKLRTYLKLPWRPLISIAGSTVYDLNDEFKIVRHAESWNISALQA 202
Query: 241 VGQIFTPG 248
VGQIFTP
Sbjct: 203 VGQIFTPS 210
>gi|449442907|ref|XP_004139222.1| PREDICTED: uncharacterized protein LOC101220191 [Cucumis sativus]
gi|449482948|ref|XP_004156451.1| PREDICTED: uncharacterized LOC101220191 [Cucumis sativus]
Length = 217
Score = 227 bits (579), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 120/227 (52%), Positives = 149/227 (65%), Gaps = 29/227 (12%)
Query: 22 SIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKY 81
++ ++C S + N N + K I+K VSG+TELLRLFSS R+D++ Q
Sbjct: 18 TVSVIRCFSSSPE--NSNGSRKKEASAIVKITVSGITELLRLFSSPISKRVDEIRDNQGE 75
Query: 82 EILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV 141
E +V+G+D++V IL+SDYENAYFVTGIFTS IY +DC+FEDPTIRFRG ELYSRNL+LLV
Sbjct: 76 EFVVTGVDEVVNILKSDYENAYFVTGIFTSAIYTDDCLFEDPTIRFRGKELYSRNLKLLV 135
Query: 142 PFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTY 201
PFF+ PSI LQ I K S G L + L RTY
Sbjct: 136 PFFDCPSIQLQTINK---------------------SNKSGVEFVLAAWKL------RTY 168
Query: 202 LKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPG 248
LKLPW+PLISIDG+T+YEL++E KI RHAESW+VSALEA+ QIF P
Sbjct: 169 LKLPWRPLISIDGNTLYELDEEFKIVRHAESWSVSALEAITQIFIPS 215
>gi|224125592|ref|XP_002319624.1| predicted protein [Populus trichocarpa]
gi|222858000|gb|EEE95547.1| predicted protein [Populus trichocarpa]
Length = 199
Score = 221 bits (564), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 122/226 (53%), Positives = 147/226 (65%), Gaps = 50/226 (22%)
Query: 22 SIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKY 81
+IHG++CCSG D K + +T TPQILK AVSGVTELLR+FS K RL+KV+ K +
Sbjct: 17 TIHGIRCCSG-ATDNEKKSQTRTKTPQILKLAVSGVTELLRVFSFSGKERLEKVNNKDRD 75
Query: 82 EILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV 141
EI VSGIDD++ IL+SDYENAYFVT GT+LYSRNL+LLV
Sbjct: 76 EISVSGIDDVIMILKSDYENAYFVT----------------------GTKLYSRNLKLLV 113
Query: 142 PFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTY 201
PFF+ PSIGLQ+IEK G+++ T Y + RTY
Sbjct: 114 PFFDCPSIGLQDIEK-----------------GVNSET----------YFVLARWKLRTY 146
Query: 202 LKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
LK PW+PLISIDGSTVYEL+++LKI RHAESWNVSALEA+GQIFTP
Sbjct: 147 LKFPWRPLISIDGSTVYELDNKLKIVRHAESWNVSALEAIGQIFTP 192
>gi|351720803|ref|NP_001235141.1| uncharacterized protein LOC100306700 [Glycine max]
gi|255629317|gb|ACU15003.1| unknown [Glycine max]
Length = 255
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 121/231 (52%), Positives = 150/231 (64%), Gaps = 38/231 (16%)
Query: 24 HGVKCCSGTVNDGNK-------NTNNKTATPQILKFAVSGVTELLRLFS-SFDKGRLDKV 75
HGV+ CS T N N+ N ++ TPQILK AVSGVTELLRLFS S + L+K
Sbjct: 24 HGVQNCSNT-NTLNRRKIYACLNREKESETPQILKIAVSGVTELLRLFSPSSHQTSLEK- 81
Query: 76 SYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSR 135
+ + E S +DD++ I++SDY+NAYFVTG F+S IYAE+CIFEDPTI+FRG ELY+R
Sbjct: 82 -QRDQIEFPASSVDDVLRIIKSDYDNAYFVTGNFSSSIYAENCIFEDPTIKFRGRELYAR 140
Query: 136 NLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRIS 195
NL+LLVPFF+ SI LQ I+K V + L + L
Sbjct: 141 NLKLLVPFFDSASIILQKIDKDV---------------------DSDTNFVLASWKL--- 176
Query: 196 LICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFT 246
RT LKLPW+PLISIDGSTVYELN++ KI RH ESWN+SA+EAV QIF+
Sbjct: 177 ---RTNLKLPWRPLISIDGSTVYELNEDYKIVRHVESWNISAVEAVLQIFS 224
>gi|297828994|ref|XP_002882379.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297328219|gb|EFH58638.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 216
Score = 207 bits (526), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 124/248 (50%), Positives = 155/248 (62%), Gaps = 42/248 (16%)
Query: 1 MAGIHWFGQTHQSVNRRSILGSI-HGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTE 59
MAG+ GQ S++R S L + H +C SG+ NKT P +LK+AV GVTE
Sbjct: 1 MAGVA--GQI--SISRSSTLRRVNHRPRCFSGS-------PENKT--PAVLKWAVGGVTE 47
Query: 60 LLRLFS-SFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDC 118
LLRLFS + + + +YE+ ++D++ ILRSDY NAYFVTGI TS IY+++C
Sbjct: 48 LLRLFSGAPSSSSIPTNKDRSRYELSAGNVEDVMEILRSDYRNAYFVTGILTSSIYSDEC 107
Query: 119 IFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNS 178
IFEDPTI F+GTELY RNL+LLVPF E SI LQN++K E R
Sbjct: 108 IFEDPTISFQGTELYERNLKLLVPFLEDASIELQNMDK-----------SESSERN---- 152
Query: 179 TALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSAL 238
Y+L + RTYLKLPW+PLISI+GSTVY+L+ + KI RH ESWNVSAL
Sbjct: 153 -----------YIL-ATWKLRTYLKLPWRPLISINGSTVYDLDRDFKIVRHVESWNVSAL 200
Query: 239 EAVGQIFT 246
EAVGQIFT
Sbjct: 201 EAVGQIFT 208
>gi|357441471|ref|XP_003591013.1| hypothetical protein MTR_1g081280 [Medicago truncatula]
gi|355480061|gb|AES61264.1| hypothetical protein MTR_1g081280 [Medicago truncatula]
Length = 238
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 114/226 (50%), Positives = 139/226 (61%), Gaps = 37/226 (16%)
Query: 29 CSGTVNDGNKNT-------NNKTATPQILKFAVSGVTELLRLFSSFDKGRL--DKVSYKQ 79
CS + ++NT K TPQ+LK AVSGVTELLRLFS + + D + KQ
Sbjct: 34 CSNSYTPSSRNTCAIYNMVREKNETPQVLKIAVSGVTELLRLFSPPQQTSVLSDDIE-KQ 92
Query: 80 KYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRL 139
+ VS ++D++ I++SDY+N YFVTG FTS IY E+CIFEDPTI+F G +LY+RNL+L
Sbjct: 93 NNDSTVSSVEDVLIIIKSDYDNDYFVTGNFTSSIYTENCIFEDPTIKFSGRDLYARNLKL 152
Query: 140 LVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICR 199
LVPFF+ SI L IEK V E LR S R
Sbjct: 153 LVPFFDCASIKLLKIEKEV----------ESDTN-----------------FLRASWKLR 185
Query: 200 TYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
T LKLPW+PLI+IDGST YELN++ KI RH ESWNVSALEAV QIF
Sbjct: 186 TNLKLPWRPLIAIDGSTSYELNEDFKIVRHVESWNVSALEAVLQIF 231
>gi|21537187|gb|AAM61528.1| unknown [Arabidopsis thaliana]
Length = 216
Score = 197 bits (502), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 107/202 (52%), Positives = 131/202 (64%), Gaps = 28/202 (13%)
Query: 46 TPQILKFAVSGVTELLRLFS-SFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYF 104
TP +LK+AVSGVTE LRL S + + + K E+ +DD++ ILRSDY N YF
Sbjct: 34 TPAVLKWAVSGVTEFLRLISGAPSSTSIATNKDRSKNEVTAGDVDDVMGILRSDYRNFYF 93
Query: 105 VTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLM 164
VTG+ TS IY++DCIFEDPTI F+GTELY RNL+LLVPF E SI LQN+EK
Sbjct: 94 VTGVLTSAIYSDDCIFEDPTISFQGTELYERNLKLLVPFLEDASIELQNMEK-------- 145
Query: 165 VGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL 224
E R Y+L + RTYLKLPW+PLISI+G+TVY+L+ +
Sbjct: 146 ---SESSQRN---------------YIL-ATWKLRTYLKLPWRPLISINGNTVYDLDKDF 186
Query: 225 KITRHAESWNVSALEAVGQIFT 246
KI RH ESWNVSALEA+GQIFT
Sbjct: 187 KIVRHVESWNVSALEAIGQIFT 208
>gi|18397041|ref|NP_566242.1| uncharacterized protein [Arabidopsis thaliana]
gi|13878017|gb|AAK44086.1|AF370271_1 unknown protein [Arabidopsis thaliana]
gi|17104639|gb|AAL34208.1| unknown protein [Arabidopsis thaliana]
gi|332640630|gb|AEE74151.1| uncharacterized protein [Arabidopsis thaliana]
Length = 216
Score = 197 bits (501), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 107/202 (52%), Positives = 131/202 (64%), Gaps = 28/202 (13%)
Query: 46 TPQILKFAVSGVTELLRLFS-SFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYF 104
TP +LK+AVSGVTE LRL S + + + K E+ +DD++ ILRSDY N YF
Sbjct: 34 TPAVLKWAVSGVTEFLRLISGAPSSTSIATNKDRSKNEVTAGDVDDVMGILRSDYRNFYF 93
Query: 105 VTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLM 164
VTG+ TS IY++DCIFEDPTI F+GTELY RNL+LLVPF E SI LQN+EK
Sbjct: 94 VTGVLTSAIYSDDCIFEDPTISFQGTELYERNLKLLVPFLEDASIELQNMEK-------- 145
Query: 165 VGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL 224
E R Y+L + RTYLKLPW+PLISI+G+TVY+L+ +
Sbjct: 146 ---SESSQRN---------------YIL-ATWKLRTYLKLPWRPLISINGNTVYDLDKDF 186
Query: 225 KITRHAESWNVSALEAVGQIFT 246
KI RH ESWNVSALEA+GQIFT
Sbjct: 187 KIVRHVESWNVSALEAIGQIFT 208
>gi|218199281|gb|EEC81708.1| hypothetical protein OsI_25317 [Oryza sativa Indica Group]
Length = 246
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 95/212 (44%), Positives = 129/212 (60%), Gaps = 33/212 (15%)
Query: 46 TPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQK-----YEI-LVSGIDDIVTILRSDY 99
P +++ AVS VTELLR+ + + +D K E+ +DD+V +L +DY
Sbjct: 58 APPLVRAAVSAVTELLRVLTPKNPRVVDDDLQKGDEGGGGAELDSPRSVDDVVAVLEADY 117
Query: 100 ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVF 159
+ AYF+TG FT +IY EDC+FEDPTI+FRG YS+NL LLVPFF+ PS+ L+NIEK
Sbjct: 118 QRAYFLTGNFTPDIYTEDCLFEDPTIKFRGRSRYSQNLDLLVPFFDSPSLELENIEK--- 174
Query: 160 LVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYE 219
GL T + + RTYL+LPW+PLI+I G+T Y+
Sbjct: 175 --------------GLRVETK----------FIMATWTLRTYLRLPWRPLIAIRGNTTYD 210
Query: 220 LNDELKITRHAESWNVSALEAVGQIFTPGDRN 251
L++E K+TRHAESW+VSALEA+GQIF P +
Sbjct: 211 LDEEYKVTRHAESWDVSALEAIGQIFVPAPKQ 242
>gi|357111214|ref|XP_003557409.1| PREDICTED: uncharacterized protein LOC100837510 [Brachypodium
distachyon]
Length = 231
Score = 176 bits (447), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 93/223 (41%), Positives = 132/223 (59%), Gaps = 33/223 (14%)
Query: 23 IHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYE 82
+ V+C S + G++ P +++ AVS VTELLR S K R + + E
Sbjct: 33 LPSVRCSSSSPGSGDQG-----GAPPVVRAAVSAVTELLRALSPSKKPRQMGEAVDTELE 87
Query: 83 ILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVP 142
L ++D+V +L +DY+ AYF+TG FT IYAEDC++EDPTI+FRG Y++NL LLVP
Sbjct: 88 -LPRSVEDVVAVLEADYQRAYFLTGNFTLGIYAEDCLYEDPTIKFRGRSKYAQNLDLLVP 146
Query: 143 FFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYL 202
F + PS+ L++IEK GL T + + RTYL
Sbjct: 147 FLDSPSLELESIEK-----------------GLRAETKS----------IMATWTLRTYL 179
Query: 203 KLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
+LPW+PLI+I G+T Y+L++E K+ RHAESW+VSALEA+ Q+F
Sbjct: 180 RLPWRPLIAIRGNTTYDLDEEYKVVRHAESWDVSALEAIAQLF 222
>gi|357441473|ref|XP_003591014.1| hypothetical protein MTR_1g081280 [Medicago truncatula]
gi|355480062|gb|AES61265.1| hypothetical protein MTR_1g081280 [Medicago truncatula]
Length = 252
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 103/213 (48%), Positives = 137/213 (64%), Gaps = 22/213 (10%)
Query: 29 CSGTVNDGNKNT-------NNKTATPQILKFAVSGVTELLRLFSSFDKGRL--DKVSYKQ 79
CS + ++NT K TPQ+LK AVSGVTELLRLFS + + D + KQ
Sbjct: 34 CSNSYTPSSRNTCAIYNMVREKNETPQVLKIAVSGVTELLRLFSPPQQTSVLSDDIE-KQ 92
Query: 80 KYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRL 139
+ VS ++D++ I++SDY+N YFVTG FTS IY E+CIFEDPTI+F G +LY+RNL+L
Sbjct: 93 NNDSTVSSVEDVLIIIKSDYDNDYFVTGNFTSSIYTENCIFEDPTIKFSGRDLYARNLKL 152
Query: 140 LVPFFEYPSIGLQNIEKFV-----FLVSLMVGRREGRMRGLHNS-TALGGHIFLLPYLLR 193
LVPFF+ SI L IEK V FL R ++R L +S + L +I++ +
Sbjct: 153 LVPFFDCASIKLLKIEKEVESDTNFL------RASWKLRCLPSSLSMLICNIWIYFFKHF 206
Query: 194 ISLICRTYLKLPWKPLISIDGSTVYELNDELKI 226
I CRT LKLPW+PLI+IDGST YELN++ K+
Sbjct: 207 IIPQCRTNLKLPWRPLIAIDGSTSYELNEDFKV 239
>gi|116780121|gb|ABK21558.1| unknown [Picea sitchensis]
Length = 239
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 95/208 (45%), Positives = 123/208 (59%), Gaps = 27/208 (12%)
Query: 40 TNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDY 99
K +P LK AV VTELLR+F++ + EI V+ IDD+V +L SDY
Sbjct: 51 AKKKKESPLFLKLAVGLVTELLRIFTTSQERSSFVSGSTTPNEISVADIDDVVAVLESDY 110
Query: 100 ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVF 159
+ +YF+TG FT IYAEDC F DPTI+F+G +LY RNL+LLVPFF PS+ L IE+
Sbjct: 111 KRSYFLTGEFTRSIYAEDCWFIDPTIKFQGRDLYQRNLQLLVPFFVNPSLLLCGIEQ--- 167
Query: 160 LVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYE 219
G T ++ RTYLKLPW+PLIS++GST+Y+
Sbjct: 168 --------------GFKGDTKF----------IKAKWYLRTYLKLPWRPLISLEGSTIYD 203
Query: 220 LNDELKITRHAESWNVSALEAVGQIFTP 247
L+D+LKI H ESWN+SA EA+ QIF P
Sbjct: 204 LDDDLKIAYHVESWNISAFEAITQIFLP 231
>gi|414883940|tpg|DAA59954.1| TPA: hypothetical protein ZEAMMB73_015011 [Zea mays]
Length = 243
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 89/204 (43%), Positives = 120/204 (58%), Gaps = 31/204 (15%)
Query: 46 TPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEI----LVSGIDDIVTILRSDYEN 101
P +++ AVS VTELLR S R +Q E ++D++ +LR DY
Sbjct: 56 APPVVRAAVSAVTELLRALSPNKALRGAATQQQQGEEPDPDPTCGSVEDVLAVLRDDYRR 115
Query: 102 AYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLV 161
AYF+TG FT IY EDC+FEDPTI+FRG YS+NL LLVPFF+ PS+ L+NIEK
Sbjct: 116 AYFLTGDFTPGIYTEDCLFEDPTIKFRGLSRYSQNLGLLVPFFDSPSLELENIEK----- 170
Query: 162 SLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELN 221
G T ++ + RTYL+LPW+PL++I GST Y+L+
Sbjct: 171 ------------GSRAET----------KFVKATWKLRTYLRLPWRPLVAIGGSTTYDLD 208
Query: 222 DELKITRHAESWNVSALEAVGQIF 245
+ K+TRH+ESW+VSAL+A+ QIF
Sbjct: 209 GDFKVTRHSESWDVSALQAIAQIF 232
>gi|242043308|ref|XP_002459525.1| hypothetical protein SORBIDRAFT_02g006110 [Sorghum bicolor]
gi|241922902|gb|EER96046.1| hypothetical protein SORBIDRAFT_02g006110 [Sorghum bicolor]
Length = 245
Score = 167 bits (423), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 89/205 (43%), Positives = 124/205 (60%), Gaps = 33/205 (16%)
Query: 46 TPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVS-----GIDDIVTILRSDYE 100
P +++ AVS VTELLR S +K D + +Q E S ++D++ +L+ DY
Sbjct: 59 APPVVRAAVSAVTELLRALSP-NKNLRDAAAQQQGEEPDSSPPRCGSVEDVLAVLQDDYR 117
Query: 101 NAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFL 160
AYF+TG FT IY DC+FEDPTI+FRG YS+NL LLVPFF+ PS+ L++IEK
Sbjct: 118 RAYFLTGDFTPGIYTADCLFEDPTIKFRGLSRYSQNLYLLVPFFDSPSLELESIEK---- 173
Query: 161 VSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYEL 220
GL T ++ + RTYL+LPW+PLI+I G+T Y+L
Sbjct: 174 -------------GLRVETK----------FVKATWKLRTYLRLPWRPLIAIRGNTTYDL 210
Query: 221 NDELKITRHAESWNVSALEAVGQIF 245
N++ K+ RH+ESW+VSA+EA+ QIF
Sbjct: 211 NEDFKVIRHSESWDVSAVEAIAQIF 235
>gi|186509771|ref|NP_001118575.1| uncharacterized protein [Arabidopsis thaliana]
gi|227202522|dbj|BAH56734.1| AT3G04890 [Arabidopsis thaliana]
gi|332640631|gb|AEE74152.1| uncharacterized protein [Arabidopsis thaliana]
Length = 221
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 89/181 (49%), Positives = 112/181 (61%), Gaps = 28/181 (15%)
Query: 46 TPQILKFAVSGVTELLRLFS-SFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYF 104
TP +LK+AVSGVTE LRL S + + + K E+ +DD++ ILRSDY N YF
Sbjct: 34 TPAVLKWAVSGVTEFLRLISGAPSSTSIATNKDRSKNEVTAGDVDDVMGILRSDYRNFYF 93
Query: 105 VTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLM 164
VTG+ TS IY++DCIFEDPTI F+GTELY RNL+LLVPF E SI LQN+EK
Sbjct: 94 VTGVLTSAIYSDDCIFEDPTISFQGTELYERNLKLLVPFLEDASIELQNMEK-------- 145
Query: 165 VGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL 224
E R Y+L + RTYLKLPW+PLISI+G+TVY+L+ +
Sbjct: 146 ---SESSQRN---------------YILATWKL-RTYLKLPWRPLISINGNTVYDLDKDF 186
Query: 225 K 225
K
Sbjct: 187 K 187
>gi|326508892|dbj|BAJ86839.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 194
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 87/202 (43%), Positives = 115/202 (56%), Gaps = 35/202 (17%)
Query: 25 GVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEIL 84
GV+C SG+ G+ +P +++ AVS VTELLR FS K V
Sbjct: 28 GVRCGSGSPGSGDD------GSPPVVRAAVSAVTELLRAFSP--KKPSPAVEAVDAESES 79
Query: 85 VSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFF 144
+DD+V +L +DY AYF+TG FT IYAEDC+FEDPTI+FRG YS+NL LLVPFF
Sbjct: 80 PRSVDDVVAVLEADYRRAYFLTGNFTLGIYAEDCLFEDPTIKFRGRSRYSQNLDLLVPFF 139
Query: 145 EYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKL 204
+ PS+ L+NI+K GL T F++ S RTYL+L
Sbjct: 140 DSPSLQLENIDK-----------------GLRVDTK-----FII-----ASWTLRTYLRL 172
Query: 205 PWKPLISIDGSTVYELNDELKI 226
PW+PLI+I G+T Y+L++E KI
Sbjct: 173 PWRPLIAIRGNTTYDLDEEYKI 194
>gi|168004834|ref|XP_001755116.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693709|gb|EDQ80060.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 140
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 72/160 (45%), Positives = 97/160 (60%), Gaps = 25/160 (15%)
Query: 91 IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
++ +LR DYE AYF+TG FT +YA+DC+F DPTI+FRG + Y RNL+LLVPFFE P +
Sbjct: 1 VLEVLREDYERAYFLTGDFTPTLYADDCLFADPTIQFRGRDRYQRNLKLLVPFFEEPKLI 60
Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
L +I++ E H S ++ RTYL+LPWKPLI
Sbjct: 61 LFDIQE------------EDSKSRQHKSINANWNL-------------RTYLRLPWKPLI 95
Query: 211 SIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGDR 250
+DGST+Y L+ KI +H ESW++S +AV Q+F P DR
Sbjct: 96 DVDGSTLYVLDANSKIVKHIESWSISGWQAVQQLFVPSDR 135
>gi|222636639|gb|EEE66771.1| hypothetical protein OsJ_23496 [Oryza sativa Japonica Group]
Length = 249
Score = 137 bits (346), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 86/215 (40%), Positives = 115/215 (53%), Gaps = 36/215 (16%)
Query: 46 TPQILKFAVSGVTELLRLFSSFDKGRLDKVSYK-QKYEILVSG--------IDDIVTILR 96
P +++ AVS VTELLR+ + + V QK + G +DD++ +L
Sbjct: 58 APPLVRAAVSAVTELLRVLTPKNPRHAKVVDDDLQKGDEGGGGAELDSPRSVDDVIAVLE 117
Query: 97 SDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEK 156
+DY+ AYF+TG FT +IY EDC+FEDPTI+FRG YS+NL LLVPFF+ PS+ L+NIEK
Sbjct: 118 ADYQRAYFLTGNFTPDIYTEDCLFEDPTIKFRGRSRYSQNLDLLVPFFDSPSLELENIEK 177
Query: 157 FVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGST 216
GL T + + LR L L+ S
Sbjct: 178 -----------------GLRVET----KFIMATWTLRSRYSQNLDL------LVPFFDSP 210
Query: 217 VYELNDELKITRHAESWNVSALEAVGQIFTPGDRN 251
EL + K+TRHAESW+VSALEA+GQIF P +
Sbjct: 211 SLELENIEKVTRHAESWDVSALEAIGQIFVPAPKQ 245
>gi|147812119|emb|CAN65783.1| hypothetical protein VITISV_010563 [Vitis vinifera]
Length = 170
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/143 (50%), Positives = 91/143 (63%), Gaps = 30/143 (20%)
Query: 16 RRSILGSIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKV 75
R+ + +HG++CCS T +NK TPQ+L+ AV GVTELLRL S F + RLD +
Sbjct: 39 RQHLQKRVHGIRCCSTT-------PDNKEKTPQLLRIAVGGVTELLRLVS-FGQNRLDSM 90
Query: 76 SYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSR 135
SYKQ E+LVS IDD++ IL+SDYENAYFVT G +LYSR
Sbjct: 91 SYKQGDELLVSCIDDVLLILKSDYENAYFVT----------------------GKDLYSR 128
Query: 136 NLRLLVPFFEYPSIGLQNIEKFV 158
NL+LLVPFF++PSI LQ IEK +
Sbjct: 129 NLKLLVPFFDHPSIALQKIEKMM 151
>gi|302836648|ref|XP_002949884.1| hypothetical protein VOLCADRAFT_109744 [Volvox carteri f.
nagariensis]
gi|300264793|gb|EFJ48987.1| hypothetical protein VOLCADRAFT_109744 [Volvox carteri f.
nagariensis]
Length = 237
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 79/218 (36%), Positives = 117/218 (53%), Gaps = 33/218 (15%)
Query: 37 NKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQ-KYEILVSGIDDIVTIL 95
N T ++ + AV+ +TE LRLF + R + V+ + + L G D+ ++
Sbjct: 44 NPKTRSRQKDNPLTVGAVTVMTETLRLFG-VGRERYEAVAAEPPRNRPLRRG--DVAGLM 100
Query: 96 R---SDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQ 152
R +D++ AY VTGI IY DC F DPT+ FRG +L+ RNL LLVPF + P++ L+
Sbjct: 101 RRITTDFKQAYLVTGILDDSIYDPDCFFADPTVAFRGVDLWKRNLALLVPFLDQPAVQLK 160
Query: 153 NIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISI 212
+++ GR E G + +R RT+L+LPW+PLI I
Sbjct: 161 RVQRL--------GRDEQ-----------GAEV------VRAEWRLRTFLRLPWRPLIDI 195
Query: 213 DGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTPGD 249
DG+T Y LN+E +I RH ESW VS +A+ Q+F P +
Sbjct: 196 DGATEYTLNEESNRIVRHVESWGVSGTQALLQMFRPSN 233
>gi|255542358|ref|XP_002512242.1| hypothetical protein RCOM_1428170 [Ricinus communis]
gi|223548203|gb|EEF49694.1| hypothetical protein RCOM_1428170 [Ricinus communis]
Length = 153
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/139 (51%), Positives = 86/139 (61%), Gaps = 39/139 (28%)
Query: 122 DPTIRFR------------GTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRRE 169
+PT+ FR GTELYSRNL+LLVPFFE PSIGL+N+EK
Sbjct: 8 NPTLSFRRVNGIRCCLGAPGTELYSRNLKLLVPFFESPSIGLENMEK------------- 54
Query: 170 GRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRH 229
G++ + FLL + RTYLK PW+P ISIDGST+YELND+ K+ RH
Sbjct: 55 ----GVNTESN-----FLL-----ATWKLRTYLKFPWRPFISIDGSTIYELNDKFKVVRH 100
Query: 230 AESWNVSALEAVGQIFTPG 248
AESWNVSALEA+GQIFTP
Sbjct: 101 AESWNVSALEAIGQIFTPS 119
>gi|159481098|ref|XP_001698619.1| SOUL3-like protein [Chlamydomonas reinhardtii]
gi|158282359|gb|EDP08112.1| SOUL3-like protein [Chlamydomonas reinhardtii]
Length = 232
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 71/198 (35%), Positives = 104/198 (52%), Gaps = 31/198 (15%)
Query: 54 VSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTILR---SDYENAYFVTGIFT 110
V+ +TE LR+ + ++ + + + G D+ ++R +D++ AYFVTG+
Sbjct: 56 VTVLTEALRVLGVGKERYVEVAAAPSRAAPVRRG--DVAGLMRRLTADFKQAYFVTGVLD 113
Query: 111 SEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREG 170
IY DC F DPT+ FRGT+L+ RNL LL PF E P++ L + + GR E
Sbjct: 114 DSIYEPDCYFADPTVAFRGTDLWKRNLALLTPFLEAPNVQLYGMRQL--------GRDED 165
Query: 171 RMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL-KITRH 229
G + +R T LKLPW+PLI +DG+T Y LN+E +I RH
Sbjct: 166 -----------GAEV------VRAEWRLTTILKLPWRPLIDLDGATEYTLNEESNRIVRH 208
Query: 230 AESWNVSALEAVGQIFTP 247
E W++S EAV Q+F P
Sbjct: 209 VEFWSISGTEAVLQMFRP 226
>gi|223948397|gb|ACN28282.1| unknown [Zea mays]
gi|414883941|tpg|DAA59955.1| TPA: hypothetical protein ZEAMMB73_015011 [Zea mays]
Length = 214
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 57/115 (49%), Positives = 73/115 (63%), Gaps = 4/115 (3%)
Query: 46 TPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEI----LVSGIDDIVTILRSDYEN 101
P +++ AVS VTELLR S R +Q E ++D++ +LR DY
Sbjct: 56 APPVVRAAVSAVTELLRALSPNKALRGAATQQQQGEEPDPDPTCGSVEDVLAVLRDDYRR 115
Query: 102 AYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEK 156
AYF+TG FT IY EDC+FEDPTI+FRG YS+NL LLVPFF+ PS+ L+NIEK
Sbjct: 116 AYFLTGDFTPGIYTEDCLFEDPTIKFRGLSRYSQNLGLLVPFFDSPSLELENIEK 170
>gi|223942399|gb|ACN25283.1| unknown [Zea mays]
gi|414883939|tpg|DAA59953.1| TPA: hypothetical protein ZEAMMB73_015011 [Zea mays]
Length = 216
Score = 113 bits (282), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 57/119 (47%), Positives = 76/119 (63%), Gaps = 8/119 (6%)
Query: 46 TPQILKFAVSGVTELLRLFSSFDK-GRLDKVSYKQKYE-------ILVSGIDDIVTILRS 97
P +++ AVS VTELLR S RL + + Q+ + ++D++ +LR
Sbjct: 56 APPVVRAAVSAVTELLRALSPNKALSRLCRGAATQQQQGEEPDPDPTCGSVEDVLAVLRD 115
Query: 98 DYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEK 156
DY AYF+TG FT IY EDC+FEDPTI+FRG YS+NL LLVPFF+ PS+ L+NIEK
Sbjct: 116 DYRRAYFLTGDFTPGIYTEDCLFEDPTIKFRGLSRYSQNLGLLVPFFDSPSLELENIEK 174
>gi|307108290|gb|EFN56530.1| hypothetical protein CHLNCDRAFT_144158 [Chlorella variabilis]
Length = 172
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 60/157 (38%), Positives = 82/157 (52%), Gaps = 25/157 (15%)
Query: 95 LRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNI 154
LR DY YFVT + T IY C F DPT+ FRG +LY RNL LLVPF P+I L+++
Sbjct: 32 LRRDYARQYFVTALVTDAIYDPGCYFADPTVSFRGRDLYKRNLALLVPFLWEPAIQLRSL 91
Query: 155 EKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDG 214
+ + A GG L L C +++LPW P + I+G
Sbjct: 92 RRL-------------------PAPAPGGSAQL---FAEWRLSC--WVRLPWAPYVDING 127
Query: 215 STVYELN-DELKITRHAESWNVSALEAVGQIFTPGDR 250
+T Y LN D +I RH E W+VSA++A+ + P +R
Sbjct: 128 TTTYTLNADGNQIVRHVEQWDVSAVQALLLLLRPSER 164
>gi|414883944|tpg|DAA59958.1| TPA: hypothetical protein ZEAMMB73_015011 [Zea mays]
Length = 148
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 43/69 (62%), Positives = 55/69 (79%)
Query: 88 IDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYP 147
++D++ +LR DY AYF+TG FT IY EDC+FEDPTI+FRG YS+NL LLVPFF+ P
Sbjct: 36 VEDVLAVLRDDYRRAYFLTGDFTPGIYTEDCLFEDPTIKFRGLSRYSQNLGLLVPFFDSP 95
Query: 148 SIGLQNIEK 156
S+ L+NIEK
Sbjct: 96 SLELENIEK 104
>gi|239050713|ref|NP_001141609.2| hypothetical protein [Zea mays]
gi|238908815|gb|ACF86714.2| unknown [Zea mays]
gi|414883943|tpg|DAA59957.1| TPA: hypothetical protein ZEAMMB73_015011 [Zea mays]
Length = 146
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 43/69 (62%), Positives = 55/69 (79%)
Query: 88 IDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYP 147
++D++ +LR DY AYF+TG FT IY EDC+FEDPTI+FRG YS+NL LLVPFF+ P
Sbjct: 36 VEDVLAVLRDDYRRAYFLTGDFTPGIYTEDCLFEDPTIKFRGLSRYSQNLGLLVPFFDSP 95
Query: 148 SIGLQNIEK 156
S+ L+NIEK
Sbjct: 96 SLELENIEK 104
>gi|302799533|ref|XP_002981525.1| hypothetical protein SELMODRAFT_58216 [Selaginella moellendorffii]
gi|300150691|gb|EFJ17340.1| hypothetical protein SELMODRAFT_58216 [Selaginella moellendorffii]
Length = 178
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/156 (35%), Positives = 77/156 (49%), Gaps = 33/156 (21%)
Query: 94 ILRSDYENAYFVTGIF-TSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQ 152
++R DYE +YFVTG F T+ IY DC F DP + F+G + + +N+ L F E ++ +
Sbjct: 52 LIRKDYEQSYFVTGTFMTTGIYKSDCEFADPFVSFKGLKRFKQNVSNLGAFMEKSTLKIA 111
Query: 153 NIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISI 212
+ ++ SL VG R R L LPW+P IS
Sbjct: 112 SWDEKE--DSLKVGWR-----------------------------FRCVLALPWRPAISA 140
Query: 213 DGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
GST Y +DE KI +H ESWN+S + V Q+F P
Sbjct: 141 SGSTEYFFDDESGKICKHVESWNISPADGVRQLFKP 176
>gi|302760261|ref|XP_002963553.1| hypothetical protein SELMODRAFT_68183 [Selaginella moellendorffii]
gi|300168821|gb|EFJ35424.1| hypothetical protein SELMODRAFT_68183 [Selaginella moellendorffii]
Length = 178
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 77/156 (49%), Gaps = 33/156 (21%)
Query: 94 ILRSDYENAYFVTGIF-TSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQ 152
++R DYE +YFVTG F T+ IY DC F DP + F+G + + +N+ L F + ++ +
Sbjct: 52 LIRKDYEQSYFVTGTFMTTGIYKSDCEFADPFVSFKGLKRFKQNVSNLGAFMKKSTLKIA 111
Query: 153 NIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISI 212
+ ++ SL VG R R L LPW+P IS
Sbjct: 112 SWDEKE--DSLKVGWR-----------------------------FRCVLALPWRPAISA 140
Query: 213 DGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
GST Y +DE KI +H ESWN+S + V Q+F P
Sbjct: 141 SGSTEYFFDDESGKICKHVESWNISPADGVRQLFKP 176
>gi|224125600|ref|XP_002319626.1| predicted protein [Populus trichocarpa]
gi|222858002|gb|EEE95549.1| predicted protein [Populus trichocarpa]
Length = 150
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 37/49 (75%), Positives = 44/49 (89%)
Query: 108 IFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEK 156
+FTS IY EDCIFEDPT++F+GT+LYSRNL LLVPFF+ PS GLQ+IEK
Sbjct: 35 VFTSTIYDEDCIFEDPTVKFQGTKLYSRNLNLLVPFFDCPSTGLQDIEK 83
>gi|116791970|gb|ABK26182.1| unknown [Picea sitchensis]
Length = 247
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/222 (31%), Positives = 96/222 (43%), Gaps = 40/222 (18%)
Query: 30 SGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGI- 88
S + + N N+NN +LK A G L S F + S +Q + V+G
Sbjct: 55 SPSPEESNSNSNN-----ILLKLAWYGSEALGIAASVFRPSTAVQPSGEQSSQNEVAGSI 109
Query: 89 --DDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
D +V ++ D+E +YFVTG T+ IY EDC F DP FRG + + RN E
Sbjct: 110 PKDAVVDAIKKDFERSYFVTGNLTTIIYEEDCEFADPAGSFRGLKRFKRNCSNFGSLLEK 169
Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
++ L + E F RG+ GH IC L PW
Sbjct: 170 SNMKLMDWEDF-------------EDRGI-------GH---------WRFIC--VLAFPW 198
Query: 207 KPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
+P++S G T Y + + KI RH E WNV + + QIF P
Sbjct: 199 RPILSATGYTEYYYDAQSGKICRHVEHWNVPKMVLLKQIFKP 240
>gi|12322860|gb|AAG51422.1|AC009465_22 hypothetical protein; 57237-56085 [Arabidopsis thaliana]
Length = 154
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 65/124 (52%), Gaps = 35/124 (28%)
Query: 46 TPQILKFAVSGVTELLRLFS--------SFDKGRLD-----KVSYKQKYEILVSGIDDIV 92
TP +LK+AVSGVTE LRL S + +K RL + K E+ +DD++
Sbjct: 34 TPAVLKWAVSGVTEFLRLISGAPSSTSIATNKDRLVVEIDLNTGCRSKNEVTAGDVDDVM 93
Query: 93 TILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQ 152
ILRSDY N YFVTG TELY RNL+LLVPF E SI LQ
Sbjct: 94 GILRSDYRNFYFVTG----------------------TELYERNLKLLVPFLEDASIELQ 131
Query: 153 NIEK 156
N+EK
Sbjct: 132 NMEK 135
>gi|168003427|ref|XP_001754414.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694516|gb|EDQ80864.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 197
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 99/226 (43%), Gaps = 40/226 (17%)
Query: 25 GVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEIL 84
GV C + K A+P ++K G + ++F R + E+
Sbjct: 8 GVVACEA--KNETKTPPTADASP-LVKMVWYGSEAFGKFVAAF---RPSATVEPEDEEMF 61
Query: 85 VSGI--DDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVP 142
V + ++V +++ DYE +YFVTG T IY DC F DP + F+G + +N+ L
Sbjct: 62 VGPVPRSEVVDLIKKDYERSYFVTGNMTMGIYEADCEFADPFVAFKGLRRFKQNVSNLGS 121
Query: 143 FFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYL 202
F E S+ + + +++ E R+ R C L
Sbjct: 122 FMEESSLKITDWQEY-----------EDRV------------------YARWRFNC--IL 150
Query: 203 KLPWKPLISIDGSTVYELN-DELKITRHAESWNVSALEAVGQIFTP 247
LPW+P+++ GST Y + + KI +H E+W++S + V Q+F P
Sbjct: 151 GLPWRPILAATGSTEYFFDSNSGKICKHVENWDISPADGVRQLFKP 196
>gi|357504325|ref|XP_003622451.1| hypothetical protein MTR_7g037920 [Medicago truncatula]
gi|355497466|gb|AES78669.1| hypothetical protein MTR_7g037920 [Medicago truncatula]
Length = 243
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 64/223 (28%), Positives = 95/223 (42%), Gaps = 40/223 (17%)
Query: 29 CSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGI 88
C+GT N+N ++ T LK A +ELL + +S + D+ +L + I
Sbjct: 41 CNGT----NQNQESQPQTNAFLKVAWYS-SELLGIAASAFRSPSDEEEASPPQRLLQT-I 94
Query: 89 D--DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
D +V ++ D++ +YFVTG T Y EDC F DP F+G + + RN E
Sbjct: 95 DRASVVDTIKQDFQRSYFVTGDLTLNAYEEDCEFADPAGSFKGLQRFKRNCTNFGSLLEK 154
Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
++ L E F +G+ GH R S I L PW
Sbjct: 155 STMNLMKWEDF-------------EDKGI-------GH-------WRFSCI----LSFPW 183
Query: 207 KPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTPG 248
+P++S G T Y + + K+ RH E WNV + QI P
Sbjct: 184 RPILSATGYTEYYFDTQSGKVYRHVEHWNVPKMALFKQILRPS 226
>gi|224125602|ref|XP_002319627.1| predicted protein [Populus trichocarpa]
gi|222858003|gb|EEE95550.1| predicted protein [Populus trichocarpa]
Length = 122
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/156 (36%), Positives = 71/156 (45%), Gaps = 60/156 (38%)
Query: 1 MAGIHWFGQTHQSVNRRSILGSIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTEL 60
MAGI T +R IH ++CCSG ++ N N+ +T TPQILK AVSGV EL
Sbjct: 1 MAGICSLNPTTLMTFKR-----IHRIRCCSGATDNEN-NSQTRTKTPQILKLAVSGVIEL 54
Query: 61 LRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIF 120
LR+FS SG D + +I F+
Sbjct: 55 LRVFS-------------------FSGKDSLYSI---------FIP-------------- 72
Query: 121 EDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEK 156
+SRNL+LLVPFF+ PSIGLQNIEK
Sbjct: 73 ------------HSRNLKLLVPFFDCPSIGLQNIEK 96
>gi|388502340|gb|AFK39236.1| unknown [Medicago truncatula]
Length = 230
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/221 (28%), Positives = 95/221 (42%), Gaps = 40/221 (18%)
Query: 29 CSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGI 88
C+GT N+N ++ T LK A +ELL + +S + D+ +L + I
Sbjct: 41 CNGT----NQNQESQPQTNAFLKVAWYS-SELLGIAASAFRSPSDEEEASPPQRLLQT-I 94
Query: 89 D--DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
D +V ++ D++ +YFVTG T Y EDC F DP F+G + + RN E
Sbjct: 95 DRASVVDTIKQDFQRSYFVTGDLTLNAYEEDCEFADPAGSFKGLQRFKRNCTNFGSLLEK 154
Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
++ L E F +G+ GH R S I L PW
Sbjct: 155 STMNLMKWEDF-------------EDKGI-------GH-------WRFSCI----LSFPW 183
Query: 207 KPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFT 246
+P++S G T Y + + K+ RH E WNV + QI +
Sbjct: 184 RPILSATGYTEYYFDTQSGKVYRHVEHWNVPKMALFKQILS 224
>gi|356560693|ref|XP_003548624.1| PREDICTED: uncharacterized protein LOC100801965 [Glycine max]
Length = 238
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/223 (29%), Positives = 94/223 (42%), Gaps = 42/223 (18%)
Query: 29 CSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGI 88
CSGT N N +++ ILK A +ELL + +S + ++ Q+ L+ I
Sbjct: 35 CSGT----NPNKESESQNNAILKLAWYS-SELLGIAASVFRSPSNEEVPPQR---LLQTI 86
Query: 89 D--DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
D +V ++ D+E +YFVTG T Y EDC F DP F+G + + RN E
Sbjct: 87 DRAAVVDTIKQDFERSYFVTGDLTLNAYEEDCEFADPAGSFKGLQRFKRNCTNFGSLLEK 146
Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
++ L E F +G+ GH R + I L PW
Sbjct: 147 SNMKLMKWEDF-------------EDKGI-------GH-------WRFNCI----LSFPW 175
Query: 207 KPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTPG 248
+P++S G T Y + K+ RH E WNV QI P
Sbjct: 176 RPILSATGYTEYYFDARSGKVCRHVEHWNVPKKALFKQILRPS 218
>gi|312283147|dbj|BAJ34439.1| unnamed protein product [Thellungiella halophila]
Length = 242
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 63/227 (27%), Positives = 91/227 (40%), Gaps = 40/227 (17%)
Query: 29 CSGTVNDGNKNTNNKTATPQ--ILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVS 86
C G N N+ +K A P+ +LKFA G L S+F +E+ V
Sbjct: 39 CRGQ-NPANEPQTSKGAEPENVLLKFAWYGSELLGIAASAFRSPASPPPPTVTGFEVPVD 97
Query: 87 GIDDIVTI-----LRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV 141
V + ++ D++ +YFVTG T E+Y E C F DP F+G + RN
Sbjct: 98 CSGRAVRVAVVDSIKQDFKRSYFVTGNLTPEVYEEKCEFADPAGSFKGLTRFKRNCTNFG 157
Query: 142 PFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTY 201
E ++ L E F +G+ GH + S +
Sbjct: 158 TLIEKSNMKLMKWENF-------------EDKGV-------GH-------WKFSCV---- 186
Query: 202 LKLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
+ PWKP++S G T Y + E KI RH E WNV + Q+ P
Sbjct: 187 MSFPWKPILSATGYTEYYFDTESGKICRHVEHWNVPKIALFKQLLRP 233
>gi|255541694|ref|XP_002511911.1| conserved hypothetical protein [Ricinus communis]
gi|223549091|gb|EEF50580.1| conserved hypothetical protein [Ricinus communis]
Length = 235
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 63/246 (25%), Positives = 99/246 (40%), Gaps = 43/246 (17%)
Query: 11 HQSVNRRSILGSIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKG 70
HQ++ I I+ K N N +T ++ L +ELL + +SF
Sbjct: 18 HQNI----IFSPIYPRKFNRIRCNGENPSTRKESEPENALLKVAWYSSELLGIAASF--F 71
Query: 71 RLDKVSYKQKYEILVSGIDDI-----VTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTI 125
R V+ + +++ + + I V ++ D++ +YFVTG TS+ Y EDC F DP
Sbjct: 72 RSPSVTSEANFKLSIDDLGSIDRATVVQSIKDDFQRSYFVTGNLTSDAYEEDCEFADPAG 131
Query: 126 RFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHI 185
F+G + RN E ++ L E F +G
Sbjct: 132 SFKGLRRFKRNCTNFGLLLEKSNMKLMKWEDF-------------------EDKGIG--- 169
Query: 186 FLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQI 244
R S I + PWKP++S G T Y + + ++ RH E WNV + + QI
Sbjct: 170 -----YWRFSCI----MSFPWKPILSATGYTEYYFDVQSGRVCRHVEHWNVPKMALLKQI 220
Query: 245 FTPGDR 250
P R
Sbjct: 221 LRPSQR 226
>gi|224067926|ref|XP_002302602.1| predicted protein [Populus trichocarpa]
gi|222844328|gb|EEE81875.1| predicted protein [Populus trichocarpa]
Length = 245
Score = 73.6 bits (179), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 89/198 (44%), Gaps = 39/198 (19%)
Query: 58 TELLRLFSSFDKGRLDKVSYKQKYEIL----VSGIDD---IVTILRSDYENAYFVTGIFT 110
+ELL + +SF + + + + +I VSG+ D ++ ++ D++ +YFVTG T
Sbjct: 67 SELLGIAASFFRSPSNSSTEASEKDIKLGKDVSGVIDRAVVMETIKEDFQKSYFVTGSLT 126
Query: 111 SEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREG 170
E Y E+C F DP F+G + + RN E ++ L E F
Sbjct: 127 LEAYEENCEFADPAGSFKGLQRFKRNCTNFGLLIEKSNMKLTKWEDF------------- 173
Query: 171 RMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL-KITRH 229
+G+ GH R S + + PWKP++S G T Y +++ ++ RH
Sbjct: 174 EDKGI-------GH-------WRFSCV----MSFPWKPILSATGYTEYYFDEQSGRVCRH 215
Query: 230 AESWNVSALEAVGQIFTP 247
E WNV + + Q+ P
Sbjct: 216 VEHWNVPKMALLKQLLKP 233
>gi|323455378|gb|EGB11246.1| hypothetical protein AURANDRAFT_61587 [Aureococcus anophagefferens]
Length = 492
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 95/207 (45%), Gaps = 33/207 (15%)
Query: 46 TPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFV 105
T K +S +T+++ L +G D ++++ L + L D+ N Y
Sbjct: 24 TSSTTKNIISTLTKVVNLQGG--RGEADLAPRARQFDTLSGAA--VKEGLAVDFGNEYLW 79
Query: 106 TGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE--YPSIGLQNIEKF-VFLVS 162
+G T E+Y EDC+F DPT+ F G + NL L + E P G + ++ F + LV
Sbjct: 80 SGKITPELYDEDCVFTDPTLSFAGLRTFEANLANLDFWIESLVPE-GRRRVDLFDLALVP 138
Query: 163 LMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYEL-N 221
R + R L L+LPW P++ ++G+T Y L
Sbjct: 139 GATDRVRAKWRMLG------------------------ALRLPWGPVLDLNGTTTYTLGG 174
Query: 222 DELKITRHAESWNVSALEAVGQIFTPG 248
D +I+ + E+W+++A +A+GQ+ PG
Sbjct: 175 DGGRISSYDETWDMTAGDALGQLLRPG 201
>gi|225454326|ref|XP_002277369.1| PREDICTED: uncharacterized protein LOC100258452 [Vitis vinifera]
Length = 256
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 67/237 (28%), Positives = 103/237 (43%), Gaps = 50/237 (21%)
Query: 23 IHGVKCCSGTVNDG-NKNTNNKTAT---PQ--ILKFAVSGVTELLRLFSSFDKGRLDKVS 76
IH +C DG N +N+ TA P+ +LK A G +ELL + +SF + +
Sbjct: 47 IHRFRC------DGENPRSNSSTAQESEPENALLKVAWYG-SELLGIAASFFRSPSSVEA 99
Query: 77 YKQKYEILVSG---ID--DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTE 131
++ ++ G +D +V ++ D++ +YFVTG T Y +DC F DP FRG
Sbjct: 100 PERAIDLAGDGSGAVDRAALVETIKEDFQRSYFVTGNLTLSAYEDDCEFADPAGSFRGLR 159
Query: 132 LYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYL 191
+ RN + ++ L E F +G+ GH
Sbjct: 160 RFKRNCTNFGSLIQKSNMKLMKWEDF-------------EDKGI-------GH------- 192
Query: 192 LRISLICRTYLKLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
R S + L PWKP++S G T Y + + K+ RH E WNV + + QI P
Sbjct: 193 WRFSCV----LSFPWKPILSATGYTEYYFDSQSGKVCRHVEHWNVPKMALLKQILRP 245
>gi|297745341|emb|CBI40421.3| unnamed protein product [Vitis vinifera]
Length = 240
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 67/237 (28%), Positives = 103/237 (43%), Gaps = 50/237 (21%)
Query: 23 IHGVKCCSGTVNDG-NKNTNNKTAT---PQ--ILKFAVSGVTELLRLFSSFDKGRLDKVS 76
IH +C DG N +N+ TA P+ +LK A G +ELL + +SF + +
Sbjct: 31 IHRFRC------DGENPRSNSSTAQESEPENALLKVAWYG-SELLGIAASFFRSPSSVEA 83
Query: 77 YKQKYEILVSG---ID--DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTE 131
++ ++ G +D +V ++ D++ +YFVTG T Y +DC F DP FRG
Sbjct: 84 PERAIDLAGDGSGAVDRAALVETIKEDFQRSYFVTGNLTLSAYEDDCEFADPAGSFRGLR 143
Query: 132 LYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYL 191
+ RN + ++ L E F +G+ GH
Sbjct: 144 RFKRNCTNFGSLIQKSNMKLMKWEDF-------------EDKGI-------GH------- 176
Query: 192 LRISLICRTYLKLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
R S + L PWKP++S G T Y + + K+ RH E WNV + + QI P
Sbjct: 177 WRFSCV----LSFPWKPILSATGYTEYYFDSQSGKVCRHVEHWNVPKMALLKQILRP 229
>gi|449441428|ref|XP_004138484.1| PREDICTED: uncharacterized protein LOC101218208 [Cucumis sativus]
Length = 239
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 65/227 (28%), Positives = 98/227 (43%), Gaps = 41/227 (18%)
Query: 29 CSGTVNDGNKNTNNKTATPQ--ILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEIL-- 84
C G N + NN+ + P+ +LK A G +ELL + +S+ + LD + + E+
Sbjct: 33 CQGN-NPTTDSPNNQESKPENAVLKVAWYG-SELLGIAASYLRPPLDVQTPLRAQELTTD 90
Query: 85 VSGIDD---IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV 141
VSG IV ++ D+ +YFVTG T + Y E C F DP F+G + RN
Sbjct: 91 VSGSIPRPLIVETIKEDFRRSYFVTGNLTLQAYEEQCEFADPAGSFKGLRRFKRNCTNFG 150
Query: 142 PFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTY 201
+ ++ L E F +G+ GH + S I
Sbjct: 151 SLVDKSNMKLTKWEGF-------------EDKGI-------GH-------WKFSCI---- 179
Query: 202 LKLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
L PW+P++S G T Y + K+ RH E WNV + + QI P
Sbjct: 180 LSFPWRPILSATGYTEYYFDARSGKVCRHVEHWNVPKMALLKQILRP 226
>gi|449495217|ref|XP_004159768.1| PREDICTED: uncharacterized LOC101218208 [Cucumis sativus]
Length = 248
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 65/227 (28%), Positives = 98/227 (43%), Gaps = 41/227 (18%)
Query: 29 CSGTVNDGNKNTNNKTATPQ--ILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEIL-- 84
C G N + NN+ + P+ +LK A G +ELL + +S+ + LD + + E+
Sbjct: 42 CQGN-NPTTDSPNNQESKPENAVLKVAWYG-SELLGIAASYLRPPLDVQTPLRAQELTTD 99
Query: 85 VSGIDD---IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV 141
VSG IV ++ D+ +YFVTG T + Y E C F DP F+G + RN
Sbjct: 100 VSGSIPRPLIVETIKEDFRRSYFVTGNLTLQAYEEQCEFADPAGSFKGLRRFKRNCTNFG 159
Query: 142 PFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTY 201
+ ++ L E F +G+ GH + S I
Sbjct: 160 SLVDKSNMKLTKWEGF-------------EDKGI-------GH-------WKFSCI---- 188
Query: 202 LKLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
L PW+P++S G T Y + K+ RH E WNV + + QI P
Sbjct: 189 LSFPWRPILSATGYTEYYFDARSGKVCRHVEHWNVPKMALLKQILRP 235
>gi|302783284|ref|XP_002973415.1| hypothetical protein SELMODRAFT_413750 [Selaginella moellendorffii]
gi|300159168|gb|EFJ25789.1| hypothetical protein SELMODRAFT_413750 [Selaginella moellendorffii]
Length = 170
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 80/166 (48%), Gaps = 33/166 (19%)
Query: 87 GIDDIVTILRSDYEN-AYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE 145
+D+I I+R D E+ Y+VTG T IY+EDC F DPT +G + Y ++ L F+
Sbjct: 36 AVDEIEQIIRQDIEDRQYYVTGDLTRSIYSEDCRFIDPTTNVQGVDSYVAAVKSL---FD 92
Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLP 205
PS + + + + S + R++G LKLP
Sbjct: 93 -PSDSKHELLE-IHVTSPTTIEAKWRLQG--------------------------SLKLP 124
Query: 206 WKPLISI-DGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGDR 250
WKP I+ +G+T Y LND+ + H E+W++S L A ++ TP R
Sbjct: 125 WKPHIAAYEGTTTYTLNDDGFVASHKETWDISLLTAFIELLTPSFR 170
>gi|242074082|ref|XP_002446977.1| hypothetical protein SORBIDRAFT_06g026170 [Sorghum bicolor]
gi|241938160|gb|EES11305.1| hypothetical protein SORBIDRAFT_06g026170 [Sorghum bicolor]
Length = 262
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 68/158 (43%), Gaps = 32/158 (20%)
Query: 91 IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
+V +++D+ +YFVTG T Y EDC F DP FRG + RN E ++
Sbjct: 128 VVEAVKADFARSYFVTGNLTLRAYEEDCEFADPAGSFRGLRRFKRNCTNFGSLLEKSNMK 187
Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
L E + + + GH R S + + PW+P++
Sbjct: 188 LTKWED------------------MEDKSI--GH-------WRFSCV----MSFPWRPIL 216
Query: 211 SIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
S G T Y + E K+ RH E+WNV + + QIF P
Sbjct: 217 SATGYTEYYFDAESGKVCRHVENWNVPKMALLRQIFRP 254
>gi|298713681|emb|CBJ48872.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 192
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 66/146 (45%), Gaps = 32/146 (21%)
Query: 103 YFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVS 162
Y+VTG T +Y DC+F DPT R E Y+ ++LL N + V L+S
Sbjct: 67 YYVTGDLTPSLYRPDCLFTDPTTRVESVERYTAAVKLL--------FDPDNAQ--VDLLS 116
Query: 163 LMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLIS-IDGSTVYELN 221
L V HIF + YLK PW P + G+T+Y L+
Sbjct: 117 LAVKD--------------DNHIF-------AEWQLQGYLKFPWHPYVKPYTGTTLYTLD 155
Query: 222 DELKITRHAESWNVSALEAVGQIFTP 247
+ I+RH E+W++SALEA TP
Sbjct: 156 SDGLISRHEETWSISALEAARATITP 181
>gi|298708412|emb|CBJ48475.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 252
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 32/163 (19%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
D V L++D++ YFVTG ++Y +C+F DP + F G + + NL L + S+
Sbjct: 117 DAVARLKTDFDRFYFVTGQMDLDLYEPECVFADPFVAFEGRDRFKNNLDNLGSLMQDVSL 176
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ + E+ ++ R L LPWKP
Sbjct: 177 DVTSWEE-------------------------------AEASIKTKWRFRCVLGLPWKPT 205
Query: 210 ISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTPGDRN 251
++ G T + N E +I RH ESW + ++A+ Q+ PG +
Sbjct: 206 LAAAGGTEFFFNQESGRIERHVESWEIEPIDALKQLIRPGRKK 248
>gi|194701212|gb|ACF84690.1| unknown [Zea mays]
gi|195612814|gb|ACG28237.1| hypothetical protein [Zea mays]
gi|414585788|tpg|DAA36359.1| TPA: hypothetical protein ZEAMMB73_852215 [Zea mays]
gi|414585789|tpg|DAA36360.1| TPA: hypothetical protein ZEAMMB73_852215 [Zea mays]
Length = 258
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 67/158 (42%), Gaps = 32/158 (20%)
Query: 91 IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
+ +++D+ +YFVTG T Y EDC F DP FRG + RN E ++
Sbjct: 124 VAEAIKADFARSYFVTGNLTLRAYEEDCEFADPAGSFRGLRRFKRNCTNFGSLLEKSNMK 183
Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
L E L + + GH R S + + PW+P++
Sbjct: 184 LTKWED------------------LEDKSI--GH-------WRFSCV----MSFPWRPIL 212
Query: 211 SIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
S G T Y + E K+ RH E+WNV + + QIF P
Sbjct: 213 SATGYTEYYFDAESGKVCRHVENWNVPKMALLRQIFRP 250
>gi|30690220|ref|NP_182134.2| Nuclear transport factor 2 (NTF2) family protein [Arabidopsis
thaliana]
gi|48310194|gb|AAT41772.1| At2g46100 [Arabidopsis thaliana]
gi|50198900|gb|AAT70469.1| At2g46100 [Arabidopsis thaliana]
gi|330255549|gb|AEC10643.1| Nuclear transport factor 2 (NTF2) family protein [Arabidopsis
thaliana]
Length = 240
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 97/245 (39%), Gaps = 37/245 (15%)
Query: 8 GQTHQSVNRRSILGSIHGVKCCSGTVNDGNKNTNNKTATPQILKFAVSGVTELLRLFSS- 66
G +S+ R + GV C D + + +LK A G +ELL + +S
Sbjct: 19 GDGGRSIFRTNRRFEATGVSCRGQNPTDEPQTSKGPEPDNVLLKIAWYG-SELLGIAASV 77
Query: 67 FDKGRLDKVSYKQKYEILVSGID---DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDP 123
F + + + SG +V ++ D++ +YFVTG T E+Y E C F DP
Sbjct: 78 FRSPETSPIVTGFEVPVDCSGRAVRVAVVDSIKQDFKRSYFVTGNLTPEVYEEKCEFADP 137
Query: 124 TIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGG 183
F+G + RN E ++ L E F +G+ G
Sbjct: 138 AGSFKGLARFKRNCTNFGSLIEKSNMKLMKWENF-------------EDKGI-------G 177
Query: 184 HIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVG 242
H + S + + PWKP++S G T Y + E KI RH E WNV +
Sbjct: 178 H-------WKFSCV----MSFPWKPILSATGYTEYYFDTESGKICRHVEHWNVPKIALFK 226
Query: 243 QIFTP 247
Q+ P
Sbjct: 227 QLLRP 231
>gi|302789448|ref|XP_002976492.1| hypothetical protein SELMODRAFT_416514 [Selaginella moellendorffii]
gi|300155530|gb|EFJ22161.1| hypothetical protein SELMODRAFT_416514 [Selaginella moellendorffii]
Length = 170
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 51/166 (30%), Positives = 79/166 (47%), Gaps = 33/166 (19%)
Query: 87 GIDDIVTILRSDYEN-AYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE 145
+D+I I+R D E+ Y+VTG T IY++DC F DPT +G + Y ++ L F+
Sbjct: 36 AVDEIEQIIRQDIEDRQYYVTGDLTRSIYSDDCRFIDPTTNVQGVDSYVAAVKSL---FD 92
Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLP 205
PS + + + + S + R++G LKLP
Sbjct: 93 -PSDSKHELLE-IRVTSPTTIEAKWRLQG--------------------------SLKLP 124
Query: 206 WKPLISI-DGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGDR 250
WKP I+ +G+T Y LND+ + H E+W++S L A + TP R
Sbjct: 125 WKPHIAAYEGTTTYTLNDDGFVASHKETWDISLLTAFIEFLTPSFR 170
>gi|297824667|ref|XP_002880216.1| hypothetical protein ARALYDRAFT_483757 [Arabidopsis lyrata subsp.
lyrata]
gi|297326055|gb|EFH56475.1| hypothetical protein ARALYDRAFT_483757 [Arabidopsis lyrata subsp.
lyrata]
Length = 241
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 62/226 (27%), Positives = 94/226 (41%), Gaps = 40/226 (17%)
Query: 29 CSGTVNDGNKNTNNKTATPQ--ILKFAVSGVTELLRLFSS-FDKGRLDKVSYKQKYEILV 85
C G N N+ +K P+ +LK A G +ELL + +S F + + +
Sbjct: 39 CRGQ-NPTNEPQTSKAPEPENVVLKIAWYG-SELLGIAASVFRSPETSPIVTGFEVPVDC 96
Query: 86 SGID---DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVP 142
SG +V ++ D++ +YFVTG T E+Y + C F DP F+G + RN
Sbjct: 97 SGRAVRVAVVDSIKQDFKRSYFVTGNLTPEVYEDKCEFADPAGSFKGLARFKRNCTNFGS 156
Query: 143 FFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYL 202
E ++ L E F +G+ GH + S + +
Sbjct: 157 LIEKSNMKLMKWENF-------------EDKGV-------GH-------WKFSCV----M 185
Query: 203 KLPWKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
PWKP++S G T Y + E KI RH E WNV + Q+ P
Sbjct: 186 SFPWKPILSATGYTEYFFDPESGKICRHVEHWNVPKIALFKQLLRP 231
>gi|168007562|ref|XP_001756477.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162692516|gb|EDQ78873.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 324
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 85/182 (46%), Gaps = 42/182 (23%)
Query: 66 SFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTI 125
+F +G+ +S +Q EI+ + I + Y+VTG T IY ++C F DPT
Sbjct: 178 AFTEGKQTNLSVEQVKEIIENDI----------RKGQYYVTGNLTPSIYRDNCKFTDPTT 227
Query: 126 RFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHI 185
G Y +RLL F+ P + Q L+S+ V R + + LGG
Sbjct: 228 VVTGVNKYVAAVRLL---FD-PDLSQQE------LLSIDVTGP----RTIEVNWRLGG-- 271
Query: 186 FLLPYLLRISLICRTYLKLPWKPLIS-IDGSTVYELNDELKITRHAESWNVSALEAVGQI 244
YLKLPWKP I+ +GST Y L D+ I H E+WN+S L A+ ++
Sbjct: 272 ---------------YLKLPWKPHITPYEGSTRYTLGDDGLIESHDETWNISLLTALLEL 316
Query: 245 FT 246
FT
Sbjct: 317 FT 318
>gi|357168254|ref|XP_003581559.1| PREDICTED: uncharacterized protein LOC100843329 [Brachypodium
distachyon]
Length = 249
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 65/162 (40%), Gaps = 32/162 (19%)
Query: 87 GIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
G + ++ D+ +YFVTG T Y EDC F DP FRG + RN E
Sbjct: 111 GPAQVAQAIKDDFARSYFVTGNLTLRAYEEDCEFADPAGSFRGLGRFKRNCTNFGSLLEK 170
Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
+ L E L + + GH R S + + PW
Sbjct: 171 SDMKLTKWED------------------LQDKSV--GH-------WRFSCV----MSFPW 199
Query: 207 KPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
+P++S G T Y + E K+ RH E WNV + + QIF P
Sbjct: 200 RPVLSATGYTEYFFDAESGKVCRHVEHWNVPKMALLRQIFRP 241
>gi|384247090|gb|EIE20578.1| hypothetical protein COCSUDRAFT_10876, partial [Coccomyxa
subellipsoidea C-169]
Length = 100
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/117 (35%), Positives = 59/117 (50%), Gaps = 24/117 (20%)
Query: 113 IYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRM 172
IY+E+C F DPT+ F G + + RNL+LLVPF E P I L ++ +VG RE +
Sbjct: 7 IYSEECFFGDPTVSFTGLQKWRRNLQLLVPFLEDPQIELFSLG--------VVGSREKQA 58
Query: 173 RGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRH 229
L+ TYL+LPW+P I + GST Y L++ + R+
Sbjct: 59 ----------------AVRLKADWRLSTYLRLPWRPFIDVLGSTEYTLDEASQQARN 99
>gi|326500042|dbj|BAJ90856.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326510695|dbj|BAJ87564.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 239
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 66/163 (40%), Gaps = 32/163 (19%)
Query: 86 SGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE 145
SG + ++ D+ +YFVTG T Y +DC F DP FRG + RN E
Sbjct: 100 SGRFQVAEAIKDDFARSYFVTGNLTLGAYEDDCEFADPAGSFRGLGRFKRNCTNFGSLLE 159
Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLP 205
++ L E L +G R S + + P
Sbjct: 160 KSNMKLTKWED---LEEKSIGH------------------------WRFSCV----MSFP 188
Query: 206 WKPLISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
W+P++S G T Y + E K+ RH E+WNV + + QIF P
Sbjct: 189 WRPILSATGYTEYYFDGESGKVCRHVENWNVPKMALLRQIFRP 231
>gi|308804017|ref|XP_003079321.1| unnamed protein product [Ostreococcus tauri]
gi|116057776|emb|CAL53979.1| unnamed protein product [Ostreococcus tauri]
Length = 232
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/161 (26%), Positives = 66/161 (40%), Gaps = 31/161 (19%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
+ + +R DY+ YFV+G Y + C F DP FRG E + RN
Sbjct: 23 EAIEAIREDYDENYFVSGAGKMRAYDDQCEFADPFASFRGVERFKRN------------- 69
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ N+ + V L + E GL + S + L LPW+PL
Sbjct: 70 -VSNLGGLMRDVDLKITSFEETKDGLQTE-------------WKFSCV----LDLPWRPL 111
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGDR 250
++ G T + LND+ + RH E W+V + + + P +
Sbjct: 112 LAASGGTTHTLNDDHLVVRHYERWDVEPGKVLKALLRPASK 152
>gi|125591376|gb|EAZ31726.1| hypothetical protein OsJ_15875 [Oryza sativa Japonica Group]
Length = 243
Score = 67.8 bits (164), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 56/221 (25%), Positives = 89/221 (40%), Gaps = 38/221 (17%)
Query: 33 VNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRL-----DKVSYKQKYEILVSG 87
+ G N + ++ +L+ A G +ELL + +SF + D ++ G
Sbjct: 44 ASPGAGNETSSSSENAVLRAAWYG-SELLGIAASFFRPSQPPTEGDSAGAVEEAASEPQG 102
Query: 88 IDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYP 147
+ ++ D+ +YFVTG T + Y EDC F DP F G + + RN E
Sbjct: 103 RAQVAEAVKDDFARSYFVTGNLTLKAYEEDCEFADPAGSFNGLQRFKRNCTNFGSLLEKS 162
Query: 148 SIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWK 207
++ L E L + + GH R S + + PW+
Sbjct: 163 NMKLTKWED------------------LEDKSI--GH-------WRFSCV----MSFPWR 191
Query: 208 PLISIDGSTVYELN-DELKITRHAESWNVSALEAVGQIFTP 247
P++S G T Y + K+ RH E WNV + + QIF P
Sbjct: 192 PILSATGYTEYYFDAGSGKVCRHVEHWNVPKMALLRQIFRP 232
>gi|125549434|gb|EAY95256.1| hypothetical protein OsI_17076 [Oryza sativa Indica Group]
Length = 243
Score = 67.4 bits (163), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 56/221 (25%), Positives = 89/221 (40%), Gaps = 38/221 (17%)
Query: 33 VNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRL-----DKVSYKQKYEILVSG 87
+ G N + ++ +L+ A G +ELL + +SF + D ++ G
Sbjct: 44 ASPGAGNETSSSSENAVLRAAWYG-SELLGIAASFFRPSQPPTEGDGAGAVEEAASEPQG 102
Query: 88 IDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYP 147
+ ++ D+ +YFVTG T + Y EDC F DP F G + + RN E
Sbjct: 103 RAQVAEAVKDDFARSYFVTGNLTLKAYEEDCEFADPAGSFNGLQRFKRNCTNFGSLLEKA 162
Query: 148 SIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWK 207
++ L E L + + GH R S + + PW+
Sbjct: 163 NMKLTKWED------------------LEDKSI--GH-------WRFSCV----MSFPWR 191
Query: 208 PLISIDGSTVYELN-DELKITRHAESWNVSALEAVGQIFTP 247
P++S G T Y + K+ RH E WNV + + QIF P
Sbjct: 192 PILSATGYTEYYFDAGSGKVCRHVEHWNVPKMALLRQIFRP 232
>gi|302844526|ref|XP_002953803.1| hypothetical protein VOLCADRAFT_94648 [Volvox carteri f.
nagariensis]
gi|300260911|gb|EFJ45127.1| hypothetical protein VOLCADRAFT_94648 [Volvox carteri f.
nagariensis]
Length = 344
Score = 66.6 bits (161), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 72/160 (45%), Gaps = 32/160 (20%)
Query: 89 DDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPS 148
+DI+ +R DY+ YFV G+ E YA DC+F DP + F GT+ + +N+ L
Sbjct: 120 EDILAAIRRDYDVQYFVRGLADMEAYAPDCVFADPFVSFSGTQRFKQNVSNLGGLMS--D 177
Query: 149 IGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKP 208
I L ++ ++V S + R R S + L LPW+P
Sbjct: 178 IRL-DVYEWVEGTSTLETR------------------------WRFSAL----LDLPWRP 208
Query: 209 LISIDGSTVYELN-DELKITRHAESWNVSALEAVGQIFTP 247
++ G T + ++ D + RH E W+V + V Q+ P
Sbjct: 209 RLAAAGGTTHIIDLDRGLVVRHEERWDVEPAKVVAQLLQP 248
>gi|303276212|ref|XP_003057400.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461752|gb|EEH59045.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 186
Score = 66.6 bits (161), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 70/157 (44%), Gaps = 32/157 (20%)
Query: 92 VTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGL 151
+ +LR DY+++YFV+G+ YA DC F DP + F+G + + +N +
Sbjct: 60 IALLREDYDSSYFVSGVGELAAYAPDCEFSDPFVAFKGVDRFKQN--------------V 105
Query: 152 QNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLIS 211
N+ + V L + E L S R S + L LPWKP ++
Sbjct: 106 GNLGGMMRDVDLKITGWEESESDLVTS-------------WRFSCV----LDLPWKPKLA 148
Query: 212 IDGSTVYELN-DELKITRHAESWNVSALEAVGQIFTP 247
G T + + D K+ +H E W+V + V Q+ +P
Sbjct: 149 AAGGTTHVFDPDTGKVVKHIERWDVEPKKVVKQLLSP 185
>gi|223998977|ref|XP_002289161.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974369|gb|EED92698.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 324
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/173 (27%), Positives = 81/173 (46%), Gaps = 27/173 (15%)
Query: 86 SGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE 145
S + + + ++ D + +YF++G IY EDC+F DP + F G + V ++E
Sbjct: 154 SSLQETLARVKLDNDRSYFLSGEVDRLIYDEDCVFADPFVAFNG--------KNFVVWYE 205
Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIF-----LLPYLLR------- 193
+ ++ + F L+ EGR R + N LG I +L Y +
Sbjct: 206 CMYM-IRTLSSFNLLLV-----TEGRDRFIDNLANLGSFITNYDAKMLKYDVEKDGQEIV 259
Query: 194 ISLICRTYLKLPWKPLISIDGSTVYELNDELK-ITRHAESWNVSALEAVGQIF 245
++ + L LPW P+++ YE++ + IT H ESW++ A E V QIF
Sbjct: 260 TKVMVKLQLNLPWAPVLAWPWGVTYEVDPQTYLITSHVESWDIEAWEGVKQIF 312
>gi|302850531|ref|XP_002956792.1| hypothetical protein VOLCADRAFT_97844 [Volvox carteri f.
nagariensis]
gi|300257852|gb|EFJ42095.1| hypothetical protein VOLCADRAFT_97844 [Volvox carteri f.
nagariensis]
Length = 179
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 73/157 (46%), Gaps = 35/157 (22%)
Query: 94 ILRSDY-ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV-PFFEYPSIGL 151
ILR D E YFVTG T EI+A+DC F DPT G Y L +L P +Y + L
Sbjct: 46 ILRHDLAEGQYFVTGKLTREIFADDCRFVDPTNDVTGLSKYLTALGVLFDP--QYSKVEL 103
Query: 152 QNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLIS 211
+I +VG R+ + LGG YL PW+PL+
Sbjct: 104 LDIR--------VVGPRQ-----VEADWRLGG-----------------YLVFPWRPLVQ 133
Query: 212 -IDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
G TVY LN+ ++ ++W++SA A+ + FTP
Sbjct: 134 PFTGHTVYTLNESGLVSEQRQTWSISAATALLESFTP 170
>gi|159486318|ref|XP_001701188.1| SOUL3-like protein [Chlamydomonas reinhardtii]
gi|158271888|gb|EDO97698.1| SOUL3-like protein [Chlamydomonas reinhardtii]
Length = 263
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/163 (31%), Positives = 70/163 (42%), Gaps = 37/163 (22%)
Query: 89 DDIVTILRSDYE-NAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE-- 145
+++ +LR D E YFVTG T EI+A+DC F DPT G Y L++L F+
Sbjct: 125 EEVAAVLRRDLEVGQYFVTGDLTREIFADDCRFIDPTNDVTGLSKYLTALKVL---FDPA 181
Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLP 205
Y + L +I R + LGG YL P
Sbjct: 182 YSKVQLLSISA-------------SGPRTVEADWRLGG-----------------YLVFP 211
Query: 206 WKPLI-SIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
W P + G TVY L+DE I ++W SA EA+ + FTP
Sbjct: 212 WNPRVEPFKGHTVYTLSDEGLIVEQRQTWEKSAAEALRESFTP 254
>gi|116792207|gb|ABK26275.1| unknown [Picea sitchensis]
Length = 222
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 59/212 (27%), Positives = 87/212 (41%), Gaps = 38/212 (17%)
Query: 41 NNKTATPQILKFAVSGVTELLRLFSSFDKGRL---DKVSYKQKYEILVSGIDDIVTILRS 97
N A P KF V + + L S F G V+ + K L I++I +I+ +
Sbjct: 39 NCTGAIPGAGKFGRRTVLQKMALLSVFIAGNNACGGGVASQNKKTNL--SIEEIKSIIEN 96
Query: 98 DYENA-YFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEK 156
D Y+VTG T EIY +C F DPT G E Y ++ L P+ Q
Sbjct: 97 DISKGQYYVTGDMTKEIYENNCRFRDPTTDLTGLEKYISAVKFLF----NPNTSKQE--- 149
Query: 157 FVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKP-LISIDGS 215
L+S+ V + P+ + YLK PW P ++ +GS
Sbjct: 150 ---LLSIAV---------------------VDPHTIEAKWRLEGYLKFPWNPHILPYEGS 185
Query: 216 TVYELNDELKITRHAESWNVSALEAVGQIFTP 247
T Y L+D I H E+W++S A+ + P
Sbjct: 186 TRYVLDDRGLIMSHEETWDISVWTALQEFLLP 217
>gi|434388028|ref|YP_007098639.1| hypothetical protein Cha6605_4167 [Chamaesiphon minutus PCC 6605]
gi|428019018|gb|AFY95112.1| hypothetical protein Cha6605_4167 [Chamaesiphon minutus PCC 6605]
Length = 127
Score = 64.7 bits (156), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 68/156 (43%), Gaps = 34/156 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+T LR DY F + EIYAED F+DP +FRG Y + + + +FE+P +
Sbjct: 2 DIITTLRQDYAKFPFDQ---SYEIYAEDVYFKDPLTQFRGLPRYRKTIEFIQKWFEHPHL 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ I++ V ++ R +LL + LPW P
Sbjct: 59 EMYEIDR----VDRLITTR---------------------WLLSWN------TPLPWHPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
I I G + LND I H + W+ S L+ + Q F
Sbjct: 88 IEIPGKSELTLNDADLIISHIDYWDRSPLDVLKQHF 123
>gi|412989127|emb|CCO15718.1| predicted protein [Bathycoccus prasinos]
Length = 288
Score = 63.9 bits (154), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 40/173 (23%), Positives = 80/173 (46%), Gaps = 30/173 (17%)
Query: 78 KQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNL 137
KQ+ ++ + + I ++ + DYE YF+ G + Y+ C+F DP + F G + + +N+
Sbjct: 68 KQEENVMTTRAETIDSLAK-DYEKNYFIGGESEMKAYSSSCVFADPFVSFTGLDRFKQNV 126
Query: 138 RLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLI 197
L L+++E V ++ + N +GG IF + +
Sbjct: 127 GNL-------GTSLRDVECKV-------------LKTVDN--GVGGVIFYWKFSAVVD-- 162
Query: 198 CRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGDR 250
LPW+P ++ G+T + L+D K+ +H E+W+V + ++ P +
Sbjct: 163 -----ALPWRPKLAASGNTTHVLDDANKVVKHIEAWDVDPWVVLKKLLVPASK 210
>gi|302840933|ref|XP_002952012.1| hypothetical protein VOLCADRAFT_92622 [Volvox carteri f.
nagariensis]
gi|300262598|gb|EFJ46803.1| hypothetical protein VOLCADRAFT_92622 [Volvox carteri f.
nagariensis]
Length = 200
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 74/158 (46%), Gaps = 37/158 (23%)
Query: 86 SGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE 145
S + I I R E Y+VTG T +++A DC+F+DPT+ G E Y R L+ L F+
Sbjct: 52 SALKAIDAIRRDFVERQYYVTGNLTPQLFAPDCVFKDPTVEVVGVEPYVRALQAL---FD 108
Query: 146 YPSIG---LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYL 202
PS L +I ++ R EG ++ L G + + PY
Sbjct: 109 -PSTSRADLISIRATAPSTVVLRWRLEGSLK-------LAG-LKIKPY------------ 147
Query: 203 KLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEA 240
G+TVY L+D+ K+ RH E+W++S ++A
Sbjct: 148 ----------TGTTVYTLSDDGKVIRHEETWDISTVDA 175
>gi|354564738|ref|ZP_08983914.1| Protein of unknown function DUF2358 [Fischerella sp. JSC-11]
gi|353549864|gb|EHC19303.1| Protein of unknown function DUF2358 [Fischerella sp. JSC-11]
Length = 130
Score = 63.5 bits (153), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 65/156 (41%), Gaps = 34/156 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DIV IL+ DY+ T IYAE+ F+D +FRG +LY ++ + FF P +
Sbjct: 2 DIVQILKEDYQR---FPANQTYSIYAENVYFQDQVFKFRGIQLYKLMIKFIETFFLNPKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L +I+K EG + + LPWKP
Sbjct: 59 DLHDIQK------------EGDTIKTKWTLSWNS-------------------PLPWKPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
ISI G + LN + I H + WN S L+ + Q F
Sbjct: 88 ISIPGWSELRLNCDGLIISHIDYWNCSRLDVIKQHF 123
>gi|186684325|ref|YP_001867521.1| hypothetical protein Npun_R4203 [Nostoc punctiforme PCC 73102]
gi|186466777|gb|ACC82578.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 123
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 64/156 (41%), Gaps = 34/156 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T IYA D F+DP +FRG + Y + + + +F P +
Sbjct: 2 DIIEILKKDYQRFPINQ---TYSIYAPDVYFQDPLNKFRGVKRYQKMINFIQTWFLDPKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L NI++ LG I + LPWKP
Sbjct: 59 DLHNIQR------------------------LGDTI-------KTEWTLSWNTPLPWKPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
ISI G + LN + I H + WN S L+ V Q F
Sbjct: 88 ISIPGWSELGLNSDGLIVSHVDYWNCSPLDVVKQHF 123
>gi|303281246|ref|XP_003059915.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458570|gb|EEH55867.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 266
Score = 63.2 bits (152), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 66/163 (40%), Gaps = 34/163 (20%)
Query: 88 IDDIVTILRSDY-ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
IDDI L D E YFVTG T E++A+DC F DPT G Y L LL
Sbjct: 129 IDDIAKQLEVDLREGQYFVTGKLTRELFADDCRFVDPTNDVVGLSRYLTALDLL------ 182
Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
N V L + V P + + YL+ PW
Sbjct: 183 ----FDNDTSSVELFDVRVTS---------------------PTTIEADWSLQGYLRFPW 217
Query: 207 KPLI-SIDGSTVYELNDELK-ITRHAESWNVSALEAVGQIFTP 247
KP + DG T Y LN E I ++W++S +A+ + FTP
Sbjct: 218 KPRVEPYDGHTTYVLNPENGLIQTQTQTWSISGAKAIAESFTP 260
>gi|255082958|ref|XP_002504465.1| predicted protein [Micromonas sp. RCC299]
gi|226519733|gb|ACO65723.1| predicted protein [Micromonas sp. RCC299]
Length = 237
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 73/180 (40%), Gaps = 35/180 (19%)
Query: 70 GRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRG 129
G + +V ++ ++ + D + R + YFVTG T EI+A+DC F DPT G
Sbjct: 85 GEVKRVGGDKRTDLTAEKVRD--QLERDLRDGQYFVTGNLTREIFADDCRFTDPTNDVVG 142
Query: 130 TELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLP 189
Y L LL V L + V P
Sbjct: 143 LSRYLTALGLL----------FDPATSSVDLYDIKV---------------------TSP 171
Query: 190 YLLRISLICRTYLKLPWKPLISI-DGSTVYELNDELKITRHA-ESWNVSALEAVGQIFTP 247
+ + YL+ PWKP + G TVY L+ E K+ R ++WN+S +A+ + FTP
Sbjct: 172 NTVEADWQLQGYLRFPWKPRVDPYSGHTVYSLDPETKLVRSQFQTWNISGAKAIAESFTP 231
>gi|427723122|ref|YP_007070399.1| hypothetical protein Lepto7376_1208 [Leptolyngbya sp. PCC 7376]
gi|427354842|gb|AFY37565.1| Protein of unknown function DUF2358 [Leptolyngbya sp. PCC 7376]
Length = 126
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 34/159 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T EIYA+D F+DP FRG Y + + L +F+ +
Sbjct: 2 DIIEILQRDYQQ---FPKEQTFEIYADDVFFKDPLNEFRGVTKYKKMIGFLGRWFQNIQL 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L I++ N+T +++ C LPW+P
Sbjct: 59 ELHEIQR-------------------DNATICTD--------WTLNMTC----PLPWQPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPG 248
+SI G ++ E+N++ I H + WN + L+ +GQ+F G
Sbjct: 88 LSISGYSLLEVNEQDLIISHIDYWNDAPLKVLGQVFKFG 126
>gi|298713054|emb|CBJ48829.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 240
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 89/208 (42%), Gaps = 29/208 (13%)
Query: 46 TPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFV 105
T +K V G+T+L F+ D+ + +S + +S + + +R +Y Y
Sbjct: 55 TSGSVKAFVGGLTDLFVRFAGGDEDS-EALSADTSAKASLSEAE-LEAGIRQEYAKNYLW 112
Query: 106 TGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMV 165
TG ++Y EDC F DPT+ F G Y RN+ L + + ++N ++ L
Sbjct: 113 TGDINEDLYEEDCNFTDPTLSFSGLSTYKRNVGSLQGVLD---LFVRNSRSVLYSCEL-- 167
Query: 166 GRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDEL- 224
R+E R ++ L+LPW P I + G T + +
Sbjct: 168 -RQEDSC-----------------VQTRWRMVGD--LRLPWGPTIDVVGRTTFSYDSSRG 207
Query: 225 -KITRHAESWNVSALEAVGQIFTPGDRN 251
+I + E W + A EA+ QIF PG R+
Sbjct: 208 NRIFSYDEVWEMEAAEALMQIFRPGTRD 235
>gi|159487172|ref|XP_001701609.1| SOUL heme-binding protein [Chlamydomonas reinhardtii]
gi|158271550|gb|EDO97367.1| SOUL heme-binding protein [Chlamydomonas reinhardtii]
Length = 323
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 71/163 (43%), Gaps = 34/163 (20%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
+I+ +R+DY+ YFV G+ Y C+F DP + F GT+ + +N
Sbjct: 102 EILASIRADYDEQYFVRGVSEMSAYDPQCVFADPFVSFSGTQRFKQN------------- 148
Query: 150 GLQNIEKFVFLVSLMV-GRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKP 208
+ N+ + + L V +EG N+ L R S + L LPW+P
Sbjct: 149 -VSNLGGLMSDIKLDVYDWQEG-----DNT---------LTTRWRFSCL----LDLPWRP 189
Query: 209 LISIDGSTVYELNDELK-ITRHAESWNVSALEAVGQIFTPGDR 250
++ G T + ++ + RH E W V + V Q+FTP +
Sbjct: 190 RLAAAGGTTHVIDPARGLVVRHEERWEVEPAKVVAQLFTPAAK 232
>gi|434402210|ref|YP_007145095.1| hypothetical protein Cylst_0032 [Cylindrospermum stagnale PCC 7417]
gi|428256465|gb|AFZ22415.1| hypothetical protein Cylst_0032 [Cylindrospermum stagnale PCC 7417]
Length = 125
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 71/158 (44%), Gaps = 34/158 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ + V ++ IYA D F+DP +FRG + Y + + + +F P +
Sbjct: 2 DIIQILKDDYQR-FPVNQTYS--IYAPDVYFQDPLNQFRGVKRYQKMINFIQTWFINPKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L NI++ +G + + LG + LPWKP
Sbjct: 59 DLHNIQQ--------LGDK------IKTEWTLGWNT-----------------PLPWKPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
ISI G + LN + I H + W+ S L+ + Q F P
Sbjct: 88 ISISGWSELGLNSDGFIVSHIDYWHCSRLDVLKQHFFP 125
>gi|255076935|ref|XP_002502130.1| predicted protein [Micromonas sp. RCC299]
gi|226517395|gb|ACO63388.1| predicted protein [Micromonas sp. RCC299]
Length = 176
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/160 (25%), Positives = 68/160 (42%), Gaps = 32/160 (20%)
Query: 89 DDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPS 148
+ +V +L+ DY+ +YFV+G+ Y DC F DP + F+G + + +N
Sbjct: 47 EQVVALLKEDYDQSYFVSGVGELAAYDPDCEFADPFVSFKGVDRFKQN------------ 94
Query: 149 IGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKP 208
+ N+ + + L + + R L S R S + L LPW+P
Sbjct: 95 --VGNLGGMMRDIDLKITGWDERADELETS-------------WRFSCV----LDLPWRP 135
Query: 209 LISIDGSTVYELNDEL-KITRHAESWNVSALEAVGQIFTP 247
++ G T + + K+ RH E W+V V Q+ P
Sbjct: 136 KLAAAGGTTHVFDPATGKVIRHVERWDVRPGVVVKQLLLP 175
>gi|428208414|ref|YP_007092767.1| hypothetical protein Chro_3440 [Chroococcidiopsis thermalis PCC
7203]
gi|428010335|gb|AFY88898.1| Protein of unknown function DUF2358 [Chroococcidiopsis thermalis
PCC 7203]
Length = 126
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 63/156 (40%), Gaps = 34/156 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ ILR DY+ T IYA D F+DP RFRG E Y + + + +F +
Sbjct: 2 DIIQILRQDYQR---FPDNQTYSIYASDVFFQDPLNRFRGVERYKQMINFINTWFIAVKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L +I R+EG ++ LPWKP
Sbjct: 59 DLHDI------------RQEGDT-------------------IKTEWTLSWNTPLPWKPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
I+I G + LN + I H + WN S L+ V Q F
Sbjct: 88 IAIPGWSELHLNPQGLIDSHIDYWNCSRLDVVKQHF 123
>gi|427708881|ref|YP_007051258.1| hypothetical protein Nos7107_3539 [Nostoc sp. PCC 7107]
gi|427361386|gb|AFY44108.1| Protein of unknown function DUF2358 [Nostoc sp. PCC 7107]
Length = 127
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 67/162 (41%), Gaps = 42/162 (25%)
Query: 90 DIVTILRSDY----ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE 145
DI+ L++DY EN T IYAE+ F+D +FRG +LY ++ + FF
Sbjct: 2 DIIETLKNDYQRFPENQ-------TYSIYAENVYFQDMVFKFRGVKLYQWMIKFIKTFFL 54
Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLP 205
P + L NI++ LG I + LP
Sbjct: 55 NPKMDLHNIQR------------------------LGDTI-------KTEWTLSWNSPLP 83
Query: 206 WKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
WKP I+I G + LN + I H + WN S ++ + Q F P
Sbjct: 84 WKPRIAISGWSELLLNADDLIISHIDYWNCSQVDVLKQHFFP 125
>gi|307108501|gb|EFN56741.1| hypothetical protein CHLNCDRAFT_144171 [Chlorella variabilis]
Length = 202
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 78/179 (43%), Gaps = 36/179 (20%)
Query: 70 GRLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRG 129
GR+++V + + +S +DI + R+ E YFVTG T EI+A+DC F+DPT G
Sbjct: 45 GRVERVGGDKL--VGLSPEEDI--LARNLREGQYFVTGDLTPEIFADDCRFKDPTNETAG 100
Query: 130 TELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLP 189
Y + L LL F S +++ L G P
Sbjct: 101 LSRYMKALTLL------------------FDAS-------------YSAVQLVGISVTSP 129
Query: 190 YLLRISLICRTYLKLPWKPLIS-IDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
+ YL+ PW P + G TVY LN++ I ++W+++ A+ + FTP
Sbjct: 130 TTIEADWKMGGYLRFPWHPRVEPFLGHTVYHLNEQGLIALQDQTWSITGTTALVESFTP 188
>gi|440679743|ref|YP_007154538.1| Protein of unknown function DUF2358 [Anabaena cylindrica PCC 7122]
gi|428676862|gb|AFZ55628.1| Protein of unknown function DUF2358 [Anabaena cylindrica PCC 7122]
Length = 129
Score = 60.5 bits (145), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 67/154 (43%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL++DY+ T IYA D F+DP +FRG ELY ++ + FF +
Sbjct: 2 DIIEILKADYQRFPINQ---TYSIYAPDVYFQDPVFKFRGLELYKWMIKFIQTFFLNLKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L NI+ R+E ++ +L + LPWKP
Sbjct: 59 DLHNIQ-----------RQEDTIKS------------------EWTLSWNS--PLPWKPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
ISI G + LN + I H + W+ S L+ + Q
Sbjct: 88 ISISGWSELRLNADSLIISHIDYWHGSRLDVLKQ 121
>gi|300863520|ref|ZP_07108472.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300338476|emb|CBN53614.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 129
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 68/158 (43%), Gaps = 34/158 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T IYAE F+DP RF+G + Y + + + FF+ +
Sbjct: 2 DIIEILKEDYKK---FPAAQTYSIYAEKVYFQDPMNRFKGIDRYQQMIGFMSTFFKDIKL 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L NI + VG R + L + +PW+P
Sbjct: 59 DLHNISQ--------VGDR------IETRWTLSWTV-----------------PVPWQPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
I++ G + +LN E I+ H + WN S L+ + Q+ P
Sbjct: 88 IAVPGWSELKLNTEGLISSHIDYWNCSRLDVLKQLLFP 125
>gi|307109547|gb|EFN57785.1| hypothetical protein CHLNCDRAFT_143130 [Chlorella variabilis]
Length = 257
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 44/177 (24%), Positives = 71/177 (40%), Gaps = 33/177 (18%)
Query: 71 RLDKVSYKQKYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGT 130
+LD S ++ + + ++R D+ YFVTG Y DC+F DP F GT
Sbjct: 29 KLDGPSLAEQQAGAALKREQALALIREDFIQNYFVTGQGALAAYDPDCLFADPFASFNGT 88
Query: 131 ELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPY 190
+ RN+ L L +I+ + L G E R + ++T G
Sbjct: 89 ARFKRNVSNLGGL-------LTDID--LTLTDWQEGEDELRTKWRFSATLSG-------- 131
Query: 191 LLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
LPWKPL++ G T + ++ +H ESW++ + Q+ P
Sbjct: 132 -------------LPWKPLLAAAGGTTFATG---RVCKHIESWDIEPARVLRQLLKP 172
>gi|428303699|ref|YP_007140524.1| hypothetical protein Cri9333_0009 [Crinalium epipsammum PCC 9333]
gi|428245234|gb|AFZ11014.1| Protein of unknown function DUF2358 [Crinalium epipsammum PCC 9333]
Length = 127
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 67/154 (43%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ + V ++ IYAE+ F+DP +FRG E Y + +R + +F P +
Sbjct: 2 DIIEILKQDYQ-KFPVDQTYS--IYAENVYFQDPLNKFRGVERYKQMIRFINQWFINPQL 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
LH LG I R +L T LPWKP
Sbjct: 59 ------------------------DLHEINQLGDTI-----KTRWTLSWTT--PLPWKPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I G + LN E I H + W+ S + + Q
Sbjct: 88 IAIPGWSELHLNAEKLIDSHIDYWDCSRFDVITQ 121
>gi|255089807|ref|XP_002506825.1| predicted protein [Micromonas sp. RCC299]
gi|226522098|gb|ACO68083.1| predicted protein [Micromonas sp. RCC299]
Length = 226
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 68/160 (42%), Gaps = 25/160 (15%)
Query: 94 ILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQN 153
I R E Y TG E+Y C F DPT+ F G E + RNL L P
Sbjct: 83 IAREFTEAKYLWTGDINPEMYDLYCTFTDPTLSFAGLETFQRNLANLQP----------- 131
Query: 154 IEKFVFLVSLMVGRREGRMRGLHNST-ALGGHIFLLPYLLRISLICRTYLKLPWKPLISI 212
++S +V R+ + L++ A G +R L++PW+P I +
Sbjct: 132 ------VLSRLV--RDSDVEQLYSCELAGDGKGGGGGGAVRARWRMTGNLRVPWRPRIDL 183
Query: 213 DGSTVYELND-----ELKITRHAESWNVSALEAVGQIFTP 247
+G T + D IT + E W +SA EAV Q+ TP
Sbjct: 184 EGQTTFTFKDYGGDRGCLITAYQEEWGLSAGEAVMQLVTP 223
>gi|397643843|gb|EJK76114.1| hypothetical protein THAOC_02143 [Thalassiosira oceanica]
Length = 658
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/157 (26%), Positives = 67/157 (42%), Gaps = 28/157 (17%)
Query: 100 ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVF 159
EN Y TG S +A DC F DPT+ F+G + + N++ LVP ++ G +
Sbjct: 511 ENNYLWTGNIDSSSFAADCTFTDPTLSFQGVDKFVSNVQNLVPVVDFLLEGKTDENSESV 570
Query: 160 LVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVY- 218
L+ + + E + N + + +LPW+P I + G T +
Sbjct: 571 LLDIRLNEEESYVETRWN-------------------MVGSLKRLPWRPKIDVIGRTKFW 611
Query: 219 ---ELNDELKITR-----HAESWNVSALEAVGQIFTP 247
EL D+ + R + E W + A A+ Q+ TP
Sbjct: 612 YETELRDDDSVARVQVNFYDEQWEIPAGLALMQLITP 648
>gi|428218507|ref|YP_007102972.1| hypothetical protein Pse7367_2281 [Pseudanabaena sp. PCC 7367]
gi|427990289|gb|AFY70544.1| Protein of unknown function DUF2358 [Pseudanabaena sp. PCC 7367]
Length = 154
Score = 59.3 bits (142), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 71/164 (43%), Gaps = 24/164 (14%)
Query: 80 KYEILVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRL 139
K EI ID+++ I+ +DY A F T IYAED F+DP FRG + Y + +
Sbjct: 7 KAEIQPKSIDEMIEIVIADY--AKFPEAQ-TYAIYAEDVYFKDPVYEFRGLKQYQKMIGF 63
Query: 140 LVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICR 199
+ +F ++ L IE+ + + G+ ++
Sbjct: 64 ITKWFANLNLALHTIEE-------VETSQSNPTEGVTT--------------IKTEWTMS 102
Query: 200 TYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
+PWKP IS+DG + +N + +I H + W+ + + V Q
Sbjct: 103 WNSPMPWKPRISVDGWSELGINHQGQIISHVDYWHCTKWDVVKQ 146
>gi|224005118|ref|XP_002296210.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|209586242|gb|ACI64927.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 279
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 59/227 (25%), Positives = 104/227 (45%), Gaps = 19/227 (8%)
Query: 31 GTVNDGNKNTNNKT--ATPQILKFAVSGVTELLRLFS--SFDKGRLDKVSYKQKY-EILV 85
T ++ +TNN + ++ +K VS +T + FS S D+ S +Q L+
Sbjct: 54 STQDNAENDTNNSSYESSASFVKGLVSSLTSVTNYFSLSSQDEPGPSSPSMQQTAIATLL 113
Query: 86 S---GIDDIVTILRSDY-ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV 141
S +++ +R+DY EN Y TG ++++DC F DPT+ F G + Y N+ LV
Sbjct: 114 SPPTSPSELLERIRADYAENNYLWTGKLDVSLFSKDCRFTDPTLSFEGIDNYVTNVGNLV 173
Query: 142 PFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTY 201
P E+ +G + + L+ + EG + N + G + +P+ RI +I RT
Sbjct: 174 PVVEF-LLGKEQSSQSK-LLDISCNEEEGYVETRWN---MIGDLNAIPWSPRIDVIGRTK 228
Query: 202 LKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPG 248
W + + ++ L++ + E W + A A+ Q TP
Sbjct: 229 F---WYEATTNEEDKQHQA--ALQVYFYDEKWEIPAGLALLQFITPA 270
>gi|255074113|ref|XP_002500731.1| predicted protein [Micromonas sp. RCC299]
gi|226515994|gb|ACO61989.1| predicted protein [Micromonas sp. RCC299]
Length = 297
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/159 (29%), Positives = 65/159 (40%), Gaps = 24/159 (15%)
Query: 94 ILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQN 153
I R E Y TG E+Y C F DPT+ F G E + RNL L P
Sbjct: 155 IAREFTEAKYLWTGDINPEMYDLYCTFTDPTLSFAGLETFQRNLANLQP----------- 203
Query: 154 IEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISID 213
++S +V R+ + A G +R L++PW+P I ++
Sbjct: 204 ------VLSRLV--RDSDVELYSCELAGDGKGGGGGGAVRARWRMTGNLRVPWRPRIDLE 255
Query: 214 GSTVYELND-----ELKITRHAESWNVSALEAVGQIFTP 247
G T + D IT + E W +SA EAV Q+ TP
Sbjct: 256 GQTTFTFKDYGGDRGCLITAYQEEWGLSAGEAVMQLVTP 294
>gi|282896015|ref|ZP_06304046.1| conserved hypothetical protein [Raphidiopsis brookii D9]
gi|281199125|gb|EFA73995.1| conserved hypothetical protein [Raphidiopsis brookii D9]
Length = 130
Score = 58.5 bits (140), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 65/154 (42%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
+I+ L++DY+ T IYA D F+DP +FRG ELY ++ + FF +
Sbjct: 3 NIIETLKADYQRFPLDQ---TYSIYALDVYFQDPVFKFRGLELYKWMIKFIHIFFTNLRM 59
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L +IE+ + M + + LPWKP
Sbjct: 60 DLHHIEQ------------DQNMIKTEWTLSWNA-------------------SLPWKPR 88
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
ISI G T LN++ I+ H + W+ S L+ + Q
Sbjct: 89 ISISGWTQLSLNNQGLISSHIDYWHCSRLDVLKQ 122
>gi|443315376|ref|ZP_21044870.1| hypothetical protein Lep6406DRAFT_00038910 [Leptolyngbya sp. PCC
6406]
gi|442785015|gb|ELR94861.1| hypothetical protein Lep6406DRAFT_00038910 [Leptolyngbya sp. PCC
6406]
Length = 137
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 64/153 (41%), Gaps = 33/153 (21%)
Query: 95 LRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNI 154
LR DY + ++YA D FEDP RF+G + Y + + + +F P++ L +
Sbjct: 16 LRQDYAQ---FPAHQSYDLYAADVQFEDPLNRFQGVQKYQKMIGFIDRWFIDPTLDLHEL 72
Query: 155 EKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDG 214
H+ + H L S + LPWKP I+I G
Sbjct: 73 -------------------ICHDPDQIQTHWTL-------SWVA----PLPWKPGIAISG 102
Query: 215 STVYELNDELKITRHAESWNVSALEAVGQIFTP 247
T Y LN E +I H + W+ S L+ + Q+ P
Sbjct: 103 WTDYRLNAEGQICAHIDHWHCSRLDVLKQVLAP 135
>gi|428211396|ref|YP_007084540.1| hypothetical protein Oscil6304_0887 [Oscillatoria acuminata PCC
6304]
gi|427999777|gb|AFY80620.1| hypothetical protein Oscil6304_0887 [Oscillatoria acuminata PCC
6304]
Length = 132
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 69/158 (43%), Gaps = 34/158 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T +Y+E F+DP FRG + Y + + + +F+ +
Sbjct: 2 DIIEILKQDYQQ---FPENQTYSLYSETVYFKDPMTEFRGCDRYRQMIGFMSTWFKQIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L +I R + N+ R +L T LPW+P
Sbjct: 59 DLHDI------------------RRIDNTIET-----------RWTLNWTT--PLPWQPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
I+I G + +LN E KI H + W++S L+ + Q F P
Sbjct: 88 IAIPGRSELQLNPEGKIISHIDYWDISRLDVLKQHFKP 125
>gi|427733624|ref|YP_007053168.1| hypothetical protein Riv7116_0005 [Rivularia sp. PCC 7116]
gi|427368665|gb|AFY52621.1| hypothetical protein Riv7116_0005 [Rivularia sp. PCC 7116]
Length = 129
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 62/158 (39%), Gaps = 34/158 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DIV IL+ DY+ T IYAED F+DP F G E Y ++ + +F +
Sbjct: 2 DIVEILKEDYQR---FPENQTYSIYAEDVYFKDPMNEFNGVERYKLMIKFIQTWFVQTQM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L +I RREG ++ LPWKP
Sbjct: 59 DLHDI------------RREGDT-------------------IKTEWTLNWNTPLPWKPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
ISI G + +N + I H + W S + + Q F+P
Sbjct: 88 ISIPGWSELGVNSDNLIVSHVDYWKCSRFDVLKQHFSP 125
>gi|282899336|ref|ZP_06307305.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281195793|gb|EFA70721.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 130
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 66/156 (42%), Gaps = 34/156 (21%)
Query: 88 IDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYP 147
+ +I+ L++DY+ T IYA D F+DP +FRG ELY ++ + FF
Sbjct: 1 MTNIIETLKADYQRFPLEQ---TYSIYALDVYFQDPVFKFRGLELYKWMIKFIHIFFTNL 57
Query: 148 SIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWK 207
+ L +IE+ + M ++ LPWK
Sbjct: 58 RMDLHHIEQ------------DKNM-------------------IKTEWTLSWSASLPWK 86
Query: 208 PLISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
P ISI G T LN++ I+ H + W+ S L+ + Q
Sbjct: 87 PRISISGWTQLGLNNQGLISSHIDYWHCSRLDVLKQ 122
>gi|434392432|ref|YP_007127379.1| Protein of unknown function DUF2358 [Gloeocapsa sp. PCC 7428]
gi|428264273|gb|AFZ30219.1| Protein of unknown function DUF2358 [Gloeocapsa sp. PCC 7428]
Length = 129
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 62/158 (39%), Gaps = 34/158 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DIV L+ DY+ T IYA+D F+DP FRG E Y + + +F P +
Sbjct: 2 DIVETLKEDYQR---FPANQTYSIYAQDVYFKDPLNEFRGVERYKAMIGFIETWFIAPKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L +I RREG ++ +PWKP
Sbjct: 59 DLHDI------------RREGDT-------------------IKTEWTLSWNTPVPWKPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
I+I G + +N + IT H + W S L+ + Q P
Sbjct: 88 IAIPGWSELRVNQQDIITSHVDYWKCSRLDVIKQHLFP 125
>gi|397569802|gb|EJK46976.1| hypothetical protein THAOC_34333 [Thalassiosira oceanica]
Length = 233
Score = 57.8 bits (138), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 41/159 (25%), Positives = 68/159 (42%), Gaps = 30/159 (18%)
Query: 88 IDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYP 147
+D+ + ++ D + +YF++G IY E C F DP + F G + + NL L F
Sbjct: 95 LDETLQRIQLDNDRSYFLSGEIDEMIYDEQCTFADPFVSFDGRQRFVDNLANLGSFITDY 154
Query: 148 SIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWK 207
S N ++ + G I + ++ + L LPWK
Sbjct: 155 SAKTIN----------------------YDVSENGKQI-------QTKVMVKLELNLPWK 185
Query: 208 PLISIDGSTVYELNDE-LKITRHAESWNVSALEAVGQIF 245
P+++ Y +++ IT H ESW+VS E + QIF
Sbjct: 186 PVLAWPWGVKYVIDEATFLITDHIESWDVSPSEGIAQIF 224
>gi|145343266|ref|XP_001416306.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576531|gb|ABO94599.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 136
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/165 (31%), Positives = 70/165 (42%), Gaps = 38/165 (23%)
Query: 92 VTILRSDYEN-AYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
+ LR D YFVTG T I+A DC F DPT RG E Y R +R L F E S
Sbjct: 1 MATLREDVATRQYFVTGELTEAIFARDCRFVDPTTDVRGLERYLRAVRAL--FDERRSS- 57
Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
V L+ G +R + +T G + YLKLPW P +
Sbjct: 58 ----------VRLI-----GDLRLVDATTIEGDYR------------AEGYLKLPWNPRV 90
Query: 211 S-IDGSTVYELN------DELKITRHAESWNVSALEAVGQIFTPG 248
+G ++L D I ++W++S EA+ + FTPG
Sbjct: 91 PPYEGHITWKLRPNDGSADAGLIVEQRQTWSISGGEALRETFTPG 135
>gi|219109759|ref|XP_002176633.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411168|gb|EEC51096.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 231
Score = 57.4 bits (137), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 86/212 (40%), Gaps = 38/212 (17%)
Query: 36 GNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTIL 95
+ + N + K + GV ++F S + D +Y E S +++ L
Sbjct: 49 AQEESKNTKEVDLLTKASWYGVEVFGKVFGS----KRDSSTYAYSLEQPPSSVEETKARL 104
Query: 96 RSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFF-EYPSIGLQNI 154
++D YF++G IY + C+F DP + F+G + + NL L F +Y + L
Sbjct: 105 QADNNREYFLSGEVDKCIYDDQCVFADPFVSFKGRDRFVENLVNLGSFVTKYSAKPLSYY 164
Query: 155 EKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISID- 213
E+ +V+ + + E L LPWKP+++
Sbjct: 165 EEDEAVVTKFMVKLE--------------------------------LNLPWKPVLAWPW 192
Query: 214 GSTVYELNDELKITRHAESWNVSALEAVGQIF 245
G D IT H ESW+++A E V QIF
Sbjct: 193 GVRCVIDRDTNLITLHEESWDIAAWEGVKQIF 224
>gi|427727295|ref|YP_007073532.1| hypothetical protein Nos7524_0006 [Nostoc sp. PCC 7524]
gi|427363214|gb|AFY45935.1| hypothetical protein Nos7524_0006 [Nostoc sp. PCC 7524]
Length = 125
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 63/158 (39%), Gaps = 34/158 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL++DY+ T IYAED F+DP FRG E Y + ++ + +F
Sbjct: 2 DIIAILKADYQR---FPDHQTYNIYAEDVYFQDPLNSFRGVERYKKMIKFIQTWF----- 53
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
RM LH+ LG I + LPWKP
Sbjct: 54 ------------------LNCRM-DLHHIQQLGDTI-------KTEWTLSWNTPLPWKPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
I I G + LN + I H + W+ S L+ + Q P
Sbjct: 88 IEISGWSELGLNSDGLIVSHIDYWHCSKLDVLKQHIIP 125
>gi|119509169|ref|ZP_01628320.1| hypothetical protein N9414_05190 [Nodularia spumigena CCY9414]
gi|119466335|gb|EAW47221.1| hypothetical protein N9414_05190 [Nodularia spumigena CCY9414]
Length = 125
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 68/158 (43%), Gaps = 35/158 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T IYAE+ F+DP +FRG + Y + ++ + +F P +
Sbjct: 2 DIIEILKQDYQRFPLNQ---TYSIYAENVYFQDPLTKFRGVKRYQQMIKFIQTWFINPQM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ +I++ LG I + +PWKP
Sbjct: 59 DVHDIQR------------------------LGDTI-------KTEWTLSWNTPIPWKPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ-IFT 246
ISI G + ++N I H + W+ S L+ + Q +FT
Sbjct: 88 ISISGWSELKINSADLIISHIDYWHCSPLDVIKQHLFT 125
>gi|414077348|ref|YP_006996666.1| hypothetical protein ANA_C12109 [Anabaena sp. 90]
gi|413970764|gb|AFW94853.1| hypothetical protein ANA_C12109 [Anabaena sp. 90]
Length = 130
Score = 57.0 bits (136), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 64/156 (41%), Gaps = 34/156 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DIV L++DY A F T IYA+D F+D +FRG ELY ++ + FF +
Sbjct: 2 DIVETLKADY--ARFPVNQ-TYSIYADDVYFQDAVFKFRGIELYKWMIKFIQTFFSNLKL 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L I+ ++E ++ LPWKP
Sbjct: 59 DLHTIQ----------SQQEN---------------------IKTEWTMSWNSPLPWKPH 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
IS+ G + LN + I H + W+ S L+ + Q F
Sbjct: 88 ISVSGWSELRLNADGLIVSHIDYWHCSRLDVIKQHF 123
>gi|412989019|emb|CCO15610.1| predicted protein [Bathycoccus prasinos]
Length = 187
Score = 57.0 bits (136), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 49/175 (28%), Positives = 75/175 (42%), Gaps = 44/175 (25%)
Query: 86 SGIDDIVTILRSD-----------YENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYS 134
+ I++I ILR D + AYFVTG T EI+ +DC F DPT + Y
Sbjct: 43 ASIEEIKEILRRDLQPVEKNADGTFRGAYFVTGNVTPEIFEDDCRFIDPTNTTKSLSKYV 102
Query: 135 RNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRI 194
L++L E S+ L++IE + E + G+++
Sbjct: 103 NALKILFD-PENSSVDLRSIEV----------KDENTIVGIYD----------------- 134
Query: 195 SLICRTYLKLPWKPLIS-IDGSTVYELNDELKITRHAESWN-VSALEAVGQIFTP 247
C YLKLPW P + G+ ++ + I + W+ VSA A+ + FTP
Sbjct: 135 ---CEGYLKLPWHPRVKPYTGTVTWKTSKNGLIESQTQEWSGVSAAGAIMESFTP 186
>gi|428165198|gb|EKX34199.1| hypothetical protein GUITHDRAFT_90543 [Guillardia theta CCMP2712]
Length = 275
Score = 56.6 bits (135), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 75/165 (45%), Gaps = 31/165 (18%)
Query: 85 VSGIDDIVTILRSDY-ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPF 143
+S + + ++R+D+ E Y+VTG + IY+ DC F+ P
Sbjct: 128 MSSAAEALEVIRNDFLERRYYVTGRLSKAIYSNDCFFDAPD------------------- 168
Query: 144 FEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLK 203
P + ++ + K+V S + + + L T + L + L +L+
Sbjct: 169 ---PDMPVRGLRKYVDATSHLFEPKSSSIDLLSIET-VDDSAVLARWRLEATLM------ 218
Query: 204 LPWKPLI-SIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
LPW+P + + G T+YELN+E ++RH E W+++A +A P
Sbjct: 219 LPWRPSVKAYTGVTLYELNEEGLVSRHTELWSITAADAFISTLFP 263
>gi|384245965|gb|EIE19457.1| hypothetical protein COCSUDRAFT_31253 [Coccomyxa subellipsoidea
C-169]
Length = 386
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 63/163 (38%), Gaps = 33/163 (20%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
D V +R DY+ YFV+G Y DC+F DP F G E + +N+ L E I
Sbjct: 102 DAVAAIRKDYDCNYFVSGRGEMSAYESDCLFADPFAGFNGVERFKKNVSNLGSLME--DI 159
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L M+ +E L R S + L LPW+P
Sbjct: 160 KLD-----------MLDWKETEE--------------ALETKWRFSAV----LSLPWRPR 190
Query: 210 ISIDGST--VYELNDELKITRHAESWNVSALEAVGQIFTPGDR 250
++ G T V+ ++ +H ESW+V V + P R
Sbjct: 191 LAASGGTTHVFSQVRGCQVIKHIESWDVEPAAVVRSLLKPSTR 233
>gi|298491076|ref|YP_003721253.1| hypothetical protein Aazo_2085 ['Nostoc azollae' 0708]
gi|298232994|gb|ADI64130.1| Protein of unknown function DUF2358 ['Nostoc azollae' 0708]
Length = 129
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 65/154 (42%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DIV IL++DY+ + V ++ IYA + F+D +FRG ELY ++ + FF +
Sbjct: 2 DIVEILKADYQR-FPVNQTYS--IYAPEVYFQDAVFKFRGLELYKWMIKFIQTFFLNLKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L NIE+ +E ++ LPWKP
Sbjct: 59 DLHNIEQ-----------QEDTIKS--------------------DWTLSWNSPLPWKPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
ISI G + LN E I H + W+ S L + Q
Sbjct: 88 ISISGWSELRLNSEGLIISHIDYWHSSRLNLLKQ 121
>gi|411120874|ref|ZP_11393246.1| hypothetical protein OsccyDRAFT_4880 [Oscillatoriales
cyanobacterium JSC-12]
gi|410709543|gb|EKQ67058.1| hypothetical protein OsccyDRAFT_4880 [Oscillatoriales
cyanobacterium JSC-12]
Length = 128
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 65/157 (41%), Gaps = 34/157 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ +L++DY A F T EIYAED F+DP FRG + Y R +V F + I
Sbjct: 2 DILEVLQADY--ARFPQDQ-TYEIYAEDVYFKDPMTEFRGLDRY----RSMVKFIQTWFI 54
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+H G I R LPWKP
Sbjct: 55 NCH--------------------MDVHALQQTGNQI-------RSDWTLTWNTPLPWKPK 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFT 246
I+I G + LND+ IT H + W+ S L+ + Q F+
Sbjct: 88 ITISGWSELTLNDKGFITSHIDYWHCSRLDVLKQHFS 124
>gi|428203524|ref|YP_007082113.1| hypothetical protein Ple7327_3338 [Pleurocapsa sp. PCC 7327]
gi|427980956|gb|AFY78556.1| hypothetical protein Ple7327_3338 [Pleurocapsa sp. PCC 7327]
Length = 126
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 67/158 (42%), Gaps = 34/158 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T IYAE+ F+DP FRG Y + + +F+ +
Sbjct: 2 DIIEILKDDYQR---FPAHQTYSIYAENVYFKDPLNEFRGIARYKEMIGFMSNWFQDIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L +I RREG +H L + P +PWKP
Sbjct: 59 ELHDI------------RREG--DTIHTEWTLN---WTTP--------------IPWKPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
I+I G + +L+++ I H + W+ S L+ + Q P
Sbjct: 88 ITIPGRSELKLDEQEMIVSHVDYWHCSRLDVIKQHLFP 125
>gi|145346535|ref|XP_001417742.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577970|gb|ABO96035.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 99
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 55/130 (42%), Gaps = 31/130 (23%)
Query: 114 YAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGRMR 173
YA DC F DP + FRG + + N+ N+ + V L + E
Sbjct: 1 YAGDCEFADPFVSFRGLDRFRANV--------------ANLGGMMREVDLQIKSFEETED 46
Query: 174 GLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESW 233
G+ R S + L LPW+P+++ G T + LNDE ++ RH E W
Sbjct: 47 GVQTE-------------WRFSCV----LDLPWRPMLAASGGTTHVLNDENRVVRHYERW 89
Query: 234 NVSALEAVGQ 243
+V + +GQ
Sbjct: 90 DVDPKKVLGQ 99
>gi|219109561|ref|XP_002176535.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411070|gb|EEC50998.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 332
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/170 (22%), Positives = 75/170 (44%), Gaps = 33/170 (19%)
Query: 89 DDIVTILRSDYE----NAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFF 144
+D++T+L++D++ Y++TG + +Y +DC+F+ P
Sbjct: 176 EDLMTVLQADWQTDTNKGYYITGRLNTAVYRDDCLFDGPD-------------------- 215
Query: 145 EYPSIGLQNIEKFVFLVSLMVGRREGRMRGL------HNSTALGGHIFLLPYLLRISLIC 198
P + ++ + K++ S + R R L + + + LP +
Sbjct: 216 --PDMPVRGLRKYLNAASQLFDPRTSRATLLSLEEEKEDGVLVTNDVDKLPTRIVAKWEM 273
Query: 199 RTYLKLPWKP-LISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
R L+LPWKP L + G+T Y + + I +H E+W++S EA + P
Sbjct: 274 RGILRLPWKPSLPTWTGTTTYVRDADGLIYQHLETWDMSVGEAFLRTLAP 323
>gi|428176663|gb|EKX45546.1| hypothetical protein GUITHDRAFT_71148 [Guillardia theta CCMP2712]
Length = 122
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 61/136 (44%), Gaps = 31/136 (22%)
Query: 101 NAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFL 160
+ Y+VTG T IY +DC F+DPT RG +++++
Sbjct: 13 SQYYVTGNLTRSIYRDDCRFKDPTTDVRG------------------------LKRYILA 48
Query: 161 VSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLIS-IDGSTVYE 219
V + + L++ T + + L + L+ S LKLPWKP I DG+T Y
Sbjct: 49 VQNLFADPKDSKVTLYSVTPIEDNKILAKWSLQGS------LKLPWKPYIEPYDGTTTYW 102
Query: 220 LNDELKITRHAESWNV 235
+++E I H E+W V
Sbjct: 103 IDNEGLIEFHDETWLV 118
>gi|443324513|ref|ZP_21053261.1| hypothetical protein Xen7305DRAFT_00049690 [Xenococcus sp. PCC
7305]
gi|442795887|gb|ELS05226.1| hypothetical protein Xen7305DRAFT_00049690 [Xenococcus sp. PCC
7305]
Length = 134
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 66/156 (42%), Gaps = 34/156 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
D+V +++ DY+ T IYAED FEDP +F+G Y + + L FF +I
Sbjct: 9 DLVAVIKQDYQQ---FPKDQTYSIYAEDVYFEDPLNKFQGIGRYQKMIGFLGRFFG--NI 63
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L+ LHN T +++ + LPWKPL
Sbjct: 64 DLE----------------------LHNITQDKN-------IIKTEWTLKMTSPLPWKPL 94
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
+ I G + E+N I H + WN++ ++ + Q F
Sbjct: 95 LVIPGWSELEINQNNLIVAHRDYWNIAPIKVLLQNF 130
>gi|147832280|emb|CAN73278.1| hypothetical protein VITISV_040607 [Vitis vinifera]
Length = 215
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 81/211 (38%), Gaps = 42/211 (19%)
Query: 38 KNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRLDKVSYKQKYEILVSGIDDIVTILRS 97
KN N +T ++L S +EL FS R+ K+ + + G D ++ +
Sbjct: 35 KNPNPRTRCSRLLGTGRS-CSELPLRFSGHR--RVSKLPKELLTSLETVGAVDRAALVET 91
Query: 98 DYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKF 157
E+ F Y +DC F DP FRG + RN + ++ L E F
Sbjct: 92 IKED-------FQRSAYEDDCEFADPAGSFRGLRRFKRNCTNFGSLIQKSNMKLMKWEDF 144
Query: 158 VFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTV 217
+G+ GH R S + L PWKP++S G T
Sbjct: 145 -------------EDKGI-------GH-------WRFSCV----LSFPWKPILSATGYTE 173
Query: 218 YELNDEL-KITRHAESWNVSALEAVGQIFTP 247
Y + + K+ RH E WNV + + QI P
Sbjct: 174 YYFDSQSGKVCRHVEHWNVPKMALLKQILRP 204
>gi|428319231|ref|YP_007117113.1| Protein of unknown function DUF2358 [Oscillatoria nigro-viridis PCC
7112]
gi|428242911|gb|AFZ08697.1| Protein of unknown function DUF2358 [Oscillatoria nigro-viridis PCC
7112]
Length = 130
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/162 (27%), Positives = 65/162 (40%), Gaps = 42/162 (25%)
Query: 90 DIVTILRSDY----ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE 145
DI+ IL+ DY EN T IYA+D FEDP RF G + Y + + + FF+
Sbjct: 2 DIIEILKDDYQRFPENQ-------TYNIYAQDVYFEDPVNRFTGVDRYRKMIGFMGTFFQ 54
Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLP 205
I L LH + G I R LP
Sbjct: 55 --DINLD----------------------LHGISQSGDTI-------DTRWTLRWIAPLP 83
Query: 206 WKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
WKP ++I G + ++N + I H + W+ S L+ + Q+ P
Sbjct: 84 WKPKMAIAGRSELKVNSDGLIVSHIDYWDCSRLDVLKQLVFP 125
>gi|113473951|ref|YP_720012.1| hypothetical protein Tery_0014 [Trichodesmium erythraeum IMS101]
gi|110164999|gb|ABG49539.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 130
Score = 54.7 bits (130), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 68/156 (43%), Gaps = 34/156 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI++ L+ DY+ T IYAE+ FEDP +FRG E Y + + + +F +
Sbjct: 2 DIISRLKQDYQK---FPADQTYSIYAENVYFEDPLNKFRGIERYQKMIGFMNNWFNDIKL 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L +I + G++ + L + +PWKP
Sbjct: 59 DLHDISQ-------------------------SGNVIKTRWTLSWT------APVPWKPP 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
++I G + +LN + I H + W++S L+ + Q+F
Sbjct: 88 MAIPGWSELKLNADGLINSHIDYWDISRLDVLKQLF 123
>gi|86609849|ref|YP_478611.1| hypothetical protein CYB_2413 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86558391|gb|ABD03348.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 142
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 64/160 (40%), Gaps = 34/160 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
D+V ILR+DY A F T IY F DP FRG + Y R + + + + +
Sbjct: 12 DLVAILRADY--ARFPRDQ-TYAIYDPQVYFRDPLNEFRGLDRYRRMIEGIGRWLQDIRL 68
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
LQ+I + G I R I L LPW+P
Sbjct: 69 DLQDIRQ------------------------TGNQI-------RTEWILSGALALPWRPR 97
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGD 249
+ I G T ++N+ I H + W S + + Q+F PGD
Sbjct: 98 LCIPGWTEMQINETGLIGSHVDYWRCSRRQVLLQMFRPGD 137
>gi|409990703|ref|ZP_11274042.1| hypothetical protein APPUASWS_07005 [Arthrospira platensis str.
Paraca]
gi|291570101|dbj|BAI92373.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409938430|gb|EKN79755.1| hypothetical protein APPUASWS_07005 [Arthrospira platensis str.
Paraca]
Length = 134
Score = 54.3 bits (129), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/156 (25%), Positives = 63/156 (40%), Gaps = 33/156 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
D++ ++ DY+N T IYAE+ F+DP F G + Y + + + +F+ +
Sbjct: 5 DLIETIKQDYQN---FPDNQTYSIYAENVYFKDPVNEFSGCDRYQKMINFMATWFQDIQL 61
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L +I + G + L ++ LPW P
Sbjct: 62 DLHDISE-------------------------SGDTIETQWTLSWTVSV-----LPWAPR 91
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
+SI G + +N E I RH + WN S L V Q+F
Sbjct: 92 LSIPGYSHLHINPEGLIDRHIDYWNCSRLAVVKQLF 127
>gi|218248348|ref|YP_002373719.1| hypothetical protein PCC8801_3601 [Cyanothece sp. PCC 8801]
gi|218168826|gb|ACK67563.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
Length = 132
Score = 53.9 bits (128), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 66/156 (42%), Gaps = 34/156 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
+I+ I++ DY+N F T T IYAED F+DP FRG + Y ++ + +F+ +
Sbjct: 2 NILEIIKQDYQN--FPTNP-TYSIYAEDVYFKDPMTEFRGIKRYKSMIQFMATWFKEIKL 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L I + +H L ++ P LPWKP
Sbjct: 59 DLHKIYQ--------------SKDTIHAEWTLH---WITP--------------LPWKPP 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
I+I G + +N + I H + WN S + + Q F
Sbjct: 88 IAIAGRSELVINSQNLIISHIDYWNCSRWDVLKQHF 123
>gi|332711323|ref|ZP_08431255.1| uncharacterized conserved protein, DUF2358 [Moorea producens 3L]
gi|332349872|gb|EGJ29480.1| uncharacterized conserved protein, DUF2358 [Moorea producens 3L]
Length = 146
Score = 53.9 bits (128), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 63/155 (40%), Gaps = 34/155 (21%)
Query: 91 IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
I I++ DY+ T IYA D F+DP +FRG E Y + + + +F P +
Sbjct: 21 IAQIIQQDYQR---FPADQTYSIYASDVYFKDPLNQFRGLERYKQMIAFINRWFLEPHLD 77
Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
LQNI + G + L + LPWKP +
Sbjct: 78 LQNISQ-------------------------SGDTIKTRWTLSWT------TPLPWKPRM 106
Query: 211 SIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
+I G + +LN E I H + W+ S L+ + Q+
Sbjct: 107 TIPGWSELKLNAEGLIVSHIDYWDCSRLDVLKQLL 141
>gi|170077270|ref|YP_001733908.1| hypothetical protein SYNPCC7002_A0647 [Synechococcus sp. PCC 7002]
gi|169884939|gb|ACA98652.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
Length = 127
Score = 53.9 bits (128), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 68/159 (42%), Gaps = 34/159 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
D++ ILR DY+ T IY++D F+DP +FRG + Y + + L +F +
Sbjct: 2 DLLDILRQDYQK---FPQNQTFAIYSDDVFFQDPLNQFRGRDRYQKMIGFLGRWFRDIHL 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L ++++ + R E + S+ C LPW+P
Sbjct: 59 ELHDLQQ-----TQQTIRSEWTL----------------------SMTC----PLPWQPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPG 248
+ I G ++ E+N + I H + W L+ + Q+F P
Sbjct: 88 LRISGHSLLEINADNLIVSHIDYWQKPPLKILLQVFQPS 126
>gi|334117385|ref|ZP_08491476.1| Protein of unknown function DUF2358 [Microcoleus vaginatus FGP-2]
gi|333460494|gb|EGK89102.1| Protein of unknown function DUF2358 [Microcoleus vaginatus FGP-2]
Length = 130
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 42/158 (26%), Positives = 66/158 (41%), Gaps = 34/158 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T IYA+D FEDP RF G + Y + + + FF+ ++
Sbjct: 2 DIIEILKDDYQR---FPDNQTYSIYAKDVYFEDPVNRFTGVDRYRKMIGFMGTFFQDVNL 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L I + G + L S I LPWKP
Sbjct: 59 DLHGISQ-------------------------SGDTIETRWTL--SWIA----PLPWKPK 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
++I G + ++N + I H + W+ S L+ + Q+ P
Sbjct: 88 MAIAGRSELKVNSDGLIVSHIDYWDCSRLDVLKQLVFP 125
>gi|427716868|ref|YP_007064862.1| hypothetical protein Cal7507_1569 [Calothrix sp. PCC 7507]
gi|427349304|gb|AFY32028.1| Protein of unknown function DUF2358 [Calothrix sp. PCC 7507]
Length = 133
Score = 53.5 bits (127), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 62/159 (38%), Gaps = 34/159 (21%)
Query: 89 DDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPS 148
++IV IL+ DY+ T IYA D F+DP +FRG E Y + + + +F P
Sbjct: 3 NNIVEILKEDYQRFPINQ---TYSIYATDVYFQDPLNKFRGVERYKQTINFIQTWFLNPK 59
Query: 149 IGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKP 208
+ L +I+ LG I + LPWKP
Sbjct: 60 MDLHDIQ------------------------LLGDTI-------KTEWTLSWNTPLPWKP 88
Query: 209 LISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
ISI G + LN I H + W+ S + + Q P
Sbjct: 89 RISISGWSELRLNALGLIVSHIDYWHCSRFDVLKQHLFP 127
>gi|303290268|ref|XP_003064421.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454019|gb|EEH51326.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 299
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 63/159 (39%), Gaps = 17/159 (10%)
Query: 94 ILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQN 153
I R E Y TG E+Y C F DPT+ F G + + NL+ L P + N
Sbjct: 150 IAREFTEAKYLWTGDINPEMYDLFCTFTDPTLSFAGLQTFETNLKNLQPVLRR-LVRNSN 208
Query: 154 IEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISID 213
+E + + + + GG +R S L LPW+P I +
Sbjct: 209 VELYSCEIV-----------DDGSGGSGGGSGGGGGGSVRASWRMTGDLNLPWRPRIDLT 257
Query: 214 GSTVYELNDE-----LKITRHAESWNVSALEAVGQIFTP 247
G T + D I + E W +SAL+AV Q+ P
Sbjct: 258 GQTTFGFKDAGVERGCLIVSYLEEWGISALDAVAQLVAP 296
>gi|443309241|ref|ZP_21038980.1| hypothetical protein Syn7509DRAFT_00046060 [Synechocystis sp. PCC
7509]
gi|442780710|gb|ELR90864.1| hypothetical protein Syn7509DRAFT_00046060 [Synechocystis sp. PCC
7509]
Length = 128
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 64/156 (41%), Gaps = 34/156 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DIV IL DY + V + +YA+D F+DP +FRG Y + + FF P +
Sbjct: 2 DIVEILVDDYRR-FPVDQNYN--LYAQDVFFQDPLNKFRGINKYKAMIGFINTFFIEPKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L I + +G I ++ + LPWKP
Sbjct: 59 DLLAINQ------------------------VGDTI-------KMEWVLSWNTPLPWKPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
I+I G + ++N + I H + WN S L+ + Q F
Sbjct: 88 IAIPGQSELKINADGLIASHVDYWNCSRLDVIRQHF 123
>gi|75910467|ref|YP_324763.1| hypothetical protein Ava_4269 [Anabaena variabilis ATCC 29413]
gi|75704192|gb|ABA23868.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 128
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 66/155 (42%), Gaps = 34/155 (21%)
Query: 91 IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
I+ IL+ DY+ + V ++ IYAED F+DP +FRG Y + + + +F +
Sbjct: 3 IIEILKQDYQR-FPVNQTYS--IYAEDVYFQDPLNKFRGITRYKQMINFMQTWFLNIKMD 59
Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
L +I+ E +++ +L T LPWKP I
Sbjct: 60 LHDIQHL-----------EDKIKT------------------EWTLSWNT--PLPWKPRI 88
Query: 211 SIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
SI G + LN E I H + W S L+ + Q F
Sbjct: 89 SISGWSELGLNCEGLIVSHIDYWQCSPLDVIKQHF 123
>gi|308800990|ref|XP_003075276.1| unnamed protein product [Ostreococcus tauri]
gi|116061830|emb|CAL52548.1| unnamed protein product [Ostreococcus tauri]
Length = 228
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 68/162 (41%), Gaps = 33/162 (20%)
Query: 93 TILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQ 152
T+ R AYFVTG T +AEDC F DPT +G Y L L F+ ++
Sbjct: 91 TLARDVGVEAYFVTGRLTEAAFAEDCRFVDPTTDVKGLRRY---LDALGALFDPQRSSVE 147
Query: 153 NIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLIS- 211
++G E R TA GG Y L YL+LPW+P +
Sbjct: 148 -----------LLGELEAR------ETADGGGEIRGTYAL------EGYLRLPWRPKVPR 184
Query: 212 IDGSTVYEL--NDELK----ITRHAESWNVSALEAVGQIFTP 247
+G + + ND I ++W+VS +A+ + FTP
Sbjct: 185 YEGRITWTVAPNDGGANAGLIVEQRQTWSVSGFDALRETFTP 226
>gi|37522054|ref|NP_925431.1| hypothetical protein glr2485 [Gloeobacter violaceus PCC 7421]
gi|35213053|dbj|BAC90426.1| glr2485 [Gloeobacter violaceus PCC 7421]
Length = 141
Score = 52.0 bits (123), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 63/154 (40%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL++DY+ T IYA D F+DP FRG E Y R + + +F+ +
Sbjct: 6 DILEILKADYQR---FPEDQTYAIYANDVYFKDPLNSFRGIERYRRMIGWMHRWFQPIRL 62
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L +IE R++ R + + LPWKP
Sbjct: 63 ELHSIE-----------RQQTR--------------------IVTTWTLSWQAPLPWKPH 91
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I IDG + +L+ + +I H + W S + Q
Sbjct: 92 IVIDGWSELDLDTQGQIAAHTDHWRCSPNDVAWQ 125
>gi|254413402|ref|ZP_05027173.1| hypothetical protein MC7420_889 [Coleofasciculus chthonoplastes PCC
7420]
gi|196180022|gb|EDX75015.1| hypothetical protein MC7420_889 [Coleofasciculus chthonoplastes PCC
7420]
Length = 127
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 67/156 (42%), Gaps = 34/156 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ + V ++ IYA F+DP FRG Y + + + +F P +
Sbjct: 5 DIIEILKQDYQR-FPVDQTYS--IYANTVFFKDPLNEFRGLNRYKQMIGFIQTWFINPQL 61
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L H+ + G I R +L T LPWKP
Sbjct: 62 DL------------------------HDISQSGDTI-----KTRWTLSWTT--PLPWKPR 90
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
ISI G + LN + I+ H + W++ L+ + Q+F
Sbjct: 91 ISIPGWSELRLNADGLISSHIDYWDIPRLDVLKQLF 126
>gi|56750302|ref|YP_171003.1| hypothetical protein syc0293_c [Synechococcus elongatus PCC 6301]
gi|56685261|dbj|BAD78483.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 137
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 39/160 (24%), Positives = 65/160 (40%), Gaps = 34/160 (21%)
Query: 84 LVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPF 143
+ +DD++ LR DY A F + E+Y D F+DP RF+G + Y + + + +
Sbjct: 1 MAIALDDLLETLRQDY--ARFPKDQ-SFEVYDPDAFFQDPLTRFQGRDRYQKMIAFIDRW 57
Query: 144 FEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLK 203
F P + L +I G+ + L + P+
Sbjct: 58 FLDPELTLHDI--------------HATETGIESRWTL---TWTSPW------------- 87
Query: 204 LPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
PW+P I G T++EL E I+ H + W+ S + Q
Sbjct: 88 -PWRPRSQISGRTLFELTAEGTISSHRDYWDCSPWAVLRQ 126
>gi|428311429|ref|YP_007122406.1| hypothetical protein Mic7113_3262 [Microcoleus sp. PCC 7113]
gi|428253041|gb|AFZ19000.1| hypothetical protein Mic7113_3262 [Microcoleus sp. PCC 7113]
Length = 129
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 67/162 (41%), Gaps = 34/162 (20%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T IYA D F+DP +F+G E Y + + + +F P +
Sbjct: 2 DILQILKEDYQR---FPADQTYSIYAPDVFFKDPLNQFQGIERYKQMIGFINTWFGAPKL 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L I R E ++ R +L T LPW+P
Sbjct: 59 DLHEIH-----------RSEDTIKT------------------RWTLSWTT--PLPWRPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGDRN 251
ISI G + +LN + I H + W+ S + + Q ++N
Sbjct: 88 ISIPGWSELKLNVDELIISHIDYWDCSRWDVLKQHLPFQEKN 129
>gi|218439252|ref|YP_002377581.1| hypothetical protein PCC7424_2291 [Cyanothece sp. PCC 7424]
gi|218171980|gb|ACK70713.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
Length = 125
Score = 51.2 bits (121), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 64/154 (41%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ + V T EIYA + F+DP FRG + Y + + +F+ +
Sbjct: 2 DIIEILKDDYQR-FPVNQ--TYEIYAPNVYFKDPVNEFRGIKRYQEMITFMGTWFKEIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L +I + ++ LH +T +PW+P
Sbjct: 59 DLHDIHQKENIIYT--------QWTLHWTTP-----------------------VPWQPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
ISI G + L+D I H + WN S + V Q
Sbjct: 88 ISIPGRSELTLDDNHLIVSHVDYWNCSRWDVVKQ 121
>gi|257060326|ref|YP_003138214.1| hypothetical protein Cyan8802_2513 [Cyanothece sp. PCC 8802]
gi|256590492|gb|ACV01379.1| conserved hypothetical protein [Cyanothece sp. PCC 8802]
Length = 132
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 40/156 (25%), Positives = 66/156 (42%), Gaps = 34/156 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
+I+ I++ DY+N + V ++ IYAED F+DP FRG + Y + + +F+ +
Sbjct: 2 NILEIIQQDYQN-FPVNPTYS--IYAEDVYFKDPMTEFRGIKRYKSMIEFMATWFKAIKL 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L I + +H L ++ P LPWKP
Sbjct: 59 DLHKIYQ--------------SQDTIHAEWTLH---WITP--------------LPWKPP 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
I+I G + +N + I H + WN S + + Q F
Sbjct: 88 IAIAGRSELVINAQNLIISHIDYWNCSRWDVLKQHF 123
>gi|81300067|ref|YP_400275.1| hypothetical protein Synpcc7942_1258 [Synechococcus elongatus PCC
7942]
gi|81168948|gb|ABB57288.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 137
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/160 (24%), Positives = 65/160 (40%), Gaps = 34/160 (21%)
Query: 84 LVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPF 143
+ +DD++ LR DY A F + E+Y D F+DP RF+G + Y + + + +
Sbjct: 1 MAIALDDLLETLRQDY--ARFPKDQ-SFEVYDPDVFFQDPLTRFQGRDRYQKMIAFIDRW 57
Query: 144 FEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLK 203
F P + L +I G+ + L + P+
Sbjct: 58 FLDPELTLHDI--------------HATETGIESRWTL---TWTSPW------------- 87
Query: 204 LPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
PW+P I G T++EL E I+ H + W+ S + Q
Sbjct: 88 -PWRPRSQISGRTLFELTAEGTISSHRDYWDCSPWAVLRQ 126
>gi|219123757|ref|XP_002182185.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406146|gb|EEC46086.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 457
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 76/169 (44%), Gaps = 33/169 (19%)
Query: 88 IDDIVTILRSDYEN------AYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLV 141
ID + L +D N +Y VTG I+ +DC+F+DPT R Y + L +L
Sbjct: 114 IDALAQKLTNDLANGATGRGSYIVTGDLNPRIFRDDCVFQDPTNRVASLRQYRQALTIL- 172
Query: 142 PFFEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTY 201
F+ ++ +E +VS +G +R LR R Y
Sbjct: 173 --FDPERSVVEVLEPVRRIVS---DTDDGSIR------------------LRGRFRQRGY 209
Query: 202 LK-LPWKPLISIDGSTV-YELN-DELKITRHAESWNVSALEAVGQIFTP 247
L+ LPW+P ++ ST+ Y ++ D +I +++W SA A+ + FTP
Sbjct: 210 LQFLPWQPYVTAYESTITYRIDPDTGRIAEQSQTWTKSANRALRESFTP 258
>gi|209527250|ref|ZP_03275761.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|376003800|ref|ZP_09781603.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423066117|ref|ZP_17054907.1| hypothetical protein SPLC1_S410140 [Arthrospira platensis C1]
gi|209492317|gb|EDZ92661.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|375327831|emb|CCE17356.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406712159|gb|EKD07348.1| hypothetical protein SPLC1_S410140 [Arthrospira platensis C1]
Length = 134
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/155 (23%), Positives = 62/155 (40%), Gaps = 33/155 (21%)
Query: 91 IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
++ ++ DY+N T IYA++ F+DP F G + Y + + + +F+ +
Sbjct: 6 LIETIKQDYQN---FPDNQTYSIYADNVYFKDPVNEFSGCDRYQKMINFMATWFQDIQLD 62
Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
L +I + G + L ++ LPW P +
Sbjct: 63 LHDISE-------------------------SGDTIETQWTLSWTVSV-----LPWAPRL 92
Query: 211 SIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
SI G + +N E I RH + WN S L V Q+F
Sbjct: 93 SIPGYSHLHINPEGLIDRHIDYWNCSRLAVVKQLF 127
>gi|440753389|ref|ZP_20932592.1| hypothetical protein O53_1767 [Microcystis aeruginosa TAIHU98]
gi|440177882|gb|ELP57155.1| hypothetical protein O53_1767 [Microcystis aeruginosa TAIHU98]
Length = 128
Score = 50.8 bits (120), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 64/154 (41%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T +IYA++ F+DP +FRG + Y + + +F+ +
Sbjct: 2 DIIAILQQDYQRFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ IE+ G LH +T LPW+P
Sbjct: 59 DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I GS+ L+D I H + W+ S + + Q
Sbjct: 88 IAISGSSQLTLDDNNLIISHIDYWDCSRWDVLRQ 121
>gi|427711855|ref|YP_007060479.1| hypothetical protein Syn6312_0717 [Synechococcus sp. PCC 6312]
gi|427375984|gb|AFY59936.1| hypothetical protein Syn6312_0717 [Synechococcus sp. PCC 6312]
Length = 125
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/156 (23%), Positives = 60/156 (38%), Gaps = 34/156 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
+++ +++ DY + EIYA D F+DP FRG + Y + + + +F +
Sbjct: 2 NLIEVVKQDYSR---FPQAQSYEIYASDVFFKDPVYEFRGIDKYQKMIGFITYWFSNLKL 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L +I H A+ + LPW+P
Sbjct: 59 ELHDI---------------------HQKDAI----------IHTQWTMSWNAPLPWQPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
IS+ G + E+N E I H + W S L+ V Q F
Sbjct: 88 ISVTGRSELEVNAEGLIISHIDYWQCSRLDVVKQHF 123
>gi|428224997|ref|YP_007109094.1| hypothetical protein GEI7407_1551 [Geitlerinema sp. PCC 7407]
gi|427984898|gb|AFY66042.1| Protein of unknown function DUF2358 [Geitlerinema sp. PCC 7407]
Length = 134
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 65/154 (42%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DIVT+LR DY++ E+Y F+DP FRG + + + + L+ +F +
Sbjct: 8 DIVTLLREDYQH---FPRDQRYELYDPGVYFKDPLSEFRGIKRFRQMVGLMERWFLNARM 64
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L +I++ L++ R +L T LPWKP
Sbjct: 65 DLHDIQQRDRLITT-----------------------------RWTLSWNT--PLPWKPR 93
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I G + LNDE I H + W+ S + V Q
Sbjct: 94 IAISGRSELTLNDEGLIVSHIDWWDCSIWDVVQQ 127
>gi|116309685|emb|CAH66733.1| H0404F02.9 [Oryza sativa Indica Group]
Length = 247
Score = 50.8 bits (120), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 44/191 (23%), Positives = 72/191 (37%), Gaps = 37/191 (19%)
Query: 33 VNDGNKNTNNKTATPQILKFAVSGVTELLRLFSSFDKGRL-----DKVSYKQKYEILVSG 87
+ G N + ++ +L+ A G +ELL + +SF + D ++ G
Sbjct: 44 ASPGAGNETSSSSENAVLRAAWYG-SELLGIAASFFRPSQPPTEGDGAGAVEEAASEPQG 102
Query: 88 IDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYP 147
+ ++ D+ +YFVTG T + Y EDC F DP F G + + RN E
Sbjct: 103 RAQVAEAVKDDFARSYFVTGNLTLKAYEEDCEFADPAGSFNGLQRFKRNCTNFGSLLEKA 162
Query: 148 SIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWK 207
++ L E L +G R S + + PW+
Sbjct: 163 NMKLTKWED---LEDKSIGH------------------------WRFSCV----MSFPWR 191
Query: 208 PLISIDGSTVY 218
P++S G T Y
Sbjct: 192 PILSATGYTEY 202
>gi|427419623|ref|ZP_18909806.1| hypothetical protein Lepto7375DRAFT_5471 [Leptolyngbya sp. PCC
7375]
gi|425762336|gb|EKV03189.1| hypothetical protein Lepto7375DRAFT_5471 [Leptolyngbya sp. PCC
7375]
Length = 126
Score = 50.4 bits (119), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 55/136 (40%), Gaps = 30/136 (22%)
Query: 112 EIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGR 171
+IYA+D F+DP F+G E Y R + + +F +NIE
Sbjct: 21 QIYAKDVYFKDPMSEFQGIERYQRMIGFIAQWF-------RNIE---------------- 57
Query: 172 MRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAE 231
LH + + + L + PW+P ISI G + ++N E I H +
Sbjct: 58 -LELHRVEQPQANTVITRWTLHFTA------PTPWQPRISIPGWSELQINTEGLICAHID 110
Query: 232 SWNVSALEAVGQIFTP 247
WN S + Q+F P
Sbjct: 111 YWNCSRWDVFKQLFAP 126
>gi|425469266|ref|ZP_18848216.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
gi|389882034|emb|CCI37462.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
Length = 128
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 63/154 (40%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T +IYA+D F+DP +FRG + Y + + +F+ +
Sbjct: 2 DIIAILQQDYQRFPLDQ---TYDIYADDVYFQDPLNQFRGIKRYREMIGFMGQWFQAIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ IE+ G LH +T LPW+P
Sbjct: 59 DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I G + L+D I H + W+ S + + Q
Sbjct: 88 IAISGRSQLTLDDNNLIISHIDYWDCSRWDVLRQ 121
>gi|219130148|ref|XP_002185234.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403413|gb|EEC43366.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 286
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 73/172 (42%), Gaps = 27/172 (15%)
Query: 90 DIVTILRSDY-ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFE--- 145
D++ +R+DY + Y TG + C F DPT+ F GT + RN+R L P +
Sbjct: 118 DLLERIRADYVDKNYLWTGDIDLACFDPHCRFTDPTLSFEGTATFVRNIRNLRPIVDALV 177
Query: 146 YPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLP 205
P G E L + +R+ +++S G Y+ + +LP
Sbjct: 178 VPPDGHCRSEL------LEIEQRDA--VDVNDSDGQG-------YIETRWNMVGVLNQLP 222
Query: 206 WKPLISIDGSTVY--------ELNDELKITRHAESWNVSALEAVGQIFTPGD 249
W+P I + G T + + L + R+ E W + A A+ Q+ TP D
Sbjct: 223 WRPKIDVIGRTQFWYRPCRDNRDDAALHVYRYDEEWEIPAGRALLQLVTPAD 274
>gi|428181968|gb|EKX50830.1| hypothetical protein GUITHDRAFT_103420 [Guillardia theta CCMP2712]
Length = 139
Score = 50.1 bits (118), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 56/125 (44%), Gaps = 28/125 (22%)
Query: 87 GIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
G +D++ L+ +YE YF++G +Y DC+F DP FRG E + RNL
Sbjct: 38 GNEDLLQRLQLEYEKNYFLSGDIDVSLYHPDCLFADPFASFRGRERFRRNLS-------- 89
Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
++GL + L+SL + P + R ++ + L LPW
Sbjct: 90 -NLGLFVLSYRCRLLSLAQDPVD-------------------PNVFRSCVMVKLQLNLPW 129
Query: 207 KPLIS 211
+P+++
Sbjct: 130 RPVLA 134
>gi|434398676|ref|YP_007132680.1| Protein of unknown function DUF2358 [Stanieria cyanosphaera PCC
7437]
gi|428269773|gb|AFZ35714.1| Protein of unknown function DUF2358 [Stanieria cyanosphaera PCC
7437]
Length = 131
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 60/154 (38%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
+I+ I++ DY+N T IYA+D F+DP FRG + Y + L FF ++
Sbjct: 2 EIIEIIKQDYQN---FPKNQTYSIYADDVYFKDPLNEFRGIQRYQTMIGFLGNFFREINL 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L +I L L+ LPW+P
Sbjct: 59 DLHDIS-------------------------------LQENCLKTEWTLHLTSPLPWQPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
+SI G + +LN I H + W++S E + Q
Sbjct: 88 LSIPGWSELKLNQNGLIIAHIDYWHISPWEVLKQ 121
>gi|220906651|ref|YP_002481962.1| hypothetical protein Cyan7425_1222 [Cyanothece sp. PCC 7425]
gi|219863262|gb|ACL43601.1| conserved hypothetical protein [Cyanothece sp. PCC 7425]
Length = 125
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 62/156 (39%), Gaps = 34/156 (21%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DIV+ILR DY+ T EIYA D F DP +FRG + Y
Sbjct: 2 DIVSILRQDYQR---FPQDQTYEIYAADVYFRDPLTQFRGIKRY---------------- 42
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
QN+ +F+ + + R LH I L I+ LPW P
Sbjct: 43 --QNMIQFI--QTWFINTR----MELHEIQQQEQQITTRWTLSWIA-------PLPWHPQ 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
+SI G + LN I H + W+ S E + Q+F
Sbjct: 88 LSISGRSELTLNQAGLIVSHLDYWDCSRWEVLQQLF 123
>gi|3702337|gb|AAC62894.1| unknown protein [Arabidopsis thaliana]
Length = 286
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 34/67 (50%)
Query: 91 IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
+V ++ D++ +YFVTG T E+Y E C F DP F+G + RN E ++
Sbjct: 105 VVDSIKQDFKRSYFVTGNLTPEVYEEKCEFADPAGSFKGLARFKRNCTNFGSLIEKSNMK 164
Query: 151 LQNIEKF 157
L E F
Sbjct: 165 LMKWENF 171
>gi|428775383|ref|YP_007167170.1| hypothetical protein PCC7418_0737 [Halothece sp. PCC 7418]
gi|428689662|gb|AFZ42956.1| Protein of unknown function DUF2358 [Halothece sp. PCC 7418]
Length = 125
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/135 (25%), Positives = 56/135 (41%), Gaps = 31/135 (22%)
Query: 112 EIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGR 171
+IYAED F+DP FRG + Y + + +F+ ++ L +I R+E
Sbjct: 21 DIYAEDVYFKDPLTEFRGLQRYQSLIDFISTWFQDINLELHDI------------RQEDN 68
Query: 172 MRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAE 231
++ LPW+P ISI G + ++N + KI H +
Sbjct: 69 T-------------------IQTEWTLNWTTPLPWQPRISIPGWSELKVNADHKIISHID 109
Query: 232 SWNVSALEAVGQIFT 246
W+ S E + Q F+
Sbjct: 110 YWHCSRWEVIKQHFS 124
>gi|17230193|ref|NP_486741.1| hypothetical protein all2701 [Nostoc sp. PCC 7120]
gi|17131794|dbj|BAB74400.1| all2701 [Nostoc sp. PCC 7120]
Length = 128
Score = 49.3 bits (116), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/155 (25%), Positives = 65/155 (41%), Gaps = 34/155 (21%)
Query: 91 IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
I+ IL+ DY+ + V ++ IYAED F+DP +FRG Y + + + +F +
Sbjct: 3 IIEILKQDYQR-FPVNQTYS--IYAEDVYFQDPLNKFRGITRYKQMINFMQTWFLNIKMD 59
Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
L +I+ E +++ +L T +PWKP I
Sbjct: 60 LHDIQHL-----------EDKIKT------------------EWTLSWNT--PVPWKPRI 88
Query: 211 SIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
SI G + LN + I H + W S + + Q F
Sbjct: 89 SISGWSELGLNSKGLIVSHIDYWQCSPFDVIKQHF 123
>gi|425463533|ref|ZP_18842863.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389831546|emb|CCI25628.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 128
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T +IYA++ F+DP +FRG + Y + + +F+ +
Sbjct: 2 DIIAILQQDYQKFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ IE+ G LH +T LPW+P
Sbjct: 59 DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I G + L+D I H + W+ S + + Q
Sbjct: 88 IAISGRSELTLDDNNLIISHIDYWDCSHWDVLRQ 121
>gi|186508518|ref|NP_001118532.1| Nuclear transport factor 2 (NTF2) family protein [Arabidopsis
thaliana]
gi|330255550|gb|AEC10644.1| Nuclear transport factor 2 (NTF2) family protein [Arabidopsis
thaliana]
Length = 229
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 34/67 (50%)
Query: 91 IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
+V ++ D++ +YFVTG T E+Y E C F DP F+G + RN E ++
Sbjct: 105 VVDSIKQDFKRSYFVTGNLTPEVYEEKCEFADPAGSFKGLARFKRNCTNFGSLIEKSNMK 164
Query: 151 LQNIEKF 157
L E F
Sbjct: 165 LMKWENF 171
>gi|67923125|ref|ZP_00516615.1| hypothetical protein CwatDRAFT_3436 [Crocosphaera watsonii WH 8501]
gi|416395382|ref|ZP_11686323.1| hypothetical protein CWATWH0003_3119 [Crocosphaera watsonii WH
0003]
gi|67855023|gb|EAM50292.1| hypothetical protein CwatDRAFT_3436 [Crocosphaera watsonii WH 8501]
gi|357263116|gb|EHJ12162.1| hypothetical protein CWATWH0003_3119 [Crocosphaera watsonii WH
0003]
Length = 128
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 55/139 (39%), Gaps = 31/139 (22%)
Query: 110 TSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRRE 169
T IYA++ F+DP F G + Y ++ + +F+ + L NI + ++
Sbjct: 19 TYSIYADNVYFKDPMTEFTGVKRYQEMIQFMSTWFKNIDLELHNIYQSDNII-------- 70
Query: 170 GRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRH 229
LH +T L PWKP ISI G + N++ I H
Sbjct: 71 HTQWTLHWTTPL-----------------------PWKPPISIPGRSEMTFNEQNLIISH 107
Query: 230 AESWNVSALEAVGQIFTPG 248
+ WN S + + Q F P
Sbjct: 108 IDYWNCSRWDVLKQHFVPN 126
>gi|422301888|ref|ZP_16389253.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|389789025|emb|CCI14979.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
Length = 128
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T +IYA++ F+DP +FRG + Y + + +F+ +
Sbjct: 2 DIIAILQQDYQKFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ IE+ G LH +T LPW+P
Sbjct: 59 DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I G + L+D I H + W+ S + + Q
Sbjct: 88 IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121
>gi|390440334|ref|ZP_10228673.1| conserved hypothetical protein [Microcystis sp. T1-4]
gi|389836261|emb|CCI32799.1| conserved hypothetical protein [Microcystis sp. T1-4]
Length = 128
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 66/154 (42%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T +IYA+D F+DP +FRG + Y + + +F+ +
Sbjct: 2 DIIAILQQDYQRFPLDQ---TYDIYADDVYFQDPLNQFRGIKRYREMIGFMGQWFQAIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ IE+ +G +T H ++ P LPW+P
Sbjct: 59 DVHAIEQ----------------QGNIINTRWTLH-WITP--------------LPWRPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I G + L+D I H + W+ S + + Q
Sbjct: 88 IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121
>gi|126660704|ref|ZP_01731803.1| hypothetical protein CY0110_09390 [Cyanothece sp. CCY0110]
gi|126618002|gb|EAZ88772.1| hypothetical protein CY0110_09390 [Cyanothece sp. CCY0110]
Length = 128
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 55/136 (40%), Gaps = 31/136 (22%)
Query: 110 TSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRRE 169
T +YA++ F+DP F G + Y ++ + +F+ + L NI + S + E
Sbjct: 19 TYSVYADNVYFKDPVNEFTGVKRYQEMIQFMSTWFKNVDLELHNIYQ-----SNNIIHTE 73
Query: 170 GRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRH 229
LH +T L PWKP ISI G + ND+ I H
Sbjct: 74 W---TLHWTTPL-----------------------PWKPPISIPGRSELTFNDQNLIISH 107
Query: 230 AESWNVSALEAVGQIF 245
+ WN S + V Q F
Sbjct: 108 VDYWNCSRWDVVKQHF 123
>gi|425440423|ref|ZP_18820725.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389719126|emb|CCH96985.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 128
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/154 (24%), Positives = 65/154 (42%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T +IYA++ F+DP +FRG + Y + + +F+ +
Sbjct: 2 DIIAILQQDYQRFPIDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ IE+ +++ LH +T LPW+P
Sbjct: 59 DIHAIEQQENIINTRW--------TLHWTTP-----------------------LPWRPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I G + L+D I H + W+ S + + Q
Sbjct: 88 IAISGRSQLTLDDNNLIISHIDYWDCSRWDVLRQ 121
>gi|425436057|ref|ZP_18816498.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|389679290|emb|CCH91903.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
Length = 128
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T +IYA++ F+DP +FRG + Y + + +F+ +
Sbjct: 2 DIIAILQQDYQRFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ IE+ G LH +T LPW+P
Sbjct: 59 DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I G + L+D I H + W+ S + + Q
Sbjct: 88 IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121
>gi|425451700|ref|ZP_18831520.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
gi|389766861|emb|CCI07601.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
Length = 128
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T +IYA++ F+DP +FRG + Y + + +F+ +
Sbjct: 2 DIIAILQQDYQRFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ IE+ G LH +T LPW+P
Sbjct: 59 DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPP 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I G + L+D I H + W+ S + + Q
Sbjct: 88 IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121
>gi|219362933|ref|NP_001136788.1| uncharacterized protein LOC100216931 [Zea mays]
gi|194697096|gb|ACF82632.1| unknown [Zea mays]
gi|414585787|tpg|DAA36358.1| TPA: hypothetical protein ZEAMMB73_852215 [Zea mays]
Length = 206
Score = 48.5 bits (114), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 31/65 (47%)
Query: 91 IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
+ +++D+ +YFVTG T Y EDC F DP FRG + RN E ++
Sbjct: 124 VAEAIKADFARSYFVTGNLTLRAYEEDCEFADPAGSFRGLRRFKRNCTNFGSLLEKSNMK 183
Query: 151 LQNIE 155
L E
Sbjct: 184 LTKWE 188
>gi|425447601|ref|ZP_18827586.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
gi|389731776|emb|CCI04190.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
Length = 128
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T +IYA++ F+DP +FRG + Y + + +F+ +
Sbjct: 2 DIIAILQQDYQRFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ IE+ G LH +T LPW+P
Sbjct: 59 DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I G + L+D I H + W+ S + + Q
Sbjct: 88 IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121
>gi|425459254|ref|ZP_18838740.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
gi|389823076|emb|CCI29008.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
Length = 128
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T +IYA++ F+DP +FRG + Y + + +F+ +
Sbjct: 2 DIIAILQQDYQRFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ IE+ G LH +T LPW+P
Sbjct: 59 DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPP 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I G + L+D I H + W+ S + + Q
Sbjct: 88 IAISGRSELTLDDNNLIISHIDYWDCSRWDVMRQ 121
>gi|411116636|ref|ZP_11389123.1| hypothetical protein OsccyDRAFT_0516 [Oscillatoriales
cyanobacterium JSC-12]
gi|410712739|gb|EKQ70240.1| hypothetical protein OsccyDRAFT_0516 [Oscillatoriales
cyanobacterium JSC-12]
Length = 150
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/172 (22%), Positives = 69/172 (40%), Gaps = 40/172 (23%)
Query: 85 VSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFF 144
+ I + + +L+ + F T + + IY++D F+DP F+G
Sbjct: 12 TAKIQEAIAVLKQELPTL-FQTDL-SYHIYSQDIFFQDPVNTFKG--------------- 54
Query: 145 EYPSIGLQNIEKFVFLVSLMVGRREGRMR------GLHNSTALGGHIFLLPYLLRISLIC 198
KF + + R GR+ LHN + +R
Sbjct: 55 -----------KFNYRIIFWTLRFHGRLFFTELFFDLHNVQQASENT------IRADWTV 97
Query: 199 RTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGDR 250
R L+LPWKP + +G ++Y LN + I +H ++W+ E + Q + G++
Sbjct: 98 RGTLRLPWKPRLLFNGYSIYTLNSDALIFKHIDTWDRPLGEILSQFWRRGEQ 149
>gi|428780825|ref|YP_007172611.1| hypothetical protein Dacsa_2676 [Dactylococcopsis salina PCC 8305]
gi|428695104|gb|AFZ51254.1| hypothetical protein Dacsa_2676 [Dactylococcopsis salina PCC 8305]
Length = 125
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/156 (23%), Positives = 65/156 (41%), Gaps = 34/156 (21%)
Query: 91 IVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIG 150
I+ + +DY+ + IYAED F+DP F+G + Y + + +F+ ++
Sbjct: 3 IIETIEADYQK---FPANQSYNIYAEDVYFKDPLTEFQGIKRYQSMINFISTWFKDINLE 59
Query: 151 LQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLI 210
L +I++ +H L + P LPW+P I
Sbjct: 60 LHSIKQI--------------ENTIHTEWTLN---WTTP--------------LPWQPRI 88
Query: 211 SIDGSTVYELNDELKITRHAESWNVSALEAVGQIFT 246
SI G + ++N + KI H + W+ S E + Q F+
Sbjct: 89 SIPGWSELKINTDQKIISHIDYWHCSRWEVLKQHFS 124
>gi|425453289|ref|ZP_18833047.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
gi|389801445|emb|CCI19385.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
Length = 128
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T +IYA++ F+DP +FRG + Y + + +F+ +
Sbjct: 2 DIIAILQQDYQRFPIDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ IE+ G LH +T LPW+P
Sbjct: 59 DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPP 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I G + L+D I H + W+ S + + Q
Sbjct: 88 IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121
>gi|443649469|ref|ZP_21130255.1| hypothetical protein C789_795 [Microcystis aeruginosa DIANCHI905]
gi|159027936|emb|CAO87099.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443334951|gb|ELS49439.1| hypothetical protein C789_795 [Microcystis aeruginosa DIANCHI905]
Length = 128
Score = 48.1 bits (113), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 63/154 (40%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T +IYA++ F+DP +FRG + Y + + +F+ +
Sbjct: 2 DIIAILQQDYQRFPLDQ---TYDIYADNVYFKDPLNQFRGIKRYREMIGFMSQWFQAIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ IE+ G LH +T LPW+P
Sbjct: 59 DVHAIEQ--------QGNIINTRWTLHWTTP-----------------------LPWRPP 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I G + L+D I H + W+ S + + Q
Sbjct: 88 IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121
>gi|166367539|ref|YP_001659812.1| hypothetical protein MAE_47980 [Microcystis aeruginosa NIES-843]
gi|166089912|dbj|BAG04620.1| hypothetical protein MAE_47980 [Microcystis aeruginosa NIES-843]
Length = 128
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 38/154 (24%), Positives = 66/154 (42%), Gaps = 34/154 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
DI+ IL+ DY+ T +IYA++ F+DP +FRG + Y + + +F+ +
Sbjct: 2 DIIAILQQDYQRFPLDQ---TYDIYADNVYFQDPLNQFRGIKRYREMIGFMSQWFQAIKM 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
+ IE+ +G +T H ++ P LPW+P
Sbjct: 59 DVHAIEQ----------------QGNIINTRWTLH-WITP--------------LPWRPR 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQ 243
I+I G + L+D I H + W+ S + + Q
Sbjct: 88 IAISGRSELTLDDNNLIISHIDYWDCSRWDVLRQ 121
>gi|172039389|ref|YP_001805890.1| hypothetical protein cce_4476 [Cyanothece sp. ATCC 51142]
gi|354552346|ref|ZP_08971654.1| Protein of unknown function DUF2358 [Cyanothece sp. ATCC 51472]
gi|171700843|gb|ACB53824.1| unknown [Cyanothece sp. ATCC 51142]
gi|353555668|gb|EHC25056.1| Protein of unknown function DUF2358 [Cyanothece sp. ATCC 51472]
Length = 128
Score = 46.2 bits (108), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 33/136 (24%), Positives = 53/136 (38%), Gaps = 31/136 (22%)
Query: 110 TSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRRE 169
T +YA++ F+DP F G + Y ++ + +F+ + L NI + S + E
Sbjct: 19 TYSVYADNVYFKDPVNEFTGVKRYQEMIQFMSTWFKNVDLELHNIYQ-----SDNIIHTE 73
Query: 170 GRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRH 229
+ + LPWKP ISI G + ND+ I H
Sbjct: 74 WTLHWI--------------------------TPLPWKPPISIPGRSELTFNDQNLIISH 107
Query: 230 AESWNVSALEAVGQIF 245
+ WN S + + Q F
Sbjct: 108 IDYWNCSRWDVLKQHF 123
>gi|428770087|ref|YP_007161877.1| hypothetical protein Cyan10605_1730 [Cyanobacterium aponinum PCC
10605]
gi|428684366|gb|AFZ53833.1| Protein of unknown function DUF2358 [Cyanobacterium aponinum PCC
10605]
Length = 125
Score = 44.7 bits (104), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 62/156 (39%), Gaps = 35/156 (22%)
Query: 90 DIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSI 149
+++ I++ DY+ T IYAE+ F+DP F G + Y + L +F ++
Sbjct: 2 NLIDIIKEDYQK---FPEDQTYAIYAENVHFKDPVYDFYGLKKYQEMIAFLRKWFSNLNL 58
Query: 150 GLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPL 209
L I + + + R NS LPWKP
Sbjct: 59 ELHEINQ-------IENQINTRWTMSWNSP------------------------LPWKPF 87
Query: 210 ISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
IS+ G + +L D+L I H + W+ S + + Q F
Sbjct: 88 ISVSGRSELKLKDDL-IVGHYDYWDCSFFDMIKQHF 122
>gi|158337451|ref|YP_001518626.1| hypothetical protein AM1_4330 [Acaryochloris marina MBIC11017]
gi|158307692|gb|ABW29309.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 131
Score = 43.9 bits (102), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 38/161 (23%), Positives = 64/161 (39%), Gaps = 42/161 (26%)
Query: 91 IVTILRSDY----ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEY 146
I+ L++DY EN F ++YAED F+DP +FRG Y + + +F
Sbjct: 3 ILERLQADYDRFPENQSF-------DLYAEDVYFKDPLNQFRGVAKYQEMIGFIQKWFIN 55
Query: 147 PSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPW 206
+ + IE+ ++ + R +L + PW
Sbjct: 56 TRMEVHGIEQ-----------QDNEINT------------------RWTLYWQA--PFPW 84
Query: 207 KPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTP 247
KP ++I G + +LND + H + W+ S Q+F P
Sbjct: 85 KPQMAITGRSELKLNDAGLVCSHIDYWDCSRFSVFKQLFFP 125
>gi|119494619|ref|ZP_01624743.1| hypothetical protein L8106_16284 [Lyngbya sp. PCC 8106]
gi|119452054|gb|EAW33270.1| hypothetical protein L8106_16284 [Lyngbya sp. PCC 8106]
Length = 127
Score = 43.5 bits (101), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 37/154 (24%), Positives = 61/154 (39%), Gaps = 34/154 (22%)
Query: 92 VTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGL 151
+ I++ DY+ T +Y D F+DP +F G E Y + + + +F + L
Sbjct: 4 IDIIKQDYQR---FPEDQTYSLYTPDVYFQDPLNQFNGVERYKQMIGFIRQWFLDIQLDL 60
Query: 152 QNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLIS 211
I + ++ R +L T LPWKP +S
Sbjct: 61 HEISQAENVIKT-----------------------------RWTLSWTT--PLPWKPRVS 89
Query: 212 IDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
I G + +LN E I+ H + W+ + L+ V QI
Sbjct: 90 IPGWSELKLNSEGLISSHIDYWDCTRLDVVKQIL 123
>gi|307154421|ref|YP_003889805.1| hypothetical protein Cyan7822_4622 [Cyanothece sp. PCC 7822]
gi|306984649|gb|ADN16530.1| Protein of unknown function DUF2358 [Cyanothece sp. PCC 7822]
Length = 150
Score = 42.0 bits (97), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 37/166 (22%), Positives = 67/166 (40%), Gaps = 28/166 (16%)
Query: 84 LVSGIDDIVTILRSDYENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLLVPF 143
L + I+ ++ L++D + + +IY +D F+DP F+G N R++
Sbjct: 6 LKNKINQVIDTLKADLPTLF--EKDISYDIYTQDIYFKDPVNTFKG----KLNYRII--- 56
Query: 144 FEYPSIGLQNIEKFVFLVSLMVGRREGRMRGLHNSTALGGHIFLLPYLLRISLICRTYLK 203
Y ++ +F + E + + P ++R R L
Sbjct: 57 --YWTLRFHGQ---LFFREIFFDLHEVKE--------------IEPDIIRADWTVRGTLL 97
Query: 204 LPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPGD 249
+PWK I G + Y+LN E I H ++W+ E + Q F G+
Sbjct: 98 VPWKAYIFFKGFSTYKLNSEGLIYEHIDTWDRQPGEILQQFFRSGE 143
>gi|300869129|ref|ZP_07113728.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300332898|emb|CBN58926.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 144
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 32/139 (23%), Positives = 55/139 (39%), Gaps = 26/139 (18%)
Query: 112 EIYAEDCIFEDPTIRFRGTELYSRNLRLLVPFFEYPSIGLQNIEKFVFLVSLMVGRREGR 171
+IY +D F+DP +F+G N R++ + + +F +
Sbjct: 30 DIYTKDIYFQDPVNKFKG----KINYRIIFWTLRFHAQ--------LFFTEIHFD----- 72
Query: 172 MRGLHNSTALGGHIFLLPYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAE 231
LH L + +R +L ++PWK I +G + Y+LN E I H +
Sbjct: 73 ---LHEVGQTASDTILAKWTVRGTL------RVPWKAQIFFNGYSTYKLNTEGLIYEHID 123
Query: 232 SWNVSALEAVGQIFTPGDR 250
+W+ E + Q GD
Sbjct: 124 TWDRKPGEVLKQFIRKGDE 142
>gi|119489662|ref|ZP_01622421.1| hypothetical protein L8106_13095 [Lyngbya sp. PCC 8106]
gi|119454399|gb|EAW35548.1| hypothetical protein L8106_13095 [Lyngbya sp. PCC 8106]
Length = 157
Score = 38.9 bits (89), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 17/54 (31%), Positives = 27/54 (50%)
Query: 191 LLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQI 244
L+ R L+LPWKP I +GS+ Y+L + H ++W+ S + Q
Sbjct: 92 LITADWTVRGTLRLPWKPQIFFNGSSAYKLTKHALVYHHEDTWDRSPATILKQF 145
>gi|428210029|ref|YP_007094382.1| hypothetical protein Chro_5140 [Chroococcidiopsis thermalis PCC
7203]
gi|428011950|gb|AFY90513.1| Protein of unknown function DUF2358 [Chroococcidiopsis thermalis
PCC 7203]
Length = 139
Score = 38.9 bits (89), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 28/51 (54%)
Query: 195 SLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
+ R L++PWK + +G + Y+LND I H ++W+ S E + Q F
Sbjct: 86 TWTVRGVLRVPWKAKVFFNGYSNYKLNDNGLIYEHIDTWDRSPKEILQQFF 136
>gi|428223898|ref|YP_007107995.1| hypothetical protein GEI7407_0442 [Geitlerinema sp. PCC 7407]
gi|427983799|gb|AFY64943.1| Protein of unknown function DUF2358 [Geitlerinema sp. PCC 7407]
Length = 160
Score = 38.5 bits (88), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 17/60 (28%), Positives = 31/60 (51%)
Query: 189 PYLLRISLICRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPG 248
P ++ ++ R L+LPW+P + +G++ Y L + I H + W+ E + Q F G
Sbjct: 98 PDVVFVTWTVRGTLRLPWRPRLFFNGNSTYRLGPDGLIYDHRDVWDRKPSELLKQFFVKG 157
>gi|397641287|gb|EJK74572.1| hypothetical protein THAOC_03742, partial [Thalassiosira oceanica]
Length = 195
Score = 38.5 bits (88), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 23/39 (58%)
Query: 100 ENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLR 138
EN Y TG S +A DC F DPT+ F+G + + N++
Sbjct: 157 ENNYLWTGNIDSSSFAADCTFTDPTLSFQGVDKFVSNVQ 195
>gi|303287839|ref|XP_003063208.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455040|gb|EEH52344.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 103
Score = 38.5 bits (88), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 26/51 (50%), Gaps = 5/51 (9%)
Query: 202 LKLPWKPLISIDGSTVYELNDE-----LKITRHAESWNVSALEAVGQIFTP 247
L LPW+P I + G T + D I + E W +SAL+AV Q+ P
Sbjct: 50 LNLPWRPRIDLTGQTTFGFIDAGVERGCLIVGYREEWGISALDAVAQLVAP 100
>gi|303280892|ref|XP_003059738.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226458393|gb|EEH55690.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 101
Score = 38.5 bits (88), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 20/51 (39%), Positives = 25/51 (49%), Gaps = 5/51 (9%)
Query: 202 LKLPWKPLISIDGSTVYELNDE-----LKITRHAESWNVSALEAVGQIFTP 247
L LPW+P I + G T D I + E W VSAL+AV Q+ P
Sbjct: 48 LNLPWRPRIDLTGQTTLGFMDAGVERGCLIVSYREEWGVSALDAVAQLVAP 98
>gi|242032997|ref|XP_002463893.1| hypothetical protein SORBIDRAFT_01g008410 [Sorghum bicolor]
gi|241917747|gb|EER90891.1| hypothetical protein SORBIDRAFT_01g008410 [Sorghum bicolor]
Length = 3513
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 34/70 (48%), Gaps = 3/70 (4%)
Query: 29 CSGTVNDGN-KNTNNKTATPQILKFAVSGVTELLRLFSSF--DKGRLDKVSYKQKYEILV 85
C+G +DGN T + +F G+ LLR+ + G++ VS Q Y ++V
Sbjct: 3272 CTGVSHDGNILTTGGDDGVVAVWRFVKDGIRRLLRMEKALCAHTGKITCVSVSQPYSLIV 3331
Query: 86 SGIDDIVTIL 95
SG DD IL
Sbjct: 3332 SGSDDCSVIL 3341
>gi|427735544|ref|YP_007055088.1| hypothetical protein Riv7116_2004 [Rivularia sp. PCC 7116]
gi|427370585|gb|AFY54541.1| hypothetical protein Riv7116_2004 [Rivularia sp. PCC 7116]
Length = 144
Score = 38.1 bits (87), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 26/48 (54%)
Query: 198 CRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
R L+LPWK I +G + Y+LN + I H +SW+ E + Q F
Sbjct: 91 VRGTLRLPWKAKIFFNGHSNYKLNQDGLIYEHVDSWDRKPSEILRQFF 138
>gi|414872729|tpg|DAA51286.1| TPA: hypothetical protein ZEAMMB73_996940 [Zea mays]
Length = 2552
Score = 37.0 bits (84), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 34/70 (48%), Gaps = 3/70 (4%)
Query: 29 CSGTVNDGN-KNTNNKTATPQILKFAVSGVTELLRLFSSF--DKGRLDKVSYKQKYEILV 85
C+G +DGN T + +F G+ LLR+ + G++ VS Q Y ++V
Sbjct: 2309 CTGVSHDGNILITGGDDGVVAVWRFVKDGIRRLLRMEKALCAHTGKITCVSVSQPYSLIV 2368
Query: 86 SGIDDIVTIL 95
SG DD IL
Sbjct: 2369 SGSDDCSVIL 2378
>gi|397575083|gb|EJK49524.1| hypothetical protein THAOC_31588 [Thalassiosira oceanica]
Length = 282
Score = 37.0 bits (84), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 18/54 (33%), Positives = 30/54 (55%), Gaps = 1/54 (1%)
Query: 88 IDDIVTILRSD-YENAYFVTGIFTSEIYAEDCIFEDPTIRFRGTELYSRNLRLL 140
I+D+ L + Y +FVTG+ +A+D F+DP ++ G E Y+R + L
Sbjct: 104 IEDLAAALDFELYNREWFVTGLINPIYFADDFQFQDPDVKLTGVEEYARGVNKL 157
>gi|428306747|ref|YP_007143572.1| hypothetical protein Cri9333_3229 [Crinalium epipsammum PCC 9333]
gi|428248282|gb|AFZ14062.1| Protein of unknown function DUF2358 [Crinalium epipsammum PCC 9333]
Length = 142
Score = 37.0 bits (84), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 26/51 (50%)
Query: 198 CRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIFTPG 248
R L++PWK I +G + Y+LN + I H ++W+ E + Q G
Sbjct: 89 VRGVLRVPWKAHIFFNGYSTYKLNQDCLIYEHIDTWDRKPSEILQQFIRKG 139
>gi|113478067|ref|YP_724128.1| hypothetical protein Tery_4691 [Trichodesmium erythraeum IMS101]
gi|110169115|gb|ABG53655.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 135
Score = 36.6 bits (83), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 16/48 (33%), Positives = 26/48 (54%)
Query: 198 CRTYLKLPWKPLISIDGSTVYELNDELKITRHAESWNVSALEAVGQIF 245
R L++PWK I +G + Y+LN + I H ++W+ E + Q F
Sbjct: 85 VRGILRVPWKAKIFFNGYSTYKLNQDGLIYEHIDTWDRKPSEILKQFF 132
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.140 0.420
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,912,975,934
Number of Sequences: 23463169
Number of extensions: 157546167
Number of successful extensions: 313425
Number of sequences better than 100.0: 187
Number of HSP's better than 100.0 without gapping: 149
Number of HSP's successfully gapped in prelim test: 38
Number of HSP's that attempted gapping in prelim test: 313036
Number of HSP's gapped (non-prelim): 336
length of query: 251
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 112
effective length of database: 9,097,814,876
effective search space: 1018955266112
effective search space used: 1018955266112
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 75 (33.5 bits)