BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019829
(335 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224123278|ref|XP_002319039.1| predicted protein [Populus trichocarpa]
gi|222857415|gb|EEE94962.1| predicted protein [Populus trichocarpa]
Length = 330
Score = 535 bits (1377), Expect = e-149, Method: Compositional matrix adjust.
Identities = 270/335 (80%), Positives = 303/335 (90%), Gaps = 5/335 (1%)
Query: 1 MATLIYNNLCLLSSRPRVKVLSLSRIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVK 60
M T++Y L+S PR+++ S +IV SS V V +++VL+QVD+ELSKGDERAAL+LVK
Sbjct: 1 MGTVVYYT-GLVS--PRLRLCS-KKIVGSS-VGVQRQQVLEQVDKELSKGDERAALSLVK 55
Query: 61 DLQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVG 120
DLQGKPGGLRCFGA RQIPQRLYTLDELKLNGIETRSLLSPVD TLG+IER LQLA + G
Sbjct: 56 DLQGKPGGLRCFGAGRQIPQRLYTLDELKLNGIETRSLLSPVDATLGAIERTLQLAGVAG 115
Query: 121 GVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQH 180
G++ WN FGF+PQQIL SL LLFLWT+DSVS+NGG+GSLVLDTIGHTFSQKYHNRVIQH
Sbjct: 116 GLATWNAFGFSPQQILLFSLGLLFLWTLDSVSFNGGLGSLVLDTIGHTFSQKYHNRVIQH 175
Query: 181 EAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLN 240
EAGHFLIAY+VGILP+GYTLTS EAL+K+GS NVQAGTAFVDF+FLEEVN+GKVSATTLN
Sbjct: 176 EAGHFLIAYMVGILPRGYTLTSLEALQKDGSFNVQAGTAFVDFDFLEEVNTGKVSATTLN 235
Query: 241 RFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLR 300
RFSCIALAGVATEYLL+GYAEGGLADINKLD L+KGLGFTQKKADSQVRWS+LNT+L+LR
Sbjct: 236 RFSCIALAGVATEYLLFGYAEGGLADINKLDMLIKGLGFTQKKADSQVRWSVLNTILMLR 295
Query: 301 RHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
RH+GAR KLA AMTMGKSVGSCI IIEDNID DI
Sbjct: 296 RHEGAREKLAEAMTMGKSVGSCIGIIEDNIDAADI 330
>gi|118487460|gb|ABK95557.1| unknown [Populus trichocarpa]
Length = 330
Score = 534 bits (1375), Expect = e-149, Method: Compositional matrix adjust.
Identities = 268/335 (80%), Positives = 301/335 (89%), Gaps = 5/335 (1%)
Query: 1 MATLIYNNLCLLSSRPRVKVLSLSRIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVK 60
M T++Y + PR+++ S +IV SS V V +++VL+QVD+ELSKGDERAAL+LVK
Sbjct: 1 MGTVVYYTGLV---PPRLRLCS-KKIVGSS-VGVQRQQVLEQVDKELSKGDERAALSLVK 55
Query: 61 DLQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVG 120
DLQGKPGGLRCFGA RQIPQRLYTLDELKLNGIETRSLLSPVD TLG+IER LQLA + G
Sbjct: 56 DLQGKPGGLRCFGAGRQIPQRLYTLDELKLNGIETRSLLSPVDATLGAIERTLQLAGVAG 115
Query: 121 GVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQH 180
G++ WN FGF+PQQIL SL LLFLWT+DSVS+NGG+GSLVLDTIGHTFSQKYHNRVIQH
Sbjct: 116 GLATWNAFGFSPQQILLFSLGLLFLWTLDSVSFNGGLGSLVLDTIGHTFSQKYHNRVIQH 175
Query: 181 EAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLN 240
EAGHFLIAY+VGILP+GYTLTS EAL+K+GS NVQAGTAFVDF+FLEEVN+GKVSATTLN
Sbjct: 176 EAGHFLIAYMVGILPRGYTLTSLEALQKDGSFNVQAGTAFVDFDFLEEVNTGKVSATTLN 235
Query: 241 RFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLR 300
RFSCIALAGVATEYLL+GYAEGGLADINKLD L+KGLGFTQKKADSQVRWS+LNT+L+LR
Sbjct: 236 RFSCIALAGVATEYLLFGYAEGGLADINKLDMLIKGLGFTQKKADSQVRWSVLNTILMLR 295
Query: 301 RHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
RH+GAR KLA AMTMGKSVGSCI IIEDNID DI
Sbjct: 296 RHEGAREKLAEAMTMGKSVGSCIGIIEDNIDAADI 330
>gi|255542488|ref|XP_002512307.1| conserved hypothetical protein [Ricinus communis]
gi|223548268|gb|EEF49759.1| conserved hypothetical protein [Ricinus communis]
Length = 337
Score = 519 bits (1337), Expect = e-145, Method: Compositional matrix adjust.
Identities = 253/300 (84%), Positives = 281/300 (93%)
Query: 36 KRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLDELKLNGIET 95
+++VL+QVD+EL KGDERAALALV+DLQGKP GL+C+GAARQ+PQRLYTLDELKLNGIE
Sbjct: 38 RQQVLEQVDKELKKGDERAALALVRDLQGKPAGLKCYGAARQVPQRLYTLDELKLNGIEA 97
Query: 96 RSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNG 155
SLLSPVD TLGSIERNLQL ++GG++AWN FGF+PQQI Y SL LLFL T+DSVS+NG
Sbjct: 98 ASLLSPVDATLGSIERNLQLCGLLGGIAAWNTFGFSPQQIFYFSLGLLFLLTLDSVSFNG 157
Query: 156 GVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQ 215
GVGSLVLDTIGHTFSQKYH+RVIQHEAGHFLIAYLVGILPKGYTL+S EAL+KEGSLNVQ
Sbjct: 158 GVGSLVLDTIGHTFSQKYHSRVIQHEAGHFLIAYLVGILPKGYTLSSLEALQKEGSLNVQ 217
Query: 216 AGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLK 275
AGTAFVDFEFLEEVN+GK+SA +LNRFSCIALAGVATEYLLYG+AEGGLADINKLD LLK
Sbjct: 218 AGTAFVDFEFLEEVNTGKLSAMSLNRFSCIALAGVATEYLLYGFAEGGLADINKLDMLLK 277
Query: 276 GLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
GLGFTQKKADSQVRWS+LNT+L+LRRH GARAKLA AM+MGKSVGSCI IIEDNI+D DI
Sbjct: 278 GLGFTQKKADSQVRWSVLNTILILRRHDGARAKLAEAMSMGKSVGSCISIIEDNINDADI 337
>gi|225451100|ref|XP_002266105.1| PREDICTED: uncharacterized protein LOC100267587 [Vitis vinifera]
gi|298205011|emb|CBI34318.3| unnamed protein product [Vitis vinifera]
Length = 332
Score = 516 bits (1328), Expect = e-144, Method: Compositional matrix adjust.
Identities = 272/335 (81%), Positives = 305/335 (91%), Gaps = 3/335 (0%)
Query: 1 MATLIYNNLCLLSSRPRVKVLSLSRIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVK 60
M T++Y + S PR++V +L RIVSS VS+ +VL+QVD+EL+KGD+R AL+LVK
Sbjct: 1 MGTVVYR--AAVCSHPRLRVPTL-RIVSSLGTNVSRWQVLEQVDKELAKGDDRTALSLVK 57
Query: 61 DLQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVG 120
DLQGKP GLRCFGAARQ+PQRLYTLDELKLNGIE SLLSPVD TLGSIERNLQLAA++G
Sbjct: 58 DLQGKPDGLRCFGAARQVPQRLYTLDELKLNGIEASSLLSPVDATLGSIERNLQLAAVLG 117
Query: 121 GVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQH 180
G+SAWNVFGF+PQQ+L+LSL LLFLWT+D VS+NGGVGSLVLDTIGHTFSQKYHNRVIQH
Sbjct: 118 GLSAWNVFGFSPQQLLFLSLGLLFLWTLDLVSFNGGVGSLVLDTIGHTFSQKYHNRVIQH 177
Query: 181 EAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLN 240
EAGHFLIAYL+GILPKGYTLTS EAL+KEGSLN+QAGTAFVDFEFLEEVNSGKVSAT LN
Sbjct: 178 EAGHFLIAYLLGILPKGYTLTSLEALQKEGSLNIQAGTAFVDFEFLEEVNSGKVSATMLN 237
Query: 241 RFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLR 300
RFSCIALAGV TEYLLYGYAEGGLADINKLD LLKGLGFTQKKADSQVRWS+LNTVL+LR
Sbjct: 238 RFSCIALAGVVTEYLLYGYAEGGLADINKLDLLLKGLGFTQKKADSQVRWSVLNTVLILR 297
Query: 301 RHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
R+K AR KLA AM+MGKSVG+CI +IE+NID+TDI
Sbjct: 298 RNKQARGKLAEAMSMGKSVGACIGVIEENIDNTDI 332
>gi|124360157|gb|ABN08173.1| Peptidase, cysteine peptidase active site [Medicago truncatula]
Length = 340
Score = 512 bits (1319), Expect = e-143, Method: Compositional matrix adjust.
Identities = 244/311 (78%), Positives = 282/311 (90%)
Query: 25 RIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYT 84
RI S++V S++++L+ VD+EL+KGD+RAALALVKDLQGKP GL+CFGAARQ+PQRLYT
Sbjct: 30 RIQCSTKVSSSRQQLLESVDKELTKGDDRAALALVKDLQGKPNGLQCFGAARQVPQRLYT 89
Query: 85 LDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLF 144
LDELKLNGIE SLLSPVD TLGSIERNL +AAIVGG +AWNVFG + QQI Y+SL LLF
Sbjct: 90 LDELKLNGIEAMSLLSPVDTTLGSIERNLLIAAIVGGFAAWNVFGISQQQIFYISLGLLF 149
Query: 145 LWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFE 204
LWT+D VS+ GG+GSLV+DTIGH FSQKYHNRVIQHEAGHFLIAYLVGILPKGYTL+S +
Sbjct: 150 LWTLDLVSFGGGLGSLVVDTIGHKFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLSSLD 209
Query: 205 ALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGL 264
+ KEGSLN+QAGTAFVDFEFLEEVNSGKVSATTLN+FSCIALAGV TEYL+YG++EGGL
Sbjct: 210 GMMKEGSLNIQAGTAFVDFEFLEEVNSGKVSATTLNKFSCIALAGVCTEYLIYGFSEGGL 269
Query: 265 ADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCID 324
DI KLD+LL GLGFTQKKADSQVRWS+LNTVLLLRRH+ AR+KLA AM+MG SVGSCID
Sbjct: 270 DDIRKLDSLLNGLGFTQKKADSQVRWSVLNTVLLLRRHEAARSKLAEAMSMGSSVGSCID 329
Query: 325 IIEDNIDDTDI 335
IIE++I+ +D+
Sbjct: 330 IIENSINVSDL 340
>gi|350537327|ref|NP_001233778.1| stress regulated protein [Solanum lycopersicum]
gi|58198163|gb|AAW65813.1| stress regulated protein [Solanum lycopersicum]
Length = 326
Score = 509 bits (1310), Expect = e-142, Method: Compositional matrix adjust.
Identities = 245/316 (77%), Positives = 281/316 (88%)
Query: 20 VLSLSRIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIP 79
+L S + SS+ VS VL+QVD+EL+KGDERAAL+LVKDLQGKPGGLRCFGAARQ+P
Sbjct: 11 LLCCSELRSSNSRIVSGLSVLEQVDKELTKGDERAALSLVKDLQGKPGGLRCFGAARQVP 70
Query: 80 QRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLS 139
QRLY+LDELKLNGIE SLLSPVD TLG+IERNLQ+AAI+ G +AW +PQQI ++S
Sbjct: 71 QRLYSLDELKLNGIEAISLLSPVDATLGAIERNLQIAAILSGSAAWYALDLSPQQIFFVS 130
Query: 140 LALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYT 199
L +LFLWT+D VS+NGGVG+LVLDTIGHTFSQKYHNRVIQHEAGHFLIAYL+GILPKGYT
Sbjct: 131 LGVLFLWTLDLVSFNGGVGTLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLLGILPKGYT 190
Query: 200 LTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGY 259
LTS +ALKKEGSLN+QAGTAFVD EF+EEVN GKV+AT LNRFSCIALAGVATEYLL+GY
Sbjct: 191 LTSLDALKKEGSLNIQAGTAFVDLEFIEEVNRGKVTATMLNRFSCIALAGVATEYLLFGY 250
Query: 260 AEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSV 319
AEGGL+DIN+LDALLK LGFTQKKADSQVRW++LNT+L+LRRH+ AR+KLA AMT GKSV
Sbjct: 251 AEGGLSDINQLDALLKSLGFTQKKADSQVRWAVLNTILILRRHEKARSKLAEAMTQGKSV 310
Query: 320 GSCIDIIEDNIDDTDI 335
G CIDIIE +I D D+
Sbjct: 311 GVCIDIIEKSISDDDL 326
>gi|147815273|emb|CAN70022.1| hypothetical protein VITISV_030170 [Vitis vinifera]
Length = 320
Score = 508 bits (1309), Expect = e-141, Method: Compositional matrix adjust.
Identities = 267/319 (83%), Positives = 297/319 (93%), Gaps = 1/319 (0%)
Query: 17 RVKVLSLSRIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAAR 76
R++V +L RIVSS VS+ +VL+QVD+EL+KGD+R AL+LVKDLQGKP GLRCFGAAR
Sbjct: 3 RLRVPTL-RIVSSLGTNVSRWQVLEQVDKELAKGDDRTALSLVKDLQGKPDGLRCFGAAR 61
Query: 77 QIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQIL 136
Q+PQRLYTLDELKLNGIE SLLSPVD TLGSIERNLQLAA++GG+SAWNVFGF+PQQ+L
Sbjct: 62 QVPQRLYTLDELKLNGIEASSLLSPVDATLGSIERNLQLAAVLGGLSAWNVFGFSPQQLL 121
Query: 137 YLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPK 196
+LSL LLFLWT+D VS+NGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYL+GILPK
Sbjct: 122 FLSLGLLFLWTLDLVSFNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLLGILPK 181
Query: 197 GYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLL 256
GYTLTS EAL+KEGSLN+QAGTAFVDFEFLEEVNSGKVSAT LNRFSCIALAGV TEYLL
Sbjct: 182 GYTLTSLEALQKEGSLNIQAGTAFVDFEFLEEVNSGKVSATMLNRFSCIALAGVVTEYLL 241
Query: 257 YGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMG 316
YGYAEGGLADINKLD LLKGLGFTQKKADSQVRWS+LNTVL+LRR+K AR KLA AM+MG
Sbjct: 242 YGYAEGGLADINKLDLLLKGLGFTQKKADSQVRWSVLNTVLILRRNKQARGKLAEAMSMG 301
Query: 317 KSVGSCIDIIEDNIDDTDI 335
KSVG+CI +IE+NID+TDI
Sbjct: 302 KSVGACIGVIEENIDNTDI 320
>gi|357512109|ref|XP_003626343.1| hypothetical protein MTR_7g114070 [Medicago truncatula]
gi|355501358|gb|AES82561.1| hypothetical protein MTR_7g114070 [Medicago truncatula]
Length = 347
Score = 505 bits (1301), Expect = e-140, Method: Compositional matrix adjust.
Identities = 244/318 (76%), Positives = 282/318 (88%), Gaps = 7/318 (2%)
Query: 25 RIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYT 84
RI S++V S++++L+ VD+EL+KGD+RAALALVKDLQGKP GL+CFGAARQ+PQRLYT
Sbjct: 30 RIQCSTKVSSSRQQLLESVDKELTKGDDRAALALVKDLQGKPNGLQCFGAARQVPQRLYT 89
Query: 85 LDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLF 144
LDELKLNGIE SLLSPVD TLGSIERNL +AAIVGG +AWNVFG + QQI Y+SL LLF
Sbjct: 90 LDELKLNGIEAMSLLSPVDTTLGSIERNLLIAAIVGGFAAWNVFGISQQQIFYISLGLLF 149
Query: 145 LWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFE 204
LWT+D VS+ GG+GSLV+DTIGH FSQKYHNRVIQHEAGHFLIAYLVGILPKGYTL+S +
Sbjct: 150 LWTLDLVSFGGGLGSLVVDTIGHKFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLSSLD 209
Query: 205 ALKKEGSLNVQAGTAFVDFEFLEE-------VNSGKVSATTLNRFSCIALAGVATEYLLY 257
+ KEGSLN+QAGTAFVDFEFLEE VNSGKVSATTLN+FSCIALAGV TEYL+Y
Sbjct: 210 GMMKEGSLNIQAGTAFVDFEFLEEVGCFICSVNSGKVSATTLNKFSCIALAGVCTEYLIY 269
Query: 258 GYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGK 317
G++EGGL DI KLD+LL GLGFTQKKADSQVRWS+LNTVLLLRRH+ AR+KLA AM+MG
Sbjct: 270 GFSEGGLDDIRKLDSLLNGLGFTQKKADSQVRWSVLNTVLLLRRHEAARSKLAEAMSMGS 329
Query: 318 SVGSCIDIIEDNIDDTDI 335
SVGSCIDIIE++I+ +D+
Sbjct: 330 SVGSCIDIIENSINVSDL 347
>gi|58198153|gb|AAW65807.1| stress regulated protein isoform 3 [Solanum virginianum]
gi|58198161|gb|AAW65812.1| stress regulated protein isoform 3 [Solanum virginianum]
Length = 326
Score = 504 bits (1298), Expect = e-140, Method: Compositional matrix adjust.
Identities = 242/316 (76%), Positives = 282/316 (89%)
Query: 20 VLSLSRIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIP 79
+L S++ SS+ +S VL+QVD+EL KGDERAAL+LVKDL+GKPGGLRCFGAARQ+P
Sbjct: 11 LLCCSQLRSSNSRILSGLSVLEQVDKELMKGDERAALSLVKDLRGKPGGLRCFGAARQVP 70
Query: 80 QRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLS 139
QRLY+LDEL+LNGIET SLLSPVD TLG+IERNLQ+ AI+ G +AW +PQQIL++S
Sbjct: 71 QRLYSLDELRLNGIETVSLLSPVDATLGAIERNLQIVAILSGSAAWYALDLSPQQILFVS 130
Query: 140 LALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYT 199
L +LFLWT+D VS+NGGVG+LVLDTIGHTFSQKYHNRV QHEAGHFLIAYL+GILPKGYT
Sbjct: 131 LGVLFLWTLDLVSFNGGVGALVLDTIGHTFSQKYHNRVTQHEAGHFLIAYLLGILPKGYT 190
Query: 200 LTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGY 259
LTS +ALKK+GSLN+QAGTAFVDFEF+EEVNSGKV+AT LNRFSCIALAGVATEYLL+GY
Sbjct: 191 LTSLDALKKQGSLNIQAGTAFVDFEFIEEVNSGKVTATMLNRFSCIALAGVATEYLLFGY 250
Query: 260 AEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSV 319
AEGGL+DIN+LDALLK LGFTQKKADSQVRW++LNT+L+LRRH+ AR KLA AMT GKSV
Sbjct: 251 AEGGLSDINQLDALLKSLGFTQKKADSQVRWAVLNTILILRRHEKARVKLAEAMTRGKSV 310
Query: 320 GSCIDIIEDNIDDTDI 335
G CIDIIE +I + D+
Sbjct: 311 GVCIDIIEKSISEDDL 326
>gi|297808789|ref|XP_002872278.1| hypothetical protein ARALYDRAFT_910860 [Arabidopsis lyrata subsp.
lyrata]
gi|297318115|gb|EFH48537.1| hypothetical protein ARALYDRAFT_910860 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 503 bits (1296), Expect = e-140, Method: Compositional matrix adjust.
Identities = 239/311 (76%), Positives = 277/311 (89%), Gaps = 1/311 (0%)
Query: 26 IVSSSEVEVSKRR-VLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYT 84
IV SSE +S RR VL+QVD +LS GDERAAL+LV+DLQGKP GLRCFGAARQ+PQRLY+
Sbjct: 31 IVCSSETALSTRRQVLEQVDSKLSSGDERAALSLVRDLQGKPDGLRCFGAARQVPQRLYS 90
Query: 85 LDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLF 144
L+ELKLNGI SLLSP D TLGSIERNLQ+A + GG+ AW F + QQ+L+LSL +F
Sbjct: 91 LEELKLNGINAASLLSPTDATLGSIERNLQIAGVSGGIVAWKAFDLSSQQLLFLSLGFMF 150
Query: 145 LWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFE 204
LWT+D VS+NGG+ SLVLDTIGHTFSQ+YHNRV+QHEAGHFL+AYLVGILP+GYTL+S E
Sbjct: 151 LWTLDLVSFNGGIASLVLDTIGHTFSQRYHNRVVQHEAGHFLVAYLVGILPRGYTLSSLE 210
Query: 205 ALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGL 264
AL+KEGSLN+QAG+AFVD+EFLEEVNSGKVSAT LNRFSCIALAGVATEYLLYGYAEGGL
Sbjct: 211 ALQKEGSLNIQAGSAFVDYEFLEEVNSGKVSATMLNRFSCIALAGVATEYLLYGYAEGGL 270
Query: 265 ADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCID 324
DI+KLD L+K LGFTQKKADSQVRWS+LNT+LLLRRH+ AR+KLA AM+ G+SVGSCI+
Sbjct: 271 DDISKLDGLVKSLGFTQKKADSQVRWSVLNTILLLRRHEIARSKLAQAMSKGESVGSCIE 330
Query: 325 IIEDNIDDTDI 335
IIED+ID +DI
Sbjct: 331 IIEDSIDPSDI 341
>gi|18421099|ref|NP_568492.1| putative stress regulated protein [Arabidopsis thaliana]
gi|115311399|gb|ABI93880.1| At5g27290 [Arabidopsis thaliana]
gi|332006284|gb|AED93667.1| putative stress regulated protein [Arabidopsis thaliana]
Length = 341
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 238/311 (76%), Positives = 276/311 (88%), Gaps = 1/311 (0%)
Query: 26 IVSSSEVEVS-KRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYT 84
IV SSE +S +R+ L+QVD +LS GDERAAL+LVKDLQGKP GLRCFGAARQ+PQRLYT
Sbjct: 31 IVCSSETGLSIRRQALEQVDSKLSSGDERAALSLVKDLQGKPDGLRCFGAARQVPQRLYT 90
Query: 85 LDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLF 144
L+ELKLNGI SLLSP D TLGSIERNLQ+AA+ GG+ AW F + QQ+ +L+L +F
Sbjct: 91 LEELKLNGINAASLLSPTDTTLGSIERNLQIAAVSGGIVAWKAFDLSSQQLFFLTLGFMF 150
Query: 145 LWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFE 204
LWT+D VS+NGG+GSLVLDT GHTFSQ+YHNRV+QHEAGHFL+AYLVGILP+GYTL+S E
Sbjct: 151 LWTLDLVSFNGGIGSLVLDTTGHTFSQRYHNRVVQHEAGHFLVAYLVGILPRGYTLSSLE 210
Query: 205 ALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGL 264
AL+KEGSLN+QAG+AFVD+EFLEEVNSGKVSAT LNRFSCIALAGVATEYLLYGYAEGGL
Sbjct: 211 ALQKEGSLNIQAGSAFVDYEFLEEVNSGKVSATMLNRFSCIALAGVATEYLLYGYAEGGL 270
Query: 265 ADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCID 324
DI+KLD L+K LGFTQKKADSQVRWS+LNT+LLLRRH+ AR+KLA AM+ G+SVGSCI
Sbjct: 271 DDISKLDGLVKSLGFTQKKADSQVRWSVLNTILLLRRHEIARSKLAQAMSKGESVGSCIQ 330
Query: 325 IIEDNIDDTDI 335
IIED+ID +DI
Sbjct: 331 IIEDSIDPSDI 341
>gi|449485767|ref|XP_004157269.1| PREDICTED: uncharacterized protein LOC101223311 [Cucumis sativus]
Length = 327
Score = 499 bits (1284), Expect = e-139, Method: Compositional matrix adjust.
Identities = 243/325 (74%), Positives = 285/325 (87%), Gaps = 2/325 (0%)
Query: 11 LLSSRPRVKVLSLSRIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLR 70
+L RP + + + S+EV + +R+VL+QVD+EL+KGD+RAAL L+K+ QGK G+R
Sbjct: 5 VLRYRPCFRAVGIWN--CSTEVGLRRRQVLEQVDKELAKGDDRAALFLLKESQGKLDGVR 62
Query: 71 CFGAARQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGF 130
CFGAARQIPQRLYTL+ELKLNGIET SLLSP+D TLGSIER +QLAA + VSAWN+F F
Sbjct: 63 CFGAARQIPQRLYTLEELKLNGIETSSLLSPLDSTLGSIERYIQLAAGLLAVSAWNLFEF 122
Query: 131 NPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYL 190
PQQI Y+SL LFLWT+DSV+ NGGVGSLVLDTIGHTFS+KYHNRVIQHEAGHFLIAYL
Sbjct: 123 TPQQIFYISLGFLFLWTLDSVALNGGVGSLVLDTIGHTFSKKYHNRVIQHEAGHFLIAYL 182
Query: 191 VGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGV 250
+G+LPKGYT +SFEA +KEGSLN+QAGTAFVDFEFLEEVN+GKVSAT LNRFSCIALAGV
Sbjct: 183 LGVLPKGYTTSSFEAFQKEGSLNLQAGTAFVDFEFLEEVNAGKVSATMLNRFSCIALAGV 242
Query: 251 ATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLA 310
ATEYLLYG AEGGLADINKLD LLKGLGFTQKKADSQVRW++LNT+L+LRRH+ ARAKLA
Sbjct: 243 ATEYLLYGCAEGGLADINKLDVLLKGLGFTQKKADSQVRWAVLNTILILRRHESARAKLA 302
Query: 311 VAMTMGKSVGSCIDIIEDNIDDTDI 335
AM+ GKSVG+CID++E++I D+
Sbjct: 303 DAMSSGKSVGNCIDVVENSIHLPDL 327
>gi|449435722|ref|XP_004135643.1| PREDICTED: uncharacterized protein LOC101213973 [Cucumis sativus]
Length = 327
Score = 499 bits (1284), Expect = e-138, Method: Compositional matrix adjust.
Identities = 240/307 (78%), Positives = 278/307 (90%)
Query: 29 SSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLDEL 88
S+EV + +R+VL+QVD+EL+KGD+RAAL L+K+ QGK G+RCFGAARQIPQRLYTL+EL
Sbjct: 21 STEVGLRRRQVLEQVDKELAKGDDRAALFLLKESQGKLDGVRCFGAARQIPQRLYTLEEL 80
Query: 89 KLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTV 148
KLNGIET SLLSP+D TLGSIER +QLAA + VSAWN+F F PQQI Y+SL LFLWT+
Sbjct: 81 KLNGIETSSLLSPLDSTLGSIERYIQLAAGLLAVSAWNLFEFTPQQIFYISLGFLFLWTL 140
Query: 149 DSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKK 208
DSV+ NGGVGSLVLDTIGHTFS+KYHNRVIQHEAGHFLIAYL+G+LPKGYT +SFEA +K
Sbjct: 141 DSVALNGGVGSLVLDTIGHTFSKKYHNRVIQHEAGHFLIAYLLGVLPKGYTTSSFEAFQK 200
Query: 209 EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADIN 268
EGSLN+QAGTAFVDFEFLEEVN+GKVSAT LNRFSCIALAGVATEYLLYG AEGGLADIN
Sbjct: 201 EGSLNLQAGTAFVDFEFLEEVNAGKVSATMLNRFSCIALAGVATEYLLYGCAEGGLADIN 260
Query: 269 KLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIED 328
KLD LLKGLGFTQKKADSQVRW++LNT+L+LRRH+ ARAKLA AM+ GKSVG+CID++E+
Sbjct: 261 KLDVLLKGLGFTQKKADSQVRWAVLNTILILRRHESARAKLADAMSSGKSVGNCIDVVEN 320
Query: 329 NIDDTDI 335
+I D+
Sbjct: 321 SIHLPDL 327
>gi|21593356|gb|AAM65305.1| unknown [Arabidopsis thaliana]
Length = 341
Score = 498 bits (1282), Expect = e-138, Method: Compositional matrix adjust.
Identities = 237/311 (76%), Positives = 275/311 (88%), Gaps = 1/311 (0%)
Query: 26 IVSSSEVEVS-KRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYT 84
IV SSE +S +R+ L+QVD +LS GDERAAL+LVKDLQGKP GLRCFGAARQ+PQRLYT
Sbjct: 31 IVCSSETGLSIRRQALEQVDSKLSSGDERAALSLVKDLQGKPDGLRCFGAARQVPQRLYT 90
Query: 85 LDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLF 144
L+ELKLNGI SLLSP D TLGSIERNLQ+AA+ GG+ AW F + QQ+ +L+L +F
Sbjct: 91 LEELKLNGINAASLLSPTDTTLGSIERNLQIAAVSGGIVAWKAFDLSSQQLFFLTLGFMF 150
Query: 145 LWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFE 204
LWT+D VS+NGG+GSLVLDT GHTFSQ+YHNRV+QHEAGHFL+AYLV ILP+GYTL+S E
Sbjct: 151 LWTLDLVSFNGGIGSLVLDTTGHTFSQRYHNRVVQHEAGHFLVAYLVEILPRGYTLSSLE 210
Query: 205 ALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGL 264
AL+KEGSLN+QAG+AFVD+EFLEEVNSGKVSAT LNRFSCIALAGVATEYLLYGYAEGGL
Sbjct: 211 ALQKEGSLNIQAGSAFVDYEFLEEVNSGKVSATMLNRFSCIALAGVATEYLLYGYAEGGL 270
Query: 265 ADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCID 324
DI+KLD L+K LGFTQKKADSQVRWS+LNT+LLLRRH+ AR+KLA AM+ G+SVGSCI
Sbjct: 271 DDISKLDGLVKSLGFTQKKADSQVRWSVLNTILLLRRHEIARSKLAQAMSKGESVGSCIQ 330
Query: 325 IIEDNIDDTDI 335
IIED+ID +DI
Sbjct: 331 IIEDSIDPSDI 341
>gi|2191197|gb|AAB61082.1| contains similarity to Synechococcus PCC7942 chromosomal region
used as basis of neutral siteII recombinational cloning
vector (PID:g1174192) [Arabidopsis thaliana]
Length = 386
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 230/356 (64%), Positives = 268/356 (75%), Gaps = 46/356 (12%)
Query: 26 IVSSSEVEVS-KRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYT 84
IV SSE +S +R+ L+QVD +LS GDERAAL+LVKDLQGKP GLRCFGAARQ+PQRLYT
Sbjct: 31 IVCSSETGLSIRRQALEQVDSKLSSGDERAALSLVKDLQGKPDGLRCFGAARQVPQRLYT 90
Query: 85 LDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLF 144
L+ELKLNGI SLLSP D TLGSIERNLQ+AA+ GG+ AW F + QQ+ +L+L +F
Sbjct: 91 LEELKLNGINAASLLSPTDTTLGSIERNLQIAAVSGGIVAWKAFDLSSQQLFFLTLGFMF 150
Query: 145 LWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVI----------------QHEAGHFLIA 188
LWT+D VS+NGG+GSLVLDT GHTFSQ+YHNRV+ QHEAGHFL+A
Sbjct: 151 LWTLDLVSFNGGIGSLVLDTTGHTFSQRYHNRVVQKHYIIFHWTYCELRSQHEAGHFLVA 210
Query: 189 YLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKV---SATTLNRFSCI 245
YLVGILP+GYTL+S EAL+KEGSLN+QAG+AFVD+EFLEE N LNRFSCI
Sbjct: 211 YLVGILPRGYTLSSLEALQKEGSLNIQAGSAFVDYEFLEEPNKKLCLLFQNQMLNRFSCI 270
Query: 246 ALAGVATEYLLYGYAEGGLADINK--------------------------LDALLKGLGF 279
ALAGVATEYLLYGYAEGGL DI+K LD L+K LGF
Sbjct: 271 ALAGVATEYLLYGYAEGGLDDISKVSFLLPLKNSSDYVNMLYGFVVLMEQLDGLVKSLGF 330
Query: 280 TQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
TQKKADSQVRWS+LNT+LLLRRH+ AR+KLA AM+ G+SVGSCI IIED+ID +DI
Sbjct: 331 TQKKADSQVRWSVLNTILLLRRHEIARSKLAQAMSKGESVGSCIQIIEDSIDPSDI 386
>gi|357132185|ref|XP_003567712.1| PREDICTED: uncharacterized protein LOC100844265 [Brachypodium
distachyon]
Length = 340
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 213/308 (69%), Positives = 261/308 (84%)
Query: 28 SSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLDE 87
+SS+ + VL QVD+EL KG++ AAL+LV+ G+ GGLRCFGAARQ+PQRLY LDE
Sbjct: 33 TSSKAASRRSAVLGQVDDELQKGNDEAALSLVRSSLGEGGGLRCFGAARQVPQRLYKLDE 92
Query: 88 LKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWT 147
LKLNGI+T S LSPVD TLGSIERN Q+AA++GG+S VF + Q+L+L + LLF+W+
Sbjct: 93 LKLNGIDTSSFLSPVDRTLGSIERNFQIAAVLGGLSVSAVFELSQFQVLFLFVGLLFVWS 152
Query: 148 VDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALK 207
VD + +NGGV +LVLDT+GH SQKYHNRVIQHEAGHFLIAYL+G+LPK YT+TS + L
Sbjct: 153 VDLIYFNGGVRNLVLDTVGHNLSQKYHNRVIQHEAGHFLIAYLLGVLPKEYTITSLDTLM 212
Query: 208 KEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADI 267
K+GSLNVQAGTAFVDFEF+EE+N+GK+SA LN+FSCIALAGVATEYLLYGYAEGGLAD+
Sbjct: 213 KQGSLNVQAGTAFVDFEFVEEINTGKLSAMMLNKFSCIALAGVATEYLLYGYAEGGLADV 272
Query: 268 NKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
NKLD L K LGFTQ KADSQVRW++LNTVL+LRRH+ AR+KLA AM+ G+SVGSCI++IE
Sbjct: 273 NKLDGLFKSLGFTQNKADSQVRWAVLNTVLILRRHEDARSKLAEAMSSGRSVGSCIEVIE 332
Query: 328 DNIDDTDI 335
+N++ DI
Sbjct: 333 ENVNTEDI 340
>gi|326525683|dbj|BAJ88888.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 343
Score = 441 bits (1133), Expect = e-121, Method: Compositional matrix adjust.
Identities = 211/309 (68%), Positives = 260/309 (84%), Gaps = 1/309 (0%)
Query: 28 SSSEVEVSK-RRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLD 86
+SSE ++ R VL+QVD+EL KG++ AAL+LV+ QG+ GGLRCFGAARQ+PQRLY LD
Sbjct: 35 TSSEPAAARLRAVLEQVDDELRKGNDEAALSLVRGSQGEGGGLRCFGAARQVPQRLYKLD 94
Query: 87 ELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLW 146
ELKLNGI+T S LSPVD+TLGSIERNLQ+AA++GG+S F + Q L + + LLF+W
Sbjct: 95 ELKLNGIDTSSFLSPVDQTLGSIERNLQIAALLGGLSVSVAFELSQLQALLIFVGLLFVW 154
Query: 147 TVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEAL 206
+VD + YNGG +LVLDTI H+ S+KYHNRVI+HEAGHFLIAYL+G+LPK YT+T + L
Sbjct: 155 SVDLIYYNGGARNLVLDTIAHSLSEKYHNRVIEHEAGHFLIAYLLGVLPKEYTITCLDTL 214
Query: 207 KKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLAD 266
K+GSLNVQAGTAFVDFEF+EE+N+GK+SA LN+FSCIALAGVATEYLLYGYAEGGL D
Sbjct: 215 TKQGSLNVQAGTAFVDFEFVEEINTGKLSAKMLNKFSCIALAGVATEYLLYGYAEGGLDD 274
Query: 267 INKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDII 326
+NKLD L K LGFTQ KADSQVRW++LN VL+LRRH+ AR+KLA AM+ G+SVGSCI++I
Sbjct: 275 VNKLDGLFKSLGFTQNKADSQVRWAVLNIVLILRRHEKARSKLAEAMSSGRSVGSCIEVI 334
Query: 327 EDNIDDTDI 335
E+NI+ DI
Sbjct: 335 EENINPEDI 343
>gi|115436670|ref|NP_001043093.1| Os01g0382700 [Oryza sativa Japonica Group]
gi|55297157|dbj|BAD68814.1| ATP-dependent Zn proteases-like protein [Oryza sativa Japonica
Group]
gi|113532624|dbj|BAF05007.1| Os01g0382700 [Oryza sativa Japonica Group]
gi|222618488|gb|EEE54620.1| hypothetical protein OsJ_01867 [Oryza sativa Japonica Group]
Length = 346
Score = 436 bits (1121), Expect = e-120, Method: Compositional matrix adjust.
Identities = 225/320 (70%), Positives = 266/320 (83%), Gaps = 1/320 (0%)
Query: 17 RVKVLSLSRIV-SSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAA 75
R+++L +R +SSE R VL+QVDEEL KG++ AAL+LV+ QG GGLR FGAA
Sbjct: 27 RLRLLPPARAANTSSEPAARLRAVLEQVDEELRKGNDEAALSLVRGSQGADGGLRFFGAA 86
Query: 76 RQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQI 135
RQ+PQRLYTLDELKLNGI+T + LSPVD TLGSIERNLQ+AA++GG+S F + Q+
Sbjct: 87 RQVPQRLYTLDELKLNGIDTSAFLSPVDLTLGSIERNLQIAAVLGGLSVSAAFELSKLQV 146
Query: 136 LYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILP 195
L+L L LL LW+VD V + GGV +L+LDTIGH SQKY NRVIQHEAGHFLIAYL+G+LP
Sbjct: 147 LFLFLGLLSLWSVDLVYFGGGVRNLILDTIGHNLSQKYRNRVIQHEAGHFLIAYLLGVLP 206
Query: 196 KGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYL 255
KGYT+TS + K+GSLNVQAGTAFVDFEFL+EVNSGK+SAT LN+FSCIALAGVATEYL
Sbjct: 207 KGYTITSLDTFIKKGSLNVQAGTAFVDFEFLQEVNSGKLSATMLNKFSCIALAGVATEYL 266
Query: 256 LYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTM 315
LYGYAEGGLADI +LD LLKGLGFTQKKADSQVRW++LNTV LRRHK AR++LA AM+
Sbjct: 267 LYGYAEGGLADIGQLDGLLKGLGFTQKKADSQVRWAVLNTVPALRRHKKARSQLAEAMSS 326
Query: 316 GKSVGSCIDIIEDNIDDTDI 335
GKSVGSCI +IE+NI+ DI
Sbjct: 327 GKSVGSCIGVIEENINSEDI 346
>gi|242057859|ref|XP_002458075.1| hypothetical protein SORBIDRAFT_03g026530 [Sorghum bicolor]
gi|241930050|gb|EES03195.1| hypothetical protein SORBIDRAFT_03g026530 [Sorghum bicolor]
Length = 343
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 216/308 (70%), Positives = 257/308 (83%)
Query: 28 SSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLDE 87
S+S+ R VL+QVD+ LSKG++ AAL+LV+ QG GGLR FGAARQ+PQRLYTLDE
Sbjct: 36 SASQSPARLRAVLEQVDQALSKGNDEAALSLVRSSQGVDGGLRAFGAARQVPQRLYTLDE 95
Query: 88 LKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWT 147
LKLNGI+T + LSPVD TLGSIERN+Q+AAI+GG+S + QIL+L L LL LW+
Sbjct: 96 LKLNGIDTSAFLSPVDLTLGSIERNIQIAAILGGLSVSAALELSQLQILFLILGLLSLWS 155
Query: 148 VDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALK 207
VD V + GGV +LVLDTIGH SQKYHNRV+QHEAGHFLIAYL+G+LPKGYT+TS + L
Sbjct: 156 VDFVYFGGGVRNLVLDTIGHYLSQKYHNRVVQHEAGHFLIAYLLGVLPKGYTITSLDTLI 215
Query: 208 KEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADI 267
+GSLNVQAGTAFVD+EFL E+N+GK+SAT +N+FSCIALAGVATEYLLYG AEGGLADI
Sbjct: 216 NQGSLNVQAGTAFVDYEFLGEINTGKLSATMVNKFSCIALAGVATEYLLYGLAEGGLADI 275
Query: 268 NKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
NKLD LLK LGFTQKKADSQVRW++LNTVL+LRRH+ AR++LA AM+ GKSVGSCI +IE
Sbjct: 276 NKLDGLLKSLGFTQKKADSQVRWAVLNTVLILRRHEKARSQLAEAMSTGKSVGSCIQVIE 335
Query: 328 DNIDDTDI 335
+ I DI
Sbjct: 336 ECISTDDI 343
>gi|194695982|gb|ACF82075.1| unknown [Zea mays]
Length = 326
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 215/308 (69%), Positives = 254/308 (82%)
Query: 28 SSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLDE 87
SSS+ VL+QVD+ LS+G++ AAL+LV+ QG GGLR FGAARQ+PQRLY LDE
Sbjct: 19 SSSQSPARLTAVLEQVDQLLSQGNDMAALSLVRSTQGGDGGLRAFGAARQVPQRLYKLDE 78
Query: 88 LKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWT 147
LKLNGI+T + LSPVD TLGSIERNLQ+AA++GG S + QIL+L L LL LW+
Sbjct: 79 LKLNGIDTSAFLSPVDLTLGSIERNLQIAAVLGGFSVSAALELSQLQILFLILGLLSLWS 138
Query: 148 VDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALK 207
VD V + GGV +LVLDTIGH SQKYHNRVIQHEAGHFLIAYL+G+LPKGYT+TS + L
Sbjct: 139 VDFVYFGGGVRNLVLDTIGHYLSQKYHNRVIQHEAGHFLIAYLLGVLPKGYTITSLDTLM 198
Query: 208 KEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADI 267
+GSLNVQAG AFVD+EFLEE+N+GK+SAT +N+FSCIALAGVA EYLLYG AEGGLADI
Sbjct: 199 NQGSLNVQAGAAFVDYEFLEEINTGKLSATMVNKFSCIALAGVAAEYLLYGRAEGGLADI 258
Query: 268 NKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
NKLD LLKGLGFTQKKADSQVRW++LNTVL+LRRH+ AR++LA AM+ GKSVGSCI +IE
Sbjct: 259 NKLDGLLKGLGFTQKKADSQVRWAVLNTVLILRRHEKARSQLAEAMSTGKSVGSCIQVIE 318
Query: 328 DNIDDTDI 335
+ I DI
Sbjct: 319 ECISTDDI 326
>gi|414877462|tpg|DAA54593.1| TPA: stress regulated protein [Zea mays]
Length = 343
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 215/309 (69%), Positives = 254/309 (82%)
Query: 27 VSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLD 86
SSS+ VL+QVD+ LS+G++ AAL+LV+ QG GGLR FGAARQ+PQRLY LD
Sbjct: 35 TSSSQSPARLTAVLEQVDQLLSQGNDMAALSLVRSTQGGDGGLRAFGAARQVPQRLYKLD 94
Query: 87 ELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLW 146
ELKLNGI+T + LSPVD TLGSIERNLQ+AA++GG S + QIL+L L LL LW
Sbjct: 95 ELKLNGIDTSAFLSPVDLTLGSIERNLQIAAVLGGFSVSAALELSQLQILFLILGLLSLW 154
Query: 147 TVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEAL 206
+VD V + GGV +LVLDTIGH SQKYHNRVIQHEAGHFLIAYL+G+LPKGYT+TS + L
Sbjct: 155 SVDFVYFGGGVRNLVLDTIGHYLSQKYHNRVIQHEAGHFLIAYLLGVLPKGYTITSLDTL 214
Query: 207 KKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLAD 266
+GSLNVQAG AFVD+EFLEE+N+GK+SAT +N+FSCIALAGVA EYLLYG AEGGLAD
Sbjct: 215 MNQGSLNVQAGAAFVDYEFLEEINTGKLSATMVNKFSCIALAGVAAEYLLYGRAEGGLAD 274
Query: 267 INKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDII 326
INKLD LLKGLGFTQKKADSQVRW++LNTVL+LRRH+ AR++LA AM+ GKSVGSCI +I
Sbjct: 275 INKLDGLLKGLGFTQKKADSQVRWAVLNTVLILRRHEKARSQLAEAMSTGKSVGSCIQVI 334
Query: 327 EDNIDDTDI 335
E+ I DI
Sbjct: 335 EECISTDDI 343
>gi|226498788|ref|NP_001151260.1| stress regulated protein [Zea mays]
gi|195645378|gb|ACG42157.1| stress regulated protein [Zea mays]
Length = 343
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 214/309 (69%), Positives = 254/309 (82%)
Query: 27 VSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLD 86
SSS+ VL+QVD+ LS+G++ AAL+LV+ QG GGLR FGAARQ+PQRLY LD
Sbjct: 35 TSSSQSPARLTAVLEQVDQLLSQGNDMAALSLVRSTQGGDGGLRAFGAARQVPQRLYKLD 94
Query: 87 ELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLW 146
ELKLNGI+T + LSPVD TLGSIERNLQ+AA++GG+S + QIL+L L LL LW
Sbjct: 95 ELKLNGIDTSAFLSPVDLTLGSIERNLQIAAVLGGLSVSAALELSQLQILFLILGLLSLW 154
Query: 147 TVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEAL 206
+VD V + GGV +LVLDTIGH SQKYHNRVIQHEAGHFLIAYL+G+LPKGYT+TS + L
Sbjct: 155 SVDFVYFGGGVRNLVLDTIGHYLSQKYHNRVIQHEAGHFLIAYLLGVLPKGYTITSLDTL 214
Query: 207 KKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLAD 266
+GSLNVQAG AFVD+EFLEE+N+GK+SAT +N+FSCIALAGVA EYLLYG A GGLAD
Sbjct: 215 MNQGSLNVQAGAAFVDYEFLEEINTGKLSATMVNKFSCIALAGVAAEYLLYGRAGGGLAD 274
Query: 267 INKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDII 326
INKLD LLKGLGFTQKKADSQVRW++LNTVL+LRRH+ AR++LA AM+ GKSVGSCI +I
Sbjct: 275 INKLDGLLKGLGFTQKKADSQVRWAVLNTVLILRRHEKARSQLAEAMSTGKSVGSCIQVI 334
Query: 327 EDNIDDTDI 335
E+ I DI
Sbjct: 335 EECISTDDI 343
>gi|218188255|gb|EEC70682.1| hypothetical protein OsI_02018 [Oryza sativa Indica Group]
Length = 332
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 220/330 (66%), Positives = 261/330 (79%), Gaps = 13/330 (3%)
Query: 10 CLLSSRP---RVKVLSLSRIV-SSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGK 65
C P R++VL +R +SSE R VL+QVDEEL KG++ AAL+LV+ QG
Sbjct: 12 CFTEPHPAAVRLRVLPPARAANTSSEPAARLRAVLEQVDEELRKGNDEAALSLVRGSQGA 71
Query: 66 PGGLRCFGAARQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAW 125
GGLR FGAARQ+PQRLYTLDELKLNGI+T + LSPVD TLGSIERNLQ+AA++GG+S
Sbjct: 72 DGGLRFFGAARQVPQRLYTLDELKLNGIDTSAFLSPVDLTLGSIERNLQIAAVLGGLSVS 131
Query: 126 NVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHF 185
F + Q+L+L L LL LW+VD V + GGV +L+LDTIGH SQKY NRVIQHEAGHF
Sbjct: 132 AAFELSQLQVLFLFLGLLSLWSVDLVYFGGGVRNLILDTIGHNLSQKYRNRVIQHEAGHF 191
Query: 186 LIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCI 245
LIAYL+G+LPKGY++TS + K+GSLNVQAGTAFVDFEFL+E+ LN+FSCI
Sbjct: 192 LIAYLLGVLPKGYSITSLDTFIKKGSLNVQAGTAFVDFEFLQEM---------LNKFSCI 242
Query: 246 ALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGA 305
ALAGVATEYLLYGYAEGGLADI +LD LLKGLGFTQKKADSQVRW++LNTVL LRRHK A
Sbjct: 243 ALAGVATEYLLYGYAEGGLADIGQLDGLLKGLGFTQKKADSQVRWAVLNTVLALRRHKKA 302
Query: 306 RAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
R++LA AM+ GKSVGSCI +IE+NI+ DI
Sbjct: 303 RSQLAEAMSSGKSVGSCIGVIEENINSEDI 332
>gi|414877463|tpg|DAA54594.1| TPA: hypothetical protein ZEAMMB73_027015 [Zea mays]
Length = 337
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 210/309 (67%), Positives = 249/309 (80%), Gaps = 6/309 (1%)
Query: 27 VSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLD 86
SSS+ VL+QVD+ LS+G++ AAL+LV+ QG GGLR FGAARQ+PQRLY LD
Sbjct: 35 TSSSQSPARLTAVLEQVDQLLSQGNDMAALSLVRSTQGGDGGLRAFGAARQVPQRLYKLD 94
Query: 87 ELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLW 146
ELKLNGI+T + LSPVD TLGSIERNLQ+AA++GG S + QIL+L L LL LW
Sbjct: 95 ELKLNGIDTSAFLSPVDLTLGSIERNLQIAAVLGGFSVSAALELSQLQILFLILGLLSLW 154
Query: 147 TVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEAL 206
+VD V + GGV +LVLDTIGH SQKYHNRVIQHEAGHFLIAYL+G+LPKGYT+TS + L
Sbjct: 155 SVDFVYFGGGVRNLVLDTIGHYLSQKYHNRVIQHEAGHFLIAYLLGVLPKGYTITSLDTL 214
Query: 207 KKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLAD 266
+GSLNVQAG AFVD+EFLEE+N+GK+SAT +N+FSCIALAGVA EYLLYG AEGGLAD
Sbjct: 215 MNQGSLNVQAGAAFVDYEFLEEINTGKLSATMVNKFSCIALAGVAAEYLLYGRAEGGLAD 274
Query: 267 INKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDII 326
IN KGLGFTQKKADSQVRW++LNTVL+LRRH+ AR++LA AM+ GKSVGSCI +I
Sbjct: 275 IN------KGLGFTQKKADSQVRWAVLNTVLILRRHEKARSQLAEAMSTGKSVGSCIQVI 328
Query: 327 EDNIDDTDI 335
E+ I DI
Sbjct: 329 EECISTDDI 337
>gi|148908233|gb|ABR17231.1| unknown [Picea sitchensis]
Length = 360
Score = 400 bits (1028), Expect = e-109, Method: Compositional matrix adjust.
Identities = 200/326 (61%), Positives = 253/326 (77%), Gaps = 8/326 (2%)
Query: 10 CLLSSRPRVKVLSLSRI----VSSSEVEVSKRR-VLKQVDEELSKGDERAALALVKDLQG 64
CL + P + SRI V + V+ S R+ L+++D EL +G+ER A +++ LQG
Sbjct: 32 CLHACTPPHQC---SRIIVLCVLNGGVDTSGRKEALRKLDLELERGNEREAFSIITSLQG 88
Query: 65 KPGGLRCFGAARQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSA 124
KPGGLR FGAARQ+PQ++Y+L++L+LN I+T LLSPVD TLG + RNLQ+AA+VGGV+
Sbjct: 89 KPGGLRGFGAARQVPQKIYSLEDLRLNKIDTTCLLSPVDTTLGGVRRNLQIAAVVGGVAT 148
Query: 125 WNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGH 184
WN G N ++L + LA L L T+D V GGV +L++DTIG S+KY +RV QHEAGH
Sbjct: 149 WNTLGLNQSELLAILLAFLTLGTLDQVVNGGGVEALIVDTIGRFLSKKYRDRVAQHEAGH 208
Query: 185 FLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSC 244
FLIAYL+GILPKGYTL+S +ALKKE +LNVQAGTAFVDF+FLEEV SGK+S+ TL+ +SC
Sbjct: 209 FLIAYLLGILPKGYTLSSLDALKKERTLNVQAGTAFVDFQFLEEVKSGKLSSGTLSNYSC 268
Query: 245 IALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKG 304
IALAGVA EYLL+G AEGGLADI +LD LLK LGFTQKKADSQ+RW++LNT+ LLRRH+
Sbjct: 269 IALAGVAAEYLLFGLAEGGLADIQQLDNLLKSLGFTQKKADSQIRWAVLNTITLLRRHEQ 328
Query: 305 ARAKLAVAMTMGKSVGSCIDIIEDNI 330
AR+KLA AM KSVG CID IE+ +
Sbjct: 329 ARSKLAEAMDFSKSVGDCIDTIENEL 354
>gi|79328807|ref|NP_001031951.1| putative stress regulated protein [Arabidopsis thaliana]
gi|332006285|gb|AED93668.1| putative stress regulated protein [Arabidopsis thaliana]
Length = 262
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 177/232 (76%), Positives = 205/232 (88%), Gaps = 1/232 (0%)
Query: 26 IVSSSEVEVS-KRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYT 84
IV SSE +S +R+ L+QVD +LS GDERAAL+LVKDLQGKP GLRCFGAARQ+PQRLYT
Sbjct: 31 IVCSSETGLSIRRQALEQVDSKLSSGDERAALSLVKDLQGKPDGLRCFGAARQVPQRLYT 90
Query: 85 LDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLF 144
L+ELKLNGI SLLSP D TLGSIERNLQ+AA+ GG+ AW F + QQ+ +L+L +F
Sbjct: 91 LEELKLNGINAASLLSPTDTTLGSIERNLQIAAVSGGIVAWKAFDLSSQQLFFLTLGFMF 150
Query: 145 LWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFE 204
LWT+D VS+NGG+GSLVLDT GHTFSQ+YHNRV+QHEAGHFL+AYLVGILP+GYTL+S E
Sbjct: 151 LWTLDLVSFNGGIGSLVLDTTGHTFSQRYHNRVVQHEAGHFLVAYLVGILPRGYTLSSLE 210
Query: 205 ALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLL 256
AL+KEGSLN+QAG+AFVD+EFLEEVNSGKVSAT LNRFSCIALAGVATEYLL
Sbjct: 211 ALQKEGSLNIQAGSAFVDYEFLEEVNSGKVSATMLNRFSCIALAGVATEYLL 262
>gi|168054583|ref|XP_001779710.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668908|gb|EDQ55506.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 287
Score = 340 bits (872), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 172/294 (58%), Positives = 217/294 (73%), Gaps = 8/294 (2%)
Query: 43 VDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETRSLLSPV 102
+DE L +R A+AL +L FG A +P RLY+L+EL+LN I+ LLSP
Sbjct: 1 IDEALRDNRDRDAVALASNLA-------AFGKATSVPYRLYSLEELRLNKIDATQLLSPT 53
Query: 103 DETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVL 162
D+TLG+++RNLQ AAI G W G + Q+L ++ LFL TVD ++ GGV +L+L
Sbjct: 54 DQTLGNVKRNLQYAAIAGAAVTWWQLGLDQYQLLAAAVLFLFLGTVDQIANGGGVEALLL 113
Query: 163 DTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVD 222
D+IG S KY +RV QHE+GHFLI+YLVGILPK YTL+SF+A +K G+LNVQAGT FVD
Sbjct: 114 DSIGRILSPKYQDRVAQHESGHFLISYLVGILPKSYTLSSFDAFRKYGALNVQAGTTFVD 173
Query: 223 FEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQK 282
++F EEV +GK+S+TTLN+FSC+ALAGVATEYL Y AEGGLADI +LDA+LKGL FTQK
Sbjct: 174 YDFQEEVTTGKLSSTTLNKFSCVALAGVATEYLRYEIAEGGLADILQLDAVLKGLTFTQK 233
Query: 283 KADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNID-DTDI 335
KADSQVRW++LNTV +LRRH +AKLA AM GKSVG CI +IE+ + TDI
Sbjct: 234 KADSQVRWAVLNTVSILRRHFDLQAKLAAAMLAGKSVGECILLIENELSGSTDI 287
>gi|255630972|gb|ACU15850.1| unknown [Glycine max]
Length = 232
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 157/201 (78%), Positives = 185/201 (92%)
Query: 25 RIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYT 84
RI SS V VS+++VL+++D+EL+KGD+RAALALVKDLQGKP GLRCFGAARQ+PQRLYT
Sbjct: 31 RIRCSSNVGVSRQQVLEKLDKELAKGDDRAALALVKDLQGKPDGLRCFGAARQVPQRLYT 90
Query: 85 LDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLF 144
LDEL+LNGIET SLLSPVD TLGSIERNLQ+AAIVGG++AWN F +PQQI Y+SL LLF
Sbjct: 91 LDELRLNGIETLSLLSPVDTTLGSIERNLQIAAIVGGLAAWNAFAISPQQIFYISLGLLF 150
Query: 145 LWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFE 204
LWT+D+VS+ GG+G LV+DTIGH+FSQKYHNRVIQHEAGHFLIAYLVGILP+GYT++S +
Sbjct: 151 LWTLDAVSFGGGIGGLVVDTIGHSFSQKYHNRVIQHEAGHFLIAYLVGILPRGYTISSLD 210
Query: 205 ALKKEGSLNVQAGTAFVDFEF 225
AL+K GSLN+QAGTAFVDFEF
Sbjct: 211 ALQKVGSLNIQAGTAFVDFEF 231
>gi|414877464|tpg|DAA54595.1| TPA: hypothetical protein ZEAMMB73_027015 [Zea mays]
Length = 278
Score = 323 bits (828), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 167/244 (68%), Positives = 198/244 (81%)
Query: 27 VSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLD 86
SSS+ VL+QVD+ LS+G++ AAL+LV+ QG GGLR FGAARQ+PQRLY LD
Sbjct: 35 TSSSQSPARLTAVLEQVDQLLSQGNDMAALSLVRSTQGGDGGLRAFGAARQVPQRLYKLD 94
Query: 87 ELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLW 146
ELKLNGI+T + LSPVD TLGSIERNLQ+AA++GG S + QIL+L L LL LW
Sbjct: 95 ELKLNGIDTSAFLSPVDLTLGSIERNLQIAAVLGGFSVSAALELSQLQILFLILGLLSLW 154
Query: 147 TVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEAL 206
+VD V + GGV +LVLDTIGH SQKYHNRVIQHEAGHFLIAYL+G+LPKGYT+TS + L
Sbjct: 155 SVDFVYFGGGVRNLVLDTIGHYLSQKYHNRVIQHEAGHFLIAYLLGVLPKGYTITSLDTL 214
Query: 207 KKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLAD 266
+GSLNVQAG AFVD+EFLEE+N+GK+SAT +N+FSCIALAGVA EYLLYG AEGGLAD
Sbjct: 215 MNQGSLNVQAGAAFVDYEFLEEINTGKLSATMVNKFSCIALAGVAAEYLLYGRAEGGLAD 274
Query: 267 INKL 270
INK+
Sbjct: 275 INKV 278
>gi|302755402|ref|XP_002961125.1| hypothetical protein SELMODRAFT_75099 [Selaginella moellendorffii]
gi|300172064|gb|EFJ38664.1| hypothetical protein SELMODRAFT_75099 [Selaginella moellendorffii]
Length = 323
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 177/312 (56%), Positives = 225/312 (72%), Gaps = 8/312 (2%)
Query: 19 KVLSLSRIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQI 78
+ + LS + +S EVS L V EL GD R K L LR FGAARQ+
Sbjct: 14 RCVVLSALGRASGSEVSDAAAL--VQRELRNGDHR------KALALAAARLRAFGAARQV 65
Query: 79 PQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYL 138
PQR+YTL+EL+LNGI+T LSPVD TLG++ RNLQLAA+ GG++AW +F + Q+L
Sbjct: 66 PQRIYTLEELRLNGIDTARFLSPVDFTLGTVRRNLQLAALAGGIAAWQLFHLDQFQLLIG 125
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ LF+ T+D + YNGGV +L+LDT+G S KY NRV QHEAGHFL+AYL+GILP Y
Sbjct: 126 VVVALFVGTLDQIVYNGGVEALLLDTLGRVVSGKYKNRVAQHEAGHFLVAYLMGILPADY 185
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
TL+S +A +K GSLNVQAGT+FVDFEF EEV+ ++ TLN+++C+ALAGVATE+L +G
Sbjct: 186 TLSSLDAFRKNGSLNVQAGTSFVDFEFQEEVSFKFPASKTLNKYACVALAGVATEFLKFG 245
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
AEGGL+DI +LD LLK L FTQ KADSQVRW++LNTV +LRRH +KLA AM GKS
Sbjct: 246 LAEGGLSDIQQLDELLKRLNFTQLKADSQVRWAVLNTVSILRRHLALHSKLAEAMDTGKS 305
Query: 319 VGSCIDIIEDNI 330
VG CI++IE +
Sbjct: 306 VGQCIELIETEL 317
>gi|302766930|ref|XP_002966885.1| hypothetical protein SELMODRAFT_168681 [Selaginella moellendorffii]
gi|300164876|gb|EFJ31484.1| hypothetical protein SELMODRAFT_168681 [Selaginella moellendorffii]
Length = 323
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 177/312 (56%), Positives = 225/312 (72%), Gaps = 8/312 (2%)
Query: 19 KVLSLSRIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQI 78
+ + LS + +S EVS L V EL GD R K L LR FGAARQ+
Sbjct: 14 RCVVLSALGRASGSEVSDAAAL--VQRELRNGDHR------KALALAAARLRAFGAARQV 65
Query: 79 PQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYL 138
PQR+YTL+EL+LNGI+T LSPVD TLG++ RNLQLAA+ GG++AW +F + Q+L
Sbjct: 66 PQRIYTLEELRLNGIDTARFLSPVDFTLGTVRRNLQLAALAGGIAAWQLFHLDQFQLLIG 125
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ LF+ T+D + YNGGV +L+LDT+G S KY NRV QHEAGHFL+AYL+GILP Y
Sbjct: 126 VVVALFVGTLDQIVYNGGVEALLLDTLGRVVSGKYKNRVAQHEAGHFLVAYLMGILPADY 185
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
TL+S +A +K GSLNVQAGT+FVDFEF EEV+ ++ TLN+++C+ALAGVATE+L +G
Sbjct: 186 TLSSLDAFRKNGSLNVQAGTSFVDFEFQEEVSFKFPASKTLNKYACVALAGVATEFLKFG 245
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
AEGGL+DI +LD LLK L FTQ KADSQVRW++LNTV +LRRH +KLA AM GKS
Sbjct: 246 LAEGGLSDIQQLDELLKRLNFTQLKADSQVRWAVLNTVSILRRHLALHSKLAEAMDAGKS 305
Query: 319 VGSCIDIIEDNI 330
VG CI++IE +
Sbjct: 306 VGQCIELIETEL 317
>gi|388509674|gb|AFK42903.1| unknown [Medicago truncatula]
Length = 261
Score = 320 bits (820), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 155/222 (69%), Positives = 182/222 (81%)
Query: 21 LSLSRIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQ 80
L+ RI S++V S++++L+ VD+EL+KGD+RAALALVKDLQGKP GL+CFGAARQ+PQ
Sbjct: 26 LARVRIQCSTKVSSSRQQLLESVDKELTKGDDRAALALVKDLQGKPNGLQCFGAARQVPQ 85
Query: 81 RLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSL 140
RLYTLDELKLNGIE LLSPVD TLGSIERNL +AAIVGG +AWNVFG + QQI Y+SL
Sbjct: 86 RLYTLDELKLNGIEAMPLLSPVDTTLGSIERNLLIAAIVGGFAAWNVFGISQQQIFYISL 145
Query: 141 ALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTL 200
LLFLWT+D VS+ GG+GSLV+DTIGH FSQKYHNRVIQHEAGHFLIAYLVGILPKGYTL
Sbjct: 146 GLLFLWTLDLVSFGGGLGSLVVDTIGHKFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTL 205
Query: 201 TSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRF 242
+S + + K G + TAFVDFEFLEEV S ++ F
Sbjct: 206 SSLDGMMKGGFSQYSSSTAFVDFEFLEEVGCFICSVYLMHYF 247
>gi|223973715|gb|ACN31045.1| unknown [Zea mays]
gi|414877465|tpg|DAA54596.1| TPA: hypothetical protein ZEAMMB73_027015 [Zea mays]
Length = 270
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 141/185 (76%), Positives = 163/185 (88%)
Query: 151 VSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEG 210
V + GGV +LVLDTIGH SQKYHNRVIQHEAGHFLIAYL+G+LPKGYT+TS + L +G
Sbjct: 86 VYFGGGVRNLVLDTIGHYLSQKYHNRVIQHEAGHFLIAYLLGVLPKGYTITSLDTLMNQG 145
Query: 211 SLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKL 270
SLNVQAG AFVD+EFLEE+N+GK+SAT +N+FSCIALAGVA EYLLYG AEGGLADINKL
Sbjct: 146 SLNVQAGAAFVDYEFLEEINTGKLSATMVNKFSCIALAGVAAEYLLYGRAEGGLADINKL 205
Query: 271 DALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
D LLKGLGFTQKKADSQVRW++LNTVL+LRRH+ AR++LA AM+ GKSVGSCI +IE+ I
Sbjct: 206 DGLLKGLGFTQKKADSQVRWAVLNTVLILRRHEKARSQLAEAMSTGKSVGSCIQVIEECI 265
Query: 331 DDTDI 335
DI
Sbjct: 266 STDDI 270
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 37/52 (71%)
Query: 27 VSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQI 78
SSS+ VL+QVD+ LS+G++ AAL+LV+ QG GGLR FGAARQ+
Sbjct: 35 TSSSQSPARLTAVLEQVDQLLSQGNDMAALSLVRSTQGGDGGLRAFGAARQV 86
>gi|116792958|gb|ABK26568.1| unknown [Picea sitchensis]
Length = 311
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 149/300 (49%), Positives = 196/300 (65%), Gaps = 50/300 (16%)
Query: 32 VEVSKRR-VLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLDELKL 90
V+ S R+ L+++D EL +G+ER A +++ LQGKPGGLR FGAARQ+PQ++Y+L++L+L
Sbjct: 55 VDTSGRKEALRKLDLELERGNEREAFSIITSLQGKPGGLRGFGAARQVPQKIYSLEDLRL 114
Query: 91 NGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDS 150
N I+T LLSPVD TLG + RNLQ+AA+VGGV+ WN G N ++L + LA L L T+D
Sbjct: 115 NKIDTTCLLSPVDTTLGGVRRNLQIAAVVGGVATWNTLGLNQSELLAILLAFLTLGTLDQ 174
Query: 151 VSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEG 210
V GGV +L++DTIG S+KY +RV Q ++G
Sbjct: 175 VVNGGGVEALIVDTIGRFLSKKYRDRVAQVKSG--------------------------- 207
Query: 211 SLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKL 270
K+S+ TL+ +SCIALAGVA EYLL+G AEGGLADI +L
Sbjct: 208 ----------------------KLSSGTLSNYSCIALAGVAAEYLLFGLAEGGLADIQQL 245
Query: 271 DALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
D LLK LGFTQKKADSQ+RW++LNT+ LLRRH+ AR+KLA AM KSVG CID IE+ +
Sbjct: 246 DNLLKSLGFTQKKADSQIRWAVLNTITLLRRHEQARSKLAEAMDFSKSVGDCIDTIENEL 305
>gi|384248177|gb|EIE21662.1| hypothetical protein COCSUDRAFT_33740 [Coccomyxa subellipsoidea
C-169]
Length = 279
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 135/261 (51%), Positives = 182/261 (69%)
Query: 67 GGLRCFGAARQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWN 126
G L +G RQIP+R+Y+L+EL+LN IE LSP D+TLG++ LQ ++G +A+
Sbjct: 5 GTLHAYGRGRQIPKRIYSLEELRLNKIEAEKFLSPKDDTLGTVRTILQGGGLLGIAAAYF 64
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+ Q+L L FL TVD V+ GG+ +L++DT T + Y RV HE+GHFL
Sbjct: 65 GLHWELSQLLGTLAGLGFLLTVDQVANGGGLEALLIDTAARTINPSYGERVATHESGHFL 124
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
IAYLVG+LP+ YTL+S +A K+ G+LNVQAGT F D + EV SG +S+T+L+R+ C+A
Sbjct: 125 IAYLVGLLPRTYTLSSLDAYKRYGALNVQAGTLFCDSAYQREVASGTLSSTSLDRYCCVA 184
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGAR 306
LAG+ATEY+ YG AEGGL D+ +LD LLK L FTQKKAD Q+RW++LN LLRR+ +
Sbjct: 185 LAGIATEYIKYGQAEGGLNDVQQLDGLLKALQFTQKKADGQIRWAVLNVTALLRRYSRVQ 244
Query: 307 AKLAVAMTMGKSVGSCIDIIE 327
++LA AM GKSVG CI +IE
Sbjct: 245 SQLAQAMAAGKSVGDCIALIE 265
>gi|303277981|ref|XP_003058284.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460941|gb|EEH58235.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 312
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 133/270 (49%), Positives = 183/270 (67%), Gaps = 8/270 (2%)
Query: 67 GGLRCFGAARQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQ--LAAIVGGVSA 124
G LR FGAAR +P+R Y L ELKLNGIE LLSP + T+ ++ + LA +VGG
Sbjct: 45 GALRGFGAARLVPRRDYALSELKLNGIEAERLLSPTESTISNLRDAVSKALALVVGG--- 101
Query: 125 WNVFGFNPQ--QILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEA 182
W +F +P + + VD + + GGV +LVLDT+ T S +Y R+ HEA
Sbjct: 102 W-IFAAHPSGSTLAAAASVAAGALVVDQIGFGGGVEALVLDTLAQTTSTEYKARLRIHEA 160
Query: 183 GHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRF 242
HFL+AYL+GILPKGYTL+S +A +K G+LNVQAG AF D F EV GK+ + +L RF
Sbjct: 161 AHFLVAYLMGILPKGYTLSSLDAYEKYGALNVQAGCAFCDGAFQREVARGKIGSGSLGRF 220
Query: 243 SCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRH 302
SC+ALAG++ EY+ +G++EGG++D+ +LD +L+ L FTQKK+DS+VRW++LNT+ LLRRH
Sbjct: 221 SCVALAGISMEYIAFGFSEGGVSDVRQLDGMLRALAFTQKKSDSEVRWAVLNTITLLRRH 280
Query: 303 KGARAKLAVAMTMGKSVGSCIDIIEDNIDD 332
+ KL+ M G SVG C+ +IE N+ D
Sbjct: 281 EACVRKLSEKMAAGASVGECVRLIEANLGD 310
>gi|218188256|gb|EEC70683.1| hypothetical protein OsI_02020 [Oryza sativa Indica Group]
Length = 243
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 140/214 (65%), Positives = 171/214 (79%), Gaps = 1/214 (0%)
Query: 17 RVKVLSLSRIV-SSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAA 75
R+++L +R +SSE R VL+QVDEEL KG++ AAL+LV+ QG GGLR FGAA
Sbjct: 27 RLRLLPPARAANTSSEPAARLRAVLEQVDEELRKGNDEAALSLVRGSQGADGGLRFFGAA 86
Query: 76 RQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQI 135
RQ+PQRLYTLDELKLNGI+T + LSPVD TLGSIERNLQ+AA++GG+S F + Q+
Sbjct: 87 RQVPQRLYTLDELKLNGIDTSAFLSPVDLTLGSIERNLQIAAVLGGLSVSAAFELSKLQV 146
Query: 136 LYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILP 195
L+L L LL LW+VD V + GGV +L+LDTIGH SQKY NRVIQHEAGHFLIAYL+G+LP
Sbjct: 147 LFLFLGLLSLWSVDLVYFGGGVRNLILDTIGHNLSQKYRNRVIQHEAGHFLIAYLLGVLP 206
Query: 196 KGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEV 229
KGYT+TS + K+GSLNVQAGTAFVDFEFL+E+
Sbjct: 207 KGYTITSLDTFIKKGSLNVQAGTAFVDFEFLQEL 240
>gi|255071723|ref|XP_002499536.1| predicted protein [Micromonas sp. RCC299]
gi|226514798|gb|ACO60794.1| predicted protein, partial [Micromonas sp. RCC299]
Length = 263
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 132/265 (49%), Positives = 181/265 (68%), Gaps = 4/265 (1%)
Query: 69 LRCFGAARQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVF 128
LR +GAAR +P+R Y L ELKLNGIE LLSP + T+ + L A+V V AW V
Sbjct: 1 LRGYGAARLVPRRDYALSELKLNGIEAEKLLSPTESTISGLRDVLGRVALVS-VGAW-VL 58
Query: 129 GFNPQ--QILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+P Q+ + +D V++ GG+ +L LDT+ S+ Y R+ HEA HFL
Sbjct: 59 SSHPSGAQVTGVLATAAAAIAIDQVTFAGGMEALALDTLAQATSKTYVARLRLHEAAHFL 118
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
+AYL+GILPKGYTL+S +A K+ G+LNVQAG AF D F EV GK+++ +L RFSC+A
Sbjct: 119 VAYLMGILPKGYTLSSLDAYKEYGALNVQAGCAFCDGAFQAEVAKGKITSGSLGRFSCVA 178
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGAR 306
LAG+ EY+ YG+AEGG+AD+ +LD +L+ L F+QKK+DS+VRW++LNT+ +LRRH+G
Sbjct: 179 LAGIGMEYVAYGFAEGGVADVRQLDGMLRALAFSQKKSDSEVRWAVLNTISILRRHEGTV 238
Query: 307 AKLAVAMTMGKSVGSCIDIIEDNID 331
KL+ M G SVG CI +IED++D
Sbjct: 239 RKLSERMAAGASVGECIRLIEDSLD 263
>gi|145348876|ref|XP_001418869.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579099|gb|ABO97162.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 292
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 131/286 (45%), Positives = 188/286 (65%), Gaps = 5/286 (1%)
Query: 52 ERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIER 111
ER A+ LV L+ + G ++ FGAA +P+R Y L EL+LN IE LL+P + T+ I
Sbjct: 9 EREAVELVMCLRQR-GVVKAFGAAHNVPKRDYALAELRLNNIEAEKLLAPTESTIKGIRD 67
Query: 112 NLQLAAIVGGVSAWNVFGFNPQ--QILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTF 169
N ++G ++ +P Q ++ F T D +++ GGV +L LDT+ +
Sbjct: 68 NF--TRLLGVAYVAGLYFLHPTFAQGAGVAAFAAFCATYDQIAFGGGVSALALDTVAQST 125
Query: 170 SQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEV 229
S++Y R+ +HEA HFL AYL+GILPKGYTL+S +A K G+ N+QAG AF D EF EV
Sbjct: 126 SKEYVTRLRRHEAAHFLTAYLIGILPKGYTLSSMDAFKTYGAFNIQAGCAFCDGEFQREV 185
Query: 230 NSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVR 289
GK+++T+L RF+C+A+AG+ EY+L+G+AEGGL+D+ +LD LL+ L FTQKK+DS+VR
Sbjct: 186 QKGKITSTSLGRFACVAMAGICMEYILFGFAEGGLSDVQQLDGLLRALAFTQKKSDSEVR 245
Query: 290 WSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
W++LNT LLRRH G KLA M G SVG C+ +IE ++ +
Sbjct: 246 WAVLNTTSLLRRHLGLTEKLADYMARGASVGECVALIEKEVETAEF 291
>gi|412986828|emb|CCO15254.1| predicted protein [Bathycoccus prasinos]
Length = 460
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 127/275 (46%), Positives = 178/275 (64%), Gaps = 8/275 (2%)
Query: 69 LRCFGAARQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAA----IVGG-VS 123
+R F +Q+P+R Y L ELKLN I+ LL+P + T+ I++ L +A IVG V+
Sbjct: 186 IRAFARGQQVPKRSYALAELKLNDIDPSKLLAPTENTITEIKKKLGVAVGALVIVGAIVT 245
Query: 124 AWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAG 183
+ G P Q + + L FL D+V+ N G +LVLD + + S++Y R+ +HEAG
Sbjct: 246 SGEDVGAAPFQAIKTFVFLSFLLAYDAVALNAGAQNLVLDFVANLTSEEYKTRLRRHEAG 305
Query: 184 HFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFS 243
HFL+AYL G+LPKGYTL+S +A K+ G LNVQAGT F D +F EV GK+++T++ RF+
Sbjct: 306 HFLVAYLTGVLPKGYTLSSLDAFKRFGRLNVQAGTLFCDGQFQNEVKRGKITSTSVGRFA 365
Query: 244 CIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHK 303
C+ALAGV EY YG +EGG ADI +LD L L F+QKK+D +VRW+ LNT+ ++RRH+
Sbjct: 366 CVALAGVCAEYAKYGNSEGGAADIQQLDQLFNALQFSQKKSDDEVRWATLNTMAIVRRHE 425
Query: 304 GARAKLAVAMTMGKSVGSCIDIIE---DNIDDTDI 335
G +LA M G+S I IIE N+ + DI
Sbjct: 426 GLVDELARMMGTGESTAKLISIIEARLANVAEKDI 460
>gi|159484370|ref|XP_001700231.1| predicted protein [Chlamydomonas reinhardtii]
gi|158272547|gb|EDO98346.1| predicted protein [Chlamydomonas reinhardtii]
Length = 269
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 130/263 (49%), Positives = 176/263 (66%), Gaps = 1/263 (0%)
Query: 69 LRCFGAARQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVF 128
LR +G+AR +P+R YTL+EL+LN IE LLSP D L ++ +LAA G V+A
Sbjct: 1 LRGYGSARAVPKRNYTLEELRLNRIEPEKLLSPKDTQLNAVRDAARLAAGAGLVAAAVGL 60
Query: 129 GFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTF-SQKYHNRVIQHEAGHFLI 187
++ Q+L + + TVD V GG +L++DT+G Q Y RV HEAGH L+
Sbjct: 61 QWDAGQVLAAAFGGVAALTVDQVVNGGGGEALLVDTLGRLLRPQTYGARVAAHEAGHLLV 120
Query: 188 AYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIAL 247
AYLVG+LP+ YTL+S +A + +LN+QAGT F D F EV G++ A++L+R++C+AL
Sbjct: 121 AYLVGLLPRAYTLSSLDAFLRYRALNIQAGTRFCDSAFAAEVAGGRLKASSLDRYTCVAL 180
Query: 248 AGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARA 307
AGV TEYL YG AEGGL D+ +LD + + LGFTQKKAD++VRW++LNT LLRRH A
Sbjct: 181 AGVVTEYLQYGVAEGGLGDVRQLDDMFRALGFTQKKADAEVRWAVLNTAELLRRHSRLHA 240
Query: 308 KLAVAMTMGKSVGSCIDIIEDNI 330
+LA AM G SVG CI +IE +
Sbjct: 241 ELAAAMGRGASVGQCIALIEGRL 263
>gi|307105101|gb|EFN53352.1| hypothetical protein CHLNCDRAFT_25833, partial [Chlorella
variabilis]
Length = 260
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 122/254 (48%), Positives = 168/254 (66%)
Query: 77 QIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQIL 136
Q P+R YTL EL+LN I+ L+P D TL + +Q + G +A+ + QI+
Sbjct: 1 QTPKRTYTLAELRLNKIQPEEFLAPTDSTLSGVRNVVQGGFLAGLTAAYFTQLLDLTQIV 60
Query: 137 YLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPK 196
+ +A FL TVD V+ GG +L++D+ G + Y RV HEAGHFL+AYL+G+LP+
Sbjct: 61 QVVVATGFLLTVDQVANGGGFEALLVDSAGRVVNGTYGRRVALHEAGHFLVAYLLGLLPR 120
Query: 197 GYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLL 256
GYTL+S + K+ LNVQAG F D F EV +G++S+++L+ ++C+ALAGVATE+L
Sbjct: 121 GYTLSSLDLFLKKRQLNVQAGCQFCDSAFQAEVATGRLSSSSLDTYACVALAGVATEWLR 180
Query: 257 YGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMG 316
+G AEGGL D+ +LD LL+ L FTQ KADSQVRW++LN V LLRRH+ LA AM G
Sbjct: 181 FGRAEGGLEDVRQLDRLLQALRFTQAKADSQVRWAVLNVVTLLRRHERVHDALAAAMQRG 240
Query: 317 KSVGSCIDIIEDNI 330
SVG CI +IE +
Sbjct: 241 GSVGECIGVIEGEL 254
>gi|298205012|emb|CBI34319.3| unnamed protein product [Vitis vinifera]
Length = 340
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 181/313 (57%), Gaps = 57/313 (18%)
Query: 36 KRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLDELKLNGIET 95
+RR LK VD ELSKG+ + A++LVK L GKPGGLR FGAA+Q+ RL D+LKL+
Sbjct: 67 RRRSLKMVDRELSKGNYKTAVSLVKQLNGKPGGLRGFGAAKQVSHRL---DDLKLH---- 119
Query: 96 RSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNG 155
SL S VD L +E+ Q A + D VS
Sbjct: 120 ESLQSLVDSILDLMEKCRQFAML------------------------------DEVSVKA 149
Query: 156 GVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQ 215
S+ D + + H QHEAGHFL+ YL+G+LP+GY + S EAL+++
Sbjct: 150 LEKSMP-DEGNESHCDEDHFLYTQHEAGHFLVGYLLGVLPRGYEIPSKEALRQD---RFA 205
Query: 216 AG-TAFVDFEFLEEVNSGKV---------------SATTLNRFSCIALAGVATEYLLYGY 259
AG FV FEFL +V + ++ S+ TLNRFSC+ +AG+ EYL++G
Sbjct: 206 AGRVEFVGFEFLRQVRTTEIVEKKFSKGKSKKGKISSKTLNRFSCVIVAGLIAEYLVFGC 265
Query: 260 AEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSV 319
+EG +D+ +LD +LK LGF++ +A SQ++W++LNTVL+L RH AR +LA AM +GKSV
Sbjct: 266 SEGLHSDVEQLDEVLKWLGFSEGEAYSQMKWAVLNTVLILSRHHEARLRLAKAMALGKSV 325
Query: 320 GSCIDIIEDNIDD 332
G CID IE+ I++
Sbjct: 326 GYCIDTIENVINE 338
>gi|255542490|ref|XP_002512308.1| conserved hypothetical protein [Ricinus communis]
gi|223548269|gb|EEF49760.1| conserved hypothetical protein [Ricinus communis]
Length = 287
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 120/307 (39%), Positives = 170/307 (55%), Gaps = 47/307 (15%)
Query: 27 VSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLD 86
+ ++ + +R+ L++VD ELSKG+ + AL+LV LQGKP GLR FGAA+Q+P+ + LD
Sbjct: 22 LKEADYAIRRRKSLQRVDRELSKGNFKDALSLVNQLQGKPFGLRGFGAAKQVPKNRFALD 81
Query: 87 ELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLW 146
+ +G + D + SIE NP Q L
Sbjct: 82 RFEFDGTCLSPFRALFDSVMNSIE--------------------NPNQFPLL-------- 113
Query: 147 TVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEAL 206
D S + + ++YHN +QHEAGHFL+ YL+G LPK Y S E L
Sbjct: 114 --DGESRS-------------STCEEYHNLCLQHEAGHFLVGYLLGSLPKRYRTPSIEEL 158
Query: 207 KKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLAD 266
+ +G+ F+ FEFL E N GK+S+ TLN FSCI L G+ E+L +G++EG +D
Sbjct: 159 R-DGNF-AGGNVKFLGFEFLREGNRGKISSKTLNNFSCITLGGLVVEHLAFGHSEGHYSD 216
Query: 267 INK--LDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCID 324
+ K LD+ LK L ++ +A+ QVRW+ +NT+ +L RH AR KL AM G+SVG CID
Sbjct: 217 VVKEQLDSTLKWLELSEDEANFQVRWAAVNTIAILNRHYKARLKLVEAMARGQSVGCCID 276
Query: 325 IIEDNID 331
IE+ ID
Sbjct: 277 AIENGID 283
>gi|223972721|gb|ACN30548.1| unknown [Zea mays]
gi|414877466|tpg|DAA54597.1| TPA: hypothetical protein ZEAMMB73_027015 [Zea mays]
Length = 129
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 98/127 (77%), Positives = 114/127 (89%)
Query: 209 EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADIN 268
+GSLNVQAG AFVD+EFLEE+N+GK+SAT +N+FSCIALAGVA EYLLYG AEGGLADIN
Sbjct: 3 QGSLNVQAGAAFVDYEFLEEINTGKLSATMVNKFSCIALAGVAAEYLLYGRAEGGLADIN 62
Query: 269 KLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIED 328
KLD LLKGLGFTQKKADSQVRW++LNTVL+LRRH+ AR++LA AM+ GKSVGSCI +IE+
Sbjct: 63 KLDGLLKGLGFTQKKADSQVRWAVLNTVLILRRHEKARSQLAEAMSTGKSVGSCIQVIEE 122
Query: 329 NIDDTDI 335
I DI
Sbjct: 123 CISTDDI 129
>gi|449436030|ref|XP_004135797.1| PREDICTED: uncharacterized protein LOC101213254 [Cucumis sativus]
Length = 319
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 117/325 (36%), Positives = 179/325 (55%), Gaps = 54/325 (16%)
Query: 26 IVSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTL 85
+VSS++ +RR LK VD LSK ++A++LVK LQGKP GLR FGAA+QI ++ L
Sbjct: 26 VVSSAK----RRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKKRLEL 81
Query: 86 DELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFL 145
DE ++N ++ SL VD L S+++ LQ++ + ++IL +
Sbjct: 82 DESEVNRMDILSLQPLVDSILDSVQQCLQISLL--------------EEILSVE------ 121
Query: 146 WTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEA 205
++S G S ++ H QHEAGHFL+ YL+G+LPK Y + S +A
Sbjct: 122 -KLESSMAEGRHSSRC--------EEQEHFICAQHEAGHFLVGYLMGVLPKAYQVPSIQA 172
Query: 206 LKKEGSLNVQAGTAFVDFEFLEEV-------------------NSGKVSATTLNRFSCIA 246
L++ + + +FV FEFL E+ N G +S+ TLN+FSC+
Sbjct: 173 LRQ--NRFAEGKVSFVGFEFLGEIDSAKILGENADIRSFNNRANKGTISSKTLNQFSCVT 230
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGAR 306
L G+ E L+ G ++G LADI KL ++L LG + +AD +RW+ NT ++ RH R
Sbjct: 231 LGGLVAELLVAGNSDGHLADILKLWSVLTWLGLPKSEADLHLRWAATNTTFIMSRHCETR 290
Query: 307 AKLAVAMTMGKSVGSCIDIIEDNID 331
++LA AM + K +G CID IE+ ++
Sbjct: 291 SRLAEAMALAKPIGLCIDAIENCLE 315
>gi|302833798|ref|XP_002948462.1| hypothetical protein VOLCADRAFT_103923 [Volvox carteri f.
nagariensis]
gi|300266149|gb|EFJ50337.1| hypothetical protein VOLCADRAFT_103923 [Volvox carteri f.
nagariensis]
Length = 346
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 94/211 (44%), Positives = 143/211 (67%)
Query: 67 GGLRCFGAARQIPQRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWN 126
G LR FGA RQ+P+RLY+++EL+LN +ET LLSP D +L ++ Q+AA +G +A
Sbjct: 135 GLLRGFGAVRQVPKRLYSIEELRLNRVETEKLLSPKDTSLNAVRTQAQVAAALGLAAAAW 194
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
++ +L ++F+ D V+ GG +L++D++G Y RV HEAGH L
Sbjct: 195 ANHWDGGGVLAALFGVVFVLMADQVANGGGGEALLVDSLGRLLRPSYAARVAYHEAGHLL 254
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
+AYLVG++P+ YTL+S +A + +LN+QAGT F D EF EV +G++ +++L+R++C+A
Sbjct: 255 VAYLVGLMPRAYTLSSMDAFLRYRALNIQAGTRFCDGEFAAEVATGRLKSSSLDRYTCVA 314
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGL 277
LAGV TEYL +G AEGGL D+ +LDA+ + L
Sbjct: 315 LAGVVTEYLRFGVAEGGLGDVQQLDAMFRAL 345
>gi|449485763|ref|XP_004157268.1| PREDICTED: uncharacterized LOC101213254 [Cucumis sativus]
Length = 319
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 115/325 (35%), Positives = 174/325 (53%), Gaps = 54/325 (16%)
Query: 26 IVSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTL 85
+VSS++ +RR LK VD LSK ++A++LVK LQGKP GLR FGAA+QI ++ L
Sbjct: 26 VVSSAK----RRRALKLVDRALSKRQYKSAVSLVKQLQGKPYGLRGFGAAKQIIKKRLEL 81
Query: 86 DELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFL 145
DE ++N ++ SL VD L S+++ LQ++ +
Sbjct: 82 DESEVNRMDILSLQPLVDSILDSVQQCLQISLLE-------------------------- 115
Query: 146 WTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEA 205
D +S S+ ++ H QHEAGHFL+ YL+G+LPK Y + S +A
Sbjct: 116 ---DILSVEKLESSMAEGRHSSRCEEQEHFICAQHEAGHFLVGYLMGVLPKAYQVPSIQA 172
Query: 206 LKKEGSLNVQAGTAFVDFEFLEEV-------------------NSGKVSATTLNRFSCIA 246
L++ + + +FV FEFL E+ N G +S+ TLN+FSC+
Sbjct: 173 LRQ--NRFAEGKVSFVGFEFLGEIDSAKILGENADIRSFNNRANKGTISSKTLNQFSCVT 230
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGAR 306
L G+ E L+ G ++G LADI KL ++L LG + +AD +RW+ NT ++ RH R
Sbjct: 231 LGGLVAELLVAGNSDGHLADILKLWSVLTWLGLPKSEADLHLRWAATNTAFIMSRHCETR 290
Query: 307 AKLAVAMTMGKSVGSCIDIIEDNID 331
++LA AM + K +G CID IE+ ++
Sbjct: 291 SRLAEAMALAKPIGLCIDAIENCLE 315
>gi|58198154|gb|AAW65808.1| stress regulated protein isoform 2 [Solanum virginianum]
gi|58198159|gb|AAW65811.1| stress regulated protein isoform 2 [Solanum virginianum]
Length = 248
Score = 184 bits (466), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 92/144 (63%), Positives = 118/144 (81%), Gaps = 1/144 (0%)
Query: 20 VLSLSRIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIP 79
+L S++ SS+ +S VL+QVD+EL KGDERAAL+LVKDL+GKPGGLRCFGAARQ+P
Sbjct: 11 LLCCSQLRSSNSRILSGLSVLEQVDKELMKGDERAALSLVKDLRGKPGGLRCFGAARQVP 70
Query: 80 QRLYTLDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLS 139
QRLY+LDEL+LNGIET SLLSPVD TLG+IERNLQ+ AI+ G +AW +PQQIL++S
Sbjct: 71 QRLYSLDELRLNGIETVSLLSPVDATLGAIERNLQIVAILSGSAAWYALDLSPQQILFVS 130
Query: 140 LALLFLWTVDSVSYNGGVGSLVLD 163
L +LFLWT+D V+ +G V + +L+
Sbjct: 131 LGVLFLWTLDLVN-SGKVTATMLN 153
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 86/110 (78%), Positives = 99/110 (90%)
Query: 226 LEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKAD 285
L+ VNSGKV+AT LNRFSCIALAGVATEYLL+GYAEGGL+DIN+LDALLK LGFTQKKAD
Sbjct: 139 LDLVNSGKVTATMLNRFSCIALAGVATEYLLFGYAEGGLSDINQLDALLKSLGFTQKKAD 198
Query: 286 SQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
SQVRW++LNT+L+LRRH+ AR KLA AMT GKSVG CIDIIE +I + D+
Sbjct: 199 SQVRWAVLNTILILRRHEKARVKLAEAMTRGKSVGVCIDIIEKSISEDDL 248
>gi|58198155|gb|AAW65809.1| stress regulated protein isoform 1 [Solanum virginianum]
gi|58198157|gb|AAW65810.1| stress regulated protein isoform 1 [Solanum virginianum]
Length = 175
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 86/113 (76%), Positives = 99/113 (87%)
Query: 223 FEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQK 282
F +VNSGKV+AT LNRFSCIALAGVATEYLL+GYAEGGL+DIN+LDALLK LGFTQK
Sbjct: 63 FGAARQVNSGKVTATMLNRFSCIALAGVATEYLLFGYAEGGLSDINQLDALLKSLGFTQK 122
Query: 283 KADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
KADSQVRW++LNT+L+LRRH+ AR KLA AMT GKSVG CIDIIE +I + D+
Sbjct: 123 KADSQVRWAVLNTILILRRHEKARVKLAEAMTRGKSVGVCIDIIEKSISEDDL 175
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 39/59 (66%), Positives = 49/59 (83%)
Query: 20 VLSLSRIVSSSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQI 78
+L S++ SS+ +S VL+QVD+EL KGDERAAL+LVKDL+GKPGGLRCFGAARQ+
Sbjct: 11 LLCCSQLRSSNSRILSGLSVLEQVDKELMKGDERAALSLVKDLRGKPGGLRCFGAARQV 69
>gi|297848052|ref|XP_002891907.1| hypothetical protein ARALYDRAFT_314862 [Arabidopsis lyrata subsp.
lyrata]
gi|297337749|gb|EFH68166.1| hypothetical protein ARALYDRAFT_314862 [Arabidopsis lyrata subsp.
lyrata]
Length = 294
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 103/319 (32%), Positives = 163/319 (51%), Gaps = 67/319 (21%)
Query: 36 KRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLDELKLNGIET 95
+RR L++V +ELS+G+ AL+LVK L+ K G L FG+A+ +P+ KL+
Sbjct: 24 RRRSLERVSKELSRGNYETALSLVKQLKSKHGCLSAFGSAKLLPK--------KLDMSSK 75
Query: 96 RSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNG 155
L S +D S+ R+++ + DSV +
Sbjct: 76 SDLWSLID----SVSRSIESVYV----------------------------EEDSVRISK 103
Query: 156 GVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQ 215
+ T ++ V+QHE+GHFL+ YL+G+LP+ Y + + EA+++ S +V
Sbjct: 104 -------EEETKTSPEEDWFAVVQHESGHFLVGYLLGVLPRYYEIPTLEAVRQNVS-SVT 155
Query: 216 AGTAFVDFEFLEEVNS-------------------GKVSATTLNRFSCIALAGVATEYLL 256
FV FEFL++V + G +S+ TLN FSC+ L G+ E+LL
Sbjct: 156 GRVEFVGFEFLKQVGAANQLMKDDRDSRMNLSDTQGNISSKTLNNFSCVILGGMVAEHLL 215
Query: 257 YGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMG 316
+GY+EG +D+ KL +L+ LGFT+ + ++ +RW++ NTV LL H AR LA A+
Sbjct: 216 FGYSEGFYSDVVKLIDVLRWLGFTETEKEAHIRWAVSNTVSLLHSHSEARVSLAEAIAKA 275
Query: 317 KSVGSCIDIIEDNIDDTDI 335
K +G+CI+ IE I I
Sbjct: 276 KPIGACIEAIESAISRHQI 294
>gi|359487956|ref|XP_002264252.2| PREDICTED: uncharacterized protein LOC100257204 [Vitis vinifera]
Length = 235
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 81/163 (49%), Positives = 114/163 (69%), Gaps = 13/163 (7%)
Query: 174 HNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAG-TAFVDFEFLEEVNSG 232
H QHEAGHFL+ YL+G+LP+GY + S EAL+++ AG FV FEFL +
Sbjct: 70 HFLYTQHEAGHFLVGYLLGVLPRGYEIPSKEALRQD---RFAAGRVEFVGFEFLRQ---- 122
Query: 233 KVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSL 292
TLNRFSC+ +AG+ EYL++G +EG +D+ +LD +LK LGF++ +A SQ++W++
Sbjct: 123 -----TLNRFSCVIVAGLIAEYLVFGCSEGLHSDVEQLDEVLKWLGFSEGEAYSQMKWAV 177
Query: 293 LNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
LNTVL+L RH AR +LA AM +GKSVG CID IE+ I+++ I
Sbjct: 178 LNTVLILSRHHEARLRLAKAMALGKSVGYCIDTIENVINESQI 220
>gi|3776580|gb|AAC64897.1| Contains similarity to TM021B04.11 gi|2191197 from A. thaliana BAC
gb|AF007271 [Arabidopsis thaliana]
Length = 289
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 101/315 (32%), Positives = 162/315 (51%), Gaps = 63/315 (20%)
Query: 36 KRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLDELKLNGIET 95
+R+ L++VD+EL +G+ AL+LVK L+ K G L FG+A+ +P++L + L
Sbjct: 23 RRKSLERVDKELLRGNYETALSLVKQLKSKHGCLSAFGSAKLLPKKLDMSSKTDL----- 77
Query: 96 RSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNG 155
RSL+ V ++ S+ +Q ++ S +P++ +
Sbjct: 78 RSLIDSVSRSIESV--YVQEDSV--RTSKEMEIKTSPEEDWF------------------ 115
Query: 156 GVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQ 215
S+V GH FL+ YL+G+LP+ Y + + EA+++ S NV
Sbjct: 116 ---SVVQHESGH-----------------FLVGYLLGVLPRHYEIPTLEAVRQNVS-NVT 154
Query: 216 AGTAFVDFEFLEEV---------------NSGKVSATTLNRFSCIALAGVATEYLLYGYA 260
FV FEFL++V N G +S+ TLN FSC+ L G+ TE++L+GY+
Sbjct: 155 GRVEFVGFEFLKQVGAANQLMKDDVDGQMNQGNISSKTLNNFSCVILGGMVTEHILFGYS 214
Query: 261 EGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVG 320
EG +DI KL+ +L+ LGFT+ + ++ ++W++ NTV LL HK AR LA M K +
Sbjct: 215 EGLYSDIVKLNDVLRWLGFTESEKEAHIKWAVSNTVSLLHSHKEARVSLAETMAKAKPIS 274
Query: 321 SCIDIIEDNIDDTDI 335
+CI+ IE I I
Sbjct: 275 TCIEAIESAISTHQI 289
>gi|334183317|ref|NP_001185227.1| uncharacterized protein [Arabidopsis thaliana]
gi|332195013|gb|AEE33134.1| uncharacterized protein [Arabidopsis thaliana]
Length = 217
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 72/168 (42%), Positives = 109/168 (64%), Gaps = 10/168 (5%)
Query: 177 VIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEE-------- 228
V+QHE+GHFL+ YL+G+LP+ Y + + EA+++ S NV FV FEFL++
Sbjct: 51 VVQHESGHFLVGYLLGVLPRHYEIPTLEAVRQNVS-NVTGRVEFVGFEFLKQLMKDDVDG 109
Query: 229 -VNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ 287
+N G +S+ TLN FSC+ L G+ TE++L+GY+EG +DI KL+ +L+ LGFT+ + ++
Sbjct: 110 QMNQGNISSKTLNNFSCVILGGMVTEHILFGYSEGLYSDIVKLNDVLRWLGFTESEKEAH 169
Query: 288 VRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
++W++ NTV LL HK AR LA M K + +CI+ IE I I
Sbjct: 170 IKWAVSNTVSLLHSHKEARVSLAETMAKAKPISTCIEAIESAISTHQI 217
>gi|186490962|ref|NP_001117491.1| uncharacterized protein [Arabidopsis thaliana]
gi|332195012|gb|AEE33133.1| uncharacterized protein [Arabidopsis thaliana]
Length = 219
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 73/174 (41%), Positives = 109/174 (62%), Gaps = 16/174 (9%)
Query: 177 VIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEV------- 229
V+QHE+GHFL+ YL+G+LP+ Y + + EA+++ S NV FV FEFL++V
Sbjct: 47 VVQHESGHFLVGYLLGVLPRHYEIPTLEAVRQNVS-NVTGRVEFVGFEFLKQVGAANQLM 105
Query: 230 --------NSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQ 281
N G +S+ TLN FSC+ L G+ TE++L+GY+EG +DI KL+ +L+ LGFT+
Sbjct: 106 KDDVDGQMNQGNISSKTLNNFSCVILGGMVTEHILFGYSEGLYSDIVKLNDVLRWLGFTE 165
Query: 282 KKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
+ ++ ++W++ NTV LL HK AR LA M K + +CI+ IE I I
Sbjct: 166 SEKEAHIKWAVSNTVSLLHSHKEARVSLAETMAKAKPISTCIEAIESAISTHQI 219
>gi|145336759|ref|NP_175867.2| uncharacterized protein [Arabidopsis thaliana]
gi|62320174|dbj|BAD94389.1| hypothetical protein [Arabidopsis thaliana]
gi|332195011|gb|AEE33132.1| uncharacterized protein [Arabidopsis thaliana]
Length = 223
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 73/174 (41%), Positives = 109/174 (62%), Gaps = 16/174 (9%)
Query: 177 VIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEV------- 229
V+QHE+GHFL+ YL+G+LP+ Y + + EA+++ S NV FV FEFL++V
Sbjct: 51 VVQHESGHFLVGYLLGVLPRHYEIPTLEAVRQNVS-NVTGRVEFVGFEFLKQVGAANQLM 109
Query: 230 --------NSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQ 281
N G +S+ TLN FSC+ L G+ TE++L+GY+EG +DI KL+ +L+ LGFT+
Sbjct: 110 KDDVDGQMNQGNISSKTLNNFSCVILGGMVTEHILFGYSEGLYSDIVKLNDVLRWLGFTE 169
Query: 282 KKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
+ ++ ++W++ NTV LL HK AR LA M K + +CI+ IE I I
Sbjct: 170 SEKEAHIKWAVSNTVSLLHSHKEARVSLAETMAKAKPISTCIEAIESAISTHQI 223
>gi|147815276|emb|CAN70025.1| hypothetical protein VITISV_030173 [Vitis vinifera]
Length = 209
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 76/167 (45%), Positives = 110/167 (65%), Gaps = 13/167 (7%)
Query: 174 HNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAG-TAFVDFEFLEEVNSG 232
H QHEAGHFL+ YL+G+LP+GY + S EAL+++ AG FV FEFL +V +
Sbjct: 48 HFLYTQHEAGHFLVGYLLGVLPRGYEIPSKEALRQD---RFAAGRVEFVGFEFLRQVRTT 104
Query: 233 KV------SATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADS 286
++ TLNRFSC+ +AG+ EYL++G +EG +D+ + L G ++ +A S
Sbjct: 105 EIVEKKFSKGKTLNRFSCVIVAGLIAEYLVFGCSEGLHSDVEQC---LSGWASSEGEAYS 161
Query: 287 QVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDT 333
Q++W++LNTVL+L RH AR +LA AM +GKSVG CID IE+ I++
Sbjct: 162 QMKWAVLNTVLILSRHHEARLRLAKAMALGKSVGYCIDTIENVINEV 208
>gi|308805821|ref|XP_003080222.1| ATP-dependent Zn proteases-like protein (ISS) [Ostreococcus tauri]
gi|116058682|emb|CAL54389.1| ATP-dependent Zn proteases-like protein (ISS) [Ostreococcus tauri]
Length = 137
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 57/105 (54%), Positives = 80/105 (76%)
Query: 227 EEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADS 286
EEV GK+S+T+L +F+C+A+AG+ EY+L+G+AEGGL+D+ +LD LL+ L F+QKK+DS
Sbjct: 28 EEVRKGKISSTSLGKFACVAMAGICMEYILFGFAEGGLSDVQQLDGLLRALAFSQKKSDS 87
Query: 287 QVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNID 331
+VRW++LNT LLRRH G KLA M G SVG C+ +IE +
Sbjct: 88 EVRWAVLNTTALLRRHVGLTEKLAEIMARGASVGECVALIETEMQ 132
>gi|354565740|ref|ZP_08984914.1| hypothetical protein FJSC11DRAFT_1120 [Fischerella sp. JSC-11]
gi|353548613|gb|EHC18058.1| hypothetical protein FJSC11DRAFT_1120 [Fischerella sp. JSC-11]
Length = 225
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/205 (35%), Positives = 115/205 (56%), Gaps = 3/205 (1%)
Query: 126 NVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHF 185
++F +P + +L + T DS G G++ LD + FS K+ R+I HEAGHF
Sbjct: 24 SMFNLSPTVPALATFTILGIATFDSFGLQGKGGTIFLDWVA-GFSPKHRERIIHHEAGHF 82
Query: 186 LIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCI 245
L+AYL+GI GYTL+++EA K++ Q G +F D E ++ GK++A L+R+ +
Sbjct: 83 LVAYLLGIPVIGYTLSAWEAWKQKQP--GQGGVSFDDAELASQLEWGKLTAQMLDRYCTV 140
Query: 246 ALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGA 305
+AG+ E L+Y AEGG D +KL +LK LGF+ + + R+ L LL+ + A
Sbjct: 141 WMAGITAETLVYENAEGGADDKSKLAGVLKSLGFSASACEQKQRFCALQAKTLLQENWSA 200
Query: 306 RAKLAVAMTMGKSVGSCIDIIEDNI 330
L AM +V C+ II++ +
Sbjct: 201 YQALVDAMRQKATVAECVKIIDEKL 225
>gi|428316275|ref|YP_007114157.1| hypothetical protein Osc7112_1194 [Oscillatoria nigro-viridis PCC
7112]
gi|428239955|gb|AFZ05741.1| hypothetical protein Osc7112_1194 [Oscillatoria nigro-viridis PCC
7112]
Length = 235
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/196 (36%), Positives = 108/196 (55%), Gaps = 3/196 (1%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ LL L T+DS + G G+++LD +G T S++ +R++ HEAGHFL+AYL+GI Y
Sbjct: 37 TFCLLGLATLDSFQWQGQGGTILLDWLGGT-SKEKRDRILHHEAGHFLVAYLLGIPVTSY 95
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
L ++EA K+ Q G F D + ++ +G +SA L+++ + +AG+ E +YG
Sbjct: 96 ALNAWEAFKQ--GQTAQGGVRFEDEQLASQLQNGTLSAQLLDQYCTVWMAGIVAEKFVYG 153
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
AEGG D K+ A+L L + + W LL LL HK A L AM +
Sbjct: 154 SAEGGAEDRTKISAILTQLRRPSSEIQLKQNWGLLRAKNLLESHKSAYEALVAAMENRAT 213
Query: 319 VGSCIDIIEDNIDDTD 334
V C DII+ N+ + D
Sbjct: 214 VAECCDIIQQNLQEKD 229
>gi|428297617|ref|YP_007135923.1| hypothetical protein Cal6303_0886 [Calothrix sp. PCC 6303]
gi|428234161|gb|AFY99950.1| hypothetical protein Cal6303_0886 [Calothrix sp. PCC 6303]
Length = 230
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 72/189 (38%), Positives = 111/189 (58%), Gaps = 3/189 (1%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ A+L L T+DS ++ G ++ LD I FS + R+I HEAGHFL+A+ + I GY
Sbjct: 37 TFAILGLATLDSFTFQGQGSNVFLDWIA-GFSPAHRERIIHHEAGHFLVAHQLEIPISGY 95
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
TLT++EA+K++ Q G +F D E ++ G+++A L+R+ + +AG+A E ++Y
Sbjct: 96 TLTAWEAIKQKQP--GQGGVSFDDAELTSQLQKGQITAQMLDRYCTVWMAGIAAETIVYN 153
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
+EGG D NKL +LKGLGF++ D + R+ L LL + A L AM
Sbjct: 154 NSEGGADDRNKLAGVLKGLGFSEATFDHKQRFCALQAKTLLEENWQAYQALVEAMRQRVP 213
Query: 319 VGSCIDIIE 327
V CI+II+
Sbjct: 214 VQECIEIID 222
>gi|427729611|ref|YP_007075848.1| hypothetical protein Nos7524_2409 [Nostoc sp. PCC 7524]
gi|427365530|gb|AFY48251.1| hypothetical protein Nos7524_2409 [Nostoc sp. PCC 7524]
Length = 230
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 72/190 (37%), Positives = 112/190 (58%), Gaps = 3/190 (1%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +L + T DS S G G++VLD I FS ++ +R+I HEAGHFL+A+L+GI GY
Sbjct: 37 TFTILGIATFDSFSLQGKGGTIVLDWIA-GFSPQHRDRIIHHEAGHFLVAHLLGIPVTGY 95
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
TL+++EA K+ Q G +F D E ++ GK+S L R+ + +AG+A E L++
Sbjct: 96 TLSAWEAWKQ--GQPGQGGVSFDDEELASQLQQGKLSTQMLERYCTVWMAGIAAENLVFT 153
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
AEGG D +KL A+LK +GF++ + R+ +L LL+ + + L VAM +
Sbjct: 154 QAEGGFDDQSKLAAVLKTVGFSESAYQQKQRFHVLQAKTLLQENWSSYQALVVAMRQRAT 213
Query: 319 VGSCIDIIED 328
+G+C I D
Sbjct: 214 IGNCQQAIVD 223
>gi|428205814|ref|YP_007090167.1| hypothetical protein Chro_0752 [Chroococcidiopsis thermalis PCC
7203]
gi|428007735|gb|AFY86298.1| hypothetical protein Chro_0752 [Chroococcidiopsis thermalis PCC
7203]
Length = 236
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 73/201 (36%), Positives = 117/201 (58%), Gaps = 5/201 (2%)
Query: 130 FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAY 189
F+P + ++L + T+DS S+ G G+LVLD + +FS ++ +R+++HEAGHFL+A+
Sbjct: 28 FSPTIPALATFSILAIATLDSFSFQGKGGNLVLDWLA-SFSPQHRDRIVRHEAGHFLVAH 86
Query: 190 LVGILPKGYTLTSFEALK-KEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALA 248
L+ I GYTL+++EALK K+ L G +F D E +++ G ++A L+R+ + +A
Sbjct: 87 LLNIPITGYTLSAWEALKQKQPGL---GGVSFEDRELAAQLDRGMLTAQMLDRYCTVWMA 143
Query: 249 GVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAK 308
G+A E L+Y EGG D KL +L LG+T + + RW++L LL+ + A
Sbjct: 144 GLAAEDLVYSNTEGGADDRQKLGMVLTPLGYTANAVEQKQRWAVLQARTLLQTNWSAYEA 203
Query: 309 LAVAMTMGKSVGSCIDIIEDN 329
L M V C IE+N
Sbjct: 204 LVGMMQQRAGVDECCRAIEEN 224
>gi|17229906|ref|NP_486454.1| hypothetical protein alr2414 [Nostoc sp. PCC 7120]
gi|17131506|dbj|BAB74113.1| alr2414 [Nostoc sp. PCC 7120]
Length = 228
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 67/184 (36%), Positives = 107/184 (58%), Gaps = 3/184 (1%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +L + T DS S G G+++LD I FS ++ +R+I HEAGHFL+AYL+G+ GY
Sbjct: 37 TFTILGIATFDSFSLQGKGGTILLDWIA-GFSPQHRDRIIHHEAGHFLVAYLLGVPVTGY 95
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
TL+++EA ++ L Q G F D E + +V GK+S L R+ I +AG+A E L++
Sbjct: 96 TLSAWEAWRQ--GLPGQGGVTFDDVELMSQVQQGKISNQVLERYCTICMAGIAAETLVFE 153
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
AEGG+ D +KL + K LGF++ + R+ +L LL+ + + L A+ +
Sbjct: 154 RAEGGIDDKSKLATIFKVLGFSESVCQQKQRFHVLQAKTLLQNNWASYEALVQAIRQKSA 213
Query: 319 VGSC 322
+ C
Sbjct: 214 IADC 217
>gi|434395326|ref|YP_007130273.1| hypothetical protein Glo7428_4680 [Gloeocapsa sp. PCC 7428]
gi|428267167|gb|AFZ33113.1| hypothetical protein Glo7428_4680 [Gloeocapsa sp. PCC 7428]
Length = 232
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 74/201 (36%), Positives = 114/201 (56%), Gaps = 3/201 (1%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+F +P + +LL L T+DS S G G+LVLD +FS ++ +R+++HEAGHFL
Sbjct: 25 MFNLSPTLPAIATFSLLGLATLDSFSLQGKGGTLVLDWFA-SFSPQHRDRIVRHEAGHFL 83
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
+A+L+GI GY L+++EALK++ Q G +F D E ++ G +S L+R+ I
Sbjct: 84 VAHLLGIPVTGYALSAWEALKQKQP--GQGGVSFDDTEVASQLAQGTISTQLLDRYCTIW 141
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGAR 306
+AGVA E L+Y AEGG D L +L LGF+ + + R+ L LL+++ A
Sbjct: 142 MAGVAAETLVYDRAEGGADDRQHLHTVLSSLGFSAASVELKQRFCSLQARNLLQQNWAAY 201
Query: 307 AKLAVAMTMGKSVGSCIDIIE 327
L AM V C ++I+
Sbjct: 202 EALINAMRQRADVAECRNLID 222
>gi|334119440|ref|ZP_08493526.1| hypothetical protein MicvaDRAFT_3632 [Microcoleus vaginatus FGP-2]
gi|333458228|gb|EGK86847.1| hypothetical protein MicvaDRAFT_3632 [Microcoleus vaginatus FGP-2]
Length = 225
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/192 (35%), Positives = 105/192 (54%), Gaps = 3/192 (1%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ LL L T+DS + G G+++LD +G + QK +R++ HEAGHFL+A+L+GI Y
Sbjct: 37 TFCLLGLATLDSFQWQGQGGTILLDWLGGSSKQK-RDRILHHEAGHFLVAHLLGIPVTSY 95
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
L ++EA K+ Q G F D + ++ SG +SA L+++ + +AG+ E +YG
Sbjct: 96 ALNAWEAFKQ--GQTAQGGVRFEDEQLASQLQSGTLSAQLLDQYCTVWMAGIVAEKFVYG 153
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
A+GG D K+ A+L L + + W LL L+ HK A L AM +
Sbjct: 154 SAQGGAEDRTKISAILTQLRRPSSEIQLKQNWGLLRAKNLIENHKSAYEALVAAMEERAT 213
Query: 319 VGSCIDIIEDNI 330
V C DII+ N+
Sbjct: 214 VAECCDIIQQNL 225
>gi|428175313|gb|EKX44204.1| hypothetical protein GUITHDRAFT_87555, partial [Guillardia theta
CCMP2712]
Length = 361
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 64/189 (33%), Positives = 113/189 (59%), Gaps = 8/189 (4%)
Query: 144 FLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSF 203
FL+ VD + +G ++ KY +VI+HEAGHFL+AYL+G +G+ L+++
Sbjct: 45 FLFVVDRLLTSGAA----FESAYRFLFPKYKEKVIKHEAGHFLLAYLLGCPIQGFFLSAW 100
Query: 204 EALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGG 263
+A + + QAGT F D + ++N+ +V+ T ++R++ + + G+A E + Y AEGG
Sbjct: 101 DATR--AGIRGQAGTVFFDNDLSTQLNANRVTRTAIDRYTIVLMGGIAAEAMNYEQAEGG 158
Query: 264 LADINKLDALLKGL--GFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGS 321
+D + L + L GL + ++ +Q RW++ +LLLR H+ A L AM G+S+G+
Sbjct: 159 ASDESALVSFLVGLLPPWQPQQVLNQARWAVTEAILLLREHRAAYDSLCDAMARGESLGT 218
Query: 322 CIDIIEDNI 330
CI ++E+ +
Sbjct: 219 CISVLENAL 227
>gi|75906447|ref|YP_320743.1| hypothetical protein Ava_0222 [Anabaena variabilis ATCC 29413]
gi|75700172|gb|ABA19848.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 231
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 69/196 (35%), Positives = 110/196 (56%), Gaps = 3/196 (1%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +L + T DS S G G+++LD I FS ++ +R+I HEAGHFL+AYL+G+ GY
Sbjct: 37 TFTILGIATFDSFSLQGKGGTILLDWIA-GFSPQHRDRIIHHEAGHFLVAYLLGVPVTGY 95
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
TL+++EA + +G Q G F D E + +V GK+S L R+ I +AG+A E L++
Sbjct: 96 TLSAWEAWR-QGQPG-QGGVTFDDVELVSQVEQGKISNQALERYCTICMAGIAAETLVFE 153
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
AEGG D +KL + K LGF++ + R+ +L LL+ + + L A+ +
Sbjct: 154 RAEGGTDDKSKLATIFKVLGFSESVCQQKQRFHVLQAKTLLQNNWASYQALVQAIRQKST 213
Query: 319 VGSCIDIIEDNIDDTD 334
+ C I + + D
Sbjct: 214 IADCQTAIANAPVEAD 229
>gi|387219543|gb|AFJ69480.1| hypothetical protein NGATSA_3025500, partial [Nannochloropsis
gaditana CCMP526]
Length = 522
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/233 (32%), Positives = 122/233 (52%), Gaps = 16/233 (6%)
Query: 113 LQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGG-----------VGSLV 161
L ++A+ G A N++ ++ + L LA+ + VD+ + G +G
Sbjct: 278 LPMSALTPG-QAQNLWAYSGVALCVLELAICKFYGVDAGAVIPGTLLGFFVDRFLLGGRG 336
Query: 162 LDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFV 221
D + +Y R+I+HEAGHFL+A+L+G + L G+ +AGT F+
Sbjct: 337 FDALARVVLPRYKERIIRHEAGHFLLAHLLGCPVQDCVLRPV----FNGATFGEAGTIFL 392
Query: 222 DFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQ 281
D E++ KVSA++L+R+S I + G+A E L + A+GG +D L L+ LGF
Sbjct: 393 DQTLFEQLEKAKVSASSLDRYSTIVMGGIAAEALTFQNAQGGSSDEQALILLMSLLGFPL 452
Query: 282 KKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTD 334
+ +Q RW+ L VL+++ + A L A+ GKSVG C+ IED ++ TD
Sbjct: 453 DRISNQARWAALRAVLVIKDYPEAYEALVAALQAGKSVGQCVLAIEDALEATD 505
>gi|414079657|ref|YP_007001081.1| hypothetical protein ANA_C20593 [Anabaena sp. 90]
gi|413972936|gb|AFW97024.1| hypothetical protein ANA_C20593 [Anabaena sp. 90]
Length = 237
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 68/196 (34%), Positives = 113/196 (57%), Gaps = 3/196 (1%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
++ L + T+D+ S+ G G++VLD + FS +Y R++ HEAGHFL+AYL+GI Y
Sbjct: 37 TVTFLGIATLDNFSFQGKGGTIVLDLMAR-FSPEYKERILHHEAGHFLVAYLLGIPVTSY 95
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
TL+++EA KK Q G D E ++ +G+++A L+R+ I +AG+A E L++
Sbjct: 96 TLSAWEAWKK--GQPGQGGITLEDSELTAQIETGQITAQMLDRYCTIWMAGIAAETLVFK 153
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
AEGG D KL+ L+ +GF++ + R+ LL + L++ + + L AM S
Sbjct: 154 SAEGGNDDKIKLNEFLQVIGFSETGYQQKQRFHLLQSKNLIQENWDSYQALVTAMRKRAS 213
Query: 319 VGSCIDIIEDNIDDTD 334
+ C++II + D
Sbjct: 214 IVECLEIITSHKGTPD 229
>gi|427716407|ref|YP_007064401.1| hypothetical protein Cal7507_1091 [Calothrix sp. PCC 7507]
gi|427348843|gb|AFY31567.1| hypothetical protein Cal7507_1091 [Calothrix sp. PCC 7507]
Length = 226
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 74/201 (36%), Positives = 106/201 (52%), Gaps = 3/201 (1%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+F +P + +L + T DS G G++ LD I FSQ++ +R+I HEAGHFL
Sbjct: 25 LFNLSPTIPALATFTILGIATFDSFRLQGKGGTIFLDWIA-GFSQEHRDRIIHHEAGHFL 83
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
IAYL I GYTL+++EA K+ Q G F D + ++ GK+SA L+R+ I
Sbjct: 84 IAYLFEIPISGYTLSAWEAWKQ--GQPGQGGVTFNDDQLASQLQVGKISAQMLDRYCTIW 141
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGAR 306
+AG+A E L++ EGG D NKL +L LGF + R+ L LL+ + +
Sbjct: 142 MAGIAAETLVFNNTEGGADDQNKLAGVLTNLGFAASVRQQKQRFYALQAKTLLQENWPSY 201
Query: 307 AKLAVAMTMGKSVGSCIDIIE 327
L AM G SV C I+
Sbjct: 202 EALVKAMRQGASVADCQSAIK 222
>gi|254423778|ref|ZP_05037496.1| hypothetical protein S7335_3934 [Synechococcus sp. PCC 7335]
gi|196191267|gb|EDX86231.1| hypothetical protein S7335_3934 [Synechococcus sp. PCC 7335]
Length = 243
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/184 (38%), Positives = 105/184 (57%), Gaps = 7/184 (3%)
Query: 148 VDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALK 207
+D G G++++D I S +Y R++ HEAGHFL+A+L+ I + YTL+++EA K
Sbjct: 46 IDRFGAQGRGGNILIDLISRQ-SSEYRQRILHHEAGHFLVAHLLDIPVQSYTLSAWEATK 104
Query: 208 K--EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLA 265
G V TA ++ ++ +G +SA +NR+ + +AG+A E +YG AEGG
Sbjct: 105 AGVPGLGGVVLDTAAIE----NDLATGTISAQQVNRYCILWMAGIAAETQIYGSAEGGED 160
Query: 266 DINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDI 325
D KL L K T A++Q+RW+LL LL + K A L AM+ G SV SC+ +
Sbjct: 161 DQTKLRLLWKQTQRTASAAETQIRWALLQAQTLLEKQKQAYDALVEAMSKGSSVESCLQV 220
Query: 326 IEDN 329
IE N
Sbjct: 221 IETN 224
>gi|113477845|ref|YP_723906.1| hypothetical protein Tery_4446 [Trichodesmium erythraeum IMS101]
gi|110168893|gb|ABG53433.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 229
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 72/211 (34%), Positives = 121/211 (57%), Gaps = 9/211 (4%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+F +P I + ++L L T+D++ + G +++D + T S+K +R+I HEAGHFL
Sbjct: 25 IFNISPFYIAIATFSVLVLATIDTLGWQGQGSMILVDLVAGTSSEK-RDRIICHEAGHFL 83
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
+AYL+ I GY L ++EA ++ S Q G F D + ++ SG +S+ ++R+ +
Sbjct: 84 VAYLLEIPISGYALNAWEAFRQGQS--SQGGVRFDDQKLAAQLYSGVISSQLVDRYCTVW 141
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVR--WSLLNTVLLLRRHKG 304
+AG+A E L+YG AEGG D K+ A+L+ L ++ +S+++ W+ L LL H+
Sbjct: 142 MAGIAAENLVYGNAEGGAEDRTKITAILRQL---KRPGESKLKQSWASLQARNLLENHQS 198
Query: 305 ARAKLAVAMTMGKSVGSCIDIIEDNID-DTD 334
A L AMT SV C I++++ D D
Sbjct: 199 AYKALVKAMTERSSVSDCYQTIKEHLSVDQD 229
>gi|427738135|ref|YP_007057679.1| hypothetical protein Riv7116_4718 [Rivularia sp. PCC 7116]
gi|427373176|gb|AFY57132.1| hypothetical protein Riv7116_4718 [Rivularia sp. PCC 7116]
Length = 230
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 105/189 (55%), Gaps = 5/189 (2%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ A+L + T DS S G G+L LD I FS +Y R++ HEAGHFL+A+++G+ GY
Sbjct: 37 TFAILGIATFDSWSLEGRGGTLFLDLIS-GFSSEYRERILHHEAGHFLVAHVLGVPVTGY 95
Query: 199 TLTSFEALK-KEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLY 257
TL+++EA K K+ L AG F D E L + G ++ L R+ I +AG+A E L+Y
Sbjct: 96 TLSAWEAFKQKQKGL---AGVTFGDVELLSQFEKGAITTRILERYYTIWMAGIAAEKLVY 152
Query: 258 GYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGK 317
A GG D NKL +LK LG + + + R+S+L LL + A L M
Sbjct: 153 DDASGGADDQNKLKGVLKSLGCSDSFCNQKQRFSILQAKTLLENNWSAYQTLVDTMRKRA 212
Query: 318 SVGSCIDII 326
+V C DI+
Sbjct: 213 TVKECKDIL 221
>gi|186681070|ref|YP_001864266.1| hypothetical protein Npun_F0560 [Nostoc punctiforme PCC 73102]
gi|186463522|gb|ACC79323.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 225
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 70/184 (38%), Positives = 108/184 (58%), Gaps = 3/184 (1%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ A+L + T+DS S G G++ LD I FS ++ +R++ HEAGHFL+AYL+GI GY
Sbjct: 37 TFAILGIATLDSFSLQGKGGTIFLDWIA-GFSSEHRDRIVHHEAGHFLVAYLLGIPVTGY 95
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
TL+++EA K+ Q G +F D E ++ GK+SA L+R+ + +AG+A E L++
Sbjct: 96 TLSAWEAWKQ--GQPGQGGVSFDDGELASQLEVGKISAQMLDRYCTVWMAGIAAETLVFD 153
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
AEGG D +KL +L LGF++ ++R+ L LL+ + + L AM S
Sbjct: 154 NAEGGSDDKSKLIGVLTVLGFSESVYQQKLRFHALQAKTLLQENWSSYEALVNAMRQRAS 213
Query: 319 VGSC 322
V C
Sbjct: 214 VEDC 217
>gi|434403589|ref|YP_007146474.1| hypothetical protein Cylst_1510 [Cylindrospermum stagnale PCC 7417]
gi|428257844|gb|AFZ23794.1| hypothetical protein Cylst_1510 [Cylindrospermum stagnale PCC 7417]
Length = 242
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 67/180 (37%), Positives = 103/180 (57%), Gaps = 3/180 (1%)
Query: 147 TVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEAL 206
T+DS S G G++ LD I +FS++Y +R++ HEAGHFLIA+L+GI GYTL+++EA
Sbjct: 45 TLDSFSLQGQGGTIFLDWI-SSFSKEYRDRILHHEAGHFLIAHLLGIPATGYTLSAWEAW 103
Query: 207 KKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLAD 266
K+ Q G F D + ++ +GK+ L+R+ + +AG+A E +++ AEGG D
Sbjct: 104 KQ--GQPGQGGVTFEDEQLASQLAAGKIGVQMLDRYCTVWMAGIAAETVVFNSAEGGTDD 161
Query: 267 INKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDII 326
+ KL +L LGFT+ + R+ L LL + + L AM SV C + I
Sbjct: 162 LGKLKGVLTTLGFTESICQQKQRFYALQAKNLLEENWSSYQALVKAMEQRASVEDCQNAI 221
>gi|119513307|ref|ZP_01632346.1| hypothetical protein N9414_11439 [Nodularia spumigena CCY9414]
gi|119462042|gb|EAW43040.1| hypothetical protein N9414_11439 [Nodularia spumigena CCY9414]
Length = 226
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 67/189 (35%), Positives = 108/189 (57%), Gaps = 3/189 (1%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
++ +L + T D+ S G G+++LD I FS ++ +R+I HEAGHFL AYL+ I GY
Sbjct: 37 TVTILGIATFDNFSLQGQGGTIILDWIA-GFSPQHRDRIIHHEAGHFLTAYLLDIPVTGY 95
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
TL+++EA K+ Q G +F D E ++ G +S ++R+ I +AG+ E L++
Sbjct: 96 TLSAWEAWKQ--GQPGQGGVSFDDGELATQLEQGTISTQIIDRYCTIWMAGIVAETLVFN 153
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
+AEGG D NKL +L GLGF++ A + ++ L LL+ + + L +M S
Sbjct: 154 HAEGGADDKNKLAGVLAGLGFSESAALQKQKFHFLQAKTLLQENWSSYEALVKSMQQRVS 213
Query: 319 VGSCIDIIE 327
V C +I+
Sbjct: 214 VSDCQRMID 222
>gi|427420949|ref|ZP_18911132.1| hypothetical protein Lepto7375DRAFT_6912 [Leptolyngbya sp. PCC
7375]
gi|425756826|gb|EKU97680.1| hypothetical protein Lepto7375DRAFT_6912 [Leptolyngbya sp. PCC
7375]
Length = 230
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 116/211 (54%), Gaps = 7/211 (3%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+ +P+ + ++ LL L T+D + + G G L++D + S +Y +R+++HEAGHFL
Sbjct: 25 LLAISPEGLATVTATLLGLITIDQLGWQGRGGRLLVDWLSQR-SPEYCDRIVRHEAGHFL 83
Query: 187 IAYLVGILPKGYTLTSFEALKK--EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSC 244
AYL+GI YTLT++E L+ +GS V T +D V +GK+S ++R+
Sbjct: 84 TAYLLGIPIAAYTLTAWETLRSGVDGSGGVIFKTDGID----RAVENGKISQQEIDRYCT 139
Query: 245 IALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKG 304
+ +AG+A E ++YG A+GG D KL+ L + + + + RWS+L L+ +
Sbjct: 140 VWMAGIAAEEIVYGIAQGGRDDRLKLNLLWRQINRPLAEVPLKQRWSMLQAKTLIEKESD 199
Query: 305 ARAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
A L AM + V C ++IE ++ ++
Sbjct: 200 AYGALIDAMAERRPVNECCELIEQHLQSAEV 230
>gi|440682851|ref|YP_007157646.1| hypothetical protein Anacy_3332 [Anabaena cylindrica PCC 7122]
gi|428679970|gb|AFZ58736.1| hypothetical protein Anacy_3332 [Anabaena cylindrica PCC 7122]
Length = 226
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 69/191 (36%), Positives = 110/191 (57%), Gaps = 7/191 (3%)
Query: 138 LSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKG 197
+++A L + T+D+ S+ G G++VLD + FS +Y R++ HEAGHF++AYL+GI G
Sbjct: 36 VTVAFLGIATLDNFSFQGKGGTVVLDWLAR-FSLEYRERILHHEAGHFIVAYLLGIPVTG 94
Query: 198 YTLTSFEALK--KEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYL 255
YTL+++EA K + G Q G D E +++ GK++ + + R+ I +AG+A E L
Sbjct: 95 YTLSAWEAWKIGQPG----QGGVILEDSEIAKQLEKGKITVSMVERYCNIWMAGIAAEML 150
Query: 256 LYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTM 315
++ AEGG D KL+ L LGF + D + R+ LL L++ + L AM
Sbjct: 151 VFKSAEGGGDDKAKLNQFLAALGFEENVFDQKQRFYLLQAKNLIQENWENYQNLVEAMRK 210
Query: 316 GKSVGSCIDII 326
G +V C +I
Sbjct: 211 GVNVEECKKVI 221
>gi|254413684|ref|ZP_05027453.1| hypothetical protein MC7420_3800 [Coleofasciculus chthonoplastes
PCC 7420]
gi|196179281|gb|EDX74276.1| hypothetical protein MC7420_3800 [Coleofasciculus chthonoplastes
PCC 7420]
Length = 228
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 73/207 (35%), Positives = 115/207 (55%), Gaps = 11/207 (5%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+F P + +L L TVDS+S+ G +++LD + S Y +R++ HEAGHFL
Sbjct: 26 IFNIPPAIPALATFGILGLVTVDSLSFQGKGTTVLLDWLAQRRS-DYRDRILHHEAGHFL 84
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSA----TTLNRF 242
+AYL+GI GYTLT++EALK+ Q G V F+ ++++ ++ TL+RF
Sbjct: 85 VAYLLGIPIAGYTLTAWEALKQG-----QPGLGGVQFD-TQQLSPNPLAIGEMRLTLDRF 138
Query: 243 SCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRH 302
+ +AG+A E L+YG AEGG+ D KL L+ + ++ RW++L +L+ +
Sbjct: 139 CTVWMAGIAAETLVYGEAEGGIDDCQKLKEALRLFERPVGEFTTKQRWAMLQAQTMLQDN 198
Query: 303 KGARAKLAVAMTMGKSVGSCIDIIEDN 329
A L AM SVG C +I+D+
Sbjct: 199 WEAYEALIKAMAERTSVGDCYRLIQDH 225
>gi|434399600|ref|YP_007133604.1| hypothetical protein Sta7437_3127 [Stanieria cyanosphaera PCC 7437]
gi|428270697|gb|AFZ36638.1| hypothetical protein Sta7437_3127 [Stanieria cyanosphaera PCC 7437]
Length = 223
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 74/208 (35%), Positives = 110/208 (52%), Gaps = 15/208 (7%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+ +P +L ++ L TVD++S+N +L+LD + + RVI HEAGHFL
Sbjct: 25 ILNISPTIPAMATLGIMGLATVDTLSWNNRGVTLLLDVFS---TPQQRQRVIHHEAGHFL 81
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
AY +GI GY+LT++EALK+ + G AF E + + + TL+RF +
Sbjct: 82 TAYFLGIPVTGYSLTAWEALKQ--GQPGRGGVAFNTQELTTKPINFEEMRLTLDRFCTVW 139
Query: 247 LAGVATEYLLYGYAEGGLADINKLD-----ALLKGLGFTQKKADSQVRWSLLNTVLLLRR 301
+AG+A E L+YG AEGG D +L A L +G+ QK+ RW+ + ++ R
Sbjct: 140 MAGIAAEKLVYGNAEGGQEDCEQLRLALSLAGLPEIGYAQKQ-----RWAQIQATSIIER 194
Query: 302 HKGARAKLAVAMTMGKSVGSCIDIIEDN 329
H+ A L AM SV C II+DN
Sbjct: 195 HQNAYQALVTAMEQRASVVECCQIIQDN 222
>gi|428306504|ref|YP_007143329.1| hypothetical protein Cri9333_2980 [Crinalium epipsammum PCC 9333]
gi|428248039|gb|AFZ13819.1| hypothetical protein Cri9333_2980 [Crinalium epipsammum PCC 9333]
Length = 224
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 69/205 (33%), Positives = 110/205 (53%), Gaps = 8/205 (3%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+F +P +L T+D++S+ G +L+LD FS +Y +R+I+HEAGHFL
Sbjct: 25 LFNISPAIPAIAVFGILSFATLDTLSWRGKGVNLLLDGFAR-FSPEYRDRIIKHEAGHFL 83
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
+AYL+GI GYTL+++EA +K Q G +F EF + +A + R+ +
Sbjct: 84 VAYLLGIPITGYTLSAWEAFQK--GQPGQGGVSFAPQEF-----TSPQAAIIIQRYCTVW 136
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGAR 306
+AG+A E L+YG AEGG D KL +L G+G + + R+ +L +++ H
Sbjct: 137 MAGIAAENLVYGNAEGGGEDRQKLQEVLSGMGRSVNECLQLERFCILQAKTMIQEHLKVY 196
Query: 307 AKLAVAMTMGKSVGSCIDIIEDNID 331
+ AM KSV C+ II + +
Sbjct: 197 ESIVAAMQQRKSVEECLTIINEQLS 221
>gi|428224730|ref|YP_007108827.1| hypothetical protein GEI7407_1281 [Geitlerinema sp. PCC 7407]
gi|427984631|gb|AFY65775.1| hypothetical protein GEI7407_1281 [Geitlerinema sp. PCC 7407]
Length = 231
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/185 (35%), Positives = 102/185 (55%), Gaps = 5/185 (2%)
Query: 147 TVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEAL 206
T+D+ ++ G G+L++D + +FSQ + +RV+ HEAGHFL AY++GI Y L ++EA
Sbjct: 45 TLDAWAWQGQSGTLLIDWLS-SFSQAHRDRVLHHEAGHFLAAYILGIPVTSYALNAWEAF 103
Query: 207 KKEGSLNVQAGTAFVDFEFLE-EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLA 265
++ Q G F D LE E+ G +S+ +++R+ + +AG A E L+YG EGG
Sbjct: 104 RQ--GQPAQGGVQF-DLGPLEGELQQGLISSVSIDRYCTVWMAGAAAEKLVYGNVEGGAD 160
Query: 266 DINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDI 325
D K L + + ++ +VRWS L LL H A L AM SV C I
Sbjct: 161 DRAKFQQLWAQMRRSPEEGSQKVRWSDLRARTLLETHWEAYEALVAAMKQRASVEDCCQI 220
Query: 326 IEDNI 330
I+ ++
Sbjct: 221 IDQHL 225
>gi|443315429|ref|ZP_21044920.1| hypothetical protein Lep6406DRAFT_00038470 [Leptolyngbya sp. PCC
6406]
gi|442784969|gb|ELR94818.1| hypothetical protein Lep6406DRAFT_00038470 [Leptolyngbya sp. PCC
6406]
Length = 240
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 98/176 (55%), Gaps = 3/176 (1%)
Query: 147 TVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEAL 206
T+D + G +G+L++ ++ FS + R+ HEA HFL+A+L+ I + Y+L ++EA
Sbjct: 54 TLDQIMGQGTLGNLIIGSLSR-FSPQQRQRIAHHEAAHFLVAHLLDIPIQDYSLNAWEAW 112
Query: 207 KKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLAD 266
+K L Q G F + + GK+SA +NR++ + +AG+A E +YG A GG D
Sbjct: 113 RK--GLPGQGGVRFDTTDLEAPLAQGKISAQMVNRYATVWMAGIAAEQWIYGEALGGQDD 170
Query: 267 INKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSC 322
K L + LG + ++ +Q RW+ L +L H+ A L AMT G+SV C
Sbjct: 171 QQKFAILWQQLGRSSQERQTQQRWAALQARTMLEHHQDAYRALVAAMTAGESVADC 226
>gi|452822827|gb|EME29843.1| hypothetical protein isoform 2 [Galdieria sulphuraria]
Length = 400
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 91/319 (28%), Positives = 170/319 (53%), Gaps = 36/319 (11%)
Query: 29 SSEVEVSKRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLDEL 88
S++V+V+ +V +EL K + A +++ + P +R F + L + +
Sbjct: 101 SNQVQVA------EVYDELKKKNCLKAFSVIDSEEKLPEPVRYFVNCVPVIIDLVSFAKS 154
Query: 89 KLNG----IETRSL----LSPVDETLGSIERNLQLAAIVGGVSAWNVFG---FNPQQILY 137
K G +ET + L P +++ S QLA +V +S +F F+ + I+
Sbjct: 155 KQIGLDKLVETAKIPLEALRPRSQSIFS----YQLAGVVFSLSL--IFFSHLFHIENIIR 208
Query: 138 -LSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPK 196
+ ++LL + D ++ G + + + Y +V++HEAGHFL++YL+G +
Sbjct: 209 PIGISLLVGFLCDQIALQGAI----FEELYKNLFPIYKKKVLKHEAGHFLVSYLLGCPVR 264
Query: 197 GYTLTSFEALKKEGSLNV--QAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEY 254
G L+++E+L SL + QAGT F D + E+++G ++ T++R+S + +AG+A E
Sbjct: 265 GIVLSAWESL----SLGIPGQAGTLFFDRKLESELSTGFLTDATIDRYSIVLMAGIAAEA 320
Query: 255 LLYGYAEGGLADINKLDALLKGLG--FTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVA 312
L YG AEGG +D L +L L +++++ +Q RW++L +L++++ K A L A
Sbjct: 321 LEYGQAEGGQSDEAALLRVLTCLNPPWSKERVFNQARWAVLQAILIIKKQKKAYDALVHA 380
Query: 313 MTMGKSVGSCIDIIEDNID 331
++ K +G CI +IE N +
Sbjct: 381 LSSKKPLGDCIAMIEANYE 399
>gi|443327662|ref|ZP_21056282.1| hypothetical protein Xen7305DRAFT_00019340 [Xenococcus sp. PCC
7305]
gi|442792754|gb|ELS02221.1| hypothetical protein Xen7305DRAFT_00019340 [Xenococcus sp. PCC
7305]
Length = 223
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 71/191 (37%), Positives = 97/191 (50%), Gaps = 5/191 (2%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +L L T+D++S+N +L LD + + RVI HEAGHFL AY +GI Y
Sbjct: 37 TFGILGLATIDTLSWNNKGATLFLDLF---VTSEQRQRVIHHEAGHFLTAYFLGIPITEY 93
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
+LT++EA +K G Q G F + + K TL+R + +AG+A E L+Y
Sbjct: 94 SLTAWEAFRK-GHYG-QGGVVFEPNSAISKTVDKKDLPLTLDRLCTVWMAGIAAEKLVYD 151
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
AEGG D +L L GF + RW L LL RHK A L +AM KS
Sbjct: 152 EAEGGQEDCQQLQIALNMAGFPNTIYSQKERWGQLQATSLLERHKRAFDALVIAMAERKS 211
Query: 319 VGSCIDIIEDN 329
V C II++N
Sbjct: 212 VTECCQIIQEN 222
>gi|411119080|ref|ZP_11391460.1| ATP-dependent Zn protease [Oscillatoriales cyanobacterium JSC-12]
gi|410710943|gb|EKQ68450.1| ATP-dependent Zn protease [Oscillatoriales cyanobacterium JSC-12]
Length = 229
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 101/188 (53%), Gaps = 3/188 (1%)
Query: 141 ALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTL 200
++L L VD+ ++ G +L+LD + FS + +RV++HEAGHFL+A+ + I GYTL
Sbjct: 39 SILGLAAVDTFNWQGKGSTLLLDRLA-GFSADHRDRVVRHEAGHFLVAHQLNIPVTGYTL 97
Query: 201 TSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYA 260
++E+ ++ + Q G F E E+ G +SA L+R+ I +AG A E+L YG
Sbjct: 98 NAWESFRRGQA--GQGGVQFNTQELDAELQQGTLSAHLLDRYCTIWMAGAAAEHLTYGNV 155
Query: 261 EGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVG 320
+GG D KL +L L F+ + RW+ L LL+ H L AM SV
Sbjct: 156 QGGTDDFQKLRLILSRLQFSPAAIQQKERWAALQAKTLLQEHWETYESLIAAMQRRASVE 215
Query: 321 SCIDIIED 328
C+ ++E
Sbjct: 216 ECLQVLEQ 223
>gi|427710166|ref|YP_007052543.1| hypothetical protein Nos7107_4875 [Nostoc sp. PCC 7107]
gi|427362671|gb|AFY45393.1| hypothetical protein Nos7107_4875 [Nostoc sp. PCC 7107]
Length = 228
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 69/188 (36%), Positives = 106/188 (56%), Gaps = 3/188 (1%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +L + T+DS S G G++++D I FS ++ +R+I+HEAGHFL A L+GI GY
Sbjct: 37 TFTILGIATLDSFSLQGKGGTILVDWIA-GFSSEHRDRIIRHEAGHFLAASLLGIPVTGY 95
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
TL+++EA +K Q G AF D E ++ G +SA L+R+ + +AG+A E L++
Sbjct: 96 TLSAWEAWRK--GQPGQGGVAFNDVELAAQLEKGTISAQMLDRYCTMWMAGLAAEALVFD 153
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
AEGG D KL +LK +GF++ + R+ +L LL + + L AM S
Sbjct: 154 RAEGGGDDKAKLAGVLKIVGFSESAFQQKQRFHVLQAKTLLEENWSSYEALVQAMQQRAS 213
Query: 319 VGSCIDII 326
V C I
Sbjct: 214 VEECQSAI 221
>gi|332706040|ref|ZP_08426112.1| hypothetical protein LYNGBM3L_13750 [Moorea producens 3L]
gi|332355132|gb|EGJ34600.1| hypothetical protein LYNGBM3L_13750 [Moorea producens 3L]
Length = 222
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 67/189 (35%), Positives = 105/189 (55%), Gaps = 3/189 (1%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +L L T+D S+ G +L+LD + T + ++ R+I+HEAGHFL+AYL+GI GY
Sbjct: 37 TFGVLSLVTLDGFSFQGKGLTLLLDVLAST-NPEHRGRIIRHEAGHFLVAYLLGIPITGY 95
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
TL+++EALK EG L G +F + + + TL+RF + +AG+A E ++Y
Sbjct: 96 TLSAWEALK-EGQLG-NGGVSFDTEALSAKAYNLREMRLTLDRFCTVWMAGIAAETIVYE 153
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
EGG D KL L+GLGF+ + + RW+ ++ H + L AM S
Sbjct: 154 NVEGGAEDCEKLRDALEGLGFSGSEYSVKARWAERQATSMITEHWESYEALVAAMEKRAS 213
Query: 319 VGSCIDIIE 327
V C ++I+
Sbjct: 214 VAECCEVIQ 222
>gi|425440390|ref|ZP_18820693.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389719182|emb|CCH96950.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 224
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 70/195 (35%), Positives = 107/195 (54%), Gaps = 11/195 (5%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +LL L TVD++++N +L+LD + + K+ RVI HEAGHFL AY++GI Y
Sbjct: 37 TFSLLGLVTVDTLTFNNRGVTLLLDALSPS---KHRQRVIHHEAGHFLTAYILGIPIASY 93
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFL---EEVNSGKVSATTLNRFSCIALAGVATEYL 255
+LT++EA ++ Q G V FE ++V + L R S + +AG+A E L
Sbjct: 94 SLTAWEAFRQG-----QEGVGGVQFELADLEQKVKNFTDFPAFLERISTVWMAGIAAETL 148
Query: 256 LYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTM 315
+YG AEGG +D L + LK G ++ + RWSLL LL + + A L +AM
Sbjct: 149 VYGKAEGGESDRFYLQSALKLAGVSENNYAQKERWSLLQAKTLLEKQQTAYQALVIAMEK 208
Query: 316 GKSVGSCIDIIEDNI 330
SV C +I +++
Sbjct: 209 RASVEECCRVISESL 223
>gi|428203846|ref|YP_007082435.1| hypothetical protein Ple7327_3688 [Pleurocapsa sp. PCC 7327]
gi|427981278|gb|AFY78878.1| hypothetical protein Ple7327_3688 [Pleurocapsa sp. PCC 7327]
Length = 224
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 102/191 (53%), Gaps = 6/191 (3%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+L ++ L T+D++++NG +L LD S + RV+ HEAGHFL+AY +GI GY
Sbjct: 37 TLGIMGLATLDNLTWNGKATTLFLDLFS---SPQQRQRVVHHEAGHFLVAYFLGIPVVGY 93
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
TL+++EA K EG + G D LE+ + + L RF + +AG+A E ++YG
Sbjct: 94 TLSAWEAFK-EGQPGL--GGVVFDTNLLEKSADLREAPIILERFCTVWMAGIAAETIVYG 150
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
EGG +D L +L+ +G + + RW+ L LL +H+ + L AM S
Sbjct: 151 NVEGGESDRENLREVLRFVGLPESVYPQKERWAYLQAKSLLEKHQKSYEALVQAMEKRAS 210
Query: 319 VGSCIDIIEDN 329
V C+ I N
Sbjct: 211 VEECLQAIGRN 221
>gi|425465898|ref|ZP_18845201.1| Similar to tr|Q8YUD8|Q8YUD8 [Microcystis aeruginosa PCC 9809]
gi|389831799|emb|CCI25161.1| Similar to tr|Q8YUD8|Q8YUD8 [Microcystis aeruginosa PCC 9809]
Length = 224
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 70/195 (35%), Positives = 106/195 (54%), Gaps = 11/195 (5%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +LL L TVD++++N +L+LD + + K+ RVI HEAGHFL AY++GI Y
Sbjct: 37 TFSLLGLVTVDTLTFNNRGVTLLLDALSPS---KHRQRVIHHEAGHFLTAYILGIPIASY 93
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFL---EEVNSGKVSATTLNRFSCIALAGVATEYL 255
+LT++EA ++ Q G V FE ++V + L R S + +AG+A E L
Sbjct: 94 SLTAWEAFRQG-----QEGVGGVQFELADLEQKVKNFTDFPAFLERISTVWMAGIAAETL 148
Query: 256 LYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTM 315
+YG AEGG +D L + LK G + + RWSLL LL + + A L +AM
Sbjct: 149 VYGKAEGGESDRFYLQSALKLAGVPENNYAQKERWSLLQAKTLLEKQQTAYQALVIAMEK 208
Query: 316 GKSVGSCIDIIEDNI 330
SV C +I +++
Sbjct: 209 RASVEECCRVISESL 223
>gi|425449462|ref|ZP_18829301.1| Similar to tr|Q8YUD8|Q8YUD8 [Microcystis aeruginosa PCC 7941]
gi|440755177|ref|ZP_20934379.1| hypothetical protein O53_3577 [Microcystis aeruginosa TAIHU98]
gi|389763795|emb|CCI09732.1| Similar to tr|Q8YUD8|Q8YUD8 [Microcystis aeruginosa PCC 7941]
gi|440175383|gb|ELP54752.1| hypothetical protein O53_3577 [Microcystis aeruginosa TAIHU98]
Length = 224
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 108/194 (55%), Gaps = 9/194 (4%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +LL L TVD++++N +L+LD + + K+ RVI HEAGHFL AY++GI Y
Sbjct: 37 TFSLLGLVTVDTLTFNNRGVTLLLDALSPS---KHRQRVIHHEAGHFLTAYILGIPIASY 93
Query: 199 TLTSFEALKK--EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLL 256
+LT++EA ++ EG VQ A D E ++V + L R S + +AG+A E L+
Sbjct: 94 SLTAWEAFRQGQEGVGGVQFDLA--DLE--QKVKNFTDFPAFLERISTVWMAGIAAETLV 149
Query: 257 YGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMG 316
YG AEGG +D L + LK G + + RWSLL LL + + A L +AM
Sbjct: 150 YGKAEGGESDRFYLQSALKLAGVPENNYAQKERWSLLQAKTLLEKQQTAYQALVIAMEKR 209
Query: 317 KSVGSCIDIIEDNI 330
SV C +I +++
Sbjct: 210 TSVEECCRVISESL 223
>gi|428215775|ref|YP_007088919.1| hypothetical protein Oscil6304_5515 [Oscillatoria acuminata PCC
6304]
gi|428004156|gb|AFY84999.1| hypothetical protein Oscil6304_5515 [Oscillatoria acuminata PCC
6304]
Length = 230
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 74/246 (30%), Positives = 120/246 (48%), Gaps = 21/246 (8%)
Query: 85 LDELKLNGIETRSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLF 144
+ + LN + L + LG I L+++ ++G ++ ++ GF
Sbjct: 1 MSQTALNLVAISVFLMTLSVLLGPI---LEISPLIGAIATASLLGF-------------- 43
Query: 145 LWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFE 204
+T+D++ + G +L++D S + +RV++HEAGHFL+AYL+ I GY L ++E
Sbjct: 44 -FTLDTLGWQGKGANLIVDVFA-GMSPDHRSRVLRHEAGHFLVAYLLEIPITGYALNAWE 101
Query: 205 ALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGL 264
A K+ S G F D E L E+ G +SA +R+ + +AG+A E L Y AEGG
Sbjct: 102 AFKQGQSAG--GGVQFDDQELLAELQQGSLSAQLFDRYCIVWMAGIAAETLTYERAEGGA 159
Query: 265 ADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCID 324
D KL A+L + A + RW++L L+ + A L AM +V C
Sbjct: 160 EDRTKLKAILAQNRKSATDAQLKERWAILQAKTLIESNLDAYQSLVTAMEQRLAVSECYQ 219
Query: 325 IIEDNI 330
I+ I
Sbjct: 220 ILGSRI 225
>gi|209525216|ref|ZP_03273759.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|376002447|ref|ZP_09780280.1| conserved exported hypothetical protein [Arthrospira sp. PCC 8005]
gi|423067852|ref|ZP_17056642.1| hypothetical protein SPLC1_S590200 [Arthrospira platensis C1]
gi|209494401|gb|EDZ94713.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|375329187|emb|CCE16033.1| conserved exported hypothetical protein [Arthrospira sp. PCC 8005]
gi|406710595|gb|EKD05802.1| hypothetical protein SPLC1_S590200 [Arthrospira platensis C1]
Length = 227
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 111/192 (57%), Gaps = 3/192 (1%)
Query: 138 LSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKG 197
+ LL L T+D+ S G ++++D + S +R++ HEAGHFL+AYL+G+ +G
Sbjct: 36 MVFVLLGLATLDTFSLQGQGSTILVDWFAGS-SGSRRDRIVHHEAGHFLMAYLLGVPVEG 94
Query: 198 YTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLY 257
Y LT++EA ++ Q G F D E +E+++ G +S+ ++++ + +AG+A E L+Y
Sbjct: 95 YALTAWEAFRQ--GQKAQGGVRFNDVELMEQLDHGVISSEAIDKYCKVWMAGIAAEILVY 152
Query: 258 GYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGK 317
A GGL+D + L LG + + A+ + RW++L LL+++ A LA AM G
Sbjct: 153 DQALGGLSDRQNIRRLWTQLGRSPQDAEIKERWAILQAKTLLKQNWSALEALAAAMAEGA 212
Query: 318 SVGSCIDIIEDN 329
SV C +I D
Sbjct: 213 SVAECYQLIGDR 224
>gi|425469959|ref|ZP_18848851.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
gi|389880167|emb|CCI39058.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
Length = 224
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/195 (35%), Positives = 106/195 (54%), Gaps = 11/195 (5%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +LL L TVD++++N +L+LD + + K+ RVI HEAGHFL AY++GI Y
Sbjct: 37 TFSLLGLVTVDTLTFNNRGVTLLLDALSPS---KHRQRVIYHEAGHFLTAYILGIPIASY 93
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFL---EEVNSGKVSATTLNRFSCIALAGVATEYL 255
+LT++EA ++ Q G V FE ++V + L R S + +AG+A E L
Sbjct: 94 SLTAWEAFRQG-----QEGVGGVQFELADLGQKVKNFTDFPAFLERISTVWMAGIAAETL 148
Query: 256 LYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTM 315
+YG AEGG +D L + LK G + + RWSLL LL + + A L +AM
Sbjct: 149 VYGKAEGGESDRFYLQSALKLAGVPENNYAQKERWSLLQAKTLLEKQQTAYQALVIAMEK 208
Query: 316 GKSVGSCIDIIEDNI 330
SV C +I +++
Sbjct: 209 RASVEECCRVISESL 223
>gi|425437140|ref|ZP_18817566.1| Similar to tr|Q8YUD8|Q8YUD8 [Microcystis aeruginosa PCC 9432]
gi|389677926|emb|CCH93178.1| Similar to tr|Q8YUD8|Q8YUD8 [Microcystis aeruginosa PCC 9432]
Length = 224
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 108/194 (55%), Gaps = 9/194 (4%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +LL L TVD++++N +L+LD + + K+ RVI HEAGHFL AY++GI Y
Sbjct: 37 TFSLLGLVTVDTLTFNNRGVTLLLDALSPS---KHRQRVIYHEAGHFLTAYILGIPIASY 93
Query: 199 TLTSFEALKK--EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLL 256
+LT++EA ++ EG VQ A D E ++V + L R S + +AG+A E L+
Sbjct: 94 SLTAWEAFRQGQEGVGGVQFDLA--DLE--QKVKNFTDFPAFLERISTVWMAGIAAETLV 149
Query: 257 YGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMG 316
YG AEGG +D L + LK G + + RWSLL LL + + A L +AM
Sbjct: 150 YGKAEGGESDRFYLQSALKLAGVPENNYAQKERWSLLQAKTLLEKQQTAYQALVIAMEKR 209
Query: 317 KSVGSCIDIIEDNI 330
SV C +I +++
Sbjct: 210 TSVEECCRVISESL 223
>gi|425459174|ref|ZP_18838660.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
gi|389823155|emb|CCI28861.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
Length = 224
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 108/194 (55%), Gaps = 9/194 (4%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +LL L TVD++++N +L+LD + + K+ RVI HEAGHFL AY++GI Y
Sbjct: 37 TFSLLGLVTVDTLTFNNRGVTLLLDALSPS---KHRQRVIYHEAGHFLTAYILGIPIASY 93
Query: 199 TLTSFEALKK--EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLL 256
+LT++EA ++ EG VQ A D E ++V + L R S + +AG+A E L+
Sbjct: 94 SLTAWEAFRQGQEGVGGVQFDLA--DLE--QKVKNFTDFPAFLERISTVWMAGIAAETLV 149
Query: 257 YGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMG 316
YG AEGG +D L + LK G + + RWSLL LL + + A L +AM
Sbjct: 150 YGKAEGGESDRFYLQSALKLAGVPENNYAQKERWSLLQAKTLLEKQQTAYQALVIAMEKR 209
Query: 317 KSVGSCIDIIEDNI 330
SV C +I +++
Sbjct: 210 ASVEECCRVISESL 223
>gi|443313443|ref|ZP_21043054.1| hypothetical protein Syn7509DRAFT_00003550 [Synechocystis sp. PCC
7509]
gi|442776386|gb|ELR86668.1| hypothetical protein Syn7509DRAFT_00003550 [Synechocystis sp. PCC
7509]
Length = 229
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 105/204 (51%), Gaps = 5/204 (2%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+F +P + LL L TVDS G G++V D T S++ +R++ HEAGHFL
Sbjct: 25 LFNLSPTIPAIATFTLLGLATVDSFGLQGKGGAVVSDLFAWT-SRQNRDRILHHEAGHFL 83
Query: 187 IAYLVGILPKGYTLTSFEALK-KEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCI 245
+A+ + I GYTL+++EALK K+ L G F D E + G + A L+R+ I
Sbjct: 84 VAHFLDIPVTGYTLSAWEALKQKQPGL---GGVTFADDELASNLQQGTLKAQLLDRYCTI 140
Query: 246 ALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGA 305
+AG+ E L+Y A GG D+NKL +L +GF+ + R S ++ +L + A
Sbjct: 141 WMAGITAENLVYNDAHGGADDLNKLKKVLTPIGFSAAAQAQKQRSSAISAKTILTENWKA 200
Query: 306 RAKLAVAMTMGKSVGSCIDIIEDN 329
L AM SV C II N
Sbjct: 201 YEALVQAMQQRVSVDECRIIIARN 224
>gi|307151928|ref|YP_003887312.1| hypothetical protein Cyan7822_2055 [Cyanothece sp. PCC 7822]
gi|306982156|gb|ADN14037.1| conserved hypothetical protein [Cyanothece sp. PCC 7822]
Length = 224
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 68/196 (34%), Positives = 106/196 (54%), Gaps = 5/196 (2%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+F +P +L++L L T+D +++NG +++LD + +++ RVI HEAGHFL
Sbjct: 25 LFHISPFIPAVTTLSVLGLTTLDGLAWNGRGITILLDILS---GEQHRQRVIHHEAGHFL 81
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
AY +GI GY L+ +EA K +G + G F LE+ + L+RFS +
Sbjct: 82 AAYFLGIPVTGYALSGWEAFK-QGQPGL-GGVMFDTTALLEKKVNFTEMPLILDRFSTVW 139
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGAR 306
+AG+A E L+YG ++GG D KL A L+ G +Q + RW+LL L+ +H+ +
Sbjct: 140 MAGIAAETLIYGESKGGEEDRQKLRAALQWAGVSQNSYQQKERWALLQAKNLIEKHRSSY 199
Query: 307 AKLAVAMTMGKSVGSC 322
L AM SV C
Sbjct: 200 EALVTAMKQRASVEEC 215
>gi|425445256|ref|ZP_18825289.1| Similar to tr|Q8YUD8|Q8YUD8 [Microcystis aeruginosa PCC 9443]
gi|425453723|ref|ZP_18833476.1| Similar to tr|Q8YUD8|Q8YUD8 [Microcystis aeruginosa PCC 9807]
gi|389734799|emb|CCI01601.1| Similar to tr|Q8YUD8|Q8YUD8 [Microcystis aeruginosa PCC 9443]
gi|389800374|emb|CCI20248.1| Similar to tr|Q8YUD8|Q8YUD8 [Microcystis aeruginosa PCC 9807]
Length = 224
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 105/195 (53%), Gaps = 11/195 (5%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +LL L TVD++++N +L+LD + + K+ RVI HEAGHFL AY++GI Y
Sbjct: 37 TFSLLGLVTVDTLTFNNRGVTLLLDALSPS---KHRQRVIHHEAGHFLTAYILGIPIASY 93
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFL---EEVNSGKVSATTLNRFSCIALAGVATEYL 255
+LT++EA ++ Q G V FE ++V + L R S + +AG+A E L
Sbjct: 94 SLTAWEAFRQG-----QEGVGGVQFELADLEQKVKNFTDFPAFLERISTVWMAGIAAETL 148
Query: 256 LYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTM 315
+YG A GG +D L + LK G + + RWSLL LL + + A L +AM
Sbjct: 149 VYGKAAGGESDRFYLQSALKLAGVPENNYAQKERWSLLQAKTLLEKQQTAYQALVIAMEK 208
Query: 316 GKSVGSCIDIIEDNI 330
SV C +I +++
Sbjct: 209 RASVEECCRVISESL 223
>gi|298705023|emb|CBJ28498.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 665
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 64/204 (31%), Positives = 108/204 (52%), Gaps = 12/204 (5%)
Query: 127 VFGFNP-QQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHF 185
+ G +P Q ++ +L L+F D + NG + +++ Y R+I+HEAGHF
Sbjct: 246 LMGLDPLQTVIPWTLGLIF---ADRLLLNGALS----ESLTRLIVPAYRERIIRHEAGHF 298
Query: 186 LIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCI 245
L+AYL+G + L F + G Q GT F D +++ GK++ ++++RFS +
Sbjct: 299 LVAYLLGCPVQACLLDPF--VLGSGLTGAQGGTVFADPVLSKQMADGKLTKSSIDRFSIV 356
Query: 246 ALAGVATEYLLYGYAEGGLADINKLDALLKGLG--FTQKKADSQVRWSLLNTVLLLRRHK 303
+ G+A E + YG +EGG AD + +L + F +++ SQ W+ VLL++ HK
Sbjct: 357 LMGGIAAEAINYGNSEGGSADEGVMIGILSTIAPPFDREQIKSQALWAATQAVLLIQEHK 416
Query: 304 GARAKLAVAMTMGKSVGSCIDIIE 327
A L A+ G +G C+ I+E
Sbjct: 417 EAYEVLVNALESGAELGECVTIVE 440
>gi|443654220|ref|ZP_21131283.1| hypothetical protein C789_1823 [Microcystis aeruginosa DIANCHI905]
gi|159029145|emb|CAO87505.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443333892|gb|ELS48430.1| hypothetical protein C789_1823 [Microcystis aeruginosa DIANCHI905]
Length = 224
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 71/194 (36%), Positives = 107/194 (55%), Gaps = 9/194 (4%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +LL L TVD++++N +L+LD + + K+ RVI HEAGHFL AY++GI Y
Sbjct: 37 TFSLLGLVTVDTLTFNNRGVTLLLDALSPS---KHRQRVIHHEAGHFLTAYILGIPIASY 93
Query: 199 TLTSFEALKK--EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLL 256
+LT++EA ++ EG VQ A D E ++V + L R S + +AG+A E L+
Sbjct: 94 SLTAWEAFRQGQEGVGGVQFDLA--DLE--QKVKNFTDFPAFLERISTVWMAGIAAETLV 149
Query: 257 YGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMG 316
YG A GG +D L + LK G + + RWSLL LL + + A L +AM
Sbjct: 150 YGKAAGGESDRFYLQSALKLAGVPENNYAQKERWSLLQAKTLLEKQQTAYQALVIAMEKR 209
Query: 317 KSVGSCIDIIEDNI 330
SV C +I +++
Sbjct: 210 TSVEECCRVISESL 223
>gi|428779844|ref|YP_007171630.1| hypothetical protein Dacsa_1597 [Dactylococcopsis salina PCC 8305]
gi|428694123|gb|AFZ50273.1| hypothetical protein Dacsa_1597 [Dactylococcopsis salina PCC 8305]
Length = 247
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 104/201 (51%), Gaps = 14/201 (6%)
Query: 132 PQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLV 191
P I ++ + LL T D++++N +L LD + + RV+ HEAGHFL AY +
Sbjct: 53 PAGITFIIMGLL---TFDTLTWNNRGVTLFLDLFS---TAEQRERVLHHEAGHFLAAYFL 106
Query: 192 GILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSAT---TLNRFSCIALA 248
GI GY+LT++EA +++ Q G V F+ G+ L+RF + A
Sbjct: 107 GIPITGYSLTAWEAFRQK-----QPGKGGVQFDTTALEKGGQKPNQINLMLDRFCTVWCA 161
Query: 249 GVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAK 308
G+A E L YG AEGG D ++L A+LK LG+ + + WS L L+ R++ A
Sbjct: 162 GMAAEILSYGNAEGGEDDRSQLQAILKELGYPPSQRKQKQEWSTLQAKSLIERNQEAYQT 221
Query: 309 LAVAMTMGKSVGSCIDIIEDN 329
L M G SV SC II++
Sbjct: 222 LIKVMRQGASVESCEQIIQNQ 242
>gi|166368906|ref|YP_001661179.1| hypothetical protein MAE_61650 [Microcystis aeruginosa NIES-843]
gi|166091279|dbj|BAG05987.1| hypothetical protein MAE_61650 [Microcystis aeruginosa NIES-843]
Length = 224
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 71/194 (36%), Positives = 108/194 (55%), Gaps = 9/194 (4%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +LL L TVD++++N +L+LD + + K+ RVI HEAGHFL AY++GI Y
Sbjct: 37 TFSLLGLVTVDTLTFNNRGVTLLLDALSPS---KHRQRVIYHEAGHFLTAYILGIPIASY 93
Query: 199 TLTSFEALKK--EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLL 256
+LT++EA ++ EG VQ A D E ++V + L R S + +AG+A E L+
Sbjct: 94 SLTAWEAFRQGQEGVGGVQFDLA--DLE--QKVKNFTDFPAFLERISTVWMAGIAAETLV 149
Query: 257 YGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMG 316
YG A GG +D L + LK G ++ + RWSLL LL + + A L +AM
Sbjct: 150 YGKAAGGESDRFYLQSALKLAGVSENNYAQKERWSLLQAKTLLEKQQTAYQALVIAMEKR 209
Query: 317 KSVGSCIDIIEDNI 330
SV C +I +++
Sbjct: 210 ASVEECCRVISESL 223
>gi|422302890|ref|ZP_16390248.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|389792204|emb|CCI12043.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
Length = 224
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 71/194 (36%), Positives = 107/194 (55%), Gaps = 9/194 (4%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +LL L TVD++++N +L+LD + + K+ RVI HEAGHFL AY++GI Y
Sbjct: 37 TFSLLGLVTVDTLTFNNRGVTLLLDALSPS---KHRQRVIYHEAGHFLTAYILGIPIASY 93
Query: 199 TLTSFEALKK--EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLL 256
+LT++EA ++ EG VQ A D E ++V + L R S + +AG+A E L+
Sbjct: 94 SLTAWEAFRQGQEGVGGVQFDIA--DLE--QKVKNFTDFPAFLERISTVWMAGIAAETLV 149
Query: 257 YGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMG 316
YG A GG +D L + LK G + + RWSLL LL + + A L +AM
Sbjct: 150 YGKAAGGESDRFYLQSALKLAGVPENNYAQKERWSLLQAKTLLEKQQTAYQALVIAMEKR 209
Query: 317 KSVGSCIDIIEDNI 330
SV C +I +++
Sbjct: 210 ASVEECCRVISESL 223
>gi|390442423|ref|ZP_10230423.1| conserved hypothetical protein [Microcystis sp. T1-4]
gi|389834286|emb|CCI34549.1| conserved hypothetical protein [Microcystis sp. T1-4]
Length = 224
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 71/194 (36%), Positives = 107/194 (55%), Gaps = 9/194 (4%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ +LL L TVD++++N +L+LD + + K+ RVI HEAGHFL AY++GI Y
Sbjct: 37 TFSLLGLVTVDTLTFNNRGVTLLLDALSPS---KHRQRVIYHEAGHFLTAYILGIPIASY 93
Query: 199 TLTSFEALKK--EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLL 256
+LT++EA ++ EG VQ A D E ++V + L R S + +AG+A E L+
Sbjct: 94 SLTAWEAFRQGQEGVGGVQFDLA--DLE--QKVKNFTDFPAFLERISTVWMAGIAAETLV 149
Query: 257 YGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMG 316
YG A GG +D L + LK G + + RWSLL LL + + A L +AM
Sbjct: 150 YGKAAGGESDRFYLQSALKLAGVPENNYAQKERWSLLQAKTLLEKQQTAYQALVIAMEKR 209
Query: 317 KSVGSCIDIIEDNI 330
SV C +I +++
Sbjct: 210 ASVEECCRVISESL 223
>gi|428309122|ref|YP_007120099.1| hypothetical protein Mic7113_0784 [Microcoleus sp. PCC 7113]
gi|428250734|gb|AFZ16693.1| hypothetical protein Mic7113_0784 [Microcoleus sp. PCC 7113]
Length = 232
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 71/223 (31%), Positives = 119/223 (53%), Gaps = 8/223 (3%)
Query: 113 LQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQK 172
+ L+A++G +F +P ++ +L L T+DS ++ G +L+LD + T SQ
Sbjct: 16 MTLSALLG-----PMFNISPVIPAATTITVLSLATLDSFNFQGKGATLLLDWLAGTRSQ- 69
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG 232
+ +R+++HEAGHFL+AY++GI GYTLT++EA K+ G + G F +V
Sbjct: 70 HRDRILRHEAGHFLVAYILGIPITGYTLTAWEAFKQ-GQPGL-GGVMFNTDALSPDVLDV 127
Query: 233 KVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSL 292
+ TL+RF + +AG+A E L+YG AEGG D KL L LG + + R ++
Sbjct: 128 RQLQWTLDRFCKVWMAGIAAETLVYGSAEGGGEDRQKLRETLTLLGRQGSEFQLKERLAI 187
Query: 293 LNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTDI 335
L+ H + L AM S+ C ++I+ + +++
Sbjct: 188 RQAQTLIEEHWESYEALVAAMEQRASIAECYEVIQQHYQTSEV 230
>gi|300865139|ref|ZP_07109964.1| conserved exported hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300336910|emb|CBN55114.1| conserved exported hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 225
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 100/200 (50%), Gaps = 4/200 (2%)
Query: 132 PQQILYL-SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYL 190
PQ + + + LL L D+ + G GS +L + S +R+++HEAGHFL+AYL
Sbjct: 29 PQAVPAIATFCLLGLAAFDTFQWQGQ-GSAILSDVLAGSSDDRRDRILRHEAGHFLVAYL 87
Query: 191 VGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGV 250
+ I GY L ++EA K+ Q G F D E ++ G S L+R+ + +AG+
Sbjct: 88 LEIPVSGYALNAWEAFKQ--GQTAQGGVRFEDQELASQMQKGAFSVQLLDRYCTVWMAGI 145
Query: 251 ATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLA 310
A E L+YG AEGG D NK+ +L L + + +LL L+ HK A L
Sbjct: 146 AAENLVYGNAEGGAEDRNKIRLILSQLRRPASEFKQKENRALLQARNLIENHKLAYEALV 205
Query: 311 VAMTMGKSVGSCIDIIEDNI 330
AM + V C I+ N+
Sbjct: 206 EAMKKREPVAECYRAIQQNL 225
>gi|409993255|ref|ZP_11276403.1| hypothetical protein APPUASWS_19152 [Arthrospira platensis str.
Paraca]
gi|291568940|dbj|BAI91212.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409935879|gb|EKN77395.1| hypothetical protein APPUASWS_19152 [Arthrospira platensis str.
Paraca]
Length = 227
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 108/191 (56%), Gaps = 3/191 (1%)
Query: 138 LSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKG 197
+ LL L T+D+ S G ++++D + S +R++ HEAGHFL+AYL+G+ +G
Sbjct: 36 MVFVLLGLATLDTFSLQGQGSTILVDWLAGA-SGSRRDRIVHHEAGHFLMAYLLGVPVEG 94
Query: 198 YTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLY 257
Y LT++EA ++ Q G F D E +E++N G +S+ ++++ + +AG+A E L+Y
Sbjct: 95 YALTAWEAFRQ--GQKAQGGVRFNDVELMEQLNHGVISSEAIDKYCKVWMAGIAAEILVY 152
Query: 258 GYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGK 317
A GG++D + L L + + A + RW++L LL+++ LA AM G
Sbjct: 153 DQALGGVSDRQNIRRLWTQLRRSPQDAQIKERWAILQAQTLLKQNWCTYEALAAAMAEGA 212
Query: 318 SVGSCIDIIED 328
SV C +I D
Sbjct: 213 SVAECYGLIGD 223
>gi|218441120|ref|YP_002379449.1| hypothetical protein PCC7424_4211 [Cyanothece sp. PCC 7424]
gi|218173848|gb|ACK72581.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
Length = 223
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 69/220 (31%), Positives = 120/220 (54%), Gaps = 16/220 (7%)
Query: 113 LQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQK 172
+ L+A++G +F +P +L++L L T+D +++N +++LD + ++
Sbjct: 16 MTLSALLG-----PLFHISPFIPAVATLSVLGLTTLDGLAFNSRGITVLLDLLS---GEQ 67
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFE---FLEEV 229
+ R+I HEAGHFL+AYL+GI GYTL+++EA++++ Q+G V F+ E+
Sbjct: 68 HRQRIIHHEAGHFLVAYLLGIPVTGYTLSAWEAVQQK-----QSGLGGVIFDSTALTEKT 122
Query: 230 NSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVR 289
+ + RF + +AG+A E L+YG ++GG D ++ + LK G Q + + R
Sbjct: 123 LTPTEMPLMIERFCTVWMAGIAAETLIYGESQGGEEDRFQVRSALKLAGLPQFNYEQKER 182
Query: 290 WSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDN 329
W+LL LL +H+ + L AM SV C I+ N
Sbjct: 183 WALLQAKNLLEKHQSSYEALVKAMEQRVSVEECYQILGTN 222
>gi|119487839|ref|ZP_01621336.1| hypothetical protein L8106_28286 [Lyngbya sp. PCC 8106]
gi|119455415|gb|EAW36553.1| hypothetical protein L8106_28286 [Lyngbya sp. PCC 8106]
Length = 229
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 106/193 (54%), Gaps = 3/193 (1%)
Query: 138 LSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKG 197
L+ +LL L T+D+ + G GS++L S + +R+++HEAGHFL+AYL+ I G
Sbjct: 36 LAFSLLVLGTIDTFALQGQ-GSMILIEGIEGISPEKRDRILRHEAGHFLVAYLMNIPISG 94
Query: 198 YTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLY 257
Y L + EA K+ Q G F D E ++ +G++SA ++++ + +AGVA E L Y
Sbjct: 95 YALNTLEAFKQ--GQTGQGGVRFNDQELWIQLQTGQLSAQRIDQYCTVWMAGVAAEILAY 152
Query: 258 GYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGK 317
A+GG D K+ A+ L +A+ + R + L L+ +++ A A L AM
Sbjct: 153 ENAQGGQEDRGKIRAVWTQLKRPLSEANLKERLATLQAQTLIEQNQSAYAALVEAMKQRI 212
Query: 318 SVGSCIDIIEDNI 330
SV C I+E ++
Sbjct: 213 SVEECYQILEKSL 225
>gi|257060634|ref|YP_003138522.1| hypothetical protein Cyan8802_2837 [Cyanothece sp. PCC 8802]
gi|256590800|gb|ACV01687.1| conserved hypothetical protein [Cyanothece sp. PCC 8802]
Length = 230
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 74/214 (34%), Positives = 110/214 (51%), Gaps = 14/214 (6%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+F +P +L +L L +VDS+S+ +LVL G S + R+I HEAGHFL
Sbjct: 25 IFHISPFIPAVATLGILGLVSVDSLSWQSKGLTLVL---GLLSSPQERQRIIHHEAGHFL 81
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTL--NRFSC 244
+AY +GI GYTL+++EA K+ Q G V F+ L ++ KV T L RF
Sbjct: 82 VAYCLGIPVTGYTLSAWEAFKQG-----QIGYGGVKFD-LTLLSDLKVQETPLIFERFFT 135
Query: 245 IALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKG 304
+ +AG+A E ++YG A GG D L ++K G + + RW++L L+ +H+
Sbjct: 136 VWMAGIAAERVIYGNAIGGEQDRQILREMMKLAGILPQNYQQKERWAILQAKTLIEKHQD 195
Query: 305 ARAKLAVAMTMGKSVGSCIDIIEDNI---DDTDI 335
A L M G SV C I+ + D T+I
Sbjct: 196 AYQSLGEMMAQGASVEDCYQGIQKKLSWQDKTEI 229
>gi|219128995|ref|XP_002184685.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403794|gb|EEC43744.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 170
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/168 (38%), Positives = 94/168 (55%), Gaps = 10/168 (5%)
Query: 170 SQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEAL--KKEGSLNVQAGTAFVDFEFLE 227
S ++++HEAGHFL AY++G +G L+++ AL ++ G+ V AGT+F D E
Sbjct: 1 SPGIQTKILRHEAGHFLAAYVLGCPVEGIVLSAWAALQDRRFGARQVSAGTSFFDPELSA 60
Query: 228 EVNSGK-VSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLK----GLGFTQK 282
++N+ + V + ++R+S I +AG+A E YG A+GG D L A L G G
Sbjct: 61 QINNQQAVKRSAVDRYSIIVMAGIAAEAEQYGRADGGAGDEMALVAFLSQLNGGRGGGPW 120
Query: 283 KADS---QVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
AD+ Q RW L VLLLR ++ A L A+ G S+G CI IE
Sbjct: 121 NADAIRNQARWGALQAVLLLRHYRPAYDALVDALERGGSLGDCIHAIE 168
>gi|218248042|ref|YP_002373413.1| hypothetical protein PCC8801_3286 [Cyanothece sp. PCC 8801]
gi|218168520|gb|ACK67257.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
Length = 230
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/214 (34%), Positives = 110/214 (51%), Gaps = 14/214 (6%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+F +P +L +L L +VDS+S+ +LVL G S + R+I HEAGHFL
Sbjct: 25 IFHISPFIPAVATLGILGLVSVDSLSWQSKGLTLVL---GLLASPQERQRIIYHEAGHFL 81
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTL--NRFSC 244
+AY +GI GYTL+++EA K+ Q G V F+ L ++ KV T L RF
Sbjct: 82 VAYCLGIPVTGYTLSAWEAFKQG-----QIGYGGVKFD-LTLLSDLKVQETPLIFERFFT 135
Query: 245 IALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKG 304
+ +AG+A E ++YG A GG D L ++K G + + RW++L L+ +H+
Sbjct: 136 VWMAGIAAERVIYGNAIGGEQDRQILREMMKLAGILPQNYQQKERWAILQAKTLIEKHQD 195
Query: 305 ARAKLAVAMTMGKSVGSCIDIIEDNI---DDTDI 335
A L M G SV C I+ + D T+I
Sbjct: 196 AYQSLGEMMAQGASVEDCYQGIQKKLSWQDKTEI 229
>gi|282896479|ref|ZP_06304499.1| conserved hypothetical protein [Raphidiopsis brookii D9]
gi|281198585|gb|EFA73466.1| conserved hypothetical protein [Raphidiopsis brookii D9]
Length = 222
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 66/189 (34%), Positives = 101/189 (53%), Gaps = 7/189 (3%)
Query: 140 LALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYT 199
L L + T+D+ + G G++ LD + FS Y R+I HEAGHFL+A+L+GI GYT
Sbjct: 38 LGFLGIATLDNFRFEGRGGTIFLDWLAR-FSPTYKERIIYHEAGHFLVAHLLGITVTGYT 96
Query: 200 LTSFEALK--KEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLY 257
L+++EA K + G Q G D E +++ GK+ + + ++ I +AG+A E L++
Sbjct: 97 LSAWEAWKVGQPG----QGGIILGDDEIAKQLERGKIGVSMVEKYCNIWMAGIAAELLVF 152
Query: 258 GYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGK 317
AEGG D KL+ L LGF + + + R+ LL LL ++ L AM
Sbjct: 153 NSAEGGGDDKAKLNQFLTVLGFQETLFEQKQRFHLLQAKNLLEQNWHTYQHLVQAMRNRL 212
Query: 318 SVGSCIDII 326
V C +I
Sbjct: 213 DVEECKKLI 221
>gi|282901158|ref|ZP_06309089.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281193990|gb|EFA68956.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 222
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 66/189 (34%), Positives = 99/189 (52%), Gaps = 7/189 (3%)
Query: 140 LALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYT 199
L L + T+D+ + G G++ LD + FS Y R+I HEAGHFL+A+L GI GYT
Sbjct: 38 LGFLGIATLDNFGFEGRGGTIFLDWLAR-FSPTYQERIIYHEAGHFLVAHLSGISVTGYT 96
Query: 200 LTSFEALK--KEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLY 257
L+++EA K + G Q G D E ++ GK+ + + ++ I +AG+A E L++
Sbjct: 97 LSAWEAWKIGQPG----QGGIILGDDEIARQLERGKIGVSMVEKYCNIWMAGIAAELLVF 152
Query: 258 GYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGK 317
AEGG D KL+ L LGF + + + R+ LL LL ++ L AM
Sbjct: 153 NSAEGGGDDKAKLNQFLTVLGFQETLFEQKQRFHLLQAKNLLEQNWQTYQNLVQAMRNRL 212
Query: 318 SVGSCIDII 326
V C +I
Sbjct: 213 DVEECKKLI 221
>gi|220906717|ref|YP_002482028.1| hypothetical protein Cyan7425_1290 [Cyanothece sp. PCC 7425]
gi|219863328|gb|ACL43667.1| conserved hypothetical protein [Cyanothece sp. PCC 7425]
Length = 217
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 65/190 (34%), Positives = 98/190 (51%), Gaps = 13/190 (6%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ ALL L T+D + + G +LVLD +G S Y R++ HEAGHFL+AYL+ I GY
Sbjct: 37 TFALLGLTTLDQLGFEGRGTTLVLDWLGQR-SPAYRQRILHHEAGHFLVAYLLDIPVTGY 95
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYG 258
TL+++E+L+ + Q+G V F L+ + L R++ + +AG+A E L+YG
Sbjct: 96 TLSAWESLR-----SGQSGQGGVSFAPLQP----PFTPAELERYAIVWMAGIAAEVLVYG 146
Query: 259 YAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
AEGG D L L LG + Q R + L++ H+ L AM S
Sbjct: 147 AAEGGNDDRQTLQKLWAALG---QNGQLQERQAFRQAENLIQTHRTTYQALVAAMEQRAS 203
Query: 319 VGSCIDIIED 328
+ C ++E
Sbjct: 204 LPECQQLLES 213
>gi|56751429|ref|YP_172130.1| hypothetical protein syc1420_c [Synechococcus elongatus PCC 6301]
gi|56686388|dbj|BAD79610.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 212
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 105/203 (51%), Gaps = 9/203 (4%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+ G +P L +LL L+++D+V++ G +L+LD I S +Y R++ HEAGH+L
Sbjct: 4 MLGSSPLLPAGLGFSLLVLFSLDAVTWQGRGATLLLDGI-QQRSPEYRQRILHHEAGHYL 62
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVN---SGKVSATTLNRFS 243
+A +G+ GYTL+++EAL++ Q G V F+ G++S +L ++
Sbjct: 63 VATALGLPVTGYTLSAWEALRQG-----QPGRGGVQFQAAALEAEAAQGQLSQRSLEQWC 117
Query: 244 CIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHK 303
+ +AG A E L+YG EGG D + L + L +AD + RW LL LL + +
Sbjct: 118 QVLMAGAAAEQLVYGNVEGGADDRAQWKQLWRQLDRNPAEADLRSRWGLLRAKTLLEQQR 177
Query: 304 GARAKLAVAMTMGKSVGSCIDII 326
A L AM S+ C I
Sbjct: 178 PAYDALVAAMAAEASIEDCNQAI 200
>gi|67920453|ref|ZP_00513973.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
gi|67857937|gb|EAM53176.1| conserved hypothetical protein [Crocosphaera watsonii WH 8501]
Length = 224
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 100/195 (51%), Gaps = 9/195 (4%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ A+L L TVDS S+ G +L LD + +K R+I HEAGHFL AY +G+ GY
Sbjct: 37 TFAVLGLATVDSFSWGGKGLTLFLDLFTSSEERK---RIIHHEAGHFLAAYCLGVPITGY 93
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTL--NRFSCIALAGVATEYLL 256
+LT++E ++ +AG V F+F + KVS L R + +AG+A E ++
Sbjct: 94 SLTAWETFRQ----GEKAGIGGVQFDFSLLSDQEKVSRNPLIVERTFTVLMAGIAAEKVI 149
Query: 257 YGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMG 316
Y EGG D L LLK LG + + W+LL LL RH+ A L M
Sbjct: 150 YNNVEGGEEDKQNLRELLKILGLRAELYQQKENWALLQAKNLLIRHQTAYEGLVKLMERR 209
Query: 317 KSVGSCIDIIEDNID 331
SV C +I++ ++
Sbjct: 210 ASVEECQQLIQEKMN 224
>gi|297598653|ref|NP_001046019.2| Os02g0169000 [Oryza sativa Japonica Group]
gi|255670633|dbj|BAF07933.2| Os02g0169000, partial [Oryza sativa Japonica Group]
Length = 387
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 88/164 (53%), Gaps = 4/164 (2%)
Query: 169 FSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEE 228
F Y R++ HEAGH L AYL+G +G L F AL+ + QAGT F D + +E
Sbjct: 222 FWPPYKRRILVHEAGHLLTAYLMGCPIRGVILDPFVALRM--GIQGQAGTQFWDEKMEKE 279
Query: 229 VNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDAL--LKGLGFTQKKADS 286
+ G +S+T +R+ I AG+A E L+YG AEGG D N +L L + + +
Sbjct: 280 LAEGHLSSTAFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSLCILLDPPLSVAQMAN 339
Query: 287 QVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
+ RWS++ + LL+ HK A A+ G S+ + I IE+ I
Sbjct: 340 RARWSVMQSYNLLKWHKKAHRAAVKALESGHSLSTVIRRIEEAI 383
>gi|49388605|dbj|BAD25720.1| unknown protein [Oryza sativa Japonica Group]
gi|215678722|dbj|BAG95159.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218190146|gb|EEC72573.1| hypothetical protein OsI_06014 [Oryza sativa Indica Group]
gi|222622260|gb|EEE56392.1| hypothetical protein OsJ_05542 [Oryza sativa Japonica Group]
Length = 374
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 88/165 (53%), Gaps = 4/165 (2%)
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
F Y R++ HEAGH L AYL+G +G L F AL+ + QAGT F D + +
Sbjct: 208 CFWPPYKRRILVHEAGHLLTAYLMGCPIRGVILDPFVALRM--GIQGQAGTQFWDEKMEK 265
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDAL--LKGLGFTQKKAD 285
E+ G +S+T +R+ I AG+A E L+YG AEGG D N +L L + +
Sbjct: 266 ELAEGHLSSTAFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSLCILLDPPLSVAQMA 325
Query: 286 SQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
++ RWS++ + LL+ HK A A+ G S+ + I IE+ I
Sbjct: 326 NRARWSVMQSYNLLKWHKKAHRAAVKALESGHSLSTVIRRIEEAI 370
>gi|359494650|ref|XP_002263645.2| PREDICTED: uncharacterized protein LOC100261626 [Vitis vinifera]
gi|297736143|emb|CBI24181.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 73/221 (33%), Positives = 112/221 (50%), Gaps = 12/221 (5%)
Query: 115 LAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYH 174
LAA +GGVS G + + L A+L L +D+V G + + ++ Y
Sbjct: 177 LAAFLGGVSLLLSRGIDIRPNL---AAILGLAIIDAVFLGGSCLAQI-----SSYWPPYR 228
Query: 175 NRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKV 234
R++ HEAGH L AYL+G +G L A++ + QAGT F D + +E+ G++
Sbjct: 229 RRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQM--GIQGQAGTQFWDEKLEKELAEGRL 286
Query: 235 SATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDAL--LKGLGFTQKKADSQVRWSL 292
S TT +R+ + AG+A E L+YG AEGG D N ++ L T + +Q RWS+
Sbjct: 287 SGTTFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLRPPLTIGQMSNQARWSV 346
Query: 293 LNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDT 333
L + LL+ HK A A+ G S+ I IE+ + +
Sbjct: 347 LQSYNLLKWHKHAHRAAVKALESGGSLSVVIRRIEEAMSSS 387
>gi|172038426|ref|YP_001804927.1| hypothetical protein cce_3513 [Cyanothece sp. ATCC 51142]
gi|354554228|ref|ZP_08973533.1| hypothetical protein Cy51472DRAFT_2329 [Cyanothece sp. ATCC 51472]
gi|171699880|gb|ACB52861.1| conserved hypothetical protein [Cyanothece sp. ATCC 51142]
gi|353553907|gb|EHC23298.1| hypothetical protein Cy51472DRAFT_2329 [Cyanothece sp. ATCC 51472]
Length = 228
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 73/220 (33%), Positives = 108/220 (49%), Gaps = 14/220 (6%)
Query: 113 LQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQK 172
+ L+A++G + +N+ F P + +L L TVDS S+ G +L LD H +
Sbjct: 16 MTLSALLGPI--FNISPFIPAAA---TFTILGLATVDSFSWGGKGLTLFLDLFTHA---E 67
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG 232
R+I HEAGHFL AY +GI GYTLT++E K+ +AG V F+F +
Sbjct: 68 ERQRIIHHEAGHFLAAYCLGIPVTGYTLTAWETFKQ----GEEAGMGGVQFDFSLLSDQE 123
Query: 233 KVSATTL--NRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRW 290
K+S L R + +AG+A E ++Y AEGG D L ++K L + W
Sbjct: 124 KISKNPLIIERTFTVLMAGIAAENIIYEKAEGGKEDKQNLREIMKILDIPVHLYPQKESW 183
Query: 291 SLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
+LL LL RH+ + L M SV C +I+ +
Sbjct: 184 ALLQAKNLLLRHQTSYKALVKIMEKRASVEECQTLIQHTM 223
>gi|428770841|ref|YP_007162631.1| hypothetical protein Cyan10605_2506 [Cyanobacterium aponinum PCC
10605]
gi|428685120|gb|AFZ54587.1| hypothetical protein Cyan10605_2506 [Cyanobacterium aponinum PCC
10605]
Length = 236
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 70/200 (35%), Positives = 102/200 (51%), Gaps = 10/200 (5%)
Query: 130 FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAY 189
F P I ++ L+L+ TVD++++ +L L+ S++ R+I HEAGHFL AY
Sbjct: 33 FIPATITFILLSLV---TVDTLAWGNQGTNLFLNLF---LSEEQKQRIIHHEAGHFLTAY 86
Query: 190 LVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLN-RFSCIALA 248
L I GYTLT +E +K N+ +G D FLEE L RF+ + +A
Sbjct: 87 LYEIPIIGYTLTPWENMKIN---NLGSGGVMFDTSFLEEKGQDLRELNLLTERFAVVLMA 143
Query: 249 GVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAK 308
G+A E L+Y +EGG D KL + K LG + + R ++L L+ ++K A
Sbjct: 144 GIAAEKLVYKNSEGGEEDNQKLSEIYKSLGINYSQIKIKQRLAILQAETLIEKYKDAYFA 203
Query: 309 LAVAMTMGKSVGSCIDIIED 328
L AM SV C IIE+
Sbjct: 204 LVEAMGKRLSVTECQAIIEE 223
>gi|81298895|ref|YP_399103.1| hypothetical protein Synpcc7942_0084 [Synechococcus elongatus PCC
7942]
gi|1174195|gb|AAA86649.1| orf5; Method: conceptual translation supplied by author
[Synechococcus elongatus PCC 7942]
gi|81167776|gb|ABB56116.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 233
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 105/203 (51%), Gaps = 9/203 (4%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+ G +P L +LL L+++D+V++ G +L+LD I S +Y R++ HEAGH+L
Sbjct: 25 MLGSSPLLPAGLGFSLLVLFSLDAVTWQGRGATLLLDGI-QQRSPEYRQRILHHEAGHYL 83
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVN---SGKVSATTLNRFS 243
+A +G+ GYTL+++EAL++ Q G V F+ G++S +L ++
Sbjct: 84 VATALGLPVTGYTLSAWEALRQG-----QPGRGGVQFQAAALEAEAAQGQLSQRSLEQWC 138
Query: 244 CIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHK 303
+ +AG A E L+YG EGG D + L + L +AD + RW LL LL + +
Sbjct: 139 QVLMAGAAAEQLVYGNVEGGADDRAQWKQLWRQLDRNPAEADLRSRWGLLRAKTLLEQQR 198
Query: 304 GARAKLAVAMTMGKSVGSCIDII 326
A L AM S+ C I
Sbjct: 199 PAYDALVAAMAAEASIEDCNQAI 221
>gi|13605649|gb|AAK32818.1|AF361805_1 At1g56180/F14G9_20 [Arabidopsis thaliana]
Length = 389
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 71/226 (31%), Positives = 112/226 (49%), Gaps = 16/226 (7%)
Query: 112 NLQLAAIVGGVSAW--NVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTF 169
++ LAA++GGVS P + L LA L DSV G + V +
Sbjct: 175 SIALAALLGGVSYLLSQEIDVRPNLAVILGLAYL-----DSVFLGGPCLAQV-----SCY 224
Query: 170 SQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEV 229
+ R++ HEAGH L+AYL+G +G L A++ + QAGT F D + E+
Sbjct: 225 WPPHKRRIVVHEAGHLLVAYLMGCPIRGVILDPVVAMQM--GVQGQAGTQFWDQKMESEI 282
Query: 230 NSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDAL--LKGLGFTQKKADSQ 287
G++S ++ +R+S + AG+A E L+YG AEGG D N ++ L + + +Q
Sbjct: 283 AEGRLSGSSFDRYSMVLFAGIAAEALVYGEAEGGENDENLFRSISVLLEPPLSVAQMSNQ 342
Query: 288 VRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDT 333
RWS+L + LL+ HK A A+ +G + I IE+ + +
Sbjct: 343 ARWSVLQSYNLLKWHKAAHRAAVEALQVGSPLSIVIRRIEEAMSSS 388
>gi|212274643|ref|NP_001130100.1| uncharacterized protein LOC100191193 [Zea mays]
gi|194688286|gb|ACF78227.1| unknown [Zea mays]
gi|194701692|gb|ACF84930.1| unknown [Zea mays]
gi|413935728|gb|AFW70279.1| hypothetical protein ZEAMMB73_631346 [Zea mays]
Length = 388
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 86/165 (52%), Gaps = 4/165 (2%)
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
F Y R++ HEAGH L AYL+G +G L F AL+ + QAGT F D + +
Sbjct: 221 CFWPPYKRRILVHEAGHLLTAYLMGCPIRGVILDPFVALRM--GIQGQAGTQFWDEKMEK 278
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKAD-- 285
E+ G +S+T +R+ I AG+A E L+YG AEGG D N +L L A
Sbjct: 279 ELAEGHLSSTAFDRYCMIMFAGIAAEALVYGEAEGGENDENLFRSLCILLNPPPAVAQMA 338
Query: 286 SQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
++ RWS++ + LL+ HK A A+ G S+ I IE+ I
Sbjct: 339 NRARWSVMQSYNLLKWHKKAHRAAVKALEDGHSLSVVIRRIEEAI 383
>gi|18405720|ref|NP_564711.1| uncharacterized protein [Arabidopsis thaliana]
gi|12321763|gb|AAG50923.1|AC069159_24 hypothetical protein [Arabidopsis thaliana]
gi|24111289|gb|AAN46768.1| At1g56180/F14G9_20 [Arabidopsis thaliana]
gi|332195234|gb|AEE33355.1| uncharacterized protein [Arabidopsis thaliana]
Length = 389
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 71/226 (31%), Positives = 112/226 (49%), Gaps = 16/226 (7%)
Query: 112 NLQLAAIVGGVSAW--NVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTF 169
++ LAA++GGVS P + L LA L DSV G + V +
Sbjct: 175 SIALAALLGGVSYLLSQEIDVRPNLAVILGLAYL-----DSVFLGGTCLAQV-----SCY 224
Query: 170 SQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEV 229
+ R++ HEAGH L+AYL+G +G L A++ + QAGT F D + E+
Sbjct: 225 WPPHKRRIVVHEAGHLLVAYLMGCPIRGVILDPVVAMQM--GVQGQAGTQFWDQKMESEI 282
Query: 230 NSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDAL--LKGLGFTQKKADSQ 287
G++S ++ +R+S + AG+A E L+YG AEGG D N ++ L + + +Q
Sbjct: 283 AEGRLSGSSFDRYSMVLFAGIAAEALVYGEAEGGENDENLFRSISVLLEPPLSVAQMSNQ 342
Query: 288 VRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDT 333
RWS+L + LL+ HK A A+ +G + I IE+ + +
Sbjct: 343 ARWSVLQSYNLLKWHKAAHRAAVEALQVGSPLSIVIRRIEEAMSSS 388
>gi|195655569|gb|ACG47252.1| hypothetical protein [Zea mays]
Length = 388
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 86/165 (52%), Gaps = 4/165 (2%)
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
F Y R++ HEAGH L AYL+G +G L F AL+ + QAGT F D + +
Sbjct: 221 CFWPPYKRRILVHEAGHLLTAYLMGCPIRGVILDPFVALRM--GIQGQAGTQFWDEKMEK 278
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKAD-- 285
E+ G +S+T +R+ I AG+A E L+YG AEGG D N +L L A
Sbjct: 279 ELAEGHLSSTAFDRYCMIMFAGIAAEALVYGEAEGGENDENLFRSLCILLNPPPAVAQMA 338
Query: 286 SQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
++ RWS++ + LL+ HK A A+ G S+ I IE+ I
Sbjct: 339 NRARWSVMQSYNLLKWHKKAHRAAVKALEDGHSLSVVIRRIEEAI 383
>gi|427723753|ref|YP_007071030.1| M41 family peptidase [Leptolyngbya sp. PCC 7376]
gi|427355473|gb|AFY38196.1| M41 family peptidase [Leptolyngbya sp. PCC 7376]
Length = 230
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 70/218 (32%), Positives = 109/218 (50%), Gaps = 18/218 (8%)
Query: 113 LQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQK 172
+ + +++GG+ +++ F P I ++ ++ + +VD+ G L LD ++
Sbjct: 16 MTMTSLLGGI--FHISPFVPAGI---TIFIMGIASVDTFQLQGRGAMLFLDLFT---PEE 67
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG 232
RVIQHEAGHFL YL+GI GY+LT +EA+K N Q G V+F+ LE V
Sbjct: 68 ERKRVIQHEAGHFLAGYLLGIPITGYSLTPWEAIK-----NTQGGLGGVNFD-LEAVEES 121
Query: 233 KVSATTLN----RFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQV 288
+ +N R S +AG+A E L+Y +GG D +L +L G + +
Sbjct: 122 LQKSQHINLLVERISTTLMAGIAAEKLVYDQDKGGFEDRRQLKKMLLKAGLPSVVYEQKE 181
Query: 289 RWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDII 326
+W+ L LL R+K L AM KS+ C +II
Sbjct: 182 KWATLQATNLLERNKAGYDNLVKAMAARKSLEECYEII 219
>gi|357137025|ref|XP_003570102.1| PREDICTED: uncharacterized protein LOC100846921 [Brachypodium
distachyon]
Length = 386
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 87/164 (53%), Gaps = 4/164 (2%)
Query: 169 FSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEE 228
F Y R++ HEAGH L AYL+G +G L F AL+ + QAGT F D + +E
Sbjct: 221 FWPPYKRRILVHEAGHLLTAYLMGCPIRGVILDPFVALRM--GIQGQAGTQFWDEKMEKE 278
Query: 229 VNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDAL--LKGLGFTQKKADS 286
+ G +S+T +R+ I AG+A E L+YG AEGG D N +L L + + +
Sbjct: 279 LGEGHLSSTAFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSLCILLNPPLSIAQMAN 338
Query: 287 QVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
+ RWS++ + LL+ HK A A+ G S+ I IE+ I
Sbjct: 339 RARWSVMQSYNLLKWHKKAHRAAVKALESGHSLSIVIRRIEEAI 382
>gi|6056367|gb|AAF02831.1|AC009894_2 Hypothetical protein [Arabidopsis thaliana]
Length = 368
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 71/226 (31%), Positives = 112/226 (49%), Gaps = 16/226 (7%)
Query: 112 NLQLAAIVGGVSAW--NVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTF 169
++ LAA++GGVS P + L LA L DSV G + V +
Sbjct: 154 SIALAALLGGVSYLLSQEIDVRPNLAVILGLAYL-----DSVFLGGTCLAQV-----SCY 203
Query: 170 SQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEV 229
+ R++ HEAGH L+AYL+G +G L A++ + QAGT F D + E+
Sbjct: 204 WPPHKRRIVVHEAGHLLVAYLMGCPIRGVILDPVVAMQM--GVQGQAGTQFWDQKMESEI 261
Query: 230 NSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDAL--LKGLGFTQKKADSQ 287
G++S ++ +R+S + AG+A E L+YG AEGG D N ++ L + + +Q
Sbjct: 262 AEGRLSGSSFDRYSMVLFAGIAAEALVYGEAEGGENDENLFRSISVLLEPPLSVAQMSNQ 321
Query: 288 VRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDT 333
RWS+L + LL+ HK A A+ +G + I IE+ + +
Sbjct: 322 ARWSVLQSYNLLKWHKAAHRAAVEALQVGSPLSIVIRRIEEAMSSS 367
>gi|297853366|ref|XP_002894564.1| hypothetical protein ARALYDRAFT_474687 [Arabidopsis lyrata subsp.
lyrata]
gi|297340406|gb|EFH70823.1| hypothetical protein ARALYDRAFT_474687 [Arabidopsis lyrata subsp.
lyrata]
Length = 385
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 70/223 (31%), Positives = 110/223 (49%), Gaps = 16/223 (7%)
Query: 112 NLQLAAIVGGVSAW--NVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTF 169
++ LAA +GGVS P + L LA + DSV G + V +
Sbjct: 175 SIALAAFLGGVSYLLSQEIDVRPNLAVILGLAYM-----DSVFLGGTCLAQV-----SCY 224
Query: 170 SQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEV 229
+ R++ HEAGH L+AYL+G +G L A++ + QAGT F D + E+
Sbjct: 225 WPPHKRRIVVHEAGHLLVAYLMGCPIRGVILDPIVAMQM--GVQGQAGTQFWDQKMESEI 282
Query: 230 NSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDAL--LKGLGFTQKKADSQ 287
G++S ++ +R+S + AG+A E L+YG AEGG D N ++ L + + +Q
Sbjct: 283 AEGRLSGSSFDRYSMVLFAGIAAEALIYGEAEGGENDENLFRSISVLLEPPLSVAQMSNQ 342
Query: 288 VRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
RWS+L + LL+ HK A A+ +G + I IE+ +
Sbjct: 343 ARWSVLQSYNLLKWHKAAHRAAVEALQVGSPLSIVIRRIEEAM 385
>gi|126659673|ref|ZP_01730802.1| hypothetical protein CY0110_29104 [Cyanothece sp. CCY0110]
gi|126619018|gb|EAZ89758.1| hypothetical protein CY0110_29104 [Cyanothece sp. CCY0110]
Length = 226
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 73/207 (35%), Positives = 101/207 (48%), Gaps = 12/207 (5%)
Query: 126 NVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHF 185
N+ F P ++ L L TVDS S+ G ++ LD T S+K R+I HEAGH
Sbjct: 27 NISPFIPAGTTFIILGLA---TVDSFSWGGKGLTVFLDLF--TSSEK-RRRIIHHEAGHL 80
Query: 186 LIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTL--NRFS 243
L AY +GI GYTLT++E LK+ QAG V F+F + KVS L R
Sbjct: 81 LAAYCLGIPVTGYTLTAWETLKQ----GEQAGIGGVQFDFSLLSDEEKVSKNPLILERTF 136
Query: 244 CIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHK 303
+ +AG+A E ++Y AEGG D L ++K L + W+LL LL RH+
Sbjct: 137 TVLMAGIAAENMIYSQAEGGQEDKENLREIMKILEIPSHLYPQKESWALLQAKNLLLRHE 196
Query: 304 GARAKLAVAMTMGKSVGSCIDIIEDNI 330
+L M SV C +I+ +
Sbjct: 197 TTYQELVKIMEKRASVEECQKLIQRTL 223
>gi|168019810|ref|XP_001762437.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686515|gb|EDQ72904.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 288
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/285 (30%), Positives = 137/285 (48%), Gaps = 21/285 (7%)
Query: 56 LALVKDLQG---KPGGLRCFG---AARQIPQRLYTL-DELKLNGIETRSLLSPVDETLGS 108
L LV D G + G LR FG A +R+ T D L +G++ SL SP L
Sbjct: 12 LQLVGDAYGYLQQNGLLRNFGKYKARAAGAKRVVTAEDMLASSGLDASSL-SPKKWGLTG 70
Query: 109 IERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHT 168
QLAA G + ++ N I L++ ++ L +DS+ G + VL +
Sbjct: 71 AS-AFQLAA---GFALFSFLVNNNIDIRPLAVIIVGLSLLDSIYLGGAAQAQVL-----S 121
Query: 169 FSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEE 228
Y R++ HEAGH L+AYL+G +G L + EA K ++ QAGT F D E
Sbjct: 122 LWPGYKRRMLVHEAGHVLVAYLLGCPVRGVVLDAQEAFKS--GISGQAGTQFWDESLARE 179
Query: 229 VNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGL--GFTQKKADS 286
+++ +L+R+ + AG+A E L+YG AEGG +D N ++ GL + + +
Sbjct: 180 SEQNRLTEASLDRYCIVLFAGIAAEGLVYGEAEGGESDENLYKGIISGLRPPWGPGRMSN 239
Query: 287 QVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNID 331
RWS+L +L+ H+ + + G + S ++ IE ++
Sbjct: 240 HARWSVLQAFNMLKEHRKVHEAVVQELERGSDLASIVNSIETTME 284
>gi|326509693|dbj|BAJ87062.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 384
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 86/164 (52%), Gaps = 4/164 (2%)
Query: 169 FSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEE 228
F Y RV+ HEAGH L AYL+G +G L F AL+ + QAGT F D + +E
Sbjct: 219 FWPPYKRRVLVHEAGHLLTAYLMGCPIRGVILDPFVALRM--GIQGQAGTQFWDAKMEKE 276
Query: 229 VNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDAL--LKGLGFTQKKADS 286
+ G +S+T +R+ I AG+A E L+YG AEGG D N +L L + + +
Sbjct: 277 LGEGHLSSTAFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSLCVLLDPPLSVAQMAN 336
Query: 287 QVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
+ RWS++ + LL+ HK A A+ G + I IE+ I
Sbjct: 337 RARWSVMQSYNLLKWHKKAHRAAVKALESGHGLSVVIRRIEEAI 380
>gi|194707252|gb|ACF87710.1| unknown [Zea mays]
Length = 277
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 86/165 (52%), Gaps = 4/165 (2%)
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
F Y R++ HEAGH L AYL+G +G L F AL+ + QAGT F D + +
Sbjct: 110 CFWPPYKRRILVHEAGHLLTAYLMGCPIRGVILDPFVALRM--GIQGQAGTQFWDEKMEK 167
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKAD-- 285
E+ G +S+T +R+ I AG+A E L+YG AEGG D N +L L A
Sbjct: 168 ELAEGHLSSTAFDRYCMIMFAGIAAEALVYGEAEGGENDENLFRSLCILLNPPPAVAQMA 227
Query: 286 SQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
++ RWS++ + LL+ HK A A+ G S+ I IE+ I
Sbjct: 228 NRARWSVMQSYNLLKWHKKAHRAAVKALEDGHSLSVVIRRIEEAI 272
>gi|242064154|ref|XP_002453366.1| hypothetical protein SORBIDRAFT_04g004670 [Sorghum bicolor]
gi|241933197|gb|EES06342.1| hypothetical protein SORBIDRAFT_04g004670 [Sorghum bicolor]
Length = 408
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 86/164 (52%), Gaps = 4/164 (2%)
Query: 169 FSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEE 228
F Y R++ HEAGH L AYL+G +G L F AL+ + QAGT F D + +E
Sbjct: 239 FWPPYKRRILVHEAGHLLTAYLMGCPIRGVILDPFVALRM--GIQGQAGTQFWDEKMEKE 296
Query: 229 VNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDAL--LKGLGFTQKKADS 286
+ G +S+T +R+ I AG A E L+YG AEGG D N +L L + + +
Sbjct: 297 LAEGHLSSTAFDRYCMILFAGTAAEALVYGEAEGGENDENLFRSLCILLNPPLSVAQMAN 356
Query: 287 QVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
+ RWS++ + LL+ HK A A+ G S+ I IE+ I
Sbjct: 357 RARWSVMQSYNLLKWHKKAHRAAVKALEDGHSLSIVIRRIEEAI 400
>gi|170079441|ref|YP_001736079.1| M41 family peptidase [Synechococcus sp. PCC 7002]
gi|169887110|gb|ACB00824.1| peptidase family M41 [Synechococcus sp. PCC 7002]
Length = 228
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 68/221 (30%), Positives = 109/221 (49%), Gaps = 18/221 (8%)
Query: 113 LQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQK 172
+ + +++GG+ F +P ++ A++ TVD+ ++ G +L LD +
Sbjct: 16 MTMTSLLGGI-----FQLSPFVPAGITAAVMAFATVDTFNWQGRGMTLFLDLFTPA---E 67
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG 232
R++ HEAGHFL YL+ I GY+LT +EA+K+ Q G V F+ LE V +
Sbjct: 68 QRQRILHHEAGHFLAGYLLSIPITGYSLTPWEAIKQG-----QTGQGGVSFD-LESVEAS 121
Query: 233 KVSATTLN----RFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQV 288
+ +N R S +AG+A E +YG A GG D ++L LL LG + +
Sbjct: 122 LKNPQQMNLLLERLSTTLMAGIAAETTIYGKALGGANDRDQLRQLLAKLGISVASYQQRE 181
Query: 289 RWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDN 329
+W+LL LL RH+ A L AM + C +I+ +
Sbjct: 182 QWALLQAKNLLDRHQDAFQNLVTAMADRAPLEECYRLIKQD 222
>gi|357451625|ref|XP_003596089.1| hypothetical protein MTR_2g066060 [Medicago truncatula]
gi|355485137|gb|AES66340.1| hypothetical protein MTR_2g066060 [Medicago truncatula]
Length = 390
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 71/218 (32%), Positives = 108/218 (49%), Gaps = 12/218 (5%)
Query: 115 LAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYH 174
L A GGVS G + + L + L L F DS+ +G L I ++ Y
Sbjct: 179 LFAFFGGVSFLISQGIDLRPNLAVILGLAF---ADSIF----LGGTCLAQIS-SYWPPYR 230
Query: 175 NRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKV 234
R++ HEAGH LIAYL+G +G L A++ + QAGT F D + ++ G++
Sbjct: 231 RRILIHEAGHLLIAYLMGCPIRGVILDPIVAMQM--GIQGQAGTQFWDEKVASDLAEGRL 288
Query: 235 SATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDA--LLKGLGFTQKKADSQVRWSL 292
+ +R+ + AG+A E L+YG AEGG D N + LL + + +Q RWS+
Sbjct: 289 DGSAFDRYCMVLFAGIAAEALIYGEAEGGENDENLFRSVCLLLEPPLSVAEMSNQARWSV 348
Query: 293 LNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
L + LL+ HK A A+ G S+ + I IE+ +
Sbjct: 349 LQSYNLLKWHKAAHQAAVKALESGGSLSTVIRRIEETL 386
>gi|397570891|gb|EJK47513.1| hypothetical protein THAOC_33760 [Thalassiosira oceanica]
Length = 526
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 87/168 (51%), Gaps = 15/168 (8%)
Query: 176 RVIQHEAGHFLIAYLVGILPKGYTLTSFEALKK-----EGSLNVQAGTAFVDFEFLEEVN 230
R+ +HEAGHFL AYL+G +G L+++ AL + V AGT++ D + E+++
Sbjct: 262 RITKHEAGHFLCAYLLGCPVEGVVLSTWAALSDNRFGGRSTAAVSAGTSYYDIDLSEQIS 321
Query: 231 SGK-VSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADS--- 286
K ++ +++R+S I + G+A E L YG +EGG D L L+ L KA S
Sbjct: 322 GAKPLTRESIDRYSIIVMGGIAAEALEYGRSEGGAGDEEALVRFLRSLNPRSGKAVSQWT 381
Query: 287 ------QVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIED 328
Q RW VL+L+ +K L A+ G +G C+ IE+
Sbjct: 382 PEVVRNQARWGATQAVLMLKEYKPCYDALVDALERGGDLGQCVSAIEN 429
>gi|388516525|gb|AFK46324.1| unknown [Medicago truncatula]
Length = 392
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 71/218 (32%), Positives = 108/218 (49%), Gaps = 12/218 (5%)
Query: 115 LAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYH 174
L A GGVS G + + L + L L F DS+ +G L I ++ Y
Sbjct: 181 LFAFFGGVSFLISQGIDLRPNLAVILGLAF---ADSIF----LGGTCLAQIS-SYWPPYR 232
Query: 175 NRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKV 234
R++ HEAGH LIAYL+G +G L A++ + QAGT F D + ++ G++
Sbjct: 233 RRILIHEAGHLLIAYLMGCPIRGVILDPIVAMQM--GIQGQAGTQFWDEKVASDLAEGRL 290
Query: 235 SATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDA--LLKGLGFTQKKADSQVRWSL 292
+ +R+ + AG+A E L+YG AEGG D N + LL + + +Q RWS+
Sbjct: 291 DGSAFDRYCMVLFAGIAAEALIYGEAEGGENDENLFRSVCLLLEPPLSVAEMSNQARWSV 350
Query: 293 LNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
L + LL+ HK A A+ G S+ + I IE+ +
Sbjct: 351 LQSYNLLKWHKAAHQAAVKALESGGSLSTVIRRIEETL 388
>gi|428773378|ref|YP_007165166.1| hypothetical protein Cyast_1556 [Cyanobacterium stanieri PCC 7202]
gi|428687657|gb|AFZ47517.1| hypothetical protein Cyast_1556 [Cyanobacterium stanieri PCC 7202]
Length = 226
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 70/206 (33%), Positives = 103/206 (50%), Gaps = 9/206 (4%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSY-NGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHF 185
+F +P + LL L TVD++++ N GV L+ I + QK RVI HEAGHF
Sbjct: 25 LFNLSPYIPAGATFILLSLITVDTLNWQNKGVNLLI--NIFASSEQK--ERVIYHEAGHF 80
Query: 186 LIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG-KVSATTLNRFSC 244
L A+L I +GYTLT++E LK+ N G D +FLE+ + L RF
Sbjct: 81 LTAHLFNIPIQGYTLTAWETLKQR---NGGIGGVIFDTQFLEKKGQDIREFNLMLERFGV 137
Query: 245 IALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKG 304
+ + G+A E L Y +EGG D+ K + + T + + R ++L ++ HK
Sbjct: 138 VLMGGIAAEKLQYKNSEGGQEDLIKFGEIYAPITLTSSNLEMRKRLAILQATTMIETHKE 197
Query: 305 ARAKLAVAMTMGKSVGSCIDIIEDNI 330
KL M KSV C +IE+N+
Sbjct: 198 TYLKLVEFMRQRKSVAECQALIEENL 223
>gi|428777520|ref|YP_007169307.1| hypothetical protein PCC7418_2964 [Halothece sp. PCC 7418]
gi|428691799|gb|AFZ45093.1| hypothetical protein PCC7418_2964 [Halothece sp. PCC 7418]
Length = 226
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 61/194 (31%), Positives = 101/194 (52%), Gaps = 11/194 (5%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+ A++ L T D++++ +L +D + + RV+ HEAGHFL AY +GI +GY
Sbjct: 37 TFAVMGLLTFDTLAWENRGVTLFVDLFS---TAEQRERVLYHEAGHFLTAYFLGIPIQGY 93
Query: 199 TLTSFEALKKEGSLNVQAGTAFVDFEF--LEEVNSGKVSAT-TLNRFSCIALAGVATEYL 255
+LT++EA +++ Q G V F+ LE+V + L+R + AG+A E L
Sbjct: 94 SLTAWEAFRRQ-----QPGQGGVQFDTTALEQVGTQPNQVNLMLDRACTVWCAGIAAEIL 148
Query: 256 LYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTM 315
YG AEGG D +L ++L G+ Q + + W+ L L+ R+ A L M
Sbjct: 149 QYGKAEGGGDDRAQLQSVLNDFGYPQNQRQQKEEWAKLQAKSLIERNIEAYHALVKVMRQ 208
Query: 316 GKSVGSCIDIIEDN 329
SV +C +I+++
Sbjct: 209 RASVETCEQMIQNH 222
>gi|449444266|ref|XP_004139896.1| PREDICTED: uncharacterized protein LOC101213430 [Cucumis sativus]
gi|449493060|ref|XP_004159181.1| PREDICTED: uncharacterized LOC101213430 [Cucumis sativus]
Length = 403
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 84/163 (51%), Gaps = 4/163 (2%)
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
++ Y R++ HEAGH L AYL+G +G L A++ + QAGT F D +
Sbjct: 238 SYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQM--GIQGQAGTQFWDEKMAS 295
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDAL--LKGLGFTQKKAD 285
+ G++ T+ +R+ + AG+A E L+YG AEGG D N ++ L + +
Sbjct: 296 NLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVAQMS 355
Query: 286 SQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIED 328
+Q RW++L + LL+ HK A AM G S+ I IED
Sbjct: 356 NQARWAVLQSYNLLKWHKHAHQVAVKAMESGSSLSVVIRKIED 398
>gi|195652199|gb|ACG45567.1| hypothetical protein [Zea mays]
Length = 363
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/140 (38%), Positives = 76/140 (54%), Gaps = 4/140 (2%)
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
F Y R++ HEAGH L AYL+G +G L F AL+ + QAGT F D + +
Sbjct: 221 CFWPPYKRRILVHEAGHLLTAYLMGCPIRGVILDPFVALRM--GIQGQAGTQFWDEKMEK 278
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKAD-- 285
E+ G +S+T +R+ I AG+A E L+YG AEGG D N +L L A
Sbjct: 279 ELAEGHLSSTAFDRYCMIMFAGIAAEALVYGEAEGGENDENLFRSLCILLNPPPAVAQMA 338
Query: 286 SQVRWSLLNTVLLLRRHKGA 305
++ RWS++ + LL+ HK A
Sbjct: 339 NRARWSVMQSYNLLKWHKKA 358
>gi|356557024|ref|XP_003546818.1| PREDICTED: uncharacterized protein LOC100800880 [Glycine max]
Length = 386
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 84/165 (50%), Gaps = 4/165 (2%)
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
++ Y R++ HEAGH L AYL+G +G L A++ + QAGT F D +
Sbjct: 220 SYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQM--GIQGQAGTQFWDEKVAN 277
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLD--ALLKGLGFTQKKAD 285
++ G++ T +R+ + AG+A E L+YG AEGG D N LL + +
Sbjct: 278 DLAEGRLDGTAFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRGICLLLDPPLSTAEMS 337
Query: 286 SQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNI 330
+Q RW++L + LL+ H+ A A+ G S+ I IE+ +
Sbjct: 338 NQARWAVLQSYNLLKWHRAAHRAAVKALESGDSLSVVIRSIEETM 382
>gi|224112297|ref|XP_002316146.1| predicted protein [Populus trichocarpa]
gi|222865186|gb|EEF02317.1| predicted protein [Populus trichocarpa]
Length = 353
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 55/155 (35%), Positives = 83/155 (53%), Gaps = 4/155 (2%)
Query: 176 RVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVS 235
R++ HEAGH L+AYL+G +G L A++ QAGT F D + E+ GK+S
Sbjct: 197 RILVHEAGHLLVAYLMGCPVRGVILDPMVAMQM--GTQGQAGTQFWDEKLSNELAEGKLS 254
Query: 236 ATTLNRFSCIALAGVATEYLLYGYAEGGLADIN--KLDALLKGLGFTQKKADSQVRWSLL 293
T+ +R+ + AG+A E L+YG AEGG D N + +L + + +Q RWS+L
Sbjct: 255 GTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSTCVLLQPPLSVAQMSNQARWSVL 314
Query: 294 NTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIED 328
+ LL+ H+ A A+ G S+ I IE+
Sbjct: 315 QSFNLLKWHRDAHRAAVKALESGGSLSVVIRRIEE 349
>gi|434385007|ref|YP_007095618.1| ATP-dependent Zn protease [Chamaesiphon minutus PCC 6605]
gi|428015997|gb|AFY92091.1| ATP-dependent Zn protease [Chamaesiphon minutus PCC 6605]
Length = 229
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/189 (32%), Positives = 94/189 (49%), Gaps = 11/189 (5%)
Query: 147 TVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEAL 206
TVD+ V +LVLD I T ++ Y R++ HEAGHFL+AYL+GI GYTL+++E+
Sbjct: 51 TVDNFFLKSTVATLVLDAIAGTKAE-YRQRIVHHEAGHFLVAYLLGIPITGYTLSAWESF 109
Query: 207 KKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLAD 266
++ S Q G F + +SA L + + +AG+A E L+Y ++GG D
Sbjct: 110 RQGQS--AQGGVMFAPPQ-------ANISAQLLQHYCTVWMAGIAAEKLVYDRSQGGGED 160
Query: 267 INKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDII 326
KL +L G Q++ Q + L L++ + A L AM V C I
Sbjct: 161 RQKLRGVLFVAGKQQQEIVKQENLATLQAKNLIQTNWEAYQALVTAMLDRTPVADCCTRI 220
Query: 327 EDNIDDTDI 335
D + D+
Sbjct: 221 -DRLKSPDL 228
>gi|416377179|ref|ZP_11683563.1| hypothetical protein CWATWH0003_0413 [Crocosphaera watsonii WH
0003]
gi|357266285|gb|EHJ14939.1| hypothetical protein CWATWH0003_0413 [Crocosphaera watsonii WH
0003]
Length = 165
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/164 (34%), Positives = 83/164 (50%), Gaps = 6/164 (3%)
Query: 170 SQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEV 229
S + R+I HEAGHFL AY +G+ GY+LT++E ++ +AG V F+F
Sbjct: 6 SSEERKRIIHHEAGHFLAAYCLGVPITGYSLTAWETFRQ----GEKAGIGGVQFDFSLLS 61
Query: 230 NSGKVSATTL--NRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ 287
+ KVS L R + +AG+A E ++Y EGG D L LLK LG + +
Sbjct: 62 DQEKVSRNPLIVERTFTVLMAGIAAEKVIYNNVEGGEEDKQNLRELLKILGLRAELYQQK 121
Query: 288 VRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNID 331
W+LL LL RH+ A L M SV C +I++ ++
Sbjct: 122 ENWALLQAKNLLIRHQTAYEGLVKLMERRASVEECQQLIQEKMN 165
>gi|255544586|ref|XP_002513354.1| conserved hypothetical protein [Ricinus communis]
gi|223547262|gb|EEF48757.1| conserved hypothetical protein [Ricinus communis]
Length = 410
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/163 (32%), Positives = 85/163 (52%), Gaps = 4/163 (2%)
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
++ + R++ HEAGH L+AYL+G +G L ++ + QAGT F D +
Sbjct: 227 SYWPPFKRRILVHEAGHLLVAYLMGCPIRGVILDPIVMMQM--GIQGQAGTQFWDEKLSN 284
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDAL--LKGLGFTQKKAD 285
E+ G++S TT +R+ + AG+A E L+YG AEGG D N ++ L + +
Sbjct: 285 ELADGRLSGTTFDRYCMVLFAGIAAETLVYGEAEGGENDENLFRSISVLLQPPLSVNQMS 344
Query: 286 SQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIED 328
+Q RWS+L + LL+ + A A+ G S+ I IE+
Sbjct: 345 NQARWSVLQSYNLLKWQRHAHRAAVKALESGSSLSVVIRRIEE 387
>gi|298491787|ref|YP_003721964.1| hypothetical protein Aazo_3097 ['Nostoc azollae' 0708]
gi|298233705|gb|ADI64841.1| conserved hypothetical protein ['Nostoc azollae' 0708]
Length = 200
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 54/151 (35%), Positives = 88/151 (58%), Gaps = 8/151 (5%)
Query: 147 TVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEAL 206
T+D+ S+ G G +VLD + FS + +R++ E GHFL+A L+GI GYTL+++EA
Sbjct: 45 TLDNFSFQGKGGMIVLDWLAR-FSPENPDRILHDETGHFLVAQLLGIPVTGYTLSAWEAW 103
Query: 207 K--KEGSLNVQAGTAFVDFEFL-EEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGG 263
K + G Q G D E + +++ GK+ + + + I +AG+A E +++ AEGG
Sbjct: 104 KLGQPG----QGGVTLEDSEIIAQQLEKGKIGVSMVELYCNIWMAGIAAENVVFKSAEGG 159
Query: 264 LADINKLDALLKGLGFTQKKADSQVRWSLLN 294
D KL+ L+ LGF +K + + R+ LL
Sbjct: 160 GDDKAKLNQFLQALGFEEKIFEQKQRFYLLQ 190
>gi|452822826|gb|EME29842.1| hypothetical protein isoform 1 [Galdieria sulphuraria]
Length = 377
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 104/177 (58%), Gaps = 24/177 (13%)
Query: 138 LSLALLF---LWTVDSVSYNGGVGSLV---LDTI---GHTFSQKYHN-------RVIQHE 181
SL+L+F L+ ++++ G+ LV D I G F + Y N +V++HE
Sbjct: 171 FSLSLIFFSHLFHIENIIRPIGISLLVGFLCDQIALQGAIFEELYKNLFPIYKKKVLKHE 230
Query: 182 AGHFLIAYLVGILPKGYTLTSFEALKKEGSLNV--QAGTAFVDFEFLEEVNSGKVSATTL 239
AGHFL++YL+G +G L+++E+L SL + QAGT F D + E+++G ++ T+
Sbjct: 231 AGHFLVSYLLGCPVRGIVLSAWESL----SLGIPGQAGTLFFDRKLESELSTGFLTDATI 286
Query: 240 NRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLG--FTQKKADSQVRWSLLN 294
+R+S + +AG+A E L YG AEGG +D L +L L +++++ +Q RW++L
Sbjct: 287 DRYSIVLMAGIAAEALEYGQAEGGQSDEAALLRVLTCLNPPWSKERVFNQARWAVLQ 343
>gi|443321997|ref|ZP_21051033.1| ATP-dependent Zn protease [Gloeocapsa sp. PCC 73106]
gi|442788297|gb|ELR97994.1| ATP-dependent Zn protease [Gloeocapsa sp. PCC 73106]
Length = 208
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/200 (29%), Positives = 99/200 (49%), Gaps = 17/200 (8%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+F +P ++ +L L +D++S+ ++L+ + RVI HEAGHFL
Sbjct: 25 LFNVSPVVPAGITAVVLGLGALDTLSWQNRGSDILLELFA---PEAERERVIHHEAGHFL 81
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
+AY +G+ K YT+T++ A KK G V F++ E L + +
Sbjct: 82 VAYHLGVPIKDYTVTAWSAWKKG-----YPGKGGVQFDYSE---------IELTKLVTVL 127
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGAR 306
LAG+A E ++YG AEGG +D ++ L +G + +VR +L + +L+ ++ +
Sbjct: 128 LAGIAAEMIIYGKAEGGESDRQQVSIALSAVGLSTPAQRQKVRNALASAQQILQENQQSY 187
Query: 307 AKLAVAMTMGKSVGSCIDII 326
L AM+ G SV C +I
Sbjct: 188 QSLVSAMSQGLSVAECYGVI 207
>gi|16330018|ref|NP_440746.1| hypothetical protein sll1738 [Synechocystis sp. PCC 6803]
gi|383321761|ref|YP_005382614.1| hypothetical protein SYNGTI_0852 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383324930|ref|YP_005385783.1| hypothetical protein SYNPCCP_0851 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383490814|ref|YP_005408490.1| hypothetical protein SYNPCCN_0851 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436081|ref|YP_005650805.1| hypothetical protein SYNGTS_0852 [Synechocystis sp. PCC 6803]
gi|451814177|ref|YP_007450629.1| hypothetical protein MYO_18580 [Synechocystis sp. PCC 6803]
gi|1652505|dbj|BAA17426.1| sll1738 [Synechocystis sp. PCC 6803]
gi|339273113|dbj|BAK49600.1| hypothetical protein SYNGTS_0852 [Synechocystis sp. PCC 6803]
gi|359271080|dbj|BAL28599.1| hypothetical protein SYNGTI_0852 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274250|dbj|BAL31768.1| hypothetical protein SYNPCCN_0851 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277420|dbj|BAL34937.1| hypothetical protein SYNPCCP_0851 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407957914|dbj|BAM51154.1| hypothetical protein BEST7613_2223 [Bacillus subtilis BEST7613]
gi|451780146|gb|AGF51115.1| hypothetical protein MYO_18580 [Synechocystis sp. PCC 6803]
Length = 231
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/198 (26%), Positives = 99/198 (50%), Gaps = 7/198 (3%)
Query: 139 SLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGY 198
+L +L + T D +S+ G G+ + T ++K R++ HEAGHFL+A+ + I Y
Sbjct: 37 TLGILGIITADQISWQGK-GTDFFVGLFQTKAEK--ERILCHEAGHFLVAHCLQIPITNY 93
Query: 199 TLTSFEALKK-EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLY 257
+L+ +E L++ G + AG F + + L R++ + +AG+A E ++Y
Sbjct: 94 SLSPWEVLRQGAGGM---AGIQFDTTNLENQCRDWHLRPQALERWATVWMAGIAAEKIIY 150
Query: 258 GYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGK 317
G ++GG D +L + G + K + W+ L LL +H+ A +L A+ +
Sbjct: 151 GESQGGNGDRQQLRQAFRRAGLPEIKLQQKESWAFLQAKNLLEQHRQAHGQLQQALAQRR 210
Query: 318 SVGSCIDIIEDNIDDTDI 335
SV C ++ ++ + +
Sbjct: 211 SVEECRHLLAQHLGEPAV 228
>gi|223998690|ref|XP_002289018.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976126|gb|EED94454.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 558
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 90/168 (53%), Gaps = 15/168 (8%)
Query: 176 RVIQHEAGHFLIAYLVGILPKGYTLTSFEALKK-----EGSLNVQAGTAFVDFEFLEEVN 230
R+ +HEAGHFL AYL+G +G L+++ AL+ + V AGT++ D + E+++
Sbjct: 301 RITKHEAGHFLCAYLLGCPVEGVVLSTWAALQDGRFGGRSTRAVSAGTSYYDLDLSEQIS 360
Query: 231 SGK-VSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLG---------FT 280
K ++ +++R+S I + G+A E + +G A+GG D L L+ L ++
Sbjct: 361 GMKPLTRESIDRYSIIVMGGIAAEAVEFGRADGGAGDEEALVRFLRSLNPRGGNAIKSWS 420
Query: 281 QKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIED 328
+ +Q R+ VLLL+ +K L A+ G +G+CI IE+
Sbjct: 421 PELIRNQARFGATEAVLLLKEYKPCYDALVDALERGGDLGNCIVAIEN 468
>gi|255078988|ref|XP_002503074.1| predicted protein [Micromonas sp. RCC299]
gi|226518340|gb|ACO64332.1| predicted protein [Micromonas sp. RCC299]
Length = 389
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 102/219 (46%), Gaps = 26/219 (11%)
Query: 115 LAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYH 174
LA IVG + F I +SLA+L + S G+ + +D S +Y
Sbjct: 185 LALIVGATLPGDWGAFGSYLIGGISLAVLAVG-----STAPGLLQVAIDRFAR-ISPEYR 238
Query: 175 NRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDF--EFLEEVNSG 232
NR+ +HEA HFL+ Y++G+ GY+L G A DF LE G
Sbjct: 239 NRIARHEAAHFLVGYMLGVPVAGYSLG--------------LGAAHTDFLEAKLERKVYG 284
Query: 233 K--VSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALL-KGLGFTQKKADSQV- 288
K +S T+ +C+++AGVA E L + G AD+ L +L K ++ QV
Sbjct: 285 KIAISEKTMLPLACVSMAGVAAEALAFEDVRGQEADLRDLQRILNKAEPKLSDQSQQQVT 344
Query: 289 RWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
RW++ +L+RH+ + L AM G SV C+ +IE
Sbjct: 345 RWAVWQAASMLKRHEASFDALTRAMEEGASVADCLKVIE 383
>gi|443478893|ref|ZP_21068583.1| hypothetical protein Pse7429DRAFT_4207 [Pseudanabaena biceps PCC
7429]
gi|443015746|gb|ELS30573.1| hypothetical protein Pse7429DRAFT_4207 [Pseudanabaena biceps PCC
7429]
Length = 235
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 97/216 (44%), Gaps = 18/216 (8%)
Query: 126 NVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHF 185
+ G +P +++ LL LW +D + G G+ + S KY R+ HEAGHF
Sbjct: 24 TMVGLSPVVPAAITMTLLGLWGIDIGFWQGKFGAYAQVWL-RARSPKYRERISYHEAGHF 82
Query: 186 LIAYLVGILPKGYTLTSF------EALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTL 239
L A L+G Y + E L + ++ G V+ E V++ S+ L
Sbjct: 83 LAAQLLGFKVVNYAIAGIVGQSLGEVLASKNFTGLEGG---VEIALGETVSN---SSNLL 136
Query: 240 NRFSCIALAGVATEYLLY-GYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLL 298
+R+S + +AG+A E L+ G EGG+ D+ +L + L Q RW+LLN L
Sbjct: 137 DRYSTVYMAGIAAEQLMCEGATEGGMDDMQRLRQSIAHL----PNPMIQERWALLNAKNL 192
Query: 299 LRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTD 334
L H+ LA M G S+ C IE + D
Sbjct: 193 LSEHRSVLVALAAKMLNGASIADCSQTIEQALKTDD 228
>gi|359460352|ref|ZP_09248915.1| hypothetical protein ACCM5_16623 [Acaryochloris sp. CCMEE 5410]
Length = 221
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 61/185 (32%), Positives = 87/185 (47%), Gaps = 19/185 (10%)
Query: 149 DSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKK 208
D+ Y G +L LD S +Y RVI HEAGHFL AYL+ + GYTLT++EA ++
Sbjct: 47 DTFGYQGKGMTLFLDGFAR-LSPQYRQRVIHHEAGHFLTAYLLDLPITGYTLTAWEAQQQ 105
Query: 209 EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADIN 268
G VDF S + + R+ + +AG E +Y AEGG D+
Sbjct: 106 GGGQGGICIETPVDF-------SETSALEQVERYCTVWMAGGVAETFIYSEAEGGKDDLR 158
Query: 269 KLDALLKGLGFT----QKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCID 324
+L L L +++A ++ R L+R + A L AMT KSV C
Sbjct: 159 QLRRTLNRLHMNVKVHERQAGNRAR-------QLIRSNWDAYEALVQAMTDRKSVADCCQ 211
Query: 325 IIEDN 329
I+E +
Sbjct: 212 ILEQH 216
>gi|449017348|dbj|BAM80750.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 384
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 59/193 (30%), Positives = 96/193 (49%), Gaps = 20/193 (10%)
Query: 145 LWTVDSVSYNGGVGSLVLDTIG-----HTFSQKY----HNRVIQHEAGHFLIAYLVGILP 195
L V +V GG+G+ DT+ + QK R+ HEAGH L+++++G
Sbjct: 188 LQNVAAVVSLGGMGAWAFDTLALQGRVAAWLQKRWLTDPKRIAVHEAGHVLLSHVLGYEL 247
Query: 196 KGYTLTSFEALKKEGSLNVQAG-TAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEY 254
+ Y+L + N+ AG T V E L SG VSA+ F+ +++AG+A E
Sbjct: 248 ENYSL-------GQDRTNLLAGKTGVVLTEGLRAPTSGPVSAS--ESFALLSVAGIAAEA 298
Query: 255 LLYGYAEGGLADINKLDALLKGL-GFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAM 313
+ +G AEGG+ D+ L + F + VRW++L + + H+ ++A A+
Sbjct: 299 IFFGDAEGGMEDLATLSRYFRRTEPFLNADTQASVRWAMLAALRICNMHRAELEQIAEAL 358
Query: 314 TMGKSVGSCIDII 326
+GKSV C +II
Sbjct: 359 LLGKSVKECAEII 371
>gi|242077362|ref|XP_002448617.1| hypothetical protein SORBIDRAFT_06g030240 [Sorghum bicolor]
gi|241939800|gb|EES12945.1| hypothetical protein SORBIDRAFT_06g030240 [Sorghum bicolor]
Length = 315
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 60/201 (29%), Positives = 93/201 (46%), Gaps = 19/201 (9%)
Query: 130 FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAY 189
F P I +SL +L + +V +G+ T Y R+ +HEA HFL+AY
Sbjct: 130 FVPYLIGSISLIVLAVGSVAPGLLQAAIGAF------STVFPDYQERIARHEAAHFLVAY 183
Query: 190 LVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAG 249
L+G+ GY+L + KE +D + + + SGK+ L+R + +++AG
Sbjct: 184 LIGLPILGYSLD----IGKE-------HVNLIDEQLQKLIYSGKLDGKELDRLAVVSMAG 232
Query: 250 VATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ--VRWSLLNTVLLLRRHKGARA 307
+A E L Y G AD+ L + K Q RW++L LL+ +K A
Sbjct: 233 LAAEGLQYDKVVGQSADLFTLQRFINRTKPQLSKDQQQNLTRWAVLFAASLLKNNKAAHE 292
Query: 308 KLAVAMTMGKSVGSCIDIIED 328
L AM+ SV CI+ IE+
Sbjct: 293 ALMSAMSQKASVLGCIEAIEN 313
>gi|158333380|ref|YP_001514552.1| hypothetical protein AM1_0152 [Acaryochloris marina MBIC11017]
gi|158303621|gb|ABW25238.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 221
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 60/182 (32%), Positives = 87/182 (47%), Gaps = 19/182 (10%)
Query: 149 DSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKK 208
D+ Y G +L LD S +Y RVI HEAGHFL AYL+ + GYTLT++EA ++
Sbjct: 47 DTFGYQGKGMTLFLDGFAR-LSPQYRQRVIHHEAGHFLTAYLLDLPITGYTLTAWEAQQQ 105
Query: 209 EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADIN 268
G VDF E N+ + + R+ + +AG E +Y AEGG D+
Sbjct: 106 GGGQGGVCIETPVDF---SETNALE----QVERYCTVWMAGGVAETFIYSEAEGGKDDLR 158
Query: 269 KLDALLKGLGFT----QKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCID 324
+L L L +++A ++ R ++R + A L AMT KSV C
Sbjct: 159 QLRRTLNRLHMNVKVHERQAGNRAR-------QMIRSNWDAYEALVQAMTDRKSVAECCQ 211
Query: 325 II 326
I+
Sbjct: 212 IL 213
>gi|159472210|ref|XP_001694244.1| hypothetical protein CHLREDRAFT_205570 [Chlamydomonas reinhardtii]
gi|158276907|gb|EDP02677.1| predicted protein [Chlamydomonas reinhardtii]
Length = 346
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 87/176 (49%), Gaps = 18/176 (10%)
Query: 156 GVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQ 215
G+ S+V+D F Y +RV++HEA HFL+ YLVG+ GY++ L +E
Sbjct: 179 GLLSVVIDRFSQLFPD-YKDRVLKHEAAHFLLGYLVGVPITGYSVE----LGRE------ 227
Query: 216 AGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLK 275
T F + + + + +++ ++ + +A+AG+A E Y G AD+ L LL
Sbjct: 228 -HTEFAEAKIQQRIIERQLTDEEIDSLALVAVAGMAAEGREYEEVMGQTADLYDLQRLL- 285
Query: 276 GLGFTQKKADSQ----VRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
L K +D+Q RW++ + LLR H L AM G SV C+ +IE
Sbjct: 286 -LRSKTKLSDAQQQNITRWAVWSAAGLLRSHAEEHKALIEAMRRGASVAECVQVIE 340
>gi|58531976|emb|CAE03618.3| OSJNBb0003B01.9 [Oryza sativa Japonica Group]
gi|116310870|emb|CAH67811.1| OSIGBa0138H21-OSIGBa0138E01.2 [Oryza sativa Indica Group]
gi|125549916|gb|EAY95738.1| hypothetical protein OsI_17606 [Oryza sativa Indica Group]
Length = 325
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 96/201 (47%), Gaps = 19/201 (9%)
Query: 130 FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAY 189
F P I +SL +L + ++ +G+ T Y R+ +HEA HFL+AY
Sbjct: 140 FVPYLIGSISLIVLAIGSISPGLLQAAIGAF------STVFPDYQERIARHEAAHFLVAY 193
Query: 190 LVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAG 249
L+G+ GY+L + KE +D + + + SG++ ++R + +++AG
Sbjct: 194 LIGLPILGYSLD----IGKE-------HVNLIDDQLQKLIYSGQLDQKEIDRLAVVSMAG 242
Query: 250 VATEYLLYGYAEGGLADINKLDALLK--GLGFTQKKADSQVRWSLLNTVLLLRRHKGARA 307
+A E L Y G AD+ L + T+ + + RW++L + LL+ +K A
Sbjct: 243 LAAEGLEYDKVVGQSADLFTLQRFINRTKPPLTKDQQQNLTRWAVLFSASLLKNNKAAHE 302
Query: 308 KLAVAMTMGKSVGSCIDIIED 328
L AM+ SV CI+ IE+
Sbjct: 303 ALMSAMSQKASVLGCIEAIEN 323
>gi|125591796|gb|EAZ32146.1| hypothetical protein OsJ_16343 [Oryza sativa Japonica Group]
Length = 322
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 96/201 (47%), Gaps = 19/201 (9%)
Query: 130 FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAY 189
F P I +SL +L + ++ +G+ T Y R+ +HEA HFL+AY
Sbjct: 137 FVPYLIGSISLIVLAIGSISPGLLQAAIGAF------STVFPDYQERIARHEAAHFLVAY 190
Query: 190 LVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAG 249
L+G+ GY+L + KE +D + + + SG++ ++R + +++AG
Sbjct: 191 LIGLPILGYSLD----IGKE-------HVNLIDDQLQKLIYSGQLDQKEIDRLAVVSMAG 239
Query: 250 VATEYLLYGYAEGGLADINKLDALLK--GLGFTQKKADSQVRWSLLNTVLLLRRHKGARA 307
+A E L Y G AD+ L + T+ + + RW++L + LL+ +K A
Sbjct: 240 LAAEGLEYDKVVGQSADLFTLQRFINRTKPPLTKDQQQNLTRWAVLFSASLLKNNKAAHE 299
Query: 308 KLAVAMTMGKSVGSCIDIIED 328
L AM+ SV CI+ IE+
Sbjct: 300 ALMSAMSQKASVLGCIEAIEN 320
>gi|224131418|ref|XP_002328534.1| predicted protein [Populus trichocarpa]
gi|222838249|gb|EEE76614.1| predicted protein [Populus trichocarpa]
Length = 120
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 35/50 (70%), Positives = 45/50 (90%)
Query: 282 KKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNID 331
KKADSQ+ S+L+T+L+LRRH+GARAKLA AMTMG SVGSCI +I+D+I+
Sbjct: 34 KKADSQIGCSVLSTILVLRRHEGARAKLAEAMTMGTSVGSCIGMIDDSIN 83
>gi|195624212|gb|ACG33936.1| hypothetical protein [Zea mays]
Length = 315
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 93/201 (46%), Gaps = 19/201 (9%)
Query: 130 FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAY 189
F P I +SL +L + +V +G+ T Y R+ +HEA HFL+AY
Sbjct: 130 FVPYLIGSISLIVLAVGSVAPGLLQAAIGAF------STVFPDYQERIARHEAAHFLVAY 183
Query: 190 LVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAG 249
L+G+ GY+L + KE +D + + + SG++ L+R + +++AG
Sbjct: 184 LIGLPILGYSLD----IGKE-------HVNLIDEQLQKLIYSGQLDGKELDRLAVVSMAG 232
Query: 250 VATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ--VRWSLLNTVLLLRRHKGARA 307
+A E L Y G AD+ L + K Q RW++L LL+ +K A
Sbjct: 233 LAAEGLEYDKVVGQSADLFTLQRFMNRTKPQLSKDQQQNLTRWAVLFAASLLKNNKAAHE 292
Query: 308 KLAVAMTMGKSVGSCIDIIED 328
L AM+ SV CI+ IE+
Sbjct: 293 ALMSAMSQKASVLGCIEAIEN 313
>gi|226498890|ref|NP_001140871.1| hypothetical protein [Zea mays]
gi|194701534|gb|ACF84851.1| unknown [Zea mays]
gi|414585199|tpg|DAA35770.1| TPA: hypothetical protein ZEAMMB73_590939 [Zea mays]
Length = 315
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 93/201 (46%), Gaps = 19/201 (9%)
Query: 130 FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAY 189
F P I +SL +L + +V +G+ T Y R+ +HEA HFL+AY
Sbjct: 130 FVPYLIGSISLIVLAVGSVAPGLLQAAIGAF------STVFPDYQERIARHEAAHFLVAY 183
Query: 190 LVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAG 249
L+G+ GY+L + KE +D + + + SG++ L+R + +++AG
Sbjct: 184 LIGLPILGYSLD----IGKE-------HVNLIDEQLQKLIYSGQLDGKELDRLAVVSMAG 232
Query: 250 VATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ--VRWSLLNTVLLLRRHKGARA 307
+A E L Y G AD+ L + K Q RW++L LL+ +K A
Sbjct: 233 LAAEGLEYDKVVGQSADLFTLQRFINRTKPQLSKDQQQNLTRWAVLFAASLLKNNKAAHE 292
Query: 308 KLAVAMTMGKSVGSCIDIIED 328
L AM+ SV CI+ IE+
Sbjct: 293 ALMSAMSQKASVLGCIEAIEN 313
>gi|115460866|ref|NP_001054033.1| Os04g0639300 [Oryza sativa Japonica Group]
gi|113565604|dbj|BAF15947.1| Os04g0639300, partial [Oryza sativa Japonica Group]
Length = 173
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 81/158 (51%), Gaps = 13/158 (8%)
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG 232
Y R+ +HEA HFL+AYL+G+ GY+L + KE +D + + + SG
Sbjct: 25 YQERIARHEAAHFLVAYLIGLPILGYSLD----IGKE-------HVNLIDDQLQKLIYSG 73
Query: 233 KVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLK--GLGFTQKKADSQVRW 290
++ ++R + +++AG+A E L Y G AD+ L + T+ + + RW
Sbjct: 74 QLDQKEIDRLAVVSMAGLAAEGLEYDKVVGQSADLFTLQRFINRTKPPLTKDQQQNLTRW 133
Query: 291 SLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIED 328
++L + LL+ +K A L AM+ SV CI+ IE+
Sbjct: 134 AVLFSASLLKNNKAAHEALMSAMSQKASVLGCIEAIEN 171
>gi|297805804|ref|XP_002870786.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316622|gb|EFH47045.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 205
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 92/194 (47%), Gaps = 48/194 (24%)
Query: 36 KRRVLKQVDEELSKGDERAALALVKDLQGKPGGLRCFGAARQIPQRLYTLDELKLNGIET 95
+RR L++V +ELS+G+ AL+LVK L+ K G L FG+A+ +P+ KL+
Sbjct: 24 RRRSLERVSKELSRGNYETALSLVKQLKSKHGCLSAFGSAKLLPK--------KLDMSSK 75
Query: 96 RSLLSPVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNG 155
L S +D SIE ++ + +S +P++ +
Sbjct: 76 SDLWSLIDSVSRSIE-SVYVEEDSVRISKEEETKTSPEEDWFA----------------- 117
Query: 156 GVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQ 215
V+QHE+GHFL+ YL+G+LP+ Y + + EA+++ S +V
Sbjct: 118 ---------------------VVQHESGHFLVGYLLGVLPRYYEIPTLEAVRQNVS-SVT 155
Query: 216 AGTAFVDFEFLEEV 229
FV FEFL++V
Sbjct: 156 GRVEFVGFEFLKQV 169
>gi|428216674|ref|YP_007101139.1| hypothetical protein Pse7367_0400 [Pseudanabaena sp. PCC 7367]
gi|427988456|gb|AFY68711.1| hypothetical protein Pse7367_0400 [Pseudanabaena sp. PCC 7367]
Length = 268
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/235 (25%), Positives = 108/235 (45%), Gaps = 34/235 (14%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
V G +P +++A+L ++ +D +++G + VL+ + S + R++ HEAGHFL
Sbjct: 34 VIGISPVLTAAITVAILAIYGIDLGAFSGRGTNFVLNWL-EGRSPERMERILHHEAGHFL 92
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKV-SATTLNRFSCI 245
AYL G+ GY L A +++ +G A EV+ ++ L+R+ +
Sbjct: 93 AAYLSGLKIIGYNLDPLSAPSNMSNMSNMSGMAVAGV----EVDPAQIKQPQMLSRYCTV 148
Query: 246 ALAGVATEYLL--------------------YGYA-------EGGLADINKLDALLKGLG 278
+AG+A E L G A GG+ D+ +L A +K L
Sbjct: 149 WMAGIAAEQYLDASDRQSSELNPNPSSQISSAGNAWTGDLGNTGGMNDLRQLKAAVKELA 208
Query: 279 FTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDT 333
+K + ++RW+LL + L++ + A L M G ++ C I+ N+ D+
Sbjct: 209 -PEKNPEMEMRWALLRSRTLIKEYSAAYEALVEQMRSGAALAECYGAIDANLPDS 262
>gi|326516958|dbj|BAJ96471.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326518308|dbj|BAJ88183.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 320
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 78/158 (49%), Gaps = 13/158 (8%)
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG 232
Y R+ +HEA HFL+AYL G+ GY+L + KE VD + + + SG
Sbjct: 172 YQERIARHEAAHFLVAYLTGLPILGYSLD----IGKE-------HVNLVDDQLQKLLYSG 220
Query: 233 KVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ--VRW 290
++ ++R + I++AG+A E L Y G AD+ L L KA Q RW
Sbjct: 221 QLDQKEVDRLAVISMAGLAAEGLQYDKVVGQSADLFTLQRFLNRTKPPLGKAQQQNLTRW 280
Query: 291 SLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIED 328
++L LL+ +K A L AM+ +V CI+ IE+
Sbjct: 281 AVLIAASLLKNNKAAHDALVAAMSQKATVLGCIEAIEN 318
>gi|388511020|gb|AFK43576.1| unknown [Lotus japonicus]
Length = 330
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 58/200 (29%), Positives = 92/200 (46%), Gaps = 19/200 (9%)
Query: 130 FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAY 189
F P I +SL +L + ++ +GS T Y R+ +HEA HFLIAY
Sbjct: 138 FVPYLIGSISLVVLAVGSISPGLLQAAIGSF------STVFPDYQERIARHEAAHFLIAY 191
Query: 190 LVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAG 249
L+G+ GY+L + KE +D + + SG++ A ++R + +++AG
Sbjct: 192 LLGVPILGYSLD----IGKE-------HVNLIDQRLEKLIYSGQLDAKEIDRLAVVSMAG 240
Query: 250 VATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ--VRWSLLNTVLLLRRHKGARA 307
+A E L+Y G AD+ L + K Q RW+++ LL+ +K +
Sbjct: 241 LAAEGLIYDKVIGQSADLFTLQRFINRTKPQLSKDQQQNLTRWAVMFAASLLKNNKESHE 300
Query: 308 KLAVAMTMGKSVGSCIDIIE 327
L +MT SV CI IE
Sbjct: 301 ALMASMTKKASVVECIQTIE 320
>gi|357166250|ref|XP_003580649.1| PREDICTED: uncharacterized protein LOC100846062 [Brachypodium
distachyon]
Length = 328
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 77/158 (48%), Gaps = 13/158 (8%)
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG 232
Y R+ +HEA HFL+AYL G+ GY+L + KE VD + + + SG
Sbjct: 180 YQERIARHEAAHFLVAYLTGLPILGYSLD----IGKE-------HVNLVDEQLQKLIYSG 228
Query: 233 KVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ--VRW 290
++ L+R + +++AG+A E L Y G AD+ L L K Q RW
Sbjct: 229 QLDQKELDRLAVVSMAGLAAEGLEYDKVVGQSADLFTLQRFLNRTKPPLSKDQQQNLTRW 288
Query: 291 SLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIED 328
++L LL+ +K A L AM+ SV CI+ IE+
Sbjct: 289 AVLIAASLLKNNKAAHDALVSAMSEKASVLGCIEAIEN 326
>gi|452825243|gb|EME32241.1| sarcosine dimethylglycine methyltransferase [Galdieria sulphuraria]
Length = 645
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 91/188 (48%), Gaps = 17/188 (9%)
Query: 142 LLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLT 201
+L +W +D++S NG V + F K RV HEAGHFL AYL+G+ + Y++
Sbjct: 150 VLGIWGLDNISLNGTVTNFC----SGIFQNK--QRVAVHEAGHFLTAYLLGVKIQSYSID 203
Query: 202 SFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAE 261
+ ALK+ +G F + L+ + + S L I LAG+A E L++G AE
Sbjct: 204 VWTALKQG---RASSGVLFNQEDVLKLLKYNR-SYVVL-----IWLAGIAAEVLVFGLAE 254
Query: 262 GGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGS 321
GG DI+++ L+K + D V+ + + +RRH L AM SV
Sbjct: 255 GGSRDIDQIKLLMKPDEV--QDFDQFVKRVIYQAMECIRRHSKTFESLQKAMLRNDSVEK 312
Query: 322 CIDIIEDN 329
C IE N
Sbjct: 313 CFMEIERN 320
>gi|388500492|gb|AFK38312.1| unknown [Lotus japonicus]
Length = 330
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 57/200 (28%), Positives = 92/200 (46%), Gaps = 19/200 (9%)
Query: 130 FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAY 189
F P I +SL +L + ++ +GS T Y R+ +HEA HFLI+Y
Sbjct: 138 FVPYLIGSISLVVLAVGSISPGLLQAAIGSF------STVFPDYQERIARHEAAHFLISY 191
Query: 190 LVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAG 249
L+G+ GY+L + KE +D + + SG++ A ++R + +++AG
Sbjct: 192 LLGVPILGYSLD----IGKE-------HVNLIDQRLEKLIYSGQLDAKEIDRLAVVSMAG 240
Query: 250 VATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ--VRWSLLNTVLLLRRHKGARA 307
+A E L+Y G AD+ L + K Q RW+++ LL+ +K +
Sbjct: 241 LAAEGLIYDKVIGQSADLFTLQRFINRTKPQLSKDQQQNLTRWAVMFAASLLKNNKESHE 300
Query: 308 KLAVAMTMGKSVGSCIDIIE 327
L +MT SV CI IE
Sbjct: 301 ALMASMTKKASVVECIQTIE 320
>gi|356512746|ref|XP_003525077.1| PREDICTED: uncharacterized protein LOC100807209 [Glycine max]
Length = 330
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 60/201 (29%), Positives = 90/201 (44%), Gaps = 19/201 (9%)
Query: 130 FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAY 189
F P I +SL +L + ++ +GS T Y R+ +HEA HFLIAY
Sbjct: 145 FVPYLIGSISLVVLAVGSISPGLLQAAIGSF------STLFPDYQERIARHEAAHFLIAY 198
Query: 190 LVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAG 249
L+G+ Y+L + KE +D + + SG++ A L+R + +++AG
Sbjct: 199 LLGLPIFDYSLD----IGKE-------HVNLIDERLEKLIYSGQLDAKELDRLAVVSMAG 247
Query: 250 VATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ--VRWSLLNTVLLLRRHKGARA 307
+A E L Y G AD+ L + K Q RW++L LL+ +K
Sbjct: 248 LAAEGLTYDKVVGQSADLFSLQRFINRTKPPLSKDQQQNLTRWAVLFAASLLKNNKVTHE 307
Query: 308 KLAVAMTMGKSVGSCIDIIED 328
L AMT SV CI IE+
Sbjct: 308 ALMAAMTKKASVLECIRAIEN 328
>gi|223995189|ref|XP_002287278.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976394|gb|EED94721.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 176
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 85/171 (49%), Gaps = 18/171 (10%)
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTS-------FEALKKEGSLNVQAGTAFVDFEF 225
Y R+IQHE+GHFLI +L+G K Y ++ F L E S+ + A + F+
Sbjct: 7 YRKRMIQHESGHFLIGHLLGWPVKSYQASNAVKNAVEFYPLSDE-SIGKERARA-LGFDA 64
Query: 226 LEEVN-SGKVSAT----TLNRFSCIALAGVATEYLLYGYAEGGLADINKL----DALLKG 276
N +G AT T+++ + I++AG E L YG AEGG+AD+ +L A
Sbjct: 65 RRNTNDNGNAQATVEEKTIDKLAIISVAGACAEILAYGNAEGGVADLLQLRRIYGAAASS 124
Query: 277 LGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
++ R++L ++LLR+H GA LA M +V CI IE
Sbjct: 125 KSMNASDEETPTRFALGYAMVLLRQHLGALDALAEIMEKDGTVADCILAIE 175
>gi|302787118|ref|XP_002975329.1| hypothetical protein SELMODRAFT_103319 [Selaginella moellendorffii]
gi|300156903|gb|EFJ23530.1| hypothetical protein SELMODRAFT_103319 [Selaginella moellendorffii]
Length = 271
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 77/157 (49%), Gaps = 13/157 (8%)
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG 232
Y R+++HEA HFLIAYLVG+ GY+L + KE VD + +++ G
Sbjct: 124 YQERLLRHEAAHFLIAYLVGLPVLGYSLD----IGKE-------HVNLVDEKLQKKIYEG 172
Query: 233 KVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLK--GLGFTQKKADSQVRW 290
+ L+R + +++AG+A E L Y G AD+ L L+ + ++ + RW
Sbjct: 173 FLDGNELDRLAVVSMAGLAAEALKYDKVIGQSADLFSLQRLINRSKPKLSNEQQQNLTRW 232
Query: 291 SLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
++L L++ A L AM SV CI+ IE
Sbjct: 233 AVLYAGSLIKNKSKAFDSLMAAMERNASVSECIEAIE 269
>gi|302762090|ref|XP_002964467.1| hypothetical protein SELMODRAFT_81106 [Selaginella moellendorffii]
gi|300168196|gb|EFJ34800.1| hypothetical protein SELMODRAFT_81106 [Selaginella moellendorffii]
Length = 271
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 77/157 (49%), Gaps = 13/157 (8%)
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG 232
Y R+++HEA HFLIAYLVG+ GY+L + KE VD + +++ G
Sbjct: 124 YQERLLRHEAAHFLIAYLVGLPVLGYSLD----IGKE-------HVNLVDEKLQKKIYEG 172
Query: 233 KVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLK--GLGFTQKKADSQVRW 290
+ L+R + +++AG+A E L Y G AD+ L L+ + ++ + RW
Sbjct: 173 FLDGNELDRLAVVSMAGLAAEALKYDKVIGQSADLFSLQRLINRSKPKLSNEQQQNLTRW 232
Query: 291 SLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
++L L++ A L AM SV CI+ IE
Sbjct: 233 AVLYAGSLVKNKSKAFDSLMAAMERNASVSECIEAIE 269
>gi|428221638|ref|YP_007105808.1| hypothetical protein Syn7502_01617 [Synechococcus sp. PCC 7502]
gi|427994978|gb|AFY73673.1| hypothetical protein Syn7502_01617 [Synechococcus sp. PCC 7502]
Length = 209
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 90/200 (45%), Gaps = 30/200 (15%)
Query: 130 FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFS--QKYHNRVIQHEAGHFLI 187
F P I LA+ + +D+ G G+ ++ I F ++ R++ HEAGHFL+
Sbjct: 31 FIPTGITVFCLAI---YALDNAYQEGKGGAAIISWIESKFPNHKEKQARILYHEAGHFLV 87
Query: 188 AYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIAL 247
A+L+GI GY L QAG EV++ V TL R+ I +
Sbjct: 88 AHLLGIKVVGYKLKP------------QAGV---------EVDNSTVGLNTLERYCTIWM 126
Query: 248 AGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARA 307
AG+A E LY A GG D+ KL A + T + RW+ + L+R +K A A
Sbjct: 127 AGIAAEEYLYQNANGGDDDLLKLRA---AIAHTPNPGLEE-RWAKVRARNLIRANKEAFA 182
Query: 308 KLAVAMTMGKSVGSCIDIIE 327
L M V +C +I+
Sbjct: 183 ALVGKMQEEAPVEACYQVID 202
>gi|225438565|ref|XP_002276081.1| PREDICTED: uncharacterized protein LOC100264638 [Vitis vinifera]
gi|296082500|emb|CBI21505.3| unnamed protein product [Vitis vinifera]
Length = 323
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 77/162 (47%), Gaps = 13/162 (8%)
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
+F Y R+ +HEA HFL+AYL+G+ GY+L + KE +D +
Sbjct: 171 SFFPDYQERIARHEAAHFLVAYLLGLPILGYSLD----IGKE-------HVNLIDERLEK 219
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ 287
+ SG++ L+R + +A+AG+A E L Y G AD+ L + K Q
Sbjct: 220 LIYSGQLDTKELDRLAVVAMAGLAAEGLQYDKVVGQSADLFTLQRFINRSKPQLSKDQQQ 279
Query: 288 --VRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
RW++L L++ +K L AM+ +V CI+ IE
Sbjct: 280 NLTRWAVLFAGSLIKNNKAIHEALMTAMSKKGTVLECIEAIE 321
>gi|424513258|emb|CCO66842.1| predicted protein [Bathycoccus prasinos]
Length = 316
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 48/171 (28%), Positives = 86/171 (50%), Gaps = 18/171 (10%)
Query: 162 LDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFV 221
+DT+ + +Y R+++HEA HFLIAYL GI Y+L E ++V+ A +
Sbjct: 157 VDTVSR-LNPEYMERIVKHEAAHFLIAYLSGIPVSSYSLGLME-------MHVELLEAKI 208
Query: 222 DFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQ 281
+ + + + +G ++ + + +A++GVA E + G AD L +L K + T+
Sbjct: 209 EKKLVGK--AGVITTEEMEALAVVAMSGVAAEAKYFEKVAGQEAD---LFSLQKAMNKTE 263
Query: 282 KKADSQ-----VRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
K +Q RW++ ++ ++ + +L AM GKSV C+ IE
Sbjct: 264 PKLGAQKEQSVTRWAVYKAARIITDNEKSYEQLCQAMKEGKSVAECVKAIE 314
>gi|428169741|gb|EKX38672.1| hypothetical protein GUITHDRAFT_46078, partial [Guillardia theta
CCMP2712]
Length = 172
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/174 (28%), Positives = 78/174 (44%), Gaps = 18/174 (10%)
Query: 156 GVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQ 215
G+ LVL I SQ+Y R++ HEAGHFL+ Y +G+ Y+ + LN
Sbjct: 14 GLLQLVLIQIRRAASQEYRERLLWHEAGHFLVGYCLGLPIAAYS-------ADDPILN-- 64
Query: 216 AGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALL- 274
F D E +S L+ ++ AGV E + +G A GG AD ++L + L
Sbjct: 65 -AVQFFDSE------EDSISHEFLDVLCAVSTAGVVAEAIRFGDAIGGYADFSQLQSFLY 117
Query: 275 -KGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
++ +VRW + +L+ + + L AM+ S+ CI IE
Sbjct: 118 RSRPRLNDREQQERVRWGCVAAFTILKNREESLKALIEAMSRKSSISQCILAIE 171
>gi|307106758|gb|EFN55003.1| hypothetical protein CHLNCDRAFT_134819 [Chlorella variabilis]
Length = 873
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 91/339 (26%), Positives = 149/339 (43%), Gaps = 51/339 (15%)
Query: 5 IYNNLCLLSSRPRVKVLSLSRIVSSSEVEVSK-RRVLKQVDEELSKGDERAALALVKDLQ 63
Y + L+ + + + +++ + S+ ++++ RR + +E+S GD LA V+ L
Sbjct: 568 FYRTIFFLTLKWKERAITVDTLFSNLDIDLDNYRRAPATLKQEVS-GD---VLATVEKL- 622
Query: 64 GKPGGLRCFGAA-RQIPQRL-YTLDELKLNGIETRSLLSPVDETLGSIERNLQ-LAAIVG 120
G L+ +GAA +P+R L EL+L G++ ++ + S+ + L ++VG
Sbjct: 623 ANAGALKKWGAALSDLPERRSVMLGELRLVGVKQPEKIAQL-----SVRNDAAFLFSVVG 677
Query: 121 GVS-AWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIG-------HTFSQK 172
S A V G P + S L+ GG+ SLV+ IG H
Sbjct: 678 TTSVAAVVLGQLPGDWGFFSSYLI-----------GGI-SLVVLAIGSINPGILHVVFPD 725
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG 232
Y RV+ HEA HFL YL+ + Y+L L KE T V+ + + + G
Sbjct: 726 YKQRVVAHEAAHFLAGYLLRVPVANYSLM----LGKE-------HTDLVEAKLQKRLIEG 774
Query: 233 KVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALL----KGLGFTQKKADSQV 288
+ +++ S IA+AG E + + G AD L ++ L TQ++ +Q
Sbjct: 775 SLEPAQVDQLSVIAMAGATAEAMKFEEVIGQNADFFDLQRIMSRQQPKLNNTQQQ--NQT 832
Query: 289 RWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
RW+ LLR + L AM G V CI IE
Sbjct: 833 RWASYQAASLLRTYSKEYEALQEAMARGAGVVDCIKAIE 871
>gi|86605114|ref|YP_473877.1| hypothetical protein CYA_0396 [Synechococcus sp. JA-3-3Ab]
gi|86553656|gb|ABC98614.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length = 247
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/191 (27%), Positives = 86/191 (45%), Gaps = 10/191 (5%)
Query: 148 VDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALK 207
V+ V++ G + L L + S Y RVI HEAGH ++A+L+GI +GY L ++A +
Sbjct: 39 VEQVAFGGRLSGL-LGALLDWASPAYRRRVICHEAGHIVVAHLLGIPLRGYVLGPWQAFR 97
Query: 208 KEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADI 267
+ G G + L R+S +AG A E L YG G D
Sbjct: 98 H--GIPGYGGVQLDQAPMRTWEAQGGIPWADLERYSLFWMAGAAAESLQYGQVVGDEEDR 155
Query: 268 NKLDALLKGLGFTQK-KADSQVRWSLLNTVL-----LLRRHKGARAKLAVAMTMGKSVGS 321
+L LL G+ Q + +Q+R + +N L LL+ H A A + G +
Sbjct: 156 QQLAKLLAGISAAQPLTSAAQMR-AYINRFLRQSRALLQAHPEAYAVAVRHLEQGSPLQD 214
Query: 322 CIDIIEDNIDD 332
C+ + ++++
Sbjct: 215 CLAALAASLEE 225
>gi|356525531|ref|XP_003531378.1| PREDICTED: uncharacterized protein LOC100306288 [Glycine max]
Length = 328
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 90/201 (44%), Gaps = 19/201 (9%)
Query: 130 FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAY 189
F P I +SL +L + ++ +GS T Y R+ +HEA HFLIAY
Sbjct: 143 FVPYLIGSISLVVLAVGSISPGLLQAAIGSF------STLFPDYQERIARHEAAHFLIAY 196
Query: 190 LVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAG 249
L+G+ Y+L + KE +D + + SG++ A L+R + +++AG
Sbjct: 197 LLGLPIFDYSLD----IGKE-------HVNLIDERLEKLIYSGQLDAKELDRLAVVSMAG 245
Query: 250 VATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ--VRWSLLNTVLLLRRHKGARA 307
+A E L Y G AD+ L + K Q RW++L +L+ +K
Sbjct: 246 LAAEGLTYDKVVGQSADLFSLQRFINRTKPPLSKDQQQNLTRWAVLFAASILKNNKVTHE 305
Query: 308 KLAVAMTMGKSVGSCIDIIED 328
L AM SV CI +IE+
Sbjct: 306 ALMAAMAKKASVLECIRVIEN 326
>gi|219129470|ref|XP_002184911.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403696|gb|EEC43647.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 207
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 72/157 (45%), Gaps = 20/157 (12%)
Query: 174 HNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGK 233
R+ +HEA HF Y G+ KGY+ V+ G A V+F VN+ +
Sbjct: 69 QERICRHEAAHFCCGYWCGLPIKGYS--------------VENGIAKVEFG----VNTQQ 110
Query: 234 VSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLK-GLGFTQKKADSQV-RWS 291
++AT + S AL+G+ E +G A G +D+ L+ + + F A + RW
Sbjct: 111 LTATEVAALSVTALSGLVAEAQTFGKAVGAESDLLTLEMVFRQSADFIGAAAQQDLTRWG 170
Query: 292 LLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIED 328
LN LLL+ H ++ A +SV C+ I+E
Sbjct: 171 ALNAALLLKEHNAKYEQVVQAFASQQSVEDCVAILES 207
>gi|224083801|ref|XP_002307128.1| predicted protein [Populus trichocarpa]
gi|222856577|gb|EEE94124.1| predicted protein [Populus trichocarpa]
Length = 321
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 75/158 (47%), Gaps = 15/158 (9%)
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG 232
Y R+ +HEA HFLI YL+G+ Y+L + KE +D + + + SG
Sbjct: 173 YQERIARHEAAHFLIGYLIGLPILDYSLD----IGKE-------HVNLIDEKLEKLIYSG 221
Query: 233 KVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ---VR 289
++ A L+R + +A+AG+A E L Y G AD+ L + Q D Q R
Sbjct: 222 QLDAKELDRLAVVAMAGLAAEGLQYDKVVGQSADLFTLQRFINR-SKPQISKDQQQNLTR 280
Query: 290 WSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
W++L L++ +K L AM+ +V CI IE
Sbjct: 281 WAVLFGGSLIKNNKSLHEALMTAMSNKATVLECIQAIE 318
>gi|168032306|ref|XP_001768660.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680159|gb|EDQ66598.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 360
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 67/224 (29%), Positives = 101/224 (45%), Gaps = 29/224 (12%)
Query: 115 LAAIVG-----GVSAWNVFG----FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTI 165
LA++VG V+A + G F P + LSL +L + +V G+ +V+D
Sbjct: 153 LASVVGVTSVIAVAAGQLPGDWGFFVPYLVGSLSLVVLAVGSVAP-----GLFQVVIDKF 207
Query: 166 GHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEF 225
T Y R+++HEA HFL AY+VG+ Y+L L KE + T +E
Sbjct: 208 SST-QGDYKERILRHEAAHFLTAYMVGLPIVDYSLD----LGKEHVNLLNEATEKKIYEG 262
Query: 226 LEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKAD 285
L+ + L+R + +++AG+A E L Y G AD+ L LL K +
Sbjct: 263 LD--------SNLLDRLAVVSMAGLAAEGLKYDKVMGQSADLFSLQRLLNRSKQPLKNVE 314
Query: 286 SQ--VRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
Q RW++ L++ + A L AM SV CI IE
Sbjct: 315 QQNLTRWAVYYAASLIKNNAKAYEALMEAMAKQLSVSECIKAIE 358
>gi|297821399|ref|XP_002878582.1| hypothetical protein ARALYDRAFT_900627 [Arabidopsis lyrata subsp.
lyrata]
gi|297324421|gb|EFH54841.1| hypothetical protein ARALYDRAFT_900627 [Arabidopsis lyrata subsp.
lyrata]
Length = 333
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 79/162 (48%), Gaps = 13/162 (8%)
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
TF Y R+ HEA HFL+AYL+G+ GY+L + KE +D +
Sbjct: 180 TFFPDYQERIAAHEAAHFLVAYLIGLPILGYSLD----IGKE-------HVNLIDERLAK 228
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLK--GLGFTQKKAD 285
+ SGK+ + L+R + +A+AG+A E L Y G AD+ L + + ++
Sbjct: 229 LIYSGKLDSKELDRLAAVAMAGLAAEGLKYDKVIGQSADLFSLQRFINRSQPKISNEQQQ 288
Query: 286 SQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
+ RW++L + LL+ +K L AM+ SV CI IE
Sbjct: 289 NLTRWAVLYSASLLKNNKTIHEALMAAMSKNASVLECIQTIE 330
>gi|18399842|ref|NP_565523.1| uncharacterized protein [Arabidopsis thaliana]
gi|14334630|gb|AAK59493.1| unknown protein [Arabidopsis thaliana]
gi|20198006|gb|AAD20413.2| expressed protein [Arabidopsis thaliana]
gi|23296622|gb|AAN13134.1| unknown protein [Arabidopsis thaliana]
gi|330252152|gb|AEC07246.1| uncharacterized protein [Arabidopsis thaliana]
Length = 332
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 79/162 (48%), Gaps = 13/162 (8%)
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
TF Y R+ HEA HFL+AYL+G+ GY+L + KE +D +
Sbjct: 179 TFFPDYQERIAAHEAAHFLVAYLIGLPILGYSLD----IGKE-------HVNLIDERLAK 227
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLK--GLGFTQKKAD 285
+ SGK+ + L+R + +A+AG+A E L Y G AD+ L + + ++
Sbjct: 228 LIYSGKLDSKELDRLAAVAMAGLAAEGLKYDKVIGQSADLFSLQRFINRSQPKISNEQQQ 287
Query: 286 SQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
+ RW++L + LL+ +K L AM+ SV CI IE
Sbjct: 288 NLTRWAVLYSASLLKNNKTIHEALMAAMSKNASVLECIQTIE 329
>gi|21592852|gb|AAM64802.1| unknown [Arabidopsis thaliana]
Length = 332
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 78/162 (48%), Gaps = 13/162 (8%)
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
TF Y R+ HEA HFL+AYL+G+ GY+L + KE +D +
Sbjct: 179 TFFPDYQERIAAHEAAHFLVAYLIGLPILGYSLD----IGKE-------HVNLIDERLAK 227
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLK--GLGFTQKKAD 285
+ SGK+ + L+R + +A+AG+A E L Y G AD+ L + + ++
Sbjct: 228 LIYSGKLDSKELDRLAAVAMAGLAAEGLKYDKVIGQSADLFSLQRFINRSQPKISNEQQQ 287
Query: 286 SQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
+ RW+ L + LL+ +K L AM+ SV CI IE
Sbjct: 288 NLTRWAXLYSASLLKNNKTIHEALMAAMSKNASVLECIQTIE 329
>gi|449461419|ref|XP_004148439.1| PREDICTED: uncharacterized protein LOC101209062 [Cucumis sativus]
gi|449515690|ref|XP_004164881.1| PREDICTED: uncharacterized LOC101209062 [Cucumis sativus]
Length = 320
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 79/164 (48%), Gaps = 17/164 (10%)
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
+F Y R+ HEA HFL+AYL+G+ Y+ + KE +D +
Sbjct: 167 SFFPDYQERIAGHEAAHFLVAYLLGLPILDYSTD----IGKE-------HVNLIDERLEK 215
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALL----KGLGFTQKK 283
+ SG++ L+R + +A+AG+A+E L Y G AD+ L + LG Q++
Sbjct: 216 LIYSGQLDDKELDRLAVVAMAGLASEGLKYDKVVGQSADLFTLQRFINRSKPKLGKDQQQ 275
Query: 284 ADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
+ RW++L + LL+ +K L AM+ SV C + IE
Sbjct: 276 --NLTRWAVLFSGSLLKNNKLIHEALIKAMSEKASVIECFEAIE 317
>gi|308813674|ref|XP_003084143.1| unnamed protein product [Ostreococcus tauri]
gi|116056026|emb|CAL58559.1| unnamed protein product [Ostreococcus tauri]
Length = 340
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 73/159 (45%), Gaps = 14/159 (8%)
Query: 172 KYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNS 231
+Y R+ +HEAGHFL+ Y G+ Y L E +N F++ + ++
Sbjct: 189 EYRRRIARHEAGHFLVGYACGVPVGSYNLGI-----DESHVN------FMESKLERKIFQ 237
Query: 232 G-KVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ--V 288
G K+S++ + + I++AGVA E + + G AD+ L +L + A Q
Sbjct: 238 GAKLSSSEVLPLAVISMAGVAAEAMEFEEVMGQSADLFDLQRILNRVDPKLNDASQQEIT 297
Query: 289 RWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
RW++ +L ++ A L M G SV C+ IE
Sbjct: 298 RWAVFQAATILSENRAAFDALTEKMQEGASVVDCLQTIE 336
>gi|145355993|ref|XP_001422227.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582467|gb|ABP00544.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 246
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 77/159 (48%), Gaps = 14/159 (8%)
Query: 172 KYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNS 231
+Y R+ +HEAGHFL+ Y +G+ P G S++ + +N F++ + ++
Sbjct: 95 EYRRRIARHEAGHFLVGYALGV-PVG----SYDLGIDQSHVN------FMESKLERKIFQ 143
Query: 232 G-KVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ--V 288
G K+S + + + ++++GVA E + + G AD+ L +L + A Q
Sbjct: 144 GAKLSESEMLPLAVVSMSGVAAEAMEFDEVMGQSADLYDLQRILNRVDPKLSDASQQELT 203
Query: 289 RWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
RW++ +L+ +K A L M G SV C+ IE
Sbjct: 204 RWAVFQAASILKTNKSAWDTLTEKMLDGASVVECLKTIE 242
>gi|297805800|ref|XP_002870784.1| hypothetical protein ARALYDRAFT_916373 [Arabidopsis lyrata subsp.
lyrata]
gi|297316620|gb|EFH47043.1| hypothetical protein ARALYDRAFT_916373 [Arabidopsis lyrata subsp.
lyrata]
Length = 70
Score = 55.8 bits (133), Expect = 3e-05, Method: Composition-based stats.
Identities = 27/67 (40%), Positives = 41/67 (61%)
Query: 269 KLDALLKGLGFTQKKADSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIED 328
+L +L+ LGFT+ + ++ +RW++ NTV LL H AR LA A+ K +G+CI+ IE
Sbjct: 4 QLIDVLRWLGFTETEKEAHIRWAVSNTVSLLHSHSEARVSLAEAIAKAKPIGACIEAIES 63
Query: 329 NIDDTDI 335
I I
Sbjct: 64 AISRHQI 70
>gi|414585198|tpg|DAA35769.1| TPA: hypothetical protein ZEAMMB73_590939 [Zea mays]
Length = 292
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 74/165 (44%), Gaps = 19/165 (11%)
Query: 130 FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAY 189
F P I +SL +L + +V +G+ T Y R+ +HEA HFL+AY
Sbjct: 130 FVPYLIGSISLIVLAVGSVAPGLLQAAIGAF------STVFPDYQERIARHEAAHFLVAY 183
Query: 190 LVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAG 249
L+G+ GY+L + KE +D + + + SG++ L+R + +++AG
Sbjct: 184 LIGLPILGYSLD----IGKE-------HVNLIDEQLQKLIYSGQLDGKELDRLAVVSMAG 232
Query: 250 VATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ--VRWSL 292
+A E L Y G AD+ L + K Q RW++
Sbjct: 233 LAAEGLEYDKVVGQSADLFTLQRFINRTKPQLSKDQQQNLTRWAV 277
>gi|397633736|gb|EJK71112.1| hypothetical protein THAOC_07480 [Thalassiosira oceanica]
Length = 401
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 68/160 (42%), Gaps = 16/160 (10%)
Query: 170 SQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEV 229
+++ R+ +HEAGHFL +L G+ K Y NV T EF
Sbjct: 253 AEQQRERICRHEAGHFLCGWLCGLPIKSY--------------NVNDDTGVACVEFHTGG 298
Query: 230 NSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKL-DALLKGLGFT-QKKADSQ 287
++S + S +A++G E L + A+GG D+ +L + K F K
Sbjct: 299 AGQELSDEEIASLSVVAMSGSVAEILNFDVAKGGENDLLELQNCFRKSKEFIGAAKQQDL 358
Query: 288 VRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
RW L + LL+ + KL A KS+ C+ ++E
Sbjct: 359 TRWGALTSYNLLKSNAQKYEKLVDAFKQRKSLSDCVALLE 398
>gi|224010159|ref|XP_002294037.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970054|gb|EED88392.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 207
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 63/148 (42%), Gaps = 16/148 (10%)
Query: 182 AGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNR 241
AGHFL YL G+ K Y + G A V EF + N G+++ +
Sbjct: 71 AGHFLCGYLCGLPVKNYEIVD------------STGVACV--EFHTDGNGGELTEDDVAA 116
Query: 242 FSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKADSQ--VRWSLLNTVLLL 299
S +A++G E + Y A GG D+ +L + A Q RW L + LL
Sbjct: 117 LSVVAMSGSVAEIIAYDKATGGENDLIELQNCFRKSKEFIGAARQQDLTRWGALTSYELL 176
Query: 300 RRHKGARAKLAVAMTMGKSVGSCIDIIE 327
R++ KL A KS+ C+ IIE
Sbjct: 177 RKNSEVYEKLVKAFGEKKSLSECVSIIE 204
>gi|219122870|ref|XP_002181760.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407036|gb|EEC46974.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 542
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 58/104 (55%), Gaps = 12/104 (11%)
Query: 237 TTLNRFSCIALAGVATEYLLY----------GYAEGGLADINKLDALLKGL--GFTQKKA 284
TL++ S I++AGV +E L + G AEGG+AD+N+L + + +++
Sbjct: 347 ATLDQLSAISVAGVCSEILAFVCIRLFLWAIGNAEGGVADLNQLRQIYRSAEPSISERDE 406
Query: 285 DSQVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIED 328
++++R++L T+ LRRH GA L+ M ++ C+ IE+
Sbjct: 407 ENRIRFALGYTMSQLRRHLGALDALSKIMERDGTIAECVAAIEN 450
>gi|148242741|ref|YP_001227898.1| hypothetical protein SynRCC307_1642 [Synechococcus sp. RCC307]
gi|147851051|emb|CAK28545.1| Uncharacterized conserved membrane protein [Synechococcus sp.
RCC307]
Length = 228
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 71/141 (50%), Gaps = 12/141 (8%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+ G +P I ++ L + +VD+ ++NG G L+ + + Q+ R+ HEAGH L
Sbjct: 38 MVGVSPAWITVAVISALVMVSVDAAAWNGRGGHLLAEAL--PGGQERLRRIALHEAGHRL 95
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLN-VQAGTAFVDFEFLEEVNSGKVSATTLNRFSCI 245
+A G + G TL GSL ++AG L S K+ L R+S +
Sbjct: 96 LASHEG-MELGDTLV--------GSLACLRAGHDACGRTTLPLPASSKLPLEELRRWSRV 146
Query: 246 ALAGVATEYLLYGYAEGGLAD 266
LAG+A E +++G ++GG+ D
Sbjct: 147 LLAGMAAEQVVHGGSDGGMDD 167
>gi|116789848|gb|ABK25412.1| unknown [Picea sitchensis]
Length = 319
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/93 (35%), Positives = 50/93 (53%), Gaps = 7/93 (7%)
Query: 149 DSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKK 208
DSV Y GG G + ++ Y R++ HEAGH L AYL+G +G L +A++
Sbjct: 231 DSV-YLGGAGLAQVSSLW----PPYRRRILVHEAGHILTAYLLGCPVRGVILEPLQAVQI 285
Query: 209 EGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNR 241
+ QAGT F D E+N G++S+ + +R
Sbjct: 286 --GIQGQAGTQFWDETLESELNRGQLSSASFDR 316
>gi|255575563|ref|XP_002528682.1| conserved hypothetical protein [Ricinus communis]
gi|223531905|gb|EEF33721.1| conserved hypothetical protein [Ricinus communis]
Length = 364
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 52/100 (52%), Gaps = 11/100 (11%)
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
T Y R+ +HEA HFLIAYL+G+ Y+L + KE +D + +
Sbjct: 164 TLFPDYQERIARHEAAHFLIAYLLGLPILDYSLD----IGKE-------HVNLIDEKLEK 212
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADI 267
+ SG++ + L+R + +A+AG+A E L Y G AD+
Sbjct: 213 LIYSGQLDSKELDRLAVVAMAGLAAEGLQYDKVVGQSADL 252
>gi|33866335|ref|NP_897894.1| hypothetical protein SYNW1803 [Synechococcus sp. WH 8102]
gi|33639310|emb|CAE08318.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 215
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/180 (30%), Positives = 83/180 (46%), Gaps = 12/180 (6%)
Query: 99 LSPVDE-TLGSIERNLQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGV 157
++P E + G++ L +AAI G + V G +P I+ L + +VD+ S+ G
Sbjct: 1 MAPAGEGSTGTLRAGLVVAAITGLGAFGPVLGLSPAWIVVAVGGGLVMLSVDAASWQGMG 60
Query: 158 GSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAG 217
G ++ + + + R+ HEAGH LIA +P L A + G L +
Sbjct: 61 GHVLAEAL--PGGEGRLRRIAVHEAGHLLIAEQE-QMPVQRVLVGTLACLQAG-LRSRGA 116
Query: 218 TAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGL 277
T F NS ++ L R+S + AG+A E ++YG A GG AD L L GL
Sbjct: 117 TEF------PVPNSVRMPLEDLRRWSRVLQAGIAAETVVYGKARGG-ADDRALLGQLWGL 169
>gi|428181367|gb|EKX50231.1| hypothetical protein GUITHDRAFT_151256 [Guillardia theta CCMP2712]
Length = 411
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 47/189 (24%), Positives = 80/189 (42%), Gaps = 28/189 (14%)
Query: 156 GVGSLVLDTIGHTFS--------QKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALK 207
GVGS+ IG S R I+HEA H L Y+ G+ +GY + K
Sbjct: 232 GVGSVAPSLIGGVISGLQWQFDKTNTMERRIRHEAAHILAGYMCGLPIEGYEVEPMPMCK 291
Query: 208 ----KEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGG 263
+EG+++ D E ++ + +++ + + L+GV E LY A GG
Sbjct: 292 FYDRREGNID--------DVEAWKKARP--FNEEEVDKLAVVCLSGVMGELSLYERAAGG 341
Query: 264 LADINKLDALLKGLGFTQKKADSQV-----RWSLLNTVLLLRRHKGARAKLAVAMTMGKS 318
D+ +L + ++K +++V RW + LL + + +L + G S
Sbjct: 342 QQDLEQLQEVY-FRAESEKLRNNRVREETTRWGAMKARTLLEENNDSFMRLCKQLEKGAS 400
Query: 319 VGSCIDIIE 327
+ CI IE
Sbjct: 401 IEECIASIE 409
>gi|78185262|ref|YP_377697.1| hypothetical protein Syncc9902_1695 [Synechococcus sp. CC9902]
gi|78169556|gb|ABB26653.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 215
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 52/170 (30%), Positives = 78/170 (45%), Gaps = 15/170 (8%)
Query: 112 NLQLAAIVGGVSAWNVFG----FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGH 167
L+ +V G++ FG + I+ L L ++VD+ S+ G G ++ + +
Sbjct: 11 TLRAGLVVAGITGLGAFGPAMGVSSAWIVVFVLGGLIAFSVDAASWQGMGGHILAEAL-- 68
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
Q+ R+ HEAGH LIA LP L A + G L T FV E
Sbjct: 69 PGGQERLRRIAVHEAGHLLIAE-TEALPVQRVLVGTLACVQAG-LASNGATEFVVPE--- 123
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGL 277
S +++ L R+S + AG+A E L++G A GG AD L A L G+
Sbjct: 124 ---SVRMTLEDLRRWSRVLQAGIAAEKLVFGKARGG-ADDRALLARLWGM 169
>gi|303283596|ref|XP_003061089.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457440|gb|EEH54739.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 387
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/126 (30%), Positives = 56/126 (44%), Gaps = 25/126 (19%)
Query: 171 QKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDF--EFLEE 228
KY NR+ +HEA HFL+ Y++G+ GY+L G VDF LE
Sbjct: 259 PKYRNRIARHEAAHFLVGYMLGVPVAGYSL--------------GLGKTHVDFLEAKLER 304
Query: 229 VNSGK---VSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQKKAD 285
G +S TL +C+ +AGVA E + + D+ D L L KA+
Sbjct: 305 KIFGPDFAISDKTLLPLACVCMAGVAAEAMAF------PGDVRGQDGDLVDLQRILNKAE 358
Query: 286 SQVRWS 291
++ R S
Sbjct: 359 NKARSS 364
>gi|116072859|ref|ZP_01470124.1| hypothetical protein BL107_10132 [Synechococcus sp. BL107]
gi|116064385|gb|EAU70146.1| hypothetical protein BL107_10132 [Synechococcus sp. BL107]
Length = 215
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 72/159 (45%), Gaps = 14/159 (8%)
Query: 112 NLQLAAIVGGVSAWNVFG----FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGH 167
L+ +V G++ FG + I+ L L ++VD+ S+ G G ++ + +
Sbjct: 11 TLRAGLVVAGITGLGAFGPAMGLSSAWIVVFVLGGLIAFSVDAASWQGMGGHILAEAL-- 68
Query: 168 TFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLE 227
Q+ R+ HEAGH L+A LP L A + G L T FV E
Sbjct: 69 PGGQERLRRIAVHEAGHLLVAE-TEALPVQRVLVGTLACVQAG-LASNGATEFVVPE--- 123
Query: 228 EVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLAD 266
S +++ L R+S + AG+A E L++G A GG D
Sbjct: 124 ---SVRMTLEDLRRWSRVLQAGIAAEKLVFGKARGGADD 159
>gi|384253932|gb|EIE27406.1| hypothetical protein COCSUDRAFT_52156 [Coccomyxa subellipsoidea
C-169]
Length = 340
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 45/161 (27%), Positives = 74/161 (45%), Gaps = 19/161 (11%)
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNV-QAGTAFVDFEFLEEVNS 231
Y RV++HEA H L Y+ G+ Y SLN+ QA T F + + +
Sbjct: 193 YRERVLRHEAAHLLAGYIFGVPVTAY------------SLNIGQAHTDFAEAKLQRRLIE 240
Query: 232 GKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALL----KGLGFTQKKADSQ 287
G + ++ + +ALAG A E + Y G AD+ L +L K L +Q++ +
Sbjct: 241 GSLEDNEIDALAVVALAGAAAEAMQYEEVIGQTADLVDLQRILLRSKKKLNSSQQQ--NM 298
Query: 288 VRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIED 328
R ++ LL++++ L AM G S+ C+ IE+
Sbjct: 299 TRLAIFEACSLLKKYEKEYKALQAAMQNGASIAKCVQTIEE 339
>gi|87302666|ref|ZP_01085483.1| hypothetical protein WH5701_12169 [Synechococcus sp. WH 5701]
gi|87283010|gb|EAQ74967.1| hypothetical protein WH5701_12169 [Synechococcus sp. WH 5701]
Length = 221
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 75/172 (43%), Gaps = 16/172 (9%)
Query: 113 LQLAAIVGGVSAWNVFG----FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHT 168
L++ A+V G + V G +P + + L TVDS + G G L+ +T+
Sbjct: 15 LRVGAVVAGGTLLAVLGPALGVSPWLVALTAGGGLTGLTVDSARFGGRGGHLLAETLPGG 74
Query: 169 FSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEE 228
S+ R+ HEAGH L+A G+ K + S L+ + T F E
Sbjct: 75 LSRL--RRIAIHEAGHALVADQEGLAVKQVLVGSLACLRA--GIGASGSTEF------EP 124
Query: 229 VNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGL-GF 279
K+ L R+S + AG+ E LLY A GG AD L L GL GF
Sbjct: 125 PRHAKLPPEDLRRWSRVLQAGMLAERLLYEQARGG-ADDRALLGRLWGLSGF 175
>gi|87123790|ref|ZP_01079640.1| hypothetical protein RS9917_09281 [Synechococcus sp. RS9917]
gi|86168359|gb|EAQ69616.1| hypothetical protein RS9917_09281 [Synechococcus sp. RS9917]
Length = 210
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 70/151 (46%), Gaps = 11/151 (7%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+ G +P I A L T+D+ + G G L+ +T+ + R+ HEAGH L
Sbjct: 27 LLGLSPAWIALGLGAGLLGLTLDASQWQGLGGHLLAETL--PGGRARLQRIACHEAGHLL 84
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
+A +P L +A + G L T +FE + V ++S L R+S +
Sbjct: 85 VAQ-AEAMPVRRVLVGTKACLQAG-LRSNGAT---EFELPDSV---RLSLEDLRRWSRVL 136
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGL 277
AG+A E L+YG A GG AD L L GL
Sbjct: 137 QAGMAAETLVYGQARGG-ADDRALLGRLWGL 166
>gi|254431154|ref|ZP_05044857.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
gi|197625607|gb|EDY38166.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
Length = 214
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 68/151 (45%), Gaps = 11/151 (7%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+ G +P I + + TVD+ + G G L+ + + ++ R+ HEAGH +
Sbjct: 23 LLGLSPWWIALTAGVSIGGLTVDAARFGGRGGHLLAEAL--PGGERRLRRIAVHEAGHVV 80
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
+A + + + S L QAG + L+ S K+ L R+S +
Sbjct: 81 VARAEAMPVRQVLIGSLACL--------QAGLSAGGTTELDPPASAKLPLEDLRRWSRVL 132
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGL 277
AG+A E L+YG ++GG AD L L GL
Sbjct: 133 QAGMAAEELIYGESQGG-ADDRALLGRLWGL 162
>gi|318042021|ref|ZP_07973977.1| hypothetical protein SCB01_09941 [Synechococcus sp. CB0101]
Length = 218
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 49/177 (27%), Positives = 75/177 (42%), Gaps = 10/177 (5%)
Query: 128 FGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLI 187
G +P I + + T+D+ + G G L+ + + + R+ HEAGH L+
Sbjct: 37 LGLSPWLIALAAGGGMVALTLDAARFGGLGGHLLAEALPAGIGRARLRRIAIHEAGHVLV 96
Query: 188 AYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIAL 247
A LP L A +EG L+ T LE K++ L R+S +
Sbjct: 97 AA-DNKLPVKQVLVGSRACLREG-LSANGSTQ------LEPPAHAKLALEDLRRWSRVLQ 148
Query: 248 AGVATEYLLYGYAEGGLADINKLDALLKGL-GFTQKKADSQVRWSLLNTVLLLRRHK 303
AG+ E L+YG + GG AD L L GL G+ A + R + LR+ +
Sbjct: 149 AGMIAEQLVYGGSRGG-ADDRALLGRLWGLSGYDVATAQQEQRRARREVEQWLRQRR 204
>gi|427704496|ref|YP_007047718.1| hypothetical protein Cyagr_3304 [Cyanobium gracile PCC 6307]
gi|427347664|gb|AFY30377.1| hypothetical protein Cyagr_3304 [Cyanobium gracile PCC 6307]
Length = 213
Score = 46.2 bits (108), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 82/187 (43%), Gaps = 22/187 (11%)
Query: 113 LQLAAIVGGVSAWNVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQK 172
+ LAA+ G + G +P I L + L TVD+ + G G L+ + + Q
Sbjct: 15 VSLAAVAG-----PLVGLSPWWISLLVVLGLGSLTVDAARFGGRGGHLLAEAL--PGGQG 67
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG 232
R+ HEAGH L A G+ + + S L+ V +G +F + V
Sbjct: 68 RLRRIAVHEAGHALAADSDGLAVRRVLVGSLACLRA----GVDSG-GCTEFALPDSV--- 119
Query: 233 KVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGF-------TQKKAD 285
++SA L R+S + AG+ E L+YG + GG D L L GF Q++A
Sbjct: 120 RLSAEDLRRWSRVLQAGMVAETLVYGDSVGGDDDRALLGRLWGLSGFDVEMARLEQRRAR 179
Query: 286 SQVRWSL 292
+V +L
Sbjct: 180 REVEQAL 186
>gi|302766932|ref|XP_002966886.1| hypothetical protein SELMODRAFT_408124 [Selaginella moellendorffii]
gi|300164877|gb|EFJ31485.1| hypothetical protein SELMODRAFT_408124 [Selaginella moellendorffii]
Length = 165
Score = 44.7 bits (104), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 10/68 (14%)
Query: 210 GSLNVQAGTAFVDFEFLE---------EVNSGKVSATTLNRFSCIALAGVATEYLLYGYA 260
GS+NVQAGT+FVDF F + + + KV N ++CIALAGV ++L +
Sbjct: 78 GSINVQAGTSFVDFVFQDGRYDLVCVYQSHFSKVRGLWHN-YACIALAGVTIKFLNFRLV 136
Query: 261 EGGLADIN 268
EGG N
Sbjct: 137 EGGYQTSN 144
>gi|37521535|ref|NP_924912.1| hypothetical protein gll1966 [Gloeobacter violaceus PCC 7421]
gi|35212533|dbj|BAC89907.1| gll1966 [Gloeobacter violaceus PCC 7421]
Length = 229
Score = 44.7 bits (104), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 36/160 (22%), Positives = 65/160 (40%), Gaps = 10/160 (6%)
Query: 173 YHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG 232
+ RV +HEAGH L+A+ +G+ +GYTL ++E+ ++ +
Sbjct: 69 WRERVARHEAGHLLVAHRLGLPVEGYTLGAWESFRR---------GQGGGGGVILGSPPA 119
Query: 233 KVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINKLDALLKGLGFTQK-KADSQVRWS 291
++ + + LAG E + Y A G + D K+ LL T A + +
Sbjct: 120 RLGIEGIESYCATWLAGALAERMYYSEAVGAVEDQQKVGLLLAAAQRTGAVAARTLRNRA 179
Query: 292 LLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNID 331
+L + LA M G+S+ C+ ++ D
Sbjct: 180 ERRAAAILDAERQTHNALAERMLTGQSLERCLQLLNSTKD 219
>gi|116073414|ref|ZP_01470676.1| hypothetical protein RS9916_33227 [Synechococcus sp. RS9916]
gi|116068719|gb|EAU74471.1| hypothetical protein RS9916_33227 [Synechococcus sp. RS9916]
Length = 229
Score = 43.9 bits (102), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 69/151 (45%), Gaps = 11/151 (7%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+ G +P I LL TVD+ ++G G L+ +++ + R+ HEAGH L
Sbjct: 38 LVGISPAWIAVGLGGLLVGLTVDASQWDGLGGHLLAESL--PGGRARLRRIACHEAGHLL 95
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
+A LP L A + G V++ A +F V ++ L R+S +
Sbjct: 96 LAR-ENDLPVERVLVGTRACLQAG---VRSNGA-TEFAIPTSV---RMPLEDLRRWSRVL 147
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGL 277
AG+A E L+YG A GG AD L L GL
Sbjct: 148 QAGIAAETLIYGAARGG-ADDRALLGRLWGL 177
>gi|317969208|ref|ZP_07970598.1| hypothetical protein SCB02_06717 [Synechococcus sp. CB0205]
Length = 214
Score = 43.9 bits (102), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 46/151 (30%), Positives = 64/151 (42%), Gaps = 9/151 (5%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
V G +P I + L T+D+ + G G L+ + + + R+ HEAGH L
Sbjct: 32 VLGLSPWLIAGAAGGALTALTLDAARFGGLGGHLLAEALPFGLGRSRLRRIAVHEAGHLL 91
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
+A LP L A + G L V T LE K+ L R+S +
Sbjct: 92 LAA-ENQLPVKQVLVGSRACLRHG-LRVNGSTE------LEPPAQVKMPLEDLRRWSRVL 143
Query: 247 LAGVATEYLLYGYAEGGLADINKLDALLKGL 277
LAG+ E + YG + GG AD L L GL
Sbjct: 144 LAGMVAEVVEYGGSRGG-ADDRALLGRLWGL 173
>gi|428181632|gb|EKX50495.1| hypothetical protein GUITHDRAFT_135159 [Guillardia theta CCMP2712]
Length = 558
Score = 42.7 bits (99), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 70/300 (23%), Positives = 128/300 (42%), Gaps = 33/300 (11%)
Query: 50 GDERAALALVKDLQGKPGGLRCFGAARQI-----PQRLYTL----DELKLNGIETRSLLS 100
GD A + + L+ + G L + +A +I PQR+ ++ D K GI S
Sbjct: 261 GDANATVPALDRLK-QEGKLLWWNSAPRISQDTTPQRMLSVTGMSDPDKQMGISVDSFSK 319
Query: 101 PVDETLGSIERNLQLAAIVGGVSAWNVFGFNPQQIL---YLSLALLFLWTVDSVSYNGGV 157
+ +G + L+ I+ G ++ P + Y++L + TV +G
Sbjct: 320 FRYQGVGVLGITGALSLILAGFPGPWLYLIPPDLLAGYGYITLIINIFVTVFGRQLDGLN 379
Query: 158 GSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVG-----ILP--KGYTLTSFEALKKEG 210
+LD ++ +R ++ EAG F+ AYL G ILP GY++ + K+ G
Sbjct: 380 EKRILDQTSNS-----GDRWVRREAGRFIGAYLCGLPLESILPDRNGYSIVKVFS-KRSG 433
Query: 211 SLNVQAGTAFVDFE-FLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLADINK 269
+ +++ A V + F+ E ++ ++R S + + G EY+ YG A G +
Sbjct: 434 NFDLEKLRASVMGDGFIPE----GLTKQEMDRQSIVQMFGPVAEYIKYGEATFGYRYFRQ 489
Query: 270 LDALLK-GLGFTQKKADS-QVRWSLLNTVLLLRRHKGARAKLAVAMTMGKSVGSCIDIIE 327
LD L ++A Q R+ + + ++++H+ K+ A G I I E
Sbjct: 490 LDLELDLAQSILDRRARQIQARYGITMSFQIIKQHEEGFEKVVDAFKRGCQPAEIIAIFE 549
>gi|88807636|ref|ZP_01123148.1| hypothetical protein WH7805_13833 [Synechococcus sp. WH 7805]
gi|88788850|gb|EAR20005.1| hypothetical protein WH7805_13833 [Synechococcus sp. WH 7805]
Length = 212
Score = 42.4 bits (98), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 49/171 (28%), Positives = 68/171 (39%), Gaps = 22/171 (12%)
Query: 100 SPVDETLGSIERNLQLAAIVGGVSAWNVFG----FNPQQILYLSLALLFLWTVDSVSYNG 155
SPV T + V G +A VFG +P I L L TVD+ G
Sbjct: 4 SPVSSTTAGL--------AVAGCTALAVFGPLVGVSPAWIALLIGGGLLGLTVDASQLEG 55
Query: 156 GVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQ 215
G LV + + + RV +HEAGH+L+A + K + + L E L
Sbjct: 56 MGGHLVAEAL--PGGKARLRRVARHEAGHWLVARDEQLGVKRVLVGTRACL--EAGLRCN 111
Query: 216 AGTAFVDFEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLAD 266
T F + ++ L R+S + AG+ E LL G A GG D
Sbjct: 112 GATEFT------LPDQARLPLEELRRWSRVLQAGIVAEVLLEGAARGGEDD 156
>gi|352094776|ref|ZP_08955947.1| hypothetical protein Syn8016DRAFT_1291 [Synechococcus sp. WH 8016]
gi|351681116|gb|EHA64248.1| hypothetical protein Syn8016DRAFT_1291 [Synechococcus sp. WH 8016]
Length = 212
Score = 42.4 bits (98), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 60/140 (42%), Gaps = 10/140 (7%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+ G +P I L TVD+ NG G L+ +++ + RV HEAGH+L
Sbjct: 27 LIGLSPAWIALGLGGALLGLTVDAAQLNGMGGHLLAESL--PGGRNRLRRVAFHEAGHWL 84
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
+A + K + + LK N +F + ++S L R+S +
Sbjct: 85 VAQEENLEVKRVLVGTRGCLKAGLRCN-----GVTEFALPDRA---RLSLEDLRRWSRVL 136
Query: 247 LAGVATEYLLYGYAEGGLAD 266
AG+A E LL G +GG D
Sbjct: 137 QAGMAAETLLEGPPQGGEDD 156
>gi|260435816|ref|ZP_05789786.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
gi|260413690|gb|EEX06986.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
Length = 215
Score = 42.0 bits (97), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 68/158 (43%), Gaps = 14/158 (8%)
Query: 113 LQLAAIVGGVSAWNVFG----FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHT 168
L+ V G++A FG + I+ L +VD+ ++ G G ++ + +
Sbjct: 12 LRAGVAVAGITALGAFGPALGLSAAWIVVAVGGALVTLSVDAATWEGRGGHILAEAL--P 69
Query: 169 FSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEE 228
Q+ R+ HEAGH LIA LP L A + G L T F
Sbjct: 70 GGQERLRRIAVHEAGHVLIAE-AEQLPVQQVLVGTLACVRAG-LRSSGATEFA------V 121
Query: 229 VNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGGLAD 266
+S ++ L R+S + AG+A E +++G A GG D
Sbjct: 122 PDSVRMPLEDLRRWSRVLQAGIAAETVVFGKARGGADD 159
>gi|148240151|ref|YP_001225538.1| hypothetical protein SynWH7803_1815 [Synechococcus sp. WH 7803]
gi|147848690|emb|CAK24241.1| Uncharacterized conserved secreted protein [Synechococcus sp. WH
7803]
Length = 212
Score = 41.6 bits (96), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 62/152 (40%), Gaps = 14/152 (9%)
Query: 119 VGGVSAWNVFG----FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYH 174
V G +A VFG +P I L L TVD+ G G LV + + +
Sbjct: 15 VAGCTALAVFGPLVGLSPAWIALLIGGGLLGLTVDASQLEGMGGHLVAEAL--PGGKSRL 72
Query: 175 NRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKV 234
RV +HEAGH+L+A + K + + L E L T F + ++
Sbjct: 73 RRVARHEAGHWLVAREEQMGVKRVLVGTRACL--EAGLRCNGATEFTL------PDQARL 124
Query: 235 SATTLNRFSCIALAGVATEYLLYGYAEGGLAD 266
L R+S + AG+ E L G A GG D
Sbjct: 125 PLEELRRWSRVLQAGMVAEELFEGTARGGEDD 156
>gi|255628109|gb|ACU14399.1| unknown [Glycine max]
Length = 248
Score = 40.8 bits (94), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 55/118 (46%), Gaps = 17/118 (14%)
Query: 130 FNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAY 189
F P I +SL +L + ++ +GS + T Y R+ +HEA HFLIAY
Sbjct: 143 FVPYLIGSISLVVLAVGSISPGLLQAAIGSFL------TLFPDYQERIARHEAAHFLIAY 196
Query: 190 LVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIAL 247
L+G+ Y+L + KE +D + + SG++ A L+R + +++
Sbjct: 197 LLGLPIFDYSLD----IGKE-------HVNLIDERLEKLIYSGQLDAKELDRLAVVSM 243
>gi|113953522|ref|YP_731255.1| hypothetical protein sync_2052 [Synechococcus sp. CC9311]
gi|113880873|gb|ABI45831.1| conserved hypothetical protein [Synechococcus sp. CC9311]
Length = 212
Score = 40.8 bits (94), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 59/140 (42%), Gaps = 10/140 (7%)
Query: 127 VFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFL 186
+ G +P I L TVD+ NG G L+ +++ + RV HEAGH+L
Sbjct: 27 LVGLSPAWIALGLGGALLGLTVDAAQLNGMGGHLLAESL--PGGRNRLRRVAFHEAGHWL 84
Query: 187 IAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSGKVSATTLNRFSCIA 246
+A + K + + L QAG + ++S L R+S +
Sbjct: 85 VAQEENLEVKRVLVGTRGCL--------QAGLRCNGVTEFALPDRARLSLEDLRRWSRVL 136
Query: 247 LAGVATEYLLYGYAEGGLAD 266
AG+A E LL G +GG D
Sbjct: 137 QAGMAAETLLDGPPQGGKDD 156
>gi|78212215|ref|YP_380994.1| hypothetical protein Syncc9605_0665 [Synechococcus sp. CC9605]
gi|78196674|gb|ABB34439.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 215
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 71/154 (46%), Gaps = 19/154 (12%)
Query: 128 FGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLI 187
G + I+ L +VD+ ++ G G ++ + + Q+ R+ HEAGH LI
Sbjct: 31 LGLSAAWIVVAVGGALVTLSVDAATWEGMGGHILAEAL--PGGQERLRRIAVHEAGHVLI 88
Query: 188 AYLVGILPKGYTLTSFEALKKEGSLN-VQAG---TAFVDFEFLEEVNSGKVSATTLNRFS 243
A +G L + L G+L V+AG + +F + V ++ L R+S
Sbjct: 89 A-------EGEQLPVQQVLV--GTLACVRAGLRSSGATEFAVPDSV---RMPLEDLRRWS 136
Query: 244 CIALAGVATEYLLYGYAEGGLADINKLDALLKGL 277
+ AG+A E +++G A GG AD L L GL
Sbjct: 137 RVLQAGIAAETVVFGKAHGG-ADDRALLGRLWGL 169
>gi|115380126|ref|ZP_01467161.1| aldo/keto reductase [Stigmatella aurantiaca DW4/3-1]
gi|115362868|gb|EAU62068.1| aldo/keto reductase [Stigmatella aurantiaca DW4/3-1]
Length = 330
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 62/144 (43%), Gaps = 15/144 (10%)
Query: 62 LQGKPGGLRCFGAARQIPQRLYTL--DELKLNGIETRSLLSPVDETLGSIERNLQLAAIV 119
+QGKP GL R + Q L L D + L + +P++ETLG++ER L+
Sbjct: 93 VQGKPEGLSAPVVERALEQSLERLGTDHVDLFYLHQPDPATPIEETLGAVERLLR----A 148
Query: 120 GGVSAWNVFGFNPQQILYLSLAL----LFLWTVDSVSYNGGVGSLVLDTIGHTFSQKY-- 173
G W V F Q+L L + V V YN V + ++ + F+++Y
Sbjct: 149 GKARHWGVSNFAAWQLLELDTLCDARGMPRPAVSQVLYNLVVRQIEMEYL--PFARRYPV 206
Query: 174 HNRVIQHEAGHFLIA-YLVGILPK 196
H V AG L Y+ G P
Sbjct: 207 HTTVYNPLAGGVLTGRYVRGAAPP 230
>gi|350533375|ref|ZP_08912316.1| putative signal transduction protein [Vibrio rotiferianus DAT722]
Length = 444
Score = 38.1 bits (87), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 52/218 (23%), Positives = 91/218 (41%), Gaps = 42/218 (19%)
Query: 126 NVFGFNPQQILYLSLALLFLWTVDSVSYNGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHF 185
N G P+ +++L+L + L SV Y + H+ Q+ +Q+EA
Sbjct: 239 NDTGITPRLVIFLTLFSVLLVITTSVKY--------FNKYRHSEKQRSD---LQNEAQ-- 285
Query: 186 LIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVDFEFLEEVNSG---KVSATTLNRF 242
YL G+L + + FEALK E N A +D +F + +N +V L F
Sbjct: 286 -TDYLTGLLNRRGLESKFEALKAE---NPAGAIAVLDIDFFKRINDSHGHEVGDQVLAHF 341
Query: 243 SCIALAGVATEYLLYGYAEGG------LADINKLDALLKGLGFTQKKADSQVRWSLLNTV 296
S + A ++++Y L GG + +N+ A+L Q+ A + V LN
Sbjct: 342 SKLCSATISSDYSL--ARTGGEEFILLMPHVNEEQAILCCERIRQEVAQTSVVIHELN-- 397
Query: 297 LLLRRHKGARAKLAVAMTMGKSVGSCIDIIEDNIDDTD 334
+ + +++G + S D ++D + D D
Sbjct: 398 ------------IDITVSIGIASFSGFDTMKDALADAD 423
>gi|254437201|ref|ZP_05050695.1| Peptidase family M41 [Octadecabacter antarcticus 307]
gi|198252647|gb|EDY76961.1| Peptidase family M41 [Octadecabacter antarcticus 307]
Length = 716
Score = 38.1 bits (87), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 28/101 (27%), Positives = 40/101 (39%), Gaps = 18/101 (17%)
Query: 163 DTIGHTFSQKYHNRVIQHEAGHFLIAYLVGILPKGYTLTSFEALKKEGSLNVQAGTAFVD 222
D ++ R+ HEAGH L+AY + G+ + SF AL G G
Sbjct: 530 DQFAPRLDEETLKRIAVHEAGHLLVAYTL-----GFEMPSFAALGHSGGRVTWQGAQTYT 584
Query: 223 FEFLEEVNSGKVSATTLNRFSCIALAGVATEYLLYGYAEGG 263
E T ++ + LAG+A E +YG A G
Sbjct: 585 RE-------------TTHKELTVLLAGMAAEIAIYGQASSG 612
>gi|444724628|gb|ELW65228.1| Disintegrin and metalloproteinase domain-containing protein 25
[Tupaia chinensis]
Length = 709
Score = 37.7 bits (86), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 25/96 (26%), Positives = 43/96 (44%), Gaps = 17/96 (17%)
Query: 105 TLGSIERNLQLAAIVGGVSAWNVFGFNP-----------QQILYLSLALLFLWTVDSVSY 153
T G++ + +A ++ W + FNP ++ L L ++ TV + +Y
Sbjct: 254 TTGNLIKGNDIATVLRQFQQWKIKSFNPRMKHDVAHLFIKRAFGLDLGHAYIGTVCNYNY 313
Query: 154 NGGVGSLVLDTIGHTFSQKYHNRVIQHEAGHFLIAY 189
N GV SL+ D I H+ ++ HE GH + Y
Sbjct: 314 NCGVDSLMDDNIAK------HSAIVAHEMGHNMGMY 343
>gi|310821690|ref|YP_003954048.1| aldo-keto reductase family [Stigmatella aurantiaca DW4/3-1]
gi|309394762|gb|ADO72221.1| Aldo-keto reductase family [Stigmatella aurantiaca DW4/3-1]
Length = 360
Score = 37.4 bits (85), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 62/144 (43%), Gaps = 15/144 (10%)
Query: 62 LQGKPGGLRCFGAARQIPQRLYTL--DELKLNGIETRSLLSPVDETLGSIERNLQLAAIV 119
+QGKP GL R + Q L L D + L + +P++ETLG++ER L+
Sbjct: 123 VQGKPEGLSAPVVERALEQSLERLGTDHVDLFYLHQPDPATPIEETLGAVERLLR----A 178
Query: 120 GGVSAWNVFGFNPQQILYLSLAL----LFLWTVDSVSYNGGVGSLVLDTIGHTFSQKY-- 173
G W V F Q+L L + V V YN V + ++ + F+++Y
Sbjct: 179 GKARHWGVSNFAAWQLLELDTLCDARGMPRPAVSQVLYNLVVRQIEMEYL--PFARRYPV 236
Query: 174 HNRVIQHEAGHFLIA-YLVGILPK 196
H V AG L Y+ G P
Sbjct: 237 HTTVYNPLAGGVLTGRYVRGAAPP 260
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.136 0.386
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,900,659,986
Number of Sequences: 23463169
Number of extensions: 197399669
Number of successful extensions: 509136
Number of sequences better than 100.0: 244
Number of HSP's better than 100.0 without gapping: 188
Number of HSP's successfully gapped in prelim test: 56
Number of HSP's that attempted gapping in prelim test: 508640
Number of HSP's gapped (non-prelim): 255
length of query: 335
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 192
effective length of database: 9,003,962,200
effective search space: 1728760742400
effective search space used: 1728760742400
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)