BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 014499
(423 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449462964|ref|XP_004149205.1| PREDICTED: uncharacterized protein At5g03900, chloroplastic-like
[Cucumis sativus]
gi|449500907|ref|XP_004161226.1| PREDICTED: uncharacterized protein At5g03900, chloroplastic-like
[Cucumis sativus]
Length = 516
Score = 588 bits (1517), Expect = e-165, Method: Compositional matrix adjust.
Identities = 301/428 (70%), Positives = 361/428 (84%), Gaps = 11/428 (2%)
Query: 1 MTSISTCFTTTPKSRFFFTPL---RPSINLKPPD-SFPRIQPLPFPRISGKIPGSRVLVP 56
M SIST F + SR +F PL +PSI +KP +FP LP RI+ +R VP
Sbjct: 1 MASISTYFAISQSSRLYFHPLITLKPSICVKPSTITFP---ALP-TRIAPPESRARGFVP 56
Query: 57 VAKASTDVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQK 116
+A D+ + PG +VESDKLP+DVR R M+AV+AC RVTIGDVA +AGLKLNEAQK
Sbjct: 57 TVRAGIDIPSDIRPGNVVESDKLPSDVRKRTMEAVEACGGRVTIGDVASRAGLKLNEAQK 116
Query: 117 ALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVL 176
ALQALAADTDGFLEVSDEGDVLYVFP +YR+KLAAKSF +K EP+I+K+KAAAEY +RV
Sbjct: 117 ALQALAADTDGFLEVSDEGDVLYVFPKDYRSKLAAKSFWIKFEPLIEKSKAAAEYLVRVS 176
Query: 177 FGTALIASIVIVFTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRR 236
FGTALIASIV+V+T IIA++SS+S++D+RGRR RS+DSGF ++SP+DLFWYWDPYYYRR
Sbjct: 177 FGTALIASIVLVYTTIIALISSRSEEDNRGRRSRSYDSGFTFYLSPTDLFWYWDPYYYRR 236
Query: 237 RRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYL 296
RR+QT+D+ KMNFI+S+FSFVFG+GDPNQGIEE+RWKLIG+YI+SNGGVV AEELAPYL
Sbjct: 237 RRLQTEDN--KMNFIESIFSFVFGDGDPNQGIEEERWKLIGQYISSNGGVVAAEELAPYL 294
Query: 297 DI-DRTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWAD 355
D+ +R DESY+LPVLLRFDGQPEIDEEGNILYRFPS QRTA+SQR GRKEYVGR+WAD
Sbjct: 295 DVSERNTDDESYILPVLLRFDGQPEIDEEGNILYRFPSLQRTASSQRSGRKEYVGRKWAD 354
Query: 356 AIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVA 415
+GG+EKIF+EKKW FSKT+ SER MAIGLGGLNLFGVI+LGAML+++AV P+G +KFV+
Sbjct: 355 WVGGIEKIFKEKKWVFSKTSNSERAMAIGLGGLNLFGVIVLGAMLKDVAVKPSGLIKFVS 414
Query: 416 YIFPLLQL 423
IFPLLQ+
Sbjct: 415 DIFPLLQI 422
>gi|255575701|ref|XP_002528750.1| conserved hypothetical protein [Ricinus communis]
gi|223531844|gb|EEF33662.1| conserved hypothetical protein [Ricinus communis]
Length = 514
Score = 582 bits (1499), Expect = e-163, Method: Compositional matrix adjust.
Identities = 302/428 (70%), Positives = 353/428 (82%), Gaps = 13/428 (3%)
Query: 1 MTSISTCFTTTPKSRFFFTPLRPSINLKPPDSFPRIQPLPFPRISGKIPGSRVLV-PVAK 59
M SISTCFT +PK+R T +PS LKPPDSF +I R+S K+ RV V +
Sbjct: 1 MASISTCFTISPKTRIILTS-KPSCRLKPPDSFTKI------RLSPKLSDPRVYRGSVIR 53
Query: 60 ASTDVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQ 119
A D+ G+ PG VESDKL ADVR RAM+AVDA RVTIGDVA KAGLKLNEAQKALQ
Sbjct: 54 AGIDLPSGIKPGGAVESDKLRADVRKRAMEAVDAFGGRVTIGDVASKAGLKLNEAQKALQ 113
Query: 120 ALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGT 179
ALAADT+GFLEVSDEGDVLY FP +YR+KLAAKSF++KVEP++DKAKA EY IRV FGT
Sbjct: 114 ALAADTNGFLEVSDEGDVLYAFPKDYRSKLAAKSFKMKVEPLVDKAKATGEYLIRVSFGT 173
Query: 180 ALIASIVIVFTAIIAILSSKSDDDDRGRR-RRSFDSGFNIFISPSDLFWYWDPYYYRRRR 238
ALIASIV+V+T IIA+LSS+S++D+RGRR RS+DSGF + SP+DLFWYWDPYYYRRR+
Sbjct: 174 ALIASIVLVYTTIIALLSSRSEEDNRGRRGGRSYDSGFTFYFSPTDLFWYWDPYYYRRRQ 233
Query: 239 VQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI 298
++ DDDD KMNFI+SVFSFVFG+GDPNQGIEE+RWKLIG+YI+SNGGVV AEELAP+LD+
Sbjct: 234 IKKDDDD-KMNFIESVFSFVFGDGDPNQGIEEERWKLIGQYISSNGGVVAAEELAPFLDL 292
Query: 299 ---DRTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWAD 355
D+ +DESY+LPVLLRFDGQPEIDEE ILYRFPS QRTA+SQR GRKEY+GRRW D
Sbjct: 293 QTTDKNTNDESYILPVLLRFDGQPEIDEEETILYRFPSLQRTASSQRSGRKEYIGRRWTD 352
Query: 356 AIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVA 415
+GGVEK FRE+KWEFSKT SER M IGLGG+NLFGVI+LGAML+++A P G + FVA
Sbjct: 353 WVGGVEKFFRERKWEFSKTGASERAMVIGLGGINLFGVIVLGAMLKDIAAMPGGLINFVA 412
Query: 416 YIFPLLQL 423
IFPLLQ+
Sbjct: 413 GIFPLLQV 420
>gi|224077276|ref|XP_002305197.1| predicted protein [Populus trichocarpa]
gi|222848161|gb|EEE85708.1| predicted protein [Populus trichocarpa]
Length = 374
Score = 577 bits (1487), Expect = e-162, Method: Compositional matrix adjust.
Identities = 285/371 (76%), Positives = 329/371 (88%), Gaps = 7/371 (1%)
Query: 59 KASTDVA--VGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQK 116
KA+ DV +G+ PG +VE+DKLP+DVRNRAM+AVDAC RVTIGDVA +AGLKLNEAQK
Sbjct: 5 KATADVTKTMGIRPGSVVETDKLPSDVRNRAMEAVDACGGRVTIGDVASRAGLKLNEAQK 64
Query: 117 ALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVL 176
ALQALA+DTDGFLEVSDEGDVLYVFP +YR+KLAAKS RLK EP+ +K KAAAEY IRV
Sbjct: 65 ALQALASDTDGFLEVSDEGDVLYVFPKDYRSKLAAKSLRLKFEPLFEKGKAAAEYLIRVS 124
Query: 177 FGTALIASIVIVFTAIIAILSSKSDDDDRGRRR-RSFDSGFNIFISPSDLFWYWDPYYYR 235
FGTALIASIVIV+T IIAILSS D++DRGRRR RSFD+GF ++SP+DLFWYWDPYYYR
Sbjct: 125 FGTALIASIVIVYTTIIAILSSSRDENDRGRRRSRSFDTGFAFYLSPTDLFWYWDPYYYR 184
Query: 236 RRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPY 295
RR+++TD D KMNFI+SVFSFVFG+GDPNQGIEE+RWKLIG+YI+SNGGVV AEELAP+
Sbjct: 185 RRQLRTDGGD-KMNFIESVFSFVFGDGDPNQGIEEERWKLIGQYISSNGGVVAAEELAPF 243
Query: 296 LDIDRT--MSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRW 353
LD+ T MSDESY+LPVLLRFDG+PEIDEEGNILY+FPS QRTA+S+R GRKEYVG+RW
Sbjct: 244 LDLKTTEDMSDESYILPVLLRFDGKPEIDEEGNILYQFPSLQRTASSKRSGRKEYVGKRW 303
Query: 354 ADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN-GFLK 412
AD +GGV K FREK W+FSKT+ SER MAIGLGGLNLFGVIILG MLQ+MA+T N GF+K
Sbjct: 304 ADWVGGVGKFFREKTWQFSKTSSSERAMAIGLGGLNLFGVIILGTMLQDMAITQNGGFIK 363
Query: 413 FVAYIFPLLQL 423
FV+ IFPLLQ+
Sbjct: 364 FVSSIFPLLQV 374
>gi|225440882|ref|XP_002276711.1| PREDICTED: uncharacterized protein At5g03900, chloroplastic-like
[Vitis vinifera]
Length = 495
Score = 564 bits (1454), Expect = e-158, Method: Compositional matrix adjust.
Identities = 282/376 (75%), Positives = 333/376 (88%), Gaps = 6/376 (1%)
Query: 52 RVLVPVAKASTDVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKL 111
+V VPV +A DVA G+ PG IVE+DKLP++VR RAMDAVDAC RVTIGDVA K GLKL
Sbjct: 28 QVFVPVVRAGLDVASGIRPGGIVETDKLPSNVRKRAMDAVDACGGRVTIGDVASKGGLKL 87
Query: 112 NEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEY 171
NEAQKALQALAADT+GFLEVSDEGDVLYVFP +YR+KLAAKSFR+K+EP ++KAK+AAEY
Sbjct: 88 NEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFRIKLEPFVEKAKSAAEY 147
Query: 172 SIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGRR-RRSFDSGFNIFISPSDLFWYWD 230
+RV FGTALIASIV+V+T IIA+LSS+SD+D+RGRR RS+DSGF +++P+DLFWYWD
Sbjct: 148 LVRVSFGTALIASIVLVYTTIIALLSSRSDEDNRGRRGGRSYDSGFTFYLNPADLFWYWD 207
Query: 231 PYYYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAE 290
PYYYRRRR+Q +DD MNFI+SVFSFVFG+GDPNQGIE++RWKLIG+YI+SNGGVVTAE
Sbjct: 208 PYYYRRRRIQKEDD--GMNFIESVFSFVFGDGDPNQGIEDERWKLIGQYISSNGGVVTAE 265
Query: 291 ELAPYLDI---DRTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKE 347
ELAPYLD+ D + DESY+LPVLLRF+GQPE+DEEGNILYRFPS QRTA+SQR GRKE
Sbjct: 266 ELAPYLDLETADNNLVDESYILPVLLRFEGQPEVDEEGNILYRFPSLQRTASSQRSGRKE 325
Query: 348 YVGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTP 407
YVG+RW D +GGVEK F+EKKW+FSKT+ SER M IGLGGLNLFGVIILG ML+ +AVTP
Sbjct: 326 YVGKRWTDWVGGVEKFFKEKKWQFSKTSNSERAMVIGLGGLNLFGVIILGTMLKNVAVTP 385
Query: 408 NGFLKFVAYIFPLLQL 423
+GF+ FV+ IFPLLQ+
Sbjct: 386 SGFITFVSDIFPLLQI 401
>gi|357503799|ref|XP_003622188.1| hypothetical protein MTR_7g030010 [Medicago truncatula]
gi|355497203|gb|AES78406.1| hypothetical protein MTR_7g030010 [Medicago truncatula]
Length = 507
Score = 548 bits (1413), Expect = e-153, Method: Compositional matrix adjust.
Identities = 296/431 (68%), Positives = 343/431 (79%), Gaps = 26/431 (6%)
Query: 1 MTSISTCFTTTPKSRFFFTPLRPSINLKPPDSFPRIQPLPFP---RISGKIPGSRVLVPV 57
M +I TCF TP SR + P P +P FP RI+ + G ++VP
Sbjct: 1 MATIPTCFAITPTSRL--------LTFTAP---PFHKPFIFPQNRRINKR--GWALVVP- 46
Query: 58 AKASTDVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKA 117
+A+ DV G+ PG +VESDKL +DVR R MDAVD C RVT+GDVA +AGLKLNEAQKA
Sbjct: 47 -RAAVDVGRGIRPGGVVESDKLSSDVRKRTMDAVDGCGGRVTVGDVASRAGLKLNEAQKA 105
Query: 118 LQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLF 177
LQALAADTDGFLEVS+EGDVLYVFP NYR+KL AKSFR+K EP I+KAK A EY IRV F
Sbjct: 106 LQALAADTDGFLEVSEEGDVLYVFPKNYRSKLGAKSFRIKAEPFIEKAKGAGEYLIRVSF 165
Query: 178 GTALIASIVIVFTAIIAIL-SSKSDDDDRGRR-RRSFDSGFNIFISPSDLFWYWDPYYYR 235
GTALIASIVIV+TAIIA++ SS+S+DD+RGRR RS+DSGFN + +P DLFWYWDPYY R
Sbjct: 166 GTALIASIVIVYTAIIALVTSSRSEDDNRGRRGGRSYDSGFNFYFNPVDLFWYWDPYYNR 225
Query: 236 RRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPY 295
RRRVQ DD+ K NFI+SVFSFVFG+GDPNQGIEE+RWKLIG+YIASNGGVV AEELAPY
Sbjct: 226 RRRVQVDDN--KTNFIESVFSFVFGDGDPNQGIEEERWKLIGQYIASNGGVVAAEELAPY 283
Query: 296 LDID---RTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRR 352
LDID R DESY+LPVLLRFDGQP +DEEGNILYRFPS QRT ASQ+ RKEYVG+R
Sbjct: 284 LDIDSTERIKDDESYILPVLLRFDGQPVVDEEGNILYRFPSLQRT-ASQKSKRKEYVGKR 342
Query: 353 WADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLK 412
WAD +GGVEK F EK+W+FSKT+ SER M +GLGGLNLFGVI+LG ML+E+AV P+ F+K
Sbjct: 343 WADWVGGVEKFFEEKRWQFSKTSSSERAMVVGLGGLNLFGVIVLGTMLKEVAVRPDSFIK 402
Query: 413 FVAYIFPLLQL 423
FVA IFPLLQ+
Sbjct: 403 FVADIFPLLQI 413
>gi|356537645|ref|XP_003537336.1| PREDICTED: uncharacterized protein At5g03900, chloroplastic-like
[Glycine max]
Length = 505
Score = 531 bits (1368), Expect = e-148, Method: Compositional matrix adjust.
Identities = 289/429 (67%), Positives = 334/429 (77%), Gaps = 24/429 (5%)
Query: 1 MTSISTCFTTTPKSRFFFTPLRPSINLKPPDSFPRIQPLP-FPRISGKIPGSRVLVPVAK 59
M SISTC T TP R L S K +FP +Q P ++ ++ SRVLV
Sbjct: 1 MASISTCITVTPTCRLARRLL--SFTSKQAIAFPSVQQNPVIGGVTKRVWDSRVLV---- 54
Query: 60 ASTDVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQ 119
GPG VE+DKLP+DVR R MDAVD C R+TIGDVA +AGL LN+AQKALQ
Sbjct: 55 --------AGPGGAVETDKLPSDVRKRTMDAVDECGGRMTIGDVASRAGLNLNQAQKALQ 106
Query: 120 ALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGT 179
ALAADT+GFLEVS+EGDVLYVFP +YR++L AKSFR+K EP +KAKAA EY IRV FGT
Sbjct: 107 ALAADTNGFLEVSEEGDVLYVFPKDYRSRLGAKSFRIKAEPFFEKAKAAGEYFIRVSFGT 166
Query: 180 ALIASIVIVFTAIIAIL-SSKSDDDDRGRR-RRSFDSGFNIFISPSDLFWYWDPYYYRRR 237
ALIASIVIV+T IIA++ SS+S++D+RGRR RS+DSGF + +P DLFWYWDPYYYRR+
Sbjct: 167 ALIASIVIVYTTIIALVTSSRSEEDNRGRRGGRSYDSGFTFYFNPVDLFWYWDPYYYRRQ 226
Query: 238 RVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD 297
R Q DDD KMNFI+SVFSFVFG+GDPNQGIEE+RWKLIG+YIASNGGVV AEELAPYLD
Sbjct: 227 RPQADDD--KMNFIESVFSFVFGDGDPNQGIEEERWKLIGQYIASNGGVVAAEELAPYLD 284
Query: 298 IDRT---MSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWA 354
ID T DESY+LPVLLRFDGQPE+DEEGNILYRFPS Q T ASQ+ RKEYVGRRWA
Sbjct: 285 IDSTEGIKDDESYILPVLLRFDGQPEVDEEGNILYRFPSLQ-TTASQKSKRKEYVGRRWA 343
Query: 355 DAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFV 414
D + G+EK F+EKKW+FS T SER M IGLGGLNLFGVIILG ML++ AV P+ F+KFV
Sbjct: 344 DWV-GIEKFFKEKKWQFSITGTSERAMVIGLGGLNLFGVIILGTMLKDTAVAPSSFIKFV 402
Query: 415 AYIFPLLQL 423
A IFPLLQ+
Sbjct: 403 ADIFPLLQI 411
>gi|312283379|dbj|BAJ34555.1| unnamed protein product [Thellungiella halophila]
Length = 523
Score = 526 bits (1355), Expect = e-147, Method: Compositional matrix adjust.
Identities = 268/433 (61%), Positives = 332/433 (76%), Gaps = 14/433 (3%)
Query: 1 MTSISTCFTTTPK-SRFFFTPLRPSINLKPP----DSFPRIQPLPFPRISGKIPGSRVLV 55
MT +STC +PK ++ F+ +P I L+ P +FP I FP + + V
Sbjct: 1 MTCVSTCLIVSPKLTQSGFSSKKPVIRLRSPVDRCYAFPGIFTKRFPSSRREFTSHGIAV 60
Query: 56 PVAKASTDVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQ 115
A + V+ + G +VESDKLP DVR RAM+AVD C RRVT+GDVA +AGLK+ EAQ
Sbjct: 61 VRAASIDKVSGAIKLGGLVESDKLPTDVRKRAMEAVDECGRRVTVGDVASRAGLKVTEAQ 120
Query: 116 KALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRV 175
KALQA+AADTDGFLEVSDEGDVLYVFP +YR+KLA KS R+++EP ++KAK A +Y RV
Sbjct: 121 KALQAIAADTDGFLEVSDEGDVLYVFPRDYRSKLATKSLRIQIEPFLEKAKGAVDYLTRV 180
Query: 176 LFGTALIASIVIVFTAIIAILSSKSDDDDRGRRR-RSFDSGFNIFISPSDLFWYWDPYYY 234
FGTALIASIVIV+T II +LSS+S+DD+R RRR R +DSGFN FI+P DLFWYWDP YY
Sbjct: 181 SFGTALIASIVIVYTTIIVLLSSRSEDDNRQRRRGRGYDSGFNFFINPVDLFWYWDPNYY 240
Query: 235 RRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAP 294
RRR + +D+ K MNFI+SVFSFVFG+GDPN+G EE+RW++IG YI S GGVV A+ELAP
Sbjct: 241 SRRRAR-EDEGKGMNFIESVFSFVFGDGDPNEGTEEERWQMIGRYITSRGGVVAADELAP 299
Query: 295 YLDIDRT---MSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAA-SQRIGRKEYVG 350
YLD+ + SDESY+LPVLLRFDGQPE+DEEGNILYRFPS QRTA+ S R +KEYVG
Sbjct: 300 YLDVPSSKSDTSDESYILPVLLRFDGQPELDEEGNILYRFPSLQRTASGSSR--KKEYVG 357
Query: 351 RRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGF 410
+W D + +EK F+E+KW+FSKT+ SER M +GLG +NLFGVI+L AML+EMAVTP+GF
Sbjct: 358 -KWFDWVADMEKFFKERKWQFSKTSSSERAMVVGLGAVNLFGVIVLNAMLKEMAVTPSGF 416
Query: 411 LKFVAYIFPLLQL 423
L FV I+PLLQ+
Sbjct: 417 LTFVKNIYPLLQV 429
>gi|356495901|ref|XP_003516809.1| PREDICTED: uncharacterized protein At5g03900, chloroplastic-like
[Glycine max]
Length = 500
Score = 521 bits (1343), Expect = e-145, Method: Compositional matrix adjust.
Identities = 291/432 (67%), Positives = 331/432 (76%), Gaps = 35/432 (8%)
Query: 1 MTSISTCFTTTPKSRFFFTPLRPSINLKPPDSFPRIQPLPFPRISGKIP----GSRVLVP 56
M S+STC T P P R SF QP+ P + G +P SRVLV
Sbjct: 1 MASMSTCITVIPTCGL---PRRLL-------SFTPKQPISLPCVIGGVPKRVWDSRVLV- 49
Query: 57 VAKASTDVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQK 116
PG VESDKLP+DVR R MDAVD C RVTIGDVA +AGL LN+AQK
Sbjct: 50 -----------ARPGGAVESDKLPSDVRKRTMDAVDGCGGRVTIGDVASRAGLNLNQAQK 98
Query: 117 ALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVL 176
ALQALAAD DGFLEVS EGDVLYVFP +YR++L AKSFR+K EP +KAKAA EY IRV
Sbjct: 99 ALQALAADADGFLEVSGEGDVLYVFPKDYRSRLGAKSFRIKAEPFFEKAKAAGEYLIRVS 158
Query: 177 FGTALIASIVIVFTAIIAIL-SSKSDDDDRGRR-RRSFDSGFNIFISPSDLFWYWDPYYY 234
FGTALIASIVIV+T IIA++ SS+S++D+RGRR RS+DSGF + +P DLFWYWDPYYY
Sbjct: 159 FGTALIASIVIVYTTIIALVTSSRSEEDNRGRRGGRSYDSGFTFYFNPVDLFWYWDPYYY 218
Query: 235 RRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAP 294
RRRR+Q DDD KMNFI+SVFSFVFG+GDPNQGIEE+RWKLIG+YIASNGGVV AEELAP
Sbjct: 219 RRRRLQADDD--KMNFIESVFSFVFGDGDPNQGIEEERWKLIGQYIASNGGVVAAEELAP 276
Query: 295 YLDIDRT---MSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGR 351
YLDID T DESY+LPVLLRFDGQP++DEEGNILYRFPS QRT ASQ+ RKEYVGR
Sbjct: 277 YLDIDSTEGIKDDESYILPVLLRFDGQPDVDEEGNILYRFPSLQRT-ASQKSKRKEYVGR 335
Query: 352 RWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFL 411
RWAD + G+EK F+EKKW+FSKT ER M IGLGGLNLFGVIILG ML++MAV P+ F+
Sbjct: 336 RWADWV-GIEKFFKEKKWQFSKTGTPERAMVIGLGGLNLFGVIILGTMLKDMAVAPSSFI 394
Query: 412 KFVAYIFPLLQL 423
KFVA IFPLLQ+
Sbjct: 395 KFVADIFPLLQI 406
>gi|21536751|gb|AAM61083.1| unknown [Arabidopsis thaliana]
Length = 523
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 270/435 (62%), Positives = 335/435 (77%), Gaps = 18/435 (4%)
Query: 1 MTSISTCFTTTPK-SRFFFTPLRP-SINLKPP---DSFPRIQPLPFPRISGKIPGSRVLV 55
M +STC +P+ ++ + +P I L+ P SFPR+ L +S + +R +
Sbjct: 1 MACVSTCLILSPRLTQVGLSSKKPFLIRLRSPVDRYSFPRM--LTERCLSTRRKFNRHGI 58
Query: 56 PVAKAST--DVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNE 113
V KA++ V+ + PG +VESDKLP DVR RAMDAVD C RRVT+GDVA + GLK+ E
Sbjct: 59 AVVKAASLDKVSGAIKPGGLVESDKLPTDVRKRAMDAVDECGRRVTVGDVASRGGLKVTE 118
Query: 114 AQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSI 173
AQ ALQA+AADTDGFLEVSDEGDVLYVFP +YR KLAAKS R+++EP ++KAK A +Y
Sbjct: 119 AQTALQAIAADTDGFLEVSDEGDVLYVFPRDYRTKLAAKSLRIQIEPFLEKAKGAVDYLA 178
Query: 174 RVLFGTALIASIVIVFTAIIAILSSKSDDDDRGRRR-RSFDSGFNIFISPSDLFWYWDPY 232
RV FGTALIASIVIV+T+IIA+LSSKS+DD+R RRR RS+DSGFN +I+P DL WYWDP
Sbjct: 179 RVSFGTALIASIVIVYTSIIALLSSKSEDDNRQRRRGRSYDSGFNFYINPVDLLWYWDPN 238
Query: 233 YYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEEL 292
YY RRR + +D+ K MNFI+SVFSFVFG+GDPNQGIEE+RW++IG+YI S GGVV A+EL
Sbjct: 239 YYNRRRAR-EDEGKGMNFIESVFSFVFGDGDPNQGIEEERWQMIGQYITSRGGVVAADEL 297
Query: 293 APYLDI---DRTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAA-SQRIGRKEY 348
APYLD+ M+DESY+LPVLLRFDGQPE+DEEGNILYRFPS QRTA+ S R RKEY
Sbjct: 298 APYLDVPSSKSAMNDESYILPVLLRFDGQPELDEEGNILYRFPSLQRTASGSSR--RKEY 355
Query: 349 VGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN 408
VG +W D + +EK F+EKKW+FSKT+ SER + IGLG +NLFGVI+L +L EM+V P
Sbjct: 356 VG-KWFDWVADMEKFFKEKKWQFSKTSTSERALVIGLGAVNLFGVIVLNTLLNEMSVRPG 414
Query: 409 GFLKFVAYIFPLLQL 423
GFL FV I+PLLQ+
Sbjct: 415 GFLTFVKNIYPLLQI 429
>gi|297740116|emb|CBI30298.3| unnamed protein product [Vitis vinifera]
Length = 432
Score = 520 bits (1340), Expect = e-145, Method: Compositional matrix adjust.
Identities = 260/340 (76%), Positives = 305/340 (89%), Gaps = 6/340 (1%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
MDAVDAC RVTIGDVA K GLKLNEAQKALQALAADT+GFLEVSDEGDVLYVFP +YR+
Sbjct: 1 MDAVDACGGRVTIGDVASKGGLKLNEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRS 60
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
KLAAKSFR+K+EP ++KAK+AAEY +RV FGTALIASIV+V+T IIA+LSS+SD+D+RGR
Sbjct: 61 KLAAKSFRIKLEPFVEKAKSAAEYLVRVSFGTALIASIVLVYTTIIALLSSRSDEDNRGR 120
Query: 208 RR-RSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQ 266
R RS+DSGF +++P+DLFWYWDPYYYRRRR+Q +DD MNFI+SVFSFVFG+GDPNQ
Sbjct: 121 RGGRSYDSGFTFYLNPADLFWYWDPYYYRRRRIQKEDD--GMNFIESVFSFVFGDGDPNQ 178
Query: 267 GIEEKRWKLIGEYIASNGGVVTAEELAPYLDI---DRTMSDESYVLPVLLRFDGQPEIDE 323
GIE++RWKLIG+YI+SNGGVVTAEELAPYLD+ D + DESY+LPVLLRF+GQPE+DE
Sbjct: 179 GIEDERWKLIGQYISSNGGVVTAEELAPYLDLETADNNLVDESYILPVLLRFEGQPEVDE 238
Query: 324 EGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAI 383
EGNILYRFPS QRTA+SQR GRKEYVG+RW D +GGVEK F+EKKW+FSKT+ SER M I
Sbjct: 239 EGNILYRFPSLQRTASSQRSGRKEYVGKRWTDWVGGVEKFFKEKKWQFSKTSNSERAMVI 298
Query: 384 GLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLLQL 423
GLGGLNLFGVIILG ML+ +AVTP+GF+ FV+ IFPLLQ+
Sbjct: 299 GLGGLNLFGVIILGTMLKNVAVTPSGFITFVSDIFPLLQI 338
>gi|26453262|dbj|BAC43704.1| unknown protein [Arabidopsis thaliana]
Length = 523
Score = 518 bits (1335), Expect = e-144, Method: Compositional matrix adjust.
Identities = 268/435 (61%), Positives = 335/435 (77%), Gaps = 18/435 (4%)
Query: 1 MTSISTCFTTTPK-SRFFFTPLRP-SINLKPP---DSFPRIQPLPFPRISGKIPGSRVLV 55
M +STC +P+ ++ + +P I L+ P SFPR+ L +S + +R +
Sbjct: 1 MACVSTCLILSPRLTQVGLSSKKPFLIRLRSPVDRYSFPRM--LTERCLSTRRKFNRHGI 58
Query: 56 PVAKAST--DVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNE 113
V KA++ V+ + PG +VESDKLP DVR RAMDAVD C RRVT+GDVA + GLK+ E
Sbjct: 59 AVVKAASLDKVSGAIKPGGLVESDKLPTDVRKRAMDAVDECGRRVTVGDVASRGGLKVTE 118
Query: 114 AQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSI 173
AQ ALQA+AADTDGFLEVSDEGDVLYVFP +YR KLAAKS R+++EP ++KAK A +Y
Sbjct: 119 AQTALQAIAADTDGFLEVSDEGDVLYVFPRDYRTKLAAKSLRIQIEPFLEKAKGAVDYLA 178
Query: 174 RVLFGTALIASIVIVFTAIIAILSSKSDDDDRGRRR-RSFDSGFNIFISPSDLFWYWDPY 232
RV FGTALIASIVIV+T+IIA+LSSKS+DD+R RRR RS+DSGFN +I+P+DL WYWDP
Sbjct: 179 RVSFGTALIASIVIVYTSIIALLSSKSEDDNRQRRRGRSYDSGFNFYINPADLLWYWDPN 238
Query: 233 YYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEEL 292
YY RRR + +D+ K MNFI+SVFSFVFG+GDPNQGIEE+RW++IG+YI S GGVV A+EL
Sbjct: 239 YYNRRRAR-EDEGKGMNFIESVFSFVFGDGDPNQGIEEERWQMIGQYITSRGGVVAADEL 297
Query: 293 APYLDI---DRTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAA-SQRIGRKEY 348
APYLD+ M+DESY+LPVLLRFDGQPE+DEEGNILY FPS QRTA+ S R RKEY
Sbjct: 298 APYLDVPSSKSAMNDESYILPVLLRFDGQPELDEEGNILYCFPSLQRTASGSSR--RKEY 355
Query: 349 VGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN 408
VG +W D + +E+ F+EKKW+FSKT+ SER + IGLG +NLFGVI+L +L EM+V P
Sbjct: 356 VG-KWFDWVADMERFFKEKKWQFSKTSTSERALVIGLGAVNLFGVIVLNTLLNEMSVRPG 414
Query: 409 GFLKFVAYIFPLLQL 423
GFL FV I+PLLQ+
Sbjct: 415 GFLTFVKNIYPLLQI 429
>gi|18414392|ref|NP_568129.1| Iron-sulfur assembly-like protein [Arabidopsis thaliana]
gi|88909724|sp|Q8GW20.2|Y5390_ARATH RecName: Full=Uncharacterized protein At5g03900, chloroplastic;
Flags: Precursor
gi|332003286|gb|AED90669.1| Iron-sulfur assembly-like protein [Arabidopsis thaliana]
Length = 523
Score = 518 bits (1334), Expect = e-144, Method: Compositional matrix adjust.
Identities = 269/435 (61%), Positives = 334/435 (76%), Gaps = 18/435 (4%)
Query: 1 MTSISTCFTTTPK-SRFFFTPLRP-SINLKPP---DSFPRIQPLPFPRISGKIPGSRVLV 55
M +STC +P+ ++ + +P I L+ P SFPR+ L +S + +R +
Sbjct: 1 MACVSTCLILSPRLTQVGLSSKKPFLIRLRSPVDRYSFPRM--LTERCLSTRRKFNRHGI 58
Query: 56 PVAKAST--DVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNE 113
V KA++ V+ + PG +VESDKLP DVR RAMDAVD C RRVT+GDVA + GLK+ E
Sbjct: 59 AVVKAASLDKVSGAIKPGGLVESDKLPTDVRKRAMDAVDECGRRVTVGDVASRGGLKVTE 118
Query: 114 AQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSI 173
AQ ALQA+AADTDGFLEVSDEGDVLYVFP +YR KLAAKS R+++EP ++KAK A +Y
Sbjct: 119 AQTALQAIAADTDGFLEVSDEGDVLYVFPRDYRTKLAAKSLRIQIEPFLEKAKGAVDYLA 178
Query: 174 RVLFGTALIASIVIVFTAIIAILSSKSDDDDRGRRR-RSFDSGFNIFISPSDLFWYWDPY 232
RV FGTALIASIVIV+T+IIA+LSSKS+DD+R RRR RS+DSGFN +I+P DL WYWDP
Sbjct: 179 RVSFGTALIASIVIVYTSIIALLSSKSEDDNRQRRRGRSYDSGFNFYINPVDLLWYWDPN 238
Query: 233 YYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEEL 292
YY RRR + +D+ K MNFI+SVFSFVFG+GDPNQGIEE+RW++IG+YI S GGVV A+EL
Sbjct: 239 YYNRRRAR-EDEGKGMNFIESVFSFVFGDGDPNQGIEEERWQMIGQYITSRGGVVAADEL 297
Query: 293 APYLDI---DRTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAA-SQRIGRKEY 348
APYLD+ M+DESY+LPVLLRFDGQPE+DEEGNILY FPS QRTA+ S R RKEY
Sbjct: 298 APYLDVPSSKSAMNDESYILPVLLRFDGQPELDEEGNILYCFPSLQRTASGSSR--RKEY 355
Query: 349 VGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN 408
VG +W D + +EK F+EKKW+FSKT+ SER + IGLG +NLFGVI+L +L EM+V P
Sbjct: 356 VG-KWFDWVADMEKFFKEKKWQFSKTSTSERALVIGLGAVNLFGVIVLNTLLNEMSVRPG 414
Query: 409 GFLKFVAYIFPLLQL 423
GFL FV I+PLLQ+
Sbjct: 415 GFLTFVKNIYPLLQI 429
>gi|30680281|ref|NP_850761.1| Iron-sulfur assembly-like protein [Arabidopsis thaliana]
gi|20466203|gb|AAM20419.1| putative protein [Arabidopsis thaliana]
gi|30387513|gb|AAP31922.1| At5g03900 [Arabidopsis thaliana]
gi|332003285|gb|AED90668.1| Iron-sulfur assembly-like protein [Arabidopsis thaliana]
Length = 429
Score = 517 bits (1332), Expect = e-144, Method: Compositional matrix adjust.
Identities = 269/435 (61%), Positives = 335/435 (77%), Gaps = 18/435 (4%)
Query: 1 MTSISTCFTTTPK-SRFFFTPLRP-SINLKPP---DSFPRIQPLPFPRISGKIPGSRVLV 55
M +STC +P+ ++ + +P I L+ P SFPR+ L +S + +R +
Sbjct: 1 MACVSTCLILSPRLTQVGLSSKKPFLIRLRSPVDRYSFPRM--LTERCLSTRRKFNRHGI 58
Query: 56 PVAKAST--DVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNE 113
V KA++ V+ + PG +VESDKLP DVR RAMDAVD C RRVT+GDVA + GLK+ E
Sbjct: 59 AVVKAASLDKVSGAIKPGGLVESDKLPTDVRKRAMDAVDECGRRVTVGDVASRGGLKVTE 118
Query: 114 AQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSI 173
AQ ALQA+AADTDGFLEVSDEGDVLYVFP +YR KLAAKS R+++EP ++KAK A +Y
Sbjct: 119 AQTALQAIAADTDGFLEVSDEGDVLYVFPRDYRTKLAAKSLRIQIEPFLEKAKGAVDYLA 178
Query: 174 RVLFGTALIASIVIVFTAIIAILSSKSDDDDRGRRR-RSFDSGFNIFISPSDLFWYWDPY 232
RV FGTALIASIVIV+T+IIA+LSSKS+DD+R RRR RS+DSGFN +I+P DL WYWDP
Sbjct: 179 RVSFGTALIASIVIVYTSIIALLSSKSEDDNRQRRRGRSYDSGFNFYINPVDLLWYWDPN 238
Query: 233 YYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEEL 292
YY RRR + +D+ K MNFI+SVFSFVFG+GDPNQGIEE+RW++IG+YI S GGVV A+EL
Sbjct: 239 YYNRRRAR-EDEGKGMNFIESVFSFVFGDGDPNQGIEEERWQMIGQYITSRGGVVAADEL 297
Query: 293 APYLDIDRT---MSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAA-SQRIGRKEY 348
APYLD+ + M+DESY+LPVLLRFDGQPE+DEEGNILY FPS QRTA+ S R RKEY
Sbjct: 298 APYLDVPSSKSAMNDESYILPVLLRFDGQPELDEEGNILYCFPSLQRTASGSSR--RKEY 355
Query: 349 VGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN 408
VG +W D + +EK F+EKKW+FSKT+ SER + IGLG +NLFGVI+L +L EM+V P
Sbjct: 356 VG-KWFDWVADMEKFFKEKKWQFSKTSTSERALVIGLGAVNLFGVIVLNTLLNEMSVRPG 414
Query: 409 GFLKFVAYIFPLLQL 423
GFL FV I+PLLQ+
Sbjct: 415 GFLTFVKNIYPLLQV 429
>gi|147794495|emb|CAN62762.1| hypothetical protein VITISV_021812 [Vitis vinifera]
Length = 615
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 264/399 (66%), Positives = 307/399 (76%), Gaps = 57/399 (14%)
Query: 52 RVLVPVAKASTDVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKL 111
+V VPV +A DVA G+ PG IVE+DKLP+BVR RAMDAVDAC RVTIGDVA K GLKL
Sbjct: 28 QVFVPVVRAGLDVASGIRPGGIVETDKLPSBVRKRAMDAVDACGGRVTIGDVASKGGLKL 87
Query: 112 NEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEY 171
NEAQKALQALAADT+GFLEVSDEGDVLYVFP +YR+KLAAKSFR+K+EP ++KAK+AAEY
Sbjct: 88 NEAQKALQALAADTNGFLEVSDEGDVLYVFPKDYRSKLAAKSFRIKLEPFVEKAKSAAEY 147
Query: 172 SIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDP 231
+RV FGTALIASIV+V+T IIA+LSS+ YWDP
Sbjct: 148 LVRVSFGTALIASIVLVYTTIIALLSSRR---------------------------YWDP 180
Query: 232 YYYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEE 291
YYYRRRR+Q +DD MNFI+SVFSFVFG+GDPNQGIE++RWKLIG+YI+SNGGVVTAEE
Sbjct: 181 YYYRRRRIQKEDDG--MNFIESVFSFVFGDGDPNQGIEDERWKLIGQYISSNGGVVTAEE 238
Query: 292 LAPYLDI---DRTMSDESYVLPVLLRFDGQPEIDEE------------------------ 324
LAPYLD+ D + DESY+LPVLLRF+GQPE+DEE
Sbjct: 239 LAPYLDLETADNNLVDESYILPVLLRFEGQPEVDEESITFFDENNSEHNLRSAVLEANLI 298
Query: 325 -GNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAI 383
GNILYRFPS QRTA+SQR GRKEYVG+RW D +GGVEK F+EKKW+FSKT+ SER M I
Sbjct: 299 QGNILYRFPSLQRTASSQRSGRKEYVGKRWTDWVGGVEKFFKEKKWQFSKTSNSERAMVI 358
Query: 384 GLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLLQ 422
GLGGLNLFGVIILG ML+ +AVTP+GF+ FV+ IFPLLQ
Sbjct: 359 GLGGLNLFGVIILGTMLKNVAVTPSGFITFVSDIFPLLQ 397
>gi|297806357|ref|XP_002871062.1| hypothetical protein ARALYDRAFT_487168 [Arabidopsis lyrata subsp.
lyrata]
gi|297316899|gb|EFH47321.1| hypothetical protein ARALYDRAFT_487168 [Arabidopsis lyrata subsp.
lyrata]
Length = 523
Score = 499 bits (1285), Expect = e-138, Method: Compositional matrix adjust.
Identities = 270/435 (62%), Positives = 335/435 (77%), Gaps = 18/435 (4%)
Query: 1 MTSISTCFTTTPK-SRFFFTPLRP-SINLKPP---DSFPRIQPLPFPRISGKIPGSRVLV 55
M +STC +P+ ++ + +P I L+ P SFP I L +S + +R +
Sbjct: 1 MACVSTCLILSPRLTQVGLSSKKPFLIRLRSPVDRYSFPGI--LTERCVSTRRKFNRHGI 58
Query: 56 PVAKAST--DVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNE 113
+ KA++ V+ + PG +VESDKLP DVR RAMDAVD C RRVT+GDVA +AGLK+ E
Sbjct: 59 ALVKAASLDKVSGAIKPGGLVESDKLPTDVRKRAMDAVDECGRRVTVGDVASRAGLKVTE 118
Query: 114 AQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSI 173
AQ ALQALAADTDGFLEVSDEGDVLYVFP +YR KLAAKS R+++EP ++KAK A +Y
Sbjct: 119 AQTALQALAADTDGFLEVSDEGDVLYVFPRDYRTKLAAKSLRIQIEPYLEKAKGAIDYLA 178
Query: 174 RVLFGTALIASIVIVFTAIIAILSSKSDDDDRGRRR-RSFDSGFNIFISPSDLFWYWDPY 232
RV FGTALIASIVIV+T+IIA+LSS+SDDD+R RRR R +DSGFN +I+P DL WYWDP
Sbjct: 179 RVSFGTALIASIVIVYTSIIALLSSRSDDDNRQRRRGRGYDSGFNFYINPVDLLWYWDPN 238
Query: 233 YYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEEL 292
YY RRR + +D+ K MNFI+SVFSFVFG+GDPNQGIEE+RW++IG+YI S GGVV A+EL
Sbjct: 239 YYNRRRAR-EDEGKGMNFIESVFSFVFGDGDPNQGIEEERWQMIGQYITSRGGVVAADEL 297
Query: 293 APYLDIDRT---MSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAA-SQRIGRKEY 348
APYLD+ + M+DESY+LPVLLRFDGQPE+D+EGNILYRFPS QRTA+ S R RKEY
Sbjct: 298 APYLDVPSSKSAMNDESYILPVLLRFDGQPELDDEGNILYRFPSLQRTASGSSR--RKEY 355
Query: 349 VGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN 408
VG +W D + +EK F+EKKW+FSKT+ SER + IGLG +NLFGVI+L +L EMAV P
Sbjct: 356 VG-KWFDWVADMEKFFKEKKWQFSKTSTSERALVIGLGAVNLFGVIVLNTLLNEMAVRPG 414
Query: 409 GFLKFVAYIFPLLQL 423
GFL FV I+PLLQ+
Sbjct: 415 GFLTFVKNIYPLLQI 429
>gi|9758019|dbj|BAB08616.1| unnamed protein product [Arabidopsis thaliana]
Length = 495
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 250/434 (57%), Positives = 310/434 (71%), Gaps = 44/434 (10%)
Query: 1 MTSISTCFTTTPK-SRFFFTPLRP-SINLKPP---DSFPRIQPLPFPRISGKIPGSRVLV 55
M +STC +P+ ++ + +P I L+ P SFPR+ L +S + +R +
Sbjct: 1 MACVSTCLILSPRLTQVGLSSKKPFLIRLRSPVDRYSFPRM--LTERCLSTRRKFNRHGI 58
Query: 56 PVAKAST--DVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNE 113
V KA++ V+ + PG +VESDKLP DVR RAMDAVD C RRVT+GDVA + GLK+ E
Sbjct: 59 AVVKAASLDKVSGAIKPGGLVESDKLPTDVRKRAMDAVDECGRRVTVGDVASRGGLKVTE 118
Query: 114 AQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSI 173
AQ ALQA+AADTDGFLEVSDEGDVLYVFP +YR KLAAKS R+++EP ++KAK A +Y
Sbjct: 119 AQTALQAIAADTDGFLEVSDEGDVLYVFPRDYRTKLAAKSLRIQIEPFLEKAKGAVDYLA 178
Query: 174 RVLFGTALIASIVIVFTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYY 233
RV FGTALIASIVIV+T+IIA+LSSK YWDP Y
Sbjct: 179 RVSFGTALIASIVIVYTSIIALLSSKR---------------------------YWDPNY 211
Query: 234 YRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELA 293
Y RRR + +D+ K MNFI+SVFSFVFG+GDPNQGIEE+RW++IG+YI S GGVV A+ELA
Sbjct: 212 YNRRRAR-EDEGKGMNFIESVFSFVFGDGDPNQGIEEERWQMIGQYITSRGGVVAADELA 270
Query: 294 PYLDI---DRTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAA-SQRIGRKEYV 349
PYLD+ M+DESY+LPVLLRFDGQPE+DEEGNILY FPS QRTA+ S R RKEYV
Sbjct: 271 PYLDVPSSKSAMNDESYILPVLLRFDGQPELDEEGNILYCFPSLQRTASGSSR--RKEYV 328
Query: 350 GRRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNG 409
G +W D + +EK F+EKKW+FSKT+ SER + IGLG +NLFGVI+L +L EM+V P G
Sbjct: 329 G-KWFDWVADMEKFFKEKKWQFSKTSTSERALVIGLGAVNLFGVIVLNTLLNEMSVRPGG 387
Query: 410 FLKFVAYIFPLLQL 423
FL FV I+PLLQ+
Sbjct: 388 FLTFVKNIYPLLQI 401
>gi|413937129|gb|AFW71680.1| hypothetical protein ZEAMMB73_735454 [Zea mays]
Length = 521
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 233/360 (64%), Positives = 289/360 (80%), Gaps = 10/360 (2%)
Query: 68 VGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDG 127
V PG VE+D+LP+DVR+RAM+AVD RVTIGDVA +AGL+L +A++ALQALAADT+G
Sbjct: 75 VRPGGAVETDRLPSDVRDRAMEAVDHFGGRVTIGDVASRAGLQLAQAERALQALAADTEG 134
Query: 128 FLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVI 187
FLEVS++G+VLYVFP +YRAKLA KSFR++VEP++DKAK Y +RV FGTALIASIV+
Sbjct: 135 FLEVSEDGEVLYVFPKDYRAKLAGKSFRMRVEPLVDKAKQVGAYLVRVSFGTALIASIVL 194
Query: 188 VFTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKK 247
V+T IIAILSS SD+D RGRRRRS+ S I P+D+FWY D YYRRRRV+ ++
Sbjct: 195 VYTTIIAILSSSSDEDGRGRRRRSYGS---TIIIPTDMFWYLDADYYRRRRVENENG--- 248
Query: 248 MNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI----DRTMS 303
MNFI+SVFSFVFG+GDPN G+EEKRWK+IG+YI+SNGGVVTAEELAP+LD+ + +
Sbjct: 249 MNFIESVFSFVFGDGDPNDGLEEKRWKMIGQYISSNGGVVTAEELAPFLDVPPPSEESKD 308
Query: 304 DESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKI 363
DES+VLPVLLRF G PEIDE+GNILYRFPS QRTA+S+ +EYVG +W+ GVEK
Sbjct: 309 DESFVLPVLLRFQGHPEIDEQGNILYRFPSLQRTASSKSGRSREYVGTKWSAMFSGVEKY 368
Query: 364 FREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLLQL 423
EK W+FSK N +E+ M GLGGLNLFGVIILG +L++M VTP G + F A ++PLLQ+
Sbjct: 369 LEEKPWKFSKANATEKAMVAGLGGLNLFGVIILGNLLKQMTVTPGGLISFAAQLYPLLQI 428
>gi|242065288|ref|XP_002453933.1| hypothetical protein SORBIDRAFT_04g021710 [Sorghum bicolor]
gi|241933764|gb|EES06909.1| hypothetical protein SORBIDRAFT_04g021710 [Sorghum bicolor]
Length = 522
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 226/358 (63%), Positives = 283/358 (79%), Gaps = 10/358 (2%)
Query: 70 PGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFL 129
PG VE+D+LP+DVR+RAM+AVD RVTIGDVA +AGL+L +A++ALQALAADT+GFL
Sbjct: 78 PGGAVETDRLPSDVRDRAMEAVDHFGGRVTIGDVASRAGLQLAQAERALQALAADTEGFL 137
Query: 130 EVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVF 189
EVS++G+VLYVFP +YRAKLA KSFR++VEP++DKAK Y +RV FGTALIASIV+V+
Sbjct: 138 EVSEDGEVLYVFPKDYRAKLAGKSFRMRVEPLVDKAKQVGAYLVRVSFGTALIASIVLVY 197
Query: 190 TAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKKMN 249
T IIAILSS SD+D R R S + I P+D+FWY D YYRRRRV+ ++ MN
Sbjct: 198 TTIIAILSSSSDED---SRGRRRRSYGSTVIIPTDMFWYLDADYYRRRRVENENG---MN 251
Query: 250 FIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI----DRTMSDE 305
FI+SVFSFVFG+GDPN G+EEKRWK+IG+YI+SNGGVVTAEELAP+LD+ + + DE
Sbjct: 252 FIESVFSFVFGDGDPNDGLEEKRWKMIGQYISSNGGVVTAEELAPFLDVPPPSEESKDDE 311
Query: 306 SYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFR 365
S+VLPVLLRF G PEIDE+GNILYRFPS QRTA+S+ +EYVG +W+ G+EK
Sbjct: 312 SFVLPVLLRFQGHPEIDEQGNILYRFPSLQRTASSKSGRSREYVGTKWSAMFSGIEKYLE 371
Query: 366 EKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLLQL 423
EK W+FSK N SE+ + GLGGLNLFGVIILG +L++M VTP G + F A ++PLLQ+
Sbjct: 372 EKPWKFSKANASEKALVAGLGGLNLFGVIILGNLLKQMTVTPGGLISFAAQLYPLLQI 429
>gi|226497026|ref|NP_001145951.1| uncharacterized protein LOC100279475 [Zea mays]
gi|219885087|gb|ACL52918.1| unknown [Zea mays]
gi|413937131|gb|AFW71682.1| hypothetical protein ZEAMMB73_735454 [Zea mays]
Length = 522
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 228/360 (63%), Positives = 284/360 (78%), Gaps = 9/360 (2%)
Query: 68 VGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDG 127
V PG VE+D+LP+DVR+RAM+AVD RVTIGDVA +AGL+L +A++ALQALAADT+G
Sbjct: 75 VRPGGAVETDRLPSDVRDRAMEAVDHFGGRVTIGDVASRAGLQLAQAERALQALAADTEG 134
Query: 128 FLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVI 187
FLEVS++G+VLYVFP +YRAKLA KSFR++VEP++DKAK Y +RV FGTALIASIV+
Sbjct: 135 FLEVSEDGEVLYVFPKDYRAKLAGKSFRMRVEPLVDKAKQVGAYLVRVSFGTALIASIVL 194
Query: 188 VFTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKK 247
V+T IIAILSS S D+D R R S + I P+D+FWY D YYRRRRV+ ++
Sbjct: 195 VYTTIIAILSSSSSDED--GRGRRRRSYGSTIIIPTDMFWYLDADYYRRRRVENENG--- 249
Query: 248 MNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI----DRTMS 303
MNFI+SVFSFVFG+GDPN G+EEKRWK+IG+YI+SNGGVVTAEELAP+LD+ + +
Sbjct: 250 MNFIESVFSFVFGDGDPNDGLEEKRWKMIGQYISSNGGVVTAEELAPFLDVPPPSEESKD 309
Query: 304 DESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKI 363
DES+VLPVLLRF G PEIDE+GNILYRFPS QRTA+S+ +EYVG +W+ GVEK
Sbjct: 310 DESFVLPVLLRFQGHPEIDEQGNILYRFPSLQRTASSKSGRSREYVGTKWSAMFSGVEKY 369
Query: 364 FREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLLQL 423
EK W+FSK N +E+ M GLGGLNLFGVIILG +L++M VTP G + F A ++PLLQ+
Sbjct: 370 LEEKPWKFSKANATEKAMVAGLGGLNLFGVIILGNLLKQMTVTPGGLISFAAQLYPLLQI 429
>gi|7406400|emb|CAB85510.1| putative protein [Arabidopsis thaliana]
Length = 757
Score = 449 bits (1156), Expect = e-124, Method: Compositional matrix adjust.
Identities = 245/434 (56%), Positives = 311/434 (71%), Gaps = 21/434 (4%)
Query: 1 MTSISTCFTTTPK-SRFFFTPLRP-SINLKPP---DSFPRIQPLPFPRISGKIPGSRVLV 55
M +STC +P+ ++ + +P I L+ P SFPR+ L +S + +R +
Sbjct: 1 MACVSTCLILSPRLTQVGLSSKKPFLIRLRSPVDRYSFPRM--LTERCLSTRRKFNRHGI 58
Query: 56 PVAKAST--DVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNE 113
V KA++ V+ + PG +VESDKLP DVR RAMDAVD C RRVT+GDVA + GLK+ E
Sbjct: 59 AVVKAASLDKVSGAIKPGGLVESDKLPTDVRKRAMDAVDECGRRVTVGDVASRGGLKVTE 118
Query: 114 AQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSI 173
AQ ALQA+AADTDGFLEVSDEGDVLYVFP +YR KLAAKS R+++EP ++KAK A +Y
Sbjct: 119 AQTALQAIAADTDGFLEVSDEGDVLYVFPRDYRTKLAAKSLRIQIEPFLEKAKGAVDYLA 178
Query: 174 RVLFGTALIASIVIVFTAIIAILSSKSDDDDRGRRR-RSFDSGFNIFISPSDLFWYWDPY 232
RV FGTALIASIVIV+T+IIA+LSSKS+DD+R RRR RS+DSGFN +I+P DL WYWDP
Sbjct: 179 RVSFGTALIASIVIVYTSIIALLSSKSEDDNRQRRRGRSYDSGFNFYINPVDLLWYWDPN 238
Query: 233 YYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEEL 292
YY RRR + +D+ K MNFI+SVFSFVFG+GDPNQGIEE+RW++IG+YI S GGVV A+EL
Sbjct: 239 YYNRRRAR-EDEGKGMNFIESVFSFVFGDGDPNQGIEEERWQMIGQYITSRGGVVAADEL 297
Query: 293 APYLDIDRT---MSDESYVLPVLLRFDGQPEIDEE-GNILYRFPSFQRTAASQRIGRKEY 348
APYLD+ + M+DESY+LPVLLRFDGQPE+DEE +++ ++ +I R
Sbjct: 298 APYLDVPSSKSAMNDESYILPVLLRFDGQPELDEELLDLVDEKNMWENGLTGLQIWRNS- 356
Query: 349 VGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN 408
RR + IF SKT+ SER + IGLG +NLFGVI+L +L EM+V P
Sbjct: 357 -SRRKNGNLVPDNGIF----VLHSKTSTSERALVIGLGAVNLFGVIVLNTLLNEMSVRPG 411
Query: 409 GFLKFVAYIFPLLQ 422
GFL FV I+PLLQ
Sbjct: 412 GFLTFVKNIYPLLQ 425
>gi|357149325|ref|XP_003575073.1| PREDICTED: uncharacterized protein At5g03900, chloroplastic-like
isoform 1 [Brachypodium distachyon]
Length = 519
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 229/358 (63%), Positives = 283/358 (79%), Gaps = 9/358 (2%)
Query: 70 PGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFL 129
PG VE+D+LP+DVR+RAMDAVD RVTIGDVA +AGL++++A++ALQALAADT GFL
Sbjct: 74 PGGAVETDRLPSDVRDRAMDAVDHFGGRVTIGDVASRAGLQVDQAERALQALAADTGGFL 133
Query: 130 EVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVF 189
EVS EG+VLYVFP +YRAKLA KSFR++VEP+++KAK Y +RV FGTAL+ASIV+V+
Sbjct: 134 EVSGEGEVLYVFPKDYRAKLAGKSFRMRVEPLVNKAKEVGAYVVRVSFGTALVASIVLVY 193
Query: 190 TAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKKMN 249
T IIAI+SS S D+D R RR G IF+ P+DLFWY D RRRRV+ ++ MN
Sbjct: 194 TTIIAIISSSSSDED-NRGRRRRSYGSTIFL-PTDLFWYLDAGSSRRRRVENENG---MN 248
Query: 250 FIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI----DRTMSDE 305
FI+SVFSFVFG+GDPN G+EE+RWK+IG+YI+SNGGVVTAEELAPYLD+ + + DE
Sbjct: 249 FIESVFSFVFGDGDPNDGLEERRWKMIGQYISSNGGVVTAEELAPYLDVPAPSELSKDDE 308
Query: 306 SYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFR 365
S++LPVLLRF G PE+DE+GNILYRFPS QRTA+S+ G +EYVG +W+ GVEK
Sbjct: 309 SFILPVLLRFQGHPEVDEQGNILYRFPSLQRTASSKGGGSREYVGTKWSAMFSGVEKFME 368
Query: 366 EKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLLQL 423
EK WEFSK N SER M GLGGLNLFGVIILG +L++M VTP G + F A +FPLLQ+
Sbjct: 369 EKPWEFSKANASERAMVAGLGGLNLFGVIILGNLLKQMTVTPGGLISFAAQLFPLLQI 426
>gi|222622991|gb|EEE57123.1| hypothetical protein OsJ_07007 [Oryza sativa Japonica Group]
Length = 428
Score = 423 bits (1087), Expect = e-116, Method: Compositional matrix adjust.
Identities = 223/340 (65%), Positives = 271/340 (79%), Gaps = 9/340 (2%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M+AVD RVTIGDVA +AGL+L +A++ALQALAADT GFLEVS+EG+VLYVFP +YRA
Sbjct: 1 MEAVDHFGGRVTIGDVASRAGLQLAQAERALQALAADTGGFLEVSEEGEVLYVFPKDYRA 60
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
KLA KSFR+KVEP+IDK K Y +RV FGTALIASIV+V+T IIAI+SS SD+D+RGR
Sbjct: 61 KLAGKSFRMKVEPLIDKTKEVGAYLVRVSFGTALIASIVLVYTTIIAIISSSSDEDNRGR 120
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQG 267
RRRS+DS I P+DLFWY D YYRRRR +D MNFI+S+FSFVFG+GDPN G
Sbjct: 121 RRRSYDS---TIIIPTDLFWYLDADYYRRRRRVEKEDG--MNFIESIFSFVFGDGDPNDG 175
Query: 268 IEEKRWKLIGEYIASNGGVVTAEELAPYLDI----DRTMSDESYVLPVLLRFDGQPEIDE 323
+E+KRWK+IG+YI+SNGGVVTAEELAPYLD+ +++ DES++LPVLLRF G PE+DE
Sbjct: 176 LEDKRWKMIGQYISSNGGVVTAEELAPYLDVPPISEQSKDDESFILPVLLRFQGHPEVDE 235
Query: 324 EGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAI 383
+GNILYRFPS QRTA+S+ G +EYVG +W+ VEK EK W+FSK N SER M
Sbjct: 236 QGNILYRFPSLQRTASSKGSGVREYVGNKWSAMFSSVEKYLEEKPWKFSKANASERAMVA 295
Query: 384 GLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLLQL 423
GLGGLNLFGVIILG +L++M V P G + FVA +FPLLQ+
Sbjct: 296 GLGGLNLFGVIILGNLLKQMTVPPGGLISFVAQLFPLLQV 335
>gi|50251399|dbj|BAD28426.1| unknown protein [Oryza sativa Japonica Group]
Length = 528
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 230/374 (61%), Positives = 283/374 (75%), Gaps = 15/374 (4%)
Query: 58 AKASTDVAVGVG-PGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQK 116
A+A T A G+ PG VE+D+LP+ VR+RAM+AVD RVTIGDVA +AGL+L +A++
Sbjct: 69 ARAGTIQAPGLARPGGAVETDRLPSGVRDRAMEAVDHFGGRVTIGDVASRAGLQLAQAER 128
Query: 117 ALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVL 176
ALQALAADT GFLEVS+EG+VLYVFP +YRAKLA KSFR+KVEP+IDK K Y +RV
Sbjct: 129 ALQALAADTGGFLEVSEEGEVLYVFPKDYRAKLAGKSFRMKVEPLIDKTKEVGAYLVRVS 188
Query: 177 FGTALIASIVIVFTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRR 236
FGTALIASIV+V+T IIAI+SS SD+D+RGRRRRS+DS I P+DLFWY D YYRR
Sbjct: 189 FGTALIASIVLVYTTIIAIISSSSDEDNRGRRRRSYDS---TIIIPTDLFWYLDADYYRR 245
Query: 237 RRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYL 296
RR +D MNFI+S+FSFVFG+GDPN G+E+KRWK+IG+YI+SNGGVVTAEELAPYL
Sbjct: 246 RRRVEKEDG--MNFIESIFSFVFGDGDPNDGLEDKRWKMIGQYISSNGGVVTAEELAPYL 303
Query: 297 DIDRTMSDESYVLPVLLRF-------DGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYV 349
D+ +S++S + L F D Q ++ GNILYRFPS QRTA+S+ G +EYV
Sbjct: 304 DVP-PISEQSKRMMNHLFFQFYYASRDTQ-KLMNRGNILYRFPSLQRTASSKGSGVREYV 361
Query: 350 GRRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNG 409
G +W+ VEK EK W+FSK N SER M GLGGLNLFGVIILG +L++M V P G
Sbjct: 362 GNKWSAMFSSVEKYLEEKPWKFSKANASERAMVAGLGGLNLFGVIILGNLLKQMTVPPGG 421
Query: 410 FLKFVAYIFPLLQL 423
+ FVA +FPLLQ+
Sbjct: 422 LISFVAQLFPLLQV 435
>gi|118488331|gb|ABK95984.1| unknown [Populus trichocarpa]
Length = 288
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 201/295 (68%), Positives = 238/295 (80%), Gaps = 12/295 (4%)
Query: 1 MTSISTCFTTTPKSRFFFTPLRPSINLKPPDSFP-RIQPLPFPRISGKIPGSRVLVPVA- 58
M SIST + +P L+P + LKPPDS R Q L ++S K P + + +
Sbjct: 1 MASISTPLSYSPSP----VRLKPPVRLKPPDSLLLRTQTLH--KLSFKSPNPKTPIGFSV 54
Query: 59 KASTDVA--VGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQK 116
KA+ DV +G+ PG +VE+DKLP+DVRNRAM+AVDAC RVTIGDVA +AGLKLNEAQK
Sbjct: 55 KATADVTKTMGIRPGSVVETDKLPSDVRNRAMEAVDACGGRVTIGDVASRAGLKLNEAQK 114
Query: 117 ALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVL 176
ALQALA+DTDGFLEVSDEGDVLYVFP +YR+KLAAKS RLK EP+ +K KAAAEY IRV
Sbjct: 115 ALQALASDTDGFLEVSDEGDVLYVFPKDYRSKLAAKSLRLKFEPLFEKGKAAAEYLIRVS 174
Query: 177 FGTALIASIVIVFTAIIAILSSKSDDDDRGRRR-RSFDSGFNIFISPSDLFWYWDPYYYR 235
FGTALIASIVIV+T IIAILSS D++DRGRRR RSFD+GF ++SP+DLFWYWDPYYYR
Sbjct: 175 FGTALIASIVIVYTTIIAILSSSRDENDRGRRRSRSFDTGFAFYLSPTDLFWYWDPYYYR 234
Query: 236 RRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAE 290
RR+++TD D KMNFI+SVFSFVFG+GDPNQGIEE+RWKLIG+YI+SNGGVV AE
Sbjct: 235 RRQLRTDGGD-KMNFIESVFSFVFGDGDPNQGIEEERWKLIGQYISSNGGVVAAE 288
>gi|326505676|dbj|BAJ95509.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 437
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 198/307 (64%), Positives = 245/307 (79%), Gaps = 12/307 (3%)
Query: 71 GRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLE 130
G VE+D+LPADVR+RAMDAVD RVTIGDVA +AGL+++ A++ALQALA+DT GFLE
Sbjct: 77 GGTVETDRLPADVRDRAMDAVDHFGGRVTIGDVASRAGLQIDLAERALQALASDTGGFLE 136
Query: 131 VSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFT 190
VS EG+VLYVFP +YRAKLA KSFR++VEP++DKAK A Y +RV FGTAL+ASIV+V+
Sbjct: 137 VSGEGEVLYVFPEDYRAKLAGKSFRMRVEPLVDKAKEAGAYVVRVSFGTALVASIVLVYA 196
Query: 191 AIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKKMNF 250
IIAILSS SD+D+RGRRRRS+ G +F+ P+DLFWY D RRRRV+ DK M+F
Sbjct: 197 TIIAILSSSSDEDNRGRRRRSY--GSTMFL-PTDLFWYLDTGSSRRRRVE---KDKGMSF 250
Query: 251 IKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI----DRTMS--D 304
I+SVFSFVFG GDPN G+EE+RWK+IG+YI+SNGGVVTAEELAPYLD+ ++T S D
Sbjct: 251 IESVFSFVFGNGDPNDGLEERRWKMIGQYISSNGGVVTAEELAPYLDVPAPAEQTNSKDD 310
Query: 305 ESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIF 364
ES++LPVLLRF G P +D +GNILYRFPS Q T +S+ G +EYVG RW+ + G+EK
Sbjct: 311 ESFILPVLLRFQGHPLVDNQGNILYRFPSLQHTTSSKGGGSREYVGTRWSTMLSGIEKFM 370
Query: 365 REKKWEF 371
EK WEF
Sbjct: 371 EEKPWEF 377
>gi|357149327|ref|XP_003575074.1| PREDICTED: uncharacterized protein At5g03900, chloroplastic-like
isoform 2 [Brachypodium distachyon]
Length = 433
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 199/306 (65%), Positives = 249/306 (81%), Gaps = 10/306 (3%)
Query: 70 PGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFL 129
PG VE+D+LP+DVR+RAMDAVD RVTIGDVA +AGL++++A++ALQALAADT GFL
Sbjct: 74 PGGAVETDRLPSDVRDRAMDAVDHFGGRVTIGDVASRAGLQVDQAERALQALAADTGGFL 133
Query: 130 EVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVF 189
EVS EG+VLYVFP +YRAKLA KSFR++VEP+++KAK Y +RV FGTAL+ASIV+V+
Sbjct: 134 EVSGEGEVLYVFPKDYRAKLAGKSFRMRVEPLVNKAKEVGAYVVRVSFGTALVASIVLVY 193
Query: 190 TAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKKMN 249
T IIAI+SS SD+D+RGRRRRS+ G IF+ P+DLFWY D RRRRV+ ++ MN
Sbjct: 194 TTIIAIISSSSDEDNRGRRRRSY--GSTIFL-PTDLFWYLDAGSSRRRRVENENG---MN 247
Query: 250 FIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI----DRTMSDE 305
FI+SVFSFVFG+GDPN G+EE+RWK+IG+YI+SNGGVVTAEELAPYLD+ + + DE
Sbjct: 248 FIESVFSFVFGDGDPNDGLEERRWKMIGQYISSNGGVVTAEELAPYLDVPAPSELSKDDE 307
Query: 306 SYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFR 365
S++LPVLLRF G PE+DE+GNILYRFPS QRTA+S+ G +EYVG +W+ GVEK
Sbjct: 308 SFILPVLLRFQGHPEVDEQGNILYRFPSLQRTASSKGGGSREYVGTKWSAMFSGVEKFME 367
Query: 366 EKKWEF 371
EK WEF
Sbjct: 368 EKPWEF 373
>gi|218190899|gb|EEC73326.1| hypothetical protein OsI_07523 [Oryza sativa Indica Group]
Length = 400
Score = 368 bits (945), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 203/340 (59%), Positives = 246/340 (72%), Gaps = 37/340 (10%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M+AVD RVTIGDV S+EG+VLYVFP +YRA
Sbjct: 1 MEAVDHFGGRVTIGDV----------------------------SEEGEVLYVFPKDYRA 32
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
KLA KSFR+KVEP+IDK K Y +RV FGTALIASIV+V+T IIAI+SS SD+D+RGR
Sbjct: 33 KLAGKSFRMKVEPLIDKTKEVGAYLVRVSFGTALIASIVLVYTTIIAIISSSSDEDNRGR 92
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQG 267
RRRS+DS I P+DLFWY D YYRRRR +D MNFI+S+FSFVFG+GDPN G
Sbjct: 93 RRRSYDS---TIIIPTDLFWYLDADYYRRRRRVEKEDG--MNFIESIFSFVFGDGDPNDG 147
Query: 268 IEEKRWKLIGEYIASNGGVVTAEELAPYLDI----DRTMSDESYVLPVLLRFDGQPEIDE 323
+E+KRWK+IG+YI+SNGGVVTAEELAPYLD+ +++ DES++LPVLLRF G PE+DE
Sbjct: 148 LEDKRWKMIGQYISSNGGVVTAEELAPYLDVPPISEQSKDDESFILPVLLRFQGHPEVDE 207
Query: 324 EGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAI 383
+GNILYRFPS QRTA+S+ G +EYVG +W+ VEK EK W+FSK N SER M
Sbjct: 208 QGNILYRFPSLQRTASSKGSGVREYVGNKWSAMFSSVEKYLEEKPWKFSKANASERAMVA 267
Query: 384 GLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLLQL 423
GLGGLNLFGVIILG +L++M V P G + FVA +FPLLQ+
Sbjct: 268 GLGGLNLFGVIILGNLLKQMTVPPGGLISFVAQLFPLLQV 307
>gi|168055840|ref|XP_001779931.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668645|gb|EDQ55248.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 413
Score = 346 bits (888), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 191/358 (53%), Positives = 237/358 (66%), Gaps = 44/358 (12%)
Query: 74 VESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSD 133
+E D LP VR+ M AVD RRVT+GDVA +AGLKL +A+ ALQALAAD+ GFLEVSD
Sbjct: 1 IEIDSLPPFVRDSTMKAVDDLGRRVTVGDVASRAGLKLTQAETALQALAADSGGFLEVSD 60
Query: 134 EGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAII 193
EGDVLYV P +YRA L +KS RLK EP+++K KA AEY+IRV FGTAL+ASIVIV++ I+
Sbjct: 61 EGDVLYVLPKDYRANLTSKSLRLKYEPLLEKLKALAEYAIRVSFGTALLASIVIVYSTIL 120
Query: 194 AILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKKMNFIKS 253
ILSS GRR YW+P YYR RR MNF ++
Sbjct: 121 VILSS-------GRR-------------------YWNPNYYRSRR--PSKRGGGMNFFEN 152
Query: 254 VFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI----DRTMSDESYVL 309
VFSFVFG+GDPN+G+EE RW+ IG+ I S GGVV+AEELAP+LD+ + T DESYVL
Sbjct: 153 VFSFVFGDGDPNEGLEEVRWRAIGDTITSKGGVVSAEELAPFLDVPPYSEETKGDESYVL 212
Query: 310 PVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVE-KIFREKK 368
PVLLRFDG PE+DE+GNILYRFPS QRTA +++GRR GG + F+E +
Sbjct: 213 PVLLRFDGHPEVDEQGNILYRFPSLQRTAV-------DWLGRRKEAPKGGSDLYYFQENQ 265
Query: 369 WEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTP----NGFLKFVAYIFPLLQ 422
W FSK E+ + IGLG LNL GV+IL +ML++ + +G + F P LQ
Sbjct: 266 WAFSKARKVEQALVIGLGCLNLAGVVILSSMLRDYSFIQSFRGSGLIPFAFKALPFLQ 323
>gi|302791423|ref|XP_002977478.1| hypothetical protein SELMODRAFT_106680 [Selaginella moellendorffii]
gi|300154848|gb|EFJ21482.1| hypothetical protein SELMODRAFT_106680 [Selaginella moellendorffii]
Length = 488
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 190/362 (52%), Positives = 240/362 (66%), Gaps = 46/362 (12%)
Query: 67 GVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTD 126
G+ G +E+D+L VR R M A+D RVT+GDVA AGLK++EAQ ALQALAADT
Sbjct: 72 GLRIGEQIETDRLAPSVRERTMKAIDTLGGRVTVGDVATNAGLKVSEAQSALQALAADTG 131
Query: 127 GFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIV 186
GFLEVSDEGDVLYVF +YR+ L AKSFRLK+EP + K KA AEY IRV FGT LIAS+V
Sbjct: 132 GFLEVSDEGDVLYVFSKDYRSNLLAKSFRLKIEPALSKLKAGAEYLIRVSFGTTLIASLV 191
Query: 187 IVFTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDK 246
IV+T+I +LSS RR YWDPYYYRRR+ +T+
Sbjct: 192 IVYTSIFVLLSS-------ARR-------------------YWDPYYYRRRKKRTEG--- 222
Query: 247 KMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI----DRTM 302
MNF++SVFSFVFG+ DPN+G++E RW+ IGE IA+ GGVVTAEELAPYLD+ +
Sbjct: 223 -MNFLESVFSFVFGDPDPNEGLDEVRWQAIGEEIAAKGGVVTAEELAPYLDVSALDENNK 281
Query: 303 SDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEK 362
DES+VLPVLLRFDGQPE+D GNI+YRFPS QRTAA+Q W G +
Sbjct: 282 DDESFVLPVLLRFDGQPEVDARGNIVYRFPSLQRTAANQ----------AWKSHNGDRLQ 331
Query: 363 IFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPL 420
+E FS+ +++ + + LG NL GV+ LG++L++ A+T G L+FV+ FP+
Sbjct: 332 YLQEAPLPFSRAKQTDQTLVVALGAFNLLGVLTLGSLLKDAAITAQMGGLLQFVSNAFPV 391
Query: 421 LQ 422
LQ
Sbjct: 392 LQ 393
>gi|148908848|gb|ABR17529.1| unknown [Picea sitchensis]
Length = 362
Score = 327 bits (839), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 179/271 (66%), Positives = 216/271 (79%), Gaps = 6/271 (2%)
Query: 156 LKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGRRRRSFDSG 215
++ EPV+DK KA AEY IRV FGT L+ASIVIV+TAII ILS +S++D+RGRR RS+D G
Sbjct: 1 MRFEPVLDKLKAVAEYLIRVSFGTTLLASIVIVYTAIIVILSGRSEEDNRGRRGRSYDPG 60
Query: 216 FNIFISPSDLFWYWDPYYYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKL 275
FNIFISPSDLFWYWDPYYYRRRR +T+ + MNF +SVFSFVFG+GDPNQ IEE+RWKL
Sbjct: 61 FNIFISPSDLFWYWDPYYYRRRRRKTEGE---MNFFESVFSFVFGDGDPNQEIEEERWKL 117
Query: 276 IGEYIASNGGVVTAEELAPYLD---IDRTMSDESYVLPVLLRFDGQPEIDEEGNILYRFP 332
IGEYI S+GGVVTAEE+APYLD ID DESYVLPVLLRFDG+PE++ +G+ILYRFP
Sbjct: 118 IGEYITSHGGVVTAEEVAPYLDVPPIDGNKEDESYVLPVLLRFDGRPEVNSKGDILYRFP 177
Query: 333 SFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFG 392
S QRTA++ RKEYVG+RW +G K +EK+W+FSK E+ + I LGGLNL G
Sbjct: 178 SLQRTASTWIGSRKEYVGKRWKTFVGEATKFLQEKQWDFSKAGRKEKSLVIALGGLNLVG 237
Query: 393 VIILGAMLQEMAVTPNGFLKFVAYIFPLLQL 423
VI LG+ML+++A GFL FV IFPLLQ+
Sbjct: 238 VIFLGSMLKDIATIRGGFLSFVTDIFPLLQI 268
>gi|302780763|ref|XP_002972156.1| hypothetical protein SELMODRAFT_96266 [Selaginella moellendorffii]
gi|300160455|gb|EFJ27073.1| hypothetical protein SELMODRAFT_96266 [Selaginella moellendorffii]
Length = 489
Score = 319 bits (818), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 191/362 (52%), Positives = 240/362 (66%), Gaps = 46/362 (12%)
Query: 67 GVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTD 126
G+ G +E+D+L VR R M A+D RVT+GDVA AGLK++EAQ ALQALAADT
Sbjct: 73 GLRIGEQIETDRLAPSVRERTMKAIDTLGGRVTVGDVATNAGLKVSEAQSALQALAADTG 132
Query: 127 GFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIV 186
GFLEVSDEGDVLYVF +YR+ L AKSFRLK+EP + K KA AEY IRV FGT LIAS+V
Sbjct: 133 GFLEVSDEGDVLYVFSKDYRSNLLAKSFRLKIEPALSKLKAGAEYLIRVSFGTTLIASLV 192
Query: 187 IVFTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDK 246
IV+T+I +LSS RR YWDPYYYRRR+ +TD
Sbjct: 193 IVYTSIFVLLSS-------ARR-------------------YWDPYYYRRRKRRTDG--- 223
Query: 247 KMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI----DRTM 302
MNF++SVFSFVFG+ DPN+G++E RW+ IGE IA+ GGVVTAEELAPYLD+ +
Sbjct: 224 -MNFLESVFSFVFGDPDPNEGLDEVRWQAIGEEIAAKGGVVTAEELAPYLDVSALDENNK 282
Query: 303 SDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEK 362
DES+VLPVLLRFDGQPE+D GNI+YRFPS QRTAA+Q W G +
Sbjct: 283 DDESFVLPVLLRFDGQPEVDARGNIVYRFPSLQRTAANQ----------AWKSHNGDRLQ 332
Query: 363 IFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPL 420
+E FS+ +++ + + LG NL GV+ LG++L++ A+T G L+FV+ FP+
Sbjct: 333 YLQEAPLPFSRAKQTDQTLVVALGAFNLLGVLTLGSLLKDAAITAQMGGLLQFVSNAFPV 392
Query: 421 LQ 422
LQ
Sbjct: 393 LQ 394
>gi|384253022|gb|EIE26497.1| hypothetical protein COCSUDRAFT_46111 [Coccomyxa subellipsoidea
C-169]
Length = 499
Score = 283 bits (725), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 176/392 (44%), Positives = 247/392 (63%), Gaps = 27/392 (6%)
Query: 45 SGKIPGSRVLVPVAKASTDVAVGVGPGR-IVESDKLPADVRNRAMDAVDACNRRVTIGDV 103
SG + S L A + DVAV P R +ES +P +R R DAV++ RVT+GDV
Sbjct: 5 SGSLLVSTCLRGTAGSEDDVAVPSAPLRGKLESVSIPQALRKRVEDAVESLGGRVTVGDV 64
Query: 104 AGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVID 163
A +AG+ L + ++ L ALAAD+ G L+VS+ GDVLYV P N+++ + +S LK+EP +
Sbjct: 65 AARAGVSLEDTERTLNALAADSQGVLQVSEAGDVLYVLPQNFKSIIQGRSVLLKLEPALA 124
Query: 164 KAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGRRRRSFDSG---FNIFI 220
+AK+AA Y++RV FGTAL+ASIV+V +IAIL++ S D RR S+ G F+ FI
Sbjct: 125 RAKSAAGYAVRVSFGTALVASIVLVSLTVIAILTAASSSDRDDRRSSSYGYGRSPFSTFI 184
Query: 221 SPSDLFWYWDPYYYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYI 280
+ SDLF+YWDPYY RRR V + +MNF +SVFSFVFG+GDPN +E+RWK++G+ I
Sbjct: 185 NVSDLFFYWDPYYSRRRAVYRQNPG-EMNFFESVFSFVFGDGDPNLVHDEQRWKMVGQLI 243
Query: 281 ASNGGVVTAEELAPYLDID-------RTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPS 333
S GGVVTAE+LAP++D+ ++ESY++P L+RF+G PE+ + G +LY FPS
Sbjct: 244 QSKGGVVTAEQLAPFMDLSPDDLEKIDGYTNESYMVPALVRFNGHPEVSQSGELLYVFPS 303
Query: 334 FQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGV 393
QRTA +QR VG DA E++W F+ + +R AI LG N GV
Sbjct: 304 LQRTARTQRT-----VGPPAKDAA-------LERRWNFTNASEGQRLGAIALGVANCVGV 351
Query: 394 IILGAMLQEMAVT---PNGFLKFVAYIFPLLQ 422
+ LG++L + V+ L F+ +FP LQ
Sbjct: 352 LYLGSLLAQPGVSQMLAQSSLGFMNNLFPFLQ 383
>gi|412988514|emb|CCO17850.1| predicted protein [Bathycoccus prasinos]
Length = 554
Score = 277 bits (709), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 170/363 (46%), Positives = 235/363 (64%), Gaps = 17/363 (4%)
Query: 69 GPGRIVESDKLPADVRNRAMDAVDAC-NRRVTIGDVAGKAGLKLNEAQKALQALAADTDG 127
GPG +ESD + +VR+ ++A+D NRRVT GDVA +G++L +A +AL ALAADT+
Sbjct: 107 GPGGRIESDAIQRNVRDSVINAIDQSQNRRVTAGDVASSSGIQLFDATQALTALAADTNA 166
Query: 128 FLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVI 187
LEV+++GD++Y FP+ Y+ L AKSF+LK EP +DK K Y +RV FGT L+AS+VI
Sbjct: 167 SLEVTNDGDLVYAFPSGYKQLLQAKSFKLKTEPTVDKVKEFLSYLVRVSFGTTLVASVVI 226
Query: 188 VFTAIIAILSSKSDDDDRGRRRRSFDSGFNIF---ISPSDLFWYWDPYYYRRRRVQTDDD 244
V+T I A+LSS+ DDR RR S G F P D+FWY DPYYYRR D
Sbjct: 227 VYTTIFALLSSQR--DDRNDRRSSRGGGMMFFGPRFYPGDIFWYLDPYYYRRPYRLRAKD 284
Query: 245 DKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRT--M 302
+ MNF ++VFSFVFG+GDPN E +RW LIG+ IA N GVVT E+LAP+LD + + +
Sbjct: 285 E--MNFFEAVFSFVFGDGDPNNDFERERWALIGQTIAKNEGVVTGEQLAPFLDREGSAEL 342
Query: 303 SDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEK 362
SDES+VLPVL RF+G PE+D+ GNI YRFP+ Q TA ++ ++ + R A + G+ K
Sbjct: 343 SDESFVLPVLTRFEGSPEMDDSGNIFYRFPAAQVTALEKK--QQNRLKRDSASSTNGLAK 400
Query: 363 IFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNG--FLKFVAYIFPL 420
E++W FS + S++ M+ LG N GVI L +++ + V ++ V P
Sbjct: 401 ---EERWSFSLADPSQKFMSAALGVANFVGVIWLSSLMTDPQVLYRNAELVQSVGGFLPA 457
Query: 421 LQL 423
LQ+
Sbjct: 458 LQV 460
>gi|308803518|ref|XP_003079072.1| Mitochondrial Fe-S cluster biosynthesis protein ISA2 (contains a
HesB-like domain) (ISS) [Ostreococcus tauri]
gi|116057526|emb|CAL51953.1| Mitochondrial Fe-S cluster biosynthesis protein ISA2 (contains a
HesB-like domain) (ISS) [Ostreococcus tauri]
Length = 468
Score = 274 bits (700), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 164/348 (47%), Positives = 219/348 (62%), Gaps = 19/348 (5%)
Query: 63 DVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALA 122
D + V PG IVESD L D+R R + A++ N R T GDV+ +G + Q+AL ALA
Sbjct: 32 DARLAVAPGSIVESDSLSTDLRERTLKAIEKANYRATAGDVSSISGASAYDTQRALNALA 91
Query: 123 ADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALI 182
ADT LEVS GD+ YVFP + R LA+KSF++K EP ++ K Y RV FGT+L+
Sbjct: 92 ADTRATLEVSSVGDLTYVFPRSSRGILASKSFKMKWEPFLNGTKGVLSYLFRVAFGTSLV 151
Query: 183 ASIVIVFTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTD 242
AS++IV+TAI A+++S D RRSF G +ISP DLFWYWDPYYYRRRR+
Sbjct: 152 ASVMIVYTAIFALMASSKSGDRDRDDRRSF-GGPRFYISPFDLFWYWDPYYYRRRRL--- 207
Query: 243 DDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTM 302
+++MNF ++VFSFVFG+GDPN E+ RW+L+G I N GV+TAE+LAPYLD
Sbjct: 208 --NREMNFFEAVFSFVFGDGDPNIDFEKMRWELVGRAIRKNKGVMTAEQLAPYLDTA-GY 264
Query: 303 SDESYVLPVLLRFDGQPEIDE-EGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVE 361
DES+VLP L RF+G PE++ G+I+YRFP+ + TA G R +DA+
Sbjct: 265 EDESFVLPALTRFEGTPEVNSVTGSIIYRFPNMESTA-----GTTSTRSRGESDALA--- 316
Query: 362 KIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNG 409
+E++W FSK ++R A LG NL GVIIL M+ + + G
Sbjct: 317 ---QEERWVFSKAEAAQRFQAGLLGVANLVGVIILANMINDPQIMYRG 361
>gi|303287751|ref|XP_003063164.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454996|gb|EEH52300.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 553
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 155/334 (46%), Positives = 209/334 (62%), Gaps = 20/334 (5%)
Query: 70 PGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFL 129
P +VES+ + +R +A+++ VT+GDVA AG+KL++A++A+ ALAADT L
Sbjct: 109 PRSVVESESVAKSIREPVENAIESLGLAVTVGDVAAAAGVKLSDAERAMTALAADTGAAL 168
Query: 130 EVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVF 189
EVS EGD+LYVF +R+KLA+KS R++ EP ++ A Y RV FGT L+ASI IV+
Sbjct: 169 EVSSEGDLLYVFDAGFRSKLASKSLRIRAEPALETAGKVGSYLARVTFGTTLVASIAIVY 228
Query: 190 TAIIAILSSKSDDDDRGRRRRSFDSGF---NIFISPSDLFWYWDPYYYRRRRVQTDDDDK 246
TAI A+LS++ D D R R F ++ SP D+FWY+DPYYY RR +
Sbjct: 229 TAIAALLSNRDDRDRRDDRGGGGGGMFFGPRMYFSPFDVFWYFDPYYYERRAYNAAREGA 288
Query: 247 K-MNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDID-RTMSD 304
K MNF ++VFSFVFG+GDPN+ E KRW L G I +GGVV A++LAPYL+ D D
Sbjct: 289 KDMNFFEAVFSFVFGDGDPNKDFEAKRWALAGLAIQKSGGVVAADQLAPYLERDPNDPDD 348
Query: 305 ESYVLPVLLRFDGQPEIDE-EGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKI 363
ES+VLP L+RF+G PE+DE G I+YRF S + T G +Y G A A
Sbjct: 349 ESFVLPALVRFNGAPEVDESSGEIVYRFESMEAT------GGGKYGGFSTAMA------- 395
Query: 364 FREKKWEFSKTNMSERGMAIGLGGLNLFGVIILG 397
E+ + FS ++R MA GLG LN GV++LG
Sbjct: 396 -EEEPYVFSLATQAQRAMAAGLGALNFVGVVVLG 428
>gi|145346304|ref|XP_001417632.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577859|gb|ABO95925.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 409
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 145/316 (45%), Positives = 203/316 (64%), Gaps = 18/316 (5%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ A++ R T+GDV+ +G + E QKAL ALAADT LEVS GD+ YVFP + R
Sbjct: 1 LKAIEKAQYRATVGDVSSISGASVFETQKALNALAADTGATLEVSSAGDLTYVFPRSTRG 60
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L++KSF++K+EP++ AK+ A Y RV FGT+L+AS++IV+TAI A++S+KS DD R
Sbjct: 61 ILSSKSFKMKIEPIVSGAKSLASYVFRVAFGTSLVASVMIVYTAIFALMSAKSSDDRSDR 120
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQG 267
R F +G +ISP DLFWYWD + D++MNF ++VFSFVFG+GDPN
Sbjct: 121 RGGGF-AGPRFYISPFDLFWYWD----PYYYRRPRRRDREMNFFEAVFSFVFGDGDPNLD 175
Query: 268 IEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDESYVLPVLLRFDGQPEIDE-EGN 326
E++RW+L+G I N GVVTAE+LAP+LD DES+VLP L RF+G PE++ G+
Sbjct: 176 FEKRRWELVGRLIQKNKGVVTAEQLAPFLDTA-GYEDESFVLPALTRFEGTPEVNTVTGS 234
Query: 327 ILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLG 386
I+YRFP+ + TA G+ A + G E + +E++W FS S++ A LG
Sbjct: 235 IIYRFPNMESTA-----------GKTTARSRGESEALAQEERWVFSLAEPSQKIQAGLLG 283
Query: 387 GLNLFGVIILGAMLQE 402
NL GV+IL M+ +
Sbjct: 284 VANLVGVVILAQMIND 299
>gi|307104444|gb|EFN52698.1| hypothetical protein CHLNCDRAFT_138698 [Chlorella variabilis]
Length = 516
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 161/376 (42%), Positives = 225/376 (59%), Gaps = 36/376 (9%)
Query: 68 VGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDG 127
G G ++ S +L DVR RA A+ RVTIGDVA AGL+L++A+ A++ALAAD+
Sbjct: 25 AGKGELL-SPRLDPDVRQRAARAISQRGGRVTIGDVASTAGLQLDQAEAAVKALAADSQA 83
Query: 128 FLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVI 187
L VS +GD++Y F + A +A+KS L++EPV +A EY +RV FGTALIAS+++
Sbjct: 84 TLAVSAQGDIVYAFAPGFEAAIASKSLLLRLEPVAAGFQAGLEYLLRVAFGTALIASVML 143
Query: 188 VFTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISP------SDLFWYWDPYYYRRRRVQT 241
VF AI ++SS S DD RS G F P +DL WYWDP++YR RR +
Sbjct: 144 VFLAITVLMSSASSRDDNRGGGRSGGGGGGGFFGPRVYFDLTDLLWYWDPFWYRNRRDRM 203
Query: 242 DDDDKK--MNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDID 299
+ + MNF++++FS+VFG+GDPN ++KRW+ IG YIA+ GG V AEELAP+LD+
Sbjct: 204 AREQRPGGMNFLEAIFSWVFGDGDPNADFDKKRWQAIGRYIAARGGTVVAEELAPFLDLQ 263
Query: 300 ---------RTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVG 350
R DESY+LPVL RF+G P +D GNI+Y FP Q+TA + G
Sbjct: 264 PGQLAADRGRITVDESYMLPVLARFNGSPRVDASGNIVYEFPELQQTAGA---GAPPPPK 320
Query: 351 RRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQE----MAVT 406
+R A RE +W+ + + ++ A+ LG +NL GV +L MLQ+ +A+
Sbjct: 321 QRSA----------REARWQLTAASAGQKLGAVALGAVNLIGVGVLTVMLQDPVSKLALA 370
Query: 407 PNGFLKFVAYIFPLLQ 422
NG L V + P LQ
Sbjct: 371 QNGLLGVVG-LMPWLQ 385
>gi|158338473|ref|YP_001519650.1| hypothetical protein AM1_5375 [Acaryochloris marina MBIC11017]
gi|158308714|gb|ABW30331.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 429
Score = 261 bits (667), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 140/345 (40%), Positives = 215/345 (62%), Gaps = 22/345 (6%)
Query: 83 VRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFP 142
+ ++ M AV+ N RVT+GDVA +AGL +++ ++ L ALA++ G L+VS+ GD+ Y+FP
Sbjct: 3 LNSKIMTAVEQLNYRVTVGDVATQAGLDIDQTERGLLALASEVSGNLQVSEAGDIAYLFP 62
Query: 143 NNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDD 202
N+R+ L K +RL+ + + DK A Y IR+ FG L+ SIV++F AI IL + S
Sbjct: 63 KNFRSVLRNKYWRLRFQALWDKIWPAIFYLIRISFGIFLVLSIVLIFVAIAIILIAMSSQ 122
Query: 203 DDRGRRRRSFDSGFNIFISPSDLFWYWDPY--YYRRRRVQTDDDDKKMNFIKSVFSFVFG 260
DDR RRS +I+P DLFW+++P Y+R+ + D +MNF++++FSF+FG
Sbjct: 123 DDRSDNRRSGSFIPRFWITP-DLFWFFNPNPRYHRQHVRRQPKDPDQMNFLEAIFSFLFG 181
Query: 261 EGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI-DRTMSDESYVLPVLLRFDGQP 319
+G+PN +E++RW+ IG I +NGG VT E+L PYL++ D +E YV+PVL FDG+P
Sbjct: 182 DGNPNANLEDRRWREIGTIIRTNGGAVTGEQLTPYLEVSDHQRENEDYVIPVLTHFDGRP 241
Query: 320 EIDEEGNILYRFPSFQRTA-ASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSE 378
E+ +G I+Y FP+ Q TA A Q+ Y+ +E W+FS+ +
Sbjct: 242 EVSPQGEIIYHFPTLQTTAKARQQRASSTYL---------------QEYPWKFSQAGSGQ 286
Query: 379 RGMAIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
+AIGLGGLNL G +L ++L++ +A+ G + FV IF +L
Sbjct: 287 ILLAIGLGGLNLVGAAVLWSLLRDGTIAIQLGGLVGFVYSIFGVL 331
>gi|359458010|ref|ZP_09246573.1| hypothetical protein ACCM5_04751 [Acaryochloris sp. CCMEE 5410]
Length = 422
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 216/342 (63%), Gaps = 26/342 (7%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ N RVT+GDVA +AGL +++ ++ L ALA++ G L+VS+ GD+ Y+FP N+R+
Sbjct: 1 MTAVEQLNYRVTVGDVATQAGLDIDQTERGLLALASEVSGNLQVSEAGDIAYLFPKNFRS 60
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K +RL+++ + DK A Y IR+ FG L+ SIV++F AI IL + + DDR
Sbjct: 61 VLRNKYWRLRLQALWDKIWPAIFYLIRISFGIFLVLSIVLIFVAIAIILIAINSQDDRSD 120
Query: 208 RRRSFDSGF--NIFISPSDLFWYWDPY--YYRRRRVQTDDDDKKMNFIKSVFSFVFGEGD 263
RRS GF +I+P DLFW+++P Y+R+ + D +MNF++++FSF+FG+G+
Sbjct: 121 NRRS--GGFIPRFWITP-DLFWFFNPNPRYHRQHARRQPKDPDQMNFLEAIFSFLFGDGN 177
Query: 264 PNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI-DRTMSDESYVLPVLLRFDGQPEID 322
PN +E++RW+ IG I +NGG VT E+L PYL++ DR +E YV+PVL FDG+PE+
Sbjct: 178 PNANLEDRRWQEIGTVIRTNGGAVTGEQLTPYLEVSDRQHENEDYVIPVLTHFDGRPEVS 237
Query: 323 EEGNILYRFPSFQRTA-ASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGM 381
+G I+Y FP+ Q TA A Q+ Y+ +E W+FS+ + +
Sbjct: 238 PQGEIIYHFPTLQTTAKARQQRASSTYL---------------QEYPWKFSQAGSGQILL 282
Query: 382 AIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
AIGLGGLNL G +L ++L++ +A+ G + FV IF +L
Sbjct: 283 AIGLGGLNLVGAAVLWSLLRDGTIAIQLGGLVGFVYSIFGVL 324
>gi|255089394|ref|XP_002506619.1| predicted protein [Micromonas sp. RCC299]
gi|226521891|gb|ACO67877.1| predicted protein [Micromonas sp. RCC299]
Length = 437
Score = 254 bits (649), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 148/347 (42%), Positives = 211/347 (60%), Gaps = 14/347 (4%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
MD++ + + T+GDVA AG+KL++A+ A++A+AADT LEVS +GD+LYVF ++R+
Sbjct: 1 MDSIQSLGGKCTVGDVASAAGVKLSDAENAMKAIAADTGATLEVSAQGDILYVFDRDFRS 60
Query: 148 KLAAKSFRLK-VEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRG 206
L AKS ++K VEP+++ A Y +R+ FGT L+ASIVIV+TAI A+LS++ + D
Sbjct: 61 LLNAKSTKIKTVEPLVEGAGKVGGYLLRISFGTTLLASIVIVYTAIAALLSNRDERDRDE 120
Query: 207 RRRRSFDSGF----NIFISPSDLFWYWDPYYYRRRRVQTD-DDDKKMNFIKSVFSFVFGE 261
RR G ++ SP D+FWYWDPYYY RR + K M+F+++VFSFVFG+
Sbjct: 121 RRGGGMGGGMFFGPRMYFSPFDMFWYWDPYYYERRSYYAAMEGAKDMDFLEAVFSFVFGD 180
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRT---MSDESYVLPVLLRFDGQ 318
GDPN E KRW L+G I N GVVTAE+LAP+LD D DES+VLP L RF+G
Sbjct: 181 GDPNADFERKRWALVGLCIQRNNGVVTAEQLAPFLDRDEDSIGTDDESFVLPALTRFNGA 240
Query: 319 PEID-EEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMS 377
PE+D G I+YRF + TA + + V +G + E++++FS
Sbjct: 241 PEVDPASGEIVYRFEDLESTAGG--VAAIQAVLDEIPRELGVTTSVAEEERYKFSLATGG 298
Query: 378 ERGMAIGLGGLNLFGVIILGAMLQ--EMAVTPNGFLKFVAYIFPLLQ 422
+R MA LG N GV+ LG + ++A+ + V + P LQ
Sbjct: 299 QRTMAAALGVFNFVGVVALGVLSSDPQIAMQKAQLVAAVGSLLPGLQ 345
>gi|427722354|ref|YP_007069631.1| hypothetical protein Lepto7376_0358 [Leptolyngbya sp. PCC 7376]
gi|427354074|gb|AFY36797.1| hypothetical protein Lepto7376_0358 [Leptolyngbya sp. PCC 7376]
Length = 436
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 138/348 (39%), Positives = 216/348 (62%), Gaps = 29/348 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ R T+G+VA +AGL + AQ L +LA++ G L+V++ GD++Y FP+N+R+
Sbjct: 8 MKAVETLQYRATVGEVASQAGLSVQLAQNQLASLASEAGGNLQVAETGDIIYEFPSNFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIA-ILSSKSDDDDRG 206
L++KSF+L+++ +DK A Y IR+ FG AL+ASI I+ AI+A I ++ S DD+
Sbjct: 68 ILSSKSFKLRLKAPLDKVWQAVFYIIRISFGVALVASIAIMTIAILALIFAANSRDDNNN 127
Query: 207 RRRRS--FDSGFNIFISPSDLFWYWDPYYYRR-------RRVQT--DDDDKKMNFIKSVF 255
R R + + I+ P DLF + P YYRR RVQT + D +MNF++SVF
Sbjct: 128 SRSRGGGIPTTWLIWWGP-DLFRMFTPGYYRRGYGRSPQLRVQTAGNSQDSEMNFLESVF 186
Query: 256 SFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDESYVLPVLLRF 315
SF+FG+GDPN +E++RW+ IG+ I +N G V AE++APY D ++++E++++ V+ RF
Sbjct: 187 SFLFGDGDPNPNLEDRRWQSIGQVIQNNDGAVIAEQIAPYFDDLDSIAEENHMMAVMARF 246
Query: 316 DGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTN 375
+G PE+ EG ++Y FP Q A + RK+ V E +W F+
Sbjct: 247 NGYPEVSPEGELIYYFPELQVKAKER---RKQ-----------SVPIFLEENRWTFTLAP 292
Query: 376 MSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
++++ +AIGLGG+NL ++LG MLQ+ AV GF+ V+ ++P L
Sbjct: 293 VNQKFLAIGLGGVNLVLALVLGTMLQDPAVVAQIGGFIGLVSALYPAL 340
>gi|428781173|ref|YP_007172959.1| hypothetical protein Dacsa_3071 [Dactylococcopsis salina PCC 8305]
gi|428695452|gb|AFZ51602.1| hypothetical protein Dacsa_3071 [Dactylococcopsis salina PCC 8305]
Length = 437
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 139/346 (40%), Positives = 206/346 (59%), Gaps = 28/346 (8%)
Query: 89 DAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAK 148
++++ N RVT+GDV+ + GLKL E QK L ALAAD G L++S+ GD+ Y+FP N+R+
Sbjct: 9 NSIEKLNYRVTVGDVSAETGLKLQETQKGLVALAADAGGHLQISETGDITYLFPQNFRSI 68
Query: 149 LAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAII-----AILSSKSDDD 203
L K FRL+++ DK Y +R+ FG LIASI+++ I A ++S D
Sbjct: 69 LRNKYFRLRLKEWWDKISGVLFYLLRISFGIVLIASIILIVITISIIIIGAQMNSDEGDA 128
Query: 204 DRGRRRRSFDSGFNIFISPSDLFWYWDPYYY---RRRRVQTDDDDKK--MNFIKSVFSFV 258
D G F + P DLFW++ P YY R+ + +K ++F++++FSF+
Sbjct: 129 DNGGGGGFSFGFFPFWFGP-DLFWFFSPGYYEDEEEERISRRNTRQKSELSFLEAIFSFL 187
Query: 259 FGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-ID-RTMSDESYVLPVLLRFD 316
FG+G+PN +EEKRW+ IG I +NGG V E++APYLD ID ++ DE Y+LPVL RF+
Sbjct: 188 FGDGNPNGKLEEKRWQEIGAVIRNNGGAVVGEQIAPYLDNIDQKSWEDEDYMLPVLTRFN 247
Query: 317 GQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNM 376
G PE+ E+GNI+Y FP Q TA +R VE +EK W F+K
Sbjct: 248 GVPEVTEDGNIIYYFPELQVTATEKR--------------KQSVESYLQEKSWRFTKATS 293
Query: 377 SERGMAIGLGGLNLFGVIILGAMLQ-EMAVTPNGFLKFVAYIFPLL 421
+++ +AIGLGG NL ++LG++L E+A G + FV I+ ++
Sbjct: 294 TQKMIAIGLGGANLVLALMLGSLLSGELAAQMGGLVAFVNGIYGII 339
>gi|428774719|ref|YP_007166506.1| hypothetical protein PCC7418_0036 [Halothece sp. PCC 7418]
gi|428688998|gb|AFZ42292.1| hypothetical protein PCC7418_0036 [Halothece sp. PCC 7418]
Length = 441
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 138/349 (39%), Positives = 201/349 (57%), Gaps = 30/349 (8%)
Query: 89 DAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAK 148
++++ + RVT+GDVA + GLKL +AQ+ L ALAAD G L+VS+ GD+ Y+FP N+R
Sbjct: 9 NSIEKLDYRVTVGDVAAETGLKLQDAQQGLVALAADAGGHLQVSETGDITYLFPENFRGI 68
Query: 149 LAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAIL------SSKSDD 202
L K RL+++ DK Y IR+ FG LIASIV++ I I+ S + D
Sbjct: 69 LRNKYLRLRLKEWWDKIWGVLFYLIRISFGIILIASIVLIAITISLIVIGVQMNSDEGDS 128
Query: 203 DDRGRRRRSFDSGFNIFISPSDLFWYWDPYYY-------RRRRVQTDDDDKKMNFIKSVF 255
D+ F GF F DLFW++ P YY RR R +++F++++F
Sbjct: 129 DEGIGGGGGFSFGFFPFWFGPDLFWFFSPGYYEYDEPIERRERKGNRKQKSELSFLEAIF 188
Query: 256 SFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDRT-MSDESYVLPVLL 313
SF+FG+G+PN +EE+RW+ IG I N G V E++APYLD ID T DE Y+L VL
Sbjct: 189 SFLFGDGNPNAKLEERRWQEIGSVIRQNRGAVVGEQIAPYLDEIDTTSWEDEDYMLSVLT 248
Query: 314 RFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSK 373
RF+G PE+ E+G I+Y FP Q TA + V E+KW+F+K
Sbjct: 249 RFNGIPEVTEDGQIIYYFPDLQVTAT--------------GNNKAPVPSYLEERKWQFTK 294
Query: 374 TNMSERGMAIGLGGLNLFGVIILGAMLQ-EMAVTPNGFLKFVAYIFPLL 421
+ +++ +AIGLGG+NL ++LG++L E+A G + FV I+ +L
Sbjct: 295 ASSTQKIIAIGLGGVNLILALMLGSLLSGEIAAQMGGLVAFVNGIYGIL 343
>gi|434396844|ref|YP_007130848.1| hypothetical protein Sta7437_0266 [Stanieria cyanosphaera PCC 7437]
gi|428267941|gb|AFZ33882.1| hypothetical protein Sta7437_0266 [Stanieria cyanosphaera PCC 7437]
Length = 431
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 129/342 (37%), Positives = 202/342 (59%), Gaps = 24/342 (7%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ +V+ N RVT+GDVA AGL +N AQ+ L ALA+D G L+V++ G++ Y+FP N+R
Sbjct: 8 VKSVEQLNYRVTVGDVASLAGLDVNLAQQGLLALASDAGGHLQVAESGEIAYLFPRNFRT 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K ++L+++ ++ Y IR+ FG LIASI+++ AI ++ S S D
Sbjct: 68 ILRNKYWKLRLQQWWERVWRVLFYIIRISFGIVLIASILLMLVAIAVVIISISSSRDDND 127
Query: 208 RRRSFDSGFNIFISPS-----DLFWYWDP-YYYRRRRVQTDDDDKKMNFIKSVFSFVFGE 261
G F P DLFW++DP Y YRRR+ + + +MNF+++VFSF+FG+
Sbjct: 128 GGGRDSYGGGYFFLPRFWIGPDLFWFFDPDYNYRRRQRRKSTANYQMNFLEAVFSFIFGD 187
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDESYVLPVLLRFDGQPEI 321
G+PN +EEKR+ IG+ I ++GG V AE++APYL DR +DE Y+LPVL RF+G PE+
Sbjct: 188 GNPNADLEEKRYAAIGQVIRNHGGAVIAEQIAPYL--DRVDADEDYMLPVLSRFNGYPEV 245
Query: 322 DEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGM 381
+G I+Y FP Q TA ++ + +E+ W F++ + + +
Sbjct: 246 SPQGEIIYYFPELQVTATERK--------------PQPISAYLKEQLWRFTQASSGQVML 291
Query: 382 AIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
+IGLG +NL ++LG++L++ +A G + F I+ LL
Sbjct: 292 SIGLGTVNLVLALVLGSLLRDGTIAAQLGGVVAFAGSIYWLL 333
>gi|428201797|ref|YP_007080386.1| hypothetical protein Ple7327_1444 [Pleurocapsa sp. PCC 7327]
gi|427979229|gb|AFY76829.1| hypothetical protein Ple7327_1444 [Pleurocapsa sp. PCC 7327]
Length = 431
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 202/343 (58%), Gaps = 26/343 (7%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVT+GDVA KAGL++N AQ+ L ALA+D G ++VSD G+++Y+FP ++R
Sbjct: 8 MKAVEQSGYRVTVGDVAAKAGLEVNLAQQGLLALASDAGGHMQVSDVGEIVYLFPKDFRV 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K ++L+++ +K Y IR+ FG LIASIV++ AI+ ILSS D+++
Sbjct: 68 ILRNKYWQLQLKEWWEKVWKVVFYLIRISFGIVLIASIVLMTIAILIILSSSRDNENSS- 126
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYY-YRRRRVQTDDDD------KKMNFIKSVFSFVFG 260
RS G S D+FW DP Y Y R R + D K+MNF++++FSF+FG
Sbjct: 127 -DRSDGRGMIFLPSYWDIFWILDPGYDYNRDRYRQQSADNASESRKQMNFLEAIFSFLFG 185
Query: 261 EGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDRTMSD-ESYVLPVLLRFDGQ 318
+G+PN +EE+RW+ IG I ++GG V A+++APYLD I+ S+ E Y+LPVL RF+G
Sbjct: 186 DGNPNFNLEERRWQQIGTVIRNSGGAVVAQQIAPYLDNINAYNSENEDYILPVLARFNGY 245
Query: 319 PEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSE 378
P++ G I+Y FP Q TA Q V REK W FS+ +
Sbjct: 246 PQVSPAGEIIYYFPQLQVTARKQE--------------QQSVAPYLREKLWRFSEAGGGQ 291
Query: 379 RGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLL 421
+A+GLG IILG++L++ V G + FV+ I+ +L
Sbjct: 292 ITLAVGLGIAQFVLAIILGSLLRDYPVA-GGLIGFVSAIYWVL 333
>gi|86609086|ref|YP_477848.1| hypothetical protein CYB_1624 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86557628|gb|ABD02585.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 446
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 130/346 (37%), Positives = 204/346 (58%), Gaps = 34/346 (9%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
MDAV+ RVT+GD+A GL L+EA++ L LA +G L+VS +G++ YVFP ++R+
Sbjct: 8 MDAVEQLGLRVTLGDIASSTGLALHEAKQELLNLAQKAEGHLQVSRQGEITYVFPPDFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAI-LSSKSDDDDRG 206
L K + ++ + + Y +R+ FG L+ SIV++ A+IA+ ++S + D+R
Sbjct: 68 ILKRKEQQDRLAALRRRLWGGFLYGLRISFGILLVVSIVLIVLALIALQMASSREQDNRS 127
Query: 207 RRRRSFDSGFNIFISPSDLFW----YWDPY----YYRRRRVQTDDDDKKMNFIKSVFSFV 258
R R G + P+ W +WDPY Y R ++NF+++V+SF+
Sbjct: 128 RSR-----GIGFYYFPN--LWIGNPFWDPYPYYGSYSGRTRPRQPQKSELNFLEAVYSFL 180
Query: 259 FGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDID---RTMSDESYVLPVLLRF 315
FG+GDPN +EE+R+ LIG+ I +NGGV+ AE++ PYLD++ + E Y+LP+LL+F
Sbjct: 181 FGDGDPNANLEEERYALIGQLIRANGGVIAAEQVLPYLDVEPGSPALEYEDYMLPILLKF 240
Query: 316 DGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTN 375
DGQPE+ E+G+I+YRFP Q AAS+R + G+ ++ +EK W FS+
Sbjct: 241 DGQPEVSEDGDIVYRFPELQ-VAASERKPK-------------GIPEVLQEKPWVFSRAT 286
Query: 376 MSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLL 421
+ +A GLG LN F IL +E+A G F+A+I+PL+
Sbjct: 287 PGQLTLAGGLGVLNFFLAAILYGAREEVAAA-AGSNAFLAFIYPLI 331
>gi|428311615|ref|YP_007122592.1| hypothetical protein Mic7113_3457 [Microcoleus sp. PCC 7113]
gi|428253227|gb|AFZ19186.1| hypothetical protein Mic7113_3457 [Microcoleus sp. PCC 7113]
Length = 445
Score = 228 bits (582), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 140/357 (39%), Positives = 210/357 (58%), Gaps = 36/357 (10%)
Query: 86 RAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNY 145
R M AV+ RVT+GDVA + GL +N A++ L ALA++ G L+V++ G++ Y+FP ++
Sbjct: 6 RIMKAVEQLGYRVTVGDVAAQVGLNINLAEQGLLALASEAGGHLQVAESGEIAYLFPKHF 65
Query: 146 RAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAI----IAILSSKSD 201
RA L K R++++ K Y IR+ FG LI SIV++F AI IAI +S SD
Sbjct: 66 RAILRNKFLRVRLQEWWQKIWRVLFYLIRISFGVILILSIVLIFVAIAVILIAISASNSD 125
Query: 202 DDDRGRRRRSFDSGFNIFISP-----SDLFWYWDPYY-------YRRRRVQTDDDDKKMN 249
+D G R SG IF P SD++W++ P Y R+R+ +M+
Sbjct: 126 NDSGGGDSRH-SSGGGIFFMPNFWLWSDMWWFFSPGYDPYDRRHRRQRKSSRSSSGSEMS 184
Query: 250 FIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYL-DIDRTMSDES-- 306
F++S+FSF+FG+G+PN +E++RW+ IG I + GG V AE++APYL DI ES
Sbjct: 185 FLESIFSFLFGDGNPNADLEDRRWQEIGTVIRNFGGAVAAEQIAPYLDDIPEGYKRESED 244
Query: 307 YVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFRE 366
Y+LPVL RF+G+PE+ +G+I+Y FP Q TA+S W V +E
Sbjct: 245 YMLPVLTRFNGRPEVSPDGDIVYHFPDLQTTASS------------WQPQ--PVPPYLQE 290
Query: 367 KKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
K W FS+ + + +AIGLGG+N+ G ++LG++L++ +A G + FV I+ LL
Sbjct: 291 KLWRFSRASSGQNMLAIGLGGVNIIGALMLGSLLRDGLIAAQIGGLIAFVQSIYWLL 347
>gi|22298996|ref|NP_682243.1| hypothetical protein tll1453 [Thermosynechococcus elongatus BP-1]
gi|22295178|dbj|BAC09005.1| tll1453 [Thermosynechococcus elongatus BP-1]
Length = 435
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 144/350 (41%), Positives = 207/350 (59%), Gaps = 25/350 (7%)
Query: 83 VRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFP 142
V R M AVD RVT GDVA GL L A+K L ALAAD G L+V++ GD++YVFP
Sbjct: 3 VDRRLMQAVDRLGYRVTAGDVASTVGLPLQTAEKGLMALAADVGGTLQVAESGDIVYVFP 62
Query: 143 NNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKS-- 200
N+R+ L AK ++L++ V + Y IR+ FG LI SIV++F AI IL + S
Sbjct: 63 RNFRSILQAKYWQLRLREVAQRVWGIVFYLIRISFGIFLIISIVLIFLAIAIILIALSSQ 122
Query: 201 --DDDDRGRRRRSFDSGFNIFISPSDLFWYW----DPYYYRRRRVQTDDDDKKMNFIKSV 254
DDD+RG+ R G N ++ P FWY+ P R ++ ++MNF + V
Sbjct: 123 GRDDDNRGQDRSWGGGGINFWVFPD--FWYFFGDSSPRRSRSPSRGNGEEIEEMNFFEGV 180
Query: 255 FSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRT-MSDESYVLPVLL 313
+SF+FG+G+PN +EE+RW+LIG+ I +N G VTAE++APYLD++ DE Y+LPVL
Sbjct: 181 YSFLFGDGNPNADLEERRWQLIGQVIRNNSGAVTAEQIAPYLDLETDPGDDEWYMLPVLT 240
Query: 314 RFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSK 373
F+G+P++ E G I+Y FP Q TA + R G+++ + I E+ W FS+
Sbjct: 241 HFNGRPQVTETGQIIYCFPELQTTAQT-RQGQQQR-----------LPPILEEQLWRFSR 288
Query: 374 TNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
+ +AIGLG +NL G ++L +L + A+ G + FVA IF +L
Sbjct: 289 APAWQIMVAIGLGCVNLIGALMLWYLLGDGAIAQELGGLVAFVASIFWIL 338
>gi|282901601|ref|ZP_06309520.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281193527|gb|EFA68505.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 423
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 137/341 (40%), Positives = 203/341 (59%), Gaps = 23/341 (6%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M A++ RVT+GDVA ++GL L EA + + LAAD G L+V+D GD++Y FP N+R
Sbjct: 8 MGAIEKLGYRVTVGDVAARSGLGLAEANEGMLVLAADAGGHLQVADTGDIVYQFPQNFRT 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K +L+++ K Y IR+ FG LIASIVI+ II I+++ S + D
Sbjct: 68 ILRNKYVQLRLQEWWAKIWQVLFYIIRISFGVLLIASIVIITLTIIIIITASSSNRDEDN 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDP-YYYRRRRVQTDDD--DKKMNFIKSVFSFVFGEGDP 264
R F GF+IF P DLFWY+ P YY + RR++ ++ + ++NF+++VFSF+FG+G+P
Sbjct: 128 RDGGF-RGFDIFFFP-DLFWYFSPNYYSQERRIERKENRGNGELNFLEAVFSFLFGDGNP 185
Query: 265 NQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI---DRTMSDESYVLPVLLRFDGQPEI 321
N +EE+RW+ IG I +N G V AE++APYLD E Y+LPVL+R++G+P++
Sbjct: 186 NSKLEERRWQQIGSVITNNQGAVVAEQIAPYLDNLGEKYQQEYEDYMLPVLVRYNGKPQV 245
Query: 322 DEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGM 381
+G I+Y FP Q AS++I V+ E W+FS N + M
Sbjct: 246 SPDGQIVYYFPELQ-VQASKKIDEP-------------VDLFLEENPWQFSAANSGQIMM 291
Query: 382 AIGLGGLNLFGVIILGAML-QEMAVTPNGFLKFVAYIFPLL 421
+ GLG LNL G ++LG +L Q + + G + FV I+ LL
Sbjct: 292 SAGLGALNLVGALVLGNLLTQTVGLEAGGLVGFVQSIYWLL 332
>gi|86606445|ref|YP_475208.1| hypothetical protein CYA_1792 [Synechococcus sp. JA-3-3Ab]
gi|86554987|gb|ABC99945.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length = 447
Score = 228 bits (580), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 129/344 (37%), Positives = 201/344 (58%), Gaps = 29/344 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+DAV+ RVT+GD+A GL L EA++ L LA G L+VS +G++ YVFP ++R
Sbjct: 10 IDAVEQLGLRVTLGDIASSTGLALEEAKQELLKLAQKAGGHLQVSRQGEIAYVFPPDFRD 69
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L + + ++ + K Y +R+ FG L+ SI+++ A+IA+ + S ++ G
Sbjct: 70 ILKRREQQDRLAALRRKLWGGFLYGLRISFGILLVVSIILITLALIALQMASSREEGSGS 129
Query: 208 RRRSFDSGFNIFISPSDLFW----YWDPYYY---RRRRVQTDDDDKKMNFIKSVFSFVFG 260
R RS + GF F + W +W PY Y R Q + +MNF+++V+SF+FG
Sbjct: 130 RSRSREVGFFYFPN----LWVGNPFWSPYPYYGGYSGRAQPRREKSEMNFLEAVYSFLFG 185
Query: 261 EGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDID---RTMSDESYVLPVLLRFDG 317
+GDPN +EE+R+ LIG+ I +NGGV+ AE++ PYLD++ + E Y+LP+LL+FDG
Sbjct: 186 DGDPNANLEEERYALIGQVIRANGGVIAAEQVLPYLDVEPGSPALEYEDYMLPILLKFDG 245
Query: 318 QPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMS 377
QPE+ E+G+I+YRFP Q AAS+R +K + ++ +EK W FS+
Sbjct: 246 QPEVSEDGDIVYRFPELQ-VAASERRSKK-------------IPEVLQEKPWVFSRATPE 291
Query: 378 ERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLL 421
+ +A GLG LN IL +E+A G F+A+I+PL+
Sbjct: 292 QLTLAAGLGVLNFLLAAILYGAREEVAAA-AGSNAFLAFIYPLI 334
>gi|443325415|ref|ZP_21054111.1| hypothetical protein Xen7305DRAFT_00040910 [Xenococcus sp. PCC
7305]
gi|442794969|gb|ELS04360.1| hypothetical protein Xen7305DRAFT_00040910 [Xenococcus sp. PCC
7305]
Length = 437
Score = 227 bits (579), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 131/350 (37%), Positives = 205/350 (58%), Gaps = 33/350 (9%)
Query: 86 RAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNY 145
+ MD+VD N RVT+GDVA +AGL LN AQ L ALAAD +G L+V++ GD++Y FP N+
Sbjct: 6 QIMDSVDNLNYRVTVGDVAAQAGLNLNVAQNGLLALAADANGNLQVAESGDIVYCFPKNF 65
Query: 146 RAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAI-IAILSSKSDDDD 204
RA L K ++L+ + K Y IR+ FG LI+SIV++ AI I ++S +D
Sbjct: 66 RAVLRNKYWQLRWQQWWSKVWQVLFYLIRISFGIVLISSIVLMLLAISIMVISLSYSSND 125
Query: 205 RGRRRRSFDSG------FNIFISPSDLFWYWDPYY----YRRRRV---QTDDDDKKMNFI 251
+RR ++ G +NI+ L W+ P Y Y++R Q +DD +M+F+
Sbjct: 126 NNQRRDNYRGGGGFISFYNIWSFSHFLRWF-QPNYSSRNYQQRNYQPRQNSNDDNEMSFL 184
Query: 252 KSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDESYVLPV 311
++V+SF+FG+G+PN +EE R IG+ I N G + AE++AP+ ID+T E ++LP+
Sbjct: 185 EAVYSFLFGDGNPNADLEENRNANIGQLIKRNKGSIVAEQVAPF--IDKTDDSEDFMLPI 242
Query: 312 LLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEF 371
L+RF+G PE+ EG ++Y FP Q +A ++ + E+ W F
Sbjct: 243 LIRFNGYPEVSNEGGLIYYFPELQVSATQ--------------NSNISLPDFLEEQLWRF 288
Query: 372 SKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLL 421
S+ + S+ +AIGLG +NL ++LG++LQ + F+ FV+ I+ +L
Sbjct: 289 SQASRSKILLAIGLGAVNLILALMLGSLLQ--YEVSSAFVAFVSSIYGIL 336
>gi|307155143|ref|YP_003890527.1| hypothetical protein Cyan7822_5373 [Cyanothece sp. PCC 7822]
gi|306985371|gb|ADN17252.1| conserved hypothetical protein [Cyanothece sp. PCC 7822]
Length = 451
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 141/348 (40%), Positives = 212/348 (60%), Gaps = 16/348 (4%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M +V+ RVT+GDVA +AGL++N AQ+ L ALA+D G L+VSD G+++Y+FP N+R+
Sbjct: 8 MKSVEQLGYRVTVGDVASQAGLEINLAQQGLLALASDAGGHLQVSDTGEIVYLFPENFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K ++L+++ +K Y IR+ FG LI SI+++ AI I+ + S D
Sbjct: 68 ILQNKYWKLRLKQWWEKVWKVLFYLIRISFGIVLIVSILLMMIAIAVIVIAISSSRDNDN 127
Query: 208 RRRSFDSGFNIFISP----SDLFWYWDP-----YYYRRRR--VQTDDDDKKMNFIKSVFS 256
S +SG I P SD+F+ + P YY R+RR + KKMNF+++VFS
Sbjct: 128 NSNSGNSGGGISFFPVFWFSDIFYVFTPDYEGNYYERQRRRANSAESSQKKMNFLEAVFS 187
Query: 257 FVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDR-TMSDESYVLPVLLR 314
F+FG+G+PN +EE+RW+ IG I +NGG V AE++APYLD IDR +E Y++PVL R
Sbjct: 188 FLFGDGNPNFNLEERRWQEIGTVIRNNGGSVIAEQIAPYLDNIDRFNQENEDYIIPVLAR 247
Query: 315 FDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKT 374
F+G P++ EG+I+Y FP Q TA +Q ++ R D++ V +EK W FS+
Sbjct: 248 FNGYPQVSPEGDIIYYFPELQVTAKNQTENFRQNNSSR-MDSL-PVASYLKEKLWRFSEA 305
Query: 375 NMSERGMAIGLGGLNLFGVIILGAMLQEMAVTP-NGFLKFVAYIFPLL 421
+ +A GLG N+ ++LG++L+E VT G + FV I+ +L
Sbjct: 306 ESGQILLATGLGAANIILALVLGSLLKEQIVTQLGGLVAFVHSIYWIL 353
>gi|218439325|ref|YP_002377654.1| hypothetical protein PCC7424_2365 [Cyanothece sp. PCC 7424]
gi|218172053|gb|ACK70786.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
Length = 447
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 141/351 (40%), Positives = 212/351 (60%), Gaps = 26/351 (7%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M +V+ RVT+GDVA +AGL++N AQ+ L ALA+D G L+VSD G+++Y+FP N+RA
Sbjct: 8 MKSVEQLGYRVTVGDVATQAGLEINLAQQGLLALASDAGGHLQVSDTGEIVYLFPQNFRA 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAIL---SSKSDDDD 204
L K ++L+ + +K Y IR+ FG LI SI+++ AI+ IL SS D D+
Sbjct: 68 ILQNKFWKLRFKQWWEKVWKVLFYLIRISFGIILILSILLMMIAIVVILIAMSSSRDGDN 127
Query: 205 RGRRRRSFDSGFNIFISP----SDLFWYWDP----YYYRRRRVQT---DDDDKKMNFIKS 253
R S SG I P SD+F+ + P +YY R+R ++ + KKMNF+++
Sbjct: 128 DSRSNYSGGSGGGITFFPYFWFSDIFYVFSPDYDAHYYERQRRKSNRGESSQKKMNFLEA 187
Query: 254 VFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDRTMSD-ESYVLPV 311
VFSF+FG+G+PN +EE+RW+ IGE I +NGG V AE++APYLD ID T + E Y++PV
Sbjct: 188 VFSFLFGDGNPNLNLEERRWQEIGEVIRNNGGAVIAEQIAPYLDNIDWTNEENEDYIIPV 247
Query: 312 LLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEF 371
L F+G P++ +G I+Y FP Q TA +Q + + + V +EK W F
Sbjct: 248 LAHFNGYPQVSPQGEIIYYFPELQVTAKNQSRAKIQ---------LTPVPAYLKEKPWRF 298
Query: 372 SKTNMSERGMAIGLGGLNLFGVIILGAMLQ-EMAVTPNGFLKFVAYIFPLL 421
S+ + +AIGLG N+ ++LG++L+ +A G + FV I+ +L
Sbjct: 299 SEAESGQVMLAIGLGAANIVLALVLGSLLRGGVAAQLGGLVAFVNSIYGIL 349
>gi|172036335|ref|YP_001802836.1| hypothetical protein cce_1420 [Cyanothece sp. ATCC 51142]
gi|354553124|ref|ZP_08972431.1| hypothetical protein Cy51472DRAFT_1227 [Cyanothece sp. ATCC 51472]
gi|171697789|gb|ACB50770.1| hypothetical protein cce_1420 [Cyanothece sp. ATCC 51142]
gi|353554954|gb|EHC24343.1| hypothetical protein Cy51472DRAFT_1227 [Cyanothece sp. ATCC 51472]
Length = 440
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 139/350 (39%), Positives = 209/350 (59%), Gaps = 33/350 (9%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M +V+ RVT+GDVA K+GL +N AQ+ L ALA+D G L+V++ GD++Y+FP N+R
Sbjct: 8 MKSVEQLGYRVTVGDVAAKSGLNINLAQQGLLALASDVSGHLQVAESGDIIYLFPENFRT 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKS-----DD 202
L K ++L+++ DK Y IR+ FG LIASI+++ AI I+ S ++
Sbjct: 68 ILRNKYWKLQLKETWDKIWKVLFYIIRISFGIILIASIILMLIAITVIIIGISSSRDGEN 127
Query: 203 DDRGRRRRSFDSGFNIFISPSDLFWYWDP-YYYRR----RRVQTDDDDKKMNFIKSVFSF 257
+ R GF F + DLFW + P Y YRR RR QT D + +MNF++++FSF
Sbjct: 128 NSSSGGRSYRGGGFFFFPNFGDLFWIFYPDYGYRRYGSSRRHQTRDSN-EMNFLEAIFSF 186
Query: 258 VFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDR-TMSDESYVLPVLLRF 315
+FG+GDPN +EE+RWK IG I +N G + AE++APYLD IDR +E Y+LPVL RF
Sbjct: 187 LFGDGDPNYNLEERRWKTIGNVIRNNKGSIIAEQVAPYLDNIDRYNQENEDYILPVLTRF 246
Query: 316 DGQPEIDEEGNILYRFPSFQRTA--ASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSK 373
+G PE+ +G ++Y FP Q T +SQ+ + REK ++FS+
Sbjct: 247 NGTPEVSPQGELIYYFPELQVTVQESSQK----------------SIASYLREKLYKFSE 290
Query: 374 TNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
+ S+ +AIGLG +N ++LG+ L++ ++ GF+ F+ I+ LL
Sbjct: 291 ASSSQVMLAIGLGAINFILALVLGSFLRDPSLVAQFGGFIAFINSIYWLL 340
>gi|302851833|ref|XP_002957439.1| hypothetical protein VOLCADRAFT_98565 [Volvox carteri f.
nagariensis]
gi|300257243|gb|EFJ41494.1| hypothetical protein VOLCADRAFT_98565 [Volvox carteri f.
nagariensis]
Length = 516
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 134/331 (40%), Positives = 197/331 (59%), Gaps = 20/331 (6%)
Query: 74 VESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSD 133
++S+KL D+R R AV+ RVT+GDVA +AG+KL +A +AL+ALA DT L+VS
Sbjct: 10 LQSNKLHPDLRERVEAAVEQLGGRVTVGDVAARAGVKLADADEALKALAYDTQASLKVSS 69
Query: 134 EGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAII 193
EG V+Y F +++++L +S + P++ + Y +RV FGTALIAS+ +V+ AI
Sbjct: 70 EGAVVYGFAPDFQSRLRNRSVLIAAAPLLRRTVGVLSYLVRVAFGTALIASVALVWLAIA 129
Query: 194 AILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDD-DDKKMNFIK 252
+LSS++DD D RR F F+ P DLF YWDPYY R+R ++ + MNF++
Sbjct: 130 VLLSSRNDDRDNRRRGGGGGVTF--FMDPVDLFLYWDPYYERKRAARSAELRTGGMNFME 187
Query: 253 SVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMS--------- 303
+VFSFVFG+GDPN+ EE+RW+ +G I GGV+TAEE+AP+L+
Sbjct: 188 AVFSFVFGDGDPNESYEERRWQELGAMIRKKGGVLTAEEMAPFLEPPAPAPVPPAYSKEP 247
Query: 304 -----DESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIG 358
DES+VLP L++F G+ E+DE+G+ILY FP+ QRT Q+ ++ + R
Sbjct: 248 YIPYPDESFVLPALIKFGGEAEVDEQGHILYCFPALQRTGVQQQQQKRWRLRNRTESTSW 307
Query: 359 GVEKIFREKKWEFSKTNMSERGMAIGLGGLN 389
V E+ WE + + + LG N
Sbjct: 308 NVPL---ERTWELTAATPGQIAGVVTLGLFN 335
>gi|427706435|ref|YP_007048812.1| hypothetical protein Nos7107_1002 [Nostoc sp. PCC 7107]
gi|427358940|gb|AFY41662.1| hypothetical protein Nos7107_1002 [Nostoc sp. PCC 7107]
Length = 430
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 136/342 (39%), Positives = 197/342 (57%), Gaps = 25/342 (7%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVTIGDVA +AGL + EA + L ALAAD G L+V++ GD++Y+FP N+RA
Sbjct: 8 MKAVEQLGYRVTIGDVATQAGLNVAEAGQGLLALAADAGGHLQVAETGDIIYLFPPNFRA 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K F+L+++ K Y IR+ FG L+ SI ++ +I+ I+++ + D D
Sbjct: 68 ILRNKYFQLRLQEWWQKVWGILFYLIRISFGIFLVLSIALITISIVIIITAANSDRDNDN 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKK---MNFIKSVFSFVFGEGDP 264
R S GF F DLFWY+ P YY Q D ++ MNF ++VFSF+FG+G+P
Sbjct: 128 RGSSRSGGFFFF---PDLFWYFSPNYYDTSYQQRRKDSRQGSEMNFFEAVFSFLFGDGNP 184
Query: 265 NQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI---DRTMSDESYVLPVLLRFDGQPEI 321
N +EE+RW+ I I + G V AE++APYLD T E Y+LPVL+RF+GQP++
Sbjct: 185 NAKLEERRWQEIATVIRNQRGAVVAEQIAPYLDDLGPGYTQEYEDYMLPVLIRFNGQPKV 244
Query: 322 DEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGM 381
EG I+Y FP Q A++Q GR+ V E W FS+ + + +
Sbjct: 245 SSEGQIVYYFPELQVKASNQ--GRQS------------VPVYLEELPWRFSRADSGQIML 290
Query: 382 AIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
+ GLG LN G ++LG +L++ +A G + FV I+ LL
Sbjct: 291 SAGLGVLNFVGALVLGNLLRDGIVAAQLGGLVAFVQGIYWLL 332
>gi|413937128|gb|AFW71679.1| hypothetical protein ZEAMMB73_735454 [Zea mays]
Length = 256
Score = 224 bits (572), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 122/187 (65%), Positives = 152/187 (81%), Gaps = 6/187 (3%)
Query: 68 VGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDG 127
V PG VE+D+LP+DVR+RAM+AVD RVTIGDVA +AGL+L +A++ALQALAADT+G
Sbjct: 75 VRPGGAVETDRLPSDVRDRAMEAVDHFGGRVTIGDVASRAGLQLAQAERALQALAADTEG 134
Query: 128 FLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVI 187
FLEVS++G+VLYVFP +YRAKLA KSFR++VEP++DKAK Y +RV FGTALIASIV+
Sbjct: 135 FLEVSEDGEVLYVFPKDYRAKLAGKSFRMRVEPLVDKAKQVGAYLVRVSFGTALIASIVL 194
Query: 188 VFTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKK 247
V+T IIAILSS SD+D RGRRRRS+ S I P+D+FWY D YYRRRRV+ ++
Sbjct: 195 VYTTIIAILSSSSDEDGRGRRRRSYGS---TIIIPTDMFWYLDADYYRRRRVE---NENG 248
Query: 248 MNFIKSV 254
MNFI+SV
Sbjct: 249 MNFIESV 255
>gi|332711929|ref|ZP_08431859.1| hypothetical protein LYNGBM3L_68080 [Moorea producens 3L]
gi|332349257|gb|EGJ28867.1| hypothetical protein LYNGBM3L_68080 [Moorea producens 3L]
Length = 456
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 135/358 (37%), Positives = 203/358 (56%), Gaps = 39/358 (10%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ + RVT+GDVA + GL +++A++ L LA++ G L+V++ GD+ Y+FP N+RA
Sbjct: 8 MKAVEQLDYRVTVGDVAAQVGLNIHQAEQGLLVLASEVGGHLQVAESGDIAYLFPKNFRA 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAI-IAILSSKSDDDDRG 206
L K +L+++ +K Y IR+ FG L+ SIV++ AI +LS K+ D+
Sbjct: 68 ILRNKFLKLRLQAWWEKVWGVLFYLIRISFGIFLVLSIVLMIVAIAFIVLSLKAGSDNDS 127
Query: 207 RRRRSFDSGF-----------NIFISPSDLFWYWDPYYYR-------RRRVQTDDDDKKM 248
+ +I P DL+ +++P Y R DDD +M
Sbjct: 128 GGGGGGGNRRSGGGGGILFLPRFWIGP-DLYRWFNPSYSHSNYRSRRRNSRYNQDDDYEM 186
Query: 249 NFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYL-DIDRTMSDES- 306
NF+++VFSF+FG+G+PN +EE+RW+ IG I ++GG V AE++APYL D+ + ES
Sbjct: 187 NFLEAVFSFLFGDGNPNSDLEERRWQTIGAVIRNSGGAVAAEQIAPYLDDLGESYQRESE 246
Query: 307 -YVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFR 365
Y+LPVL+RFDG+PE+ G+I+Y FP Q TAA Q V R
Sbjct: 247 DYMLPVLIRFDGRPEVSPSGDIVYHFPKLQTTAAKQN--------------QQPVPAYLR 292
Query: 366 EKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQ--EMAVTPNGFLKFVAYIFPLL 421
K WEFS+ + +AIGLG +N ++LG++LQ E+A GF+ FV I+PLL
Sbjct: 293 AKIWEFSQARSEQIMLAIGLGSVNFVLALVLGSLLQGGEIAAQLGGFVLFVELIYPLL 350
>gi|119492348|ref|ZP_01623684.1| hypothetical protein L8106_28960 [Lyngbya sp. PCC 8106]
gi|119453128|gb|EAW34296.1| hypothetical protein L8106_28960 [Lyngbya sp. PCC 8106]
Length = 432
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 131/347 (37%), Positives = 198/347 (57%), Gaps = 27/347 (7%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M +V+ RVT+GDVA +AG+ +N AQ+ L ALA+D G L+V++ G++ + F ++R
Sbjct: 1 MKSVEQLGYRVTVGDVAAQAGIDVNVAQQQLLALASDAGGHLQVAESGEIAFQFSQDFRT 60
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K +RLK++ +K Y IR+ FG L+ASIV++F +I IL + + D
Sbjct: 61 VLRNKFWRLKLQEWWEKVWRVLFYIIRISFGIFLLASIVLIFVSIAIILIAMNSSRDGDD 120
Query: 208 RRRSFDSGFNIFISPSDLF---WYW------DPYYYRRRRVQTDDDDKKMNFIKSVFSFV 258
S DSG P F WYW D YY++RR+ + D + NF+++VFSF+
Sbjct: 121 SGGSSDSGGGFIFVPRLWFGSNWYWFLYWNDDDYYHQRRQRSANPKDNQYNFLEAVFSFL 180
Query: 259 FGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDRTMSD-ESYVLPVLLRFD 316
FG+G+PN +EE+RWK I I +N G + AE++APYLD I R + E Y+LPVL RF+
Sbjct: 181 FGDGNPNANLEERRWKTIATLIRNNQGAIIAEQVAPYLDEISRQEEEYEDYMLPVLTRFN 240
Query: 317 GQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNM 376
G P++ +G ++Y FP Q TA R + +E+ W+FS+ +
Sbjct: 241 GYPKVSPQGELVYHFPDLQTTANQYR--------------PQAIVSSLKEQLWKFSQASS 286
Query: 377 SERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
+ +A GLG N+ G IIL ++LQ+ + + G + FV IFPLL
Sbjct: 287 GQLMLAAGLGVANIVGAIILASLLQDQNLVQSLGGLVAFVDVIFPLL 333
>gi|17229098|ref|NP_485646.1| hypothetical protein all1606 [Nostoc sp. PCC 7120]
gi|17135426|dbj|BAB77972.1| all1606 [Nostoc sp. PCC 7120]
Length = 431
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 133/345 (38%), Positives = 194/345 (56%), Gaps = 30/345 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVT+GDVA +AGL L EA + L ALA+D G L+V++ GD++Y+FP N+R
Sbjct: 8 MQAVEKLGYRVTVGDVATQAGLNLAEAGQGLLALASDAGGHLQVAETGDIVYLFPRNFRD 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K F+L+++ K Y IR+ FG L+ASI ++ AI I+++ + +D
Sbjct: 68 VLRNKYFKLRLQEWWQKIWKILFYLIRISFGIFLVASIALITIAIFLIITAMNSSNDNDD 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYY-----YRRRRVQTDDDDKKMNFIKSVFSFVFGEG 262
R + + F P DLFWY+ P Y R+R Q D +NF ++VFSF+FG+G
Sbjct: 128 RSSNSSGSWGFFYFP-DLFWYFSPNYGYSAPERQRHSQESSD---LNFFEAVFSFLFGDG 183
Query: 263 DPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYL-DIDRTMSD--ESYVLPVLLRFDGQP 319
+PN +EE+RW+ I I S+ G V AE++APYL DI E Y+LPVLLRF+GQP
Sbjct: 184 NPNANLEERRWQEIATVIRSSRGAVVAEQIAPYLNDIGEVYQQEYEDYMLPVLLRFNGQP 243
Query: 320 EIDEEGNILYRFPSFQ-RTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSE 378
+ EG I+Y FP Q R + Q+ Y+ E W FS + +
Sbjct: 244 SVSPEGQIVYYFPELQVRASKKQQQPLSVYL---------------EEFPWRFSAASSGQ 288
Query: 379 RGMAIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
++ GLG +NL G ++LG++L + +A G + FV I+ LL
Sbjct: 289 IMLSAGLGIVNLVGALVLGSLLVDGTVAAQLGGLVAFVQGIYWLL 333
>gi|434403756|ref|YP_007146641.1| hypothetical protein Cylst_1689 [Cylindrospermum stagnale PCC 7417]
gi|428258011|gb|AFZ23961.1| hypothetical protein Cylst_1689 [Cylindrospermum stagnale PCC 7417]
Length = 432
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 132/343 (38%), Positives = 194/343 (56%), Gaps = 26/343 (7%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M +V+ RVT+GDVA AGL + EA + L ALA+D G L+V++ GD++Y+FP N+R
Sbjct: 8 MRSVEQLGYRVTVGDVASFAGLNVAEASQGLLALASDAGGHLQVAESGDIVYLFPQNFRG 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K +L+++ K Y IR+ FG LI SI ++ I+ I+++ + D D G
Sbjct: 68 ILRNKYLQLRLQEWWKKVWGVLFYLIRISFGVFLIVSIALITVTIMVIITAINSDRD-GD 126
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYY----YRRRRVQTDDDDKKMNFIKSVFSFVFGEGD 263
R S G F DLFWY+ P Y RRR + ++ D +NF ++VFSF+FG+G+
Sbjct: 127 NRSSNSGGGGGFFFFPDLFWYFSPGYETNNQERRRERGENSD--LNFFEAVFSFLFGDGN 184
Query: 264 PNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI---DRTMSDESYVLPVLLRFDGQPE 320
PN +EE+RW+ IG I +N G V AE++APYLD E Y+LPVL+RF+GQP
Sbjct: 185 PNANLEERRWQEIGTVIRNNRGAVVAEQIAPYLDHIGEKYQQEYEDYMLPVLIRFNGQPT 244
Query: 321 IDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERG 380
+ EG I+Y FP Q +A +R R+ + E W FS + +
Sbjct: 245 VSPEGQIVYYFPELQVSATKKR--RQ------------AIATYLEEFPWRFSAASSGQVL 290
Query: 381 MAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
++ GLG LN G ++LG++L++ AV G + FV I+ LL
Sbjct: 291 LSAGLGVLNFVGALVLGSLLRDGAVAAQLGGLVAFVQGIYWLL 333
>gi|75910417|ref|YP_324713.1| hypothetical protein Ava_4219 [Anabaena variabilis ATCC 29413]
gi|75704142|gb|ABA23818.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 431
Score = 221 bits (563), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 134/345 (38%), Positives = 193/345 (55%), Gaps = 30/345 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVT+GDVA +AGL L EA + L ALA+D G L+V++ GD++Y+FP N+R
Sbjct: 8 MQAVEKLGYRVTVGDVATQAGLNLAEAGQGLLALASDAGGHLQVAETGDIVYLFPRNFRD 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K F+L+++ K Y IR+ FG LIASI ++ AI I+++ + +D
Sbjct: 68 VLRNKYFKLRLQEWWQKIWKILFYLIRISFGIFLIASIALITIAIFLIITAMNSSNDNDD 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYY-----YRRRRVQTDDDDKKMNFIKSVFSFVFGEG 262
R + + F P DLFWY++P Y RRR Q D +NF ++VFSF+FG+G
Sbjct: 128 RSSNSSGSWGFFYFP-DLFWYFNPNYGYSAPERRRSSQESSD---LNFFEAVFSFLFGDG 183
Query: 263 DPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYL-DIDRTMSD--ESYVLPVLLRFDGQP 319
+PN +EE+RW+ I I + G V AE++APYL DI E Y+LPVLLRF+GQP
Sbjct: 184 NPNANLEERRWQEIATVIRGSRGAVVAEQIAPYLDDIGEVYQQEYEDYMLPVLLRFNGQP 243
Query: 320 EIDEEGNILYRFPSFQ-RTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSE 378
+ EG I+Y FP Q R + Q+ Y+ E W FS + +
Sbjct: 244 SVSPEGQIVYYFPELQVRASKKQQQPLSVYL---------------EEFPWRFSAASSGQ 288
Query: 379 RGMAIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
++ GLG +NL G ++LG +L + +A G + FV I+ LL
Sbjct: 289 VMLSAGLGIVNLVGALVLGNLLVDGTVAAQLGGLVAFVQGIYWLL 333
>gi|428315604|ref|YP_007113486.1| hypothetical protein Osc7112_0463 [Oscillatoria nigro-viridis PCC
7112]
gi|428239284|gb|AFZ05070.1| hypothetical protein Osc7112_0463 [Oscillatoria nigro-viridis PCC
7112]
Length = 438
Score = 221 bits (562), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 134/348 (38%), Positives = 195/348 (56%), Gaps = 31/348 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVT+GDVA KAGL +N AQ+ L LA++ G L+V++ GD+ Y+FP N+R
Sbjct: 8 MQAVEQLGYRVTVGDVAAKAGLDVNFAQRELLTLASEAGGNLQVAESGDIAYLFPKNFRD 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILS-----SKSDD 202
L K RL+++ K Y IR+ FG L+ASI+++F AI +LS +
Sbjct: 68 ILRNKFLRLQLQEWWQKIWRVLFYLIRISFGIVLVASILLIFVAITILLSSGDSNNGGGG 127
Query: 203 DDRGRRRRSFDSGFNIF-ISPSDLFW--YW---DPYYYRRRRVQTDDDDKKMNFIKSVFS 256
G GF+ F +DL W YW +PYY +R R+ D +M+F+++VFS
Sbjct: 128 GGDGGGGGGRGGGFSFFPYFWNDLIWIFYWNHDEPYYQQRSRL--TDQKPQMSFLEAVFS 185
Query: 257 FVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD---IDRTMSDESYVLPVLL 313
F+FG+G+PN +EE++W I I +N G V AE++APYLD + E Y+LP L
Sbjct: 186 FLFGDGNPNHNLEERKWSDIATAIRNNRGAVAAEQIAPYLDNLGQGYSREYEQYMLPALA 245
Query: 314 RFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSK 373
RFDG+PE+ EG I+Y FP Q T A++R V RE W FS
Sbjct: 246 RFDGRPEVSPEGQIVYHFPQLQ-TTATERNSEP-------------VAAYLREMLWRFSN 291
Query: 374 TNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLL 421
+ + +A GLG +N+ G ++LG +L A+ GF+ FV+ I+P+L
Sbjct: 292 ASSGQIMLAAGLGAVNIVGALVLGNLLSNSAIA-GGFIGFVSAIYPIL 338
>gi|282897046|ref|ZP_06305048.1| conserved hypothetical protein [Raphidiopsis brookii D9]
gi|281197698|gb|EFA72592.1| conserved hypothetical protein [Raphidiopsis brookii D9]
Length = 423
Score = 221 bits (562), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 134/341 (39%), Positives = 202/341 (59%), Gaps = 23/341 (6%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M A++ RVT+GDVA ++GL L EA + + LAAD G L+V+D GD++Y FP N+R
Sbjct: 8 MGAIEKLGYRVTVGDVAARSGLGLAEANEGMLVLAADAGGHLQVADTGDIVYQFPQNFRT 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K +L++ + Y IR+ FG LIASIVI+ II I+++ S + D
Sbjct: 68 ILRNKYVQLRLREWWARIWQVLFYIIRISFGVLLIASIVIITLTIIIIITASSSNRDEDN 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYYY-RRRRVQTDDD--DKKMNFIKSVFSFVFGEGDP 264
R F GF+IF P DLFWY+ P YY + R+++ ++ + ++NF+++VFSF+FG+G+P
Sbjct: 128 RDGGF-RGFDIFFFP-DLFWYFSPNYYSQERQIERKENRGNGELNFLEAVFSFLFGDGNP 185
Query: 265 NQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI---DRTMSDESYVLPVLLRFDGQPEI 321
N +EE+RW+ IG I +N G V AE++APYLD E Y+LPVL+R++G+P++
Sbjct: 186 NSKLEERRWQQIGTVITNNQGAVVAEQIAPYLDNLGEKYQQEYEDYMLPVLVRYNGKPQV 245
Query: 322 DEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGM 381
+G I+Y FP Q AS++I V+ E W+FS + + M
Sbjct: 246 SPDGQIVYYFPELQ-VQASKKIDEP-------------VDLFLEENPWQFSAASSGQIMM 291
Query: 382 AIGLGGLNLFGVIILGAML-QEMAVTPNGFLKFVAYIFPLL 421
+ GLG LNL G ++LG +L Q + + G + FV I+ LL
Sbjct: 292 SAGLGVLNLVGALVLGNLLTQTVGLEAGGLVGFVQSIYWLL 332
>gi|416391537|ref|ZP_11685703.1| hypothetical protein CWATWH0003_2517 [Crocosphaera watsonii WH
0003]
gi|357263817|gb|EHJ12777.1| hypothetical protein CWATWH0003_2517 [Crocosphaera watsonii WH
0003]
Length = 442
Score = 220 bits (561), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 135/350 (38%), Positives = 202/350 (57%), Gaps = 30/350 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M +V+ RVT+GDVA K+GL +N AQ+ L ALA+D G L+V++ GD++Y+FP+N+R
Sbjct: 8 MQSVEQLGYRVTVGDVAAKSGLDINLAQRGLLALASDVSGHLQVAESGDIIYLFPDNFRT 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAI---LSSKSDDDD 204
L K ++L+++ +K Y IR+ FG L+ASI+++ AI I L+S D +
Sbjct: 68 ILRNKYWKLQLKETWEKIWKVLFYIIRISFGIVLVASIILMLIAITVIIIGLNSSRDGNS 127
Query: 205 RG---RRRRSFDSGFNIFISPSDLFWYWDPYY-YRR-----RRVQTDDDDKKMNFIKSVF 255
R GF F + DLFW + P Y Y R R D +MNF+++VF
Sbjct: 128 NNSGSRGGSYRGGGFFFFPNFGDLFWIFYPNYGYNRYDHSSSRRSQSRDANEMNFLEAVF 187
Query: 256 SFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDR-TMSDESYVLPVLL 313
SF+FG+G+PN +EE+RWK IG I +N G + AE++APYLD IDR +E Y+LPVL
Sbjct: 188 SFLFGDGNPNYNLEERRWKAIGTVIKNNKGSIIAEQVAPYLDNIDRYNKENEDYILPVLT 247
Query: 314 RFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSK 373
RF+G PE+ G ++Y FP Q T Q +K + RE+ ++FS+
Sbjct: 248 RFNGNPEVSPNGELIYHFPELQVTV--QESTQK------------SISTYLRERLYKFSE 293
Query: 374 TNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
++ +AIGLG LN +ILG+ L++ ++ GF+ F+ I+ LL
Sbjct: 294 AGSNKIMLAIGLGALNFILALILGSFLKDPSIVAQFGGFIAFINSIYWLL 343
>gi|170079211|ref|YP_001735849.1| hypothetical protein SYNPCC7002_A2617 [Synechococcus sp. PCC 7002]
gi|169886880|gb|ACB00594.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
Length = 440
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 129/350 (36%), Positives = 199/350 (56%), Gaps = 31/350 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ R T+G+VA +AGL+LN AQ L LAAD G L+V++ GDV++ FP N+R
Sbjct: 8 MRAVETLQYRATVGEVATQAGLELNVAQNGLYNLAADAGGHLQVAETGDVVFEFPKNFRT 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDD---D 204
L K F+++++ +DK + IR+ FG ALI SI+++ AI+A++++ S D
Sbjct: 68 ILRNKYFKIRLQEWLDKTWRVVFFLIRISFGIALILSILLMTIAILALVTAASSQDNNNS 127
Query: 205 RGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRR---------VQTDDDDKKMNFIKSVF 255
RR SF+ + ++ P+ F + P YY RR ++ + MNF+++VF
Sbjct: 128 SRRRSSSFNLPWLVWWGPNP-FRVFSPNYYGPRRYGSSATLKNTTSNTEKGGMNFLEAVF 186
Query: 256 SFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD--IDRTMSDESYVLPVLL 313
SF+FG+GDPN +EE+RW+ IG+ I +N G V AE+LAPY D ++ESY+LPVL
Sbjct: 187 SFLFGDGDPNPNLEERRWQTIGQVIQNNDGAVIAEQLAPYFDELSPTEKNEESYMLPVLA 246
Query: 314 RFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSK 373
RF+G PE+ G+++Y FP Q A +R V E +W F+
Sbjct: 247 RFNGYPEVSPTGDLVYYFPELQVKAQERR--------------QKSVPNYLEEHRWTFTL 292
Query: 374 TNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
++ +AI LGG+NL ++LG +LQ+ A+ G + ++PLL
Sbjct: 293 APTGQKFLAIALGGVNLVLALMLGVLLQDSALVAELGGLVGLAQALYPLL 342
>gi|159484054|ref|XP_001700075.1| predicted protein [Chlamydomonas reinhardtii]
gi|158272571|gb|EDO98369.1| predicted protein [Chlamydomonas reinhardtii]
Length = 409
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 136/356 (38%), Positives = 211/356 (59%), Gaps = 37/356 (10%)
Query: 74 VESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSD 133
++S++L D+R R A++ RVT+GDVA +AG+KL +A +AL+ALA DT L+VS
Sbjct: 28 LQSNRLEPDLRERVESAIERLGGRVTVGDVAARAGVKLAQADEALKALAYDTAAALQVSA 87
Query: 134 EGDVLYVFPNNYRAKLAAKSFRLK-VEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAI 192
GD++Y F ++R++L ++S + + P+ + A Y RV FGTAL+AS+V+V+ A+
Sbjct: 88 SGDLVYAFAPDFRSRLRSRSLLVSTLLPLGRRLGGALSYLARVAFGTALVASVVVVWLAV 147
Query: 193 IAILSSKSDDDDRGRRRRSFDSGFN----IFISPSDLFWYWDPYYYRRRRVQTDDDDKKM 248
+A+L + DD D R G +F+ +DLF YWDPYYY+ Q + +++
Sbjct: 148 MALLRGRGDDRDDRGYRGGGFGGGYYSGRMFMDVTDLFLYWDPYYYQNT-AQRVANGEQL 206
Query: 249 NFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD----------- 297
+F++S+FSFVFG+GDPN EE+RW+ +GE I + GGVVTAEE+APYLD
Sbjct: 207 SFVESIFSFVFGDGDPNADFEERRWQRLGEMIRAKGGVVTAEEMAPYLDPPEPADPPGRP 266
Query: 298 ------IDRT---MSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEY 348
+ T DE++VLP L++F G+P +DE G ILYRFP+ Q T
Sbjct: 267 LYDSSGMTETYIPYPDENFVLPALIKFGGEPSVDEAGRILYRFPALQLTGVK-------- 318
Query: 349 VGRRWADAIGGVE--KIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQE 402
G+R A+ ++ E+ W+F+ + + + LG LNL GV +L +++ +
Sbjct: 319 -GKRPANRFSPQSSFEVPMERDWQFTAASGGQVAGTVFLGLLNLVGVAVLSSLMAD 373
>gi|414076942|ref|YP_006996260.1| hypothetical protein ANA_C11684 [Anabaena sp. 90]
gi|413970358|gb|AFW94447.1| hypothetical protein ANA_C11684 [Anabaena sp. 90]
Length = 427
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 134/343 (39%), Positives = 202/343 (58%), Gaps = 28/343 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M +V+ RVT+GDVA + GL + E ++L ALAAD G L+V++ GDV+Y FP N+R
Sbjct: 8 MRSVEQLGYRVTVGDVATQIGLNIAEVNQSLLALAADAGGHLQVAESGDVVYQFPKNFRD 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
+ K +++++ K + Y IR+ FG L+ASI+++ II I+++ SD D+ R
Sbjct: 68 IIRNKYLQIRLQEWWKKVWSVLFYLIRISFGVLLVASILLITLTIIIIMTASSDRDNDNR 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYY---YRRRRVQTDDDDKKMNFIKSVFSFVFGEGDP 264
S GFN F P DLFWY+ P + Y+ RR + +++ +MNF ++VFSF+FG+GDP
Sbjct: 128 SNNS--KGFNFFFFP-DLFWYFSPNHRDSYQERRGERGENN-EMNFFEAVFSFLFGDGDP 183
Query: 265 NQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD---IDRTMSDESYVLPVLLRFDGQPEI 321
N +EE+RW+ IG I++N G V AE++APYLD E Y+LPVL+RF+G P++
Sbjct: 184 NANLEERRWEEIGAVISNNQGAVVAEQIAPYLDDIGAKYQQEYEDYMLPVLVRFNGMPQV 243
Query: 322 DEEGNILYRFPSFQ-RTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERG 380
+G I+Y FP Q R + QR EY+ E W+FS + +
Sbjct: 244 SSDGQIVYYFPELQVRASKKQRRSISEYL---------------HEFSWKFSSASSGQLM 288
Query: 381 MAIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
++ GLGG+N G +ILG +L+ +A G + FV I+ LL
Sbjct: 289 LSAGLGGVNFVGALILGNLLKNGTIATQIGGLVAFVQGIYGLL 331
>gi|427732389|ref|YP_007078626.1| hypothetical protein Nos7524_5306 [Nostoc sp. PCC 7524]
gi|427368308|gb|AFY51029.1| hypothetical protein Nos7524_5306 [Nostoc sp. PCC 7524]
Length = 431
Score = 218 bits (555), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 139/344 (40%), Positives = 202/344 (58%), Gaps = 28/344 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVT+GDVA +AGL + EA ++L ALA D G L+V++ GDV+Y+FP N+R
Sbjct: 8 MQAVEKLGYRVTVGDVATQAGLNIAEANQSLLALATDAGGHLQVAETGDVVYLFPRNFRD 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAI---LSSKSDDDD 204
L K +L+++ K Y IRV FG LIASI+++ II I ++S SD+DD
Sbjct: 68 ILRNKYLQLQLQEWWQKIWGVLFYLIRVSFGILLIASIILITITIIVIITAMNSSSDNDD 127
Query: 205 RGRRRRSFDSGFNIFISPSDLFWYWDPYY--YRRRRVQTDDDDKKMNFIKSVFSFVFGEG 262
RRS SG FI DLFWY++P Y Y ++R + +D +MNF+++VFSF+FG+G
Sbjct: 128 ----RRSHGSGGWGFIYFPDLFWYFNPNYQNYPQQRRRGRQEDSQMNFLEAVFSFLFGDG 183
Query: 263 DPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD---IDRTMSDESYVLPVLLRFDGQP 319
+PN +EE+RW+ + I +N G V AE+++PYLD E Y+LPVLL+F+GQP
Sbjct: 184 NPNANLEERRWQEVAAVIRANRGAVVAEQISPYLDDIGAAYQQEYEDYMLPVLLKFNGQP 243
Query: 320 EIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSER 379
++ EG I+Y FP Q A+S+R V E W FS + +
Sbjct: 244 KVSPEGQIVYYFPELQVRASSKR--------------HESVPVYLEEVPWRFSAASSGQI 289
Query: 380 GMAIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
++ GLG NL G ++LG++L + +A G + FV I+ LL
Sbjct: 290 MLSAGLGVANLVGALVLGSLLADGTVAAELGGLVAFVQGIYWLL 333
>gi|354564624|ref|ZP_08983800.1| hypothetical protein FJSC11DRAFT_0006 [Fischerella sp. JSC-11]
gi|353549750|gb|EHC19189.1| hypothetical protein FJSC11DRAFT_0006 [Fischerella sp. JSC-11]
Length = 430
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 131/343 (38%), Positives = 196/343 (57%), Gaps = 29/343 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVT+GDVA +AGL + A L ALA+D G ++V++ GD++Y+FP N+R
Sbjct: 8 MQAVERLGYRVTVGDVASQAGLNVELASAGLLALASDAGGHMQVAESGDIVYLFPRNFRD 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L +K RL+++ K + Y IR+ FG LIASI +++ I+ I+ S + D D G
Sbjct: 68 ILRSKHLRLQLQEWWRKIWSVLFYIIRISFGILLIASIALIYITILVIIMSANSDRDSGD 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDP----YYYRRRRVQTDDDDKKMNFIKSVFSFVFGEGD 263
R F GF F +LFWY+ P YY R++ + D +NF+++VFSF+FG+G+
Sbjct: 128 RGSDFGGGFFYF---PNLFWYFSPNYDTYYPERQQERRQSD---LNFLEAVFSFLFGDGN 181
Query: 264 PNQGIEEKRWKLIGEYIASNGGVVTAEELAPYL-DIDRTMSD--ESYVLPVLLRFDGQPE 320
PN +E++RW+ I I +N G V AE++APYL DI T E Y+LPVL RF+GQP+
Sbjct: 182 PNANLEKRRWRTIAAVIRNNKGAVVAEQIAPYLDDIGETYQQEYEDYMLPVLTRFNGQPK 241
Query: 321 IDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERG 380
+ EG I+Y FP Q +AA +R + E W FS + +
Sbjct: 242 VSPEGQIVYYFPDLQVSAAKKR--------------DRSISPYLEELPWRFSAASSGQIL 287
Query: 381 MAIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
++ GLG +N+ G +ILG++L++ A G + F I+ LL
Sbjct: 288 LSAGLGVVNIVGALILGSLLRDGTAAAVLGGLVAFAQGIYWLL 330
>gi|443320931|ref|ZP_21050003.1| hypothetical protein GLO73106DRAFT_00028590 [Gloeocapsa sp. PCC
73106]
gi|442789354|gb|ELR99015.1| hypothetical protein GLO73106DRAFT_00028590 [Gloeocapsa sp. PCC
73106]
Length = 428
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 127/342 (37%), Positives = 202/342 (59%), Gaps = 26/342 (7%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M++V+ + RVT+GDVA KAGL++N A++ L ALAADT G L+V++ G+++Y+FP N+R
Sbjct: 8 MNSVETLDYRVTVGDVATKAGLEINLARQQLLALAADTGGNLQVTETGEIVYLFPRNFRG 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAI---LSSKSDDDD 204
L K ++L+ + K Y IR+ FG ALI S++++F AI I LS ++++
Sbjct: 68 ILRNKHWQLQWQQSWQKVWRVLFYLIRISFGIALILSVLLMFVAIAVIVIGLSVSGNNEE 127
Query: 205 RGRRRRSFDSGFNIFISPSDLFWYWDPYY----YRRRRVQTDDDDKKMNFIKSVFSFVFG 260
RR +SG + P FW+ ++ + R + +KK+NF++S+FSF+FG
Sbjct: 128 NSDRRD--NSGGGVLFLPR--FWFSPDFFGVFSWNRGYSPQSNPEKKINFLESIFSFLFG 183
Query: 261 EGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDESYVLPVLLRFDGQPE 320
+G+PN +EE RW+ IG I + G + A+++APYLD +E Y+LPVL RF+G P+
Sbjct: 184 DGNPNSRLEEHRWQEIGTVIRNQKGAIVAQQIAPYLDNIADFEEEDYILPVLARFNGYPQ 243
Query: 321 IDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERG 380
+ + G I+Y FP Q TA ++ A++ V +EK W+FS+ +
Sbjct: 244 VSDTGEIIYYFPDLQVTATEKQ-----------AES---VSPYLKEKTWQFSQAGSGQII 289
Query: 381 MAIGLGGLNLFGVIILGAMLQ-EMAVTPNGFLKFVAYIFPLL 421
AI LG LN ++LG +LQ ++A F+ FVA I+ +L
Sbjct: 290 GAIALGCLNFILALVLGYLLQGDLATQLGEFVTFVASIYWVL 331
>gi|300863650|ref|ZP_07108589.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300338358|emb|CBN53733.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 434
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 137/345 (39%), Positives = 195/345 (56%), Gaps = 28/345 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVT+GDVA KAGL +N A++ L ALA+D G L+V++ G++ Y FP N+R
Sbjct: 8 MQAVEQLGYRVTVGDVAAKAGLDVNFARRELLALASDAGGNLQVAESGEIAYSFPQNFRD 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSS--KSDDDDR 205
L K FRL+V+ K Y IR+ FG L+AS++++F +I +LS+ SD+++
Sbjct: 68 ILRNKFFRLQVQEWWQKVWRILFYLIRISFGLLLLASLILIFVSITLLLSALNSSDNNNS 127
Query: 206 GRRRRSFDSGFNIFISPSDLFW--YW--DPYYYRRRRVQTDDDDKKMNFIKSVFSFVFGE 261
G R F S +D W YW DPYY R R Q ++ +MNF++SVFSF+FG+
Sbjct: 128 GGREGGGFGAMPYFWS-NDWLWFFYWNSDPYYRRTR--QAAGEENQMNFLESVFSFIFGD 184
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI---DRTMSDESYVLPVLLRFDGQ 318
G+PN ++E RW+ I I +N G + AE++APYLD + E Y+LP L RFDG+
Sbjct: 185 GNPNYNLDEHRWQAIATVIRNNQGAIAAEQIAPYLDQFGKGYAVEYEEYMLPALARFDGR 244
Query: 319 PEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSE 378
PE+ EG I+Y FP Q TAA + V RE W FS+ + +
Sbjct: 245 PEVSPEGEIVYHFPELQTTAAQ--------------NHPQPVASYLRELLWRFSEASSGQ 290
Query: 379 RGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
+A GLG NL G +ILG +L + G + FV I+P+L
Sbjct: 291 ILLASGLGIANLGGALILGQLLASSPIVAKLGGLIIFVQAIYPIL 335
>gi|254411086|ref|ZP_05024864.1| hypothetical protein MC7420_564 [Coleofasciculus chthonoplastes PCC
7420]
gi|196182441|gb|EDX77427.1| hypothetical protein MC7420_564 [Coleofasciculus chthonoplastes PCC
7420]
Length = 441
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 140/352 (39%), Positives = 208/352 (59%), Gaps = 30/352 (8%)
Query: 86 RAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNY 145
+ M++V+ RVT+GDVA KAGL +N A++ L ALA+D G L+V++ G++ Y+FP N+
Sbjct: 6 KIMNSVEQLGYRVTVGDVAAKAGLNINLAERGLLALASDAGGHLQVAESGEIAYLFPKNF 65
Query: 146 RAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTA---IIAILSSKSDD 202
R L K RL+++ K Y IR+ FG L+ SIV++F A I+ +SS SD
Sbjct: 66 RTNLRNKFLRLRLQEWWSKIWRVLFYLIRISFGLLLLLSIVLIFLAISIIVISISSSSDS 125
Query: 203 DDRGRRRRSFDSGFNIFISPS-----DLFWYWDPYYYRRRRVQT---DDDDKKMNFIKSV 254
D G + G +F P D+FW++DP Y R R+ +T + K+MNF++S+
Sbjct: 126 DSGGGGNGGSNRGGGMFFMPRFWIGPDIFWFFDPSYERHRQRRTSWNSAESKQMNFLESI 185
Query: 255 FSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSD---ESYVLPV 311
FSF+FG+G+PN +EE+RW+ IG I ++GG VTAE++APYLD T E Y+LPV
Sbjct: 186 FSFLFGDGNPNADLEERRWQEIGTVIRNSGGAVTAEQIAPYLDDIGTGYQREYEDYMLPV 245
Query: 312 LLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEF 371
L RF+G+PE+ +G I+Y FP Q TA Q V +E+ W F
Sbjct: 246 LSRFNGRPEVSPDGEIIYHFPDLQTTAQQQHPQP--------------VPSYLQEQPWRF 291
Query: 372 SKTNMSERGMAIGLGGLNLFGVIILGAMLQ--EMAVTPNGFLKFVAYIFPLL 421
S ++ +AIGLGG+N+ G ++LG++L+ E+A G + FVA I+ L
Sbjct: 292 SAATSGQKILAIGLGGVNIVGALMLGSLLRGGEIAAQIGGLVAFVASIYWFL 343
>gi|428771289|ref|YP_007163079.1| hypothetical protein Cyan10605_2975 [Cyanobacterium aponinum PCC
10605]
gi|428685568|gb|AFZ55035.1| hypothetical protein Cyan10605_2975 [Cyanobacterium aponinum PCC
10605]
Length = 432
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 133/345 (38%), Positives = 205/345 (59%), Gaps = 28/345 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M +V+ N RVT+GDVA ++GL + Q+ L LA DT G L+V++ GD++Y+F +N+R+
Sbjct: 9 MQSVEKLNYRVTVGDVATQSGLDVKIVQQELLQLANDTSGHLQVAETGDIVYLFSSNFRS 68
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAII----AILSSKSDDD 203
L K ++L+ + + KA Y I++ FG LI+SI+I+ AII AI SSK D+
Sbjct: 69 ILRNKYWQLRWKKWLQKAWDIVFYLIKISFGIILISSIIIMLLAIIVIVVAISSSKDGDN 128
Query: 204 DRGRRRRSFDSGF----NIFISPSDLFWYWDPYYYRRR--RVQTDDDDKKMNFIKSVFSF 257
+ G RR GF +ISP D FW + P Y RR R + + + ++NF++S++SF
Sbjct: 129 NGGDSRRG--GGFFFLPQFWISP-DFFWMFSPNYEERRYQRQRNNKTENELNFLESIYSF 185
Query: 258 VFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDESYVLPVLLRFDG 317
+FG+G+PN+ +EE+RW+ I I +N G + AE++APYLD DE Y+LPVL+RF+G
Sbjct: 186 LFGDGNPNRNLEERRWREIATVIKNNNGAIIAEQVAPYLDNISNQEDEDYILPVLIRFNG 245
Query: 318 QPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMS 377
PE+ ++G I+Y FP Q TA + V +E W+FS +
Sbjct: 246 YPEVSDKGEIIYYFPELQVTAKERN--------------KASVAPYLKENLWQFSIASSG 291
Query: 378 ERGMAIGLGGLNLFGVIILGAMLQ-EMAVTPNGFLKFVAYIFPLL 421
++ AI LGG+N+ ++LG +L E+A GF+ FV I+ +L
Sbjct: 292 QKIGAIALGGVNIVLALMLGTLLTPELAQEIGGFILFVNSIYGIL 336
>gi|334121578|ref|ZP_08495642.1| hypothetical protein MicvaDRAFT_2618 [Microcoleus vaginatus FGP-2]
gi|333454867|gb|EGK83543.1| hypothetical protein MicvaDRAFT_2618 [Microcoleus vaginatus FGP-2]
Length = 438
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 130/348 (37%), Positives = 191/348 (54%), Gaps = 31/348 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVT+GDVA KAGL ++ AQ+ L LA++ G L+V++ GD+ Y+FP N+R
Sbjct: 8 MQAVEQLGYRVTVGDVAAKAGLDIHFAQRELLTLASEAGGNLQVAESGDIAYLFPKNFRD 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDD---D 204
L K RL+++ K Y IR+ FG L+ASI+++F AI +LSS ++
Sbjct: 68 ILRNKFLRLQLQEWWQKIWRILFYLIRISFGIVLVASILLIFVAITILLSSGDSNNGGGG 127
Query: 205 RGRRRRSFDSGFNIFISP---SDLFW--YWD---PYYYRRRRVQTDDDDKKMNFIKSVFS 256
G G + P +DL W YW+ PYY +R R+ +M+F+++VFS
Sbjct: 128 GGEGGGGGGRGGSFLFFPYFWNDLIWIFYWNHDQPYYQQRSRL--TGQKPQMSFLEAVFS 185
Query: 257 FVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD---IDRTMSDESYVLPVLL 313
F+FG+G+PN +EE++W I I +N G V AE++APYLD E Y+LP L
Sbjct: 186 FLFGDGNPNYNLEERKWIDIATAIGNNRGAVVAEQIAPYLDNLGQGYAREYEEYMLPALA 245
Query: 314 RFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSK 373
RFDG+PE+ EG I+Y FP Q TA + V RE W FS
Sbjct: 246 RFDGRPEVSPEGQIIYHFPQLQTTAVQKN--------------PQPVAAYLREMLWRFSN 291
Query: 374 TNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLL 421
+ + +A GLG +NL G ++LG +L + GF+ FV+ ++P+L
Sbjct: 292 ASSGQIMLAAGLGAVNLVGALVLGNLLSNNLIA-GGFIGFVSPVYPML 338
>gi|126654854|ref|ZP_01726388.1| hypothetical protein CY0110_10472 [Cyanothece sp. CCY0110]
gi|126623589|gb|EAZ94293.1| hypothetical protein CY0110_10472 [Cyanothece sp. CCY0110]
Length = 441
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 136/349 (38%), Positives = 203/349 (58%), Gaps = 29/349 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M +V+ RVT+GDVA K+GL +N AQ+ L ALA+D G L+V++ GD++Y+FP N+R
Sbjct: 8 MQSVEQLGYRVTVGDVAAKSGLNINLAQQGLLALASDVSGHLQVAESGDIIYLFPENFRT 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAIL-----SSKSDD 202
L K ++L+ + +K Y IR+ FG LIASI+++ AI I+ S + D+
Sbjct: 68 ILRNKYWKLQFKETWEKIWKVLFYIIRISFGIILIASIILMLVAITVIVIGLNSSREGDN 127
Query: 203 DDRGRRRRSFDSGFNIFISPSDLFWYWDP-YYYRRR-----RVQTDDDDKKMNFIKSVFS 256
R R GF F + ++LFW + P Y YRR R + + +MNF++++FS
Sbjct: 128 SSGSRSRSYRGGGFFFFPNFNNLFWIFYPDYGYRRHGYNSSRRSNNRNSNEMNFLEAIFS 187
Query: 257 FVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDR-TMSDESYVLPVLLR 314
F+FG+GDPN +EE+RWK IG I +N G + AE++APYLD IDR +E Y+LPVL R
Sbjct: 188 FLFGDGDPNYNLEERRWKAIGTVIKNNKGSIIAEQVAPYLDNIDRYNQENEDYILPVLTR 247
Query: 315 FDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKT 374
F+G PE+ G ++Y FP Q T Q +K + RE+ ++FS+
Sbjct: 248 FNGSPEVSPNGELIYYFPELQVTV--QESTQK------------SISSYLRERLYKFSEA 293
Query: 375 NMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
S+ +AIGLG N ++LG+ L++ A+ GF+ FV I+ LL
Sbjct: 294 ASSQIMLAIGLGAFNFILALVLGSFLKDPAIVAQFGGFIAFVNSIYWLL 342
>gi|427738747|ref|YP_007058291.1| hypothetical protein Riv7116_5365 [Rivularia sp. PCC 7116]
gi|427373788|gb|AFY57744.1| hypothetical protein Riv7116_5365 [Rivularia sp. PCC 7116]
Length = 426
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 134/343 (39%), Positives = 193/343 (56%), Gaps = 32/343 (9%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ N RVT+GDVA KAGL + + L ALA D G L+V+D GD++Y+FP N+R
Sbjct: 8 MQAVEKLNYRVTVGDVAQKAGLNIEITNQGLLALATDAGGHLQVADTGDIVYLFPRNFRD 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K F+L+++ +K Y IR+ FG LI SI ++ +I IL+S S DDD
Sbjct: 68 VLRNKYFKLQLQEWWNKVWKVLFYLIRISFGIVLILSIALIVISITLILTSLSGDDDDRG 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYY---YR---RRRVQTDDDDKKMNFIKSVFSFVFGE 261
F DLFW++ P Y YR RRR ++D +NF+++VFSF+FG+
Sbjct: 128 GGFGGGI---GFFYFPDLFWFFSPNYNTGYRSHGRRREKSD-----LNFLEAVFSFLFGD 179
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDRTMSD--ESYVLPVLLRFDGQ 318
G+PN +EE+RW+ I I +N G V AE++ PYLD I T E Y+LPVL +F+G+
Sbjct: 180 GNPNVNLEERRWQEIATVIRNNDGAVVAEQITPYLDNIGETYQQQYEDYMLPVLTKFNGK 239
Query: 319 PEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSE 378
PE+ EG I+Y FP Q +A + R+ + RE W FSK S+
Sbjct: 240 PEVSPEGEIVYYFPELQVSAKNNH--RQ------------SIPNFLREFSWRFSKATSSQ 285
Query: 379 RGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLL 421
++ GLG +N G ++LG +L++ + G + FV I+ LL
Sbjct: 286 VMLSTGLGVVNFVGALMLGGLLRDGTIA-GGLVGFVQGIYGLL 327
>gi|413937130|gb|AFW71681.1| hypothetical protein ZEAMMB73_735454 [Zea mays]
Length = 257
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 117/187 (62%), Positives = 147/187 (78%), Gaps = 5/187 (2%)
Query: 68 VGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDG 127
V PG VE+D+LP+DVR+RAM+AVD RVTIGDVA +AGL+L +A++ALQALAADT+G
Sbjct: 75 VRPGGAVETDRLPSDVRDRAMEAVDHFGGRVTIGDVASRAGLQLAQAERALQALAADTEG 134
Query: 128 FLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVI 187
FLEVS++G+VLYVFP +YRAKLA KSFR++VEP++DKAK Y +RV FGTALIASIV+
Sbjct: 135 FLEVSEDGEVLYVFPKDYRAKLAGKSFRMRVEPLVDKAKQVGAYLVRVSFGTALIASIVL 194
Query: 188 VFTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKK 247
V+T IIAILSS S D+D R R S + I P+D+FWY D YYRRRRV+ ++
Sbjct: 195 VYTTIIAILSSSSSDED--GRGRRRRSYGSTIIIPTDMFWYLDADYYRRRRVE---NENG 249
Query: 248 MNFIKSV 254
MNFI+SV
Sbjct: 250 MNFIESV 256
>gi|443318784|ref|ZP_21048028.1| hypothetical protein Lep6406DRAFT_00006960 [Leptolyngbya sp. PCC
6406]
gi|442781610|gb|ELR91706.1| hypothetical protein Lep6406DRAFT_00006960 [Leptolyngbya sp. PCC
6406]
Length = 450
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 139/354 (39%), Positives = 201/354 (56%), Gaps = 31/354 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M+AV++ + RVT+GD+A AGL LN AQ+ L LAADT L+VS+ G++ Y FP N+RA
Sbjct: 8 MEAVESLDYRVTVGDLAASAGLDLNTAQRGLITLAADTQAHLQVSEAGEIAYEFPKNFRA 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAIL----SSKSDDD 203
L K +R++++ D+ Y IR+ FG L+ SI+I+ AI+ I+ SS+ DD+
Sbjct: 68 VLRNKYWRVRLQETWDRIWGVLFYLIRISFGILLLLSILIIVAAIVVIIVALNSSQKDDN 127
Query: 204 DRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQT----------DDDDKKMNFIKS 253
R F DLFW+ ++Y R + DDDD MNF+++
Sbjct: 128 RSSSRSGGGGMMFIPRFWVGDLFWF---FHYSPNRQRQQPQRQTGRSGDDDD--MNFLEA 182
Query: 254 VFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD---IDRTMSDESYVLP 310
VFSF+FG+GDPN +EE+RW+ IG I +NGG V AE++ PYLD D+ Y++P
Sbjct: 183 VFSFLFGDGDPNADLEERRWQTIGTVIRNNGGAVAAEQITPYLDGFGESWAKDDDDYMMP 242
Query: 311 VLLRFDGQPEIDEEGNILYRFPSFQRTA-ASQRIGRKEYVGRRWADAIGGVEKIFREKKW 369
VLLRF+G P++ +GNI+Y FP Q TA Q + + Y V +E W
Sbjct: 243 VLLRFNGVPQVSPQGNIIYHFPELQVTANEGQGVASRSYRKS------SSVAAYLKEASW 296
Query: 370 EFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
FS+ + + MA GLG N I+LGA++Q+ A+ GF+ FV I+ LL
Sbjct: 297 RFSQASSGKLTMAAGLGIFNFVAAIVLGALMQDPALVAELGGFIAFVNSIYWLL 350
>gi|218245546|ref|YP_002370917.1| hypothetical protein PCC8801_0676 [Cyanothece sp. PCC 8801]
gi|257058583|ref|YP_003136471.1| hypothetical protein Cyan8802_0695 [Cyanothece sp. PCC 8802]
gi|218166024|gb|ACK64761.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
gi|256588749|gb|ACU99635.1| conserved hypothetical protein [Cyanothece sp. PCC 8802]
Length = 442
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 131/351 (37%), Positives = 201/351 (57%), Gaps = 31/351 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M +V+ RVT+GDVA +AGL+LN AQ+ L ALA+D G L+V++ GD++Y+FP N+R
Sbjct: 9 MRSVEQLGYRVTVGDVAAQAGLELNLAQQGLLALASDAGGHLQVAESGDIVYLFPQNFRT 68
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVF----TAIIAILSSKSDDD 203
L K ++++++ +K Y IR+ FG LIASI+++ IAI SS+ D
Sbjct: 69 ILRNKYWKIRLQETWEKVWKVLFYLIRISFGVILIASIILMMITIAIIFIAISSSRDGDR 128
Query: 204 DRGRRRRSFDSGFNIFISP--SDLFWYWDPYY-YRRRRVQTDDDDKK------MNFIKSV 254
D R G + P SDLFW + P Y Y R ++ + KK +NF++++
Sbjct: 129 DSNSDRGGGYGGGGLIFFPNFSDLFWIFYPNYGYNRYEYESSQNYKKTPEKSELNFLEAI 188
Query: 255 FSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDES--YVLPVL 312
FSF+FG+G+PN +E++RW+ IG I +N G V AE++APYLD ++E+ Y+LPVL
Sbjct: 189 FSFLFGDGNPNYNLEDRRWQDIGTVIRNNRGSVVAEQIAPYLDNITKYNEETEDYILPVL 248
Query: 313 LRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFS 372
RF+G P++ +G I+Y FP Q T +Q V +EK + FS
Sbjct: 249 TRFNGYPQVSPQGEIIYYFPELQVTVQNQ--------------TQRSVASYLKEKLYRFS 294
Query: 373 KTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
+ + + +AIGLG LN ++LG++L+E + G + FV I+ LL
Sbjct: 295 QASSGQIMIAIGLGALNFILALVLGSLLKEPDIVNQIGGLVAFVNSIYWLL 345
>gi|428307418|ref|YP_007144243.1| hypothetical protein Cri9333_3924 [Crinalium epipsammum PCC 9333]
gi|428248953|gb|AFZ14733.1| hypothetical protein Cri9333_3924 [Crinalium epipsammum PCC 9333]
Length = 436
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 134/347 (38%), Positives = 203/347 (58%), Gaps = 31/347 (8%)
Query: 90 AVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKL 149
AV+ RVT+GDV +AGL +N AQ L ALA++ G L+V+D GD++Y+FP N+RA L
Sbjct: 10 AVEQLGYRVTVGDVVTQAGLNVNIAQAGLLALASEAGGHLQVADTGDLVYLFPKNFRAVL 69
Query: 150 AAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAIL--SSKSDDDDRGR 207
K FRL+++ +K Y IR+ FG LIASI I+ I IL S+ S D++
Sbjct: 70 RNKFFRLRLKEWWEKIWRVLFYIIRISFGIVLIASIFIIIVTITVILIASNSSGDNNDNS 129
Query: 208 RRRSFDSGFNIFISP----SDLFWYWDPYY---YRRRRVQTDDDDKKMNFIKSVFSFVFG 260
G IF+ SD+FW++DP Y Y + ++ + + ++MNF+++VFSF+FG
Sbjct: 130 SSDRSGGGGMIFMPNFWFGSDIFWFFDPGYDNRYNQEQLSSLPEQRRMNFLEAVFSFLFG 189
Query: 261 EGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-----IDRTMSDESYVLPVLLRF 315
+G+PN +E++RW I I +N G V AE++APYLD R E Y+LPVL RF
Sbjct: 190 DGNPNADLEDRRWHDIATVIRNNQGAVVAEQIAPYLDSIGEGFHREY--EEYMLPVLTRF 247
Query: 316 DGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTN 375
+G PE+ EG I+Y FP Q T A++R + VE +E++W FS+ +
Sbjct: 248 NGLPEVSPEGQIVYHFPELQ-TTATERNSQP-------------VETYLQERRWLFSQAD 293
Query: 376 MSERGMAIGLGGLNLFGVIILGAMLQE-MAVTPNGFLKFVAYIFPLL 421
+ ++IGLG LN+ G ++LG++L+ +A G + F I+ +L
Sbjct: 294 SGQILLSIGLGALNIVGALVLGSLLKGVIAQEAVGLVAFTNSIYWIL 340
>gi|427718520|ref|YP_007066514.1| hypothetical protein Cal7507_3274 [Calothrix sp. PCC 7507]
gi|427350956|gb|AFY33680.1| hypothetical protein Cal7507_3274 [Calothrix sp. PCC 7507]
Length = 445
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/361 (35%), Positives = 197/361 (54%), Gaps = 24/361 (6%)
Query: 70 PGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFL 129
G + + +L N + AV+ RVT+GDVA + GL + EA + L +LA+D G L
Sbjct: 3 SGFLTMNARLDLKRHNGVVQAVEQLGYRVTVGDVATQTGLNVAEAGQGLLSLASDAGGHL 62
Query: 130 EVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVF 189
+V++ GD++Y+FP N RA L K ++L+++ K Y IR+ FG LI SI ++
Sbjct: 63 QVAESGDIVYLFPQNLRAILRNKYWQLRLQEWWQKLWGVLFYLIRISFGVFLIISIALIT 122
Query: 190 TAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYY----YRRRRVQTDDDD 245
II I+++ + D D R + G F P DLFWY+ P Y +RR + +
Sbjct: 123 VTIIMIMTAANSDRDSDNRGSNSSRGGGFFFFP-DLFWYFSPDYRTNRQQRRSERRQGQE 181
Query: 246 KKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD---IDRTM 302
+NF ++VFSF+FG+G+PN + E+RW+ I I +N G V AE+++PYLD +
Sbjct: 182 SDLNFFEAVFSFLFGDGNPNDNLAERRWQEIATVIRNNRGAVVAEQISPYLDDIGEEYKR 241
Query: 303 SDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEK 362
E Y+LPVL+RF+GQPE+ EG I+Y FP Q A + R+ V
Sbjct: 242 EYEDYMLPVLIRFNGQPEVSPEGQIVYYFPELQVRATKKH--RQS------------VTA 287
Query: 363 IFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPL 420
E W FS + + ++ GLG LN G ++LG++L++ +A G + FV I+ L
Sbjct: 288 YLEEFLWRFSAASSGQVMLSAGLGVLNFVGALVLGSLLRDGTVAAQLGGLVAFVQGIYWL 347
Query: 421 L 421
L
Sbjct: 348 L 348
>gi|428224266|ref|YP_007108363.1| hypothetical protein GEI7407_0813 [Geitlerinema sp. PCC 7407]
gi|427984167|gb|AFY65311.1| hypothetical protein GEI7407_0813 [Geitlerinema sp. PCC 7407]
Length = 428
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 131/331 (39%), Positives = 187/331 (56%), Gaps = 22/331 (6%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ N R T+GDVA +AGL + AQ+ L ALA+D G L+V++ GD+ Y F ++R
Sbjct: 8 MKAVEQLNYRATVGDVAAQAGLDIKVAQRELLALASDAGGHLQVAESGDIAYQFSKDFRT 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K FRL+++ K Y IR+ FG L+A I + AIIAI+ S + D D
Sbjct: 68 ILRNKYFRLQLQEWWQKVWRVLFYLIRISFGIVLVALIALTLLAIIAIVMSLNRDGDGDS 127
Query: 208 RRRSFDSGF----NIFISPSDLFWYWDPYYYRRRRVQTDDDDKKMNFIKSVFSFVFGEGD 263
S ++I+P DLFW++DP Y R + + MNF ++FSF+FG+G+
Sbjct: 128 SSDSDSGSGFGLPRVWITP-DLFWFFDPGYGYEARPERQQSESGMNFFVAIFSFLFGDGN 186
Query: 264 PNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI--DRT-MSDESYVLPVLLRFDGQPE 320
PN +EE+RWKLI I ++ G V AE++APYLD DR+ S E Y+LPVLLRFDG+PE
Sbjct: 187 PNAILEERRWKLIATTIRNHQGAVVAEQIAPYLDELGDRSAQSTEDYMLPVLLRFDGRPE 246
Query: 321 IDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERG 380
+ +EG ++Y FP Q AA R V +E W FS + +
Sbjct: 247 VSQEGGLIYHFPELQTMAAQTR--------------TQSVAAYLKEMSWRFSLASSGQIM 292
Query: 381 MAIGLGGLNLFGVIILGAMLQEMAVTPNGFL 411
AI LGG L ++LG+++ E AV+ + F+
Sbjct: 293 GAIALGGTLLVLSLVLGSLIAEAAVSSSLFV 323
>gi|428302125|ref|YP_007140431.1| hypothetical protein Cal6303_5582 [Calothrix sp. PCC 6303]
gi|428238669|gb|AFZ04459.1| hypothetical protein Cal6303_5582 [Calothrix sp. PCC 6303]
Length = 435
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 131/342 (38%), Positives = 191/342 (55%), Gaps = 26/342 (7%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVTIGDVA +AGL + A + L ALAAD G ++VS+ GDV+Y+FP N+R
Sbjct: 8 MQAVEKLGYRVTIGDVATQAGLNVELASQGLLALAADAGGNMQVSNSGDVVYLFPKNFRD 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K F++K++ K Y IR+ FG LIAS+V++ +IAI + + +D
Sbjct: 68 VLRNKYFQIKLQEWWQKIWGVLFYLIRLSFGIFLIASMVLISITLIAISIAATFSNDNDN 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYY---YRRRRVQTDDDDKKMNFIKSVFSFVFGEGDP 264
+ F D +WY P Y Y +R+ + K++NF ++VFSF+FG+G+P
Sbjct: 128 GGSGGGGNWGNFFFFPDFWWYTSPNYGNDYEKRQ----REKKELNFFEAVFSFLFGDGNP 183
Query: 265 NQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDRTMSD--ESYVLPVLLRFDGQPEI 321
N +EEKRWK IG I SN G V AE+++PYLD I T E Y+LPVL+RFDG+PE+
Sbjct: 184 NARLEEKRWKQIGAVIRSNRGAVIAEQISPYLDNIGETYQQEYEDYMLPVLIRFDGKPEV 243
Query: 322 DEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGM 381
EG ++Y FP Q A ++ R E ++FS + +
Sbjct: 244 SPEGQLVYYFPKLQVGAVKRQQQR--------------FSTFLEEIPFKFSNATSGQLFL 289
Query: 382 AIGLGGLNLFGVIILGAMLQ--EMAVTPNGFLKFVAYIFPLL 421
+ GLG LNL G ++LG++L+ +A G + FV I+ +L
Sbjct: 290 SAGLGILNLGGALVLGSLLKGGALAAQLGGLVAFVQSIYWVL 331
>gi|449018655|dbj|BAM82057.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 522
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 130/344 (37%), Positives = 193/344 (56%), Gaps = 22/344 (6%)
Query: 83 VRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFP 142
V +R + AV++ RVT+ DVA + GL L AQ+ L ALA G LEVS +G+++Y FP
Sbjct: 93 VDDRVLGAVESLGGRVTVADVAARTGLGLPTAQRCLVALANVAGGHLEVSRDGELVYAFP 152
Query: 143 NNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSS--KS 200
+N R L +S + +K Y +R+ FG LIASI I++T+I+AI SS
Sbjct: 153 SNVRRVLLQESIIFALRNRWEKISPFLNYLVRISFGILLIASIFIIYTSIVAISSSVQSR 212
Query: 201 DDDDRGRRRRSFDSG-FNIFISPS--DLFWYWDPYYYRRRRVQTDDDDKKMNFIKSVFSF 257
+DD R RS+ G +++ PS DL +Y + Y RRR + M+F++S+FSF
Sbjct: 213 EDDRRSSYNRSYGGGSLYVWMGPSPFDLIFYANHGYSGRRR------EGGMSFLESIFSF 266
Query: 258 VFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDR-------TMSDESYVLP 310
+FG+G+PN +E RW+ IGE I GG VTAE++AP LD+ DES++LP
Sbjct: 267 LFGDGNPNADLETYRWRRIGETIRRLGGAVTAEQIAPLLDVPEPPESRGAVYVDESFMLP 326
Query: 311 VLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWE 370
VL RF G+PE+ ++G+I+Y FP T + + G +E E+++
Sbjct: 327 VLERFGGRPEVTDDGDIVYVFPDLLNTVSGSGSTSGSASSEQSHYDKGYIE----EQEYV 382
Query: 371 FSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFV 414
FSK + + A GLG LNL G + LG + ++ P FLK++
Sbjct: 383 FSKASSGQILGAAGLGVLNLLGAVTLGEYMAQLRAVPLPFLKWL 426
>gi|425467289|ref|ZP_18846573.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|389829960|emb|CCI28313.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9809]
Length = 433
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 119/336 (35%), Positives = 195/336 (58%), Gaps = 31/336 (9%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ +++ RVT+GDVA K+GL++N AQ+ L ALAA+ G L+V+D G+++Y+FP N+R+
Sbjct: 8 LQSIEKLGYRVTVGDVAAKSGLEINAAQQGLLALAAEAGGHLQVADTGEIIYLFPENFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFG----TALIASIVIVFTAIIAILSSKSDDD 203
L K ++L+++ K Y IR+ FG +++ + + +I ++ SD D
Sbjct: 68 ILLNKYWQLQLKAFASKIWQVVFYVIRISFGIVLIISILILLFAILAILIGLMFKDSDSD 127
Query: 204 DRGRRRRSFDSGFNIFISPSDLFWYWDP-----YYYRR-RRVQTDDDDKKMNFIKSVFSF 257
+ S DS NI P+D FW + P YY RR R V+ + KMNF+++VFSF
Sbjct: 128 N-----NSGDSKININFFPTDFFWIFYPDFGNNYYERRDREVKAEKPPSKMNFLEAVFSF 182
Query: 258 VFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDES--YVLPVLLRF 315
+FG+GDPN +EE+RW+ IG+ I +N G + A ++ PYLD +E+ Y+LPVL RF
Sbjct: 183 LFGDGDPNFNLEERRWQTIGKVIQNNRGSIVAPQIVPYLDSITPSQEETEDYILPVLTRF 242
Query: 316 DGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTN 375
+G P++ + G+I+Y FP Q T +++ V +EK+W+FS+ +
Sbjct: 243 NGLPQVSDRGDIIYYFPDLQVTTKERKLQT--------------VSPYLKEKRWKFSEAD 288
Query: 376 MSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFL 411
+ +A+GLG +N ++LG +L +V +G L
Sbjct: 289 SGQIILALGLGAVNFILALVLGYLLNSDSVDLSGSL 324
>gi|67922560|ref|ZP_00516067.1| hypothetical protein CwatDRAFT_3897 [Crocosphaera watsonii WH 8501]
gi|67855569|gb|EAM50821.1| hypothetical protein CwatDRAFT_3897 [Crocosphaera watsonii WH 8501]
Length = 442
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 135/350 (38%), Positives = 202/350 (57%), Gaps = 30/350 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M +V+ RVT+GDVA K+GL +N AQ+ L ALA+D G L+V++ GD++Y+FP+N+R
Sbjct: 8 MQSVEQLGYRVTVGDVAAKSGLDINLAQRGLLALASDVSGHLQVAESGDIIYLFPDNFRT 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAI---LSSKSDDDD 204
L K ++L+++ +K Y IR+ FG L+ASI+++ AI I L+S D +
Sbjct: 68 ILRNKYWKLQLKETWEKIWKVLFYIIRISFGIILVASIILILIAITVIIIGLNSSRDGNS 127
Query: 205 RG---RRRRSFDSGFNIFISPSDLFWYWDPYY-YRR-----RRVQTDDDDKKMNFIKSVF 255
R GF F + DLFW + P Y Y R R D +MNF+++VF
Sbjct: 128 NNSGSRGGSYRGGGFFFFPNFGDLFWIFYPNYGYNRYDHSSSRRSQSRDANEMNFLEAVF 187
Query: 256 SFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDR-TMSDESYVLPVLL 313
SF+FG+G+PN +EE+RWK IG I +N G + AE++APYLD IDR +E Y+LPVL
Sbjct: 188 SFLFGDGNPNYNLEERRWKAIGTVIKNNKGSIIAEQVAPYLDNIDRYNKENEDYILPVLT 247
Query: 314 RFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSK 373
RF+G PE+ G ++Y FP Q T Q +K + RE+ ++FS+
Sbjct: 248 RFNGNPEVSPNGELIYHFPELQVTV--QESTQK------------SISTYLRERLYKFSE 293
Query: 374 TNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
++ +AIGLG LN +ILG+ L++ ++ GF+ F+ I+ LL
Sbjct: 294 AGSNKIMLAIGLGALNFILALILGSFLKDPSIVAQFGGFIAFINSIYWLL 343
>gi|443312537|ref|ZP_21042154.1| hypothetical protein Syn7509DRAFT_00014950 [Synechocystis sp. PCC
7509]
gi|442777515|gb|ELR87791.1| hypothetical protein Syn7509DRAFT_00014950 [Synechocystis sp. PCC
7509]
Length = 431
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 129/350 (36%), Positives = 196/350 (56%), Gaps = 38/350 (10%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M+AV+ RVT+GDVA +AG +N A++ L ALA D G L+V++ G+++Y+FP N RA
Sbjct: 8 MEAVEQLGYRVTVGDVATQAGFNVNLAEQGLLALATDVGGHLQVAETGEIVYLFPKNLRA 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSD--DDDR 205
L K RL+++ K Y IR+ FG LIASI ++ I IL++ + DDD
Sbjct: 68 VLRNKFLRLRLQEWWKKVWRVLFYLIRISFGVLLIASIALIIVTIFVILTAANQNNDDDN 127
Query: 206 GRRRRSFDSGFNIF-----ISPSDLFWY------WDPYYYRRRRVQTDDDDKKMNFIKSV 254
RS+ G + F I P+ W+ +D +Y RR +++ ++NF +++
Sbjct: 128 SNSDRSYGGGSSFFMPYYWIGPN---WFGVFSPDYDRHYQERR-----EEESQLNFFEAI 179
Query: 255 FSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD---IDRTMSDESYVLPV 311
FSF+FG+G+PN +EE+R K I I +N G V E++APYLD E Y+LPV
Sbjct: 180 FSFLFGDGNPNVDLEERRQKEIAAVIRTNRGAVVGEQIAPYLDNIGSGYVQEYEDYMLPV 239
Query: 312 LLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEF 371
L RF+GQP + EG+I+Y FP Q A+ Q+ + + +E +F
Sbjct: 240 LTRFNGQPTVSPEGHIVYVFPELQSYASRQQ-------------QLLAIPPYLQELPQQF 286
Query: 372 SKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLL 421
S + + G++IGLG LNL G + LG++L ++A T G + FV I+ LL
Sbjct: 287 SAASSGQLGLSIGLGVLNLGGALFLGSLLSDIA-TAGGLVGFVQAIYWLL 335
>gi|427714016|ref|YP_007062640.1| hypothetical protein Syn6312_3044 [Synechococcus sp. PCC 6312]
gi|427378145|gb|AFY62097.1| hypothetical protein Syn6312_3044 [Synechococcus sp. PCC 6312]
Length = 435
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 133/342 (38%), Positives = 201/342 (58%), Gaps = 24/342 (7%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVT+GDVA KAGL LN +QK L ALA+ G L+V++ GD+ YV P ++R
Sbjct: 12 MQAVEQLGYRVTVGDVAAKAGLDLNTSQKGLLALASAVGGDLQVAESGDIAYVLPRDFRG 71
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDD-DDRG 206
L +K R++++ K A Y IR+ FG LI SIV++F AII I+ + S +D
Sbjct: 72 ILRSKYLRIQLQEAWAKVWAVLFYLIRMSFGIFLILSIVLIFVAIIVIVIAASSSRNDNN 131
Query: 207 RRRRSFDSGFNIFISPSDLFWYW--DPYYYRRRRVQTDDDD-KKMNFIKSVFSFVFGEGD 263
R FD F FI D +W++ DPY RR+ + + ++MNF++ ++SF+FG+G+
Sbjct: 132 NRSGGFD--FPRFIFFPDFWWFFGSDPYRPRRQSSRNQRQNPEEMNFLEGIYSFLFGDGN 189
Query: 264 PNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI--DRTMSDESYVLPVLLRFDGQPEI 321
PN +E +RW+ +G+ I + G V AE++APYLDI S E++++PVL RF+G P++
Sbjct: 190 PNVNLESRRWQEVGQQIINQKGAVVAEQIAPYLDIPPGPLPSGENFMIPVLSRFNGVPQV 249
Query: 322 DEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGM 381
E+G I+Y FP Q TA QR +K V E W+F++ + + +
Sbjct: 250 TEQGEIIYHFPDLQATAKQQR--QKP------------VSPYLEEITWKFTRASSGQVML 295
Query: 382 AIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
AIGLG +NL G ++L +L + +V G + FV IF +L
Sbjct: 296 AIGLGCINLVGALVLWNLLGDGSVATQLGGIVAFVQSIFWVL 337
>gi|376003682|ref|ZP_09781490.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
8005]
gi|375327980|emb|CCE17243.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
8005]
Length = 451
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 132/350 (37%), Positives = 191/350 (54%), Gaps = 30/350 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
++AV+ RV+ DVA +AGL++N AQ+ L ALA++ G L+VS+ GD+ ++FP N+RA
Sbjct: 14 IEAVEKLGYRVSSADVAIQAGLEVNLAQQQLLALASEAGGHLQVSESGDIAFLFPQNFRA 73
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAIL---SSKSDDDD 204
+ + +RL+V K Y IR+ FG L+ SIV++F I AI+ S
Sbjct: 74 VMRNRYWRLRVYEWWQKVWRILFYLIRISFGIMLLVSIVLIFITIAAIVFFGDSNRSGGG 133
Query: 205 RGRRRRSFDSGFNIFI-----SPSDL-FWYWD--PYYYRRRRVQTDDDDKKMNFIKSVFS 256
G IFI P+ L F WD YY R++++ +KMNF + VFS
Sbjct: 134 GGGGGGGSRGRGIIFIPHFWFHPNSLSFLSWDNGNYYRRQQKIANKLKKEKMNFFEVVFS 193
Query: 257 FVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDRTMSD--ESYVLPVLL 313
F+FG+G+PN +E +RW+ I I +N G V AE++APYLD + S E Y+LPVL
Sbjct: 194 FLFGDGNPNYNLEAQRWQAIATVIRNNQGAVVAEQIAPYLDNLGNAYSQEFEDYMLPVLT 253
Query: 314 RFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSK 373
RF+GQPE+ EG ++Y FP Q TA R V +E W+FS
Sbjct: 254 RFNGQPEVSPEGQLVYHFPELQTTATQYR--------------PQPVPAYLKENNWKFSN 299
Query: 374 TNMSERGMAIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
++ +A GLG +N G +ILG +L++ MA G + FV I+PLL
Sbjct: 300 ATSNQLMLAAGLGAVNFVGALILGHLLEDGAMAAQMGGLVAFVEMIYPLL 349
>gi|428211752|ref|YP_007084896.1| hypothetical protein Oscil6304_1262 [Oscillatoria acuminata PCC
6304]
gi|428000133|gb|AFY80976.1| hypothetical protein Oscil6304_1262 [Oscillatoria acuminata PCC
6304]
Length = 436
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 135/348 (38%), Positives = 199/348 (57%), Gaps = 29/348 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M V+ + RVT+GDVA AGL++N A++ L ALA++ G L+V+D GD++Y FP +RA
Sbjct: 8 MQTVEKLDYRVTVGDVATSAGLQVNLAERELLALASEAGGHLQVADTGDIVYQFPKQFRA 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K FR++++ +K Y IR+ FG LI SI I+F +I +L+ + D R
Sbjct: 68 ILRNKYFRIQLQEWWEKVWRILFYLIRISFGIILILSIFIIFASIFILLTMANKDSSSNR 127
Query: 208 RRRSFDSGF---NIFISPSDLFWYWDPYY--YRRRRVQTDD----DDKKMNFIKSVFSFV 258
R +F P D+FW+++P Y Y+R+R Q ++ MNF++SVFSF+
Sbjct: 128 SRSRGGGMIFFPRMFWGP-DIFWFFNPNYGRYQRQRRQKAQASSSSEESMNFLESVFSFL 186
Query: 259 FGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD---IDRTMSDESYVLPVLLRF 315
FG+G+PN +EE+RW+ I I +N G V AE++APYLD + E Y+LPVL RF
Sbjct: 187 FGDGNPNANLEERRWQAIAATIRNNQGAVVAEQIAPYLDDLGTGYSNEYEDYMLPVLTRF 246
Query: 316 DGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTN 375
+G+PE+ +G I+Y FP Q T QR V RE W+FS+ +
Sbjct: 247 NGRPEVSPDGQIVYHFPELQ-TTVQQR-------------NTLAVPAYLRELPWKFSEAS 292
Query: 376 MSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
++ +A GLG LN+ G I LG +LQ+ A+ G + F A I+ LL
Sbjct: 293 SNQLSLAAGLGVLNIGGGIFLGYLLQDGAIAAQIGGLVAFAASIYWLL 340
>gi|165968223|gb|ABY75921.1| At5g03900 [Arabidopsis thaliana]
gi|165968225|gb|ABY75922.1| At5g03900 [Arabidopsis thaliana]
gi|165968227|gb|ABY75923.1| At5g03900 [Arabidopsis thaliana]
gi|165968229|gb|ABY75924.1| At5g03900 [Arabidopsis thaliana]
gi|165968231|gb|ABY75925.1| At5g03900 [Arabidopsis thaliana]
gi|165968233|gb|ABY75926.1| At5g03900 [Arabidopsis thaliana]
gi|165968235|gb|ABY75927.1| At5g03900 [Arabidopsis thaliana]
gi|165968237|gb|ABY75928.1| At5g03900 [Arabidopsis thaliana]
gi|165968239|gb|ABY75929.1| At5g03900 [Arabidopsis thaliana]
gi|165968241|gb|ABY75930.1| At5g03900 [Arabidopsis thaliana]
gi|165968243|gb|ABY75931.1| At5g03900 [Arabidopsis thaliana]
gi|165968245|gb|ABY75932.1| At5g03900 [Arabidopsis thaliana]
gi|165968247|gb|ABY75933.1| At5g03900 [Arabidopsis thaliana]
gi|165968249|gb|ABY75934.1| At5g03900 [Arabidopsis thaliana]
Length = 198
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 112/181 (61%), Positives = 137/181 (75%), Gaps = 7/181 (3%)
Query: 25 INLKPPD---SFPRIQPLPFPRISGKIPGSRVLVPVAKAST--DVAVGVGPGRIVESDKL 79
I L+ P SFPR+ L +S + +R + V KA++ V+ + PG +VESDKL
Sbjct: 20 IRLRSPVDRYSFPRM--LTERCLSTRRKFNRHGIAVVKAASLDKVSGAIKPGGLVESDKL 77
Query: 80 PADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLY 139
P DVR RAMDAVD C RRVT+GDVA + GLK+ EAQ ALQA+AADTDGFLEVSDEGDVLY
Sbjct: 78 PTDVRKRAMDAVDECGRRVTVGDVASRGGLKVTEAQTALQAIAADTDGFLEVSDEGDVLY 137
Query: 140 VFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSK 199
VFP +YR KLAAKS R+++EP ++KAK A +Y RV FGTALIASIVIV+T+IIA+LSSK
Sbjct: 138 VFPRDYRTKLAAKSLRIQIEPFLEKAKGAVDYLARVSFGTALIASIVIVYTSIIALLSSK 197
Query: 200 S 200
S
Sbjct: 198 S 198
>gi|209526900|ref|ZP_03275419.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|423063841|ref|ZP_17052631.1| hypothetical protein SPLC1_S130880 [Arthrospira platensis C1]
gi|209492679|gb|EDZ93015.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|406714690|gb|EKD09851.1| hypothetical protein SPLC1_S130880 [Arthrospira platensis C1]
Length = 450
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 131/349 (37%), Positives = 191/349 (54%), Gaps = 29/349 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
++AV+ RV+ DVA +AGL++N AQ+ L ALA++ G L+VS+ GD+ ++FP N+RA
Sbjct: 14 IEAVEKLGYRVSSADVAIQAGLEVNLAQQQLLALASEAGGHLQVSESGDIAFLFPQNFRA 73
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAIL--SSKSDDDDR 205
+ + +RL+V K Y IR+ FG L+ SIV++F I AI+ +
Sbjct: 74 VMRNRYWRLRVYEWWQKVWRILFYLIRISFGIMLLVSIVLIFITIAAIVFFGDSNRSGGG 133
Query: 206 GRRRRSFDSGFNIFI-----SPSDL-FWYWD--PYYYRRRRVQTDDDDKKMNFIKSVFSF 257
G IFI P+ L F WD YY R++++ +KMNF + VFSF
Sbjct: 134 GGGGGGSRGRGLIFIPHFWFHPNSLSFLSWDNGNYYRRQQKIANKLKKEKMNFFEVVFSF 193
Query: 258 VFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDRTMSD--ESYVLPVLLR 314
+FG+G+PN +E +RW+ I I +N G V AE++APYLD + S E Y+LPVL R
Sbjct: 194 LFGDGNPNYNLEAQRWQAIATVIRNNQGAVVAEQIAPYLDNLGNAYSQEFEDYMLPVLTR 253
Query: 315 FDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKT 374
F+GQPE+ EG ++Y FP Q TA R V +E W+FS
Sbjct: 254 FNGQPEVSPEGQLVYHFPELQTTATQYR--------------PQPVPTYLKENNWKFSNA 299
Query: 375 NMSERGMAIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
++ +A GLG +N G +ILG +L++ MA G + FV I+PLL
Sbjct: 300 TSNQLMLAAGLGAVNFVGALILGHLLEDGAMAAQMGGLVAFVEMIYPLL 348
>gi|422305103|ref|ZP_16392440.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9806]
gi|389789660|emb|CCI14389.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9806]
Length = 434
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 116/336 (34%), Positives = 196/336 (58%), Gaps = 31/336 (9%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ +++ RVT+GDVA K+GL++N AQ+ L ALAA+ G L+V+D G+++Y+FP N+R+
Sbjct: 8 LQSIEKLGYRVTVGDVAAKSGLEINAAQQGLLALAAEAGGHLQVADTGEIIYLFPENFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLF----GTALIASIVIVFTAIIAILSSKSDDD 203
L K ++L+++ K Y IR+ F +++ ++ + +I ++ SD D
Sbjct: 68 ILLNKYWQLQLKAFASKIWKVVFYVIRISFGIVLIISILILLLAILAILIGLMFKDSDSD 127
Query: 204 DRGRRRRSFDSGFNIFISPSDLFWYWDP-----YYYRR-RRVQTDDDDKKMNFIKSVFSF 257
+ S + NI P+D FW + P YY +R R V+ + KMNF+++VFSF
Sbjct: 128 N-----NSGNKNININFFPTDFFWIFYPDFGNNYYEKRDREVKAEKPPSKMNFLEAVFSF 182
Query: 258 VFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDES--YVLPVLLRF 315
+FG+GDPN +EE+RW++IG+ I +N G + A ++APYLD + +E+ Y+LPVL RF
Sbjct: 183 LFGDGDPNFNLEERRWQIIGKVIQNNRGSIIAPQIAPYLDSINPIQEETEDYILPVLTRF 242
Query: 316 DGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTN 375
+G P++ + G+I+Y FP Q T ++ + V +EK W+FS+ +
Sbjct: 243 NGLPQVSDRGDIIYYFPDLQVTTKERK--------------VQAVSPYLKEKPWKFSEAD 288
Query: 376 MSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFL 411
+ +A+GLG +N ++LG +L +V +G L
Sbjct: 289 SGQIILALGLGAVNFILALVLGYLLNSDSVDLSGSL 324
>gi|425439346|ref|ZP_18819674.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9717]
gi|389720468|emb|CCH95857.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9717]
Length = 434
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 119/336 (35%), Positives = 195/336 (58%), Gaps = 31/336 (9%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ +++ RVT+GDVA K+GL++N AQ+ L ALAA+ G L+V+D G+++Y+FP N+R+
Sbjct: 8 LQSIEKLGYRVTVGDVAAKSGLEINAAQQGLLALAAEAGGHLQVADTGEIIYLFPENFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLF----GTALIASIVIVFTAIIAILSSKSDDD 203
L K ++L+++ K Y IR+ F +++ ++ + +I ++ SD D
Sbjct: 68 ILLNKYWQLQLKAFGSKIWQVVFYVIRISFGIVLIISILILLLAILAILIGLMFKDSDSD 127
Query: 204 DRGRRRRSFDSGFNIFISPSDLFWYWDP-----YYYRR-RRVQTDDDDKKMNFIKSVFSF 257
+ S DS NI P+D FW + P YY RR R V+ + KMNF+++VFSF
Sbjct: 128 N-----NSGDSKININFFPTDFFWIFYPDFGNNYYERRDREVKAEKPPSKMNFLEAVFSF 182
Query: 258 VFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDES--YVLPVLLRF 315
+FG+GDPN +EE+RW+ IG+ I +N G + A ++APYLD +E+ Y+LPVL RF
Sbjct: 183 LFGDGDPNFNLEERRWQTIGKVIQNNRGSIIAPQIAPYLDSITPSQEETEDYILPVLTRF 242
Query: 316 DGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTN 375
+G P++ + G+I+Y FP Q T ++ + + REK W+FS+ +
Sbjct: 243 NGLPQVSDRGDIIYYFPDLQVTTKERK--------------VQALSPYLREKSWKFSEAD 288
Query: 376 MSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFL 411
+ +A+GLG +N ++LG +L +V +G L
Sbjct: 289 SGQIILALGLGVVNFILALVLGFLLNSDSVDLSGSL 324
>gi|186685321|ref|YP_001868517.1| hypothetical protein Npun_F5248 [Nostoc punctiforme PCC 73102]
gi|186467773|gb|ACC83574.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 432
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 126/341 (36%), Positives = 190/341 (55%), Gaps = 23/341 (6%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVT+GDVA +AGL + EA ++L ALA+D G L+V+D GD++Y+FP N+RA
Sbjct: 8 MQAVEKLGYRVTVGDVATQAGLNVAEANQSLLALASDAGGHLQVADSGDIVYLFPQNFRA 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K F+L+++ K + Y IR+ FG L SI ++ I I+++ + D D
Sbjct: 68 ILRNKYFQLRLQEWWKKVWSVLFYLIRISFGIFLTVSIALITITIFIIITAANSDRDGDN 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYY--YRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPN 265
R + G + + FWY P Y Y + R + ++ +NF ++VFSF+FG+G+PN
Sbjct: 128 RGSNSGGGGFFYF--PNFFWYLSPNYDTYYQERRRETREESNLNFFEAVFSFLFGDGNPN 185
Query: 266 QGIEEKRWKLIGEYIASNGGVVTAEELAPYLD---IDRTMSDESYVLPVLLRFDGQPEID 322
++E+RW+ I I SN G V AE++APYLD T E Y+LPVL RF+GQP +
Sbjct: 186 ANLDERRWQEIATVIRSNRGAVVAEQIAPYLDDISQGYTREYEDYMLPVLTRFNGQPAVS 245
Query: 323 EEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGMA 382
EG I+Y FP Q +A +R + E W FS + + ++
Sbjct: 246 PEGQIVYYFPELQVSAVKKR--------------RHSISSYLEELPWRFSAASSGQIMLS 291
Query: 383 IGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
GLG LN G ++LG++L++ +A G + FV I+ LL
Sbjct: 292 AGLGALNFVGALVLGSLLRDGTVAAQLGGLVAFVQGIYWLL 332
>gi|411118128|ref|ZP_11390509.1| hypothetical protein OsccyDRAFT_1985 [Oscillatoriales
cyanobacterium JSC-12]
gi|410711852|gb|EKQ69358.1| hypothetical protein OsccyDRAFT_1985 [Oscillatoriales
cyanobacterium JSC-12]
Length = 431
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 139/349 (39%), Positives = 194/349 (55%), Gaps = 37/349 (10%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ N RVT+GDVA +AGL +N A+ L ALA++ G L+VS+ GD++Y+FP +R+
Sbjct: 8 MKAVEHLNYRVTVGDVAMQAGLDINLAESGLLALASEAGGHLQVSESGDIVYLFPQQFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAIL----SSKSDDD 203
L K RL+++ +K Y IR+ FG LI SI+++F AI AI+ +S+ D
Sbjct: 68 ILRNKYLRLQLQEWWEKVWRILFYIIRISFGIILIVSIILIFLAIFAIIIASTASRDGDS 127
Query: 204 DRGRRRRSFDSGFNIFISPSDLFWYWDPYY------YRRRRVQTDDDDKKMNFIKSVFSF 257
D G S NI+ P DLFW + P Y YRRR + D MNF++++FSF
Sbjct: 128 DSG----GSVSMPNIWFGP-DLFWIFYPSYNERPTAYRRR---STGQDSSMNFLEAIFSF 179
Query: 258 VFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD---IDRTMSDESYVLPVLLR 314
+FG+GDPN +EE+RW+ IG I +N G V AE++APYLD E Y+LPVL R
Sbjct: 180 LFGDGDPNADLEERRWRAIGTVIRNNRGAVIAEQIAPYLDNVGQGYAREYEEYMLPVLTR 239
Query: 315 FDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKT 374
F+G+PE+ EG ++Y F Q TA R V +E +FS
Sbjct: 240 FNGRPEVSPEGGLVYHFADLQTTAVENRFQP--------------VSAYLKEFPRKFSNA 285
Query: 375 NMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
+ MA+GLG LNL G + LG +L A+ G + FV I+ LL
Sbjct: 286 TAGQILMAVGLGSLNLIGALALGRLLAGGAIAAQMGGLVAFVQSIYWLL 334
>gi|166364486|ref|YP_001656759.1| hypothetical protein MAE_17450 [Microcystis aeruginosa NIES-843]
gi|166086859|dbj|BAG01567.1| hypothetical protein MAE_17450 [Microcystis aeruginosa NIES-843]
Length = 434
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 129/335 (38%), Positives = 197/335 (58%), Gaps = 29/335 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ +++ RVT+GDVA K+GL++N AQ+ L ALAA+ G L+V+D G+++Y+FP N+R+
Sbjct: 8 LQSIEKLGYRVTVGDVAAKSGLEINAAQQGLLALAAEAGGHLQVADTGEIIYLFPENFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSS---KSDDDD 204
L K ++L+++ K Y IR+ FG LI SI+I+ AI+AIL K D D
Sbjct: 68 ILLNKYWQLQLKAFGSKIWQVVFYVIRISFGIVLIISILILLLAILAILIGLMFKDGDSD 127
Query: 205 RGRRRRSFDSGFNIFISPSDLFWYWDP-----YYYRR-RRVQTDDDDKKMNFIKSVFSFV 258
S DS NI P+D FW + P YY RR R V+ + KMNF+++VFSF+
Sbjct: 128 NN----SGDSKININFFPTDFFWIFYPDFSNNYYERRDREVKAEKPPSKMNFLEAVFSFI 183
Query: 259 FGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDES--YVLPVLLRFD 316
FG+GDPN +EE+RW+ IG+ I +N G + A ++ PYLD +E+ Y+LPVL RF+
Sbjct: 184 FGDGDPNFNLEERRWQTIGKVIQNNRGSIIAPQIVPYLDSITPSQEETEDYILPVLTRFN 243
Query: 317 GQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNM 376
G P++ + G+I+Y FP Q T +++ V REK W+FS+ +
Sbjct: 244 GLPQVSDRGDIIYYFPDLQVTTKERKLQT--------------VSPYLREKSWKFSEADS 289
Query: 377 SERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFL 411
+ +A+GLG +N ++LG +L +V +G L
Sbjct: 290 GQIILALGLGVVNFILALVLGFLLNSDSVDLSGSL 324
>gi|16329601|ref|NP_440329.1| hypothetical protein slr1603 [Synechocystis sp. PCC 6803]
gi|383321342|ref|YP_005382195.1| hypothetical protein SYNGTI_0433 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383324512|ref|YP_005385365.1| hypothetical protein SYNPCCP_0433 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383490396|ref|YP_005408072.1| hypothetical protein SYNPCCN_0433 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384435662|ref|YP_005650386.1| hypothetical protein SYNGTS_0433 [Synechocystis sp. PCC 6803]
gi|451813760|ref|YP_007450212.1| hypothetical protein MYO_14380 [Synechocystis sp. PCC 6803]
gi|1652084|dbj|BAA17009.1| slr1603 [Synechocystis sp. PCC 6803]
gi|339272694|dbj|BAK49181.1| hypothetical protein SYNGTS_0433 [Synechocystis sp. PCC 6803]
gi|359270661|dbj|BAL28180.1| hypothetical protein SYNGTI_0433 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359273832|dbj|BAL31350.1| hypothetical protein SYNPCCN_0433 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277002|dbj|BAL34519.1| hypothetical protein SYNPCCP_0433 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407957482|dbj|BAM50722.1| hypothetical protein BEST7613_1791 [Synechocystis sp. PCC 6803]
gi|451779729|gb|AGF50698.1| hypothetical protein MYO_14380 [Synechocystis sp. PCC 6803]
Length = 440
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 114/325 (35%), Positives = 188/325 (57%), Gaps = 25/325 (7%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ + VT+GDVA +AGL++N+ Q+ L LA++ +G L+V++ G++++ FP +R
Sbjct: 19 MTAVEQLDYVVTVGDVASQAGLEINQTQQGLLTLASEVEGHLQVAESGEIVFAFPKQFRT 78
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFG---TALIASIVIVFTAIIAILSSKSDDDD 204
L K +RL+ + + Y IR+ FG I +++ AII ++SKSD+D+
Sbjct: 79 ILRNKYWRLRFQSWLQSIWQVLFYLIRISFGIILILSILLMLVAIIAIIIAVNSKSDNDN 138
Query: 205 RGRRRRS----FDSGFNIFISPSDLFWYWDPYYYRRRR--VQTDDDDKKMNFIKSVFSFV 258
G + S G IF P D+FW P R++R ++D + ++NF++++FSF+
Sbjct: 139 DGGFKFSGDGNRGGGGGIFFWPGDIFWLLSPDGGRQKRDKKRSDKNKNELNFLEAIFSFL 198
Query: 259 FGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD--IDRTMSDESYVLPVLLRFD 316
FG+G+PN+ +EEKRW+ +G I NGG + AE+ APYLD +E Y++PVL RF+
Sbjct: 199 FGDGNPNEDLEEKRWQTLGSLIRHNGGAIAAEQAAPYLDNVTKFNRDNEDYIIPVLARFN 258
Query: 317 GQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNM 376
G P++ G ++Y FP+ Q +A ++ + REK W+FS+ +
Sbjct: 259 GYPQVSPSGELIYTFPNLQVSAQERQSTT--------------LSAYLREKPWKFSQASS 304
Query: 377 SERGMAIGLGGLNLFGVIILGAMLQ 401
++ AI LGG+NL + LG ML+
Sbjct: 305 GQKIAAIALGGVNLILALSLGVMLK 329
>gi|428772634|ref|YP_007164422.1| hypothetical protein Cyast_0801 [Cyanobacterium stanieri PCC 7202]
gi|428686913|gb|AFZ46773.1| hypothetical protein Cyast_0801 [Cyanobacterium stanieri PCC 7202]
Length = 431
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 124/356 (34%), Positives = 203/356 (57%), Gaps = 36/356 (10%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M +V+ + RVT+GDVA ++GL + AQ+ L ALA++ G L+V++ GD++Y FPN++R
Sbjct: 8 MKSVEKLDYRVTVGDVASESGLSVALAQQELLALASEAGGHLQVAETGDMVYAFPNDFRT 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAIL----SSKSDDD 203
L + ++L+ + + K Y IR+ FG L+ASI+I+ AI I+ +S+ D++
Sbjct: 68 ILRNRYWKLRAKQWLSKVWEVVFYLIRISFGILLVASIIILAVAIAVIVIALSASRGDNN 127
Query: 204 DRGRRRRS-----FDSGFNIFISPSDLF---WYWDPYYYRRRRV----QTDDDDKKMNFI 251
+R R F F + +P +F ++ + YYY++ Q ++NF+
Sbjct: 128 NRSSNSRGGGGMMFMPNFWLMPNPFSIFSPRYHRNNYYYQQYNRGNIPQQPKQKSELNFL 187
Query: 252 KSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMS--DESYVL 309
+S++SF+FG+G+PN +EE+RW+ I I +N G V AE++APYLD + DE Y+L
Sbjct: 188 ESIYSFLFGDGNPNPNLEERRWQEIAAVIQNNQGAVIAEQVAPYLDNINIYNERDEDYML 247
Query: 310 PVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKW 369
PVL RF+G PE+ E GNI+Y FP Q A + K + E KW
Sbjct: 248 PVLTRFNGYPEVSETGNIIYYFPELQVKA---NVKEK-----------SSISPYLEENKW 293
Query: 370 EFSKTNMSERGMAIGLGGLNLFGVIILGAMLQE--MAVTPN--GFLKFVAYIFPLL 421
+FS S++ MAI LGG+ V++ ++ Q+ ++P+ F+ FV +I+P+L
Sbjct: 294 QFSIATSSQKIMAIALGGIYFILVLVFYSLAQQDITTLSPDLLYFIDFVRFIYPIL 349
>gi|113474496|ref|YP_720557.1| hypothetical protein Tery_0643 [Trichodesmium erythraeum IMS101]
gi|110165544|gb|ABG50084.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 449
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 128/360 (35%), Positives = 197/360 (54%), Gaps = 43/360 (11%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ + RVT+GDVA +AGL + A++AL ALA D G L+VS+ G++ Y+FPNN+R+
Sbjct: 8 MTAVENLDYRVTLGDVAAQAGLDIKVAEQALLALATDAGGHLQVSESGEIAYLFPNNFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFG-----TALIASIVIVFTAIIAILSSKSDD 202
L K +RL++ +K Y IRV FG + +I + I I SS+ +D
Sbjct: 68 ILRNKFWRLRLREWWEKVWRILFYLIRVSFGFLLILSIIIIMVAIAIIVISINSSSEQND 127
Query: 203 DDRGRRRRSFDSGF--NIFISPSDLFWYW--------DPYYYRRRRVQTDD------DDK 246
G GF +P +W W PY Y+R+ + + +
Sbjct: 128 SCEGENNSGASMGFFPRFGFNP---YWLWLFDWGYYDQPYRYQRKNQRKTNKSFIPKKEN 184
Query: 247 KMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD---IDRTMS 303
+MNF++++FSF+FG+G+PN ++E+RW+ I I +N G V AE++APYLD + +
Sbjct: 185 EMNFLEAIFSFLFGDGNPNYNLDEQRWQAIATVIKNNQGAVVAEQIAPYLDDLGKEYAVE 244
Query: 304 DESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKI 363
E Y+LPVL+RF+G+PE+ +G I+Y FP Q TA ++W V
Sbjct: 245 YEEYILPVLIRFNGRPEVSPDGQIVYHFPDLQTTA------------KQW--HFEPVSTY 290
Query: 364 FREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
EKK FSK N ++ +AIGLG +N G ++LG++L++ M + FV I+P+L
Sbjct: 291 LIEKKLRFSKANSNQIMLAIGLGAVNFIGALVLGSLLEDDRMIEQIGWIINFVEVIYPVL 350
>gi|298492734|ref|YP_003722911.1| hypothetical protein Aazo_4514 ['Nostoc azollae' 0708]
gi|298234652|gb|ADI65788.1| conserved hypothetical protein ['Nostoc azollae' 0708]
Length = 424
Score = 201 bits (510), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 130/340 (38%), Positives = 194/340 (57%), Gaps = 23/340 (6%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M +V+ RVT+GDVA +AGL + EA + L ALA+D G L+V++ GD++Y FP N+R
Sbjct: 8 MRSVEQLGYRVTVGDVATQAGLNVGEANQGLLALASDAGGHLQVAESGDIVYQFPQNFRN 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K +L+++ K Y IR+ FG LIASI ++ +I I+++ + D D G
Sbjct: 68 ILRNKYLQLRLQEWWKKIWGVLFYLIRISFGLLLIASIALITVTVIIIITAANSDRD-GD 126
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYYYRRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQG 267
R S SGFN F P DLFWY+ YY + R + +D K NF +SVFSF+FG+G+PN
Sbjct: 127 NRSSSSSGFNFFFFP-DLFWYFSLDYYSQERRRERREDSKFNFFESVFSFLFGDGNPNAN 185
Query: 268 IEEKRWKLIGEYIASNGGVVTAEELAPYLDI---DRTMSDESYVLPVLLRFDGQPEIDEE 324
+EE+RW+ IG I ++ G V AE++APYLD E Y+LPVL++F+G+P++ +
Sbjct: 186 LEERRWQEIGTLIRNHKGAVVAEQIAPYLDNLGEKYQQEHEDYMLPVLVQFNGKPQVSPD 245
Query: 325 GNILYRFPSFQ-RTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGMAI 383
G I+Y FP Q + + QR Y+ E W FS+ + + ++
Sbjct: 246 GQIVYYFPELQVKASKKQRQSIAPYL---------------EELSWRFSEASSEQIMLSA 290
Query: 384 GLGGLNLFGVIILGAMLQ--EMAVTPNGFLKFVAYIFPLL 421
GLG +NL ++LG++L A + FV I+ LL
Sbjct: 291 GLGAVNLVAALMLGSLLSGATAAAKIGVLVGFVQGIYWLL 330
>gi|284929774|ref|YP_003422296.1| hypothetical protein UCYN_12490 [cyanobacterium UCYN-A]
gi|284810218|gb|ADB95915.1| hypothetical protein UCYN_12490 [cyanobacterium UCYN-A]
Length = 423
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 118/339 (34%), Positives = 193/339 (56%), Gaps = 24/339 (7%)
Query: 86 RAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNY 145
+ +D+V+ + RVT+GDVA +G+ + QK L +LA+DT G L+V++ G ++Y+F N
Sbjct: 6 KIIDSVEKLDYRVTVGDVASYSGISPSVVQKELLSLASDTFGNLQVTESGTIIYIFSKNL 65
Query: 146 RAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDR 205
R K+ ++ + K A Y IR+ FG LI+SI++V AI+ I++S D +
Sbjct: 66 RTIFFKKNLISRLNVIWIKVWAVLFYLIRISFGVILISSILVVLAAIVIIITSLQSDREE 125
Query: 206 GRRRRSFDSGFNIFISPS--DLFWYWDPY----YYRRRRVQTDDDDKKMNFIKSVFSFVF 259
R + G + F P+ DLFW + P + + + D +K+ F+++VFSF+F
Sbjct: 126 NNRYSNQSRGISFFFLPNFGDLFWIFYPTRSYNSFSSSKYKEDTTSEKITFLEAVFSFLF 185
Query: 260 GEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDRTMSD-ESYVLPVLLRFDG 317
G+G+PN +EE RW++I I +N GVV AE+LAPYLD I++ D E YVLPVL RF G
Sbjct: 186 GDGNPNLNLEEIRWEIISRVIYNNKGVVIAEQLAPYLDGINKYNEDNEDYVLPVLKRFAG 245
Query: 318 QPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMS 377
P + +G ++Y FP Q ++ + K V F EK + F++ + S
Sbjct: 246 IPNVTSKGELIYSFPELQ--TFTESLPEKH------------VNTNFEEKLYNFTRISSS 291
Query: 378 ERGMAIGLGGLNLFGVIILGAML--QEMAVTPNGFLKFV 414
++ + LG +N V++LG++L Q + + +GF+ F+
Sbjct: 292 QKVSVVSLGVINFILVLVLGSLLKDQNIILELDGFIGFI 330
>gi|409993764|ref|ZP_11276894.1| hypothetical protein APPUASWS_21678 [Arthrospira platensis str.
Paraca]
gi|409935369|gb|EKN76903.1| hypothetical protein APPUASWS_21678 [Arthrospira platensis str.
Paraca]
Length = 451
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 128/353 (36%), Positives = 192/353 (54%), Gaps = 36/353 (10%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
++AV+ RV+ DVA +AGL++N AQ+ L ALA++ G L+V++ GD+ ++FP N+RA
Sbjct: 14 IEAVEKLGYRVSSADVAIQAGLEVNLAQQQLLALASEAGGHLQVAESGDIAFLFPQNFRA 73
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAIL---SSKSDDDD 204
+ + +RL+V K Y IR+ FG L+ SIV++F I AIL S +
Sbjct: 74 VMRNRYWRLRVYEWWQKVWRILFYLIRISFGIMLLVSIVLIFITIAAILFFGDSNRNGGG 133
Query: 205 RGRRRRSFDSGFNIFISPSDLFWY---------WD--PYYYRRRRVQTDDDDKKMNFIKS 253
G IFI FW+ WD YY +++++ + +KMNF +
Sbjct: 134 GGGGGGGSRGRGLIFIPH---FWFHPNSFSFLSWDNGNYYRQQQKITKKQNKEKMNFFEV 190
Query: 254 VFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDRTMSD--ESYVLP 310
VFSF+FG+G+PN +E +RW+ I I +N G V AE++APYLD + S E Y+LP
Sbjct: 191 VFSFLFGDGNPNYNLEAQRWQEIATVIRNNQGAVVAEQIAPYLDNLGNAYSQEFEDYMLP 250
Query: 311 VLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWE 370
VL RF+GQPE+ +G ++Y FP Q TA R V +E W+
Sbjct: 251 VLTRFNGQPEVSPQGQLVYHFPELQTTATKYR--------------PQPVPAYLKENNWK 296
Query: 371 FSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
FS ++ +A GLG +N G +ILG +L++ A+ G + FV I+PLL
Sbjct: 297 FSNATSNQLMLAAGLGAVNFVGALILGHLLEDGAIAAQMGGLVAFVEIIYPLL 349
>gi|37522358|ref|NP_925735.1| hypothetical protein glr2789 [Gloeobacter violaceus PCC 7421]
gi|35213358|dbj|BAC90730.1| glr2789 [Gloeobacter violaceus PCC 7421]
Length = 432
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 126/341 (36%), Positives = 192/341 (56%), Gaps = 23/341 (6%)
Query: 84 RNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPN 143
R + + AV+ N R T+GDVA ++GL+L Q+ L +LA D DG L+VS+ G+++Y F
Sbjct: 14 REQILGAVETLNYRATVGDVATRSGLELTTVQQELNSLAQDGDGHLQVSNTGEIVYAFEA 73
Query: 144 NYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDD 203
+ R +L K +++ ++ KA A Y +R+ FG L+ S++++ I +LSS+ D D
Sbjct: 74 DVRDRLLRKDRNARLKALLKKAWQVAFYLVRISFGILLVLSLLLIAIGIYVVLSSR-DGD 132
Query: 204 DRGRRRRSFDSGFNIFISPSDLFW--YWDPY-YYRRRRVQTDDDDKKMNFIKSVFSFVFG 260
G G + F+ YW PY YY RR++ +D M F++SV+SF+FG
Sbjct: 133 SGGEGESRGGGGGMPGFFWGNFFYIFYWPPYGYYEERRLKDPND---MGFLESVYSFLFG 189
Query: 261 EGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDESYVLPVLLRFDGQPE 320
+G+PN +E +RW+ + I +N GVV AE+LAPYL+ R S E +VLP L+++DG PE
Sbjct: 190 DGNPNADLESRRWRTVAGLIQANRGVVVAEQLAPYLEAARVDS-EDFVLPALVKYDGVPE 248
Query: 321 IDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERG 380
+ E+G I+YRFP Q A RR A + G +EK+W+FSK +
Sbjct: 249 VSEQGEIVYRFPQLQVQAEE----------RRPAAPLPG---FLQEKRWQFSKAPGGKLV 295
Query: 381 MAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLL 421
+A LG N G L +LQ + P+ L F A + P+L
Sbjct: 296 LAGALGVANFVGAWFLYFLLQSTNLPPD--LGFFAALAPVL 334
>gi|434395039|ref|YP_007129986.1| hypothetical protein Glo7428_4383 [Gloeocapsa sp. PCC 7428]
gi|428266880|gb|AFZ32826.1| hypothetical protein Glo7428_4383 [Gloeocapsa sp. PCC 7428]
Length = 443
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 126/351 (35%), Positives = 191/351 (54%), Gaps = 43/351 (12%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVT+GDVA ++GL ++ AQ+ L LA++ G ++V++ G+++Y FP N R
Sbjct: 8 MQAVEQLGYRVTVGDVATQSGLNVSLAQQGLLMLASEAGGHMQVAESGEIVYQFPKNLRT 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSS-KSDDDDRG 206
L K +RL+++ + Y IR+ FG L+ SI I+F +I I+S D+DDRG
Sbjct: 68 VLRNKFWRLRLQAWWQRVWRILFYLIRISFGIVLLLSIAIIFLSIFIIISMINRDNDDRG 127
Query: 207 RRRRSFDSGFNIF-----ISPSDLFWYW------DPYYYRRRRVQTDDDDKKMNFIKSVF 255
+ SG ++ I P+ WYW D Y +RRR + ++NF++++F
Sbjct: 128 ----NHSSGGMVYMPHFWIGPN---WYWFLYPDYDTRYQQRRR-----EKNELNFLEAIF 175
Query: 256 SFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD---IDRTMSDESYVLPVL 312
SF+FG+G+PN +EE+RW+ I I +N G V AE++APYLD E Y+LPVL
Sbjct: 176 SFLFGDGNPNANLEEQRWQSIATVIRNNSGAVVAEQIAPYLDDIGSGYAQEYEDYMLPVL 235
Query: 313 LRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFS 372
RF+GQP + +G ++Y FP Q +AA QR V +E W FS
Sbjct: 236 TRFNGQPAVSPQGQLVYHFPELQVSAAQQR--------------SRAVAPFLQESLWRFS 281
Query: 373 KTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
+ + +A GLG LN G + LG +L + A+ G + F I+ LL
Sbjct: 282 AASSGQIALAAGLGLLNFVGAVFLGYLLADGAIAAELGGLVAFAQGIYWLL 332
>gi|425434699|ref|ZP_18815163.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9432]
gi|389675795|emb|CCH95120.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9432]
Length = 433
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 125/332 (37%), Positives = 197/332 (59%), Gaps = 23/332 (6%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ +++ RVT+GDVA K+GL++N AQ+ L ALAA+ G L+V+D G+++Y+FP N+R+
Sbjct: 8 LQSIEKLGYRVTVGDVAAKSGLEINAAQQGLLALAAEAGGHLQVADTGEIIYLFPENFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K ++LK++ K Y IR+ FG LI SI+I+ AI+AIL D
Sbjct: 68 ILLNKYWQLKLKAFGSKIWQVVFYVIRISFGIVLIISILILLLAILAILIGLMFKDSDSD 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDP-----YYYRR-RRVQTDDDDKKMNFIKSVFSFVFGE 261
++ F P+D FW + P YY RR R V+ D KMNF+++VFSF+FG+
Sbjct: 128 NNSGNNNININFF-PTDFFWIFYPDFGNNYYERRDREVKADKPPSKMNFLEAVFSFLFGD 186
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDES--YVLPVLLRFDGQP 319
GDPN +EE+RW++IG+ I +N G + A ++APYLD +E+ Y+LPVL RF+G P
Sbjct: 187 GDPNFNLEERRWQIIGKVIQNNRGSIIAPQIAPYLDSITPSHEETEDYILPVLTRFNGLP 246
Query: 320 EIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSER 379
++ + G+I+Y FP Q T +++ V +EK+W+FS+ + +
Sbjct: 247 QVSDRGDIIYYFPDLQVTTKERKLQT--------------VSPYLKEKRWKFSEADSGQI 292
Query: 380 GMAIGLGGLNLFGVIILGAMLQEMAVTPNGFL 411
+A+GLG +N ++LG++L +V +G L
Sbjct: 293 ILALGLGVVNFILALVLGSLLNSDSVDLSGSL 324
>gi|390441911|ref|ZP_10229939.1| conserved membrane hypothetical protein [Microcystis sp. T1-4]
gi|389834809|emb|CCI34065.1| conserved membrane hypothetical protein [Microcystis sp. T1-4]
Length = 434
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 123/332 (37%), Positives = 195/332 (58%), Gaps = 23/332 (6%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ +++ RVT+GDVA K+GL++N AQ+ L LAA+ G L+V+D G+++Y+FP N+R+
Sbjct: 8 LQSIEKLGYRVTVGDVAAKSGLEINAAQQGLLVLAAEAGGHLQVADTGEIIYLFPENFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K ++L+++ K Y IR+ FG LI SI+I+ AI+AIL + D
Sbjct: 68 ILLNKYWQLQLKAFASKIWQVVFYIIRISFGIVLIISILILLFAILAILIASMFKDSDSD 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDP-----YYYRR-RRVQTDDDDKKMNFIKSVFSFVFGE 261
++ F P+D FW + P YY +R R V+ + KMNF++SVFSF+FG+
Sbjct: 128 NNSGNNNININFF-PTDFFWIFYPDFGNNYYEKRDREVKAEKPPSKMNFLESVFSFLFGD 186
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDES--YVLPVLLRFDGQP 319
GDPN +EE+RW+ IG+ I +N G + A ++APYLD +E+ Y+LPVL RF+G P
Sbjct: 187 GDPNFNLEERRWQTIGKVIQNNRGSIIAPQIAPYLDSITPSQEETEDYILPVLTRFNGLP 246
Query: 320 EIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSER 379
++ + G+I+Y FP Q T +++ V REK+W+FS+ + +
Sbjct: 247 QVSDRGDIIYYFPDLQVTTKQRKLQT--------------VSPYLREKRWKFSEADSGQI 292
Query: 380 GMAIGLGGLNLFGVIILGAMLQEMAVTPNGFL 411
+A+GLG +N ++LG +L +V +G L
Sbjct: 293 ILALGLGAVNFILALVLGYLLNSDSVDLSGSL 324
>gi|440684463|ref|YP_007159258.1| hypothetical protein Anacy_5008 [Anabaena cylindrica PCC 7122]
gi|428681582|gb|AFZ60348.1| hypothetical protein Anacy_5008 [Anabaena cylindrica PCC 7122]
Length = 423
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 125/344 (36%), Positives = 188/344 (54%), Gaps = 29/344 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M +V+ RVT GDVA +AGL + EA K L ALA+D G L+V++ GD++Y FP N+R
Sbjct: 8 MRSVEQLGYRVTTGDVAAQAGLNVAEANKGLLALASDAGGHLQVAETGDIVYQFPQNFRT 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIAS---IVIVFTAIIAILSSKSDDDD 204
L K +L+++ K Y IR+ FG LI S I + I+ +S D+D+
Sbjct: 68 ILRNKYLQLRLQEWWKKVWGVLFYLIRISFGVFLILSIALITVTIIIIVTAANSDRDNDN 127
Query: 205 RGRRRRSFDSGFNIFISPSDLFWYWDPYY--YRRRRVQTDDDDKKMNFIKSVFSFVFGEG 262
RG R G N F P DLFWY+ P Y + R + ++ +NF ++VFSF+FG+G
Sbjct: 128 RGSGSR----GLNFFFFP-DLFWYFSPDYRSRSQERRRERGENSNLNFFEAVFSFLFGDG 182
Query: 263 DPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI---DRTMSDESYVLPVLLRFDGQP 319
+PN+ +EE+RW+ I I ++ G V AE++ PY+D E Y+LPV+LRF+GQP
Sbjct: 183 NPNENLEERRWQEIATVIRNHQGSVVAEQITPYMDHLGEKYQQEYEDYMLPVMLRFNGQP 242
Query: 320 EIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSER 379
++ +G I+Y FP Q A+ ++ R+ + E W FS +
Sbjct: 243 QVSPDGQIVYYFPELQVKASKKQ--RQS------------IAPYLEEFPWRFSAAGSGQI 288
Query: 380 GMAIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
++ GLG LN +ILG++L++ A G + FV I+ LL
Sbjct: 289 MLSAGLGVLNFVAALILGSLLRDGTAAAQIGGLVGFVQGIYWLL 332
>gi|443656397|ref|ZP_21131674.1| hypothetical protein C789_2214 [Microcystis aeruginosa DIANCHI905]
gi|159028305|emb|CAO87203.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443333423|gb|ELS47984.1| hypothetical protein C789_2214 [Microcystis aeruginosa DIANCHI905]
Length = 433
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 124/332 (37%), Positives = 196/332 (59%), Gaps = 23/332 (6%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ +++ RVT+GDVA K+GL++N AQ+ L ALAA G L+V+D G+++Y+FP N+R+
Sbjct: 8 LQSIEKLGYRVTVGDVAAKSGLEINAAQQGLLALAAKAGGHLQVADTGEIIYLFPENFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K ++L+++ K Y IR+ FG LI SI+I+ AI+AIL D
Sbjct: 68 ILLNKYWQLQLKAFASKIWKVVFYVIRISFGIVLIISILILLLAILAILIGLMFKDSDSD 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDP-----YYYRR-RRVQTDDDDKKMNFIKSVFSFVFGE 261
++ F P+D FW + P YY RR R V+ D KMNF+++VFSF+FG+
Sbjct: 128 NNSGNNNININFF-PTDFFWIFYPDFGNNYYERRDREVKADKPPSKMNFLEAVFSFLFGD 186
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDES--YVLPVLLRFDGQP 319
GDPN +EE+RW++IG+ I +N G + A ++APYLD +E+ Y+LPVL RF+G P
Sbjct: 187 GDPNFNLEERRWQIIGKVIQNNRGSIIAPQIAPYLDSITPSHEETEDYILPVLTRFNGLP 246
Query: 320 EIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSER 379
++ + G+I+Y FP Q T +++ V +EK+W+FS+ + +
Sbjct: 247 QVSDRGDIIYYFPDLQVTTKERKLQT--------------VSPYLKEKRWKFSEADSGQI 292
Query: 380 GMAIGLGGLNLFGVIILGAMLQEMAVTPNGFL 411
+A+GLG +N ++LG++L +V +G L
Sbjct: 293 ILALGLGVVNFILALVLGSLLNSDSVDLSGSL 324
>gi|425461017|ref|ZP_18840497.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9808]
gi|440756613|ref|ZP_20935813.1| hypothetical protein O53_5021 [Microcystis aeruginosa TAIHU98]
gi|389826189|emb|CCI23481.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9808]
gi|440172642|gb|ELP52126.1| hypothetical protein O53_5021 [Microcystis aeruginosa TAIHU98]
Length = 434
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 124/332 (37%), Positives = 197/332 (59%), Gaps = 23/332 (6%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ +++ RVT+GDVA K+GL++N AQ+ L ALAA+ G L+V+D G+++Y+FP N+R+
Sbjct: 8 LQSIEKLGYRVTVGDVAAKSGLEINAAQQGLLALAAEAGGHLQVADTGEIIYLFPENFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K ++L+++ K Y IR+ FG LI SI+I+ AI+AIL D
Sbjct: 68 ILLNKYWQLQLKAFGSKIWKVVFYVIRISFGIVLIISILILLLAILAILIGLMFKDSDSD 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDP-----YYYRR-RRVQTDDDDKKMNFIKSVFSFVFGE 261
++ F P+D FW + P YY RR R V+ D KMNF+++VFSF+FG+
Sbjct: 128 NNSGNNNININFF-PTDFFWIFYPDFGNNYYERRDREVKADKPPSKMNFLEAVFSFLFGD 186
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDES--YVLPVLLRFDGQP 319
GDPN +EE+RW++IG+ I +N G + A ++APYLD +E+ Y+LPVL RF+G P
Sbjct: 187 GDPNFNLEERRWQIIGKVIQNNRGSIIAPQIAPYLDSITPSHEETEDYILPVLTRFNGLP 246
Query: 320 EIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSER 379
++ + G+I+Y FP Q T +++ V +EK+W+FS+ + +
Sbjct: 247 QVSDRGDIIYYFPDLQVTTKERKLQT--------------VSPYLKEKRWKFSEADSGQI 292
Query: 380 GMAIGLGGLNLFGVIILGAMLQEMAVTPNGFL 411
+A+GLG +N ++LG++L +V +G L
Sbjct: 293 ILALGLGVVNFILALVLGSLLNSDSVDLSGSL 324
>gi|428221285|ref|YP_007105455.1| hypothetical protein Syn7502_01217 [Synechococcus sp. PCC 7502]
gi|427994625|gb|AFY73320.1| hypothetical protein Syn7502_01217 [Synechococcus sp. PCC 7502]
Length = 427
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 118/311 (37%), Positives = 175/311 (56%), Gaps = 30/311 (9%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ AV+ N RVTIGDVA ++GL +N AQ+ L ALA T G L+VS+ G+++YVF R
Sbjct: 8 ITAVEKLNYRVTIGDVAAQSGLDVNLAQRELLALATQTSGNLQVSETGEIVYVFSPQVRQ 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFT----AIIAILSSKSDDD 203
L +++F+L+++ + + Y IR+ FG L+ SI IV A+IA+ S SD+D
Sbjct: 68 ILWSRNFKLRLQAWLGQVWKWVFYLIRISFGILLVVSIAIVVIGIVLAVIALQSRSSDND 127
Query: 204 DRGRRRRSFDSGFNIFISPSDLFW------YWDPYYYRRRRVQTDDDDKKMNFIKSVFSF 257
+R R D GFN FI + FW + YY R+++ +D M F++S+FSF
Sbjct: 128 NRRSDDR--DGGFN-FI--PNFFWIDFGNIFAPSYYEHPDRIKSKANDSNMGFLESIFSF 182
Query: 258 VFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD--IDRTMSDESYVLPVLLRF 315
+FG+G+PN +EE+R + I I S+ GVV AE++APYLD D E Y++PVL +F
Sbjct: 183 LFGDGNPNYDLEERRSQEISALIRSHQGVVIAEQVAPYLDEIADTQEGFEDYMIPVLTKF 242
Query: 316 DGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTN 375
+G P++ E G + Y FP Q+ AA + +EK W+FSK
Sbjct: 243 NGLPQVTEIGTLAYSFPDLQKVAADRSKSNPS-------------NSFLQEKIWQFSKAG 289
Query: 376 MSERGMAIGLG 386
+ +AI LG
Sbjct: 290 AGKITLAIALG 300
>gi|427419595|ref|ZP_18909778.1| hypothetical protein Lepto7375DRAFT_5441 [Leptolyngbya sp. PCC
7375]
gi|425762308|gb|EKV03161.1| hypothetical protein Lepto7375DRAFT_5441 [Leptolyngbya sp. PCC
7375]
Length = 427
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 122/346 (35%), Positives = 194/346 (56%), Gaps = 29/346 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVT+GDVA K GL +++A+ + ALA++T ++VS+ GD+ Y F +RA
Sbjct: 8 MKAVETLRYRVTVGDVAAKTGLNISQAEAGVLALASETQAHMQVSETGDIAYEFSPQFRA 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAI---LSSKSDDDD 204
L + ++L+ + +K A Y IR+ FG LI ++V+VF AIIAI + S+ +DD+
Sbjct: 68 ILRNQYWQLRWQETWEKIWAVLFYLIRISFGLMLILAVVLVFAAIIAIQISMQSQREDDN 127
Query: 205 RGRRRRSFDSGFNIFISPSDLFWYWDPY-----YYRRRRVQTDDDDKK---MNFIKSVFS 256
RG S GFN ++ ++YW Y Y R+ + D +++ +NF++ V+S
Sbjct: 128 RG-GSYSGGIGFNPWLWIGRDWFYWFSYGPRRQYGRQYDYSSRDRNRQGSELNFLEGVYS 186
Query: 257 FVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSD-ESYVLPVLLRF 315
F+FG+GDPN+ ++E+RW I I +N G V+AE+LAPYL+ D E YV+P L RF
Sbjct: 187 FLFGDGDPNKDLDERRWGAIATVIRNNRGAVSAEQLAPYLENANLDDDLEDYVIPALSRF 246
Query: 316 DGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTN 375
+G+P + +G+++Y FP Q TA +Q+ V +E K F+ +
Sbjct: 247 NGRPNVSPQGDLVYYFPELQVTAKNQK--------------PQSVPAFLKEAKRRFTSAS 292
Query: 376 MSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIFPLL 421
++ +AIGLG L + G I L + + T GF+ V + L
Sbjct: 293 SNQVAIAIGLGTLLVVGSIYLSFIAADF--TDGGFVDLVYSLCDLF 336
>gi|425451176|ref|ZP_18830998.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
7941]
gi|389767657|emb|CCI07015.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
7941]
Length = 434
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 124/332 (37%), Positives = 196/332 (59%), Gaps = 23/332 (6%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ +++ RVT+GDVA K+GL++N AQ+ L ALAA+ G L+V+D G+++Y+FP N+R+
Sbjct: 8 LQSIEKLGYRVTVGDVAAKSGLEINAAQQGLLALAAEAGGHLQVADTGEIIYLFPENFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K ++L+++ K Y IR+ FG LI SI+I+ AI+AIL D
Sbjct: 68 ILLNKYWQLQLKAFGSKIWKVVFYVIRISFGIVLIISILILLLAILAILIGLMFKDSDSD 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDP-----YYYRR-RRVQTDDDDKKMNFIKSVFSFVFGE 261
++ F P+D FW + P YY RR R V+ D KMNF+++VFSF+FG+
Sbjct: 128 NNSGNNNININFF-PTDFFWIFYPDFGNNYYERRDREVKADKPPSKMNFLEAVFSFLFGD 186
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDES--YVLPVLLRFDGQP 319
GDPN +EE+RW++IG+ I +N G + A ++APYLD +E+ Y+LPVL RF+G P
Sbjct: 187 GDPNFNLEERRWQIIGKVIQNNRGSIIAPQIAPYLDSITPSHEETEDYILPVLTRFNGLP 246
Query: 320 EIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSER 379
++ + G+I+Y FP Q T +++ V +EK+W+FS+ + +
Sbjct: 247 QVSDRGDIIYYFPDLQVTTKERKLQT--------------VSPYLKEKRWKFSEADSGQI 292
Query: 380 GMAIGLGGLNLFGVIILGAMLQEMAVTPNGFL 411
+A+GLG +N ++LG++L +V G L
Sbjct: 293 ILALGLGVVNFILALVLGSLLNSDSVDLRGSL 324
>gi|119513583|ref|ZP_01632597.1| hypothetical protein N9414_10795 [Nodularia spumigena CCY9414]
gi|119461765|gb|EAW42788.1| hypothetical protein N9414_10795 [Nodularia spumigena CCY9414]
Length = 438
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 131/345 (37%), Positives = 186/345 (53%), Gaps = 25/345 (7%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
M AV+ RVT+GDV+ +AGL L EA + L ALA+D G L+V++ GD++Y FP N+R
Sbjct: 8 MQAVEKLGYRVTVGDVSSQAGLNLAEAGQGLLALASDAGGHLQVAETGDIVYEFPRNFRD 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K F++++ K Y IR+ F LI SIV++ I I+++ + D D
Sbjct: 68 VLRNKYFQMRLREWWKKVWEVLFYLIRISFAIFLILSIVLITITIFIIVTASNSDRDGNS 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYYYRR------RRVQTDDDDKKMNFIKSVFSFVFGE 261
R S G F DLFWY+ P Y R +R Q + MNF ++VFSF+FG+
Sbjct: 128 RGSSSRGGGLFFFPFPDLFWYFSPNYRDRQQERQYQRHQQTSQESNMNFFEAVFSFLFGD 187
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI---DRTMSDESYVLPVLLRFDGQ 318
G+PN +EE+RW+ IG I +N G V AE++APYLD T E Y+LPVL+RF+GQ
Sbjct: 188 GNPNANLEERRWQEIGAVIRNNQGAVVAEQIAPYLDNLGESYTQDYEDYMLPVLIRFNGQ 247
Query: 319 PEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSE 378
PE+ EG I+Y FP Q A++ + V E W FS + +
Sbjct: 248 PEVSPEGQIVYYFPELQ-VKATKTLHHS-------------VPMHLEEYPWRFSAASSGQ 293
Query: 379 RGMAIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
++ GLG LN +ILG ++ + A G + FV I+ LL
Sbjct: 294 IMLSAGLGVLNFVAALILGGLIADGTAAAQLGGLVAFVEGIYWLL 338
>gi|425457609|ref|ZP_18837312.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9807]
gi|389800994|emb|CCI19785.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9807]
Length = 434
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 124/332 (37%), Positives = 196/332 (59%), Gaps = 23/332 (6%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ +++ RVT+GDVA K+GL++N AQ+ L ALAA+ G L+V+D G+++Y+FP N+R+
Sbjct: 8 LQSIEKLGYRVTVGDVAAKSGLEINAAQQGLLALAAEAGGHLQVADTGEIIYLFPENFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K ++L+++ K Y IR+ FG LI SI+I+ AI+AIL D
Sbjct: 68 ILLNKYWQLQLKAFGSKIWQVVFYIIRISFGIVLIISILILLLAILAILIGLMFKDSDSD 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDP-----YYYRR-RRVQTDDDDKKMNFIKSVFSFVFGE 261
++ F P+D FW + P YY RR R V+ D KMNF+++VFSF+FG+
Sbjct: 128 NNSGNNNININFF-PTDFFWIFYPDFGNNYYERRDREVKADKPPSKMNFLEAVFSFLFGD 186
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDES--YVLPVLLRFDGQP 319
GDPN +EE+RW+ IG+ I +N G + A ++APYLD +E+ Y+LPVL RF+G P
Sbjct: 187 GDPNFNLEERRWQTIGKVIQNNRGSIIAPQIAPYLDSITPSHEETEDYILPVLTRFNGLP 246
Query: 320 EIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSER 379
++ + G+I+Y FP Q T +++ V +EK+W+FS+ + +
Sbjct: 247 QVSDRGDIIYYFPDLQVTTKERKLQT--------------VSPYLKEKRWKFSEADSGQI 292
Query: 380 GMAIGLGGLNLFGVIILGAMLQEMAVTPNGFL 411
+A+GLG +N ++LG++L +V +G L
Sbjct: 293 ILALGLGVVNFILALVLGSLLNSDSVDLSGSL 324
>gi|443475414|ref|ZP_21065364.1| hypothetical protein Pse7429DRAFT_1004 [Pseudanabaena biceps PCC
7429]
gi|443019721|gb|ELS33769.1| hypothetical protein Pse7429DRAFT_1004 [Pseudanabaena biceps PCC
7429]
Length = 431
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 129/345 (37%), Positives = 196/345 (56%), Gaps = 29/345 (8%)
Query: 84 RNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPN 143
R M+AV+ N RVTIGDVA ++GL LN AQ+ + ALA++T G ++V++ G++ Y F
Sbjct: 4 RTAVMEAVEKLNYRVTIGDVAAQSGLSLNAAQREILALASETGGNIQVAESGEIAYKFAP 63
Query: 144 NYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVI----VFTAIIAILSSK 199
N+R L +SF L+V+ + Y+IR+ FG LI SI+I + A IA+ S
Sbjct: 64 NFRQILINRSFWLQVKEWLQGVWKWVFYAIRISFGILLILSILIVVVGIIAATIALQSQG 123
Query: 200 SDDDDRGRRRRSFDSGFNIFISP-----SDLFWYWDPYYYRRRRVQTDDDDKKMNFIKSV 254
+D+DR RR G IF+ + F + P YY ++++ D D +MNF++SV
Sbjct: 124 RNDNDRNDRRSDNRGGGFIFLGGWGNPFGNPFIMFAPDYYEPQQLKQRDPD-EMNFLESV 182
Query: 255 FSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYL-DIDRTMSDESYVLPVLL 313
FSF+FG+G+PN +EE+RW+ I I SN GVV AE++APYL DI +DE +V+PVL
Sbjct: 183 FSFLFGDGNPNADLEERRWREIAAMIRSNNGVVIAEQIAPYLDDITYKENDEYFVIPVLA 242
Query: 314 RFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSK 373
+F+G PE+ E G + Y+FP Q+ A+ ++ K +EK W+FS+
Sbjct: 243 KFNGFPEVSEAGTLAYKFPELQKVASERKAKTK--------------SAYLKEKVWQFSQ 288
Query: 374 TNMSERGMAIGLGGLNLFGVIILGAMLQE----MAVTPNGFLKFV 414
+ + ++IGLG L ++L +L + GFL V
Sbjct: 289 ASQGKITLSIGLGIFYLVSSLVLSNLLGNPRLSSVIASGGFLGLV 333
>gi|425443998|ref|ZP_18824059.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9443]
gi|389732042|emb|CCI03958.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9443]
Length = 434
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 123/332 (37%), Positives = 196/332 (59%), Gaps = 23/332 (6%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ +++ RVT+GDVA K+GL++N AQ+ L ALAA+ G L+V+D G+++Y+FP N+R+
Sbjct: 8 LQSIEKLGYRVTVGDVAAKSGLEINAAQQGLLALAAEAGGHLQVADTGEIIYLFPENFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K ++L+++ K Y IR+ FG LI SI+I+ AI+AIL D
Sbjct: 68 ILLNKYWQLQLKAFASKIWKVVFYIIRISFGIVLIISILILLLAILAILIGLMFKDSDSD 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDP-----YYYRR-RRVQTDDDDKKMNFIKSVFSFVFGE 261
++ F P+D FW + P YY RR R V+ + KMNF+++VFSF+FG+
Sbjct: 128 NNSGNNNININFF-PTDFFWIFYPDFGNNYYERRDREVKAEKPPSKMNFLEAVFSFLFGD 186
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDES--YVLPVLLRFDGQP 319
GDPN +EE+RW+ IG+ I +N G + A ++APYLD +E+ Y+LPVL RF+G P
Sbjct: 187 GDPNFNLEERRWQTIGKVIQNNRGSIIAPQIAPYLDSITPSHEETEDYILPVLTRFNGLP 246
Query: 320 EIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSER 379
++ + G+I+Y FP Q T +++ V +EK+W+FS+ + +
Sbjct: 247 QVSDRGDIIYYFPDLQVTIKKRKLQT--------------VSPYLKEKRWKFSEADSGQI 292
Query: 380 GMAIGLGGLNLFGVIILGAMLQEMAVTPNGFL 411
+A+GLG +N ++LG++L +V +G L
Sbjct: 293 ILALGLGVVNFILALVLGSLLNSDSVDLSGSL 324
>gi|425472459|ref|ZP_18851300.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9701]
gi|389881460|emb|CCI37992.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9701]
Length = 434
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 122/332 (36%), Positives = 196/332 (59%), Gaps = 23/332 (6%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ +++ RVT+GDVA K+GL++N AQ+ L ALAA+ G L+V+D G+++Y+FP N+R+
Sbjct: 8 LQSIEKLGYRVTVGDVAAKSGLEINAAQQGLLALAAEAGGHLQVADTGEIIYLFPENFRS 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K ++L+++ K Y IR+ FG LI SI+I+ AI+AIL D
Sbjct: 68 ILLNKYWQLQLKAFASKIWKVVFYVIRISFGIVLIISILILLLAILAILIGLMFKDSDSD 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDP-----YYYRR-RRVQTDDDDKKMNFIKSVFSFVFGE 261
++ F P+D FW + P YY +R R V+ + KMNF++SVFSF+FG+
Sbjct: 128 NNSGNNNININFF-PTDFFWIFYPDFGNNYYEKRDREVKAEKPPSKMNFLESVFSFLFGD 186
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDES--YVLPVLLRFDGQP 319
GDPN +EE+RW+ IG+ I +N G + A ++APYLD +E+ Y+LPVL RF+G P
Sbjct: 187 GDPNFNLEERRWQTIGKVIQNNRGSIIAPQIAPYLDSITPSQEETEDYILPVLTRFNGLP 246
Query: 320 EIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSER 379
++ + G+I+Y FP Q T ++ + + +EK+W+FS+ + +
Sbjct: 247 QVSDRGDIIYYFPDLQVTTKERK--------------VQALSPYLKEKRWKFSEADSGQI 292
Query: 380 GMAIGLGGLNLFGVIILGAMLQEMAVTPNGFL 411
+A+GLG +N ++LG +L+ +V +G L
Sbjct: 293 ILALGLGVVNFILALVLGYLLKSDSVDLSGSL 324
>gi|428206235|ref|YP_007090588.1| hypothetical protein Chro_1192 [Chroococcidiopsis thermalis PCC
7203]
gi|428008156|gb|AFY86719.1| hypothetical protein Chro_1192 [Chroococcidiopsis thermalis PCC
7203]
Length = 429
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 116/309 (37%), Positives = 172/309 (55%), Gaps = 24/309 (7%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ AV+ RVT+GDVA + GL + +AQ+ L ALA+D G ++V++ G++ Y FP N R
Sbjct: 8 VQAVEQLGYRVTVGDVAAQIGLDVGQAQQGLLALASDVGGHMQVANSGEIAYQFPQNLRG 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
L K +RL+++ K Y IR+ FG L+ SIV+VF I I+++ + D D
Sbjct: 68 VLRNKYWRLRLQEWWQKIWKVLFYLIRISFGIVLLVSIVLVFLTIAIIITATNRDGDDRG 127
Query: 208 RRRSFDSGFNIFISPSDLFWYWDPYY---YRRRRVQTDDDDKKMNFIKSVFSFVFGEGDP 264
R F D FW + P Y YR+RR +T + +NF++++FSF+FG+G+P
Sbjct: 128 DRGGGGFYMPYFWIGPDWFWVFYPDYDTRYRQRRRETSN----LNFLEAIFSFLFGDGNP 183
Query: 265 NQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD---IDRTMSDESYVLPVLLRFDGQPEI 321
N +E+KR+ LI I +N GVVTAE++APYLD E Y+LPVL RF+GQP +
Sbjct: 184 NIDLEDKRFSLIAATIRNNRGVVTAEQIAPYLDDLGEGYAQEYEDYMLPVLTRFNGQPLV 243
Query: 322 DEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSKTNMSERGM 381
EG+++Y FP Q +A Q+ R V +E + FS + + +
Sbjct: 244 SPEGHLVYHFPELQVSATQQQSRR--------------VPAFLQELPYRFSAASSGQIML 289
Query: 382 AIGLGGLNL 390
IGLG LN
Sbjct: 290 GIGLGALNF 298
>gi|254421842|ref|ZP_05035560.1| hypothetical protein S7335_1992 [Synechococcus sp. PCC 7335]
gi|196189331|gb|EDX84295.1| hypothetical protein S7335_1992 [Synechococcus sp. PCC 7335]
Length = 445
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 130/369 (35%), Positives = 199/369 (53%), Gaps = 48/369 (13%)
Query: 82 DVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVF 141
++ M AV++ + RVT+GDVA KAGL + AQ+ L ALA++T G ++VS+ G++ Y F
Sbjct: 2 ELNKEVMQAVESLDYRVTVGDVATKAGLNVEVAQQGLLALASETQGHMQVSETGEIAYEF 61
Query: 142 PNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAIL---SS 198
P N+R L K ++L+ + + K +A Y IR+ FG L+ SI I+ AII ++ SS
Sbjct: 62 PRNFRGVLRNKYWQLRAQETLSKVWSALFYVIRISFGIMLMVSIAIISIAIIVLVIAASS 121
Query: 199 KSDDDDRGRRRRSFDSGFNIFISPSDLFWYWD-----------------------PYYYR 235
+ + R RR GF F P++ F+ +D Y
Sbjct: 122 QGGGNSRDNRRGG--GGFVFF--PTNFFYLFDFNYGRGRYGRGRGRYPNRGTSRYGGSYS 177
Query: 236 RRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPY 295
R ++ +K+ F++++FSF+FG+GDPN+ +EE+RW+ IG I +NGG + AE++ PY
Sbjct: 178 GGRGRSSPSGEKLPFLEAIFSFLFGDGDPNEDLEERRWQAIGSVITNNGGAIAAEQVTPY 237
Query: 296 LDIDRTMSD---ESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRR 352
LD SD E Y+LPVL RF+GQPE+ G+++Y FP Q +A QR
Sbjct: 238 LDDLGEGSDREYEDYMLPVLTRFNGQPEVSPTGDMVYYFPELQ-VSAMQR---------- 286
Query: 353 WADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLK 412
V +E K +F+ + ++I LG LNL G ++L +LQ V F+
Sbjct: 287 ---GKAAVSAYLKETKRKFTSATSDQVMISIALGALNLVGALVLWNLLQTATVDIE-FIA 342
Query: 413 FVAYIFPLL 421
FV I+ LL
Sbjct: 343 FVESIYWLL 351
>gi|291572170|dbj|BAI94442.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 451
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 124/350 (35%), Positives = 189/350 (54%), Gaps = 30/350 (8%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
++AV+ RV+ DVA +AGL++N AQ+ L ALA++ G L+V++ GD+ ++FP N+RA
Sbjct: 14 IEAVEKLGYRVSSADVAIQAGLEVNLAQQQLLALASEAGGHLQVAESGDIAFLFPQNFRA 73
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGR 207
+ + +RL+V K Y IR+ FG L+ SIV++F I AIL + + G
Sbjct: 74 VMRNRYWRLRVYEWWQKVWRILFYLIRISFGIMLLVSIVLIFITIAAILFFGDSNRNGGG 133
Query: 208 RRRSFDSGFNIFISPSDLFWY---------WD--PYYYRRRRVQTDDDDKKMNFIKSVFS 256
+ FW+ WD YY +++++ +KMNF + VFS
Sbjct: 134 GGGGGGGSRGRGLIFIPHFWFHPNSFSFLSWDNGNYYRQQQKITKKQKKEKMNFFEVVFS 193
Query: 257 FVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDRTMSD--ESYVLPVLL 313
F+FG+G+PN +E +RW+ I I +N G V AE++APYLD + S E Y+LPVL
Sbjct: 194 FLFGDGNPNYNLEAQRWQEIATVIRNNQGAVVAEQIAPYLDNLGNAYSQEFEDYMLPVLT 253
Query: 314 RFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEFSK 373
RF+GQPE+ +G ++Y FP Q TA R V +E W+FS
Sbjct: 254 RFNGQPEVSPQGQLVYHFPELQTTATKYR--------------PQPVPAYLKENNWKFSN 299
Query: 374 TNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
++ +A GLG +N G +ILG +L++ A+ G + FV I+PLL
Sbjct: 300 ATSNQLMLAAGLGAVNFVGALILGHLLEDGAIAAQMGGLVAFVEIIYPLL 349
>gi|428164099|gb|EKX33139.1| hypothetical protein GUITHDRAFT_98431 [Guillardia theta CCMP2712]
Length = 555
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 153/457 (33%), Positives = 234/457 (51%), Gaps = 71/457 (15%)
Query: 17 FFTPLRPSINLKPPDSFPRIQPLPFPRISGKIPGSRVLVPVAKASTDVAVGVGPGRIVES 76
F +PL I+ +P R Q P + G R + AS+ A+GV E
Sbjct: 22 FLSPLSLGIS-RPQRGLSRCQRAPGGSWT---RGGRSRL---HASSRGALGV----FSEP 70
Query: 77 DKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGD 136
D P ++ + AVD RVT+ DVA AG+ LNEA+K L LA DG LEVS +G+
Sbjct: 71 DPPPENI----LKAVDRAGGRVTVADVATSAGVSLNEAKKQLNVLAQLADGNLEVSKDGE 126
Query: 137 VLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAIL 196
++YVF +R +L ++S + +++ DK Y ++V FG ALI+SI ++FTAI+ +L
Sbjct: 127 LVYVFSAGFRNELLSRSAKKRIQQGWDKVAPILFYMVKVSFGIALISSIALIFTAIL-VL 185
Query: 197 SSKSDDDDRGRRRRSFDSGFNIFISPSDLFWY----WDPYYYRRRR---VQTDDDDKKMN 249
S S DD RR S GF+ ++ PS +W+ +D +Y+ V D+ +++
Sbjct: 186 QSSSRDDRDDRRGYSSGGGFSFYMGPS--YWWGPSPFDIFYFSSSSPYGVNRYDNPSQLS 243
Query: 250 FIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI-----DRTMSD 304
F++SVFSF+FG+G+PN E++R+K + I + GGVVTAEE+A YLD + + D
Sbjct: 244 FLESVFSFLFGDGNPNANFEQRRYKALANLIRAKGGVVTAEEVAGYLDPPAEPDEDKLVD 303
Query: 305 ESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAAS--------------QRIGRKEYVG 350
ES+VLP L + G PE+ E G+I+Y F Q +AA+ ++ + E +
Sbjct: 304 ESFVLPALTQLGGMPEVTESGDIVYVFEDLQLSAAAANDQGWEKLSLQELTKLAKSEGIS 363
Query: 351 RRWA---DAIGGVEKIFR-------------------EKKWEFSKTNMSERGMAIG-LGG 387
R D + V + R EK ++FS ++ +A+G L
Sbjct: 364 TRGVYEKDDLLAVIRAARETRSVKSNQEGDRVATYLDEKPFQFSLATSGQQ-LAVGALAA 422
Query: 388 LNLFGVIILGAMLQEMAVTPN---GFLKFVAYIFPLL 421
+NLFG + LG + + G L FV I+PLL
Sbjct: 423 VNLFGALYLGNLFASPYLVGRELIGLLGFVKAIYPLL 459
>gi|165968251|gb|ABY75935.1| At5g03900-like protein [Arabidopsis lyrata]
Length = 198
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 103/148 (69%), Positives = 123/148 (83%), Gaps = 2/148 (1%)
Query: 55 VPVAKAST--DVAVGVGPGRIVESDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLN 112
+ V KA++ V+ + PG +VESDKLP DVR RAMDAVD C RRVT+GDVA +AGLK+
Sbjct: 51 IAVVKAASLDKVSGAIKPGGLVESDKLPTDVRKRAMDAVDECGRRVTVGDVASRAGLKVT 110
Query: 113 EAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYS 172
EAQ ALQALAADTDGFLEVSDEGDVLYVFP +YR KLAAKS R+++EP ++KAK A +Y
Sbjct: 111 EAQTALQALAADTDGFLEVSDEGDVLYVFPRDYRTKLAAKSLRIQIEPYLEKAKGAIDYL 170
Query: 173 IRVLFGTALIASIVIVFTAIIAILSSKS 200
RV FGTALIASIVIV+T+IIA+LSS+S
Sbjct: 171 ARVSFGTALIASIVIVYTSIIALLSSRS 198
>gi|434389186|ref|YP_007099797.1| hypothetical protein Cha6605_5383 [Chamaesiphon minutus PCC 6605]
gi|428020176|gb|AFY96270.1| hypothetical protein Cha6605_5383 [Chamaesiphon minutus PCC 6605]
Length = 421
Score = 184 bits (467), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 117/334 (35%), Positives = 174/334 (52%), Gaps = 27/334 (8%)
Query: 86 RAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNY 145
+ M AV+ RVT GDVA +AG+ L +A+ L ALA T G L+V+D G+V+YVF N+
Sbjct: 6 QLMTAVEQLGYRVTSGDVATQAGIHLEQARSGLLALANRTGGHLQVTDSGEVIYVFAPNF 65
Query: 146 RAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDR 205
R L KS +L++ +D+ Y +++ FG LI SI V+ I+AI + +D
Sbjct: 66 RQILLRKSVKLQIRAWLDRLWTIGFYLVKISFGILLITSISAVYLTILAITLAALFSNDS 125
Query: 206 GRRRRSFDSGFNIF----ISPSDLFWYWDPYYYRRRRVQTDDDDKKMNFIKSVFSFVFGE 261
G + IF S S+ D ++ K +NF+++VFS +FG+
Sbjct: 126 GGDCGDGNCVLAIFDWGGNSSSNSNSSIDQNISSLAPIRAKTQRKPLNFLEAVFSVLFGD 185
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-----IDR------TMSDESYVLP 310
G+PN +E++RW+ I I GV E++ PYLD DR +E Y+LP
Sbjct: 186 GNPNADLEQRRWRYIANLIYHQQGVAIGEQILPYLDNIGAEFDRRAFLPENAGNEDYMLP 245
Query: 311 VLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWE 370
VL +F+G PE+ G ++Y FP Q T K+ G V + RE+KW+
Sbjct: 246 VLTKFNGIPEVSPTGQLVYHFPDLQTTL-------KDEPGLN-----NRVPQSLRERKWK 293
Query: 371 FSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMA 404
F+K + G IGL LNL G+IILG ML+ ++
Sbjct: 294 FTKATPEQTGWTIGLFALNLVGIIILGLMLRGIS 327
>gi|428217636|ref|YP_007102101.1| hypothetical protein Pse7367_1381 [Pseudanabaena sp. PCC 7367]
gi|427989418|gb|AFY69673.1| hypothetical protein Pse7367_1381 [Pseudanabaena sp. PCC 7367]
Length = 433
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 124/352 (35%), Positives = 194/352 (55%), Gaps = 29/352 (8%)
Query: 84 RNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPN 143
R M A++ + RVT GD+A ++GL L A L +LA+DT ++VS+ GD+ + F
Sbjct: 4 RQATMQAIEKLDYRVTPGDIAAQSGLNLEIANSQLLSLASDTSAHMQVSEAGDLAFEFDR 63
Query: 144 NYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTA------IIAILS 197
+++ L A+SF+L+++ + K Y RV FG LI +IVIV A +++
Sbjct: 64 SFKNTLLARSFKLRLQEWLSKVWKWLFYLFRVSFGILLIVAIVIVVLAIIVAWQVLSRSG 123
Query: 198 SKSDDDDRGRRRRSFDSGFNIF--ISPSDLFWYWDPYYYR--RRRVQTDDDDKKMNFIKS 253
++ R R R F F P D + +DP YYR RRRV++ + D +M +++S
Sbjct: 124 DNNNGGGRRRSSRGGGMNFFFFPRFYPWDFWAVFDPNYYRPSRRRVRSAEQD-EMGYLES 182
Query: 254 VFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYL-DIDRTMSD-ESYVLPV 311
V+SF+FG+GDPN +EE++W+ I I + GVVTAE++ P+L DI E Y++PV
Sbjct: 183 VYSFLFGDGDPNYDLEERKWQSIASIIRKHDGVVTAEQITPFLGDIGFNAEGYEDYMVPV 242
Query: 312 LLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWEF 371
L RF+GQP + +EG ++Y FP Q T AS+R K +E+KW+F
Sbjct: 243 LARFNGQPHVSDEGTLVYSFPELQ-TVASERKSNSN-------------PKYLQEEKWQF 288
Query: 372 SKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPN--GFLKFVAYIFPLL 421
S+ + + + + +G L G + LG +L A+ G+L FV I+ LL
Sbjct: 289 SQASSGKITLIVVMGIAYLIGTVYLGVLLGNPALAGELAGYLGFVESIYGLL 340
>gi|219116903|ref|XP_002179246.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409137|gb|EEC49069.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 357
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 117/335 (34%), Positives = 188/335 (56%), Gaps = 11/335 (3%)
Query: 77 DKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGD 136
+KLP+ +D V +V DVA AG+ L++A+K L ALA+ + G + V +G+
Sbjct: 9 EKLPSKA---VIDVVSMKPEKVVASDVATAAGISLSQARKDLTALASISRGDISVDKDGE 65
Query: 137 VLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAI--IA 194
++Y FP + + LA+ S + + + K A + +RV FG L+ S+V VFT I I
Sbjct: 66 LIYSFPRDLNSVLASNSVKYQTLQLARKVWPAVFWGVRVSFGVTLLVSLVAVFTTIFFIT 125
Query: 195 ILSSKSDDDDRGRRRRSFDSGFNIFISPSDL-FWYWDPYYYRRRRVQTDDDDKKMNFIKS 253
SS +DD R R G PS L F+++ PY Q + D ++M F +S
Sbjct: 126 SSSSNNDDRRRDDRGGGMSFGMGGMWGPSPLDFFFYRPYGSYGYYGQPERDPEEMGFFES 185
Query: 254 VFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD---IDRTMSDESYVLP 310
VFS++FG+GDPN G+EEKR L+ I N G VTAE+LAPY D + ++ +++VLP
Sbjct: 186 VFSYIFGDGDPNAGLEEKRLGLVASMIRENKGAVTAEQLAPYCDGAPNPQELASKTFVLP 245
Query: 311 VLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKKWE 370
++ +G+P + E+G+I+Y FP Q +AA+ ++ + + +G + +E++W+
Sbjct: 246 IVTALNGEPRVTEDGSIVYTFPELQMSAATVKV--IPAASKEEMEMMGQNPALLQEREWK 303
Query: 371 FSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAV 405
FS R +A GLG +NL G + LG +L + A+
Sbjct: 304 FSLAPEINRFLAGGLGVVNLGGALYLGNLLGQYAM 338
>gi|452823986|gb|EME30992.1| hypothetical protein Gasu_17540 [Galdieria sulphuraria]
Length = 523
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 107/335 (31%), Positives = 175/335 (52%), Gaps = 47/335 (14%)
Query: 97 RVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSFRL 156
R T D A +G+ ++ A K L +A++ +G L+VS++G++LY P N+ A L KS
Sbjct: 90 RFTAADFAVASGISVDNATKQLLRIASEVEGALDVSEQGEILYRIPVNFEAILQRKSMLN 149
Query: 157 KVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAI--LSSKSDDDDRGRRRRSFDS 214
++ + +A Y +R+ FG L+ S++IV AII + L+S+ ++D+R SF S
Sbjct: 150 SLQKLWQPVWSALFYLVRISFGIFLLVSLLIVLGAIIFVNSLNSRQNEDNRS---SSFHS 206
Query: 215 -GFNIFISPSDLFWYW-----------------DPYYYRRRRVQTDDDDKK--MNFIKSV 254
F I S SDLF+++ P +RR + + K F+++
Sbjct: 207 PRFYIGPSISDLFYFFRGPSYYNYHYYQYETDNTPSENGKRRTKENQSQKDGPKGFLEAA 266
Query: 255 FSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYL---------DIDRTMSDE 305
+SF+FG+GDPN E ++WK + I +N VVTAE+LAPYL D + + +E
Sbjct: 267 YSFLFGDGDPNADFETRKWKAVAFVIRANDCVVTAEQLAPYLLCSHDQATSDENSILVNE 326
Query: 306 SYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFR 365
S+++P L+RF G P++ E G I+Y FP ++TA + +I I +
Sbjct: 327 SFMIPALVRFGGIPQVLENGEIIYVFPELRKTAKNVQI-------------ISKLPSFPV 373
Query: 366 EKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAML 400
EK+ FS+ + ++ M + LNL G + LG M
Sbjct: 374 EKEIPFSRASSTDLWMVALIASLNLVGSLTLGNMF 408
>gi|224008062|ref|XP_002292990.1| hypothetical protein THAPSDRAFT_263883 [Thalassiosira pseudonana
CCMP1335]
gi|220971116|gb|EED89451.1| hypothetical protein THAPSDRAFT_263883 [Thalassiosira pseudonana
CCMP1335]
Length = 472
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 129/369 (34%), Positives = 198/369 (53%), Gaps = 28/369 (7%)
Query: 77 DKLPADVRNRAMDAVDACNRRVTIG-DVAGKAGLKLNEAQKALQALAADTDGFLEVSDEG 135
++LP+ + +DAV+ N I D+A KAG+ L++A+K L LA+ T G + VS +G
Sbjct: 2 ERLPS---KKVIDAVEKSNGSPIIASDLATKAGISLSQARKDLTTLASLTRGDIAVSSDG 58
Query: 136 DVLYVFPNNYRAKLAAKSFRLK-VEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIA 194
D+LY FP+N L+ S + + + +K Y+ +V FG L+AS+V VF+ I
Sbjct: 59 DLLYTFPSNINGVLSTNSAKYRALSTWNEKIVPPLFYATKVGFGVVLVASLVAVFSTIFF 118
Query: 195 ILSSKSDDDDRGRRRRSFDSGFNIFI----SPSDLFWYWDPY--------YYRRRRVQTD 242
S S DDD R RR SP D F+Y Y Y R R Q
Sbjct: 119 ATSGISRDDDDRRERRGGGMPMGFGGFWGPSPFDFFYYRPYYSRYYYSPAYDTRGRQQRS 178
Query: 243 DDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD----I 298
D +M F++SVFS+VFG+G+PN +EE+R L+ E I SNGG V+AE+LAP+ D
Sbjct: 179 QDPDEMGFLESVFSYVFGDGNPNGDVEERRIALVAEMIRSNGGAVSAEQLAPFCDDVPMP 238
Query: 299 DRTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAA---SQRIGRKEYVGRRWAD 355
+R DES+VLP + + +G+P++ E+G+I+Y FP +A+ + + KE R
Sbjct: 239 NRAYVDESFVLPFVTQLNGEPQVTEDGDIVYIFPELMASASKSPTSSMDSKEMARMRRES 298
Query: 356 AIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMA---VTPNGFLK 412
+ + E++++FS + + +A LG +NL G + LG +L + A V ++
Sbjct: 299 HVDD-PTLLLEREYKFSLASSFQTVLAGLLGVVNLGGALYLGNILGQYALYGVRLPSYMG 357
Query: 413 FVAYIFPLL 421
V FPLL
Sbjct: 358 LVQQFFPLL 366
>gi|299115766|emb|CBN74331.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 550
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 115/344 (33%), Positives = 173/344 (50%), Gaps = 35/344 (10%)
Query: 86 RAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNY 145
R + AV+ R DVA AG+ LN A++ L LA G LEVS +GDV+Y FP+N+
Sbjct: 23 RILKAVEKAGNRAVPSDVAALAGVDLNVAKRGLVNLANIVGGDLEVSKDGDVVYNFPSNF 82
Query: 146 RAKLAAKS-FRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAII--AILSSKSD- 201
RA L +KS +R VE V A Y IRV FG L+ASI ++F+ I A ++ S
Sbjct: 83 RASLLSKSAYRRGVEAV-RAAWPVIFYGIRVSFGVVLLASIALIFSTIFFAATYANSSSD 141
Query: 202 --------DDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRR-----RVQTDDDDKKM 248
GFN + PS +++ YY R D ++M
Sbjct: 142 DNRDRRRGGGGFDGGYGGRGIGFNTYFGPSPFDFFYYRPYYGYYGTPVGRGGQRGDPEEM 201
Query: 249 NFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDID------RTM 302
++S+FSF+FG+GDPN ++++R + I NGG V+AE+LAP L + ++
Sbjct: 202 GILESIFSFIFGDGDPNADMDQRRVEAAATLIRGNGGAVSAEQLAPLLTPEALRSSDSSV 261
Query: 303 SDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTA-----ASQRIGRKEYV------GR 351
DES+VLP+L DG PE+ +G+I+Y FPS Q +A A+ +IG KE
Sbjct: 262 VDESFVLPILTALDGAPEVTADGDIVYVFPSLQTSAMGTGTAATQIGSKESARLAGLKSL 321
Query: 352 RWADAIGGVEKIFREKKWEFSKTNMSERGMAIGLGGLNLFGVII 395
AD +E + F ++ + + MA G+ L+ +++
Sbjct: 322 STADLRVALEAVGVRAMSMFERSELIDAAMAAGIAKLDNGNIVL 365
>gi|397576264|gb|EJK50155.1| hypothetical protein THAOC_30900, partial [Thalassiosira oceanica]
Length = 662
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 100/296 (33%), Positives = 160/296 (54%), Gaps = 32/296 (10%)
Query: 68 VGPGRIVESDKLPADVRNRAMDAVDA-CNRRVTIGDVAGKAGLKLNEAQKALQALAADTD 126
+ P + + +KLP+ + +DA ++ R + D+A KAG+ +++A+K L ALA T
Sbjct: 114 IAPFQKILEEKLPS---QKIIDAAESFGGRPIVASDLAAKAGVSISQARKDLTALATLTR 170
Query: 127 GFLEVSDEGDVLYVFPNNYRAKLAAKSFRLK-VEPVIDKAKAAAEYSIRVLFGTALIASI 185
G + VS++GD++Y FP + + +A+ S + + + DK K Y+ +V FG AL+AS+
Sbjct: 171 GDIAVSNDGDLIYTFPRDIGSVIASTSAKYRALSTWNDKLKEPLFYATKVGFGVALLASL 230
Query: 186 VIVFTAIIAI-------LSSKSDDDDRGRRRRSFDSGFNIFISPS--DLFWYWDPYYYRR 236
V +++ I I + D G GF F PS D F+Y Y
Sbjct: 231 VAIYSTIFFIGSSSSSDRDDRDDRRGYGGGGGGMPMGFGGFWGPSPFDFFYYRPYYSRYY 290
Query: 237 RRVQTDD-----DDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEE 291
D D +M F++SVFS+VFG+G+PN +EE+R L E I NGG VTAE+
Sbjct: 291 YSPAYDTGGRRRDPDEMGFLESVFSYVFGDGNPNGDVEERRLGLAAEMIRMNGGAVTAEQ 350
Query: 292 LAPYLD-------------IDRTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPSF 334
LAP++D +R DE++VLP++ + DG+P++ E+G+I+Y FP
Sbjct: 351 LAPFVDEAPAPLDADGRLGEERAYVDEAFVLPIVTQLDGEPQVTEDGDIVYTFPEL 406
>gi|427703002|ref|YP_007046224.1| hypothetical protein Cyagr_1735 [Cyanobium gracile PCC 6307]
gi|427346170|gb|AFY28883.1| hypothetical protein Cyagr_1735 [Cyanobium gracile PCC 6307]
Length = 435
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 109/374 (29%), Positives = 174/374 (46%), Gaps = 86/374 (22%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+D + RR ++G+VA GL L E + L AL AD G L+V+++G +L+VFP R
Sbjct: 8 LDWIGNRGRRCSVGEVAAGTGLSLQEVEPGLLALVADVSGRLQVAEDGTLLFVFPPALRM 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGT---------------------------- 179
+L A S R +++ + + A IR+ FG
Sbjct: 68 RLLALSGRRRLQARLQASLRLAARLIRLSFGLVLVLVTTLVVVILSVLLIVRLFTSDDAD 127
Query: 180 ----ALIASIVIVFTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLF-WYWDPYYY 234
AL+ + V +++ +L+S R R+ S +SP L+ WD
Sbjct: 128 DAGLALLQGLAQVPLSLLDLLASGL----RAPARQGSAS-----VSPQQLWSLLWD---- 174
Query: 235 RRRRVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAP 294
D + F+ +VFS +FG+GDPN +E RW+ I ++ GGVV AE+LAP
Sbjct: 175 ---LPGEGADPASLGFLSAVFSILFGDGDPNARLEPLRWRRIAAFLRQRGGVVIAEDLAP 231
Query: 295 YLDIDRTMSD--------ESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRK 346
LD+ SD ++ +LPVLLRFDG+PE+ E+G+++Y FPS AA
Sbjct: 232 LLDLPDCPSDPDRRRDLADAAMLPVLLRFDGRPEVSEDGDLIYGFPSLPARAA------- 284
Query: 347 EYVGRRWADAIGGVEKIFREKKWEFSKTNMSER---GMAIGLGGLNLFGVIILGAMLQEM 403
+ G RE+ + FS+ +R G+A+G +++L L +
Sbjct: 285 ---------GVDGPVPPLRERAFRFSRAGAGQRIAYGIAVG-------ALLVLSPWL--L 326
Query: 404 AVTPNGFLKFVAYI 417
A++P +L VA++
Sbjct: 327 AISP-AWLPPVAWL 339
>gi|376005490|ref|ZP_09782991.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375326133|emb|CCE18744.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 267
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 101/179 (56%), Gaps = 19/179 (10%)
Query: 248 MNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLD-IDRTMSD-- 304
MNF + VFSF+FG+G+PN +E +RW+ I I +N G V AE++APYLD + S
Sbjct: 1 MNFFEVVFSFLFGDGNPNYNLEAQRWQAIATVIRNNQGAVVAEQIAPYLDNLGNAYSQEF 60
Query: 305 ESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIF 364
E Y+LPVL RF+GQPE+ EG ++Y FP Q TA R V
Sbjct: 61 EDYMLPVLTRFNGQPEVSPEGQLVYHFPELQTTATQYRPQP--------------VPAYL 106
Query: 365 REKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQE--MAVTPNGFLKFVAYIFPLL 421
+E W+FS ++ +A GLG +N G +ILG +L++ MA G + FV I+PLL
Sbjct: 107 KENNWKFSNATSNQLMLAAGLGAVNFVGALILGHLLEDGAMAAQMGGLVAFVEMIYPLL 165
>gi|339500230|ref|YP_004698265.1| hypothetical protein Spica_1614 [Spirochaeta caldaria DSM 7334]
gi|338834579|gb|AEJ19757.1| hypothetical protein Spica_1614 [Spirochaeta caldaria DSM 7334]
Length = 503
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 86/359 (23%), Positives = 155/359 (43%), Gaps = 42/359 (11%)
Query: 86 RAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADT-DGFLEVSDEGDVLYVFPNN 144
+ +D + + + +TI D+ K L L E K L +A+D G L+V++ G++LY FP
Sbjct: 13 KIVDVLKSQRQGITIADITAKTALPL-ETVKELVVIASDEFSGRLQVTESGEILYSFPRG 71
Query: 145 YRAKLAAK--SFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFT-AIIAILSS--- 198
+++K +F+ V+ + AK + + L+ ++ A++A+L+S
Sbjct: 72 FQSKYRGPMVAFKKLVKALKKGAKTVGTWLFKTWIMLMLVGYFILFMAIALVALLASTVI 131
Query: 199 --KSDDDDRGRRRRSFDSGFNIFISPSDLF------WYWDP---------YYYRRRRVQT 241
D+R RR + G F + S + W++ Y Y R +
Sbjct: 132 SVSGSSDNRSNSRR--NDGIGSFYAVSYILDLIIRIWFYSEVSKSIDRSVYGYNYRAAK- 188
Query: 242 DDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRT 301
K +VFSFVFG+GDPN EE+ K + YI +N G+++ E L
Sbjct: 189 ---PKGRPLHHAVFSFVFGDGDPNADWEERERKAVISYIQANKGIISLPEFM-ILTGAAP 244
Query: 302 MSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVE 361
E + + F G PE E+G +LYRF + + +Q R +A ++
Sbjct: 245 QDAEQKISRYCVEFGGMPEASEDGTVLYRFEALLLRSDTQ--------DRSFAGLSAPIK 296
Query: 362 K--IFREKKWEFSKTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIF 418
+ +F + + + G+ + G L+ LG ++ V + +L +A+I
Sbjct: 297 RLQVFSKNSKKMNSWFAIINGVNLVFGSYFLYFASTLGPIVSNTQVRGSSYLYAIAFIL 355
>gi|255077187|ref|XP_002502242.1| predicted protein [Micromonas sp. RCC299]
gi|226517507|gb|ACO63500.1| predicted protein [Micromonas sp. RCC299]
Length = 506
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 66/128 (51%), Gaps = 35/128 (27%)
Query: 243 DDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI---- 298
D +++++FI+SVF+FVFG GDPN+ +E +RW+ +G + +N GVV AE+LAP+LD
Sbjct: 301 DKNRELSFIESVFAFVFGRGDPNEDLEHRRWRAVGLLLRANQGVVYAEQLAPFLDSYLLR 360
Query: 299 -------------------------------DRTMSDESYVLPVLLRFDGQPEIDEEGNI 327
D + E YVL L +F G E E+G +
Sbjct: 361 DHRGGGLDALAATVRNIIRNIIRRKEDDARGDASRMHEGYVLEALAKFGGHAESSEDGRL 420
Query: 328 LYRFPSFQ 335
+Y FP+ Q
Sbjct: 421 VYVFPALQ 428
>gi|333994649|ref|YP_004527262.1| hypothetical protein TREAZ_0585 [Treponema azotonutricium ZAS-9]
gi|333735236|gb|AEF81185.1| conserved hypothetical protein [Treponema azotonutricium ZAS-9]
Length = 510
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 71/265 (26%), Positives = 117/265 (44%), Gaps = 26/265 (9%)
Query: 86 RAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNY 145
+ ++A + T+ D+A L L ++ + A + G LEV++ G++LY FP +
Sbjct: 13 KIVEAFKGKRKGATVADIAAVTALPLYTVKELVPVAADEFSGRLEVTESGEILYSFPRGF 72
Query: 146 --RAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAIL------S 197
R K A + E +I A +V LI ++ +A L +
Sbjct: 73 ASRFKGLAPFLKKTSEKLIHFLGLAGTGIFKVWIMVMLIGYFLVFLAIALASLFLSMAAN 132
Query: 198 SKSDDDDRGRRRRSF--DSGFNIFISPSDLFWYW-------DPYYYRRRRVQTDDDDKKM 248
S++++D R + S FN+ I W++ +P Y R + + K
Sbjct: 133 SRNNNDSRSGHGGMYLGSSIFNLIIR----LWFYSELTKSVNPRYGRYGGFENNSRPKGR 188
Query: 249 NFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSD--ES 306
K++FSFVFG+GDPN + K Y+ + GVV+ EL + I SD ES
Sbjct: 189 PLHKAIFSFVFGDGDPNAQWPTEEKKTFIAYVQEHRGVVSLPEL---MAISGIPSDRAES 245
Query: 307 YVLPVLLRFDGQPEIDEEGNILYRF 331
+ + F G PE E+G ++YRF
Sbjct: 246 EITALCAEFGGSPEATEDGTVVYRF 270
>gi|383791992|ref|YP_005476566.1| hypothetical protein [Spirochaeta africana DSM 8902]
gi|383108526|gb|AFG38859.1| hypothetical protein Spiaf_2835 [Spirochaeta africana DSM 8902]
Length = 531
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 99/399 (24%), Positives = 168/399 (42%), Gaps = 79/399 (19%)
Query: 83 VRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFP 142
V +R + A R T+ D+ GL + ++ L + D G L V++ G++LY FP
Sbjct: 13 VESRLVQAFRKRGREATVADLIAATGLPRLQIEETLPQVVRDYRGHLRVTESGEILYHFP 72
Query: 143 NN-YRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAII-------A 194
+ + + + ++ ++ I + AA + L+ ++ ++F A++
Sbjct: 73 HGVHHRERSPRARAARLLRRIGRGAAAVGKLVFKLWIMLMLVGYFVLFVALVIAALMASL 132
Query: 195 ILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYW---------------DPYYYRRRRV 239
S+KS+ RG RS + + +FW W R
Sbjct: 133 AASAKSEGRSRG---RSIGGTYMMTRLIQQVFWLWMFSGRRRGYGYRGAAIGSGIGGSRT 189
Query: 240 QTDDDDKKMN--------FIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEE 291
+++ D + +SVF+FVFG DP + E + + E + ++ GVVT +E
Sbjct: 190 RSNGLDWSASRSEGDGPPLHQSVFAFVFGSDDPAREWETRERQAFVELVQTHKGVVTLDE 249
Query: 292 LAPYLDIDRTMS----DESYVL--PVLLRFDGQPEIDEEGNILYRFPSFQRTA--ASQRI 343
L R MS DE++ L +LL +DG+P++ E+G+++YRFP RTA + +R
Sbjct: 250 L-------RAMSGRSLDEAHDLMNRMLLEYDGEPDVTEQGSLIYRFPELLRTAEPSDRRG 302
Query: 344 GRKEYV---------------GRRWADAIGGVEKIFREKKWEFSKTNMSERGMAIGL--G 386
GR + + RW + GV F F G+A GL
Sbjct: 303 GRIDRLLPAMPTVPFNDNSPGLNRWIGVLNGVNLGFGAYFTGF--------GLAGGLPEE 354
Query: 387 GLNLFGVIILGAMLQEMAVTPNGF----LKFVAYIFPLL 421
G LF I++ + E+A P G L V +F LL
Sbjct: 355 GFGLFYSIVV-VLFSEIAANPLGLVLIGLGVVPLVFSLL 392
>gi|308813161|ref|XP_003083887.1| Mitochondrial Fe-S cluster biosynthesis protein ISA2 (contains a
HesB-like domain) (ISS) [Ostreococcus tauri]
gi|116055769|emb|CAL57854.1| Mitochondrial Fe-S cluster biosynthesis protein ISA2 (contains a
HesB-like domain) (ISS) [Ostreococcus tauri]
Length = 597
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 55/198 (27%), Positives = 87/198 (43%), Gaps = 46/198 (23%)
Query: 243 DDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI---- 298
D +++ F++S+F+FVFG GDPN+ +E +RW+ +G + N G V AE++AP+LD
Sbjct: 225 DRGRELTFVESIFAFVFGRGDPNERLETRRWRAVGALLRVNKGCVYAEQVAPFLDSYLLS 284
Query: 299 -----------------------------------DRTMSDESYVLPVLLRFDGQPEIDE 323
D + E Y+L VL RF G E +
Sbjct: 285 KEDHNEVKNGLFASIFDVVSHARRIFRKKQAENERDTSKMHEGYMLEVLTRFGGHAEASD 344
Query: 324 EGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKIFREKK--WEFSKTNMSERGM 381
+G ++Y FPS Q T ++ R A I+ + WE S M +
Sbjct: 345 DGKLIYVFPSLQVTTIAEAAASSRSTPGR-AVPPPTPPPIYERVRPLWE-SGPKMP---L 399
Query: 382 AIGLGGLNLFGVIILGAM 399
+ LG LN+F + + A+
Sbjct: 400 VVALGFLNIFMIYVFHAL 417
>gi|145355506|ref|XP_001422002.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582241|gb|ABP00296.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 573
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 63/134 (47%), Gaps = 38/134 (28%)
Query: 243 DDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI---- 298
D D++++F +S+F+FVFG GDPN +E +RW+ +G + N G V AE++AP+LD
Sbjct: 253 DRDRELSFFESIFAFVFGRGDPNDNLETRRWRAVGALLRVNKGCVFAEQVAPFLDTYLLT 312
Query: 299 ----------------------------------DRTMSDESYVLPVLLRFDGQPEIDEE 324
D E Y+L VL RF G E +
Sbjct: 313 KEDHSEVRNGLFAVVFDLVAHARRLFRRKADAERDVRRMHEGYMLEVLTRFGGFAEASDA 372
Query: 325 GNILYRFPSFQRTA 338
G ++Y FPS Q TA
Sbjct: 373 GELIYVFPSLQVTA 386
>gi|162453881|ref|YP_001616248.1| hypothetical protein sce5605 [Sorangium cellulosum So ce56]
gi|161164463|emb|CAN95768.1| hypothetical protein sce5605 [Sorangium cellulosum So ce56]
Length = 515
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 108/250 (43%), Gaps = 19/250 (7%)
Query: 98 VTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKS-FRL 156
+T+ D + K+GL L +A+ L L ++ G L+ + EG++L+ FP + ++
Sbjct: 52 LTLADASAKSGLPLRDAESGLHLLVSEHRGHLKATSEGELLFRFPYGFTKPWETRTRLTR 111
Query: 157 KVEPVIDKAKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKSDDDDRGRR-------- 208
++ V+ A A + +R LIA +VI +IA L ++S D R
Sbjct: 112 ALQAVLRVAAGVARFVVRAWIAIVLIAYVVIFVGILIAQLFARSSSDSRNHDGVPGSFAG 171
Query: 209 ----RRSFDSGFNIFISPSDLFWYWDPYY--YRRRRVQTDDDDKKMNFIKSVFSFVFGEG 262
R D+ F F S W +P + R + F + V F FG
Sbjct: 172 YFLFRLVLDAIFWTFHPFSPFVWTSEPAWSSSHGHRGALGRRRDETPFYEKVNRFFFGPT 231
Query: 263 -DPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDESYVLPVLLRFDGQPEI 321
+P +E + KLI I + G + ++ + R +D ++L +DG ++
Sbjct: 232 PEPRDPLEAE--KLILAEIRAQRGRIGLADVMRVTGLSRDEADPRMAR-LMLDYDGTVDV 288
Query: 322 DEEGNILYRF 331
EEG I+YRF
Sbjct: 289 SEEGGIVYRF 298
>gi|374815345|ref|ZP_09719082.1| hypothetical protein TpriZ_15905 [Treponema primitia ZAS-1]
Length = 496
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 68/250 (27%), Positives = 107/250 (42%), Gaps = 19/250 (7%)
Query: 98 VTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAA--KSFR 155
VT+ D+ K L LN ++ + A + L+V++ G++LY FP + +K F
Sbjct: 24 VTVADIVAKTALPLNTVRELVPRAADEYSARLQVTESGEILYSFPRGFTSKYRGFKAGFS 83
Query: 156 LKVEPVIDKAKAAAEYSIR-----------VLFGTALIASIVIVFTAIIAILSSKSDDDD 204
+E K AAE+ + VLF + S+VI A + S +
Sbjct: 84 RFMEKFGKALKIAAEFVFKIWIMVMLVGYFVLFMVIALGSLVISVAASNSNNRSSNRSGG 143
Query: 205 RGRRRRSFDSGFNIFIS---PSDLFWYWDPYYYRRRRVQTDDDDKKMNFIKSVFSFVFGE 261
S FN+ I S+L D Y R VQ + + K++FSFVFG+
Sbjct: 144 GIGGMYFASSIFNMIIRIWFYSELTKSMDRRYGYGRSVQARPKGRPL--YKAIFSFVFGD 201
Query: 262 GDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDESYVLPVLLRFDGQPEI 321
GDPN + + + YI +N GV++ E L + E + F G PE
Sbjct: 202 GDPNADWPRREKQAVIAYIQANRGVISLPEFMT-LTGNSPAEAEERITGYCAEFGGLPEA 260
Query: 322 DEEGNILYRF 331
++G ++YRF
Sbjct: 261 SDDGTVVYRF 270
>gi|333997871|ref|YP_004530483.1| hypothetical protein TREPR_2607 [Treponema primitia ZAS-2]
gi|333740869|gb|AEF86359.1| conserved hypothetical protein [Treponema primitia ZAS-2]
Length = 500
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 80/346 (23%), Positives = 146/346 (42%), Gaps = 36/346 (10%)
Query: 95 NRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKSF 154
+ T+ D+ K L LN ++ + A + G LEV++ G++LY FP + +K + F
Sbjct: 22 QKGATVADIVAKTALPLNTVRELVPQAADEYSGRLEVTESGEILYSFPRGFISKY--RGF 79
Query: 155 RLKVEPVIDK-AKAAAEYSI---RVLFGTALIASIVIVFTAIIA--ILSSKSDDDDRGRR 208
+ + ++K KA +S+ +V L+ + +A +LS + R
Sbjct: 80 KASLNRFLEKFGKALKIFSVGAFKVWIMVMLVGYFALFMLIALASLMLSVAASSSSRSDN 139
Query: 209 RRSFDSGFNIFISPSDLF------WYWD--------PYYYRRRRVQTDDDDKKMNFIKSV 254
R S G +F + S +F W++ Y Y R Q + + K++
Sbjct: 140 RSSSRGGGGLFFA-SGIFNMIIRIWFYSELTKSIDRSYGYGRGSSQPRPKGRPL--YKAI 196
Query: 255 FSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDIDRTMSDESYVLPVLLR 314
FSFVFG+GDPN + + + YI +N GV++ E + T ++E +
Sbjct: 197 FSFVFGDGDPNADWISREKQGVIAYIQANAGVISLPEFMALTGKESTEAEEG-ITGYCAE 255
Query: 315 FDGQPEIDEEGNILYRFPSFQRTAASQRIGRKEYVGRRWADAIGGVEKI--FREKKWEFS 372
F G PE ++G ++YRF + + R + R +A ++++ F K + +
Sbjct: 256 FGGLPEATDDGTVVYRF--------DELLLRADKKSRSFAGFSAPLKRLLSFSSNKQKMN 307
Query: 373 KTNMSERGMAIGLGGLNLFGVIILGAMLQEMAVTPNGFLKFVAYIF 418
G + G LF GA+ + +L V+Y+
Sbjct: 308 GWFSVINGANLAFGSYFLFNAFTTGAITTQAQFDAASYLYKVSYLL 353
>gi|412986081|emb|CCO17281.1| predicted protein [Bathycoccus prasinos]
Length = 639
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 66/135 (48%), Gaps = 40/135 (29%)
Query: 248 MNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI--------- 298
++FI+S+++FVFG GDPN+ +E +R K I + + +N GVV AE+LA + D
Sbjct: 310 LSFIESIYAFVFGRGDPNEFLEHERSKAISQLLRANRGVVFAEQLAAFTDAFLLSKKDRK 369
Query: 299 -------------------------DRTMSD------ESYVLPVLLRFDGQPEIDEEGNI 327
R++ D E YVL +L +F G E D+ G +
Sbjct: 370 GMRGGGSLFSSFFGGNRRDVKKDDGSRSLDDREERQHEGYVLRILEQFYGHAESDDYGRL 429
Query: 328 LYRFPSFQRTAASQR 342
+Y FPSFQ TA +
Sbjct: 430 IYIFPSFQVTAEENK 444
>gi|303289423|ref|XP_003063999.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226454315|gb|EEH51621.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 537
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 68/297 (22%), Positives = 110/297 (37%), Gaps = 94/297 (31%)
Query: 129 LEVSDEGDVLYVFPNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIV 188
+E + G+V+YVFP+ RA + A+ R + A R LF T LI S+ ++
Sbjct: 64 VESAPNGEVIYVFPHRARAAVLARDARENSAAFRRRTWRGALALFRGLFATFLIVSVCVI 123
Query: 189 FTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPS-------------------DLFWYW 229
F A++AI + + +RG R D +F +P D FW+
Sbjct: 124 FLALVAI-TIIALTQNRGGGGRGGDDVLPVFFTPGGGGGGGGFGGGPYYRHHGVDNFWF- 181
Query: 230 DPYYYRRR----RVQTDDDDKKMNFIKSVFSFVFGEG----------------------- 262
Y Y R + + ++M + + ++ +G
Sbjct: 182 --YLYMRDIWWFTYWNEHEHRRMIYERGMYGGRYGAHVGRINVVDENGVAIGVPVRPHGG 239
Query: 263 ---------DPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDI--------------- 298
DPN +E +RW+ I + +N G V AE++AP+LD
Sbjct: 240 KGGPGRGGGDPNDTLEPRRWRAIALLLRANKGAVFAEQVAPFLDTFLLGAEGSNARDAGV 299
Query: 299 --------------------DRTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQ 335
D + E Y+L V RF G E ++G ++Y FPS Q
Sbjct: 300 FDALRRRVGLGAKKRGDRERDPSRMHEGYMLRVCSRFGGHAESSDDGKLVYVFPSLQ 356
>gi|379731250|ref|YP_005323446.1| hypothetical protein SGRA_3134 [Saprospira grandis str. Lewin]
gi|378576861|gb|AFC25862.1| hypothetical protein SGRA_3134 [Saprospira grandis str. Lewin]
Length = 490
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 122/280 (43%), Gaps = 45/280 (16%)
Query: 94 CNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRAKLAAKS 153
N++ T+ D A G+ + E++ A++ L D L+V++ GD++Y F +R KS
Sbjct: 19 SNQQFTLEDAAASTGMPIIESESAIRELMQRYDCKLKVTENGDLIYDFTGLHRR--TEKS 76
Query: 154 FRLKVEPVIDKAKAAAEYSIRVLFGTALIAS----IVIVFTAIIAILSS--KSDDDDR-G 206
F ++ D + R++ L+ +VI+ I+ +LS SD+D + G
Sbjct: 77 FGERLAEFADWFWKGFKIFYRIVTAVFLLIYFVLFVVILIALIVGLLSQGGNSDNDSKSG 136
Query: 207 RRRRSFDSGFNIFISP---SDLFWYWDPYYYRRR--------------------RVQTDD 243
+ F F +FIS S + Y Y YR R R
Sbjct: 137 GIGKLFLVLFRVFISIFEWSTILGY--DYTYRSRDDYGYPYKHYVEKPGQLAKLRKNRKS 194
Query: 244 DDKKMNFIKSVFSFVFG----EGDPNQGIEEKRWKLIGEYIASNGGVVTAEELAPYLDID 299
++ F+ SV+ F+FG + DP +E + ++ N G+V+ EL
Sbjct: 195 SKEEKGFVASVYDFIFGPVRYQPDPYANHKE-----VASFLKENKGLVSTAELQALAGWR 249
Query: 300 RTMSDESYVLPVLLRFDGQPEIDEEGNILY-RFPSFQRTA 338
R + E+++ +L R+DGQ +I E +LY F R+A
Sbjct: 250 RDEA-ENFMTELLGRYDGQAKISERSAVLYGDFSQLNRSA 288
>gi|424841171|ref|ZP_18265796.1| hypothetical protein SapgrDRAFT_0547 [Saprospira grandis DSM 2844]
gi|395319369|gb|EJF52290.1| hypothetical protein SapgrDRAFT_0547 [Saprospira grandis DSM 2844]
Length = 490
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 127/292 (43%), Gaps = 45/292 (15%)
Query: 82 DVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVF 141
+V R + N++ T+ D G+ + E++ A++ L D L+V++ GD++Y F
Sbjct: 7 EVNLRLEKYLLKSNQQFTLEDATASTGMPIIESESAIRELMQRYDCKLKVTENGDLIYDF 66
Query: 142 PNNYRAKLAAKSFRLKVEPVIDKAKAAAEYSIRVLFGTALIAS----IVIVFTAIIAILS 197
+R KSF +++ D + R++ L+ +VI+ I+ +LS
Sbjct: 67 TGLHRR--TEKSFGERLQEFADWFWKGFKIFYRIVTAVFLLIYFVLFVVILIALIVGLLS 124
Query: 198 S--KSDDDDR-GRRRRSFDSGFNIFISP---SDLFWYWDPYYYRRR-------------- 237
SD+D + G + F F +F+S S + Y Y YR R
Sbjct: 125 QGGNSDNDSKSGGVGKLFLVLFRVFLSIFEWSTILGY--DYTYRSRDDYGYPYKHYVEKP 182
Query: 238 ------RVQTDDDDKKMNFIKSVFSFVFG----EGDPNQGIEEKRWKLIGEYIASNGGVV 287
R + ++ F+ S++ F+FG + DP +E + ++ N G+V
Sbjct: 183 GQLAKLRKKRKSSKEEKGFVASIYDFIFGPVRYQPDPYANHKE-----VASFLKENKGLV 237
Query: 288 TAEELAPYLDIDRTMSDESYVLPVLLRFDGQPEIDEEGNILY-RFPSFQRTA 338
+ EL R + E+++ +L R+DGQ +I E +LY F R+A
Sbjct: 238 STAELQALAGWRRDEA-ENFMTELLGRYDGQAKISERSAVLYGNFSQLNRSA 288
>gi|56750095|ref|YP_170796.1| hypothetical protein syc0086_c [Synechococcus elongatus PCC 6301]
gi|56685054|dbj|BAD78276.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 321
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 54/90 (60%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ A++ RVT+GDVA ++GL++ + L LAA + L+VSD GD+ + FP+N R
Sbjct: 8 VQAIERLGYRVTVGDVAAQSGLRVALVEHQLLQLAAASQATLQVSDRGDIAFQFPHNLRQ 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLF 177
+L +S+R +++ + + Y IR+ F
Sbjct: 68 RLQQESWRQQLQLLGQQLWRLLFYLIRISF 97
>gi|81300436|ref|YP_400644.1| hypothetical protein Synpcc7942_1627 [Synechococcus elongatus PCC
7942]
gi|81169317|gb|ABB57657.1| hypothetical protein Synpcc7942_1627 [Synechococcus elongatus PCC
7942]
Length = 432
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 31/90 (34%), Positives = 54/90 (60%)
Query: 88 MDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNNYRA 147
+ A++ RVT+GDVA ++GL++ + L LAA + L+VSD GD+ + FP+N R
Sbjct: 8 VQAIERLGYRVTVGDVAAQSGLRVALVEHQLLQLAAASQATLQVSDRGDIAFQFPHNLRQ 67
Query: 148 KLAAKSFRLKVEPVIDKAKAAAEYSIRVLF 177
+L +S+R +++ + + Y IR+ F
Sbjct: 68 RLQQESWRQQLQLLGQQLWRLLFYLIRISF 97
>gi|347755989|ref|YP_004863552.1| hypothetical protein [Candidatus Chloracidobacterium thermophilum
B]
gi|347588506|gb|AEP13035.1| hypothetical protein Cabther_B0027 [Candidatus Chloracidobacterium
thermophilum B]
Length = 482
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 74/288 (25%), Positives = 122/288 (42%), Gaps = 59/288 (20%)
Query: 96 RRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVF-PNNYRAKLAAKSF 154
+R+T+ + A GL ++EA+ AL+ L D L+V++ GD++Y F P +R KSF
Sbjct: 22 KRLTLTEAAATTGLSIDEAEAALRELLMRYDCVLQVTENGDLIYDFGPRLHRRD--EKSF 79
Query: 155 RLKVEPVIDKAKAAAEYSIRVLFGTALIASIVIVF---------TAIIAILSSKSDDDDR 205
R ++ ++ + VLF A IA ++V+ A+IA+L + + +
Sbjct: 80 REYLDEFLELL-----WKGFVLFFKAWIAITLVVYFLIFLTILVMAVIALLFA-GGNKSK 133
Query: 206 GRRRRSFDSGFNIFISPSDLF--------------WYWDPYYYRR--------------- 236
GRR +S + F++ + LF + D + YR
Sbjct: 134 GRRSQS-AAPFHLIYYAARLFVAIFDWGTAAATTTYQTDQHGYRYATYQPKSAPLSPFQP 192
Query: 237 RRVQTDDDDKKMNFIKSVFSFVFG----EGDPNQGIEEKRWKLIGEYIASNGGVVTAEEL 292
+ QT KK +FI SV+ FVFG DP + +E I Y+ N G++ E+
Sbjct: 193 KNAQTAQPAKK-SFIASVYDFVFGPPRVTRDPLENQKE-----IAAYVRRNKGILFVSEI 246
Query: 293 APYLDIDRTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAAS 340
+D +M + RF G I + + F RTA
Sbjct: 247 KALTGLD-SMPAALLFSDCIGRFQGDIRISDRKLMYGLFDRLLRTAGE 293
>gi|168988199|gb|ACA35269.1| FtsZ2 [Cucumis sativus]
Length = 169
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 20/21 (95%), Positives = 21/21 (100%)
Query: 304 DESYVLPVLLRFDGQPEIDEE 324
DESY+LPVLLRFDGQPEIDEE
Sbjct: 2 DESYILPVLLRFDGQPEIDEE 22
>gi|443328286|ref|ZP_21056886.1| hypothetical protein Xen7305DRAFT_00013240 [Xenococcus sp. PCC
7305]
gi|442792132|gb|ELS01619.1| hypothetical protein Xen7305DRAFT_00013240 [Xenococcus sp. PCC
7305]
Length = 190
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 23/67 (34%), Positives = 39/67 (58%)
Query: 76 SDKLPADVRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEG 135
S K ++RN + V +T+ D+A A L +A+KAL+ A + D EV+++G
Sbjct: 14 SKKSDREIRNIFFNLVQNNQGSITVMDLAMAANLSGTDAKKALEKFAVEFDASFEVTEKG 73
Query: 136 DVLYVFP 142
++LY+FP
Sbjct: 74 NILYLFP 80
>gi|224069428|ref|XP_002302976.1| predicted protein [Populus trichocarpa]
gi|222844702|gb|EEE82249.1| predicted protein [Populus trichocarpa]
Length = 198
Score = 42.7 bits (99), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 18/27 (66%), Positives = 23/27 (85%)
Query: 275 LIGEYIASNGGVVTAEELAPYLDIDRT 301
+IG+ I+SNGGVV AEELAP+LD+ T
Sbjct: 75 MIGQCISSNGGVVAAEELAPFLDVKTT 101
>gi|427718787|ref|YP_007066781.1| hypothetical protein Cal7507_3555 [Calothrix sp. PCC 7507]
gi|427351223|gb|AFY33947.1| hypothetical protein Cal7507_3555 [Calothrix sp. PCC 7507]
Length = 141
Score = 42.4 bits (98), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 37/66 (56%), Gaps = 2/66 (3%)
Query: 269 EEKRWKLI-GEYIASNGGVVTAEELAPYLDIDRTMSDESYVLPVLLRFDGQPEIDEEGNI 327
E+KR +L+ E I N G +T +LA +I T S + Y+ + E++E+GNI
Sbjct: 77 EQKRLQLLFLELIEQNAGTITVLQLAKNAEIS-TQSSKQYLDDKAKELNASFEVNEDGNI 135
Query: 328 LYRFPS 333
LYRF S
Sbjct: 136 LYRFSS 141
>gi|399054444|ref|ZP_10742942.1| hypothetical protein PMI08_04541 [Brevibacillus sp. CF112]
gi|433546185|ref|ZP_20502520.1| hypothetical protein D478_20971 [Brevibacillus agri BAB-2500]
gi|398047763|gb|EJL40270.1| hypothetical protein PMI08_04541 [Brevibacillus sp. CF112]
gi|432182557|gb|ELK40123.1| hypothetical protein D478_20971 [Brevibacillus agri BAB-2500]
Length = 150
Score = 42.0 bits (97), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 22/57 (38%), Positives = 30/57 (52%)
Query: 86 RAMDAVDACNRRVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFP 142
+ +D + VT+ D+A K+ L L EAQK L L L V+ GD+LY FP
Sbjct: 82 KVLDIARENDGYVTVVDIATKSTLSLEEAQKILDDLQKKGHADLSVTKSGDILYYFP 138
>gi|84997770|ref|XP_953606.1| SfiI-subtelomeric fragment related protein family member [Theileria
annulata]
gi|65304603|emb|CAI72928.1| SfiI-subtelomeric fragment related protein family member, putative
[Theileria annulata]
Length = 400
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 53/96 (55%), Gaps = 6/96 (6%)
Query: 109 LKLNEAQKALQALAADTDGFLEVSDEGDVLYVFPNN--YRAK--LAAKSFRLKVEPVIDK 164
+ N + LQ + DG +E++ E +++Y P N Y A L SF+L E V+ K
Sbjct: 195 INYNIVKNVLQLKPVEVDGDIEMTTE-NIVYREPFNKIYEANRGLGICSFKLNNE-VVWK 252
Query: 165 AKAAAEYSIRVLFGTALIASIVIVFTAIIAILSSKS 200
AK +EY++++++ L + I++++ I + SKS
Sbjct: 253 AKDTSEYAVKIIYSEYLTSKILLIYNNIDYTVLSKS 288
>gi|302338873|ref|YP_003804079.1| hypothetical protein Spirs_2370 [Spirochaeta smaragdinae DSM 11293]
gi|301636058|gb|ADK81485.1| hypothetical protein Spirs_2370 [Spirochaeta smaragdinae DSM 11293]
Length = 129
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 29/46 (63%)
Query: 97 RVTIGDVAGKAGLKLNEAQKALQALAADTDGFLEVSDEGDVLYVFP 142
++T+ DV + G+ + EA+K LQA+ + +EV D+G + Y FP
Sbjct: 73 KLTVSDVVIETGIAVQEAEKILQAMVDNQHVRMEVRDDGIIYYEFP 118
>gi|124002463|ref|ZP_01687316.1| hypothetical protein M23134_05166 [Microscilla marina ATCC 23134]
gi|123992292|gb|EAY31660.1| hypothetical protein M23134_05166 [Microscilla marina ATCC 23134]
Length = 536
Score = 39.7 bits (91), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 54/114 (47%), Gaps = 15/114 (13%)
Query: 236 RRRVQTDDDDKKMNFIKSVFSFVFGEGDP---NQGIEEKRWKLIGEYIASNGGVVTAEEL 292
RR + +DD+ + +FS++FGE P + IE KL+ +I N G + A E+
Sbjct: 168 RRPSEEQEDDEGA--VHQMFSYIFGETTPKPDDLAIE----KLLLNFIVQNKGKIVAAEI 221
Query: 293 APYLDID-RTMSDESYVLPVLLRFDGQPEIDEEGNILYRFPSFQRTAASQRIGR 345
R +E+ L + + G E+ +EG I+Y FP + SQ I +
Sbjct: 222 VQLTGWSIRKAQEETAQL--MASYHGDAEVTDEGVIVYSFPDLE---DSQEINK 270
>gi|315122738|ref|YP_004063227.1| malic enzyme [Candidatus Liberibacter solanacearum CLso-ZC1]
gi|313496140|gb|ADR52739.1| malic enzyme [Candidatus Liberibacter solanacearum CLso-ZC1]
Length = 779
Score = 38.1 bits (87), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 38/138 (27%), Positives = 63/138 (45%), Gaps = 11/138 (7%)
Query: 60 ASTDVAVGVGPGRIVESDKLPAD-VRNRAMDAVDACNRRVTIGDVAGKAGLKLNEAQKAL 118
A T V+V I E+ L + VR+ M+ + + R G K+ LK++EA + +
Sbjct: 613 ADTHVSVDPSAREIAENTVLASQAVRSFGMNPLVSLLSRSNFGSHNTKSSLKMHEALEQI 672
Query: 119 QALAADTDGFLEVSDEGDVLYVFPNN-YRAKLAAKSFRLKVEPVIDKAKAAAE------- 170
+ L+ D ++ E D +F NN R ++ +L + P ID A + E
Sbjct: 673 RELSRDLKVDEKIQGEADFSEIFCNNAVRDTSLSQDAKLLIFPNIDSANISLEMVKSITN 732
Query: 171 --YSIRVLFGTALIASIV 186
Y +VL G+AL I+
Sbjct: 733 GLYIGKVLLGSALPVHIL 750
>gi|221210324|ref|ZP_03583304.1| helicase domain protein [Burkholderia multivorans CGD1]
gi|221169280|gb|EEE01747.1| helicase domain protein [Burkholderia multivorans CGD1]
Length = 1126
Score = 38.1 bits (87), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 37/149 (24%), Positives = 59/149 (39%), Gaps = 24/149 (16%)
Query: 178 GTALIASIVIVFTAIIAILSSKSDDDDRGRRRRSFDSGFNIFISPSDLFWYWDPYYYRRR 237
G LI +V + +++A +K +DD + I P +L W+ Y +R R
Sbjct: 274 GGVLIGDVVGLGKSMMATALAKVYEDDYNTE--------TLIICPKNLVSMWESYVHRYR 325
Query: 238 ---------RVQTDDDDKKMNFIKSVFSFVFGEGDPNQGIEEKRWKLIGEYIASNGGVVT 288
+V T+ +K + + E + E KRWK+I EYI N +
Sbjct: 326 LHAKVLPLSKVLTELPEKTRRY----RLIIIDESHNLRNKEGKRWKVIREYIERNDSLTV 381
Query: 289 AEELAPYLDIDRTMSDESYVLPVLLRFDG 317
PY ++T D S L + L D
Sbjct: 382 LLSATPY---NKTYLDLSAQLALFLNEDA 407
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.139 0.411
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,702,357,935
Number of Sequences: 23463169
Number of extensions: 294630704
Number of successful extensions: 731522
Number of sequences better than 100.0: 171
Number of HSP's better than 100.0 without gapping: 153
Number of HSP's successfully gapped in prelim test: 18
Number of HSP's that attempted gapping in prelim test: 730597
Number of HSP's gapped (non-prelim): 231
length of query: 423
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 278
effective length of database: 8,957,035,862
effective search space: 2490055969636
effective search space used: 2490055969636
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)