BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 016737
(383 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297829368|ref|XP_002882566.1| hypothetical protein ARALYDRAFT_478142 [Arabidopsis lyrata subsp.
lyrata]
gi|297328406|gb|EFH58825.1| hypothetical protein ARALYDRAFT_478142 [Arabidopsis lyrata subsp.
lyrata]
Length = 374
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 276/374 (73%), Positives = 313/374 (83%), Gaps = 6/374 (1%)
Query: 10 NSTSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKEA 69
N+T +P+L I K +S TKP F + T + F R S+ ESSLS+ KE
Sbjct: 7 NTTRIQTPSLPR---IPKPSSFTKPIKTHHLFSSETLLKRCRFVSR-SLPESSLSITKEQ 62
Query: 70 DAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLS 129
+ E + EDDPT ELSYLD E+D +SI EWELDFCSRPILD RGKKIWELVVCD SLS
Sbjct: 63 EVANEVE--EDDPTSELSYLDPESDADSIKEWELDFCSRPILDSRGKKIWELVVCDASLS 120
Query: 130 LQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIP 189
LQ TKYFPNNVINSITLK+AIV I DLGVP+PEKIRFFRSQMQTIITKACKEL IK +P
Sbjct: 121 LQVTKYFPNNVINSITLKDAIVTITQDLGVPLPEKIRFFRSQMQTIITKACKELAIKAVP 180
Query: 190 SKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS 249
SKRCLSL LWL+ERY+TVYTRHPGFQKGS PLL+LDNPFPM LP+NLFG+KWAFVQLP+S
Sbjct: 181 SKRCLSLFLWLQERYDTVYTRHPGFQKGSLPLLSLDNPFPMNLPENLFGEKWAFVQLPYS 240
Query: 250 AVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE 309
AV+EE+S E KFVFGA+LDLDLLGIEVD+ TLIPGL+VA+SRAKPLAAWMNGLEVCSIE
Sbjct: 241 AVREEISDFEEKFVFGATLDLDLLGIEVDENTLIPGLSVATSRAKPLAAWMNGLEVCSIE 300
Query: 310 TDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDC 369
D+++G LILSVGI+TRY+YA YKK PVTT EAEAWE+AKKA GGLHFLAIQ++LDS+DC
Sbjct: 301 ADSSKGCLILSVGIATRYVYATYKKTPVTTDEAEAWESAKKASGGLHFLAIQDDLDSDDC 360
Query: 370 VGFWLLLDLPPPPV 383
VGFWLL+DLPPPPV
Sbjct: 361 VGFWLLIDLPPPPV 374
>gi|255553548|ref|XP_002517815.1| conserved hypothetical protein [Ricinus communis]
gi|223543087|gb|EEF44622.1| conserved hypothetical protein [Ricinus communis]
Length = 377
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 291/371 (78%), Positives = 328/371 (88%), Gaps = 5/371 (1%)
Query: 11 STSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQ----HFRPRPSVSESSLSVP 66
+T + +PT HKPISK TS +KPT V F ++ PP+ HF+ + SVS
Sbjct: 2 ATLSFNPTRIPHKPISKITSFSKPTKVYFP-VSQKPPKTHQKQLHFQSKLSVSTQEQVEV 60
Query: 67 KEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDG 126
++ D E E ++V+DDPT E+SYLD ETDP+SI EWELDFCSRPILDIRGKK+WELVVCD
Sbjct: 61 EDYDNEEEEEEVDDDPTAEVSYLDPETDPDSIVEWELDFCSRPILDIRGKKVWELVVCDD 120
Query: 127 SLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIK 186
SLSLQ+TKYFPNNVINSITLK+A+V++ +DLGVP+PEKIRFFRSQMQTIITKACKEL+IK
Sbjct: 121 SLSLQFTKYFPNNVINSITLKDALVSVSEDLGVPLPEKIRFFRSQMQTIITKACKELNIK 180
Query: 187 PIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL 246
P+PSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELP+NLFG+KWAFVQL
Sbjct: 181 PVPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPENLFGEKWAFVQL 240
Query: 247 PFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVC 306
PFSAVQEEVSSLE++F+FGASLDLDLLGIE+ +KTLIPGLAVASSRAKPLAAWMNGLEVC
Sbjct: 241 PFSAVQEEVSSLETRFMFGASLDLDLLGIEIGEKTLIPGLAVASSRAKPLAAWMNGLEVC 300
Query: 307 SIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDS 366
SIE DT+R LILSVG+STRYIYA YKKNPVTT+EAEAWEAAKK CGGLHFLAIQE+LDS
Sbjct: 301 SIEADTSRACLILSVGLSTRYIYATYKKNPVTTAEAEAWEAAKKTCGGLHFLAIQEDLDS 360
Query: 367 EDCVGFWLLLD 377
EDCVGFWLLLD
Sbjct: 361 EDCVGFWLLLD 371
>gi|18398129|ref|NP_566327.1| RNA binding protein [Arabidopsis thaliana]
gi|6648213|gb|AAF21211.1|AC013483_35 unknown protein [Arabidopsis thaliana]
gi|18252181|gb|AAL61923.1| unknown protein [Arabidopsis thaliana]
gi|24899681|gb|AAN65055.1| unknown protein [Arabidopsis thaliana]
gi|332641109|gb|AEE74630.1| RNA binding protein [Arabidopsis thaliana]
Length = 374
Score = 553 bits (1424), Expect = e-155, Method: Compositional matrix adjust.
Identities = 274/374 (73%), Positives = 311/374 (83%), Gaps = 6/374 (1%)
Query: 10 NSTSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKEA 69
N+ +P+L I K +S TKP F + T + F R S+ ESSLS+ KE
Sbjct: 7 NTRRIQTPSLPR---IPKPSSFTKPIKTHHLFSSETLLKRCRFVSR-SLPESSLSITKEQ 62
Query: 70 DAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLS 129
+ E + EDDPT ELSYLD E+D +SI EWELDFCSRPILD RGKKIWELVVCD SLS
Sbjct: 63 EVANEVE--EDDPTSELSYLDPESDADSIKEWELDFCSRPILDSRGKKIWELVVCDASLS 120
Query: 130 LQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIP 189
LQ TKYFPNNVINSITLK+AIV I DLGVP+PEKIRFFRSQMQTIITKACKEL IK +P
Sbjct: 121 LQVTKYFPNNVINSITLKDAIVTITQDLGVPLPEKIRFFRSQMQTIITKACKELAIKAVP 180
Query: 190 SKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS 249
SKRCLSL LWL+ERY+TVYTRHPGFQKGS PLL+LDNPFPM LP+NLFG+KWAFVQLP+S
Sbjct: 181 SKRCLSLFLWLQERYDTVYTRHPGFQKGSLPLLSLDNPFPMNLPENLFGEKWAFVQLPYS 240
Query: 250 AVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE 309
AV+EE+S + KFVFGASLDLDLLGIEVD+ TLIPGL+VA+SRAKPLAAWMNGLEVCSIE
Sbjct: 241 AVREEISDFDEKFVFGASLDLDLLGIEVDENTLIPGLSVATSRAKPLAAWMNGLEVCSIE 300
Query: 310 TDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDC 369
D+++G LILSVGI+TRY+YA YKK PVTT EAEAWE+AKK GGLHFLAIQ++LDS+DC
Sbjct: 301 ADSSKGCLILSVGIATRYVYATYKKTPVTTDEAEAWESAKKTSGGLHFLAIQDDLDSDDC 360
Query: 370 VGFWLLLDLPPPPV 383
VGFWLL+DLPPPPV
Sbjct: 361 VGFWLLIDLPPPPV 374
>gi|225450083|ref|XP_002278058.1| PREDICTED: uncharacterized protein LOC100243060 [Vitis vinifera]
Length = 378
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 302/377 (80%), Positives = 328/377 (87%), Gaps = 9/377 (2%)
Query: 4 AALSLNNSTSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTN---TPPRLQHFRPRPSVSE 60
A LSLN T +PTL SHKPI +F SLT PT F TN T P+L HFR SVSE
Sbjct: 2 AGLSLN-PTKITTPTLQSHKPIYRFNSLTNPTKTQLKFPTNPAKTHPKLLHFR-HSSVSE 59
Query: 61 SSLSVPKEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWE 120
SS+SVPKE + + E DD PT E++YLD ETDPESI+EWELDFCSRPILDIRGKKIWE
Sbjct: 60 SSVSVPKEVEVDDEEDD----PTSEMNYLDRETDPESISEWELDFCSRPILDIRGKKIWE 115
Query: 121 LVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKAC 180
L+VCD SLSLQYTKYFPNNVINS+TLK AI +I D+L VP+PEKIRFFRSQMQTI+TKAC
Sbjct: 116 LLVCDSSLSLQYTKYFPNNVINSVTLKNAIESISDELDVPLPEKIRFFRSQMQTIVTKAC 175
Query: 181 KELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDK 240
KEL IKPIPSKRCLSL+LWLEERYETVYTRHPGFQ+GSKPLL LDNPFPM+LP+NLFG+K
Sbjct: 176 KELGIKPIPSKRCLSLILWLEERYETVYTRHPGFQQGSKPLLTLDNPFPMQLPENLFGEK 235
Query: 241 WAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWM 300
WAFVQLPFSAVQEEVSSLE++ VFGASLDLDLLGIEVD TLIPGLAVASSRAKPLAAWM
Sbjct: 236 WAFVQLPFSAVQEEVSSLETRLVFGASLDLDLLGIEVDANTLIPGLAVASSRAKPLAAWM 295
Query: 301 NGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAI 360
NGLEVCSIE DTAR LILSVGISTRYIYA YKK PVTTSEAEAWEAAKKACGGLHFLAI
Sbjct: 296 NGLEVCSIEADTARACLILSVGISTRYIYATYKKTPVTTSEAEAWEAAKKACGGLHFLAI 355
Query: 361 QEELDSEDCVGFWLLLD 377
Q++L+S+DCVGFWLLLD
Sbjct: 356 QDDLNSDDCVGFWLLLD 372
>gi|118489335|gb|ABK96472.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 376
Score = 531 bits (1368), Expect = e-148, Method: Compositional matrix adjust.
Identities = 275/371 (74%), Positives = 316/371 (85%), Gaps = 6/371 (1%)
Query: 11 STSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRP----RPSVSESSLSVP 66
+T + +PT HKPISK S +K + + F F + P H +P +++ S+S
Sbjct: 2 ATLSFNPTRIPHKPISKTASFSKTSEMPFPF--SLKPSKHHVKPLHLQSNIITKLSVSTQ 59
Query: 67 KEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDG 126
+E + D EDDPT E YLD+ETDP+SI EWELDFCSRPILD+RGKK+WELVVCD
Sbjct: 60 EEEVETEKEDLEEDDPTAETVYLDQETDPDSILEWELDFCSRPILDVRGKKVWELVVCDD 119
Query: 127 SLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIK 186
SLSLQ+TKYFPNNVINSITLK+AIV+I DLGVP+PE+IRFFRSQMQTIITKACKE+ IK
Sbjct: 120 SLSLQFTKYFPNNVINSITLKDAIVSISVDLGVPLPERIRFFRSQMQTIITKACKEIGIK 179
Query: 187 PIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL 246
PIPSKRC+SLLLWLEERYETVYTRHPGFQKG+KPLLALDNPFPMELPDNLFG+KWAFVQL
Sbjct: 180 PIPSKRCISLLLWLEERYETVYTRHPGFQKGAKPLLALDNPFPMELPDNLFGEKWAFVQL 239
Query: 247 PFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVC 306
PFSAV+EE++S E+ F FGASLDLDLLGIE+DDKT+IPGLAVASSRA+PLAAWMNGLEVC
Sbjct: 240 PFSAVREEIASFETSFFFGASLDLDLLGIEIDDKTMIPGLAVASSRAEPLAAWMNGLEVC 299
Query: 307 SIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDS 366
+IE DT+R LILSVGI+TRY+YA YKK PVTT+EAEAWEAAKKACGGLHFLAIQ +LDS
Sbjct: 300 AIEADTSRACLILSVGIATRYVYATYKKTPVTTAEAEAWEAAKKACGGLHFLAIQNDLDS 359
Query: 367 EDCVGFWLLLD 377
+DCVGFWLLLD
Sbjct: 360 DDCVGFWLLLD 370
>gi|449436313|ref|XP_004135937.1| PREDICTED: uncharacterized protein LOC101208052 [Cucumis sativus]
gi|449488836|ref|XP_004158187.1| PREDICTED: uncharacterized protein LOC101230638 [Cucumis sativus]
Length = 379
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 270/353 (76%), Positives = 311/353 (88%), Gaps = 8/353 (2%)
Query: 23 KPI-SKFTSLTKPTN-VSFNFLTNTPPRLQHFRPRPSVSESSLSVPKEADAEIEADDVED 80
KPI S F+ K N S N + P L FR SVSESS++ P+E +E ++ ED
Sbjct: 23 KPIYSPFSQSIKTANRFSANGRISQQP-LPRFRSN-SVSESSVTAPEE----VELNEDED 76
Query: 81 DPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNV 140
DPT E++YLD ETDPESITEWELDFCSRPILDIRGKK+WELVVCD SLSLQYTKYFPNNV
Sbjct: 77 DPTLEMAYLDSETDPESITEWELDFCSRPILDIRGKKVWELVVCDNSLSLQYTKYFPNNV 136
Query: 141 INSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWL 200
INSITL++A+ +I ++LGVP+P+KIRFFRSQMQTIITKAC EL IKPIPSKRCLSLLLWL
Sbjct: 137 INSITLRDAVSSIAEELGVPLPDKIRFFRSQMQTIITKACTELGIKPIPSKRCLSLLLWL 196
Query: 201 EERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLES 260
EERYETVYTRHPGFQKGSKPLLALDNPFPMELP+NLFG++WAFVQLPFSAVQEE+S+L+
Sbjct: 197 EERYETVYTRHPGFQKGSKPLLALDNPFPMELPENLFGERWAFVQLPFSAVQEEISNLKE 256
Query: 261 KFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILS 320
F+FG+SLDLDLLGIE+DDKT+IPGL+VA+SRA+PLAAWMNG+EV S+E DT+R SLILS
Sbjct: 257 TFMFGSSLDLDLLGIEIDDKTMIPGLSVATSRAQPLAAWMNGMEVYSVEADTSRASLILS 316
Query: 321 VGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFW 373
VGI+TRY+YA YKK PVT++EAEAWEAAKKACGGLHFLAIQ++LDSEDCVGFW
Sbjct: 317 VGIATRYVYATYKKTPVTSAEAEAWEAAKKACGGLHFLAIQDDLDSEDCVGFW 369
>gi|224104083|ref|XP_002313311.1| predicted protein [Populus trichocarpa]
gi|222849719|gb|EEE87266.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 514 bits (1324), Expect = e-143, Method: Compositional matrix adjust.
Identities = 258/312 (82%), Positives = 289/312 (92%), Gaps = 1/312 (0%)
Query: 67 KEADAEIEADDVE-DDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCD 125
+E + E E D E DDPT E+ YLD ETDP+SI EWELDFCSRPILD+RGKK+WELVVCD
Sbjct: 8 QEEEVETEKKDYEEDDPTTEMVYLDPETDPDSIVEWELDFCSRPILDVRGKKVWELVVCD 67
Query: 126 GSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDI 185
SLSLQ+TKYFPNNVINSITLK+AIV+I +DLGVP+PE+IRFFRSQMQTIITKACKE+ I
Sbjct: 68 DSLSLQFTKYFPNNVINSITLKDAIVSISEDLGVPLPERIRFFRSQMQTIITKACKEIGI 127
Query: 186 KPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQ 245
KPIPSKRC+SLLLWLEERYETVYTRHPGFQKG+KPLLALDNPFPMELPDNLFG+KWAFVQ
Sbjct: 128 KPIPSKRCISLLLWLEERYETVYTRHPGFQKGAKPLLALDNPFPMELPDNLFGEKWAFVQ 187
Query: 246 LPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEV 305
LP+SAV+EE++SLE+ F FGASLDLDLLGIE+DDKT+IPGLAVASSRA+PLAAWMNGLEV
Sbjct: 188 LPYSAVREEIASLETSFFFGASLDLDLLGIEIDDKTMIPGLAVASSRAEPLAAWMNGLEV 247
Query: 306 CSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELD 365
+IE DT+R LILSVGI+TRY+YA YKK PVTT+EAEAWEAAKKACGGLHFLAIQ +LD
Sbjct: 248 VAIEADTSRACLILSVGIATRYVYATYKKTPVTTAEAEAWEAAKKACGGLHFLAIQNDLD 307
Query: 366 SEDCVGFWLLLD 377
S+DCVGFWLLLD
Sbjct: 308 SDDCVGFWLLLD 319
>gi|388502160|gb|AFK39146.1| unknown [Lotus japonicus]
Length = 382
Score = 511 bits (1316), Expect = e-142, Method: Compositional matrix adjust.
Identities = 260/377 (68%), Positives = 316/377 (83%), Gaps = 5/377 (1%)
Query: 4 AALSLNNSTSTNSPTLNSHKPISKFTSLTKPTNVSF--NFLTNTPPRLQHFRPRPSVSES 61
A LS N T +PT N P +K TS +KP + + + ++ +L HFR SVSE+
Sbjct: 2 ATLSFN-PTRIRTPTFNRSNPSTKLTSSSKPIRIPCIPSSINHSHQKLIHFRAN-SVSET 59
Query: 62 SLSVPKEADAEIEADDVEDD-PTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWE 120
SLS KE + E D+ EDD PT E+S+LD ETDP++I++WELDFCSRPILD RGKK+WE
Sbjct: 60 SLSTQKEEEQETLGDEEEDDDPTAEMSFLDPETDPDAISDWELDFCSRPILDARGKKLWE 119
Query: 121 LVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKAC 180
LVVCD +LSLQ+TKYFPNNVINSITLK+A+V++CDDLG+P+P+KIRFFRSQMQTIIT+AC
Sbjct: 120 LVVCDSTLSLQFTKYFPNNVINSITLKDAVVSVCDDLGLPLPKKIRFFRSQMQTIITRAC 179
Query: 181 KELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDK 240
EL IKP+PSKRCLSLLLWLEERYETVY +HPGFQKG PLLALDNPFP +LP++LFG++
Sbjct: 180 NELGIKPVPSKRCLSLLLWLEERYETVYKKHPGFQKGFTPLLALDNPFPTKLPEDLFGER 239
Query: 241 WAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWM 300
WAFVQLPFSAV+EE++SL++ +FG+ LDLDL+GIE+DDKT+IPGLAV SSRA L+A M
Sbjct: 240 WAFVQLPFSAVREELTSLQTNMIFGSGLDLDLMGIEIDDKTMIPGLAVGSSRATVLSAIM 299
Query: 301 NGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAI 360
N E+C++E DTARGSLILSVGISTRY+YA YKK P TTSEAEAWEAAKKACGGLHFLAI
Sbjct: 300 NSFELCTVEADTARGSLILSVGISTRYVYATYKKTPTTTSEAEAWEAAKKACGGLHFLAI 359
Query: 361 QEELDSEDCVGFWLLLD 377
Q++++SE+C GFWLLLD
Sbjct: 360 QQDIESEECAGFWLLLD 376
>gi|224104081|ref|XP_002313310.1| predicted protein [Populus trichocarpa]
gi|222849718|gb|EEE87265.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 509 bits (1312), Expect = e-142, Method: Compositional matrix adjust.
Identities = 254/299 (84%), Positives = 282/299 (94%)
Query: 79 EDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPN 138
EDDPT E YLD+ETDP+SI EWELDFCSRPILD+RGKK+WELVVCD SLSLQ+TKYFPN
Sbjct: 21 EDDPTAETVYLDQETDPDSIVEWELDFCSRPILDVRGKKVWELVVCDDSLSLQFTKYFPN 80
Query: 139 NVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLL 198
NVINSITLK+AIV+I DLGVP+PE+IRFFRSQM TIITKACKE+ IKPIPSKRC+SLLL
Sbjct: 81 NVINSITLKDAIVSISVDLGVPLPERIRFFRSQMLTIITKACKEIGIKPIPSKRCISLLL 140
Query: 199 WLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSL 258
WLEERYETVYTRHPGFQKG+KPLLALDNPFPMELPDNLFG+KWAFVQLPFSAV+EE++SL
Sbjct: 141 WLEERYETVYTRHPGFQKGAKPLLALDNPFPMELPDNLFGEKWAFVQLPFSAVREEIASL 200
Query: 259 ESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLI 318
E++F FGASLDLDLLGIE+DDKT+IPGLAVASSRA+PLAAWMNGLEV +IE DT+R LI
Sbjct: 201 ETRFFFGASLDLDLLGIEIDDKTMIPGLAVASSRAEPLAAWMNGLEVVAIEADTSRACLI 260
Query: 319 LSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
LSVGI+TRY+YA YKK PVTT+EAEAWEAAKKACGGLHFLAIQ +LDS+DCVGFWLLLD
Sbjct: 261 LSVGIATRYVYATYKKTPVTTAEAEAWEAAKKACGGLHFLAIQNDLDSDDCVGFWLLLD 319
>gi|297736276|emb|CBI24914.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 506 bits (1302), Expect = e-140, Method: Compositional matrix adjust.
Identities = 256/292 (87%), Positives = 276/292 (94%)
Query: 86 LSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSIT 145
++YLD ETDPESI+EWELDFCSRPILDIRGKKIWEL+VCD SLSLQYTKYFPNNVINS+T
Sbjct: 1 MNYLDRETDPESISEWELDFCSRPILDIRGKKIWELLVCDSSLSLQYTKYFPNNVINSVT 60
Query: 146 LKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYE 205
LK AI +I D+L VP+PEKIRFFRSQMQTI+TKACKEL IKPIPSKRCLSL+LWLEERYE
Sbjct: 61 LKNAIESISDELDVPLPEKIRFFRSQMQTIVTKACKELGIKPIPSKRCLSLILWLEERYE 120
Query: 206 TVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFG 265
TVYTRHPGFQ+GSKPLL LDNPFPM+LP+NLFG+KWAFVQLPFSAVQEEVSSLE++ VFG
Sbjct: 121 TVYTRHPGFQQGSKPLLTLDNPFPMQLPENLFGEKWAFVQLPFSAVQEEVSSLETRLVFG 180
Query: 266 ASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGIST 325
ASLDLDLLGIEVD TLIPGLAVASSRAKPLAAWMNGLEVCSIE DTAR LILSVGIST
Sbjct: 181 ASLDLDLLGIEVDANTLIPGLAVASSRAKPLAAWMNGLEVCSIEADTARACLILSVGIST 240
Query: 326 RYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
RYIYA YKK PVTTSEAEAWEAAKKACGGLHFLAIQ++L+S+DCVGFWLLLD
Sbjct: 241 RYIYATYKKTPVTTSEAEAWEAAKKACGGLHFLAIQDDLNSDDCVGFWLLLD 292
>gi|357457965|ref|XP_003599263.1| hypothetical protein MTR_3g030950 [Medicago truncatula]
gi|355488311|gb|AES69514.1| hypothetical protein MTR_3g030950 [Medicago truncatula]
Length = 380
Score = 474 bits (1219), Expect = e-131, Method: Compositional matrix adjust.
Identities = 259/376 (68%), Positives = 307/376 (81%), Gaps = 5/376 (1%)
Query: 4 AALSLNNSTSTNSPTLNSHKPI--SKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSES 61
A LS N ST +P+ N PI +K +S +KP + F F +N L+ S + S
Sbjct: 2 ATLSFN-STRIKTPSFNYTNPIITTKLSS-SKPI-IKFPFSSNKNHFLKLQISSVSETSS 58
Query: 62 SLSVPKEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWEL 121
+ + K+ + E E ++ ++DPT E YLD E DP+SI WELDFCSRPILD RGKK+WEL
Sbjct: 59 TTTTQKDIEEEEEEEEEKEDPTAETCYLDPEADPDSILSWELDFCSRPILDARGKKLWEL 118
Query: 122 VVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACK 181
VVCD SLSLQYTKYFPNNVINSITLK++IVAICDDL +P+P IRFFRSQMQTIITKACK
Sbjct: 119 VVCDKSLSLQYTKYFPNNVINSITLKDSIVAICDDLDLPVPRNIRFFRSQMQTIITKACK 178
Query: 182 ELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKW 241
EL I+ +PSKRCLSLLLWLEERYETVYT+HPGFQKGSKPLL LDNPF +LP++LFG++W
Sbjct: 179 ELGIRALPSKRCLSLLLWLEERYETVYTKHPGFQKGSKPLLPLDNPFATKLPEDLFGERW 238
Query: 242 AFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMN 301
AFVQLP+SAV+ E S+ E +F +G+ LDLDLLGIE+D+KTLIPGLAVASSRAK L+A+MN
Sbjct: 239 AFVQLPYSAVRAEASASEERFGYGSGLDLDLLGIEIDEKTLIPGLAVASSRAKILSAFMN 298
Query: 302 GLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQ 361
GLE+CSIETDTAR +L LSVGISTRY+YA YKK+P +T EAEAWEAAKKA GGLHFLAIQ
Sbjct: 299 GLELCSIETDTARSNLTLSVGISTRYVYATYKKSPTSTKEAEAWEAAKKASGGLHFLAIQ 358
Query: 362 EELDSEDCVGFWLLLD 377
+ELDSEDC+GFWLLLD
Sbjct: 359 DELDSEDCIGFWLLLD 374
>gi|363807199|ref|NP_001242607.1| uncharacterized protein LOC100795572 [Glycine max]
gi|255640179|gb|ACU20380.1| unknown [Glycine max]
Length = 377
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 264/383 (68%), Positives = 309/383 (80%), Gaps = 10/383 (2%)
Query: 4 AALSLNNSTSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSL 63
A LS N SPT SK T+ +K + +N+ P+L HFRPR SVSES+
Sbjct: 2 ATLSFN-PVRIKSPTFKH----SKLTTPSKRITIPCTTPSNSHPKLLHFRPR-SVSESTQ 55
Query: 64 SVPKEA---DAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWE 120
EA + E E +D +DDP+ ELSY+D TDPESITEWELDFCSRPILD RGKK+WE
Sbjct: 56 KEAPEAVLGEEEEEEEDDDDDPSAELSYVDPVTDPESITEWELDFCSRPILDARGKKVWE 115
Query: 121 LVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKAC 180
LVVC +LSLQYTKYFPNNVINSITLK+AIVA+ D LGVP+P IRFFRSQMQTIIT AC
Sbjct: 116 LVVCGKTLSLQYTKYFPNNVINSITLKDAIVAVSDQLGVPLPRNIRFFRSQMQTIITNAC 175
Query: 181 KELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDK 240
EL I+P+PSKRC+S++LWLEERYETVY +HPGFQ+GSKPLLALDNPFP ELPD L+G++
Sbjct: 176 NELRIRPVPSKRCVSIILWLEERYETVYKKHPGFQEGSKPLLALDNPFPTELPDILYGER 235
Query: 241 WAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWM 300
WAFVQLP+SAV+EE+S+ E + V G+ LDLDLLG+++DDKTLIPGL+VASS + LAA +
Sbjct: 236 WAFVQLPYSAVREEISTFE-RGVCGSGLDLDLLGLDIDDKTLIPGLSVASSNSTALAALI 294
Query: 301 NGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAI 360
NGLEVC++E DTAR LILS GISTRYIY+ YKK P TTSEAEAWEAAKKACGGLHFLA+
Sbjct: 295 NGLEVCAVEADTARARLILSSGISTRYIYSTYKKTPETTSEAEAWEAAKKACGGLHFLAV 354
Query: 361 QEELDSEDCVGFWLLLDLPPPPV 383
Q +LDSEDCVGF+LLLDLP PPV
Sbjct: 355 QPDLDSEDCVGFFLLLDLPFPPV 377
>gi|226508054|ref|NP_001150851.1| tab2 protein [Zea mays]
gi|194702852|gb|ACF85510.1| unknown [Zea mays]
gi|195642376|gb|ACG40656.1| tab2 protein [Zea mays]
gi|413937739|gb|AFW72290.1| Tab2 protein [Zea mays]
Length = 390
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 221/309 (71%), Positives = 269/309 (87%), Gaps = 1/309 (0%)
Query: 69 ADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSL 128
AD E+EA++ + DP E+ YLD + DPESI EWELDFCSRPILD RGKK+WELVVCD +L
Sbjct: 77 ADEEVEAEN-KVDPQAEVCYLDPDVDPESIREWELDFCSRPILDARGKKVWELVVCDATL 135
Query: 129 SLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPI 188
SLQ+T+YFPNN INS+TL++A+ ++ + LGVP+P+++RFFRSQMQTIIT+AC +L +K +
Sbjct: 136 SLQFTRYFPNNAINSVTLRDALASVSEALGVPMPDRVRFFRSQMQTIITRACGDLGVKAV 195
Query: 189 PSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPF 248
PS+RC+SLLLWLEERYE VY+RHPGFQ G++PLLALDNPFP LP+NLFGDKWAFVQLPF
Sbjct: 196 PSRRCVSLLLWLEERYEVVYSRHPGFQAGTRPLLALDNPFPTTLPENLFGDKWAFVQLPF 255
Query: 249 SAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSI 308
SAV+EEV SLE ++ FGA LDL+LLG E+DD TL+PG+AV SSRAKPLAAWMNGLE+C++
Sbjct: 256 SAVREEVESLERRYAFGAGLDLELLGFELDDTTLVPGVAVESSRAKPLAAWMNGLEICAM 315
Query: 309 ETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSED 368
E DT R SLILS G+STRY+Y+ Y+K +T EAEAWEAAKKACGGLHFLAIQE L+S+
Sbjct: 316 EADTGRASLILSAGVSTRYVYSGYQKTAASTQEAEAWEAAKKACGGLHFLAIQENLNSDG 375
Query: 369 CVGFWLLLD 377
CVGFWLLLD
Sbjct: 376 CVGFWLLLD 384
>gi|356534594|ref|XP_003535838.1| PREDICTED: uncharacterized protein LOC100803590 [Glycine max]
Length = 378
Score = 457 bits (1175), Expect = e-126, Method: Compositional matrix adjust.
Identities = 255/360 (70%), Positives = 300/360 (83%), Gaps = 9/360 (2%)
Query: 29 TSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKEADAEI-----EADDVEDDPT 83
T+ +KP + +N+ P+L HFR R SVSES+ KEA + E +D +DDPT
Sbjct: 23 TTPSKPITIPCTTPSNSHPKLLHFRTR-SVSESTHQ--KEAPEAVLGEHEEEEDDDDDPT 79
Query: 84 QELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINS 143
ELSY+D ETDPESITEWELDFCSRPILD+RGKKIWELVVCD +LSLQYTKYFPNNVINS
Sbjct: 80 SELSYVDPETDPESITEWELDFCSRPILDVRGKKIWELVVCDKTLSLQYTKYFPNNVINS 139
Query: 144 ITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEER 203
ITLK+AIVA+ D LGVP+P IRFFRSQMQTIIT AC EL I+P+PSKRC+S++LWLEER
Sbjct: 140 ITLKDAIVAVSDQLGVPLPRNIRFFRSQMQTIITNACNELRIRPVPSKRCVSIILWLEER 199
Query: 204 YETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFV 263
YETVY +HPGFQ+GSKPLLALDNPFP ELPD L+G++WAFVQLP+SAV+EE+S+ E + V
Sbjct: 200 YETVYRKHPGFQEGSKPLLALDNPFPTELPDILYGERWAFVQLPYSAVREEISTFE-RGV 258
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGI 323
G+ LDL+LLG+++DDKTLIPGL+VASS A LAA +NGLEV ++E D R LILS GI
Sbjct: 259 CGSGLDLELLGLDIDDKTLIPGLSVASSNATALAALINGLEVSAVEADAPRARLILSAGI 318
Query: 324 STRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 383
STRYIY+ YKK P TTSEAEAWEAAKKACGGLHF+A+Q +LDSEDCVGF+LLLDLP PPV
Sbjct: 319 STRYIYSTYKKTPETTSEAEAWEAAKKACGGLHFIAVQPDLDSEDCVGFFLLLDLPFPPV 378
>gi|115447245|ref|NP_001047402.1| Os02g0610800 [Oryza sativa Japonica Group]
gi|47497182|dbj|BAD19229.1| putative Tab2 protein [Oryza sativa Japonica Group]
gi|113536933|dbj|BAF09316.1| Os02g0610800 [Oryza sativa Japonica Group]
gi|215704647|dbj|BAG94275.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 392
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 218/304 (71%), Positives = 261/304 (85%)
Query: 74 EADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYT 133
++++ E DP E+ YLD E D E I EWELDFCSRPILD RGKK+WELVVCD +LSLQ+T
Sbjct: 83 DSEEEEMDPLAEVCYLDPEADAEGIREWELDFCSRPILDARGKKVWELVVCDATLSLQFT 142
Query: 134 KYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRC 193
++FPN INS+TL++A+ ++ LGVP+P++ RFFRSQMQTII++AC EL +K +PS+RC
Sbjct: 143 RFFPNTSINSVTLRDALASVATSLGVPLPDRARFFRSQMQTIISRACNELGVKAVPSRRC 202
Query: 194 LSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQE 253
+SLLLWLEERYETVY+RHPGFQ G+KPLL LDNPFP LP+NLFGDKWAFVQLPFSAV+E
Sbjct: 203 VSLLLWLEERYETVYSRHPGFQSGTKPLLTLDNPFPTSLPENLFGDKWAFVQLPFSAVRE 262
Query: 254 EVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA 313
EV SLE ++ FGA LDLDLLG E+D+ TLIPG+AV SSRAKPLAAWMNGLE+CS+E DT
Sbjct: 263 EVESLERRYAFGAGLDLDLLGFELDENTLIPGVAVESSRAKPLAAWMNGLEICSMEVDTG 322
Query: 314 RGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFW 373
R +LILS G+STRY+YA Y+K+ TT EAEAWEAAKKACGGLHFLAIQE L+S+ CVGFW
Sbjct: 323 RANLILSAGVSTRYVYAGYQKSAATTQEAEAWEAAKKACGGLHFLAIQENLNSDGCVGFW 382
Query: 374 LLLD 377
LLLD
Sbjct: 383 LLLD 386
>gi|125540251|gb|EAY86646.1| hypothetical protein OsI_08028 [Oryza sativa Indica Group]
Length = 392
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 218/304 (71%), Positives = 261/304 (85%)
Query: 74 EADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYT 133
++++ E DP E+ YLD E D E I EWELDFCSRPILD RGKK+WELVVCD +LSLQ+T
Sbjct: 83 DSEEEEMDPLAEVCYLDPEADAEGIREWELDFCSRPILDARGKKVWELVVCDATLSLQFT 142
Query: 134 KYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRC 193
++FPN INS+TL++A+ ++ LGVP+P++ RFFRSQMQTII++AC EL +K +PS+RC
Sbjct: 143 RFFPNTSINSVTLRDALASVATSLGVPLPDRARFFRSQMQTIISRACNELGVKAVPSRRC 202
Query: 194 LSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQE 253
+SLLLWLEERYETVY+RHPGFQ G+KPLL LDNPFP LP+NLFGDKWAFVQLPFSAV+E
Sbjct: 203 VSLLLWLEERYETVYSRHPGFQSGTKPLLTLDNPFPTSLPENLFGDKWAFVQLPFSAVRE 262
Query: 254 EVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA 313
EV SLE ++ FGA LDLDLLG E+D+ TLIPG+AV SSRAKPLAAWMNGLE+CS+E DT
Sbjct: 263 EVESLERRYAFGAGLDLDLLGFELDENTLIPGVAVESSRAKPLAAWMNGLEICSMEVDTG 322
Query: 314 RGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFW 373
R +LILS G+STRY+YA Y+K+ TT EAEAWEAAKKACGGLHFLAIQE L+S+ CVGFW
Sbjct: 323 RANLILSAGVSTRYVYAGYQKSAATTQEAEAWEAAKKACGGLHFLAIQENLNSDGCVGFW 382
Query: 374 LLLD 377
LLLD
Sbjct: 383 LLLD 386
>gi|125582848|gb|EAZ23779.1| hypothetical protein OsJ_07488 [Oryza sativa Japonica Group]
Length = 304
Score = 450 bits (1157), Expect = e-124, Method: Compositional matrix adjust.
Identities = 217/297 (73%), Positives = 256/297 (86%)
Query: 81 DPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNV 140
DP E+ YLD E D E I EWELDFCSRPILD RGKK+WELVVCD +LSLQ+T++FPN
Sbjct: 2 DPLAEVCYLDPEADAEGIREWELDFCSRPILDARGKKVWELVVCDATLSLQFTRFFPNTS 61
Query: 141 INSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWL 200
INS+TL++A+ ++ LGVP+P++ RFFRSQMQTII++AC EL +K +PS+RC+SLLLWL
Sbjct: 62 INSVTLRDALASVATSLGVPLPDRARFFRSQMQTIISRACNELGVKAVPSRRCVSLLLWL 121
Query: 201 EERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLES 260
EERYETVY+RHPGFQ G+KPLL LDNPFP LP+NLFGDKWAFVQLPFSAV+EEV SLE
Sbjct: 122 EERYETVYSRHPGFQSGTKPLLTLDNPFPTSLPENLFGDKWAFVQLPFSAVREEVESLER 181
Query: 261 KFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILS 320
++ FGA LDLDLLG E+D+ TLIPG+AV SSRAKPLAAWMNGLE+CS+E DT R +LILS
Sbjct: 182 RYAFGAGLDLDLLGFELDENTLIPGVAVESSRAKPLAAWMNGLEICSMEVDTGRANLILS 241
Query: 321 VGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
G+STRY+YA Y+K+ TT EAEAWEAAKKACGGLHFLAIQE L+S+ CVGFWLLLD
Sbjct: 242 AGVSTRYVYAGYQKSAATTQEAEAWEAAKKACGGLHFLAIQENLNSDGCVGFWLLLD 298
>gi|242062284|ref|XP_002452431.1| hypothetical protein SORBIDRAFT_04g025690 [Sorghum bicolor]
gi|241932262|gb|EES05407.1| hypothetical protein SORBIDRAFT_04g025690 [Sorghum bicolor]
Length = 399
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 215/293 (73%), Positives = 255/293 (87%)
Query: 85 ELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSI 144
E+ YLD + DPESI EWELDFCSRPILD RGKK+WELVVCD +LSLQ+T+YFPNN INS+
Sbjct: 101 EVCYLDPDADPESIREWELDFCSRPILDARGKKVWELVVCDATLSLQFTRYFPNNAINSV 160
Query: 145 TLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERY 204
TL++A+ ++ + LGVP+P+++RFFRSQMQTIIT+AC EL +K +PS+RC+SLLLWLEERY
Sbjct: 161 TLRDALSSVSEALGVPMPDRVRFFRSQMQTIITRACGELGVKAVPSRRCVSLLLWLEERY 220
Query: 205 ETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVF 264
E VY+RHPGFQ G++PLLALDNPFP LP+NLFGDKWAFVQLPFSAV+EEV SL ++ F
Sbjct: 221 EVVYSRHPGFQAGTRPLLALDNPFPTTLPENLFGDKWAFVQLPFSAVREEVESLGRRYAF 280
Query: 265 GASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGIS 324
GA LDLDLLG E+DD TL+PG+AV SSRAKPLAAWMNGLE+ ++E DT R SLILS G+S
Sbjct: 281 GAGLDLDLLGFELDDSTLVPGVAVESSRAKPLAAWMNGLEISAMEVDTGRASLILSAGVS 340
Query: 325 TRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
TRYIY+ Y+K P T EAEAWEAAKKA GGLHFLAIQE L+S+ CVGFWLLLD
Sbjct: 341 TRYIYSGYQKTPAATQEAEAWEAAKKASGGLHFLAIQENLNSDGCVGFWLLLD 393
>gi|357150079|ref|XP_003575334.1| PREDICTED: uncharacterized protein LOC100846528 [Brachypodium
distachyon]
Length = 394
Score = 441 bits (1133), Expect = e-121, Method: Compositional matrix adjust.
Identities = 220/358 (61%), Positives = 269/358 (75%), Gaps = 17/358 (4%)
Query: 33 KPTNVSFNFLTNTPPRLQHFRPRP-----SVSESSLSVPKEADAEIEADDVED------- 80
KP++ SF+ P + P P S+S S + AD DD
Sbjct: 27 KPSSASFSARPYPHPHYRLAVPTPRRPCRSISSESPTASAAADTAEGEDDPAAATIEEEE 86
Query: 81 -----DPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKY 135
DP E+ YLD E D E I EWE+DFCSRPILD RGKK+WELVVCD +LSLQ+T++
Sbjct: 87 EEEELDPLAEVCYLDPEADAEGIREWEVDFCSRPILDARGKKVWELVVCDATLSLQFTRF 146
Query: 136 FPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLS 195
FPN INS+TL++A+ ++ LGVP+P++ RFFRSQMQTII++AC EL +K +PS+RC+S
Sbjct: 147 FPNTSINSVTLRDALASVSTSLGVPLPDRARFFRSQMQTIISRACNELGVKAVPSRRCVS 206
Query: 196 LLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEV 255
LLLWLEERYETVY+RHPGFQ+G+KPLL LDNPF LPDNLFGDKWAFVQLPF+ V+EEV
Sbjct: 207 LLLWLEERYETVYSRHPGFQQGTKPLLTLDNPFASNLPDNLFGDKWAFVQLPFADVREEV 266
Query: 256 SSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARG 315
L ++ FGA LDLDLLG E+D+ TL+PG+AV SSRA+PLAAWMNGLE+CS+E DT R
Sbjct: 267 ELLGRRYAFGAGLDLDLLGFELDETTLVPGVAVESSRARPLAAWMNGLEICSMEVDTDRA 326
Query: 316 SLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFW 373
+LILS G+STRY+YA Y+K+ TT EAEAWEAAKKACGGLHFLAIQE L+S+ CVGFW
Sbjct: 327 NLILSAGVSTRYVYAAYQKSAATTQEAEAWEAAKKACGGLHFLAIQENLNSDSCVGFW 384
>gi|168032007|ref|XP_001768511.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680224|gb|EDQ66662.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 338
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 187/295 (63%), Positives = 233/295 (78%)
Query: 89 LDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKE 148
L E++D +SI+EWELDFCSRPILD RGKK+WELVVCD LQ+T++FPNNVINS+TL++
Sbjct: 44 LAEDSDVDSISEWELDFCSRPILDARGKKLWELVVCDSRRQLQFTRFFPNNVINSVTLRD 103
Query: 149 AIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVY 208
A++ I D LGVP PEKIRFFRSQMQTIITKACKELDI+P+PS+RC++L+ WLEER+ETVY
Sbjct: 104 ALMYIMDTLGVPKPEKIRFFRSQMQTIITKACKELDIQPVPSQRCVALIKWLEERFETVY 163
Query: 209 TRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASL 268
++HPG+Q+G+ PLL P++LPD L G++WAFVQLPF AV EE+ + VFG+ L
Sbjct: 164 SQHPGYQEGASPLLLQQQSLPLDLPDALRGEEWAFVQLPFEAVLEEMEGVVRGDVFGSVL 223
Query: 269 DLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYI 328
DL L I++ +IPG+AVASSRA PLAAW N LE+ +E DT R L+LS G++ R+
Sbjct: 224 DLGTLNIDLSGDIMIPGVAVASSRATPLAAWTNALELACLEVDTQRSCLVLSTGVADRWR 283
Query: 329 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 383
YA Y+K+ T +E EAWEAAKK CGGLHFLA+Q LDSE C GFWLLLD P PV
Sbjct: 284 YAFYRKSRQTDAEGEAWEAAKKKCGGLHFLAVQSSLDSELCTGFWLLLDTPISPV 338
>gi|168015159|ref|XP_001760118.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688498|gb|EDQ74874.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 290
Score = 373 bits (957), Expect = e-101, Method: Compositional matrix adjust.
Identities = 180/292 (61%), Positives = 227/292 (77%), Gaps = 2/292 (0%)
Query: 92 ETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIV 151
+ D +SI EWELDFCSRPILD RGKK+WELVVCD LQ+T++FPNNVINS+TL++A++
Sbjct: 1 DADVDSIYEWELDFCSRPILDSRGKKLWELVVCDSRRQLQFTRFFPNNVINSVTLRDALL 60
Query: 152 AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRH 211
I D L VP PEKIRFFRSQMQTIITKACKELDI+P+PS+RC++L+ WLEER+ETVY++H
Sbjct: 61 YIMDTLQVPKPEKIRFFRSQMQTIITKACKELDIQPVPSQRCVTLIKWLEERFETVYSQH 120
Query: 212 PGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLD 271
PG+Q+G+ PLL P++LPD L G++WAF L +AV EE+ + VFG+ LDLD
Sbjct: 121 PGYQEGASPLLLQQQSLPLDLPDALRGEEWAF--LALAAVLEEMEGVSKGDVFGSVLDLD 178
Query: 272 LLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYAN 331
L I++ +IPG+AVASSRA PLAAW N LE+ S+E DT R L+LS G++ R+ YA
Sbjct: 179 RLNIDLSPGIMIPGVAVASSRATPLAAWTNALELASLEVDTQRSCLVLSTGVADRWRYAF 238
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 383
Y+K+ T +E EAWEAAK+ CGGLHFLA+Q LDSE C GFWLL+D P PV
Sbjct: 239 YRKSRQTDAEGEAWEAAKRKCGGLHFLAVQSSLDSELCTGFWLLIDTPISPV 290
>gi|413937738|gb|AFW72289.1| hypothetical protein ZEAMMB73_111177 [Zea mays]
Length = 320
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 166/247 (67%), Positives = 206/247 (83%), Gaps = 3/247 (1%)
Query: 69 ADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSL 128
AD E+EA++ + DP E+ YLD + DPESI EWELDFCSRPILD RGKK+WELVVCD +L
Sbjct: 77 ADEEVEAEN-KVDPQAEVCYLDPDVDPESIREWELDFCSRPILDARGKKVWELVVCDATL 135
Query: 129 SLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPI 188
SLQ+T+YFPNN INS+TL++A+ ++ + LGVP+P+++RFFRSQMQTIIT+AC +L +K +
Sbjct: 136 SLQFTRYFPNNAINSVTLRDALASVSEALGVPMPDRVRFFRSQMQTIITRACGDLGVKAV 195
Query: 189 PSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPF 248
PS+RC+SLLLWLEERYE VY+RHPGFQ G++PLLALDNPFP LP+NLFGDKWAFVQLPF
Sbjct: 196 PSRRCVSLLLWLEERYEVVYSRHPGFQAGTRPLLALDNPFPTTLPENLFGDKWAFVQLPF 255
Query: 249 SAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSI 308
SAV+EEV SLE ++ FGA LDL+LLG E+DD TL+PG+AV SSRAKPLA + L C +
Sbjct: 256 SAVREEVESLERRYAFGAGLDLELLGFELDDTTLVPGVAVESSRAKPLAETVPSL--CFL 313
Query: 309 ETDTARG 315
RG
Sbjct: 314 SPPFQRG 320
>gi|357457971|ref|XP_003599266.1| Cc-nbs-lrr resistance protein [Medicago truncatula]
gi|355488314|gb|AES69517.1| Cc-nbs-lrr resistance protein [Medicago truncatula]
Length = 1528
Score = 315 bits (807), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 163/206 (79%), Positives = 188/206 (91%)
Query: 172 MQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPME 231
MQTIITKACKEL I+ +PSKRCLSLLLWLEERYETVYT+HPGFQKGSKPLL LDNPF +
Sbjct: 2 MQTIITKACKELGIRALPSKRCLSLLLWLEERYETVYTKHPGFQKGSKPLLPLDNPFATK 61
Query: 232 LPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASS 291
LP++LFG++WAFVQLP+SAV+ E S+ E +F +G+ LDLDLLGIE+D+KTLIPGLAVASS
Sbjct: 62 LPEDLFGERWAFVQLPYSAVRAEASASEERFGYGSGLDLDLLGIEIDEKTLIPGLAVASS 121
Query: 292 RAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKA 351
RAK L+A+MNGLE+CSIETDTAR +L LSVGISTRY+YA YKK+P +T EAEAWEAAKKA
Sbjct: 122 RAKILSAFMNGLELCSIETDTARSNLTLSVGISTRYVYATYKKSPTSTKEAEAWEAAKKA 181
Query: 352 CGGLHFLAIQEELDSEDCVGFWLLLD 377
GGLHFLAIQ+ELDSEDC+GFWLLLD
Sbjct: 182 SGGLHFLAIQDELDSEDCIGFWLLLD 207
>gi|302763879|ref|XP_002965361.1| hypothetical protein SELMODRAFT_65804 [Selaginella moellendorffii]
gi|300167594|gb|EFJ34199.1| hypothetical protein SELMODRAFT_65804 [Selaginella moellendorffii]
Length = 290
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 147/291 (50%), Positives = 199/291 (68%), Gaps = 1/291 (0%)
Query: 93 TDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVA 152
D SI EW+LDFCSRPI D RGK++WEL++CD L++ +++P+NVINS TLK AI
Sbjct: 1 ADLASIVEWQLDFCSRPIFDDRGKRMWELIICDAKRQLEFARFYPSNVINSTTLKNAIAE 60
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ + +P P ++R+FRSQ++TII+KAC EL I+ S+RC +L+ WL+ERY+ VY +HP
Sbjct: 61 VIETFDLPRPTRVRYFRSQVKTIISKACGELGIQVTSSQRCTALVRWLQERYDQVYRQHP 120
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
GFQ+ + +L++ P E+P N G+KWAFVQL F A+QEE+ +E FG + LD+
Sbjct: 121 GFQENAPSILSMGVSVPKEVPPNYRGEKWAFVQLSFQALQEEIKLVEKGSNFG-EVSLDM 179
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANY 332
L TLIPG+AVASSR LAAW N LE+ S+ D +L+LS G S ++ Y+ Y
Sbjct: 180 LTELPSPDTLIPGVAVASSRDLALAAWTNSLELASLSVDKKNSALVLSSGASRQWFYSYY 239
Query: 333 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 383
KK+ EA+ WE+AKKA GGLHFLAIQ L+S C G W+L D P PPV
Sbjct: 240 KKSKQADEEADLWESAKKAAGGLHFLAIQPSLESNSCSGLWILYDFPAPPV 290
>gi|302790880|ref|XP_002977207.1| hypothetical protein SELMODRAFT_55779 [Selaginella moellendorffii]
gi|300155183|gb|EFJ21816.1| hypothetical protein SELMODRAFT_55779 [Selaginella moellendorffii]
Length = 290
Score = 310 bits (795), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 146/291 (50%), Positives = 197/291 (67%), Gaps = 1/291 (0%)
Query: 93 TDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVA 152
D SI EW+LDFCSRPI D RGK++WEL++CD L++ +++P+NVINS TLK AI
Sbjct: 1 ADLASIVEWQLDFCSRPIFDDRGKRMWELIICDAKRQLEFARFYPSNVINSTTLKNAIAE 60
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ + +P P ++R+FRSQ++TII+KAC EL I+ S+RC +L+ WL ERY+ VY +HP
Sbjct: 61 VIETFDLPRPTRVRYFRSQVKTIISKACGELGIQVTSSQRCTALVRWLHERYDQVYRQHP 120
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
GFQ+ + +L++ P E+P N G+KWAFVQL F A+QEE+ +E FG + LD+
Sbjct: 121 GFQENAPSILSMGVNVPKEVPPNYRGEKWAFVQLSFQALQEEIKLVEKGSNFG-EVSLDM 179
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANY 332
L TLIPG+AVASSR LAAW N LE+ S+ D +L+L G S ++ Y+ Y
Sbjct: 180 LTELPSPDTLIPGVAVASSRDLALAAWTNSLELASLSVDKKNSALVLLSGASRQWFYSYY 239
Query: 333 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 383
KK+ EA+ WE+AKKA GGLHFLAIQ L+S C G W+L D P PPV
Sbjct: 240 KKSKQADEEADLWESAKKAAGGLHFLAIQPSLESNSCSGLWILYDFPAPPV 290
>gi|116783338|gb|ABK22899.1| unknown [Picea sitchensis]
Length = 362
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 143/213 (67%), Positives = 176/213 (82%)
Query: 85 ELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSI 144
E++ L + DPESITEWELDFCSRPILDIRGKKIWELVVCD +L++T+++PNNVINSI
Sbjct: 80 EVTKLAADIDPESITEWELDFCSRPILDIRGKKIWELVVCDSKRALEFTRFYPNNVINSI 139
Query: 145 TLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERY 204
TLK+AI++I LGVP P+ IRFFRSQM+TI++KAC EL I+P+PSKRCLSL+ WLEERY
Sbjct: 140 TLKDAIMSIVQTLGVPKPQTIRFFRSQMKTIVSKACNELGIRPVPSKRCLSLIRWLEERY 199
Query: 205 ETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVF 264
E VY RHPGFQKG+K LL L+ P+ELPDNL G+KWAFVQLP + VQEE++ ++ + F
Sbjct: 200 EPVYMRHPGFQKGAKALLTLEQSSPLELPDNLCGEKWAFVQLPLAVVQEELAIVQEESSF 259
Query: 265 GASLDLDLLGIEVDDKTLIPGLAVASSRAKPLA 297
G+ LDLD LGI + D LIPG+A+ASSRA LA
Sbjct: 260 GSVLDLDTLGISLSDDALIPGVAIASSRAIGLA 292
>gi|307108142|gb|EFN56383.1| hypothetical protein CHLNCDRAFT_35116 [Chlorella variabilis]
Length = 388
Score = 300 bits (768), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 138/283 (48%), Positives = 192/283 (67%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
WELDFCSRPILD RGKK+WEL++CD + +Y ++ PNN INS LK A+ AI G
Sbjct: 106 WELDFCSRPILDERGKKVWELIICDPQRTFEYAQFIPNNKINSSELKRALEAILAQPGAV 165
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P RFFR QMQTII++A +L I P+PS+RC +L+ WLE+R +VY HPG+ + +
Sbjct: 166 RPTTARFFRGQMQTIISRALSDLGITPMPSRRCFTLMNWLEDRMGSVYEAHPGYNEKAST 225
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
L ++ P +LPD L G+KW+FVQLP + +Q+E+ ++ + FGA+LDL + ++
Sbjct: 226 LFTVEMGAPEDLPDALRGEKWSFVQLPLATLQQELEAVAAGKAFGATLDLGAMRQQLAPD 285
Query: 281 TLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTS 340
TL+PG+AV S RA PLAAW NGL++ ++ DT R LIL G + R+ Y Y++ TT+
Sbjct: 286 TLVPGVAVYSRRADPLAAWTNGLDLSAVVADTDRAFLILETGFNQRWRYGAYRRTLETTA 345
Query: 341 EAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 383
EA+AWE AK+A GGLHFL + + ++E+ G WLLLD PP V
Sbjct: 346 EAQAWEEAKQAVGGLHFLVVMSDEEAENSSGLWLLLDRKPPNV 388
>gi|145350231|ref|XP_001419517.1| psaB translation factor [Ostreococcus lucimarinus CCE9901]
gi|144579749|gb|ABO97810.1| psaB translation factor [Ostreococcus lucimarinus CCE9901]
Length = 379
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 130/282 (46%), Positives = 189/282 (67%), Gaps = 4/282 (1%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLG 158
T+W++DFCSRP+ D RGKK+WEL+V D + + ++ +YFPNN INS+ L A+ + +
Sbjct: 99 TDWQIDFCSRPLRDDRGKKVWELLVTDDARTFEHAEYFPNNRINSVELARALERVMAEKK 158
Query: 159 VPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGS 218
P + +FFR+QMQTII++AC E+D++P+ S+RC ++ WL ER E VY +HPG+ +
Sbjct: 159 EK-PRRFKFFRAQMQTIISRACNEVDVQPLASRRCQTMTKWLNERVENVYKKHPGYDASA 217
Query: 219 KPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVD 278
PL+A + P LPD L G+ WAFV LP V+EE+ ++ VFGA+L++D +
Sbjct: 218 PPLMAFEATAPKRLPDALRGESWAFVALPLVGVREEMEQVKRGRVFGATLEIDE---NLP 274
Query: 279 DKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVT 338
D TLIPG+AV +SRA LA W GLE+ I +DT S++L G++ + YA ++K+P
Sbjct: 275 DDTLIPGIAVYTSRAAALAGWTKGLELACISSDTQTSSIVLETGVNDSWSYAFFRKSPEL 334
Query: 339 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPP 380
T EA+ WE K+AC GLHFLAIQ + ++E GFW+L D P
Sbjct: 335 TKEAKEWEEVKRACNGLHFLAIQTDEEAEATDGFWILQDSDP 376
>gi|384248807|gb|EIE22290.1| PsaB RNA binding protein [Coccomyxa subellipsoidea C-169]
Length = 304
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 137/280 (48%), Positives = 186/280 (66%), Gaps = 1/280 (0%)
Query: 97 SITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDD 156
+ + WELDF SRPILD RGKK WEL++C S Y+K+FPNN INS LK A+ I +
Sbjct: 17 TFSTWELDFSSRPILDARGKKRWELLICSPDRSWVYSKWFPNNRINSTQLKAALQEIIEA 76
Query: 157 LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQK 216
G P+ +RFFR QMQTII++A +LDIKP+PS+RC SL+ LEER ETVY R G+
Sbjct: 77 EGAVKPQTVRFFRGQMQTIISRALADLDIKPVPSRRCFSLIGLLEERLETVYKRAAGYSD 136
Query: 217 GSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGI- 275
+ L LD P +LPD L G+ W FVQLP ++EE+ +++++ FGA+ L G+
Sbjct: 137 KATSLFTLDLGPPQDLPDALRGESWLFVQLPLGLLREELRAVDTRQTFGANFALASAGLA 196
Query: 276 EVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKN 335
++ D T IPG+AV S RA PLAAW +GLEV ++ D R L+L G++ R+ Y NY++
Sbjct: 197 DLPDDTPIPGVAVYSRRAVPLAAWTSGLEVANVAADADRACLVLETGVNQRWRYGNYQRT 256
Query: 336 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
P T++A AWEAAK A GLHFL +Q + +++ G WLL
Sbjct: 257 PENTADARAWEAAKIAARGLHFLVVQADEEADTSAGLWLL 296
>gi|159466814|ref|XP_001691593.1| PsaB RNA binding protein [Chlamydomonas reinhardtii]
gi|33235187|emb|CAE17328.1| Tab2 protein [Chlamydomonas reinhardtii]
gi|33235189|emb|CAE17329.1| Tab2 protein [Chlamydomonas reinhardtii]
gi|158278939|gb|EDP04701.1| PsaB RNA binding protein [Chlamydomonas reinhardtii]
Length = 358
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 133/284 (46%), Positives = 185/284 (65%), Gaps = 1/284 (0%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
WE+DFCSRP+LD RGKK+WEL++CD + +Y++YFPN+ INS LK I I G
Sbjct: 75 WEIDFCSRPLLDERGKKVWELLICDPERNFEYSEYFPNSKINSAELKRTIERILAQAGAE 134
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
PEK RFFRSQMQTIITKA + IK +PS+RC +++ W+ ER E+VY + P F ++
Sbjct: 135 RPEKARFFRSQMQTIITKALTDCQIKAVPSRRCFTVMSWINERLESVYKQDPRFSDKAQS 194
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGI-EVDD 279
L LD P LPD L G++WAFVQLP + + + ++ +FG+ L +G+ ++
Sbjct: 195 LFQLDLGPPEALPDALRGEQWAFVQLPLGTLLQMLKRVDDAEIFGSGFTLGTVGLADLPA 254
Query: 280 KTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTT 339
LIPG+ V S RA PLAAW NGLE+ +++ D AR LIL G++ R+ Y +++ N +
Sbjct: 255 DILIPGVVVFSRRALPLAAWTNGLEIAAVKADVARSCLILETGVNQRWKYGSWRPNEDSI 314
Query: 340 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 383
EAE WE AK+ G+HFLA+Q + DSE+ G WLL D PP +
Sbjct: 315 GEAEGWEIAKQGVKGVHFLAVQPDPDSEELNGLWLLQDCEPPTI 358
>gi|302836193|ref|XP_002949657.1| hypothetical protein VOLCADRAFT_74347 [Volvox carteri f.
nagariensis]
gi|300265016|gb|EFJ49209.1| hypothetical protein VOLCADRAFT_74347 [Volvox carteri f.
nagariensis]
Length = 365
Score = 281 bits (718), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 134/286 (46%), Positives = 183/286 (63%), Gaps = 1/286 (0%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLG 158
T WE+DFCSRP+LD RGKK+WEL++CD +Y++YFPN+ INS LK AI I G
Sbjct: 80 TVWEIDFCSRPLLDERGKKVWELLICDPERKFEYSEYFPNSKINSAELKRAIERILAQAG 139
Query: 159 VPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGS 218
PEK RFFRSQMQTIITKA + IK +PS+RC +++ W+ ER ++VY P + +
Sbjct: 140 AQRPEKARFFRSQMQTIITKALTDCQIKAVPSRRCFTVMSWINERLDSVYKTDPRYSDKA 199
Query: 219 KPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIE-V 277
+ L LD P LPD L G++WAFVQLP + + + +E +FG + L G++ +
Sbjct: 200 QSLFQLDLGPPEALPDALRGEQWAFVQLPLGTLLQMLRKVEEGEIFGGTFSLGTAGLQDL 259
Query: 278 DDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPV 337
LIPG+ V S RA PLAAW NGLE+ +++ D R LIL G++ R+ Y +++ N
Sbjct: 260 PMDILIPGVVVFSRRALPLAAWTNGLEIAAVKADVQRSCLILETGVNQRWKYGSWRPNED 319
Query: 338 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 383
+ EAE WE AK+ GLHFLA+Q + DSE+ G WLL D PP +
Sbjct: 320 SIGEAEGWEIAKEGVKGLHFLAVQPDPDSEELNGLWLLQDCEPPSI 365
>gi|412990938|emb|CCO18310.1| predicted protein [Bathycoccus prasinos]
Length = 393
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 127/283 (44%), Positives = 179/283 (63%), Gaps = 10/283 (3%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DFCSRP+ D RGKK+WEL++ D + ++ ++FPNN INS+ L +A+ +
Sbjct: 111 WQIDFCSRPLKDDRGKKVWELLITDEDRTFEHAEFFPNNRINSVELSKALQKVVSKRTEE 170
Query: 161 I---PEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
P +++FFRSQM TIIT+ACKE +++P+PS+RC ++L WLEER ETVY +HPG+
Sbjct: 171 TGEGPRRVKFFRSQMMTIITRACKECELEPLPSRRCQTMLNWLEERMETVYKKHPGYDAN 230
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEV 277
S PL+ D P LPD L G+ WAFV LP V+EE+ S+ FG DLL I+
Sbjct: 231 SAPLMTFDAQAPKPLPDALRGESWAFVALPLVGVKEEMESVARGKAFG-----DLLNIDP 285
Query: 278 D--DKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKN 335
D D TLIPG+ V ++RA L+ W GLE+ +I D S++L G++ + YA +++
Sbjct: 286 DLPDDTLIPGVVVYTARAAALSGWTKGLELSAITVDLESSSIVLETGVNESWNYAFFRRT 345
Query: 336 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
EA WE K+ GLHFLAIQ + DSE GFW+L D+
Sbjct: 346 KELREEAREWEGVKRQTKGLHFLAIQTDADSETTDGFWVLQDV 388
>gi|255088429|ref|XP_002506137.1| predicted protein [Micromonas sp. RCC299]
gi|226521408|gb|ACO67395.1| predicted protein [Micromonas sp. RCC299]
Length = 274
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 128/278 (46%), Positives = 177/278 (63%), Gaps = 5/278 (1%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+LDFCSRP+ D RGKK+WEL++CD + S +++++FPNN INS+ L +AI + G
Sbjct: 1 WQLDFCSRPMKDERGKKMWELLICDETRSFEHSEFFPNNRINSVELAKAIDRVFVARG-E 59
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P + +FFRSQMQTIIT+AC E+ + P+PS+RC ++ WL+ER ETVY HPG+ + P
Sbjct: 60 RPRRFKFFRSQMQTIITRACGEVGVNPLPSRRCQTMSRWLDERLETVYKTHPGYDGSAAP 119
Query: 221 LLALDNPF-PMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDD 279
+ + P LPD L G+ WAFV LP V+EE + + VFG L++D ++D
Sbjct: 120 NMGFEGGGGPRPLPDALRGESWAFVALPLVGVREEAEQVRANRVFGDLLEIDPT---LED 176
Query: 280 KTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTT 339
TLIPG+AV + RA LA W GLE+ I D G+L+L G+S + YA +++
Sbjct: 177 DTLIPGIAVYTRRAAALAGWTKGLELGGISVDFDMGTLLLDTGVSDSWQYARFRQTKELM 236
Query: 340 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
EA WE K A GLHFLAIQ + D+E GFW+L D
Sbjct: 237 KEAREWEEVKAAVNGLHFLAIQTDEDAETTDGFWILQD 274
>gi|303274889|ref|XP_003056755.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461107|gb|EEH58400.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 270
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 122/267 (45%), Positives = 172/267 (64%), Gaps = 4/267 (1%)
Query: 112 DIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQ 171
D RGKK+WEL++CD S S Q+ ++FPNN INS+ L +AI + ++ G P++ +FFRSQ
Sbjct: 3 DERGKKMWELLICDESRSFQHAEFFPNNRINSVELSKAIQRVLNEQGAR-PKRFKFFRSQ 61
Query: 172 MQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPME 231
MQTIIT+AC ++ + P+PS+RC +L WL++R E VY +HPG+ S P++ + P
Sbjct: 62 MQTIITRACNDVGVPPLPSRRCQTLTRWLDQRAEEVYKKHPGYDGSSSPMMGFETSAPKP 121
Query: 232 LPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASS 291
LPD L G+ WAFV LP V+EE + + VFG LD+D + D TL+PG+AV +
Sbjct: 122 LPDALRGESWAFVALPLIGVKEEAMQVSANRVFGDLLDIDE---ALPDDTLVPGIAVYTR 178
Query: 292 RAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKA 351
RA LA W GLE+ I D G+LIL G++ + YA +++ T EA+ WE K A
Sbjct: 179 RAAALAGWTKGLELGGISVDLDMGTLILDTGVADSWQYARFRQTKELTREAKEWEDVKAA 238
Query: 352 CGGLHFLAIQEELDSEDCVGFWLLLDL 378
GGLHFLAIQ + ++E GFW+L D
Sbjct: 239 AGGLHFLAIQTDEEAESTDGFWILQDF 265
>gi|308807645|ref|XP_003081133.1| Tab2 protein (ISS) [Ostreococcus tauri]
gi|116059595|emb|CAL55302.1| Tab2 protein (ISS) [Ostreococcus tauri]
Length = 300
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 127/330 (38%), Positives = 176/330 (53%), Gaps = 55/330 (16%)
Query: 53 RPRPS-VSESSLSVPKEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPIL 111
R RP+ VS S S P A +P L L ++ W++DFCSRP+
Sbjct: 23 RERPAAVSPFSRSTPTSARRLHTRASATQEPAATLKKLTKD--------WQIDFCSRPLR 74
Query: 112 DIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQ 171
D RGKK+WEL+V D S ++ +YFPNN INS+ L A+ + G P + +FFR+Q
Sbjct: 75 DDRGKKVWELLVTDDERSFEHAEYFPNNRINSVELARALERVMASKGEK-PRRFKFFRAQ 133
Query: 172 MQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPME 231
MQTIIT+AC E+D++ + S+RC ++ WL+ER E+VY +HPG+ + PL+A + P
Sbjct: 134 MQTIITRACTEVDVEALASRRCQTMTNWLDERVESVYKKHPGYDANAPPLMAFEPTAPKR 193
Query: 232 LPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASS 291
LPD L G+ WAFV LP VFGA LD+D + D TLIPG+AV +S
Sbjct: 194 LPDALRGESWAFVALPLVG------------VFGALLDIDE---NLPDDTLIPGIAVYTS 238
Query: 292 RAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKA 351
RA AA ++L G++ + YA ++K P T E + WE K+A
Sbjct: 239 RAAVSAA-----------------HIVLETGVNDSWSYAFFRKTPELTKEPKEWEQVKRA 281
Query: 352 CGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
CGG+ GFW+L D PPP
Sbjct: 282 CGGV-------------TDGFWILRDAPPP 298
>gi|119510299|ref|ZP_01629435.1| hypothetical protein N9414_16117 [Nodularia spumigena CCY9414]
gi|119465043|gb|EAW45944.1| hypothetical protein N9414_16117 [Nodularia spumigena CCY9414]
Length = 287
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 119/287 (41%), Positives = 168/287 (58%), Gaps = 16/287 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WE+DF SRPILD + KK+WE++VC+ + +Y KY P+ +NS+ L+ A+
Sbjct: 5 WEIDFYSRPILDEKQKKVWEVLVCESPSDISTKPESLFRYAKYCPSTQVNSVWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
D G P +IRFFR QM +ITKAC+++ I PS+R L L WL +R E VY + P
Sbjct: 65 AIDKAG-EAPIRIRFFRRQMSNMITKACQDVGIPAQPSRRILVLNQWLRQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q G+ P + LD+P P LPD L G +WAFV L E E FG + L+L
Sbjct: 124 GYQGGTNPSVRLDSPLPQRLPDALEGKQWAFVSL---QAAEFADMSEWDIGFGEAFPLEL 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGS-LILSVGISTRYIYAN 331
V +T IPG+ + S RA P+A WM+GLE+ + DT +G L+L G + +I AN
Sbjct: 181 --ANVSPETRIPGVLIFSPRALPIAGWMSGLELACLNFDTKQGQRLVLETGATESWILAN 238
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
NP T +EA+ +E AK+ G+HF+ +Q + +E GFWLL +L
Sbjct: 239 I-TNPQTLAEAKGYEQAKEKANGVHFIGVQSDPQAESFTGFWLLQNL 284
>gi|300863927|ref|ZP_07108842.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300338047|emb|CBN53988.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 286
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 123/289 (42%), Positives = 172/289 (59%), Gaps = 16/289 (5%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELD+ SRPI+D + KK+WE+++C+ L++ +Y+++ P++ +NS+ L AI
Sbjct: 3 TIWELDYYSRPIVDEQQKKLWEVLICESPLNVGDKSESLFRYSQFCPSSTVNSLWLAAAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
P PEKIRFFR QM +I KAC+EL I PS+R +L WL ER E VY
Sbjct: 63 KEAIASSPSP-PEKIRFFRRQMTNMIVKACEELHIPAAPSRRTYALQQWLRERMEDVYPT 121
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDL 270
HPGFQ G P + + P LP+ L G+KW+FV LP A EE+S E + FG + L
Sbjct: 122 HPGFQSGLTPSVQYSSEIPQALPEALLGEKWSFVTLPVEAF-EEMSEWEIE--FGEAFGL 178
Query: 271 DLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA-RGSLILSVGISTRYIY 329
+ G++ +T IPGL + SSRA LAAWM+GLE+ + D L+L G S R+I
Sbjct: 179 EAFGLK--PQTPIPGLIIFSSRATALAAWMSGLELAFVTFDGGPPARLVLETGASDRWIL 236
Query: 330 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
AN + + +E + +E+AK A +HFLAIQ +SE GFWLL +
Sbjct: 237 ANLRDLSI-VAEVKGFESAKVAANQVHFLAIQSHPESESFAGFWLLQEF 284
>gi|17232380|ref|NP_488928.1| hypothetical protein alr4888 [Nostoc sp. PCC 7120]
gi|17134025|dbj|BAB76587.1| alr4888 [Nostoc sp. PCC 7120]
Length = 286
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 119/287 (41%), Positives = 167/287 (58%), Gaps = 16/287 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE+V+C+ L ++ Y +Y P+ +NS L+ AI
Sbjct: 5 WELDFYSRPILDENQKKVWEVVICESPLDIRTKTDSLFRYAQYCPSTEVNSAWLRTAIQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P KIRFFR QM +I KAC++ I +PS+R L+L WL++R E VY + P
Sbjct: 65 AISKAG-KAPIKIRFFRRQMNNMIVKACEDAGIPALPSRRTLALNQWLKQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q + P + LD+P P LPD L G +W FV L +A E+ E F LD
Sbjct: 124 GYQGVTTPSVRLDSPLPQRLPDALEGQQWVFVSLS-AADLAEMPDWEIGFSEAFPLDF-- 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARG-SLILSVGISTRYIYAN 331
++V +T IPG+ + S RA P+A WM+GLE+ + DT++G L+L G + +I AN
Sbjct: 181 --VQVSPETRIPGVLIFSPRALPIAGWMSGLELAFLRVDTSQGMRLVLETGATESWILAN 238
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
KNP T EA +E AK+ G+HF+ +Q ++E GFWLL +L
Sbjct: 239 I-KNPTTVQEARGFEEAKQKANGVHFIGVQSNPEAESFAGFWLLQEL 284
>gi|428207859|ref|YP_007092212.1| hypothetical protein Chro_2874 [Chroococcidiopsis thermalis PCC
7203]
gi|428009780|gb|AFY88343.1| protein of unknown function DUF1092 [Chroococcidiopsis thermalis
PCC 7203]
Length = 287
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 120/290 (41%), Positives = 170/290 (58%), Gaps = 16/290 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAIVA 152
WE+DF SRPILD KK+WE+VVC+ L +Y +Y P+ +NS L+ A+
Sbjct: 5 WEIDFYSRPILDENQKKVWEVVVCESPLDTRTDPTRLFRYAQYCPSTQVNSAWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P K RFFR QM +ITKACK+L I PS+R L+LL L+ER + VY + P
Sbjct: 65 AMAKAGT-APTKFRFFRRQMNNMITKACKDLGIPAQPSRRTLALLQLLKERMDEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q P + +++ P LPD L G +WAFV L +A+ + E + FG + L +
Sbjct: 124 GYQPTPNPSVKMESSPPQRLPDALTGQQWAFVNLEATALAD---MDEWEIAFGEAFPLQM 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA-RGSLILSVGISTRYIYAN 331
+G+ +T IPGL + S RA PLA WM+GLE+ I +T+ L+L G S +I AN
Sbjct: 181 VGL--SPETTIPGLLIFSERALPLAGWMSGLELAFIRVETSPVARLLLETGASESWILAN 238
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
KNP T +EA+A+ +AK+ G+HF+A+Q +E GFWLL ++ P
Sbjct: 239 L-KNPQTVAEAQAFVSAKQQANGVHFIAVQSNPQTESFAGFWLLQEVSIP 287
>gi|434394708|ref|YP_007129655.1| protein of unknown function DUF1092 [Gloeocapsa sp. PCC 7428]
gi|428266549|gb|AFZ32495.1| protein of unknown function DUF1092 [Gloeocapsa sp. PCC 7428]
Length = 309
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 119/285 (41%), Positives = 173/285 (60%), Gaps = 18/285 (6%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS---------LQYTKYFPNNVINSITLKEAIV 151
WE+DF SRPILD KKIWE++VC+ SL+ ++ KY P+ +NS+ L+ A+
Sbjct: 27 WEIDFYSRPILDENQKKIWEVLVCE-SLTDIRTKPDSLFRFAKYCPSTQVNSVWLRTALE 85
Query: 152 AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRH 211
GV P K RFFR QM +ITKAC++L I PS+R L+L WL++R E VY
Sbjct: 86 EAIAAAGVS-PVKFRFFRRQMNNMITKACEDLGIPAQPSRRTLALNQWLQQRMEEVYPHE 144
Query: 212 PGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLD 271
PG+Q + P + ++ P P LPD L G +WAFV L +A + E + FG + L+
Sbjct: 145 PGYQATTNPSVRMEVPLPQRLPDALIGQQWAFVTLEAAAFAD---MPEWEIGFGEAFPLE 201
Query: 272 LLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETD-TARGSLILSVGISTRYIYA 330
+ G++ + K IPG+ V S RA PLA WM+GLE+ +I+ D T L+L G++ +I A
Sbjct: 202 IAGVKPETK--IPGVIVLSPRAMPLAGWMSGLELANIKFDSTETPQLLLETGVTESWILA 259
Query: 331 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
++ K+P +EA+ +E+AK+ G+HFLA+Q + E GFWLL
Sbjct: 260 SF-KDPQMIAEAKGFESAKQQANGVHFLAVQANPEVEAFAGFWLL 303
>gi|427718386|ref|YP_007066380.1| hypothetical protein Cal7507_3136 [Calothrix sp. PCC 7507]
gi|427350822|gb|AFY33546.1| protein of unknown function DUF1092 [Calothrix sp. PCC 7507]
Length = 287
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 119/290 (41%), Positives = 170/290 (58%), Gaps = 16/290 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD + KK+WE++VC+ ++ Y +Y + +NS L+ A+
Sbjct: 5 WELDFYSRPILDEKQKKVWEVLVCESPSDIRTKTDSLFRYAQYCSSTQVNSGWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P KIRFFR QM +ITKAC+++ I PS+R L L WL++R E VY + P
Sbjct: 65 AITTAG-EAPIKIRFFRRQMNNMITKACEDVGIPAQPSRRTLVLNQWLQQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q G+ + L+ P P LPD L G +WAFV L E E + FG S LD
Sbjct: 124 GYQGGANASVRLERPLPQRLPDALEGQQWAFVTLEAGDFAE---MPEWEIGFGESFPLDF 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGS-LILSVGISTRYIYAN 331
++ +T IPG+ + S RA PLA WM+GLE+ + DT++G+ L+L G++ +I AN
Sbjct: 181 --AKITPETRIPGVLIFSPRALPLAGWMSGLELAFLRFDTSQGARLLLETGVTESWILAN 238
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
KNP T SEAE +EA+K+ G+HF+ +Q ++ GFWLL ++ P
Sbjct: 239 I-KNPQTLSEAEGFEASKQKANGVHFIGVQSNPQAQSFAGFWLLQEVNLP 287
>gi|56750056|ref|YP_170757.1| hypothetical protein syc0047_c [Synechococcus elongatus PCC 6301]
gi|81300399|ref|YP_400607.1| hypothetical protein Synpcc7942_1590 [Synechococcus elongatus PCC
7942]
gi|7328458|dbj|BAA92865.1| ORF285 [Synechococcus elongatus PCC 6301]
gi|22002499|gb|AAM82651.1| unknown [Synechococcus elongatus PCC 7942]
gi|56685015|dbj|BAD78237.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169280|gb|ABB57620.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 285
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 117/286 (40%), Positives = 170/286 (59%), Gaps = 14/286 (4%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDG-------SLSLQYTKYFPNNVINSITLKEAIVAI 153
WELDF SRPILD GKK+WE+ + + +++ +Y + + +NS+TL++A+ +
Sbjct: 5 WELDFYSRPILDEAGKKLWEVAIAETVTTVEAPAVTFRYADFVTGDQVNSVTLQDALKSA 64
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
+ G P P++IR+FR M +I KAC +L + S+R +SL WLEER + VY HPG
Sbjct: 65 IAEAGTP-PDRIRYFRRPMNNMIRKACTDLGLPCQLSRRTVSLHNWLEERRQQVYATHPG 123
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLL 273
+ G + + + P LPD L GD+WAFV LPF+A+ E E FG + L
Sbjct: 124 YNPGPVAGVQMPDEAPQPLPDALRGDRWAFVDLPFAALAEHG---EWGIDFGEA--FPLA 178
Query: 274 GIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYK 333
GI++ D+T IPGL + +SRA P+AAW++GLE + D+ L+L G S R+ A
Sbjct: 179 GIDLPDETPIPGLIIFASRAMPIAAWLSGLEPAWLTYDSPAKQLLLETGGSERWTLAALN 238
Query: 334 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
P EA + AAK+A GLHFLA+Q + +S+ GFWLL +LP
Sbjct: 239 V-PALQQEATQFNAAKQAAKGLHFLAVQVDPNSDRFAGFWLLRELP 283
>gi|427730243|ref|YP_007076480.1| hypothetical protein Nos7524_3080 [Nostoc sp. PCC 7524]
gi|427366162|gb|AFY48883.1| Protein of unknown function (DUF1092) [Nostoc sp. PCC 7524]
Length = 287
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 118/290 (40%), Positives = 171/290 (58%), Gaps = 16/290 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WE+DF SRPILD KK+WE++VC+ L ++ Y +Y P+ +NS L+ A+
Sbjct: 5 WEIDFYSRPILDENQKKVWEVLVCESPLDIRTNLDSLFRYAQYCPSTQVNSGWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
D G P KIRFFR QM +ITKAC++L I + S+R L L WLE+R VY + P
Sbjct: 65 AIDKAG-EAPIKIRFFRRQMNNMITKACQDLGIPALSSRRTLVLNQWLEQRMIEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q G+ P + L+NP P LPD L G KW FV L + + E E F + L+L
Sbjct: 124 GYQGGANPSVRLENPLPQRLPDALEGQKWVFVSLSAAELAE---MPEWDIGFREAFPLEL 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGS-LILSVGISTRYIYAN 331
++ +T IPG+ + S RA P+A WM+GLE+ + D ++G+ L+L G + +I AN
Sbjct: 181 --AQLSPETRIPGVLIFSPRALPVAGWMSGLELAFLRVDQSQGTRLVLETGTAESWILAN 238
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
KN T +EA+ +EAAK+ G+HF+ +Q + +E GFWLL ++ P
Sbjct: 239 I-KNSTTLAEAQGFEAAKQNANGVHFIGVQSDPQAEAFAGFWLLQEVNLP 287
>gi|75908378|ref|YP_322674.1| hypothetical protein Ava_2159 [Anabaena variabilis ATCC 29413]
gi|75702103|gb|ABA21779.1| Protein of unknown function DUF1092 [Anabaena variabilis ATCC
29413]
Length = 286
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 115/287 (40%), Positives = 169/287 (58%), Gaps = 16/287 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE+V+C+ L ++ Y +Y P+ +NS+ L+ AI
Sbjct: 5 WELDFYSRPILDENQKKVWEVVICESPLDIRTKTDSLFRYAQYCPSTEVNSVWLRTAIQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P KIRFFR QM +I KAC++ I + S+R L+L L++R E VY + P
Sbjct: 65 AISKAG-EAPIKIRFFRRQMNNMIVKACEDAGIPALASRRTLALNQLLKQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q G+ P + LD+P P LPD L G +W FV L + + E E + F + LD
Sbjct: 124 GYQGGTTPSVRLDSPLPQRLPDALEGQQWVFVSLSAADLAE---MPEWEIGFSEAFPLDF 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGS-LILSVGISTRYIYAN 331
++V ++ IPG+ + S RA P+A WM+GLE+ + DT++G+ L+L G + +I AN
Sbjct: 181 --VQVSPESRIPGVLIFSPRALPIAGWMSGLELAFLRVDTSQGTRLVLETGATESWILAN 238
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
KNP T EA +E AK+ G+HF+ +Q ++E GFWLL ++
Sbjct: 239 I-KNPTTLQEARGFEEAKQKANGVHFIGVQSNPEAESFAGFWLLQEV 284
>gi|414077821|ref|YP_006997139.1| hypothetical protein ANA_C12609 [Anabaena sp. 90]
gi|413971237|gb|AFW95326.1| hypothetical protein ANA_C12609 [Anabaena sp. 90]
Length = 291
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 113/290 (38%), Positives = 173/290 (59%), Gaps = 16/290 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE++VC+ + + +Y KY P+ +NS L+ AI
Sbjct: 9 WELDFYSRPILDENQKKVWEMLVCESPVDIGTQTDSLFRYAKYCPSTQVNSGWLRTAIQE 68
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
++ G P KIRFFR QM +ITK+C+++ + +PS+R L L W+++R + VY + P
Sbjct: 69 AIEEAGAS-PTKIRFFRRQMNNMITKSCEDVGVPAVPSRRTLVLNQWIQQRMKEVYPQEP 127
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q + P + LD P P LPD L G +WAFV L S + + + + FG + L+L
Sbjct: 128 GYQGVANPSVRLDKPLPQRLPDALEGKQWAFVTLEASDLAQ---MPDWEIGFGEAFPLEL 184
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGS-LILSVGISTRYIYAN 331
E+ +T IPG+ + S RA P+A WM+GLE+ + DT +G+ LIL G + ++ AN
Sbjct: 185 --AELRPETRIPGILIFSPRALPIAGWMSGLEMAYLHFDTKQGNRLILETGATESWVVAN 242
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
+ P +EA+ + AK+ G+HF+ +Q + S+D GFWLL ++ P
Sbjct: 243 I-RTPELLAEAQGFTVAKEQANGVHFIGVQSDPQSQDFAGFWLLQEINLP 291
>gi|428311384|ref|YP_007122361.1| hypothetical protein Mic7113_3216 [Microcoleus sp. PCC 7113]
gi|428252996|gb|AFZ18955.1| Protein of unknown function (DUF1092) [Microcoleus sp. PCC 7113]
Length = 287
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 126/290 (43%), Positives = 167/290 (57%), Gaps = 20/290 (6%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAIV- 151
WELDF SRPILD KKIWE++VC+ L QYT++ P+ +NSI L+EA+
Sbjct: 5 WELDFYSRPILDENQKKIWEILVCESPLDTRQSPDELFQYTQFCPSQQVNSIWLREALAE 64
Query: 152 AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRH 211
AI P EKIRFFR QM +ITKAC+EL I+ IPS+R +L WLE+R Y +H
Sbjct: 65 AIAQSKQTP--EKIRFFRRQMTNMITKACEELGIQVIPSRRTYTLERWLEQRILGFYPKH 122
Query: 212 PGFQ--KGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLD 269
PG++ + + P LPD L DKWAFV L A +E E F +
Sbjct: 123 PGYKPTAAASSFVQYQPQIPQPLPDALEYDKWAFVTLEAGAFEEMN---EWDIGFSEAFP 179
Query: 270 LDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGS-LILSVGISTRYI 328
L ++G+ D T IPG+ + SSRA PLA WM+GLE+ + D+A + L+L G S +I
Sbjct: 180 LSMMGLAPD--TPIPGIIIFSSRATPLAGWMSGLELAFVRFDSAESARLLLETGASDSWI 237
Query: 329 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
A K + T +EA+ +E K+ G+HFLAIQ SE GFWLL +L
Sbjct: 238 LATLKDSQ-TLAEAQGFELTKQNAEGVHFLAIQSTPTSESFAGFWLLQEL 286
>gi|334120908|ref|ZP_08494985.1| protein of unknown function DUF1092 [Microcoleus vaginatus FGP-2]
gi|333455907|gb|EGK84547.1| protein of unknown function DUF1092 [Microcoleus vaginatus FGP-2]
Length = 286
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 126/293 (43%), Positives = 171/293 (58%), Gaps = 24/293 (8%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSI----TL 146
T WELDF SRPILD R KK WE+++C+ L++ +Y+++ ++ +NS+ L
Sbjct: 3 TIWELDFYSRPILDEREKKKWEVLICESPLNVGDKAESLFRYSQFCSSSTVNSLWLAGAL 62
Query: 147 KEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYET 206
KEAI A PEKIRFFR QM +ITKAC++LDI S+R L+L LWLEER +
Sbjct: 63 KEAIAAAPKR-----PEKIRFFRRQMANMITKACEDLDIPAACSRRTLALSLWLEERMQD 117
Query: 207 VYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGA 266
VY PG+Q P + P+ LPD L G+KW FV LP +A E E FG
Sbjct: 118 VYPAEPGYQAVVNPSVQFVPETPVALPDALIGEKWTFVSLPIAAFDEMS---EWDIGFGE 174
Query: 267 SLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA-RGSLILSVGIST 325
+ L + + +T IPGL + SSRA LA WM+GLE+ ++ ++ L+L G +
Sbjct: 175 AFGLPM--TRLAPETQIPGLIIYSSRATALAGWMSGLELAFLKFESGPPARLVLDTGAND 232
Query: 326 RYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
R+I AN ++ T EA+ +EAAKK +HFLAIQ DSE GFWLL +L
Sbjct: 233 RWILANL-RDAATEREAKGFEAAKKQAKQVHFLAIQSNPDSESFAGFWLLHEL 284
>gi|186682051|ref|YP_001865247.1| hypothetical protein Npun_R1620 [Nostoc punctiforme PCC 73102]
gi|186464503|gb|ACC80304.1| protein of unknown function DUF1092 [Nostoc punctiforme PCC 73102]
Length = 286
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 117/287 (40%), Positives = 168/287 (58%), Gaps = 16/287 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WE+DF SRPILD KKIWE++VC+ L + +Y +Y P+ +NS L+ A+
Sbjct: 5 WEIDFYSRPILDDNQKKIWEVLVCESPLDIGTKPDSLFRYAQYCPSTQVNSGWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P KIRFFR QM +ITKAC+++ I PS+R L L WLEER + VY + P
Sbjct: 65 AITQAG-KAPIKIRFFRRQMNNMITKACQDVGIPAQPSRRTLVLNQWLEERMKEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q G+ P + L+ P P LPD L G +W FV L + + E E + FG + L+L
Sbjct: 124 GYQGGTNPSVRLEKPLPQRLPDALEGQQWVFVTLDAADLAE---MPEWEIGFGEAFPLEL 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA-RGSLILSVGISTRYIYAN 331
+V + IPG+ + S RA PLA WM+GLE+ + DT+ L+L G++ +I AN
Sbjct: 181 --AKVSPEARIPGILIFSPRALPLAGWMSGLELAFLRFDTSEEARLLLETGVNESWIVAN 238
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
KK P +EA+ +E AK+ G+HF+ IQ + ++ GFWLL ++
Sbjct: 239 IKK-PQVLAEAKGFEEAKQKANGVHFIGIQSDPKAQSFAGFWLLQEV 284
>gi|427705901|ref|YP_007048278.1| hypothetical protein Nos7107_0455 [Nostoc sp. PCC 7107]
gi|427358406|gb|AFY41128.1| protein of unknown function DUF1092 [Nostoc sp. PCC 7107]
Length = 287
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 115/287 (40%), Positives = 165/287 (57%), Gaps = 16/287 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WE+DF SRPILD KK+WE+VVC+ L ++ Y +Y P+ +NS L+ A+
Sbjct: 5 WEIDFYSRPILDENQKKVWEVVVCESPLDIRAQTDSLFRYAQYCPSTEVNSGWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
D G P K+RFFR QM +ITKAC++L I PS+R L L WL++R E VY + P
Sbjct: 65 AIDKAG-EAPIKVRFFRRQMNNMITKACQDLGIPAQPSRRTLLLNQWLQQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q G+ P + LD+P P LPD L G +W FV L + E E FG + L++
Sbjct: 124 GYQGGNNPSVRLDSPLPQRLPDALEGQQWVFVSL---SAGELAEMPEWDIGFGEAFPLEM 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGS-LILSVGISTRYIYAN 331
++ + IPG+ + S RA PLA WM+GLE+ + D + G+ LIL G + +I AN
Sbjct: 181 --AQLSPEARIPGVLIFSPRALPLAGWMSGLELAFLRVDQSVGTRLILETGATESWIVAN 238
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
KNP EA+ + +K+ G+HF+ +Q +E GFWLL ++
Sbjct: 239 I-KNPQLLVEAKGFAESKQQANGVHFIGVQSSPQAESFAGFWLLQEV 284
>gi|428319661|ref|YP_007117543.1| protein of unknown function DUF1092 [Oscillatoria nigro-viridis PCC
7112]
gi|428243341|gb|AFZ09127.1| protein of unknown function DUF1092 [Oscillatoria nigro-viridis PCC
7112]
Length = 286
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 126/293 (43%), Positives = 173/293 (59%), Gaps = 24/293 (8%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITL---- 146
T WELDF SRPI+D R KK WE+++C+ L++ +Y+++ ++ +NS+ L
Sbjct: 3 TIWELDFYSRPIIDEREKKKWEVLICESPLNVGDKAESLFRYSQFCSSSTVNSLWLAGAI 62
Query: 147 KEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYET 206
K+AI A PEKIRFFR QM +ITKAC+ELDI S+R L+L LWLEER +
Sbjct: 63 KDAIAAAPKR-----PEKIRFFRRQMANMITKACEELDIPAACSRRTLALSLWLEERMQD 117
Query: 207 VYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGA 266
VY PG+Q P + P+ LPD L G+KWAFV LP +A +E+S + F
Sbjct: 118 VYPAEPGYQPVVNPSVQFIPETPVALPDALIGEKWAFVSLPIAAF-DEMSEWDIGFGEAF 176
Query: 267 SLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA-RGSLILSVGIST 325
L + LG KT IPGL + SSRA LA WM+GLE+ ++ ++ L+L G +
Sbjct: 177 GLPMTALG----PKTQIPGLIIYSSRATALAGWMSGLELAFLKFESGPPARLVLDTGAND 232
Query: 326 RYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
R+I AN ++ T EA+ +EAAK +HFLAIQ +SE GFWLL +L
Sbjct: 233 RWILANL-RDAATEREAKGFEAAKNQAKKVHFLAIQSNPESESFAGFWLLHEL 284
>gi|119490556|ref|ZP_01622998.1| hypothetical protein L8106_07991 [Lyngbya sp. PCC 8106]
gi|119453884|gb|EAW35040.1| hypothetical protein L8106_07991 [Lyngbya sp. PCC 8106]
Length = 286
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 122/289 (42%), Positives = 170/289 (58%), Gaps = 16/289 (5%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRP+ D GKK+WE+++C L + +YT++ P+ +NSI L+ AI
Sbjct: 3 TIWELDFYSRPLRDEEGKKVWEVLICQTPLEIGDRAESLFRYTQFCPSTDVNSIWLQGAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
A + P++IRFFR M +I KACKEL I S+R +L WL+ER E VY
Sbjct: 63 QAAIKE-ADETPQRIRFFRRPMANMILKACKELAIPVTASRRTYALFQWLDERIENVYPT 121
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDL 270
P +Q+ + P + + P LPD L GD+WAFV L SA EE+S E FG + L
Sbjct: 122 LPNYQETANPSVQFASSPPQRLPDALQGDQWAFVSLEASAF-EEMS--EWNIGFGEAFGL 178
Query: 271 DLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA-RGSLILSVGISTRYIY 329
+LG+ +T IPGL V SSRA PLAAWM+GLE+ + + R SL+L G + +I
Sbjct: 179 PMLGL--SGETQIPGLIVFSSRATPLAAWMSGLELAFLRVNKGDRPSLLLETGENDSWIL 236
Query: 330 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
AN + T +EAE +E AK+ +HFLA+Q + ++E G W+L +L
Sbjct: 237 ANL-TDAGTQAEAEQFEEAKRQAKNVHFLAVQSDPNTESFAGLWMLQEL 284
>gi|427740039|ref|YP_007059583.1| hypothetical protein Riv7116_6716 [Rivularia sp. PCC 7116]
gi|427375080|gb|AFY59036.1| Protein of unknown function (DUF1092) [Rivularia sp. PCC 7116]
Length = 284
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 114/287 (39%), Positives = 170/287 (59%), Gaps = 16/287 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WELD+ SRPILD KK+WE+++C+ L + +Y KY + +NS+ L+ A+
Sbjct: 5 WELDYYSRPILDENKKKVWEVLICETPLDISSKTDSLFRYAKYCSSATVNSVWLQTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P KIRFFR QM +ITKAC+E+ I S+R L+L WL++R + VY +
Sbjct: 65 AIGKAG-EAPVKIRFFRRQMNNMITKACEEIGIPAQTSRRTLALNQWLQQRMDEVYPQEA 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q G+ P + L++P P LPD L G++ FV L + + E FG + LDL
Sbjct: 124 GYQGGTNPSVRLESPLPQRLPDALEGEQLQFVTL---SAADFADMPEWNIDFGEAFPLDL 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYAN 331
GI ++K IPG+ + S+RA P+AAWM+GLE+ + D+++ G L+L G + +I AN
Sbjct: 181 AGISSENK--IPGVLIFSNRALPIAAWMSGLELAWLRFDSSKTGRLLLETGATESWILAN 238
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
KNP EA+ +E AK+ G+HF+ +Q + SE GFWLL ++
Sbjct: 239 I-KNPQMLLEAQNFEQAKQKANGVHFIGVQSDPTSESFAGFWLLREI 284
>gi|220906218|ref|YP_002481529.1| hypothetical protein Cyan7425_0781 [Cyanothece sp. PCC 7425]
gi|219862829|gb|ACL43168.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 7425]
Length = 288
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 118/283 (41%), Positives = 157/283 (55%), Gaps = 14/283 (4%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAIC----DD 156
WE+DF SRP+LD KKIWEL+VCD +Y + + N+ L+ +
Sbjct: 5 WEIDFYSRPLLDENQKKIWELLVCDPDRRFEYVQTCSGSQANARWLQTELATALPLWRQA 64
Query: 157 LGVP---IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
L +P +PEKIRFFR QM +IIT+AC +L I P PS+R +L WL+ER E VY + PG
Sbjct: 65 LELPETAMPEKIRFFRRQMNSIITRACTDLGIPPQPSRRTFTLYQWLKERSEKVYPQQPG 124
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLL 273
FQ + LA + P LPD L G W F L A E ++ E FG L L
Sbjct: 125 FQPLAMSPLAFEASPPQPLPDALMGQGWTFASL---AASEFAAATEWSITFGEVFPLSRL 181
Query: 274 GIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA-RGSLILSVGISTRYIYANY 332
G+ +T++PGL + SSRAKPLA WM+GLE+ + +T LIL G+S R+I A
Sbjct: 182 GL--SPETVVPGLIIFSSRAKPLAGWMSGLELACLTLETEPVPQLILETGVSDRWILARL 239
Query: 333 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ P E +E K+ G +HFLA+Q SED GFW+L
Sbjct: 240 -RTPQLLEEGRNFEQTKQQAGQVHFLAVQTNPQSEDFAGFWVL 281
>gi|434404855|ref|YP_007147740.1| Protein of unknown function (DUF1092) [Cylindrospermum stagnale PCC
7417]
gi|428259110|gb|AFZ25060.1| Protein of unknown function (DUF1092) [Cylindrospermum stagnale PCC
7417]
Length = 287
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 112/290 (38%), Positives = 169/290 (58%), Gaps = 16/290 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WE+DF SRPILD KK+WE++VC+ + +Y +Y P+ +NS L+ A+
Sbjct: 5 WEVDFYSRPILDENQKKVWEVLVCETPSGIGTNIDSLFRYAQYCPSTQVNSGWLRTALQQ 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P K+RFFR QM +ITKAC+++ + +PS+R L L WL++R E VY + P
Sbjct: 65 AINKAG-EAPIKVRFFRRQMNNMITKACEDVGVPALPSRRTLFLNQWLQQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q G+ + LD P P LPD L G +WAFV L Q+ E + FG + L+L
Sbjct: 124 GYQGGANASVRLDRPLPQRLPDALEGKQWAFVTL---EAQDFADMPEWEIGFGEAFPLEL 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARG-SLILSVGISTRYIYAN 331
+ + + IPG+ + S RA PLA WM+GLE+ ++ DT+ G LIL G + +I AN
Sbjct: 181 AKLSPEAR--IPGILIFSPRALPLAGWMSGLELAYLKFDTSLGERLILETGATESWIVAN 238
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
+ P EA+ +E+ K+A G+HF+ +Q + ++ GFWLL ++ P
Sbjct: 239 I-RTPQLLVEAKGFESTKQAANGVHFIGVQSDAQAQSFAGFWLLQEINLP 287
>gi|428301149|ref|YP_007139455.1| hypothetical protein Cal6303_4583 [Calothrix sp. PCC 6303]
gi|428237693|gb|AFZ03483.1| protein of unknown function DUF1092 [Calothrix sp. PCC 6303]
Length = 287
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 116/292 (39%), Positives = 168/292 (57%), Gaps = 16/292 (5%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD KK+WEL++C+ +Y +Y P+ +NS L+ AI
Sbjct: 3 TTWELDFYSRPILDENQKKVWELLLCESPKDSRTKVDSLFRYAQYCPSTEVNSAWLRTAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
G P +IRFFR QM +ITKAC++ I S+R L L WL++R + VY +
Sbjct: 63 QEAISKAG-EAPTRIRFFRRQMNNMITKACQDSGIPAQSSRRILVLHQWLQQRMDEVYPQ 121
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDL 270
PG+Q GS P + LD P P LPD L + WAFV+L ++ + E + FG L
Sbjct: 122 EPGYQGGSNPSVRLDAPVPQRLPDALELENWAFVRL---TAKDFLDMPEWEIGFGEGFPL 178
Query: 271 DLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGS-LILSVGISTRYIY 329
+L ++ D T I G+ + SSR+ PLAAWM+GLE+ ++ D + G L+L G + +I
Sbjct: 179 EL--AQISDDTPISGVLIFSSRSLPLAAWMSGLELGYLKFDQSEGGRLLLETGATESWIV 236
Query: 330 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
AN + + V +EA+ +E AK++ G+HF+ +Q SE GFWLL ++ P
Sbjct: 237 ANIRNSQV-INEAKNFEVAKQSANGVHFIGVQANPQSESFAGFWLLQEVTLP 287
>gi|354566488|ref|ZP_08985660.1| protein of unknown function DUF1092 [Fischerella sp. JSC-11]
gi|353545504|gb|EHC14955.1| protein of unknown function DUF1092 [Fischerella sp. JSC-11]
Length = 288
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 116/288 (40%), Positives = 163/288 (56%), Gaps = 17/288 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE++VC+ L +Y +Y P+ +NS+ L+ A+
Sbjct: 5 WELDFYSRPILDENQKKVWEVLVCESPLDTRTKVDSLFRYAQYCPSTQVNSVWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
D G P KIRFFR QM +ITKAC ++ I PS+R L L WL++R E VY + P
Sbjct: 65 AIDKAG-EAPIKIRFFRRQMNNMITKACGDIGIPAQPSRRTLVLNQWLQQRIEQVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q G P + L+ P P LPD L +W FV L S E + + FG L+L
Sbjct: 124 GYQGGVNPSVRLEAPLPQRLPDALEWQQWGFVTLLGS---EFADMPDWEIDFGEGFPLEL 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA--RGSLILSVGISTRYIYA 330
+V +T IPG+ + S RA PLA WM+GL++ + D + G L+L G + +I A
Sbjct: 181 --AQVSPETSIPGILIFSPRALPLAGWMSGLDLAWLRFDDSPQGGRLLLETGATESWILA 238
Query: 331 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
N KNP +EA +E AK+ G+HF+ +Q + S+ GFWLL ++
Sbjct: 239 NL-KNPQILAEARNFEQAKQQANGVHFIGVQSDPQSQSFAGFWLLCEI 285
>gi|427419514|ref|ZP_18909697.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 7375]
gi|425762227|gb|EKV03080.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 7375]
Length = 285
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 114/282 (40%), Positives = 163/282 (57%), Gaps = 14/282 (4%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS------LQYTKYFPNNVINSITLKEAIVAIC 154
WELDF SRP+LD KK WE+++CDG+ S ++Y+K+ N +NSI L++AI
Sbjct: 5 WELDFYSRPVLDDNQKKRWEVLLCDGAQSVADSSRIRYSKFLSNKQVNSIELQQAIEEAI 64
Query: 155 DDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGF 214
+ G P +IRFFR QMQ +I +AC EL + S+R L+L WLE+R E Y + PG+
Sbjct: 65 EKAGES-PTQIRFFRYQMQNMIKRACDELGVSARLSRRTLTLQTWLEDRQENFYPQQPGY 123
Query: 215 QKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLG 274
Q+G P LPD L G +WA V LP +E E + FG + L+L G
Sbjct: 124 QEGKSPATVQPVEVARPLPDALIGQRWAMVSLP---AKEFADMPEWEIGFGEAFPLELAG 180
Query: 275 IEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANYK 333
I D T++PG+ + S RA PLA WM+GLE+ ++ + S L+L G + +I A+
Sbjct: 181 IGPD--TMVPGILIFSERALPLAGWMSGLEMAYLDVQIDQISQLLLETGSNDTWIMASLN 238
Query: 334 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ P EAE + AAK+ +HF+A+Q+ DSE GFWL+
Sbjct: 239 R-PELKQEAERFMAAKEEANQVHFVAVQDNPDSESFAGFWLM 279
>gi|282901672|ref|ZP_06309588.1| protein of unknown function DUF1092 [Cylindrospermopsis raciborskii
CS-505]
gi|281193435|gb|EFA68416.1| protein of unknown function DUF1092 [Cylindrospermopsis raciborskii
CS-505]
Length = 289
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 107/290 (36%), Positives = 170/290 (58%), Gaps = 19/290 (6%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE+++C+ + +Y +Y P+ +NS+ L++A+
Sbjct: 5 WELDFYSRPILDANQKKVWEVLICESPTDVLTKVDSLFRYAQYCPSTQVNSVWLRQALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ GV P KIRFFR QM +ITKAC+++ I +PS++ L L W+++R E VY + P
Sbjct: 65 AIEKAGVA-PIKIRFFRRQMNNMITKACQDMGIPALPSRKTLVLNQWIQQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+++ + + L+ P P LPD L G +W FV L S + + E + FG + L+L
Sbjct: 124 GYEQVTNSSVRLERPLPQRLPDALEGKQWTFVSLGASDITD---MPEWEIAFGEAFPLEL 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGS----LILSVGISTRYI 328
G+ + IPG+ + S RA P+A WM+GLE+ + D+ R + L+L G + +I
Sbjct: 181 AGL--SPEIPIPGILIFSPRALPIAGWMSGLELAYLRLDSNRNNQGDRLVLETGGTESWI 238
Query: 329 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
AN + P +EA+ +E AK+ G+HF+ +Q + S+ GFWLL ++
Sbjct: 239 LANL-RTPQLLAEAKGFEEAKQKADGVHFIGVQSDPQSQSFAGFWLLKEI 287
>gi|298490971|ref|YP_003721148.1| hypothetical protein Aazo_1936 ['Nostoc azollae' 0708]
gi|298232889|gb|ADI64025.1| protein of unknown function DUF1092 ['Nostoc azollae' 0708]
Length = 286
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 109/287 (37%), Positives = 166/287 (57%), Gaps = 16/287 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE++VC+ + ++ Y +Y P+ +NS+ L+ A+
Sbjct: 5 WELDFYSRPILDANQKKVWEILVCESPVDVRTKTDSLFRYAQYCPSTQVNSVWLRTALEE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P KIRFFR QM +ITKAC++ I +PS+R L L WL++R E VY +
Sbjct: 65 AINKAG-EAPIKIRFFRRQMNNMITKACQDAGIPALPSRRALVLNQWLQQRMEEVYPQEL 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q + P + LD P P LPD L G +WAFV L ++ V + + FG + L+L
Sbjct: 124 GYQGEANPSVRLDRPLPQRLPDALEGKQWAFVTL---EAKDFVDMPDWEIAFGEAFPLEL 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARG-SLILSVGISTRYIYAN 331
++ + IPG+ + S RA P+A WM+GLE+ + DT++G LIL G + ++ AN
Sbjct: 181 --AQLSPEIRIPGILIFSPRALPIAGWMSGLEMAYLRFDTSQGDRLILETGATESWVLAN 238
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+ P EA+ +E K+ G+HF+ +Q + + GFWLL ++
Sbjct: 239 I-RTPQLLKEAQGFEETKQKANGVHFIGVQSDPQVQSFSGFWLLQEV 284
>gi|409993875|ref|ZP_11277002.1| hypothetical protein APPUASWS_22218 [Arthrospira platensis str.
Paraca]
gi|291566596|dbj|BAI88868.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409935287|gb|EKN76824.1| hypothetical protein APPUASWS_22218 [Arthrospira platensis str.
Paraca]
Length = 287
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 117/289 (40%), Positives = 170/289 (58%), Gaps = 16/289 (5%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAI 150
T WELDF SRP+ D GKK+WE+++C+ L ++ YT++ P+ +NSI L+ AI
Sbjct: 4 TIWELDFYSRPLRDEDGKKVWEVIICETPLDVRSRPESLFRYTQFCPSTQVNSIWLQGAI 63
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+P P KIRFFR M +I+KA + LDI S+R +L WL+ER + VY
Sbjct: 64 EEAIAQAPLP-PSKIRFFRRPMANMISKAAEGLDIPASASRRTYTLFQWLQERIDKVYPT 122
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDL 270
+P +Q+G+ P + + P LPD L G++WA V L +A Q+ E FG + L
Sbjct: 123 YPNYQEGTNPSVQFVSGEPQPLPDALQGEQWAIVSLEAAAFQDMP---EWDIGFGEAFSL 179
Query: 271 DLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIY 329
++G+ +TL+PGL + S+RA PLAAWM+GLE+ + +T R SLIL G + +I
Sbjct: 180 PMMGL--SPETLVPGLIIFSTRAIPLAAWMSGLELAFLRLLETPRPSLILETGENESWIL 237
Query: 330 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
AN + T +EA +E AK + +HFLAIQ + +SE GFW+L L
Sbjct: 238 ANLTDSK-TQTEARNFEQAKLSAKNVHFLAIQSDPNSESFAGFWMLQQL 285
>gi|428211001|ref|YP_007084145.1| hypothetical protein Oscil6304_0478 [Oscillatoria acuminata PCC
6304]
gi|427999382|gb|AFY80225.1| Protein of unknown function (DUF1092) [Oscillatoria acuminata PCC
6304]
Length = 293
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 120/293 (40%), Positives = 169/293 (57%), Gaps = 22/293 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAIVA 152
WELDF S+PILD GKK WE+++C+ L+++KY ++ +NSI L AI
Sbjct: 5 WELDFYSKPILDENGKKRWEVLICESPTDICSTTDELLRFSKYCSSSEVNSIWLGNAINE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P +IRFFR QM +ITKACK+L I PS+R ++L WL++R +TVY P
Sbjct: 65 AIATAGKS-PTQIRFFRRQMNNMITKACKDLGINSKPSRRTVALYRWLQDRMDTVYPLEP 123
Query: 213 GFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLD 271
GFQ G P + + P P LPD L GD+WAFV L + E+S E F S
Sbjct: 124 GFQGAGLNPSVQFETPKPERLPDALQGDRWAFVSLEAGSFA-EMSEWEIDF----SEAFP 178
Query: 272 LLGI-----EVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTAR-GSLILSVGIST 325
+LG ++ T+IPG+ V S+RAK +AAWM+GLE+ ++ + G ++L G +
Sbjct: 179 ILGEKSLVPQITPDTIIPGMIVFSNRAKAIAAWMSGLELGFLKPELEEPGQVVLETGFNE 238
Query: 326 RYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
R+I AN + T +EA+ + K G+HFLAIQ + +SE GFWLL +L
Sbjct: 239 RWILANL-TDKTTRAEAQGFAETKDKAQGVHFLAIQTDPNSESFAGFWLLQEL 290
>gi|440681970|ref|YP_007156765.1| protein of unknown function DUF1092 [Anabaena cylindrica PCC 7122]
gi|428679089|gb|AFZ57855.1| protein of unknown function DUF1092 [Anabaena cylindrica PCC 7122]
Length = 287
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 115/290 (39%), Positives = 164/290 (56%), Gaps = 16/290 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE++VC+ ++ Y +Y P+ +NS L+ A+
Sbjct: 5 WELDFYSRPILDENQKKVWEVLVCESPSDVRTKTDSLFRYAQYCPSTQVNSGWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P KIRFFR QM +ITKAC+ + I I S+R L L WL++R E VY + P
Sbjct: 65 AIEKAG-EAPIKIRFFRRQMNNMITKACEGVGIPAISSRRTLFLNQWLQQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q + P + LD P P LPD L G +WAFV L E E + FG + L+L
Sbjct: 124 GYQGIANPSVRLDKPLPQRLPDALEGKQWAFVTLDAGDFAE---MPEWEIGFGEAFPLEL 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARG-SLILSVGISTRYIYAN 331
+ + + IPG+ + S RA PLA WM+GLE+ + DT +G LIL G + +I AN
Sbjct: 181 AKLSPEAR--IPGILIFSPRALPLAGWMSGLEMAYLHFDTKQGDRLILETGATESWIVAN 238
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
K P +EA+ + AK+ G+HF+ +Q + ++ GFWLL ++ P
Sbjct: 239 I-KTPQLLAEAQGFAQAKEKANGVHFIGVQSDPQAQSFAGFWLLQEVNLP 287
>gi|411119159|ref|ZP_11391539.1| Protein of unknown function (DUF1092) [Oscillatoriales
cyanobacterium JSC-12]
gi|410711022|gb|EKQ68529.1| Protein of unknown function (DUF1092) [Oscillatoriales
cyanobacterium JSC-12]
Length = 288
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 115/287 (40%), Positives = 168/287 (58%), Gaps = 17/287 (5%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD GKK+WE+V+C+ + ++ +Y + +NS L +A+
Sbjct: 3 TIWELDFYSRPILDEHGKKVWEVVLCESPTQIKAEPDRLFRFAEYCASTEVNSERLVQAL 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
P P +IRFFR M+ +ITKAC +L++ + S+R +L WL++R+ Y +
Sbjct: 63 QTAIAQAPSP-PSRIRFFRQAMKNMITKACNDLNLPSVLSRRTYALNQWLQQRFAEEYPK 121
Query: 211 HPGFQKGSKPLLALDNPFPME-LPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLD 269
HPGFQ GS P ++ ++ LPD L G KWAFV L + + EE+ E FG +
Sbjct: 122 HPGFQAGSNPSVSFAATTAVQSLPDALIGQKWAFVSLE-AGMLEEMD--EWAIDFGEAFP 178
Query: 270 LDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVGISTRYI 328
L L+ + D ++PG+ + S RA P+A WM+GLE+ S++ DT + L+L G S R+I
Sbjct: 179 LSLVNLSPD--AIVPGVIIFSPRAVPMAGWMSGLELGSLKLDTESTPRLLLETGGSDRWI 236
Query: 329 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
A+ V T EA+ +E AK+ G+HFLAIQ D+E GFWLL
Sbjct: 237 LASLNNAQVQT-EAQNFETAKQKANGVHFLAIQAAPDTETFAGFWLL 282
>gi|428304460|ref|YP_007141285.1| hypothetical protein Cri9333_0858 [Crinalium epipsammum PCC 9333]
gi|428245995|gb|AFZ11775.1| protein of unknown function DUF1092 [Crinalium epipsammum PCC 9333]
Length = 287
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 115/289 (39%), Positives = 161/289 (55%), Gaps = 16/289 (5%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAI 150
T WELDF SRPI+D KKIWE++VC+ + +Y +Y P+ +NS++L+ A+
Sbjct: 3 TIWELDFYSRPIIDENQKKIWEVLVCESPVDTRQSVESLFRYAQYCPSTQVNSVSLQNAL 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ G P+KIRFFR QM +I KAC +L I PS+R ++ WL ER + VY
Sbjct: 63 TEAIEKSGQS-PQKIRFFRRQMNNMIVKACTDLGILAEPSRRTYAVHQWLRERMQDVYPS 121
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDL 270
HP +Q + P + + P LPD L G KW FV L SA E E F + L
Sbjct: 122 HPNYQPSNSPSVQFEVQPPQPLPDALIGQKWMFVSLDASAFAE---MHEWNIGFSEAFPL 178
Query: 271 DLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA-RGSLILSVGISTRYIY 329
++L + +T IPG+ + S RA P+AAWM+G+E I+ A + L+L G S +
Sbjct: 179 EML--HLSPQTRIPGIIILSPRAIPMAAWMSGIEPALIKFYPAPQARLLLETGGSDSWFL 236
Query: 330 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+ N + +EA +EAAK+ G+HFLAIQ SED GFWLL +L
Sbjct: 237 VK-QLNGSSQTEAAGFEAAKQQAKGVHFLAIQSSPQSEDFAGFWLLQEL 284
>gi|209528431|ref|ZP_03276864.1| protein of unknown function DUF1092 [Arthrospira maxima CS-328]
gi|376003070|ref|ZP_09780887.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423067161|ref|ZP_17055951.1| hypothetical protein SPLC1_S532420 [Arthrospira platensis C1]
gi|209491136|gb|EDZ91558.1| protein of unknown function DUF1092 [Arthrospira maxima CS-328]
gi|375328518|emb|CCE16640.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406711447|gb|EKD06648.1| hypothetical protein SPLC1_S532420 [Arthrospira platensis C1]
Length = 287
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 114/289 (39%), Positives = 171/289 (59%), Gaps = 16/289 (5%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAI 150
T WELDF SRP+ D GKK+WE+++C+ L ++ YT++ P+ +NSI L+ AI
Sbjct: 4 TIWELDFYSRPLRDEDGKKVWEVIICETPLDVRSRPESLFRYTQFCPSTQVNSIWLQGAI 63
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+P P KIRFFR M +I+KA + LDI S+R +L WL+ER + VY
Sbjct: 64 QEAIAQAPLP-PSKIRFFRRPMANMISKAAEGLDIPASASRRTYTLFQWLQERIDKVYPT 122
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDL 270
+P +Q+G+ P + + P LPD L G++WA V L +A ++ E FG + L
Sbjct: 123 YPNYQEGTNPSVQFVSGEPQPLPDALQGEQWAIVSLEAAAFED---MPEWDIGFGEAFSL 179
Query: 271 DLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIY 329
++G+ +T +PGL + ++RA PLAAWM+GLE+ + +T R +LIL G + +I
Sbjct: 180 PMMGL--SPETPVPGLIIFTTRAIPLAAWMSGLELAFLRLVETPRPNLILETGENESWIL 237
Query: 330 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
AN +P T +EA+ +E AK + +HFLAIQ + +SE GFW+L L
Sbjct: 238 ANL-TDPKTQTEAKNFEQAKLSAKNVHFLAIQSDPNSESFAGFWMLQQL 285
>gi|332712125|ref|ZP_08432053.1| protein of unknown function, DUF1092 [Moorea producens 3L]
gi|332348931|gb|EGJ28543.1| protein of unknown function, DUF1092 [Moorea producens 3L]
Length = 287
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 117/292 (40%), Positives = 164/292 (56%), Gaps = 19/292 (6%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD KK+WE+++C+ L + QY + PN +NSI L +A+
Sbjct: 3 TIWELDFYSRPILDENQKKLWEVLICESPLDINLSPETLFQYASWCPNQQVNSIWLGQAL 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
P P KIRFFR QM +ITKAC EL+I PS+R +L WL++R E Y
Sbjct: 63 ADAIAKAQQP-PSKIRFFRRQMNNMITKACNELNIPAQPSRRTYALERWLKQRIEDFYPN 121
Query: 211 HPGFQ--KGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASL 268
PG+ + + +P P LPD L G KWA V L +A +E E + FG +
Sbjct: 122 QPGYDPAAAASSFVRYQSPIPKPLPDALQGQKWAVVSLQAAAFEEMN---EWEIDFGEAF 178
Query: 269 DLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA--RGSLILSVGISTR 326
+ ++ I + T IPG+ + S RAKPLAAWM+GLE+ + DT + L+L G +
Sbjct: 179 PVSIMDIAPE--TPIPGVIIFSQRAKPLAAWMSGLELSFVRLDTTDDKPKLLLETGANDS 236
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+I AN K+ + +EA+++E AK+ +HFLA+Q SE GFWL +L
Sbjct: 237 WILANLTKSQI-LAEAKSFEEAKQNANLVHFLAVQSSPTSEQFAGFWLCREL 287
>gi|282896250|ref|ZP_06304272.1| Putative uncharacterized protein [Raphidiopsis brookii D9]
gi|281198746|gb|EFA73625.1| Putative uncharacterized protein [Raphidiopsis brookii D9]
Length = 289
Score = 197 bits (502), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 106/290 (36%), Positives = 169/290 (58%), Gaps = 19/290 (6%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD+ KK+WE+++C+ + +Y++Y P+ +NS+ L++A+
Sbjct: 5 WELDFYSRPILDVNQKKVWEVLICESPTDVITKVDSLFRYSQYCPSTQVNSVWLRQALEE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ GV P KIRFFR QM +ITKAC+++ I + S++ L L W+++R E VY + P
Sbjct: 65 AIEKAGVA-PIKIRFFRRQMNNMITKACQDMGIPALSSRKTLVLNQWIQQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q+ + + L+ P P LPD L G +W FV L S + E + FG + L+L
Sbjct: 124 GYQQVTNSSVRLERPLPQRLPDALEGKQWTFVSLEASDFTD---MPEWEIAFGEAFPLEL 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGS----LILSVGISTRYI 328
G+ +T IPG+ + S RA P+A WM+GLE+ + D+ + L+L G + +I
Sbjct: 181 AGL--SPETPIPGILIFSPRALPIAGWMSGLELAYLRFDSNPNNQGDRLVLETGGTESWI 238
Query: 329 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
AN + P +A+ +E AK+ G+HF+ +Q + S+ GFWLL ++
Sbjct: 239 LANL-RTPKLLEDAKGFEEAKQKANGVHFIGVQSDPQSQSFAGFWLLKEI 287
>gi|113478101|ref|YP_724162.1| hypothetical protein Tery_4728 [Trichodesmium erythraeum IMS101]
gi|110169149|gb|ABG53689.1| protein of unknown function DUF1092 [Trichodesmium erythraeum
IMS101]
Length = 286
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 119/289 (41%), Positives = 162/289 (56%), Gaps = 16/289 (5%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD R KK+WEL++C + + +Y+++ + +NSI L+ AI
Sbjct: 3 TIWELDFYSRPILDERQKKLWELLICQSPIGINDTTDSLYRYSEFTNSQEVNSIWLRSAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
P PE+IRFFR QM +ITKAC EL I S+R L WLE+R E VY
Sbjct: 63 EKAIAQAPEP-PERIRFFRRQMNNMITKACGELAIPIALSRRTYLLNQWLEQRMEEVYPT 121
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDL 270
+PG+Q G+ P N P LPD L G++W FV L A E E FG + L
Sbjct: 122 YPGYQPGTNPSGQYMNSAPQPLPDALIGERWTFVSLEAGAFTEMS---EWDIDFGEAFPL 178
Query: 271 DLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTAR-GSLILSVGISTRYIY 329
++ + + IPGL + SSRA+ LAAWM+GLE+ I+ A L+L+ G + +I
Sbjct: 179 SMMNLA--PLSAIPGLIIYSSRAQALAAWMSGLELAFIKFSPASPARLLLNTGGNDCWIL 236
Query: 330 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
AN NP T +EA+ + AK +HFLA+Q +SE GFWLL ++
Sbjct: 237 ANL-SNPSTIAEAKRFSEAKSKAKEVHFLAVQSNPESESFAGFWLLQEI 284
>gi|443327636|ref|ZP_21056256.1| Protein of unknown function (DUF1092) [Xenococcus sp. PCC 7305]
gi|442792728|gb|ELS02195.1| Protein of unknown function (DUF1092) [Xenococcus sp. PCC 7305]
Length = 289
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 126/292 (43%), Positives = 166/292 (56%), Gaps = 23/292 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVV------CDGSLS--LQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE+++ D SL +Y +Y + INS+ L EAI
Sbjct: 6 WELDFYSRPILDENQKKVWEVLIQESPTTTDRSLDDLFRYAQYTSSKTINSLWLSEAIEK 65
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P KIRFFR QM +ITKAC+EL I I S+R +L W+E+R +VY
Sbjct: 66 AIAESGTK-PRKIRFFRRQMNNMITKACEELGIAAIASRRTYALAQWIEDRMTSVYPNET 124
Query: 213 GFQKGSKPLLALDNPFPME---LPDNLFGDK---WAFVQLPFSAVQEEVSSLESKFVFGA 266
G+ + + ++ P P+ LPD + GDK WAFV L SA E E + FG
Sbjct: 125 GYDQKAANSASVKYP-PLNAIPLPDAVRGDKNDRWAFVSLDCSAFAEMS---EWEINFGE 180
Query: 267 SLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETD-TARGSLILSVGIST 325
+ L L I + K IPGL S RA PLAAWM+GLE+ ++ + T+R L L G S
Sbjct: 181 AFPLSLANIAGETK--IPGLIFFSPRANPLAAWMSGLEMGYLQLEITSRPRLRLETGASD 238
Query: 326 RYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
+I AN NP SEA+ +EA+KK G+HFLA+Q + +SE GFWLL D
Sbjct: 239 SWILANV-TNPQILSEAKGFEASKKEAQGVHFLAVQSDPESESFAGFWLLKD 289
>gi|428780588|ref|YP_007172374.1| hypothetical protein Dacsa_2415 [Dactylococcopsis salina PCC 8305]
gi|428694867|gb|AFZ51017.1| Protein of unknown function (DUF1092) [Dactylococcopsis salina PCC
8305]
Length = 287
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 118/293 (40%), Positives = 163/293 (55%), Gaps = 25/293 (8%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAI 150
T WELDF SRPI D KK+WE+++C+ L ++ Y K+ +NSI L+EAI
Sbjct: 3 TIWELDFYSRPIRDENNKKLWEVLICESPLDVETTEEQLFRYQKFCSAQTVNSIFLQEAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ G P+KIRFFR QM +ITKAC ++ I +PS+R +L W+EER E VY +
Sbjct: 63 NEAIEASGKS-PKKIRFFRRQMSNMITKACDDIGITALPSRRTYALQRWIEERLENVYPQ 121
Query: 211 HPGFQKGSKPLLALDNPFPME----LPDNLFGDK---WAFVQLPFSAVQEEVSSLESKFV 263
G+ + + + + +P E LPD + GDK WAFV L QE E +
Sbjct: 122 QEGYDETAVSSVTVQ--YPAENAAILPDAIRGDKGDRWAFVTLEVQGFQE---MKEWEIS 176
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVG 322
FG L L ++ +T IPGL + S RA P A WM+G+E+ I+ +R LIL G
Sbjct: 177 FGEGFPLSLF--DLSPETKIPGLVIFSPRAMPFAGWMSGIELSQIQLQQGSRPRLILQTG 234
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
S +I A+ NP T EA+ ++ AK+ G+HFLAIQ + SE GFWLL
Sbjct: 235 TSECWILADI-TNPDTLKEAQGFQQAKETAQGVHFLAIQSDPQSEAFAGFWLL 286
>gi|434388804|ref|YP_007099415.1| Protein of unknown function (DUF1092) [Chamaesiphon minutus PCC
6605]
gi|428019794|gb|AFY95888.1| Protein of unknown function (DUF1092) [Chamaesiphon minutus PCC
6605]
Length = 286
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 105/286 (36%), Positives = 165/286 (57%), Gaps = 17/286 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAIVA 152
WE+DF SRP++D R KK+WEL++C+ + ++T+Y P++ +NS+ L EA+ A
Sbjct: 5 WEIDFYSRPLVDERQKKVWELLICESPATTDRSTEDLFRFTRYCPSDRVNSLWLAEALQA 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ P++IRFFR QM +ITKACK++ I S+R ++L W+++R E Y + P
Sbjct: 65 AMLE-AKQSPQRIRFFRRQMNNMITKACKDIGIPAAASRRTIALHQWIDDRMEHFYPQQP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
+Q + + + + P LP+ L G+KW FV L A + E + F + L +
Sbjct: 124 NYQAANTASVQMFSDPPQPLPEALLGEKWTFVSL---AASQFADMNEWQIGFSEAFPLAM 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA-RGSLILSVGISTRYIYAN 331
+G V + IPGL + S R+ P+AAWM+GLE+ S+ A + +L+L G S +I A
Sbjct: 181 VG--VTPEMPIPGLILYSPRSVPMAAWMSGLEIVSVRYQPAPKSTLLLETGASESWILAR 238
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
+ T EA +EA+K+ G+HF+AIQ D E+ GFWLL +
Sbjct: 239 LEGT--TQQEAARFEASKQQAKGVHFIAIQSSPDVEEFAGFWLLYE 282
>gi|298714858|emb|CBJ25757.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 310
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 105/281 (37%), Positives = 159/281 (56%), Gaps = 7/281 (2%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL- 157
EWELD SRP++ GKK+WEL++CD + + ++ P+N++NS ++ I + +
Sbjct: 34 NEWELDVYSRPVVGADGKKLWELLICDSTGNFRHVSPIPSNMVNSREVRRTIEGVIEAAP 93
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
G P IRFFR+ M +I A KE+++ P + ++ WLEER VY GF+
Sbjct: 94 GGSKPTVIRFFRNAMFNMIDIALKEVEVAVKPCRTTYAMYQWLEERERDVYPAMAGFKPT 153
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEV 277
K D P LPD L G+++AFV +P S ++ + E+ V G LD +
Sbjct: 154 MKQPAFFDIRTPTPLPDALRGEQYAFVTMPVSEFRQGNINDENVGV-GRLCPLD---ASL 209
Query: 278 DDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPV 337
D +IPGLA+ ++RA+PLA WM GLEV + D L L GI+T+Y+ A + +
Sbjct: 210 PDDAMIPGLAMFTARAEPLATWMTGLEVAYFKADLKNRELALECGINTQYLVARVQGD-- 267
Query: 338 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
EA+ +E AK+A GG HF+A+Q D++D GFWLL ++
Sbjct: 268 QRKEAQGFEEAKRALGGFHFVAVQSNPDADDVAGFWLLKEV 308
>gi|257062177|ref|YP_003140065.1| hypothetical protein Cyan8802_4445 [Cyanothece sp. PCC 8802]
gi|256592343|gb|ACV03230.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 8802]
Length = 293
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 117/292 (40%), Positives = 167/292 (57%), Gaps = 21/292 (7%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD KK+WE+V+C+ L++ +Y+++ + +NS+ L+EAI
Sbjct: 3 TIWELDFYSRPILDENQKKLWEVVICETPLTVDRSPDTLFKYSQFCSSQTVNSVWLREAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ G P+KIRFFR QM +ITKAC++ I +PS+R +L WL ER + Y
Sbjct: 63 ESAIAQAG-ETPQKIRFFRRQMNNMITKACEDAGIAAVPSRRTYTLTHWLAERNQQFYPT 121
Query: 211 HPGFQKGSKPLLALDNP--FPMELPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFVFG 265
PG+ + ++ P + LPD + G DKWAFV L SA++E E + FG
Sbjct: 122 QPGYSVEAAQTSSVAYPELNAIPLPDAVRGDKADKWAFVTLEASALEE---MNEWEIGFG 178
Query: 266 ASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGIS 324
L LLG+ + + IPGL + S RA PLAAWM+GLE+ ++ + R + L G S
Sbjct: 179 EGFPLSLLGVTSEQR--IPGLIIFSDRALPLAAWMSGLELGFLKFEENPRPIVRLETGTS 236
Query: 325 TRYIYANYK-KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+I N K+ T +EA+ +E AK+ +HFLAIQ D+E GFWLL
Sbjct: 237 DSWILVNISPKDAPTLAEAQGFETAKQNGQQVHFLAIQSSPDTESFAGFWLL 288
>gi|425436008|ref|ZP_18816449.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9432]
gi|389679353|emb|CCH91843.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9432]
Length = 291
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 117/293 (39%), Positives = 166/293 (56%), Gaps = 23/293 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ +++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPVTIDRSSDTIFKYASYCPNTMVNSQWLSEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I A GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 ITAAIKAAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ + +++ P P+ LPD + G DKWAFV L S+ + +
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFHD---IKDWDIS 175
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVG 322
FG +L +LG+ +D+ IPGL + S RA PLA WM+GLE+ ++ +T +R L L G
Sbjct: 176 FGE--NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETGSRPVLRLETG 233
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
S +I N N T EA+ +E AK+ LHFLAIQ +SE GFWLL
Sbjct: 234 ASDSWILVNV-TNTETLKEAKNFEEAKQKANNLHFLAIQSNPESESFAGFWLL 285
>gi|218249090|ref|YP_002374461.1| hypothetical protein PCC8801_4383 [Cyanothece sp. PCC 8801]
gi|218169568|gb|ACK68305.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 8801]
Length = 293
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 117/292 (40%), Positives = 167/292 (57%), Gaps = 21/292 (7%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD KK+WE+V+C+ L++ +Y+++ + +NS+ L+EAI
Sbjct: 3 TIWELDFYSRPILDENQKKLWEVVICETPLTVDRSPDTLFKYSQFCSSQTVNSVWLREAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ G P+KIRFFR QM +ITKAC++ I +PS+R +L WL ER + Y
Sbjct: 63 ESAIAQAG-ETPQKIRFFRRQMNNMITKACEDAGIAAVPSRRTYTLTHWLAERDQQFYPT 121
Query: 211 HPGFQKGSKPLLALDNP--FPMELPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFVFG 265
PG+ + ++ P + LPD + G DKWAFV L SA++E E + FG
Sbjct: 122 QPGYSVEAAQTSSVAYPELNAIPLPDAVRGDKADKWAFVTLEASALEE---MNEWEIGFG 178
Query: 266 ASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGIS 324
L LLG+ + + IPGL + S RA PLAAWM+GLE+ ++ + R + L G S
Sbjct: 179 EGFPLSLLGVTSEQR--IPGLIIFSDRALPLAAWMSGLELGFLKFEENPRPIVRLETGTS 236
Query: 325 TRYIYANYK-KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+I N K+ T +EA+ +E AK+ +HFLAIQ D+E GFWLL
Sbjct: 237 DSWILVNISPKDAPTLAEAQGFETAKQNGQQVHFLAIQSSPDTESFAGFWLL 288
>gi|427724036|ref|YP_007071313.1| hypothetical protein Lepto7376_2188 [Leptolyngbya sp. PCC 7376]
gi|427355756|gb|AFY38479.1| protein of unknown function DUF1092 [Leptolyngbya sp. PCC 7376]
Length = 285
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 164/293 (55%), Gaps = 25/293 (8%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVC---------DGSLSLQYTKYFPNNVINSITLKE 148
+T WELDF SRPILD KK+WE+++C DG L +Y+++ N +NSITLK+
Sbjct: 1 MTIWELDFYSRPILDDNQKKLWEVLICEAPTSIKQGDGDL-FRYSEFCTNTEVNSITLKK 59
Query: 149 AIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVY 208
AI + GV P KIRFFR QM +I+K C++ I PS+R +L+ W+++R VY
Sbjct: 60 AIEKAIAEAGVS-PSKIRFFRRQMNNMISKGCEDAGIPSAPSRRAYTLMQWIDQRTREVY 118
Query: 209 TRHPGFQKGSKPLLALDNPF--PMELPDNLF---GDKWAFVQLPFSAVQEEVSSLESKFV 263
HP F + + ++ P + LPD + GDKWA V L SA E+ E
Sbjct: 119 PEHPNFDEQAARNTSVQYPSLNAVALPDAVRGDKGDKWAIVSLEASAF-EDFDDWE--ID 175
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA-RGSLILSVG 322
FG L+ L + T IPGL + S RA PLA WM+GLE+ + + R S++L G
Sbjct: 176 FGEPFPLNNL----NSDTKIPGLLIFSPRAVPLAGWMSGLELSFLHLNQQPRPSMVLETG 231
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+S +I A+ N T EA+ +E AKK G+HFLAIQ D E GFW+L
Sbjct: 232 VSDSWIVADL-PNKGTVKEAKNFETAKKKAEGIHFLAIQNSPDDERFAGFWML 283
>gi|422303161|ref|ZP_16390515.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9806]
gi|389791919|emb|CCI12318.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9806]
Length = 291
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 116/293 (39%), Positives = 166/293 (56%), Gaps = 23/293 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRSSDTLFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
+ A GV P+KIRFFR QM +ITKAC+++ I PS+R +L W++ER Y
Sbjct: 61 VTAAIKAAGV-TPKKIRFFRRQMNNMITKACEDIGIPASPSRRTHALTRWIKERMANFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ + +++ P P+ LPD + G DKWAFV L S+ + +
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFND---LKDWDIS 175
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVG 322
FG +L +LG+ +D+ IPGL + S RA PLA WM+GLE+ ++ +T +R L L G
Sbjct: 176 FGE--NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETSSRPVLRLETG 233
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
S +I N N T +EA+ +E AK+ LHFLAIQ +SE GFWLL
Sbjct: 234 ASDSWILVNV-TNAETLNEAKNFEEAKQKANNLHFLAIQSNPESESFAGFWLL 285
>gi|443649603|ref|ZP_21130311.1| hypothetical protein C789_851 [Microcystis aeruginosa DIANCHI905]
gi|159028601|emb|CAO90604.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443334903|gb|ELS49392.1| hypothetical protein C789_851 [Microcystis aeruginosa DIANCHI905]
Length = 291
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 116/293 (39%), Positives = 166/293 (56%), Gaps = 23/293 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRSSDTLFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I A GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 ITAAIKAAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMANFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ + +++ P P+ LPD + G DKWAFV L S+ + +
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFHD---LKDWDIS 175
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVG 322
FG +L +LG+ +D+ IPGL + S RA PLA WM+GLE+ ++ +T +R L L G
Sbjct: 176 FGE--NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETSSRPLLRLETG 233
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
S +I N N T +EA+ +E AK+ LHFLAIQ +S+ GFWLL
Sbjct: 234 ASDSWILVNV-TNAETLNEAKNFEEAKQKANNLHFLAIQSNPESQSFAGFWLL 285
>gi|158336667|ref|YP_001517841.1| hypothetical protein AM1_3535 [Acaryochloris marina MBIC11017]
gi|158306908|gb|ABW28525.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 287
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 108/286 (37%), Positives = 162/286 (56%), Gaps = 14/286 (4%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAIC------ 154
WE+DF SRPILD + KKIWEL+VCD + ++TK + N+ L+EA+
Sbjct: 5 WEIDFYSRPILDEQQKKIWELLVCDSQRNFEFTKVCSGSQANARWLQEALAEALPLWRQQ 64
Query: 155 DDLGVP-IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
+ G PE+IRFFR M++II +AC+ L+I PS+R + WL ER +TVY +HPG
Sbjct: 65 ANYGEQDFPERIRFFRRSMKSIIPRACEALEIPAQPSRRTFGVYQWLCEREQTVYPQHPG 124
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLL 273
+Q + + P LPD L G+ W V L SA +E E FGA + L L
Sbjct: 125 YQPMMAAPMTFEPTLPKPLPDALQGEGWRLVTLQLSAFEE---MDEWDIAFGAKIPLAQL 181
Query: 274 GIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVGISTRYIYANY 332
+ +T IPGL + S R+ PLA WM+GLE+ ++ + + L+L G+S R++ A
Sbjct: 182 NL--PPETAIPGLLIFSERSTPLAGWMSGLELACLKLEMDPKPQLLLETGLSDRWVIAYL 239
Query: 333 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+P+ +E + +E K+A +HF+A+Q +SE GFWL+ ++
Sbjct: 240 NDDPL-VAEIQDFEKTKQAAQQVHFVAVQSSPESEQFAGFWLMQEI 284
>gi|390439073|ref|ZP_10227492.1| conserved hypothetical protein [Microcystis sp. T1-4]
gi|389837496|emb|CCI31616.1| conserved hypothetical protein [Microcystis sp. T1-4]
Length = 291
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 116/293 (39%), Positives = 166/293 (56%), Gaps = 23/293 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP+LD KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVLDENNKKRWELLICETPATIDRSSDTLFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I A + GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 ITAAIKEAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ + +++ P P+ LPD + G DKWAFV L S+ + +
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFND---LKDWDIS 175
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVG 322
FG +L +LG+ +D+ IPGL + S RA PLA WM+GLE+ ++ + +R L L G
Sbjct: 176 FGE--NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLEAGSRPLLRLETG 233
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
S +I N N T +EA+ +E AK+ LHFLA+Q +SE GFWLL
Sbjct: 234 ASDSWILVNV-TNAETLNEAKNFEEAKQKANNLHFLAVQSNPESESFAGFWLL 285
>gi|443310782|ref|ZP_21040422.1| Protein of unknown function (DUF1092) [Synechocystis sp. PCC 7509]
gi|442779136|gb|ELR89389.1| Protein of unknown function (DUF1092) [Synechocystis sp. PCC 7509]
Length = 288
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 110/291 (37%), Positives = 161/291 (55%), Gaps = 17/291 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WE+DF SRP+LD KK+WE++VC+ LS+ +Y++Y ++ +NS LK A+
Sbjct: 5 WEIDFYSRPVLDENNKKLWEILVCESPLSIDTELDSLFKYSEYCSSSQVNSAWLKAALEK 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ P K RFFR+ M +I KAC++L I PS+R L+L WL++R VY P
Sbjct: 65 AMEQ-SATTPLKFRFFRTSMNNMIVKACQDLGIPAQPSRRTLALHQWLQQRNLDVYPLEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL 272
G+Q + P + P LPD L G KW L + + + E + FG + L L
Sbjct: 124 GYQASTNPSVRGQKSDPQRLPDALIGQKWVVASLTGADLAQMP---EWEIGFGEAFPLPL 180
Query: 273 LGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTAR--GSLILSVGISTRYIYA 330
EV T++PG+ + S RA PLA WM+GLE+ +++ DT+ LIL G S ++ A
Sbjct: 181 --GEVASDTIVPGVIIYSPRAVPLAGWMSGLEIAALKVDTSVNPARLILETGASDSWLLA 238
Query: 331 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
N NP T A+ +E AK+ +HFLA+Q +SE GFWLL ++ P
Sbjct: 239 NV-TNPQTLQMAQDFEGAKQKANQVHFLAVQSSPESEVFAGFWLLQEINLP 288
>gi|425464328|ref|ZP_18843650.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389833706|emb|CCI21561.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 291
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 114/293 (38%), Positives = 167/293 (56%), Gaps = 23/293 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRSSDTIFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
+ A + GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 VTAAIKEAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMANFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ + +++ P P+ LPD + G DKWAFV L S+ + +
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFND---LKDWDIS 175
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVG 322
FG +L +LG+ +D+ IPGL + S RA PL+ WM+GLE+ ++ +T +R L L G
Sbjct: 176 FGE--NLPILGMGLDENLKIPGLVIFSPRALPLSGWMSGLEMAYLKLETGSRPLLRLETG 233
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
S +I N N T +EA+ +E AK+ LHFLA+Q +SE GFWLL
Sbjct: 234 ASDSWILVNV-TNAETLNEAKNFEEAKQKANNLHFLAVQSNPESESFAGFWLL 285
>gi|425443217|ref|ZP_18823442.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389715544|emb|CCI00112.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 291
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 116/293 (39%), Positives = 166/293 (56%), Gaps = 23/293 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP+LD KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVLDENNKKRWELLICETPATIDRSSDTIFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
+ A GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 VTAAIKAAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ + +++ P P+ LPD + G DKWAFV L S+ + +
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFND---LKDWDIS 175
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVG 322
FG +L +LG+ +D+ IPGL + S RA PL+ WM+GLE+ ++ +T +R L L G
Sbjct: 176 FGE--NLPILGMGLDENLKIPGLVIFSPRALPLSGWMSGLEMAYLKLETGSRPLLRLETG 233
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
S +I N N T +EA+ +E AK+ LHFLAIQ +SE GFWLL
Sbjct: 234 ASDSWILVNV-TNAETLNEAKNFEEAKQKANNLHFLAIQSNPESESFAGFWLL 285
>gi|428775715|ref|YP_007167502.1| hypothetical protein PCC7418_1082 [Halothece sp. PCC 7418]
gi|428689994|gb|AFZ43288.1| protein of unknown function DUF1092 [Halothece sp. PCC 7418]
Length = 287
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 116/291 (39%), Positives = 164/291 (56%), Gaps = 21/291 (7%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAI 150
T WELDF SRPI D KK+WE+++C+ L +Y+K+ +NSI L+EA+
Sbjct: 3 TIWELDFYSRPIRDENNKKLWEVLICESPLQANTTEGELFRYSKFCSAQNVNSIFLQEAL 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ G P+KIRFFR QM +ITKAC++L+I +PS+R +L WL+ER + VY +
Sbjct: 63 NEAMEKSGT-TPKKIRFFRRQMNNMITKACEDLEITALPSRRTYALQKWLQERLDQVYPQ 121
Query: 211 HPGFQKGSKPLLALDNPF--PMELPDNLF---GDKWAFVQLPFSAVQEEVSSLESKFVFG 265
G+ + + ++ P + LPD + GDKWAFV L A QE + FG
Sbjct: 122 QEGYDETAVTNASVQYPAENAVILPDAIRGDKGDKWAFVTLEAQAFQE---MEDWDISFG 178
Query: 266 ASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGIS 324
L L E+ +T +PGL + S RA P A WM+G+E+ I+ + + L+L G S
Sbjct: 179 EGFPLSLF--ELAPETKVPGLVIFSPRAMPFAGWMSGIELSQIQLQEGSLPRLVLQTGSS 236
Query: 325 TRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+I A+ NP T EA+ + AAKK G+HFLAIQ + SE GFWLL
Sbjct: 237 DCWILADI-TNPETLKEAQGFAAAKKDAKGVHFLAIQTDPSSESFAGFWLL 286
>gi|166363644|ref|YP_001655917.1| hypothetical protein MAE_09030 [Microcystis aeruginosa NIES-843]
gi|166086017|dbj|BAG00725.1| hypothetical protein MAE_09030 [Microcystis aeruginosa NIES-843]
Length = 291
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 115/293 (39%), Positives = 167/293 (56%), Gaps = 23/293 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRSSDTIFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
+ A + GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 VTAAIKEAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ + +++ P P+ LPD + G DKWAFV L S+ + +
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFND---LKDWDIS 175
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVG 322
FG +L +LG+ +D+ IPGL + S RA PL+ WM+GLE+ ++ +T +R L L G
Sbjct: 176 FGE--NLPILGMGLDENLKIPGLVIFSPRALPLSGWMSGLEMAYLKLETGSRPLLRLETG 233
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
S +I N N T +EA+ +E AK+ LHFLAIQ +SE GFWLL
Sbjct: 234 ASDSWILVNV-TNAETLNEAKNFEEAKQKANNLHFLAIQSNPESESFAGFWLL 285
>gi|425445142|ref|ZP_18825178.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9443]
gi|389734932|emb|CCI01483.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9443]
Length = 291
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 115/293 (39%), Positives = 166/293 (56%), Gaps = 23/293 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRPSDTIFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I A GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 ITAAIKAAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ + +++ P P+ LPD + G DKWAFV L S+ + +
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFHD---IKDWDIS 175
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVG 322
FG +L +LG+ +D+ IPGL + S RA PLA WM+GLE+ ++ +T +R L L G
Sbjct: 176 FGE--NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETGSRPVLRLETG 233
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
S +I N N T +EA+ +E AK+ LHFLA+Q +S+ GFWLL
Sbjct: 234 ASDSWILVNV-TNAKTLNEAKNFEEAKQKANNLHFLAVQSNPESQSFAGFWLL 285
>gi|170077740|ref|YP_001734378.1| hypothetical protein SYNPCC7002_A1122 [Synechococcus sp. PCC 7002]
gi|169885409|gb|ACA99122.1| conserved hypothetical protein (DUF1092) [Synechococcus sp. PCC
7002]
Length = 285
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 117/292 (40%), Positives = 160/292 (54%), Gaps = 23/292 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEA 149
+T WELDF SRP+LD KK+WE+++C+ +Q Y+++ N +NSITLK A
Sbjct: 1 MTIWELDFYSRPLLDDNDKKLWEILICETPTRIQQDPTTLFRYSEFCSNTDVNSITLKTA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I G P KIRFFR QM +ITK C++ I PS+R +L+ W+ +R + VY
Sbjct: 61 IEKAIATSGQS-PTKIRFFRRQMNNMITKGCEDAGIPAAPSRRTYTLMTWITQREQEVYP 119
Query: 210 RHPGFQKGSKPLLALDNPF--PMELPDNLF---GDKWAFVQLPFSAVQEEVSSLESKFVF 264
+ + + S ++ P + LPD + GDKWA V L SA + E F
Sbjct: 120 QEANYDEKSAKSSSVQYPALNAIALPDAVRGDKGDKWAIVSLEASAFSD---FDEWDIAF 176
Query: 265 GASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGI 323
G L +D T IPGL + S RA PLA WM+GLE+ + R SL+L G+
Sbjct: 177 GEPFPL----THLDPTTKIPGLLIFSPRAVPLAGWMSGLELGFLHLQKNPRSSLVLETGV 232
Query: 324 STRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
S +I A+ N T EAE++EAAKKA G+HFLAIQ+ D E GFW+L
Sbjct: 233 SDSWIVADL-PNAQTLKEAESFEAAKKAAAGIHFLAIQKSPDEEQFAGFWML 283
>gi|428218630|ref|YP_007103095.1| hypothetical protein Pse7367_2406 [Pseudanabaena sp. PCC 7367]
gi|427990412|gb|AFY70667.1| protein of unknown function DUF1092 [Pseudanabaena sp. PCC 7367]
Length = 287
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 108/288 (37%), Positives = 156/288 (54%), Gaps = 21/288 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
WELDF SRP+L+ KKIWEL++CD + +++ + P++ +NS L E + + G
Sbjct: 5 WELDFYSRPVLNQNKKKIWELLICDRTRQMEWVQECPSDRVNSAWLAEQLQTVIQKTG-Q 63
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+K+RFFR M IIT+ C + + P+ S+R +L WL+ER VY + GFQ
Sbjct: 64 TPQKVRFFRPSMANIITRGCNQAGLNPLASRRVFTLAAWLQERMAQVYPQQEGFQAADPN 123
Query: 221 LLALDNPFPM----ELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIE 276
L L P +PD L G+ WA V L + S+ + F DL L
Sbjct: 124 PLPLAVPMQQISTRPIPDALIGEGWAIVSL---RADQFASAGDWSIDFEELFDLSYLS-- 178
Query: 277 VDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE--TDTARGS--LILSVGISTRYIYANY 332
D TLIPGL + S RA PLAAWM G++ ++ T+ GS ++L R++ AN+
Sbjct: 179 --DDTLIPGLIIYSHRATPLAAWMAGVDPVFLKFVTNQNDGSSQMLLEANADARWLVANF 236
Query: 333 K-----KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ KN ++ +A+E AK+ +HFLAIQ+ DSED GFWLL
Sbjct: 237 QSAKAPKNAKAIADGQAFETAKQKAAQVHFLAIQDNPDSEDFAGFWLL 284
>gi|425453946|ref|ZP_18833695.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9807]
gi|389799877|emb|CCI20614.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9807]
Length = 291
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 115/293 (39%), Positives = 165/293 (56%), Gaps = 23/293 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRPSDTIFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I A GV P+KIRFFR QM +I+KAC+++ + PS+R +L W+EER Y
Sbjct: 61 ITAAIKAAGV-TPKKIRFFRRQMNNMISKACEDIGVPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ + +++ P P+ LPD + G DKWAFV L S+ + +
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFHD---LKDWDIS 175
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVG 322
FG +L +LG+ +D+ IPGL + S RA PLA WM+GLE+ ++ +T +R L L G
Sbjct: 176 FGE--NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETGSRPVLRLETG 233
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
S +I N N T EA+ +E AK+ LHFLA+Q +SE GFWLL
Sbjct: 234 ASDSWILVNV-TNTETLKEAKNFEEAKQKANNLHFLAVQSNPESESFAGFWLL 285
>gi|443319275|ref|ZP_21048509.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 6406]
gi|442781102|gb|ELR91208.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 6406]
Length = 287
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 108/286 (37%), Positives = 159/286 (55%), Gaps = 16/286 (5%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD R K+ WE+++ +G + +++++ N +NS+ LKE I
Sbjct: 3 TIWELDFYSRPILDERNKRRWEVLISEGLQRVDADPENLFRFSQFLANTDVNSLKLKEVI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
P P ++RFFR MQT+IT+AC++L + PS+R L+L W++ R VY +
Sbjct: 63 ETAIAQAPEP-PSRVRFFRFSMQTMITRACEDLGLAATPSRRTLALQDWIDYRQREVYPQ 121
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDL 270
PG+ P + P P LPD L G +WAFV LP ++ + FG L
Sbjct: 122 DPGYTDKPAPTVGAPPPSPRRLPDALVGQRWAFVTLP---ARDFADMPDWPMDFGEGFPL 178
Query: 271 DLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARG-SLILSVGISTRYIY 329
L GI D T IPG+ + S RA +A WM+GLE+ + +T++ LIL G + +I
Sbjct: 179 SLAGI--GDDTPIPGIIIFSPRAVAMAGWMSGLELSELRVETSKSPRLILETGAADSWIL 236
Query: 330 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ + + T EA+ +EAAK A +HFLA+QE +E GFWL+
Sbjct: 237 SPLGDSTLQT-EAKNFEAAKVAANQVHFLALQENPATEAFAGFWLM 281
>gi|425451748|ref|ZP_18831568.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
gi|440751656|ref|ZP_20930859.1| hypothetical protein O53_19 [Microcystis aeruginosa TAIHU98]
gi|389766807|emb|CCI07649.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
gi|440176149|gb|ELP55422.1| hypothetical protein O53_19 [Microcystis aeruginosa TAIHU98]
Length = 291
Score = 187 bits (474), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 115/293 (39%), Positives = 165/293 (56%), Gaps = 23/293 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRSSDTIFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
+ A GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 VTAAIKAAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ + +++ P P+ LPD + G DKWAFV L S+ + +
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFND---LKDWDIS 175
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVG 322
FG +L +LG+ +D+ IPGL + S RA PLA WM+GLE+ ++ +T +R L L G
Sbjct: 176 FGE--NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETGSRPVLRLETG 233
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
S +I N N T EA+ +E AK+ LHFLA+Q +SE GFWLL
Sbjct: 234 ASDSWILVNV-TNTETLKEAKNFEEAKQKANNLHFLAVQSNPESESFAGFWLL 285
>gi|359459949|ref|ZP_09248512.1| hypothetical protein ACCM5_14568 [Acaryochloris sp. CCMEE 5410]
Length = 287
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 107/286 (37%), Positives = 163/286 (56%), Gaps = 14/286 (4%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAI---VAICDDL 157
WE+DF SRPILD + KKIWEL+VCD + ++TK + N+ L+EA+ + +
Sbjct: 5 WEIDFYSRPILDEQQKKIWELLVCDSQRNFEFTKVCSGSQANARWLQEALAEALPLWRQQ 64
Query: 158 G----VPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
G PE+IRFFR M++II +AC+ L+I PS+R + WL ER +TVY +HPG
Sbjct: 65 GNYGEQDFPERIRFFRRSMKSIIPRACEALEIPAQPSRRTFGVYQWLCEREQTVYPQHPG 124
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLL 273
+Q + + P LPD L G+ W V L SA ++ E FGA + L L
Sbjct: 125 YQPMMAAPMTFEPTLPKPLPDALQGEGWRLVTLQLSAFED---MDEWDIAFGAKIPLAQL 181
Query: 274 GIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVGISTRYIYANY 332
+ +T IPGL + S R+ PLA WM+GLE+ ++ + + L+L G+S R++ A
Sbjct: 182 NL--PPETAIPGLLIFSERSTPLAGWMSGLELACLKLEMDPKPQLLLETGLSDRWVIAYL 239
Query: 333 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+P+ +E + +E K+A +HF+A+Q +SE GFWL+ ++
Sbjct: 240 NDDPL-VAEIQDFEKTKQAAQQIHFVAVQSSPESEQFAGFWLMQEI 284
>gi|428223761|ref|YP_007107858.1| hypothetical protein GEI7407_0302 [Geitlerinema sp. PCC 7407]
gi|427983662|gb|AFY64806.1| protein of unknown function DUF1092 [Geitlerinema sp. PCC 7407]
Length = 289
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 110/290 (37%), Positives = 164/290 (56%), Gaps = 16/290 (5%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD R KK+WE++VC+ ++ +Y +Y + +NS+ L++A+
Sbjct: 3 TIWELDFYSRPILDEREKKVWEVLVCESPQTVNQAPETLFRYAEYCDSGEVNSVRLRQAL 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
P P+KIRFFR Q+ +ITKAC +L + P+PS+R ++L WLEER VY
Sbjct: 63 ERAIAQAPQP-PDKIRFFRRQLTNMITKACSDLGVLPLPSRRTVTLNQWLEERSRDVYPL 121
Query: 211 HPGFQKG-SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLD 269
P +++G P + + P P LPD L D+ AFV L A + E FG +
Sbjct: 122 DPNYREGVVVPSVQFETPEPKRLPDALNYDRLAFVTLEAGAFADMT---EWSIDFGEAFP 178
Query: 270 LDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYI 328
L+ LG+ +T +PG+ + SSRA PLAAWM+GLE+ + +T L+L G + R++
Sbjct: 179 LEALGL--TPETRVPGVLLFSSRALPLAAWMSGLEMAFVRYEETPNPCLVLDTGANERWL 236
Query: 329 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
EA+ +E AK+A +HF+ +Q + SE GFWLL ++
Sbjct: 237 LRGNLAERSQQQEAKNFELAKQAAQNVHFIGVQSDPQSEAFSGFWLLQEV 286
>gi|425472349|ref|ZP_18851200.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
gi|389881591|emb|CCI37866.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
Length = 291
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 116/293 (39%), Positives = 165/293 (56%), Gaps = 23/293 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRSSDTIFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I A G P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 ITAAIKAAG-GTPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ + +++ P P+ LPD + G DKWAFV L S+ + +
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFND---LKDWDIS 175
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVG 322
FG +L +LG+ +D+ IPGL + S RA PLA WM+GLE+ ++ +T +R L L G
Sbjct: 176 FGE--NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETGSRPLLRLETG 233
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
S +I N N T +EA+ +E AK+ LHFLAIQ +SE GFWLL
Sbjct: 234 ASDSWILVNV-TNAETLNEAKNFEEAKQKANNLHFLAIQSNPESESFAGFWLL 285
>gi|218437072|ref|YP_002375401.1| hypothetical protein PCC7424_0060 [Cyanothece sp. PCC 7424]
gi|218169800|gb|ACK68533.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 7424]
Length = 290
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 111/289 (38%), Positives = 164/289 (56%), Gaps = 21/289 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE+++C +Y+++ N +NS+ L EAI
Sbjct: 5 WELDFYSRPILDENKKKLWEVLICQAPTESDQSPDSLFKYSEFCSNTTVNSLWLGEAIKK 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P+KIRFFR QM +I+KAC++ I P PS+R +L W+EER VY +
Sbjct: 65 ATLEAG-EAPKKIRFFRRQMNNMISKACEDAGIDPAPSRRTYALNQWIEERMRDVYPQQE 123
Query: 213 GFQKGSKPLLALDNPF--PMELPDNLF---GDKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
G+ + + +++ P + LPD + GDK+AFV L A + E FG +
Sbjct: 124 GYDENAAKPVSVQYPALNAVPLPDAIRGDKGDKYAFVSLEAEAFAQ---MKEWDIAFGEA 180
Query: 268 LDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTR 326
L ++G+ + K IPG+ + SSRA PLA WM+GLE+ ++ +++R L L G+S
Sbjct: 181 FPLSMVGVTSEVK--IPGVIIYSSRALPLAGWMSGLEMGYLKLEESSRPILRLETGVSDS 238
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+I N NP T +EA+ +EA K+ +HFLA+Q +SE GFWLL
Sbjct: 239 WILLNV-TNPQTLAEAKGFEATKQKANNVHFLAVQSSPESESFSGFWLL 286
>gi|425459302|ref|ZP_18838788.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
gi|389823007|emb|CCI29141.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
Length = 291
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 115/293 (39%), Positives = 164/293 (55%), Gaps = 23/293 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN +NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRSSDTLFKYASYCPNTTVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I A GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 ITAAIKAAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ + +++ P P+ LPD + G DKWAFV L S+ + +
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFND---LKDWDIS 175
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVG 322
FG +L +LG+ +D+ IPGL + S RA PLA WM+GLE+ ++ +T +R L L G
Sbjct: 176 FGE--NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETGSRPVLRLETG 233
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
S +I + N T EA+ +E AK+ LHFLA+Q +SE GFWLL
Sbjct: 234 ASDSWILVSV-TNTETLKEAKNFEEAKQKANNLHFLAVQSNPESESFAGFWLL 285
>gi|428204149|ref|YP_007082738.1| hypothetical protein Ple7327_4040 [Pleurocapsa sp. PCC 7327]
gi|427981581|gb|AFY79181.1| Protein of unknown function (DUF1092) [Pleurocapsa sp. PCC 7327]
Length = 291
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 115/290 (39%), Positives = 162/290 (55%), Gaps = 23/290 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVC----DGSLSL----QYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE+++C D S +Y+++ N +NS+ L++ I
Sbjct: 5 WELDFYSRPILDENNKKLWEVLICETPTDSKQSFDSLFKYSQFCSNQSVNSLWLQQEIEK 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
GV P+KIRFFR QM +I KAC++L I P PS+R +L WL +R + Y P
Sbjct: 65 AIAQAGV-APKKIRFFRRQMNNMIVKACEDLGIPPAPSRRTYALERWLSQRLDEFYPNQP 123
Query: 213 GFQKGSKPLLALDNPFPME---LPDNLF---GDKWAFVQLPFSAVQEEVSSLESKFVFGA 266
G+ + ++ P P+ LPD + GDKWAFV L SA +E E FG
Sbjct: 124 GYDAAAAKSASVQYP-PLNATPLPDAVRGDKGDKWAFVSLEASAFEEMN---EWDIAFGE 179
Query: 267 SLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLI-LSVGIST 325
+ L L G+ D K IPGL + SSRA PLA WM+GLE+ ++ + ++ L G S
Sbjct: 180 AFPLSLTGMTPDTK--IPGLIIFSSRALPLAGWMSGLELAFLKFEGGSRPIVRLETGASD 237
Query: 326 RYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+I A+ +P +EA+ +E AK+ +HFLAIQ +S+ GFWLL
Sbjct: 238 SWILASL-TDPKMLAEAKGFEEAKQKAQQVHFLAIQSNPESQSFAGFWLL 286
>gi|434398597|ref|YP_007132601.1| protein of unknown function DUF1092 [Stanieria cyanosphaera PCC
7437]
gi|428269694|gb|AFZ35635.1| protein of unknown function DUF1092 [Stanieria cyanosphaera PCC
7437]
Length = 292
Score = 184 bits (466), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 116/293 (39%), Positives = 167/293 (56%), Gaps = 23/293 (7%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLS---------LQYTKYFPNNVINSITLKEA 149
T WELDF SRPILD KK+WE+++C+ SL+ +Y++Y + +NS+ L+EA
Sbjct: 4 TIWELDFYSRPILDEENKKVWEVLICE-SLTDPERSPDEIFRYSQYCSSKTVNSLWLREA 62
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I G+ P+KIRFFR QM +ITKAC++ I PS R +L WL R + VY
Sbjct: 63 IEKAIAIAGI-TPKKIRFFRRQMNNMITKACEDAGIAAAPSSRTYALNHWLATRMKEVYP 121
Query: 210 RHPGFQKGSKPLLALDNP--FPMELPDNL---FGDKWAFVQLPFSAVQEEVSSLESKFVF 264
+ PG+ + + +++ P + LPD + GDKWAFV L SA E E + F
Sbjct: 122 QEPGYDQKTASSISVQYPDLNAIPLPDAVRGDRGDKWAFVSLEASAFAEMN---EWEIGF 178
Query: 265 GASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVGI 323
+ L LL + +T IPGL + S RA LAAW++GLE+ + ++ R + L+ G+
Sbjct: 179 KEAFPLSLLNL--SSETQIPGLIIFSPRATLLAAWLSGLEMGFLHLESDPRPRICLNTGL 236
Query: 324 STRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLL 376
S ++ N P T +EA+ +E AK+ G+HFLAIQ +SE GFWLLL
Sbjct: 237 SDSWVLVNL-TTPSTLTEAKEFELAKQKAQGVHFLAIQSSTESESFAGFWLLL 288
>gi|254425410|ref|ZP_05039128.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
gi|196192899|gb|EDX87863.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
Length = 301
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 110/301 (36%), Positives = 161/301 (53%), Gaps = 32/301 (10%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAI 150
T WELDF SRP+LD + KK WE+++C+G S++ Y+KY N+ +NS TL+ AI
Sbjct: 3 TVWELDFYSRPVLDEQNKKRWEILICEGLQSVEDDPANLFRYSKYVSNSEVNSETLQAAI 62
Query: 151 ---VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETV 207
+A P K+R+FR QMQ +I +AC+E + PS+R L+L WLE+R V
Sbjct: 63 EEAIAQSASESADSPTKVRYFRYQMQNMIKRACEEAGLLSYPSRRTLALQQWLEDRKVNV 122
Query: 208 YTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
Y P ++ + +A LPD L G +WA V LP +E + F +
Sbjct: 123 YPNEPRYKPSASASVAKPIDVVNPLPDALIGQQWALVTLP---AKEFADMGDWDVAFKEA 179
Query: 268 LDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE-------------TDTAR 314
L++ G+E D T IPG + S+RA PLAAWM+GLE+ + TDTAR
Sbjct: 180 FPLEIAGVEPD--TPIPGFIIYSNRATPLAAWMSGLEIAGVRAGKEESSNYVSKNTDTAR 237
Query: 315 GSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWL 374
L++ G ++ A+ P T +E +E AK A +HF+A+Q+ +SE G WL
Sbjct: 238 --LLMDTGTIETWLLADL-VTPETQAEGLRFENAKAAANNVHFIAVQDSPESETFAGMWL 294
Query: 375 L 375
+
Sbjct: 295 M 295
>gi|428772145|ref|YP_007163933.1| hypothetical protein Cyast_0304 [Cyanobacterium stanieri PCC 7202]
gi|428686424|gb|AFZ46284.1| protein of unknown function DUF1092 [Cyanobacterium stanieri PCC
7202]
Length = 288
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 115/289 (39%), Positives = 154/289 (53%), Gaps = 22/289 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WELDF SRPI D KK+WE+++C+ + +Y+++ N+ +NSITL AI +
Sbjct: 5 WELDFYSRPIFDENNKKLWEILICESPTDIDSDYDSLFRYSQFCSNSEVNSITLGGAIAS 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P KIRFFR QM +I KAC + I PS+ +L WL+ER Y
Sbjct: 65 AMEKAG-ETPSKIRFFRRQMNNMIIKACDDAGIPVFPSRHTYALNRWLDERETDFYPHQE 123
Query: 213 GFQKGSKPLLALDNP--FPMELPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
G+Q K ++ P + LPD + G DKWA V L Q+ E FG +
Sbjct: 124 GYQ-APKNTASVQYPQGNAVSLPDAVKGDRTDKWALVSLGSDDFQD---MREWAIAFGEA 179
Query: 268 LDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLI-LSVGISTR 326
L L IE D T IPGL + S RA PLAAWM+GLE+ + +T + I L G+S
Sbjct: 180 FPLSLADIE--DNTKIPGLIIFSKRALPLAAWMSGLELGYLRLETGQFPRICLETGVSDS 237
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+I AN + T EAE +E K+ G+HFLAIQ +SE GFWLL
Sbjct: 238 WILANITDDK-TLGEAEGFETTKQQANGVHFLAIQSSPESESFEGFWLL 285
>gi|307150318|ref|YP_003885702.1| hypothetical protein Cyan7822_0382 [Cyanothece sp. PCC 7822]
gi|306980546|gb|ADN12427.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 7822]
Length = 290
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 113/293 (38%), Positives = 160/293 (54%), Gaps = 25/293 (8%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD KK+WE+++C+ QY+++ + +NS+ L E +
Sbjct: 3 TIWELDFYSRPILDEDEKKLWEVLICEAPTEPDLSPDSLFQYSEFCSSKTVNSLWLAETL 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
G P+KIRFFR QM +ITKAC+E I PS+R +L W+E+R + Y +
Sbjct: 63 KKAIAQAG-KAPKKIRFFRRQMNNMITKACEEAGIDAAPSRRTYALNQWIEQRMKEFYPQ 121
Query: 211 HPGFQKGSKPLLALDNPFP----MELPDNLF---GDKWAFVQLPFSAVQEEVSSLESKFV 263
G+ + K L+ +P + LPD + GDK+AFV L A + E
Sbjct: 122 QEGYDQ--KAALSTSVQYPGLNAIPLPDAIRGDKGDKYAFVSLEAEAFAQ---LKEWDIA 176
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVG 322
FG + L +LGI +K IPGL + SSRA PLA WM+GLE+ ++ ++ R + L G
Sbjct: 177 FGEAFPLSMLGINPKNK--IPGLIIYSSRALPLAGWMSGLEMGYLKFEESDRPIVRLETG 234
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+S +I N NP SEA+ +E KK +HFLA+Q +SE GFWLL
Sbjct: 235 VSDSWIVINV-TNPQILSEAKGFEETKKRANNVHFLAVQSSPESESFAGFWLL 286
>gi|449016446|dbj|BAM79848.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 411
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 117/325 (36%), Positives = 166/325 (51%), Gaps = 54/325 (16%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
WELDF SRP++ GK++WELVVCD S + + FPNN++NS L A+ + ++ V
Sbjct: 91 WELDFYSRPVVGADGKRLWELVVCDRDGSFVHVEAFPNNMVNSRELARAVKTLIEESSVR 150
Query: 161 IPEKIRFFRSQMQTIITKACKELD-IKPIPSKRCLSLLLWLEERYETVYTRHPGFQ---- 215
P IRFFR+QM+ +I A + + ++ PS+R +L L L R VY R PG++
Sbjct: 151 -PRIIRFFRAQMRNMIQIAMQNISGVETRPSRRTYALFLALAYRERNVYPRLPGYEGKSI 209
Query: 216 --------KGSKPLLA----------LDNPFPMELPDNLFGDKWAFVQL----------- 246
+G++ LA +D LPD L GD++AFV +
Sbjct: 210 GIGNRSGTRGAELSLAESIGNMLKTPVDLKVAARLPDELQGDRFAFVTILLRDVTQMNAA 269
Query: 247 ------PFSAVQEEV-------SSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRA 293
P S E + +S + A+LDL E TL+PG+ + S RA
Sbjct: 270 GFGELCPVSLSAESMNLDIQMRTSGNAGSTQSAALDLGAPSPE----TLVPGVVIYSRRA 325
Query: 294 KPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACG 353
PLAAW +G E+ I D + + L G+ Y++A + P +EA A+ AKKA
Sbjct: 326 LPLAAWFSGTELAYIIADEQQKEIYLECGLDAAYLFARIQ--PSLEAEARAFNEAKKAAR 383
Query: 354 GLHFLAIQEELDSEDCVGFWLLLDL 378
GLHFLAIQE+ D ED GFWLL D+
Sbjct: 384 GLHFLAIQEKPDDEDVCGFWLLRDV 408
>gi|428771014|ref|YP_007162804.1| hypothetical protein Cyan10605_2686 [Cyanobacterium aponinum PCC
10605]
gi|428685293|gb|AFZ54760.1| protein of unknown function DUF1092 [Cyanobacterium aponinum PCC
10605]
Length = 294
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 113/291 (38%), Positives = 155/291 (53%), Gaps = 21/291 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVC--------DGSLSLQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPI+D KK WE+++C D S +Y+++ N +NSITL+ AI
Sbjct: 5 WELDFYSRPIIDENNKKRWEILICESPTTIDTDTSQLFRYSQFCANTEVNSITLQNAIAT 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P KIRFFR QM +I K C++ I + S+ +L WLEER + Y
Sbjct: 65 AIEKAG-ETPSKIRFFRRQMNNMILKGCEDAGIPALASRHTYTLNQWLEERMTSFYPLQE 123
Query: 213 GFQKGSKPLLALDNP--FPMELPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
G+ + + ++ P P+ LPD L G DKWA V L ++E E F +
Sbjct: 124 GYDEKATIAASVQYPQTNPVNLPDALKGDKKDKWALVSLNGKDLEEMP---EWDIGFREA 180
Query: 268 LDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARG-SLILSVGISTR 326
L + I D K IPGL + SSRA PLA WM+GLE+ + D + S+ L G+S
Sbjct: 181 FPLKIANISPDTK--IPGLIIFSSRALPLAGWMSGLELGYLRLDRGKFPSICLETGVSDS 238
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
+I N + T SEAE +E KK G+HFLAIQ +S+ FWLLL+
Sbjct: 239 WILVNL-TDKNTLSEAEGFENTKKQANGVHFLAIQSSPESQSFEAFWLLLE 288
>gi|16330318|ref|NP_441046.1| hypothetical protein sll2002 [Synechocystis sp. PCC 6803]
gi|383322059|ref|YP_005382912.1| hypothetical protein SYNGTI_1150 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325228|ref|YP_005386081.1| hypothetical protein SYNPCCP_1149 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491112|ref|YP_005408788.1| hypothetical protein SYNPCCN_1149 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436379|ref|YP_005651103.1| hypothetical protein SYNGTS_1150 [Synechocystis sp. PCC 6803]
gi|451814476|ref|YP_007450928.1| hypothetical protein MYO_111600 [Synechocystis sp. PCC 6803]
gi|1652807|dbj|BAA17726.1| sll2002 [Synechocystis sp. PCC 6803]
gi|339273411|dbj|BAK49898.1| hypothetical protein SYNGTS_1150 [Synechocystis sp. PCC 6803]
gi|359271378|dbj|BAL28897.1| hypothetical protein SYNGTI_1150 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274548|dbj|BAL32066.1| hypothetical protein SYNPCCN_1149 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277718|dbj|BAL35235.1| hypothetical protein SYNPCCP_1149 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|451780445|gb|AGF51414.1| hypothetical protein MYO_111600 [Synechocystis sp. PCC 6803]
Length = 292
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 112/289 (38%), Positives = 161/289 (55%), Gaps = 21/289 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WELDF SRP+LD KK+WE+++C+ S+Q Y++Y P++ +NS+ L++AI A
Sbjct: 5 WELDFYSRPLLDDEEKKVWEVLICESPQSVQQLPGDLFRYSQYCPSSTVNSVWLRQAIEA 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G +P+KIRFFR QM +I+KAC+E I P PS+R L WL +R E Y + P
Sbjct: 65 AIAEAGQ-MPQKIRFFRRQMNNMISKACEEAGIPPAPSRRTYVLEQWLGDRLENFYPQQP 123
Query: 213 GFQKGSKPLLALDNP--FPMELPDNLFGDK---WAFVQLPFSAVQEEVSSLESKFVFGAS 267
G+ ++ P + LPD + GD+ WA V L A + + + FG S
Sbjct: 124 GYDPKLASSTSVQYPELNAIALPDAVRGDRGDQWALVSL---AAADFNDLPDWEISFGES 180
Query: 268 LDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVGISTR 326
L + D + IPGL + S RA P AAW++GLE+ ++ +T R + L G S
Sbjct: 181 FPLSSYNLSPDSR--IPGLILFSPRALPFAAWLSGLELGYLQYNTDPRPIMRLETGASDS 238
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+I AN + + EA+ +E KK G+HFLAIQ DSE GFWLL
Sbjct: 239 WIVANV-TDKTSEQEAQGFEQTKKLAQGIHFLAIQTSPDSETFAGFWLL 286
>gi|126659192|ref|ZP_01730330.1| hypothetical protein CY0110_04433 [Cyanothece sp. CCY0110]
gi|126619497|gb|EAZ90228.1| hypothetical protein CY0110_04433 [Cyanothece sp. CCY0110]
Length = 290
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 111/289 (38%), Positives = 154/289 (53%), Gaps = 21/289 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSL--------SLQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK WE+++C+ +Y ++ P N +NSI L+EA+
Sbjct: 5 WELDFYSRPILDENNKKQWEVLICETQTDTTESLDKGFRYAQFCPPNTVNSIWLREALEI 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P KIRFFR QM +I KAC++ + PS+R +L WL +R++ Y
Sbjct: 65 AIEKAG-ENPSKIRFFRRQMNNMIVKACEDAGLVASPSRRTYTLNHWLNQRFQDFYPSQE 123
Query: 213 GFQKGSKPLLALDNPF--PMELPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
G+ + + ++ P + LPD + G DKWAFV L SA ++ E FG
Sbjct: 124 GYDEKAATNASVAYPTLNAIALPDAVRGDKSDKWAFVSLEASAFED---MKEWDIRFGEG 180
Query: 268 LDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLE-VCSIETDTARGSLILSVGISTR 326
L+L+ + D K IPG + S RA PLA WM+GLE VC + R L L G+S
Sbjct: 181 FPLELVDLSPDTK--IPGFIIFSQRALPLAGWMSGLELVCLKVQEKPRPILSLETGLSDS 238
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+I AN + + +EA+ +E K G+HFLAIQ D E GFWLL
Sbjct: 239 WILANL-TDKSSVAEAQGFEDTKNKAKGVHFLAIQSRPDVETFSGFWLL 286
>gi|172036928|ref|YP_001803429.1| hypothetical protein cce_2013 [Cyanothece sp. ATCC 51142]
gi|354554731|ref|ZP_08974035.1| protein of unknown function DUF1092 [Cyanothece sp. ATCC 51472]
gi|171698382|gb|ACB51363.1| DUF1092-containing protein [Cyanothece sp. ATCC 51142]
gi|353553540|gb|EHC22932.1| protein of unknown function DUF1092 [Cyanothece sp. ATCC 51472]
Length = 289
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 110/292 (37%), Positives = 157/292 (53%), Gaps = 23/292 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSL--------SLQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK WE+++C+ +Y ++ P + +NSI L+EA+
Sbjct: 5 WELDFYSRPILDENNKKQWEVLICETQTDTTESLDKGFRYAEFCPPSTVNSIWLREALET 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P KIRFFR QM +I KAC++ + PS+R +L W+ +R++ Y
Sbjct: 65 AIEKAG-ETPSKIRFFRRQMNNMIVKACEDAGLVASPSRRTYTLNHWINQRFQDFYPSQE 123
Query: 213 GFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFVFGA 266
G+ + + ++ P P++ LPD + G DKWAFV L S + E FG
Sbjct: 124 GYDEKAATNASVAYP-PLDAIALPDAVRGDKSDKWAFVSLEASGFAD---MKEWDIRFGE 179
Query: 267 SLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA-RGSLILSVGIST 325
L+L + D K IPG + S RA PLA WM+GLE+ S++ T +L L G+S
Sbjct: 180 GFPLELANLSPDTK--IPGFIIFSRRALPLAGWMSGLELVSLKFQTKPFPNLCLETGLSD 237
Query: 326 RYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
+I AN + + +EAE +E +K G+HFLAIQ D E GFWLL D
Sbjct: 238 NWILANL-TDKSSVTEAEGFEQSKNKANGVHFLAIQSRPDVETFSGFWLLKD 288
>gi|67922272|ref|ZP_00515785.1| Protein of unknown function DUF1092 [Crocosphaera watsonii WH 8501]
gi|67855848|gb|EAM51094.1| Protein of unknown function DUF1092 [Crocosphaera watsonii WH 8501]
Length = 289
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 110/291 (37%), Positives = 155/291 (53%), Gaps = 21/291 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCD------GSL--SLQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK WE+++C+ GSL +Y K+ P +NS+ L+EAI
Sbjct: 5 WELDFYSRPILDENKKKQWEVLICETQTDSQGSLEDGFRYAKFCPPKTVNSMWLREAIET 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P K+RFFR QM +I KAC++ + PS+R +L WL++R + Y
Sbjct: 65 AMEKTG-EAPSKVRFFRRQMNNMIVKACEDAGLVATPSRRTYTLNHWLKQRQQDFYPSQE 123
Query: 213 GFQKGSKPLLALDNPF--PMELPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
G+ + + ++ P + LPD + G DKW FV L SA +E E FG
Sbjct: 124 GYNEAAATNASVAYPALDAIALPDAVRGDRSDKWTFVSLEASAFEE---MKEWDIRFGEG 180
Query: 268 LDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLI-LSVGISTR 326
L L + D K IPG + S RA PLAAWM+GLE+ +++ + ++ L G+S
Sbjct: 181 FPLALADLSPDTK--IPGFIIYSQRALPLAAWMSGLELVALKFKSKPLPILSLETGLSDS 238
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
+I AN + +E + +E K G+HFLAIQ D E GFWLL D
Sbjct: 239 WILANL-TDQSGVAEGKGFEDTKNKAEGVHFLAIQPRPDVETFSGFWLLKD 288
>gi|416389975|ref|ZP_11685424.1| hypothetical protein CWATWH0003_2245 [Crocosphaera watsonii WH
0003]
gi|357264130|gb|EHJ13056.1| hypothetical protein CWATWH0003_2245 [Crocosphaera watsonii WH
0003]
Length = 289
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/291 (37%), Positives = 155/291 (53%), Gaps = 21/291 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCD------GSL--SLQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK WE+++C+ GSL +Y ++ P +NS+ L+EAI
Sbjct: 5 WELDFYSRPILDENKKKQWEVLICETQTDSQGSLEDGFRYAQFCPPKTVNSMWLREAIET 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P K+RFFR QM +I KAC++ + PS+R +L WL++R + Y
Sbjct: 65 AMEKTG-EAPSKVRFFRRQMNNMIVKACEDAGLVATPSRRTYTLNHWLKQRQQDFYPSQE 123
Query: 213 GFQKGSKPLLALDNPF--PMELPDNLFG---DKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
G+ + + ++ P + LPD + G DKW FV L SA +E E FG
Sbjct: 124 GYNEAAATNASVAYPALDAIALPDAVRGDRSDKWTFVSLEASAFEE---MKEWDIRFGEG 180
Query: 268 LDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLI-LSVGISTR 326
L L + D K IPG + S RA PLAAWM+GLE+ +++ + ++ L G+S
Sbjct: 181 FPLALADLSPDTK--IPGFIIYSQRALPLAAWMSGLELVALKFKSKPLPILSLETGLSDS 238
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
+I AN + +E + +E K G+HFLAIQ D E GFWLL D
Sbjct: 239 WILANL-TDQSGVAEGKGFEDTKNKAEGVHFLAIQPRPDVETFSGFWLLKD 288
>gi|428223149|ref|YP_007107319.1| hypothetical protein Syn7502_03320 [Synechococcus sp. PCC 7502]
gi|427996489|gb|AFY75184.1| Protein of unknown function (DUF1092) [Synechococcus sp. PCC 7502]
Length = 299
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 113/299 (37%), Positives = 159/299 (53%), Gaps = 30/299 (10%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDG----SLSLQYTKYFPNNVINSITLK-EAIVAICD 155
WELDF SRP+LD KKIWEL++C+ S Q+ K +NS L E +AI
Sbjct: 5 WELDFYSRPVLDENQKKIWELLICNSPDRSSQPFQWIKECNAQEVNSGWLATELKLAIAH 64
Query: 156 D--LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
+ LG P+K+RF+R M IIT+ CK+ ++ P PS+R +L WL+ R E++Y + G
Sbjct: 65 NASLGNRDPQKVRFYRPSMTNIITRGCKQAELIPQPSRRLFTLSSWLQTRMESIYPQREG 124
Query: 214 F-QKGSKPL---LALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLD 269
F +PL + + P PD L G+ W L + QE + E FG
Sbjct: 125 FIAPDPQPLPLKIGIQVPVAKPAPDALMGESWLVASLKVADFQE---ATEWSMDFGELFA 181
Query: 270 LDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARG--SLILSVGISTRY 327
LD + D +TLI GL + SSRA LAAWM G++ +++ + + G LIL G +R+
Sbjct: 182 LDHIS---DPETLISGLIITSSRALALAAWMAGVDPVALKFEVSEGKIQLILEAGEESRW 238
Query: 328 IY-----ANYK------KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
I AN K + P S+A ++E AKK G+HF+AIQ L+ E GFWLL
Sbjct: 239 ILTTLNTANPKGQKSAERIPKVISQAGSFEQAKKNSNGIHFIAIQTSLEVEHFTGFWLL 297
>gi|219117107|ref|XP_002179348.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409239|gb|EEC49171.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 278
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 103/291 (35%), Positives = 158/291 (54%), Gaps = 27/291 (9%)
Query: 100 EWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGV 159
EWELD SRP+ + GKK+WE+++ D + S ++ + P+N +NS TL++ + + + V
Sbjct: 1 EWELDCYSRPVA-VAGKKLWEVLITDSAGSFRFRQTLPSNQVNSKTLRQIVDDLMERADV 59
Query: 160 PIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQK--- 216
P IRFFR M +I A EL + PS+ +L WLE+R+E VY + GF
Sbjct: 60 K-PNTIRFFRGAMFNMINIALMELPVTSKPSRCTFALASWLEDRHENVYPQMEGFNANMV 118
Query: 217 GSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGI- 275
GS LD P+ LPD L G+K+AFV LP ++F+ G S+D +G+
Sbjct: 119 GSTIPSFLDVRTPVRLPDALRGEKYAFVALPV-----------AEFLPGGSVDATNIGVG 167
Query: 276 -------EVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYI 328
++ + G+ + ++RA+ LA+W+ G EV ++ D + L++ I T+Y+
Sbjct: 168 RICTIPRDIPADAFVQGVVILTNRAEALASWLAGTEVVALTADLRKRVLVMETDIDTQYL 227
Query: 329 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
A K N EA + E K GLHF+++QE DS D GFWLL +LP
Sbjct: 228 MA--KLNESQRVEAASLEEGKAGLKGLHFVSVQENEDS-DPTGFWLLRELP 275
>gi|427711791|ref|YP_007060415.1| hypothetical protein Syn6312_0651 [Synechococcus sp. PCC 6312]
gi|427375920|gb|AFY59872.1| Protein of unknown function (DUF1092) [Synechococcus sp. PCC 6312]
Length = 285
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 106/285 (37%), Positives = 146/285 (51%), Gaps = 16/285 (5%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITL----KEAIVAIC 154
T WELDF SRPILD + KK+WE+++C+ L+ Q+ KY N+ L +EA+
Sbjct: 3 TIWELDFYSRPILDAQQKKLWEVLICNRQLTFQFAKYCSGAEANARWLMSAIQEAVQQWQ 62
Query: 155 DDLGVP---IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRH 211
+ +P PE+IRFFR M +II + C+ I + S+R L WL ER E VY +
Sbjct: 63 QEFNLPESERPERIRFFRRPMNSIILRGCEAAGIPGLASRRTFGLYEWLAERQEQVYPQT 122
Query: 212 PGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLD 271
PG+Q P L + LPD L G KW FV LP E ++ E + FG L
Sbjct: 123 PGYQPLIAPPPELPQAKALPLPDALQGQKWQFVSLP---AGEFANATEWEIKFGEVFSLS 179
Query: 272 LLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARG-SLILSVGISTRYIYA 330
L D ++LIPG+ + S RA PLAAWM+GLE + + L+L G R+
Sbjct: 180 GL----DPESLIPGIIIYSQRALPLAAWMSGLEPACLSLELGPDPQLVLETGADDRWTLV 235
Query: 331 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ T+ AA+ LHFLA+Q + ED GFWL+
Sbjct: 236 TLPNKDLITAAEAF-MAAQAQVKNLHFLAVQASPEREDFAGFWLM 279
>gi|162606540|ref|XP_001713300.1| hypothetical protein GTHECHR2175 [Guillardia theta]
gi|12580766|emb|CAC27084.1| hypothetical protein [Guillardia theta]
Length = 323
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 95/278 (34%), Positives = 155/278 (55%), Gaps = 13/278 (4%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
WELDF SRP++ GKK+WEL++ + SLQ + PNN++NS L+ ++ I +
Sbjct: 48 WELDFFSRPVILDDGKKLWELIIVNKDKSLQIIESVPNNMVNSKELRRKLLNIINS-AEK 106
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQ---KG 217
P+ I+FFR+QM +I+ A +LDI PS+R +L + ER +T+Y G++ +
Sbjct: 107 KPDVIKFFRAQMFNMISIALSDLDINVKPSRRTYALFEIIREREKTIYPEMIGYKPYLRE 166
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEV 277
K L+L FP +PD L G+ ++FV ++++E L+ + V S +D ++
Sbjct: 167 YKEDLSLKR-FPQRMPDILLGENFSFV---LASLEEINVILKDQSVMKDSFKIDENKYDI 222
Query: 278 DDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPV 337
D IPG+ + S+RA LA W+NGLEV SI D + S++L + T++++A K +
Sbjct: 223 DK---IPGIVILSNRANSLANWINGLEVFSISFDQEKSSIVLDCSLDTKFLFA--KIDIK 277
Query: 338 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ +E K+ G HF+++ L GFWLL
Sbjct: 278 KIQDGTKFENQKRLNSGFHFISVMSGLPENKIYGFWLL 315
>gi|443322105|ref|ZP_21051138.1| Protein of unknown function (DUF1092) [Gloeocapsa sp. PCC 73106]
gi|442788158|gb|ELR97858.1| Protein of unknown function (DUF1092) [Gloeocapsa sp. PCC 73106]
Length = 297
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 111/304 (36%), Positives = 160/304 (52%), Gaps = 35/304 (11%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDG--------SLSLQYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD KK+WE+++C+ L +Y ++ P+ +NS+ L EAI
Sbjct: 3 TIWELDFYSRPILDENQKKLWEVLICESPQQISTNPDLIYKYAQFCPSTSVNSLWLAEAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ G IP KIRFFR QM+ +ITKAC+E+ + P+PS+R +L W+ ER + Y
Sbjct: 63 KQAIAESG-QIPSKIRFFRRQMKNMITKACEEVAVIPVPSRRTHTLNHWIVERLKNHY-- 119
Query: 211 HPGFQKGSKPLLALDNPFP----MELPDNLF---GDKWAFVQLPFSAVQEEVSSLESKFV 263
P + +P + LPD + GDKW V LP VQ+ + +
Sbjct: 120 -PTLDNYDSQAINASVQYPPLNAIALPDAVRGDKGDKWTLVTLP---VQDFIEMDQWDIA 175
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEV--CSIE--TDTAR----- 314
FG + L L ++D + IPG+ + S+RA PLA W++GLE+ C +E T + R
Sbjct: 176 FGEAFPLSLY--DLDPQLSIPGVIIFSNRAIPLAGWLSGLEIGSCYVEDITPSTREIVRQ 233
Query: 315 -GSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFW 373
L L G+S +I A+ + SEA + AK +HFLAIQ +S+ G W
Sbjct: 234 LSRLRLETGLSDSWILADI-TDEQGQSEARGFTKAKNLVQQIHFLAIQSSPESDSFAGLW 292
Query: 374 LLLD 377
LL D
Sbjct: 293 LLKD 296
>gi|160331683|ref|XP_001712548.1| hypothetical protein HAN_3g413 [Hemiselmis andersenii]
gi|159765997|gb|ABW98223.1| hypothetical protein HAN_3g413 [Hemiselmis andersenii]
Length = 337
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 100/306 (32%), Positives = 160/306 (52%), Gaps = 32/306 (10%)
Query: 84 QELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINS 143
++S +E + E I WELDF SRP++D GKK+WE+++ D + ++ + PNN++NS
Sbjct: 45 NKISMKNELINEEII--WELDFFSRPVVDENGKKLWEIIIVDQKGNFEHIETVPNNLVNS 102
Query: 144 ITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEER 203
LK+ I + D P+ I+FFRSQM +I A +LD+ PS+R SL + ER
Sbjct: 103 KELKKRIKILLDKSDKK-PKVIKFFRSQMFNMINIALSDLDLIVRPSRRTFSLYNKISER 161
Query: 204 YETVYTRHPGFQKGSKPLL------ALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSS 257
E +Y KG +P + A P ++PD L G+K+ F L +E+SS
Sbjct: 162 EEKIYPN----MKGYRPFMRESDFNASLKKVPQKMPDALRGEKYIFASLS----SDELSS 213
Query: 258 LESKFVFGASLDLDLLGI-----EVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDT 312
+ S D+ G E D IPG+ + S RAK L+ W++G+E+C++ D
Sbjct: 214 INSS-------DIAFSGFCPLPAEFDKNQQIPGIVIYSERAKSLSGWLDGVELCNVFCDL 266
Query: 313 ARGSLILSVGISTRYIYANY---KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDC 369
+LIL G+ ++++A + K + + E + +E KK G+HF+A+Q +
Sbjct: 267 ENKNLILECGLDIQFLFAKFSETKNSKNSNFEPKFFEKNKKKSQGIHFVAVQSYSKQNEI 326
Query: 370 VGFWLL 375
G W L
Sbjct: 327 AGIWTL 332
>gi|86605930|ref|YP_474693.1| hypothetical protein CYA_1247 [Synechococcus sp. JA-3-3Ab]
gi|86554472|gb|ABC99430.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length = 285
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 96/276 (34%), Positives = 154/276 (55%), Gaps = 11/276 (3%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DF + P+ D +G+++WEL+VCD S L+ KY N +NS + + + + P
Sbjct: 16 WQMDFNAVPLRDGQGRRVWELLVCDASGQLRQAKYCSNQEVNSTWVAQQLRGYLEAAPQP 75
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P IR FR++M +I+ +AC I +PS+R +L W+ ER E VY + F +P
Sbjct: 76 -PAAIRVFRARMSSILQRACNAAGIPMLPSRRVYALKAWMRERAEQVYPQETQFTYSPEP 134
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQE-EVSSLESKFVFGASLDLDLLGIEVDD 279
+ + P P+ LPD L G++WAFV L ++E E +E FG ++ D
Sbjct: 135 PVEPEPPDPIPLPDKLQGERWAFVTLRARDLREAETWPME----FGELFPVNWEAWAPD- 189
Query: 280 KTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTT 339
T+IPGL +AS RA P+AAW++G+E + A G L+ G++ Y++A K +
Sbjct: 190 -TIIPGLVIASRRALPIAAWLSGMEPAYLH--VAEGRLLFEAGLNDCYLFAQLKDEKL-R 245
Query: 340 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+EAE + ++ G+HFLAIQ + ++ GFWL+
Sbjct: 246 AEAEGFAQRQRQAQGIHFLAIQSDFRAQSFAGFWLM 281
>gi|224002018|ref|XP_002290681.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974103|gb|EED92433.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 359
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 97/305 (31%), Positives = 164/305 (53%), Gaps = 29/305 (9%)
Query: 89 LDEETDPESITE-WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLK 147
++E T+ + ++E WELD SRP+L KK+WE+++ D S +++ + P+N +NS ++
Sbjct: 67 VEETTNWDKVSEEWELDCYSRPVLVDGKKKLWEILMTDSSGNMKVCRALPSNKVNSREVR 126
Query: 148 EAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETV 207
+ I D+ V P IRFFR M +I A E+D+ PS+ +L W+E+R V
Sbjct: 127 RVVEEIIDESEVK-PSTIRFFRGAMFNMINIALSEIDVIAKPSRCTFALAQWIEDRNRDV 185
Query: 208 YTRHPGFQKGSKPLLALDNPF-----PMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKF 262
Y + G++ + + F ++LPD L G+K+AFV LP ++F
Sbjct: 186 YPKMEGYRATMSGIGGIGGTFLDIRTAVKLPDALRGEKYAFVGLPL-----------AEF 234
Query: 263 VFGASLDLDLLGIE----VDD----KTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTAR 314
+ G +D + +G+ VD + + G+ + + RAK LA+W+ G EV ++ D +
Sbjct: 235 LPGGGIDNNNIGVGRLCPVDSTLAADSFVQGVVILTPRAKALASWLAGTEVAGLKADLRK 294
Query: 315 GSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWL 374
L++ I +Y+ A K N EA +E K A GLHF+++Q++ D +D GFWL
Sbjct: 295 RELVMETDIDNQYLMA--KLNDDQRREAAVYEEGKDALNGLHFISVQKDED-DDPAGFWL 351
Query: 375 LLDLP 379
L ++P
Sbjct: 352 LREIP 356
>gi|148242688|ref|YP_001227845.1| hypothetical protein SynRCC307_1589 [Synechococcus sp. RCC307]
gi|147850998|emb|CAK28492.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
Length = 283
Score = 158 bits (399), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 98/288 (34%), Positives = 160/288 (55%), Gaps = 23/288 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLK---EAIVAICDDL 157
WELDF SRP+LD GKK WE ++C G S Q+ ++ P + +NSI LK +A D+
Sbjct: 8 WELDFYSRPLLDENGKKRWEALICSGDGSFQWQRFCPADSVNSIWLKTALSDALAAADEA 67
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
P P+++R +RS M+T++ +A + + ++ +PS+RC +L+ WL+ER ++Y G G
Sbjct: 68 SSPAPKRLRCWRSSMRTMVQRAAEGVGLEMVPSRRCYALVEWLQEREASIYPEMEGHLNG 127
Query: 218 -SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGI- 275
P P+ LP+ + GD W + LP +++ E + +D G+
Sbjct: 128 PLAPPPQPLQAAPLPLPEAVRGDSWGWASLPAASLAE-----------ASEWPMDFSGLV 176
Query: 276 ---EVDDKTLIPGLAV-ASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYAN 331
+ ++PG+ + +SSRA LA W++GLE +E + L+L G+ R++ ++
Sbjct: 177 PLPNTKAEAMVPGVRLFSSSRALALAGWLSGLEPVRLEVCGQQ--LVLEAGLEDRWLVSD 234
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
+N S +A EAA++ GGL FLA+Q D+ + GFWLL DLP
Sbjct: 235 L-QNGEADSAQQALEAARQEAGGLQFLAVQSGPDATEFAGFWLLRDLP 281
>gi|399949996|gb|AFP65652.1| hypothetical protein CMESO_508 [Chroomonas mesostigmatica CCMP1168]
Length = 336
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 96/287 (33%), Positives = 154/287 (53%), Gaps = 18/287 (6%)
Query: 97 SITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDD 156
S T WE+DF SRP+L+ GKK+WEL+V D + ++ + PNN+INS LK+ I A+ +
Sbjct: 58 SNTVWEIDFFSRPVLNEDGKKLWELIVVDQKGTFEHIEAIPNNLINSRELKKRINALIEK 117
Query: 157 LGVPIPEK---IRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
P+K I+FFRSQM +I A +L+I PS+R +L + ER E VY + G
Sbjct: 118 S----PQKPILIKFFRSQMFNMINIALSDLNINVRPSRRTFALFEKISEREENVYPKMSG 173
Query: 214 FQKGSKPLLALD--NPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLD 271
++ K + D P ++PD L G+K+ F + ++ E S + S FG L
Sbjct: 174 YRPFMKEVDVNDMLKKVPQKMPDTLRGEKYVFASI---SIPELESMVNSGINFGQMCPLP 230
Query: 272 LLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYAN 331
D IPG+ + S RAK L++W +G+E+ +I D ++++ G+ T+Y++
Sbjct: 231 K---NFDFNQKIPGIVILSERAKSLSSWFDGIELFNIICDLETKNIMIECGLDTQYLFGK 287
Query: 332 YKKNPV---TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ + + E + +E KK G+HF+A+QE + G W L
Sbjct: 288 FSEETIQDRVNLEPKLFEKNKKKSQGVHFIAVQEYSKKKPIYGIWTL 334
>gi|443478232|ref|ZP_21068010.1| protein of unknown function DUF1092 [Pseudanabaena biceps PCC 7429]
gi|443016503|gb|ELS31148.1| protein of unknown function DUF1092 [Pseudanabaena biceps PCC 7429]
Length = 284
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 97/284 (34%), Positives = 142/284 (50%), Gaps = 17/284 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
WELDF SRP+LD KK+WEL++CD ++ + P+ +NS L + + C
Sbjct: 5 WELDFYSRPLLDANNKKVWELLICDRDRQFEWVRECPSTEVNSEWLAKQLTD-CVATNGQ 63
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQK---G 217
P KIRFFR M II + CK I S+R ++ WL ER ++Y GFQ
Sbjct: 64 TPIKIRFFRPSMTNIIMRGCKLAGITGQASRRVFTMSAWLAERMASIYPNRDGFQAVDPN 123
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEV 277
PL L P +PD L G++W V L S +E + E F LD+ L
Sbjct: 124 PLPLKVLAAQDPKPVPDALMGEQWISVSLKASDFEE---AKEWSMDFSELLDVSHL---- 176
Query: 278 DDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTA----RGSLILSVGISTRYIYANYK 333
D T++ G+ + S+RA LAAWM+G++ I+ + R + L R++ AN +
Sbjct: 177 DPDTIVAGIIIISARATALAAWMSGVDPVFIKFERNLLGDRTQMQLEASADARWVLANLQ 236
Query: 334 --KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
K+ + ++ +E AK+ G HFLAIQ + E GFW+L
Sbjct: 237 APKDKLAIAQGADFEKAKQKSQGFHFLAIQTNAEEEHFAGFWML 280
>gi|86608615|ref|YP_477377.1| hypothetical protein CYB_1137 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86557157|gb|ABD02114.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 293
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 98/279 (35%), Positives = 156/279 (55%), Gaps = 9/279 (3%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DF + P+ D + +++WEL+VCD + + +Y N +NS + + + + P
Sbjct: 16 WQMDFNAVPLRDEQNRRVWELLVCDPTGRFRQAQYCSNQEVNSTWVARQLRSYLEAAPQP 75
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P IR FR++M +I+ +AC + I +PS+R +L W+ ER E VY + F +P
Sbjct: 76 -PSAIRVFRARMSSILQRACDAVGIPMLPSRRVYTLKAWMRERAEQVYPQETQFTYSPEP 134
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQE-EVSSLESKFVFGASLDL---DLLGIE 276
+ D P P+ LPD L G++WAFV L ++E + +E +F + D D L
Sbjct: 135 PVDPDPPDPIRLPDKLQGERWAFVTLRAEDLREADAWPIEFGELFPVAWDTLTPDTLA-P 193
Query: 277 VDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNP 336
V TLIPGL + S RA P+AAWM+G+E + A G L+L G++ Y++A +
Sbjct: 194 VVRSTLIPGLVITSQRALPMAAWMSGMEPAYL--SVADGRLLLEAGLNDCYLFAQLRDET 251
Query: 337 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ T EAE + ++ GLHFLAIQ +L ++ GFWL+
Sbjct: 252 LRT-EAEVFAQRQQQAQGLHFLAIQTDLRAQSFAGFWLM 289
>gi|22299529|ref|NP_682776.1| hypothetical protein tlr1986 [Thermosynechococcus elongatus BP-1]
gi|22295712|dbj|BAC09538.1| tlr1986 [Thermosynechococcus elongatus BP-1]
Length = 287
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 105/289 (36%), Positives = 149/289 (51%), Gaps = 14/289 (4%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINS----ITLKEAIVAIC 154
T WELDF SRP++D KKIWEL+VCD Q++K N+ L+EA+
Sbjct: 3 TIWELDFYSRPLVDENNKKIWELLVCDRQQQFQFSKTCAGAEANARWLAAALEEAMDQWR 62
Query: 155 DDLGVP---IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRH 211
LG+ P+++RFFR M +IIT+ + + +PS+R +L WL +R Y
Sbjct: 63 QQLGLAEGVQPQRVRFFRRAMSSIITRGGEAAGLVMVPSRRTFALYDWLRDRATNFYPTL 122
Query: 212 PGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLD 271
P +Q L P P LP L GD+W LP ++ ++ E + FG L
Sbjct: 123 PNYQADLATPPQLPPPAPQPLPPALQGDRWQLSGLPLGEIK---TAAEWELPFGEVPPLP 179
Query: 272 LLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYA 330
L + +D TL+PGL + S RA PLA W++GLE S+ +T + LIL G S R+I
Sbjct: 180 FLTL--NDDTLLPGLIIYSQRALPLAGWLSGLEPASLSFEETPQPLLILETGASDRWILI 237
Query: 331 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
+NP E A++ A GLHF+A++E+ E GFWLL P
Sbjct: 238 R-GRNPQIQKELAAFKDACTQSQGLHFIAVKEQPTQETLQGFWLLQQTP 285
>gi|407958237|dbj|BAM51477.1| hypothetical protein BEST7613_2546 [Bacillus subtilis BEST7613]
Length = 271
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 100/272 (36%), Positives = 148/272 (54%), Gaps = 21/272 (7%)
Query: 118 IWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFR 169
+WE+++C+ S+Q Y++Y P++ +NS+ L++AI A + G +P+KIRFFR
Sbjct: 1 MWEVLICESPQSVQQLPGDLFRYSQYCPSSTVNSVWLRQAIEAAIAEAGQ-MPQKIRFFR 59
Query: 170 SQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNP-- 227
QM +I+KAC+E I P PS+R L WL +R E Y + PG+ ++ P
Sbjct: 60 RQMNNMISKACEEAGIPPAPSRRTYVLEQWLGDRLENFYPQQPGYDPKLASSTSVQYPEL 119
Query: 228 FPMELPDNLFGDK---WAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIP 284
+ LPD + GD+ WA V L A + + + FG S L + D + IP
Sbjct: 120 NAIALPDAVRGDRGDQWALVSL---AAADFNDLPDWEISFGESFPLSSYNLSPDSR--IP 174
Query: 285 GLAVASSRAKPLAAWMNGLEVCSIETDT-ARGSLILSVGISTRYIYANYKKNPVTTSEAE 343
GL + S RA P AAW++GLE+ ++ +T R + L G S +I AN + + EA+
Sbjct: 175 GLILFSPRALPFAAWLSGLELGYLQYNTDPRPIMRLETGASDSWIVANV-TDKTSEQEAQ 233
Query: 344 AWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+E KK G+HFLAIQ DSE GFWLL
Sbjct: 234 GFEQTKKLAQGIHFLAIQTSPDSETFAGFWLL 265
>gi|397628715|gb|EJK69024.1| hypothetical protein THAOC_09759, partial [Thalassiosira oceanica]
Length = 382
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 90/278 (32%), Positives = 150/278 (53%), Gaps = 28/278 (10%)
Query: 115 GKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQT 174
GKK+WE+++ D S +L+ + P+N +NS +++ + + + V P IRFFR M
Sbjct: 117 GKKLWEILITDSSGNLRVCRSLPSNKVNSREVRKVVEDVIGESEVK-PGTIRFFRGAMFN 175
Query: 175 IITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPF-----P 229
+I A E+D+ PS+ +L WLEER VY + G+Q L + F
Sbjct: 176 MINIALSEIDVVAKPSRCTFALAQWLEERNRDVYPQMEGYQAAKARLGGVGGTFLDIRTA 235
Query: 230 MELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIE----VDD----KT 281
++LPD L G+K+AFV LP ++F+ G S++ + +G+ VD +
Sbjct: 236 VKLPDALRGEKYAFVGLPL-----------AEFIEGGSVNNENIGVGRLCPVDSTLPADS 284
Query: 282 LIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSE 341
+ G+ + +SRAK LA+W+ G EV I+ D + L++ I +Y+ A K + E
Sbjct: 285 FVQGVVILTSRAKALASWLAGTEVGGIKADIRKRELVMETDIDNQYLMA--KLDDDQRRE 342
Query: 342 AEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
A +E K + GLHF+++QE+ +++D GFWLL ++P
Sbjct: 343 AANFEEGKDSLNGLHFVSVQED-ENDDPAGFWLLREIP 379
>gi|317969607|ref|ZP_07970997.1| hypothetical protein SCB02_08730 [Synechococcus sp. CB0205]
Length = 299
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 97/292 (33%), Positives = 155/292 (53%), Gaps = 20/292 (6%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDG------SLSLQYTKYFPNNVINSITLKEAI-- 150
+WELD+ SRPIL+ GKK WEL++C + Q+ P + +NS LK A+
Sbjct: 15 ADWELDYYSRPILEEDGKKRWELLICSSPNAENPGRAFQWVLKCPASSVNSQWLKSALEQ 74
Query: 151 -VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
+ D G P KIR +RS M+T++ +A ++L ++ +PS+RC +L+ WL+ER TVY
Sbjct: 75 ALEQADSEGFDPPRKIRCWRSSMRTMVQRASEQLGLELVPSRRCYALVEWLQEREATVYP 134
Query: 210 RHPGFQKGS-KPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASL 268
G+ G P P + LP+ GD W++ LP A++E + + ++ F
Sbjct: 135 EEEGYMAGPLAPPPQPIQPVAVPLPEAARGDSWSWASLPIGALREAM-TWDTSFA----- 188
Query: 269 DLDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRY 327
L L +DD+ ++ GL + ++SR+ +A W++GLE +E + L+L G R+
Sbjct: 189 GLVPLPESLDDELMVSGLRLFSASRSLAIAGWVSGLEPVRLEVCGQQ--LVLEAGQEDRW 246
Query: 328 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
+ + + + A AA+ GG+ FLAIQ D GFW+L DLP
Sbjct: 247 LLGQLESDEAEAAAAAF-LAARGQVGGVQFLAIQSSPDQPGFDGFWILRDLP 297
>gi|87302524|ref|ZP_01085341.1| hypothetical protein WH5701_11459 [Synechococcus sp. WH 5701]
gi|87282868|gb|EAQ74825.1| hypothetical protein WH5701_11459 [Synechococcus sp. WH 5701]
Length = 299
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 100/293 (34%), Positives = 155/293 (52%), Gaps = 24/293 (8%)
Query: 100 EWELDFCSRPILDIRGKKIWELVVCDGSLS------LQYTKYFPNNVINSITLKEAI--- 150
+WELDF SRP+LD GKK W+L++ +S ++ K P + +NS+ L+ A+
Sbjct: 16 DWELDFFSRPVLDPGGKKRWDLLITATPVSEGSQPRFRWVKNCPASTVNSVWLQGALNEA 75
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
++ D G+ P ++R +R+ M+T++ +A + + ++ IPS+RC +L WL ER VY
Sbjct: 76 LSAAADQGLGAPRRLRCWRATMRTMVQRAAEAIGLEVIPSRRCYALAEWLSERERDVYPA 135
Query: 211 HPGFQKGSKPLLALDNP---FPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
G+ G PL P P+ LP+ GD W +V LP A++ E S E F
Sbjct: 136 EEGYMAG--PLAPPPQPMRSLPLPLPEAARGDSWDWVSLPLGALR-EASEWEIGFEGLFP 192
Query: 268 LDLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTR 326
L DL D ++PGL + S +R+ +A W+ GLE +E + SL+L G+ R
Sbjct: 193 LPADL-----PDDLMVPGLRLFSRTRSLAIAGWIAGLEPARLEMEGT--SLVLEAGLEDR 245
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
+ A + + AA++A GL F+A+Q E SE GFWLL D+P
Sbjct: 246 WRLATLAEQEASEVAEAF-AAAREAAAGLQFIAVQSEAQSERFDGFWLLRDMP 297
>gi|318041062|ref|ZP_07973018.1| hypothetical protein SCB01_05109 [Synechococcus sp. CB0101]
Length = 305
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 96/307 (31%), Positives = 156/307 (50%), Gaps = 26/307 (8%)
Query: 90 DEETDPESIT------EWELDFCSRPILDIRGKKIWELVVCD------GSLSLQYTKYFP 137
D+ DP T +WELD+ SRPIL+ GKK WEL++C ++ +
Sbjct: 6 DQVADPTRRTAAPLQLDWELDYYSRPILEPDGKKRWELLICSTPAPGASGPGFRFVQNCS 65
Query: 138 NNVINSITLKEAIVAICDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCL 194
+ +NS LK+A+ + G P K+R +R+ M+T++++A ++L ++ IPS+RC
Sbjct: 66 ASSVNSQWLKQALEQAMEQAAAEGYAAPRKLRCWRASMRTMVSRAAEQLSLELIPSRRCY 125
Query: 195 SLLLWLEERYETVYTRHPGFQKGS-KPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQE 253
+L+ WL+ER TVY G+ G P P + LP+ GD W++ LP A++E
Sbjct: 126 ALVEWLQERQATVYPAEEGYMAGPLAPAPLPIQPVAVPLPEAARGDSWSWASLPLGALRE 185
Query: 254 EVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWMNGLEVCSIETDT 312
+ E F + LD G DD ++ GL + +++R+ +A W+ GLE +E
Sbjct: 186 ---AAEWDVSFAGLVPLDGTG---DDDVMVSGLRLFSATRSLAIAGWIAGLEPVRLEVSG 239
Query: 313 ARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGF 372
L+L G+ R++ N + + AA++ GG+ FLA+Q GF
Sbjct: 240 --NQLVLEAGLEDRWLLGNLEAEEAEAAAQAF-RAARQQAGGVQFLAVQSSDAQNGFDGF 296
Query: 373 WLLLDLP 379
W+L DLP
Sbjct: 297 WVLRDLP 303
>gi|33863502|ref|NP_895062.1| hypothetical protein PMT1234 [Prochlorococcus marinus str. MIT
9313]
gi|33640951|emb|CAE21409.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 299
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 100/299 (33%), Positives = 152/299 (50%), Gaps = 19/299 (6%)
Query: 93 TDPESITEWELDFCSRPILDIRGKKIWELVVCD-----GSLSLQYTKYFPNNVINSITLK 147
TD T+WELDF SRPIL+ GKK WEL++ G+ ++ K P +NS+ L
Sbjct: 10 TDQHPKTDWELDFYSRPILESDGKKRWELLISSSQDPSGTAPFRWVKRCPAGEVNSLWLT 69
Query: 148 EAIVAICDD---LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERY 204
+A+ D G P ++R +R M+T++ +A EL I+ IPS+R +LL WL ER
Sbjct: 70 DALREALKDSQEQGWEAPLRLRCWRISMRTMVQRAAAELGIEVIPSRRTYALLDWLAERE 129
Query: 205 ETVYTRHPGFQKG-SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFV 263
VY G+ G P P+ LP+ + GD W++ LP ++E + E
Sbjct: 130 RDVYPLEEGYMAGPLAPPPTPIPTPPVPLPEAVRGDAWSWASLPLGLLRE---AQEWPIG 186
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVG 322
FG L +G +D +PG+ + S +RA LA W+ GLE + D + L+L G
Sbjct: 187 FGGLLP---VGANDNDNIPVPGVRMFSQTRALALAGWLGGLEPVCLAVDGTQ--LMLEAG 241
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
R++ + T + EA ++A GGL F+++Q + + GFW+L DLP P
Sbjct: 242 QDDRWLVTDLDDKTATAVQQSLLEAREQA-GGLQFISVQTSPEEKRFAGFWMLRDLPQP 299
>gi|323451508|gb|EGB07385.1| hypothetical protein AURANDRAFT_27892 [Aureococcus anophagefferens]
Length = 345
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 94/286 (32%), Positives = 145/286 (50%), Gaps = 16/286 (5%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLG 158
+EWELD SRP+L ++GKK+WEL++ D S + P +NS+ +++AI +
Sbjct: 61 SEWELDCFSRPVL-VKGKKLWELLITDASGQWRDVVALPATGVNSVAVRKAIEDVIARAP 119
Query: 159 VPIPEKIRFFRSQMQTIITKACKEL-----DIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
V P IRFFR QM ++T A + ++ PS+ +L W+EER VY G
Sbjct: 120 VK-PTVIRFFRRQMLNMLTIALNGVAANRPTLRVTPSRATHALYDWIEEREADVYPGMEG 178
Query: 214 FQKGSKPLLALDNPFPM---ELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDL 270
+ G+ P+ LP+ L G+++AFV LP S V E G +++
Sbjct: 179 YSPGAGAATRDRMTAPVTASRLPEGLRGEQYAFVTLPLSEVLSGGGITEENVGVGKLINV 238
Query: 271 DLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYA 330
EVD L+PG+A+ + R+ LA + E+ + D A+ L+L V + ++ A
Sbjct: 239 KP-AYEVD--ALLPGIAILTRRSDALAMSLASTELAGVRADAAQRQLVLDVALDESFLVA 295
Query: 331 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQE-ELDSEDCVGFWLL 375
+ EA A+E AK+ GGLHF+ +Q E D + GFWLL
Sbjct: 296 KLDDD--QRVEAAAFEKAKQGLGGLHFVVVQSPEDDGVEPAGFWLL 339
>gi|284928976|ref|YP_003421498.1| hypothetical protein UCYN_04030 [cyanobacterium UCYN-A]
gi|284809435|gb|ADB95140.1| Protein of unknown function (DUF1092) [cyanobacterium UCYN-A]
Length = 293
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 96/291 (32%), Positives = 147/291 (50%), Gaps = 22/291 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSL--------SLQYTKYFPNNVINSITLKEAIVA 152
WELDF SRP KK+WE+++C+ + ++++ P++ +NSI L++AI
Sbjct: 5 WELDFYSRPNFFKHNKKLWEVLICETPMYSNKSFNDCFKFSQLCPSSTVNSIWLRQAIEK 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P+ IRFFR QMQ +I KACK+ +I+ IPS+R +L W+++R +
Sbjct: 65 AMKKAGES-PDLIRFFRFQMQNMIIKACKDAEIEAIPSRRTFALNYWIDKREKQFKLVKN 123
Query: 213 GFQKGSKPLLALDNPFPM-ELPDNLFGDKWAFVQLPFSAVQEEVSSL----ESKFVFGAS 267
+ D M LPD L ++++ + V +VS E FG +
Sbjct: 124 RINNTVSTINRTDTDSQMVSLPDTLKDNQFS----KYFCVDLKVSDFNHIDEWDIGFGEN 179
Query: 268 LDLDLLGIEVDDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGS-LILSVGISTR 326
+ G+ T+IPGL S RA P+AAW++G E+ S+ D S L L G++ +
Sbjct: 180 YAISPYGLS--SHTIIPGLVFFSPRALPIAAWLSGFELVSLRFDRKNSSTLYLETGLNDK 237
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
+ N + EA+ +E K+ G+HFLAIQ D E GFWLL D
Sbjct: 238 SVLINL-NDIRLIQEAKNFERKKENSKGIHFLAIQPSPDVELFSGFWLLKD 287
>gi|427701381|ref|YP_007044603.1| hypothetical protein Cyagr_0042 [Cyanobium gracile PCC 6307]
gi|427344549|gb|AFY27262.1| Protein of unknown function (DUF1092) [Cyanobium gracile PCC 6307]
Length = 296
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 97/292 (33%), Positives = 149/292 (51%), Gaps = 22/292 (7%)
Query: 100 EWELDFCSRPILDIRGKKIWELVVCDGSLSLQ-------YTKYFPNNVINSITLKEAIVA 152
+WELD+ SRPIL+ GKK WEL++C + LQ ++ P +NS L+ AI A
Sbjct: 13 DWELDYYSRPILEADGKKRWELLICS-TAGLQPTPDPFRWSMDCPAASVNSQWLRGAIEA 71
Query: 153 ICDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
G P ++R +R M+ ++ +A + L ++ +PS+RC L+ WL ER +VY
Sbjct: 72 ALAAAAEQGYGPPRRLRCWRGSMRAMVQRAAEGLGLELVPSRRCYGLVEWLRERQASVYP 131
Query: 210 RHPGFQKG-SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASL 268
PG+ G P P + LP+ GD+W++ L +A E E F
Sbjct: 132 LEPGYMAGPLAPPPQPIPPVALPLPEAARGDRWSWATL-TAATLAEAGGWEIAFP----- 185
Query: 269 DLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRY 327
L L +D T +PG+ + S RA +A W++GLE +E G L+L G+ R+
Sbjct: 186 GLVALPSAIDPATPVPGIRLFSRRRALAIAGWLSGLEPTRLEVSA--GQLVLEAGLEDRW 243
Query: 328 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
I A + ++ +A+ A++ GGL F+AIQ ++ GFWLL DLP
Sbjct: 244 ILARLPEEEARLAQ-QAFAEARERAGGLQFIAIQASEEASTLEGFWLLRDLP 294
>gi|124022483|ref|YP_001016790.1| hypothetical protein P9303_07741 [Prochlorococcus marinus str. MIT
9303]
gi|123962769|gb|ABM77525.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9303]
Length = 299
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 100/300 (33%), Positives = 153/300 (51%), Gaps = 21/300 (7%)
Query: 93 TDPESITEWELDFCSRPILDIRGKKIWELVVCD-----GSLSLQYTKYFPNNVINSITLK 147
TD T+WELDF SRPIL+ GKK WEL++ G+ ++ K P +NS+ L
Sbjct: 10 TDQHPKTDWELDFYSRPILESDGKKRWELLISSSQDPSGTAPFRWVKRCPAGEVNSLWLT 69
Query: 148 EAIVAICDD---LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERY 204
+A+ D G P ++R +R M+T++ +A EL I+ IPS+R +LL WL ER
Sbjct: 70 DALREALKDSQGQGWEAPLRLRCWRISMRTMVQRAAAELGIEVIPSRRTYALLDWLAERE 129
Query: 205 ETVYTRHPGFQKG-SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFV 263
VY G+ G P P+ LP+ + GD W++ LP ++E + E
Sbjct: 130 RDVYPLEEGYMAGPLAPPPTPIPTPPVPLPEAVRGDAWSWASLPLGLLRE---AQEWPIG 186
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLE-VCSIETDTARGSLILSV 321
FG L +G +D +PG+ + S +RA LA W+ GLE VC + T L+L
Sbjct: 187 FGGLLP---VGANDNDNIPVPGVRMFSQTRALALAGWLGGLEPVCLVVDGT---QLMLEA 240
Query: 322 GISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
G R++ + + + E+ ++A GGL F+++Q + + GFW+L DLP P
Sbjct: 241 GQDDRWLVTDLDEKTAKAVQQSLLESREQA-GGLQFISVQTSPEEKRFAGFWMLRDLPQP 299
>gi|82799327|gb|ABB92253.1| conserved hypothetical protein [uncultured marine type-A
Synechococcus 5B2]
Length = 293
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 159/292 (54%), Gaps = 22/292 (7%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVV-----CDGS-LSLQYTKYFPNNVINSITLKEAIVA 152
+WELDF SRPIL+ G+K WEL++ D S + ++ K P+ +NS+ L A+
Sbjct: 9 ADWELDFYSRPILEADGRKCWELLITATPAADASEQTFRFAKRCPSGEVNSLWLSTALKE 68
Query: 153 ICD---DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
D + G P ++R +RS M+T++ +A +LD++ I S+R SLL WL++R + VY
Sbjct: 69 ARDRAVEAGWSEPRRLRCWRSSMRTMVQRAAADLDLEMIASRRTYSLLDWLQQREQEVYP 128
Query: 210 RHPGFQKG--SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
+ GF G + P + + P + LP+ + GD W++ LP + +++ + + F
Sbjct: 129 QEEGFMAGPLAPPPVPIATP-AVPLPEEVQGDAWSWASLPAALLRD---ACDWPIGFSGL 184
Query: 268 LDLDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTR 326
L L + ++D +PGL + ++SRA +A W+ GLE + + + L+L G R
Sbjct: 185 LPLP---VALEDDQAVPGLRLFSNSRALAMAGWLGGLEPVRLMVEGRQ--LVLEAGQDDR 239
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
++ ++ + + +AE +K+ GL F+AIQ + + GFW++ D+
Sbjct: 240 WLVSDLDPSTAASIKAEL-NQSKEHAKGLQFIAIQSSPEEQAFAGFWMMRDI 290
>gi|116075331|ref|ZP_01472591.1| hypothetical protein RS9916_27264 [Synechococcus sp. RS9916]
gi|116067528|gb|EAU73282.1| hypothetical protein RS9916_27264 [Synechococcus sp. RS9916]
Length = 299
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 93/295 (31%), Positives = 149/295 (50%), Gaps = 23/295 (7%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCD-----GSLSLQYTKYFPNNVINSITLKEAI--- 150
+WELDF SRPIL+ GKK WEL++ G + +Y + P +NS L EA+
Sbjct: 16 ADWELDFYSRPILEPDGKKRWELLISSTPELGGGEAFRYARRCPAGEVNSTWLTEALRDA 75
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ + G P ++R +RS M+T++ +A LD++ +PS+R +L+ W+ ER VY +
Sbjct: 76 MTAAEADGWRAPRRLRSWRSAMRTMVQRAAAALDLEMVPSRRTYALIDWMAERDREVYPK 135
Query: 211 HPGFQKG--SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASL 268
G+ G + P +A+ P + LP+ + GD ++ LP ++ E + E F L
Sbjct: 136 EEGYMAGPLAPPPVAVSTPA-IPLPEAVRGDALSWANLPLGSLAE---AKEWPLGFNGLL 191
Query: 269 DLDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRY 327
+ +D IPGL + +S+RA LA W+ GLE + D + LIL G +
Sbjct: 192 PIP---EGLDPAQPIPGLRLFSSTRALALAGWLGGLEPVRLRIDGRQ--LILDAGQDDSW 246
Query: 328 IYANYKKNPVTTSEA-EAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
+ + +P + A +A + GL F+A+Q D GFW+L D P P
Sbjct: 247 LVTDL--DPASAEAAKQALAETRTTASGLQFIAVQTTPDHPRFEGFWMLRDQPEP 299
>gi|33239980|ref|NP_874922.1| hypothetical protein Pro0529 [Prochlorococcus marinus subsp.
marinus str. CCMP1375]
gi|33237506|gb|AAP99574.1| Uncharacterized protein [Prochlorococcus marinus subsp. marinus
str. CCMP1375]
Length = 297
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/294 (31%), Positives = 154/294 (52%), Gaps = 19/294 (6%)
Query: 95 PESITEWELDFCSRPILDIRGKKIWELVVCD-----GSLSLQYTKYFPNNVINSITLKEA 149
P + +WE+DF SRP+++I GKK WEL++ G+ + ++ K P N +NSI L EA
Sbjct: 10 PLNKADWEVDFYSRPVIEIDGKKRWELLISSTQDFSGAETFRWEKKCPANEVNSIWLSEA 69
Query: 150 IVAICDD---LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYET 206
+ +D G P+++R +R+ M+T+ITKA +++ I+ I S+R SL WL +R +
Sbjct: 70 LKEALEDSSKQGWAFPKRLRCWRTSMKTMITKASEKVGIEVIESRRTFSLHEWLLQRDKD 129
Query: 207 VYTRHPGFQKGS-KPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFG 265
VY G+ P ++D P LP+ L GD W+F L A++ + E F
Sbjct: 130 VYPNEEGYISAPIPPNPSIDFTQPEPLPEALRGDAWSFSSLSIEAIR---GAREWPMEFN 186
Query: 266 ASLDLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGIS 324
A L + ++ IPGL + S +RA PL+AW++GLE + + L+L G
Sbjct: 187 ALLPIKK---SLEGNIEIPGLRMFSKTRALPLSAWLSGLEPVRLLVEN--NQLLLESGQE 241
Query: 325 TRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+ ++ + K+ ++ K G+ F+AIQ + E GFW+L D+
Sbjct: 242 SLWLVTDMSKD-YAEKVKDSLINGKANADGIQFIAIQTSPEEESFTGFWMLKDI 294
>gi|78185205|ref|YP_377640.1| hypothetical protein Syncc9902_1638 [Synechococcus sp. CC9902]
gi|78169499|gb|ABB26596.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 293
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 93/294 (31%), Positives = 152/294 (51%), Gaps = 24/294 (8%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVC------DGSLSLQYTKYFPNNVINSITLKEAIV- 151
++WELDF SRPILD G+K WEL++ DG ++ K P++ +NSI L A+
Sbjct: 9 SDWELDFYSRPILDADGRKRWELLITTTPSSEDGDTPFRFAKVCPSSEVNSIWLNTALAE 68
Query: 152 ----AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETV 207
A+ + G P+ ++R +RS M+T++ +A E DI+ I S+R +LL WLE R V
Sbjct: 69 ARESALQEGYGAPV--RLRCWRSSMRTMVQRAATEQDIEVISSRRTFALLDWLEHREREV 126
Query: 208 YTRHPGFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGA 266
Y + GF P A P+ LP+ + GD W++ LP +++ + + F
Sbjct: 127 YPKEEGFMAGPLAPPPAPVVTPPIPLPEEVQGDAWSWATLPAGLLRD---AGDWPMSFSG 183
Query: 267 SLDLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGIST 325
L + ++D+ +PGL + S +R+ +A W+ GLE + + + LIL G
Sbjct: 184 LLPVP---TNLEDEAQVPGLRLFSRTRSLAMAGWLGGLEPVRLLVEGRQ--LILEAGQDD 238
Query: 326 RYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
R++ ++ S A E + + GL F+AIQ D + GFW++ D+P
Sbjct: 239 RWLVSDL-DGEAAKSITSALETCQTSVRGLQFIAIQASPDEQAFAGFWMMRDIP 291
>gi|116072198|ref|ZP_01469465.1| hypothetical protein BL107_10441 [Synechococcus sp. BL107]
gi|116064720|gb|EAU70479.1| hypothetical protein BL107_10441 [Synechococcus sp. BL107]
Length = 293
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 92/294 (31%), Positives = 153/294 (52%), Gaps = 24/294 (8%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVC------DGSLSLQYTKYFPNNVINSITLKEAIV- 151
++WELDF SRPIL G+K WEL++ DG ++ K P+ +NS+ L A+
Sbjct: 9 SDWELDFYSRPILGADGRKRWELLITTTPSSEDGDSPFRFAKVCPSTEVNSLWLSSALSE 68
Query: 152 ----AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETV 207
A+ G P+ ++R +RS M+T++ +A E DI+ I S+R +LL WLE+R V
Sbjct: 69 AREQALQAGYGAPV--RLRCWRSSMRTMVQRAATEQDIEVISSRRTFALLDWLEQREREV 126
Query: 208 YTRHPGFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGA 266
Y + GF P A P+ LP+ + GD W++ LP +++ + + F
Sbjct: 127 YPKEEGFMAGPLAPPPAPVQTPPIPLPEEVQGDAWSWATLPAGLLRD---ADDWPMSFSG 183
Query: 267 SLDLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGIST 325
L + ++D+ +PGL + S +R+ +A W+ GLE + + + LIL G
Sbjct: 184 LLPVP---TNLEDEAQVPGLRLFSQTRSLAMAGWLGGLEPVRLLVEGRQ--LILEAGQDD 238
Query: 326 RYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
R++ ++ + S A A E ++ + GL F+AIQ D + GFW++ D+P
Sbjct: 239 RWLVSDL-DGEASKSIASALETSQTSVRGLQFIAIQASPDEQAFAGFWMMRDIP 291
>gi|452822989|gb|EME30003.1| hypothetical protein Gasu_25920 [Galdieria sulphuraria]
Length = 366
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 94/284 (33%), Positives = 144/284 (50%), Gaps = 12/284 (4%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLG 158
T WELDF SRP+ K+IWEL+V D S L + + PN++INS L++ + + + +
Sbjct: 88 TVWELDFYSRPVYGKDNKRIWELIVVDESFLLCHVESVPNDMINSAELRKRVERLLEQVT 147
Query: 159 VPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGS 218
V P+ ++F R M +I+ A K+L + PS+R L L +R +Y++ PG++ S
Sbjct: 148 VK-PKVVKFSRMPMFNMISLALKDLGFEVKPSRRTYRLYHVLRDREANIYSKMPGYR--S 204
Query: 219 KPLLALDNPFPME-LPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEV 277
+ L+ + E LPD L G+K+AF +S + E SS + D+ G +
Sbjct: 205 ENTLSTSYLYSTERLPDALRGEKFAFCTADYSFLYELQSSDTIPYC-----DIFNTGDSI 259
Query: 278 DDKTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPV 337
+ +PG+ V S RA LA+W G EV I+ L+L GI++ Y A ++
Sbjct: 260 LLEKELPGIIVYSERADSLASWTAGAEVSFIKFREEELELVLECGINSHYRLAKIAEDHR 319
Query: 338 TTSEAEAWEAAKKACGGLHFLAIQ---EELDSEDCVGFWLLLDL 378
EA+ +E K G HF AIQ E + G WLL D
Sbjct: 320 LVEEAKTFEQMKWHMKGFHFYAIQSLKETSGTSHIKGLWLLNDF 363
>gi|90655540|gb|ABD96379.1| unknown [uncultured marine type-A Synechococcus GOM 3O12]
Length = 293
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 91/291 (31%), Positives = 148/291 (50%), Gaps = 20/291 (6%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLS------LQYTKYFPNNVINSITLKEAIV- 151
+WELDF SRPIL+ G+K WEL++ + +++K P+ +NSI L A+
Sbjct: 9 ADWELDFYSRPILESDGRKRWELLITATPAADARETPFRFSKCCPSGEVNSIWLSSALAE 68
Query: 152 --AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
D G P P ++R +RS M+T++ +A ELD++ I S+R +LL WL++R + VY
Sbjct: 69 ARQCAVDAGWPAPRRLRCWRSSMRTMVQRAATELDLEMIASRRTYALLDWLQQREQEVYP 128
Query: 210 RHPGFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASL 268
GF P A P+ LP+ + GD W++ LP +++ S F L
Sbjct: 129 LEEGFMAGPLAPPPAPIATPPVPLPEEVQGDAWSWASLPADLLRDAADWPTS---FSGLL 185
Query: 269 DLDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRY 327
L G++ D +PGL + +SSRA +A W+ GLE + + + L+L G R+
Sbjct: 186 PLP-KGLDTDQP--VPGLRLFSSSRALAMAGWLGGLEPVRLLVEGRQ--LVLEAGQDDRW 240
Query: 328 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+ ++ + +K+ GL F+AIQ D + GFW++ D+
Sbjct: 241 LVSDLDSAAADAIAGDL-GRSKERGKGLQFIAIQTSPDEQAFAGFWMMRDI 290
>gi|88807699|ref|ZP_01123211.1| hypothetical protein WH7805_14148 [Synechococcus sp. WH 7805]
gi|88788913|gb|EAR20068.1| hypothetical protein WH7805_14148 [Synechococcus sp. WH 7805]
Length = 304
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 94/302 (31%), Positives = 153/302 (50%), Gaps = 25/302 (8%)
Query: 91 EETDPESITEWELDFCSRPILDIRGKKIWELVVCDGS-----LSLQYTKYFPNNVINSI- 144
E++ + +WELDF SRPIL+ GKK WEL++ + ++ K P +NS
Sbjct: 13 EQSSAQKQADWELDFYSRPILEADGKKRWELLITSTPTPTEPVCFRFEKRCPAGDVNSTW 72
Query: 145 ---TLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLE 201
L+EA+ A ++ G P+++R +RS M+T++ +A EL ++ IPS+R +LL WLE
Sbjct: 73 LTSALREALTA-ANEQGWLQPKRLRTWRSAMRTMVQRAASELGLEMIPSRRTYALLDWLE 131
Query: 202 ERYETVYTRHPGFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLES 260
ER +VY GF P A P+ LP+ + GD W + LP ++ E
Sbjct: 132 ERERSVYPLDEGFMAGPIAPPPAPIATPPLPLPEAVRGDAWCWAALPLGSLLE-----AG 186
Query: 261 KFVFGASLDLDLLGI--EVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSL 317
++ G + DLL I +D + +PGL + S +RA LA W+ GLE + + L
Sbjct: 187 EWPMGFN---DLLPIPEGMDPELPVPGLRLFSQTRALALAGWLGGLEPVRLRVSNQQ--L 241
Query: 318 ILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
+L G ++ ++ + ++ + GL F+++Q DS+ GFW+L D
Sbjct: 242 VLDAGQDDSWLVSDLGQMEANQCREALMDSVSRG-RGLQFISVQTTPDSQRFDGFWMLRD 300
Query: 378 LP 379
P
Sbjct: 301 RP 302
>gi|148238987|ref|YP_001224374.1| hypothetical protein SynWH7803_0651 [Synechococcus sp. WH 7803]
gi|147847526|emb|CAK23077.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
Length = 304
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 94/293 (32%), Positives = 147/293 (50%), Gaps = 23/293 (7%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSL-----SLQYTKYFPNNVINSITLKEAIVAI 153
+WELDF SRPIL+ GKK WEL++ ++ K P +NS L A+
Sbjct: 21 ADWELDFYSRPILEADGKKRWELLITSTPTPSAPDCFRFEKRCPAGDVNSTWLASALREA 80
Query: 154 CDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
D G P ++R +RS M+T++ +A EL+++ IPS+R +LL WLEER +Y
Sbjct: 81 LDTAQAHGWMSPRRLRTWRSAMRTMVQRAASELELEMIPSRRTYALLDWLEERERDLYPL 140
Query: 211 HPGFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLD 269
G+ P A P+ LP+ + GD W + LP +++E S++ G +
Sbjct: 141 DKGYMAGPLAPPPAPIATPPLPLPEAVRGDAWCWAALPLGSLRE-----ASEWPMGFN-- 193
Query: 270 LDLLGI--EVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTR 326
DLL I +D + +PGL + S +RA LA W+ GLE + + + LIL G
Sbjct: 194 -DLLPIPEAMDPELPVPGLRLFSQTRALALAGWLGGLEPVRLRMNAQQ--LILDAGQDDS 250
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
++ ++ + +A E + GL F+++Q DS+ GFW+L D P
Sbjct: 251 WLVSDLGQTEAVECR-DALEDSVHRSRGLQFISVQATPDSQRFDGFWMLRDQP 302
>gi|90655491|gb|ABD96331.1| unknown [uncultured marine type-A Synechococcus GOM 3O6]
Length = 293
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 95/293 (32%), Positives = 154/293 (52%), Gaps = 24/293 (8%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVV-----CDGS-LSLQYTKYFPNNVINSITLKEAI-- 150
+WELDF SRPIL+ G+K WEL+V D + + +++K P+ +NS+ L A+
Sbjct: 9 ADWELDFYSRPILEADGRKRWELLVTATPAADATEIPFRFSKCCPSGEVNSLWLTAALGE 68
Query: 151 VAICD-DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
C + G P P ++R +RS M+T++ +A ELD++ I S+R +LL WL++R + VY
Sbjct: 69 ARQCALEAGWPAPRRLRCWRSSMRTMVQRAATELDLEMIASRRTYALLEWLQQREQEVYP 128
Query: 210 RHPGFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASL 268
+ GF P A P+ LP+ + GD W++ LP + + + S + F
Sbjct: 129 QEEGFMAGPLAPPPAPVATPPVPLPEEVQGDAWSWASLP-ADLLGDASDWPTSFS----- 182
Query: 269 DLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRY 327
L L +D +PGL + S SRA +A W+ GLE + + + L+L G R+
Sbjct: 183 GLLPLPAGLDSNQPVPGLRLFSNSRALAMAGWLGGLEPVRLLVEGRQ--LVLEAGQDDRW 240
Query: 328 IYANYKKNPVTTSEAEAWEAA--KKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+ ++ +EA A E A K+ GL F+AIQ + + GFW++ D+
Sbjct: 241 LVSDLDS---AAAEAIAGELAQSKERGKGLQFIAIQASPEEQAFAGFWMMRDI 290
>gi|434392898|ref|YP_007127845.1| protein of unknown function DUF1092 [Gloeocapsa sp. PCC 7428]
gi|428264739|gb|AFZ30685.1| protein of unknown function DUF1092 [Gloeocapsa sp. PCC 7428]
Length = 279
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 83/277 (29%), Positives = 138/277 (49%), Gaps = 9/277 (3%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ D G+ +WEL+VCD + ++++ P + +N+ L E + + D +
Sbjct: 7 WQADFYRRPLRDAAGQTLWELLVCDLTRTVEFVALCPQSQVNAHWLVEQLQHVADKM--- 63
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR Q ++IT A ++L I ++R +L WL+ER ++Y + +
Sbjct: 64 -PDTIQVFRPQSLSLITAAGEQLGITVEATRRTDALKQWLQER-SSLYRSMDNYTGEAYD 121
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
LL L+ P P LP+ L+G++W F L V+E + L L + +
Sbjct: 122 LLTLEKPPPTPLPEKLWGEQWRFAALSAKDVEEAFQERPIP-ILNMPPALMPLQLGLASN 180
Query: 281 TLIPGLAVASSR-AKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVT 338
IPG+ + R + LA W+ S+ A L+L G+ R+I A ++ V+
Sbjct: 181 IAIPGVIIYGGRQSMRLARWLQEANPVSLNYIAGAPDGLVLEAGLVDRWIVATFEDREVS 240
Query: 339 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
TS A+ +E K+ GLHFL +Q + GFWLL
Sbjct: 241 TS-AQNYEQRKQQSKGLHFLLVQPDNSDITFSGFWLL 276
>gi|159903073|ref|YP_001550417.1| hypothetical protein P9211_05321 [Prochlorococcus marinus str. MIT
9211]
gi|159888249|gb|ABX08463.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9211]
Length = 295
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 90/292 (30%), Positives = 147/292 (50%), Gaps = 23/292 (7%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCD-----GSLSLQYTKYFPNNVINSITLKEAIVAI 153
+WELDF SRP+++ GKK WEL++ G ++ K P N +NSI L +A+
Sbjct: 12 ADWELDFYSRPVIEADGKKRWELLISSTENLSGKEPFRWEKKCPANEVNSIWLSKALKEA 71
Query: 154 CDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
D G P+ +R +R+ M+T+I KA + L ++ S+R SLL WL R + VY
Sbjct: 72 LKDAQSQGWGKPKIVRCWRAPMKTMIKKAAESLGLEVKESRRTYSLLDWLAHREKEVYPL 131
Query: 211 HPGFQKG---SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
G+ G P L+ P P LP+ + GD +F L +++E + E F
Sbjct: 132 QSGYLNGPIAPPPARILNQPTP--LPEAIRGDALSFASLEVRSLRE---AREWPIEFQGL 186
Query: 268 LDLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTR 326
L + +++ IPGL + S +RA L+AW++GLE + + + LIL G R
Sbjct: 187 LP---IAPSIEENISIPGLRLFSKNRAFALSAWLSGLEPVKLIVE--KNQLILEAGQEDR 241
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
++ + + ++ E +++ GL F++IQ + + GFW+L DL
Sbjct: 242 WLVTDMPQASADNAKKEL-SNSRENANGLQFISIQTSPNEQKFSGFWMLRDL 292
>gi|33866273|ref|NP_897832.1| hypothetical protein SYNW1741 [Synechococcus sp. WH 8102]
gi|33639248|emb|CAE08256.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 293
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 95/293 (32%), Positives = 153/293 (52%), Gaps = 24/293 (8%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVV-----CDGS-LSLQYTKYFPNNVINSITLKEAI-- 150
+WELDF SRPIL+ G+K WEL+V D + + +++K P+ +NS+ L A+
Sbjct: 9 ADWELDFYSRPILEADGRKRWELLVTATPAADATEIPFRFSKCCPSGEVNSLWLSAALGE 68
Query: 151 VAICD-DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
C + G P P ++R +RS M+T++ +A ELD++ I S+R +LL WL+ R + VY
Sbjct: 69 ARQCALEAGWPAPRRLRCWRSSMRTMVQRAATELDLEMIASRRTYALLEWLQHREQEVYP 128
Query: 210 RHPGFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASL 268
+ GF P A P+ LP+ + GD W++ LP + + + S + F
Sbjct: 129 QEEGFMAGPLAPPPAPVATPPVPLPEEVQGDAWSWASLP-ADLLGDASDWPTSFS----- 182
Query: 269 DLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRY 327
L L +D +PGL + S SRA +A W+ GLE + + + L+L G R+
Sbjct: 183 GLLPLPAGLDSNQPVPGLRLFSNSRALAVAGWLGGLEPVRLLVEGRQ--LVLEAGQDDRW 240
Query: 328 IYANYKKNPVTTSEAEAWEAA--KKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+ ++ +EA A E A K+ GL F+AIQ + + GFW++ D+
Sbjct: 241 LVSDLDS---AAAEAIAGELAQSKERGKGLQFIAIQTSPEEQAFAGFWMMRDI 290
>gi|87123919|ref|ZP_01079769.1| hypothetical protein RS9917_09926 [Synechococcus sp. RS9917]
gi|86168488|gb|EAQ69745.1| hypothetical protein RS9917_09926 [Synechococcus sp. RS9917]
Length = 304
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 92/297 (30%), Positives = 148/297 (49%), Gaps = 27/297 (9%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCD-----GSLSLQYTKYFPNNVINSITLKEAI--- 150
+WELDF SRPIL+ GKK WEL++ G +Y + P +NS L A+
Sbjct: 21 ADWELDFYSRPILEADGKKRWELLITGSPDRSGRPPFRYERRCPAGEVNSTWLASALRDA 80
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ + G P+++R +RS M+T++ +A EL ++ PS+R +L+ WL +R VY
Sbjct: 81 LDLAQSEGWSPPQRLRCWRSAMRTMVQRAGTELGLEVRPSRRTYALIDWLAQREREVYPT 140
Query: 211 HPGFQKGSKPL---LALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
GF G PL A + LP+ + GD W++ LP ++++ + G
Sbjct: 141 EEGFMAG--PLAPSPAPTPTPALPLPEAVRGDAWSWASLPLGSLRD-----AEDWPLGFH 193
Query: 268 LDLDLLGI--EVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGIS 324
DLL I + +PGL + S SRA LA W+ GLE + + + L+L G
Sbjct: 194 ---DLLPIPNALAADQPVPGLRLFSRSRALALAGWLGGLEPVRLRVEGCQ--LVLDAGQD 248
Query: 325 TRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
++ + + ++ E +AA++ GGL F+A+Q ++ GFW+L D P P
Sbjct: 249 DAWLVTDLEPEAANITQREL-DAAREQIGGLQFIAVQTTPETPRFEGFWMLRDQPEP 304
>gi|260434334|ref|ZP_05788304.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
gi|260412208|gb|EEX05504.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
Length = 294
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 91/293 (31%), Positives = 151/293 (51%), Gaps = 24/293 (8%)
Query: 100 EWELDFCSRPILDIRGKKIWELVVCDGSLS------LQYTKYFPNNVINSITLKEAIV-- 151
+WELDF SRPIL+ G+K WEL++ + ++ K P+ +NS+ L +A+
Sbjct: 11 DWELDFYSRPILEADGRKRWELLITSTPAATGDTEPFRFAKVCPSGDVNSLWLSQALAEA 70
Query: 152 ---AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVY 208
+ G P+ ++R +RS M+T++ +A E D++ IPS+R +LL WL++R VY
Sbjct: 71 KQASASGGWGSPV--RLRCWRSSMRTMVQRAAAEQDLEVIPSRRTFALLDWLQQREREVY 128
Query: 209 TRHPGFQKG-SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
GF G P A P LP+ + GD W++ LP S + E + E F
Sbjct: 129 PEEEGFMAGPLAPPPAPVPTPPAPLPEEVQGDAWSWAALPASLLLE---ASEWPMSFSGL 185
Query: 268 LDLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTR 326
L + +D + +PGL + S SR+ +A W+ GLE + + + L+L G R
Sbjct: 186 LPVP---DGIDPEASVPGLRLFSQSRSVAMAGWLGGLEPVRMIVEDRQ--LVLEAGQDDR 240
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
++ ++ + V +EA +++ GL F+AIQ + + GFW+L D+P
Sbjct: 241 WLVSDLEPG-VAAEISEALATSQQQVRGLQFIAIQSIPEEQTFGGFWMLRDIP 292
>gi|124025296|ref|YP_001014412.1| hypothetical protein NATL1_05851 [Prochlorococcus marinus str.
NATL1A]
gi|123960364|gb|ABM75147.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL1A]
Length = 295
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 88/292 (30%), Positives = 150/292 (51%), Gaps = 23/292 (7%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGS-----LSLQYTKYFPNNVINSITLKEAIVAI 153
T+WE+DF SRPI+D GKK WEL++ + + ++ K P + +NSI LK+A
Sbjct: 12 TDWEIDFYSRPIIDENGKKRWELLITSTNNFKDKKTFKWEKICPASSVNSIWLKDAFDEA 71
Query: 154 CDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
D+ G P IR +RS M+T+I +A ++ I+ I S+R SLL WL ER + Y +
Sbjct: 72 IDEAYSQGWDKPSVIRCWRSSMKTMIKRAADQIGIELISSRRTYSLLEWLIERERSFYPQ 131
Query: 211 HPGFQKGSKPLLALDNPFPME---LPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
G+ + L NP + LP+ + G+ W+F L + ++ E E +F
Sbjct: 132 QKGYTGVN--LAPPSNPITNQAIPLPEEVRGESWSFASLSLNTLR-EADEWEIEFS---- 184
Query: 268 LDLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTR 326
+L + +++ IPG+ + S R+ LAAW+ GLE + + + +IL G + R
Sbjct: 185 -NLIPIKDSINENISIPGIRLFSPKRSLALAAWLGGLEPAKLLIEGTQ--IILEAGQADR 241
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
++ + ++ E ++ K GL F+++Q+ + GFW+L D+
Sbjct: 242 WLVTDVEEEAKKVIE-NNFQNTKLYADGLQFISVQKSPEENSLDGFWMLKDI 292
>gi|194477333|ref|YP_002049512.1| hypothetical protein PCC_0893 [Paulinella chromatophora]
gi|171192340|gb|ACB43302.1| hypothetical protein PCC_0893 [Paulinella chromatophora]
Length = 306
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 86/290 (29%), Positives = 146/290 (50%), Gaps = 20/290 (6%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDG-SLSLQY-TKYF------PNNVINSITLKEAI 150
++WELDF SR +D KK WEL++C S+S+ + YF P+ +NS+ LKEA+
Sbjct: 18 SDWELDFYSRSPIDTNDKKCWELIICSTPSISITGPSAYFRWEMPCPSESVNSLWLKEAL 77
Query: 151 VAICD---DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETV 207
D + G P ++R +RS M+ +I +A + I+ +PS+RC +L+ W+++R +
Sbjct: 78 GQAIDSALEQGFSSPRRLRSWRSSMRIMIQRAVESFGIEFVPSRRCYTLMEWIKDREIQI 137
Query: 208 YTRHPGFQKGSKPLLALDNPF-PMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGA 266
Y+ + + F + LP GD W++ LP + +Q E S+ E F
Sbjct: 138 YSSQKNMSTNIGVIPSTRTQFRAIPLPTAAQGDSWSWASLPMNILQ-EASNWE--ISFSG 194
Query: 267 SLDLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGIST 325
L L + E + +IPG+ + S SR+ +A W+ GLE +E L+L G+
Sbjct: 195 LLPLPIFN-EKQKEIMIPGVRLLSLSRSLAIAGWIQGLEPVRLE--ICETQLVLEAGLED 251
Query: 326 RYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
R++ + + EA+ A+ G+ FLA+Q + + G W+L
Sbjct: 252 RWLLTDLPIEEALVAN-EAFTKARMNAFGVQFLAVQSDPNQRGFDGLWML 300
>gi|113953228|ref|YP_731197.1| hypothetical protein sync_1994 [Synechococcus sp. CC9311]
gi|113880579|gb|ABI45537.1| Uncharacterized protein [Synechococcus sp. CC9311]
Length = 304
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 147/297 (49%), Gaps = 29/297 (9%)
Query: 100 EWELDFCSRPILDIRGKKIWELVVCDG-----SLSLQYTKYFPNNVINSITLKEAI---V 151
+WELDF SRPIL+ GKK WEL++ + S ++ K P +NS L A+ +
Sbjct: 22 DWELDFYSRPILEPDGKKRWELLIISSPSEGTTSSFRFEKRCPAGSVNSTWLTSALTEAI 81
Query: 152 AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRH 211
A G P K+R +RS M+T++ +A EL ++ +PS+R +LL W+ ER + +Y
Sbjct: 82 AAAQQQGWSEPRKLRSWRSSMRTMVQRAASELGLEMVPSRRTYALLDWIAEREQDLYPNE 141
Query: 212 PGFQKGS-KPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDL 270
G+ G P AL + P LP+++ GD W + +LP SA++E A +
Sbjct: 142 EGYMAGPLAPPPALISTPPRPLPESVRGDAWNWAELPASALRE-----------AAGWPI 190
Query: 271 DLLG-----IEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGIS 324
G I + D +IPGL + S +R LA + G+E ++ + L+L G
Sbjct: 191 GFRGLLPVPITIKDDQVIPGLRLFSQTRGLALAGLLGGIEPVRLKVSGTQ--LLLEAGQD 248
Query: 325 TRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
++ ++ ++ + A + GL F+A+Q ++E GFW+L D P
Sbjct: 249 DCWLVSDLSSEEAKHV-SDLMKGASEHAEGLQFIAVQTSPEAERFEGFWMLRDQAEP 304
>gi|352094718|ref|ZP_08955889.1| protein of unknown function DUF1092 [Synechococcus sp. WH 8016]
gi|351681058|gb|EHA64190.1| protein of unknown function DUF1092 [Synechococcus sp. WH 8016]
Length = 303
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 151/299 (50%), Gaps = 33/299 (11%)
Query: 100 EWELDFCSRPILDIRGKKIWELVV----CDGSLS-LQYTKYFPNNVINSITLKEAI---V 151
+WELDF SRPIL+ GKK WEL++ C+G+ S ++ K P + +NS L A+ +
Sbjct: 21 DWELDFYSRPILEPDGKKRWELLIVSSPCEGTTSSFRFEKRCPASSVNSTWLTSALTEAM 80
Query: 152 AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRH 211
A G +P K+R +RS M+T++ +A EL ++ +PS+R +L W+ ER + +Y +
Sbjct: 81 AAAQQQGWAVPRKLRSWRSSMRTMVQRAASELGLEMVPSRRTYALFDWIAEREQDLYPKE 140
Query: 212 PGFQKGSKPL---LALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASL 268
G+ G PL + P LP+++ GD W + +LP ++++E
Sbjct: 141 EGYMAG--PLAPPPVPVSTPPRPLPESVRGDAWNWAELPAASLRE-----------ATGW 187
Query: 269 DLDLLGI-----EVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVG 322
+ G+ ++D +IPGL + S +R LA + G+E + + L+L G
Sbjct: 188 PIGFRGLLPVPNTINDDQIIPGLRLFSQTRGLALAGLLGGIEPVRLRVSGTQ--LLLEAG 245
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 381
++ ++ A +AA+ A GL F+A+Q D+E GFW+L D P
Sbjct: 246 QDDCWLVSDLSSEEAVHVSALMTQAAEHA-DGLQFIAVQTSPDAERFEGFWMLRDQAEP 303
>gi|72383696|ref|YP_293051.1| hypothetical protein PMN2A_1860 [Prochlorococcus marinus str.
NATL2A]
gi|72003546|gb|AAZ59348.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL2A]
Length = 295
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 88/292 (30%), Positives = 149/292 (51%), Gaps = 23/292 (7%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGS-----LSLQYTKYFPNNVINSITLKEAIVAI 153
T+WE+DF SRPI+D GKK WEL++ + + ++ K P + +NSI LK+A
Sbjct: 12 TDWEIDFYSRPIIDENGKKRWELLITSTNNFKDKKTFKWEKICPASSVNSIWLKDAFDEA 71
Query: 154 CDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
D+ G P IR +RS M+T+I +A ++ I+ I S+R SLL WL ER + Y +
Sbjct: 72 IDEAYLQGWDKPSVIRCWRSSMKTMIKRAADQIGIELISSRRTYSLLEWLIERERSFYPQ 131
Query: 211 HPGFQKGSKPLLALDNPFPME---LPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
G+ + L NP + LP+ + G+ W+F L + ++ E E +F
Sbjct: 132 QKGYTGVN--LAPPSNPITNQAIPLPEEVRGESWSFASLSLNTLR-EADEWEIEFS---- 184
Query: 268 LDLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTR 326
+L + +++ IPG+ + S R+ LAAW+ GLE + + + +IL G + R
Sbjct: 185 -NLIPIKDSINENISIPGIRLFSPKRSLALAAWLGGLEPAKLLIEGTQ--IILEAGQADR 241
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
++ + ++ E + K GL F+++Q+ + GFW+L D+
Sbjct: 242 WLVTDVEEEAKKVIE-NNFLNTKLYADGLQFISVQKSPEENSLHGFWMLKDI 292
>gi|78212273|ref|YP_381052.1| hypothetical protein Syncc9605_0725 [Synechococcus sp. CC9605]
gi|78196732|gb|ABB34497.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 294
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 88/293 (30%), Positives = 148/293 (50%), Gaps = 24/293 (8%)
Query: 100 EWELDFCSRPILDIRGKKIWELVVCDGSLS------LQYTKYFPNNVINSITLKEAIV-- 151
+WELDF SRPIL+ G+K WEL++ + ++ K P+ +NS+ L +A+
Sbjct: 11 DWELDFYSRPILEADGRKRWELLITSTPAASGDAEPFRFAKVCPSGDVNSLWLSQALAEA 70
Query: 152 ---AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVY 208
+ G P+ ++R +RS M+T++ +A E D++ IPS+R +LL WL++R VY
Sbjct: 71 KQASASGGWGSPV--RLRCWRSSMRTMVQRAAAEQDLEVIPSRRTFALLDWLQQREREVY 128
Query: 209 TRHPGFQKG-SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGAS 267
GF G P A P+ LP+ + GD W++ LP S + E + E F
Sbjct: 129 PEEEGFMAGPLAPPPAPVPTPPVPLPEEVQGDAWSWAALPASLLLE---ASEWPMSFSGL 185
Query: 268 LDLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTR 326
L + +D + +PGL + S SR+ +A W+ GLE + + + L+L G R
Sbjct: 186 LPVP---DGIDPEASVPGLRLFSQSRSLAMAGWLGGLEPVRMIVEDRQ--LVLEAGQDDR 240
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
++ ++ + +++ GL F+AIQ + + GFW+L D+P
Sbjct: 241 WLVSDLEPGIAAEIAEAL-ATSQQQVRGLQFIAIQSSPEEQTFGGFWMLRDIP 292
>gi|113477160|ref|YP_723221.1| hypothetical protein Tery_3687 [Trichodesmium erythraeum IMS101]
gi|110168208|gb|ABG52748.1| protein of unknown function DUF1092 [Trichodesmium erythraeum
IMS101]
Length = 283
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 88/292 (30%), Positives = 146/292 (50%), Gaps = 28/292 (9%)
Query: 96 ESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICD 155
++IT W++D+ RP+ D +G+K+WEL++C + SL++ P + + + L + +
Sbjct: 5 DTITIWQVDYYRRPLQDKQGQKLWELLICTPTRSLEFIAMCPQSEVKASWLVAQLQKMAQ 64
Query: 156 DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQ 215
G +P+ I+ FR Q +I A + L +K P++R +L WL ER + Y +
Sbjct: 65 GQG--LPDVIQVFRPQSLGLIEVAAQMLGLKIEPTRRTTALKEWLLERVQQ-YQDMEAYT 121
Query: 216 KGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS----AVQEEVSSLES-KFVFGASLDL 270
L LD P P+ L +NL+GD+W F LP ++ + LE+ +F+ +L L
Sbjct: 122 GEFYEPLVLDVPPPVPLAENLWGDRWRFASLPAGNIGDIIERPIPVLEAPEFLLPLNLGL 181
Query: 271 DLLGIEVDDKTL-IPGLAVASSR-AKPLAAWMNG-----LEVCSIETDTARGSLILSVGI 323
TL IPG+ + R + LA W+ L+ S + D LIL G+
Sbjct: 182 --------SSTLPIPGVVIDGGRQSMKLARWLETTRPYLLKYISGDPD----GLILETGL 229
Query: 324 STRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
R++ A + V+ + A+A+E K+ GLHFL +Q + GFWLL
Sbjct: 230 VDRWVVATFADQEVSGA-AQAYEQRKQQSEGLHFLLVQPDDSGMTYSGFWLL 280
>gi|186681562|ref|YP_001864758.1| hypothetical protein Npun_F1089 [Nostoc punctiforme PCC 73102]
gi|186464014|gb|ACC79815.1| protein of unknown function DUF1092 [Nostoc punctiforme PCC 73102]
Length = 264
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 80/277 (28%), Positives = 132/277 (47%), Gaps = 22/277 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DF RP D G+ +WEL++CD + S +Y + NS + + G
Sbjct: 4 WQVDFYRRPSQDASGQILWELLICDATRSFEYEATCLQSAANSNWVAAQLELAA---GEK 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q ++I A + L I P++ L+L WL+E+ ++P
Sbjct: 61 LPDVIQVFRPQSLSLIEVAGRNLSINVEPTRHTLALKQWLQEK------QYPS------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
ALD P P LP+NL+G++W F L S V+ S + L + + +
Sbjct: 108 --ALDKPPPAPLPENLWGEQWRFATLAASDVETRFSDRPIP-ILHIPEHLKPINLGLAST 164
Query: 281 TLIPGLAVASSR-AKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVT 338
+PG+ + R + LA W+ ++ A L+L G+ R+I A + ++P
Sbjct: 165 VPVPGVVIYGGRQSMRLARWLQQARPVALNYISGAPDGLVLEAGLVDRWIVATF-EDPEV 223
Query: 339 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
T+ A+ ++ KK C GLHFL +Q + GFWLL
Sbjct: 224 TTAAQTYQQRKKHCRGLHFLLVQPDDSGMTYSGFWLL 260
>gi|90655437|gb|ABD96278.1| unknown [uncultured marine type-A Synechococcus GOM 3M9]
Length = 288
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 92/291 (31%), Positives = 151/291 (51%), Gaps = 20/291 (6%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVC------DGSLSLQYTKYFPNNVINSITLKEAI-- 150
+WELDF SRPIL+ G+K WEL++ D ++ K P+ +NS+ L +A+
Sbjct: 4 ADWELDFYSRPILEPDGRKRWELLITSTPTLSDPIAPFRFIKCCPSGEVNSLWLTQALRE 63
Query: 151 -VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
A +D G P+++R +RS M+T++ +A EL ++ IPS+R +LL WL++R VY
Sbjct: 64 AGAAAEDAGWSAPQRLRCWRSSMRTMVQRAAAELSLEVIPSRRTYALLDWLQQRQREVYP 123
Query: 210 RHPGFQKG-SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASL 268
GF G P A P+ LP+ + GD W + LP +QE ++ G S
Sbjct: 124 SLEGFMAGPLAPPPAPVPTPPVPLPEEVQGDAWTWAALPGGLLQE-----AGEWPMGFS- 177
Query: 269 DLDLLGIEVDDKTLIPGLAVAS-SRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRY 327
L L ++ + +PGL + S SRA +A W+ GLE + + + L+L G R+
Sbjct: 178 GLIPLPPDLSSEAPVPGLRLFSRSRALAMAGWLGGLEPVRLLVEERQ--LLLEAGQDDRW 235
Query: 328 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+ ++ + E E+++ GL F+AIQ + + GFW++ D+
Sbjct: 236 LVSDLESGAADAIETALRESSEH-MHGLQFIAIQSSPEEQSFAGFWMMRDI 285
>gi|300869097|ref|ZP_07113697.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300332913|emb|CBN58893.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 281
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 138/277 (49%), Gaps = 7/277 (2%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DF RP+ D G+K+WEL +CD + ++ + P + NS L E + + G
Sbjct: 4 WQVDFYRRPLKDDAGEKLWELSICDLDRNFTFSTFCPQSQANSGWLTEQLQQVSQ--GKN 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q +I A + LD++ ++R +L LEER + Y + + +
Sbjct: 62 LPDLIQVFRPQSLGLIEAAAQVLDVEVEATRRTFALKRLLEERAKQ-YQKMANYTGEAYH 120
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
L L++P P+ LP+NL+GD+W F LP +++ S + L L + +
Sbjct: 121 PLMLESPPPVPLPENLWGDRWRFAALPAGDIEDAFKSRPIP-ILEMPELLLPLNLALAST 179
Query: 281 TLIPGLAVASSR-AKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVT 338
+PG+ + R + LA W+ + ++ + LIL G+ R+I A + ++P
Sbjct: 180 VSVPGVIIDGGRQSMRLARWLQAAKPVALNYIPGSPDGLILEAGLVDRWIVATF-EDPDV 238
Query: 339 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ E ++ ++ GLHFL +Q + GFWLL
Sbjct: 239 KAAGEIYQQRQQLSHGLHFLLVQPDDSGMTYTGFWLL 275
>gi|119511451|ref|ZP_01630562.1| hypothetical protein N9414_16559 [Nodularia spumigena CCY9414]
gi|119463916|gb|EAW44842.1| hypothetical protein N9414_16559 [Nodularia spumigena CCY9414]
Length = 265
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 82/280 (29%), Positives = 135/280 (48%), Gaps = 22/280 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
I W++DF RP+ D G+ +WEL++CD + S +YT P + NS L I +D
Sbjct: 2 IKIWQVDFYRRPVQDKSGQILWELLICDATRSFEYTATCPQSAANSHWLATQIQLADND- 60
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
+P+ I+ FR Q ++I A LDI P++ L+L WLEE+ ++P
Sbjct: 61 --NLPDTIQVFRPQSLSLIQAAANNLDIDVEPTRYTLALKQWLEEK------QYP----- 107
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEV 277
LALD P P LP+NL+G++W F L + + + + V L + + +
Sbjct: 108 ----LALDKPPPTPLPENLWGEEWRFATLSAGELADVFAQRQIPIVSIPEF-LKPINLGL 162
Query: 278 DDKTLIPGLAVASSR-AKPLAAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKN 335
+PG+ + R + LA W+ + ++ LIL G+ R+I A ++
Sbjct: 163 ASTVPVPGVIIYGGRKSMYLARWLEQAQPFTLNYIAGEPNGLILEAGLVDRWIVATFEDA 222
Query: 336 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
V + A+ ++ ++ GLHFL +Q + GFWLL
Sbjct: 223 EVEAA-AKVYQQRQQQSQGLHFLLVQPDDSGMTYTGFWLL 261
>gi|242075630|ref|XP_002447751.1| hypothetical protein SORBIDRAFT_06g015032 [Sorghum bicolor]
gi|241938934|gb|EES12079.1| hypothetical protein SORBIDRAFT_06g015032 [Sorghum bicolor]
Length = 159
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 56/85 (65%), Positives = 66/85 (77%), Gaps = 2/85 (2%)
Query: 185 IKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFV 244
I+ + S R +SLLLWLE+RYE VY+RHP FQ G++PLLALDNPFP LP+NLFGDKWAFV
Sbjct: 71 IELLLSGRSVSLLLWLEKRYEVVYSRHPEFQAGTRPLLALDNPFPTTLPENLFGDKWAFV 130
Query: 245 QLPFSAVQEEVSSLESKFVFGASLD 269
QLPFSA EV SL + +GA L
Sbjct: 131 QLPFSAFWCEVESLGRR--YGAGLG 153
>gi|242040489|ref|XP_002467639.1| hypothetical protein SORBIDRAFT_01g031365 [Sorghum bicolor]
gi|242092100|ref|XP_002436540.1| hypothetical protein SORBIDRAFT_10g004402 [Sorghum bicolor]
gi|241914763|gb|EER87907.1| hypothetical protein SORBIDRAFT_10g004402 [Sorghum bicolor]
gi|241921493|gb|EER94637.1| hypothetical protein SORBIDRAFT_01g031365 [Sorghum bicolor]
Length = 136
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 55/91 (60%), Positives = 66/91 (72%), Gaps = 4/91 (4%)
Query: 160 PIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSK 219
P P RF + + +++ EL + S RC+SLLLWLEERYE VY+RHP FQ G++
Sbjct: 49 PHPHGRRFAYATYELLLSPDRIELLL----SGRCVSLLLWLEERYEVVYSRHPEFQAGTR 104
Query: 220 PLLALDNPFPMELPDNLFGDKWAFVQLPFSA 250
PLLALDNPFP LP+NLFGDKWAFVQLPFS
Sbjct: 105 PLLALDNPFPTTLPENLFGDKWAFVQLPFSG 135
>gi|428211452|ref|YP_007084596.1| hypothetical protein Oscil6304_0944 [Oscillatoria acuminata PCC
6304]
gi|427999833|gb|AFY80676.1| Protein of unknown function (DUF1092) [Oscillatoria acuminata PCC
6304]
Length = 277
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 134/277 (48%), Gaps = 8/277 (2%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ G+ +WEL +CD + + Q+++ + NS L E + + +
Sbjct: 4 WQADFYRRPLQSATGEPLWELCLCDPTGNFQWSRCCSQSEANSTWLAEQLQIVAEGR--- 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+PE I FR Q +++ A ++L +K PS+R +L WL E+ + Y P +
Sbjct: 61 LPEAIAVFRPQSLSLMVAAGEKLGVKIEPSRRTPALKSWLVEKAQE-YRNAPNYTCEPYE 119
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
L D P P LP+ L+GD+W F + +A EV + + + +L + + +
Sbjct: 120 PLVSDRPPPGPLPEALWGDRWRFASVS-AAYLMEVFAQRAIRIRHIPEELTPVALGLPST 178
Query: 281 TLIPGLAVASSR-AKPLAAWMNGLEVCSIETDTA-RGSLILSVGISTRYIYANYKKNPVT 338
+IPG+ + R + +A W+ +I + LIL G+ R+I A ++ V
Sbjct: 179 AVIPGVVLDGGRQSMKIAQWLQEASPVAINYNPGPPNGLILEAGLVDRWIMATFEDTEVA 238
Query: 339 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ + ++ K+A GLHFL IQ + GFWLL
Sbjct: 239 EA-GQTFQQRKQATQGLHFLLIQPDDSGMTYSGFWLL 274
>gi|428318463|ref|YP_007116345.1| protein of unknown function DUF1092 [Oscillatoria nigro-viridis PCC
7112]
gi|428242143|gb|AFZ07929.1| protein of unknown function DUF1092 [Oscillatoria nigro-viridis PCC
7112]
Length = 279
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 132/281 (46%), Gaps = 15/281 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ D GK +WEL +CD S Q++ NS L +
Sbjct: 4 WQADFYRRPLQDETGKPLWELFICDSEGSFQFSAVCSQGAANSNWLASQLQQQAQTHN-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q +I A K L +K ++R +L L L++R + Y+ P + +
Sbjct: 62 LPDLIQVFRPQSLGLIEAAGKVLGVKVEATRRTPALKLLLQQRAKE-YSSMPNYTGETYS 120
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
+ALD+P P+ LP+NL+GD W F LP ++E + L L + +
Sbjct: 121 AIALDSPPPVPLPENLWGDGWRFASLPAGDIEEAFQGRPLPILEMPEFLLP-LNLGLAST 179
Query: 281 TLIPGLAVASSR-AKPLAAWMN-----GLEVCSIETDTARGSLILSVGISTRYIYANYKK 334
+PG+ + R + LA W+ L + E D LIL G+ R++ A ++
Sbjct: 180 VPVPGVVIDGGRQSMRLARWLQDAKPFALNYIAGEPD----GLILEAGLVDRWVVATFED 235
Query: 335 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ V + A+ ++ K+ GLHFL +Q + GFWLL
Sbjct: 236 SEVKAA-AQIYQQRKQLSKGLHFLLVQPDDSGMTYTGFWLL 275
>gi|427420079|ref|ZP_18910262.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 7375]
gi|425762792|gb|EKV03645.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 7375]
Length = 285
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 77/283 (27%), Positives = 135/283 (47%), Gaps = 11/283 (3%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+LDF RP+ + + +WEL+VC ++ Y + P +++ L+ I G
Sbjct: 5 WQLDFYRRPLKNTDNQPLWELLVCTPNMDFSYGETCPQPEADAMWLRHQIKQAIHRAGY- 63
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ ++ FR Q QT+ AC+ELDI +R +L WL +R Y + +
Sbjct: 64 RPKVLQVFRPQTQTLTEVACRELDIPVETQRRLPTLKQWLRQR-NAWYPNLKTYTGEAYS 122
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEV--D 278
A++ P+ LPDNL+G+ W F L ++ + + + S+ +LL +E+
Sbjct: 123 PFAIERSTPIPLPDNLWGETWRFAGL----SNADLLRFQYEAIPVRSIPKELLPLEIGLS 178
Query: 279 DKTLIPGLAV-ASSRAKPLAAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNP 336
LIPG+ + R+ L W++ ++ ++ + LIL G+ R++ ++ +
Sbjct: 179 STVLIPGVVIDGGQRSMALTQWLDSVQPAFLKYIAGQPDGLILEAGLCDRFVLTTFEDSD 238
Query: 337 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
V + A A+E K GLHFL I+ + G WLL + P
Sbjct: 239 VRGA-ANAFEQRKVTSKGLHFLLIRPDDSGMTYSGLWLLQESP 280
>gi|427718052|ref|YP_007066046.1| hypothetical protein Cal7507_2795 [Calothrix sp. PCC 7507]
gi|427350488|gb|AFY33212.1| protein of unknown function DUF1092 [Calothrix sp. PCC 7507]
Length = 265
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 131/279 (46%), Gaps = 26/279 (9%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF D G+ +WEL++CD + S +YT P + NS L I G
Sbjct: 5 WQADFYRSSQRDTAGQVLWELLLCDATRSFEYTATCPQSAANSNWLTSQIELAA---GGK 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
PE I+ FR Q ++I A + L I P++R L++ WL+E+ ++P
Sbjct: 62 FPEVIQVFRPQSLSLIEAAGRNLGINVEPTRRTLAVKQWLKEK------QYP-------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
LALD P P LP+NL+G++W F L + + + + L + + +
Sbjct: 108 -LALDKPPPSPLPENLWGEQWRFATLQAGELVDVFAERPIPILHIPEF-LQPINLGLAST 165
Query: 281 TLIPGLAVASSR-AKPLAAWMNGLEVCSIETDTARG---SLILSVGISTRYIYANYKKNP 336
+PG+ + R + LA W+N E +E + G L+L + R+I A + +
Sbjct: 166 VPVPGVVIYGGRQSMRLARWLN--EASPVELNYIAGEPDGLVLEAALVDRWIVATFADSE 223
Query: 337 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
VT + A+ +E K+ GLHFL +Q + GFWLL
Sbjct: 224 VTAA-AKLYEQRKQQSLGLHFLLVQPDDSGMTYSGFWLL 261
>gi|37523856|ref|NP_927233.1| hypothetical protein glr4287 [Gloeobacter violaceus PCC 7421]
gi|35214862|dbj|BAC92228.1| glr4287 [Gloeobacter violaceus PCC 7421]
Length = 272
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 81/277 (29%), Positives = 129/277 (46%), Gaps = 15/277 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
WELDF P++ G+ WEL+VC L ++ P + N + L+ + + G P
Sbjct: 4 WELDFYRCPLVGADGQVRWELLVCTAEGGLLRAQFCPADAANVVWLEAQLAELVASRGGP 63
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P ++R FR+ + AC+ L I S+R +++ ER E++Y + P ++ P
Sbjct: 64 -PLQMRAFRTAAFNLAGPACRRLGIPLRHSRRAIAVQRRRAEREESLYPQMPDYRP-LPP 121
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLL-GIEVDD 279
+ P +PD D+W F LP + E+ L + A L++ LL GI+
Sbjct: 122 GVPQQKAVPAPIPDARLPDRWGFSALPGA----ELGQLRQLPI--AYLEVPLLAGIDAP- 174
Query: 280 KTLIPGLAVASSRAKPLAAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNPVT 338
+PG+ + S R + LA+W+ E S++ A LIL G+ R+I A + +P
Sbjct: 175 ---VPGVFLFSRRDRDLASWLAAREPVSLQYTRAEIDGLILEAGLDERWILATF-DDPGM 230
Query: 339 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ GLHFLA+Q S GFWLL
Sbjct: 231 RERGRQFAERLAGSRGLHFLAVQPAEGSPQIAGFWLL 267
>gi|242085770|ref|XP_002443310.1| hypothetical protein SORBIDRAFT_08g017335 [Sorghum bicolor]
gi|241944003|gb|EES17148.1| hypothetical protein SORBIDRAFT_08g017335 [Sorghum bicolor]
Length = 136
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 52/91 (57%), Positives = 66/91 (72%), Gaps = 4/91 (4%)
Query: 160 PIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSK 219
P P RF + + +++ EL + S RC+SLLLWLEERYE VY+RHP FQ G++
Sbjct: 49 PHPHGRRFAYATYELLLSPDRIELLL----SGRCVSLLLWLEERYEVVYSRHPEFQAGTR 104
Query: 220 PLLALDNPFPMELPDNLFGDKWAFVQLPFSA 250
P+LALDNP+P LP+NLFGDKWA+VQLPFS
Sbjct: 105 PMLALDNPYPTTLPENLFGDKWAYVQLPFSG 135
>gi|119486760|ref|ZP_01620735.1| hypothetical protein L8106_10937 [Lyngbya sp. PCC 8106]
gi|119456053|gb|EAW37186.1| hypothetical protein L8106_10937 [Lyngbya sp. PCC 8106]
Length = 277
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 82/284 (28%), Positives = 138/284 (48%), Gaps = 22/284 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ D G+ +WEL++CD S +++Y + P + NS L + +
Sbjct: 4 WQADFYRRPLQDTTGQPLWELLICDQSRNIEYLAFCPQSHANSTWLTQQLQQATQTEK-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I FR Q ++I A L I+ P++R ++L WL++R + Y + G+
Sbjct: 62 -PDLIWVFRPQSLSLIQTAATALGIRVEPNRRTVTLKQWLQQRSQD-YPQLAGYTNEPYK 119
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAV-----QEEVSSLE-SKFVFGASLDLDLLG 274
+ LD P P+ +P+NL+GD W F LP + + LE F++ +L L
Sbjct: 120 PVELDKPPPVPIPENLWGDVWRFATLPAGDIVDGFRDRPIPILEMPDFLYPINLGL---- 175
Query: 275 IEVDDKTL-IPGLAVASSR-AKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYAN 331
TL +PG+ + R + L+ W+ + S+ + LIL G+ R++ A
Sbjct: 176 ----PSTLPVPGIVINGGRQSMQLSRWLAEKKPVSLHYIPGSPDGLILEAGLVDRWVLAT 231
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
++ VT + A+ + K+ GLHFL +Q + GFWLL
Sbjct: 232 FEDAEVTEA-AKMFTERKQLTKGLHFLLVQPDDSGITYTGFWLL 274
>gi|440681085|ref|YP_007155880.1| protein of unknown function DUF1092 [Anabaena cylindrica PCC 7122]
gi|428678204|gb|AFZ56970.1| protein of unknown function DUF1092 [Anabaena cylindrica PCC 7122]
Length = 265
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 131/278 (47%), Gaps = 24/278 (8%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P+ D G+ +WEL++CD + +Y P + NS L E +
Sbjct: 5 WQADFYRSPLRDAAGQILWELLICDATRKFEYVATCPQSQANSNWLTEQFQTAGAE---K 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRC-LSLLLWLEERYETVYTRHPGFQKGSK 219
+PE I+ FR Q +IT A L IK + + RC L+L WL+E+ ++P
Sbjct: 62 LPEIIQVFRPQSLGLITAAGNNLSIK-VEATRCTLALKQWLQEK------QYP------- 107
Query: 220 PLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDD 279
+A+D P P LP+NL+G++W F +P + +E + + L + + +
Sbjct: 108 --IAVDKPPPAPLPENLWGEEWRFATIPAGDIVDEFTERPIPILQIPEF-LKPINLGLAS 164
Query: 280 KTLIPGLAVASSR-AKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPV 337
+PG+ + R + LA W+ S+ A LIL G++ R+I A ++ V
Sbjct: 165 TVPVPGVVIYGGRQSMRLARWLQEANPVSLNYIAGAPDGLILEAGLADRWILATFEDEEV 224
Query: 338 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ A+ + K+ GLHFL IQ + GFWLL
Sbjct: 225 AAA-AKVYAQRKQVSKGLHFLLIQPDDSGMTYSGFWLL 261
>gi|17229810|ref|NP_486358.1| hypothetical protein all2318 [Nostoc sp. PCC 7120]
gi|17131410|dbj|BAB74017.1| all2318 [Nostoc sp. PCC 7120]
Length = 264
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 79/277 (28%), Positives = 126/277 (45%), Gaps = 22/277 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P D+ GK +WEL++CD + +YT P + NS L I G
Sbjct: 5 WQADFYRSPRQDLDGKILWELLICDVNRGFEYTATCPQSEANSSWLTTQIQLAA---GEK 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q ++I A + L I P ++ +L WL+E+ ++
Sbjct: 62 LPDIIQVFRPQSLSLIEAAGRNLGINVEPQRQTPALKQWLQEKQYSI------------- 108
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
A+D P P LPDNL+GD+W F + + + S + L + + +
Sbjct: 109 --AIDKPPPTPLPDNLWGDEWRFASIQAGDIVDLFSDRPIP-ILSLPEPLKPINLGLAST 165
Query: 281 TLIPGLAV-ASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVT 338
IPG+ + R+ LA W+ ++ A LIL G+ R+I ++ VT
Sbjct: 166 VAIPGVVIYGGKRSLNLARWIAQTRPVALNYIAGAPDGLILEAGLVDRWILVTFEDAEVT 225
Query: 339 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ A+ +E +K GLHFL +Q + GFWLL
Sbjct: 226 AA-AKVYEQRQKQSRGLHFLLVQPDDSGMTYTGFWLL 261
>gi|428314314|ref|YP_007125291.1| hypothetical protein Mic7113_6296 [Microcoleus sp. PCC 7113]
gi|428255926|gb|AFZ21885.1| Protein of unknown function (DUF1092) [Microcoleus sp. PCC 7113]
Length = 278
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 76/277 (27%), Positives = 133/277 (48%), Gaps = 8/277 (2%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ D G+ +WEL++CD + Y + + +N+ L + + G
Sbjct: 4 WQADFYRRPLRDATGQVLWELLICDATRHFTYQAWCAQSEVNANWL---VAQLRQAAGDN 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR Q +++ A ++L I P++ +L WL++R Y + G+ +
Sbjct: 61 WPDVIQVFRPQSLSLMEAAAQQLGIAVEPTRGTTTLKQWLQQR-ALQYPKQEGYTAEAYN 119
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
+A+D P P+ LP+NL+GD+W F +P ++E + L L + +
Sbjct: 120 PIAIDKPPPLPLPENLWGDRWRFASIPAGNIEEAFGDRPIP-ILEMPESLLPLNLGLAST 178
Query: 281 TLIPGLAVASSR-AKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVT 338
+PG+ + R + LA W+ + S+ A LIL G+ R++ A ++ V
Sbjct: 179 VAVPGVIIDGGRKSMQLARWLQNVTPVSLNYIAGAPDGLILEAGLVDRWVVATFEDTEVA 238
Query: 339 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
T+ A +E + GLHFL +Q + GFWLL
Sbjct: 239 TA-ARMYEQRQSLSQGLHFLLVQPDDSGMTYTGFWLL 274
>gi|126695893|ref|YP_001090779.1| hypothetical protein P9301_05551 [Prochlorococcus marinus str. MIT
9301]
gi|126542936|gb|ABO17178.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9301]
Length = 301
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 146/296 (49%), Gaps = 28/296 (9%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLS-----LQYTKYFPNNVINSITLKEAI-- 150
I++WELDF SRPI++ GKK WEL++C + K P N +NS+ L A+
Sbjct: 15 ISDWELDFYSRPIIESNGKKRWELIICSTRSYKTEDVFLWNKKCPANEVNSVWLTRALNE 74
Query: 151 -VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
++ G P +RF+RS M++II K+ + I+ I S+R +LL +E + +Y
Sbjct: 75 AISEAKKQGWEKPSIVRFWRSSMKSIIKKSLDAVSIEAIVSRRTYNLLDRIEFLEKEIYP 134
Query: 210 RHPGFQKGSKPLLA------LDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ +G +LA ++NP P LP+ + GD ++ ++ E S+
Sbjct: 135 KEKGYVRG---VLAPTFTSKMENP-PTPLPEAVRGDALTISEI---SIGELKSAQNWPME 187
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASS-RAKPLAAWMNGLEVCSIETDTARGSLILSVG 322
FG D+ + ++DD LIPGL + S R+ L+AW + LE I+ + LIL
Sbjct: 188 FG---DIFPIQQDIDDNYLIPGLRLFSKDRSLALSAWFSCLE--PIKLVVNKNQLILEAS 242
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+++ + + + E KK G F++IQ E GFW+L D+
Sbjct: 243 EDDKWLVTDLPEKDANILNTKFLE-NKKNSFGYQFISIQSTPFIEKFAGFWILRDI 297
>gi|427710618|ref|YP_007052995.1| hypothetical protein Nos7107_5360 [Nostoc sp. PCC 7107]
gi|427363123|gb|AFY45845.1| protein of unknown function DUF1092 [Nostoc sp. PCC 7107]
Length = 265
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 83/281 (29%), Positives = 132/281 (46%), Gaps = 30/281 (10%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF D G+ +WEL++CD + S +YT P + NS + E I G
Sbjct: 4 WQADFYRSSQQDKSGQVLWELLICDVNRSFEYTAACPQSEANSSWVIEQIQQAA---GEK 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P I+ FR Q ++I A + L I ++R L+L WL+ER+ V
Sbjct: 61 LPNVIQVFRPQSLSLIETAGRNLGIVVEATRRTLALKQWLQERHSAV------------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSS----LESKFVFGASLDLDLLGIE 276
+L+ P P+ LP+NL+G++W L ++ E S + S F ++L L
Sbjct: 108 --SLEKPAPLPLPENLWGEQWRLATLAAGDLETEFSDRPIPILSMPEFLTPINLGL---- 161
Query: 277 VDDKTLIPGLAVASSR-AKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKK 334
+PG+ + R + LA W+ + ++ A LIL G+ R++ A ++
Sbjct: 162 -ASTIPVPGVVIYGGRQSMRLARWLATAKPVALNYIAGAPDGLILEAGLVDRWVLATFED 220
Query: 335 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
VTT+ A+ +E K+ GLHFL IQ + GFWLL
Sbjct: 221 AEVTTA-AKIYEQRKQQSRGLHFLLIQPDDSGMTYSGFWLL 260
>gi|434405323|ref|YP_007148208.1| Protein of unknown function (DUF1092) [Cylindrospermum stagnale PCC
7417]
gi|428259578|gb|AFZ25528.1| Protein of unknown function (DUF1092) [Cylindrospermum stagnale PCC
7417]
Length = 264
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 76/280 (27%), Positives = 133/280 (47%), Gaps = 22/280 (7%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+T W+ DF RP D + +WEL +CD + S ++ P + NS + + +
Sbjct: 1 MTIWQADFYKRPQKDATEQVLWELSICDQTRSFEFAATCPQSQANSTWVATQLQLAANK- 59
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
+P+ I+ FR Q +I A + L I P++R L+L WL+++ + P
Sbjct: 60 --KLPDVIQVFRPQSLNLIAAAGRTLGINVEPNRRTLALKQWLQQK------QFP----- 106
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEV 277
LA++ P P LP+NL+G++W F +LP + ++ + + L + + +
Sbjct: 107 ----LAVEKPPPAPLPENLWGEEWRFAKLPAGDI-ADIFTERPIPILQVPEFLKPINLGL 161
Query: 278 DDKTLIPGLAVASSR-AKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKN 335
+PG+ + R + LA W+ + ++ A LIL G+ R+I A + +
Sbjct: 162 ASTVSVPGVIIYGGRQSMRLARWLQEADPVALNYMSGAPDGLILEAGLQDRWIVATFDDS 221
Query: 336 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
VT + A+ +E K+ GLHFL +Q + GFWLL
Sbjct: 222 EVTDA-AKVYEQRKQQSRGLHFLLVQPDDSGMTYTGFWLL 260
>gi|75906361|ref|YP_320657.1| hypothetical protein Ava_0136 [Anabaena variabilis ATCC 29413]
gi|75700086|gb|ABA19762.1| Protein of unknown function DUF1092 [Anabaena variabilis ATCC
29413]
Length = 264
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 79/277 (28%), Positives = 125/277 (45%), Gaps = 22/277 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P D+ GK +WEL++CD + +YT P + NS L I G
Sbjct: 5 WQADFYRSPQQDLDGKILWELLICDVNRGFEYTATCPQSEANSSWLTSQIQLAA---GEK 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q ++I A + L I P ++ +L WL+E+ ++
Sbjct: 62 LPDIIQVFRPQSLSLIEAAGRNLGINVEPQRQTPALKQWLQEKQYSI------------- 108
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
A+D P P LPDNL+GD+W F + V + S + L + + +
Sbjct: 109 --AIDKPPPTPLPDNLWGDEWRFASIQAGDVVDLFSDRPIP-ILSLPEPLKPINLGLAST 165
Query: 281 TLIPGLAV-ASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVT 338
IPG+ + R+ LA W+ ++ A LIL G+ R+I ++ V
Sbjct: 166 VAIPGVVIYGGRRSLNLARWIAQTRPVALNYIAGAPDGLILEAGLVDRWILVTFEDAEVK 225
Query: 339 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ A+ +E +K GLHFL +Q + GFWLL
Sbjct: 226 AA-AKVYEQRQKQSRGLHFLLVQPDDSGMTYTGFWLL 261
>gi|123968120|ref|YP_001008978.1| hypothetical protein A9601_05851 [Prochlorococcus marinus str.
AS9601]
gi|123198230|gb|ABM69871.1| conserved hypothetical protein [Prochlorococcus marinus str.
AS9601]
Length = 301
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 147/296 (49%), Gaps = 28/296 (9%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLS-----LQYTKYFPNNVINSITLKEAI-- 150
I++WELDF SRPI++ GKK WEL++C + K P N +NS+ L +A+
Sbjct: 15 ISDWELDFYSRPIIESNGKKRWELIICSTRSYKTEDVFLWNKKCPANEVNSVWLTKALNE 74
Query: 151 -VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
++ G P +RF+RS M++II ++ + + I+ I S+R +LL +E + +Y
Sbjct: 75 AISEAKKQGWEKPSIVRFWRSSMKSIIKRSLEAVSIEAIVSRRTFNLLDRIEFLEKEIYP 134
Query: 210 RHPGFQKGSKPLLA------LDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ +G +LA ++NP P LP+ + GD ++ ++ E S+
Sbjct: 135 KEKGYVRG---VLAPTFTSKMENP-PTPLPEAVRGDALTISEI---SIGELKSAENWPME 187
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASS-RAKPLAAWMNGLEVCSIETDTARGSLILSVG 322
FG D+ + V+D L+PGL + S R+ L+AW + LE I+ + LIL
Sbjct: 188 FG---DIFPIQQNVNDNYLVPGLRLFSKDRSLALSAWFSCLE--PIKLVVNKNQLILEAA 242
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+++ + + + E KK G F++IQ E GFW+L D+
Sbjct: 243 EDDKWLVTDLPEKDANILNTKFLE-NKKNSFGYQFISIQSTPYIEKFAGFWILRDI 297
>gi|334120429|ref|ZP_08494510.1| protein of unknown function DUF1092 [Microcoleus vaginatus FGP-2]
gi|333456776|gb|EGK85406.1| protein of unknown function DUF1092 [Microcoleus vaginatus FGP-2]
Length = 279
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 79/277 (28%), Positives = 130/277 (46%), Gaps = 7/277 (2%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ D GK +WEL++CD S Q++ NS L +
Sbjct: 4 WQADFYRRPLQDETGKPLWELLICDSEGSFQFSAVCRQGDANSNWLASQLQQQAQTQN-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P I+ FR Q +I A K L +K ++R +L L L++R + Y P + +
Sbjct: 62 LPALIQVFRPQSLGLIEAAGKVLGVKVEATRRTGALKLLLQQRAKE-YLSMPNYTGETYS 120
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
+ALD+P P+ LP+NL+GD W F LP ++E + L L + +
Sbjct: 121 AIALDSPPPVPLPENLWGDGWRFASLPAGDIEEAFQGRPLP-ILEMPELLLPLNLGLAST 179
Query: 281 TLIPGLAVASSR-AKPLAAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNPVT 338
+PG+ + R + LA W+ + ++ LIL G+ R++ A ++ + V
Sbjct: 180 VPVPGVVIDGGRQSMRLARWLQDAKPFAVNYIAGEPDGLILEAGLVDRWVVATFEDSEVK 239
Query: 339 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ A+ ++ K+ GLHFL +Q + GFWLL
Sbjct: 240 AA-AQIYQQRKQLSKGLHFLLVQPDDSGMTYTGFWLL 275
>gi|428225981|ref|YP_007110078.1| hypothetical protein GEI7407_2551 [Geitlerinema sp. PCC 7407]
gi|427985882|gb|AFY67026.1| protein of unknown function DUF1092 [Geitlerinema sp. PCC 7407]
Length = 283
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 77/282 (27%), Positives = 134/282 (47%), Gaps = 10/282 (3%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+T WE DF RP+ + G+ +WEL++CD L + P + L + +
Sbjct: 1 MTIWEADFYRRPLRNAAGQPLWELLLCDQQRQLILSAMCPQPDATAAWLTGQLRSHFAA- 59
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
GV PE++R FR Q +++ AC+ L I ++R ++ L R + Y + P +
Sbjct: 60 GVTPPERLRVFRPQSLSLLQVACEPLGIAVEGTRRTPAIKAALLAR-ASAYAQMPEYSSE 118
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEV 277
+ L ++ P LP+ L+GD+W F + A + +S + V + +LL +++
Sbjct: 119 AYQPLYIEKAPPAPLPETLWGDRWRFGAM---AAGDLISVFRHRPVPILEMPTELLPVQL 175
Query: 278 D--DKTLIPGLAV-ASSRAKPLAAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYK 333
T IPG+ + R+ +A W+ + S+ T LIL G+S R++ A
Sbjct: 176 GLASTTPIPGVILEGGRRSLQIARWLQAHQPVSLHYRTGDPDGLILEAGLSDRWVIAT-T 234
Query: 334 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+P + A +E ++A GLHFL I+ + + FWLL
Sbjct: 235 TDPDMAAAARTYEERQQASQGLHFLLIEPDDSGQTSTAFWLL 276
>gi|411116983|ref|ZP_11389470.1| Protein of unknown function (DUF1092) [Oscillatoriales
cyanobacterium JSC-12]
gi|410713086|gb|EKQ70587.1| Protein of unknown function (DUF1092) [Oscillatoriales
cyanobacterium JSC-12]
Length = 285
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 79/282 (28%), Positives = 129/282 (45%), Gaps = 11/282 (3%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
++ WE+D RP+ D G +WELVVCD + +T IN+ + I + D
Sbjct: 1 MSVWEVDCYRRPLQDEAGNPLWELVVCDTEGAFTWTALCQQAQINADWVAAQIRDLVRDR 60
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
P+P+ I FR Q ++ C +L I P++ L +L+E T Y +PG+
Sbjct: 61 --PLPQIIHVFRPQTLHLLEPVCTQLGISIEPTRHTPYLKTYLQE-LATQYPNYPGYTGQ 117
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEV 277
LALD P+ L L G+ W F L A + + + + + LL + +
Sbjct: 118 LYDPLALDQSPPLPLDATLLGNHWQFATL---AAGDIADAFTGRMIPILEMPEFLLPLNL 174
Query: 278 DDKTL--IPGLAV-ASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYK 333
++ +PG+ + A R+ LA W+ ++ + L+L G+ R+I A +
Sbjct: 175 GLASMVPVPGVVIEAGRRSLRLAQWLKQTRPVALNYIPGSPNGLVLQAGLVDRWIIATF- 233
Query: 334 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
++P + A +E K A GLHFL +Q + GFWLL
Sbjct: 234 EDPHVAASATEFEQRKIASRGLHFLLVQPDDSGMTYSGFWLL 275
>gi|157412945|ref|YP_001483811.1| hypothetical protein P9215_06101 [Prochlorococcus marinus str. MIT
9215]
gi|157387520|gb|ABV50225.1| Conserved hypothetical protein [Prochlorococcus marinus str. MIT
9215]
Length = 301
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 147/296 (49%), Gaps = 28/296 (9%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLS-----LQYTKYFPNNVINSITLKEAI-- 150
I++WELDF SRPI++ GKK WEL++C + K P N +NS+ L +A+
Sbjct: 15 ISDWELDFYSRPIIESNGKKRWELIICSTRSYKTEDVFLWNKKCPANEVNSVWLTKALNE 74
Query: 151 -VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
++ G P +RF+RS M++II K+ + + I+ I S+R +LL +E + +Y
Sbjct: 75 AISEAKKQGWEKPSIVRFWRSSMKSIIKKSLEAVSIEAIVSRRTYNLLDRIEFLEKEIYP 134
Query: 210 RHPGFQKGSKPLLA------LDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ +G +LA ++N P LP+ + GD ++ ++ E S+
Sbjct: 135 KEKGYVRG---VLAPAFTSKIENS-PTPLPEAVRGDALTISEI---SIGELKSAENWPME 187
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASS-RAKPLAAWMNGLEVCSIETDTARGSLILSVG 322
FG D+ + ++DD L+PGL + S R+ L+AW + LE I+ LIL
Sbjct: 188 FG---DIFPIKQDLDDNYLVPGLRLFSKDRSLALSAWFSCLE--PIKLVVNENQLILEAS 242
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+++ + K ++ + KK G F++IQ E GFW+L D+
Sbjct: 243 EDDKWLVTDLPKKDANILNSKFLD-NKKNSFGYQFISIQSTPYIEKFAGFWILRDI 297
>gi|78778914|ref|YP_397026.1| hypothetical protein PMT9312_0529 [Prochlorococcus marinus str. MIT
9312]
gi|78712413|gb|ABB49590.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9312]
Length = 301
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 87/300 (29%), Positives = 148/300 (49%), Gaps = 21/300 (7%)
Query: 91 EETDPE-SITEWELDFCSRPILDIRGKKIWELVVC-----DGSLSLQYTKYFPNNVINSI 144
+ET PE I++WELDF SRPI++ GKK WEL++C + + K P + +NSI
Sbjct: 7 KETSPELKISDWELDFYSRPIIEANGKKRWELIICSTRSYETKDIFLWNKKCPASEVNSI 66
Query: 145 TLKEAIVAICDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLE 201
L +A+ ++ G P +RF+RS M++II K+ + +I+ + S+R +L +E
Sbjct: 67 WLTKALNEALNEARKEGWAKPSIVRFWRSSMKSIIKKSLEATNIEALVSRRTYNLFDRIE 126
Query: 202 ERYETVYTRHPGFQKG--SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLE 259
+ +Y + G+ +G + + P LP+ + GD ++ +V E S+
Sbjct: 127 FLEKDIYPKEKGYVRGVLAPTFTSTMESSPTPLPEAVRGDALTISEI---SVGELKSAQN 183
Query: 260 SKFVFGASLDLDLLGIEVDDKTLIPGLAVASS-RAKPLAAWMNGLEVCSIETDTARGSLI 318
FG D+ + +D+ LIPGL + S R+ L+AW + LE I+ ++ LI
Sbjct: 184 WPIEFG---DIFPIHQPLDNNELIPGLRLFSKERSLALSAWFSSLE--PIKLIISKNQLI 238
Query: 319 LSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
L +++ + + + E KK G F++IQ E GFW+L D+
Sbjct: 239 LEASEDDKWLVTDLPEKDANILSTKFLE-NKKNSFGYQFISIQSTPYIEKFAGFWILRDI 297
>gi|254421948|ref|ZP_05035666.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
gi|196189437|gb|EDX84401.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
Length = 300
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 77/282 (27%), Positives = 129/282 (45%), Gaps = 9/282 (3%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+LDF RP+ D +G +WEL++CD +LS Y ++ + N+ ++ + I D
Sbjct: 16 WQLDFYRRPLKDSQGNPLWELLICDETLSFTYGEFCIQSEANAPWIRHQL-EIASDRAGG 74
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P I FR Q +++ AC+ L +K + +L WL +R Y + K S
Sbjct: 75 WPNDIEIFRPQTVSLVEVACRNLPVKVRSRRDVPTLKRWLLQR-AAWYPTLKSYTKQSYE 133
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
+AL+ P P+ + ++L G+ W F + +Q S E V +L + + +
Sbjct: 134 PIALERPAPVPIAEHLMGEGWQFAAISTDELQR--LSYEPIPVQTVPAELMPIRLGLPST 191
Query: 281 TLIPGLAVASSRAK-PLAAWMNGLEVCSIE--TDTARGSLILSVGISTRYIYANYKKNPV 337
LIPG+ + R LA W+ + ++ T G L+L G+ R+I A ++ V
Sbjct: 192 LLIPGVVIDGGRQSLGLAQWLQSVNPVMLQYIAGTPDG-LLLEAGLVERWIMATFEDEAV 250
Query: 338 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
+ A + K A GLH L ++ + G WLL P
Sbjct: 251 AEA-ARTFTERKIAANGLHLLLVRPDDSGLTYTGLWLLQSTP 291
>gi|218438370|ref|YP_002376699.1| hypothetical protein PCC7424_1387 [Cyanothece sp. PCC 7424]
gi|218171098|gb|ACK69831.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 7424]
Length = 271
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 131/284 (46%), Gaps = 24/284 (8%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF R + D +G+ +WELV+ D ++ + P + NS L +
Sbjct: 4 WQGDFYKRSLFDQQGEMLWELVITDQQGTMIHEAKCPQSQANSDWLIRQLQQATQK---N 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
IP+ I+ FR Q ++T A ++L IK +P++R +L L+ R +
Sbjct: 61 IPDLIQVFRPQSIGLLTSAAEKLGIKVVPTRRTSALKEVLKRRSTNT----------TID 110
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVD-- 278
+ LD P P LP+NL+G++W F+ L + + + + + DLL I ++
Sbjct: 111 VSTLDRPPPQGLPENLWGEQWGFISLKAGDL---IQFFRDRPIPIVDMPEDLLPINLNLP 167
Query: 279 DKTLIPGLAVASSR-AKPLAAWMNGLEVCSIETDTAR----GSLILSVGISTRYIYANYK 333
IPG+ + R + LA W+ + SI + G L+L G+ R+I A +
Sbjct: 168 STVFIPGIVIYGGRKSMYLARWLEEQQPVSISYIPTQIGLSGGLVLESGLVDRWILATF- 226
Query: 334 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
++P A+ +E K GLHFL +Q + GFWLL D
Sbjct: 227 EDPEMAQAAQKYEDRKVMSKGLHFLTVQPDDSGITYTGFWLLND 270
>gi|254526095|ref|ZP_05138147.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9202]
gi|221537519|gb|EEE39972.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9202]
Length = 301
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 88/296 (29%), Positives = 145/296 (48%), Gaps = 28/296 (9%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLS-----LQYTKYFPNNVINSITLKEAI-- 150
I++WELDF SRPI++ GKK WEL++ + K P N +NS+ L +A+
Sbjct: 15 ISDWELDFYSRPIIESNGKKRWELIISSTRSYKTEDVFLWNKKCPANEVNSVWLTKALNE 74
Query: 151 -VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
++ G P RF+RS M++II K+ + + I+ + S+R +LL +E + +Y
Sbjct: 75 ALSEAKKQGWEKPSIARFWRSSMKSIIKKSLEAVSIEAVVSRRTYNLLDRIEFLEKEIYP 134
Query: 210 RHPGFQKGSKPLLA------LDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFV 263
+ G+ +G +LA ++N P LP+ + GD ++ ++ E S+
Sbjct: 135 KEKGYVRG---VLAPTFTSKMENS-PTPLPEAVRGDALTISEI---SIGELKSAENWPME 187
Query: 264 FGASLDLDLLGIEVDDKTLIPGLAVASS-RAKPLAAWMNGLEVCSIETDTARGSLILSVG 322
FG D+ + ++DDK L+PGL + S R+ LAAW + LE I+ LIL
Sbjct: 188 FG---DIFPIQQDLDDKNLVPGLRLFSKDRSLALAAWFSCLE--PIKLVVNENQLILEAS 242
Query: 323 ISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+++ + K + E KK G F++IQ E GFW+L D+
Sbjct: 243 EDDKWLVTDLPKKDANILNTKFLE-NKKNSFGYQFISIQSTPYIEKFAGFWILRDI 297
>gi|427734622|ref|YP_007054166.1| hypothetical protein Riv7116_1045 [Rivularia sp. PCC 7116]
gi|427369663|gb|AFY53619.1| Protein of unknown function (DUF1092) [Rivularia sp. PCC 7116]
Length = 262
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 76/281 (27%), Positives = 134/281 (47%), Gaps = 24/281 (8%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DF R + G+ +W+L +CD +L L+Y P + NS + I D
Sbjct: 3 WQIDFYRRSQPEKSGQVLWDLSICDSTLELKYEATCPQSEANSSWVVSQIQQAASD---S 59
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ ++ FR Q ++I +A K L IK ++R ++L WL+++ +
Sbjct: 60 LPDVMQVFRPQSLSLIEQAGKILGIKVEATRRTIALKTWLKQKQQ--------------- 104
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL-LGIEVDD 279
ALD P P+ L +N++GDKW+F L + + S E + DL L + + +
Sbjct: 105 FTALDKPPPVPLSENIWGDKWSFATLRAGDIGDFFS--ERPIPILETPDLLLPINMGLAS 162
Query: 280 KTLIPGLAVASSR-AKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPV 337
+PG+ + R + LA W+ ++ A L+L G+ R+I A ++ V
Sbjct: 163 TVPVPGVVIYGGRKSMLLARWLKENRPVALNYIAGAPDGLVLEAGLVDRWIVATFEDEEV 222
Query: 338 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+ + A ++ K+ GLHFL +Q + GFWLL ++
Sbjct: 223 SQA-AALYQQRKQQSQGLHFLLVQPDDSGMTYTGFWLLQEV 262
>gi|33861086|ref|NP_892647.1| hypothetical protein PMM0529 [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
gi|33639818|emb|CAE18988.1| conserved hypothetical protein [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
Length = 301
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 89/301 (29%), Positives = 147/301 (48%), Gaps = 38/301 (12%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYF------PNNVINSITLKEAIV 151
I++WELDF SRPI++ GKK WEL++ S S + K F P N +NSI L +A+
Sbjct: 15 ISDWELDFYSRPIIETNGKKRWELIIS-SSKSFKTEKIFLWNKVCPANEVNSIWLTKALN 73
Query: 152 AICDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVY 208
+D G P KIRF+R+ M++II K+ + + I+ + S+R L +E +Y
Sbjct: 74 EALNDAEIEGWAKPLKIRFWRASMKSIIKKSIENIGIEALVSRRTYELFDRIEFLEREIY 133
Query: 209 TRHPGFQKGSKPLLA-------LDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESK 261
G+ +G +LA L++P P LP+ + GD + E+S E K
Sbjct: 134 PLEQGYVRG---VLAPTFTSNILNDPKP--LPEAVRGD---------ALTISEISIEELK 179
Query: 262 FVFGASLDL-DLLGIE--VDDKTLIPGLAVASS-RAKPLAAWMNGLEVCSIETDTARGSL 317
++ D+ I+ + + L+PGL + S R+ LAAW + LE ++ + L
Sbjct: 180 LAKNWPIEFGDIFPIQSSIKNDNLVPGLRLFSKDRSLALAAWFSSLE--PVKLLIKQNQL 237
Query: 318 ILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
IL +++ + ++ + + +KK G F++IQ E GFW+L D
Sbjct: 238 ILEASEDDKWLVTDLQEKDAKVLN-DKFTQSKKDSYGYQFISIQATPFIEKFAGFWILKD 296
Query: 378 L 378
+
Sbjct: 297 V 297
>gi|307151401|ref|YP_003886785.1| hypothetical protein Cyan7822_1516 [Cyanothece sp. PCC 7822]
gi|306981629|gb|ADN13510.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 7822]
Length = 278
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 78/282 (27%), Positives = 134/282 (47%), Gaps = 24/282 (8%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF R ++ G+ +WEL++ D + Y + P ++ NS L + +
Sbjct: 4 WQADFYKRQQMNQAGEILWELLITDSLGKIIYERQCPQSMANSDWLLVQLQQATEQFS-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR Q ++T ++L + + ++R +L L++R P
Sbjct: 62 -PDVIQVFRPQSLALLTSCAEKLGLTVVATRRTWALKKVLQQRAAAT----------KDP 110
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVD-D 279
LD P P LP NL+G++W F + A + + + + + ++ +LL I +
Sbjct: 111 QDILDKPPPQPLPANLWGEEWRFAHV---AAGDLIEFFKDRPIPLLNIPEELLPINLGLA 167
Query: 280 KTL-IPGLAVASSR-AKPLAAWMNG---LEVCSIETDTAR-GSLILSVGISTRYIYANYK 333
TL IPG+ + R + LA W+N + + I T+ + G L+L G+ R+I A ++
Sbjct: 168 STLPIPGMVIYGGRTSMYLARWLNQENPVAINYISTEVGKSGGLVLESGLVNRWILATFE 227
Query: 334 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+P AE +E K+ C GLHFL IQ + GFWLL
Sbjct: 228 -DPEVVVAAEKYEQRKQLCRGLHFLTIQPDSSGMTYSGFWLL 268
>gi|434388752|ref|YP_007099363.1| Protein of unknown function (DUF1092) [Chamaesiphon minutus PCC
6605]
gi|428019742|gb|AFY95836.1| Protein of unknown function (DUF1092) [Chamaesiphon minutus PCC
6605]
Length = 273
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 137/284 (48%), Gaps = 20/284 (7%)
Query: 97 SITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAI-VAICD 155
+I W+ D SRP + RG+ +WELV+C +T P +N+ + I +A D
Sbjct: 2 TIMLWQADISSRPQQNDRGETLWELVICAADGGWFHTAICPQKQVNAEWIAAQIKLAATD 61
Query: 156 DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQ 215
L P I+ FR Q +I A ++L I+ ++R ++L L+++ + + +P +Q
Sbjct: 62 KL----PTAIQVFRPQSLGLIQTAAQKLGIEVEATRRTIALKKLLQQQTQNYH--NPNYQ 115
Query: 216 KGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLL-- 273
LA+++P P +PD L G+KW FV L + V+ + + S+ LL
Sbjct: 116 P-----LAIESPPPQPIPDYLMGEKWQFVTL---TAGQLVADFADRPIPIVSMPDYLLPP 167
Query: 274 GIEVDDKTLIPGLAV-ASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYAN 331
+ IPG+ + ++++ LA W+ E S+ G L+L VG++ R++
Sbjct: 168 HWGLGANVAIPGVIIYGATQSMRLARWIADTEPVSLNYLGDDPGGLVLDVGLADRWVMVT 227
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ V+ + A +EA K+ GLHFL + + G WLL
Sbjct: 228 FNDAEVSQA-ARLYEARKRLVHGLHFLLVTPDDSGITYSGIWLL 270
>gi|428306984|ref|YP_007143809.1| hypothetical protein Cri9333_3474 [Crinalium epipsammum PCC 9333]
gi|428248519|gb|AFZ14299.1| protein of unknown function DUF1092 [Crinalium epipsammum PCC 9333]
Length = 277
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 78/283 (27%), Positives = 129/283 (45%), Gaps = 22/283 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DF RP+ + +G+ WELV+CD + S Y + N + + +
Sbjct: 4 WQVDFYRRPLKNQQGEVWWELVICDLTRSFTYEVQCRQSEANVTWIVSQLQEAAGN-AKH 62
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q +I A ++L+IK ++ +L L+++ E T +
Sbjct: 63 LPDIIQVFRPQSFNLIQLAGQQLNIKVEATRHTYALKELLQDKAEYYSTNGDNYNP---- 118
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSS-----LES-KFVFGASLDLDLLG 274
LALD P P LP+NL G++W F LP + E + LE +F+ +L L
Sbjct: 119 -LALDKPPPTPLPENLLGEQWRFATLPAGDLVEAFAERPIPVLEMPEFLLPINLGL---- 173
Query: 275 IEVDDKTLIPGLAVASSRAK-PLAAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANY 332
+PG+ + R LA W+ + S+ L+L G+ R++ A +
Sbjct: 174 ---ASTVAVPGVIIYGGRQSLRLARWLEEAKPVSLHFIIGEPAGLVLEAGLVDRWVVATF 230
Query: 333 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ V S A+ +E K+ GLHFL +Q + GFWLL
Sbjct: 231 EDQEVVKS-AQTYEQRKQQSKGLHFLLVQPDDSGVTYSGFWLL 272
>gi|427731600|ref|YP_007077837.1| hypothetical protein Nos7524_4487 [Nostoc sp. PCC 7524]
gi|427367519|gb|AFY50240.1| Protein of unknown function (DUF1092) [Nostoc sp. PCC 7524]
Length = 268
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 76/279 (27%), Positives = 129/279 (46%), Gaps = 26/279 (9%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P D G+ +WEL++C+ + S +Y + NS L I G
Sbjct: 5 WQADFYRSPQQDAAGQALWELLICNVNRSFEYVATCFQSEANSSWLTAQIQQAA---GEN 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q +++ A + L I P +R +L WL+E+ ++P
Sbjct: 62 LPDVIQVFRPQSLSLMEVAGRNLGITVEPQRRTSALKQWLQEK------KYP-------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASL--DLDLLGIEVD 278
+A+D P P LPDNL+G++W F + A + V + + S+ L + + +
Sbjct: 108 -IAIDKPPPAPLPDNLWGEEWRFATI---AAGDLVDLFSDRPIPMLSVPESLQPINLGLA 163
Query: 279 DKTLIPGLAV-ASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNP 336
+PG+ + R+ LA W+ S+ A L+L G++ R+I ++
Sbjct: 164 STIAVPGVIIYGGRRSLRLAQWIQQTRPVSLNYIAGAPDGLVLEAGLADRWIVVTFEDAE 223
Query: 337 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
V + A+ +E K+ GLHFL +Q + GFWLL
Sbjct: 224 VAAA-AKVYEQRKQQSRGLHFLIVQPDDSGMTYSGFWLL 261
>gi|428206227|ref|YP_007090580.1| hypothetical protein Chro_1184 [Chroococcidiopsis thermalis PCC
7203]
gi|428008148|gb|AFY86711.1| protein of unknown function DUF1092 [Chroococcidiopsis thermalis
PCC 7203]
Length = 327
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/336 (25%), Positives = 137/336 (40%), Gaps = 76/336 (22%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCD--GSL---------------SLQYTKYFPNNVINS 143
W+ DF RP D G+ +WEL++CD G + + +Y P N+
Sbjct: 8 WQADFYRRPWQDDTGQVLWELLICDAEGGMLFDHATQTRSDHRTGNFRYEAICPQAAANA 67
Query: 144 ITLKEAIV-------------------------------AICDDLGVPIPEKIRFFRSQM 172
L E + ++ + +P+ I+ FR Q
Sbjct: 68 SWLVEQLQLAASNSSEFFSTTPKSISPSPPYQGGLGGSESVTGQTELALPDIIQVFRPQS 127
Query: 173 QTIITKACKELDIKPIPSKRCLSLLLWLEER---YETVYTRHPGFQKGSKPLLALDNPFP 229
++I A ++L I P++R +L WL R Y T +P LA+D P P
Sbjct: 128 LSLIATAGQKLGITVEPTRRTGALKQWLRSRIPQYSTTGAYNP---------LAVDKPPP 178
Query: 230 MELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL------LGIEVDDKTLI 283
+ LP+NL+GD+W F LP LE+ F LD+ L + + +
Sbjct: 179 VPLPENLWGDRWRFASLP-------ARDLEAAFKDRPLPILDMPEFLLPLNLGLASTIAV 231
Query: 284 PGLAVASSR-AKPLAAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNPVTTSE 341
PG+ + R + LA W+ + ++ LIL G++ R++ A + + S
Sbjct: 232 PGIIIYGGRKSMQLARWLQAAQPIALNYVPGELAGLILEAGLADRWVVATFSDSEAIAS- 290
Query: 342 AEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
A+ + ++ GLHFL +Q + S GFWLL D
Sbjct: 291 AQTYAQRQQQSQGLHFLLVQPDDSSVTYTGFWLLRD 326
>gi|354569034|ref|ZP_08988193.1| protein of unknown function DUF1092 [Fischerella sp. JSC-11]
gi|353539038|gb|EHC08534.1| protein of unknown function DUF1092 [Fischerella sp. JSC-11]
Length = 264
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 123/281 (43%), Gaps = 24/281 (8%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+ W+ DF G+ +WEL++CD + S Q+ P + +NS V +
Sbjct: 1 MVTWQADFYHHRRQQAAGRVLWELLICDRNRSFQFEASCPQSEVNS---NWVAVQLQLAG 57
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEER-YETVYTRHPGFQK 216
G +P+ I+ FR Q +I +A + L I P++R +L WL+E+ Y TV
Sbjct: 58 GGNLPDVIQVFRPQCLGLIEQAGRSLGINVEPTRRTFALKQWLQEKQYPTV--------- 108
Query: 217 GSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIE 276
+D P P LP+NL+G++W F L V E + + L + +
Sbjct: 109 -------VDKPPPAPLPENLWGEEWRFATLSAGKVVEVFTEQPIPILVMPEF-LQPINLG 160
Query: 277 VDDKTLIPGLAVASSR-AKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKK 334
+ +PG+ + R + LA W+ ++ A L+L G+ R+I +
Sbjct: 161 LASMVSVPGVVIYGGRQSMRLARWLQEARPAALNYVAGAPDGLVLEAGLVDRWILVTF-T 219
Query: 335 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+P + +E K+ GLHFL +Q + GFWLL
Sbjct: 220 DPEVVAAGRVYEQRKQESRGLHFLLVQPDDSGMTFSGFWLL 260
>gi|123965828|ref|YP_001010909.1| hypothetical protein P9515_05931 [Prochlorococcus marinus str. MIT
9515]
gi|123200194|gb|ABM71802.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9515]
Length = 301
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 143/297 (48%), Gaps = 32/297 (10%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLS-----LQYTKYFPNNVINSITLKEAIVAI 153
++WELDF SRPI++ GKK WEL++ + K P N +NSI L +++
Sbjct: 16 SDWELDFYSRPIIEKNGKKRWELIISSSKTFKTEDIFLWNKICPANEVNSIWLTKSLNEA 75
Query: 154 CDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+D G P KIRF+R+ M++II K+ + + I+ + S+R L +E + VY
Sbjct: 76 LNDAERKGWEKPSKIRFWRASMKSIIKKSIENIGIEALVSRRTYELFDRIEFLEKEVYPL 135
Query: 211 HPGFQKGSKPLLA-------LDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFV 263
G+ +G +LA ++P P LP+ + GD ++ EE+ S E+ +
Sbjct: 136 ENGYVRG---VLAPTFTSRIANDPTP--LPEAVRGDALTISEISI----EELKSAENWPI 186
Query: 264 -FGASLDLDLLGIEVDDKTLIPGLAVASS-RAKPLAAWMNGLEVCSIETDTARGSLILSV 321
FG D+ + + ++ L+PGL + S R+ LAAW + LE + + + LIL
Sbjct: 187 EFG---DIFPIKKSLKNENLVPGLRLFSKERSLALAAWFSSLEPVKLHIE--KNQLILEA 241
Query: 322 GISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+++ + + V + K G F++IQ E GFW+L D+
Sbjct: 242 SEDNKWLVTDLSEK-VAKELNNKFTQNKNDSFGYQFISIQSTPFIEKFAGFWILRDI 297
>gi|298492811|ref|YP_003722988.1| hypothetical protein Aazo_4636 ['Nostoc azollae' 0708]
gi|298234729|gb|ADI65865.1| protein of unknown function DUF1092 ['Nostoc azollae' 0708]
Length = 265
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 78/284 (27%), Positives = 133/284 (46%), Gaps = 36/284 (12%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAI-VAICDDLGV 159
W+ DF P+ D G+ +WEL++CD + L+Y P + NS L E +A + L
Sbjct: 5 WQTDFYRSPLRDSAGQVLWELLICDPTRKLEYVATCPQSQANSNWLTEQFQLAGAEKL-- 62
Query: 160 PIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSK 219
P+ I+ FR Q ++I+ A L I P++ L+L WL+E+ ++P
Sbjct: 63 --PDIIQVFRPQSLSLISAAASNLGINIEPTRSTLALKQWLQEK------KYP------- 107
Query: 220 PLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFV----FGASLDLDLLGI 275
+ +D P L +NL+G++W F + + +E + + F ++L L
Sbjct: 108 --ILIDKLPPEPLLENLWGEEWRFANISAGDIVDEFTDRPIPILQIPEFVQPINLGLA-- 163
Query: 276 EVDDKTLIPGLAVASSR-AKPLAAWMNGLEVCSIETDTARGS---LILSVGISTRYIYAN 331
IPG+ + R + LA W+ E ++ + G+ LIL G++ R+I A
Sbjct: 164 ---STVRIPGVVIYGGRQSMRLAKWLQ--EANAVSLNYIAGTPDGLILDAGLADRWILAT 218
Query: 332 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ + V + A+ + K+ GLHFL +Q + GFWLL
Sbjct: 219 FDDDEVAAA-AKVYTQRKQVSKGLHFLLVQPDDSRMTYSGFWLL 261
>gi|359459254|ref|ZP_09247817.1| hypothetical protein ACCM5_11029 [Acaryochloris sp. CCMEE 5410]
Length = 281
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 79/286 (27%), Positives = 134/286 (46%), Gaps = 12/286 (4%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+T W++DF RP+ + +WEL V D + + P +S L + + +
Sbjct: 1 MTIWQVDFDRRPLKNTEDYPLWELTVYDPQTQMACHRLCPEPNASSEWLMAELQELFTLM 60
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
G P P + + FR + T + + L+I +++ L L+ R + Y + P +
Sbjct: 61 GPP-PTQFQVFRPRSLTFLEDVGRTLNIAVEATRQTPGLKRVLQVRTQ-AYAQLPEYTGQ 118
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL---LG 274
S LA++ P +P++L+GD+W FV L +A + E L+ ++ L LG
Sbjct: 119 SYDPLAIEPLPPQPMPEHLWGDQWQFVTL--AASELESVLLQRPIPLRTVPEMLLPSQLG 176
Query: 275 IEVDDKTLIPGLAV-ASSRAKPLAAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANY 332
+ D T IPG+ + R+ LA W+ + SI+ A S LI++ G++ RY+ Y
Sbjct: 177 VAAD--TRIPGVLINGGRRSMQLAQWLQKQQPASIQAMRAELSGLIMAAGLNERYVLVTY 234
Query: 333 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+ S A+ +E K+ GLHFL +Q + G WLL L
Sbjct: 235 DDADI-VSAAQGFEQGKQGSQGLHFLLVQPDDSGVTYTGLWLLSSL 279
>gi|428300978|ref|YP_007139284.1| hypothetical protein Cal6303_4407 [Calothrix sp. PCC 6303]
gi|428237522|gb|AFZ03312.1| protein of unknown function DUF1092 [Calothrix sp. PCC 6303]
Length = 259
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 71/261 (27%), Positives = 122/261 (46%), Gaps = 24/261 (9%)
Query: 118 IWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIIT 177
IW L +CD + +Y P + NS L ++ +P+KI+ FR Q +++
Sbjct: 19 IWNLSICDANGDFRYKASCPQSEANSTWLTSQFKLAGNE---RLPDKIQVFRPQSLSLVE 75
Query: 178 KACKELDIKPIPSKRCLSLLLWLE-ERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNL 236
A L+I ++R +L LWL+ E+Y T + P PM LP+ L
Sbjct: 76 LAASHLNISVEATRRTDALKLWLQAEKYATTVEKLP----------------PMPLPEKL 119
Query: 237 FGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSR-AKP 295
+G+KW F P + +E S + L + + + T IPG+ + R +
Sbjct: 120 WGEKWQFATFPAGGIVDEFSDRLIP-ILDIPDYLQPINLGIASTTAIPGVIIYGGRQSMQ 178
Query: 296 LAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGG 354
+A W+ ++ S+ A LIL G++ R++ A ++ + VT + A+ +++ ++ G
Sbjct: 179 IARWLKQVQPVSLNYIAGAPDGLILEAGLADRWVIATFEDSEVTIA-AKNYQSRQQQSHG 237
Query: 355 LHFLAIQEELDSEDCVGFWLL 375
LHFL IQ + GFWLL
Sbjct: 238 LHFLLIQPDDSGMTYSGFWLL 258
>gi|254415147|ref|ZP_05028909.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196177953|gb|EDX72955.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 278
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 77/279 (27%), Positives = 139/279 (49%), Gaps = 9/279 (3%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPN-NVINSITLKEAIVAICDDLGV 159
W+ DF RP+ D G+ +WEL++CD + ++ Y + P +V + + V++
Sbjct: 4 WQADFYRRPLQDETGQILWELLICDTTGNVIYQSFCPQPDVTRDWLVSQVQVSVAK---T 60
Query: 160 PIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSK 219
+P+ I+ FR Q + + ++L IK ++R +L L+ER Y +H + +
Sbjct: 61 GLPDAIQVFRPQSFNLFQEVGQQLGIKVEATRRTPALKQRLQER-TLEYPQHENYTGEAY 119
Query: 220 PLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDD 279
L+LD P P+ LP+NL+GD+W F +P ++E + + +L L + +
Sbjct: 120 NPLSLDKPPPLPLPENLWGDRWRFASIPAGDIEEGFAQRPIP-ILQMPNELLPLQLGLAS 178
Query: 280 KTLIPGLAVASSR-AKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPV 337
+PG+ + R + PLA W+ ++ ++ A LIL G+ R++ A ++ V
Sbjct: 179 TVAVPGVVIDGGRQSMPLARWLQEVQPVALNYIPGAPDGLILEAGLVERWVMATFEDKEV 238
Query: 338 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLL 376
+ A +E K+ GLHFL +Q + GFWLL+
Sbjct: 239 AAA-ARLYEQRKQTSQGLHFLLVQPDDSGMTYTGFWLLM 276
>gi|409989581|ref|ZP_11273128.1| hypothetical protein APPUASWS_02193 [Arthrospira platensis str.
Paraca]
gi|291570627|dbj|BAI92899.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409939557|gb|EKN80674.1| hypothetical protein APPUASWS_02193 [Arthrospira platensis str.
Paraca]
Length = 277
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 81/281 (28%), Positives = 126/281 (44%), Gaps = 16/281 (5%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSI----TLKEAIVAICDD 156
W+ DF RP+ D RG+ +WEL+VCD P + NS LKE V
Sbjct: 4 WQADFYRRPLEDERGQPLWELLVCDQLGDRLLVATCPQSEANSTWLLNQLKEMFVT---- 59
Query: 157 LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQK 216
P+ I+ FR ++ K+L + ++R L L L E +Y + G+
Sbjct: 60 ---DQPDIIQVFRPACLSLFEVVGKQLGVTVQATRRTLGLKKLLAEMM-LIYPQMTGYTG 115
Query: 217 GSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIE 276
+ LA+D P+ LP+NL+GD+W F LP +QE + S+ L L +
Sbjct: 116 QNYDPLAIDKLPPLPLPENLWGDRWRFATLPAGDLQEVFGDRPIPILDMPSILLP-LNLG 174
Query: 277 VDDKTLIPGLAVASSR-AKPLAAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKK 334
+ I G+ + R + LA W+ ++ + LIL G+S R++ A +
Sbjct: 175 LASTVAISGVVIDGGRQSMGLARWLQSVKPVGFNYIPGQPDGLILEAGLSDRWVVATFDD 234
Query: 335 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ V + A +E K+ GLHFL +Q + GFWLL
Sbjct: 235 DDVAQA-ARMFETRKRLAKGLHFLLVQPDDSGVTYTGFWLL 274
>gi|209524029|ref|ZP_03272580.1| protein of unknown function DUF1092 [Arthrospira maxima CS-328]
gi|209495404|gb|EDZ95708.1| protein of unknown function DUF1092 [Arthrospira maxima CS-328]
Length = 277
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 79/277 (28%), Positives = 124/277 (44%), Gaps = 8/277 (2%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ D G+ +WEL++CD P + NS L + + I D
Sbjct: 4 WQADFYRRPLRDDSGQPLWELLLCDEFGDRLLVATCPQSEANSTWLLKQLEEIWD---TD 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR + K+L + ++R L L L E +Y + PG+
Sbjct: 61 QPDLIQVFRPACLNLFEVVGKQLGVTVQGTRRTLGLKKLLAEMM-LIYPQMPGYTGEDYD 119
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
LA+D P+ LP+NL+G +W F LP +QE + S L L + +
Sbjct: 120 PLAIDKLPPLPLPENLWGTRWRFATLPAGDLQEVFGDRPIPILDMPSFLLP-LNLGLAST 178
Query: 281 TLIPGLAVASSR-AKPLAAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNPVT 338
I G+ + R + LA W+ ++ + + LIL G+S R++ A + + V
Sbjct: 179 VAISGVVIDGGRQSMRLARWLQSVKPVGLNYIPGQPDGLILEAGLSDRWVVATFDDDDVA 238
Query: 339 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ A +E K+ GLHFL IQ + GFWLL
Sbjct: 239 QA-ARMFETRKRLAKGLHFLLIQPDDSGVTYTGFWLL 274
>gi|376004228|ref|ZP_09781975.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423065962|ref|ZP_17054752.1| hypothetical protein SPLC1_S370220 [Arthrospira platensis C1]
gi|375327434|emb|CCE17728.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406712461|gb|EKD07646.1| hypothetical protein SPLC1_S370220 [Arthrospira platensis C1]
Length = 277
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 79/277 (28%), Positives = 124/277 (44%), Gaps = 8/277 (2%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ D G+ +WEL++CD P + NS L + + I D
Sbjct: 4 WQADFYRRPLRDDSGQPLWELLLCDELGDRLLVATCPQSEANSTWLLKQLEEIWD---TD 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR + K+L + ++R L L L E +Y + PG+
Sbjct: 61 QPDLIQVFRPACLNLFEVVGKQLGVTVQGTRRTLGLKKLLAEMM-LIYPQMPGYTGEDYD 119
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
LA+D P+ LP+NL+G +W F LP +QE + S L L + +
Sbjct: 120 PLAIDKLPPLPLPENLWGTRWRFATLPAGDLQEVFGDRPIPILDMPSFLLP-LNLGLAST 178
Query: 281 TLIPGLAVASSR-AKPLAAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNPVT 338
I G+ + R + LA W+ ++ + + LIL G+S R++ A + + V
Sbjct: 179 VAISGVVIDGGRQSMRLARWLQSVKPVGLNYIPGQPDGLILEAGLSDRWVVATFDDDDVA 238
Query: 339 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ A +E K+ GLHFL IQ + GFWLL
Sbjct: 239 QA-ARMFETRKRLAKGLHFLLIQPDDSGVTYTGFWLL 274
>gi|443315479|ref|ZP_21044967.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 6406]
gi|442784905|gb|ELR94757.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 6406]
Length = 278
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 127/283 (44%), Gaps = 12/283 (4%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+T WE+DF RP D +G +WEL++CD + Y L+ +
Sbjct: 1 MTRWEVDFYRRPCEDGQGTPLWELLICDRAFDFTYGAMVSQPEATVDWLQGQLKTAIAKA 60
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
G+P P++I FR ++ A L I IP+++ +L WL R Y P +
Sbjct: 61 GIP-PDEICAFRPPAVALLQAAAPPLGIAVIPTRQTPTLKQWLVTR-SRWYPTLPTYSGA 118
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEV 277
LA+D P P+ +P++L+G++W F L + QEE L + + SL LD L +++
Sbjct: 119 PYDPLAVDRPAPVPVPESLWGEQWRFGALSAADFQEE---LTQEPIPIQSLPLDWLPLQM 175
Query: 278 DDKTL--IPGLAVASSRAKPLAAWMNGLEVCSIETDTARG---SLILSVGISTRYIYANY 332
+ IPG+ + R + LA + + G LIL G+ R++ +
Sbjct: 176 GLASTIPIPGVIIDGGR-RALALAQWLAAQDPVALNPMVGNPAGLILEAGLCDRWVLTTF 234
Query: 333 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
++P + A + + GLHFL ++ + G WLL
Sbjct: 235 -EDPQVQAAARTFGERQLQAQGLHFLLVRPDDSGITYTGLWLL 276
>gi|443312305|ref|ZP_21041923.1| Protein of unknown function (DUF1092) [Synechocystis sp. PCC 7509]
gi|442777543|gb|ELR87818.1| Protein of unknown function (DUF1092) [Synechocystis sp. PCC 7509]
Length = 272
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 123/279 (44%), Gaps = 17/279 (6%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ + G+ +WEL++CD Y P + NS L E + +
Sbjct: 4 WQADFYRRPLQNEAGEVLWELLICDRDRLFTYEALCPQSQANSKWLIEQLQIAAKNQK-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR Q +I A + L I ++R +L WL ER ++P
Sbjct: 62 -PDLIQVFRPQSLNLIQLAAENLGIAVEATRRTFALKQWLTER------QYPSNNGEPYN 114
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL--LGIEVD 278
LA+D P L +NL+G++W F L + V S + + + + L L + +
Sbjct: 115 PLAIDKAPPTPLTENLWGEQWRFASLSAGDI---VESFKERLIPIKEMPEFLLPLNLGLA 171
Query: 279 DKTLIPGLAV-ASSRAKPLAAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANYKKNP 336
IPG+ + ++ LA W+ + ++ S L+L G+S R++ +
Sbjct: 172 STITIPGVVIDGGKKSMQLARWLQSIHPVALNYIAGDPSGLVLEAGLSERWVVNTFTDKE 231
Query: 337 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
V + A + ++ GLHFL +Q + GFWLL
Sbjct: 232 VIAA-AVTYTQRQQLTKGLHFLLVQPDNSGMTYSGFWLL 269
>gi|427711582|ref|YP_007060206.1| hypothetical protein Syn6312_0434 [Synechococcus sp. PCC 6312]
gi|427375711|gb|AFY59663.1| Protein of unknown function (DUF1092) [Synechococcus sp. PCC 6312]
Length = 281
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 79/289 (27%), Positives = 133/289 (46%), Gaps = 24/289 (8%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCD--GSLSLQYTKYFPNNVINSITLKEAIVAICD 155
+T W++DF +RP+ + +G+ +WEL++ D G + Q ++ + + + IC
Sbjct: 1 MTLWQVDFSARPLTNPQGQTLWELLIVDPLGQILHQAQCSQAQARLDWLIRQ---LEICI 57
Query: 156 DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLS---LLLWLEERYETV--YTR 210
PE+I+ FR Q ++ A EL++ P++ + LL E Y T YT
Sbjct: 58 QRTGSCPERIQLFRPQCLSLFEVAANELNLMVEPTRHTPALKRLLAAQAEHYPTAANYTG 117
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDL 270
P +PL P P+ LPD L+G+ W F L +E + L ++ + SL +
Sbjct: 118 EP-----YQPLHITSLP-PVPLPDYLWGEGWQFTGL---MAEELETHLITQPIPILSLRM 168
Query: 271 DLL--GIEVDDKTLIPGLAV-ASSRAKPLAAWMNGLEVCSIETDTAR-GSLILSVGISTR 326
DLL + + +IPG+ + R+ LA W +E + LI+S G+ R
Sbjct: 169 DLLPSQLGLAASVVIPGIIIYGGRRSMALARWCQEQNPAEVEFIAGQPDGLIMSAGLWER 228
Query: 327 YIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
++ + +P A+ + + A GLHFL IQ + G WLL
Sbjct: 229 WVLVTF-DDPQVKQSAQGFMTRRAAAQGLHFLMIQPDESGVTYTGLWLL 276
>gi|414078911|ref|YP_006998229.1| hypothetical protein ANA_C13764 [Anabaena sp. 90]
gi|413972327|gb|AFW96416.1| hypothetical protein ANA_C13764 [Anabaena sp. 90]
Length = 265
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 72/279 (25%), Positives = 130/279 (46%), Gaps = 22/279 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P+ ++ + +WEL+VCD + S ++T P + NS + + + +
Sbjct: 5 WQADFYRIPLQNVEEQILWELLVCDPTRSFEFTASCPQSQANSTWVAQQLQLAGQE---K 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q ++IT A L I ++R L+L WL + ++P
Sbjct: 62 LPDVIQVFRPQSLSLITTAGNNLGIYVEATRRTLALKQWLTAK------QYP-------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
+ +D P+ LP+NL+G++W F +P + +E + F+ L + + +
Sbjct: 108 -VIVDKLPPLPLPENLWGEEWRFATIPSGDIVDEFTERPIPFLQIPDF-LKPINLGLAST 165
Query: 281 TLIPGLAVASSR-AKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVT 338
IPG+ + R + LA W+ S+ A L+L G+ R++ A + VT
Sbjct: 166 VPIPGVVIYGGRKSMRLAQWLKESNPVSLNYIGGAPDGLVLEAGLLDRWVLATFTDEEVT 225
Query: 339 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
+ + ++ K+ GLHFL +Q + G WLL D
Sbjct: 226 AA-GKLYQERKQLSQGLHFLLVQPDDSGMTYSGLWLLQD 263
>gi|158336954|ref|YP_001518129.1| hypothetical protein AM1_3825 [Acaryochloris marina MBIC11017]
gi|158307195|gb|ABW28812.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 281
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 137/286 (47%), Gaps = 12/286 (4%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+T W++DF RP+ + +WEL V D + + P ++ L + + +
Sbjct: 1 MTIWQVDFDRRPLKNTEDYPLWELTVYDPQTQMACHRLCPEPNVSPDWLIAELKELFTLM 60
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
G P P + + FR + T + + ++LDI +++ L L L+ R + Y + P +
Sbjct: 61 GPP-PTQFQVFRPRSLTFMEEVRQKLDISVEATRQTLGLKRVLQVRTQA-YAQLPEYTGQ 118
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL---LG 274
S LA++ P +P++L+GD+W FV L +A + E L+ ++ L LG
Sbjct: 119 SYDPLAIEPLPPQPMPEHLWGDQWQFVTL--AASELESVLLQRPIPLRTVPEMLLPSQLG 176
Query: 275 IEVDDKTLIPGLAV-ASSRAKPLAAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANY 332
+ D T +PG+ + R+ LA W+ + SI+ A S LI++ G++ RY+ Y
Sbjct: 177 LAAD--TRLPGVLINGGRRSMQLAQWLQQQQPASIQAMRAELSGLIMAAGLNERYVLVTY 234
Query: 333 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+ + A+ +E K+ GLHFL +Q + G WLL L
Sbjct: 235 DDADIVPA-AQGFEQGKQGSQGLHFLLVQPDDSGVTYTGLWLLSSL 279
>gi|428775356|ref|YP_007167143.1| hypothetical protein PCC7418_0709 [Halothece sp. PCC 7418]
gi|428689635|gb|AFZ42929.1| protein of unknown function DUF1092 [Halothece sp. PCC 7418]
Length = 273
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 71/282 (25%), Positives = 120/282 (42%), Gaps = 23/282 (8%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DF P + + +WELVVCD ++ T + T+ I +
Sbjct: 8 WQVDFYRLPQANASQESVWELVVCD---EVEKTVKTQSCFQAEATVDWLITHLRAIAQGS 64
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
PEKI+ FR + ++ A +L+I +E T + R +G +
Sbjct: 65 FPEKIKVFRPESLQLLQLAGDKLEIS-------------VEGTRHTPFLRQVLRDRGGEE 111
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
+ +++P P LP+ ++G++W F L ++ + F +L + +
Sbjct: 112 RVKVESPPPQPLPEEIWGEQWQFASLNAEEIEYRLPERPIPFR-EIPPELSPFQLNLGST 170
Query: 281 TLIPGLAVASSRAK-PLAAWMNGLEVCSIE----TDTARGSLILSVGISTRYIYANYKKN 335
TLIPG+ + R LA W E +IE G L+L G+ R++ ++ +
Sbjct: 171 TLIPGIIIYGGRQSWQLAQWFAETEPMAIEYIPTAVGESGGLVLEAGLRDRWVIITFE-D 229
Query: 336 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
P AE ++ K+ GLHFL IQ + GFWLL D
Sbjct: 230 PEVAKAAEKFQQRKQNSNGLHFLLIQPDNSGMTDTGFWLLAD 271
>gi|422302945|ref|ZP_16390303.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|389792167|emb|CCI12098.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
Length = 265
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 126/294 (42%), Gaps = 48/294 (16%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----GTVWQLLISDSLGHLIYENSCPQSQANSDWLTQQLQQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSSPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------FSAVQEEVSSLESKFVFGA 266
+D+P P +PD G +W F + P F + + SL F
Sbjct: 107 ---------NIDSPPPQPIPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAFY--- 154
Query: 267 SLDLDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWM---NGLEVCSIETDTAR-GSLILSV 321
L + + +IPG+ + ++ +A W+ N + + I T+T R G L+L
Sbjct: 155 -----PLKLGLASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTETGRSGGLVLES 209
Query: 322 GISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
G++ R+I+ Y+ V + A A++A K+ GLHFL IQ + GFWLL
Sbjct: 210 GLNERWIFLTYEDEEVAPA-ANAYQATKEESQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|22299400|ref|NP_682647.1| hypothetical protein tll1857 [Thermosynechococcus elongatus BP-1]
gi|22295583|dbj|BAC09409.1| tll1857 [Thermosynechococcus elongatus BP-1]
Length = 276
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 131/280 (46%), Gaps = 10/280 (3%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
++ W++D RP+ G +WELV+CD YT + P +++S + +
Sbjct: 1 MSRWQVDLYRRPLRTPSGLDLWELVICDPEDHFYYTTFCPEPLVSSAWVATEF----NSC 56
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
G P+PE+++ FR Q ++ AC++L+I P++R +L +L +R + Y +
Sbjct: 57 GQPLPERVQVFRPQSLGLVEGACQQLNIPLEPTRRTAALKHYLCQRAQE-YPSLKTYTGE 115
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEV 277
+ LA++ P P+ LPD+++G+ W F + +Q+ + + + LG+
Sbjct: 116 AYDPLAIEQPPPLPLPDDIWGESWQFAAIAPPDLQQLMQYPLRILALEMEMLPESLGLAA 175
Query: 278 DDKTLIPGLAVASSRAK-PLAAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKN 335
D TLIPG+ + R LA W +E + ++L G+ R+++ ++ +
Sbjct: 176 D--TLIPGIILYGGRKSLKLARWFQEQVPYRLEFVPGQPCGVLLHSGLRDRWVFLTFQDS 233
Query: 336 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ + + + + GLHFL IQ WLL
Sbjct: 234 EIAQA-GDVFRDRLQKSQGLHFLLIQPTPRDTTYTALWLL 272
>gi|428780442|ref|YP_007172228.1| hypothetical protein Dacsa_2255 [Dactylococcopsis salina PCC 8305]
gi|428694721|gb|AFZ50871.1| Protein of unknown function (DUF1092) [Dactylococcopsis salina PCC
8305]
Length = 273
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/285 (23%), Positives = 126/285 (44%), Gaps = 23/285 (8%)
Query: 96 ESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICD 155
+S + W++DF P +G+ WELV+CD S T+ L E++ +
Sbjct: 3 QSQSSWQVDFYRLPQPTTKGESQWELVICDQSTKEVKTRSCLQKEATVDWLVESLQGLAT 62
Query: 156 DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQ 215
+ +P K+R FR + ++ A + L + ++ L L +R
Sbjct: 63 E---ELPLKMRVFRPESLQLLQLAGERLGVIVEGTRHTYLLKQVLRDR------------ 107
Query: 216 KGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGI 275
G + + +++P P LP+ ++G++W F +L ++ + F + +L +
Sbjct: 108 -GGEERIKVESPPPQPLPEFIWGEQWQFARLNADEIEYRMPERPIPFCEMPT-ELTPFQL 165
Query: 276 EVDDKTLIPGLAVASSR-AKPLAAWMNGLEVCSIE----TDTARGSLILSVGISTRYIYA 330
+ TL+PG+ + R ++ LA W + ++ T G L+L G+ R++
Sbjct: 166 NLGSTTLVPGIIIYGGRQSRQLAQWFMEAQPMAVNYMPTTVGESGGLVLEAGLRDRWVII 225
Query: 331 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
++ V T+ E +E K+ GLHFL +Q + GFWLL
Sbjct: 226 TFEDTEVATA-GEKYEQRKQESNGLHFLLLQPDDSGMTDTGFWLL 269
>gi|425463363|ref|ZP_18842702.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389833791|emb|CCI21409.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 265
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 126/294 (42%), Gaps = 48/294 (16%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----GTVWQLLISDPLGHLIYENSCPQSQANSDWLTQQLQQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------FSAVQEEVSSLESKFVFGA 266
+D P P +PD G +W F + P F + + SL F +
Sbjct: 107 ---------NIDYPPPQPVPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAF---S 154
Query: 267 SLDLDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWM---NGLEVCSIETDTAR-GSLILSV 321
L L L +IPG+ + ++ +A W+ N + + I T+T R G L+L
Sbjct: 155 PLKLGL-----ASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTETGRSGGLVLES 209
Query: 322 GISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
G++ R+I+ Y+ V + A A++A K+ GLHFL IQ + GFWLL
Sbjct: 210 GLNERWIFLTYEDEEVARA-ANAYQATKEEGQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|425440676|ref|ZP_18820974.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9717]
gi|389718833|emb|CCH97263.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9717]
Length = 265
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 126/294 (42%), Gaps = 48/294 (16%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ + +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----ETVWQLLISDSLGHLIYENSCPQSQANSDWLTQQLQQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------FSAVQEEVSSLESKFVFGA 266
+D+P P LPD G +W F + P F + + SL F +
Sbjct: 107 ---------NIDSPPPQPLPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAF---S 154
Query: 267 SLDLDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWM---NGLEVCSIETDTAR-GSLILSV 321
L L L +IPG+ + ++ +A W+ N + + I T+ R G L+L
Sbjct: 155 PLKLGL-----ASTLMIPGVVITGGKKSLAIARWLGEINPVFIDHIPTERGRSGGLVLES 209
Query: 322 GISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
G++ R+I+ Y+ V + A ++A K+ GLHFL IQ + GFWLL
Sbjct: 210 GLNERWIFLTYEDEEVALA-ANVYQATKQESQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|425434023|ref|ZP_18814495.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|389678222|emb|CCH92899.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
Length = 265
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 127/294 (43%), Gaps = 48/294 (16%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ + +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----ETVWQLLIFDPLGHLIYENSCPQSQANSDWLTQQLEQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------FSAVQEEVSSLESKFVFGA 266
+D+P P LPD G +W F + P F + + SL F +
Sbjct: 107 ---------NIDSPPPQPLPDRFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAF---S 154
Query: 267 SLDLDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWM---NGLEVCSIETDTAR-GSLILSV 321
L L L +IPG+ + ++ +A W+ N + + I T+T R G L+L
Sbjct: 155 PLKLGL-----ASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTETGRSGGLVLES 209
Query: 322 GISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
G++ R+I+ Y+ V + A ++A K+ GLHFL IQ + GFWLL
Sbjct: 210 GLNERWIFLTYEDEEVALA-ANVYQATKEEGQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|440753361|ref|ZP_20932564.1| hypothetical protein O53_1739 [Microcystis aeruginosa TAIHU98]
gi|440177854|gb|ELP57127.1| hypothetical protein O53_1739 [Microcystis aeruginosa TAIHU98]
Length = 265
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 126/294 (42%), Gaps = 48/294 (16%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----GTVWQLLISDPLGHLIYENSCPQSQANSDWLTQQLEQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------FSAVQEEVSSLESKFVFGA 266
+D+P P LPD G +W F + P F + + SL F +
Sbjct: 107 ---------NIDSPPPQPLPDRFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAF---S 154
Query: 267 SLDLDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWM---NGLEVCSIETDTAR-GSLILSV 321
L L L +IPG+ + ++ +A W+ N + + I T+T R G L+L
Sbjct: 155 PLKLGL-----ASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTETGRSGGLVLES 209
Query: 322 GISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
G++ R+I+ Y+ V + A ++A K+ GLHFL IQ + GFWLL
Sbjct: 210 GLNERWIFLTYEDEEVALA-ANVYQATKEEGQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|390440582|ref|ZP_10228809.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis sp. T1-4]
gi|389836112|emb|CCI32935.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis sp. T1-4]
Length = 265
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 126/294 (42%), Gaps = 48/294 (16%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ + +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----ETVWQLLIFDSLGHLIYENSCPQSQANSDWLTQQLRQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIRLTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------FSAVQEEVSSLESKFVFGA 266
+D+P P LPD G +W F + P F + + SL F +
Sbjct: 107 ---------NIDSPPPQPLPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAF---S 154
Query: 267 SLDLDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWM---NGLEVCSIETDTAR-GSLILSV 321
L L L +IPG+ + ++ +A W+ N + + I T+T R G L+L
Sbjct: 155 PLKLGL-----ASTLMIPGVVITGGKKSLAIARWLGEINPVFIDHIPTETGRSGGLVLES 209
Query: 322 GISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
G++ R+I+ Y+ V + A ++A K+ G HFL IQ + GFWLL
Sbjct: 210 GLNERWIFLTYEDEEVARA-ANVYQATKEESQGWHFLLIQPDDSGRTFTGFWLL 262
>gi|425458730|ref|ZP_18838218.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9808]
gi|389824876|emb|CCI25820.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9808]
Length = 265
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 126/294 (42%), Gaps = 48/294 (16%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ + +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----ETVWQLLIFDPLGHLIYENSCPQSQANSDWLTQQLEQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------FSAVQEEVSSLESKFVFGA 266
+D+P P LPD G +W F + P F + + SL F +
Sbjct: 107 ---------NIDSPPPQPLPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAF---S 154
Query: 267 SLDLDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWM---NGLEVCSIETDTAR-GSLILSV 321
L L L +IPG+ + ++ +A W+ N + + I T+ R G L+L
Sbjct: 155 PLKLGL-----ASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTERGRSGGLVLES 209
Query: 322 GISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
G+S R+I+ Y+ V + A ++A K+ GLHFL IQ + GFWLL
Sbjct: 210 GLSERWIFLTYEDEEVALA-ANIYQATKEESQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|443663863|ref|ZP_21133251.1| hypothetical protein C789_3791 [Microcystis aeruginosa DIANCHI905]
gi|159028218|emb|CAO88028.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443331745|gb|ELS46389.1| hypothetical protein C789_3791 [Microcystis aeruginosa DIANCHI905]
Length = 265
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 78/290 (26%), Positives = 123/290 (42%), Gaps = 40/290 (13%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+T W+ DF + +W+L++ D L Y P + NS L + + C
Sbjct: 1 MTIWQADFY-KSSSSPSLSTVWQLLISDSLGHLIYENSCPQSQANSDWLTQQLQQACQ-- 57
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 58 -VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI---------- 106
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLP-------FSAVQEEVSSLESKFVFGASLDL 270
+D+P P LPD G +W F + P F + + SL F + L L
Sbjct: 107 -----NIDSPPPQPLPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAF---SPLKL 158
Query: 271 DLLGIEVDDKTLIPGLAV-ASSRAKPLAAWM---NGLEVCSIETDTAR-GSLILSVGIST 325
L +IPG+ + ++ +A W+ N + + I T+ R G L+L G++
Sbjct: 159 GL-----ASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTERGRSGGLVLESGLNE 213
Query: 326 RYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
R+I+ Y+ V + A ++A K+ GLHFL IQ + GFWLL
Sbjct: 214 RWIFLTYEDEEVALA-ANVYQATKQESQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|425470238|ref|ZP_18849108.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9701]
gi|389884213|emb|CCI35473.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9701]
Length = 265
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 116/270 (42%), Gaps = 39/270 (14%)
Query: 118 IWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIIT 177
+W+L++ D L Y P + NS L + + C V PE I+ FR Q +
Sbjct: 20 VWQLLIFDPLGHLIYENSCPQSQANSDWLTQQLQQACQ---VSSPEIIQVFRPQCANLFL 76
Query: 178 KACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLF 237
A + L IK ++ +L LE R + +D+P P LPD
Sbjct: 77 LAGQNLQIKIELTRHVNALKKQLELRQIPI---------------NIDSPPPQPLPDQFL 121
Query: 238 GDKWAFVQLP-------FSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAV-A 289
G +W F + P F + + SL F L + + +IPG+ +
Sbjct: 122 GQEWRFARFPAVDLVNFFCDRRIPILSLPEAFY--------PLKLGLASTLMIPGVVITG 173
Query: 290 SSRAKPLAAWM---NGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNPVTTSEAEAW 345
++ +A W+ N + + I T+ R G L+L G++ R+I+ Y+ V + A A+
Sbjct: 174 GKKSLAIARWLGEINPVFIDHIPTERGRSGGLVLESGLNERWIFLTYEDEEVARA-ANAY 232
Query: 346 EAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+A K+ GLHFL IQ + GFWLL
Sbjct: 233 QATKQESQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|254413499|ref|ZP_05027269.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196179606|gb|EDX74600.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 153
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/151 (37%), Positives = 79/151 (52%), Gaps = 7/151 (4%)
Query: 229 PMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAV 288
P LPD + G KWA V L E + FG + L ++ + D T IPG+ +
Sbjct: 8 PQPLPDAIQGQKWALVSL---EAAAFAEMEEWEIDFGEAFPLSMMNLAPD--TRIPGVII 62
Query: 289 ASSRAKPLAAWMNGLEVCSIETDTARG-SLILSVGISTRYIYANYKKNPVTTSEAEAWEA 347
S RAK LAAWM+GLE+ ++ L+L G S + AN + T +EA+ +E+
Sbjct: 63 FSDRAKALAAWMSGLELAFVKFQGGVTPRLLLETGASDSWALANLT-DAQTLAEAQGFES 121
Query: 348 AKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
AK+ +HFLA+Q SE GFWLL +L
Sbjct: 122 AKENAQSIHFLAVQSTPTSETFAGFWLLQEL 152
>gi|425451962|ref|ZP_18831781.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 7941]
gi|389766454|emb|CCI07907.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 7941]
Length = 265
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 126/294 (42%), Gaps = 48/294 (16%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ + +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----ETVWQLLIFDPLGHLIYENSCPQSQANSDWLTQQLEQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------FSAVQEEVSSLESKFVFGA 266
+D+P P LPD G +W F + P F + + SL F +
Sbjct: 107 ---------NIDSPPPQPLPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAF---S 154
Query: 267 SLDLDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWM---NGLEVCSIETDTAR-GSLILSV 321
L L L +IPG+ + ++ +A W+ N + + I T+ R G L+L
Sbjct: 155 PLKLGL-----ASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTERGRSGGLVLES 209
Query: 322 GISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
G++ R+I+ Y+ V + A ++A K+ GLHFL IQ + GFWLL
Sbjct: 210 GLNERWIFLTYEDEEVALA-ANIYQATKEESQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|425444579|ref|ZP_18824626.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9443]
gi|389735645|emb|CCI00880.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9443]
Length = 267
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 122/291 (41%), Gaps = 40/291 (13%)
Query: 98 ITEWELDFCSRPILDIRG-KKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDD 156
+T W+ DF +W+L++ D L Y P + NS L + + C
Sbjct: 1 MTIWQADFYKSSSSSSPSLGTVWQLLISDSLGHLIYENSCPQSQANSDWLTQQLQQACQ- 59
Query: 157 LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQK 216
V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 60 --VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI--------- 108
Query: 217 GSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------FSAVQEEVSSLESKFVFGASLD 269
+D+P P LPD G +W F + P F + + SL F + L
Sbjct: 109 ------NIDSPPPQPLPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAF---SPLK 159
Query: 270 LDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWM---NGLEVCSIETDTAR-GSLILSVGIS 324
L L +IPG+ + ++ +A W+ N + + I T+ R G L+L G++
Sbjct: 160 LGL-----ASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTERGRSGGLVLESGLN 214
Query: 325 TRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
R+I+ Y+ V + A ++A K+ GLHFL IQ + GFWLL
Sbjct: 215 ERWIFLTYEDEEVALA-ANVYQATKQESQGLHFLLIQPDDSGRTFTGFWLL 264
>gi|166364113|ref|YP_001656386.1| hypothetical protein MAE_13720 [Microcystis aeruginosa NIES-843]
gi|166086486|dbj|BAG01194.1| hypothetical protein MAE_13720 [Microcystis aeruginosa NIES-843]
Length = 265
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 125/294 (42%), Gaps = 48/294 (16%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----GTVWQLLISDPLGHLIYENSCPQSQANSDWLTQQLQQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------FSAVQEEVSSLESKFVFGA 266
+D P P +PD G +W F + P F + + SL F +
Sbjct: 107 ---------NIDYPPPQPVPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAF---S 154
Query: 267 SLDLDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWM---NGLEVCSIETDTAR-GSLILSV 321
L L L +IP + + ++ +A W+ N + + I T+T R G L+L
Sbjct: 155 PLKLGL-----ASTLMIPSVVITGGKKSLAIARWLEEINPVFIDHIPTETGRSGGLVLES 209
Query: 322 GISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
G++ R+I+ Y+ V + A A++A K+ GLHFL IQ + GFWLL
Sbjct: 210 GLNERWIFLTYEDEEVARA-ANAYQATKEEGQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|428204653|ref|YP_007083242.1| hypothetical protein Ple7327_4590 [Pleurocapsa sp. PCC 7327]
gi|427982085|gb|AFY79685.1| Protein of unknown function (DUF1092) [Pleurocapsa sp. PCC 7327]
Length = 271
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/282 (24%), Positives = 125/282 (44%), Gaps = 20/282 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF + GK +WEL++CD + P + N L I +
Sbjct: 4 WQADFYKHDRKNKEGKHLWELLICDPQGHIIQEAKCPQSQANPDWL---ISQLQQANRGN 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P++I+ FR Q ++++ A ++L I+ ++R +L L +R + P
Sbjct: 61 LPDRIQVFRLQSLSLLSIAAEKLGIQVEATRRTGALKAELRKRI---------IDENYDP 111
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
+ L+ P P LP+NL+G+ W F + + S + L + + +
Sbjct: 112 V-KLEKPPPQALPENLWGESWRFATFRAGDLVDYFSD-RPLPILHMPESLLPINLGIAST 169
Query: 281 TLIPGLAVASSR-AKPLAAWMNG---LEVCSIETDTAR-GSLILSVGISTRYIYANYKKN 335
+PG+ + R + LA W+ + I T+ + G L+L G+ R+I A ++
Sbjct: 170 ISVPGVIIYGGRKSMYLAKWLQEAKPFSLSYIPTEIGKSGGLVLESGLVDRWILATFEDE 229
Query: 336 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 377
+ + A+ +E K+A GLHFL +Q + GFWLL D
Sbjct: 230 EIAQA-AQNYEQRKQASLGLHFLLVQPDDSGMTYTGFWLLKD 270
>gi|443324165|ref|ZP_21053109.1| Protein of unknown function (DUF1092) [Xenococcus sp. PCC 7305]
gi|442796049|gb|ELS05375.1| Protein of unknown function (DUF1092) [Xenococcus sp. PCC 7305]
Length = 271
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 68/283 (24%), Positives = 128/283 (45%), Gaps = 18/283 (6%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+T W+ DF P ++ + W+LV+C L + +N+ L + +
Sbjct: 1 MTIWQSDFYHYPKIEPQ----WQLVICSSDGKLIHETNCSAAQVNAKWLTKQLQQAAQG- 55
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
+P KI+ FR Q+ + A +EL I+ ++R +L L+ Y + ++
Sbjct: 56 --KLPTKIQVFRPQIVGLFEIATQELGIELETTRRTNALKEKLQ-NYSPINSKDKSKNNN 112
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEV 277
S ++ P P +P++L+G+ W F+ + + + F L+ + + +
Sbjct: 113 S---FDVEKPPPQGVPEDLWGENWNFISMSANDLINFTGDRPIPIKFAPEF-LNPIKLGI 168
Query: 278 DDKTLIPGLAVASSR-AKPLAAWMNG---LEVCSIETDTAR-GSLILSVGISTRYIYANY 332
LIPG+ V R + LA W++ + + I T+ + G L+L G+ R+I+A +
Sbjct: 169 ASDALIPGIVVYGGRKSMVLARWLDQQKPVALNYIPTEIGKSGGLVLESGLVDRWIFATF 228
Query: 333 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ + + A ++E K+ GLHFL IQ + G WLL
Sbjct: 229 ESEAIAQA-ARSYEQRKQDSKGLHFLLIQPDDSGMTNTGIWLL 270
>gi|425455145|ref|ZP_18834870.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9807]
gi|389804026|emb|CCI17121.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9807]
Length = 267
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 121/291 (41%), Gaps = 40/291 (13%)
Query: 98 ITEWELDFCSRPILDIRG-KKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDD 156
+T W+ DF +W+L++ D L Y P + NS L + + C
Sbjct: 1 MTIWQADFYKSSSSSSPSLGTVWQLLISDSLGHLIYENSCPQSQANSDWLTQQLQQACQ- 59
Query: 157 LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQK 216
V PE I+ FR + + A + L IK ++ +L LE R +
Sbjct: 60 --VSPPEIIQVFRPECANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI--------- 108
Query: 217 GSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------FSAVQEEVSSLESKFVFGASLD 269
+D+P P LPD G +W F + P F + + SL F + L
Sbjct: 109 ------NIDSPPPQPLPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAF---SPLK 159
Query: 270 LDLLGIEVDDKTLIPGLAV-ASSRAKPLAAWM---NGLEVCSIETDTAR-GSLILSVGIS 324
L L +IPG+ + ++ +A W+ N + + I T+ R G L+L G++
Sbjct: 160 LGL-----ASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTERGRSGGLVLESGLN 214
Query: 325 TRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
R+I+ Y+ V + A ++A K+ G HFL IQ + GFWLL
Sbjct: 215 ERWIFLTYEDEEVALA-ANIYQATKQESQGWHFLLIQPDDSGRTFTGFWLL 264
>gi|434399732|ref|YP_007133736.1| protein of unknown function DUF1092 [Stanieria cyanosphaera PCC
7437]
gi|428270829|gb|AFZ36770.1| protein of unknown function DUF1092 [Stanieria cyanosphaera PCC
7437]
Length = 269
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 124/280 (44%), Gaps = 22/280 (7%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF L+ +W+L++CD Q T + N + I I G
Sbjct: 4 WQADFYKFS-LNQNNSWLWKLLICDLE---QNTVFEQNCQQEDASANWLIHQINQAAGDK 59
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q + T A ++L IK + + R +L +Y T +P
Sbjct: 60 LPDVIQIFRPQALGLFTVAAQQLGIK-VEATRRTKILKQQLNKYITD-ANYP-------- 109
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDK 280
LA+D P P LP++L+G++W F + ++ +S + + L + + +
Sbjct: 110 -LAIDRPPPQPLPESLWGEQWNFATITADSLSNLISDRPIPILDTPTFLLP-INLGIAST 167
Query: 281 TLIPGLAV-ASSRAKPLAAWMNG---LEVCSIETDTAR-GSLILSVGISTRYIYANYKKN 335
+PG+ + A ++ LA W+ + I+T+ + G LIL G+ R+I ++
Sbjct: 168 INLPGVVIYAGKQSLKLARWLAAEKPFSLNYIDTEAGKSGGLILESGLVDRWIMTTFEDE 227
Query: 336 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
V + + +E K+ GLHFL IQ + G WLL
Sbjct: 228 KVAQA-GKIYEQRKQLSKGLHFLLIQPDDSGMTYTGLWLL 266
>gi|218248800|ref|YP_002374171.1| hypothetical protein PCC8801_4079 [Cyanothece sp. PCC 8801]
gi|218169278|gb|ACK68015.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 8801]
Length = 273
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 78/282 (27%), Positives = 124/282 (43%), Gaps = 24/282 (8%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P+ + W+L++CD + + NS L + I
Sbjct: 4 WQADFYKNPLDHEKPNPQWQLIICDDQGQIICQENCQQKEANSNWLISQLKPIFQQNN-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR Q ++T A KEL +K ++R L L+++ K
Sbjct: 62 -PDFIQVFRPQSLNLLTLAVKELGVKIQATRRTPELKAILKQQAA----------KTGAN 110
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSS--LESKFVFGASLDLDLLGIEVD 278
L LD P P LP NL+G+KW FV + E S + + + A ++L +
Sbjct: 111 SLKLDQPPPQPLPQNLWGEKWRFVSFRGGDMIEFFSDRPIPIRDIPEALFPINL---GIA 167
Query: 279 DKTLIPGLAVASSR-AKPLAAWMNGLE-VC--SIETDTAR-GSLILSVGISTRYIYANYK 333
IPG+ + + + LA W+ ++ VC I T+ G LIL G+ R+I A +
Sbjct: 168 STVNIPGIIIYGGKTSMYLARWLADIKPVCLNYIPTEMGHSGGLILEAGLVDRWILATF- 226
Query: 334 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
++P A+ +E K+ GLHFL +Q + GFWLL
Sbjct: 227 EDPEMAQAAQQYETQKQTSKGLHFLVVQPDDSEITYSGFWLL 268
>gi|257061859|ref|YP_003139747.1| hypothetical protein Cyan8802_4118 [Cyanothece sp. PCC 8802]
gi|256592025|gb|ACV02912.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 8802]
Length = 273
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 78/282 (27%), Positives = 124/282 (43%), Gaps = 24/282 (8%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P+ + W+L++CD + + NS L + I
Sbjct: 4 WQADFYKNPLDHEKPNPQWQLIICDDQGQIICQENCRQKEANSNWLISQLKPIFQQNN-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR Q ++T A KEL +K ++R L L+++ K
Sbjct: 62 -PDFIQVFRPQSLNLLTLAVKELGVKIQATRRTPQLKAILKQQAA----------KTGAN 110
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSS--LESKFVFGASLDLDLLGIEVD 278
L LD P P LP NL+G+KW FV + E S + + + A ++L +
Sbjct: 111 SLKLDQPPPQPLPQNLWGEKWRFVSFRGGDMIEFFSDRPIPIRDIPEALFPINL---GIA 167
Query: 279 DKTLIPGLAVASSR-AKPLAAWMNGLE-VC--SIETDTAR-GSLILSVGISTRYIYANYK 333
IPG+ + + + LA W+ ++ VC I T+ G LIL G+ R+I A +
Sbjct: 168 STVNIPGIIIYGGKTSMYLARWLADIKPVCLNYIPTEMGHSGGLILEAGLVDRWILATF- 226
Query: 334 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
++P A+ +E K+ GLHFL +Q + GFWLL
Sbjct: 227 EDPEMAQAAQQYETQKQTSKGLHFLVVQPDDSEITYSGFWLL 268
>gi|357521231|ref|XP_003630904.1| hypothetical protein MTR_8g104810 [Medicago truncatula]
gi|355524926|gb|AET05380.1| hypothetical protein MTR_8g104810 [Medicago truncatula]
Length = 108
Score = 71.2 bits (173), Expect = 7e-10, Method: Composition-based stats.
Identities = 30/32 (93%), Positives = 31/32 (96%)
Query: 105 FCSRPILDIRGKKIWELVVCDGSLSLQYTKYF 136
FCSRPILD+RGKKIWELVVCD SLSLQYTKYF
Sbjct: 43 FCSRPILDVRGKKIWELVVCDKSLSLQYTKYF 74
>gi|416393935|ref|ZP_11686049.1| hypothetical protein CWATWH0003_2850 [Crocosphaera watsonii WH
0003]
gi|357263417|gb|EHJ12432.1| hypothetical protein CWATWH0003_2850 [Crocosphaera watsonii WH
0003]
Length = 269
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 71/294 (24%), Positives = 120/294 (40%), Gaps = 52/294 (17%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF W L++CD + S+ + + NS L + ++
Sbjct: 4 WQADFYKHLSQTNENNTTWNLIICDQNSSIIHEASCQQSEANSNWLIAELESLVKQYS-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ ++ FR Q ++ K L I ++R L L++++ +
Sbjct: 62 -PDVVKVFRPQCLSLFQLLGKALGIYIEATRRTPQLKQILKDKFPSS------------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQ--------------LPFSAVQEEVSSLESKFVFGA 266
+ L+ P +P+NL+GDKW +P + EE++ ++
Sbjct: 108 -VKLEQSPPQAVPENLWGDKWRLATFKAGDFLDYFSDRPIPIKDLPEELNPID------- 159
Query: 267 SLDLDLLGIEVDDKTLIPGLAVASSR-AKPLAAWMNGLEVCS---IETDTAR-GSLILSV 321
LGI D K IPGL + R + LA W+ + S I TD + G LIL
Sbjct: 160 ------LGIASDIK--IPGLVIYGGRQSMYLARWLADNQPVSLNYIPTDVEKSGGLILES 211
Query: 322 GISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
G+ R++ ++ + + S A+ +E K+ GLHFL IQ + G WLL
Sbjct: 212 GLVDRWVLLTFEDSEMAQS-AQKYEQQKEDSQGLHFLLIQPDDSGMTETGIWLL 264
>gi|67924121|ref|ZP_00517567.1| Protein of unknown function DUF1092 [Crocosphaera watsonii WH 8501]
gi|67854046|gb|EAM49359.1| Protein of unknown function DUF1092 [Crocosphaera watsonii WH 8501]
Length = 269
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 71/294 (24%), Positives = 120/294 (40%), Gaps = 52/294 (17%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF W L++CD + S+ + + NS L + ++
Sbjct: 4 WQADFYKHLSQTNENNTTWNLIICDQNSSIIHEASCQQSEANSNWLIAELESLVKQYS-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ ++ FR Q ++ K L I ++R L L++++ +
Sbjct: 62 -PDVVKVFRPQCLSLFQLLGKALGIYIEATRRTSQLKQILKDKFPSS------------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQ--------------LPFSAVQEEVSSLESKFVFGA 266
+ L+ P +P+NL+GDKW +P + EE++ ++
Sbjct: 108 -VKLEQSPPQAVPENLWGDKWRLATFKAGDFLDYFRDRPIPIKDLPEELNPID------- 159
Query: 267 SLDLDLLGIEVDDKTLIPGLAVASSR-AKPLAAWMNGLEVCS---IETDTAR-GSLILSV 321
LGI D K IPGL + R + LA W+ + S I TD + G LIL
Sbjct: 160 ------LGIASDIK--IPGLVIYGGRQSMYLARWLADNQPVSLNYIPTDVEKSGGLILES 211
Query: 322 GISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
G+ R++ ++ + + S A+ +E K+ GLHFL IQ + G WLL
Sbjct: 212 GLVDRWVLLTFEDSEMAQS-AQKYEQQKEDSQGLHFLLIQPDDSGMTETGIWLL 264
>gi|172039290|ref|YP_001805791.1| hypothetical protein cce_4377 [Cyanothece sp. ATCC 51142]
gi|171700744|gb|ACB53725.1| DUF1092-containing protein [Cyanothece sp. ATCC 51142]
Length = 275
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 74/295 (25%), Positives = 121/295 (41%), Gaps = 38/295 (12%)
Query: 93 TDPESITEWELDFCSRPILDIRGKKIWELVVCD------GSLSLQYTKYFPNNVINSITL 146
T +S+ W+ DF + W L+VCD S Q ++ N +I+ +
Sbjct: 2 TVSKSMIIWQADFYKHLSQEHENNTKWNLIVCDQQGVIIHQASCQQSEATSNWLISEL-- 59
Query: 147 KEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYET 206
E +V P+ I+ FR Q ++ K L+IK ++R L L+E+Y
Sbjct: 60 -EPLVKQYS------PDIIKVFRPQCLSLFALVGKRLEIKIEGTRRTPQLKQILQEKYPN 112
Query: 207 VYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFV-FG 265
+ L+ P +P++L+GDKW F + S
Sbjct: 113 S--------------VKLEQSPPQAIPESLWGDKWHFATFKAGDFFDYFSDRPIPMKELP 158
Query: 266 ASLDLDLLGIEVDDKTLIPGLAVASSR-AKPLAAWMNGLEVCS---IETDTAR-GSLILS 320
+L+ LGI D IPG+ + R + LA W+ + S I T+ + G LIL
Sbjct: 159 EALNPIHLGIASD--VNIPGVVIYGGRQSMYLARWLADNQPVSLNYIPTEVNKSGGLILE 216
Query: 321 VGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
G+ R++ ++ N + A+ +E K+ GLHF +Q + G WLL
Sbjct: 217 SGLVDRWVLLTFE-NAEMSQSAQQYEKQKERTQGLHFFLLQPDDSGMTQTGIWLL 270
>gi|354552442|ref|ZP_08971750.1| protein of unknown function DUF1092 [Cyanothece sp. ATCC 51472]
gi|353555764|gb|EHC25152.1| protein of unknown function DUF1092 [Cyanothece sp. ATCC 51472]
Length = 269
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 72/287 (25%), Positives = 117/287 (40%), Gaps = 38/287 (13%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCD------GSLSLQYTKYFPNNVINSITLKEAIVAIC 154
W+ DF + W L+VCD S Q ++ N +I+ + E +V
Sbjct: 4 WQADFYKHLSQEHENNTKWNLIVCDQQGVIIHQASCQQSEATSNWLISEL---EPLVKQY 60
Query: 155 DDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGF 214
P+ I+ FR Q ++ K L+IK ++R L L+E+Y
Sbjct: 61 S------PDIIKVFRPQCLSLFALVGKRLEIKIEGTRRTPQLKQILQEKYPNS------- 107
Query: 215 QKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFV-FGASLDLDLL 273
+ L+ P +P++L+GDKW F + S +L+ L
Sbjct: 108 -------VKLEQSPPQAIPESLWGDKWHFATFKAGDFFDYFSDRPIPMKELPEALNPIHL 160
Query: 274 GIEVDDKTLIPGLAVASSR-AKPLAAWMNGLEVCS---IETDTAR-GSLILSVGISTRYI 328
GI D IPG+ + R + LA W+ + S I T+ + G LIL G+ R++
Sbjct: 161 GIASD--VNIPGVVIYGGRQSMYLARWLADNQPVSLNYIPTEVNKSGGLILESGLVDRWV 218
Query: 329 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
++ N + A+ +E K+ GLHF +Q + G WLL
Sbjct: 219 LLTFE-NAEMSQSAQQYEKQKERTQGLHFFLLQPDDSGMTQTGIWLL 264
>gi|126658961|ref|ZP_01730103.1| hypothetical protein CY0110_26702 [Cyanothece sp. CCY0110]
gi|126619759|gb|EAZ90486.1| hypothetical protein CY0110_26702 [Cyanothece sp. CCY0110]
Length = 270
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 66/281 (23%), Positives = 116/281 (41%), Gaps = 25/281 (8%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS-LQYTKYFPNNVINSITLKEAIVAICDDLGV 159
W+ DF + + W L++C+ + Y + NS L + +
Sbjct: 4 WQADFYKHLSQENKQNTTWNLIICNEQKGEIVYQSSCQQSEANSSWLIGQLEPFIKEYS- 62
Query: 160 PIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSK 219
P+ I+ FR Q ++ ++L +K ++R L L+E+Y
Sbjct: 63 --PDIIKVFRPQCLSLFQLVEEKLGVKIEGTRRTPQLKQILKEKYPNS------------ 108
Query: 220 PLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDD 279
+ L+ P +P++L+GDKW F + S + S +L+ + + +
Sbjct: 109 --IKLEQAPPQPIPESLWGDKWRFAAFKAGDFFDYFSDRPIP-IKDLSEELNPINLGIAS 165
Query: 280 KTLIPGLAVASSR-AKPLAAWM---NGLEVCSIETDTAR-GSLILSVGISTRYIYANYKK 334
IPG+ + R + LA W + + I TD + G LIL G+ R+I ++
Sbjct: 166 DINIPGVVIYGGRQSMYLARWFAENQPVSLNYIPTDINQSGGLILESGLVDRWILLTFED 225
Query: 335 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ + S A+ +E K+ GLHFL IQ + G WLL
Sbjct: 226 SEMAES-AQQYEQQKEESQGLHFLLIQPDDSGMTQTGIWLL 265
>gi|282901430|ref|ZP_06309355.1| protein of unknown function DUF1092 [Cylindrospermopsis raciborskii
CS-505]
gi|281193709|gb|EFA68681.1| protein of unknown function DUF1092 [Cylindrospermopsis raciborskii
CS-505]
Length = 155
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 72/153 (47%), Gaps = 6/153 (3%)
Query: 229 PMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFV-FGASLDLDLLGIEVDDKTLIPGLA 287
PM LP++L+G++W FV + + EE S F S LG+ V IPG+
Sbjct: 6 PMPLPESLWGEQWCFVSVSAGDILEEFGSRSIPFKKITDSFVPAKLGLAV--TVSIPGVI 63
Query: 288 V-ASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAW 345
+ ++ LA W+N S+ A LIL + +I A + VT + + +
Sbjct: 64 IYGGKQSLRLARWLNENNPVSLNYIPGAPDGLILQSSSTNPWIVATFTDIDVTAA-GKVY 122
Query: 346 EAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 378
+ KK GG+HFL +Q + GFWLL D+
Sbjct: 123 QQRKKVSGGVHFLLVQPDHSGITFTGFWLLKDI 155
>gi|254432298|ref|ZP_05046001.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
gi|197626751|gb|EDY39310.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
Length = 97
Score = 57.0 bits (136), Expect = 1e-05, Method: Composition-based stats.
Identities = 34/98 (34%), Positives = 53/98 (54%), Gaps = 4/98 (4%)
Query: 283 IPGLAV-ASSRAKPLAAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSE 341
+PGL + ++SRA LA W+ GLE +E L+L G+ R++ A + P +
Sbjct: 1 MPGLRLFSASRALALAGWLAGLEPVRLEM--VDRQLVLEAGLEDRWLLATLPE-PEADAA 57
Query: 342 AEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 379
+A+ A+ GGL F+A+Q + GFW+L DLP
Sbjct: 58 RQAFAEARLRAGGLQFIAVQARESDQRFEGFWMLRDLP 95
>gi|357488599|ref|XP_003614587.1| General transcription factor IIH subunit [Medicago truncatula]
gi|355515922|gb|AES97545.1| General transcription factor IIH subunit [Medicago truncatula]
Length = 133
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/43 (60%), Positives = 31/43 (72%), Gaps = 1/43 (2%)
Query: 198 LWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDK 240
LWL+E YETVY HPGFQ GSKPL DN F M+L + + G+K
Sbjct: 92 LWLDEHYETVYI-HPGFQIGSKPLFPFDNLFDMKLQNIIHGEK 133
>gi|282896203|ref|ZP_06304226.1| Protein of unknown function DUF1092 [Raphidiopsis brookii D9]
gi|281198892|gb|EFA73770.1| Protein of unknown function DUF1092 [Raphidiopsis brookii D9]
Length = 160
Score = 46.2 bits (108), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 43/151 (28%), Positives = 70/151 (46%), Gaps = 8/151 (5%)
Query: 229 PMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDL-LGIEVDDKTLIPGLA 287
PM LP++L+G++W F + + EE S F L + LG+ V IPG+
Sbjct: 6 PMPLPESLWGEQWCFASVSAGDILEEFGSRSIPFKKIPDSFLPVKLGLAV--TVSIPGVI 63
Query: 288 V-ASSRAKPLAAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAW 345
+ ++ LA W++ S+ A LIL + R+I A + VT + A+ +
Sbjct: 64 IYGGKQSLRLARWLSENNPVSLNYIAGAPDGLILQSSSTNRWIVATFTDTDVTAA-AKVY 122
Query: 346 EAAKKACGGLHFLAIQEELDSEDCVGFWLLL 376
+ KK G+HFL +Q D+ W L+
Sbjct: 123 QQRKKVSEGVHFLLVQP--DNSGMTFSWFLV 151
>gi|16329371|ref|NP_440099.1| hypothetical protein slr1110 [Synechocystis sp. PCC 6803]
gi|383321112|ref|YP_005381965.1| hypothetical protein SYNGTI_0203 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383324282|ref|YP_005385135.1| hypothetical protein SYNPCCP_0203 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383490166|ref|YP_005407842.1| hypothetical protein SYNPCCN_0203 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384435432|ref|YP_005650156.1| hypothetical protein SYNGTS_0203 [Synechocystis sp. PCC 6803]
gi|451813530|ref|YP_007449982.1| hypothetical protein MYO_12030 [Synechocystis sp. PCC 6803]
gi|1651852|dbj|BAA16779.1| slr1110 [Synechocystis sp. PCC 6803]
gi|339272464|dbj|BAK48951.1| hypothetical protein SYNGTS_0203 [Synechocystis sp. PCC 6803]
gi|359270431|dbj|BAL27950.1| hypothetical protein SYNGTI_0203 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359273602|dbj|BAL31120.1| hypothetical protein SYNPCCN_0203 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359276772|dbj|BAL34289.1| hypothetical protein SYNPCCP_0203 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|451779499|gb|AGF50468.1| hypothetical protein MYO_12030 [Synechocystis sp. PCC 6803]
Length = 192
Score = 42.0 bits (97), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 36/148 (24%), Positives = 59/148 (39%), Gaps = 9/148 (6%)
Query: 234 DNLFGDKWAFVQLPFSAVQEEVSSLESKF-VFGASLDLDLLGIEVDDKTLIPGLAVASSR 292
D L G+ W FV LP + ++ L LG+ D IPG+ + R
Sbjct: 46 DRLLGESWQFVALPAQDIWPYFGDRPMRYQAMPEHLSPLRLGLAAD--LPIPGVVIYGGR 103
Query: 293 -AKPLAAWMNGLEVCSI----ETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEA 347
+ + W+ + S+ E G L+L R++ +K + T+ A +
Sbjct: 104 QCRFIGEWLAEQQPKSLVYIAEDPQQSGGLVLHTQNGDRWVMVTFKDGEMATA-AGVFSQ 162
Query: 348 AKKACGGLHFLAIQEELDSEDCVGFWLL 375
++ GLHFL +Q + G WLL
Sbjct: 163 RQQKAKGLHFLWLQPDNSGVTTTGVWLL 190
>gi|407957245|dbj|BAM50485.1| hypothetical protein BEST7613_1554 [Synechocystis sp. PCC 6803]
Length = 160
Score = 41.2 bits (95), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 39/161 (24%), Positives = 62/161 (38%), Gaps = 35/161 (21%)
Query: 234 DNLFGDKWAFVQLP--------------FSAVQEEVSSLESKFVFGASLDLDLLGIEVDD 279
D L G+ W FV LP + A+ E +S L G + DL
Sbjct: 14 DRLLGESWQFVALPAQDIWPYFGDRPMRYQAMPEHLSPLR----LGLAADLP-------- 61
Query: 280 KTLIPGLAVASSR-AKPLAAWMNGLEVCSI----ETDTARGSLILSVGISTRYIYANYKK 334
IPG+ + R + + W+ + S+ E G L+L R++ +K
Sbjct: 62 ---IPGVVIYGGRQCRFIGEWLAEQQPKSLVYIAEDPQQSGGLVLHTQNGDRWVMVTFKD 118
Query: 335 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
+ T+ A + ++ GLHFL +Q + G WLL
Sbjct: 119 GEMATA-AGVFSQRQQKAKGLHFLWLQPDNSGVTTTGVWLL 158
>gi|170078426|ref|YP_001735064.1| hypothetical protein SYNPCC7002_A1820 [Synechococcus sp. PCC 7002]
gi|157811858|gb|ABV80279.1| unknown [Synechococcus sp. PCC 7002]
gi|169886095|gb|ACA99808.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
Length = 160
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 33/152 (21%), Positives = 62/152 (40%), Gaps = 7/152 (4%)
Query: 229 PMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAV 288
P LPD L+G+ W F +P + + +L + + + LIPG +
Sbjct: 9 PQPLPDKLWGENWRFGSIPAGDFWDLFGDRPIP-ILSLPEELQPVKLGLASNVLIPGTII 67
Query: 289 ASSR-AKPLAAWMNGLEVCSI---ETD-TARGSLILSVGISTRYIYANYKKNPVTTSEAE 343
R + LA W+ + ++ ET+ G +L+ + R++ + + S +
Sbjct: 68 YGGRQSMQLAQWLQEQQPQTVFYQETEANLAGGFVLTGADTQRWVIMTFHDQAI-ASAGQ 126
Query: 344 AWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 375
++ + GLHFL +Q + WLL
Sbjct: 127 RYQQRLQQAQGLHFLLVQPDDSDVTHTALWLL 158
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.134 0.399
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,398,244,539
Number of Sequences: 23463169
Number of extensions: 272403890
Number of successful extensions: 790376
Number of sequences better than 100.0: 255
Number of HSP's better than 100.0 without gapping: 196
Number of HSP's successfully gapped in prelim test: 59
Number of HSP's that attempted gapping in prelim test: 789437
Number of HSP's gapped (non-prelim): 268
length of query: 383
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 239
effective length of database: 8,980,499,031
effective search space: 2146339268409
effective search space used: 2146339268409
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 78 (34.7 bits)