BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019808
(335 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297829368|ref|XP_002882566.1| hypothetical protein ARALYDRAFT_478142 [Arabidopsis lyrata subsp.
lyrata]
gi|297328406|gb|EFH58825.1| hypothetical protein ARALYDRAFT_478142 [Arabidopsis lyrata subsp.
lyrata]
Length = 374
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 238/374 (63%), Positives = 269/374 (71%), Gaps = 54/374 (14%)
Query: 10 NSTSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKEA 69
N+T +P+L I K +S TKP F + T + F R S+ ESSLS+ KE
Sbjct: 7 NTTRIQTPSLPR---IPKPSSFTKPIKTHHLFSSETLLKRCRFVSR-SLPESSLSITKEQ 62
Query: 70 DAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLS 129
+ E + EDDPT ELSYLD E+D +SI EWELDFCSRPILD RGKKIWELVVCD SLS
Sbjct: 63 EVANEVE--EDDPTSELSYLDPESDADSIKEWELDFCSRPILDSRGKKIWELVVCDASLS 120
Query: 130 LQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIP 189
LQ TKYFPNNVINSITLK+AIV I DLGVP+PEKIRFFRSQMQTIITKACKEL IK +P
Sbjct: 121 LQVTKYFPNNVINSITLKDAIVTITQDLGVPLPEKIRFFRSQMQTIITKACKELAIKAVP 180
Query: 190 SKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS 249
SKRCLSL LWL+ERY+TVYTRHPGFQKGS PLL+LDNPFPM LP+NLFG+KWAFVQLP+S
Sbjct: 181 SKRCLSLFLWLQERYDTVYTRHPGFQKGSLPLLSLDNPFPMNLPENLFGEKWAFVQLPYS 240
Query: 250 ------------------------------------------------AWMNGLEVCSIE 261
AWMNGLEVCSIE
Sbjct: 241 AVREEISDFEEKFVFGATLDLDLLGIEVDENTLIPGLSVATSRAKPLAAWMNGLEVCSIE 300
Query: 262 TDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDC 321
D+++G LILSVGI+TRY+YA YKK PVTT EAEAWE+AKKA GGLHFLAIQ++LDS+DC
Sbjct: 301 ADSSKGCLILSVGIATRYVYATYKKTPVTTDEAEAWESAKKASGGLHFLAIQDDLDSDDC 360
Query: 322 VGFWLLLDLPPPPV 335
VGFWLL+DLPPPPV
Sbjct: 361 VGFWLLIDLPPPPV 374
>gi|18398129|ref|NP_566327.1| RNA binding protein [Arabidopsis thaliana]
gi|6648213|gb|AAF21211.1|AC013483_35 unknown protein [Arabidopsis thaliana]
gi|18252181|gb|AAL61923.1| unknown protein [Arabidopsis thaliana]
gi|24899681|gb|AAN65055.1| unknown protein [Arabidopsis thaliana]
gi|332641109|gb|AEE74630.1| RNA binding protein [Arabidopsis thaliana]
Length = 374
Score = 454 bits (1168), Expect = e-125, Method: Compositional matrix adjust.
Identities = 236/374 (63%), Positives = 267/374 (71%), Gaps = 54/374 (14%)
Query: 10 NSTSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKEA 69
N+ +P+L I K +S TKP F + T + F R S+ ESSLS+ KE
Sbjct: 7 NTRRIQTPSLPR---IPKPSSFTKPIKTHHLFSSETLLKRCRFVSR-SLPESSLSITKEQ 62
Query: 70 DAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLS 129
+ E + EDDPT ELSYLD E+D +SI EWELDFCSRPILD RGKKIWELVVCD SLS
Sbjct: 63 EVANEVE--EDDPTSELSYLDPESDADSIKEWELDFCSRPILDSRGKKIWELVVCDASLS 120
Query: 130 LQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIP 189
LQ TKYFPNNVINSITLK+AIV I DLGVP+PEKIRFFRSQMQTIITKACKEL IK +P
Sbjct: 121 LQVTKYFPNNVINSITLKDAIVTITQDLGVPLPEKIRFFRSQMQTIITKACKELAIKAVP 180
Query: 190 SKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS 249
SKRCLSL LWL+ERY+TVYTRHPGFQKGS PLL+LDNPFPM LP+NLFG+KWAFVQLP+S
Sbjct: 181 SKRCLSLFLWLQERYDTVYTRHPGFQKGSLPLLSLDNPFPMNLPENLFGEKWAFVQLPYS 240
Query: 250 ------------------------------------------------AWMNGLEVCSIE 261
AWMNGLEVCSIE
Sbjct: 241 AVREEISDFDEKFVFGASLDLDLLGIEVDENTLIPGLSVATSRAKPLAAWMNGLEVCSIE 300
Query: 262 TDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDC 321
D+++G LILSVGI+TRY+YA YKK PVTT EAEAWE+AKK GGLHFLAIQ++LDS+DC
Sbjct: 301 ADSSKGCLILSVGIATRYVYATYKKTPVTTDEAEAWESAKKTSGGLHFLAIQDDLDSDDC 360
Query: 322 VGFWLLLDLPPPPV 335
VGFWLL+DLPPPPV
Sbjct: 361 VGFWLLIDLPPPPV 374
>gi|255553548|ref|XP_002517815.1| conserved hypothetical protein [Ricinus communis]
gi|223543087|gb|EEF44622.1| conserved hypothetical protein [Ricinus communis]
Length = 377
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 245/367 (66%), Positives = 277/367 (75%), Gaps = 53/367 (14%)
Query: 11 STSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQ----HFRPRPSVSESSLSVP 66
+T + +PT HKPISK TS +KPT V F ++ PP+ HF+ + SVS
Sbjct: 2 ATLSFNPTRIPHKPISKITSFSKPTKVYFP-VSQKPPKTHQKQLHFQSKLSVSTQEQVEV 60
Query: 67 KEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDG 126
++ D E E ++V+DDPT E+SYLD ETDP+SI EWELDFCSRPILDIRGKK+WELVVCD
Sbjct: 61 EDYDNEEEEEEVDDDPTAEVSYLDPETDPDSIVEWELDFCSRPILDIRGKKVWELVVCDD 120
Query: 127 SLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIK 186
SLSLQ+TKYFPNNVINSITLK+A+V++ +DLGVP+PEKIRFFRSQMQTIITKACKEL+IK
Sbjct: 121 SLSLQFTKYFPNNVINSITLKDALVSVSEDLGVPLPEKIRFFRSQMQTIITKACKELNIK 180
Query: 187 PIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL 246
P+PSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELP+NLFG+KWAFVQL
Sbjct: 181 PVPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPENLFGEKWAFVQL 240
Query: 247 PFS------------------------------------------------AWMNGLEVC 258
PFS AWMNGLEVC
Sbjct: 241 PFSAVQEEVSSLETRFMFGASLDLDLLGIEIGEKTLIPGLAVASSRAKPLAAWMNGLEVC 300
Query: 259 SIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDS 318
SIE DT+R LILSVG+STRYIYA YKKNPVTT+EAEAWEAAKK CGGLHFLAIQE+LDS
Sbjct: 301 SIEADTSRACLILSVGLSTRYIYATYKKNPVTTAEAEAWEAAKKTCGGLHFLAIQEDLDS 360
Query: 319 EDCVGFW 325
EDCVGFW
Sbjct: 361 EDCVGFW 367
>gi|225450083|ref|XP_002278058.1| PREDICTED: uncharacterized protein LOC100243060 [Vitis vinifera]
Length = 378
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 255/373 (68%), Positives = 279/373 (74%), Gaps = 57/373 (15%)
Query: 4 AALSLNNSTSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTN---TPPRLQHFRPRPSVSE 60
A LSLN T +PTL SHKPI +F SLT PT F TN T P+L HFR SVSE
Sbjct: 2 AGLSLN-PTKITTPTLQSHKPIYRFNSLTNPTKTQLKFPTNPAKTHPKLLHFR-HSSVSE 59
Query: 61 SSLSVPKEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWE 120
SS+SVPKE + + E DD PT E++YLD ETDPESI+EWELDFCSRPILDIRGKKIWE
Sbjct: 60 SSVSVPKEVEVDDEEDD----PTSEMNYLDRETDPESISEWELDFCSRPILDIRGKKIWE 115
Query: 121 LVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKAC 180
L+VCD SLSLQYTKYFPNNVINS+TLK AI +I D+L VP+PEKIRFFRSQMQTI+TKAC
Sbjct: 116 LLVCDSSLSLQYTKYFPNNVINSVTLKNAIESISDELDVPLPEKIRFFRSQMQTIVTKAC 175
Query: 181 KELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDK 240
KEL IKPIPSKRCLSL+LWLEERYETVYTRHPGFQ+GSKPLL LDNPFPM+LP+NLFG+K
Sbjct: 176 KELGIKPIPSKRCLSLILWLEERYETVYTRHPGFQQGSKPLLTLDNPFPMQLPENLFGEK 235
Query: 241 WAFVQLPFS------------------------------------------------AWM 252
WAFVQLPFS AWM
Sbjct: 236 WAFVQLPFSAVQEEVSSLETRLVFGASLDLDLLGIEVDANTLIPGLAVASSRAKPLAAWM 295
Query: 253 NGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAI 312
NGLEVCSIE DTAR LILSVGISTRYIYA YKK PVTTSEAEAWEAAKKACGGLHFLAI
Sbjct: 296 NGLEVCSIEADTARACLILSVGISTRYIYATYKKTPVTTSEAEAWEAAKKACGGLHFLAI 355
Query: 313 QEELDSEDCVGFW 325
Q++L+S+DCVGFW
Sbjct: 356 QDDLNSDDCVGFW 368
>gi|118489335|gb|ABK96472.1| unknown [Populus trichocarpa x Populus deltoides]
Length = 376
Score = 432 bits (1112), Expect = e-119, Method: Compositional matrix adjust.
Identities = 233/367 (63%), Positives = 267/367 (72%), Gaps = 54/367 (14%)
Query: 11 STSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRP----RPSVSESSLSVP 66
+T + +PT HKPISK S +K + + F F + P H +P +++ S+S
Sbjct: 2 ATLSFNPTRIPHKPISKTASFSKTSEMPFPF--SLKPSKHHVKPLHLQSNIITKLSVSTQ 59
Query: 67 KEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDG 126
+E + D EDDPT E YLD+ETDP+SI EWELDFCSRPILD+RGKK+WELVVCD
Sbjct: 60 EEEVETEKEDLEEDDPTAETVYLDQETDPDSILEWELDFCSRPILDVRGKKVWELVVCDD 119
Query: 127 SLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIK 186
SLSLQ+TKYFPNNVINSITLK+AIV+I DLGVP+PE+IRFFRSQMQTIITKACKE+ IK
Sbjct: 120 SLSLQFTKYFPNNVINSITLKDAIVSISVDLGVPLPERIRFFRSQMQTIITKACKEIGIK 179
Query: 187 PIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL 246
PIPSKRC+SLLLWLEERYETVYTRHPGFQKG+KPLLALDNPFPMELPDNLFG+KWAFVQL
Sbjct: 180 PIPSKRCISLLLWLEERYETVYTRHPGFQKGAKPLLALDNPFPMELPDNLFGEKWAFVQL 239
Query: 247 PFS------------------------------------------------AWMNGLEVC 258
PFS AWMNGLEVC
Sbjct: 240 PFSAVREEIASFETSFFFGASLDLDLLGIEIDDKTMIPGLAVASSRAEPLAAWMNGLEVC 299
Query: 259 SIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDS 318
+IE DT+R LILSVGI+TRY+YA YKK PVTT+EAEAWEAAKKACGGLHFLAIQ +LDS
Sbjct: 300 AIEADTSRACLILSVGIATRYVYATYKKTPVTTAEAEAWEAAKKACGGLHFLAIQNDLDS 359
Query: 319 EDCVGFW 325
+DCVGFW
Sbjct: 360 DDCVGFW 366
>gi|449436313|ref|XP_004135937.1| PREDICTED: uncharacterized protein LOC101208052 [Cucumis sativus]
gi|449488836|ref|XP_004158187.1| PREDICTED: uncharacterized protein LOC101230638 [Cucumis sativus]
Length = 379
Score = 429 bits (1104), Expect = e-118, Method: Compositional matrix adjust.
Identities = 234/353 (66%), Positives = 265/353 (75%), Gaps = 56/353 (15%)
Query: 23 KPI-SKFTSLTKPTN-VSFNFLTNTPPRLQHFRPRPSVSESSLSVPKEADAEIEADDVED 80
KPI S F+ K N S N + P L FR SVSESS++ P+E +E ++ ED
Sbjct: 23 KPIYSPFSQSIKTANRFSANGRISQQP-LPRFRSN-SVSESSVTAPEE----VELNEDED 76
Query: 81 DPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNV 140
DPT E++YLD ETDPESITEWELDFCSRPILDIRGKK+WELVVCD SLSLQYTKYFPNNV
Sbjct: 77 DPTLEMAYLDSETDPESITEWELDFCSRPILDIRGKKVWELVVCDNSLSLQYTKYFPNNV 136
Query: 141 INSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWL 200
INSITL++A+ +I ++LGVP+P+KIRFFRSQMQTIITKAC EL IKPIPSKRCLSLLLWL
Sbjct: 137 INSITLRDAVSSIAEELGVPLPDKIRFFRSQMQTIITKACTELGIKPIPSKRCLSLLLWL 196
Query: 201 EERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS----------- 249
EERYETVYTRHPGFQKGSKPLLALDNPFPMELP+NLFG++WAFVQLPFS
Sbjct: 197 EERYETVYTRHPGFQKGSKPLLALDNPFPMELPENLFGERWAFVQLPFSAVQEEISNLKE 256
Query: 250 -------------------------------------AWMNGLEVCSIETDTARGSLILS 272
AWMNG+EV S+E DT+R SLILS
Sbjct: 257 TFMFGSSLDLDLLGIEIDDKTMIPGLSVATSRAQPLAAWMNGMEVYSVEADTSRASLILS 316
Query: 273 VGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFW 325
VGI+TRY+YA YKK PVT++EAEAWEAAKKACGGLHFLAIQ++LDSEDCVGFW
Sbjct: 317 VGIATRYVYATYKKTPVTSAEAEAWEAAKKACGGLHFLAIQDDLDSEDCVGFW 369
>gi|388502160|gb|AFK39146.1| unknown [Lotus japonicus]
Length = 382
Score = 423 bits (1088), Expect = e-116, Method: Compositional matrix adjust.
Identities = 225/373 (60%), Positives = 270/373 (72%), Gaps = 53/373 (14%)
Query: 4 AALSLNNSTSTNSPTLNSHKPISKFTSLTKPTNVSF--NFLTNTPPRLQHFRPRPSVSES 61
A LS N T +PT N P +K TS +KP + + + ++ +L HFR SVSE+
Sbjct: 2 ATLSFN-PTRIRTPTFNRSNPSTKLTSSSKPIRIPCIPSSINHSHQKLIHFRAN-SVSET 59
Query: 62 SLSVPKEADAEIEADDVEDD-PTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWE 120
SLS KE + E D+ EDD PT E+S+LD ETDP++I++WELDFCSRPILD RGKK+WE
Sbjct: 60 SLSTQKEEEQETLGDEEEDDDPTAEMSFLDPETDPDAISDWELDFCSRPILDARGKKLWE 119
Query: 121 LVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKAC 180
LVVCD +LSLQ+TKYFPNNVINSITLK+A+V++CDDLG+P+P+KIRFFRSQMQTIIT+AC
Sbjct: 120 LVVCDSTLSLQFTKYFPNNVINSITLKDAVVSVCDDLGLPLPKKIRFFRSQMQTIITRAC 179
Query: 181 KELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDK 240
EL IKP+PSKRCLSLLLWLEERYETVY +HPGFQKG PLLALDNPFP +LP++LFG++
Sbjct: 180 NELGIKPVPSKRCLSLLLWLEERYETVYKKHPGFQKGFTPLLALDNPFPTKLPEDLFGER 239
Query: 241 WAFVQLPF------------------------------------------------SAWM 252
WAFVQLPF SA M
Sbjct: 240 WAFVQLPFSAVREELTSLQTNMIFGSGLDLDLMGIEIDDKTMIPGLAVGSSRATVLSAIM 299
Query: 253 NGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAI 312
N E+C++E DTARGSLILSVGISTRY+YA YKK P TTSEAEAWEAAKKACGGLHFLAI
Sbjct: 300 NSFELCTVEADTARGSLILSVGISTRYVYATYKKTPTTTSEAEAWEAAKKACGGLHFLAI 359
Query: 313 QEELDSEDCVGFW 325
Q++++SE+C GFW
Sbjct: 360 QQDIESEECAGFW 372
>gi|224104083|ref|XP_002313311.1| predicted protein [Populus trichocarpa]
gi|222849719|gb|EEE87266.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 215/308 (69%), Positives = 239/308 (77%), Gaps = 49/308 (15%)
Query: 67 KEADAEIEADDVE-DDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCD 125
+E + E E D E DDPT E+ YLD ETDP+SI EWELDFCSRPILD+RGKK+WELVVCD
Sbjct: 8 QEEEVETEKKDYEEDDPTTEMVYLDPETDPDSIVEWELDFCSRPILDVRGKKVWELVVCD 67
Query: 126 GSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDI 185
SLSLQ+TKYFPNNVINSITLK+AIV+I +DLGVP+PE+IRFFRSQMQTIITKACKE+ I
Sbjct: 68 DSLSLQFTKYFPNNVINSITLKDAIVSISEDLGVPLPERIRFFRSQMQTIITKACKEIGI 127
Query: 186 KPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQ 245
KPIPSKRC+SLLLWLEERYETVYTRHPGFQKG+KPLLALDNPFPMELPDNLFG+KWAFVQ
Sbjct: 128 KPIPSKRCISLLLWLEERYETVYTRHPGFQKGAKPLLALDNPFPMELPDNLFGEKWAFVQ 187
Query: 246 LPFS------------------------------------------------AWMNGLEV 257
LP+S AWMNGLEV
Sbjct: 188 LPYSAVREEIASLETSFFFGASLDLDLLGIEIDDKTMIPGLAVASSRAEPLAAWMNGLEV 247
Query: 258 CSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELD 317
+IE DT+R LILSVGI+TRY+YA YKK PVTT+EAEAWEAAKKACGGLHFLAIQ +LD
Sbjct: 248 VAIEADTSRACLILSVGIATRYVYATYKKTPVTTAEAEAWEAAKKACGGLHFLAIQNDLD 307
Query: 318 SEDCVGFW 325
S+DCVGFW
Sbjct: 308 SDDCVGFW 315
>gi|224104081|ref|XP_002313310.1| predicted protein [Populus trichocarpa]
gi|222849718|gb|EEE87265.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 409 bits (1052), Expect = e-112, Method: Compositional matrix adjust.
Identities = 211/295 (71%), Positives = 231/295 (78%), Gaps = 48/295 (16%)
Query: 79 EDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPN 138
EDDPT E YLD+ETDP+SI EWELDFCSRPILD+RGKK+WELVVCD SLSLQ+TKYFPN
Sbjct: 21 EDDPTAETVYLDQETDPDSIVEWELDFCSRPILDVRGKKVWELVVCDDSLSLQFTKYFPN 80
Query: 139 NVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLL 198
NVINSITLK+AIV+I DLGVP+PE+IRFFRSQM TIITKACKE+ IKPIPSKRC+SLLL
Sbjct: 81 NVINSITLKDAIVSISVDLGVPLPERIRFFRSQMLTIITKACKEIGIKPIPSKRCISLLL 140
Query: 199 WLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS--------- 249
WLEERYETVYTRHPGFQKG+KPLLALDNPFPMELPDNLFG+KWAFVQLPFS
Sbjct: 141 WLEERYETVYTRHPGFQKGAKPLLALDNPFPMELPDNLFGEKWAFVQLPFSAVREEIASL 200
Query: 250 ---------------------------------------AWMNGLEVCSIETDTARGSLI 270
AWMNGLEV +IE DT+R LI
Sbjct: 201 ETRFFFGASLDLDLLGIEIDDKTMIPGLAVASSRAEPLAAWMNGLEVVAIEADTSRACLI 260
Query: 271 LSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFW 325
LSVGI+TRY+YA YKK PVTT+EAEAWEAAKKACGGLHFLAIQ +LDS+DCVGFW
Sbjct: 261 LSVGIATRYVYATYKKTPVTTAEAEAWEAAKKACGGLHFLAIQNDLDSDDCVGFW 315
>gi|363807199|ref|NP_001242607.1| uncharacterized protein LOC100795572 [Glycine max]
gi|255640179|gb|ACU20380.1| unknown [Glycine max]
Length = 377
Score = 409 bits (1051), Expect = e-112, Method: Compositional matrix adjust.
Identities = 234/382 (61%), Positives = 269/382 (70%), Gaps = 56/382 (14%)
Query: 4 AALSLNNSTSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSL 63
A LS N SPT SK T+ +K + +N+ P+L HFRPR SVSES+
Sbjct: 2 ATLSFN-PVRIKSPTFKH----SKLTTPSKRITIPCTTPSNSHPKLLHFRPR-SVSESTQ 55
Query: 64 SVPKEA---DAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWE 120
EA + E E +D +DDP+ ELSY+D TDPESITEWELDFCSRPILD RGKK+WE
Sbjct: 56 KEAPEAVLGEEEEEEEDDDDDPSAELSYVDPVTDPESITEWELDFCSRPILDARGKKVWE 115
Query: 121 LVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKAC 180
LVVC +LSLQYTKYFPNNVINSITLK+AIVA+ D LGVP+P IRFFRSQMQTIIT AC
Sbjct: 116 LVVCGKTLSLQYTKYFPNNVINSITLKDAIVAVSDQLGVPLPRNIRFFRSQMQTIITNAC 175
Query: 181 KELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDK 240
EL I+P+PSKRC+S++LWLEERYETVY +HPGFQ+GSKPLLALDNPFP ELPD L+G++
Sbjct: 176 NELRIRPVPSKRCVSIILWLEERYETVYKKHPGFQEGSKPLLALDNPFPTELPDILYGER 235
Query: 241 WAFVQLPFS-----------------------------------------------AWMN 253
WAFVQLP+S A +N
Sbjct: 236 WAFVQLPYSAVREEISTFERGVCGSGLDLDLLGLDIDDKTLIPGLSVASSNSTALAALIN 295
Query: 254 GLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQ 313
GLEVC++E DTAR LILS GISTRYIY+ YKK P TTSEAEAWEAAKKACGGLHFLA+Q
Sbjct: 296 GLEVCAVEADTARARLILSSGISTRYIYSTYKKTPETTSEAEAWEAAKKACGGLHFLAVQ 355
Query: 314 EELDSEDCVGFWLLLDLPPPPV 335
+LDSEDCVGF+LLLDLP PPV
Sbjct: 356 PDLDSEDCVGFFLLLDLPFPPV 377
>gi|356534594|ref|XP_003535838.1| PREDICTED: uncharacterized protein LOC100803590 [Glycine max]
Length = 378
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 225/359 (62%), Positives = 260/359 (72%), Gaps = 55/359 (15%)
Query: 29 TSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKEADAEI-----EADDVEDDPT 83
T+ +KP + +N+ P+L HFR R SVSES+ KEA + E +D +DDPT
Sbjct: 23 TTPSKPITIPCTTPSNSHPKLLHFRTR-SVSESTHQ--KEAPEAVLGEHEEEEDDDDDPT 79
Query: 84 QELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINS 143
ELSY+D ETDPESITEWELDFCSRPILD+RGKKIWELVVCD +LSLQYTKYFPNNVINS
Sbjct: 80 SELSYVDPETDPESITEWELDFCSRPILDVRGKKIWELVVCDKTLSLQYTKYFPNNVINS 139
Query: 144 ITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEER 203
ITLK+AIVA+ D LGVP+P IRFFRSQMQTIIT AC EL I+P+PSKRC+S++LWLEER
Sbjct: 140 ITLKDAIVAVSDQLGVPLPRNIRFFRSQMQTIITNACNELRIRPVPSKRCVSIILWLEER 199
Query: 204 YETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS-------------- 249
YETVY +HPGFQ+GSKPLLALDNPFP ELPD L+G++WAFVQLP+S
Sbjct: 200 YETVYRKHPGFQEGSKPLLALDNPFPTELPDILYGERWAFVQLPYSAVREEISTFERGVC 259
Query: 250 ---------------------------------AWMNGLEVCSIETDTARGSLILSVGIS 276
A +NGLEV ++E D R LILS GIS
Sbjct: 260 GSGLDLELLGLDIDDKTLIPGLSVASSNATALAALINGLEVSAVEADAPRARLILSAGIS 319
Query: 277 TRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 335
TRYIY+ YKK P TTSEAEAWEAAKKACGGLHF+A+Q +LDSEDCVGF+LLLDLP PPV
Sbjct: 320 TRYIYSTYKKTPETTSEAEAWEAAKKACGGLHFIAVQPDLDSEDCVGFFLLLDLPFPPV 378
>gi|297736276|emb|CBI24914.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 209/288 (72%), Positives = 227/288 (78%), Gaps = 48/288 (16%)
Query: 86 LSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSIT 145
++YLD ETDPESI+EWELDFCSRPILDIRGKKIWEL+VCD SLSLQYTKYFPNNVINS+T
Sbjct: 1 MNYLDRETDPESISEWELDFCSRPILDIRGKKIWELLVCDSSLSLQYTKYFPNNVINSVT 60
Query: 146 LKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYE 205
LK AI +I D+L VP+PEKIRFFRSQMQTI+TKACKEL IKPIPSKRCLSL+LWLEERYE
Sbjct: 61 LKNAIESISDELDVPLPEKIRFFRSQMQTIVTKACKELGIKPIPSKRCLSLILWLEERYE 120
Query: 206 TVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS---------------- 249
TVYTRHPGFQ+GSKPLL LDNPFPM+LP+NLFG+KWAFVQLPFS
Sbjct: 121 TVYTRHPGFQQGSKPLLTLDNPFPMQLPENLFGEKWAFVQLPFSAVQEEVSSLETRLVFG 180
Query: 250 --------------------------------AWMNGLEVCSIETDTARGSLILSVGIST 277
AWMNGLEVCSIE DTAR LILSVGIST
Sbjct: 181 ASLDLDLLGIEVDANTLIPGLAVASSRAKPLAAWMNGLEVCSIEADTARACLILSVGIST 240
Query: 278 RYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFW 325
RYIYA YKK PVTTSEAEAWEAAKKACGGLHFLAIQ++L+S+DCVGFW
Sbjct: 241 RYIYATYKKTPVTTSEAEAWEAAKKACGGLHFLAIQDDLNSDDCVGFW 288
>gi|357457965|ref|XP_003599263.1| hypothetical protein MTR_3g030950 [Medicago truncatula]
gi|355488311|gb|AES69514.1| hypothetical protein MTR_3g030950 [Medicago truncatula]
Length = 380
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 222/372 (59%), Positives = 262/372 (70%), Gaps = 53/372 (14%)
Query: 4 AALSLNNSTSTNSPTLNSHKPI--SKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSES 61
A LS N ST +P+ N PI +K +S +KP + F F +N L+ S + S
Sbjct: 2 ATLSFN-STRIKTPSFNYTNPIITTKLSS-SKPI-IKFPFSSNKNHFLKLQISSVSETSS 58
Query: 62 SLSVPKEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWEL 121
+ + K+ + E E ++ ++DPT E YLD E DP+SI WELDFCSRPILD RGKK+WEL
Sbjct: 59 TTTTQKDIEEEEEEEEEKEDPTAETCYLDPEADPDSILSWELDFCSRPILDARGKKLWEL 118
Query: 122 VVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACK 181
VVCD SLSLQYTKYFPNNVINSITLK++IVAICDDL +P+P IRFFRSQMQTIITKACK
Sbjct: 119 VVCDKSLSLQYTKYFPNNVINSITLKDSIVAICDDLDLPVPRNIRFFRSQMQTIITKACK 178
Query: 182 ELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKW 241
EL I+ +PSKRCLSLLLWLEERYETVYT+HPGFQKGSKPLL LDNPF +LP++LFG++W
Sbjct: 179 ELGIRALPSKRCLSLLLWLEERYETVYTKHPGFQKGSKPLLPLDNPFATKLPEDLFGERW 238
Query: 242 AFVQLPF------------------------------------------------SAWMN 253
AFVQLP+ SA+MN
Sbjct: 239 AFVQLPYSAVRAEASASEERFGYGSGLDLDLLGIEIDEKTLIPGLAVASSRAKILSAFMN 298
Query: 254 GLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQ 313
GLE+CSIETDTAR +L LSVGISTRY+YA YKK+P +T EAEAWEAAKKA GGLHFLAIQ
Sbjct: 299 GLELCSIETDTARSNLTLSVGISTRYVYATYKKSPTSTKEAEAWEAAKKASGGLHFLAIQ 358
Query: 314 EELDSEDCVGFW 325
+ELDSEDC+GFW
Sbjct: 359 DELDSEDCIGFW 370
>gi|226508054|ref|NP_001150851.1| tab2 protein [Zea mays]
gi|194702852|gb|ACF85510.1| unknown [Zea mays]
gi|195642376|gb|ACG40656.1| tab2 protein [Zea mays]
gi|413937739|gb|AFW72290.1| Tab2 protein [Zea mays]
Length = 390
Score = 372 bits (956), Expect = e-101, Method: Compositional matrix adjust.
Identities = 183/305 (60%), Positives = 224/305 (73%), Gaps = 49/305 (16%)
Query: 69 ADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSL 128
AD E+EA++ + DP E+ YLD + DPESI EWELDFCSRPILD RGKK+WELVVCD +L
Sbjct: 77 ADEEVEAEN-KVDPQAEVCYLDPDVDPESIREWELDFCSRPILDARGKKVWELVVCDATL 135
Query: 129 SLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPI 188
SLQ+T+YFPNN INS+TL++A+ ++ + LGVP+P+++RFFRSQMQTIIT+AC +L +K +
Sbjct: 136 SLQFTRYFPNNAINSVTLRDALASVSEALGVPMPDRVRFFRSQMQTIITRACGDLGVKAV 195
Query: 189 PSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPF 248
PS+RC+SLLLWLEERYE VY+RHPGFQ G++PLLALDNPFP LP+NLFGDKWAFVQLPF
Sbjct: 196 PSRRCVSLLLWLEERYEVVYSRHPGFQAGTRPLLALDNPFPTTLPENLFGDKWAFVQLPF 255
Query: 249 S------------------------------------------------AWMNGLEVCSI 260
S AWMNGLE+C++
Sbjct: 256 SAVREEVESLERRYAFGAGLDLELLGFELDDTTLVPGVAVESSRAKPLAAWMNGLEICAM 315
Query: 261 ETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSED 320
E DT R SLILS G+STRY+Y+ Y+K +T EAEAWEAAKKACGGLHFLAIQE L+S+
Sbjct: 316 EADTGRASLILSAGVSTRYVYSGYQKTAASTQEAEAWEAAKKACGGLHFLAIQENLNSDG 375
Query: 321 CVGFW 325
CVGFW
Sbjct: 376 CVGFW 380
>gi|115447245|ref|NP_001047402.1| Os02g0610800 [Oryza sativa Japonica Group]
gi|47497182|dbj|BAD19229.1| putative Tab2 protein [Oryza sativa Japonica Group]
gi|113536933|dbj|BAF09316.1| Os02g0610800 [Oryza sativa Japonica Group]
gi|215704647|dbj|BAG94275.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 392
Score = 364 bits (935), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 179/300 (59%), Positives = 216/300 (72%), Gaps = 48/300 (16%)
Query: 74 EADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYT 133
++++ E DP E+ YLD E D E I EWELDFCSRPILD RGKK+WELVVCD +LSLQ+T
Sbjct: 83 DSEEEEMDPLAEVCYLDPEADAEGIREWELDFCSRPILDARGKKVWELVVCDATLSLQFT 142
Query: 134 KYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRC 193
++FPN INS+TL++A+ ++ LGVP+P++ RFFRSQMQTII++AC EL +K +PS+RC
Sbjct: 143 RFFPNTSINSVTLRDALASVATSLGVPLPDRARFFRSQMQTIISRACNELGVKAVPSRRC 202
Query: 194 LSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS---- 249
+SLLLWLEERYETVY+RHPGFQ G+KPLL LDNPFP LP+NLFGDKWAFVQLPFS
Sbjct: 203 VSLLLWLEERYETVYSRHPGFQSGTKPLLTLDNPFPTSLPENLFGDKWAFVQLPFSAVRE 262
Query: 250 --------------------------------------------AWMNGLEVCSIETDTA 265
AWMNGLE+CS+E DT
Sbjct: 263 EVESLERRYAFGAGLDLDLLGFELDENTLIPGVAVESSRAKPLAAWMNGLEICSMEVDTG 322
Query: 266 RGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFW 325
R +LILS G+STRY+YA Y+K+ TT EAEAWEAAKKACGGLHFLAIQE L+S+ CVGFW
Sbjct: 323 RANLILSAGVSTRYVYAGYQKSAATTQEAEAWEAAKKACGGLHFLAIQENLNSDGCVGFW 382
>gi|125540251|gb|EAY86646.1| hypothetical protein OsI_08028 [Oryza sativa Indica Group]
Length = 392
Score = 364 bits (934), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 179/300 (59%), Positives = 216/300 (72%), Gaps = 48/300 (16%)
Query: 74 EADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYT 133
++++ E DP E+ YLD E D E I EWELDFCSRPILD RGKK+WELVVCD +LSLQ+T
Sbjct: 83 DSEEEEMDPLAEVCYLDPEADAEGIREWELDFCSRPILDARGKKVWELVVCDATLSLQFT 142
Query: 134 KYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRC 193
++FPN INS+TL++A+ ++ LGVP+P++ RFFRSQMQTII++AC EL +K +PS+RC
Sbjct: 143 RFFPNTSINSVTLRDALASVATSLGVPLPDRARFFRSQMQTIISRACNELGVKAVPSRRC 202
Query: 194 LSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS---- 249
+SLLLWLEERYETVY+RHPGFQ G+KPLL LDNPFP LP+NLFGDKWAFVQLPFS
Sbjct: 203 VSLLLWLEERYETVYSRHPGFQSGTKPLLTLDNPFPTSLPENLFGDKWAFVQLPFSAVRE 262
Query: 250 --------------------------------------------AWMNGLEVCSIETDTA 265
AWMNGLE+CS+E DT
Sbjct: 263 EVESLERRYAFGAGLDLDLLGFELDENTLIPGVAVESSRAKPLAAWMNGLEICSMEVDTG 322
Query: 266 RGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFW 325
R +LILS G+STRY+YA Y+K+ TT EAEAWEAAKKACGGLHFLAIQE L+S+ CVGFW
Sbjct: 323 RANLILSAGVSTRYVYAGYQKSAATTQEAEAWEAAKKACGGLHFLAIQENLNSDGCVGFW 382
>gi|125582848|gb|EAZ23779.1| hypothetical protein OsJ_07488 [Oryza sativa Japonica Group]
Length = 304
Score = 360 bits (923), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 178/293 (60%), Positives = 211/293 (72%), Gaps = 48/293 (16%)
Query: 81 DPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNV 140
DP E+ YLD E D E I EWELDFCSRPILD RGKK+WELVVCD +LSLQ+T++FPN
Sbjct: 2 DPLAEVCYLDPEADAEGIREWELDFCSRPILDARGKKVWELVVCDATLSLQFTRFFPNTS 61
Query: 141 INSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWL 200
INS+TL++A+ ++ LGVP+P++ RFFRSQMQTII++AC EL +K +PS+RC+SLLLWL
Sbjct: 62 INSVTLRDALASVATSLGVPLPDRARFFRSQMQTIISRACNELGVKAVPSRRCVSLLLWL 121
Query: 201 EERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS----------- 249
EERYETVY+RHPGFQ G+KPLL LDNPFP LP+NLFGDKWAFVQLPFS
Sbjct: 122 EERYETVYSRHPGFQSGTKPLLTLDNPFPTSLPENLFGDKWAFVQLPFSAVREEVESLER 181
Query: 250 -------------------------------------AWMNGLEVCSIETDTARGSLILS 272
AWMNGLE+CS+E DT R +LILS
Sbjct: 182 RYAFGAGLDLDLLGFELDENTLIPGVAVESSRAKPLAAWMNGLEICSMEVDTGRANLILS 241
Query: 273 VGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFW 325
G+STRY+YA Y+K+ TT EAEAWEAAKKACGGLHFLAIQE L+S+ CVGFW
Sbjct: 242 AGVSTRYVYAGYQKSAATTQEAEAWEAAKKACGGLHFLAIQENLNSDGCVGFW 294
>gi|357150079|ref|XP_003575334.1| PREDICTED: uncharacterized protein LOC100846528 [Brachypodium
distachyon]
Length = 394
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 192/358 (53%), Positives = 233/358 (65%), Gaps = 65/358 (18%)
Query: 33 KPTNVSFNFLTNTPPRLQHFRPRP-----SVSESSLSVPKEAD-AEIEADDVED------ 80
KP++ SF+ P + P P S+S S + AD AE E D
Sbjct: 27 KPSSASFSARPYPHPHYRLAVPTPRRPCRSISSESPTASAAADTAEGEDDPAAATIEEEE 86
Query: 81 -----DPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKY 135
DP E+ YLD E D E I EWE+DFCSRPILD RGKK+WELVVCD +LSLQ+T++
Sbjct: 87 EEEELDPLAEVCYLDPEADAEGIREWEVDFCSRPILDARGKKVWELVVCDATLSLQFTRF 146
Query: 136 FPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLS 195
FPN INS+TL++A+ ++ LGVP+P++ RFFRSQMQTII++AC EL +K +PS+RC+S
Sbjct: 147 FPNTSINSVTLRDALASVSTSLGVPLPDRARFFRSQMQTIISRACNELGVKAVPSRRCVS 206
Query: 196 LLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPF------- 248
LLLWLEERYETVY+RHPGFQ+G+KPLL LDNPF LPDNLFGDKWAFVQLPF
Sbjct: 207 LLLWLEERYETVYSRHPGFQQGTKPLLTLDNPFASNLPDNLFGDKWAFVQLPFADVREEV 266
Query: 249 -----------------------------------------SAWMNGLEVCSIETDTARG 267
+AWMNGLE+CS+E DT R
Sbjct: 267 ELLGRRYAFGAGLDLDLLGFELDETTLVPGVAVESSRARPLAAWMNGLEICSMEVDTDRA 326
Query: 268 SLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFW 325
+LILS G+STRY+YA Y+K+ TT EAEAWEAAKKACGGLHFLAIQE L+S+ CVGFW
Sbjct: 327 NLILSAGVSTRYVYAAYQKSAATTQEAEAWEAAKKACGGLHFLAIQENLNSDSCVGFW 384
>gi|242062284|ref|XP_002452431.1| hypothetical protein SORBIDRAFT_04g025690 [Sorghum bicolor]
gi|241932262|gb|EES05407.1| hypothetical protein SORBIDRAFT_04g025690 [Sorghum bicolor]
Length = 399
Score = 355 bits (912), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 177/289 (61%), Positives = 211/289 (73%), Gaps = 48/289 (16%)
Query: 85 ELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSI 144
E+ YLD + DPESI EWELDFCSRPILD RGKK+WELVVCD +LSLQ+T+YFPNN INS+
Sbjct: 101 EVCYLDPDADPESIREWELDFCSRPILDARGKKVWELVVCDATLSLQFTRYFPNNAINSV 160
Query: 145 TLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERY 204
TL++A+ ++ + LGVP+P+++RFFRSQMQTIIT+AC EL +K +PS+RC+SLLLWLEERY
Sbjct: 161 TLRDALSSVSEALGVPMPDRVRFFRSQMQTIITRACGELGVKAVPSRRCVSLLLWLEERY 220
Query: 205 ETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS--------------- 249
E VY+RHPGFQ G++PLLALDNPFP LP+NLFGDKWAFVQLPFS
Sbjct: 221 EVVYSRHPGFQAGTRPLLALDNPFPTTLPENLFGDKWAFVQLPFSAVREEVESLGRRYAF 280
Query: 250 ---------------------------------AWMNGLEVCSIETDTARGSLILSVGIS 276
AWMNGLE+ ++E DT R SLILS G+S
Sbjct: 281 GAGLDLDLLGFELDDSTLVPGVAVESSRAKPLAAWMNGLEISAMEVDTGRASLILSAGVS 340
Query: 277 TRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFW 325
TRYIY+ Y+K P T EAEAWEAAKKA GGLHFLAIQE L+S+ CVGFW
Sbjct: 341 TRYIYSGYQKTPAATQEAEAWEAAKKASGGLHFLAIQENLNSDGCVGFW 389
>gi|168032007|ref|XP_001768511.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680224|gb|EDQ66662.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 338
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 162/295 (54%), Positives = 202/295 (68%), Gaps = 48/295 (16%)
Query: 89 LDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKE 148
L E++D +SI+EWELDFCSRPILD RGKK+WELVVCD LQ+T++FPNNVINS+TL++
Sbjct: 44 LAEDSDVDSISEWELDFCSRPILDARGKKLWELVVCDSRRQLQFTRFFPNNVINSVTLRD 103
Query: 149 AIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVY 208
A++ I D LGVP PEKIRFFRSQMQTIITKACKELDI+P+PS+RC++L+ WLEER+ETVY
Sbjct: 104 ALMYIMDTLGVPKPEKIRFFRSQMQTIITKACKELDIQPVPSQRCVALIKWLEERFETVY 163
Query: 209 TRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPF-------------------- 248
++HPG+Q+G+ PLL P++LPD L G++WAFVQLPF
Sbjct: 164 SQHPGYQEGASPLLLQQQSLPLDLPDALRGEEWAFVQLPFEAVLEEMEGVVRGDVFGSVL 223
Query: 249 ----------------------------SAWMNGLEVCSIETDTARGSLILSVGISTRYI 280
+AW N LE+ +E DT R L+LS G++ R+
Sbjct: 224 DLGTLNIDLSGDIMIPGVAVASSRATPLAAWTNALELACLEVDTQRSCLVLSTGVADRWR 283
Query: 281 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 335
YA Y+K+ T +E EAWEAAKK CGGLHFLA+Q LDSE C GFWLLLD P PV
Sbjct: 284 YAFYRKSRQTDAEGEAWEAAKKKCGGLHFLAVQSSLDSELCTGFWLLLDTPISPV 338
>gi|168015159|ref|XP_001760118.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162688498|gb|EDQ74874.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 290
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 155/290 (53%), Positives = 196/290 (67%), Gaps = 46/290 (15%)
Query: 92 ETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIV 151
+ D +SI EWELDFCSRPILD RGKK+WELVVCD LQ+T++FPNNVINS+TL++A++
Sbjct: 1 DADVDSIYEWELDFCSRPILDSRGKKLWELVVCDSRRQLQFTRFFPNNVINSVTLRDALL 60
Query: 152 AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRH 211
I D L VP PEKIRFFRSQMQTIITKACKELDI+P+PS+RC++L+ WLEER+ETVY++H
Sbjct: 61 YIMDTLQVPKPEKIRFFRSQMQTIITKACKELDIQPVPSQRCVTLIKWLEERFETVYSQH 120
Query: 212 PGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL------------------------- 246
PG+Q+G+ PLL P++LPD L G++WAF+ L
Sbjct: 121 PGYQEGASPLLLQQQSLPLDLPDALRGEEWAFLALAAVLEEMEGVSKGDVFGSVLDLDRL 180
Query: 247 ---------------------PFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYK 285
P +AW N LE+ S+E DT R L+LS G++ R+ YA Y+
Sbjct: 181 NIDLSPGIMIPGVAVASSRATPLAAWTNALELASLEVDTQRSCLVLSTGVADRWRYAFYR 240
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 335
K+ T +E EAWEAAK+ CGGLHFLA+Q LDSE C GFWLL+D P PV
Sbjct: 241 KSRQTDAEGEAWEAAKRKCGGLHFLAVQSSLDSELCTGFWLLIDTPISPV 290
>gi|413937738|gb|AFW72289.1| hypothetical protein ZEAMMB73_111177 [Zea mays]
Length = 320
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 135/200 (67%), Positives = 168/200 (84%), Gaps = 3/200 (1%)
Query: 69 ADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSL 128
AD E+EA++ + DP E+ YLD + DPESI EWELDFCSRPILD RGKK+WELVVCD +L
Sbjct: 77 ADEEVEAEN-KVDPQAEVCYLDPDVDPESIREWELDFCSRPILDARGKKVWELVVCDATL 135
Query: 129 SLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPI 188
SLQ+T+YFPNN INS+TL++A+ ++ + LGVP+P+++RFFRSQMQTIIT+AC +L +K +
Sbjct: 136 SLQFTRYFPNNAINSVTLRDALASVSEALGVPMPDRVRFFRSQMQTIITRACGDLGVKAV 195
Query: 189 PSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPF 248
PS+RC+SLLLWLEERYE VY+RHPGFQ G++PLLALDNPFP LP+NLFGDKWAFVQLPF
Sbjct: 196 PSRRCVSLLLWLEERYEVVYSRHPGFQAGTRPLLALDNPFPTTLPENLFGDKWAFVQLPF 255
Query: 249 SAWMNGLEVCSIETDTARGS 268
SA EV S+E A G+
Sbjct: 256 SAVRE--EVESLERRYAFGA 273
>gi|116783338|gb|ABK22899.1| unknown [Picea sitchensis]
Length = 362
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 120/186 (64%), Positives = 152/186 (81%), Gaps = 2/186 (1%)
Query: 85 ELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSI 144
E++ L + DPESITEWELDFCSRPILDIRGKKIWELVVCD +L++T+++PNNVINSI
Sbjct: 80 EVTKLAADIDPESITEWELDFCSRPILDIRGKKIWELVVCDSKRALEFTRFYPNNVINSI 139
Query: 145 TLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERY 204
TLK+AI++I LGVP P+ IRFFRSQM+TI++KAC EL I+P+PSKRCLSL+ WLEERY
Sbjct: 140 TLKDAIMSIVQTLGVPKPQTIRFFRSQMKTIVSKACNELGIRPVPSKRCLSLIRWLEERY 199
Query: 205 ETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAWMNGLEVCSIETDT 264
E VY RHPGFQKG+K LL L+ P+ELPDNL G+KWAFVQLP + L + ++ ++
Sbjct: 200 EPVYMRHPGFQKGAKALLTLEQSSPLELPDNLCGEKWAFVQLPLAVVQEELAI--VQEES 257
Query: 265 ARGSLI 270
+ GS++
Sbjct: 258 SFGSVL 263
>gi|302763879|ref|XP_002965361.1| hypothetical protein SELMODRAFT_65804 [Selaginella moellendorffii]
gi|300167594|gb|EFJ34199.1| hypothetical protein SELMODRAFT_65804 [Selaginella moellendorffii]
Length = 290
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 124/290 (42%), Positives = 171/290 (58%), Gaps = 47/290 (16%)
Query: 93 TDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVA 152
D SI EW+LDFCSRPI D RGK++WEL++CD L++ +++P+NVINS TLK AI
Sbjct: 1 ADLASIVEWQLDFCSRPIFDDRGKRMWELIICDAKRQLEFARFYPSNVINSTTLKNAIAE 60
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ + +P P ++R+FRSQ++TII+KAC EL I+ S+RC +L+ WL+ERY+ VY +HP
Sbjct: 61 VIETFDLPRPTRVRYFRSQVKTIISKACGELGIQVTSSQRCTALVRWLQERYDQVYRQHP 120
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPF------------------------ 248
GFQ+ + +L++ P E+P N G+KWAFVQL F
Sbjct: 121 GFQENAPSILSMGVSVPKEVPPNYRGEKWAFVQLSFQALQEEIKLVEKGSNFGEVSLDML 180
Query: 249 -----------------------SAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYK 285
+AW N LE+ S+ D +L+LS G S ++ Y+ YK
Sbjct: 181 TELPSPDTLIPGVAVASSRDLALAAWTNSLELASLSVDKKNSALVLSSGASRQWFYSYYK 240
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 335
K+ EA+ WE+AKKA GGLHFLAIQ L+S C G W+L D P PPV
Sbjct: 241 KSKQADEEADLWESAKKAAGGLHFLAIQPSLESNSCSGLWILYDFPAPPV 290
>gi|302790880|ref|XP_002977207.1| hypothetical protein SELMODRAFT_55779 [Selaginella moellendorffii]
gi|300155183|gb|EFJ21816.1| hypothetical protein SELMODRAFT_55779 [Selaginella moellendorffii]
Length = 290
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 123/290 (42%), Positives = 169/290 (58%), Gaps = 47/290 (16%)
Query: 93 TDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVA 152
D SI EW+LDFCSRPI D RGK++WEL++CD L++ +++P+NVINS TLK AI
Sbjct: 1 ADLASIVEWQLDFCSRPIFDDRGKRMWELIICDAKRQLEFARFYPSNVINSTTLKNAIAE 60
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ + +P P ++R+FRSQ++TII+KAC EL I+ S+RC +L+ WL ERY+ VY +HP
Sbjct: 61 VIETFDLPRPTRVRYFRSQVKTIISKACGELGIQVTSSQRCTALVRWLHERYDQVYRQHP 120
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPF------------------------ 248
GFQ+ + +L++ P E+P N G+KWAFVQL F
Sbjct: 121 GFQENAPSILSMGVNVPKEVPPNYRGEKWAFVQLSFQALQEEIKLVEKGSNFGEVSLDML 180
Query: 249 -----------------------SAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYK 285
+AW N LE+ S+ D +L+L G S ++ Y+ YK
Sbjct: 181 TELPSPDTLIPGVAVASSRDLALAAWTNSLELASLSVDKKNSALVLLSGASRQWFYSYYK 240
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 335
K+ EA+ WE+AKKA GGLHFLAIQ L+S C G W+L D P PPV
Sbjct: 241 KSKQADEEADLWESAKKAAGGLHFLAIQPSLESNSCSGLWILYDFPAPPV 290
>gi|307108142|gb|EFN56383.1| hypothetical protein CHLNCDRAFT_35116 [Chlorella variabilis]
Length = 388
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 118/283 (41%), Positives = 160/283 (56%), Gaps = 48/283 (16%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
WELDFCSRPILD RGKK+WEL++CD + +Y ++ PNN INS LK A+ AI G
Sbjct: 106 WELDFCSRPILDERGKKVWELIICDPQRTFEYAQFIPNNKINSSELKRALEAILAQPGAV 165
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P RFFR QMQTII++A +L I P+PS+RC +L+ WLE+R +VY HPG+ + +
Sbjct: 166 RPTTARFFRGQMQTIISRALSDLGITPMPSRRCFTLMNWLEDRMGSVYEAHPGYNEKAST 225
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQL---------------------------------- 246
L ++ P +LPD L G+KW+FVQL
Sbjct: 226 LFTVEMGAPEDLPDALRGEKWSFVQLPLATLQQELEAVAAGKAFGATLDLGAMRQQLAPD 285
Query: 247 --------------PFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTS 292
P +AW NGL++ ++ DT R LIL G + R+ Y Y++ TT+
Sbjct: 286 TLVPGVAVYSRRADPLAAWTNGLDLSAVVADTDRAFLILETGFNQRWRYGAYRRTLETTA 345
Query: 293 EAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 335
EA+AWE AK+A GGLHFL + + ++E+ G WLLLD PP V
Sbjct: 346 EAQAWEEAKQAVGGLHFLVVMSDEEAENSSGLWLLLDRKPPNV 388
>gi|159466814|ref|XP_001691593.1| PsaB RNA binding protein [Chlamydomonas reinhardtii]
gi|33235187|emb|CAE17328.1| Tab2 protein [Chlamydomonas reinhardtii]
gi|33235189|emb|CAE17329.1| Tab2 protein [Chlamydomonas reinhardtii]
gi|158278939|gb|EDP04701.1| PsaB RNA binding protein [Chlamydomonas reinhardtii]
Length = 358
Score = 236 bits (603), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 118/284 (41%), Positives = 159/284 (55%), Gaps = 49/284 (17%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
WE+DFCSRP+LD RGKK+WEL++CD + +Y++YFPN+ INS LK I I G
Sbjct: 75 WEIDFCSRPLLDERGKKVWELLICDPERNFEYSEYFPNSKINSAELKRTIERILAQAGAE 134
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
PEK RFFRSQMQTIITKA + IK +PS+RC +++ W+ ER E+VY + P F ++
Sbjct: 135 RPEKARFFRSQMQTIITKALTDCQIKAVPSRRCFTVMSWINERLESVYKQDPRFSDKAQS 194
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQ----------------------------------- 245
L LD P LPD L G++WAFVQ
Sbjct: 195 LFQLDLGPPEALPDALRGEQWAFVQLPLGTLLQMLKRVDDAEIFGSGFTLGTVGLADLPA 254
Query: 246 --------------LPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTT 291
LP +AW NGLE+ +++ D AR LIL G++ R+ Y +++ N +
Sbjct: 255 DILIPGVVVFSRRALPLAAWTNGLEIAAVKADVARSCLILETGVNQRWKYGSWRPNEDSI 314
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 335
EAE WE AK+ G+HFLA+Q + DSE+ G WLL D PP +
Sbjct: 315 GEAEGWEIAKQGVKGVHFLAVQPDPDSEELNGLWLLQDCEPPTI 358
>gi|302836193|ref|XP_002949657.1| hypothetical protein VOLCADRAFT_74347 [Volvox carteri f.
nagariensis]
gi|300265016|gb|EFJ49209.1| hypothetical protein VOLCADRAFT_74347 [Volvox carteri f.
nagariensis]
Length = 365
Score = 234 bits (596), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 118/286 (41%), Positives = 158/286 (55%), Gaps = 49/286 (17%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLG 158
T WE+DFCSRP+LD RGKK+WEL++CD +Y++YFPN+ INS LK AI I G
Sbjct: 80 TVWEIDFCSRPLLDERGKKVWELLICDPERKFEYSEYFPNSKINSAELKRAIERILAQAG 139
Query: 159 VPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGS 218
PEK RFFRSQMQTIITKA + IK +PS+RC +++ W+ ER ++VY P + +
Sbjct: 140 AQRPEKARFFRSQMQTIITKALTDCQIKAVPSRRCFTVMSWINERLDSVYKTDPRYSDKA 199
Query: 219 KPLLALDNPFPMELPDNLFGDKWAFVQ--------------------------------- 245
+ L LD P LPD L G++WAFVQ
Sbjct: 200 QSLFQLDLGPPEALPDALRGEQWAFVQLPLGTLLQMLRKVEEGEIFGGTFSLGTAGLQDL 259
Query: 246 ----------------LPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPV 289
LP +AW NGLE+ +++ D R LIL G++ R+ Y +++ N
Sbjct: 260 PMDILIPGVVVFSRRALPLAAWTNGLEIAAVKADVQRSCLILETGVNQRWKYGSWRPNED 319
Query: 290 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV 335
+ EAE WE AK+ GLHFLA+Q + DSE+ G WLL D PP +
Sbjct: 320 SIGEAEGWEIAKEGVKGLHFLAVQPDPDSEELNGLWLLQDCEPPSI 365
>gi|145350231|ref|XP_001419517.1| psaB translation factor [Ostreococcus lucimarinus CCE9901]
gi|144579749|gb|ABO97810.1| psaB translation factor [Ostreococcus lucimarinus CCE9901]
Length = 379
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 108/279 (38%), Positives = 158/279 (56%), Gaps = 46/279 (16%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLG 158
T+W++DFCSRP+ D RGKK+WEL+V D + + ++ +YFPNN INS+ L A+ + +
Sbjct: 99 TDWQIDFCSRPLRDDRGKKVWELLVTDDARTFEHAEYFPNNRINSVELARALERVMAEKK 158
Query: 159 VPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGS 218
P + +FFR+QMQTII++AC E+D++P+ S+RC ++ WL ER E VY +HPG+ +
Sbjct: 159 EK-PRRFKFFRAQMQTIISRACNEVDVQPLASRRCQTMTKWLNERVENVYKKHPGYDASA 217
Query: 219 KPLLALDNPFPMELPDNLFGDKWAFVQLP------------------------------- 247
PL+A + P LPD L G+ WAFV LP
Sbjct: 218 PPLMAFEATAPKRLPDALRGESWAFVALPLVGVREEMEQVKRGRVFGATLEIDENLPDDT 277
Query: 248 --------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSE 293
+ W GLE+ I +DT S++L G++ + YA ++K+P T E
Sbjct: 278 LIPGIAVYTSRAAALAGWTKGLELACISSDTQTSSIVLETGVNDSWSYAFFRKSPELTKE 337
Query: 294 AEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPP 332
A+ WE K+AC GLHFLAIQ + ++E GFW+L D P
Sbjct: 338 AKEWEEVKRACNGLHFLAIQTDEEAEATDGFWILQDSDP 376
>gi|384248807|gb|EIE22290.1| PsaB RNA binding protein [Coccomyxa subellipsoidea C-169]
Length = 304
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 117/280 (41%), Positives = 154/280 (55%), Gaps = 49/280 (17%)
Query: 97 SITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDD 156
+ + WELDF SRPILD RGKK WEL++C S Y+K+FPNN INS LK A+ I +
Sbjct: 17 TFSTWELDFSSRPILDARGKKRWELLICSPDRSWVYSKWFPNNRINSTQLKAALQEIIEA 76
Query: 157 LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQK 216
G P+ +RFFR QMQTII++A +LDIKP+PS+RC SL+ LEER ETVY R G+
Sbjct: 77 EGAVKPQTVRFFRGQMQTIISRALADLDIKPVPSRRCFSLIGLLEERLETVYKRAAGYSD 136
Query: 217 GSKPLLALDNPFPMELPDNLFGDKWAFVQL------------------------------ 246
+ L LD P +LPD L G+ W FVQL
Sbjct: 137 KATSLFTLDLGPPQDLPDALRGESWLFVQLPLGLLREELRAVDTRQTFGANFALASAGLA 196
Query: 247 -------------------PFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKN 287
P +AW +GLEV ++ D R L+L G++ R+ Y NY++
Sbjct: 197 DLPDDTPIPGVAVYSRRAVPLAAWTSGLEVANVAADADRACLVLETGVNQRWRYGNYQRT 256
Query: 288 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
P T++A AWEAAK A GLHFL +Q + +++ G WLL
Sbjct: 257 PENTADARAWEAAKIAARGLHFLVVQADEEADTSAGLWLL 296
>gi|357457971|ref|XP_003599266.1| Cc-nbs-lrr resistance protein [Medicago truncatula]
gi|355488314|gb|AES69517.1| Cc-nbs-lrr resistance protein [Medicago truncatula]
Length = 1528
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 130/206 (63%), Positives = 147/206 (71%), Gaps = 48/206 (23%)
Query: 172 MQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPME 231
MQTIITKACKEL I+ +PSKRCLSLLLWLEERYETVYT+HPGFQKGSKPLL LDNPF +
Sbjct: 2 MQTIITKACKELGIRALPSKRCLSLLLWLEERYETVYTKHPGFQKGSKPLLPLDNPFATK 61
Query: 232 LPDNLFGDKWAFVQLPFSA----------------------------------------- 250
LP++LFG++WAFVQLP+SA
Sbjct: 62 LPEDLFGERWAFVQLPYSAVRAEASASEERFGYGSGLDLDLLGIEIDEKTLIPGLAVASS 121
Query: 251 -------WMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKA 303
+MNGLE+CSIETDTAR +L LSVGISTRY+YA YKK+P +T EAEAWEAAKKA
Sbjct: 122 RAKILSAFMNGLELCSIETDTARSNLTLSVGISTRYVYATYKKSPTSTKEAEAWEAAKKA 181
Query: 304 CGGLHFLAIQEELDSEDCVGFWLLLD 329
GGLHFLAIQ+ELDSEDC+GFWLLLD
Sbjct: 182 SGGLHFLAIQDELDSEDCIGFWLLLD 207
>gi|412990938|emb|CCO18310.1| predicted protein [Bathycoccus prasinos]
Length = 393
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 108/279 (38%), Positives = 154/279 (55%), Gaps = 50/279 (17%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAIC----DD 156
W++DFCSRP+ D RGKK+WEL++ D + ++ ++FPNN INS+ L +A+ + ++
Sbjct: 111 WQIDFCSRPLKDDRGKKVWELLITDEDRTFEHAEFFPNNRINSVELSKALQKVVSKRTEE 170
Query: 157 LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQK 216
G P +++FFRSQM TIIT+ACKE +++P+PS+RC ++L WLEER ETVY +HPG+
Sbjct: 171 TGEG-PRRVKFFRSQMMTIITRACKECELEPLPSRRCQTMLNWLEERMETVYKKHPGYDA 229
Query: 217 GSKPLLALDNPFPMELPDNLFGDKWAFVQLP----------------------------- 247
S PL+ D P LPD L G+ WAFV LP
Sbjct: 230 NSAPLMTFDAQAPKPLPDALRGESWAFVALPLVGVKEEMESVARGKAFGDLLNIDPDLPD 289
Query: 248 ----------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTT 291
S W GLE+ +I D S++L G++ + YA +++
Sbjct: 290 DTLIPGVVVYTARAAALSGWTKGLELSAITVDLESSSIVLETGVNESWNYAFFRRTKELR 349
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
EA WE K+ GLHFLAIQ + DSE GFW+L D+
Sbjct: 350 EEAREWEGVKRQTKGLHFLAIQTDADSETTDGFWVLQDV 388
>gi|255088429|ref|XP_002506137.1| predicted protein [Micromonas sp. RCC299]
gi|226521408|gb|ACO67395.1| predicted protein [Micromonas sp. RCC299]
Length = 274
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 108/275 (39%), Positives = 149/275 (54%), Gaps = 47/275 (17%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+LDFCSRP+ D RGKK+WEL++CD + S +++++FPNN INS+ L +AI + G
Sbjct: 1 WQLDFCSRPMKDERGKKMWELLICDETRSFEHSEFFPNNRINSVELAKAIDRVFVARG-E 59
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P + +FFRSQMQTIIT+AC E+ + P+PS+RC ++ WL+ER ETVY HPG+ + P
Sbjct: 60 RPRRFKFFRSQMQTIITRACGEVGVNPLPSRRCQTMSRWLDERLETVYKTHPGYDGSAAP 119
Query: 221 LLALD-NPFPMELPDNLFGDKWAFVQLP-------------------------------- 247
+ + P LPD L G+ WAFV LP
Sbjct: 120 NMGFEGGGGPRPLPDALRGESWAFVALPLVGVREEAEQVRANRVFGDLLEIDPTLEDDTL 179
Query: 248 -------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEA 294
+ W GLE+ I D G+L+L G+S + YA +++ EA
Sbjct: 180 IPGIAVYTRRAAALAGWTKGLELGGISVDFDMGTLLLDTGVSDSWQYARFRQTKELMKEA 239
Query: 295 EAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 329
WE K A GLHFLAIQ + D+E GFW+L D
Sbjct: 240 REWEEVKAAVNGLHFLAIQTDEDAETTDGFWILQD 274
>gi|303274889|ref|XP_003056755.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461107|gb|EEH58400.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 270
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 102/264 (38%), Positives = 145/264 (54%), Gaps = 46/264 (17%)
Query: 112 DIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQ 171
D RGKK+WEL++CD S S Q+ ++FPNN INS+ L +AI + ++ G P++ +FFRSQ
Sbjct: 3 DERGKKMWELLICDESRSFQHAEFFPNNRINSVELSKAIQRVLNEQGAR-PKRFKFFRSQ 61
Query: 172 MQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPME 231
MQTIIT+AC ++ + P+PS+RC +L WL++R E VY +HPG+ S P++ + P
Sbjct: 62 MQTIITRACNDVGVPPLPSRRCQTLTRWLDQRAEEVYKKHPGYDGSSSPMMGFETSAPKP 121
Query: 232 LPDNLFGDKWAFVQLP-------------------------------------------- 247
LPD L G+ WAFV LP
Sbjct: 122 LPDALRGESWAFVALPLIGVKEEAMQVSANRVFGDLLDIDEALPDDTLVPGIAVYTRRAA 181
Query: 248 -FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGG 306
+ W GLE+ I D G+LIL G++ + YA +++ T EA+ WE K A GG
Sbjct: 182 ALAGWTKGLELGGISVDLDMGTLILDTGVADSWQYARFRQTKELTREAKEWEDVKAAAGG 241
Query: 307 LHFLAIQEELDSEDCVGFWLLLDL 330
LHFLAIQ + ++E GFW+L D
Sbjct: 242 LHFLAIQTDEEAESTDGFWILQDF 265
>gi|308807645|ref|XP_003081133.1| Tab2 protein (ISS) [Ostreococcus tauri]
gi|116059595|emb|CAL55302.1| Tab2 protein (ISS) [Ostreococcus tauri]
Length = 300
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 111/298 (37%), Positives = 161/298 (54%), Gaps = 39/298 (13%)
Query: 53 RPRPS-VSESSLSVPKEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPIL 111
R RP+ VS S S P A +P L L ++ W++DFCSRP+
Sbjct: 23 RERPAAVSPFSRSTPTSARRLHTRASATQEPAATLKKLTKD--------WQIDFCSRPLR 74
Query: 112 DIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQ 171
D RGKK+WEL+V D S ++ +YFPNN INS+ L A+ + G P + +FFR+Q
Sbjct: 75 DDRGKKVWELLVTDDERSFEHAEYFPNNRINSVELARALERVMASKGEK-PRRFKFFRAQ 133
Query: 172 MQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPME 231
MQTIIT+AC E+D++ + S+RC ++ WL+ER E+VY +HPG+ + PL+A + P
Sbjct: 134 MQTIITRACTEVDVEALASRRCQTMTNWLDERVESVYKKHPGYDANAPPLMAFEPTAPKR 193
Query: 232 LPDNLFGDKWAFVQLP----FSAWMN------------GLEVCSIETDTARGSLILSVGI 275
LPD L G+ WAFV LP F A ++ G+ V + + ++L G+
Sbjct: 194 LPDALRGESWAFVALPLVGVFGALLDIDENLPDDTLIPGIAVYTSRAAVSAAHIVLETGV 253
Query: 276 STRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
+ + YA ++K P T E + WE K+ACGG+ GFW+L D PPP
Sbjct: 254 NDSWSYAFFRKTPELTKEPKEWEQVKRACGGV-------------TDGFWILRDAPPP 298
>gi|119510299|ref|ZP_01629435.1| hypothetical protein N9414_16117 [Nodularia spumigena CCY9414]
gi|119465043|gb|EAW45944.1| hypothetical protein N9414_16117 [Nodularia spumigena CCY9414]
Length = 287
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 104/282 (36%), Positives = 148/282 (52%), Gaps = 54/282 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WE+DF SRPILD + KK+WE++VC+ + +Y KY P+ +NS+ L+ A+
Sbjct: 5 WEIDFYSRPILDEKQKKVWEVLVCESPSDISTKPESLFRYAKYCPSTQVNSVWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
D G P +IRFFR QM +ITKAC+++ I PS+R L L WL +R E VY + P
Sbjct: 65 AIDKAG-EAPIRIRFFRRQMSNMITKACQDVGIPAQPSRRILVLNQWLRQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQ--------------------------- 245
G+Q G+ P + LD+P P LPD L G +WAFV
Sbjct: 124 GYQGGTNPSVRLDSPLPQRLPDALEGKQWAFVSLQAAEFADMSEWDIGFGEAFPLELANV 183
Query: 246 ----------------LPFSAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANYKKNP 288
LP + WM+GLE+ + DT +G L+L G + +I AN NP
Sbjct: 184 SPETRIPGVLIFSPRALPIAGWMSGLELACLNFDTKQGQRLVLETGATESWILANI-TNP 242
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
T +EA+ +E AK+ G+HF+ +Q + +E GFWLL +L
Sbjct: 243 QTLAEAKGYEQAKEKANGVHFIGVQSDPQAESFTGFWLLQNL 284
>gi|427730243|ref|YP_007076480.1| hypothetical protein Nos7524_3080 [Nostoc sp. PCC 7524]
gi|427366162|gb|AFY48883.1| Protein of unknown function (DUF1092) [Nostoc sp. PCC 7524]
Length = 287
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 105/285 (36%), Positives = 149/285 (52%), Gaps = 54/285 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WE+DF SRPILD KK+WE++VC+ L ++ Y +Y P+ +NS L+ A+
Sbjct: 5 WEIDFYSRPILDENQKKVWEVLVCESPLDIRTNLDSLFRYAQYCPSTQVNSGWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
D G P KIRFFR QM +ITKAC++L I + S+R L L WLE+R VY + P
Sbjct: 65 AIDKAG-EAPIKIRFFRRQMNNMITKACQDLGIPALSSRRTLVLNQWLEQRMIEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQ--------------------------- 245
G+Q G+ P + L+NP P LPD L G KW FV
Sbjct: 124 GYQGGANPSVRLENPLPQRLPDALEGQKWVFVSLSAAELAEMPEWDIGFREAFPLELAQL 183
Query: 246 ----------------LPFSAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANYKKNP 288
LP + WM+GLE+ + D ++G+ L+L G + +I AN KN
Sbjct: 184 SPETRIPGVLIFSPRALPVAGWMSGLELAFLRVDQSQGTRLVLETGTAESWILANI-KNS 242
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
T +EA+ +EAAK+ G+HF+ +Q + +E GFWLL ++ P
Sbjct: 243 TTLAEAQGFEAAKQNANGVHFIGVQSDPQAEAFAGFWLLQEVNLP 287
>gi|17232380|ref|NP_488928.1| hypothetical protein alr4888 [Nostoc sp. PCC 7120]
gi|17134025|dbj|BAB76587.1| alr4888 [Nostoc sp. PCC 7120]
Length = 286
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 104/282 (36%), Positives = 145/282 (51%), Gaps = 54/282 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE+V+C+ L ++ Y +Y P+ +NS L+ AI
Sbjct: 5 WELDFYSRPILDENQKKVWEVVICESPLDIRTKTDSLFRYAQYCPSTEVNSAWLRTAIQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P KIRFFR QM +I KAC++ I +PS+R L+L WL++R E VY + P
Sbjct: 65 AISKAG-KAPIKIRFFRRQMNNMIVKACEDAGIPALPSRRTLALNQWLKQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQ--------------------------- 245
G+Q + P + LD+P P LPD L G +W FV
Sbjct: 124 GYQGVTTPSVRLDSPLPQRLPDALEGQQWVFVSLSAADLAEMPDWEIGFSEAFPLDFVQV 183
Query: 246 ----------------LPFSAWMNGLEVCSIETDTARG-SLILSVGISTRYIYANYKKNP 288
LP + WM+GLE+ + DT++G L+L G + +I AN KNP
Sbjct: 184 SPETRIPGVLIFSPRALPIAGWMSGLELAFLRVDTSQGMRLVLETGATESWILANI-KNP 242
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
T EA +E AK+ G+HF+ +Q ++E GFWLL +L
Sbjct: 243 TTVQEARGFEEAKQKANGVHFIGVQSNPEAESFAGFWLLQEL 284
>gi|75908378|ref|YP_322674.1| hypothetical protein Ava_2159 [Anabaena variabilis ATCC 29413]
gi|75702103|gb|ABA21779.1| Protein of unknown function DUF1092 [Anabaena variabilis ATCC
29413]
Length = 286
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 102/282 (36%), Positives = 146/282 (51%), Gaps = 54/282 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE+V+C+ L ++ Y +Y P+ +NS+ L+ AI
Sbjct: 5 WELDFYSRPILDENQKKVWEVVICESPLDIRTKTDSLFRYAQYCPSTEVNSVWLRTAIQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P KIRFFR QM +I KAC++ I + S+R L+L L++R E VY + P
Sbjct: 65 AISKAG-EAPIKIRFFRRQMNNMIVKACEDAGIPALASRRTLALNQLLKQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQ--------------------------- 245
G+Q G+ P + LD+P P LPD L G +W FV
Sbjct: 124 GYQGGTTPSVRLDSPLPQRLPDALEGQQWVFVSLSAADLAEMPEWEIGFSEAFPLDFVQV 183
Query: 246 ----------------LPFSAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANYKKNP 288
LP + WM+GLE+ + DT++G+ L+L G + +I AN KNP
Sbjct: 184 SPESRIPGVLIFSPRALPIAGWMSGLELAFLRVDTSQGTRLVLETGATESWILANI-KNP 242
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
T EA +E AK+ G+HF+ +Q ++E GFWLL ++
Sbjct: 243 TTLQEARGFEEAKQKANGVHFIGVQSNPEAESFAGFWLLQEV 284
>gi|427718386|ref|YP_007066380.1| hypothetical protein Cal7507_3136 [Calothrix sp. PCC 7507]
gi|427350822|gb|AFY33546.1| protein of unknown function DUF1092 [Calothrix sp. PCC 7507]
Length = 287
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 103/285 (36%), Positives = 149/285 (52%), Gaps = 54/285 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD + KK+WE++VC+ ++ Y +Y + +NS L+ A+
Sbjct: 5 WELDFYSRPILDEKQKKVWEVLVCESPSDIRTKTDSLFRYAQYCSSTQVNSGWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P KIRFFR QM +ITKAC+++ I PS+R L L WL++R E VY + P
Sbjct: 65 AITTAG-EAPIKIRFFRRQMNNMITKACEDVGIPAQPSRRTLVLNQWLQQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFV---------------------------- 244
G+Q G+ + L+ P P LPD L G +WAFV
Sbjct: 124 GYQGGANASVRLERPLPQRLPDALEGQQWAFVTLEAGDFAEMPEWEIGFGESFPLDFAKI 183
Query: 245 ---------------QLPFSAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANYKKNP 288
LP + WM+GLE+ + DT++G+ L+L G++ +I AN KNP
Sbjct: 184 TPETRIPGVLIFSPRALPLAGWMSGLELAFLRFDTSQGARLLLETGVTESWILANI-KNP 242
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
T SEAE +EA+K+ G+HF+ +Q ++ GFWLL ++ P
Sbjct: 243 QTLSEAEGFEASKQKANGVHFIGVQSNPQAQSFAGFWLLQEVNLP 287
>gi|414077821|ref|YP_006997139.1| hypothetical protein ANA_C12609 [Anabaena sp. 90]
gi|413971237|gb|AFW95326.1| hypothetical protein ANA_C12609 [Anabaena sp. 90]
Length = 291
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 99/285 (34%), Positives = 149/285 (52%), Gaps = 54/285 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE++VC+ + + +Y KY P+ +NS L+ AI
Sbjct: 9 WELDFYSRPILDENQKKVWEMLVCESPVDIGTQTDSLFRYAKYCPSTQVNSGWLRTAIQE 68
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
++ G P KIRFFR QM +ITK+C+++ + +PS+R L L W+++R + VY + P
Sbjct: 69 AIEEAGAS-PTKIRFFRRQMNNMITKSCEDVGVPAVPSRRTLVLNQWIQQRMKEVYPQEP 127
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFV---------------------------- 244
G+Q + P + LD P P LPD L G +WAFV
Sbjct: 128 GYQGVANPSVRLDKPLPQRLPDALEGKQWAFVTLEASDLAQMPDWEIGFGEAFPLELAEL 187
Query: 245 ---------------QLPFSAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANYKKNP 288
LP + WM+GLE+ + DT +G+ LIL G + ++ AN + P
Sbjct: 188 RPETRIPGILIFSPRALPIAGWMSGLEMAYLHFDTKQGNRLILETGATESWVVANI-RTP 246
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
+EA+ + AK+ G+HF+ +Q + S+D GFWLL ++ P
Sbjct: 247 ELLAEAQGFTVAKEQANGVHFIGVQSDPQSQDFAGFWLLQEINLP 291
>gi|427705901|ref|YP_007048278.1| hypothetical protein Nos7107_0455 [Nostoc sp. PCC 7107]
gi|427358406|gb|AFY41128.1| protein of unknown function DUF1092 [Nostoc sp. PCC 7107]
Length = 287
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 102/282 (36%), Positives = 144/282 (51%), Gaps = 54/282 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WE+DF SRPILD KK+WE+VVC+ L ++ Y +Y P+ +NS L+ A+
Sbjct: 5 WEIDFYSRPILDENQKKVWEVVVCESPLDIRAQTDSLFRYAQYCPSTEVNSGWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
D G P K+RFFR QM +ITKAC++L I PS+R L L WL++R E VY + P
Sbjct: 65 AIDKAG-EAPIKVRFFRRQMNNMITKACQDLGIPAQPSRRTLLLNQWLQQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQ--------------------------- 245
G+Q G+ P + LD+P P LPD L G +W FV
Sbjct: 124 GYQGGNNPSVRLDSPLPQRLPDALEGQQWVFVSLSAGELAEMPEWDIGFGEAFPLEMAQL 183
Query: 246 ----------------LPFSAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANYKKNP 288
LP + WM+GLE+ + D + G+ LIL G + +I AN KNP
Sbjct: 184 SPEARIPGVLIFSPRALPLAGWMSGLELAFLRVDQSVGTRLILETGATESWIVANI-KNP 242
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
EA+ + +K+ G+HF+ +Q +E GFWLL ++
Sbjct: 243 QLLVEAKGFAESKQQANGVHFIGVQSSPQAESFAGFWLLQEV 284
>gi|434394708|ref|YP_007129655.1| protein of unknown function DUF1092 [Gloeocapsa sp. PCC 7428]
gi|428266549|gb|AFZ32495.1| protein of unknown function DUF1092 [Gloeocapsa sp. PCC 7428]
Length = 309
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 103/280 (36%), Positives = 148/280 (52%), Gaps = 56/280 (20%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS---------LQYTKYFPNNVINSITLKEAIV 151
WE+DF SRPILD KKIWE++VC+ SL+ ++ KY P+ +NS+ L+ A+
Sbjct: 27 WEIDFYSRPILDENQKKIWEVLVCE-SLTDIRTKPDSLFRFAKYCPSTQVNSVWLRTALE 85
Query: 152 AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRH 211
GV P K RFFR QM +ITKAC++L I PS+R L+L WL++R E VY
Sbjct: 86 EAIAAAGVS-PVKFRFFRRQMNNMITKACEDLGIPAQPSRRTLALNQWLQQRMEEVYPHE 144
Query: 212 PGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL------------------------- 246
PG+Q + P + ++ P P LPD L G +WAFV L
Sbjct: 145 PGYQATTNPSVRMEVPLPQRLPDALIGQQWAFVTLEAAAFADMPEWEIGFGEAFPLEIAG 204
Query: 247 ------------------PFSAWMNGLEVCSIETD-TARGSLILSVGISTRYIYANYKKN 287
P + WM+GLE+ +I+ D T L+L G++ +I A++ K+
Sbjct: 205 VKPETKIPGVIVLSPRAMPLAGWMSGLELANIKFDSTETPQLLLETGVTESWILASF-KD 263
Query: 288 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
P +EA+ +E+AK+ G+HFLA+Q + E GFWLL
Sbjct: 264 PQMIAEAKGFESAKQQANGVHFLAVQANPEVEAFAGFWLL 303
>gi|186682051|ref|YP_001865247.1| hypothetical protein Npun_R1620 [Nostoc punctiforme PCC 73102]
gi|186464503|gb|ACC80304.1| protein of unknown function DUF1092 [Nostoc punctiforme PCC 73102]
Length = 286
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 102/282 (36%), Positives = 145/282 (51%), Gaps = 54/282 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WE+DF SRPILD KKIWE++VC+ L + +Y +Y P+ +NS L+ A+
Sbjct: 5 WEIDFYSRPILDDNQKKIWEVLVCESPLDIGTKPDSLFRYAQYCPSTQVNSGWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P KIRFFR QM +ITKAC+++ I PS+R L L WLEER + VY + P
Sbjct: 65 AITQAG-KAPIKIRFFRRQMNNMITKACQDVGIPAQPSRRTLVLNQWLEERMKEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFV---------------------------- 244
G+Q G+ P + L+ P P LPD L G +W FV
Sbjct: 124 GYQGGTNPSVRLEKPLPQRLPDALEGQQWVFVTLDAADLAEMPEWEIGFGEAFPLELAKV 183
Query: 245 ---------------QLPFSAWMNGLEVCSIETDTA-RGSLILSVGISTRYIYANYKKNP 288
LP + WM+GLE+ + DT+ L+L G++ +I AN KK P
Sbjct: 184 SPEARIPGILIFSPRALPLAGWMSGLELAFLRFDTSEEARLLLETGVNESWIVANIKK-P 242
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+EA+ +E AK+ G+HF+ IQ + ++ GFWLL ++
Sbjct: 243 QVLAEAKGFEEAKQKANGVHFIGIQSDPKAQSFAGFWLLQEV 284
>gi|428207859|ref|YP_007092212.1| hypothetical protein Chro_2874 [Chroococcidiopsis thermalis PCC
7203]
gi|428009780|gb|AFY88343.1| protein of unknown function DUF1092 [Chroococcidiopsis thermalis
PCC 7203]
Length = 287
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 104/285 (36%), Positives = 145/285 (50%), Gaps = 54/285 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAIVA 152
WE+DF SRPILD KK+WE+VVC+ L +Y +Y P+ +NS L+ A+
Sbjct: 5 WEIDFYSRPILDENQKKVWEVVVCESPLDTRTDPTRLFRYAQYCPSTQVNSAWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P K RFFR QM +ITKACK+L I PS+R L+LL L+ER + VY + P
Sbjct: 65 AMAKAGT-APTKFRFFRRQMNNMITKACKDLGIPAQPSRRTLALLQLLKERMDEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQ--------------------------- 245
G+Q P + +++ P LPD L G +WAFV
Sbjct: 124 GYQPTPNPSVKMESSPPQRLPDALTGQQWAFVNLEATALADMDEWEIAFGEAFPLQMVGL 183
Query: 246 ----------------LPFSAWMNGLEVCSIETDTA-RGSLILSVGISTRYIYANYKKNP 288
LP + WM+GLE+ I +T+ L+L G S +I AN KNP
Sbjct: 184 SPETTIPGLLIFSERALPLAGWMSGLELAFIRVETSPVARLLLETGASESWILANL-KNP 242
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
T +EA+A+ +AK+ G+HF+A+Q +E GFWLL ++ P
Sbjct: 243 QTVAEAQAFVSAKQQANGVHFIAVQSNPQTESFAGFWLLQEVSIP 287
>gi|300863927|ref|ZP_07108842.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300338047|emb|CBN53988.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 286
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 103/284 (36%), Positives = 145/284 (51%), Gaps = 54/284 (19%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELD+ SRPI+D + KK+WE+++C+ L++ +Y+++ P++ +NS+ L AI
Sbjct: 3 TIWELDYYSRPIVDEQQKKLWEVLICESPLNVGDKSESLFRYSQFCPSSTVNSLWLAAAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
P PEKIRFFR QM +I KAC+EL I PS+R +L WL ER E VY
Sbjct: 63 KEAIASSPSP-PEKIRFFRRQMTNMIVKACEELHIPAAPSRRTYALQQWLRERMEDVYPT 121
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP----------------------- 247
HPGFQ G P + + P LP+ L G+KW+FV LP
Sbjct: 122 HPGFQSGLTPSVQYSSEIPQALPEALLGEKWSFVTLPVEAFEEMSEWEIEFGEAFGLEAF 181
Query: 248 --------------------FSAWMNGLEVCSIETDTA-RGSLILSVGISTRYIYANYKK 286
+AWM+GLE+ + D L+L G S R+I AN +
Sbjct: 182 GLKPQTPIPGLIIFSSRATALAAWMSGLELAFVTFDGGPPARLVLETGASDRWILANLRD 241
Query: 287 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ +E + +E+AK A +HFLAIQ +SE GFWLL +
Sbjct: 242 LSI-VAEVKGFESAKVAANQVHFLAIQSHPESESFAGFWLLQEF 284
>gi|434404855|ref|YP_007147740.1| Protein of unknown function (DUF1092) [Cylindrospermum stagnale PCC
7417]
gi|428259110|gb|AFZ25060.1| Protein of unknown function (DUF1092) [Cylindrospermum stagnale PCC
7417]
Length = 287
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 98/285 (34%), Positives = 147/285 (51%), Gaps = 54/285 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WE+DF SRPILD KK+WE++VC+ + +Y +Y P+ +NS L+ A+
Sbjct: 5 WEVDFYSRPILDENQKKVWEVLVCETPSGIGTNIDSLFRYAQYCPSTQVNSGWLRTALQQ 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P K+RFFR QM +ITKAC+++ + +PS+R L L WL++R E VY + P
Sbjct: 65 AINKAG-EAPIKVRFFRRQMNNMITKACEDVGVPALPSRRTLFLNQWLQQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFV---------------------------- 244
G+Q G+ + LD P P LPD L G +WAFV
Sbjct: 124 GYQGGANASVRLDRPLPQRLPDALEGKQWAFVTLEAQDFADMPEWEIGFGEAFPLELAKL 183
Query: 245 ---------------QLPFSAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANYKKNP 288
LP + WM+GLE+ ++ DT+ G LIL G + +I AN + P
Sbjct: 184 SPEARIPGILIFSPRALPLAGWMSGLELAYLKFDTSLGERLILETGATESWIVANI-RTP 242
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
EA+ +E+ K+A G+HF+ +Q + ++ GFWLL ++ P
Sbjct: 243 QLLVEAKGFESTKQAANGVHFIGVQSDAQAQSFAGFWLLQEINLP 287
>gi|334120908|ref|ZP_08494985.1| protein of unknown function DUF1092 [Microcoleus vaginatus FGP-2]
gi|333455907|gb|EGK84547.1| protein of unknown function DUF1092 [Microcoleus vaginatus FGP-2]
Length = 286
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 110/288 (38%), Positives = 150/288 (52%), Gaps = 62/288 (21%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSI----TL 146
T WELDF SRPILD R KK WE+++C+ L++ +Y+++ ++ +NS+ L
Sbjct: 3 TIWELDFYSRPILDEREKKKWEVLICESPLNVGDKAESLFRYSQFCSSSTVNSLWLAGAL 62
Query: 147 KEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYET 206
KEAI A PEKIRFFR QM +ITKAC++LDI S+R L+L LWLEER +
Sbjct: 63 KEAIAAAPKR-----PEKIRFFRRQMANMITKACEDLDIPAACSRRTLALSLWLEERMQD 117
Query: 207 VYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSA---------------- 250
VY PG+Q P + P+ LPD L G+KW FV LP +A
Sbjct: 118 VYPAEPGYQAVVNPSVQFVPETPVALPDALIGEKWTFVSLPIAAFDEMSEWDIGFGEAFG 177
Query: 251 ---------------------------WMNGLEVCSIETDTA-RGSLILSVGISTRYIYA 282
WM+GLE+ ++ ++ L+L G + R+I A
Sbjct: 178 LPMTRLAPETQIPGLIIYSSRATALAGWMSGLELAFLKFESGPPARLVLDTGANDRWILA 237
Query: 283 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
N ++ T EA+ +EAAKK +HFLAIQ DSE GFWLL +L
Sbjct: 238 NL-RDAATEREAKGFEAAKKQAKQVHFLAIQSNPDSESFAGFWLLHEL 284
>gi|428311384|ref|YP_007122361.1| hypothetical protein Mic7113_3216 [Microcoleus sp. PCC 7113]
gi|428252996|gb|AFZ18955.1| Protein of unknown function (DUF1092) [Microcoleus sp. PCC 7113]
Length = 287
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 109/285 (38%), Positives = 144/285 (50%), Gaps = 58/285 (20%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAIV- 151
WELDF SRPILD KKIWE++VC+ L QYT++ P+ +NSI L+EA+
Sbjct: 5 WELDFYSRPILDENQKKIWEILVCESPLDTRQSPDELFQYTQFCPSQQVNSIWLREALAE 64
Query: 152 AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRH 211
AI P EKIRFFR QM +ITKAC+EL I+ IPS+R +L WLE+R Y +H
Sbjct: 65 AIAQSKQTP--EKIRFFRRQMTNMITKACEELGIQVIPSRRTYTLERWLEQRILGFYPKH 122
Query: 212 PGFQ--KGSKPLLALDNPFPMELPDNLFGDKWAFVQL----------------------- 246
PG++ + + P LPD L DKWAFV L
Sbjct: 123 PGYKPTAAASSFVQYQPQIPQPLPDALEYDKWAFVTLEAGAFEEMNEWDIGFSEAFPLSM 182
Query: 247 --------------------PFSAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANYK 285
P + WM+GLE+ + D+A + L+L G S +I A K
Sbjct: 183 MGLAPDTPIPGIIIFSSRATPLAGWMSGLELAFVRFDSAESARLLLETGASDSWILATLK 242
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ T +EA+ +E K+ G+HFLAIQ SE GFWLL +L
Sbjct: 243 DSQ-TLAEAQGFELTKQNAEGVHFLAIQSTPTSESFAGFWLLQEL 286
>gi|354566488|ref|ZP_08985660.1| protein of unknown function DUF1092 [Fischerella sp. JSC-11]
gi|353545504|gb|EHC14955.1| protein of unknown function DUF1092 [Fischerella sp. JSC-11]
Length = 288
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 100/283 (35%), Positives = 141/283 (49%), Gaps = 55/283 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE++VC+ L +Y +Y P+ +NS+ L+ A+
Sbjct: 5 WELDFYSRPILDENQKKVWEVLVCESPLDTRTKVDSLFRYAQYCPSTQVNSVWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
D G P KIRFFR QM +ITKAC ++ I PS+R L L WL++R E VY + P
Sbjct: 65 AIDKAG-EAPIKIRFFRRQMNNMITKACGDIGIPAQPSRRTLVLNQWLQQRIEQVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFV---------------------------- 244
G+Q G P + L+ P P LPD L +W FV
Sbjct: 124 GYQGGVNPSVRLEAPLPQRLPDALEWQQWGFVTLLGSEFADMPDWEIDFGEGFPLELAQV 183
Query: 245 ---------------QLPFSAWMNGLEVCSIETDTA--RGSLILSVGISTRYIYANYKKN 287
LP + WM+GL++ + D + G L+L G + +I AN KN
Sbjct: 184 SPETSIPGILIFSPRALPLAGWMSGLDLAWLRFDDSPQGGRLLLETGATESWILANL-KN 242
Query: 288 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
P +EA +E AK+ G+HF+ +Q + S+ GFWLL ++
Sbjct: 243 PQILAEARNFEQAKQQANGVHFIGVQSDPQSQSFAGFWLLCEI 285
>gi|298490971|ref|YP_003721148.1| hypothetical protein Aazo_1936 ['Nostoc azollae' 0708]
gi|298232889|gb|ADI64025.1| protein of unknown function DUF1092 ['Nostoc azollae' 0708]
Length = 286
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 97/282 (34%), Positives = 143/282 (50%), Gaps = 54/282 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE++VC+ + ++ Y +Y P+ +NS+ L+ A+
Sbjct: 5 WELDFYSRPILDANQKKVWEILVCESPVDVRTKTDSLFRYAQYCPSTQVNSVWLRTALEE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P KIRFFR QM +ITKAC++ I +PS+R L L WL++R E VY +
Sbjct: 65 AINKAG-EAPIKIRFFRRQMNNMITKACQDAGIPALPSRRALVLNQWLQQRMEEVYPQEL 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFV---------------------------- 244
G+Q + P + LD P P LPD L G +WAFV
Sbjct: 124 GYQGEANPSVRLDRPLPQRLPDALEGKQWAFVTLEAKDFVDMPDWEIAFGEAFPLELAQL 183
Query: 245 ---------------QLPFSAWMNGLEVCSIETDTARG-SLILSVGISTRYIYANYKKNP 288
LP + WM+GLE+ + DT++G LIL G + ++ AN + P
Sbjct: 184 SPEIRIPGILIFSPRALPIAGWMSGLEMAYLRFDTSQGDRLILETGATESWVLANI-RTP 242
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
EA+ +E K+ G+HF+ +Q + + GFWLL ++
Sbjct: 243 QLLKEAQGFEETKQKANGVHFIGVQSDPQVQSFSGFWLLQEV 284
>gi|428301149|ref|YP_007139455.1| hypothetical protein Cal6303_4583 [Calothrix sp. PCC 6303]
gi|428237693|gb|AFZ03483.1| protein of unknown function DUF1092 [Calothrix sp. PCC 6303]
Length = 287
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 104/288 (36%), Positives = 147/288 (51%), Gaps = 56/288 (19%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDG---------SLSLQYTKYFPNNVINSITLKEA 149
T WELDF SRPILD KK+WEL++C+ SL +Y +Y P+ +NS L+ A
Sbjct: 3 TTWELDFYSRPILDENQKKVWELLLCESPKDSRTKVDSL-FRYAQYCPSTEVNSAWLRTA 61
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I G P +IRFFR QM +ITKAC++ I S+R L L WL++R + VY
Sbjct: 62 IQEAISKAG-EAPTRIRFFRRQMNNMITKACQDSGIPAQSSRRILVLHQWLQQRMDEVYP 120
Query: 210 RHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQ------------------------ 245
+ PG+Q GS P + LD P P LPD L + WAFV+
Sbjct: 121 QEPGYQGGSNPSVRLDAPVPQRLPDALELENWAFVRLTAKDFLDMPEWEIGFGEGFPLEL 180
Query: 246 -------------------LPFSAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANYK 285
LP +AWM+GLE+ ++ D + G L+L G + +I AN +
Sbjct: 181 AQISDDTPISGVLIFSSRSLPLAAWMSGLELGYLKFDQSEGGRLLLETGATESWIVANIR 240
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
+ V +EA+ +E AK++ G+HF+ +Q SE GFWLL ++ P
Sbjct: 241 NSQV-INEAKNFEVAKQSANGVHFIGVQANPQSESFAGFWLLQEVTLP 287
>gi|440681970|ref|YP_007156765.1| protein of unknown function DUF1092 [Anabaena cylindrica PCC 7122]
gi|428679089|gb|AFZ57855.1| protein of unknown function DUF1092 [Anabaena cylindrica PCC 7122]
Length = 287
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 101/285 (35%), Positives = 143/285 (50%), Gaps = 54/285 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE++VC+ ++ Y +Y P+ +NS L+ A+
Sbjct: 5 WELDFYSRPILDENQKKVWEVLVCESPSDVRTKTDSLFRYAQYCPSTQVNSGWLRTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P KIRFFR QM +ITKAC+ + I I S+R L L WL++R E VY + P
Sbjct: 65 AIEKAG-EAPIKIRFFRRQMNNMITKACEGVGIPAISSRRTLFLNQWLQQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFV---------------------------- 244
G+Q + P + LD P P LPD L G +WAFV
Sbjct: 124 GYQGIANPSVRLDKPLPQRLPDALEGKQWAFVTLDAGDFAEMPEWEIGFGEAFPLELAKL 183
Query: 245 ---------------QLPFSAWMNGLEVCSIETDTARG-SLILSVGISTRYIYANYKKNP 288
LP + WM+GLE+ + DT +G LIL G + +I AN K P
Sbjct: 184 SPEARIPGILIFSPRALPLAGWMSGLEMAYLHFDTKQGDRLILETGATESWIVANI-KTP 242
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
+EA+ + AK+ G+HF+ +Q + ++ GFWLL ++ P
Sbjct: 243 QLLAEAQGFAQAKEKANGVHFIGVQSDPQAQSFAGFWLLQEVNLP 287
>gi|428319661|ref|YP_007117543.1| protein of unknown function DUF1092 [Oscillatoria nigro-viridis PCC
7112]
gi|428243341|gb|AFZ09127.1| protein of unknown function DUF1092 [Oscillatoria nigro-viridis PCC
7112]
Length = 286
Score = 171 bits (432), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 107/288 (37%), Positives = 150/288 (52%), Gaps = 62/288 (21%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSI----TL 146
T WELDF SRPI+D R KK WE+++C+ L++ +Y+++ ++ +NS+ +
Sbjct: 3 TIWELDFYSRPIIDEREKKKWEVLICESPLNVGDKAESLFRYSQFCSSSTVNSLWLAGAI 62
Query: 147 KEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYET 206
K+AI A PEKIRFFR QM +ITKAC+ELDI S+R L+L LWLEER +
Sbjct: 63 KDAIAAAPKR-----PEKIRFFRRQMANMITKACEELDIPAACSRRTLALSLWLEERMQD 117
Query: 207 VYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSA---------------- 250
VY PG+Q P + P+ LPD L G+KWAFV LP +A
Sbjct: 118 VYPAEPGYQPVVNPSVQFIPETPVALPDALIGEKWAFVSLPIAAFDEMSEWDIGFGEAFG 177
Query: 251 ---------------------------WMNGLEVCSIETDTA-RGSLILSVGISTRYIYA 282
WM+GLE+ ++ ++ L+L G + R+I A
Sbjct: 178 LPMTALGPKTQIPGLIIYSSRATALAGWMSGLELAFLKFESGPPARLVLDTGANDRWILA 237
Query: 283 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
N ++ T EA+ +EAAK +HFLAIQ +SE GFWLL +L
Sbjct: 238 NL-RDAATEREAKGFEAAKNQAKKVHFLAIQSNPESESFAGFWLLHEL 284
>gi|282901672|ref|ZP_06309588.1| protein of unknown function DUF1092 [Cylindrospermopsis raciborskii
CS-505]
gi|281193435|gb|EFA68416.1| protein of unknown function DUF1092 [Cylindrospermopsis raciborskii
CS-505]
Length = 289
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 93/285 (32%), Positives = 147/285 (51%), Gaps = 57/285 (20%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE+++C+ + +Y +Y P+ +NS+ L++A+
Sbjct: 5 WELDFYSRPILDANQKKVWEVLICESPTDVLTKVDSLFRYAQYCPSTQVNSVWLRQALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ GV P KIRFFR QM +ITKAC+++ I +PS++ L L W+++R E VY + P
Sbjct: 65 AIEKAGVA-PIKIRFFRRQMNNMITKACQDMGIPALPSRKTLVLNQWIQQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQ--------------------------- 245
G+++ + + L+ P P LPD L G +W FV
Sbjct: 124 GYEQVTNSSVRLERPLPQRLPDALEGKQWTFVSLGASDITDMPEWEIAFGEAFPLELAGL 183
Query: 246 ----------------LPFSAWMNGLEVCSIETDTARGS----LILSVGISTRYIYANYK 285
LP + WM+GLE+ + D+ R + L+L G + +I AN
Sbjct: 184 SPEIPIPGILIFSPRALPIAGWMSGLELAYLRLDSNRNNQGDRLVLETGGTESWILANL- 242
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ P +EA+ +E AK+ G+HF+ +Q + S+ GFWLL ++
Sbjct: 243 RTPQLLAEAKGFEEAKQKADGVHFIGVQSDPQSQSFAGFWLLKEI 287
>gi|427740039|ref|YP_007059583.1| hypothetical protein Riv7116_6716 [Rivularia sp. PCC 7116]
gi|427375080|gb|AFY59036.1| Protein of unknown function (DUF1092) [Rivularia sp. PCC 7116]
Length = 284
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 98/282 (34%), Positives = 146/282 (51%), Gaps = 54/282 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WELD+ SRPILD KK+WE+++C+ L + +Y KY + +NS+ L+ A+
Sbjct: 5 WELDYYSRPILDENKKKVWEVLICETPLDISSKTDSLFRYAKYCSSATVNSVWLQTALQE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P KIRFFR QM +ITKAC+E+ I S+R L+L WL++R + VY +
Sbjct: 65 AIGKAG-EAPVKIRFFRRQMNNMITKACEEIGIPAQTSRRTLALNQWLQQRMDEVYPQEA 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFV---------------------------- 244
G+Q G+ P + L++P P LPD L G++ FV
Sbjct: 124 GYQGGTNPSVRLESPLPQRLPDALEGEQLQFVTLSAADFADMPEWNIDFGEAFPLDLAGI 183
Query: 245 ---------------QLPFSAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNP 288
LP +AWM+GLE+ + D+++ G L+L G + +I AN KNP
Sbjct: 184 SSENKIPGVLIFSNRALPIAAWMSGLELAWLRFDSSKTGRLLLETGATESWILANI-KNP 242
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
EA+ +E AK+ G+HF+ +Q + SE GFWLL ++
Sbjct: 243 QMLLEAQNFEQAKQKANGVHFIGVQSDPTSESFAGFWLLREI 284
>gi|56750056|ref|YP_170757.1| hypothetical protein syc0047_c [Synechococcus elongatus PCC 6301]
gi|81300399|ref|YP_400607.1| hypothetical protein Synpcc7942_1590 [Synechococcus elongatus PCC
7942]
gi|7328458|dbj|BAA92865.1| ORF285 [Synechococcus elongatus PCC 6301]
gi|22002499|gb|AAM82651.1| unknown [Synechococcus elongatus PCC 7942]
gi|56685015|dbj|BAD78237.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169280|gb|ABB57620.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 285
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 98/281 (34%), Positives = 143/281 (50%), Gaps = 52/281 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDG-------SLSLQYTKYFPNNVINSITLKEAIVAI 153
WELDF SRPILD GKK+WE+ + + +++ +Y + + +NS+TL++A+ +
Sbjct: 5 WELDFYSRPILDEAGKKLWEVAIAETVTTVEAPAVTFRYADFVTGDQVNSVTLQDALKSA 64
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
+ G P P++IR+FR M +I KAC +L + S+R +SL WLEER + VY HPG
Sbjct: 65 IAEAGTP-PDRIRYFRRPMNNMIRKACTDLGLPCQLSRRTVSLHNWLEERRQQVYATHPG 123
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPF------------------------- 248
+ G + + + P LPD L GD+WAFV LPF
Sbjct: 124 YNPGPVAGVQMPDEAPQPLPDALRGDRWAFVDLPFAALAEHGEWGIDFGEAFPLAGIDLP 183
Query: 249 ------------------SAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVT 290
+AW++GLE + D+ L+L G S R+ A P
Sbjct: 184 DETPIPGLIIFASRAMPIAAWLSGLEPAWLTYDSPAKQLLLETGGSERWTLAALNV-PAL 242
Query: 291 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
EA + AAK+A GLHFLA+Q + +S+ GFWLL +LP
Sbjct: 243 QQEATQFNAAKQAAKGLHFLAVQVDPNSDRFAGFWLLRELP 283
>gi|411119159|ref|ZP_11391539.1| Protein of unknown function (DUF1092) [Oscillatoriales
cyanobacterium JSC-12]
gi|410711022|gb|EKQ68529.1| Protein of unknown function (DUF1092) [Oscillatoriales
cyanobacterium JSC-12]
Length = 288
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 101/282 (35%), Positives = 144/282 (51%), Gaps = 55/282 (19%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAI 150
T WELDF SRPILD GKK+WE+V+C+ ++ + +Y + +NS L +A+
Sbjct: 3 TIWELDFYSRPILDEHGKKVWEVVLCESPTQIKAEPDRLFRFAEYCASTEVNSERLVQAL 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
P P +IRFFR M+ +ITKAC +L++ + S+R +L WL++R+ Y +
Sbjct: 63 QTAIAQAPSP-PSRIRFFRQAMKNMITKACNDLNLPSVLSRRTYALNQWLQQRFAEEYPK 121
Query: 211 HPGFQKGSKPLLALDNPFPME-LPDNLFGDKWAFVQL----------------------- 246
HPGFQ GS P ++ ++ LPD L G KWAFV L
Sbjct: 122 HPGFQAGSNPSVSFAATTAVQSLPDALIGQKWAFVSLEAGMLEEMDEWAIDFGEAFPLSL 181
Query: 247 --------------------PFSAWMNGLEVCSIETDT-ARGSLILSVGISTRYIYANYK 285
P + WM+GLE+ S++ DT + L+L G S R+I A+
Sbjct: 182 VNLSPDAIVPGVIIFSPRAVPMAGWMSGLELGSLKLDTESTPRLLLETGGSDRWILASLN 241
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
V T EA+ +E AK+ G+HFLAIQ D+E GFWLL
Sbjct: 242 NAQVQT-EAQNFETAKQKANGVHFLAIQAAPDTETFAGFWLL 282
>gi|428211001|ref|YP_007084145.1| hypothetical protein Oscil6304_0478 [Oscillatoria acuminata PCC
6304]
gi|427999382|gb|AFY80225.1| Protein of unknown function (DUF1092) [Oscillatoria acuminata PCC
6304]
Length = 293
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 103/288 (35%), Positives = 144/288 (50%), Gaps = 60/288 (20%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAIVA 152
WELDF S+PILD GKK WE+++C+ L+++KY ++ +NSI L AI
Sbjct: 5 WELDFYSKPILDENGKKRWEVLICESPTDICSTTDELLRFSKYCSSSEVNSIWLGNAINE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
G P +IRFFR QM +ITKACK+L I PS+R ++L WL++R +TVY P
Sbjct: 65 AIATAGKS-PTQIRFFRRQMNNMITKACKDLGINSKPSRRTVALYRWLQDRMDTVYPLEP 123
Query: 213 GFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQL------------------------- 246
GFQ G P + + P P LPD L GD+WAFV L
Sbjct: 124 GFQGAGLNPSVQFETPKPERLPDALQGDRWAFVSLEAGSFAEMSEWEIDFSEAFPILGEK 183
Query: 247 -----------------------PFSAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYA 282
+AWM+GLE+ ++ + G ++L G + R+I A
Sbjct: 184 SLVPQITPDTIIPGMIVFSNRAKAIAAWMSGLELGFLKPELEEPGQVVLETGFNERWILA 243
Query: 283 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
N + T +EA+ + K G+HFLAIQ + +SE GFWLL +L
Sbjct: 244 NL-TDKTTRAEAQGFAETKDKAQGVHFLAIQTDPNSESFAGFWLLQEL 290
>gi|428304460|ref|YP_007141285.1| hypothetical protein Cri9333_0858 [Crinalium epipsammum PCC 9333]
gi|428245995|gb|AFZ11775.1| protein of unknown function DUF1092 [Crinalium epipsammum PCC 9333]
Length = 287
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 139/284 (48%), Gaps = 54/284 (19%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAI 150
T WELDF SRPI+D KKIWE++VC+ + +Y +Y P+ +NS++L+ A+
Sbjct: 3 TIWELDFYSRPIIDENQKKIWEVLVCESPVDTRQSVESLFRYAQYCPSTQVNSVSLQNAL 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ G P+KIRFFR QM +I KAC +L I PS+R ++ WL ER + VY
Sbjct: 63 TEAIEKSGQS-PQKIRFFRRQMNNMIVKACTDLGILAEPSRRTYAVHQWLRERMQDVYPS 121
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL------------------------ 246
HP +Q + P + + P LPD L G KW FV L
Sbjct: 122 HPNYQPSNSPSVQFEVQPPQPLPDALIGQKWMFVSLDASAFAEMHEWNIGFSEAFPLEML 181
Query: 247 -------------------PFSAWMNGLEVCSIETDTA-RGSLILSVGISTRYIYANYKK 286
P +AWM+G+E I+ A + L+L G S + +
Sbjct: 182 HLSPQTRIPGIIILSPRAIPMAAWMSGIEPALIKFYPAPQARLLLETGGSDSWFLVK-QL 240
Query: 287 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
N + +EA +EAAK+ G+HFLAIQ SED GFWLL +L
Sbjct: 241 NGSSQTEAAGFEAAKQQAKGVHFLAIQSSPQSEDFAGFWLLQEL 284
>gi|282896250|ref|ZP_06304272.1| Putative uncharacterized protein [Raphidiopsis brookii D9]
gi|281198746|gb|EFA73625.1| Putative uncharacterized protein [Raphidiopsis brookii D9]
Length = 289
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 91/285 (31%), Positives = 145/285 (50%), Gaps = 57/285 (20%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD+ KK+WE+++C+ + +Y++Y P+ +NS+ L++A+
Sbjct: 5 WELDFYSRPILDVNQKKVWEVLICESPTDVITKVDSLFRYSQYCPSTQVNSVWLRQALEE 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ GV P KIRFFR QM +ITKAC+++ I + S++ L L W+++R E VY + P
Sbjct: 65 AIEKAGVA-PIKIRFFRRQMNNMITKACQDMGIPALSSRKTLVLNQWIQQRMEEVYPQEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQ--------------------------- 245
G+Q+ + + L+ P P LPD L G +W FV
Sbjct: 124 GYQQVTNSSVRLERPLPQRLPDALEGKQWTFVSLEASDFTDMPEWEIAFGEAFPLELAGL 183
Query: 246 ----------------LPFSAWMNGLEVCSIETDT----ARGSLILSVGISTRYIYANYK 285
LP + WM+GLE+ + D+ L+L G + +I AN
Sbjct: 184 SPETPIPGILIFSPRALPIAGWMSGLELAYLRFDSNPNNQGDRLVLETGGTESWILANL- 242
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ P +A+ +E AK+ G+HF+ +Q + S+ GFWLL ++
Sbjct: 243 RTPKLLEDAKGFEEAKQKANGVHFIGVQSDPQSQSFAGFWLLKEI 287
>gi|427419514|ref|ZP_18909697.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 7375]
gi|425762227|gb|EKV03080.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 7375]
Length = 285
Score = 163 bits (413), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 96/277 (34%), Positives = 138/277 (49%), Gaps = 52/277 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS------LQYTKYFPNNVINSITLKEAIVAIC 154
WELDF SRP+LD KK WE+++CDG+ S ++Y+K+ N +NSI L++AI
Sbjct: 5 WELDFYSRPVLDDNQKKRWEVLLCDGAQSVADSSRIRYSKFLSNKQVNSIELQQAIEEAI 64
Query: 155 DDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGF 214
+ G P +IRFFR QMQ +I +AC EL + S+R L+L WLE+R E Y + PG+
Sbjct: 65 EKAGES-PTQIRFFRYQMQNMIKRACDELGVSARLSRRTLTLQTWLEDRQENFYPQQPGY 123
Query: 215 QKGSKPLLALDNPFPMELPDNLFGDKWAFVQ----------------------------- 245
Q+G P LPD L G +WA V
Sbjct: 124 QEGKSPATVQPVEVARPLPDALIGQRWAMVSLPAKEFADMPEWEIGFGEAFPLELAGIGP 183
Query: 246 --------------LPFSAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANYKKNPVT 290
LP + WM+GLE+ ++ + S L+L G + +I A+ + P
Sbjct: 184 DTMVPGILIFSERALPLAGWMSGLEMAYLDVQIDQISQLLLETGSNDTWIMASLNR-PEL 242
Query: 291 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
EAE + AAK+ +HF+A+Q+ DSE GFWL+
Sbjct: 243 KQEAERFMAAKEEANQVHFVAVQDNPDSESFAGFWLM 279
>gi|209528431|ref|ZP_03276864.1| protein of unknown function DUF1092 [Arthrospira maxima CS-328]
gi|376003070|ref|ZP_09780887.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423067161|ref|ZP_17055951.1| hypothetical protein SPLC1_S532420 [Arthrospira platensis C1]
gi|209491136|gb|EDZ91558.1| protein of unknown function DUF1092 [Arthrospira maxima CS-328]
gi|375328518|emb|CCE16640.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406711447|gb|EKD06648.1| hypothetical protein SPLC1_S532420 [Arthrospira platensis C1]
Length = 287
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 146/284 (51%), Gaps = 54/284 (19%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAI 150
T WELDF SRP+ D GKK+WE+++C+ L ++ YT++ P+ +NSI L+ AI
Sbjct: 4 TIWELDFYSRPLRDEDGKKVWEVIICETPLDVRSRPESLFRYTQFCPSTQVNSIWLQGAI 63
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+P P KIRFFR M +I+KA + LDI S+R +L WL+ER + VY
Sbjct: 64 QEAIAQAPLP-PSKIRFFRRPMANMISKAAEGLDIPASASRRTYTLFQWLQERIDKVYPT 122
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL------------------------ 246
+P +Q+G+ P + + P LPD L G++WA V L
Sbjct: 123 YPNYQEGTNPSVQFVSGEPQPLPDALQGEQWAIVSLEAAAFEDMPEWDIGFGEAFSLPMM 182
Query: 247 -------------------PFSAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKK 286
P +AWM+GLE+ + +T R +LIL G + +I AN
Sbjct: 183 GLSPETPVPGLIIFTTRAIPLAAWMSGLELAFLRLVETPRPNLILETGENESWILANL-T 241
Query: 287 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+P T +EA+ +E AK + +HFLAIQ + +SE GFW+L L
Sbjct: 242 DPKTQTEAKNFEQAKLSAKNVHFLAIQSDPNSESFAGFWMLQQL 285
>gi|434388804|ref|YP_007099415.1| Protein of unknown function (DUF1092) [Chamaesiphon minutus PCC
6605]
gi|428019794|gb|AFY95888.1| Protein of unknown function (DUF1092) [Chamaesiphon minutus PCC
6605]
Length = 286
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 92/281 (32%), Positives = 144/281 (51%), Gaps = 55/281 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAIVA 152
WE+DF SRP++D R KK+WEL++C+ + ++T+Y P++ +NS+ L EA+ A
Sbjct: 5 WEIDFYSRPLVDERQKKVWELLICESPATTDRSTEDLFRFTRYCPSDRVNSLWLAEALQA 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ P++IRFFR QM +ITKACK++ I S+R ++L W+++R E Y + P
Sbjct: 65 AMLE-AKQSPQRIRFFRRQMNNMITKACKDIGIPAAASRRTIALHQWIDDRMEHFYPQQP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL-------------------------- 246
+Q + + + + P LP+ L G+KW FV L
Sbjct: 124 NYQAANTASVQMFSDPPQPLPEALLGEKWTFVSLAASQFADMNEWQIGFSEAFPLAMVGV 183
Query: 247 -----------------PFSAWMNGLEVCSIETDTA-RGSLILSVGISTRYIYANYKKNP 288
P +AWM+GLE+ S+ A + +L+L G S +I A +
Sbjct: 184 TPEMPIPGLILYSPRSVPMAAWMSGLEIVSVRYQPAPKSTLLLETGASESWILARLEGT- 242
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 329
T EA +EA+K+ G+HF+AIQ D E+ GFWLL +
Sbjct: 243 -TQQEAARFEASKQQAKGVHFIAIQSSPDVEEFAGFWLLYE 282
>gi|119490556|ref|ZP_01622998.1| hypothetical protein L8106_07991 [Lyngbya sp. PCC 8106]
gi|119453884|gb|EAW35040.1| hypothetical protein L8106_07991 [Lyngbya sp. PCC 8106]
Length = 286
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 99/284 (34%), Positives = 143/284 (50%), Gaps = 54/284 (19%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRP+ D GKK+WE+++C L + +YT++ P+ +NSI L+ AI
Sbjct: 3 TIWELDFYSRPLRDEEGKKVWEVLICQTPLEIGDRAESLFRYTQFCPSTDVNSIWLQGAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
A + P++IRFFR M +I KACKEL I S+R +L WL+ER E VY
Sbjct: 63 QAAIKE-ADETPQRIRFFRRPMANMILKACKELAIPVTASRRTYALFQWLDERIENVYPT 121
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL------------------------ 246
P +Q+ + P + + P LPD L GD+WAFV L
Sbjct: 122 LPNYQETANPSVQFASSPPQRLPDALQGDQWAFVSLEASAFEEMSEWNIGFGEAFGLPML 181
Query: 247 -------------------PFSAWMNGLEVCSIETDTA-RGSLILSVGISTRYIYANYKK 286
P +AWM+GLE+ + + R SL+L G + +I AN
Sbjct: 182 GLSGETQIPGLIVFSSRATPLAAWMSGLELAFLRVNKGDRPSLLLETGENDSWILANL-T 240
Query: 287 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ T +EAE +E AK+ +HFLA+Q + ++E G W+L +L
Sbjct: 241 DAGTQAEAEQFEEAKRQAKNVHFLAVQSDPNTESFAGLWMLQEL 284
>gi|220906218|ref|YP_002481529.1| hypothetical protein Cyan7425_0781 [Cyanothece sp. PCC 7425]
gi|219862829|gb|ACL43168.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 7425]
Length = 288
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 99/278 (35%), Positives = 132/278 (47%), Gaps = 52/278 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAIC----DD 156
WE+DF SRP+LD KKIWEL+VCD +Y + + N+ L+ +
Sbjct: 5 WEIDFYSRPLLDENQKKIWELLVCDPDRRFEYVQTCSGSQANARWLQTELATALPLWRQA 64
Query: 157 LGVP---IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
L +P +PEKIRFFR QM +IIT+AC +L I P PS+R +L WL+ER E VY + PG
Sbjct: 65 LELPETAMPEKIRFFRRQMNSIITRACTDLGIPPQPSRRTFTLYQWLKERSEKVYPQQPG 124
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL--------------------------- 246
FQ + LA + P LPD L G W F L
Sbjct: 125 FQPLAMSPLAFEASPPQPLPDALMGQGWTFASLAASEFAAATEWSITFGEVFPLSRLGLS 184
Query: 247 ----------------PFSAWMNGLEVCSIETDTA-RGSLILSVGISTRYIYANYKKNPV 289
P + WM+GLE+ + +T LIL G+S R+I A + P
Sbjct: 185 PETVVPGLIIFSSRAKPLAGWMSGLELACLTLETEPVPQLILETGVSDRWILARL-RTPQ 243
Query: 290 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
E +E K+ G +HFLA+Q SED GFW+L
Sbjct: 244 LLEEGRNFEQTKQQAGQVHFLAVQTNPQSEDFAGFWVL 281
>gi|332712125|ref|ZP_08432053.1| protein of unknown function, DUF1092 [Moorea producens 3L]
gi|332348931|gb|EGJ28543.1| protein of unknown function, DUF1092 [Moorea producens 3L]
Length = 287
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 101/287 (35%), Positives = 139/287 (48%), Gaps = 57/287 (19%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD KK+WE+++C+ L + QY + PN +NSI L +A+
Sbjct: 3 TIWELDFYSRPILDENQKKLWEVLICESPLDINLSPETLFQYASWCPNQQVNSIWLGQAL 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
P P KIRFFR QM +ITKAC EL+I PS+R +L WL++R E Y
Sbjct: 63 ADAIAKAQQP-PSKIRFFRRQMNNMITKACNELNIPAQPSRRTYALERWLKQRIEDFYPN 121
Query: 211 HPGFQ--KGSKPLLALDNPFPMELPDNLFGDKWAFVQL---------------------- 246
PG+ + + +P P LPD L G KWA V L
Sbjct: 122 QPGYDPAAAASSFVRYQSPIPKPLPDALQGQKWAVVSLQAAAFEEMNEWEIDFGEAFPVS 181
Query: 247 ---------------------PFSAWMNGLEVCSIETDTA--RGSLILSVGISTRYIYAN 283
P +AWM+GLE+ + DT + L+L G + +I AN
Sbjct: 182 IMDIAPETPIPGVIIFSQRAKPLAAWMSGLELSFVRLDTTDDKPKLLLETGANDSWILAN 241
Query: 284 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
K+ + +EA+++E AK+ +HFLA+Q SE GFWL +L
Sbjct: 242 LTKSQI-LAEAKSFEEAKQNANLVHFLAVQSSPTSEQFAGFWLCREL 287
>gi|113478101|ref|YP_724162.1| hypothetical protein Tery_4728 [Trichodesmium erythraeum IMS101]
gi|110169149|gb|ABG53689.1| protein of unknown function DUF1092 [Trichodesmium erythraeum
IMS101]
Length = 286
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 103/284 (36%), Positives = 140/284 (49%), Gaps = 54/284 (19%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD R KK+WEL++C + + +Y+++ + +NSI L+ AI
Sbjct: 3 TIWELDFYSRPILDERQKKLWELLICQSPIGINDTTDSLYRYSEFTNSQEVNSIWLRSAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
P PE+IRFFR QM +ITKAC EL I S+R L WLE+R E VY
Sbjct: 63 EKAIAQAPEP-PERIRFFRRQMNNMITKACGELAIPIALSRRTYLLNQWLEQRMEEVYPT 121
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL------------------------ 246
+PG+Q G+ P N P LPD L G++W FV L
Sbjct: 122 YPGYQPGTNPSGQYMNSAPQPLPDALIGERWTFVSLEAGAFTEMSEWDIDFGEAFPLSMM 181
Query: 247 -------------------PFSAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKK 286
+AWM+GLE+ I+ A L+L+ G + +I AN
Sbjct: 182 NLAPLSAIPGLIIYSSRAQALAAWMSGLELAFIKFSPASPARLLLNTGGNDCWILANL-S 240
Query: 287 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
NP T +EA+ + AK +HFLA+Q +SE GFWLL ++
Sbjct: 241 NPSTIAEAKRFSEAKSKAKEVHFLAVQSNPESESFAGFWLLQEI 284
>gi|443327636|ref|ZP_21056256.1| Protein of unknown function (DUF1092) [Xenococcus sp. PCC 7305]
gi|442792728|gb|ELS02195.1| Protein of unknown function (DUF1092) [Xenococcus sp. PCC 7305]
Length = 289
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 107/287 (37%), Positives = 145/287 (50%), Gaps = 61/287 (21%)
Query: 101 WELDFCSRPILDIRGKKIWELVV------CDGSLS--LQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE+++ D SL +Y +Y + INS+ L EAI
Sbjct: 6 WELDFYSRPILDENQKKVWEVLIQESPTTTDRSLDDLFRYAQYTSSKTINSLWLSEAIEK 65
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P KIRFFR QM +ITKAC+EL I I S+R +L W+E+R +VY
Sbjct: 66 AIAESGTK-PRKIRFFRRQMNNMITKACEELGIAAIASRRTYALAQWIEDRMTSVYPNET 124
Query: 213 GFQKGSKPLLALDNPFPME---LPDNLFGDK---WAFVQL-------------------- 246
G+ + + ++ P P+ LPD + GDK WAFV L
Sbjct: 125 GYDQKAANSASVKYP-PLNAIPLPDAVRGDKNDRWAFVSLDCSAFAEMSEWEINFGEAFP 183
Query: 247 -----------------------PFSAWMNGLEVCSIETD-TARGSLILSVGISTRYIYA 282
P +AWM+GLE+ ++ + T+R L L G S +I A
Sbjct: 184 LSLANIAGETKIPGLIFFSPRANPLAAWMSGLEMGYLQLEITSRPRLRLETGASDSWILA 243
Query: 283 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 329
N NP SEA+ +EA+KK G+HFLA+Q + +SE GFWLL D
Sbjct: 244 NV-TNPQILSEAKGFEASKKEAQGVHFLAVQSDPESESFAGFWLLKD 289
>gi|409993875|ref|ZP_11277002.1| hypothetical protein APPUASWS_22218 [Arthrospira platensis str.
Paraca]
gi|291566596|dbj|BAI88868.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409935287|gb|EKN76824.1| hypothetical protein APPUASWS_22218 [Arthrospira platensis str.
Paraca]
Length = 287
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 144/284 (50%), Gaps = 54/284 (19%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAI 150
T WELDF SRP+ D GKK+WE+++C+ L ++ YT++ P+ +NSI L+ AI
Sbjct: 4 TIWELDFYSRPLRDEDGKKVWEVIICETPLDVRSRPESLFRYTQFCPSTQVNSIWLQGAI 63
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+P P KIRFFR M +I+KA + LDI S+R +L WL+ER + VY
Sbjct: 64 EEAIAQAPLP-PSKIRFFRRPMANMISKAAEGLDIPASASRRTYTLFQWLQERIDKVYPT 122
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL------------------------ 246
+P +Q+G+ P + + P LPD L G++WA V L
Sbjct: 123 YPNYQEGTNPSVQFVSGEPQPLPDALQGEQWAIVSLEAAAFQDMPEWDIGFGEAFSLPMM 182
Query: 247 -------------------PFSAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKK 286
P +AWM+GLE+ + +T R SLIL G + +I AN
Sbjct: 183 GLSPETLVPGLIIFSTRAIPLAAWMSGLELAFLRLLETPRPSLILETGENESWILANLTD 242
Query: 287 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ T +EA +E AK + +HFLAIQ + +SE GFW+L L
Sbjct: 243 SK-TQTEARNFEQAKLSAKNVHFLAIQSDPNSESFAGFWMLQQL 285
>gi|427724036|ref|YP_007071313.1| hypothetical protein Lepto7376_2188 [Leptolyngbya sp. PCC 7376]
gi|427355756|gb|AFY38479.1| protein of unknown function DUF1092 [Leptolyngbya sp. PCC 7376]
Length = 285
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 101/286 (35%), Positives = 143/286 (50%), Gaps = 59/286 (20%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVC---------DGSLSLQYTKYFPNNVINSITLKE 148
+T WELDF SRPILD KK+WE+++C DG L +Y+++ N +NSITLK+
Sbjct: 1 MTIWELDFYSRPILDDNQKKLWEVLICEAPTSIKQGDGDL-FRYSEFCTNTEVNSITLKK 59
Query: 149 AIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVY 208
AI + GV P KIRFFR QM +I+K C++ I PS+R +L+ W+++R VY
Sbjct: 60 AIEKAIAEAGVS-PSKIRFFRRQMNNMISKGCEDAGIPSAPSRRAYTLMQWIDQRTREVY 118
Query: 209 TRHPGFQKGSKPLLALDNPF--PMELPDNLF---GDKWAFVQL----------------- 246
HP F + + ++ P + LPD + GDKWA V L
Sbjct: 119 PEHPNFDEQAARNTSVQYPSLNAVALPDAVRGDKGDKWAIVSLEASAFEDFDDWEIDFGE 178
Query: 247 ------------------------PFSAWMNGLEVCSIETDTA-RGSLILSVGISTRYIY 281
P + WM+GLE+ + + R S++L G+S +I
Sbjct: 179 PFPLNNLNSDTKIPGLLIFSPRAVPLAGWMSGLELSFLHLNQQPRPSMVLETGVSDSWIV 238
Query: 282 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
A+ N T EA+ +E AKK G+HFLAIQ D E GFW+L
Sbjct: 239 ADL-PNKGTVKEAKNFETAKKKAEGIHFLAIQNSPDDERFAGFWML 283
>gi|428780588|ref|YP_007172374.1| hypothetical protein Dacsa_2415 [Dactylococcopsis salina PCC 8305]
gi|428694867|gb|AFZ51017.1| Protein of unknown function (DUF1092) [Dactylococcopsis salina PCC
8305]
Length = 287
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 103/288 (35%), Positives = 144/288 (50%), Gaps = 63/288 (21%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAI 150
T WELDF SRPI D KK+WE+++C+ L ++ Y K+ +NSI L+EAI
Sbjct: 3 TIWELDFYSRPIRDENNKKLWEVLICESPLDVETTEEQLFRYQKFCSAQTVNSIFLQEAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ G P+KIRFFR QM +ITKAC ++ I +PS+R +L W+EER E VY +
Sbjct: 63 NEAIEASGKS-PKKIRFFRRQMSNMITKACDDIGITALPSRRTYALQRWIEERLENVYPQ 121
Query: 211 HPGFQKGSKPLLALDNPFPME----LPDNLFGDK---WAFVQL----------------- 246
G+ + + + + +P E LPD + GDK WAFV L
Sbjct: 122 QEGYDETAVSSVTVQ--YPAENAAILPDAIRGDKGDRWAFVTLEVQGFQEMKEWEISFGE 179
Query: 247 --------------------------PFSAWMNGLEVCSIE-TDTARGSLILSVGISTRY 279
PF+ WM+G+E+ I+ +R LIL G S +
Sbjct: 180 GFPLSLFDLSPETKIPGLVIFSPRAMPFAGWMSGIELSQIQLQQGSRPRLILQTGTSECW 239
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
I A+ NP T EA+ ++ AK+ G+HFLAIQ + SE GFWLL
Sbjct: 240 ILADI-TNPDTLKEAQGFQQAKETAQGVHFLAIQSDPQSEAFAGFWLL 286
>gi|443310782|ref|ZP_21040422.1| Protein of unknown function (DUF1092) [Synechocystis sp. PCC 7509]
gi|442779136|gb|ELR89389.1| Protein of unknown function (DUF1092) [Synechocystis sp. PCC 7509]
Length = 288
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 95/286 (33%), Positives = 138/286 (48%), Gaps = 55/286 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WE+DF SRP+LD KK+WE++VC+ LS+ +Y++Y ++ +NS LK A+
Sbjct: 5 WEIDFYSRPVLDENNKKLWEILVCESPLSIDTELDSLFKYSEYCSSSQVNSAWLKAALEK 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ P K RFFR+ M +I KAC++L I PS+R L+L WL++R VY P
Sbjct: 65 AMEQ-SATTPLKFRFFRTSMNNMIVKACQDLGIPAQPSRRTLALHQWLQQRNLDVYPLEP 123
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL-------------------------- 246
G+Q + P + P LPD L G KW L
Sbjct: 124 GYQASTNPSVRGQKSDPQRLPDALIGQKWVVASLTGADLAQMPEWEIGFGEAFPLPLGEV 183
Query: 247 -----------------PFSAWMNGLEVCSIETDTAR--GSLILSVGISTRYIYANYKKN 287
P + WM+GLE+ +++ DT+ LIL G S ++ AN N
Sbjct: 184 ASDTIVPGVIIYSPRAVPLAGWMSGLEIAALKVDTSVNPARLILETGASDSWLLANV-TN 242
Query: 288 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
P T A+ +E AK+ +HFLA+Q +SE GFWLL ++ P
Sbjct: 243 PQTLQMAQDFEGAKQKANQVHFLAVQSSPESEVFAGFWLLQEINLP 288
>gi|170077740|ref|YP_001734378.1| hypothetical protein SYNPCC7002_A1122 [Synechococcus sp. PCC 7002]
gi|169885409|gb|ACA99122.1| conserved hypothetical protein (DUF1092) [Synechococcus sp. PCC
7002]
Length = 285
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 100/285 (35%), Positives = 141/285 (49%), Gaps = 57/285 (20%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEA 149
+T WELDF SRP+LD KK+WE+++C+ +Q Y+++ N +NSITLK A
Sbjct: 1 MTIWELDFYSRPLLDDNDKKLWEILICETPTRIQQDPTTLFRYSEFCSNTDVNSITLKTA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I G P KIRFFR QM +ITK C++ I PS+R +L+ W+ +R + VY
Sbjct: 61 IEKAIATSGQS-PTKIRFFRRQMNNMITKGCEDAGIPAAPSRRTYTLMTWITQREQEVYP 119
Query: 210 RHPGFQKGSKPLLALDNPF--PMELPDNLF---GDKWAFVQL------------------ 246
+ + + S ++ P + LPD + GDKWA V L
Sbjct: 120 QEANYDEKSAKSSSVQYPALNAIALPDAVRGDKGDKWAIVSLEASAFSDFDEWDIAFGEP 179
Query: 247 -----------------------PFSAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYA 282
P + WM+GLE+ + R SL+L G+S +I A
Sbjct: 180 FPLTHLDPTTKIPGLLIFSPRAVPLAGWMSGLELGFLHLQKNPRSSLVLETGVSDSWIVA 239
Query: 283 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ N T EAE++EAAKKA G+HFLAIQ+ D E GFW+L
Sbjct: 240 DL-PNAQTLKEAESFEAAKKAAAGIHFLAIQKSPDEEQFAGFWML 283
>gi|428775715|ref|YP_007167502.1| hypothetical protein PCC7418_1082 [Halothece sp. PCC 7418]
gi|428689994|gb|AFZ43288.1| protein of unknown function DUF1092 [Halothece sp. PCC 7418]
Length = 287
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 101/286 (35%), Positives = 145/286 (50%), Gaps = 59/286 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAI 150
T WELDF SRPI D KK+WE+++C+ L +Y+K+ +NSI L+EA+
Sbjct: 3 TIWELDFYSRPIRDENNKKLWEVLICESPLQANTTEGELFRYSKFCSAQNVNSIFLQEAL 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ G P+KIRFFR QM +ITKAC++L+I +PS+R +L WL+ER + VY +
Sbjct: 63 NEAMEKSGT-TPKKIRFFRRQMNNMITKACEDLEITALPSRRTYALQKWLQERLDQVYPQ 121
Query: 211 HPGFQKGSKPLLALDNPF--PMELPDNLF---GDKWAFVQL------------------- 246
G+ + + ++ P + LPD + GDKWAFV L
Sbjct: 122 QEGYDETAVTNASVQYPAENAVILPDAIRGDKGDKWAFVTLEAQAFQEMEDWDISFGEGF 181
Query: 247 ------------------------PFSAWMNGLEVCSIE-TDTARGSLILSVGISTRYIY 281
PF+ WM+G+E+ I+ + + L+L G S +I
Sbjct: 182 PLSLFELAPETKVPGLVIFSPRAMPFAGWMSGIELSQIQLQEGSLPRLVLQTGSSDCWIL 241
Query: 282 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
A+ NP T EA+ + AAKK G+HFLAIQ + SE GFWLL
Sbjct: 242 ADI-TNPETLKEAQGFAAAKKDAKGVHFLAIQTDPSSESFAGFWLL 286
>gi|443319275|ref|ZP_21048509.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 6406]
gi|442781102|gb|ELR91208.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 6406]
Length = 287
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 93/281 (33%), Positives = 139/281 (49%), Gaps = 54/281 (19%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD R K+ WE+++ +G + +++++ N +NS+ LKE I
Sbjct: 3 TIWELDFYSRPILDERNKRRWEVLISEGLQRVDADPENLFRFSQFLANTDVNSLKLKEVI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
P P ++RFFR MQT+IT+AC++L + PS+R L+L W++ R VY +
Sbjct: 63 ETAIAQAPEP-PSRVRFFRFSMQTMITRACEDLGLAATPSRRTLALQDWIDYRQREVYPQ 121
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP----------------------- 247
PG+ P + P P LPD L G +WAFV LP
Sbjct: 122 DPGYTDKPAPTVGAPPPSPRRLPDALVGQRWAFVTLPARDFADMPDWPMDFGEGFPLSLA 181
Query: 248 --------------------FSAWMNGLEVCSIETDTARG-SLILSVGISTRYIYANYKK 286
+ WM+GLE+ + +T++ LIL G + +I +
Sbjct: 182 GIGDDTPIPGIIIFSPRAVAMAGWMSGLELSELRVETSKSPRLILETGAADSWILSPLGD 241
Query: 287 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ + T EA+ +EAAK A +HFLA+QE +E GFWL+
Sbjct: 242 STLQT-EAKNFEAAKVAANQVHFLALQENPATEAFAGFWLM 281
>gi|298714858|emb|CBJ25757.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 310
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 89/276 (32%), Positives = 133/276 (48%), Gaps = 47/276 (17%)
Query: 100 EWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL-G 158
EWELD SRP++ GKK+WEL++CD + + ++ P+N++NS ++ I + + G
Sbjct: 35 EWELDVYSRPVVGADGKKLWELLICDSTGNFRHVSPIPSNMVNSREVRRTIEGVIEAAPG 94
Query: 159 VPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGS 218
P IRFFR+ M +I A KE+++ P + ++ WLEER VY GF+
Sbjct: 95 GSKPTVIRFFRNAMFNMIDIALKEVEVAVKPCRTTYAMYQWLEERERDVYPAMAGFKPTM 154
Query: 219 KPLLALDNPFPMELPDNLFGDKWAFVQLPFS----------------------------- 249
K D P LPD L G+++AFV +P S
Sbjct: 155 KQPAFFDIRTPTPLPDALRGEQYAFVTMPVSEFRQGNINDENVGVGRLCPLDASLPDDAM 214
Query: 250 ---------------AWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEA 294
WM GLEV + D L L GI+T+Y+ A + + EA
Sbjct: 215 IPGLAMFTARAEPLATWMTGLEVAYFKADLKNRELALECGINTQYLVARVQGD--QRKEA 272
Query: 295 EAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ +E AK+A GG HF+A+Q D++D GFWLL ++
Sbjct: 273 QGFEEAKRALGGFHFVAVQSNPDADDVAGFWLLKEV 308
>gi|428218630|ref|YP_007103095.1| hypothetical protein Pse7367_2406 [Pseudanabaena sp. PCC 7367]
gi|427990412|gb|AFY70667.1| protein of unknown function DUF1092 [Pseudanabaena sp. PCC 7367]
Length = 287
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 91/281 (32%), Positives = 136/281 (48%), Gaps = 55/281 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
WELDF SRP+L+ KKIWEL++CD + +++ + P++ +NS L E + + G
Sbjct: 5 WELDFYSRPVLNQNKKKIWELLICDRTRQMEWVQECPSDRVNSAWLAEQLQTVIQKTG-Q 63
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+K+RFFR M IIT+ C + + P+ S+R +L WL+ER VY + GFQ
Sbjct: 64 TPQKVRFFRPSMANIITRGCNQAGLNPLASRRVFTLAAWLQERMAQVYPQQEGFQAADPN 123
Query: 221 LLALDNPFPM----ELPDNLFGDKWAFVQL------------------------------ 246
L L P +PD L G+ WA V L
Sbjct: 124 PLPLAVPMQQISTRPIPDALIGEGWAIVSLRADQFASAGDWSIDFEELFDLSYLSDDTLI 183
Query: 247 -----------PFSAWMNGLEVCSIE--TDTARGS--LILSVGISTRYIYANYK-----K 286
P +AWM G++ ++ T+ GS ++L R++ AN++ K
Sbjct: 184 PGLIIYSHRATPLAAWMAGVDPVFLKFVTNQNDGSSQMLLEANADARWLVANFQSAKAPK 243
Query: 287 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
N ++ +A+E AK+ +HFLAIQ+ DSED GFWLL
Sbjct: 244 NAKAIADGQAFETAKQKAAQVHFLAIQDNPDSEDFAGFWLL 284
>gi|425464328|ref|ZP_18843650.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389833706|emb|CCI21561.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 291
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 100/288 (34%), Positives = 143/288 (49%), Gaps = 61/288 (21%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRSSDTIFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
+ A + GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 VTAAIKEAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMANFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFV------------------- 244
+ G+ + +++ P P+ LPD + G DKWAFV
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFNDLKDWDISFGE 178
Query: 245 ------------------------QLPFSAWMNGLEVCSIETDT-ARGSLILSVGISTRY 279
LP S WM+GLE+ ++ +T +R L L G S +
Sbjct: 179 NLPILGMGLDENLKIPGLVIFSPRALPLSGWMSGLEMAYLKLETGSRPLLRLETGASDSW 238
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
I N N T +EA+ +E AK+ LHFLA+Q +SE GFWLL
Sbjct: 239 ILVNV-TNAETLNEAKNFEEAKQKANNLHFLAVQSNPESESFAGFWLL 285
>gi|425443217|ref|ZP_18823442.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389715544|emb|CCI00112.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 291
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 102/288 (35%), Positives = 142/288 (49%), Gaps = 61/288 (21%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP+LD KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVLDENNKKRWELLICETPATIDRSSDTIFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
+ A GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 VTAAIKAAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFV------------------- 244
+ G+ + +++ P P+ LPD + G DKWAFV
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFNDLKDWDISFGE 178
Query: 245 ------------------------QLPFSAWMNGLEVCSIETDT-ARGSLILSVGISTRY 279
LP S WM+GLE+ ++ +T +R L L G S +
Sbjct: 179 NLPILGMGLDENLKIPGLVIFSPRALPLSGWMSGLEMAYLKLETGSRPLLRLETGASDSW 238
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
I N N T +EA+ +E AK+ LHFLAIQ +SE GFWLL
Sbjct: 239 ILVNV-TNAETLNEAKNFEEAKQKANNLHFLAIQSNPESESFAGFWLL 285
>gi|166363644|ref|YP_001655917.1| hypothetical protein MAE_09030 [Microcystis aeruginosa NIES-843]
gi|166086017|dbj|BAG00725.1| hypothetical protein MAE_09030 [Microcystis aeruginosa NIES-843]
Length = 291
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 101/288 (35%), Positives = 143/288 (49%), Gaps = 61/288 (21%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRSSDTIFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
+ A + GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 VTAAIKEAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFV------------------- 244
+ G+ + +++ P P+ LPD + G DKWAFV
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFNDLKDWDISFGE 178
Query: 245 ------------------------QLPFSAWMNGLEVCSIETDT-ARGSLILSVGISTRY 279
LP S WM+GLE+ ++ +T +R L L G S +
Sbjct: 179 NLPILGMGLDENLKIPGLVIFSPRALPLSGWMSGLEMAYLKLETGSRPLLRLETGASDSW 238
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
I N N T +EA+ +E AK+ LHFLAIQ +SE GFWLL
Sbjct: 239 ILVNV-TNAETLNEAKNFEEAKQKANNLHFLAIQSNPESESFAGFWLL 285
>gi|425436008|ref|ZP_18816449.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9432]
gi|389679353|emb|CCH91843.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9432]
Length = 291
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 101/288 (35%), Positives = 142/288 (49%), Gaps = 61/288 (21%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ +++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPVTIDRSSDTIFKYASYCPNTMVNSQWLSEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I A GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 ITAAIKAAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFV------------------- 244
+ G+ + +++ P P+ LPD + G DKWAFV
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFHDIKDWDISFGE 178
Query: 245 ------------------------QLPFSAWMNGLEVCSIETDT-ARGSLILSVGISTRY 279
LP + WM+GLE+ ++ +T +R L L G S +
Sbjct: 179 NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETGSRPVLRLETGASDSW 238
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
I N N T EA+ +E AK+ LHFLAIQ +SE GFWLL
Sbjct: 239 ILVNV-TNTETLKEAKNFEEAKQKANNLHFLAIQSNPESESFAGFWLL 285
>gi|422303161|ref|ZP_16390515.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9806]
gi|389791919|emb|CCI12318.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9806]
Length = 291
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 100/288 (34%), Positives = 142/288 (49%), Gaps = 61/288 (21%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRSSDTLFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
+ A GV P+KIRFFR QM +ITKAC+++ I PS+R +L W++ER Y
Sbjct: 61 VTAAIKAAGV-TPKKIRFFRRQMNNMITKACEDIGIPASPSRRTHALTRWIKERMANFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFV------------------- 244
+ G+ + +++ P P+ LPD + G DKWAFV
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFNDLKDWDISFGE 178
Query: 245 ------------------------QLPFSAWMNGLEVCSIETDT-ARGSLILSVGISTRY 279
LP + WM+GLE+ ++ +T +R L L G S +
Sbjct: 179 NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETSSRPVLRLETGASDSW 238
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
I N N T +EA+ +E AK+ LHFLAIQ +SE GFWLL
Sbjct: 239 ILVNV-TNAETLNEAKNFEEAKQKANNLHFLAIQSNPESESFAGFWLL 285
>gi|428223761|ref|YP_007107858.1| hypothetical protein GEI7407_0302 [Geitlerinema sp. PCC 7407]
gi|427983662|gb|AFY64806.1| protein of unknown function DUF1092 [Geitlerinema sp. PCC 7407]
Length = 289
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 94/285 (32%), Positives = 141/285 (49%), Gaps = 54/285 (18%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD R KK+WE++VC+ ++ +Y +Y + +NS+ L++A+
Sbjct: 3 TIWELDFYSRPILDEREKKVWEVLVCESPQTVNQAPETLFRYAEYCDSGEVNSVRLRQAL 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
P P+KIRFFR Q+ +ITKAC +L + P+PS+R ++L WLEER VY
Sbjct: 63 ERAIAQAPQP-PDKIRFFRRQLTNMITKACSDLGVLPLPSRRTVTLNQWLEERSRDVYPL 121
Query: 211 HPGFQKG-SKPLLALDNPFPMELPDNLFGDKWAFV------------------------- 244
P +++G P + + P P LPD L D+ AFV
Sbjct: 122 DPNYREGVVVPSVQFETPEPKRLPDALNYDRLAFVTLEAGAFADMTEWSIDFGEAFPLEA 181
Query: 245 ------------------QLPFSAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYK 285
LP +AWM+GLE+ + +T L+L G + R++
Sbjct: 182 LGLTPETRVPGVLLFSSRALPLAAWMSGLEMAFVRYEETPNPCLVLDTGANERWLLRGNL 241
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
EA+ +E AK+A +HF+ +Q + SE GFWLL ++
Sbjct: 242 AERSQQQEAKNFELAKQAAQNVHFIGVQSDPQSEAFSGFWLLQEV 286
>gi|390439073|ref|ZP_10227492.1| conserved hypothetical protein [Microcystis sp. T1-4]
gi|389837496|emb|CCI31616.1| conserved hypothetical protein [Microcystis sp. T1-4]
Length = 291
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 100/288 (34%), Positives = 141/288 (48%), Gaps = 61/288 (21%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP+LD KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVLDENNKKRWELLICETPATIDRSSDTLFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I A + GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 ITAAIKEAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFV------------------- 244
+ G+ + +++ P P+ LPD + G DKWAFV
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFNDLKDWDISFGE 178
Query: 245 ------------------------QLPFSAWMNGLEVCSIETDTA-RGSLILSVGISTRY 279
LP + WM+GLE+ ++ + R L L G S +
Sbjct: 179 NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLEAGSRPLLRLETGASDSW 238
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
I N N T +EA+ +E AK+ LHFLA+Q +SE GFWLL
Sbjct: 239 ILVNV-TNAETLNEAKNFEEAKQKANNLHFLAVQSNPESESFAGFWLL 285
>gi|359459949|ref|ZP_09248512.1| hypothetical protein ACCM5_14568 [Acaryochloris sp. CCMEE 5410]
Length = 287
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 139/281 (49%), Gaps = 52/281 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAIC------ 154
WE+DF SRPILD + KKIWEL+VCD + ++TK + N+ L+EA+
Sbjct: 5 WEIDFYSRPILDEQQKKIWELLVCDSQRNFEFTKVCSGSQANARWLQEALAEALPLWRQQ 64
Query: 155 DDLGVP-IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
+ G PE+IRFFR M++II +AC+ L+I PS+R + WL ER +TVY +HPG
Sbjct: 65 GNYGEQDFPERIRFFRRSMKSIIPRACEALEIPAQPSRRTFGVYQWLCEREQTVYPQHPG 124
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL--------------------------- 246
+Q + + P LPD L G+ W V L
Sbjct: 125 YQPMMAAPMTFEPTLPKPLPDALQGEGWRLVTLQLSAFEDMDEWDIAFGAKIPLAQLNLP 184
Query: 247 ----------------PFSAWMNGLEVCSIETDT-ARGSLILSVGISTRYIYANYKKNPV 289
P + WM+GLE+ ++ + + L+L G+S R++ A +P+
Sbjct: 185 PETAIPGLLIFSERSTPLAGWMSGLELACLKLEMDPKPQLLLETGLSDRWVIAYLNDDPL 244
Query: 290 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+E + +E K+A +HF+A+Q +SE GFWL+ ++
Sbjct: 245 -VAEIQDFEKTKQAAQQIHFVAVQSSPESEQFAGFWLMQEI 284
>gi|443649603|ref|ZP_21130311.1| hypothetical protein C789_851 [Microcystis aeruginosa DIANCHI905]
gi|159028601|emb|CAO90604.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443334903|gb|ELS49392.1| hypothetical protein C789_851 [Microcystis aeruginosa DIANCHI905]
Length = 291
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 100/288 (34%), Positives = 142/288 (49%), Gaps = 61/288 (21%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRSSDTLFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I A GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 ITAAIKAAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMANFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFV------------------- 244
+ G+ + +++ P P+ LPD + G DKWAFV
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFHDLKDWDISFGE 178
Query: 245 ------------------------QLPFSAWMNGLEVCSIETDT-ARGSLILSVGISTRY 279
LP + WM+GLE+ ++ +T +R L L G S +
Sbjct: 179 NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETSSRPLLRLETGASDSW 238
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
I N N T +EA+ +E AK+ LHFLAIQ +S+ GFWLL
Sbjct: 239 ILVNV-TNAETLNEAKNFEEAKQKANNLHFLAIQSNPESQSFAGFWLL 285
>gi|16330318|ref|NP_441046.1| hypothetical protein sll2002 [Synechocystis sp. PCC 6803]
gi|383322059|ref|YP_005382912.1| hypothetical protein SYNGTI_1150 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383325228|ref|YP_005386081.1| hypothetical protein SYNPCCP_1149 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383491112|ref|YP_005408788.1| hypothetical protein SYNPCCN_1149 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384436379|ref|YP_005651103.1| hypothetical protein SYNGTS_1150 [Synechocystis sp. PCC 6803]
gi|451814476|ref|YP_007450928.1| hypothetical protein MYO_111600 [Synechocystis sp. PCC 6803]
gi|1652807|dbj|BAA17726.1| sll2002 [Synechocystis sp. PCC 6803]
gi|339273411|dbj|BAK49898.1| hypothetical protein SYNGTS_1150 [Synechocystis sp. PCC 6803]
gi|359271378|dbj|BAL28897.1| hypothetical protein SYNGTI_1150 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274548|dbj|BAL32066.1| hypothetical protein SYNPCCN_1149 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277718|dbj|BAL35235.1| hypothetical protein SYNPCCP_1149 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|451780445|gb|AGF51414.1| hypothetical protein MYO_111600 [Synechocystis sp. PCC 6803]
Length = 292
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 99/284 (34%), Positives = 143/284 (50%), Gaps = 59/284 (20%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVA 152
WELDF SRP+LD KK+WE+++C+ S+Q Y++Y P++ +NS+ L++AI A
Sbjct: 5 WELDFYSRPLLDDEEKKVWEVLICESPQSVQQLPGDLFRYSQYCPSSTVNSVWLRQAIEA 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G +P+KIRFFR QM +I+KAC+E I P PS+R L WL +R E Y + P
Sbjct: 65 AIAEAGQ-MPQKIRFFRRQMNNMISKACEEAGIPPAPSRRTYVLEQWLGDRLENFYPQQP 123
Query: 213 GFQKGSKPLLALDNP--FPMELPDNLFGDK---WAFVQ---------------------- 245
G+ ++ P + LPD + GD+ WA V
Sbjct: 124 GYDPKLASSTSVQYPELNAIALPDAVRGDRGDQWALVSLAAADFNDLPDWEISFGESFPL 183
Query: 246 ---------------------LPFSAWMNGLEVCSIETDT-ARGSLILSVGISTRYIYAN 283
LPF+AW++GLE+ ++ +T R + L G S +I AN
Sbjct: 184 SSYNLSPDSRIPGLILFSPRALPFAAWLSGLELGYLQYNTDPRPIMRLETGASDSWIVAN 243
Query: 284 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ + EA+ +E KK G+HFLAIQ DSE GFWLL
Sbjct: 244 V-TDKTSEQEAQGFEQTKKLAQGIHFLAIQTSPDSETFAGFWLL 286
>gi|158336667|ref|YP_001517841.1| hypothetical protein AM1_3535 [Acaryochloris marina MBIC11017]
gi|158306908|gb|ABW28525.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 287
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 139/281 (49%), Gaps = 52/281 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAIC------ 154
WE+DF SRPILD + KKIWEL+VCD + ++TK + N+ L+EA+
Sbjct: 5 WEIDFYSRPILDEQQKKIWELLVCDSQRNFEFTKVCSGSQANARWLQEALAEALPLWRQQ 64
Query: 155 DDLGVP-IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
+ G PE+IRFFR M++II +AC+ L+I PS+R + WL ER +TVY +HPG
Sbjct: 65 ANYGEQDFPERIRFFRRSMKSIIPRACEALEIPAQPSRRTFGVYQWLCEREQTVYPQHPG 124
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL--------------------------- 246
+Q + + P LPD L G+ W V L
Sbjct: 125 YQPMMAAPMTFEPTLPKPLPDALQGEGWRLVTLQLSAFEEMDEWDIAFGAKIPLAQLNLP 184
Query: 247 ----------------PFSAWMNGLEVCSIETDT-ARGSLILSVGISTRYIYANYKKNPV 289
P + WM+GLE+ ++ + + L+L G+S R++ A +P+
Sbjct: 185 PETAIPGLLIFSERSTPLAGWMSGLELACLKLEMDPKPQLLLETGLSDRWVIAYLNDDPL 244
Query: 290 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+E + +E K+A +HF+A+Q +SE GFWL+ ++
Sbjct: 245 -VAEIQDFEKTKQAAQQVHFVAVQSSPESEQFAGFWLMQEI 284
>gi|425445142|ref|ZP_18825178.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9443]
gi|389734932|emb|CCI01483.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9443]
Length = 291
Score = 150 bits (380), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 99/288 (34%), Positives = 141/288 (48%), Gaps = 61/288 (21%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRPSDTIFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I A GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 ITAAIKAAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFV------------------- 244
+ G+ + +++ P P+ LPD + G DKWAFV
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFHDIKDWDISFGE 178
Query: 245 ------------------------QLPFSAWMNGLEVCSIETDTA-RGSLILSVGISTRY 279
LP + WM+GLE+ ++ +T R L L G S +
Sbjct: 179 NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETGSRPVLRLETGASDSW 238
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
I N N T +EA+ +E AK+ LHFLA+Q +S+ GFWLL
Sbjct: 239 ILVNV-TNAKTLNEAKNFEEAKQKANNLHFLAVQSNPESQSFAGFWLL 285
>gi|425453946|ref|ZP_18833695.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9807]
gi|389799877|emb|CCI20614.1| Similar to tr|P73680|P73680 [Microcystis aeruginosa PCC 9807]
Length = 291
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 99/288 (34%), Positives = 141/288 (48%), Gaps = 61/288 (21%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRPSDTIFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I A GV P+KIRFFR QM +I+KAC+++ + PS+R +L W+EER Y
Sbjct: 61 ITAAIKAAGV-TPKKIRFFRRQMNNMISKACEDIGVPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFV------------------- 244
+ G+ + +++ P P+ LPD + G DKWAFV
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFHDLKDWDISFGE 178
Query: 245 ------------------------QLPFSAWMNGLEVCSIETDT-ARGSLILSVGISTRY 279
LP + WM+GLE+ ++ +T +R L L G S +
Sbjct: 179 NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETGSRPVLRLETGASDSW 238
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
I N N T EA+ +E AK+ LHFLA+Q +SE GFWLL
Sbjct: 239 ILVNV-TNTETLKEAKNFEEAKQKANNLHFLAVQSNPESESFAGFWLL 285
>gi|425451748|ref|ZP_18831568.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
gi|440751656|ref|ZP_20930859.1| hypothetical protein O53_19 [Microcystis aeruginosa TAIHU98]
gi|389766807|emb|CCI07649.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
gi|440176149|gb|ELP55422.1| hypothetical protein O53_19 [Microcystis aeruginosa TAIHU98]
Length = 291
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 99/288 (34%), Positives = 141/288 (48%), Gaps = 61/288 (21%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRSSDTIFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
+ A GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 VTAAIKAAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFV------------------- 244
+ G+ + +++ P P+ LPD + G DKWAFV
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFNDLKDWDISFGE 178
Query: 245 ------------------------QLPFSAWMNGLEVCSIETDT-ARGSLILSVGISTRY 279
LP + WM+GLE+ ++ +T +R L L G S +
Sbjct: 179 NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETGSRPVLRLETGASDSW 238
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
I N N T EA+ +E AK+ LHFLA+Q +SE GFWLL
Sbjct: 239 ILVNV-TNTETLKEAKNFEEAKQKANNLHFLAVQSNPESESFAGFWLL 285
>gi|434398597|ref|YP_007132601.1| protein of unknown function DUF1092 [Stanieria cyanosphaera PCC
7437]
gi|428269694|gb|AFZ35635.1| protein of unknown function DUF1092 [Stanieria cyanosphaera PCC
7437]
Length = 292
Score = 150 bits (379), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 99/288 (34%), Positives = 147/288 (51%), Gaps = 61/288 (21%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLS---------LQYTKYFPNNVINSITLKEA 149
T WELDF SRPILD KK+WE+++C+ SL+ +Y++Y + +NS+ L+EA
Sbjct: 4 TIWELDFYSRPILDEENKKVWEVLICE-SLTDPERSPDEIFRYSQYCSSKTVNSLWLREA 62
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I G+ P+KIRFFR QM +ITKAC++ I PS R +L WL R + VY
Sbjct: 63 IEKAIAIAGI-TPKKIRFFRRQMNNMITKACEDAGIAAAPSSRTYALNHWLATRMKEVYP 121
Query: 210 RHPGFQKGSKPLLALDNP--FPMELPDNL---FGDKWAFV-------------------- 244
+ PG+ + + +++ P + LPD + GDKWAFV
Sbjct: 122 QEPGYDQKTASSISVQYPDLNAIPLPDAVRGDRGDKWAFVSLEASAFAEMNEWEIGFKEA 181
Query: 245 ------------QLP-----------FSAWMNGLEVCSIETDT-ARGSLILSVGISTRYI 280
Q+P +AW++GLE+ + ++ R + L+ G+S ++
Sbjct: 182 FPLSLLNLSSETQIPGLIIFSPRATLLAAWLSGLEMGFLHLESDPRPRICLNTGLSDSWV 241
Query: 281 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLL 328
N P T +EA+ +E AK+ G+HFLAIQ +SE GFWLLL
Sbjct: 242 LVNL-TTPSTLTEAKEFELAKQKAQGVHFLAIQSSTESESFAGFWLLL 288
>gi|425472349|ref|ZP_18851200.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
gi|389881591|emb|CCI37866.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
Length = 291
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 100/288 (34%), Positives = 140/288 (48%), Gaps = 61/288 (21%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN ++NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRSSDTIFKYASYCPNTMVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I A G P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 ITAAIKAAG-GTPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFV------------------- 244
+ G+ + +++ P P+ LPD + G DKWAFV
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFNDLKDWDISFGE 178
Query: 245 ------------------------QLPFSAWMNGLEVCSIETDTA-RGSLILSVGISTRY 279
LP + WM+GLE+ ++ +T R L L G S +
Sbjct: 179 NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETGSRPLLRLETGASDSW 238
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
I N N T +EA+ +E AK+ LHFLAIQ +SE GFWLL
Sbjct: 239 ILVNV-TNAETLNEAKNFEEAKQKANNLHFLAIQSNPESESFAGFWLL 285
>gi|218437072|ref|YP_002375401.1| hypothetical protein PCC7424_0060 [Cyanothece sp. PCC 7424]
gi|218169800|gb|ACK68533.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 7424]
Length = 290
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 95/284 (33%), Positives = 141/284 (49%), Gaps = 59/284 (20%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS--------LQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE+++C +Y+++ N +NS+ L EAI
Sbjct: 5 WELDFYSRPILDENKKKLWEVLICQAPTESDQSPDSLFKYSEFCSNTTVNSLWLGEAIKK 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P+KIRFFR QM +I+KAC++ I P PS+R +L W+EER VY +
Sbjct: 65 ATLEAG-EAPKKIRFFRRQMNNMISKACEDAGIDPAPSRRTYALNQWIEERMRDVYPQQE 123
Query: 213 GFQKGSKPLLALDNPF--PMELPDNLF---GDKWAFVQ---------------------- 245
G+ + + +++ P + LPD + GDK+AFV
Sbjct: 124 GYDENAAKPVSVQYPALNAVPLPDAIRGDKGDKYAFVSLEAEAFAQMKEWDIAFGEAFPL 183
Query: 246 ---------------------LPFSAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYAN 283
LP + WM+GLE+ ++ +++R L L G+S +I N
Sbjct: 184 SMVGVTSEVKIPGVIIYSSRALPLAGWMSGLEMGYLKLEESSRPILRLETGVSDSWILLN 243
Query: 284 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
NP T +EA+ +EA K+ +HFLA+Q +SE GFWLL
Sbjct: 244 V-TNPQTLAEAKGFEATKQKANNVHFLAVQSSPESESFSGFWLL 286
>gi|257062177|ref|YP_003140065.1| hypothetical protein Cyan8802_4445 [Cyanothece sp. PCC 8802]
gi|256592343|gb|ACV03230.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 8802]
Length = 293
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 98/287 (34%), Positives = 142/287 (49%), Gaps = 59/287 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD KK+WE+V+C+ L++ +Y+++ + +NS+ L+EAI
Sbjct: 3 TIWELDFYSRPILDENQKKLWEVVICETPLTVDRSPDTLFKYSQFCSSQTVNSVWLREAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ G P+KIRFFR QM +ITKAC++ I +PS+R +L WL ER + Y
Sbjct: 63 ESAIAQAG-ETPQKIRFFRRQMNNMITKACEDAGIAAVPSRRTYTLTHWLAERNQQFYPT 121
Query: 211 HPGFQKGSKPLLALDNP--FPMELPDNLFG---DKWAFV--------------------- 244
PG+ + ++ P + LPD + G DKWAFV
Sbjct: 122 QPGYSVEAAQTSSVAYPELNAIPLPDAVRGDKADKWAFVTLEASALEEMNEWEIGFGEGF 181
Query: 245 ----------------------QLPFSAWMNGLEVCSIE-TDTARGSLILSVGISTRYIY 281
LP +AWM+GLE+ ++ + R + L G S +I
Sbjct: 182 PLSLLGVTSEQRIPGLIIFSDRALPLAAWMSGLELGFLKFEENPRPIVRLETGTSDSWIL 241
Query: 282 ANYK-KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
N K+ T +EA+ +E AK+ +HFLAIQ D+E GFWLL
Sbjct: 242 VNISPKDAPTLAEAQGFETAKQNGQQVHFLAIQSSPDTESFAGFWLL 288
>gi|425459302|ref|ZP_18838788.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
gi|389823007|emb|CCI29141.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
Length = 291
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 99/288 (34%), Positives = 140/288 (48%), Gaps = 61/288 (21%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEA 149
+T WELDF SRP++D KK WEL++C+ ++ +Y Y PN +NS L EA
Sbjct: 1 MTIWELDFYSRPVVDENNKKRWELLICETPATIDRSSDTLFKYASYCPNTTVNSQWLGEA 60
Query: 150 IVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
I A GV P+KIRFFR QM +I+KAC+++ I PS+R +L W+EER Y
Sbjct: 61 ITAAIKAAGV-TPKKIRFFRRQMNNMISKACEDIGIPASPSRRTHALTRWIEERMVNFYP 119
Query: 210 RHPGFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFV------------------- 244
+ G+ + +++ P P+ LPD + G DKWAFV
Sbjct: 120 QEVGYDQNLTKTASVNYP-PLNAVPLPDAVRGDKADKWAFVTLELSSFNDLKDWDISFGE 178
Query: 245 ------------------------QLPFSAWMNGLEVCSIETDT-ARGSLILSVGISTRY 279
LP + WM+GLE+ ++ +T +R L L G S +
Sbjct: 179 NLPILGMGLDENLKIPGLVIFSPRALPLAGWMSGLEMAYLKLETGSRPVLRLETGASDSW 238
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
I + N T EA+ +E AK+ LHFLA+Q +SE GFWLL
Sbjct: 239 ILVSV-TNTETLKEAKNFEEAKQKANNLHFLAVQSNPESESFAGFWLL 285
>gi|218249090|ref|YP_002374461.1| hypothetical protein PCC8801_4383 [Cyanothece sp. PCC 8801]
gi|218169568|gb|ACK68305.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 8801]
Length = 293
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 98/287 (34%), Positives = 142/287 (49%), Gaps = 59/287 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD KK+WE+V+C+ L++ +Y+++ + +NS+ L+EAI
Sbjct: 3 TIWELDFYSRPILDENQKKLWEVVICETPLTVDRSPDTLFKYSQFCSSQTVNSVWLREAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ G P+KIRFFR QM +ITKAC++ I +PS+R +L WL ER + Y
Sbjct: 63 ESAIAQAG-ETPQKIRFFRRQMNNMITKACEDAGIAAVPSRRTYTLTHWLAERDQQFYPT 121
Query: 211 HPGFQKGSKPLLALDNP--FPMELPDNLFG---DKWAFV--------------------- 244
PG+ + ++ P + LPD + G DKWAFV
Sbjct: 122 QPGYSVEAAQTSSVAYPELNAIPLPDAVRGDKADKWAFVTLEASALEEMNEWEIGFGEGF 181
Query: 245 ----------------------QLPFSAWMNGLEVCSIE-TDTARGSLILSVGISTRYIY 281
LP +AWM+GLE+ ++ + R + L G S +I
Sbjct: 182 PLSLLGVTSEQRIPGLIIFSDRALPLAAWMSGLELGFLKFEENPRPIVRLETGTSDSWIL 241
Query: 282 ANYK-KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
N K+ T +EA+ +E AK+ +HFLAIQ D+E GFWLL
Sbjct: 242 VNISPKDAPTLAEAQGFETAKQNGQQVHFLAIQSSPDTESFAGFWLL 288
>gi|428771014|ref|YP_007162804.1| hypothetical protein Cyan10605_2686 [Cyanobacterium aponinum PCC
10605]
gi|428685293|gb|AFZ54760.1| protein of unknown function DUF1092 [Cyanobacterium aponinum PCC
10605]
Length = 294
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 96/286 (33%), Positives = 134/286 (46%), Gaps = 59/286 (20%)
Query: 101 WELDFCSRPILDIRGKKIWELVVC--------DGSLSLQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPI+D KK WE+++C D S +Y+++ N +NSITL+ AI
Sbjct: 5 WELDFYSRPIIDENNKKRWEILICESPTTIDTDTSQLFRYSQFCANTEVNSITLQNAIAT 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P KIRFFR QM +I K C++ I + S+ +L WLEER + Y
Sbjct: 65 AIEKAG-ETPSKIRFFRRQMNNMILKGCEDAGIPALASRHTYTLNQWLEERMTSFYPLQE 123
Query: 213 GFQKGSKPLLALDNP--FPMELPDNLFG---DKWAFVQ---------------------- 245
G+ + + ++ P P+ LPD L G DKWA V
Sbjct: 124 GYDEKATIAASVQYPQTNPVNLPDALKGDKKDKWALVSLNGKDLEEMPEWDIGFREAFPL 183
Query: 246 ---------------------LPFSAWMNGLEVCSIETDTARG-SLILSVGISTRYIYAN 283
LP + WM+GLE+ + D + S+ L G+S +I N
Sbjct: 184 KIANISPDTKIPGLIIFSSRALPLAGWMSGLELGYLRLDRGKFPSICLETGVSDSWILVN 243
Query: 284 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 329
+ T SEAE +E KK G+HFLAIQ +S+ FWLLL+
Sbjct: 244 L-TDKNTLSEAEGFENTKKQANGVHFLAIQSSPESQSFEAFWLLLE 288
>gi|254425410|ref|ZP_05039128.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
gi|196192899|gb|EDX87863.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
Length = 301
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 138/296 (46%), Gaps = 70/296 (23%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAI 150
T WELDF SRP+LD + KK WE+++C+G S++ Y+KY N+ +NS TL+ AI
Sbjct: 3 TVWELDFYSRPVLDEQNKKRWEILICEGLQSVEDDPANLFRYSKYVSNSEVNSETLQAAI 62
Query: 151 ---VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETV 207
+A P K+R+FR QMQ +I +AC+E + PS+R L+L WLE+R V
Sbjct: 63 EEAIAQSASESADSPTKVRYFRYQMQNMIKRACEEAGLLSYPSRRTLALQQWLEDRKVNV 122
Query: 208 YTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQL--------------------- 246
Y P ++ + +A LPD L G +WA V L
Sbjct: 123 YPNEPRYKPSASASVAKPIDVVNPLPDALIGQQWALVTLPAKEFADMGDWDVAFKEAFPL 182
Query: 247 ----------------------PFSAWMNGLEVCSIE-------------TDTARGSLIL 271
P +AWM+GLE+ + TDTAR L++
Sbjct: 183 EIAGVEPDTPIPGFIIYSNRATPLAAWMSGLEIAGVRAGKEESSNYVSKNTDTAR--LLM 240
Query: 272 SVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
G ++ A+ P T +E +E AK A +HF+A+Q+ +SE G WL+
Sbjct: 241 DTGTIETWLLADL-VTPETQAEGLRFENAKAAANNVHFIAVQDSPESETFAGMWLM 295
>gi|428204149|ref|YP_007082738.1| hypothetical protein Ple7327_4040 [Pleurocapsa sp. PCC 7327]
gi|427981581|gb|AFY79181.1| Protein of unknown function (DUF1092) [Pleurocapsa sp. PCC 7327]
Length = 291
Score = 144 bits (362), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 94/285 (32%), Positives = 138/285 (48%), Gaps = 61/285 (21%)
Query: 101 WELDFCSRPILDIRGKKIWELVVC----DGSLSL----QYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK+WE+++C D S +Y+++ N +NS+ L++ I
Sbjct: 5 WELDFYSRPILDENNKKLWEVLICETPTDSKQSFDSLFKYSQFCSNQSVNSLWLQQEIEK 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
GV P+KIRFFR QM +I KAC++L I P PS+R +L WL +R + Y P
Sbjct: 65 AIAQAGV-APKKIRFFRRQMNNMIVKACEDLGIPPAPSRRTYALERWLSQRLDEFYPNQP 123
Query: 213 GFQKGSKPLLALDNPFPME---LPDNLF---GDKWAFVQ--------------------- 245
G+ + ++ P P+ LPD + GDKWAFV
Sbjct: 124 GYDAAAAKSASVQYP-PLNATPLPDAVRGDKGDKWAFVSLEASAFEEMNEWDIAFGEAFP 182
Query: 246 ----------------------LPFSAWMNGLEVCSIETDTARGSLI-LSVGISTRYIYA 282
LP + WM+GLE+ ++ + ++ L G S +I A
Sbjct: 183 LSLTGMTPDTKIPGLIIFSSRALPLAGWMSGLELAFLKFEGGSRPIVRLETGASDSWILA 242
Query: 283 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ +P +EA+ +E AK+ +HFLAIQ +S+ GFWLL
Sbjct: 243 SL-TDPKMLAEAKGFEEAKQKAQQVHFLAIQSNPESQSFAGFWLL 286
>gi|172036928|ref|YP_001803429.1| hypothetical protein cce_2013 [Cyanothece sp. ATCC 51142]
gi|354554731|ref|ZP_08974035.1| protein of unknown function DUF1092 [Cyanothece sp. ATCC 51472]
gi|171698382|gb|ACB51363.1| DUF1092-containing protein [Cyanothece sp. ATCC 51142]
gi|353553540|gb|EHC22932.1| protein of unknown function DUF1092 [Cyanothece sp. ATCC 51472]
Length = 289
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 94/287 (32%), Positives = 138/287 (48%), Gaps = 61/287 (21%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSL--------SLQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK WE+++C+ +Y ++ P + +NSI L+EA+
Sbjct: 5 WELDFYSRPILDENNKKQWEVLICETQTDTTESLDKGFRYAEFCPPSTVNSIWLREALET 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P KIRFFR QM +I KAC++ + PS+R +L W+ +R++ Y
Sbjct: 65 AIEKAG-ETPSKIRFFRRQMNNMIVKACEDAGLVASPSRRTYTLNHWINQRFQDFYPSQE 123
Query: 213 GFQKGSKPLLALDNPFPME---LPDNLFG---DKWAFVQ--------------------- 245
G+ + + ++ P P++ LPD + G DKWAFV
Sbjct: 124 GYDEKAATNASVAYP-PLDAIALPDAVRGDKSDKWAFVSLEASGFADMKEWDIRFGEGFP 182
Query: 246 ----------------------LPFSAWMNGLEVCSIETDTA-RGSLILSVGISTRYIYA 282
LP + WM+GLE+ S++ T +L L G+S +I A
Sbjct: 183 LELANLSPDTKIPGFIIFSRRALPLAGWMSGLELVSLKFQTKPFPNLCLETGLSDNWILA 242
Query: 283 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 329
N + + +EAE +E +K G+HFLAIQ D E GFWLL D
Sbjct: 243 NL-TDKSSVTEAEGFEQSKNKANGVHFLAIQSRPDVETFSGFWLLKD 288
>gi|219117107|ref|XP_002179348.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409239|gb|EEC49171.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 278
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 93/280 (33%), Positives = 135/280 (48%), Gaps = 53/280 (18%)
Query: 100 EWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGV 159
EWELD SRP+ + GKK+WE+++ D + S ++ + P+N +NS TL++ + + + V
Sbjct: 1 EWELDCYSRPVA-VAGKKLWEVLITDSAGSFRFRQTLPSNQVNSKTLRQIVDDLMERADV 59
Query: 160 PIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQK--- 216
P IRFFR M +I A EL + PS+ +L WLE+R+E VY + GF
Sbjct: 60 K-PNTIRFFRGAMFNMINIALMELPVTSKPSRCTFALASWLEDRHENVYPQMEGFNANMV 118
Query: 217 GSKPLLALDNPFPMELPDNLFGDKWAFVQLP----------------------------- 247
GS LD P+ LPD L G+K+AFV LP
Sbjct: 119 GSTIPSFLDVRTPVRLPDALRGEKYAFVALPVAEFLPGGSVDATNIGVGRICTIPRDIPA 178
Query: 248 ----------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTT 291
++W+ G EV ++ D + L++ I T+Y+ A K N
Sbjct: 179 DAFVQGVVILTNRAEALASWLAGTEVVALTADLRKRVLVMETDIDTQYLMA--KLNESQR 236
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
EA + E K GLHF+++QE DS D GFWLL +LP
Sbjct: 237 VEAASLEEGKAGLKGLHFVSVQENEDS-DPTGFWLLRELP 275
>gi|428772145|ref|YP_007163933.1| hypothetical protein Cyast_0304 [Cyanobacterium stanieri PCC 7202]
gi|428686424|gb|AFZ46284.1| protein of unknown function DUF1092 [Cyanobacterium stanieri PCC
7202]
Length = 288
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 96/284 (33%), Positives = 133/284 (46%), Gaps = 60/284 (21%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL--------QYTKYFPNNVINSITLKEAIVA 152
WELDF SRPI D KK+WE+++C+ + +Y+++ N+ +NSITL AI +
Sbjct: 5 WELDFYSRPIFDENNKKLWEILICESPTDIDSDYDSLFRYSQFCSNSEVNSITLGGAIAS 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P KIRFFR QM +I KAC + I PS+ +L WL+ER Y
Sbjct: 65 AMEKAG-ETPSKIRFFRRQMNNMIIKACDDAGIPVFPSRHTYALNRWLDERETDFYPHQE 123
Query: 213 GFQKGSKPLLALDNP--FPMELPDNLFG---DKWAFVQ---------------------- 245
G+Q K ++ P + LPD + G DKWA V
Sbjct: 124 GYQ-APKNTASVQYPQGNAVSLPDAVKGDRTDKWALVSLGSDDFQDMREWAIAFGEAFPL 182
Query: 246 ---------------------LPFSAWMNGLEVCSIETDTARGSLI-LSVGISTRYIYAN 283
LP +AWM+GLE+ + +T + I L G+S +I AN
Sbjct: 183 SLADIEDNTKIPGLIIFSKRALPLAAWMSGLELGYLRLETGQFPRICLETGVSDSWILAN 242
Query: 284 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ T EAE +E K+ G+HFLAIQ +SE GFWLL
Sbjct: 243 ITDDK-TLGEAEGFETTKQQANGVHFLAIQSSPESESFEGFWLL 285
>gi|126659192|ref|ZP_01730330.1| hypothetical protein CY0110_04433 [Cyanothece sp. CCY0110]
gi|126619497|gb|EAZ90228.1| hypothetical protein CY0110_04433 [Cyanothece sp. CCY0110]
Length = 290
Score = 140 bits (353), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 94/284 (33%), Positives = 132/284 (46%), Gaps = 59/284 (20%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSL--------SLQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK WE+++C+ +Y ++ P N +NSI L+EA+
Sbjct: 5 WELDFYSRPILDENNKKQWEVLICETQTDTTESLDKGFRYAQFCPPNTVNSIWLREALEI 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P KIRFFR QM +I KAC++ + PS+R +L WL +R++ Y
Sbjct: 65 AIEKAG-ENPSKIRFFRRQMNNMIVKACEDAGLVASPSRRTYTLNHWLNQRFQDFYPSQE 123
Query: 213 GFQKGSKPLLALDNPF--PMELPDNLFG---DKWAFVQ---------------------- 245
G+ + + ++ P + LPD + G DKWAFV
Sbjct: 124 GYDEKAATNASVAYPTLNAIALPDAVRGDKSDKWAFVSLEASAFEDMKEWDIRFGEGFPL 183
Query: 246 ---------------------LPFSAWMNGLE-VCSIETDTARGSLILSVGISTRYIYAN 283
LP + WM+GLE VC + R L L G+S +I AN
Sbjct: 184 ELVDLSPDTKIPGFIIFSQRALPLAGWMSGLELVCLKVQEKPRPILSLETGLSDSWILAN 243
Query: 284 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ + +EA+ +E K G+HFLAIQ D E GFWLL
Sbjct: 244 L-TDKSSVAEAQGFEDTKNKAKGVHFLAIQSRPDVETFSGFWLL 286
>gi|307150318|ref|YP_003885702.1| hypothetical protein Cyan7822_0382 [Cyanothece sp. PCC 7822]
gi|306980546|gb|ADN12427.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 7822]
Length = 290
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 96/288 (33%), Positives = 139/288 (48%), Gaps = 63/288 (21%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDG----SLS----LQYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD KK+WE+++C+ LS QY+++ + +NS+ L E +
Sbjct: 3 TIWELDFYSRPILDEDEKKLWEVLICEAPTEPDLSPDSLFQYSEFCSSKTVNSLWLAETL 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
G P+KIRFFR QM +ITKAC+E I PS+R +L W+E+R + Y +
Sbjct: 63 KKAIAQAG-KAPKKIRFFRRQMNNMITKACEEAGIDAAPSRRTYALNQWIEQRMKEFYPQ 121
Query: 211 HPGFQKGSKPLLALDNPFP----MELPDNLF---GDKWAFVQ------------------ 245
G+ + K L+ +P + LPD + GDK+AFV
Sbjct: 122 QEGYDQ--KAALSTSVQYPGLNAIPLPDAIRGDKGDKYAFVSLEAEAFAQLKEWDIAFGE 179
Query: 246 -------------------------LPFSAWMNGLEVCSIE-TDTARGSLILSVGISTRY 279
LP + WM+GLE+ ++ ++ R + L G+S +
Sbjct: 180 AFPLSMLGINPKNKIPGLIIYSSRALPLAGWMSGLEMGYLKFEESDRPIVRLETGVSDSW 239
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
I N NP SEA+ +E KK +HFLA+Q +SE GFWLL
Sbjct: 240 IVINV-TNPQILSEAKGFEETKKRANNVHFLAVQSSPESESFAGFWLL 286
>gi|67922272|ref|ZP_00515785.1| Protein of unknown function DUF1092 [Crocosphaera watsonii WH 8501]
gi|67855848|gb|EAM51094.1| Protein of unknown function DUF1092 [Crocosphaera watsonii WH 8501]
Length = 289
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 92/286 (32%), Positives = 135/286 (47%), Gaps = 59/286 (20%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCD------GSL--SLQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK WE+++C+ GSL +Y K+ P +NS+ L+EAI
Sbjct: 5 WELDFYSRPILDENKKKQWEVLICETQTDSQGSLEDGFRYAKFCPPKTVNSMWLREAIET 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P K+RFFR QM +I KAC++ + PS+R +L WL++R + Y
Sbjct: 65 AMEKTG-EAPSKVRFFRRQMNNMIVKACEDAGLVATPSRRTYTLNHWLKQRQQDFYPSQE 123
Query: 213 GFQKGSKPLLALDNPF--PMELPDNLFG---DKWAFVQ---------------------- 245
G+ + + ++ P + LPD + G DKW FV
Sbjct: 124 GYNEAAATNASVAYPALDAIALPDAVRGDRSDKWTFVSLEASAFEEMKEWDIRFGEGFPL 183
Query: 246 ---------------------LPFSAWMNGLEVCSIETDTARGSLI-LSVGISTRYIYAN 283
LP +AWM+GLE+ +++ + ++ L G+S +I AN
Sbjct: 184 ALADLSPDTKIPGFIIYSQRALPLAAWMSGLELVALKFKSKPLPILSLETGLSDSWILAN 243
Query: 284 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 329
+ +E + +E K G+HFLAIQ D E GFWLL D
Sbjct: 244 L-TDQSGVAEGKGFEDTKNKAEGVHFLAIQPRPDVETFSGFWLLKD 288
>gi|148242688|ref|YP_001227845.1| hypothetical protein SynRCC307_1589 [Synechococcus sp. RCC307]
gi|147850998|emb|CAK28492.1| Conserved hypothetical protein [Synechococcus sp. RCC307]
Length = 283
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 138/277 (49%), Gaps = 49/277 (17%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLK---EAIVAICDDL 157
WELDF SRP+LD GKK WE ++C G S Q+ ++ P + +NSI LK +A D+
Sbjct: 8 WELDFYSRPLLDENGKKRWEALICSGDGSFQWQRFCPADSVNSIWLKTALSDALAAADEA 67
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
P P+++R +RS M+T++ +A + + ++ +PS+RC +L+ WL+ER ++Y G G
Sbjct: 68 SSPAPKRLRCWRSSMRTMVQRAAEGVGLEMVPSRRCYALVEWLQEREASIYPEMEGHLNG 127
Query: 218 -SKPLLALDNPFPMELPDNLFGDKWAFVQLP----------------------------- 247
P P+ LP+ + GD W + LP
Sbjct: 128 PLAPPPQPLQAAPLPLPEAVRGDSWGWASLPAASLAEASEWPMDFSGLVPLPNTKAEAMV 187
Query: 248 -------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEA 294
+ W++GLE +E + L+L G+ R++ ++ +N S
Sbjct: 188 PGVRLFSSSRALALAGWLSGLEPVRLEVCGQQ--LVLEAGLEDRWLVSDL-QNGEADSAQ 244
Query: 295 EAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
+A EAA++ GGL FLA+Q D+ + GFWLL DLP
Sbjct: 245 QALEAARQEAGGLQFLAVQSGPDATEFAGFWLLRDLP 281
>gi|416389975|ref|ZP_11685424.1| hypothetical protein CWATWH0003_2245 [Crocosphaera watsonii WH
0003]
gi|357264130|gb|EHJ13056.1| hypothetical protein CWATWH0003_2245 [Crocosphaera watsonii WH
0003]
Length = 289
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 135/286 (47%), Gaps = 59/286 (20%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCD------GSL--SLQYTKYFPNNVINSITLKEAIVA 152
WELDF SRPILD KK WE+++C+ GSL +Y ++ P +NS+ L+EAI
Sbjct: 5 WELDFYSRPILDENKKKQWEVLICETQTDSQGSLEDGFRYAQFCPPKTVNSMWLREAIET 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ G P K+RFFR QM +I KAC++ + PS+R +L WL++R + Y
Sbjct: 65 AMEKTG-EAPSKVRFFRRQMNNMIVKACEDAGLVATPSRRTYTLNHWLKQRQQDFYPSQE 123
Query: 213 GFQKGSKPLLALDNPF--PMELPDNLFG---DKWAFVQ---------------------- 245
G+ + + ++ P + LPD + G DKW FV
Sbjct: 124 GYNEAAATNASVAYPALDAIALPDAVRGDRSDKWTFVSLEASAFEEMKEWDIRFGEGFPL 183
Query: 246 ---------------------LPFSAWMNGLEVCSIETDTARGSLI-LSVGISTRYIYAN 283
LP +AWM+GLE+ +++ + ++ L G+S +I AN
Sbjct: 184 ALADLSPDTKIPGFIIYSQRALPLAAWMSGLELVALKFKSKPLPILSLETGLSDSWILAN 243
Query: 284 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 329
+ +E + +E K G+HFLAIQ D E GFWLL D
Sbjct: 244 L-TDQSGVAEGKGFEDTKNKAEGVHFLAIQPRPDVETFSGFWLLKD 288
>gi|224002018|ref|XP_002290681.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220974103|gb|EED92433.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 359
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 85/294 (28%), Positives = 141/294 (47%), Gaps = 55/294 (18%)
Query: 89 LDEETDPESITE-WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLK 147
++E T+ + ++E WELD SRP+L KK+WE+++ D S +++ + P+N +NS ++
Sbjct: 67 VEETTNWDKVSEEWELDCYSRPVLVDGKKKLWEILMTDSSGNMKVCRALPSNKVNSREVR 126
Query: 148 EAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETV 207
+ I D+ V P IRFFR M +I A E+D+ PS+ +L W+E+R V
Sbjct: 127 RVVEEIIDESEVK-PSTIRFFRGAMFNMINIALSEIDVIAKPSRCTFALAQWIEDRNRDV 185
Query: 208 YTRHPGFQKGSKPLLALDNPF-----PMELPDNLFGDKWAFVQLP--------------- 247
Y + G++ + + F ++LPD L G+K+AFV LP
Sbjct: 186 YPKMEGYRATMSGIGGIGGTFLDIRTAVKLPDALRGEKYAFVGLPLAEFLPGGGIDNNNI 245
Query: 248 ------------------------------FSAWMNGLEVCSIETDTARGSLILSVGIST 277
++W+ G EV ++ D + L++ I
Sbjct: 246 GVGRLCPVDSTLAADSFVQGVVILTPRAKALASWLAGTEVAGLKADLRKRELVMETDIDN 305
Query: 278 RYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
+Y+ A K N EA +E K A GLHF+++Q++ D +D GFWLL ++P
Sbjct: 306 QYLMA--KLNDDQRREAAVYEEGKDALNGLHFISVQKDED-DDPAGFWLLREIP 356
>gi|162606540|ref|XP_001713300.1| hypothetical protein GTHECHR2175 [Guillardia theta]
gi|12580766|emb|CAC27084.1| hypothetical protein [Guillardia theta]
Length = 323
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 82/272 (30%), Positives = 133/272 (48%), Gaps = 49/272 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
WELDF SRP++ GKK+WEL++ + SLQ + PNN++NS L+ ++ I +
Sbjct: 48 WELDFFSRPVILDDGKKLWELIIVNKDKSLQIIESVPNNMVNSKELRRKLLNIINS-AEK 106
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQ---KG 217
P+ I+FFR+QM +I+ A +LDI PS+R +L + ER +T+Y G++ +
Sbjct: 107 KPDVIKFFRAQMFNMISIALSDLDINVKPSRRTYALFEIIREREKTIYPEMIGYKPYLRE 166
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFV-------------------------------QL 246
K L+L FP +PD L G+ ++FV ++
Sbjct: 167 YKEDLSLKR-FPQRMPDILLGENFSFVLASLEEINVILKDQSVMKDSFKIDENKYDIDKI 225
Query: 247 P-----------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAE 295
P + W+NGLEV SI D + S++L + T++++A K + +
Sbjct: 226 PGIVILSNRANSLANWINGLEVFSISFDQEKSSIVLDCSLDTKFLFA--KIDIKKIQDGT 283
Query: 296 AWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+E K+ G HF+++ L GFWLL
Sbjct: 284 KFENQKRLNSGFHFISVMSGLPENKIYGFWLL 315
>gi|428223149|ref|YP_007107319.1| hypothetical protein Syn7502_03320 [Synechococcus sp. PCC 7502]
gi|427996489|gb|AFY75184.1| Protein of unknown function (DUF1092) [Synechococcus sp. PCC 7502]
Length = 299
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 94/293 (32%), Positives = 136/293 (46%), Gaps = 66/293 (22%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDG----SLSLQYTKYFPNNVINSITLK-EAIVAICD 155
WELDF SRP+LD KKIWEL++C+ S Q+ K +NS L E +AI
Sbjct: 5 WELDFYSRPVLDENQKKIWELLICNSPDRSSQPFQWIKECNAQEVNSGWLATELKLAIAH 64
Query: 156 D--LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
+ LG P+K+RF+R M IIT+ CK+ ++ P PS+R +L WL+ R E++Y + G
Sbjct: 65 NASLGNRDPQKVRFYRPSMTNIITRGCKQAELIPQPSRRLFTLSSWLQTRMESIYPQREG 124
Query: 214 F-QKGSKPL---LALDNPFPMELPDNLFGDKWAFVQ------------------------ 245
F +PL + + P PD L G+ W
Sbjct: 125 FIAPDPQPLPLKIGIQVPVAKPAPDALMGESWLVASLKVADFQEATEWSMDFGELFALDH 184
Query: 246 ------------------LPFSAWMNGLEVCSIETDTARG--SLILSVGISTRYIY---- 281
L +AWM G++ +++ + + G LIL G +R+I
Sbjct: 185 ISDPETLISGLIITSSRALALAAWMAGVDPVALKFEVSEGKIQLILEAGEESRWILTTLN 244
Query: 282 -ANYK------KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
AN K + P S+A ++E AKK G+HF+AIQ L+ E GFWLL
Sbjct: 245 TANPKGQKSAERIPKVISQAGSFEQAKKNSNGIHFIAIQTSLEVEHFTGFWLL 297
>gi|160331683|ref|XP_001712548.1| hypothetical protein HAN_3g413 [Hemiselmis andersenii]
gi|159765997|gb|ABW98223.1| hypothetical protein HAN_3g413 [Hemiselmis andersenii]
Length = 337
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 86/294 (29%), Positives = 140/294 (47%), Gaps = 58/294 (19%)
Query: 85 ELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSI 144
++S +E + E I WELDF SRP++D GKK+WE+++ D + ++ + PNN++NS
Sbjct: 46 KISMKNELINEEII--WELDFFSRPVVDENGKKLWEIIIVDQKGNFEHIETVPNNLVNSK 103
Query: 145 TLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERY 204
LK+ I + D P+ I+FFRSQM +I A +LD+ PS+R SL + ER
Sbjct: 104 ELKKRIKILLDKSDKK-PKVIKFFRSQMFNMINIALSDLDLIVRPSRRTFSLYNKISERE 162
Query: 205 ETVYTRHPGFQKGSKPLL------ALDNPFPMELPDNLFGDKWAFV-------------- 244
E +Y KG +P + A P ++PD L G+K+ F
Sbjct: 163 EKIYPN----MKGYRPFMRESDFNASLKKVPQKMPDALRGEKYIFASLSSDELSSINSSD 218
Query: 245 -----------------QLP-----------FSAWMNGLEVCSIETDTARGSLILSVGIS 276
Q+P S W++G+E+C++ D +LIL G+
Sbjct: 219 IAFSGFCPLPAEFDKNQQIPGIVIYSERAKSLSGWLDGVELCNVFCDLENKNLILECGLD 278
Query: 277 TRYIYANY---KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
++++A + K + + E + +E KK G+HF+A+Q + G W L
Sbjct: 279 IQFLFAKFSETKNSKNSNFEPKFFEKNKKKSQGIHFVAVQSYSKQNEIAGIWTL 332
>gi|427711791|ref|YP_007060415.1| hypothetical protein Syn6312_0651 [Synechococcus sp. PCC 6312]
gi|427375920|gb|AFY59872.1| Protein of unknown function (DUF1092) [Synechococcus sp. PCC 6312]
Length = 285
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 89/278 (32%), Positives = 123/278 (44%), Gaps = 50/278 (17%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITL----KEAIVAIC 154
T WELDF SRPILD + KK+WE+++C+ L+ Q+ KY N+ L +EA+
Sbjct: 3 TIWELDFYSRPILDAQQKKLWEVLICNRQLTFQFAKYCSGAEANARWLMSAIQEAVQQWQ 62
Query: 155 DDLGVP---IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRH 211
+ +P PE+IRFFR M +II + C+ I + S+R L WL ER E VY +
Sbjct: 63 QEFNLPESERPERIRFFRRPMNSIILRGCEAAGIPGLASRRTFGLYEWLAERQEQVYPQT 122
Query: 212 PGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQ-------------------------- 245
PG+Q P L + LPD L G KW FV
Sbjct: 123 PGYQPLIAPPPELPQAKALPLPDALQGQKWQFVSLPAGEFANATEWEIKFGEVFSLSGLD 182
Query: 246 ---------------LPFSAWMNGLEVCSIETDTARG-SLILSVGISTRYIYANYKKNPV 289
LP +AWM+GLE + + L+L G R+ +
Sbjct: 183 PESLIPGIIIYSQRALPLAAWMSGLEPACLSLELGPDPQLVLETGADDRWTLVTLPNKDL 242
Query: 290 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
T+ AA+ LHFLA+Q + ED GFWL+
Sbjct: 243 ITAAEAF-MAAQAQVKNLHFLAVQASPEREDFAGFWLM 279
>gi|86605930|ref|YP_474693.1| hypothetical protein CYA_1247 [Synechococcus sp. JA-3-3Ab]
gi|86554472|gb|ABC99430.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length = 285
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 80/270 (29%), Positives = 131/270 (48%), Gaps = 47/270 (17%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DF + P+ D +G+++WEL+VCD S L+ KY N +NS + + + + P
Sbjct: 16 WQMDFNAVPLRDGQGRRVWELLVCDASGQLRQAKYCSNQEVNSTWVAQQLRGYLEAAPQP 75
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P IR FR++M +I+ +AC I +PS+R +L W+ ER E VY + F +P
Sbjct: 76 -PAAIRVFRARMSSILQRACNAAGIPMLPSRRVYALKAWMRERAEQVYPQETQFTYSPEP 134
Query: 221 LLALDNPFPMELPDNLFGDKWAFV------------------------------------ 244
+ + P P+ LPD L G++WAFV
Sbjct: 135 PVEPEPPDPIPLPDKLQGERWAFVTLRARDLREAETWPMEFGELFPVNWEAWAPDTIIPG 194
Query: 245 -------QLPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAW 297
LP +AW++G+E + A G L+ G++ Y++A K + +EAE +
Sbjct: 195 LVIASRRALPIAAWLSGMEPAYLH--VAEGRLLFEAGLNDCYLFAQLKDEKL-RAEAEGF 251
Query: 298 EAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
++ G+HFLAIQ + ++ GFWL+
Sbjct: 252 AQRQRQAQGIHFLAIQSDFRAQSFAGFWLM 281
>gi|399949996|gb|AFP65652.1| hypothetical protein CMESO_508 [Chroomonas mesostigmatica CCMP1168]
Length = 336
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 134/281 (47%), Gaps = 54/281 (19%)
Query: 97 SITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDD 156
S T WE+DF SRP+L+ GKK+WEL+V D + ++ + PNN+INS LK+ I A+ +
Sbjct: 58 SNTVWEIDFFSRPVLNEDGKKLWELIVVDQKGTFEHIEAIPNNLINSRELKKRINALIEK 117
Query: 157 LGVPIPEK---IRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
P+K I+FFRSQM +I A +L+I PS+R +L + ER E VY + G
Sbjct: 118 S----PQKPILIKFFRSQMFNMINIALSDLNINVRPSRRTFALFEKISEREENVYPKMSG 173
Query: 214 FQKGSKPLLALD--NPFPMELPDNLFGDKWAFVQL------------------------- 246
++ K + D P ++PD L G+K+ F +
Sbjct: 174 YRPFMKEVDVNDMLKKVPQKMPDTLRGEKYVFASISIPELESMVNSGINFGQMCPLPKNF 233
Query: 247 -----------------PFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPV 289
S+W +G+E+ +I D ++++ G+ T+Y++ + + +
Sbjct: 234 DFNQKIPGIVILSERAKSLSSWFDGIELFNIICDLETKNIMIECGLDTQYLFGKFSEETI 293
Query: 290 ---TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
E + +E KK G+HF+A+QE + G W L
Sbjct: 294 QDRVNLEPKLFEKNKKKSQGVHFIAVQEYSKKKPIYGIWTL 334
>gi|407958237|dbj|BAM51477.1| hypothetical protein BEST7613_2546 [Bacillus subtilis BEST7613]
Length = 271
Score = 127 bits (319), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 87/267 (32%), Positives = 130/267 (48%), Gaps = 59/267 (22%)
Query: 118 IWELVVCDGSLSLQ--------YTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFR 169
+WE+++C+ S+Q Y++Y P++ +NS+ L++AI A + G +P+KIRFFR
Sbjct: 1 MWEVLICESPQSVQQLPGDLFRYSQYCPSSTVNSVWLRQAIEAAIAEAGQ-MPQKIRFFR 59
Query: 170 SQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNP-- 227
QM +I+KAC+E I P PS+R L WL +R E Y + PG+ ++ P
Sbjct: 60 RQMNNMISKACEEAGIPPAPSRRTYVLEQWLGDRLENFYPQQPGYDPKLASSTSVQYPEL 119
Query: 228 FPMELPDNLFGDK---WAFVQ--------------------------------------- 245
+ LPD + GD+ WA V
Sbjct: 120 NAIALPDAVRGDRGDQWALVSLAAADFNDLPDWEISFGESFPLSSYNLSPDSRIPGLILF 179
Query: 246 ----LPFSAWMNGLEVCSIETDT-ARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAA 300
LPF+AW++GLE+ ++ +T R + L G S +I AN + + EA+ +E
Sbjct: 180 SPRALPFAAWLSGLELGYLQYNTDPRPIMRLETGASDSWIVANV-TDKTSEQEAQGFEQT 238
Query: 301 KKACGGLHFLAIQEELDSEDCVGFWLL 327
KK G+HFLAIQ DSE GFWLL
Sbjct: 239 KKLAQGIHFLAIQTSPDSETFAGFWLL 265
>gi|443478232|ref|ZP_21068010.1| protein of unknown function DUF1092 [Pseudanabaena biceps PCC 7429]
gi|443016503|gb|ELS31148.1| protein of unknown function DUF1092 [Pseudanabaena biceps PCC 7429]
Length = 284
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 82/277 (29%), Positives = 120/277 (43%), Gaps = 51/277 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
WELDF SRP+LD KK+WEL++CD ++ + P+ +NS L + + C
Sbjct: 5 WELDFYSRPLLDANNKKVWELLICDRDRQFEWVRECPSTEVNSEWLAKQLTD-CVATNGQ 63
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQK---G 217
P KIRFFR M II + CK I S+R ++ WL ER ++Y GFQ
Sbjct: 64 TPIKIRFFRPSMTNIIMRGCKLAGITGQASRRVFTMSAWLAERMASIYPNRDGFQAVDPN 123
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQL------------------------------- 246
PL L P +PD L G++W V L
Sbjct: 124 PLPLKVLAAQDPKPVPDALMGEQWISVSLKASDFEEAKEWSMDFSELLDVSHLDPDTIVA 183
Query: 247 ----------PFSAWMNGLEVCSIETDTA----RGSLILSVGISTRYIYANYK--KNPVT 290
+AWM+G++ I+ + R + L R++ AN + K+ +
Sbjct: 184 GIIIISARATALAAWMSGVDPVFIKFERNLLGDRTQMQLEASADARWVLANLQAPKDKLA 243
Query: 291 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
++ +E AK+ G HFLAIQ + E GFW+L
Sbjct: 244 IAQGADFEKAKQKSQGFHFLAIQTNAEEEHFAGFWML 280
>gi|443322105|ref|ZP_21051138.1| Protein of unknown function (DUF1092) [Gloeocapsa sp. PCC 73106]
gi|442788158|gb|ELR97858.1| Protein of unknown function (DUF1092) [Gloeocapsa sp. PCC 73106]
Length = 297
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 95/299 (31%), Positives = 135/299 (45%), Gaps = 73/299 (24%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDG--------SLSLQYTKYFPNNVINSITLKEAI 150
T WELDF SRPILD KK+WE+++C+ L +Y ++ P+ +NS+ L EAI
Sbjct: 3 TIWELDFYSRPILDENQKKLWEVLICESPQQISTNPDLIYKYAQFCPSTSVNSLWLAEAI 62
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ G IP KIRFFR QM+ +ITKAC+E+ + P+PS+R +L W+ ER + Y
Sbjct: 63 KQAIAESG-QIPSKIRFFRRQMKNMITKACEEVAVIPVPSRRTHTLNHWIVERLKNHY-- 119
Query: 211 HPGFQKGSKPLLALDNPFP----MELPDNLF---GDKWAFVQL----------------- 246
P + +P + LPD + GDKW V L
Sbjct: 120 -PTLDNYDSQAINASVQYPPLNAIALPDAVRGDKGDKWTLVTLPVQDFIEMDQWDIAFGE 178
Query: 247 --------------------------PFSAWMNGLEV--CSIE--TDTAR------GSLI 270
P + W++GLE+ C +E T + R L
Sbjct: 179 AFPLSLYDLDPQLSIPGVIIFSNRAIPLAGWLSGLEIGSCYVEDITPSTREIVRQLSRLR 238
Query: 271 LSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 329
L G+S +I A+ + SEA + AK +HFLAIQ +S+ G WLL D
Sbjct: 239 LETGLSDSWILADI-TDEQGQSEARGFTKAKNLVQQIHFLAIQSSPESDSFAGLWLLKD 296
>gi|317969607|ref|ZP_07970997.1| hypothetical protein SCB02_08730 [Synechococcus sp. CB0205]
Length = 299
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 129/286 (45%), Gaps = 56/286 (19%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDG------SLSLQYTKYFPNNVINSITLKEAI-- 150
+WELD+ SRPIL+ GKK WEL++C + Q+ P + +NS LK A+
Sbjct: 15 ADWELDYYSRPILEEDGKKRWELLICSSPNAENPGRAFQWVLKCPASSVNSQWLKSALEQ 74
Query: 151 -VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
+ D G P KIR +RS M+T++ +A ++L ++ +PS+RC +L+ WL+ER TVY
Sbjct: 75 ALEQADSEGFDPPRKIRCWRSSMRTMVQRASEQLGLELVPSRRCYALVEWLQEREATVYP 134
Query: 210 RHPGFQKGS-KPLLALDNPFPMELPDNLFGDKWAFVQLPFSA------------------ 250
G+ G P P + LP+ GD W++ LP A
Sbjct: 135 EEEGYMAGPLAPPPQPIQPVAVPLPEAARGDSWSWASLPIGALREAMTWDTSFAGLVPLP 194
Query: 251 -------------------------WMNGLEVCSIETDTARGSLILSVGISTRYIYANYK 285
W++GLE +E + L+L G R++ +
Sbjct: 195 ESLDDELMVSGLRLFSASRSLAIAGWVSGLEPVRLEVCGQQ--LVLEAGQEDRWLLGQLE 252
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
+ + A AA+ GG+ FLAIQ D GFW+L DLP
Sbjct: 253 SDEAEAAAAAF-LAARGQVGGVQFLAIQSSPDQPGFDGFWILRDLP 297
>gi|87302524|ref|ZP_01085341.1| hypothetical protein WH5701_11459 [Synechococcus sp. WH 5701]
gi|87282868|gb|EAQ74825.1| hypothetical protein WH5701_11459 [Synechococcus sp. WH 5701]
Length = 299
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 88/287 (30%), Positives = 134/287 (46%), Gaps = 60/287 (20%)
Query: 100 EWELDFCSRPILDIRGKKIWELV-----VCDGSL-SLQYTKYFPNNVINSITLKEAI--- 150
+WELDF SRP+LD GKK W+L+ V +GS ++ K P + +NS+ L+ A+
Sbjct: 16 DWELDFFSRPVLDPGGKKRWDLLITATPVSEGSQPRFRWVKNCPASTVNSVWLQGALNEA 75
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
++ D G+ P ++R +R+ M+T++ +A + + ++ IPS+RC +L WL ER VY
Sbjct: 76 LSAAADQGLGAPRRLRCWRATMRTMVQRAAEAIGLEVIPSRRCYALAEWLSERERDVYPA 135
Query: 211 HPGFQKGSKPLLALDNP---FPMELPDNLFGDKWAFVQLPFSA----------------- 250
G+ G PL P P+ LP+ GD W +V LP A
Sbjct: 136 EEGYMAG--PLAPPPQPMRSLPLPLPEAARGDSWDWVSLPLGALREASEWEIGFEGLFPL 193
Query: 251 --------------------------WMNGLEVCSIETDTARGSLILSVGISTRYIYANY 284
W+ GLE +E + SL+L G+ R+ A
Sbjct: 194 PADLPDDLMVPGLRLFSRTRSLAIAGWIAGLEPARLEMEGT--SLVLEAGLEDRWRLATL 251
Query: 285 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
+ + AA++A GL F+A+Q E SE GFWLL D+P
Sbjct: 252 AEQEASEVAEAF-AAAREAAAGLQFIAVQSEAQSERFDGFWLLRDMP 297
>gi|397628715|gb|EJK69024.1| hypothetical protein THAOC_09759, partial [Thalassiosira oceanica]
Length = 382
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 125/267 (46%), Gaps = 54/267 (20%)
Query: 115 GKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQT 174
GKK+WE+++ D S +L+ + P+N +NS +++ + + + V P IRFFR M
Sbjct: 117 GKKLWEILITDSSGNLRVCRSLPSNKVNSREVRKVVEDVIGESEVK-PGTIRFFRGAMFN 175
Query: 175 IITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPF-----P 229
+I A E+D+ PS+ +L WLEER VY + G+Q L + F
Sbjct: 176 MINIALSEIDVVAKPSRCTFALAQWLEERNRDVYPQMEGYQAAKARLGGVGGTFLDIRTA 235
Query: 230 MELPDNLFGDKWAFVQLP------------------------------------------ 247
++LPD L G+K+AFV LP
Sbjct: 236 VKLPDALRGEKYAFVGLPLAEFIEGGSVNNENIGVGRLCPVDSTLPADSFVQGVVILTSR 295
Query: 248 ---FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKAC 304
++W+ G EV I+ D + L++ I +Y+ A K + EA +E K +
Sbjct: 296 AKALASWLAGTEVGGIKADIRKRELVMETDIDNQYLMA--KLDDDQRREAANFEEGKDSL 353
Query: 305 GGLHFLAIQEELDSEDCVGFWLLLDLP 331
GLHF+++QE+ +++D GFWLL ++P
Sbjct: 354 NGLHFVSVQED-ENDDPAGFWLLREIP 379
>gi|86608615|ref|YP_477377.1| hypothetical protein CYB_1137 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86557157|gb|ABD02114.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 293
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 132/278 (47%), Gaps = 55/278 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DF + P+ D + +++WEL+VCD + + +Y N +NS + + + + P
Sbjct: 16 WQMDFNAVPLRDEQNRRVWELLVCDPTGRFRQAQYCSNQEVNSTWVARQLRSYLEAAPQP 75
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P IR FR++M +I+ +AC + I +PS+R +L W+ ER E VY + F +P
Sbjct: 76 -PSAIRVFRARMSSILQRACDAVGIPMLPSRRVYTLKAWMRERAEQVYPQETQFTYSPEP 134
Query: 221 LLALDNPFPMELPDNLFGDKWAFV------------------------------------ 244
+ D P P+ LPD L G++WAFV
Sbjct: 135 PVDPDPPDPIRLPDKLQGERWAFVTLRAEDLREADAWPIEFGELFPVAWDTLTPDTLAPV 194
Query: 245 ---------------QLPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPV 289
LP +AWM+G+E + A G L+L G++ Y++A + +
Sbjct: 195 VRSTLIPGLVITSQRALPMAAWMSGMEPAYL--SVADGRLLLEAGLNDCYLFAQLRDETL 252
Query: 290 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
T EAE + ++ GLHFLAIQ +L ++ GFWL+
Sbjct: 253 RT-EAEVFAQRQQQAQGLHFLAIQTDLRAQSFAGFWLM 289
>gi|78185205|ref|YP_377640.1| hypothetical protein Syncc9902_1638 [Synechococcus sp. CC9902]
gi|78169499|gb|ABB26596.1| conserved hypothetical protein [Synechococcus sp. CC9902]
Length = 293
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 130/288 (45%), Gaps = 60/288 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVC------DGSLSLQYTKYFPNNVINSITLKEAIV- 151
++WELDF SRPILD G+K WEL++ DG ++ K P++ +NSI L A+
Sbjct: 9 SDWELDFYSRPILDADGRKRWELLITTTPSSEDGDTPFRFAKVCPSSEVNSIWLNTALAE 68
Query: 152 ----AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETV 207
A+ + G P+ ++R +RS M+T++ +A E DI+ I S+R +LL WLE R V
Sbjct: 69 ARESALQEGYGAPV--RLRCWRSSMRTMVQRAATEQDIEVISSRRTFALLDWLEHREREV 126
Query: 208 YTRHPGFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLP------------------- 247
Y + GF P A P+ LP+ + GD W++ LP
Sbjct: 127 YPKEEGFMAGPLAPPPAPVVTPPIPLPEEVQGDAWSWATLPAGLLRDAGDWPMSFSGLLP 186
Query: 248 ------------------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYAN 283
+ W+ GLE + + + LIL G R++ ++
Sbjct: 187 VPTNLEDEAQVPGLRLFSRTRSLAMAGWLGGLEPVRLLVEGRQ--LILEAGQDDRWLVSD 244
Query: 284 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
S A E + + GL F+AIQ D + GFW++ D+P
Sbjct: 245 L-DGEAAKSITSALETCQTSVRGLQFIAIQASPDEQAFAGFWMMRDIP 291
>gi|318041062|ref|ZP_07973018.1| hypothetical protein SCB01_05109 [Synechococcus sp. CB0101]
Length = 305
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 132/301 (43%), Gaps = 62/301 (20%)
Query: 90 DEETDPESIT------EWELDFCSRPILDIRGKKIWELVVCD------GSLSLQYTKYFP 137
D+ DP T +WELD+ SRPIL+ GKK WEL++C ++ +
Sbjct: 6 DQVADPTRRTAAPLQLDWELDYYSRPILEPDGKKRWELLICSTPAPGASGPGFRFVQNCS 65
Query: 138 NNVINSITLKEAIVAICDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCL 194
+ +NS LK+A+ + G P K+R +R+ M+T++++A ++L ++ IPS+RC
Sbjct: 66 ASSVNSQWLKQALEQAMEQAAAEGYAAPRKLRCWRASMRTMVSRAAEQLSLELIPSRRCY 125
Query: 195 SLLLWLEERYETVYTRHPGFQKGS-KPLLALDNPFPMELPDNLFGDKWAFVQLPFSA--- 250
+L+ WL+ER TVY G+ G P P + LP+ GD W++ LP A
Sbjct: 126 ALVEWLQERQATVYPAEEGYMAGPLAPAPLPIQPVAVPLPEAARGDSWSWASLPLGALRE 185
Query: 251 ----------------------------------------WMNGLEVCSIETDTARGSLI 270
W+ GLE +E L+
Sbjct: 186 AAEWDVSFAGLVPLDGTGDDDVMVSGLRLFSATRSLAIAGWIAGLEPVRLEVSG--NQLV 243
Query: 271 LSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
L G+ R++ N + + AA++ GG+ FLA+Q GFW+L DL
Sbjct: 244 LEAGLEDRWLLGNLEAEEAEAAAQAF-RAARQQAGGVQFLAVQSSDAQNGFDGFWVLRDL 302
Query: 331 P 331
P
Sbjct: 303 P 303
>gi|427701381|ref|YP_007044603.1| hypothetical protein Cyagr_0042 [Cyanobium gracile PCC 6307]
gi|427344549|gb|AFY27262.1| Protein of unknown function (DUF1092) [Cyanobium gracile PCC 6307]
Length = 296
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 132/286 (46%), Gaps = 58/286 (20%)
Query: 100 EWELDFCSRPILDIRGKKIWELVVCDGSLSLQ-------YTKYFPNNVINSITLKEAIVA 152
+WELD+ SRPIL+ GKK WEL++C + LQ ++ P +NS L+ AI A
Sbjct: 13 DWELDYYSRPILEADGKKRWELLICS-TAGLQPTPDPFRWSMDCPAASVNSQWLRGAIEA 71
Query: 153 ICDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
G P ++R +R M+ ++ +A + L ++ +PS+RC L+ WL ER +VY
Sbjct: 72 ALAAAAEQGYGPPRRLRCWRGSMRAMVQRAAEGLGLELVPSRRCYGLVEWLRERQASVYP 131
Query: 210 RHPGFQKG-SKPLLALDNPFPMELPDNLFGDKWAF---------------------VQLP 247
PG+ G P P + LP+ GD+W++ V LP
Sbjct: 132 LEPGYMAGPLAPPPQPIPPVALPLPEAARGDRWSWATLTAATLAEAGGWEIAFPGLVALP 191
Query: 248 ----------------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYK 285
+ W++GLE +E G L+L G+ R+I A
Sbjct: 192 SAIDPATPVPGIRLFSRRRALAIAGWLSGLEPTRLEVSA--GQLVLEAGLEDRWILARLP 249
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
+ ++ +A+ A++ GGL F+AIQ ++ GFWLL DLP
Sbjct: 250 EEEARLAQ-QAFAEARERAGGLQFIAIQASEEASTLEGFWLLRDLP 294
>gi|33863502|ref|NP_895062.1| hypothetical protein PMT1234 [Prochlorococcus marinus str. MIT
9313]
gi|33640951|emb|CAE21409.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 299
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 86/293 (29%), Positives = 130/293 (44%), Gaps = 55/293 (18%)
Query: 93 TDPESITEWELDFCSRPILDIRGKKIWELVVCD-----GSLSLQYTKYFPNNVINSITLK 147
TD T+WELDF SRPIL+ GKK WEL++ G+ ++ K P +NS+ L
Sbjct: 10 TDQHPKTDWELDFYSRPILESDGKKRWELLISSSQDPSGTAPFRWVKRCPAGEVNSLWLT 69
Query: 148 EAIVAICDD---LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERY 204
+A+ D G P ++R +R M+T++ +A EL I+ IPS+R +LL WL ER
Sbjct: 70 DALREALKDSQEQGWEAPLRLRCWRISMRTMVQRAAAELGIEVIPSRRTYALLDWLAERE 129
Query: 205 ETVYTRHPGFQKG-SKPLLALDNPFPMELPDNLFGDKWAFVQLP---------------- 247
VY G+ G P P+ LP+ + GD W++ LP
Sbjct: 130 RDVYPLEEGYMAGPLAPPPTPIPTPPVPLPEAVRGDAWSWASLPLGLLREAQEWPIGFGG 189
Query: 248 ---------------------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYI 280
+ W+ GLE + D + L+L G R++
Sbjct: 190 LLPVGANDNDNIPVPGVRMFSQTRALALAGWLGGLEPVCLAVDGTQ--LMLEAGQDDRWL 247
Query: 281 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
+ T + EA ++A GGL F+++Q + + GFW+L DLP P
Sbjct: 248 VTDLDDKTATAVQQSLLEAREQA-GGLQFISVQTSPEEKRFAGFWMLRDLPQP 299
>gi|116072198|ref|ZP_01469465.1| hypothetical protein BL107_10441 [Synechococcus sp. BL107]
gi|116064720|gb|EAU70479.1| hypothetical protein BL107_10441 [Synechococcus sp. BL107]
Length = 293
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/288 (28%), Positives = 131/288 (45%), Gaps = 60/288 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVC------DGSLSLQYTKYFPNNVINSITLKEAIV- 151
++WELDF SRPIL G+K WEL++ DG ++ K P+ +NS+ L A+
Sbjct: 9 SDWELDFYSRPILGADGRKRWELLITTTPSSEDGDSPFRFAKVCPSTEVNSLWLSSALSE 68
Query: 152 ----AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETV 207
A+ G P+ ++R +RS M+T++ +A E DI+ I S+R +LL WLE+R V
Sbjct: 69 AREQALQAGYGAPV--RLRCWRSSMRTMVQRAATEQDIEVISSRRTFALLDWLEQREREV 126
Query: 208 YTRHPGFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLP------------------- 247
Y + GF P A P+ LP+ + GD W++ LP
Sbjct: 127 YPKEEGFMAGPLAPPPAPVQTPPIPLPEEVQGDAWSWATLPAGLLRDADDWPMSFSGLLP 186
Query: 248 ------------------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYAN 283
+ W+ GLE + + + LIL G R++ ++
Sbjct: 187 VPTNLEDEAQVPGLRLFSQTRSLAMAGWLGGLEPVRLLVEGRQ--LILEAGQDDRWLVSD 244
Query: 284 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
+ S A A E ++ + GL F+AIQ D + GFW++ D+P
Sbjct: 245 L-DGEASKSIASALETSQTSVRGLQFIAIQASPDEQAFAGFWMMRDIP 291
>gi|82799327|gb|ABB92253.1| conserved hypothetical protein [uncultured marine type-A
Synechococcus 5B2]
Length = 293
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 135/286 (47%), Gaps = 58/286 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVV-----CDGS-LSLQYTKYFPNNVINSITLKEAIVA 152
+WELDF SRPIL+ G+K WEL++ D S + ++ K P+ +NS+ L A+
Sbjct: 9 ADWELDFYSRPILEADGRKCWELLITATPAADASEQTFRFAKRCPSGEVNSLWLSTALKE 68
Query: 153 ICD---DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
D + G P ++R +RS M+T++ +A +LD++ I S+R SLL WL++R + VY
Sbjct: 69 ARDRAVEAGWSEPRRLRCWRSSMRTMVQRAAADLDLEMIASRRTYSLLDWLQQREQEVYP 128
Query: 210 RHPGFQKG--SKPLLALDNPFPMELPDNLFGDKWAFVQLP-------------------- 247
+ GF G + P + + P + LP+ + GD W++ LP
Sbjct: 129 QEEGFMAGPLAPPPVPIATP-AVPLPEEVQGDAWSWASLPAALLRDACDWPIGFSGLLPL 187
Query: 248 -----------------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANY 284
+ W+ GLE + + + L+L G R++ ++
Sbjct: 188 PVALEDDQAVPGLRLFSNSRALAMAGWLGGLEPVRLMVEGRQ--LVLEAGQDDRWLVSDL 245
Query: 285 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ + +AE +K+ GL F+AIQ + + GFW++ D+
Sbjct: 246 DPSTAASIKAEL-NQSKEHAKGLQFIAIQSSPEEQAFAGFWMMRDI 290
>gi|284928976|ref|YP_003421498.1| hypothetical protein UCYN_04030 [cyanobacterium UCYN-A]
gi|284809435|gb|ADB95140.1| Protein of unknown function (DUF1092) [cyanobacterium UCYN-A]
Length = 293
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 86/285 (30%), Positives = 129/285 (45%), Gaps = 58/285 (20%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSL--------SLQYTKYFPNNVINSITLKEAIVA 152
WELDF SRP KK+WE+++C+ + ++++ P++ +NSI L++AI
Sbjct: 5 WELDFYSRPNFFKHNKKLWEVLICETPMYSNKSFNDCFKFSQLCPSSTVNSIWLRQAIEK 64
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEER--------- 203
G P+ IRFFR QMQ +I KACK+ +I+ IPS+R +L W+++R
Sbjct: 65 AMKKAGES-PDLIRFFRFQMQNMIIKACKDAEIEAIPSRRTFALNYWIDKREKQFKLVKN 123
Query: 204 -----YETVYTRHPGFQKGSKPLLALDNPFPMELPDNL--------------FGDKWAFV 244
T+ Q S P DN F +L FG+ +A
Sbjct: 124 RINNTVSTINRTDTDSQMVSLPDTLKDNQFSKYFCVDLKVSDFNHIDEWDIGFGENYAIS 183
Query: 245 -------------------QLPFSAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANY 284
LP +AW++G E+ S+ D S L L G++ + + N
Sbjct: 184 PYGLSSHTIIPGLVFFSPRALPIAAWLSGFELVSLRFDRKNSSTLYLETGLNDKSVLINL 243
Query: 285 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 329
+ EA+ +E K+ G+HFLAIQ D E GFWLL D
Sbjct: 244 -NDIRLIQEAKNFERKKENSKGIHFLAIQPSPDVELFSGFWLLKD 287
>gi|22299529|ref|NP_682776.1| hypothetical protein tlr1986 [Thermosynechococcus elongatus BP-1]
gi|22295712|dbj|BAC09538.1| tlr1986 [Thermosynechococcus elongatus BP-1]
Length = 287
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 88/284 (30%), Positives = 126/284 (44%), Gaps = 52/284 (18%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINS----ITLKEAIVAIC 154
T WELDF SRP++D KKIWEL+VCD Q++K N+ L+EA+
Sbjct: 3 TIWELDFYSRPLVDENNKKIWELLVCDRQQQFQFSKTCAGAEANARWLAAALEEAMDQWR 62
Query: 155 DDLGVP---IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRH 211
LG+ P+++RFFR M +IIT+ + + +PS+R +L WL +R Y
Sbjct: 63 QQLGLAEGVQPQRVRFFRRAMSSIITRGGEAAGLVMVPSRRTFALYDWLRDRATNFYPTL 122
Query: 212 PGFQK---------------------GSK------PLLALDNPFPMELP----------- 233
P +Q G + PL + ELP
Sbjct: 123 PNYQADLATPPQLPPPAPQPLPPALQGDRWQLSGLPLGEIKTAAEWELPFGEVPPLPFLT 182
Query: 234 ---DNLFGDKWAFVQ--LPFSAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKN 287
D L + Q LP + W++GLE S+ +T + LIL G S R+I +N
Sbjct: 183 LNDDTLLPGLIIYSQRALPLAGWLSGLEPASLSFEETPQPLLILETGASDRWILIR-GRN 241
Query: 288 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
P E A++ A GLHF+A++E+ E GFWLL P
Sbjct: 242 PQIQKELAAFKDACTQSQGLHFIAVKEQPTQETLQGFWLLQQTP 285
>gi|33239980|ref|NP_874922.1| hypothetical protein Pro0529 [Prochlorococcus marinus subsp.
marinus str. CCMP1375]
gi|33237506|gb|AAP99574.1| Uncharacterized protein [Prochlorococcus marinus subsp. marinus
str. CCMP1375]
Length = 297
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 82/288 (28%), Positives = 133/288 (46%), Gaps = 55/288 (19%)
Query: 95 PESITEWELDFCSRPILDIRGKKIWELVVCD-----GSLSLQYTKYFPNNVINSITLKEA 149
P + +WE+DF SRP+++I GKK WEL++ G+ + ++ K P N +NSI L EA
Sbjct: 10 PLNKADWEVDFYSRPVIEIDGKKRWELLISSTQDFSGAETFRWEKKCPANEVNSIWLSEA 69
Query: 150 IVAICDD---LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYET 206
+ +D G P+++R +R+ M+T+ITKA +++ I+ I S+R SL WL +R +
Sbjct: 70 LKEALEDSSKQGWAFPKRLRCWRTSMKTMITKASEKVGIEVIESRRTFSLHEWLLQRDKD 129
Query: 207 VYTRHPGFQKGS-KPLLALDNPFPMELPDNLFGDKWAFVQ-------------------- 245
VY G+ P ++D P LP+ L GD W+F
Sbjct: 130 VYPNEEGYISAPIPPNPSIDFTQPEPLPEALRGDAWSFSSLSIEAIRGAREWPMEFNALL 189
Query: 246 -----------------------LPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYA 282
LP SAW++GLE + + L+L G + ++
Sbjct: 190 PIKKSLEGNIEIPGLRMFSKTRALPLSAWLSGLEPVRLLVEN--NQLLLESGQESLWLVT 247
Query: 283 NYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ K+ ++ K G+ F+AIQ + E GFW+L D+
Sbjct: 248 DMSKD-YAEKVKDSLINGKANADGIQFIAIQTSPEEESFTGFWMLKDI 294
>gi|323451508|gb|EGB07385.1| hypothetical protein AURANDRAFT_27892 [Aureococcus anophagefferens]
Length = 345
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 126/283 (44%), Gaps = 58/283 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLG 158
+EWELD SRP+L ++GKK+WEL++ D S + P +NS+ +++AI +
Sbjct: 61 SEWELDCFSRPVL-VKGKKLWELLITDASGQWRDVVALPATGVNSVAVRKAIEDVIARAP 119
Query: 159 VPIPEKIRFFRSQMQTIITKACKEL-----DIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
V P IRFFR QM ++T A + ++ PS+ +L W+EER VY G
Sbjct: 120 VK-PTVIRFFRRQMLNMLTIALNGVAANRPTLRVTPSRATHALYDWIEEREADVYPGMEG 178
Query: 214 FQKGSKPLLALDNPFPM---ELPDNLFGDKWAFVQLPFSAWMNG---------------- 254
+ G+ P+ LP+ L G+++AFV LP S ++G
Sbjct: 179 YSPGAGAATRDRMTAPVTASRLPEGLRGEQYAFVTLPLSEVLSGGGITEENVGVGKLINV 238
Query: 255 -----------------------------LEVCSIETDTARGSLILSVGISTRYIYANYK 285
E+ + D A+ L+L V + ++ A
Sbjct: 239 KPAYEVDALLPGIAILTRRSDALAMSLASTELAGVRADAAQRQLVLDVALDESFLVAKLD 298
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQE-ELDSEDCVGFWLL 327
+ EA A+E AK+ GGLHF+ +Q E D + GFWLL
Sbjct: 299 DD--QRVEAAAFEKAKQGLGGLHFVVVQSPEDDGVEPAGFWLL 339
>gi|116075331|ref|ZP_01472591.1| hypothetical protein RS9916_27264 [Synechococcus sp. RS9916]
gi|116067528|gb|EAU73282.1| hypothetical protein RS9916_27264 [Synechococcus sp. RS9916]
Length = 299
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 79/289 (27%), Positives = 128/289 (44%), Gaps = 59/289 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVC-----DGSLSLQYTKYFPNNVINSITLKEAI--- 150
+WELDF SRPIL+ GKK WEL++ G + +Y + P +NS L EA+
Sbjct: 16 ADWELDFYSRPILEPDGKKRWELLISSTPELGGGEAFRYARRCPAGEVNSTWLTEALRDA 75
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ + G P ++R +RS M+T++ +A LD++ +PS+R +L+ W+ ER VY +
Sbjct: 76 MTAAEADGWRAPRRLRSWRSAMRTMVQRAAAALDLEMVPSRRTYALIDWMAERDREVYPK 135
Query: 211 HPGFQKG--SKPLLALDNPFPMELPDNLFGDKWAFVQLP--------------------- 247
G+ G + P +A+ P + LP+ + GD ++ LP
Sbjct: 136 EEGYMAGPLAPPPVAVSTPA-IPLPEAVRGDALSWANLPLGSLAEAKEWPLGFNGLLPIP 194
Query: 248 ----------------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYK 285
+ W+ GLE + D + LIL G ++ +
Sbjct: 195 EGLDPAQPIPGLRLFSSTRALALAGWLGGLEPVRLRIDGRQ--LILDAGQDDSWLVTDL- 251
Query: 286 KNPVTTSEA-EAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
+P + A +A + GL F+A+Q D GFW+L D P P
Sbjct: 252 -DPASAEAAKQALAETRTTASGLQFIAVQTTPDHPRFEGFWMLRDQPEP 299
>gi|242040489|ref|XP_002467639.1| hypothetical protein SORBIDRAFT_01g031365 [Sorghum bicolor]
gi|242092100|ref|XP_002436540.1| hypothetical protein SORBIDRAFT_10g004402 [Sorghum bicolor]
gi|241914763|gb|EER87907.1| hypothetical protein SORBIDRAFT_10g004402 [Sorghum bicolor]
gi|241921493|gb|EER94637.1| hypothetical protein SORBIDRAFT_01g031365 [Sorghum bicolor]
Length = 136
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 55/91 (60%), Positives = 66/91 (72%), Gaps = 4/91 (4%)
Query: 160 PIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSK 219
P P RF + + +++ EL + S RC+SLLLWLEERYE VY+RHP FQ G++
Sbjct: 49 PHPHGRRFAYATYELLLSPDRIELLL----SGRCVSLLLWLEERYEVVYSRHPEFQAGTR 104
Query: 220 PLLALDNPFPMELPDNLFGDKWAFVQLPFSA 250
PLLALDNPFP LP+NLFGDKWAFVQLPFS
Sbjct: 105 PLLALDNPFPTTLPENLFGDKWAFVQLPFSG 135
>gi|124022483|ref|YP_001016790.1| hypothetical protein P9303_07741 [Prochlorococcus marinus str. MIT
9303]
gi|123962769|gb|ABM77525.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9303]
Length = 299
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 86/294 (29%), Positives = 131/294 (44%), Gaps = 57/294 (19%)
Query: 93 TDPESITEWELDFCSRPILDIRGKKIWELVVCD-----GSLSLQYTKYFPNNVINSITLK 147
TD T+WELDF SRPIL+ GKK WEL++ G+ ++ K P +NS+ L
Sbjct: 10 TDQHPKTDWELDFYSRPILESDGKKRWELLISSSQDPSGTAPFRWVKRCPAGEVNSLWLT 69
Query: 148 EAIVAICDD---LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERY 204
+A+ D G P ++R +R M+T++ +A EL I+ IPS+R +LL WL ER
Sbjct: 70 DALREALKDSQGQGWEAPLRLRCWRISMRTMVQRAAAELGIEVIPSRRTYALLDWLAERE 129
Query: 205 ETVYTRHPGFQKG-SKPLLALDNPFPMELPDNLFGDKWAFVQLP---------------- 247
VY G+ G P P+ LP+ + GD W++ LP
Sbjct: 130 RDVYPLEEGYMAGPLAPPPTPIPTPPVPLPEAVRGDAWSWASLPLGLLREAQEWPIGFGG 189
Query: 248 ---------------------------FSAWMNGLE-VCSIETDTARGSLILSVGISTRY 279
+ W+ GLE VC + T L+L G R+
Sbjct: 190 LLPVGANDNDNIPVPGVRMFSQTRALALAGWLGGLEPVCLVVDGT---QLMLEAGQDDRW 246
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
+ + + + E+ ++A GGL F+++Q + + GFW+L DLP P
Sbjct: 247 LVTDLDEKTAKAVQQSLLESREQA-GGLQFISVQTSPEEKRFAGFWMLRDLPQP 299
>gi|242085770|ref|XP_002443310.1| hypothetical protein SORBIDRAFT_08g017335 [Sorghum bicolor]
gi|241944003|gb|EES17148.1| hypothetical protein SORBIDRAFT_08g017335 [Sorghum bicolor]
Length = 136
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 52/91 (57%), Positives = 66/91 (72%), Gaps = 4/91 (4%)
Query: 160 PIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSK 219
P P RF + + +++ EL + S RC+SLLLWLEERYE VY+RHP FQ G++
Sbjct: 49 PHPHGRRFAYATYELLLSPDRIELLL----SGRCVSLLLWLEERYEVVYSRHPEFQAGTR 104
Query: 220 PLLALDNPFPMELPDNLFGDKWAFVQLPFSA 250
P+LALDNP+P LP+NLFGDKWA+VQLPFS
Sbjct: 105 PMLALDNPYPTTLPENLFGDKWAYVQLPFSG 135
>gi|242075630|ref|XP_002447751.1| hypothetical protein SORBIDRAFT_06g015032 [Sorghum bicolor]
gi|241938934|gb|EES12079.1| hypothetical protein SORBIDRAFT_06g015032 [Sorghum bicolor]
Length = 159
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 55/92 (59%), Positives = 69/92 (75%), Gaps = 6/92 (6%)
Query: 185 IKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFV 244
I+ + S R +SLLLWLE+RYE VY+RHP FQ G++PLLALDNPFP LP+NLFGDKWAFV
Sbjct: 71 IELLLSGRSVSLLLWLEKRYEVVYSRHPEFQAGTRPLLALDNPFPTTLPENLFGDKWAFV 130
Query: 245 QLPFSAWMNGLEVCSIETDTAR-GSLILSVGI 275
QLPFSA+ C +E+ R G+ + S G+
Sbjct: 131 QLPFSAFW-----CEVESLGRRYGAGLGSAGV 157
>gi|352094718|ref|ZP_08955889.1| protein of unknown function DUF1092 [Synechococcus sp. WH 8016]
gi|351681058|gb|EHA64190.1| protein of unknown function DUF1092 [Synechococcus sp. WH 8016]
Length = 303
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 132/286 (46%), Gaps = 55/286 (19%)
Query: 100 EWELDFCSRPILDIRGKKIWELVV----CDGSLS-LQYTKYFPNNVINSITLKEAI---V 151
+WELDF SRPIL+ GKK WEL++ C+G+ S ++ K P + +NS L A+ +
Sbjct: 21 DWELDFYSRPILEPDGKKRWELLIVSSPCEGTTSSFRFEKRCPASSVNSTWLTSALTEAM 80
Query: 152 AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRH 211
A G +P K+R +RS M+T++ +A EL ++ +PS+R +L W+ ER + +Y +
Sbjct: 81 AAAQQQGWAVPRKLRSWRSSMRTMVQRAASELGLEMVPSRRTYALFDWIAEREQDLYPKE 140
Query: 212 PGFQKGSKPL---LALDNPFPMELPDNLFGDKWAFVQLPFS------AW----------- 251
G+ G PL + P LP+++ GD W + +LP + W
Sbjct: 141 EGYMAG--PLAPPPVPVSTPPRPLPESVRGDAWNWAELPAASLREATGWPIGFRGLLPVP 198
Query: 252 --------MNGLEVCS----------------IETDTARGSLILSVGISTRYIYANYKKN 287
+ GL + S + + L+L G ++ ++
Sbjct: 199 NTINDDQIIPGLRLFSQTRGLALAGLLGGIEPVRLRVSGTQLLLEAGQDDCWLVSDLSSE 258
Query: 288 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
A +AA+ A GL F+A+Q D+E GFW+L D P
Sbjct: 259 EAVHVSALMTQAAEHA-DGLQFIAVQTSPDAERFEGFWMLRDQAEP 303
>gi|90655491|gb|ABD96331.1| unknown [uncultured marine type-A Synechococcus GOM 3O6]
Length = 293
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 82/287 (28%), Positives = 134/287 (46%), Gaps = 60/287 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVV-----CDGS-LSLQYTKYFPNNVINSITLKEAI-- 150
+WELDF SRPIL+ G+K WEL+V D + + +++K P+ +NS+ L A+
Sbjct: 9 ADWELDFYSRPILEADGRKRWELLVTATPAADATEIPFRFSKCCPSGEVNSLWLTAALGE 68
Query: 151 VAICD-DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
C + G P P ++R +RS M+T++ +A ELD++ I S+R +LL WL++R + VY
Sbjct: 69 ARQCALEAGWPAPRRLRCWRSSMRTMVQRAATELDLEMIASRRTYALLEWLQQREQEVYP 128
Query: 210 RHPGFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLP--------------------- 247
+ GF P A P+ LP+ + GD W++ LP
Sbjct: 129 QEEGFMAGPLAPPPAPVATPPVPLPEEVQGDAWSWASLPADLLGDASDWPTSFSGLLPLP 188
Query: 248 ----------------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYK 285
+ W+ GLE + + + L+L G R++ ++
Sbjct: 189 AGLDSNQPVPGLRLFSNSRALAMAGWLGGLEPVRLLVEGRQ--LVLEAGQDDRWLVSDLD 246
Query: 286 KNPVTTSEAEAWEAA--KKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+EA A E A K+ GL F+AIQ + + GFW++ D+
Sbjct: 247 S---AAAEAIAGELAQSKERGKGLQFIAIQASPEEQAFAGFWMMRDI 290
>gi|449016446|dbj|BAM79848.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 411
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 92/169 (54%), Gaps = 24/169 (14%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
WELDF SRP++ GK++WELVVCD S + + FPNN++NS L A+ + ++ V
Sbjct: 91 WELDFYSRPVVGADGKRLWELVVCDRDGSFVHVEAFPNNMVNSRELARAVKTLIEESSVR 150
Query: 161 IPEKIRFFRSQMQTIITKACKELD-IKPIPSKRCLSLLLWLEERYETVYTRHPGFQ---- 215
P IRFFR+QM+ +I A + + ++ PS+R +L L L R VY R PG++
Sbjct: 151 -PRIIRFFRAQMRNMIQIAMQNISGVETRPSRRTYALFLALAYRERNVYPRLPGYEGKSI 209
Query: 216 --------KGSKPLLA----------LDNPFPMELPDNLFGDKWAFVQL 246
+G++ LA +D LPD L GD++AFV +
Sbjct: 210 GIGNRSGTRGAELSLAESIGNMLKTPVDLKVAARLPDELQGDRFAFVTI 258
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/85 (44%), Positives = 51/85 (60%), Gaps = 2/85 (2%)
Query: 246 LPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACG 305
LP +AW +G E+ I D + + L G+ Y++A + P +EA A+ AKKA
Sbjct: 326 LPLAAWFSGTELAYIIADEQQKEIYLECGLDAAYLFARIQ--PSLEAEARAFNEAKKAAR 383
Query: 306 GLHFLAIQEELDSEDCVGFWLLLDL 330
GLHFLAIQE+ D ED GFWLL D+
Sbjct: 384 GLHFLAIQEKPDDEDVCGFWLLRDV 408
>gi|113953228|ref|YP_731197.1| hypothetical protein sync_1994 [Synechococcus sp. CC9311]
gi|113880579|gb|ABI45537.1| Uncharacterized protein [Synechococcus sp. CC9311]
Length = 304
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 128/284 (45%), Gaps = 51/284 (17%)
Query: 100 EWELDFCSRPILDIRGKKIWELVVCDG-----SLSLQYTKYFPNNVINSITLKEAI---V 151
+WELDF SRPIL+ GKK WEL++ + S ++ K P +NS L A+ +
Sbjct: 22 DWELDFYSRPILEPDGKKRWELLIISSPSEGTTSSFRFEKRCPAGSVNSTWLTSALTEAI 81
Query: 152 AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRH 211
A G P K+R +RS M+T++ +A EL ++ +PS+R +LL W+ ER + +Y
Sbjct: 82 AAAQQQGWSEPRKLRSWRSSMRTMVQRAASELGLEMVPSRRTYALLDWIAEREQDLYPNE 141
Query: 212 PGFQKGS-KPLLALDNPFPMELPDNLFGDKWAFVQLPFSA------W------------- 251
G+ G P AL + P LP+++ GD W + +LP SA W
Sbjct: 142 EGYMAGPLAPPPALISTPPRPLPESVRGDAWNWAELPASALREAAGWPIGFRGLLPVPIT 201
Query: 252 ------MNGLEVCS----------------IETDTARGSLILSVGISTRYIYANYKKNPV 289
+ GL + S + + L+L G ++ ++
Sbjct: 202 IKDDQVIPGLRLFSQTRGLALAGLLGGIEPVRLKVSGTQLLLEAGQDDCWLVSDLSSEEA 261
Query: 290 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
++ + A + GL F+A+Q ++E GFW+L D P
Sbjct: 262 KHV-SDLMKGASEHAEGLQFIAVQTSPEAERFEGFWMLRDQAEP 304
>gi|90655540|gb|ABD96379.1| unknown [uncultured marine type-A Synechococcus GOM 3O12]
Length = 293
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 77/285 (27%), Positives = 126/285 (44%), Gaps = 56/285 (19%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLS------LQYTKYFPNNVINSITLKEAIV- 151
+WELDF SRPIL+ G+K WEL++ + +++K P+ +NSI L A+
Sbjct: 9 ADWELDFYSRPILESDGRKRWELLITATPAADARETPFRFSKCCPSGEVNSIWLSSALAE 68
Query: 152 --AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
D G P P ++R +RS M+T++ +A ELD++ I S+R +LL WL++R + VY
Sbjct: 69 ARQCAVDAGWPAPRRLRCWRSSMRTMVQRAATELDLEMIASRRTYALLDWLQQREQEVYP 128
Query: 210 RHPGFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLP--------------------- 247
GF P A P+ LP+ + GD W++ LP
Sbjct: 129 LEEGFMAGPLAPPPAPIATPPVPLPEEVQGDAWSWASLPADLLRDAADWPTSFSGLLPLP 188
Query: 248 ----------------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYK 285
+ W+ GLE + + + L+L G R++ ++
Sbjct: 189 KGLDTDQPVPGLRLFSSSRALAMAGWLGGLEPVRLLVEGRQ--LVLEAGQDDRWLVSDLD 246
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ +K+ GL F+AIQ D + GFW++ D+
Sbjct: 247 SAAADAIAGDL-GRSKERGKGLQFIAIQTSPDEQAFAGFWMMRDI 290
>gi|33866273|ref|NP_897832.1| hypothetical protein SYNW1741 [Synechococcus sp. WH 8102]
gi|33639248|emb|CAE08256.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
Length = 293
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 82/287 (28%), Positives = 133/287 (46%), Gaps = 60/287 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVV-----CDGS-LSLQYTKYFPNNVINSITLKEAI-- 150
+WELDF SRPIL+ G+K WEL+V D + + +++K P+ +NS+ L A+
Sbjct: 9 ADWELDFYSRPILEADGRKRWELLVTATPAADATEIPFRFSKCCPSGEVNSLWLSAALGE 68
Query: 151 VAICD-DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
C + G P P ++R +RS M+T++ +A ELD++ I S+R +LL WL+ R + VY
Sbjct: 69 ARQCALEAGWPAPRRLRCWRSSMRTMVQRAATELDLEMIASRRTYALLEWLQHREQEVYP 128
Query: 210 RHPGFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLP--------------------- 247
+ GF P A P+ LP+ + GD W++ LP
Sbjct: 129 QEEGFMAGPLAPPPAPVATPPVPLPEEVQGDAWSWASLPADLLGDASDWPTSFSGLLPLP 188
Query: 248 ----------------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYK 285
+ W+ GLE + + + L+L G R++ ++
Sbjct: 189 AGLDSNQPVPGLRLFSNSRALAVAGWLGGLEPVRLLVEGRQ--LVLEAGQDDRWLVSDLD 246
Query: 286 KNPVTTSEAEAWEAA--KKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+EA A E A K+ GL F+AIQ + + GFW++ D+
Sbjct: 247 S---AAAEAIAGELAQSKERGKGLQFIAIQTSPEEQAFAGFWMMRDI 290
>gi|87123919|ref|ZP_01079769.1| hypothetical protein RS9917_09926 [Synechococcus sp. RS9917]
gi|86168488|gb|EAQ69745.1| hypothetical protein RS9917_09926 [Synechococcus sp. RS9917]
Length = 304
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 78/289 (26%), Positives = 127/289 (43%), Gaps = 59/289 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCD-----GSLSLQYTKYFPNNVINSITLKEAI--- 150
+WELDF SRPIL+ GKK WEL++ G +Y + P +NS L A+
Sbjct: 21 ADWELDFYSRPILEADGKKRWELLITGSPDRSGRPPFRYERRCPAGEVNSTWLASALRDA 80
Query: 151 VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+ + G P+++R +RS M+T++ +A EL ++ PS+R +L+ WL +R VY
Sbjct: 81 LDLAQSEGWSPPQRLRCWRSAMRTMVQRAGTELGLEVRPSRRTYALIDWLAQREREVYPT 140
Query: 211 HPGFQKGSKPL---LALDNPFPMELPDNLFGDKWAFVQLP-------------------- 247
GF G PL A + LP+ + GD W++ LP
Sbjct: 141 EEGFMAG--PLAPSPAPTPTPALPLPEAVRGDAWSWASLPLGSLRDAEDWPLGFHDLLPI 198
Query: 248 -----------------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANY 284
+ W+ GLE + + + L+L G ++ +
Sbjct: 199 PNALAADQPVPGLRLFSRSRALALAGWLGGLEPVRLRVEGCQ--LVLDAGQDDAWLVTDL 256
Query: 285 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPP 333
+ ++ E +AA++ GGL F+A+Q ++ GFW+L D P P
Sbjct: 257 EPEAANITQREL-DAAREQIGGLQFIAVQTTPETPRFEGFWMLRDQPEP 304
>gi|434392898|ref|YP_007127845.1| protein of unknown function DUF1092 [Gloeocapsa sp. PCC 7428]
gi|428264739|gb|AFZ30685.1| protein of unknown function DUF1092 [Gloeocapsa sp. PCC 7428]
Length = 279
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 74/276 (26%), Positives = 125/276 (45%), Gaps = 55/276 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ D G+ +WEL+VCD + ++++ P + +N+ L E + + D +
Sbjct: 7 WQADFYRRPLRDAAGQTLWELLVCDLTRTVEFVALCPQSQVNAHWLVEQLQHVADKM--- 63
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR Q ++IT A ++L I ++R +L WL+ER ++Y + +
Sbjct: 64 -PDTIQVFRPQSLSLITAAGEQLGITVEATRRTDALKQWLQER-SSLYRSMDNYTGEAYD 121
Query: 221 LLALDNPFPMELPDNLFGDKWAF----------------------------VQLPFSA-- 250
LL L+ P P LP+ L+G++W F +QL ++
Sbjct: 122 LLTLEKPPPTPLPEKLWGEQWRFAALSAKDVEEAFQERPIPILNMPPALMPLQLGLASNI 181
Query: 251 ------------------WMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVTT 291
W+ S+ A L+L G+ R+I A ++ V+T
Sbjct: 182 AIPGVIIYGGRQSMRLARWLQEANPVSLNYIAGAPDGLVLEAGLVDRWIVATFEDREVST 241
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
S A+ +E K+ GLHFL +Q + GFWLL
Sbjct: 242 S-AQNYEQRKQQSKGLHFLLVQPDNSDITFSGFWLL 276
>gi|260434334|ref|ZP_05788304.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
gi|260412208|gb|EEX05504.1| conserved hypothetical protein [Synechococcus sp. WH 8109]
Length = 294
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 79/287 (27%), Positives = 130/287 (45%), Gaps = 60/287 (20%)
Query: 100 EWELDFCSRPILDIRGKKIWELVVCDGSLS------LQYTKYFPNNVINSITLKEAIV-- 151
+WELDF SRPIL+ G+K WEL++ + ++ K P+ +NS+ L +A+
Sbjct: 11 DWELDFYSRPILEADGRKRWELLITSTPAATGDTEPFRFAKVCPSGDVNSLWLSQALAEA 70
Query: 152 ---AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVY 208
+ G P+ ++R +RS M+T++ +A E D++ IPS+R +LL WL++R VY
Sbjct: 71 KQASASGGWGSPV--RLRCWRSSMRTMVQRAAAEQDLEVIPSRRTFALLDWLQQREREVY 128
Query: 209 TRHPGFQKG-SKPLLALDNPFPMELPDNLFGDKWAFVQLPFS------------------ 249
GF G P A P LP+ + GD W++ LP S
Sbjct: 129 PEEEGFMAGPLAPPPAPVPTPPAPLPEEVQGDAWSWAALPASLLLEASEWPMSFSGLLPV 188
Query: 250 -------------------------AWMNGLEVCSIETDTARGSLILSVGISTRYIYANY 284
W+ GLE + + + L+L G R++ ++
Sbjct: 189 PDGIDPEASVPGLRLFSQSRSVAMAGWLGGLEPVRMIVEDRQ--LVLEAGQDDRWLVSDL 246
Query: 285 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
+ V +EA +++ GL F+AIQ + + GFW+L D+P
Sbjct: 247 EPG-VAAEISEALATSQQQVRGLQFIAIQSIPEEQTFGGFWMLRDIP 292
>gi|452822989|gb|EME30003.1| hypothetical protein Gasu_25920 [Galdieria sulphuraria]
Length = 366
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 124/279 (44%), Gaps = 50/279 (17%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLG 158
T WELDF SRP+ K+IWEL+V D S L + + PN++INS L++ + + + +
Sbjct: 88 TVWELDFYSRPVYGKDNKRIWELIVVDESFLLCHVESVPNDMINSAELRKRVERLLEQVT 147
Query: 159 VPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGS 218
V P+ ++F R M +I+ A K+L + PS+R L L +R +Y++ PG++ S
Sbjct: 148 VK-PKVVKFSRMPMFNMISLALKDLGFEVKPSRRTYRLYHVLRDREANIYSKMPGYR--S 204
Query: 219 KPLLALDNPFPME-LPDNLFGDKWAFVQLPFS---------------------------- 249
+ L+ + E LPD L G+K+AF +S
Sbjct: 205 ENTLSTSYLYSTERLPDALRGEKFAFCTADYSFLYELQSSDTIPYCDIFNTGDSILLEKE 264
Query: 250 ---------------AWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEA 294
+W G EV I+ L+L GI++ Y A ++ EA
Sbjct: 265 LPGIIVYSERADSLASWTAGAEVSFIKFREEELELVLECGINSHYRLAKIAEDHRLVEEA 324
Query: 295 EAWEAAKKACGGLHFLAIQ---EELDSEDCVGFWLLLDL 330
+ +E K G HF AIQ E + G WLL D
Sbjct: 325 KTFEQMKWHMKGFHFYAIQSLKETSGTSHIKGLWLLNDF 363
>gi|159903073|ref|YP_001550417.1| hypothetical protein P9211_05321 [Prochlorococcus marinus str. MIT
9211]
gi|159888249|gb|ABX08463.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9211]
Length = 295
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 79/286 (27%), Positives = 125/286 (43%), Gaps = 59/286 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCD-----GSLSLQYTKYFPNNVINSITLKEAIVAI 153
+WELDF SRP+++ GKK WEL++ G ++ K P N +NSI L +A+
Sbjct: 12 ADWELDFYSRPVIEADGKKRWELLISSTENLSGKEPFRWEKKCPANEVNSIWLSKALKEA 71
Query: 154 CDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
D G P+ +R +R+ M+T+I KA + L ++ S+R SLL WL R + VY
Sbjct: 72 LKDAQSQGWGKPKIVRCWRAPMKTMIKKAAESLGLEVKESRRTYSLLDWLAHREKEVYPL 131
Query: 211 HPGFQKG---SKPLLALDNPFPMELPDNLFGDKWAFVQL--------------------- 246
G+ G P L+ P P LP+ + GD +F L
Sbjct: 132 QSGYLNGPIAPPPARILNQPTP--LPEAIRGDALSFASLEVRSLREAREWPIEFQGLLPI 189
Query: 247 ----------------------PFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANY 284
SAW++GLE + + + LIL G R++ +
Sbjct: 190 APSIEENISIPGLRLFSKNRAFALSAWLSGLEPVKLIVE--KNQLILEAGQEDRWLVTDM 247
Query: 285 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ ++ E +++ GL F++IQ + + GFW+L DL
Sbjct: 248 PQASADNAKKEL-SNSRENANGLQFISIQTSPNEQKFSGFWMLRDL 292
>gi|113477160|ref|YP_723221.1| hypothetical protein Tery_3687 [Trichodesmium erythraeum IMS101]
gi|110168208|gb|ABG52748.1| protein of unknown function DUF1092 [Trichodesmium erythraeum
IMS101]
Length = 283
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 75/280 (26%), Positives = 123/280 (43%), Gaps = 52/280 (18%)
Query: 96 ESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICD 155
++IT W++D+ RP+ D +G+K+WEL++C + SL++ P + + + L + +
Sbjct: 5 DTITIWQVDYYRRPLQDKQGQKLWELLICTPTRSLEFIAMCPQSEVKASWLVAQLQKMAQ 64
Query: 156 DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQ 215
G +P+ I+ FR Q +I A + L +K P++R +L WL ER + Y +
Sbjct: 65 GQG--LPDVIQVFRPQSLGLIEVAAQMLGLKIEPTRRTTALKEWLLERVQQ-YQDMEAYT 121
Query: 216 KGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS------------------------AW 251
L LD P P+ L +NL+GD+W F LP
Sbjct: 122 GEFYEPLVLDVPPPVPLAENLWGDRWRFASLPAGNIGDIIERPIPVLEAPEFLLPLNLGL 181
Query: 252 MNGLEVCSIETDTARGS------------------------LILSVGISTRYIYANYKKN 287
+ L + + D R S LIL G+ R++ A +
Sbjct: 182 SSTLPIPGVVIDGGRQSMKLARWLETTRPYLLKYISGDPDGLILETGLVDRWVVATFADQ 241
Query: 288 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
V+ + A+A+E K+ GLHFL +Q + GFWLL
Sbjct: 242 EVSGA-AQAYEQRKQQSEGLHFLLVQPDDSGMTYSGFWLL 280
>gi|88807699|ref|ZP_01123211.1| hypothetical protein WH7805_14148 [Synechococcus sp. WH 7805]
gi|88788913|gb|EAR20068.1| hypothetical protein WH7805_14148 [Synechococcus sp. WH 7805]
Length = 304
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 129/294 (43%), Gaps = 57/294 (19%)
Query: 91 EETDPESITEWELDFCSRPILDIRGKKIWELVVCDGS-----LSLQYTKYFPNNVINSI- 144
E++ + +WELDF SRPIL+ GKK WEL++ + ++ K P +NS
Sbjct: 13 EQSSAQKQADWELDFYSRPILEADGKKRWELLITSTPTPTEPVCFRFEKRCPAGDVNSTW 72
Query: 145 ---TLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLE 201
L+EA+ A ++ G P+++R +RS M+T++ +A EL ++ IPS+R +LL WLE
Sbjct: 73 LTSALREALTA-ANEQGWLQPKRLRTWRSAMRTMVQRAASELGLEMIPSRRTYALLDWLE 131
Query: 202 ERYETVYTRHPGFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLP------------- 247
ER +VY GF P A P+ LP+ + GD W + LP
Sbjct: 132 ERERSVYPLDEGFMAGPIAPPPAPIATPPLPLPEAVRGDAWCWAALPLGSLLEAGEWPMG 191
Query: 248 ------------------------------FSAWMNGLEVCSIETDTARGSLILSVGIST 277
+ W+ GLE + + L+L G
Sbjct: 192 FNDLLPIPEGMDPELPVPGLRLFSQTRALALAGWLGGLEPVRLRVSNQQ--LVLDAGQDD 249
Query: 278 RYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
++ ++ + ++ + GL F+++Q DS+ GFW+L D P
Sbjct: 250 SWLVSDLGQMEANQCREALMDSVSRGR-GLQFISVQTTPDSQRFDGFWMLRDRP 302
>gi|300869097|ref|ZP_07113697.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300332913|emb|CBN58893.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 281
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/276 (26%), Positives = 121/276 (43%), Gaps = 53/276 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DF RP+ D G+K+WEL +CD + ++ + P + NS L E + + G
Sbjct: 4 WQVDFYRRPLKDDAGEKLWELSICDLDRNFTFSTFCPQSQANSGWLTEQLQQVSQ--GKN 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q +I A + LD++ ++R +L LEER + Y + + +
Sbjct: 62 LPDLIQVFRPQSLGLIEAAAQVLDVEVEATRRTFALKRLLEERAKQ-YQKMANYTGEAYH 120
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAWMNGLE------------------------ 256
L L++P P+ LP+NL+GD+W F LP + +
Sbjct: 121 PLMLESPPPVPLPENLWGDRWRFAALPAGDIEDAFKSRPIPILEMPELLLPLNLALASTV 180
Query: 257 -VCSIETDTARGS------------------------LILSVGISTRYIYANYKKNPVTT 291
V + D R S LIL G+ R+I A + ++P
Sbjct: 181 SVPGVIIDGGRQSMRLARWLQAAKPVALNYIPGSPDGLILEAGLVDRWIVATF-EDPDVK 239
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ E ++ ++ GLHFL +Q + GFWLL
Sbjct: 240 AAGEIYQQRQQLSHGLHFLLVQPDDSGMTYTGFWLL 275
>gi|124025296|ref|YP_001014412.1| hypothetical protein NATL1_05851 [Prochlorococcus marinus str.
NATL1A]
gi|123960364|gb|ABM75147.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL1A]
Length = 295
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 77/286 (26%), Positives = 128/286 (44%), Gaps = 59/286 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGS-----LSLQYTKYFPNNVINSITLKEAIVAI 153
T+WE+DF SRPI+D GKK WEL++ + + ++ K P + +NSI LK+A
Sbjct: 12 TDWEIDFYSRPIIDENGKKRWELLITSTNNFKDKKTFKWEKICPASSVNSIWLKDAFDEA 71
Query: 154 CDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
D+ G P IR +RS M+T+I +A ++ I+ I S+R SLL WL ER + Y +
Sbjct: 72 IDEAYSQGWDKPSVIRCWRSSMKTMIKRAADQIGIELISSRRTYSLLEWLIERERSFYPQ 131
Query: 211 HPGFQKGSKPLLALDNPFPME---LPDNLFGDKWAFV----------------------- 244
G+ + L NP + LP+ + G+ W+F
Sbjct: 132 QKGYTGVN--LAPPSNPITNQAIPLPEEVRGESWSFASLSLNTLREADEWEIEFSNLIPI 189
Query: 245 --------------------QLPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANY 284
L +AW+ GLE + + + +IL G + R++ +
Sbjct: 190 KDSINENISIPGIRLFSPKRSLALAAWLGGLEPAKLLIEGTQ--IILEAGQADRWLVTDV 247
Query: 285 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
++ E ++ K GL F+++Q+ + GFW+L D+
Sbjct: 248 EEEAKKVIE-NNFQNTKLYADGLQFISVQKSPEENSLDGFWMLKDI 292
>gi|78212273|ref|YP_381052.1| hypothetical protein Syncc9605_0725 [Synechococcus sp. CC9605]
gi|78196732|gb|ABB34497.1| conserved hypothetical protein [Synechococcus sp. CC9605]
Length = 294
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 127/287 (44%), Gaps = 60/287 (20%)
Query: 100 EWELDFCSRPILDIRGKKIWELVVCDGSLS------LQYTKYFPNNVINSITLKEAIV-- 151
+WELDF SRPIL+ G+K WEL++ + ++ K P+ +NS+ L +A+
Sbjct: 11 DWELDFYSRPILEADGRKRWELLITSTPAASGDAEPFRFAKVCPSGDVNSLWLSQALAEA 70
Query: 152 ---AICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVY 208
+ G P+ ++R +RS M+T++ +A E D++ IPS+R +LL WL++R VY
Sbjct: 71 KQASASGGWGSPV--RLRCWRSSMRTMVQRAAAEQDLEVIPSRRTFALLDWLQQREREVY 128
Query: 209 TRHPGFQKG-SKPLLALDNPFPMELPDNLFGDKWAFVQLPFS------------------ 249
GF G P A P+ LP+ + GD W++ LP S
Sbjct: 129 PEEEGFMAGPLAPPPAPVPTPPVPLPEEVQGDAWSWAALPASLLLEASEWPMSFSGLLPV 188
Query: 250 -------------------------AWMNGLEVCSIETDTARGSLILSVGISTRYIYANY 284
W+ GLE + + + L+L G R++ ++
Sbjct: 189 PDGIDPEASVPGLRLFSQSRSLAMAGWLGGLEPVRMIVEDRQ--LVLEAGQDDRWLVSDL 246
Query: 285 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
+ +++ GL F+AIQ + + GFW+L D+P
Sbjct: 247 EPGIAAEIAEAL-ATSQQQVRGLQFIAIQSSPEEQTFGGFWMLRDIP 292
>gi|148238987|ref|YP_001224374.1| hypothetical protein SynWH7803_0651 [Synechococcus sp. WH 7803]
gi|147847526|emb|CAK23077.1| Conserved hypothetical protein [Synechococcus sp. WH 7803]
Length = 304
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 78/285 (27%), Positives = 121/285 (42%), Gaps = 55/285 (19%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSL-----SLQYTKYFPNNVINSITLKEAIVAI 153
+WELDF SRPIL+ GKK WEL++ ++ K P +NS L A+
Sbjct: 21 ADWELDFYSRPILEADGKKRWELLITSTPTPSAPDCFRFEKRCPAGDVNSTWLASALREA 80
Query: 154 CDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
D G P ++R +RS M+T++ +A EL+++ IPS+R +LL WLEER +Y
Sbjct: 81 LDTAQAHGWMSPRRLRTWRSAMRTMVQRAASELELEMIPSRRTYALLDWLEERERDLYPL 140
Query: 211 HPGFQ-KGSKPLLALDNPFPMELPDNLFGDKWAFVQLP---------------------- 247
G+ P A P+ LP+ + GD W + LP
Sbjct: 141 DKGYMAGPLAPPPAPIATPPLPLPEAVRGDAWCWAALPLGSLREASEWPMGFNDLLPIPE 200
Query: 248 ---------------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKK 286
+ W+ GLE + + + LIL G ++ ++ +
Sbjct: 201 AMDPELPVPGLRLFSQTRALALAGWLGGLEPVRLRMNAQQ--LILDAGQDDSWLVSDLGQ 258
Query: 287 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
+A E + GL F+++Q DS+ GFW+L D P
Sbjct: 259 TEAVECR-DALEDSVHRSRGLQFISVQATPDSQRFDGFWMLRDQP 302
>gi|72383696|ref|YP_293051.1| hypothetical protein PMN2A_1860 [Prochlorococcus marinus str.
NATL2A]
gi|72003546|gb|AAZ59348.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL2A]
Length = 295
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 77/286 (26%), Positives = 127/286 (44%), Gaps = 59/286 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGS-----LSLQYTKYFPNNVINSITLKEAIVAI 153
T+WE+DF SRPI+D GKK WEL++ + + ++ K P + +NSI LK+A
Sbjct: 12 TDWEIDFYSRPIIDENGKKRWELLITSTNNFKDKKTFKWEKICPASSVNSIWLKDAFDEA 71
Query: 154 CDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
D+ G P IR +RS M+T+I +A ++ I+ I S+R SLL WL ER + Y +
Sbjct: 72 IDEAYLQGWDKPSVIRCWRSSMKTMIKRAADQIGIELISSRRTYSLLEWLIERERSFYPQ 131
Query: 211 HPGFQKGSKPLLALDNPFPME---LPDNLFGDKWAFV----------------------- 244
G+ + L NP + LP+ + G+ W+F
Sbjct: 132 QKGYTGVN--LAPPSNPITNQAIPLPEEVRGESWSFASLSLNTLREADEWEIEFSNLIPI 189
Query: 245 --------------------QLPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANY 284
L +AW+ GLE + + + +IL G + R++ +
Sbjct: 190 KDSINENISIPGIRLFSPKRSLALAAWLGGLEPAKLLIEGTQ--IILEAGQADRWLVTDV 247
Query: 285 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
++ E + K GL F+++Q+ + GFW+L D+
Sbjct: 248 EEEAKKVIE-NNFLNTKLYADGLQFISVQKSPEENSLHGFWMLKDI 292
>gi|119486760|ref|ZP_01620735.1| hypothetical protein L8106_10937 [Lyngbya sp. PCC 8106]
gi|119456053|gb|EAW37186.1| hypothetical protein L8106_10937 [Lyngbya sp. PCC 8106]
Length = 277
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 120/276 (43%), Gaps = 54/276 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ D G+ +WEL++CD S +++Y + P + NS L + +
Sbjct: 4 WQADFYRRPLQDTTGQPLWELLICDQSRNIEYLAFCPQSHANSTWLTQQLQQATQ---TE 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I FR Q ++I A L I+ P++R ++L WL++R + Y + G+
Sbjct: 61 KPDLIWVFRPQSLSLIQTAATALGIRVEPNRRTVTLKQWLQQRSQD-YPQLAGYTNEPYK 119
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAWMNG-------------------------L 255
+ LD P P+ +P+NL+GD W F LP ++G L
Sbjct: 120 PVELDKPPPVPIPENLWGDVWRFATLPAGDIVDGFRDRPIPILEMPDFLYPINLGLPSTL 179
Query: 256 EVCSIETDTARGS------------------------LILSVGISTRYIYANYKKNPVTT 291
V I + R S LIL G+ R++ A ++ VT
Sbjct: 180 PVPGIVINGGRQSMQLSRWLAEKKPVSLHYIPGSPDGLILEAGLVDRWVLATFEDAEVTE 239
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A+ + K+ GLHFL +Q + GFWLL
Sbjct: 240 A-AKMFTERKQLTKGLHFLLVQPDDSGITYTGFWLL 274
>gi|427718052|ref|YP_007066046.1| hypothetical protein Cal7507_2795 [Calothrix sp. PCC 7507]
gi|427350488|gb|AFY33212.1| protein of unknown function DUF1092 [Calothrix sp. PCC 7507]
Length = 265
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 76/278 (27%), Positives = 115/278 (41%), Gaps = 72/278 (25%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF D G+ +WEL++CD + S +YT P + NS L I G
Sbjct: 5 WQADFYRSSQRDTAGQVLWELLLCDATRSFEYTATCPQSAANSNWLTSQIELAA---GGK 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
PE I+ FR Q ++I A + L I P++R L++ WL+E+ ++P
Sbjct: 62 FPEVIQVFRPQSLSLIEAAGRNLGINVEPTRRTLAVKQWLKEK------QYP-------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLP--------------------------------- 247
LALD P P LP+NL+G++W F L
Sbjct: 108 -LALDKPPPSPLPENLWGEQWRFATLQAGELVDVFAERPIPILHIPEFLQPINLGLASTV 166
Query: 248 ---------------FSAWMNGLEVCSIETDTARG---SLILSVGISTRYIYANYKKNPV 289
+ W+N E +E + G L+L + R+I A + + V
Sbjct: 167 PVPGVVIYGGRQSMRLARWLN--EASPVELNYIAGEPDGLVLEAALVDRWIVATFADSEV 224
Query: 290 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
T + A+ +E K+ GLHFL +Q + GFWLL
Sbjct: 225 TAA-AKLYEQRKQQSLGLHFLLVQPDDSGMTYSGFWLL 261
>gi|428318463|ref|YP_007116345.1| protein of unknown function DUF1092 [Oscillatoria nigro-viridis PCC
7112]
gi|428242143|gb|AFZ07929.1| protein of unknown function DUF1092 [Oscillatoria nigro-viridis PCC
7112]
Length = 279
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 74/276 (26%), Positives = 116/276 (42%), Gaps = 53/276 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ D GK +WEL +CD S Q++ NS L +
Sbjct: 4 WQADFYRRPLQDETGKPLWELFICDSEGSFQFSAVCSQGAANSNWLASQLQQQAQTHN-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q +I A K L +K ++R +L L L++R + Y+ P + +
Sbjct: 62 LPDLIQVFRPQSLGLIEAAGKVLGVKVEATRRTPALKLLLQQRAKE-YSSMPNYTGETYS 120
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFS-------------------------AWMNGL 255
+ALD+P P+ LP+NL+GD W F LP + +
Sbjct: 121 AIALDSPPPVPLPENLWGDGWRFASLPAGDIEEAFQGRPLPILEMPEFLLPLNLGLASTV 180
Query: 256 EVCSIETDTARGS------------------------LILSVGISTRYIYANYKKNPVTT 291
V + D R S LIL G+ R++ A ++ + V
Sbjct: 181 PVPGVVIDGGRQSMRLARWLQDAKPFALNYIAGEPDGLILEAGLVDRWVVATFEDSEVKA 240
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A+ ++ K+ GLHFL +Q + GFWLL
Sbjct: 241 A-AQIYQQRKQLSKGLHFLLVQPDDSGMTYTGFWLL 275
>gi|90655437|gb|ABD96278.1| unknown [uncultured marine type-A Synechococcus GOM 3M9]
Length = 288
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 125/285 (43%), Gaps = 56/285 (19%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVC------DGSLSLQYTKYFPNNVINSITLKEAIVA 152
+WELDF SRPIL+ G+K WEL++ D ++ K P+ +NS+ L +A+
Sbjct: 4 ADWELDFYSRPILEPDGRKRWELLITSTPTLSDPIAPFRFIKCCPSGEVNSLWLTQALRE 63
Query: 153 ICDDLGV---PIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
P+++R +RS M+T++ +A EL ++ IPS+R +LL WL++R VY
Sbjct: 64 AGAAAEDAGWSAPQRLRCWRSSMRTMVQRAAAELSLEVIPSRRTYALLDWLQQRQREVYP 123
Query: 210 RHPGFQKG-SKPLLALDNPFPMELPDNLFGDKWAFVQLP--------------------- 247
GF G P A P+ LP+ + GD W + LP
Sbjct: 124 SLEGFMAGPLAPPPAPVPTPPVPLPEEVQGDAWTWAALPGGLLQEAGEWPMGFSGLIPLP 183
Query: 248 ----------------------FSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYK 285
+ W+ GLE + + + L+L G R++ ++ +
Sbjct: 184 PDLSSEAPVPGLRLFSRSRALAMAGWLGGLEPVRLLVEERQ--LLLEAGQDDRWLVSDLE 241
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
E E+++ GL F+AIQ + + GFW++ D+
Sbjct: 242 SGAADAIETALRESSEH-MHGLQFIAIQSSPEEQSFAGFWMMRDI 285
>gi|119511451|ref|ZP_01630562.1| hypothetical protein N9414_16559 [Nodularia spumigena CCY9414]
gi|119463916|gb|EAW44842.1| hypothetical protein N9414_16559 [Nodularia spumigena CCY9414]
Length = 265
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 77/279 (27%), Positives = 121/279 (43%), Gaps = 68/279 (24%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
I W++DF RP+ D G+ +WEL++CD + S +YT P + NS L I +D
Sbjct: 2 IKIWQVDFYRRPVQDKSGQILWELLICDATRSFEYTATCPQSAANSHWLATQIQLADND- 60
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
+P+ I+ FR Q ++I A LDI P++ L+L WLEE+ ++P
Sbjct: 61 --NLPDTIQVFRPQSLSLIQAAANNLDIDVEPTRYTLALKQWLEEK------QYP----- 107
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFV--------------QLPFSAWMNGLE------- 256
LALD P P LP+NL+G++W F Q+P + L+
Sbjct: 108 ----LALDKPPPTPLPENLWGEEWRFATLSAGELADVFAQRQIPIVSIPEFLKPINLGLA 163
Query: 257 ----VCSIETDTARGSLILS------------------------VGISTRYIYANYKKNP 288
V + R S+ L+ G+ R+I A ++
Sbjct: 164 STVPVPGVIIYGGRKSMYLARWLEQAQPFTLNYIAGEPNGLILEAGLVDRWIVATFEDAE 223
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
V + A+ ++ ++ GLHFL +Q + GFWLL
Sbjct: 224 VEAA-AKVYQQRQQQSQGLHFLLVQPDDSGMTYTGFWLL 261
>gi|194477333|ref|YP_002049512.1| hypothetical protein PCC_0893 [Paulinella chromatophora]
gi|171192340|gb|ACB43302.1| hypothetical protein PCC_0893 [Paulinella chromatophora]
Length = 306
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 71/286 (24%), Positives = 122/286 (42%), Gaps = 60/286 (20%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDG-SLSLQY-TKYF------PNNVINSITLKEAI 150
++WELDF SR +D KK WEL++C S+S+ + YF P+ +NS+ LKEA+
Sbjct: 18 SDWELDFYSRSPIDTNDKKCWELIICSTPSISITGPSAYFRWEMPCPSESVNSLWLKEAL 77
Query: 151 VAICD---DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETV 207
D + G P ++R +RS M+ +I +A + I+ +PS+RC +L+ W+++R +
Sbjct: 78 GQAIDSALEQGFSSPRRLRSWRSSMRIMIQRAVESFGIEFVPSRRCYTLMEWIKDREIQI 137
Query: 208 YTRHPGFQKGSKPLLALDNPF-PMELPDNLFGDKWAFVQLPF------------------ 248
Y+ + + F + LP GD W++ LP
Sbjct: 138 YSSQKNMSTNIGVIPSTRTQFRAIPLPTAAQGDSWSWASLPMNILQEASNWEISFSGLLP 197
Query: 249 ---------------------------SAWMNGLEVCSIETDTARGSLILSVGISTRYIY 281
+ W+ GLE +E L+L G+ R++
Sbjct: 198 LPIFNEKQKEIMIPGVRLLSLSRSLAIAGWIQGLEPVRLE--ICETQLVLEAGLEDRWLL 255
Query: 282 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ + EA+ A+ G+ FLA+Q + + G W+L
Sbjct: 256 TDLPIEEALVAN-EAFTKARMNAFGVQFLAVQSDPNQRGFDGLWML 300
>gi|334120429|ref|ZP_08494510.1| protein of unknown function DUF1092 [Microcoleus vaginatus FGP-2]
gi|333456776|gb|EGK85406.1| protein of unknown function DUF1092 [Microcoleus vaginatus FGP-2]
Length = 279
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/276 (26%), Positives = 114/276 (41%), Gaps = 53/276 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ D GK +WEL++CD S Q++ NS L +
Sbjct: 4 WQADFYRRPLQDETGKPLWELLICDSEGSFQFSAVCRQGDANSNWLASQLQQQAQTQN-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P I+ FR Q +I A K L +K ++R +L L L++R + Y P + +
Sbjct: 62 LPALIQVFRPQSLGLIEAAGKVLGVKVEATRRTGALKLLLQQRAKE-YLSMPNYTGETYS 120
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFSAWMNGLE------------------------ 256
+ALD+P P+ LP+NL+GD W F LP +
Sbjct: 121 AIALDSPPPVPLPENLWGDGWRFASLPAGDIEEAFQGRPLPILEMPELLLPLNLGLASTV 180
Query: 257 -VCSIETDTARGS------------------------LILSVGISTRYIYANYKKNPVTT 291
V + D R S LIL G+ R++ A ++ + V
Sbjct: 181 PVPGVVIDGGRQSMRLARWLQDAKPFAVNYIAGEPDGLILEAGLVDRWVVATFEDSEVKA 240
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A+ ++ K+ GLHFL +Q + GFWLL
Sbjct: 241 A-AQIYQQRKQLSKGLHFLLVQPDDSGMTYTGFWLL 275
>gi|186681562|ref|YP_001864758.1| hypothetical protein Npun_F1089 [Nostoc punctiforme PCC 73102]
gi|186464014|gb|ACC79815.1| protein of unknown function DUF1092 [Nostoc punctiforme PCC 73102]
Length = 264
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 72/276 (26%), Positives = 116/276 (42%), Gaps = 68/276 (24%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DF RP D G+ +WEL++CD + S +Y + NS + + G
Sbjct: 4 WQVDFYRRPSQDASGQILWELLICDATRSFEYEATCLQSAANSNWVAAQLELAA---GEK 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q ++I A + L I P++ L+L WL+E+ ++P
Sbjct: 61 LPDVIQVFRPQSLSLIEVAGRNLSINVEPTRHTLALKQWLQEK------QYPS------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFS-------------------------AWMNGL 255
ALD P P LP+NL+G++W F L S + +
Sbjct: 108 --ALDKPPPAPLPENLWGEQWRFATLAASDVETRFSDRPIPILHIPEHLKPINLGLASTV 165
Query: 256 EVCSIETDTARGSLILS------------------------VGISTRYIYANYKKNPVTT 291
V + R S+ L+ G+ R+I A + ++P T
Sbjct: 166 PVPGVVIYGGRQSMRLARWLQQARPVALNYISGAPDGLVLEAGLVDRWIVATF-EDPEVT 224
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A+ ++ KK C GLHFL +Q + GFWLL
Sbjct: 225 TAAQTYQQRKKHCRGLHFLLVQPDDSGMTYSGFWLL 260
>gi|428211452|ref|YP_007084596.1| hypothetical protein Oscil6304_0944 [Oscillatoria acuminata PCC
6304]
gi|427999833|gb|AFY80676.1| Protein of unknown function (DUF1092) [Oscillatoria acuminata PCC
6304]
Length = 277
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 73/276 (26%), Positives = 118/276 (42%), Gaps = 54/276 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ G+ +WEL +CD + + Q+++ + NS L E + + +
Sbjct: 4 WQADFYRRPLQSATGEPLWELCLCDPTGNFQWSRCCSQSEANSTWLAEQLQIVAEGR--- 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+PE I FR Q +++ A ++L +K PS+R +L WL E+ + Y P +
Sbjct: 61 LPEAIAVFRPQSLSLMVAAGEKLGVKIEPSRRTPALKSWLVEKAQE-YRNAPNYTCEPYE 119
Query: 221 LLALDNPFPMELPDNLFGDKWAF------------------------------VQLPFSA 250
L D P P LP+ L+GD+W F + LP +A
Sbjct: 120 PLVSDRPPPGPLPEALWGDRWRFASVSAAYLMEVFAQRAIRIRHIPEELTPVALGLPSTA 179
Query: 251 WMNGL----------------EVCSIETDTARG---SLILSVGISTRYIYANYKKNPVTT 291
+ G+ E + + G LIL G+ R+I A ++ V
Sbjct: 180 VIPGVVLDGGRQSMKIAQWLQEASPVAINYNPGPPNGLILEAGLVDRWIMATFEDTEVAE 239
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ + ++ K+A GLHFL IQ + GFWLL
Sbjct: 240 A-GQTFQQRKQATQGLHFLLIQPDDSGMTYSGFWLL 274
>gi|434405323|ref|YP_007148208.1| Protein of unknown function (DUF1092) [Cylindrospermum stagnale PCC
7417]
gi|428259578|gb|AFZ25528.1| Protein of unknown function (DUF1092) [Cylindrospermum stagnale PCC
7417]
Length = 264
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 70/279 (25%), Positives = 116/279 (41%), Gaps = 68/279 (24%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+T W+ DF RP D + +WEL +CD + S ++ P + NS + + +
Sbjct: 1 MTIWQADFYKRPQKDATEQVLWELSICDQTRSFEFAATCPQSQANSTWVATQLQLAANK- 59
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
+P+ I+ FR Q +I A + L I P++R L+L WL+++ + P
Sbjct: 60 --KLPDVIQVFRPQSLNLIAAAGRTLGINVEPNRRTLALKQWLQQK------QFP----- 106
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLP------------------------------ 247
LA++ P P LP+NL+G++W F +LP
Sbjct: 107 ----LAVEKPPPAPLPENLWGEEWRFAKLPAGDIADIFTERPIPILQVPEFLKPINLGLA 162
Query: 248 ------------------FSAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNP 288
+ W+ + ++ A LIL G+ R+I A + +
Sbjct: 163 STVSVPGVIIYGGRQSMRLARWLQEADPVALNYMSGAPDGLILEAGLQDRWIVATFDDSE 222
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
VT + A+ +E K+ GLHFL +Q + GFWLL
Sbjct: 223 VTDA-AKVYEQRKQQSRGLHFLLVQPDDSGMTYTGFWLL 260
>gi|440681085|ref|YP_007155880.1| protein of unknown function DUF1092 [Anabaena cylindrica PCC 7122]
gi|428678204|gb|AFZ56970.1| protein of unknown function DUF1092 [Anabaena cylindrica PCC 7122]
Length = 265
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 74/277 (26%), Positives = 114/277 (41%), Gaps = 70/277 (25%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P+ D G+ +WEL++CD + +Y P + NS L E +
Sbjct: 5 WQADFYRSPLRDAAGQILWELLICDATRKFEYVATCPQSQANSNWLTEQFQTAGAE---K 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRC-LSLLLWLEERYETVYTRHPGFQKGSK 219
+PE I+ FR Q +IT A L IK + + RC L+L WL+E+ ++P
Sbjct: 62 LPEIIQVFRPQSLGLITAAGNNLSIK-VEATRCTLALKQWLQEK------QYP------- 107
Query: 220 PLLALDNPFPMELPDNLFGDKWAFVQLP-------------------------------- 247
+A+D P P LP+NL+G++W F +P
Sbjct: 108 --IAVDKPPPAPLPENLWGEEWRFATIPAGDIVDEFTERPIPILQIPEFLKPINLGLAST 165
Query: 248 ----------------FSAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVT 290
+ W+ S+ A LIL G++ R+I A ++ V
Sbjct: 166 VPVPGVVIYGGRQSMRLARWLQEANPVSLNYIAGAPDGLILEAGLADRWILATFEDEEVA 225
Query: 291 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A+ + K+ GLHFL IQ + GFWLL
Sbjct: 226 AA-AKVYAQRKQVSKGLHFLLIQPDDSGMTYSGFWLL 261
>gi|37523856|ref|NP_927233.1| hypothetical protein glr4287 [Gloeobacter violaceus PCC 7421]
gi|35214862|dbj|BAC92228.1| glr4287 [Gloeobacter violaceus PCC 7421]
Length = 272
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 67/267 (25%), Positives = 106/267 (39%), Gaps = 43/267 (16%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
WELDF P++ G+ WEL+VC L ++ P + N + L+ + + G P
Sbjct: 4 WELDFYRCPLVGADGQVRWELLVCTAEGGLLRAQFCPADAANVVWLEAQLAELVASRGGP 63
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P ++R FR+ + AC+ L I S+R +++ ER E++Y + P ++ P
Sbjct: 64 -PLQMRAFRTAAFNLAGPACRRLGIPLRHSRRAIAVQRRRAEREESLYPQMPDYRP-LPP 121
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLP--------------------------------- 247
+ P +PD D+W F LP
Sbjct: 122 GVPQQKAVPAPIPDARLPDRWGFSALPGAELGQLRQLPIAYLEVPLLAGIDAPVPGVFLF 181
Query: 248 ------FSAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNPVTTSEAEAWEAA 300
++W+ E S++ A LIL G+ R+I A + +P +
Sbjct: 182 SRRDRDLASWLAAREPVSLQYTRAEIDGLILEAGLDERWILATF-DDPGMRERGRQFAER 240
Query: 301 KKACGGLHFLAIQEELDSEDCVGFWLL 327
GLHFLA+Q S GFWLL
Sbjct: 241 LAGSRGLHFLAVQPAEGSPQIAGFWLL 267
>gi|428314314|ref|YP_007125291.1| hypothetical protein Mic7113_6296 [Microcoleus sp. PCC 7113]
gi|428255926|gb|AFZ21885.1| Protein of unknown function (DUF1092) [Microcoleus sp. PCC 7113]
Length = 278
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 68/276 (24%), Positives = 117/276 (42%), Gaps = 54/276 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ D G+ +WEL++CD + Y + + +N+ L + + G
Sbjct: 4 WQADFYRRPLRDATGQVLWELLICDATRHFTYQAWCAQSEVNANWL---VAQLRQAAGDN 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR Q +++ A ++L I P++ +L WL++R Y + G+ +
Sbjct: 61 WPDVIQVFRPQSLSLMEAAAQQLGIAVEPTRGTTTLKQWLQQR-ALQYPKQEGYTAEAYN 119
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLP--------------------------------- 247
+A+D P P+ LP+NL+GD+W F +P
Sbjct: 120 PIAIDKPPPLPLPENLWGDRWRFASIPAGNIEEAFGDRPIPILEMPESLLPLNLGLASTV 179
Query: 248 ---------------FSAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVTT 291
+ W+ + S+ A LIL G+ R++ A ++ V T
Sbjct: 180 AVPGVIIDGGRKSMQLARWLQNVTPVSLNYIAGAPDGLILEAGLVDRWVVATFEDTEVAT 239
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A +E + GLHFL +Q + GFWLL
Sbjct: 240 A-ARMYEQRQSLSQGLHFLLVQPDDSGMTYTGFWLL 274
>gi|427420079|ref|ZP_18910262.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 7375]
gi|425762792|gb|EKV03645.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 7375]
Length = 285
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 67/279 (24%), Positives = 114/279 (40%), Gaps = 51/279 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+LDF RP+ + + +WEL+VC ++ Y + P +++ L+ I G
Sbjct: 5 WQLDFYRRPLKNTDNQPLWELLVCTPNMDFSYGETCPQPEADAMWLRHQIKQAIHRAGY- 63
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ ++ FR Q QT+ AC+ELDI +R +L WL +R Y + +
Sbjct: 64 RPKVLQVFRPQTQTLTEVACRELDIPVETQRRLPTLKQWLRQR-NAWYPNLKTYTGEAYS 122
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQL---------------------------------- 246
A++ P+ LPDNL+G+ W F L
Sbjct: 123 PFAIERSTPIPLPDNLWGETWRFAGLSNADLLRFQYEAIPVRSIPKELLPLEIGLSSTVL 182
Query: 247 -------------PFSAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNPVTTS 292
+ W++ ++ ++ + LIL G+ R++ ++ + V +
Sbjct: 183 IPGVVIDGGQRSMALTQWLDSVQPAFLKYIAGQPDGLILEAGLCDRFVLTTFEDSDVRGA 242
Query: 293 EAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
A A+E K GLHFL I+ + G WLL + P
Sbjct: 243 -ANAFEQRKVTSKGLHFLLIRPDDSGMTYSGLWLLQESP 280
>gi|411116983|ref|ZP_11389470.1| Protein of unknown function (DUF1092) [Oscillatoriales
cyanobacterium JSC-12]
gi|410713086|gb|EKQ70587.1| Protein of unknown function (DUF1092) [Oscillatoriales
cyanobacterium JSC-12]
Length = 285
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 74/279 (26%), Positives = 112/279 (40%), Gaps = 53/279 (18%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
++ WE+D RP+ D G +WELVVCD + +T IN+ + I + D
Sbjct: 1 MSVWEVDCYRRPLQDEAGNPLWELVVCDTEGAFTWTALCQQAQINADWVAAQIRDLVRD- 59
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
P+P+ I FR Q ++ C +L I P++ L +L+E T Y +PG+
Sbjct: 60 -RPLPQIIHVFRPQTLHLLEPVCTQLGISIEPTRHTPYLKTYLQE-LATQYPNYPGYTGQ 117
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQL-------PFSAWM------------------ 252
LALD P+ L L G+ W F L F+ M
Sbjct: 118 LYDPLALDQSPPLPLDATLLGNHWQFATLAAGDIADAFTGRMIPILEMPEFLLPLNLGLA 177
Query: 253 NGLEVCSIETDTARGS------------------------LILSVGISTRYIYANYKKNP 288
+ + V + + R S L+L G+ R+I A + ++P
Sbjct: 178 SMVPVPGVVIEAGRRSLRLAQWLKQTRPVALNYIPGSPNGLVLQAGLVDRWIIATF-EDP 236
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A +E K A GLHFL +Q + GFWLL
Sbjct: 237 HVAASATEFEQRKIASRGLHFLLVQPDDSGMTYSGFWLL 275
>gi|75906361|ref|YP_320657.1| hypothetical protein Ava_0136 [Anabaena variabilis ATCC 29413]
gi|75700086|gb|ABA19762.1| Protein of unknown function DUF1092 [Anabaena variabilis ATCC
29413]
Length = 264
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 78/275 (28%), Positives = 111/275 (40%), Gaps = 66/275 (24%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P D+ GK +WEL++CD + +YT P + NS L I G
Sbjct: 5 WQADFYRSPQQDLDGKILWELLICDVNRGFEYTATCPQSEANSSWLTSQIQLAA---GEK 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q ++I A + L I P ++ +L WL+E+ ++
Sbjct: 62 LPDIIQVFRPQSLSLIEAAGRNLGINVEPQRQTPALKQWLQEKQYSI------------- 108
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQL---------------------PFSAWMNGL---- 255
A+D P P LPDNL+GD+W F + P GL
Sbjct: 109 --AIDKPPPTPLPDNLWGDEWRFASIQAGDVVDLFSDRPIPILSLPEPLKPINLGLASTV 166
Query: 256 EVCSIETDTARGSLILSVGIS-TRYIYANYKKNP----------------VTTSEAEAWE 298
+ + R SL L+ I+ TR + NY VT +AE
Sbjct: 167 AIPGVVIYGGRRSLNLARWIAQTRPVALNYIAGAPDGLILEAGLVDRWILVTFEDAEVKA 226
Query: 299 AAK------KACGGLHFLAIQEELDSEDCVGFWLL 327
AAK K GLHFL +Q + GFWLL
Sbjct: 227 AAKVYEQRQKQSRGLHFLLVQPDDSGMTYTGFWLL 261
>gi|427710618|ref|YP_007052995.1| hypothetical protein Nos7107_5360 [Nostoc sp. PCC 7107]
gi|427363123|gb|AFY45845.1| protein of unknown function DUF1092 [Nostoc sp. PCC 7107]
Length = 265
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 73/276 (26%), Positives = 115/276 (41%), Gaps = 68/276 (24%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF D G+ +WEL++CD + S +YT P + NS + E I G
Sbjct: 4 WQADFYRSSQQDKSGQVLWELLICDVNRSFEYTAACPQSEANSSWVIEQIQQAA---GEK 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P I+ FR Q ++I A + L I ++R L+L WL+ER+ V
Sbjct: 61 LPNVIQVFRPQSLSLIETAGRNLGIVVEATRRTLALKQWLQERHSAV------------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLP-------FS------------------AWMNGL 255
+L+ P P+ LP+NL+G++W L FS + +
Sbjct: 108 --SLEKPAPLPLPENLWGEQWRLATLAAGDLETEFSDRPIPILSMPEFLTPINLGLASTI 165
Query: 256 EVCSIETDTARGSLILSVGIST------------------------RYIYANYKKNPVTT 291
V + R S+ L+ ++T R++ A ++ VTT
Sbjct: 166 PVPGVVIYGGRQSMRLARWLATAKPVALNYIAGAPDGLILEAGLVDRWVLATFEDAEVTT 225
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A+ +E K+ GLHFL IQ + GFWLL
Sbjct: 226 A-AKIYEQRKQQSRGLHFLLIQPDDSGMTYSGFWLL 260
>gi|428225981|ref|YP_007110078.1| hypothetical protein GEI7407_2551 [Geitlerinema sp. PCC 7407]
gi|427985882|gb|AFY67026.1| protein of unknown function DUF1092 [Geitlerinema sp. PCC 7407]
Length = 283
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 115/279 (41%), Gaps = 52/279 (18%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+T WE DF RP+ + G+ +WEL++CD L + P + L + +
Sbjct: 1 MTIWEADFYRRPLRNAAGQPLWELLLCDQQRQLILSAMCPQPDATAAWLTGQLRSHFAA- 59
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
GV PE++R FR Q +++ AC+ L I ++R ++ L R + Y + P +
Sbjct: 60 GVTPPERLRVFRPQSLSLLQVACEPLGIAVEGTRRTPAIKAALLAR-ASAYAQMPEYSSE 118
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAF----------------------------VQLPFS 249
+ L ++ P LP+ L+GD+W F VQL +
Sbjct: 119 AYQPLYIEKAPPAPLPETLWGDRWRFGAMAAGDLISVFRHRPVPILEMPTELLPVQLGLA 178
Query: 250 A--------------------WMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNP 288
+ W+ + S+ T LIL G+S R++ A +P
Sbjct: 179 STTPIPGVILEGGRRSLQIARWLQAHQPVSLHYRTGDPDGLILEAGLSDRWVIAT-TTDP 237
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A +E ++A GLHFL I+ + + FWLL
Sbjct: 238 DMAAAARTYEERQQASQGLHFLLIEPDDSGQTSTAFWLL 276
>gi|17229810|ref|NP_486358.1| hypothetical protein all2318 [Nostoc sp. PCC 7120]
gi|17131410|dbj|BAB74017.1| all2318 [Nostoc sp. PCC 7120]
Length = 264
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 73/275 (26%), Positives = 112/275 (40%), Gaps = 66/275 (24%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P D+ GK +WEL++CD + +YT P + NS L I G
Sbjct: 5 WQADFYRSPRQDLDGKILWELLICDVNRGFEYTATCPQSEANSSWLTTQIQLAA---GEK 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q ++I A + L I P ++ +L WL+E+ ++
Sbjct: 62 LPDIIQVFRPQSLSLIEAAGRNLGINVEPQRQTPALKQWLQEKQYSI------------- 108
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQL---------------------PFSAWMNGL---- 255
A+D P P LPDNL+GD+W F + P GL
Sbjct: 109 --AIDKPPPTPLPDNLWGDEWRFASIQAGDIVDLFSDRPIPILSLPEPLKPINLGLASTV 166
Query: 256 EVCSIETDTARGSLILSVGIS-TRYIYANYK----------------------KNPVTTS 292
+ + + SL L+ I+ TR + NY ++ T+
Sbjct: 167 AIPGVVIYGGKRSLNLARWIAQTRPVALNYIAGAPDGLILEAGLVDRWILVTFEDAEVTA 226
Query: 293 EAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
A+ +E +K GLHFL +Q + GFWLL
Sbjct: 227 AAKVYEQRQKQSRGLHFLLVQPDDSGMTYTGFWLL 261
>gi|123968120|ref|YP_001008978.1| hypothetical protein A9601_05851 [Prochlorococcus marinus str.
AS9601]
gi|123198230|gb|ABM69871.1| conserved hypothetical protein [Prochlorococcus marinus str.
AS9601]
Length = 301
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 77/290 (26%), Positives = 126/290 (43%), Gaps = 64/290 (22%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLS-----LQYTKYFPNNVINSITLKEAI-- 150
I++WELDF SRPI++ GKK WEL++C + K P N +NS+ L +A+
Sbjct: 15 ISDWELDFYSRPIIESNGKKRWELIICSTRSYKTEDVFLWNKKCPANEVNSVWLTKALNE 74
Query: 151 -VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
++ G P +RF+RS M++II ++ + + I+ I S+R +LL +E + +Y
Sbjct: 75 AISEAKKQGWEKPSIVRFWRSSMKSIIKRSLEAVSIEAIVSRRTFNLLDRIEFLEKEIYP 134
Query: 210 RHPGFQKGSKPLLA------LDNPFPMELPDNL------------------------FGD 239
+ G+ +G +LA ++NP P LP+ + FGD
Sbjct: 135 KEKGYVRG---VLAPTFTSKMENP-PTPLPEAVRGDALTISEISIGELKSAENWPMEFGD 190
Query: 240 KWAFVQ-------------------LPFSAWMNGLEVCSIETDTARGSLILSVGISTRYI 280
+ Q L SAW + LE I+ + LIL +++
Sbjct: 191 IFPIQQNVNDNYLVPGLRLFSKDRSLALSAWFSCLE--PIKLVVNKNQLILEAAEDDKWL 248
Query: 281 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ + + E KK G F++IQ E GFW+L D+
Sbjct: 249 VTDLPEKDANILNTKFLE-NKKNSFGYQFISIQSTPYIEKFAGFWILRDI 297
>gi|427731600|ref|YP_007077837.1| hypothetical protein Nos7524_4487 [Nostoc sp. PCC 7524]
gi|427367519|gb|AFY50240.1| Protein of unknown function (DUF1092) [Nostoc sp. PCC 7524]
Length = 268
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 68/276 (24%), Positives = 110/276 (39%), Gaps = 68/276 (24%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P D G+ +WEL++C+ + S +Y + NS L I G
Sbjct: 5 WQADFYRSPQQDAAGQALWELLICNVNRSFEYVATCFQSEANSSWLTAQIQQAA---GEN 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q +++ A + L I P +R +L WL+E+ ++P
Sbjct: 62 LPDVIQVFRPQSLSLMEVAGRNLGITVEPQRRTSALKQWLQEK------KYP-------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAFV------------------------------------ 244
+A+D P P LPDNL+G++W F
Sbjct: 108 -IAIDKPPPAPLPDNLWGEEWRFATIAAGDLVDLFSDRPIPMLSVPESLQPINLGLASTI 166
Query: 245 ------------QLPFSAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVTT 291
L + W+ S+ A L+L G++ R+I ++ V
Sbjct: 167 AVPGVIIYGGRRSLRLAQWIQQTRPVSLNYIAGAPDGLVLEAGLADRWIVVTFEDAEVAA 226
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A+ +E K+ GLHFL +Q + GFWLL
Sbjct: 227 A-AKVYEQRKQQSRGLHFLIVQPDDSGMTYSGFWLL 261
>gi|428306984|ref|YP_007143809.1| hypothetical protein Cri9333_3474 [Crinalium epipsammum PCC 9333]
gi|428248519|gb|AFZ14299.1| protein of unknown function DUF1092 [Crinalium epipsammum PCC 9333]
Length = 277
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 67/276 (24%), Positives = 111/276 (40%), Gaps = 56/276 (20%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DF RP+ + +G+ WELV+CD + S Y + N + + +
Sbjct: 4 WQVDFYRRPLKNQQGEVWWELVICDLTRSFTYEVQCRQSEANVTWIVSQLQEAAGN-AKH 62
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q +I A ++L+IK ++ +L L+++ E T +
Sbjct: 63 LPDIIQVFRPQSFNLIQLAGQQLNIKVEATRHTYALKELLQDKAEYYSTNGDNYNP---- 118
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLP--------------------------------- 247
LALD P P LP+NL G++W F LP
Sbjct: 119 -LALDKPPPTPLPENLLGEQWRFATLPAGDLVEAFAERPIPVLEMPEFLLPINLGLASTV 177
Query: 248 ---------------FSAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNPVTT 291
+ W+ + S+ L+L G+ R++ A ++ V
Sbjct: 178 AVPGVIIYGGRQSLRLARWLEEAKPVSLHFIIGEPAGLVLEAGLVDRWVVATFEDQEVVK 237
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
S A+ +E K+ GLHFL +Q + GFWLL
Sbjct: 238 S-AQTYEQRKQQSKGLHFLLVQPDDSGVTYSGFWLL 272
>gi|254421948|ref|ZP_05035666.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
gi|196189437|gb|EDX84401.1| conserved hypothetical protein [Synechococcus sp. PCC 7335]
Length = 300
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 64/279 (22%), Positives = 109/279 (39%), Gaps = 51/279 (18%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+LDF RP+ D +G +WEL++CD +LS Y ++ + N+ ++ + I D
Sbjct: 16 WQLDFYRRPLKDSQGNPLWELLICDETLSFTYGEFCIQSEANAPWIRHQL-EIASDRAGG 74
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P I FR Q +++ AC+ L +K + +L WL +R Y + K S
Sbjct: 75 WPNDIEIFRPQTVSLVEVACRNLPVKVRSRRDVPTLKRWLLQR-AAWYPTLKSYTKQSYE 133
Query: 221 LLALDNPFPMELPDNLFGDKWAFV------------------------------------ 244
+AL+ P P+ + ++L G+ W F
Sbjct: 134 PIALERPAPVPIAEHLMGEGWQFAAISTDELQRLSYEPIPVQTVPAELMPIRLGLPSTLL 193
Query: 245 -----------QLPFSAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNPVTTS 292
L + W+ + ++ L+L G+ R+I A ++ V +
Sbjct: 194 IPGVVIDGGRQSLGLAQWLQSVNPVMLQYIAGTPDGLLLEAGLVERWIMATFEDEAVAEA 253
Query: 293 EAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLP 331
A + K A GLH L ++ + G WLL P
Sbjct: 254 -ARTFTERKIAANGLHLLLVRPDDSGLTYTGLWLLQSTP 291
>gi|126695893|ref|YP_001090779.1| hypothetical protein P9301_05551 [Prochlorococcus marinus str. MIT
9301]
gi|126542936|gb|ABO17178.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9301]
Length = 301
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/290 (26%), Positives = 124/290 (42%), Gaps = 64/290 (22%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLS-----LQYTKYFPNNVINSITLKEAI-- 150
I++WELDF SRPI++ GKK WEL++C + K P N +NS+ L A+
Sbjct: 15 ISDWELDFYSRPIIESNGKKRWELIICSTRSYKTEDVFLWNKKCPANEVNSVWLTRALNE 74
Query: 151 -VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
++ G P +RF+RS M++II K+ + I+ I S+R +LL +E + +Y
Sbjct: 75 AISEAKKQGWEKPSIVRFWRSSMKSIIKKSLDAVSIEAIVSRRTYNLLDRIEFLEKEIYP 134
Query: 210 RHPGFQKGSKPLLA------LDNPFPMELPDNL------------------------FGD 239
+ G+ +G +LA ++NP P LP+ + FGD
Sbjct: 135 KEKGYVRG---VLAPTFTSKMENP-PTPLPEAVRGDALTISEISIGELKSAQNWPMEFGD 190
Query: 240 KWAFVQ-------------------LPFSAWMNGLEVCSIETDTARGSLILSVGISTRYI 280
+ Q L SAW + LE I+ + LIL +++
Sbjct: 191 IFPIQQDIDDNYLIPGLRLFSKDRSLALSAWFSCLE--PIKLVVNKNQLILEASEDDKWL 248
Query: 281 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ + + E KK G F++IQ E GFW+L D+
Sbjct: 249 VTDLPEKDANILNTKFLE-NKKNSFGYQFISIQSTPFIEKFAGFWILRDI 297
>gi|427734622|ref|YP_007054166.1| hypothetical protein Riv7116_1045 [Rivularia sp. PCC 7116]
gi|427369663|gb|AFY53619.1| Protein of unknown function (DUF1092) [Rivularia sp. PCC 7116]
Length = 262
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 68/279 (24%), Positives = 118/279 (42%), Gaps = 68/279 (24%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DF R + G+ +W+L +CD +L L+Y P + NS + I D
Sbjct: 3 WQIDFYRRSQPEKSGQVLWDLSICDSTLELKYEATCPQSEANSSWVVSQIQQAASD---S 59
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ ++ FR Q ++I +A K L IK ++R ++L WL+++ +
Sbjct: 60 LPDVMQVFRPQSLSLIEQAGKILGIKVEATRRTIALKTWLKQKQQ--------------- 104
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLP-------FS------------------AWMNGL 255
ALD P P+ L +N++GDKW+F L FS + +
Sbjct: 105 FTALDKPPPVPLSENIWGDKWSFATLRAGDIGDFFSERPIPILETPDLLLPINMGLASTV 164
Query: 256 EVCSIETDTARGSLILS------------------------VGISTRYIYANYKKNPVTT 291
V + R S++L+ G+ R+I A ++ V+
Sbjct: 165 PVPGVVIYGGRKSMLLARWLKENRPVALNYIAGAPDGLVLEAGLVDRWIVATFEDEEVSQ 224
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ A ++ K+ GLHFL +Q + GFWLL ++
Sbjct: 225 A-AALYQQRKQQSQGLHFLLVQPDDSGMTYTGFWLLQEV 262
>gi|434388752|ref|YP_007099363.1| Protein of unknown function (DUF1092) [Chamaesiphon minutus PCC
6605]
gi|428019742|gb|AFY95836.1| Protein of unknown function (DUF1092) [Chamaesiphon minutus PCC
6605]
Length = 273
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 116/281 (41%), Gaps = 62/281 (22%)
Query: 97 SITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDD 156
+I W+ D SRP + RG+ +WELV+C +T P +N+ + I D
Sbjct: 2 TIMLWQADISSRPQQNDRGETLWELVICAADGGWFHTAICPQKQVNAEWIAAQIKLAATD 61
Query: 157 LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQK 216
+P I+ FR Q +I A ++L I+ ++R ++L L+++ + + +P +Q
Sbjct: 62 ---KLPTAIQVFRPQSLGLIQTAAQKLGIEVEATRRTIALKKLLQQQTQNYH--NPNYQP 116
Query: 217 GSKPLLALDNPFPMELPDNLFGDKWAFVQLP----------------------------- 247
LA+++P P +PD L G+KW FV L
Sbjct: 117 -----LAIESPPPQPIPDYLMGEKWQFVTLTAGQLVADFADRPIPIVSMPDYLLPPHWGL 171
Query: 248 -------------------FSAWMNGLEVCSIE--TDTARGSLILSVGISTRYIYANYKK 286
+ W+ E S+ D G L+L VG++ R++ +
Sbjct: 172 GANVAIPGVIIYGATQSMRLARWIADTEPVSLNYLGDDPGG-LVLDVGLADRWVMVTFND 230
Query: 287 NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
V+ + A +EA K+ GLHFL + + G WLL
Sbjct: 231 AEVSQA-ARLYEARKRLVHGLHFLLVTPDDSGITYSGIWLL 270
>gi|33861086|ref|NP_892647.1| hypothetical protein PMM0529 [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
gi|33639818|emb|CAE18988.1| conserved hypothetical protein [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
Length = 301
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 78/287 (27%), Positives = 124/287 (43%), Gaps = 58/287 (20%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYF------PNNVINSITLKEAIV 151
I++WELDF SRPI++ GKK WEL++ S S + K F P N +NSI L +A+
Sbjct: 15 ISDWELDFYSRPIIETNGKKRWELIIS-SSKSFKTEKIFLWNKVCPANEVNSIWLTKALN 73
Query: 152 AICDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVY 208
+D G P KIRF+R+ M++II K+ + + I+ + S+R L +E +Y
Sbjct: 74 EALNDAEIEGWAKPLKIRFWRASMKSIIKKSIENIGIEALVSRRTYELFDRIEFLEREIY 133
Query: 209 TRHPGFQKGS-------------KPL------------------LALDNPFPMELP---- 233
G+ +G KPL L L +P+E
Sbjct: 134 PLEQGYVRGVLAPTFTSNILNDPKPLPEAVRGDALTISEISIEELKLAKNWPIEFGDIFP 193
Query: 234 -------DNLFGDKWAFVQ---LPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYAN 283
DNL F + L +AW + LE ++ + LIL +++ +
Sbjct: 194 IQSSIKNDNLVPGLRLFSKDRSLALAAWFSSLE--PVKLLIKQNQLILEASEDDKWLVTD 251
Query: 284 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
++ + + +KK G F++IQ E GFW+L D+
Sbjct: 252 LQEKDAKVLN-DKFTQSKKDSYGYQFISIQATPFIEKFAGFWILKDV 297
>gi|78778914|ref|YP_397026.1| hypothetical protein PMT9312_0529 [Prochlorococcus marinus str. MIT
9312]
gi|78712413|gb|ABB49590.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9312]
Length = 301
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/294 (25%), Positives = 127/294 (43%), Gaps = 57/294 (19%)
Query: 91 EETDPE-SITEWELDFCSRPILDIRGKKIWELVVC-----DGSLSLQYTKYFPNNVINSI 144
+ET PE I++WELDF SRPI++ GKK WEL++C + + K P + +NSI
Sbjct: 7 KETSPELKISDWELDFYSRPIIEANGKKRWELIICSTRSYETKDIFLWNKKCPASEVNSI 66
Query: 145 TLKEAIVAICDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLE 201
L +A+ ++ G P +RF+RS M++II K+ + +I+ + S+R +L +E
Sbjct: 67 WLTKALNEALNEARKEGWAKPSIVRFWRSSMKSIIKKSLEATNIEALVSRRTYNLFDRIE 126
Query: 202 ERYETVYTRHPGFQKG--SKPLLALDNPFPMELPDNL----------------------- 236
+ +Y + G+ +G + + P LP+ +
Sbjct: 127 FLEKDIYPKEKGYVRGVLAPTFTSTMESSPTPLPEAVRGDALTISEISVGELKSAQNWPI 186
Query: 237 -FGDKWAFVQ-------------------LPFSAWMNGLEVCSIETDTARGSLILSVGIS 276
FGD + Q L SAW + LE I+ ++ LIL
Sbjct: 187 EFGDIFPIHQPLDNNELIPGLRLFSKERSLALSAWFSSLE--PIKLIISKNQLILEASED 244
Query: 277 TRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+++ + + + E KK G F++IQ E GFW+L D+
Sbjct: 245 DKWLVTDLPEKDANILSTKFLE-NKKNSFGYQFISIQSTPYIEKFAGFWILRDI 297
>gi|307151401|ref|YP_003886785.1| hypothetical protein Cyan7822_1516 [Cyanothece sp. PCC 7822]
gi|306981629|gb|ADN13510.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 7822]
Length = 278
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 66/279 (23%), Positives = 111/279 (39%), Gaps = 66/279 (23%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF R ++ G+ +WEL++ D + Y + P ++ NS L + +
Sbjct: 4 WQADFYKRQQMNQAGEILWELLITDSLGKIIYERQCPQSMANSDWLLVQLQQATEQFS-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR Q ++T ++L + + ++R +L L++R P
Sbjct: 62 -PDVIQVFRPQSLALLTSCAEKLGLTVVATRRTWALKKVLQQRAAAT----------KDP 110
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLP--------------------------------- 247
LD P P LP NL+G++W F +
Sbjct: 111 QDILDKPPPQPLPANLWGEEWRFAHVAAGDLIEFFKDRPIPLLNIPEELLPINLGLASTL 170
Query: 248 ---------------FSAWMNG---LEVCSIETDTAR-GSLILSVGISTRYIYANYKKNP 288
+ W+N + + I T+ + G L+L G+ R+I A + ++P
Sbjct: 171 PIPGMVIYGGRTSMYLARWLNQENPVAINYISTEVGKSGGLVLESGLVNRWILATF-EDP 229
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
AE +E K+ C GLHFL IQ + GFWLL
Sbjct: 230 EVVVAAEKYEQRKQLCRGLHFLTIQPDSSGMTYSGFWLL 268
>gi|157412945|ref|YP_001483811.1| hypothetical protein P9215_06101 [Prochlorococcus marinus str. MIT
9215]
gi|157387520|gb|ABV50225.1| Conserved hypothetical protein [Prochlorococcus marinus str. MIT
9215]
Length = 301
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 77/290 (26%), Positives = 125/290 (43%), Gaps = 64/290 (22%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLS-----LQYTKYFPNNVINSITLKEAI-- 150
I++WELDF SRPI++ GKK WEL++C + K P N +NS+ L +A+
Sbjct: 15 ISDWELDFYSRPIIESNGKKRWELIICSTRSYKTEDVFLWNKKCPANEVNSVWLTKALNE 74
Query: 151 -VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
++ G P +RF+RS M++II K+ + + I+ I S+R +LL +E + +Y
Sbjct: 75 AISEAKKQGWEKPSIVRFWRSSMKSIIKKSLEAVSIEAIVSRRTYNLLDRIEFLEKEIYP 134
Query: 210 RHPGFQKGSKPLLA------LDNPFPMELPDNL------------------------FGD 239
+ G+ +G +LA ++N P LP+ + FGD
Sbjct: 135 KEKGYVRG---VLAPAFTSKIENS-PTPLPEAVRGDALTISEISIGELKSAENWPMEFGD 190
Query: 240 KWAFVQ-------------------LPFSAWMNGLEVCSIETDTARGSLILSVGISTRYI 280
+ Q L SAW + LE I+ LIL +++
Sbjct: 191 IFPIKQDLDDNYLVPGLRLFSKDRSLALSAWFSCLE--PIKLVVNENQLILEASEDDKWL 248
Query: 281 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ K ++ + KK G F++IQ E GFW+L D+
Sbjct: 249 VTDLPKKDANILNSKFLD-NKKNSFGYQFISIQSTPYIEKFAGFWILRDI 297
>gi|428206227|ref|YP_007090580.1| hypothetical protein Chro_1184 [Chroococcidiopsis thermalis PCC
7203]
gi|428008148|gb|AFY86711.1| protein of unknown function DUF1092 [Chroococcidiopsis thermalis
PCC 7203]
Length = 327
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 72/329 (21%), Positives = 117/329 (35%), Gaps = 110/329 (33%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSL-----------------QYTKYFPNNVINS 143
W+ DF RP D G+ +WEL++CD + +Y P N+
Sbjct: 8 WQADFYRRPWQDDTGQVLWELLICDAEGGMLFDHATQTRSDHRTGNFRYEAICPQAAANA 67
Query: 144 ITLKEAIV-------------------------------AICDDLGVPIPEKIRFFRSQM 172
L E + ++ + +P+ I+ FR Q
Sbjct: 68 SWLVEQLQLAASNSSEFFSTTPKSISPSPPYQGGLGGSESVTGQTELALPDIIQVFRPQS 127
Query: 173 QTIITKACKELDIKPIPSKRCLSLLLWLEER---YETVYTRHPGFQKGSKPLLALDNPFP 229
++I A ++L I P++R +L WL R Y T +P LA+D P P
Sbjct: 128 LSLIATAGQKLGITVEPTRRTGALKQWLRSRIPQYSTTGAYNP---------LAVDKPPP 178
Query: 230 MELPDNLFGDKWAFVQLP------------------------------------------ 247
+ LP+NL+GD+W F LP
Sbjct: 179 VPLPENLWGDRWRFASLPARDLEAAFKDRPLPILDMPEFLLPLNLGLASTIAVPGIIIYG 238
Query: 248 ------FSAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNPVTTSEAEAWEAA 300
+ W+ + ++ LIL G++ R++ A + + S A+ +
Sbjct: 239 GRKSMQLARWLQAAQPIALNYVPGELAGLILEAGLADRWVVATFSDSEAIAS-AQTYAQR 297
Query: 301 KKACGGLHFLAIQEELDSEDCVGFWLLLD 329
++ GLHFL +Q + S GFWLL D
Sbjct: 298 QQQSQGLHFLLVQPDDSSVTYTGFWLLRD 326
>gi|298492811|ref|YP_003722988.1| hypothetical protein Aazo_4636 ['Nostoc azollae' 0708]
gi|298234729|gb|ADI65865.1| protein of unknown function DUF1092 ['Nostoc azollae' 0708]
Length = 265
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 68/278 (24%), Positives = 116/278 (41%), Gaps = 72/278 (25%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P+ D G+ +WEL++CD + L+Y P + NS L E +
Sbjct: 5 WQTDFYRSPLRDSAGQVLWELLICDPTRKLEYVATCPQSQANSNWLTEQFQLAGAE---K 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q ++I+ A L I P++ L+L WL+E+ ++P
Sbjct: 62 LPDIIQVFRPQSLSLISAAASNLGINIEPTRSTLALKQWLQEK------KYP-------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAF-------------------VQLP-------------- 247
+ +D P L +NL+G++W F +Q+P
Sbjct: 108 -ILIDKLPPEPLLENLWGEEWRFANISAGDIVDEFTDRPIPILQIPEFVQPINLGLASTV 166
Query: 248 ---------------FSAWMNGLEVCSIETDTARGS---LILSVGISTRYIYANYKKNPV 289
+ W+ E ++ + G+ LIL G++ R+I A + + V
Sbjct: 167 RIPGVVIYGGRQSMRLAKWLQ--EANAVSLNYIAGTPDGLILDAGLADRWILATFDDDEV 224
Query: 290 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A+ + K+ GLHFL +Q + GFWLL
Sbjct: 225 AAA-AKVYTQRKQVSKGLHFLLVQPDDSRMTYSGFWLL 261
>gi|443315479|ref|ZP_21044967.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 6406]
gi|442784905|gb|ELR94757.1| Protein of unknown function (DUF1092) [Leptolyngbya sp. PCC 6406]
Length = 278
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 64/279 (22%), Positives = 109/279 (39%), Gaps = 52/279 (18%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+T WE+DF RP D +G +WEL++CD + Y L+ +
Sbjct: 1 MTRWEVDFYRRPCEDGQGTPLWELLICDRAFDFTYGAMVSQPEATVDWLQGQLKTAIAKA 60
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
G+P P++I FR ++ A L I IP+++ +L WL R Y P +
Sbjct: 61 GIP-PDEICAFRPPAVALLQAAAPPLGIAVIPTRQTPTLKQWLVTRSRW-YPTLPTYSGA 118
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQL--------------PFSA----WM------- 252
LA+D P P+ +P++L+G++W F L P + W+
Sbjct: 119 PYDPLAVDRPAPVPVPESLWGEQWRFGALSAADFQEELTQEPIPIQSLPLDWLPLQMGLA 178
Query: 253 NGLEVCSIETDTAR------------------------GSLILSVGISTRYIYANYKKNP 288
+ + + + D R LIL G+ R++ + ++P
Sbjct: 179 STIPIPGVIIDGGRRALALAQWLAAQDPVALNPMVGNPAGLILEAGLCDRWVLTTF-EDP 237
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A + + GLHFL ++ + G WLL
Sbjct: 238 QVQAAARTFGERQLQAQGLHFLLVRPDDSGITYTGLWLL 276
>gi|409989581|ref|ZP_11273128.1| hypothetical protein APPUASWS_02193 [Arthrospira platensis str.
Paraca]
gi|291570627|dbj|BAI92899.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409939557|gb|EKN80674.1| hypothetical protein APPUASWS_02193 [Arthrospira platensis str.
Paraca]
Length = 277
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 73/280 (26%), Positives = 110/280 (39%), Gaps = 62/280 (22%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSI----TLKEAIVAICDD 156
W+ DF RP+ D RG+ +WEL+VCD P + NS LKE V
Sbjct: 4 WQADFYRRPLEDERGQPLWELLVCDQLGDRLLVATCPQSEANSTWLLNQLKEMFVT---- 59
Query: 157 LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQK 216
P+ I+ FR ++ K+L + ++R L L L E +Y + G+
Sbjct: 60 ---DQPDIIQVFRPACLSLFEVVGKQLGVTVQATRRTLGLKKLLAEMM-LIYPQMTGYTG 115
Query: 217 GSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS-------------------------AW 251
+ LA+D P+ LP+NL+GD+W F LP
Sbjct: 116 QNYDPLAIDKLPPLPLPENLWGDRWRFATLPAGDLQEVFGDRPIPILDMPSILLPLNLGL 175
Query: 252 MNGLEVCSIETDTARGS------------------------LILSVGISTRYIYANYKKN 287
+ + + + D R S LIL G+S R++ A + +
Sbjct: 176 ASTVAISGVVIDGGRQSMGLARWLQSVKPVGFNYIPGQPDGLILEAGLSDRWVVATFDDD 235
Query: 288 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
V + A +E K+ GLHFL +Q + GFWLL
Sbjct: 236 DVAQA-ARMFETRKRLAKGLHFLLVQPDDSGVTYTGFWLL 274
>gi|376004228|ref|ZP_09781975.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|423065962|ref|ZP_17054752.1| hypothetical protein SPLC1_S370220 [Arthrospira platensis C1]
gi|375327434|emb|CCE17728.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|406712461|gb|EKD07646.1| hypothetical protein SPLC1_S370220 [Arthrospira platensis C1]
Length = 277
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 71/276 (25%), Positives = 108/276 (39%), Gaps = 54/276 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ D G+ +WEL++CD P + NS L + + I D
Sbjct: 4 WQADFYRRPLRDDSGQPLWELLLCDELGDRLLVATCPQSEANSTWLLKQLEEIWD---TD 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR + K+L + ++R L L L E +Y + PG+
Sbjct: 61 QPDLIQVFRPACLNLFEVVGKQLGVTVQGTRRTLGLKKLLAEMM-LIYPQMPGYTGEDYD 119
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFS-------------------------AWMNGL 255
LA+D P+ LP+NL+G +W F LP + +
Sbjct: 120 PLAIDKLPPLPLPENLWGTRWRFATLPAGDLQEVFGDRPIPILDMPSFLLPLNLGLASTV 179
Query: 256 EVCSIETDTARGS------------------------LILSVGISTRYIYANYKKNPVTT 291
+ + D R S LIL G+S R++ A + + V
Sbjct: 180 AISGVVIDGGRQSMRLARWLQSVKPVGLNYIPGQPDGLILEAGLSDRWVVATFDDDDVAQ 239
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A +E K+ GLHFL IQ + GFWLL
Sbjct: 240 A-ARMFETRKRLAKGLHFLLIQPDDSGVTYTGFWLL 274
>gi|209524029|ref|ZP_03272580.1| protein of unknown function DUF1092 [Arthrospira maxima CS-328]
gi|209495404|gb|EDZ95708.1| protein of unknown function DUF1092 [Arthrospira maxima CS-328]
Length = 277
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 71/276 (25%), Positives = 108/276 (39%), Gaps = 54/276 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ D G+ +WEL++CD P + NS L + + I D
Sbjct: 4 WQADFYRRPLRDDSGQPLWELLLCDEFGDRLLVATCPQSEANSTWLLKQLEEIWD---TD 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR + K+L + ++R L L L E +Y + PG+
Sbjct: 61 QPDLIQVFRPACLNLFEVVGKQLGVTVQGTRRTLGLKKLLAEMM-LIYPQMPGYTGEDYD 119
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLPFS-------------------------AWMNGL 255
LA+D P+ LP+NL+G +W F LP + +
Sbjct: 120 PLAIDKLPPLPLPENLWGTRWRFATLPAGDLQEVFGDRPIPILDMPSFLLPLNLGLASTV 179
Query: 256 EVCSIETDTARGS------------------------LILSVGISTRYIYANYKKNPVTT 291
+ + D R S LIL G+S R++ A + + V
Sbjct: 180 AISGVVIDGGRQSMRLARWLQSVKPVGLNYIPGQPDGLILEAGLSDRWVVATFDDDDVAQ 239
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A +E K+ GLHFL IQ + GFWLL
Sbjct: 240 A-ARMFETRKRLAKGLHFLLIQPDDSGVTYTGFWLL 274
>gi|218438370|ref|YP_002376699.1| hypothetical protein PCC7424_1387 [Cyanothece sp. PCC 7424]
gi|218171098|gb|ACK69831.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 7424]
Length = 271
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 115/286 (40%), Gaps = 76/286 (26%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF R + D +G+ +WELV+ D ++ + P + NS L +
Sbjct: 4 WQGDFYKRSLFDQQGEMLWELVITDQQGTMIHEAKCPQSQANSDWLIRQLQQATQK---N 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
IP+ I+ FR Q ++T A ++L IK +P++R +L L+ R +
Sbjct: 61 IPDLIQVFRPQSIGLLTSAAEKLGIKVVPTRRTSALKEVLKRRSTNT----------TID 110
Query: 221 LLALDNPFPMELPDNLFGDKWAFV------------------------------QLPFSA 250
+ LD P P LP+NL+G++W F+ LP +
Sbjct: 111 VSTLDRPPPQGLPENLWGEQWGFISLKAGDLIQFFRDRPIPIVDMPEDLLPINLNLPSTV 170
Query: 251 WMNGLEVCSIETDTARGSLILS-----------------VGIST----------RYIYAN 283
++ G+ + R S+ L+ +G+S R+I A
Sbjct: 171 FIPGIVIYG-----GRKSMYLARWLEEQQPVSISYIPTQIGLSGGLVLESGLVDRWILAT 225
Query: 284 YKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 329
+ ++P A+ +E K GLHFL +Q + GFWLL D
Sbjct: 226 F-EDPEMAQAAQKYEDRKVMSKGLHFLTVQPDDSGITYTGFWLLND 270
>gi|123965828|ref|YP_001010909.1| hypothetical protein P9515_05931 [Prochlorococcus marinus str. MIT
9515]
gi|123200194|gb|ABM71802.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9515]
Length = 301
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 71/290 (24%), Positives = 118/290 (40%), Gaps = 66/290 (22%)
Query: 99 TEWELDFCSRPILDIRGKKIWELVVCDGSLS-----LQYTKYFPNNVINSITLKEAIVAI 153
++WELDF SRPI++ GKK WEL++ + K P N +NSI L +++
Sbjct: 16 SDWELDFYSRPIIEKNGKKRWELIISSSKTFKTEDIFLWNKICPANEVNSIWLTKSLNEA 75
Query: 154 CDDL---GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTR 210
+D G P KIRF+R+ M++II K+ + + I+ + S+R L +E + VY
Sbjct: 76 LNDAERKGWEKPSKIRFWRASMKSIIKKSIENIGIEALVSRRTYELFDRIEFLEKEVYPL 135
Query: 211 HPGFQKGSKPLLA-------LDNPFPMELPDNLFGDKWAFVQ------------------ 245
G+ +G +LA ++P P LP+ + GD +
Sbjct: 136 ENGYVRG---VLAPTFTSRIANDPTP--LPEAVRGDALTISEISIEELKSAENWPIEFGD 190
Query: 246 -------------------------LPFSAWMNGLEVCSIETDTARGSLILSVGISTRYI 280
L +AW + LE + + + LIL +++
Sbjct: 191 IFPIKKSLKNENLVPGLRLFSKERSLALAAWFSSLEPVKLHIE--KNQLILEASEDNKWL 248
Query: 281 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ + V + K G F++IQ E GFW+L D+
Sbjct: 249 VTDLSEK-VAKELNNKFTQNKNDSFGYQFISIQSTPFIEKFAGFWILRDI 297
>gi|254526095|ref|ZP_05138147.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9202]
gi|221537519|gb|EEE39972.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9202]
Length = 301
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 75/290 (25%), Positives = 122/290 (42%), Gaps = 64/290 (22%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLS-----LQYTKYFPNNVINSITLKEAI-- 150
I++WELDF SRPI++ GKK WEL++ + K P N +NS+ L +A+
Sbjct: 15 ISDWELDFYSRPIIESNGKKRWELIISSTRSYKTEDVFLWNKKCPANEVNSVWLTKALNE 74
Query: 151 -VAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYT 209
++ G P RF+RS M++II K+ + + I+ + S+R +LL +E + +Y
Sbjct: 75 ALSEAKKQGWEKPSIARFWRSSMKSIIKKSLEAVSIEAVVSRRTYNLLDRIEFLEKEIYP 134
Query: 210 RHPGFQKGSKPLLA------LDNPFPMELPDNL------------------------FGD 239
+ G+ +G +LA ++N P LP+ + FGD
Sbjct: 135 KEKGYVRG---VLAPTFTSKMENS-PTPLPEAVRGDALTISEISIGELKSAENWPMEFGD 190
Query: 240 KWAFVQ-------------------LPFSAWMNGLEVCSIETDTARGSLILSVGISTRYI 280
+ Q L +AW + LE I+ LIL +++
Sbjct: 191 IFPIQQDLDDKNLVPGLRLFSKDRSLALAAWFSCLE--PIKLVVNENQLILEASEDDKWL 248
Query: 281 YANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ K + E KK G F++IQ E GFW+L D+
Sbjct: 249 VTDLPKKDANILNTKFLE-NKKNSFGYQFISIQSTPYIEKFAGFWILRDI 297
>gi|359459254|ref|ZP_09247817.1| hypothetical protein ACCM5_11029 [Acaryochloris sp. CCMEE 5410]
Length = 281
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 65/282 (23%), Positives = 112/282 (39%), Gaps = 52/282 (18%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+T W++DF RP+ + +WEL V D + + P +S L + + +
Sbjct: 1 MTIWQVDFDRRPLKNTEDYPLWELTVYDPQTQMACHRLCPEPNASSEWLMAELQELFTLM 60
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
G P P + + FR + T + + L+I +++ L L+ R + Y + P +
Sbjct: 61 GPP-PTQFQVFRPRSLTFLEDVGRTLNIAVEATRQTPGLKRVLQVRTQ-AYAQLPEYTGQ 118
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLP------------------------------ 247
S LA++ P +P++L+GD+W FV L
Sbjct: 119 SYDPLAIEPLPPQPMPEHLWGDQWQFVTLAASELESVLLQRPIPLRTVPEMLLPSQLGVA 178
Query: 248 ------------------FSAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANYKKNP 288
+ W+ + SI+ A S LI++ G++ RY+ Y
Sbjct: 179 ADTRIPGVLINGGRRSMQLAQWLQKQQPASIQAMRAELSGLIMAAGLNERYVLVTYDDAD 238
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ S A+ +E K+ GLHFL +Q + G WLL L
Sbjct: 239 I-VSAAQGFEQGKQGSQGLHFLLVQPDDSGVTYTGLWLLSSL 279
>gi|428300978|ref|YP_007139284.1| hypothetical protein Cal6303_4407 [Calothrix sp. PCC 6303]
gi|428237522|gb|AFZ03312.1| protein of unknown function DUF1092 [Calothrix sp. PCC 6303]
Length = 259
Score = 74.3 bits (181), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 62/260 (23%), Positives = 104/260 (40%), Gaps = 70/260 (26%)
Query: 118 IWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIIT 177
IW L +CD + +Y P + NS L ++ +P+KI+ FR Q +++
Sbjct: 19 IWNLSICDANGDFRYKASCPQSEANSTWLTSQFKLAGNE---RLPDKIQVFRPQSLSLVE 75
Query: 178 KACKELDIKPIPSKRCLSLLLWLE-ERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNL 236
A L+I ++R +L LWL+ E+Y T + P PM LP+ L
Sbjct: 76 LAASHLNISVEATRRTDALKLWLQAEKYATTVEKLP----------------PMPLPEKL 119
Query: 237 FGDKWAFVQLP------------------------------------------------F 248
+G+KW F P
Sbjct: 120 WGEKWQFATFPAGGIVDEFSDRLIPILDIPDYLQPINLGIASTTAIPGVIIYGGRQSMQI 179
Query: 249 SAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGL 307
+ W+ ++ S+ A LIL G++ R++ A ++ + VT + A+ +++ ++ GL
Sbjct: 180 ARWLKQVQPVSLNYIAGAPDGLILEAGLADRWVIATFEDSEVTIA-AKNYQSRQQQSHGL 238
Query: 308 HFLAIQEELDSEDCVGFWLL 327
HFL IQ + GFWLL
Sbjct: 239 HFLLIQPDDSGMTYSGFWLL 258
>gi|443312305|ref|ZP_21041923.1| Protein of unknown function (DUF1092) [Synechocystis sp. PCC 7509]
gi|442777543|gb|ELR87818.1| Protein of unknown function (DUF1092) [Synechocystis sp. PCC 7509]
Length = 272
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 64/276 (23%), Positives = 104/276 (37%), Gaps = 59/276 (21%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF RP+ + G+ +WEL++CD Y P + NS L E + +
Sbjct: 4 WQADFYRRPLQNEAGEVLWELLICDRDRLFTYEALCPQSQANSKWLIEQLQIAAKNQK-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR Q +I A + L I ++R +L WL ER ++P
Sbjct: 62 -PDLIQVFRPQSLNLIQLAAENLGIAVEATRRTFALKQWLTER------QYPSNNGEPYN 114
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLP--------------------------------- 247
LA+D P L +NL+G++W F L
Sbjct: 115 PLAIDKAPPTPLTENLWGEQWRFASLSAGDIVESFKERLIPIKEMPEFLLPLNLGLASTI 174
Query: 248 ---------------FSAWMNGLEVCSIETDTARGS-LILSVGISTRYIYANYKKNPVTT 291
+ W+ + ++ S L+L G+S R++ + V
Sbjct: 175 TIPGVVIDGGKKSMQLARWLQSIHPVALNYIAGDPSGLVLEAGLSERWVVNTFTDKEVIA 234
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A + ++ GLHFL +Q + GFWLL
Sbjct: 235 A-AVTYTQRQQLTKGLHFLLVQPDNSGMTYSGFWLL 269
>gi|254415147|ref|ZP_05028909.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196177953|gb|EDX72955.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 278
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 119/278 (42%), Gaps = 55/278 (19%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPN-NVINSITLKEAIVAICDDLGV 159
W+ DF RP+ D G+ +WEL++CD + ++ Y + P +V + + V++
Sbjct: 4 WQADFYRRPLQDETGQILWELLICDTTGNVIYQSFCPQPDVTRDWLVSQVQVSVAK---T 60
Query: 160 PIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSK 219
+P+ I+ FR Q + + ++L IK ++R +L L+ER Y +H + +
Sbjct: 61 GLPDAIQVFRPQSFNLFQEVGQQLGIKVEATRRTPALKQRLQER-TLEYPQHENYTGEAY 119
Query: 220 PLLALDNPFPMELPDNLFGDKWAFVQLPFSAWMNG------------------------- 254
L+LD P P+ LP+NL+GD+W F +P G
Sbjct: 120 NPLSLDKPPPLPLPENLWGDRWRFASIPAGDIEEGFAQRPIPILQMPNELLPLQLGLAST 179
Query: 255 LEVCSIETDTARGS------------------------LILSVGISTRYIYANYKKNPVT 290
+ V + D R S LIL G+ R++ A ++ V
Sbjct: 180 VAVPGVVIDGGRQSMPLARWLQEVQPVALNYIPGAPDGLILEAGLVERWVMATFEDKEVA 239
Query: 291 TSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLL 328
+ A +E K+ GLHFL +Q + GFWLL+
Sbjct: 240 AA-ARLYEQRKQTSQGLHFLLVQPDDSGMTYTGFWLLM 276
>gi|158336954|ref|YP_001518129.1| hypothetical protein AM1_3825 [Acaryochloris marina MBIC11017]
gi|158307195|gb|ABW28812.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 281
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 116/282 (41%), Gaps = 52/282 (18%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+T W++DF RP+ + +WEL V D + + P ++ L + + +
Sbjct: 1 MTIWQVDFDRRPLKNTEDYPLWELTVYDPQTQMACHRLCPEPNVSPDWLIAELKELFTLM 60
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
G P P + + FR + T + + ++LDI +++ L L L+ R + Y + P +
Sbjct: 61 GPP-PTQFQVFRPRSLTFMEEVRQKLDISVEATRQTLGLKRVLQVRTQA-YAQLPEYTGQ 118
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFS---------------------------- 249
S LA++ P +P++L+GD+W FV L S
Sbjct: 119 SYDPLAIEPLPPQPMPEHLWGDQWQFVTLAASELESVLLQRPIPLRTVPEMLLPSQLGLA 178
Query: 250 -------AWMNG-------------LEVCSIETDTARGS-LILSVGISTRYIYANYKKNP 288
+NG + SI+ A S LI++ G++ RY+ Y
Sbjct: 179 ADTRLPGVLINGGRRSMQLAQWLQQQQPASIQAMRAELSGLIMAAGLNERYVLVTYDDAD 238
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
+ + A+ +E K+ GLHFL +Q + G WLL L
Sbjct: 239 IVPA-AQGFEQGKQGSQGLHFLLVQPDDSGVTYTGLWLLSSL 279
>gi|357521231|ref|XP_003630904.1| hypothetical protein MTR_8g104810 [Medicago truncatula]
gi|355524926|gb|AET05380.1| hypothetical protein MTR_8g104810 [Medicago truncatula]
Length = 108
Score = 71.6 bits (174), Expect = 4e-10, Method: Composition-based stats.
Identities = 30/32 (93%), Positives = 31/32 (96%)
Query: 105 FCSRPILDIRGKKIWELVVCDGSLSLQYTKYF 136
FCSRPILD+RGKKIWELVVCD SLSLQYTKYF
Sbjct: 43 FCSRPILDVRGKKIWELVVCDKSLSLQYTKYF 74
>gi|354569034|ref|ZP_08988193.1| protein of unknown function DUF1092 [Fischerella sp. JSC-11]
gi|353539038|gb|EHC08534.1| protein of unknown function DUF1092 [Fischerella sp. JSC-11]
Length = 264
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 72/150 (48%), Gaps = 20/150 (13%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+ W+ DF G+ +WEL++CD + S Q+ P + +NS V +
Sbjct: 1 MVTWQADFYHHRRQQAAGRVLWELLICDRNRSFQFEASCPQSEVNS---NWVAVQLQLAG 57
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEER-YETVYTRHPGFQK 216
G +P+ I+ FR Q +I +A + L I P++R +L WL+E+ Y TV
Sbjct: 58 GGNLPDVIQVFRPQCLGLIEQAGRSLGINVEPTRRTFALKQWLQEKQYPTV--------- 108
Query: 217 GSKPLLALDNPFPMELPDNLFGDKWAFVQL 246
+D P P LP+NL+G++W F L
Sbjct: 109 -------VDKPPPAPLPENLWGEEWRFATL 131
>gi|422302945|ref|ZP_16390303.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|389792167|emb|CCI12098.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
Length = 265
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/286 (24%), Positives = 108/286 (37%), Gaps = 80/286 (27%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----GTVWQLLISDSLGHLIYENSCPQSQANSDWLTQQLQQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSSPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------------------------- 247
+D+P P +PD G +W F + P
Sbjct: 107 ---------NIDSPPPQPIPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAFYPLK 157
Query: 248 ----------------------FSAWM---NGLEVCSIETDTAR-GSLILSVGISTRYIY 281
+ W+ N + + I T+T R G L+L G++ R+I+
Sbjct: 158 LGLASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTETGRSGGLVLESGLNERWIF 217
Query: 282 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
Y+ V + A A++A K+ GLHFL IQ + GFWLL
Sbjct: 218 LTYEDEEVAPA-ANAYQATKEESQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|414078911|ref|YP_006998229.1| hypothetical protein ANA_C13764 [Anabaena sp. 90]
gi|413972327|gb|AFW96416.1| hypothetical protein ANA_C13764 [Anabaena sp. 90]
Length = 265
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 63/278 (22%), Positives = 112/278 (40%), Gaps = 68/278 (24%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P+ ++ + +WEL+VCD + S ++T P + NS + + + +
Sbjct: 5 WQADFYRIPLQNVEEQILWELLVCDPTRSFEFTASCPQSQANSTWVAQQLQLAGQE---K 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q ++IT A L I ++R L+L WL + ++P
Sbjct: 62 LPDVIQVFRPQSLSLITTAGNNLGIYVEATRRTLALKQWLTAK------QYP-------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQLP--------------------------------- 247
+ +D P+ LP+NL+G++W F +P
Sbjct: 108 -VIVDKLPPLPLPENLWGEEWRFATIPSGDIVDEFTERPIPFLQIPDFLKPINLGLASTV 166
Query: 248 ---------------FSAWMNGLEVCSIE-TDTARGSLILSVGISTRYIYANYKKNPVTT 291
+ W+ S+ A L+L G+ R++ A + VT
Sbjct: 167 PIPGVVIYGGRKSMRLAQWLKESNPVSLNYIGGAPDGLVLEAGLLDRWVLATFTDEEVTA 226
Query: 292 SEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 329
+ + ++ K+ GLHFL +Q + G WLL D
Sbjct: 227 A-GKLYQERKQLSQGLHFLLVQPDDSGMTYSGLWLLQD 263
>gi|425440676|ref|ZP_18820974.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9717]
gi|389718833|emb|CCH97263.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9717]
Length = 265
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 107/286 (37%), Gaps = 80/286 (27%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ + +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----ETVWQLLISDSLGHLIYENSCPQSQANSDWLTQQLQQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------------------------- 247
+D+P P LPD G +W F + P
Sbjct: 107 ---------NIDSPPPQPLPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAFSPLK 157
Query: 248 ----------------------FSAWM---NGLEVCSIETDTAR-GSLILSVGISTRYIY 281
+ W+ N + + I T+ R G L+L G++ R+I+
Sbjct: 158 LGLASTLMIPGVVITGGKKSLAIARWLGEINPVFIDHIPTERGRSGGLVLESGLNERWIF 217
Query: 282 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
Y+ V + A ++A K+ GLHFL IQ + GFWLL
Sbjct: 218 LTYEDEEVALA-ANVYQATKQESQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|425463363|ref|ZP_18842702.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|389833791|emb|CCI21409.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 265
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 70/286 (24%), Positives = 107/286 (37%), Gaps = 80/286 (27%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----GTVWQLLISDPLGHLIYENSCPQSQANSDWLTQQLQQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------------------------- 247
+D P P +PD G +W F + P
Sbjct: 107 ---------NIDYPPPQPVPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAFSPLK 157
Query: 248 ----------------------FSAWM---NGLEVCSIETDTAR-GSLILSVGISTRYIY 281
+ W+ N + + I T+T R G L+L G++ R+I+
Sbjct: 158 LGLASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTETGRSGGLVLESGLNERWIF 217
Query: 282 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
Y+ V + A A++A K+ GLHFL IQ + GFWLL
Sbjct: 218 LTYEDEEVARA-ANAYQATKEEGQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|166364113|ref|YP_001656386.1| hypothetical protein MAE_13720 [Microcystis aeruginosa NIES-843]
gi|166086486|dbj|BAG01194.1| hypothetical protein MAE_13720 [Microcystis aeruginosa NIES-843]
Length = 265
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 70/286 (24%), Positives = 107/286 (37%), Gaps = 80/286 (27%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----GTVWQLLISDPLGHLIYENSCPQSQANSDWLTQQLQQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------------------------- 247
+D P P +PD G +W F + P
Sbjct: 107 ---------NIDYPPPQPVPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAFSPLK 157
Query: 248 ----------------------FSAWM---NGLEVCSIETDTAR-GSLILSVGISTRYIY 281
+ W+ N + + I T+T R G L+L G++ R+I+
Sbjct: 158 LGLASTLMIPSVVITGGKKSLAIARWLEEINPVFIDHIPTETGRSGGLVLESGLNERWIF 217
Query: 282 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
Y+ V + A A++A K+ GLHFL IQ + GFWLL
Sbjct: 218 LTYEDEEVARA-ANAYQATKEEGQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|440753361|ref|ZP_20932564.1| hypothetical protein O53_1739 [Microcystis aeruginosa TAIHU98]
gi|440177854|gb|ELP57127.1| hypothetical protein O53_1739 [Microcystis aeruginosa TAIHU98]
Length = 265
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 70/286 (24%), Positives = 107/286 (37%), Gaps = 80/286 (27%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----GTVWQLLISDPLGHLIYENSCPQSQANSDWLTQQLEQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------------------------- 247
+D+P P LPD G +W F + P
Sbjct: 107 ---------NIDSPPPQPLPDRFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAFSPLK 157
Query: 248 ----------------------FSAWM---NGLEVCSIETDTAR-GSLILSVGISTRYIY 281
+ W+ N + + I T+T R G L+L G++ R+I+
Sbjct: 158 LGLASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTETGRSGGLVLESGLNERWIF 217
Query: 282 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
Y+ V + A ++A K+ GLHFL IQ + GFWLL
Sbjct: 218 LTYEDEEVALA-ANVYQATKEEGQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|425434023|ref|ZP_18814495.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|389678222|emb|CCH92899.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
Length = 265
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 70/286 (24%), Positives = 108/286 (37%), Gaps = 80/286 (27%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ + +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----ETVWQLLIFDPLGHLIYENSCPQSQANSDWLTQQLEQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------------------------- 247
+D+P P LPD G +W F + P
Sbjct: 107 ---------NIDSPPPQPLPDRFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAFSPLK 157
Query: 248 ----------------------FSAWM---NGLEVCSIETDTAR-GSLILSVGISTRYIY 281
+ W+ N + + I T+T R G L+L G++ R+I+
Sbjct: 158 LGLASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTETGRSGGLVLESGLNERWIF 217
Query: 282 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
Y+ V + A ++A K+ GLHFL IQ + GFWLL
Sbjct: 218 LTYEDEEVALA-ANVYQATKEEGQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|390440582|ref|ZP_10228809.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis sp. T1-4]
gi|389836112|emb|CCI32935.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis sp. T1-4]
Length = 265
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 107/286 (37%), Gaps = 80/286 (27%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ + +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----ETVWQLLIFDSLGHLIYENSCPQSQANSDWLTQQLRQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIRLTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------------------------- 247
+D+P P LPD G +W F + P
Sbjct: 107 ---------NIDSPPPQPLPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAFSPLK 157
Query: 248 ----------------------FSAWM---NGLEVCSIETDTAR-GSLILSVGISTRYIY 281
+ W+ N + + I T+T R G L+L G++ R+I+
Sbjct: 158 LGLASTLMIPGVVITGGKKSLAIARWLGEINPVFIDHIPTETGRSGGLVLESGLNERWIF 217
Query: 282 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
Y+ V + A ++A K+ G HFL IQ + GFWLL
Sbjct: 218 LTYEDEEVARA-ANVYQATKEESQGWHFLLIQPDDSGRTFTGFWLL 262
>gi|443663863|ref|ZP_21133251.1| hypothetical protein C789_3791 [Microcystis aeruginosa DIANCHI905]
gi|159028218|emb|CAO88028.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443331745|gb|ELS46389.1| hypothetical protein C789_3791 [Microcystis aeruginosa DIANCHI905]
Length = 265
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 104/282 (36%), Gaps = 72/282 (25%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+T W+ DF + +W+L++ D L Y P + NS L + + C
Sbjct: 1 MTIWQADFY-KSSSSPSLSTVWQLLISDSLGHLIYENSCPQSQANSDWLTQQLQQACQ-- 57
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 58 -VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI---------- 106
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLP------------------------------ 247
+D+P P LPD G +W F + P
Sbjct: 107 -----NIDSPPPQPLPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAFSPLKLGLA 161
Query: 248 ------------------FSAWM---NGLEVCSIETDTAR-GSLILSVGISTRYIYANYK 285
+ W+ N + + I T+ R G L+L G++ R+I+ Y+
Sbjct: 162 STLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTERGRSGGLVLESGLNERWIFLTYE 221
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
V + A ++A K+ GLHFL IQ + GFWLL
Sbjct: 222 DEEVALA-ANVYQATKQESQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|425458730|ref|ZP_18838218.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9808]
gi|389824876|emb|CCI25820.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9808]
Length = 265
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 70/286 (24%), Positives = 107/286 (37%), Gaps = 80/286 (27%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ + +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----ETVWQLLIFDPLGHLIYENSCPQSQANSDWLTQQLEQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------------------------- 247
+D+P P LPD G +W F + P
Sbjct: 107 ---------NIDSPPPQPLPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAFSPLK 157
Query: 248 ----------------------FSAWM---NGLEVCSIETDTAR-GSLILSVGISTRYIY 281
+ W+ N + + I T+ R G L+L G+S R+I+
Sbjct: 158 LGLASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTERGRSGGLVLESGLSERWIF 217
Query: 282 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
Y+ V + A ++A K+ GLHFL IQ + GFWLL
Sbjct: 218 LTYEDEEVALA-ANIYQATKEESQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|425470238|ref|ZP_18849108.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9701]
gi|389884213|emb|CCI35473.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9701]
Length = 265
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/262 (24%), Positives = 98/262 (37%), Gaps = 71/262 (27%)
Query: 118 IWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIIT 177
+W+L++ D L Y P + NS L + + C V PE I+ FR Q +
Sbjct: 20 VWQLLIFDPLGHLIYENSCPQSQANSDWLTQQLQQACQ---VSSPEIIQVFRPQCANLFL 76
Query: 178 KACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLF 237
A + L IK ++ +L LE R + +D+P P LPD
Sbjct: 77 LAGQNLQIKIELTRHVNALKKQLELRQIPI---------------NIDSPPPQPLPDQFL 121
Query: 238 GDKWAFVQLP------------------------------------------------FS 249
G +W F + P +
Sbjct: 122 GQEWRFARFPAVDLVNFFCDRRIPILSLPEAFYPLKLGLASTLMIPGVVITGGKKSLAIA 181
Query: 250 AWM---NGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACG 305
W+ N + + I T+ R G L+L G++ R+I+ Y+ V + A A++A K+
Sbjct: 182 RWLGEINPVFIDHIPTERGRSGGLVLESGLNERWIFLTYEDEEVARA-ANAYQATKQESQ 240
Query: 306 GLHFLAIQEELDSEDCVGFWLL 327
GLHFL IQ + GFWLL
Sbjct: 241 GLHFLLIQPDDSGRTFTGFWLL 262
>gi|428775356|ref|YP_007167143.1| hypothetical protein PCC7418_0709 [Halothece sp. PCC 7418]
gi|428689635|gb|AFZ42929.1| protein of unknown function DUF1092 [Halothece sp. PCC 7418]
Length = 273
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 63/281 (22%), Positives = 105/281 (37%), Gaps = 69/281 (24%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W++DF P + + +WELVVCD ++ T + T+ I +
Sbjct: 8 WQVDFYRLPQANASQESVWELVVCD---EVEKTVKTQSCFQAEATVDWLITHLRAIAQGS 64
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
PEKI+ FR + ++ A +L+I +E T + R +G +
Sbjct: 65 FPEKIKVFRPESLQLLQLAGDKLEIS-------------VEGTRHTPFLRQVLRDRGGEE 111
Query: 221 LLALDNPFPMELPDNLFGDKWAFVQL--------------PF------------------ 248
+ +++P P LP+ ++G++W F L PF
Sbjct: 112 RVKVESPPPQPLPEEIWGEQWQFASLNAEEIEYRLPERPIPFREIPPELSPFQLNLGSTT 171
Query: 249 ----------------SAWMNGLEVCSIE----TDTARGSLILSVGISTRYIYANYKKNP 288
+ W E +IE G L+L G+ R++ + ++P
Sbjct: 172 LIPGIIIYGGRQSWQLAQWFAETEPMAIEYIPTAVGESGGLVLEAGLRDRWVIITF-EDP 230
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 329
AE ++ K+ GLHFL IQ + GFWLL D
Sbjct: 231 EVAKAAEKFQQRKQNSNGLHFLLIQPDNSGMTDTGFWLLAD 271
>gi|22299400|ref|NP_682647.1| hypothetical protein tll1857 [Thermosynechococcus elongatus BP-1]
gi|22295583|dbj|BAC09409.1| tll1857 [Thermosynechococcus elongatus BP-1]
Length = 276
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 63/278 (22%), Positives = 117/278 (42%), Gaps = 54/278 (19%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
++ W++D RP+ G +WELV+CD YT + P +++S + +
Sbjct: 1 MSRWQVDLYRRPLRTPSGLDLWELVICDPEDHFYYTTFCPEPLVSSAWVATEF----NSC 56
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
G P+PE+++ FR Q ++ AC++L+I P++R +L +L +R + Y +
Sbjct: 57 GQPLPERVQVFRPQSLGLVEGACQQLNIPLEPTRRTAALKHYLCQRAQE-YPSLKTYTGE 115
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAF-----------VQLPFSAWMNGLEV----CSIET 262
+ LA++ P P+ LPD+++G+ W F +Q P +E+ +
Sbjct: 116 AYDPLAIEQPPPLPLPDDIWGESWQFAAIAPPDLQQLMQYPLRILALEMEMLPESLGLAA 175
Query: 263 DT---------ARGSL------------------------ILSVGISTRYIYANYKKNPV 289
DT R SL +L G+ R+++ ++ + +
Sbjct: 176 DTLIPGIILYGGRKSLKLARWFQEQVPYRLEFVPGQPCGVLLHSGLRDRWVFLTFQDSEI 235
Query: 290 TTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ + + + GLHFL IQ WLL
Sbjct: 236 AQA-GDVFRDRLQKSQGLHFLLIQPTPRDTTYTALWLL 272
>gi|428204653|ref|YP_007083242.1| hypothetical protein Ple7327_4590 [Pleurocapsa sp. PCC 7327]
gi|427982085|gb|AFY79685.1| Protein of unknown function (DUF1092) [Pleurocapsa sp. PCC 7327]
Length = 271
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 64/281 (22%), Positives = 112/281 (39%), Gaps = 66/281 (23%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF + GK +WEL++CD + P + N L I +
Sbjct: 4 WQADFYKHDRKNKEGKHLWELLICDPQGHIIQEAKCPQSQANPDWL---ISQLQQANRGN 60
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P++I+ FR Q ++++ A ++L I+ ++R +L L +R + P
Sbjct: 61 LPDRIQVFRLQSLSLLSIAAEKLGIQVEATRRTGALKAELRKRI---------IDENYDP 111
Query: 221 LLALDNPFPMELPDNLFGDKWAF-------------------VQLP-------------- 247
+ L+ P P LP+NL+G+ W F + +P
Sbjct: 112 -VKLEKPPPQALPENLWGESWRFATFRAGDLVDYFSDRPLPILHMPESLLPINLGIASTI 170
Query: 248 ---------------FSAWMNG---LEVCSIETDTAR-GSLILSVGISTRYIYANYKKNP 288
+ W+ + I T+ + G L+L G+ R+I A ++
Sbjct: 171 SVPGVIIYGGRKSMYLAKWLQEAKPFSLSYIPTEIGKSGGLVLESGLVDRWILATFEDEE 230
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 329
+ + A+ +E K+A GLHFL +Q + GFWLL D
Sbjct: 231 IAQA-AQNYEQRKQASLGLHFLLVQPDDSGMTYTGFWLLKD 270
>gi|425451962|ref|ZP_18831781.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 7941]
gi|389766454|emb|CCI07907.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 7941]
Length = 265
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 69/286 (24%), Positives = 107/286 (37%), Gaps = 80/286 (27%)
Query: 98 ITEWELDF----CSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAI 153
+T W+ DF S P+ + +W+L++ D L Y P + NS L + +
Sbjct: 1 MTIWQADFYKSSSSSPL-----ETVWQLLIFDPLGHLIYENSCPQSQANSDWLTQQLEQA 55
Query: 154 CDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG 213
C V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 56 CQ---VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI------ 106
Query: 214 FQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLP-------------------------- 247
+D+P P LPD G +W F + P
Sbjct: 107 ---------NIDSPPPQPLPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAFSPLK 157
Query: 248 ----------------------FSAWM---NGLEVCSIETDTAR-GSLILSVGISTRYIY 281
+ W+ N + + I T+ R G L+L G++ R+I+
Sbjct: 158 LGLASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTERGRSGGLVLESGLNERWIF 217
Query: 282 ANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
Y+ V + A ++A K+ GLHFL IQ + GFWLL
Sbjct: 218 LTYEDEEVALA-ANIYQATKEESQGLHFLLIQPDDSGRTFTGFWLL 262
>gi|425444579|ref|ZP_18824626.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9443]
gi|389735645|emb|CCI00880.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9443]
Length = 267
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 67/283 (23%), Positives = 103/283 (36%), Gaps = 72/283 (25%)
Query: 98 ITEWELDFCSRPILDIRG-KKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDD 156
+T W+ DF +W+L++ D L Y P + NS L + + C
Sbjct: 1 MTIWQADFYKSSSSSSPSLGTVWQLLISDSLGHLIYENSCPQSQANSDWLTQQLQQACQ- 59
Query: 157 LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQK 216
V PE I+ FR Q + A + L IK ++ +L LE R +
Sbjct: 60 --VSPPEIIQVFRPQCANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI--------- 108
Query: 217 GSKPLLALDNPFPMELPDNLFGDKWAFVQLP----------------------------- 247
+D+P P LPD G +W F + P
Sbjct: 109 ------NIDSPPPQPLPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAFSPLKLGL 162
Query: 248 -------------------FSAWM---NGLEVCSIETDTAR-GSLILSVGISTRYIYANY 284
+ W+ N + + I T+ R G L+L G++ R+I+ Y
Sbjct: 163 ASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTERGRSGGLVLESGLNERWIFLTY 222
Query: 285 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ V + A ++A K+ GLHFL IQ + GFWLL
Sbjct: 223 EDEEVALA-ANVYQATKQESQGLHFLLIQPDDSGRTFTGFWLL 264
>gi|425455145|ref|ZP_18834870.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9807]
gi|389804026|emb|CCI17121.1| Similar to tr|Q8YUM2|Q8YUM2 [Microcystis aeruginosa PCC 9807]
Length = 267
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 65/283 (22%), Positives = 102/283 (36%), Gaps = 72/283 (25%)
Query: 98 ITEWELDFCSRPILDIRG-KKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDD 156
+T W+ DF +W+L++ D L Y P + NS L + + C
Sbjct: 1 MTIWQADFYKSSSSSSPSLGTVWQLLISDSLGHLIYENSCPQSQANSDWLTQQLQQACQ- 59
Query: 157 LGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQK 216
V PE I+ FR + + A + L IK ++ +L LE R +
Sbjct: 60 --VSPPEIIQVFRPECANLFLLAGQNLQIKIELTRHVNALKKQLELRQIPI--------- 108
Query: 217 GSKPLLALDNPFPMELPDNLFGDKWAFVQLP----------------------------- 247
+D+P P LPD G +W F + P
Sbjct: 109 ------NIDSPPPQPLPDQFLGQEWRFARFPAVDLVNFFGDRRIPILSLPEAFSPLKLGL 162
Query: 248 -------------------FSAWM---NGLEVCSIETDTAR-GSLILSVGISTRYIYANY 284
+ W+ N + + I T+ R G L+L G++ R+I+ Y
Sbjct: 163 ASTLMIPGVVITGGKKSLAIARWLEEINPVFIDHIPTERGRSGGLVLESGLNERWIFLTY 222
Query: 285 KKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ V + A ++A K+ G HFL IQ + GFWLL
Sbjct: 223 EDEEVALA-ANIYQATKQESQGWHFLLIQPDDSGRTFTGFWLL 264
>gi|427711582|ref|YP_007060206.1| hypothetical protein Syn6312_0434 [Synechococcus sp. PCC 6312]
gi|427375711|gb|AFY59663.1| Protein of unknown function (DUF1092) [Synechococcus sp. PCC 6312]
Length = 281
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 68/285 (23%), Positives = 108/285 (37%), Gaps = 64/285 (22%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCD--GSLSLQYTKYFPNNVINSITLKEAIVAICD 155
+T W++DF +RP+ + +G+ +WEL++ D G + Q ++ + + + IC
Sbjct: 1 MTLWQVDFSARPLTNPQGQTLWELLIVDPLGQILHQAQCSQAQARLDWLIRQ---LEICI 57
Query: 156 DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLS---LLLWLEERYETV--YTR 210
PE+I+ FR Q ++ A EL++ P++ + LL E Y T YT
Sbjct: 58 QRTGSCPERIQLFRPQCLSLFEVAANELNLMVEPTRHTPALKRLLAAQAEHYPTAANYTG 117
Query: 211 HPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFV-----QLPFSAWMNGLEVCSIETDTA 265
P +PL P P+ LPD L+G+ W F +L + + S+ D
Sbjct: 118 EP-----YQPLHITSLP-PVPLPDYLWGEGWQFTGLMAEELETHLITQPIPILSLRMDLL 171
Query: 266 RGSLIL--SVGISTRYIYANYKK------------------------------------- 286
L L SV I IY +
Sbjct: 172 PSQLGLAASVVIPGIIIYGGRRSMALARWCQEQNPAEVEFIAGQPDGLIMSAGLWERWVL 231
Query: 287 ----NPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+P A+ + + A GLHFL IQ + G WLL
Sbjct: 232 VTFDDPQVKQSAQGFMTRRAAAQGLHFLMIQPDESGVTYTGLWLL 276
>gi|218248800|ref|YP_002374171.1| hypothetical protein PCC8801_4079 [Cyanothece sp. PCC 8801]
gi|218169278|gb|ACK68015.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 8801]
Length = 273
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 106/279 (37%), Gaps = 66/279 (23%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P+ + W+L++CD + + NS L + I
Sbjct: 4 WQADFYKNPLDHEKPNPQWQLIICDDQGQIICQENCQQKEANSNWLISQLKPIFQQNN-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR Q ++T A KEL +K ++R L L+++ K
Sbjct: 62 -PDFIQVFRPQSLNLLTLAVKELGVKIQATRRTPELKAILKQQAA----------KTGAN 110
Query: 221 LLALDNPFPMELPDNLFGDKWAFV------------------------------------ 244
L LD P P LP NL+G+KW FV
Sbjct: 111 SLKLDQPPPQPLPQNLWGEKWRFVSFRGGDMIEFFSDRPIPIRDIPEALFPINLGIASTV 170
Query: 245 ------------QLPFSAWMNGLE-VC--SIETDTAR-GSLILSVGISTRYIYANYKKNP 288
+ + W+ ++ VC I T+ G LIL G+ R+I A + ++P
Sbjct: 171 NIPGIIIYGGKTSMYLARWLADIKPVCLNYIPTEMGHSGGLILEAGLVDRWILATF-EDP 229
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
A+ +E K+ GLHFL +Q + GFWLL
Sbjct: 230 EMAQAAQQYETQKQTSKGLHFLVVQPDDSEITYSGFWLL 268
>gi|257061859|ref|YP_003139747.1| hypothetical protein Cyan8802_4118 [Cyanothece sp. PCC 8802]
gi|256592025|gb|ACV02912.1| protein of unknown function DUF1092 [Cyanothece sp. PCC 8802]
Length = 273
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 69/279 (24%), Positives = 106/279 (37%), Gaps = 66/279 (23%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF P+ + W+L++CD + + NS L + I
Sbjct: 4 WQADFYKNPLDHEKPNPQWQLIICDDQGQIICQENCRQKEANSNWLISQLKPIFQQNN-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR Q ++T A KEL +K ++R L L+++ K
Sbjct: 62 -PDFIQVFRPQSLNLLTLAVKELGVKIQATRRTPQLKAILKQQAA----------KTGAN 110
Query: 221 LLALDNPFPMELPDNLFGDKWAFV------------------------------------ 244
L LD P P LP NL+G+KW FV
Sbjct: 111 SLKLDQPPPQPLPQNLWGEKWRFVSFRGGDMIEFFSDRPIPIRDIPEALFPINLGIASTV 170
Query: 245 ------------QLPFSAWMNGLE-VC--SIETDTAR-GSLILSVGISTRYIYANYKKNP 288
+ + W+ ++ VC I T+ G LIL G+ R+I A + ++P
Sbjct: 171 NIPGIIIYGGKTSMYLARWLADIKPVCLNYIPTEMGHSGGLILEAGLVDRWILATF-EDP 229
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
A+ +E K+ GLHFL +Q + GFWLL
Sbjct: 230 EMAQAAQQYETQKQTSKGLHFLVVQPDDSEITYSGFWLL 268
>gi|434399732|ref|YP_007133736.1| protein of unknown function DUF1092 [Stanieria cyanosphaera PCC
7437]
gi|428270829|gb|AFZ36770.1| protein of unknown function DUF1092 [Stanieria cyanosphaera PCC
7437]
Length = 269
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 66/279 (23%), Positives = 107/279 (38%), Gaps = 68/279 (24%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF L+ +W+L++CD Q T + N + I I G
Sbjct: 4 WQADFYKFS-LNQNNSWLWKLLICDLE---QNTVFEQNCQQEDASANWLIHQINQAAGDK 59
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
+P+ I+ FR Q + T A ++L IK + + R +L +Y T +P
Sbjct: 60 LPDVIQIFRPQALGLFTVAAQQLGIK-VEATRRTKILKQQLNKYITD-ANYP-------- 109
Query: 221 LLALDNPFPMELPDNLFGDKWAF------------------------------------V 244
LA+D P P LP++L+G++W F +
Sbjct: 110 -LAIDRPPPQPLPESLWGEQWNFATITADSLSNLISDRPIPILDTPTFLLPINLGIASTI 168
Query: 245 QLP------------FSAWMNG---LEVCSIETDTAR-GSLILSVGISTRYIYANYKKNP 288
LP + W+ + I+T+ + G LIL G+ R+I ++
Sbjct: 169 NLPGVVIYAGKQSLKLARWLAAEKPFSLNYIDTEAGKSGGLILESGLVDRWIMTTFEDEK 228
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
V + + +E K+ GLHFL IQ + G WLL
Sbjct: 229 VAQA-GKIYEQRKQLSKGLHFLLIQPDDSGMTYTGLWLL 266
>gi|443324165|ref|ZP_21053109.1| Protein of unknown function (DUF1092) [Xenococcus sp. PCC 7305]
gi|442796049|gb|ELS05375.1| Protein of unknown function (DUF1092) [Xenococcus sp. PCC 7305]
Length = 271
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/282 (21%), Positives = 111/282 (39%), Gaps = 64/282 (22%)
Query: 98 ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDL 157
+T W+ DF P ++ + W+LV+C L + +N+ L + +
Sbjct: 1 MTIWQSDFYHYPKIEPQ----WQLVICSSDGKLIHETNCSAAQVNAKWLTKQLQQAAQG- 55
Query: 158 GVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKG 217
+P KI+ FR Q+ + A +EL I+ ++R +L L+ Y + ++
Sbjct: 56 --KLPTKIQVFRPQIVGLFEIATQELGIELETTRRTNALKEKLQ-NYSPINSKDKSKNNN 112
Query: 218 SKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAWMN------------------------ 253
S ++ P P +P++L+G+ W F+ + + +N
Sbjct: 113 S---FDVEKPPPQGVPEDLWGENWNFISMSANDLINFTGDRPIPIKFAPEFLNPIKLGIA 169
Query: 254 ------GLEVCS---------------------IETDTAR-GSLILSVGISTRYIYANYK 285
G+ V I T+ + G L+L G+ R+I+A ++
Sbjct: 170 SDALIPGIVVYGGRKSMVLARWLDQQKPVALNYIPTEIGKSGGLVLESGLVDRWIFATFE 229
Query: 286 KNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ + A ++E K+ GLHFL IQ + G WLL
Sbjct: 230 SEAIAQA-ARSYEQRKQDSKGLHFLLIQPDDSGMTNTGIWLL 270
>gi|357488599|ref|XP_003614587.1| General transcription factor IIH subunit [Medicago truncatula]
gi|355515922|gb|AES97545.1| General transcription factor IIH subunit [Medicago truncatula]
Length = 133
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/43 (60%), Positives = 31/43 (72%), Gaps = 1/43 (2%)
Query: 198 LWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDK 240
LWL+E YETVY HPGFQ GSKPL DN F M+L + + G+K
Sbjct: 92 LWLDEHYETVYI-HPGFQIGSKPLFPFDNLFDMKLQNIIHGEK 133
>gi|254413499|ref|ZP_05027269.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
gi|196179606|gb|EDX74600.1| conserved hypothetical protein [Coleofasciculus chthonoplastes PCC
7420]
Length = 153
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 47/84 (55%), Gaps = 2/84 (2%)
Query: 248 FSAWMNGLEVCSIETDTARG-SLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGG 306
+AWM+GLE+ ++ L+L G S + AN + T +EA+ +E+AK+
Sbjct: 70 LAAWMSGLELAFVKFQGGVTPRLLLETGASDSWALANLT-DAQTLAEAQGFESAKENAQS 128
Query: 307 LHFLAIQEELDSEDCVGFWLLLDL 330
+HFLA+Q SE GFWLL +L
Sbjct: 129 IHFLAVQSTPTSETFAGFWLLQEL 152
>gi|428780442|ref|YP_007172228.1| hypothetical protein Dacsa_2255 [Dactylococcopsis salina PCC 8305]
gi|428694721|gb|AFZ50871.1| Protein of unknown function (DUF1092) [Dactylococcopsis salina PCC
8305]
Length = 273
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 59/283 (20%), Positives = 106/283 (37%), Gaps = 67/283 (23%)
Query: 96 ESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICD 155
+S + W++DF P +G+ WELV+CD S T+ L E++ +
Sbjct: 3 QSQSSWQVDFYRLPQPTTKGESQWELVICDQSTKEVKTRSCLQKEATVDWLVESLQGLAT 62
Query: 156 DLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQ 215
+ +P K+R FR + ++ A + L + ++ L L +R
Sbjct: 63 E---ELPLKMRVFRPESLQLLQLAGERLGVIVEGTRHTYLLKQVLRDR------------ 107
Query: 216 KGSKPLLALDNPFPMELPDNLFGDKWAFVQL--------------PFSAWMNGLE--VCS 259
G + + +++P P LP+ ++G++W F +L PF L +
Sbjct: 108 -GGEERIKVESPPPQPLPEFIWGEQWQFARLNADEIEYRMPERPIPFCEMPTELTPFQLN 166
Query: 260 IETDTARGSLILSVGISTRYIYANYKKN--------PVTTSEA----------------- 294
+ + T +I+ G +R + + + P T E+
Sbjct: 167 LGSTTLVPGIIIYGGRQSRQLAQWFMEAQPMAVNYMPTTVGESGGLVLEAGLRDRWVIIT 226
Query: 295 ----------EAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
E +E K+ GLHFL +Q + GFWLL
Sbjct: 227 FEDTEVATAGEKYEQRKQESNGLHFLLLQPDDSGMTDTGFWLL 269
>gi|254432298|ref|ZP_05046001.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
gi|197626751|gb|EDY39310.1| conserved hypothetical protein [Cyanobium sp. PCC 7001]
Length = 97
Score = 48.9 bits (115), Expect = 0.003, Method: Composition-based stats.
Identities = 27/86 (31%), Positives = 43/86 (50%), Gaps = 3/86 (3%)
Query: 246 LPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACG 305
L + W+ GLE +E L+L G+ R++ A + P + +A+ A+ G
Sbjct: 13 LALAGWLAGLEPVRLEM--VDRQLVLEAGLEDRWLLATLPE-PEADAARQAFAEARLRAG 69
Query: 306 GLHFLAIQEELDSEDCVGFWLLLDLP 331
GL F+A+Q + GFW+L DLP
Sbjct: 70 GLQFIAVQARESDQRFEGFWMLRDLP 95
>gi|67924121|ref|ZP_00517567.1| Protein of unknown function DUF1092 [Crocosphaera watsonii WH 8501]
gi|67854046|gb|EAM49359.1| Protein of unknown function DUF1092 [Crocosphaera watsonii WH 8501]
Length = 269
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 57/279 (20%), Positives = 102/279 (36%), Gaps = 70/279 (25%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF W L++CD + S+ + + NS L + ++
Sbjct: 4 WQADFYKHLSQTNENNTTWNLIICDQNSSIIHEASCQQSEANSNWLIAELESLVKQYS-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ ++ FR Q ++ K L I ++R L L++++ +
Sbjct: 62 -PDVVKVFRPQCLSLFQLLGKALGIYIEATRRTSQLKQILKDKFPSS------------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAF------------------------------------V 244
+ L+ P +P+NL+GDKW +
Sbjct: 108 -VKLEQSPPQAVPENLWGDKWRLATFKAGDFLDYFRDRPIPIKDLPEELNPIDLGIASDI 166
Query: 245 QLP------------FSAWMNGLEVCS---IETDTAR-GSLILSVGISTRYIYANYKKNP 288
++P + W+ + S I TD + G LIL G+ R++ ++ +
Sbjct: 167 KIPGLVIYGGRQSMYLARWLADNQPVSLNYIPTDVEKSGGLILESGLVDRWVLLTFEDSE 226
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ S A+ +E K+ GLHFL IQ + G WLL
Sbjct: 227 MAQS-AQKYEQQKEDSQGLHFLLIQPDDSGMTETGIWLL 264
>gi|416393935|ref|ZP_11686049.1| hypothetical protein CWATWH0003_2850 [Crocosphaera watsonii WH
0003]
gi|357263417|gb|EHJ12432.1| hypothetical protein CWATWH0003_2850 [Crocosphaera watsonii WH
0003]
Length = 269
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 57/279 (20%), Positives = 102/279 (36%), Gaps = 70/279 (25%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF W L++CD + S+ + + NS L + ++
Sbjct: 4 WQADFYKHLSQTNENNTTWNLIICDQNSSIIHEASCQQSEANSNWLIAELESLVKQYS-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ ++ FR Q ++ K L I ++R L L++++ +
Sbjct: 62 -PDVVKVFRPQCLSLFQLLGKALGIYIEATRRTPQLKQILKDKFPSS------------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAF------------------------------------V 244
+ L+ P +P+NL+GDKW +
Sbjct: 108 -VKLEQSPPQAVPENLWGDKWRLATFKAGDFLDYFSDRPIPIKDLPEELNPIDLGIASDI 166
Query: 245 QLP------------FSAWMNGLEVCS---IETDTAR-GSLILSVGISTRYIYANYKKNP 288
++P + W+ + S I TD + G LIL G+ R++ ++ +
Sbjct: 167 KIPGLVIYGGRQSMYLARWLADNQPVSLNYIPTDVEKSGGLILESGLVDRWVLLTFEDSE 226
Query: 289 VTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ S A+ +E K+ GLHFL IQ + G WLL
Sbjct: 227 MAQS-AQKYEQQKEDSQGLHFLLIQPDDSGMTETGIWLL 264
>gi|172039290|ref|YP_001805791.1| hypothetical protein cce_4377 [Cyanothece sp. ATCC 51142]
gi|171700744|gb|ACB53725.1| DUF1092-containing protein [Cyanothece sp. ATCC 51142]
Length = 275
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 59/288 (20%), Positives = 102/288 (35%), Gaps = 72/288 (25%)
Query: 93 TDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVA 152
T +S+ W+ DF + W L+VCD + + + S L +
Sbjct: 2 TVSKSMIIWQADFYKHLSQEHENNTKWNLIVCDQQGVIIHQASCQQSEATSNWLISELEP 61
Query: 153 ICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP 212
+ P+ I+ FR Q ++ K L+IK ++R L L+E+Y
Sbjct: 62 LVKQYS---PDIIKVFRPQCLSLFALVGKRLEIKIEGTRRTPQLKQILQEKYPNS----- 113
Query: 213 GFQKGSKPLLALDNPFPMELPDNLFGDKWAF----------------------------- 243
+ L+ P +P++L+GDKW F
Sbjct: 114 ---------VKLEQSPPQAIPESLWGDKWHFATFKAGDFFDYFSDRPIPMKELPEALNPI 164
Query: 244 -------VQLP------------FSAWMNGLEVCSI-----ETDTARGSLILSVGISTRY 279
V +P + W+ + S+ E + + G LIL G+ R+
Sbjct: 165 HLGIASDVNIPGVVIYGGRQSMYLARWLADNQPVSLNYIPTEVNKSGG-LILESGLVDRW 223
Query: 280 IYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ + +N + A+ +E K+ GLHF +Q + G WLL
Sbjct: 224 VLLTF-ENAEMSQSAQQYEKQKERTQGLHFFLLQPDDSGMTQTGIWLL 270
>gi|354552442|ref|ZP_08971750.1| protein of unknown function DUF1092 [Cyanothece sp. ATCC 51472]
gi|353555764|gb|EHC25152.1| protein of unknown function DUF1092 [Cyanothece sp. ATCC 51472]
Length = 269
Score = 45.4 bits (106), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 57/280 (20%), Positives = 98/280 (35%), Gaps = 72/280 (25%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVP 160
W+ DF + W L+VCD + + + S L + +
Sbjct: 4 WQADFYKHLSQEHENNTKWNLIVCDQQGVIIHQASCQQSEATSNWLISELEPLVKQYS-- 61
Query: 161 IPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKP 220
P+ I+ FR Q ++ K L+IK ++R L L+E+Y
Sbjct: 62 -PDIIKVFRPQCLSLFALVGKRLEIKIEGTRRTPQLKQILQEKYPNS------------- 107
Query: 221 LLALDNPFPMELPDNLFGDKWAF------------------------------------V 244
+ L+ P +P++L+GDKW F V
Sbjct: 108 -VKLEQSPPQAIPESLWGDKWHFATFKAGDFFDYFSDRPIPMKELPEALNPIHLGIASDV 166
Query: 245 QLP------------FSAWMNGLEVCSI-----ETDTARGSLILSVGISTRYIYANYKKN 287
+P + W+ + S+ E + + G LIL G+ R++ + +N
Sbjct: 167 NIPGVVIYGGRQSMYLARWLADNQPVSLNYIPTEVNKSGG-LILESGLVDRWVLLTF-EN 224
Query: 288 PVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLL 327
+ A+ +E K+ GLHF +Q + G WLL
Sbjct: 225 AEMSQSAQQYEKQKERTQGLHFFLLQPDDSGMTQTGIWLL 264
>gi|126658961|ref|ZP_01730103.1| hypothetical protein CY0110_26702 [Cyanothece sp. CCY0110]
gi|126619759|gb|EAZ90486.1| hypothetical protein CY0110_26702 [Cyanothece sp. CCY0110]
Length = 270
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 61/266 (22%), Positives = 104/266 (39%), Gaps = 43/266 (16%)
Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS-LQYTKYFPNNVINSITLKEAIVAICDDLGV 159
W+ DF + + W L++C+ + Y + NS L + +
Sbjct: 4 WQADFYKHLSQENKQNTTWNLIICNEQKGEIVYQSSCQQSEANSSWLIGQLEPFIKEYS- 62
Query: 160 PIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERY--------------- 204
P+ I+ FR Q ++ ++L +K ++R L L+E+Y
Sbjct: 63 --PDIIKVFRPQCLSLFQLVEEKLGVKIEGTRRTPQLKQILKEKYPNSIKLEQAPPQPIP 120
Query: 205 ETVYT---RHPGFQKGSKPLLALDNPFPM-----EL-PDNL-------------FGDKWA 242
E+++ R F+ G D P P+ EL P NL +G + +
Sbjct: 121 ESLWGDKWRFAAFKAGDFFDYFSDRPIPIKDLSEELNPINLGIASDINIPGVVIYGGRQS 180
Query: 243 FVQLPFSAWMNGLEVCSIETDTAR-GSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAK 301
+ A + + I TD + G LIL G+ R+I ++ + + S A+ +E K
Sbjct: 181 MYLARWFAENQPVSLNYIPTDINQSGGLILESGLVDRWILLTFEDSEMAES-AQQYEQQK 239
Query: 302 KACGGLHFLAIQEELDSEDCVGFWLL 327
+ GLHFL IQ + G WLL
Sbjct: 240 EESQGLHFLLIQPDDSGMTQTGIWLL 265
>gi|282901430|ref|ZP_06309355.1| protein of unknown function DUF1092 [Cylindrospermopsis raciborskii
CS-505]
gi|281193709|gb|EFA68681.1| protein of unknown function DUF1092 [Cylindrospermopsis raciborskii
CS-505]
Length = 155
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 37/150 (24%), Positives = 58/150 (38%), Gaps = 48/150 (32%)
Query: 229 PMELPDNLFGDKWAFVQLPFSAWMNGLEVCSIE----TDT---ARGSLILSVGISTRYIY 281
PM LP++L+G++W FV + + SI TD+ A+ L ++V I IY
Sbjct: 6 PMPLPESLWGEQWCFVSVSAGDILEEFGSRSIPFKKITDSFVPAKLGLAVTVSIPGVIIY 65
Query: 282 ANYK----------KNPVT-------------------------------TSEAEAWEAA 300
+ NPV+ T+ + ++
Sbjct: 66 GGKQSLRLARWLNENNPVSLNYIPGAPDGLILQSSSTNPWIVATFTDIDVTAAGKVYQQR 125
Query: 301 KKACGGLHFLAIQEELDSEDCVGFWLLLDL 330
KK GG+HFL +Q + GFWLL D+
Sbjct: 126 KKVSGGVHFLLVQPDHSGITFTGFWLLKDI 155
>gi|206580337|ref|YP_002240790.1| magnesium-transporting ATPase MgtA [Klebsiella pneumoniae 342]
gi|206569395|gb|ACI11171.1| magnesium-translocating P-type ATPase [Klebsiella pneumoniae 342]
Length = 902
Score = 38.9 bits (89), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 57/129 (44%), Gaps = 12/129 (9%)
Query: 180 CKELDIKPIP---SKRCLSLLLWLEERYETVYTRHP-GFQKG---SKPLLALDNPFPMEL 232
+ L PIP KRCL + ++ + HP G +G +K L DN P +
Sbjct: 30 ARNLTSMPIPDSLGKRCLDVAAMDDQEIWRAFDSHPEGLNEGEVAAKILKHGDNQIPAQK 89
Query: 233 PDNLFGDKWAFVQLPFSAWMNGLEVCSIETDT--ARGSLILSVGISTRYIYANYKKNPVT 290
P + W + PF+ + L + S T+ A G + L VGIST N+ + +
Sbjct: 90 PSPWWVHLWTCYRNPFNLLLTVLGIVSYSTEDLFAAGVIALMVGIST---LLNFIQEARS 146
Query: 291 TSEAEAWEA 299
T A+A +A
Sbjct: 147 TKAADALKA 155
>gi|290512186|ref|ZP_06551553.1| magnesium-translocating P-type ATPase [Klebsiella sp. 1_1_55]
gi|289775181|gb|EFD83182.1| magnesium-translocating P-type ATPase [Klebsiella sp. 1_1_55]
Length = 902
Score = 38.5 bits (88), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 57/129 (44%), Gaps = 12/129 (9%)
Query: 180 CKELDIKPIP---SKRCLSLLLWLEERYETVYTRHP-GFQKG---SKPLLALDNPFPMEL 232
+ L PIP KRCL + ++ + HP G +G +K L DN P +
Sbjct: 30 ARNLTSVPIPDSLGKRCLDVAAMDDQEIWRAFDSHPEGLNEGEVAAKILKHGDNQIPAQK 89
Query: 233 PDNLFGDKWAFVQLPFSAWMNGLEVCSIETDT--ARGSLILSVGISTRYIYANYKKNPVT 290
P + W + PF+ + L + S T+ A G + L VGIST N+ + +
Sbjct: 90 PSPWWVHLWTCYRNPFNLLLTVLGIVSYSTEDLFAAGVIALMVGIST---LLNFIQEARS 146
Query: 291 TSEAEAWEA 299
T A+A +A
Sbjct: 147 TKAADALKA 155
>gi|336248428|ref|YP_004592138.1| magnesium-transporting ATPase MgtA [Enterobacter aerogenes KCTC
2190]
gi|334734484|gb|AEG96859.1| magnesium-transporting ATPase MgtA [Enterobacter aerogenes KCTC
2190]
Length = 902
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 55/129 (42%), Gaps = 12/129 (9%)
Query: 180 CKELDIKPIP---SKRCLSLLLWLEERYETVYTRHP----GFQKGSKPLLALDNPFPMEL 232
+ L PIP KRCL + E+ + HP + +K L DN P +
Sbjct: 30 ARNLASAPIPDSLGKRCLDVAAMDEQEIWRAFDSHPEGLNDVEVAAKIALHGDNQIPAQK 89
Query: 233 PDNLFGDKWAFVQLPFSAWMNGLEVCSIETDT--ARGSLILSVGISTRYIYANYKKNPVT 290
P + W + PF+ + L + S T+ A G + L VGIST N+ + +
Sbjct: 90 PSPWWVHLWTCYRNPFNLLLTVLGIVSYSTEDLFAAGVIALMVGIST---LLNFIQEARS 146
Query: 291 TSEAEAWEA 299
T A+A +A
Sbjct: 147 TKAADALKA 155
>gi|444353494|ref|YP_007389638.1| Magnesium transporting ATPase, P-type 1 (EC 3.6.3.2) [Enterobacter
aerogenes EA1509E]
gi|443904324|emb|CCG32098.1| Magnesium transporting ATPase, P-type 1 (EC 3.6.3.2) [Enterobacter
aerogenes EA1509E]
Length = 902
Score = 37.7 bits (86), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 37/129 (28%), Positives = 56/129 (43%), Gaps = 12/129 (9%)
Query: 180 CKELDIKPIP---SKRCLSLLLWLEERYETVYTRHP-GF---QKGSKPLLALDNPFPMEL 232
+ L PIP KRCL + E+ + HP G + +K L DN P +
Sbjct: 30 ARNLASAPIPDSLGKRCLDVAAMDEQEIWRAFDSHPEGLNDSEVAAKIALHGDNQIPAQK 89
Query: 233 PDNLFGDKWAFVQLPFSAWMNGLEVCSIETDT--ARGSLILSVGISTRYIYANYKKNPVT 290
P + W + PF+ + L + S T+ A G + L VGIST N+ + +
Sbjct: 90 PSPWWIHLWTCYRNPFNLLLTVLGIVSYSTEDLFAAGVIALMVGIST---LLNFIQEARS 146
Query: 291 TSEAEAWEA 299
T A+A +A
Sbjct: 147 TKAADALKA 155
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.134 0.408
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,774,839,255
Number of Sequences: 23463169
Number of extensions: 248755420
Number of successful extensions: 714316
Number of sequences better than 100.0: 261
Number of HSP's better than 100.0 without gapping: 196
Number of HSP's successfully gapped in prelim test: 65
Number of HSP's that attempted gapping in prelim test: 713400
Number of HSP's gapped (non-prelim): 488
length of query: 335
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 192
effective length of database: 9,003,962,200
effective search space: 1728760742400
effective search space used: 1728760742400
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)