BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 021557
(311 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359484753|ref|XP_002264815.2| PREDICTED: uncharacterized protein LOC100256872 [Vitis vinifera]
Length = 597
Score = 548 bits (1412), Expect = e-153, Method: Compositional matrix adjust.
Identities = 257/304 (84%), Positives = 278/304 (91%)
Query: 8 KPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLK 67
K + + SN SSSQ+NW+GIPTRLNNIPHSREIR YFYDDVLQATQRAVNDGRTRLK
Sbjct: 294 KSIRASERLSNLASSSQENWRGIPTRLNNIPHSREIRTYFYDDVLQATQRAVNDGRTRLK 353
Query: 68 VEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAG 127
VEINIPELNPEMDVYRIGTLMELVRV+ALSFADDGK VKVCVQGSMGEGALAGMPLQLAG
Sbjct: 354 VEINIPELNPEMDVYRIGTLMELVRVLALSFADDGKHVKVCVQGSMGEGALAGMPLQLAG 413
Query: 128 TRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDA 187
TRKILEFMDWGDYGAKGTF+ IGSIG+ EV+EQDD+FILVAPQNAVGNCIIDD+KAMTDA
Sbjct: 414 TRKILEFMDWGDYGAKGTFINIGSIGSKEVEEQDDLFILVAPQNAVGNCIIDDLKAMTDA 473
Query: 188 AGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGAL 247
AG RPVIL+NPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGA+
Sbjct: 474 AGNRPVILVNPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGAI 533
Query: 248 RMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFL 307
RM+YP YELYKRVD G EKY+ILST + +P+ D+INDAFLGKPR+ KKA+GFWGFL
Sbjct: 534 RMSYPYRYELYKRVDESFGKEKYIILSTSSARPNIDEINDAFLGKPRSEAKKATGFWGFL 593
Query: 308 SSVF 311
S +
Sbjct: 594 SGIL 597
>gi|255565097|ref|XP_002523541.1| adenylate kinase, putative [Ricinus communis]
gi|223537248|gb|EEF38880.1| adenylate kinase, putative [Ricinus communis]
Length = 597
Score = 547 bits (1409), Expect = e-153, Method: Compositional matrix adjust.
Identities = 256/302 (84%), Positives = 277/302 (91%)
Query: 10 VHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVE 69
+ + +S NQ S SQD W+GIPTRLNNIPHSREIR YFYDDVLQATQRAVNDGR RLKVE
Sbjct: 296 IPKSESSYNQASVSQDKWRGIPTRLNNIPHSREIRTYFYDDVLQATQRAVNDGRMRLKVE 355
Query: 70 INIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTR 129
I IPELNPEMDVYRIGTLMELVRV+ALSFADDGKRVKVCVQGSMG+GALAGMPLQL+GTR
Sbjct: 356 IAIPELNPEMDVYRIGTLMELVRVLALSFADDGKRVKVCVQGSMGQGALAGMPLQLSGTR 415
Query: 130 KILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG 189
KILEFMDWGDYGA GTF+ IGSIGA EVDE DDMFILVAPQNAVGNCIIDD++AMTDAAG
Sbjct: 416 KILEFMDWGDYGALGTFIRIGSIGAKEVDEDDDMFILVAPQNAVGNCIIDDLRAMTDAAG 475
Query: 190 TRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRM 249
+RPVIL+NPRLKDLPGSSGIMQTMGR +RLEYAASFENCY FRLLYY GTQYPIMGALRM
Sbjct: 476 SRPVILVNPRLKDLPGSSGIMQTMGRAERLEYAASFENCYSFRLLYYGGTQYPIMGALRM 535
Query: 250 TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSS 309
+YP YELYKRVD+P+ GEKY ILSTF+EKP+ DDINDAFLG+ RN++KKASGFWGFLS
Sbjct: 536 SYPYHYELYKRVDLPARGEKYEILSTFSEKPTGDDINDAFLGRSRNQDKKASGFWGFLSG 595
Query: 310 VF 311
V
Sbjct: 596 VL 597
>gi|296084582|emb|CBI25603.3| unnamed protein product [Vitis vinifera]
Length = 652
Score = 543 bits (1399), Expect = e-152, Method: Compositional matrix adjust.
Identities = 254/293 (86%), Positives = 273/293 (93%)
Query: 19 QVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPE 78
Q S SQ+NW+GIPTRLNNIPHSREIR YFYDDVLQATQRAVNDGRTRLKVEINIPELNPE
Sbjct: 360 QASLSQENWRGIPTRLNNIPHSREIRTYFYDDVLQATQRAVNDGRTRLKVEINIPELNPE 419
Query: 79 MDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG 138
MDVYRIGTLMELVRV+ALSFADDGK VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG
Sbjct: 420 MDVYRIGTLMELVRVLALSFADDGKHVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG 479
Query: 139 DYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINP 198
DYGAKGTF+ IGSIG+ EV+EQDD+FILVAPQNAVGNCIIDD+KAMTDAAG RPVIL+NP
Sbjct: 480 DYGAKGTFINIGSIGSKEVEEQDDLFILVAPQNAVGNCIIDDLKAMTDAAGNRPVILVNP 539
Query: 199 RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELY 258
RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGA+RM+YP YELY
Sbjct: 540 RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGAIRMSYPYRYELY 599
Query: 259 KRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
KRVD G EKY+ILST + +P+ D+INDAFLGKPR+ KKA+GFWGFLS +
Sbjct: 600 KRVDESFGKEKYIILSTSSARPNIDEINDAFLGKPRSEAKKATGFWGFLSGIL 652
>gi|449493659|ref|XP_004159398.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101226743 [Cucumis sativus]
Length = 609
Score = 523 bits (1346), Expect = e-146, Method: Compositional matrix adjust.
Identities = 245/300 (81%), Positives = 269/300 (89%)
Query: 12 ERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEIN 71
E K NQ S S+D+W+GIPTRLNNIPHSREIR YFYDDVLQAT+RAV GRTRLKVE N
Sbjct: 310 ENKGTVNQPSESRDSWRGIPTRLNNIPHSREIRKYFYDDVLQATKRAVQAGRTRLKVENN 369
Query: 72 IPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKI 131
IPELNPEMDVYRIGTLMELVR IALSFADDG+RVKVC+QGSMGEGAL+GMPLQLAGTR+I
Sbjct: 370 IPELNPEMDVYRIGTLMELVRTIALSFADDGRRVKVCIQGSMGEGALSGMPLQLAGTRRI 429
Query: 132 LEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTR 191
LE+MDWG+YGA GTFV+IGSIGA EVD++DDMFILVAPQNAVGNCIIDDMKAMTDAAG R
Sbjct: 430 LEYMDWGEYGALGTFVKIGSIGAKEVDDEDDMFILVAPQNAVGNCIIDDMKAMTDAAGDR 489
Query: 192 PVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTY 251
PVIL+NPRLKDLPGSSGIMQTMGR+KRLEYAASFE CYFFRLLYYAGTQYPIMGALRM+Y
Sbjct: 490 PVILVNPRLKDLPGSSGIMQTMGREKRLEYAASFEICYFFRLLYYAGTQYPIMGALRMSY 549
Query: 252 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
P GYELY+RVD PSG EKY+ LS + ++PS+DDINDAF G R K +SG WGFLS +
Sbjct: 550 PYGYELYRRVDEPSGKEKYIELSKYPKRPSTDDINDAFQGNKRKEAKSSSGIWGFLSGIL 609
>gi|356532999|ref|XP_003535056.1| PREDICTED: uncharacterized protein LOC100796331 [Glycine max]
Length = 1016
Score = 519 bits (1337), Expect = e-145, Method: Compositional matrix adjust.
Identities = 241/291 (82%), Positives = 265/291 (91%)
Query: 21 SSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMD 80
S+ DNW+GIPTRLNNIPHSREIR YFYDDVLQATQRA+NDG+TR+KV+INIPELNPEMD
Sbjct: 726 SAEADNWRGIPTRLNNIPHSREIRKYFYDDVLQATQRAINDGKTRVKVDINIPELNPEMD 785
Query: 81 VYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY 140
VYRIGTLMELVR +ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY
Sbjct: 786 VYRIGTLMELVRTLALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY 845
Query: 141 GAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRL 200
GAKGTF+ IGSIGA EV+E+DDMFILVAPQNAVGNCIIDD++AMT+AA RPVILIN RL
Sbjct: 846 GAKGTFINIGSIGAAEVEEEDDMFILVAPQNAVGNCIIDDLRAMTNAAEHRPVILINARL 905
Query: 201 KDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKR 260
KDLPGSSGIMQTMGRD+RL+YAASFE+CYFFRLLYYAGTQYPIMGA+RMTYP YELYKR
Sbjct: 906 KDLPGSSGIMQTMGRDQRLKYAASFESCYFFRLLYYAGTQYPIMGAIRMTYPYQYELYKR 965
Query: 261 VDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
VD G EKY ILSTF E+P++D+INDAF GKP +K SG WGFL+ +
Sbjct: 966 VDESPGKEKYAILSTFPERPTTDEINDAFEGKPSKDPRKTSGIWGFLNGIL 1016
>gi|449443265|ref|XP_004139400.1| PREDICTED: uncharacterized protein LOC101203230 [Cucumis sativus]
Length = 577
Score = 517 bits (1332), Expect = e-144, Method: Compositional matrix adjust.
Identities = 241/308 (78%), Positives = 272/308 (88%)
Query: 4 TTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGR 63
++ + + + K+ + +S D+W+GIPTRLNNIPHSREIR YFYDDVLQAT+RAV GR
Sbjct: 270 SSLLSQIQKEKAMKSGNASFSDSWRGIPTRLNNIPHSREIRKYFYDDVLQATKRAVQAGR 329
Query: 64 TRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPL 123
TRLKVE NIPELNPEMDVYRIGTLMELVR IALSFADDG+RVKVC+QGSMGEGAL+GMPL
Sbjct: 330 TRLKVENNIPELNPEMDVYRIGTLMELVRTIALSFADDGRRVKVCIQGSMGEGALSGMPL 389
Query: 124 QLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKA 183
QLAGTR+ILE+MDWG+YGA GTFV+IGSIGA EVD++DDMFILVAPQNAVGNCIIDDMKA
Sbjct: 390 QLAGTRRILEYMDWGEYGALGTFVKIGSIGAKEVDDEDDMFILVAPQNAVGNCIIDDMKA 449
Query: 184 MTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPI 243
MTDAAG RPVIL+NPRLKDLPGSSGIMQTMGR+KRLEYAASFE CYFFRLLYYAGTQYPI
Sbjct: 450 MTDAAGDRPVILVNPRLKDLPGSSGIMQTMGREKRLEYAASFEICYFFRLLYYAGTQYPI 509
Query: 244 MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGF 303
MGALRM+YP GYELY+RVD PSG EKY+ LS + ++PS+DDINDAF G R K +SG
Sbjct: 510 MGALRMSYPYGYELYRRVDEPSGKEKYIELSKYPKRPSTDDINDAFQGNKRKEAKSSSGI 569
Query: 304 WGFLSSVF 311
WGFLS +
Sbjct: 570 WGFLSGIL 577
>gi|356556590|ref|XP_003546607.1| PREDICTED: uncharacterized protein LOC100818830 [Glycine max]
Length = 591
Score = 517 bits (1331), Expect = e-144, Method: Compositional matrix adjust.
Identities = 240/288 (83%), Positives = 264/288 (91%)
Query: 24 QDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYR 83
QD W+GIPTRLNNIPHSREIR YFYDDVLQATQRA++DG+TR+KV+INIPELNPEMDVYR
Sbjct: 304 QDKWRGIPTRLNNIPHSREIRKYFYDDVLQATQRAISDGKTRVKVDINIPELNPEMDVYR 363
Query: 84 IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAK 143
IGTLMELVR +ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAK
Sbjct: 364 IGTLMELVRALALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAK 423
Query: 144 GTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDL 203
GTF+ IGSIGA EV+EQDDMFILVAPQNAVGNCIIDD++AMT+AA RPVILIN RLKDL
Sbjct: 424 GTFINIGSIGAAEVEEQDDMFILVAPQNAVGNCIIDDLRAMTNAAEHRPVILINARLKDL 483
Query: 204 PGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDV 263
PGSSGIMQT+GRD+RL+YAASFE+CYFFRLLYYAGTQYPIMGA+RMTYP Y+LYKRVD
Sbjct: 484 PGSSGIMQTIGRDQRLKYAASFESCYFFRLLYYAGTQYPIMGAIRMTYPYQYDLYKRVDE 543
Query: 264 PSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
G EKYVILSTF E+P++D+INDAF GKP +K SG WGFLS +
Sbjct: 544 SPGKEKYVILSTFPERPTTDEINDAFQGKPSKDPRKTSGIWGFLSGIL 591
>gi|224144020|ref|XP_002325159.1| predicted protein [Populus trichocarpa]
gi|222866593|gb|EEF03724.1| predicted protein [Populus trichocarpa]
Length = 520
Score = 517 bits (1331), Expect = e-144, Method: Compositional matrix adjust.
Identities = 241/273 (88%), Positives = 261/273 (95%)
Query: 22 SSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDV 81
S QD W+GIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRL+V+INIPELNPEMDV
Sbjct: 248 SLQDKWRGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLRVDINIPELNPEMDV 307
Query: 82 YRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYG 141
YRIGTLMELVRVIALSF+DDGK VKVCVQGSMG+GALAGMPLQLAGTR ILEFMDWGDYG
Sbjct: 308 YRIGTLMELVRVIALSFSDDGKHVKVCVQGSMGKGALAGMPLQLAGTRNILEFMDWGDYG 367
Query: 142 AKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLK 201
AKGTF++IGSIG EV E+DDMFILVAPQNAVGNCII+D+KAMT AAG+RPVILINP+LK
Sbjct: 368 AKGTFIKIGSIGEQEVSEEDDMFILVAPQNAVGNCIIEDLKAMTVAAGSRPVILINPKLK 427
Query: 202 DLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRV 261
DLPGSSGIMQTMGRDKRLEYAASFE+CYFFRLLYYAGTQYPIMGALRM+YP YELYKRV
Sbjct: 428 DLPGSSGIMQTMGRDKRLEYAASFESCYFFRLLYYAGTQYPIMGALRMSYPYRYELYKRV 487
Query: 262 DVPSGGEKYVILSTFTEKPSSDDINDAFLGKPR 294
D PSG EKY+ILSTF+EKPS+DD+NDAF+G+PR
Sbjct: 488 DQPSGTEKYMILSTFSEKPSTDDVNDAFMGRPR 520
>gi|22327339|ref|NP_198367.2| adenylate kinase family protein [Arabidopsis thaliana]
gi|17979434|gb|AAL49859.1| putative adenylate kinase [Arabidopsis thaliana]
gi|22136756|gb|AAM91697.1| putative adenylate kinase [Arabidopsis thaliana]
gi|110740593|dbj|BAE98401.1| adenylate kinase -like protein [Arabidopsis thaliana]
gi|332006553|gb|AED93936.1| adenylate kinase family protein [Arabidopsis thaliana]
Length = 588
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 227/300 (75%), Positives = 265/300 (88%)
Query: 12 ERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEIN 71
+R ++++ S QD W+GIPTRLNNIPHSR+IR YFY+DVLQAT R++ DG TRL+V+IN
Sbjct: 288 KRMIKTDKASPVQDKWRGIPTRLNNIPHSRDIRAYFYEDVLQATIRSIKDGNTRLRVDIN 347
Query: 72 IPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKI 131
IPELNPEMDVYRIGTLMELV+ +ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKI
Sbjct: 348 IPELNPEMDVYRIGTLMELVQALALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKI 407
Query: 132 LEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTR 191
LE+MDWGD GTFV++G+IG EVDE+DDMFILVAPQNAVGNCIIDD++AMT AAG R
Sbjct: 408 LEYMDWGDDETLGTFVKLGAIGGKEVDEEDDMFILVAPQNAVGNCIIDDLQAMTTAAGKR 467
Query: 192 PVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTY 251
PV+LINPRLKDLP SSGIMQTMGR++RLEYA +F+NCY FRLLYY GTQYPIMGALRM+Y
Sbjct: 468 PVVLINPRLKDLPASSGIMQTMGREQRLEYALTFDNCYVFRLLYYLGTQYPIMGALRMSY 527
Query: 252 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
P YELYKRV+ +G EKYV+L+T+ E+P+ + I+DAF GK R++ KKASG WGFLSSVF
Sbjct: 528 PYRYELYKRVNEENGKEKYVLLATYAERPTPEQIDDAFSGKSRDQSKKASGIWGFLSSVF 587
>gi|297801048|ref|XP_002868408.1| adenylate kinase family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314244|gb|EFH44667.1| adenylate kinase family protein [Arabidopsis lyrata subsp. lyrata]
Length = 595
Score = 490 bits (1261), Expect = e-136, Method: Compositional matrix adjust.
Identities = 227/293 (77%), Positives = 261/293 (89%)
Query: 19 QVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPE 78
+ S QD W+GIPTRLNNIPHSR+IR YFY+DVLQAT R++ DG TRL+V+INIPELNPE
Sbjct: 302 KASPVQDKWRGIPTRLNNIPHSRDIRAYFYEDVLQATIRSIKDGNTRLRVDINIPELNPE 361
Query: 79 MDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG 138
MDVYRIGTLMELVRV+ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILE+MDWG
Sbjct: 362 MDVYRIGTLMELVRVLALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEYMDWG 421
Query: 139 DYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINP 198
D GTFV++G+IG EVDE+DDMFILVAPQNAVGNCIIDD++AMT AAG RPV+LINP
Sbjct: 422 DDETLGTFVKLGAIGGKEVDEEDDMFILVAPQNAVGNCIIDDLQAMTTAAGKRPVVLINP 481
Query: 199 RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELY 258
RLKDLP SSGIMQTMGR++RLEYA +F+NCY FRLLYY GTQYPIMGALRM+YP YELY
Sbjct: 482 RLKDLPASSGIMQTMGREQRLEYALTFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELY 541
Query: 259 KRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
KRV+ +G EKYV+L+T++E+P+ + I+DAF GK R++ KK SG WGFLSSVF
Sbjct: 542 KRVNEENGKEKYVLLATYSERPTPEQIDDAFSGKSRDQSKKPSGIWGFLSSVF 594
>gi|357145496|ref|XP_003573662.1| PREDICTED: uncharacterized protein LOC100845101 [Brachypodium
distachyon]
Length = 596
Score = 486 bits (1250), Expect = e-135, Method: Compositional matrix adjust.
Identities = 219/292 (75%), Positives = 259/292 (88%)
Query: 20 VSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEM 79
++S++ W+GIPTRLNNIPHSREIR YFYDDVLQAT+RA+ D +TRL+++INIPELNPEM
Sbjct: 305 TTTSKNEWRGIPTRLNNIPHSREIRKYFYDDVLQATKRAIEDKKTRLQIDINIPELNPEM 364
Query: 80 DVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGD 139
DVYRIGTLMELVR I+L+FADDGKRVKV VQGSMG+GA AG+PLQLAGTRKILEFMDWGD
Sbjct: 365 DVYRIGTLMELVREISLAFADDGKRVKVSVQGSMGQGAFAGIPLQLAGTRKILEFMDWGD 424
Query: 140 YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPR 199
YGAKG F+ IG++GA+EVD++DDMFIL+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPR
Sbjct: 425 YGAKGAFINIGAVGASEVDKEDDMFILIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPR 484
Query: 200 LKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYK 259
LKD+P SSG+MQTMGRD RL+YAASFE CY FRLL+YAGT YPIMGALRM YPN YE+Y+
Sbjct: 485 LKDMPASSGVMQTMGRDMRLKYAASFETCYSFRLLFYAGTFYPIMGALRMGYPNKYEIYR 544
Query: 260 RVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
RVD P+G EKYV+L+ +KP++DDI +AF G + +E SGFWGFLS +
Sbjct: 545 RVDEPNGKEKYVLLAELMDKPTADDITNAFKGPKKEKESAPSGFWGFLSGIL 596
>gi|297608344|ref|NP_001061454.2| Os08g0288200 [Oryza sativa Japonica Group]
gi|255678327|dbj|BAF23368.2| Os08g0288200, partial [Oryza sativa Japonica Group]
Length = 493
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 218/295 (73%), Positives = 258/295 (87%)
Query: 17 SNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELN 76
S ++S++ ++GIPTRLNNIPHSREIR YFY+DVL AT+ AV D +TRL+++INIPELN
Sbjct: 199 SKDTTASKNEFRGIPTRLNNIPHSREIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELN 258
Query: 77 PEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMD 136
PEMDVYRIGTLMELVR ++LSFADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MD
Sbjct: 259 PEMDVYRIGTLMELVRELSLSFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMD 318
Query: 137 WGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILI 196
WG+YGAKGTF+ G++GA+EVD++DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+
Sbjct: 319 WGEYGAKGTFINFGAVGASEVDKEDDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILV 378
Query: 197 NPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYE 256
NPRLKD+PGSSG+MQTMGRD RL+YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE
Sbjct: 379 NPRLKDMPGSSGVMQTMGRDMRLKYAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYE 438
Query: 257 LYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
+Y+RVD P+G E+YV+L F EKP+ D+I +AF + EK ASGFWGFLS +
Sbjct: 439 IYRRVDEPNGQERYVLLEEFVEKPTPDEITNAFRPRKNENEKSASGFWGFLSGIL 493
>gi|79329010|ref|NP_001031966.1| adenylate kinase family protein [Arabidopsis thaliana]
gi|332006554|gb|AED93937.1| adenylate kinase family protein [Arabidopsis thaliana]
Length = 580
Score = 478 bits (1230), Expect = e-132, Method: Compositional matrix adjust.
Identities = 220/293 (75%), Positives = 258/293 (88%)
Query: 12 ERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEIN 71
+R ++++ S QD W+GIPTRLNNIPHSR+IR YFY+DVLQAT R++ DG TRL+V+IN
Sbjct: 288 KRMIKTDKASPVQDKWRGIPTRLNNIPHSRDIRAYFYEDVLQATIRSIKDGNTRLRVDIN 347
Query: 72 IPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKI 131
IPELNPEMDVYRIGTLMELV+ +ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKI
Sbjct: 348 IPELNPEMDVYRIGTLMELVQALALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKI 407
Query: 132 LEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTR 191
LE+MDWGD GTFV++G+IG EVDE+DDMFILVAPQNAVGNCIIDD++AMT AAG R
Sbjct: 408 LEYMDWGDDETLGTFVKLGAIGGKEVDEEDDMFILVAPQNAVGNCIIDDLQAMTTAAGKR 467
Query: 192 PVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTY 251
PV+LINPRLKDLP SSGIMQTMGR++RLEYA +F+NCY FRLLYY GTQYPIMGALRM+Y
Sbjct: 468 PVVLINPRLKDLPASSGIMQTMGREQRLEYALTFDNCYVFRLLYYLGTQYPIMGALRMSY 527
Query: 252 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFW 304
P YELYKRV+ +G EKYV+L+T+ E+P+ + I+DAF GK R++ KKASG W
Sbjct: 528 PYRYELYKRVNEENGKEKYVLLATYAERPTPEQIDDAFSGKSRDQSKKASGIW 580
>gi|326499868|dbj|BAJ90769.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326502206|dbj|BAJ95166.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 596
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 214/292 (73%), Positives = 257/292 (88%), Gaps = 1/292 (0%)
Query: 20 VSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEM 79
++S+ W+GIPTRLNNIPHSREIR YFYDDV+QAT+RA+ D +TRL+++INIPELNPE+
Sbjct: 306 TTASKKEWRGIPTRLNNIPHSREIRKYFYDDVIQATKRAIEDKKTRLQIDINIPELNPEL 365
Query: 80 DVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGD 139
DVYRIGTLMELVR ++L+FADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGD
Sbjct: 366 DVYRIGTLMELVRDLSLTFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGD 425
Query: 140 YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPR 199
YGA G F+ IG++GA+EVD++DDMF+L+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPR
Sbjct: 426 YGAMGAFINIGAVGASEVDKEDDMFVLIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPR 485
Query: 200 LKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYK 259
LKD+P SSG+MQTMGRD RL+YAASFE CY FRLL+Y GT YPIMGALRM YPN YE+Y+
Sbjct: 486 LKDMPASSGVMQTMGRDLRLQYAASFETCYSFRLLFYVGTFYPIMGALRMAYPNKYEIYR 545
Query: 260 RVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
RVD P+ GEKY +L+ FT P++DDI +AF+G + +E SGFWGFLS +
Sbjct: 546 RVDEPN-GEKYDLLAEFTANPTADDITNAFVGTKKKKENAPSGFWGFLSGIL 596
>gi|38175634|dbj|BAD01341.1| putative Adenylate kinase, chloroplast [Oryza sativa Japonica
Group]
Length = 467
Score = 477 bits (1227), Expect = e-132, Method: Compositional matrix adjust.
Identities = 218/295 (73%), Positives = 258/295 (87%)
Query: 17 SNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELN 76
S ++S++ ++GIPTRLNNIPHSREIR YFY+DVL AT+ AV D +TRL+++INIPELN
Sbjct: 173 SKDTTASKNEFRGIPTRLNNIPHSREIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELN 232
Query: 77 PEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMD 136
PEMDVYRIGTLMELVR ++LSFADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MD
Sbjct: 233 PEMDVYRIGTLMELVRELSLSFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMD 292
Query: 137 WGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILI 196
WG+YGAKGTF+ G++GA+EVD++DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+
Sbjct: 293 WGEYGAKGTFINFGAVGASEVDKEDDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILV 352
Query: 197 NPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYE 256
NPRLKD+PGSSG+MQTMGRD RL+YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE
Sbjct: 353 NPRLKDMPGSSGVMQTMGRDMRLKYAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYE 412
Query: 257 LYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
+Y+RVD P+G E+YV+L F EKP+ D+I +AF + EK ASGFWGFLS +
Sbjct: 413 IYRRVDEPNGQERYVLLEEFVEKPTPDEITNAFRPRKNENEKSASGFWGFLSGIL 467
>gi|293333213|ref|NP_001168735.1| uncharacterized protein LOC100382527 [Zea mays]
gi|223972711|gb|ACN30543.1| unknown [Zea mays]
Length = 599
Score = 477 bits (1227), Expect = e-132, Method: Compositional matrix adjust.
Identities = 216/291 (74%), Positives = 258/291 (88%), Gaps = 1/291 (0%)
Query: 21 SSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMD 80
+ S++ W+GIPTRLNNIPHSREIR YFY+DVL+AT+ A+ D +TRL+++INIPELNPEMD
Sbjct: 310 TGSKNEWRGIPTRLNNIPHSREIREYFYNDVLKATKLAIEDKKTRLQIDINIPELNPEMD 369
Query: 81 VYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY 140
VYRIGTLMELVR ++LSFADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGDY
Sbjct: 370 VYRIGTLMELVRELSLSFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGDY 429
Query: 141 GAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRL 200
GAKGTF+ IG++GA +VD++DDMF+L+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPRL
Sbjct: 430 GAKGTFINIGAVGARDVDKEDDMFVLIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPRL 489
Query: 201 KDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKR 260
KD+P SSG+MQTMGRD RL+YAASFE CY FRLLYYAG+ YPIMGALRM YPN YE+Y+R
Sbjct: 490 KDMPASSGVMQTMGRDIRLKYAASFETCYSFRLLYYAGSFYPIMGALRMAYPNKYEIYRR 549
Query: 261 VDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
VD +G E Y++++ FT KP+ D+I AF G+ R EK +SGFWGFLS +F
Sbjct: 550 VDTLNGKENYILIAEFTNKPTPDEITSAFKGQKRE-EKASSGFWGFLSGLF 599
>gi|326509295|dbj|BAJ91564.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 512
Score = 476 bits (1224), Expect = e-132, Method: Compositional matrix adjust.
Identities = 214/292 (73%), Positives = 257/292 (88%), Gaps = 1/292 (0%)
Query: 20 VSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEM 79
++S+ W+GIPTRLNNIPHSREIR YFYDDV+QAT+RA+ D +TRL+++INIPELNPE+
Sbjct: 222 TTASKKEWRGIPTRLNNIPHSREIRKYFYDDVIQATKRAIEDKKTRLQIDINIPELNPEL 281
Query: 80 DVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGD 139
DVYRIGTLMELVR ++L+FADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGD
Sbjct: 282 DVYRIGTLMELVRDLSLTFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGD 341
Query: 140 YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPR 199
YGA G F+ IG++GA+EVD++DDMF+L+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPR
Sbjct: 342 YGAMGAFINIGAVGASEVDKEDDMFVLIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPR 401
Query: 200 LKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYK 259
LKD+P SSG+MQTMGRD RL+YAASFE CY FRLL+Y GT YPIMGALRM YPN YE+Y+
Sbjct: 402 LKDMPASSGVMQTMGRDLRLQYAASFETCYSFRLLFYVGTFYPIMGALRMAYPNKYEIYR 461
Query: 260 RVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
RVD P+ GEKY +L+ FT P++DDI +AF+G + +E SGFWGFLS +
Sbjct: 462 RVDEPN-GEKYDLLAEFTANPTADDITNAFVGTKKKKENAPSGFWGFLSGIL 512
>gi|10257489|dbj|BAB11193.1| unnamed protein product [Arabidopsis thaliana]
Length = 337
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 213/330 (64%), Positives = 249/330 (75%), Gaps = 48/330 (14%)
Query: 13 RKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINI 72
++ +++ S QD W+GIPTRLNNIPHSR+IR YFY+DVLQAT R++ DG TRL+V+INI
Sbjct: 8 KQVLADKASPVQDKWRGIPTRLNNIPHSRDIRAYFYEDVLQATIRSIKDGNTRLRVDINI 67
Query: 73 PELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKIL 132
PELNPEMDVYRIGTLMELV+ +ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKIL
Sbjct: 68 PELNPEMDVYRIGTLMELVQALALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKIL 127
Query: 133 EFMDWGDYGAKGTFVEIGSI---------------------------------------- 152
E+MDWGD GTFV++G+I
Sbjct: 128 EYMDWGDDETLGTFVKLGAIVTRESLNLQKLHHQQNIVVSIIKLVMHNKVGYIELHIILL 187
Query: 153 --------GANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLP 204
G EVDE+DDMFILVAPQNAVGNCIIDD++AMT AAG RPV+LINPRLKDLP
Sbjct: 188 FDYVFAMTGGKEVDEEDDMFILVAPQNAVGNCIIDDLQAMTTAAGKRPVVLINPRLKDLP 247
Query: 205 GSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVP 264
SSGIMQTMGR++RLEYA +F+NCY FRLLYY GTQYPIMGALRM+YP YELYKRV+
Sbjct: 248 ASSGIMQTMGREQRLEYALTFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELYKRVNEE 307
Query: 265 SGGEKYVILSTFTEKPSSDDINDAFLGKPR 294
+G EKYV+L+T+ E+P+ + I+DAF GK R
Sbjct: 308 NGKEKYVLLATYAERPTPEQIDDAFSGKSR 337
>gi|302769091|ref|XP_002967965.1| hypothetical protein SELMODRAFT_169443 [Selaginella moellendorffii]
gi|300164703|gb|EFJ31312.1| hypothetical protein SELMODRAFT_169443 [Selaginella moellendorffii]
Length = 575
Score = 389 bits (1000), Expect = e-106, Method: Compositional matrix adjust.
Identities = 177/284 (62%), Positives = 223/284 (78%), Gaps = 1/284 (0%)
Query: 24 QDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYR 83
QD+W+G+PT+LN+IPHSREIR YFY +V + + AV DG +LKVEI IPELNPEMDVYR
Sbjct: 291 QDSWRGMPTKLNSIPHSREIREYFYTEVCEGVRNAVEDGLHKLKVEITIPELNPEMDVYR 350
Query: 84 IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAK 143
GTL+ELVR +A +FA+DGKRVKVCVQG MGEG GMPLQL+G+R++LE MDWGDYGAK
Sbjct: 351 AGTLLELVRELAFTFANDGKRVKVCVQGPMGEGIFRGMPLQLSGSRRMLEVMDWGDYGAK 410
Query: 144 GTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDL 203
GTFV IG+I A++V E DDMFIL+APQNAVGNCII+D++AM AAG RPVIL+NP+LKD+
Sbjct: 411 GTFVNIGAIAASDVFETDDMFILMAPQNAVGNCIIEDLQAMVQAAGDRPVILVNPQLKDV 470
Query: 204 PGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDV 263
PG+ G+MQ +GR++R+E++ SF CY FRLLY AGT YPIMGALRM YP YE++KR+D
Sbjct: 471 PGAGGVMQVIGREQRMEFSDSFFVCYSFRLLYRAGTFYPIMGALRMKYPGPYEVHKRIDT 530
Query: 264 PSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFL 307
E Y+ +TF +P+ ++ D F G+ + K S W F
Sbjct: 531 GYSSETYIQEATFAGRPTPTELQDVFAGRSKEASKTKS-IWSFF 573
>gi|302761066|ref|XP_002963955.1| hypothetical protein SELMODRAFT_270406 [Selaginella moellendorffii]
gi|300167684|gb|EFJ34288.1| hypothetical protein SELMODRAFT_270406 [Selaginella moellendorffii]
Length = 575
Score = 389 bits (1000), Expect = e-106, Method: Compositional matrix adjust.
Identities = 177/284 (62%), Positives = 222/284 (78%), Gaps = 1/284 (0%)
Query: 24 QDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYR 83
QD+W+G+PT+LN+IPHSREIR YFY +V + + AV DG +LKVEI IPELNPEMDVYR
Sbjct: 291 QDSWRGMPTKLNSIPHSREIREYFYTEVCEGVRNAVEDGLHKLKVEITIPELNPEMDVYR 350
Query: 84 IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAK 143
GTL+ELVR +A +FA+DGKRVKVCVQG MGEG GMPLQL+G+R +LE MDWGDYGAK
Sbjct: 351 AGTLLELVRELAFTFANDGKRVKVCVQGPMGEGIFRGMPLQLSGSRMMLEVMDWGDYGAK 410
Query: 144 GTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDL 203
GTFV IG+I A++V E DDMFIL+APQNAVGNCII+D++AM AAG RPVIL+NP+LKD+
Sbjct: 411 GTFVNIGAIAASDVSETDDMFILMAPQNAVGNCIIEDLQAMVQAAGDRPVILVNPQLKDV 470
Query: 204 PGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDV 263
PG+ G+MQ +GR++R+E++ SF CY FRLLY AGT YPIMGALRM YP YE++KR+D
Sbjct: 471 PGAGGVMQVIGREQRMEFSDSFFVCYSFRLLYRAGTFYPIMGALRMKYPGPYEVHKRIDT 530
Query: 264 PSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFL 307
E Y+ +TF +P+ ++ D F G+ + K S W F
Sbjct: 531 GYSSETYIQEATFPGRPTPTELQDVFAGRSKEASKTKS-IWSFF 573
>gi|168018045|ref|XP_001761557.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687241|gb|EDQ73625.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 264
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 162/264 (61%), Positives = 211/264 (79%), Gaps = 1/264 (0%)
Query: 27 WKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGT 86
W+G+PT+LN PHSREIR YFY D QA + A++DG+ +LKV + IPELNPEMDVYRIGT
Sbjct: 1 WRGVPTKLNTCPHSREIRQYFYKDACQAAKNAIDDGKHKLKVLLTIPELNPEMDVYRIGT 60
Query: 87 LMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTF 146
L+EL+R +A +FA+DGKRV+VCVQG MGEG +GMPL L+GTR +LE MDWG+Y K TF
Sbjct: 61 LLELIRELAFTFANDGKRVRVCVQGPMGEGIFSGMPLALSGTRPMLERMDWGEYIEKDTF 120
Query: 147 VEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGS 206
+ G++GA EV + DDMFIL+APQNAVGNCIIDD+KAM +AAG RPVI++NP+LKD+P S
Sbjct: 121 IRFGAVGAKEVYDDDDMFILMAPQNAVGNCIIDDLKAMVEAAGDRPVIILNPKLKDIPAS 180
Query: 207 SGIMQTM-GRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPS 265
GIMQ + GR +R+E+A SF CY FRL+Y +GTQYPI+GA RM YP YE+Y+R D+
Sbjct: 181 GGIMQQVGGRQERMEFAESFFTCYNFRLMYTSGTQYPIVGAFRMAYPGPYEIYRRFDLDR 240
Query: 266 GGEKYVILSTFTEKPSSDDINDAF 289
G E Y ++++F +P+ +I+DA+
Sbjct: 241 GVEVYKLIASFDHEPTRSEISDAY 264
>gi|218200847|gb|EEC83274.1| hypothetical protein OsI_28627 [Oryza sativa Indica Group]
Length = 681
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 152/215 (70%), Positives = 184/215 (85%), Gaps = 6/215 (2%)
Query: 17 SNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELN 76
S ++S++ ++GIPTRLNNIPHSREIR YFY+DVL AT+ AV D +TRL+++INIPELN
Sbjct: 315 SKDTTASKNEFRGIPTRLNNIPHSREIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELN 374
Query: 77 PEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMD 136
PEMDVYRIGTLMELVR ++LSFADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MD
Sbjct: 375 PEMDVYRIGTLMELVRELSLSFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMD 434
Query: 137 WGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILI 196
WG+YGAKGTF+ G++GA+EVD++DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+
Sbjct: 435 WGEYGAKGTFINFGAVGASEVDKEDDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILV 494
Query: 197 NPRLKDLP------GSSGIMQTMGRDKRLEYAASF 225
NPRLK GS G ++ G+++ ++YAA
Sbjct: 495 NPRLKGCGVGVRGWGSRGGVKVNGKNRVVKYAADL 529
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 73/110 (66%), Positives = 88/110 (80%)
Query: 202 DLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRV 261
D+PGSSG+MQTMGRD RL+YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE+Y+RV
Sbjct: 572 DMPGSSGVMQTMGRDMRLKYAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYEIYRRV 631
Query: 262 DVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
D P+G E+YV+L F EKP+ D+I +AF + EK ASGFWGFLS +
Sbjct: 632 DEPNGQERYVLLEEFVEKPTPDEITNAFRPRKNENEKSASGFWGFLSGIL 681
>gi|222640270|gb|EEE68402.1| hypothetical protein OsJ_26756 [Oryza sativa Japonica Group]
Length = 680
Score = 318 bits (816), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 152/215 (70%), Positives = 183/215 (85%), Gaps = 6/215 (2%)
Query: 17 SNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELN 76
S ++S++ ++GIPTRLNNIPHSREIR YFY+DVL AT+ AV D +TRL+++INIPELN
Sbjct: 314 SKDTTASKNEFRGIPTRLNNIPHSREIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELN 373
Query: 77 PEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMD 136
PEMDVYRIGTLMELVR ++LSFADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MD
Sbjct: 374 PEMDVYRIGTLMELVRELSLSFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMD 433
Query: 137 WGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILI 196
WG+YGAKGTF+ G++GA+EVD++DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+
Sbjct: 434 WGEYGAKGTFINFGAVGASEVDKEDDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILV 493
Query: 197 NPRLKDLP------GSSGIMQTMGRDKRLEYAASF 225
NPRLK GS G + G+++ ++YAA
Sbjct: 494 NPRLKGCGVGVRGWGSRGGGKVNGKNRVVKYAADL 528
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 73/110 (66%), Positives = 88/110 (80%)
Query: 202 DLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRV 261
D+PGSSG+MQTMGRD RL+YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE+Y+RV
Sbjct: 571 DMPGSSGVMQTMGRDMRLKYAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYEIYRRV 630
Query: 262 DVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
D P+G E+YV+L F EKP+ D+I +AF + EK ASGFWGFLS +
Sbjct: 631 DEPNGQERYVLLEEFVEKPTPDEITNAFRPRKNENEKSASGFWGFLSGIL 680
>gi|384250870|gb|EIE24349.1| hypothetical protein COCSUDRAFT_65858 [Coccomyxa subellipsoidea
C-169]
Length = 291
Score = 243 bits (620), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 126/278 (45%), Positives = 176/278 (63%), Gaps = 10/278 (3%)
Query: 30 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 89
IP RLNN+P SRE R +FY+DV +A ++AV L V++ +PE NPEMDVYRIGT++E
Sbjct: 18 IPQRLNNVPASRETRQFFYNDVCRAVEKAVAAEERLLSVKLTVPETNPEMDVYRIGTVLE 77
Query: 90 LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 149
LVR + A DGKRVKVCVQ ++G+G G+PL L+G R+I+E MDWG+ FV
Sbjct: 78 LVRDLVTDLAADGKRVKVCVQQALGQGVFQGLPLSLSGVRRIMENMDWGEV---FEFVSF 134
Query: 150 GSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAA--GTRPVILINPRLKDLPGSS 207
G++G++ +D+ +++VAPQN VG+ I+ + M AA + +IL NP LKDLP +S
Sbjct: 135 GNVGSDHLDD-SAYYVIVAPQNVVGSTIMTKLLEMATAAVEQKKTLILFNPLLKDLPSAS 193
Query: 208 GIMQTMGRDKRLEYAASFENCYFFRLLYYAGT-QYPIMGALRMTYPNGYELYKRVDVPSG 266
G+M GR +R+E+A SF Y FRLLY + YPI GALR Y +E++KR D+
Sbjct: 194 GVMGVRGRRERMEFADSFIPAYHFRLLYSSSAMMYPIRGALRHVYGGPWEVFKREDMGKR 253
Query: 267 GEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFW 304
E+Y + +F +P + F +P KK SG W
Sbjct: 254 TEEYRFVGSFDSQPDGSLLTPLFQEQP---PKKNSGAW 288
>gi|413922034|gb|AFW61966.1| hypothetical protein ZEAMMB73_400224, partial [Zea mays]
Length = 446
Score = 236 bits (603), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 105/133 (78%), Positives = 124/133 (93%)
Query: 21 SSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMD 80
+ S++ W+GIPTRLNNIPHSREIR YFY+DVL+AT+ A+ D +TRL+++INIPELNPEMD
Sbjct: 310 TGSKNEWRGIPTRLNNIPHSREIREYFYNDVLKATKLAIEDKKTRLQIDINIPELNPEMD 369
Query: 81 VYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY 140
VYRIGTLMELVR ++LSFADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGDY
Sbjct: 370 VYRIGTLMELVRELSLSFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGDY 429
Query: 141 GAKGTFVEIGSIG 153
GAKGTF+ IG++G
Sbjct: 430 GAKGTFINIGAVG 442
>gi|303283278|ref|XP_003060930.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457281|gb|EEH54580.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 354
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 113/233 (48%), Positives = 155/233 (66%), Gaps = 5/233 (2%)
Query: 30 IPTRLNNIPHSREIRNYFYDDVLQATQRAVN--DGRTRLKVEINIPELNPEMDVYRIGTL 87
IP LNN PH RE+R +FYDD + + RAV + TR+K ++ PELN DVYR+GTL
Sbjct: 85 IPIMLNNTPHPREVRRWFYDDCVTSVTRAVTGEEPMTRVKAKVEFPELNVNGDVYRVGTL 144
Query: 88 MELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFV 147
+ELVR +A++ A DGKRV+VCVQGSMGEG G+PL L+G +IL+ MDWGD F+
Sbjct: 145 LELVRELAMTLAKDGKRVRVCVQGSMGEGVFQGLPLSLSGVARILDMMDWGD---ADEFI 201
Query: 148 EIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSS 207
G+I + +DD FIL+APQN VG ++ + AM +AAG RP+I+INP+L D+ +
Sbjct: 202 SRGAIDGDVPQPEDDFFILIAPQNIVGFAVLPRIIAMEEAAGDRPMIMINPKLDDIQSAG 261
Query: 208 GIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKR 260
+M GR +R EY +++ Y FRLLY +PI GA+R T+ +ELYK+
Sbjct: 262 NVMSIRGRGERREYVSTWREVYHFRLLYRKPYFFPIYGAMRYTHDGKWELYKK 314
>gi|307109260|gb|EFN57498.1| hypothetical protein CHLNCDRAFT_21254 [Chlorella variabilis]
Length = 396
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 123/314 (39%), Positives = 171/314 (54%), Gaps = 55/314 (17%)
Query: 30 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 89
IPTRLN IPH R R +FY + A +RA++ G TR+ IPELN E DVYR+GTL+E
Sbjct: 76 IPTRLNTIPHERTTRQHFYREATTAIKRAIDAGETRVLARCTIPELNTEFDVYRVGTLLE 135
Query: 90 LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 149
LVR +A S A DGKRVKVCVQ ++G+G GMPL L G +I++ MDWG+ KG
Sbjct: 136 LVREVATSLAADGKRVKVCVQQALGQGVFQGMPLSLNGVLRIMKQMDWGESAEKGVSSAR 195
Query: 150 GSI----------------------------------------------------GANEV 157
G + GA EV
Sbjct: 196 GGVVGCLFCVHTGRSHEEGMCACSAWLVHSSPCPVLTSLTCHPATAADRILTGNLGAAEV 255
Query: 158 DEQDDMFILVAPQNAVGNCIIDDMKAMTDAA--GTRPVILINPRLKDLPGSSGIMQTMGR 215
E D F+L+APQN VG+ I+ + M AA +P++LINP+L D+ ++G+M GR
Sbjct: 256 -EDADAFVLIAPQNIVGHSIMPLLSEMVAAAEEAGKPMVLINPKLTDIQSAAGVMSVRGR 314
Query: 216 DKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILST 275
RL++ A+F Y FRLLY + +PIMGALR + +E+++RV++ G E+Y ++ +
Sbjct: 315 QDRLDFTATFGTAYHFRLLYKGVSMHPIMGALRHEHDGPWEVFRRVELGPGVEEYQLIGS 374
Query: 276 FTEKPSSDDINDAF 289
F +P+ I DAF
Sbjct: 375 FDREPTPPLITDAF 388
>gi|255078468|ref|XP_002502814.1| predicted protein [Micromonas sp. RCC299]
gi|226518080|gb|ACO64072.1| predicted protein [Micromonas sp. RCC299]
Length = 331
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 120/267 (44%), Positives = 164/267 (61%), Gaps = 8/267 (2%)
Query: 30 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDG--RTRLKVEINIPELNPEMDVYRIGTL 87
IP LNN PH RE+R +FYDD ++ AV +TR+K I PELN DVYRIGTL
Sbjct: 68 IPKMLNNTPHPREVRKWFYDDCRESVVAAVTGANPKTRVKAFIEFPELNVNGDVYRIGTL 127
Query: 88 MELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFV 147
+ELVR +A++ A DGKRV+VCVQGSMG+G G+PL L+G +IL+ MDWGD F+
Sbjct: 128 LELVREVAMALAADGKRVRVCVQGSMGQGVFQGLPLSLSGVARILDMMDWGD---ADEFI 184
Query: 148 EIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSS 207
GSIG + D FIL++PQN VG ++ ++ M AAG RP+I+INP+L D+ +
Sbjct: 185 ARGSIGGDVPRPDDAYFILISPQNIVGYSVLPYLQEMEKAAGDRPIIMINPKLGDIQSAG 244
Query: 208 GIMQTMGRDKRLEYA-ASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSG 266
+M GR +R EY ++E Y FRLLY +PI GALR +ELYK+
Sbjct: 245 NVMSIRGRGERREYVEKTWEEVYHFRLLYKKPFFFPIYGALRYGTGGMWELYKKFGKME- 303
Query: 267 GEKYVILSTFTE-KPSSDDINDAFLGK 292
E++ +L T+ + +P + +I L K
Sbjct: 304 QEEWKLLRTYDDGEPGAAEITKRILAK 330
>gi|412991462|emb|CCO16307.1| predicted protein [Bathycoccus prasinos]
Length = 352
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 117/278 (42%), Positives = 169/278 (60%), Gaps = 17/278 (6%)
Query: 29 GIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGR-TRLKVEINIPELNPEMDVYRIGTL 87
+PT LNNIPH+REIR +FYDD ++ RA+ D R R+KV PE+N E DV+R GTL
Sbjct: 77 ALPTLLNNIPHTREIRRFFYDDATESVLRAIKDDRNVRMKVFTEFPEMNVEGDVFRAGTL 136
Query: 88 MELVRVIALSFA----DDG----KRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGD 139
+E+VR +A A DG +RVK+CVQGSMG G G+PL L+G ++++ MDW +
Sbjct: 137 LEMVRTMATELARMERKDGSGGKQRVKLCVQGSMGTGVFQGLPLALSGISRLIDLMDWEE 196
Query: 140 YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPR 199
A+ + G+IG + + +D+FIL+ PQN VG I+ M+ M + AG RP+IL+NP+
Sbjct: 197 DVAER--ISNGAIGKDHAKDDEDVFILICPQNIVGYSILPYMEEMMEVAGNRPMILLNPK 254
Query: 200 LKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNG----- 254
L D+ + +M GR R+E+A S+E Y FRLLY +PI GALR +Y +
Sbjct: 255 LGDIQSAGNVMSIRGRQGRMEFANSWEEIYHFRLLYRKPFFFPIYGALRKSYSSAEEDDT 314
Query: 255 YELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGK 292
+ELYKR E Y ++ + ++P+ D + GK
Sbjct: 315 WELYKRFG-KMDDEYYQLMEVYKKEPNPDQMTKVIWGK 351
>gi|145349122|ref|XP_001418989.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579219|gb|ABO97282.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 290
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 112/263 (42%), Positives = 158/263 (60%), Gaps = 4/263 (1%)
Query: 30 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 89
+PT LNN+PH+R R +FY+D ++ AV R+K++ PELN DVYR+GT +E
Sbjct: 31 VPTLLNNVPHARATRRWFYEDATKSIVAAVTARERRIKIKSEFPELNVNGDVYRVGTTLE 90
Query: 90 LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 149
LVR A + A DG RVK+CVQ SMG+G +PL L+G K+L MDW + G +G
Sbjct: 91 LVRETAAALARDGLRVKICVQRSMGQGVFQALPLSLSGVSKLLGMMDW-EPGTEGMIDPS 149
Query: 150 GSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGI 209
G IGA + E D FILV PQN VG ++ ++ M G +P+I+INPRL D+ + +
Sbjct: 150 GGIGAAD-GEGADCFILVQPQNIVGYSVLPYIEEMEAVVGDKPMIMINPRLDDVQSAGNV 208
Query: 210 MQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMT-YPNGYELYKRVDVPSGGE 268
M GR +R++ A+++ Y FRLLY +PI GALR Y N +ELYKR E
Sbjct: 209 MSVRGRKERMDAVAAWKEVYHFRLLYRKPYFFPIYGALRFAYYENEWELYKRTGARE-LE 267
Query: 269 KYVILSTFTEKPSSDDINDAFLG 291
+YV+L T+ +P+ D++ G
Sbjct: 268 QYVLLGTYPSEPTPDEMTKKIWG 290
>gi|428174173|gb|EKX43070.1| hypothetical protein GUITHDRAFT_110795 [Guillardia theta CCMP2712]
Length = 584
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 119/273 (43%), Positives = 159/273 (58%), Gaps = 26/273 (9%)
Query: 34 LNNIPHSREIRNYFYDDVLQATQRAVNDGR------TRLKVEINIPELNPEMDVYRIGTL 87
LN IP + EIR +FY DV A Q A+ D R RLK+E PELNPEMD +R+GT+
Sbjct: 322 LNEIPCNTEIREWFYRDVCDAVQVALQDSRFLESCAGRLKIECVPPELNPEMDSFRVGTM 381
Query: 88 MELVRVIALSFADD-GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTF 146
+E+VR IA+ G+RVKVC+QGS+GEG GMPL L G R+++E MDW G
Sbjct: 382 LEMVREIAIRVGVVMGRRVKVCIQGSLGEGIFTGMPLMLNGMRRVMESMDW--QARPGEM 439
Query: 147 VEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG-TRPVILINPRLKDLPG 205
E I ++FI++APQ+ VG + + + M +AAG R VILINPRLKD P
Sbjct: 440 YEGLLI---------NVFIIIAPQSMVGASVYEPLSQMVEAAGPKRTVILINPRLKDRPS 490
Query: 206 SSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGT-QYPIMGALRMTYPNG--YELYKRVD 262
S +M GR +R+ +AASF + FRLLY T +PI GALRM N + L+KR
Sbjct: 491 SGNVMSVQGRSQRIAFAASFREIFHFRLLYTGTTFMFPIQGALRMNMANSPFWTLFKRET 550
Query: 263 VPS----GGEKYVILSTFTEKPSSDDINDAFLG 291
+ GE Y ++ F E+P ++ I + G
Sbjct: 551 TTTPDERTGESYQPIAVFAEEPKTEAITNMMNG 583
>gi|219124197|ref|XP_002182396.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406357|gb|EEC46297.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 406
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 113/284 (39%), Positives = 166/284 (58%), Gaps = 28/284 (9%)
Query: 34 LNNIPHSREIRNYFYDDVLQATQRAV---NDGRT--RLKVEINIPELNPEMDVYRIGTLM 88
LN +PH+R +R+YFY+ A AV ++G+ R+ + PE+NP MD YRIGT++
Sbjct: 121 LNQVPHNRYVRDYFYEQAALAVHDAVVLCSEGKCINRMLITSQFPEMNPSMDSYRIGTIL 180
Query: 89 ELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG------DYGA 142
E+VR I + A++ RV++CVQGSMG G GMP QL G KI++ MDW + G
Sbjct: 181 EMVRTIGIKLAEENLRVRICVQGSMGVGIFTGMPKQLNGVSKIIQMMDWQSGEGELNEGM 240
Query: 143 KGTFVEIGSIGA------------NEVDEQDDMFILVAPQNAVG--NCIIDDMKAMTDAA 188
G ++ G++G N V QDD+FIL+APQ+ VG + I+ ++ M +AA
Sbjct: 241 VGDYIRFGAVGPEHVLNEEKDKDDNVVQYQDDVFILIAPQSMVGTDSSIMPLLQGMVEAA 300
Query: 189 GTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQY-PIMGAL 247
G RPVIL+NP L D ++G GR +R+++A SF+ Y F+ +Y +GT Y PI+GA+
Sbjct: 301 GNRPVILMNPDLTDKVSAAGQQSVRGRQQRIDFAESFQTVYHFQNIYISGTSYFPILGAI 360
Query: 248 RMTYPNGYEL-YKRVDVPSG-GEKYVILSTFTEKPSSDDINDAF 289
+P L ++R D G GE YV + P ++I DAF
Sbjct: 361 TKLHPKEPWLAHQRRDYADGEGEIYVPVLAGEVIPKGEEILDAF 404
>gi|299473139|emb|CBN78715.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 442
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 107/269 (39%), Positives = 162/269 (60%), Gaps = 9/269 (3%)
Query: 26 NWKGIPTR--LNNIPHSREIRNYFYDDVLQATQRAVNDGRT--RLKVEINIPELNPEMDV 81
++ G+P LNN+PHS+ RN+ Y +A AV D R+K+ +PELNP MD
Sbjct: 167 SYGGLPVNMLLNNVPHSKMARNFIYRGAAEALVLAVEDPTVPRRMKLVCTVPELNPAMDT 226
Query: 82 YRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYG 141
YR+GT++ELVR +A S AD G ++ +Q SMG+G +PL L+G I++ MDW D G
Sbjct: 227 YRVGTVLELVRELAYSMADVGLNTRILIQPSMGKGVFKSLPLALSGVMSIMQGMDWED-G 285
Query: 142 AKGTFVEIGSIG-ANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRL 200
G+ V G +G ++V +DD+ ++++PQ+ G I + + A RPV++INP L
Sbjct: 286 LVGSHVNFGQLGNKDDVSAEDDVILVISPQSIAGYSIHPLLVDTVEHANGRPVVIINPNL 345
Query: 201 KDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQ-YPIMGALRMTYPNG-YELY 258
KD P S G+MQ GR +R+ +A SF++ Y FRLLY + +PI GA+ P+ Y ++
Sbjct: 346 KDRPSSGGVMQVRGRGERIAFAESFKDIYNFRLLYGSSVAFFPIKGAVMKAGPSEPYVVF 405
Query: 259 KRVDV-PSGGEKYVILSTFTEKPSSDDIN 286
R + S GE+YV L TF + P +++I
Sbjct: 406 DRRNTGKSKGEEYVPLGTFDDDPDAENIT 434
>gi|397643725|gb|EJK76041.1| hypothetical protein THAOC_02216 [Thalassiosira oceanica]
Length = 1862
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 115/284 (40%), Positives = 159/284 (55%), Gaps = 28/284 (9%)
Query: 34 LNNIPHSREIRNYFYDDVLQATQRAV---NDG--RTRLKVEINIPELNPEMDVYRIGTLM 88
LN +PH+R +R YFY+ A AV + G R+K+++ PE+NP+MD YRIGTL+
Sbjct: 1576 LNQVPHNRYVRQYFYEMAADAALEAVVLCSAGFISNRMKLQVMFPEMNPQMDSYRIGTLL 1635
Query: 89 ELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG------DYGA 142
EL R IA+ FA+ RV+VCVQGS+G+G +P QL G IL+ MDW + G
Sbjct: 1636 ELTRTIAIRFAEQNLRVRVCVQGSIGQGIFTAIPKQLNGVATILKRMDWQSGEGEENEGM 1695
Query: 143 KGTFVEIGSIGANEV------------DEQDDMFILVAPQNAVG--NCIIDDMKAMTDAA 188
G FV G IG V +QDD+FIL+ PQN G + II + MT+A
Sbjct: 1696 VGDFVNFGHIGKEHVVNFKEGEGGEADTQQDDVFILICPQNMQGIESSIIPYLLEMTEAV 1755
Query: 189 GTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQY-PIMGAL 247
G RP+ILINP L D S G GR R+++A SF+ + F+ +YY+GT Y PI+GA
Sbjct: 1756 GDRPMILINPDLVDKASSQGQQSVRGRQDRIDFANSFKTIFQFQNIYYSGTSYFPILGAS 1815
Query: 248 RMTYPNG-YELYKRVDVPSG-GEKYVILSTFTEKPSSDDINDAF 289
+ P + ++R D+ G GE YV + P S+ + F
Sbjct: 1816 TLLGPEEMWVSHQRRDLKGGKGEVYVPVYASESMPDSEVLMSTF 1859
>gi|308806305|ref|XP_003080464.1| putative Adenylate kinase, chloroplast (ISS) [Ostreococcus tauri]
gi|116058924|emb|CAL54631.1| putative Adenylate kinase, chloroplast (ISS) [Ostreococcus tauri]
Length = 284
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 106/264 (40%), Positives = 153/264 (57%), Gaps = 16/264 (6%)
Query: 30 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 89
IPT LNN+PH+RE R +FYDD +T AV G R+K++ P + P
Sbjct: 35 IPTLLNNVPHARETRRWFYDDACASTLAAVGAGERRMKIKTEFPGVEP------------ 82
Query: 90 LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 149
L R +A + A DGKRVK+CVQ SMG+G + L L+G K+L MD+ + A+
Sbjct: 83 LTRAMAQTLARDGKRVKICVQRSMGQGVFQALTLSLSGVSKLLGMMDFEEDVAE-MIDPN 141
Query: 150 GSIGANE-VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSG 208
G IGA + ++ D +ILV PQN VG ++ + AM +A G +P+I+INP+L D+ +
Sbjct: 142 GGIGAEDGAEDGADAYILVQPQNIVGYSVLPYIAAMEEAVGDKPMIMINPKLDDIQSAGN 201
Query: 209 IMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMT-YPNGYELYKRVDVPSGG 267
+M GR +R++ A++ Y FRLLY +PI GALR T Y N +ELYKRV +
Sbjct: 202 VMSIRGRQERMDAVAAWREIYHFRLLYRKPYFFPIYGALRYTYYENAWELYKRVGMRE-E 260
Query: 268 EKYVILSTFTEKPSSDDINDAFLG 291
E+YV+ ST+ +P D++ G
Sbjct: 261 EEYVLRSTYDSEPDPDEMTKQIWG 284
>gi|223997276|ref|XP_002288311.1| adenylate-kinase [Thalassiosira pseudonana CCMP1335]
gi|220975419|gb|EED93747.1| adenylate-kinase [Thalassiosira pseudonana CCMP1335]
Length = 231
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/230 (43%), Positives = 141/230 (61%), Gaps = 16/230 (6%)
Query: 34 LNNIPHSREIRNYFYDDVLQATQRA---VNDGR--TRLKVEINIPELNPEMDVYRIGTLM 88
LN +PH+R +R YFYD QA A ++G+ R+K+ + PE+NP MD YRIGTL+
Sbjct: 1 LNQVPHNRFVRKYFYDMASQAALDAAVLCSEGKLTNRMKMTVMFPEMNPSMDSYRIGTLL 60
Query: 89 ELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDW------GDYGA 142
EL R IA+ A+ RV+VCVQGSMG G G+P QL G +L+ MDW + G
Sbjct: 61 ELARSIAIKLAEQNLRVRVCVQGSMGVGIFTGIPKQLNGVSTLLQRMDWESEVGEENEGM 120
Query: 143 KGTFVEIGSIGA--NEVDEQDDMFILVAPQNAVG-NC-IIDDMKAMTDAAGTRPVILINP 198
G +V G +G N +QDD+F+++ PQN +G C II + M +AAG RPVI++NP
Sbjct: 121 VGDYVNFGFVGKEHNSAVQQDDVFLIICPQNMLGLECSIIQPLSEMVEAAGDRPVIILNP 180
Query: 199 RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQY-PIMGAL 247
L D S G GR +R+++A SFE + F+ +Y +GT Y PI+G++
Sbjct: 181 DLTDKISSQGQQSIRGRQQRIDFAESFETIFQFQNIYTSGTSYFPILGSV 230
>gi|62320102|dbj|BAD94280.1| adenylate kinase -like protein [Arabidopsis thaliana]
Length = 95
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 64/95 (67%), Positives = 80/95 (84%)
Query: 210 MQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEK 269
MQTMGR++RLEYA +F+NCY FRLLYY GTQYPIMGALRM+YP YELYKRV+ +G EK
Sbjct: 1 MQTMGREQRLEYALTFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELYKRVNEENGKEK 60
Query: 270 YVILSTFTEKPSSDDINDAFLGKPRNREKKASGFW 304
YV+L+T+ E+P+ + I+DAF GK R++ KKASG W
Sbjct: 61 YVLLATYAERPTPEQIDDAFSGKSRDQSKKASGIW 95
>gi|147768155|emb|CAN78322.1| hypothetical protein VITISV_001685 [Vitis vinifera]
Length = 228
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 47/61 (77%), Positives = 51/61 (83%)
Query: 8 KPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLK 67
K + + SN SSSQ+ W+GIPTRLNNIPHSREIR YFYDDVLQATQRAVNDGRTRLK
Sbjct: 167 KSIRASERLSNLASSSQEXWRGIPTRLNNIPHSREIRTYFYDDVLQATQRAVNDGRTRLK 226
Query: 68 V 68
V
Sbjct: 227 V 227
>gi|425442993|ref|ZP_18823225.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389715818|emb|CCH99873.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 247
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/270 (27%), Positives = 120/270 (44%), Gaps = 50/270 (18%)
Query: 45 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL FA
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFAS- 51
Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96
Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
++ +D+ F++V P + I+ ++ + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
R + ++ E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
K SS+ ND+ P N+ KK+ F
Sbjct: 208 ENLILKASSNTPNDS--SNPANKAKKSGLF 235
>gi|166367819|ref|YP_001660092.1| hypothetical protein MAE_50780 [Microcystis aeruginosa NIES-843]
gi|425464571|ref|ZP_18843881.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|166090192|dbj|BAG04900.1| hypothetical protein MAE_50780 [Microcystis aeruginosa NIES-843]
gi|389833386|emb|CCI22146.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 247
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 120/270 (44%), Gaps = 50/270 (18%)
Query: 45 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGDRRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96
Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
R + ++ E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
K SS+ ND+ P N+ KK+ F
Sbjct: 208 ENLILKASSNTPNDS--SNPANKAKKSGLF 235
>gi|422304988|ref|ZP_16392325.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|389789763|emb|CCI14274.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
Length = 247
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 71/271 (26%), Positives = 123/271 (45%), Gaps = 52/271 (19%)
Query: 45 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL FA
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFA-- 50
Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGT-RKILEFMDWGDYGAKGTFVEIGSIGA------ 154
S+ +G +G+ + T +L DWG+ T ++G +G+
Sbjct: 51 ----------SIFDGYGSGLRVIFPDTGAAMLARRDWGE-----TVFQLGDLGSRFIPIE 95
Query: 155 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 214
N++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ S +
Sbjct: 96 NKIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV--SVVGIGYAA 149
Query: 215 RDKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVI 272
R R + ++ E+CY+FR L A YP + + + +GYEL GE
Sbjct: 150 RQLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA--- 206
Query: 273 LSTFTEKPSSDDINDAFLGKPRNREKKASGF 303
L K SS++ D P N+ KK+ F
Sbjct: 207 LENLILKASSNNPKDG--SNPANKAKKSGLF 235
>gi|425435402|ref|ZP_18815857.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|389680066|emb|CCH91215.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
Length = 247
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 72/271 (26%), Positives = 121/271 (44%), Gaps = 52/271 (19%)
Query: 45 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFT-- 50
Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGT-RKILEFMDWGDYGAKGTFVEIGSIGA------ 154
S+ +G +G+ + T +L DWG+ T ++G +G+
Sbjct: 51 ----------SIFDGYGSGLRVIFPDTGAAMLARRDWGE-----TVFQLGDLGSRFIPVE 95
Query: 155 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 214
N++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ S +
Sbjct: 96 NKIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV--SVVGIGYAA 149
Query: 215 RDKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVI 272
R R + + E+CY+FR L A YP M + + +GYEL GE
Sbjct: 150 RQLRERFLSILESCYYFRPLESAIVYRSYPSMWQVWLEKEDGYELISEQSTKPMGEA--- 206
Query: 273 LSTFTEKPSSDDINDAFLGKPRNREKKASGF 303
L K SS+ ND P N+ KK+ F
Sbjct: 207 LENLILKASSNTPNDG--SNPANKAKKSGLF 235
>gi|427731149|ref|YP_007077386.1| hypothetical protein Nos7524_4017 [Nostoc sp. PCC 7524]
gi|427367068|gb|AFY49789.1| protein of unknown function (DUF1995) [Nostoc sp. PCC 7524]
Length = 246
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 69/252 (27%), Positives = 116/252 (46%), Gaps = 52/252 (20%)
Query: 42 EIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 98
E+ N D + QA T+ A+ DG TRL+VE+ PEL M + F
Sbjct: 3 ELPNTLEDAIAQAREATKAALADGYTRLQVELLFPELK----------FMPVAEQFLPLF 52
Query: 99 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NE 156
++ R+K+ G ALA R+ DW D + + G I + ++
Sbjct: 53 SEYESRLKIFF-ADAGASALA---------RR-----DWADVPFQILDIGTGRIASIQSK 97
Query: 157 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRD 216
V +D++F+ VAP + + ++ + + G RP++++NPRL+D PG+ GI R
Sbjct: 98 VQPEDEIFLFVAPTSVE----VPQVEKICEIIGDRPLVMLNPRLED-PGTVGIGYA-ARQ 151
Query: 217 KRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILS 274
R + + E+CY+ R P+ A+ YP +EL+ V+ +K+ L
Sbjct: 152 TRQRFISKIESCYYLR---------PVDDETAVFRCYPGLWELW--VENSGTWQKFAELP 200
Query: 275 TFTEKPSSDDIN 286
+KPS D+I+
Sbjct: 201 ---KKPSGDEID 209
>gi|354566056|ref|ZP_08985229.1| protein of unknown function DUF1995-containing protein [Fischerella
sp. JSC-11]
gi|353546564|gb|EHC16012.1| protein of unknown function DUF1995-containing protein [Fischerella
sp. JSC-11]
Length = 250
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 109/246 (44%), Gaps = 46/246 (18%)
Query: 59 VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 118
+ DG TRL+VE+ PEL LME+ + F ++KV + G G
Sbjct: 23 IADGYTRLQVELLFPELK----------LMEVAQQFLPMFEQYNDKLKVFFADAGGAG-- 70
Query: 119 AGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQNAVGNC 176
L DW D K T + G + E V +D++F+ +AP G+
Sbjct: 71 -------------LARRDWTDVRFKITDIGTGRAASIESKVQPEDEIFLFIAP----GSP 113
Query: 177 IIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 236
+ ++ + G RP++++NP L+D G GI R R+ + ++ E+CY+ R
Sbjct: 114 EVPQLEKICAVIGDRPLVMLNPFLED-AGKIGIGYA-ARQTRIHFLSTIESCYYLR---- 167
Query: 237 AGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNR 296
+ YP + R YP +E+ V + GE Y ++ +KPS DD+ D L K +
Sbjct: 168 --SVYPDVALFR-CYPGLWEIL----VENNGE-YQKIAELPQKPSGDDL-DIILAKGQAA 218
Query: 297 EKKASG 302
A G
Sbjct: 219 TAIAEG 224
>gi|425449243|ref|ZP_18829085.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
gi|389764162|emb|CCI09454.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
Length = 247
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 121/270 (44%), Gaps = 50/270 (18%)
Query: 45 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96
Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
++ +D++F++V P + I+ ++ + + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEVFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
K SS++ +D+ P N+ KK+ F
Sbjct: 208 ENLILKASSNNPDDS--SNPANKAKKSGLF 235
>gi|434398468|ref|YP_007132472.1| protein of unknown function DUF1995-containing protein [Stanieria
cyanosphaera PCC 7437]
gi|428269565|gb|AFZ35506.1| protein of unknown function DUF1995-containing protein [Stanieria
cyanosphaera PCC 7437]
Length = 247
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 70/260 (26%), Positives = 114/260 (43%), Gaps = 59/260 (22%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+AT A+ DG R++VE+ IPE+ + V L A FA+ G +KV S
Sbjct: 17 EATLAALADGYNRVQVELVIPEITLQAQV--------LALEFATIFAEYGSGLKVMFPDS 68
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFI 165
G ALA DWG D G++ T VE +V EQD +FI
Sbjct: 69 -GAAALAK--------------RDWGEVPFQVTDLGSRYTAVE------TQVSEQDQIFI 107
Query: 166 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 225
+VAP ++ ++ + + A RPV+L+ P+L+D+ GI T R R + S
Sbjct: 108 VVAP----AAIEVEKVEKLCNVAEDRPVVLLIPQLEDI-AFVGIGYT-ARQMRERFIKSL 161
Query: 226 ENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDI 285
+CY+ R L A A+ +P+ ++++ + E+Y + +P +D+
Sbjct: 162 YSCYYLRPLEEA--------AILRCHPSSWQIWLEKE-----EEYQLAMELAHQPMGEDL 208
Query: 286 NDAFLG----KPRNREKKAS 301
+ P N+ + AS
Sbjct: 209 ERLLMKLTNPNPENKNEVAS 228
>gi|425459538|ref|ZP_18839024.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
gi|389822715|emb|CCI29585.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
Length = 247
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 118/270 (43%), Gaps = 50/270 (18%)
Query: 45 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
K SS+ ND P N+ KK+ F
Sbjct: 208 ENLILKASSNTPNDG--SNPANKAKKSGLF 235
>gi|425446838|ref|ZP_18826837.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
gi|389732775|emb|CCI03345.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
Length = 247
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 119/270 (44%), Gaps = 50/270 (18%)
Query: 45 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
N + +LQA TQ A+ G RL+VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRLQVELVIPEIALQ------------AQALALDFTS- 51
Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
K SS++ D+ P N+ KK+ F
Sbjct: 208 ENLILKASSNNPKDS--SNPANKAKKSGLF 235
>gi|428781510|ref|YP_007173296.1| hypothetical protein Dacsa_3447 [Dactylococcopsis salina PCC 8305]
gi|428695789|gb|AFZ51939.1| protein of unknown function (DUF1995) [Dactylococcopsis salina PCC
8305]
Length = 253
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 68/235 (28%), Positives = 116/235 (49%), Gaps = 41/235 (17%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD-GKRVKVCVQG 111
Q+T A+ G TRL+VE+ +PE+ + + + + FADD G +KV
Sbjct: 17 QSTVAALEAGLTRLQVELKVPEIALKAE--------SIAKEFTDLFADDYGSGLKVLFPD 68
Query: 112 SMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQN 171
S G ALA +EF + D G++ T +E +V E+D +F++V+P +
Sbjct: 69 S-GAAALARR------NWSDVEF-NVNDLGSRNTPIE------KKVAEEDQIFLVVSP-S 113
Query: 172 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 231
AV + ++ + + AG RPVIL+ P+L+D+ + GI R R + ++ E+C++
Sbjct: 114 AVE---VQKVEKLCNLAGDRPVILLIPQLEDV-ATVGIGYA-ARQLRERFLSTLESCFYL 168
Query: 232 RLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
+ L A AL YP+G++L+ + G +Y EKP DD++
Sbjct: 169 QPLDEA--------ALLKRYPSGWQLW----IEKGENQYEFFCEEVEKPVGDDLD 211
>gi|440753103|ref|ZP_20932306.1| hypothetical protein O53_1480 [Microcystis aeruginosa TAIHU98]
gi|440177596|gb|ELP56869.1| hypothetical protein O53_1480 [Microcystis aeruginosa TAIHU98]
Length = 247
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 120/270 (44%), Gaps = 50/270 (18%)
Query: 45 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
K SS++ +D+ P N+ KK+ F
Sbjct: 208 ENLILKASSNNPDDS--SNPANKAKKSGLF 235
>gi|22299764|ref|NP_683011.1| hypothetical protein tlr2221 [Thermosynechococcus elongatus BP-1]
gi|22295948|dbj|BAC09773.1| tlr2221 [Thermosynechococcus elongatus BP-1]
Length = 232
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 113/255 (44%), Gaps = 59/255 (23%)
Query: 62 GRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM 121
G+ RL++EI +P+LNP M L FA+ G +KV G ALA
Sbjct: 27 GKNRLQIEILLPDLNP----------MGLAAGYLPLFAELGSDLKVFF-ADAGSAALA-- 73
Query: 122 PLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIID 179
R+ +WG+ Y +G + I A+ D ++VAP +
Sbjct: 74 -------RR-----EWGETAYSVRGVNERLTPIEAS-----DRAIVIVAPTPVE----VT 112
Query: 180 DMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGT 239
++ M AG RP IL+NPRL+D+ GI GR R + + E CY+ R L A T
Sbjct: 113 AIEQMCLTAGDRPFILLNPRLQDV-AVVGIGYA-GRQLRERFLNTLEPCYYLRPL--AET 168
Query: 240 QYPIMGALRMTYPNGYELYK-RVDVPSGGEKYVILSTFTEKPSSDDINDAF--LGKPRNR 296
L YP +++++ R P+ L+ F ++P+S+DI A LGKPR
Sbjct: 169 VI-----LWRCYPQAWQIWQYRETAPT------CLAEFEQRPTSEDIERALSALGKPR-- 215
Query: 297 EKKASGFWGFLSSVF 311
A F LS++
Sbjct: 216 ---AQSFLSRLSALL 227
>gi|425472740|ref|ZP_18851581.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
gi|389881081|emb|CCI38316.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
Length = 247
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 118/270 (43%), Gaps = 50/270 (18%)
Query: 45 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
++ +D++F++V P + I+ ++ + A RPV+L+ P+L+D+ S + R
Sbjct: 97 KIKPEDEVFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV--SVVGIGYAAR 150
Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
K SS+ ND P N+ KK+ F
Sbjct: 208 ENLILKASSNTPNDG--SNPANKAKKSGLF 235
>gi|443658098|ref|ZP_21132025.1| hypothetical protein C789_2565 [Microcystis aeruginosa DIANCHI905]
gi|159027702|emb|CAO89569.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443333038|gb|ELS47616.1| hypothetical protein C789_2565 [Microcystis aeruginosa DIANCHI905]
Length = 247
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 120/270 (44%), Gaps = 50/270 (18%)
Query: 45 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
++ ++D+ F++V P + I+ ++ + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKDEDEAFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
K SS++ +D+ P N+ KK+ F
Sbjct: 208 ENLILKASSNNPDDS--SNPANKAKKSGLF 235
>gi|282899475|ref|ZP_06307441.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281195632|gb|EFA70563.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 245
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 107/247 (43%), Gaps = 50/247 (20%)
Query: 54 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
A + A++DG+TR++VE+ PEL M + F + R+KV +
Sbjct: 21 AVRSALSDGKTRIQVELLFPELQ----------FMPVAEQFLPLFTEYESRLKVFFADAG 70
Query: 114 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQN 171
DWGD K + G ++E + +D++F+ +AP +
Sbjct: 71 AAALAR---------------RDWGDIPFKIMDIGTGRAASSESKIQPEDEIFLFIAPTS 115
Query: 172 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 231
+ ++ + G RP +++NPRL+D S + R+ R + ++ E+CY+
Sbjct: 116 VE----VAQLEKLCQIIGERPFVMLNPRLED--SSVVGIGYAARETRRRFISTIESCYYL 169
Query: 232 RLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
R PI AL +YP +E++ D GE Y ++ +KPS D+I+
Sbjct: 170 R---------PIDEQSALMRSYPGNWEIWLETD----GE-YQKIAELPQKPSGDEIDSIL 215
Query: 290 L-GKPRN 295
+ G+P
Sbjct: 216 IKGQPET 222
>gi|428311732|ref|YP_007122709.1| hypothetical protein Mic7113_3579 [Microcoleus sp. PCC 7113]
gi|428253344|gb|AFZ19303.1| protein of unknown function (DUF1995) [Microcoleus sp. PCC 7113]
Length = 249
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/236 (28%), Positives = 115/236 (48%), Gaps = 45/236 (19%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+ATQ A+NDGRTRL+VE+ PE+ + I + I L F + G +KV +
Sbjct: 17 EATQSALNDGRTRLQVELVFPEIA--LQAQSIAQ-----QFIPL-FEEYGSGLKVLFPDT 68
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIG-SIGANEVDEQDDMFILVAPQN 171
G ALA R+ DWG+ K + V S N++ +D F+LVAP +
Sbjct: 69 -GAAALA---------RR-----DWGEVPFKISDVGTSRSPITNKIQAEDKAFLLVAP-S 112
Query: 172 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 231
AV + ++ + + AG RP +L+NP+L+D+ S + R R ++ E CY+
Sbjct: 113 AVE---VAQVETLCNLAGDRPCVLLNPQLEDI--SIVGIGMAARKLRERLLSTIEPCYYL 167
Query: 232 RLLYYAGTQYPI-MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
R PI A+ +YP ++++ +D ++Y +++ ++P+ D +
Sbjct: 168 R---------PIDPAAILRSYPGLWQVWLEID-----DEYQLIAEEPQRPAGDALE 209
>gi|425455386|ref|ZP_18835106.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
gi|389803734|emb|CCI17368.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
Length = 247
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 119/270 (44%), Gaps = 50/270 (18%)
Query: 45 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLAIESGARRVQVELVIPEIALQ------------AQALALDFTS- 51
Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------LFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
++ +D+ F++V P + I+ ++ + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
K SS++ +D+ P N+ KK+ F
Sbjct: 208 ENLILKASSNNPDDS--SNPANKAKKSGLF 235
>gi|443475522|ref|ZP_21065469.1| protein of unknown function DUF1995-containing protein
[Pseudanabaena biceps PCC 7429]
gi|443019641|gb|ELS33702.1| protein of unknown function DUF1995-containing protein
[Pseudanabaena biceps PCC 7429]
Length = 236
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 67/268 (25%), Positives = 113/268 (42%), Gaps = 60/268 (22%)
Query: 49 DDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVC 108
+ + AT +A+ G TR+ ++ PEL M + A SF
Sbjct: 19 EQAIAATHQAIASGATRVLADLRFPELKA----------MPIAYEFAASFNQQYGEAWQA 68
Query: 109 VQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE----VDEQDDMF 164
+ G ALA RK W D + + G NE + E+D F
Sbjct: 69 IFSDAGAAALA--------KRK------WVD-------LNVSVRGVNEGRRAIREEDRAF 107
Query: 165 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 224
+L+ P + ++ ++ + AG RP +++NPRL++ G+ RD+ + ++
Sbjct: 108 LLIEP----TSIEVEQVEKLVQLAGDRPFVMLNPRLENSEVGLGLAARQMRDR---FLST 160
Query: 225 FENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 284
FE Y+ + L +GAL YP ++++ + + G +K L+ ++PSSDD
Sbjct: 161 FETAYYIKPLE--------LGALWRCYPQTWQVWNQTE--EGMQK---LAEVEQRPSSDD 207
Query: 285 INDAFLGKPRNREKKASG-FWGFLSSVF 311
I+ F R + K SG F G L F
Sbjct: 208 IDRLF----RQKTGKQSGSFLGRLQQFF 231
>gi|186682862|ref|YP_001866058.1| hypothetical protein Npun_R2561 [Nostoc punctiforme PCC 73102]
gi|186465314|gb|ACC81115.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 244
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/269 (24%), Positives = 111/269 (41%), Gaps = 53/269 (19%)
Query: 42 EIRNYFYDDVLQ---ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 98
E+ N D + Q A Q A+ DG TR++VE PEL M + F
Sbjct: 3 ELPNTLEDAIAQSRTAVQAALADGCTRIQVEFLFPELK----------FMPVAEQFLPLF 52
Query: 99 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NE 156
+ R+K+ + R+ +W D + + G + +
Sbjct: 53 TEYDSRLKIFFADAGAAAL----------ARR-----NWTDAPFQILDIGTGRTASLQTK 97
Query: 157 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRD 216
+ +D++F+ +AP + + ++ + G RPV+ +NPRL+D G+ GI R
Sbjct: 98 IQPEDEIFLFIAPTSVE----VPQVEKLCQEIGDRPVVFLNPRLED-SGTVGIGYA-ARQ 151
Query: 217 KRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILS 274
RL + + E+CY+ R PI AL YP +E++ D GE Y ++
Sbjct: 152 TRLRFTNTIESCYYLR---------PIDEQSALSRCYPGQWEVWLETD----GE-YQRIA 197
Query: 275 TFTEKPSSDDINDAFL-GKPRNREKKASG 302
KPS DD++ L G+P+ A
Sbjct: 198 ELPTKPSGDDLDQILLKGQPQTTTDAAPA 226
>gi|427418343|ref|ZP_18908526.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 7375]
gi|425761056|gb|EKV01909.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 7375]
Length = 237
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 123/269 (45%), Gaps = 54/269 (20%)
Query: 34 LNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRV 93
++++P S E + + +T+ A+ DG RL+VE+ IPEL + R TL
Sbjct: 1 MSDLPSSLE---ETIEQAIASTKAAIEDGIIRLQVEMVIPELKQQPIAERFLTL------ 51
Query: 94 IALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG--DYGAKGTFVEIGS 151
F D G + KV + G ALA R+ DW D+ +G I
Sbjct: 52 ----FQDLGLQFKVYFPDA-GAAALA---------RR-----DWDNPDFTIRG----INE 88
Query: 152 IGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQ 211
I A + D+ F++V P +AV + D++ + + A RPV+L+NP+L+D+ S +
Sbjct: 89 IQAA-ITPDDEAFLIVNP-SAVE---VQDVEKLCNEAQDRPVVLLNPQLEDV--SIVGIG 141
Query: 212 TMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYV 271
R R + ++ +CY++R + + A+ +P G+++++ V G Y
Sbjct: 142 YAARQLRERFLSTLTSCYYYRPMTDS--------AVLRRHPQGWQVWQDV-----GNDYE 188
Query: 272 ILSTFTEKPSSDDINDAFLGKPRNREKKA 300
+ S E+PSS+ + G K A
Sbjct: 189 LKSELPERPSSEALEKILYGSEDTEGKAA 217
>gi|428225033|ref|YP_007109130.1| hypothetical protein GEI7407_1587 [Geitlerinema sp. PCC 7407]
gi|427984934|gb|AFY66078.1| protein of unknown function DUF1995 [Geitlerinema sp. PCC 7407]
Length = 244
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/252 (28%), Positives = 124/252 (49%), Gaps = 53/252 (21%)
Query: 54 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF-ADDGKRVKVCVQGS 112
ATQ A++DG +RL+VE+ PE++ + + + + A F A G R K+ +
Sbjct: 18 ATQAALDDGHSRLQVELVFPEIDLQA--------ISIAELFAADFEARYGDRFKILFPDA 69
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGAN-EVDEQDDMFILVAP 169
GA A LA R WG+ Y +G I + A + DEQ F+++AP
Sbjct: 70 ---GAAA-----LAKHR-------WGEKPYEIRG----ISELKAQIQPDEQ--AFLMIAP 108
Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
+ + ++ + A RPV+++NPRL+D+ + GI GR R + ++ +CY
Sbjct: 109 ----SSVEVGPVEKFCEEASDRPVVMVNPRLEDV-ATIGIGYA-GRQLRERFLSTLLSCY 162
Query: 230 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
+ R + G GALR +YP +E++ ++ SG EK ++ ++KP D + D
Sbjct: 163 YLR--PFEG------GALRRSYPGPWEVW--LETESGYEK---VAEESQKPVGDAL-DQI 208
Query: 290 LGKPRNREKKAS 301
+G+ + E + S
Sbjct: 209 IGRVQGAETEGS 220
>gi|16329818|ref|NP_440546.1| hypothetical protein slr1702 [Synechocystis sp. PCC 6803]
gi|383321560|ref|YP_005382413.1| hypothetical protein SYNGTI_0651 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383324730|ref|YP_005385583.1| hypothetical protein SYNPCCP_0651 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383490614|ref|YP_005408290.1| hypothetical protein SYNPCCN_0651 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384435880|ref|YP_005650604.1| hypothetical protein SYNGTS_0651 [Synechocystis sp. PCC 6803]
gi|451813978|ref|YP_007450430.1| hypothetical protein MYO_16580 [Synechocystis sp. PCC 6803]
gi|1652303|dbj|BAA17226.1| slr1702 [Synechocystis sp. PCC 6803]
gi|339272912|dbj|BAK49399.1| hypothetical protein SYNGTS_0651 [Synechocystis sp. PCC 6803]
gi|359270879|dbj|BAL28398.1| hypothetical protein SYNGTI_0651 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274050|dbj|BAL31568.1| hypothetical protein SYNPCCN_0651 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277220|dbj|BAL34737.1| hypothetical protein SYNPCCP_0651 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407957706|dbj|BAM50946.1| hypothetical protein BEST7613_2015 [Synechocystis sp. PCC 6803]
gi|451779947|gb|AGF50916.1| hypothetical protein MYO_16580 [Synechocystis sp. PCC 6803]
Length = 251
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/244 (24%), Positives = 120/244 (49%), Gaps = 55/244 (22%)
Query: 50 DVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCV 109
+ + ATQRA++DG R++VE+ +PE+ + +E ++ FAD G +++
Sbjct: 15 EAIAATQRALDDGYRRIQVELAVPEIALQAQ----AIALEFSQL----FADQGLGLRIIF 66
Query: 110 QGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------NEVDEQDDM 163
+ G ALA DWG+ +IG +G+ ++ E D +
Sbjct: 67 PDT-GAAALAK--------------RDWGEVP-----FQIGDLGSRYTPIGRKIAESDQV 106
Query: 164 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 223
F++V+P +AV + ++ + + AG RPV+L+ P+L+D+ S + R R + +
Sbjct: 107 FLVVSP-SAVE---VQSVEKLCELAGDRPVVLLIPQLEDV--SIVGIGYAARQLRQRFLS 160
Query: 224 SFENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSS 282
+ + Y+FR P+ GA+ + ++P+ +++ + GE+Y +L+ EKP+
Sbjct: 161 TLFSAYYFR---------PLDGAVVLRSHPSRWQV-----LLEKGEEYELLTELPEKPAG 206
Query: 283 DDIN 286
+ +
Sbjct: 207 EALE 210
>gi|282896506|ref|ZP_06304526.1| conserved hypothetical protein [Raphidiopsis brookii D9]
gi|281198612|gb|EFA73493.1| conserved hypothetical protein [Raphidiopsis brookii D9]
Length = 245
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 60/248 (24%), Positives = 108/248 (43%), Gaps = 52/248 (20%)
Query: 54 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
A + A++DG+TR++VE+ PEL M + F + +KV +
Sbjct: 21 AAKSALSDGKTRIQVELLFPELQ----------FMPVAEQFLPLFTEYESGLKVFFADAG 70
Query: 114 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQN 171
DWGD K + G ++E + +D++F+ +AP +
Sbjct: 71 AAALAR---------------RDWGDIPFKIMDIGTGRAASSESKIQPEDEIFLFIAPTS 115
Query: 172 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGI-MQTMGRDKRLEYAASFENCYF 230
+ ++ + G RP +++NPRL+D SS + + R+ R + ++ E+CY+
Sbjct: 116 VE----VAQLEKLCQIIGERPFVMLNPRLED---SSVVGIGYAARETRRRFISTIESCYY 168
Query: 231 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 288
R PI AL +YP +E++ D GE Y ++ +KPS D+I+
Sbjct: 169 LR---------PIDEQSALMRSYPGNWEIWLETD----GE-YRKIAELPQKPSGDEIDSI 214
Query: 289 FL-GKPRN 295
+ G+P
Sbjct: 215 LIKGQPET 222
>gi|300865956|ref|ZP_07110692.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300336022|emb|CBN55850.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 248
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 73/273 (26%), Positives = 121/273 (44%), Gaps = 60/273 (21%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYR---IGTLMELVRVIALSFADDGKRVKVCV 109
+A + A+ND +TRL++E+ PE+ + I EL + + F D G
Sbjct: 17 EAVKAALNDAQTRLQIELVFPEIALQAQSITQQFITDFEELANGLKVFFPDTGS------ 70
Query: 110 QGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG---ANEVDEQDDMFIL 166
ALA R+ DWG K T ++GS A +++ +D +F+L
Sbjct: 71 ------AALA---------RR-----DWGTVPFKIT--DLGSSRSPVAEKIEPEDQIFLL 108
Query: 167 VAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFE 226
V NA + + ++ + +AA +RPVIL+NPRL+D + GI GR R + + +
Sbjct: 109 V---NA-SSIEVAQVEQLCNAADSRPVILLNPRLEDA-ATIGIGYA-GRQLRDRFLNTLQ 162
Query: 227 NCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
+CY+ R L A AL YP ++++ + GE Y ++S +KP D++
Sbjct: 163 SCYYIRPLPTA--------ALFRCYPQSWQVWLE---ETEGE-YKLISETAQKPVGDELE 210
Query: 287 DAFL--------GKPRNREKKASGFWGFLSSVF 311
P++ K G L S+F
Sbjct: 211 RIIAPTVQNGDPSAPQSAPVKKKGMLAELQSLF 243
>gi|434407903|ref|YP_007150788.1| protein of unknown function (DUF1995) [Cylindrospermum stagnale PCC
7417]
gi|428262158|gb|AFZ28108.1| protein of unknown function (DUF1995) [Cylindrospermum stagnale PCC
7417]
Length = 244
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 58/240 (24%), Positives = 111/240 (46%), Gaps = 53/240 (22%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+ATQ A+ DG R++++ PEL ++ + FA+ R+KV
Sbjct: 17 EATQAALADGCKRIQIDFLFPELK----------ILPVAEQFLPVFAEYDSRLKVFF-AD 65
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGS----IGANEVDEQDDMFILVA 168
G ALA DW D A + V+IG+ +++ +D++F+ +A
Sbjct: 66 AGGAALARR--------------DWAD--AAFSIVDIGTGRTASLQSKIQPEDEIFLFIA 109
Query: 169 PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENC 228
P + + ++ + + G RPV+ +NPRL+D G GI T R+ R + ++ E+C
Sbjct: 110 PTSVE----VPQLEKLCEIIGDRPVVFLNPRLED-SGVVGIGYT-ARETRRRFISTIESC 163
Query: 229 YFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
Y+ R P+ A+ +YP +E++ + ++Y ++ ++PS D+++
Sbjct: 164 YYLR---------PVDDETAVFRSYPGQWEVWVETN-----DEYQRIAELPKRPSGDELD 209
>gi|428775196|ref|YP_007166983.1| hypothetical protein PCC7418_0540 [Halothece sp. PCC 7418]
gi|428689475|gb|AFZ42769.1| protein of unknown function DUF1995 [Halothece sp. PCC 7418]
Length = 253
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 109/243 (44%), Gaps = 57/243 (23%)
Query: 53 QATQRAVNDGRTRLKVEINIPE--LNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQ 110
Q+T A+ G TRL+VE+ IPE L E L+E D G +KV
Sbjct: 17 QSTLTALEAGLTRLQVELRIPEIALQGEKIAKEFADLLE---------DDYGSGLKVLFP 67
Query: 111 GSMGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDM 163
+ G ALA R+ +W D G++ T +E +V E D M
Sbjct: 68 DT-GAAALA---------RR-----NWADVSFQVNDLGSRNTPIE------KKVSEDDQM 106
Query: 164 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 223
F+LV+P +AV + ++ + + AG RPVIL+ P+L+D+ + GI R R + +
Sbjct: 107 FLLVSP-SAVE---VQKVEKLCNLAGDRPVILLIPQLEDV-ATVGIGYA-ARQLRERFLS 160
Query: 224 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 283
+ E+CY+ + L A AL YP+ ++L+ + G Y EKP D
Sbjct: 161 TLESCYYLQPLEEA--------ALLKRYPSSWQLW----IEKGENNYEFFCEEPEKPVGD 208
Query: 284 DIN 286
++
Sbjct: 209 TLD 211
>gi|254422515|ref|ZP_05036233.1| hypothetical protein S7335_2667 [Synechococcus sp. PCC 7335]
gi|196190004|gb|EDX84968.1| hypothetical protein S7335_2667 [Synechococcus sp. PCC 7335]
Length = 248
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 65/237 (27%), Positives = 115/237 (48%), Gaps = 58/237 (24%)
Query: 54 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFAD--DGKRVKVCVQG 111
ATQ A++ G + L++E+ IPEL + IA F + + ++ KV
Sbjct: 17 ATQAALDAGISLLQIEMAIPELKHQP--------------IAQQFVELFNDRQFKVLFPD 62
Query: 112 SMGEGALAGMPLQLAGTRKILEFMDWG--DYGAKGTFVEIGSIGANEVDEQDDMFILVAP 169
+ G ALA R+ DW D+ +G +G + A VD +DD++++V P
Sbjct: 63 A-GAAALA---------RR-----DWNNPDFVIRG----LGEL-AQTVDPEDDLYLIVNP 102
Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
+AV +D ++A+ + A +PV+L+NP+L+D+ GI R R + + E CY
Sbjct: 103 -SAVE---VDKVEALCNEALDQPVVLLNPQLEDV-AVVGIGYA-ARQLRDRFLSQIETCY 156
Query: 230 FFRLLYYAGTQYPI-MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDI 285
+ R PI G + YP +++++ + G ++Y + + +PSS+DI
Sbjct: 157 YVR---------PIDQGVVYRAYPGPWQIWREI----GPDEYEHVQDLSNRPSSEDI 200
>gi|428317816|ref|YP_007115698.1| protein of unknown function DUF1995-containing protein
[Oscillatoria nigro-viridis PCC 7112]
gi|428241496|gb|AFZ07282.1| protein of unknown function DUF1995-containing protein
[Oscillatoria nigro-viridis PCC 7112]
Length = 248
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 67/240 (27%), Positives = 108/240 (45%), Gaps = 55/240 (22%)
Query: 54 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
AT A++DG+T L+VE+ PE+ + + L E F + VKV +
Sbjct: 18 ATAAALSDGKTLLQVELVFPEIALQAQSITLQFLPE--------FEEIYPGVKVFFPDT- 68
Query: 114 GEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFIL 166
G ALA R+ DWG D G+ T VE +++ +D +F+L
Sbjct: 69 GAAALA---------RR-----DWGETPFKVTDLGSSRTPVE------DKIAPEDQLFLL 108
Query: 167 VAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFE 226
+ P + ++ + AA RPVIL+NPRL+D+ + GI GR R + E
Sbjct: 109 INP----AAVEVAQVEKIYIAAAGRPVILLNPRLEDV-ATIGIGYA-GRQLRDRFLNKIE 162
Query: 227 NCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
+CY+ R L A AL YP ++++ + ++Y ++S +KP DD+
Sbjct: 163 SCYYIRPLDTA--------ALFRCYPQPWQVWLETN-----DEYELISETAQKPVGDDLE 209
>gi|427715914|ref|YP_007063908.1| hypothetical protein Cal7507_0584 [Calothrix sp. PCC 7507]
gi|427348350|gb|AFY31074.1| protein of unknown function DUF1995-containing protein [Calothrix
sp. PCC 7507]
Length = 243
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 108/248 (43%), Gaps = 55/248 (22%)
Query: 54 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
ATQ A+ DG TRL+VE PEL P M + AD R+KV
Sbjct: 18 ATQAALADGYTRLQVEFLFPELKP----------MPVAEQYLPLLADYESRLKVFF-ADT 66
Query: 114 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDE----QDDMFILVAP 169
G ALA R+ D+ T +IG+ A V + ++++F+ +AP
Sbjct: 67 GAAALA---------RR--------DWDVPFTISDIGTGRATSVSDKIQPEEEIFLFIAP 109
Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
+ I ++ + G RP IL+NPRL+D GI R R + + E CY
Sbjct: 110 ----SSVEISQLEKLFAEIGDRPAILLNPRLEDA-AIVGIGYA-ARQIRERFINTIETCY 163
Query: 230 FFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 287
+ R P+ A+ YP +E++ V + GE Y ++ ++PS D+I+
Sbjct: 164 YLR---------PVDDQTAVFRCYPGLWEVW----VETNGE-YQKIAELPKRPSGDEIDL 209
Query: 288 AFL-GKPR 294
L G+P+
Sbjct: 210 ILLKGQPQ 217
>gi|411117915|ref|ZP_11390296.1| protein of unknown function (DUF1995) [Oscillatoriales
cyanobacterium JSC-12]
gi|410711639|gb|EKQ69145.1| protein of unknown function (DUF1995) [Oscillatoriales
cyanobacterium JSC-12]
Length = 241
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 63/238 (26%), Positives = 107/238 (44%), Gaps = 53/238 (22%)
Query: 59 VNDGRTRLKVEINIPELNPEMDVYR-IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGA 117
+ G +RL+VEI IPEL P Y+ + E + + F D G ++
Sbjct: 23 LEAGYSRLQVEIAIPELKPMQPAYQYLSVFEEYGSHLKVFFTDAGA-------AALARRE 75
Query: 118 LAGMP-----LQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNA 172
AG P + + GTR+ T VE V +D +F+ VAP +A
Sbjct: 76 WAGSPYNIQSIDIQGTRQT-------------TPVE------QLVLSEDRVFVFVAP-SA 115
Query: 173 VGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFR 232
V +++ + +AAG RPVIL NPR++D+ S + R R + + E CY+ +
Sbjct: 116 VEVGVVEQ---IANAAGDRPVILFNPRMEDV--SVVGIGYAARKLRERFLNTIEPCYYLK 170
Query: 233 LLYYAGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
P+ G AL YP+ ++++ E Y +++ T+KP+ + +++ F
Sbjct: 171 ---------PLEGSALIRCYPSLWQVWAETS-----EGYTLIAEETQKPTLERLDEIF 214
>gi|220907967|ref|YP_002483278.1| hypothetical protein Cyan7425_2561 [Cyanothece sp. PCC 7425]
gi|219864578|gb|ACL44917.1| conserved hypothetical protein [Cyanothece sp. PCC 7425]
Length = 233
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/233 (25%), Positives = 101/233 (43%), Gaps = 54/233 (23%)
Query: 62 GRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM 121
G +RL+V++ IPEL P M + + F+D G +KV +
Sbjct: 26 GYSRLQVDLLIPELKP----------MPVAQQYLSLFSDYGAGLKVFFADAGAAALARRD 75
Query: 122 PLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE----VDEQDDMFILVAPQNAVGNCI 177
WGD V G NE + +D F+LVAP
Sbjct: 76 ---------------WGD-------VPFTIRGVNELLNPIQPEDRAFVLVAPTPVE---- 109
Query: 178 IDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYA 237
+ ++AM + AG RP IL+N +L+D+ + GI GR R + A+FE CY+ R L +
Sbjct: 110 VMQVEAMANQAGDRPFILLNAKLEDI-ATIGIGYA-GRQLRQRFLATFEPCYYLRPLDW- 166
Query: 238 GTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 290
GA+ YP+ ++++ + ++Y +++ E+P+ D ++ L
Sbjct: 167 -------GAVLRIYPSPWQVW----LEQAEDQYQLIAEEAERPAGDRLDQILL 208
>gi|119510288|ref|ZP_01629424.1| hypothetical protein N9414_16062 [Nodularia spumigena CCY9414]
gi|119465032|gb|EAW45933.1| hypothetical protein N9414_16062 [Nodularia spumigena CCY9414]
Length = 244
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 112/247 (45%), Gaps = 50/247 (20%)
Query: 54 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
ATQ A+ DG TRL+V+ PEL LM + F + R+K+ +
Sbjct: 18 ATQAALADGYTRLQVDFLFPELK----------LMPVAEQFLSLFTEYDSRLKIFFPDA- 66
Query: 114 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQN 171
G ALA DW K + G + + +++ +D++F+ +AP +
Sbjct: 67 GGAALAN--------------RDWAGTPFKILDIGTGRVASIQSKIQPEDEIFLFIAPTS 112
Query: 172 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 231
+ ++ + + G RP +++NPRL+D G GI T R R + ++ E+CY+
Sbjct: 113 VE----VPQVEKLCENIGDRPFVMLNPRLED-SGVVGIGYT-ARQTRQRFISTLESCYYL 166
Query: 232 RLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
R P+ A+ YP +E++ ++ GE Y ++ ++P+ D+++
Sbjct: 167 R---------PVDDTTAVFRCYPGLWEVWVEIN----GE-YQKVAELPKRPTGDELDVIV 212
Query: 290 L-GKPRN 295
+ G+P+
Sbjct: 213 MQGQPQT 219
>gi|443309167|ref|ZP_21038918.1| protein of unknown function (DUF1995) [Synechocystis sp. PCC 7509]
gi|442780785|gb|ELR90927.1| protein of unknown function (DUF1995) [Synechocystis sp. PCC 7509]
Length = 243
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 100/239 (41%), Gaps = 44/239 (18%)
Query: 54 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
ATQ A+ DG RL+VEI +PELNP +A F D + +
Sbjct: 18 ATQNAIADGLNRLQVEILLPELNP--------------MPVAERFLSDEEGISHNFNKIF 63
Query: 114 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-VDEQDDMFILVAPQNA 172
A A A R+ DWG+ + + + E + +D+ + +AP
Sbjct: 64 FPDAGAA-----ALARR-----DWGEVPFELLDISTKRVSVEEQIQPEDEAILCIAPTAQ 113
Query: 173 VGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFR 232
+ ++ + +A G RPV+L+NPRL+D+ S + GR R + + ++CY+ R
Sbjct: 114 E----VLQIEKLCNAMGDRPVVLLNPRLEDV--SIVGIGYAGRQLRDRFLKNIQSCYYLR 167
Query: 233 LLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLG 291
L G L YP ++++ + Y ++ KP DD++ F G
Sbjct: 168 PLEE--------GTLFRCYPGLWQVWCETT-----DGYELIKELPNKPVGDDLDLIFAG 213
>gi|434395506|ref|YP_007130453.1| protein of unknown function DUF1995-containing protein [Gloeocapsa
sp. PCC 7428]
gi|428267347|gb|AFZ33293.1| protein of unknown function DUF1995-containing protein [Gloeocapsa
sp. PCC 7428]
Length = 243
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 109/230 (47%), Gaps = 50/230 (21%)
Query: 61 DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAG 120
DG + L++EI IPELNP M + + G ++KV + G ALA
Sbjct: 25 DGYSLLQIEILIPELNP----------MPVAEQFLPALDSFGNQLKVFFPDA-GAAALA- 72
Query: 121 MPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQ----DDMFILVAPQNAVGNC 176
R+ DWG K ++IG+ G VD+Q D+ F+LVAP +AV
Sbjct: 73 --------RR-----DWGSVPFK--ILDIGT-GRVPVDQQVLPEDEAFLLVAP-SAVE-- 113
Query: 177 IIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 236
+ ++ + A G RP IL+NPRL+D+ S + GR R + + E+CY R L
Sbjct: 114 -VAQVEKLHQAVGERPFILLNPRLEDV--SIVGIGYAGRQLRARFLNTIESCYHLRPLDG 170
Query: 237 AGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
A A+ YP+ +++++ + +Y +++ +KP D+++
Sbjct: 171 A--------AVFRCYPSPWQVWQE----NKDGEYQLIAEQPKKPMGDEVD 208
>gi|56752173|ref|YP_172874.1| hypothetical protein syc2164_c [Synechococcus elongatus PCC 6301]
gi|81300739|ref|YP_400947.1| hypothetical protein Synpcc7942_1930 [Synechococcus elongatus PCC
7942]
gi|56687132|dbj|BAD80354.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169620|gb|ABB57960.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 247
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 111/242 (45%), Gaps = 53/242 (21%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+A QRA+ G TR++V++ IPEL + ++ +F D G +++V
Sbjct: 17 EAAQRALAAGLTRIQVDLAIPELKIQPIAWQFLA----------AFLDRGAKLRVFFPDP 66
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGANEVDEQDDMFILVAPQ 170
G ALA R+ DWG+ + +G + + A V ++++FI + P
Sbjct: 67 -GAAALA---------RR-----DWGELPFSVRG----MEELKAA-VQPEEELFIFIEP- 105
Query: 171 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 230
+ + ++ + G RPVIL+NPRL+D+ + GI R R + +++ Y+
Sbjct: 106 ---SSVEVQRLEQLCQEIGDRPVILLNPRLEDV-ATIGIGYA-ARQLRERFLNQWQSAYY 160
Query: 231 FRLLYYAGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
P+ G A+ YP +++++ D Y +L + ++P+ +D++ F
Sbjct: 161 LS---------PLEGAAIFQAYPQRWQVWQETDT-----GYELLQEYDQRPNGEDLDRLF 206
Query: 290 LG 291
G
Sbjct: 207 AG 208
>gi|334118025|ref|ZP_08492115.1| Domain of unknown function DUF1995-containing protein [Microcoleus
vaginatus FGP-2]
gi|333460010|gb|EGK88620.1| Domain of unknown function DUF1995-containing protein [Microcoleus
vaginatus FGP-2]
Length = 248
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/239 (27%), Positives = 109/239 (45%), Gaps = 53/239 (22%)
Query: 54 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRV---IALSFADDGKRVKVCVQ 110
AT A++DG+T L+VE+ PE+ + L EL + + + F D
Sbjct: 18 ATAAALSDGKTLLQVELVFPEIALQAQSITEQFLPELEEIYPGVKVFFPD---------- 67
Query: 111 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILV 167
G ALA R+ DWG+ K T ++GS + +++ +D +F+L+
Sbjct: 68 --AGAAALA---------RR-----DWGETPFKVT--DLGSSRSPVEDKIAPEDQLFLLI 109
Query: 168 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 227
P + ++ + AA RPVIL+NPRL+D+ + GI GR R + + E+
Sbjct: 110 NP----AAVEVAQVERLYIAAAGRPVILLNPRLEDV-ATIGIGYA-GRQLRDRFLSKIES 163
Query: 228 CYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
CY+ R L A AL YP ++++ + +Y ++S +KP DD+
Sbjct: 164 CYYVRPLDAA--------ALFRCYPQSWQVWLERN-----NQYELISETAQKPVGDDLE 209
>gi|452823755|gb|EME30763.1| hypothetical protein isoform 2 [Galdieria sulphuraria]
Length = 1138
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 131/296 (44%), Gaps = 56/296 (18%)
Query: 11 HERKSRSNQVSSSQDNWKGIPTR---LNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLK 67
+ R+ +NQ SQ++ G+P + ++ IP+S ++ L A A G RL
Sbjct: 50 NSRQISANQTFLSQES--GLPDQGEVVSPIPNSL---TRLSENSLSALIAATEAGYKRLL 104
Query: 68 VEINIPELNPEMDVYR--IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQL 125
++I+ + + ++ + L EL+RVI + + + + G A A
Sbjct: 105 IDIDFEQGDSTYTTFKNAVPFLRELLRVIE---SRNWQTTTTLLFPDEGSAAFA------ 155
Query: 126 AGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMT 185
RK DW +K +F+ GS D D I++ P +AV D++AM
Sbjct: 156 ---RK-----DWN--ISKCSFLSFGSYSGK--DSSQDWLIMICP-SAV------DVEAME 196
Query: 186 D---------AAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 236
+ RP+ILINP+L D+ G++G + R R ++ ++FE+ YF R+ +
Sbjct: 197 QIVNRESPFLSKDARPIILINPKLVDM-GATG-LGFNARQLRQQFLSTFESIYFLRVYTW 254
Query: 237 AGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 290
QYP ++ + N E + P Y +L TF KP+ D I + FL
Sbjct: 255 GVVVRQYPFRWSVWLDTANSDENSSSEEAP-----YRLLRTFENKPNDDTIQEIFL 305
>gi|452823754|gb|EME30762.1| hypothetical protein isoform 1 [Galdieria sulphuraria]
Length = 1152
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 131/296 (44%), Gaps = 56/296 (18%)
Query: 11 HERKSRSNQVSSSQDNWKGIPTR---LNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLK 67
+ R+ +NQ SQ++ G+P + ++ IP+S ++ L A A G RL
Sbjct: 50 NSRQISANQTFLSQES--GLPDQGEVVSPIPNSL---TRLSENSLSALIAATEAGYKRLL 104
Query: 68 VEINIPELNPEMDVYR--IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQL 125
++I+ + + ++ + L EL+RVI + + + + G A A
Sbjct: 105 IDIDFEQGDSTYTTFKNAVPFLRELLRVIE---SRNWQTTTTLLFPDEGSAAFA------ 155
Query: 126 AGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMT 185
RK DW +K +F+ GS D D I++ P +AV D++AM
Sbjct: 156 ---RK-----DWN--ISKCSFLSFGSYSGK--DSSQDWLIMICP-SAV------DVEAME 196
Query: 186 D---------AAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 236
+ RP+ILINP+L D+ G++G + R R ++ ++FE+ YF R+ +
Sbjct: 197 QIVNRESPFLSKDARPIILINPKLVDM-GATG-LGFNARQLRQQFLSTFESIYFLRVYTW 254
Query: 237 AGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 290
QYP ++ + N E + P Y +L TF KP+ D I + FL
Sbjct: 255 GVVVRQYPFRWSVWLDTANSDENSSSEEAP-----YRLLRTFENKPNDDTIQEIFL 305
>gi|440681570|ref|YP_007156365.1| protein of unknown function DUF1995-containing protein [Anabaena
cylindrica PCC 7122]
gi|428678689|gb|AFZ57455.1| protein of unknown function DUF1995-containing protein [Anabaena
cylindrica PCC 7122]
Length = 244
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 55/238 (23%), Positives = 101/238 (42%), Gaps = 49/238 (20%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+A + A+ DG TR++V+ PEL M + FA+ R+K+ +
Sbjct: 17 EAVKSALADGMTRIQVDFLFPELK----------FMPIAEQFIPLFAEYESRLKIFFADA 66
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQ 170
W D K + G + E + +D++F+ +AP
Sbjct: 67 GAAALARRD---------------WADSPFKVEDIGTGRTASLETKIQPEDEIFLFIAPT 111
Query: 171 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 230
+ + ++ + + GTRP IL+NPRL+D G GI R+ R + ++ E+CY+
Sbjct: 112 SVE----VPQLEKLCEIIGTRPFILLNPRLED-SGVVGIGYA-ARETRRRFISTIESCYY 165
Query: 231 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
R P+ AL YP +E++ + ++Y ++ ++PS D+I+
Sbjct: 166 LR---------PVDDESALFRCYPGDWEVWLETN-----DEYQKIAELPKRPSGDEID 209
>gi|428300086|ref|YP_007138392.1| hypothetical protein Cal6303_3487 [Calothrix sp. PCC 6303]
gi|428236630|gb|AFZ02420.1| protein of unknown function DUF1995-containing protein [Calothrix
sp. PCC 6303]
Length = 250
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 68/259 (26%), Positives = 112/259 (43%), Gaps = 54/259 (20%)
Query: 42 EIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 98
++ N D + QA T A+ DG TRL+VE+ PEL P M + +F
Sbjct: 3 QLPNSLEDAIFQAREATLAAIADGCTRLQVELLFPELKP----------MPVAEEFIPTF 52
Query: 99 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-- 156
A G ++KV + DW D K V G A +
Sbjct: 53 ASYGDKLKVFFADAGAAALAR---------------RDWKDVPFKIWDVGTGRASAAQPK 97
Query: 157 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG-TRPVILINPRLKDLPGSSGIMQTMGR 215
+ +D+MF+ +AP + + +++ + G RP +++NP+L+D G+ GI R
Sbjct: 98 IQAEDEMFLFIAPTSVE----VAELERLCGEIGEQRPFVMLNPKLED-SGTVGIGYA-AR 151
Query: 216 DKRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
+ R+ + ++ E+CY+ R P+ AL YP +E++ V GE Y +
Sbjct: 152 NIRMRFISTIESCYYLR---------PVDDETALFRCYPGMWEVW----VDKDGE-YKRI 197
Query: 274 STFTEKPSSDDINDAFLGK 292
+ KPS D++ D L K
Sbjct: 198 AELANKPSGDEL-DLILAK 215
>gi|428306245|ref|YP_007143070.1| hypothetical protein Cri9333_2705 [Crinalium epipsammum PCC 9333]
gi|428247780|gb|AFZ13560.1| protein of unknown function DUF1995-containing protein [Crinalium
epipsammum PCC 9333]
Length = 248
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 67/242 (27%), Positives = 113/242 (46%), Gaps = 51/242 (21%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+AT+ A+ DG TRL VE+ PEL M + + G +KV +
Sbjct: 17 EATKTAIADGHTRLMVELVFPELK----------TMPVASQFIPELEEYGSHLKVFFTDA 66
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 169
G ALA R+ DWG+ K ++GS + N+++E+D +F+L+ P
Sbjct: 67 -GAAALA---------RR-----DWGEVPFK--ISDLGSSRSSVQNKIEEEDQIFLLIEP 109
Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
+AV I ++ + +AAG RPVIL+ PRL+D GI YAA
Sbjct: 110 -SAVE---IAQVEQLCNAAGDRPVILLVPRLEDA-AVVGIG----------YAARQLRDR 154
Query: 230 FFRLLYYAGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 288
F + LY + P+ G AL ++P+ ++++ + + Y +++ T+KP + ++
Sbjct: 155 FIKTLYSSYYIRPLEGAALFRSHPSPWQVWLETN-----DDYNLIAEETQKPVGETLDQI 209
Query: 289 FL 290
L
Sbjct: 210 IL 211
>gi|427705929|ref|YP_007048306.1| hypothetical protein Nos7107_0483 [Nostoc sp. PCC 7107]
gi|427358434|gb|AFY41156.1| protein of unknown function DUF1995-containing protein [Nostoc sp.
PCC 7107]
Length = 244
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 66/240 (27%), Positives = 113/240 (47%), Gaps = 53/240 (22%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+AT+ A+ DG TR++V++ PEL M + FA+ R+KV
Sbjct: 17 EATKAALADGYTRVQVDLLFPELKQ----------MPVAEQFLPLFAEYESRLKVFF-AD 65
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA----NEVDEQDDMFILVA 168
G ALA R+ DW D A ++IG+ A +++ +D++F+ V+
Sbjct: 66 AGGAALA---------RR-----DWVD--AAFQILDIGTGRAASIQSKIKPEDEIFLFVS 109
Query: 169 PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENC 228
P +AV I ++ + + G RP++++NPRL+D G+ GI R R + + E+C
Sbjct: 110 P-SAVE---IPQLEKVCEIIGDRPLVMLNPRLED-SGTVGIGYA-ARQIRERFLNTIESC 163
Query: 229 YFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
Y+ R P+ A+ YP +E+ + GE + ++ +KPS DDI+
Sbjct: 164 YYLR---------PVDENTAVFRCYPGQWEVLVQ-----KGETWEKIAELPKKPSGDDID 209
>gi|443320726|ref|ZP_21049809.1| protein of unknown function (DUF1995) [Gloeocapsa sp. PCC 73106]
gi|442789566|gb|ELR99216.1| protein of unknown function (DUF1995) [Gloeocapsa sp. PCC 73106]
Length = 241
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 64/236 (27%), Positives = 109/236 (46%), Gaps = 59/236 (25%)
Query: 54 ATQRAVNDGRTRLKVEINIPE--LNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQG 111
A A+NDG RL+VE+ IPE L + TL F + G +KV +
Sbjct: 17 AVTNALNDGYRRLQVELVIPEIALQAQALALEFTTL----------FTEYGSGLKVFLPD 66
Query: 112 SMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMFI 165
+ G ALA R+ DWG+ T + +G N V++ D +F+
Sbjct: 67 T-GAAALA---------RR-----DWGE-----TPFAVTDLGTNRSPIETRVNDTDQIFL 106
Query: 166 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 225
+V+P +AV + ++ + + AG RPV+L+ P+L+D+ S + R R + ++
Sbjct: 107 VVSP-SAVE---VTQVEKLCNLAGDRPVVLLIPQLEDV--SIVGIGLAARQLRERFLSTL 160
Query: 226 ENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKP 280
E+CY+F+ P+ GA + YP+ +++++ G Y +L EKP
Sbjct: 161 ESCYYFK---------PLEGAAVLKNYPSPWQVWRE-----SGNDYQLLCEQGEKP 202
>gi|428208370|ref|YP_007092723.1| hypothetical protein Chro_3395 [Chroococcidiopsis thermalis PCC
7203]
gi|428010291|gb|AFY88854.1| protein of unknown function DUF1995 [Chroococcidiopsis thermalis
PCC 7203]
Length = 272
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 111/234 (47%), Gaps = 51/234 (21%)
Query: 61 DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAG 120
DG TRL+VE+ IPEL P M + F G ++KV + G ALA
Sbjct: 53 DGYTRLQVELAIPELKP----------MSVAEQFLAEFTQFGSQLKVFFPDA-GAAALA- 100
Query: 121 MPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQNAVGNCII 178
R+ DWG+ K V+IG+ E ++ +D+ + ++P +AV +
Sbjct: 101 --------RR-----DWGEVPFK--IVDIGTRSPLEEKIEPEDEALLFISP-SAVE---V 141
Query: 179 DDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAG 238
+ ++ + + A T P +++NPRL+D+ GI GR R + + E+CY+ R
Sbjct: 142 ERVEKLCELA-TCPTVMLNPRLEDV-AIVGIGYA-GRQLRTRFLNNIESCYYLR------ 192
Query: 239 TQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLG 291
PI ++ +YP +++++ ++ E++ +++ KP D+I+ G
Sbjct: 193 ---PIENISVFRSYPGEWQIWREIE-----EEFQLITEQPTKPMGDEIDAILYG 238
>gi|390438390|ref|ZP_10226864.1| conserved hypothetical protein [Microcystis sp. T1-4]
gi|389838196|emb|CCI30988.1| conserved hypothetical protein [Microcystis sp. T1-4]
Length = 247
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 61/235 (25%), Positives = 104/235 (44%), Gaps = 45/235 (19%)
Query: 45 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
N + +LQA TQ A+ G ++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARCVQVELVIPEIALQ------------AQALALDFTS- 51
Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96
Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGE 268
R + ++ E+CY+FR L A YP + + + +GYEL GE
Sbjct: 151 QLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGE 205
>gi|307150883|ref|YP_003886267.1| hypothetical protein Cyan7822_0982 [Cyanothece sp. PCC 7822]
gi|306981111|gb|ADN12992.1| Domain of unknown function DUF1995 [Cyanothece sp. PCC 7822]
Length = 250
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 63/214 (29%), Positives = 98/214 (45%), Gaps = 44/214 (20%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+ATQ A+ G RL+VE+ IPE+ E G +E + F G +KV +
Sbjct: 17 EATQIALEAGLGRLQVELVIPEIALEAQ----GLALEFTSL----FEQYGSGLKVLFPDT 68
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFI 165
G ALA R+ DWG D G++ VE+ +V +D+ F+
Sbjct: 69 -GAAALA---------RR-----DWGETPFKISDLGSRFIPVEM------KVTPEDEAFL 107
Query: 166 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 225
+V P + I + + + A RPV+L+ P+L+D+ GI T R R + ++
Sbjct: 108 VVCP----ASIEIQSVAKLCEIAEQRPVLLLIPQLEDV-SIVGIGYT-ARQLRDNFLSTL 161
Query: 226 ENCYFFRLLYYAGT--QYPIMGALRMTYPNGYEL 257
E+CY+FR L A YP + + + GYEL
Sbjct: 162 ESCYYFRPLEGAAVVRSYPGLWQVWLEKEQGYEL 195
>gi|414076889|ref|YP_006996207.1| hypothetical protein ANA_C11624 [Anabaena sp. 90]
gi|413970305|gb|AFW94394.1| hypothetical protein ANA_C11624 [Anabaena sp. 90]
Length = 244
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 107/253 (42%), Gaps = 49/253 (19%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+A + A+ DG TR++VE+ PEL M + F + R+KV +
Sbjct: 17 EAVKSALADGMTRIQVELLFPELK----------FMTVAEQFLPLFTEYESRLKVFFADA 66
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQ 170
W D + + + G + E + +D++F+ +AP
Sbjct: 67 GAAALARRD---------------WTDAQFQISDIGTGRAASLEAKIQPEDEIFLFIAPT 111
Query: 171 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 230
+ + ++ + + G RPVI++ PRL+D GI T R+ R + ++ E+CY+
Sbjct: 112 SVE----VPQLEKLCELIGERPVIMLTPRLED-SSVVGIGYT-ARETRRRFISTIESCYY 165
Query: 231 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 288
R P+ AL YP +E++ + + GE Y ++ +KPS D+++
Sbjct: 166 IR---------PVDDESALFRCYPGLWEVW----LETAGE-YQKVAELPKKPSGDELDAI 211
Query: 289 FLGKPRNREKKAS 301
+G R A+
Sbjct: 212 LMGGQRENTTDAT 224
>gi|86604746|ref|YP_473509.1| hypothetical protein CYA_0013 [Synechococcus sp. JA-3-3Ab]
gi|86553288|gb|ABC98246.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length = 246
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/283 (22%), Positives = 117/283 (41%), Gaps = 59/283 (20%)
Query: 33 RLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVR 92
+L PH + +QAT+ A+ G RL +EI P++ PE+ +ELV
Sbjct: 6 QLTQFPHLPGDLAQACEQAIQATRLALQAGYRRLLIEILAPDIKPEVLAR---PFLELVP 62
Query: 93 VIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSI 152
AL D G ALA ++G+ +++ S+
Sbjct: 63 PPALVLFSDA-----------GGAALAQR-----------------EWGSLPDGIQLQSL 94
Query: 153 GA--NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG-------TRPVILINPRLKDL 203
A EQ +F++ A + + + + D +G +PV+L+NP+L+D
Sbjct: 95 TARTQPTPEQSVLFVMPAVYSLDQVERVCQLVSGGDPSGLAAKGRDPKPVVLLNPQLQD- 153
Query: 204 PGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDV 263
++ + GR R + ++FE Y+ R L + GAL YP+ + ++++ +
Sbjct: 154 -AAAVGVGLAGRRLRQRFLSTFETSYYLRSL--------VEGALFRAYPDPWSVWQQEE- 203
Query: 264 PSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGF 306
Y +L TF +PS +++ + F K+ +W +
Sbjct: 204 ---PGLYSVLKTFRARPSGEEVAELF-----RASKRGDPWWSW 238
>gi|113475888|ref|YP_721949.1| hypothetical protein Tery_2247 [Trichodesmium erythraeum IMS101]
gi|110166936|gb|ABG51476.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 253
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/247 (26%), Positives = 118/247 (47%), Gaps = 54/247 (21%)
Query: 52 LQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQG 111
++AT+ A+ DG TR+++EI +P++ + L + + + ++KV
Sbjct: 16 MEATKAALQDGYTRVQIEIVVPDIELQAQ--------SLAKQFIPALLETSTKLKVFFPD 67
Query: 112 SMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMFI 165
S G ALA R+ DW D TF +I +G + +V+ +D F+
Sbjct: 68 S-GAAALA---------RR-----DWQD----ATF-KIEDLGTSRSPVDKKVEPEDQCFL 107
Query: 166 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 225
L+AP +A+ + + +++ AG RPVI++ P+L+D+ GI R R + +
Sbjct: 108 LIAP-SAIE---VAQTEKLSNLAGDRPVIMLIPKLEDV-SIVGIGYA-ARQLRERFIKTI 161
Query: 226 ENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDI 285
E+CY+ R L A AL YP+ ++++ + G+ Y +++ EKP D++
Sbjct: 162 ESCYYIRSLGGA--------ALYRCYPSPWQVW----LEENGQ-YKLIAEQPEKPVGDEV 208
Query: 286 NDAFLGK 292
D L K
Sbjct: 209 -DMILAK 214
>gi|119484707|ref|ZP_01619189.1| hypothetical protein L8106_14580 [Lyngbya sp. PCC 8106]
gi|119457525|gb|EAW38649.1| hypothetical protein L8106_14580 [Lyngbya sp. PCC 8106]
Length = 249
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 66/247 (26%), Positives = 111/247 (44%), Gaps = 56/247 (22%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYR---IGTLMELVRVIALSFADDGKRVKVCV 109
QATQ A++DG ++VE+ PE+ + I + E V+ + F D
Sbjct: 17 QATQAALDDGYKLVQVELVFPEIELQAQAIAQQFIPAIEESGTVLKVFFPD--------- 67
Query: 110 QGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFIL 166
G ALA R+ DWG+ K ++GS + + V + D F++
Sbjct: 68 ---AGAAALA---------RR-----DWGEIPFK--ISDLGSSRSPIDSRVKDDDGRFLV 108
Query: 167 VAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFE 226
V+P ++ ++ ++ AG R IL+NPRL+D+ GI R R + ++ E
Sbjct: 109 VSPTPVE----VEQVEKLSQLAGDRVTILLNPRLEDI-AIIGIGYA-ARALRDRFISTIE 162
Query: 227 NCYFFRLLYYAGTQYPIMG--ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 284
+CY+ R P+ G AL YP+ +E+++ +D GE Y +L+ KP D
Sbjct: 163 SCYYLR---------PLEGDAALYRCYPSLWEVWQEID----GE-YTLLAQEQTKPVGDQ 208
Query: 285 INDAFLG 291
++ G
Sbjct: 209 LDQILAG 215
>gi|443327973|ref|ZP_21056578.1| protein of unknown function (DUF1995) [Xenococcus sp. PCC 7305]
gi|442792382|gb|ELS01864.1| protein of unknown function (DUF1995) [Xenococcus sp. PCC 7305]
Length = 248
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 68/266 (25%), Positives = 112/266 (42%), Gaps = 51/266 (19%)
Query: 45 NYFYDDVLQATQRA-------VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALS 97
N F + +A ++A + DG TRL+VE+ IPE+ + + +AL
Sbjct: 2 NEFPQSIAEAVEQAKKTVIDAIADGYTRLQVELVIPEIPLQ------------AQSLALE 49
Query: 98 FADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIG 150
FA+ S G G P A +L DWG D G + T +E
Sbjct: 50 FAN-------LFLESHGSGVKVMFPDTGAA---MLAKRDWGEIAFQVTDMGGRFTSIE-- 97
Query: 151 SIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIM 210
++ E D++FI+V P + ++ ++ + + G RPVIL+ P+L+D+ GI
Sbjct: 98 ----QQIAETDEIFIMVCPSSVE----VERVETLCNLVGDRPVILLIPQLEDV-AVVGI- 147
Query: 211 QTMGRDKRLEYAASFENCYFFRLLYYAGTQ--YPIMGALRMTYPNGYELYKRVDV-PSGG 267
R R + ++ +CY+ R L A +P + + GYEL + P G
Sbjct: 148 GLAARRLRDRFISTLYSCYYIRPLEGASILRIHPSSWQIWLEKEEGYELAMELSSKPMGE 207
Query: 268 EKYVILSTFTEKPSSDDINDAFLGKP 293
E +L T N++ + KP
Sbjct: 208 ELDRLLIQLTTPEGEAGKNNSTIKKP 233
>gi|332705285|ref|ZP_08425366.1| protein of unknown function, DUF1995 [Moorea producens 3L]
gi|332356028|gb|EGJ35487.1| protein of unknown function, DUF1995 [Moorea producens 3L]
Length = 251
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 56/233 (24%), Positives = 109/233 (46%), Gaps = 43/233 (18%)
Query: 59 VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 118
++DG+TR++VE+ PE+ + E + + F G +K+ + G AL
Sbjct: 23 LDDGQTRVQVELVFPEIALQAQ----SIAQEFIPI----FEQYGPGLKIFFPDT-GASAL 73
Query: 119 AGMPLQLAGTRKILEFMDWGDYGAKGTFVEIG-SIGANEVDEQDDMFILVAPQNAVGNCI 177
A DWG+ K T V S +++ D +F+LV P +AV
Sbjct: 74 A--------------LRDWGEVPFKITDVGTSRSPAETRIEDDDQVFLLVGP-SAVE--- 115
Query: 178 IDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYA 237
+ ++ + + AG RP +++NP+L+D+ S + R R + + E+CY+ R +
Sbjct: 116 VAQVEKICNLAGDRPCVILNPQLEDV--SIVGIGYAARQLRDRFLKTLESCYYLR--PFP 171
Query: 238 GTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 290
G GAL YP+ ++++ +D ++Y +++ KP+++ ++ L
Sbjct: 172 G------GALWRCYPSMWQVWLEID-----DEYQLVTEEPSKPTAEALDQIIL 213
>gi|298489954|ref|YP_003720131.1| hypothetical protein Aazo_0482 ['Nostoc azollae' 0708]
gi|298231872|gb|ADI63008.1| Domain of unknown function DUF1995 ['Nostoc azollae' 0708]
Length = 244
Score = 58.2 bits (139), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 56/246 (22%), Positives = 102/246 (41%), Gaps = 50/246 (20%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+A + A+ DG TR++V+ PEL M + FA+ RVKV +
Sbjct: 17 EAVKSALADGVTRIQVDFLFPELK----------FMPVAEQFVPLFAEYESRVKVFFADA 66
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQ 170
W + K + G + ++ +D++F+ +AP
Sbjct: 67 GAAALARRD---------------WQNVPFKVEDIGTGRAASLQTKIQPEDEIFLFIAPT 111
Query: 171 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 230
+ ++ M + TRP++L+NPRL+D G GI R+ R + ++ E+CY+
Sbjct: 112 PVE----VPQLEKMCEIIDTRPIVLLNPRLED-SGVVGIGYA-ARETRRRFISTIESCYY 165
Query: 231 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 288
R P+ AL YP +E++ +Y ++ ++PS D+I+
Sbjct: 166 LR---------PVDDESALFRCYPGQWEVWL-----ESNNEYEKIAELPKRPSGDEIDMI 211
Query: 289 FL-GKP 293
+ G+P
Sbjct: 212 LMKGQP 217
>gi|67922775|ref|ZP_00516276.1| hypothetical protein CwatDRAFT_3614 [Crocosphaera watsonii WH 8501]
gi|416392960|ref|ZP_11685949.1| hypothetical protein CWATWH0003_2757 [Crocosphaera watsonii WH
0003]
gi|67855391|gb|EAM50649.1| hypothetical protein CwatDRAFT_3614 [Crocosphaera watsonii WH 8501]
gi|357263546|gb|EHJ12537.1| hypothetical protein CWATWH0003_2757 [Crocosphaera watsonii WH
0003]
Length = 246
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 104/244 (42%), Gaps = 46/244 (18%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+A + A+ G R+ +EI IPE+ + + +AL F +
Sbjct: 17 EAAKIALESGVGRICLEIVIPEIAIQ------------AQALALEFT------------T 52
Query: 113 MGEGALAGMPLQLAGT-RKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMF 164
+ EG +G+ + T L DWG D G++ T +E ++ E D++F
Sbjct: 53 LFEGQQSGLKVIFPDTGAAALARRDWGETPFRVTDLGSRATGIE------RKISEADEIF 106
Query: 165 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 224
+LV P + ++ ++ + + AG RPVIL+ P+L+D+ S + R R + ++
Sbjct: 107 LLVCPSSVE----VETVEKLCELAGDRPVILLIPQLEDV--SIVGIGYAARQLRDRFIST 160
Query: 225 FENCYFFRLL--YYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSS 282
E+ Y+FR L YP + + +GYEL GE IL K
Sbjct: 161 LESVYYFRPLDDVVVLRSYPSPWLVFLEKEDGYELIAEQGQKPMGEALDILVNNALKGEE 220
Query: 283 DDIN 286
DD N
Sbjct: 221 DDSN 224
>gi|218247522|ref|YP_002372893.1| hypothetical protein PCC8801_2736 [Cyanothece sp. PCC 8801]
gi|257061142|ref|YP_003139030.1| hypothetical protein Cyan8802_3366 [Cyanothece sp. PCC 8802]
gi|218168000|gb|ACK66737.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
gi|256591308|gb|ACV02195.1| conserved hypothetical protein [Cyanothece sp. PCC 8802]
Length = 245
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 63/232 (27%), Positives = 102/232 (43%), Gaps = 45/232 (19%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+AT+ A+ R++VE+ +PE+ + G +E + DD +KV +
Sbjct: 17 EATKLALEANLGRIQVELVVPEIALQAQ----GLALEFTTLF-----DDLSSLKVIFPDT 67
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFI 165
G ALA R+ DWG D G++ T VE+ +V EQD F+
Sbjct: 68 -GAAALA---------RR-----DWGETPFKVTDLGSRATSVEM------KVSEQDRAFL 106
Query: 166 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 225
LV P + ++ ++ + + AG RPV+L+ P+L+D+ GI R R + ++
Sbjct: 107 LVCP----SSVEVESVEKLCNLAGDRPVVLLIPQLEDV-SVVGIGYA-ARQLRERFISTL 160
Query: 226 ENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILST 275
+ Y+FR L A YP + + GYEL GE I+ T
Sbjct: 161 NSAYYFRPLEGAAILRSYPSPWNVYLETETGYELIASEPQKPLGEALEIILT 212
>gi|254410487|ref|ZP_05024266.1| hypothetical protein MC7420_3002 [Coleofasciculus chthonoplastes
PCC 7420]
gi|196182693|gb|EDX77678.1| hypothetical protein MC7420_3002 [Coleofasciculus chthonoplastes
PCC 7420]
Length = 245
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 112/242 (46%), Gaps = 37/242 (15%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+ATQ A+NDG TRL+VE+ PE+ + I + F G +KV +
Sbjct: 17 EATQAALNDGYTRLQVELVFPEIA--LQAQSIAWQFTPI------FEQYGSGLKVLFPDA 68
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 169
G ALA R+ DWG+ K T +IG+ + + + +D+ F++V+P
Sbjct: 69 -GAAALA---------RR-----DWGETPFKVT--DIGTSRSPVESRIQPEDEAFLIVSP 111
Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
+AV + ++ + + AG RP +L+ P+L+DL GI R R + ++ +CY
Sbjct: 112 -SAVE---VSQVEKLCNLAGDRPCVLLTPQLEDL-KVVGIGYA-ARQLRERFLSTLTSCY 165
Query: 230 FFRLLYYAGTQ--YPIMGALRMTYPNGYELY-KRVDVPSGGEKYVILSTFTEKPSSDDIN 286
+ + L A YP + + + N Y+L + P G +I++ T S+D
Sbjct: 166 YVQPLDGAALLRVYPGLWQIWLEKENAYQLIAEEPQKPVGDALELIIAKVTGSGDSEDTP 225
Query: 287 DA 288
A
Sbjct: 226 AA 227
>gi|434386365|ref|YP_007096976.1| protein of unknown function (DUF1995) [Chamaesiphon minutus PCC
6605]
gi|428017355|gb|AFY93449.1| protein of unknown function (DUF1995) [Chamaesiphon minutus PCC
6605]
Length = 240
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 60/233 (25%), Positives = 102/233 (43%), Gaps = 55/233 (23%)
Query: 61 DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAG 120
DG R+ VEI IPEL M L R SF+D G +KV + G ALA
Sbjct: 25 DGHKRITVEIVIPELKS----------MGLARQFLESFSDRGNTLKVLFTDT-GAAALAK 73
Query: 121 MPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG------ANEVDEQDDMFILVAPQNAVG 174
DW +G +I +G +++ +D+++I + P +AV
Sbjct: 74 R--------------DW-----QGVNFKIEDLGNSRSPITDKIQAEDELYIAIDP-SAVE 113
Query: 175 NCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLL 234
++ ++ + +AAG RPVIL P+L+D GI R R + + Y+ + L
Sbjct: 114 ---VEQVEKLCNAAGDRPVILFLPKLED-AAIVGIGYA-ARQLRDRFLTTLTCAYYIKPL 168
Query: 235 YYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 287
AL YP +++++ D + Y++L+ +KP D++++
Sbjct: 169 E--------ASALYRCYPAQWQVWQEQD-----DDYILLAECPQKPVGDELDE 208
>gi|359457624|ref|ZP_09246187.1| hypothetical protein ACCM5_02793 [Acaryochloris sp. CCMEE 5410]
Length = 234
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/270 (23%), Positives = 115/270 (42%), Gaps = 64/270 (23%)
Query: 52 LQATQRAVNDGRTRLKVEINIPELNP----EMDVYRIGTLMELVRVIALSFADDGKRVKV 107
++AT+ A+ D +T L+VEI IPEL P + + ++ L E V+V F+D G
Sbjct: 15 VEATKAAIADHKTLLQVEIAIPELKPLPVAQQYLSQLPNLGENVKVF---FSDTGA---- 67
Query: 108 CVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILV 167
++ + +L G +++ + V ++D F+ +
Sbjct: 68 ---AALARHQWKDISYELRGIEELI----------------------DPVQPENDAFVFI 102
Query: 168 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 227
AP + ++ + AG R IL+NP+L+D+ GI GR R + + E+
Sbjct: 103 APTPVE----VGKVEQICSQAGDRVCILLNPKLEDV-SIVGIGMA-GRTLRERFLNNIES 156
Query: 228 CYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGE--KYVILSTFTEKPSSDDI 285
CY F L GA+ +YP+ ++++ + GE ++ IL+T +PS + I
Sbjct: 157 CYSFLPLER--------GAVIRSYPSEWQIWW-----AEGEEDEHQILATEENRPSGERI 203
Query: 286 NDAFLGKPRNREKKASG-------FWGFLS 308
+ E + G FW L+
Sbjct: 204 SQLLASLTATEEIRKPGLLDNMQQFWKALT 233
>gi|158334629|ref|YP_001515801.1| hypothetical protein AM1_1459 [Acaryochloris marina MBIC11017]
gi|158304870|gb|ABW26487.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 234
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/270 (22%), Positives = 114/270 (42%), Gaps = 64/270 (23%)
Query: 52 LQATQRAVNDGRTRLKVEINIPELNP----EMDVYRIGTLMELVRVIALSFADDGKRVKV 107
++AT+ A+ D +T L++EI IPEL P + + ++ L E V+V F+D G
Sbjct: 15 VEATKSAIADHKTLLQIEIAIPELKPLPVAQQYLSQLPNLGENVKVF---FSDTGA---- 67
Query: 108 CVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILV 167
++ + +L G +++ + V ++D F+ +
Sbjct: 68 ---AALARHQWKDISYELRGIEELI----------------------DPVQPENDAFVFI 102
Query: 168 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 227
AP + ++ + AG R IL+NP+L+D+ S + GR R + + E+
Sbjct: 103 APTPVE----VGKVEQICSQAGDRVCILLNPKLEDV--SIVGIGMAGRTLRERFLNNIES 156
Query: 228 CYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGE--KYVILSTFTEKPSSDDI 285
CY F L GA+ +YP+ ++++ + GE ++ IL+T +PS + I
Sbjct: 157 CYSFLPLER--------GAVIRSYPSEWQIWW-----AEGEEDEHQILATEESRPSGERI 203
Query: 286 NDAFLGKPRNREKKASG-------FWGFLS 308
+ + G FW L+
Sbjct: 204 GQLLASSTTTEDNRKPGLLDNMQQFWKALT 233
>gi|428183504|gb|EKX52362.1| hypothetical protein GUITHDRAFT_157134 [Guillardia theta CCMP2712]
Length = 325
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 71/156 (45%), Gaps = 13/156 (8%)
Query: 141 GAKGTFVE---IGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILIN 197
G KG+ +E I +EV E DDMFI++ + D++ + AAG RPVIL N
Sbjct: 123 GFKGSLIERVWINKDFVSEVQEDDDMFIIIGASAQE----LPDVRELCKAAGDRPVILFN 178
Query: 198 PRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY----AGTQYPI--MGALRMTY 251
+L+ L G G+ + ++ YF Y AG + I GAL TY
Sbjct: 179 LKLQVLRGDFGLPFFPSKSLHNDWLCEALPAYFMLPKSYTKTIAGPPFLINYSGALFRTY 238
Query: 252 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 287
P +++ V GG +Y + ++P+ D+ +
Sbjct: 239 PGKWQMLLEVPDEDGGGRYQRVRMLDKRPALSDVRE 274
>gi|86607776|ref|YP_476538.1| hypothetical protein CYB_0277 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86556318|gb|ABD01275.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 233
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/250 (22%), Positives = 108/250 (43%), Gaps = 54/250 (21%)
Query: 49 DDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVC 108
+ + AT+ A+ G RL +EI P+L PE+ +ELV+ AL D
Sbjct: 10 EQAIGATRLAMQAGYRRLLIEILAPDLKPEVLAR---PFLELVQPPALVLFSDA------ 60
Query: 109 VQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVA 168
G ALA ++G+ +E+ S+ + + +LV
Sbjct: 61 -----GGAALAQR-----------------EWGSLPEGIELQSLTSRTQPTPEQSILLVM 98
Query: 169 PQNAVGNCIIDDMKAMT--DAAG-------TRPVILINPRLKDLPGSSGIMQTMGRDKRL 219
P + + +A++ D +G +PV+L+NP+L+D ++ + GR R
Sbjct: 99 PAVYSLDQVERVCQAVSGGDPSGLASKGRDPKPVVLLNPQLQD--AATVGVGLAGRRLRQ 156
Query: 220 EYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEK 279
+ ++FE Y+ R L + GAL YP+ + ++++ + Y +L TF +
Sbjct: 157 RFLSTFETSYYLRSL--------VEGALFRAYPDPWSVWQQEE----PGLYSVLKTFIAQ 204
Query: 280 PSSDDINDAF 289
P+ +++ + F
Sbjct: 205 PTGEEVAELF 214
>gi|428203362|ref|YP_007081951.1| hypothetical protein Ple7327_3166 [Pleurocapsa sp. PCC 7327]
gi|427980794|gb|AFY78394.1| protein of unknown function (DUF1995) [Pleurocapsa sp. PCC 7327]
Length = 249
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 114/243 (46%), Gaps = 59/243 (24%)
Query: 53 QATQRAVNDGRTRLKVEINIPE--LNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQ 110
+AT+ A+ G R++VE+ +PE L + +L+E D G ++V
Sbjct: 17 EATKVALEAGCERIQVELVVPEIALQAQSLALEFTSLLE----------DYGSGLRVLFP 66
Query: 111 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMF 164
+ G ALA R+ DWG+ T ++ +G+ ++ E+D F
Sbjct: 67 DT-GAAALA---------RR-----DWGE-----TVFQVSDLGSRFIPVDMKISEEDTAF 106
Query: 165 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 224
++V+P N + + + + AG RPV+L+ P+L+D+ S + R R + ++
Sbjct: 107 LVVSPSAIEVNSV----EKLCNLAGDRPVVLLIPQLEDV--SVVGIGLAARQLRERFLST 160
Query: 225 FENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 283
E+ Y+FR P+ GA+ + +YP+ ++++ G E Y +++ +KP+ +
Sbjct: 161 LESSYYFR---------PLEGAVVLRSYPSLWQVWL-----GGEEDYQLIAEEPQKPTGE 206
Query: 284 DIN 286
++
Sbjct: 207 RLD 209
>gi|443314784|ref|ZP_21044317.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 6406]
gi|442785626|gb|ELR95433.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 6406]
Length = 246
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 111/240 (46%), Gaps = 52/240 (21%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
QAT+ A+ G +R+ V+I IPEL +I + E + F + G + KV +
Sbjct: 19 QATRAAIEAGVSRIVVDIAIPEL-------KIQPVAEQFYPL---FEEMGLQFKVYFPDA 68
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWG--DYGAKGTFVEIGSIGANEVDEQDDMFILVAPQ 170
G ALA DWG D+ +G G + A+ D+ F+++ P
Sbjct: 69 -GAAALAK--------------RDWGNPDFSIRGINEMKGRLEAD-----DEAFLIIEP- 107
Query: 171 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 230
+ ++D+++ + A R V+++NP+L+D+ + GI T GR R + ++ E Y+
Sbjct: 108 ---SSVEVNDVESFCNEATGRFVVMLNPKLEDI-ATIGIGYT-GRQLRERFLSTLETIYY 162
Query: 231 FRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
+ P+ GA + YP ++++ + + Y +L+ F +KPS + + F
Sbjct: 163 LQ---------PLEGATILRAYPGLWQVWGE----TTDDGYELLADFPQKPSGEALEKLF 209
>gi|304570679|ref|YP_001806112.2| hypothetical protein cce_4698 [Cyanothece sp. ATCC 51142]
gi|354552131|ref|ZP_08971439.1| protein of unknown function DUF1995-containing protein [Cyanothece
sp. ATCC 51472]
gi|353555453|gb|EHC24841.1| protein of unknown function DUF1995-containing protein [Cyanothece
sp. ATCC 51472]
Length = 246
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 55/235 (23%), Positives = 107/235 (45%), Gaps = 56/235 (23%)
Query: 106 KVCVQGSMGEGALAGMPLQL----------AGTRKI--------LEFMDWG-------DY 140
++C++ + E AL PL L +G + I L DWG D
Sbjct: 29 RICLEIVIPEIALQAQPLALEFTTLFKGQESGLKVIFPDTGAAALARRDWGETPFQVTDL 88
Query: 141 GAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRL 200
G++ T ++ ++ E D+ F+LV+P + ++ ++ + + AG RPVIL+ P+L
Sbjct: 89 GSRATPID------KKISEADEAFLLVSP----SSVEVEIVEKLCNLAGHRPVILLIPQL 138
Query: 201 KDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYK 259
+D+ S + R R + ++ E+ Y+FR P+ G + + +YP+ + +Y
Sbjct: 139 EDV--SIVGIGYAARQLRERFISTLESVYYFR---------PLDGVVVLRSYPSPWLVYL 187
Query: 260 RVDVPSGGEKYVILSTFTEKPSSDD----INDAFLGKPRNREKKASGFWGFLSSV 310
+ E Y +++ +KP + +N+A G+ N ++ G +SV
Sbjct: 188 EKE-----EGYELIAEQGQKPMGEALDILVNNALKGEEENSDQPQPKKSGIFASV 237
>gi|17230608|ref|NP_487156.1| hypothetical protein all3116 [Nostoc sp. PCC 7120]
gi|17132211|dbj|BAB74815.1| all3116 [Nostoc sp. PCC 7120]
Length = 245
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 103/248 (41%), Gaps = 50/248 (20%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+AT+ A+ DG R++VE+ PEL M + + R+KV +
Sbjct: 17 EATKSALADGCNRIQVELLFPELK----------FMPVAEQFLPCLVEYESRLKVFFADA 66
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQ 170
W D + + G I + +++ +D++F+ ++P
Sbjct: 67 GAAALARRD---------------WKDAAFQILDIGTGRIASIQSKIQPEDEIFLFISPT 111
Query: 171 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 230
+ + ++ + + G RP I +NPRL+D G+ GI T R R + ++CY+
Sbjct: 112 SVE----VPQLEKICEIIGDRPAIFLNPRLEDA-GTVGIGYT-ARQTRERFLNIIQSCYY 165
Query: 231 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 288
R PI AL +YP +E++ D + ++ +KPS D+I+
Sbjct: 166 LR---------PIDDETALFRSYPGDWEIWVEND-----GNWAKIANLPKKPSGDEIDLI 211
Query: 289 FL-GKPRN 295
+ G+P+
Sbjct: 212 LMKGQPQT 219
>gi|428772277|ref|YP_007164065.1| hypothetical protein Cyast_0436 [Cyanobacterium stanieri PCC 7202]
gi|428686556|gb|AFZ46416.1| protein of unknown function DUF1995 [Cyanobacterium stanieri PCC
7202]
Length = 244
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 61/245 (24%), Positives = 118/245 (48%), Gaps = 52/245 (21%)
Query: 54 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
AT A+ G RL++E+ IPE+ + EL A F + G +KV +
Sbjct: 18 ATLTALESGFNRLQIELIIPEIALKAQ--------ELAFAFAQLFGEYGAGLKVFFPDT- 68
Query: 114 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFV--EIGSIGA---NEVDEQDDMFILVA 168
G ALA R+ +WG+ TFV ++GS + N V + D+ +++V+
Sbjct: 69 GAAALA---------RR-----EWGE----TTFVVTDLGSSRSPIENRVSDTDEFYLVVS 110
Query: 169 PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENC 228
P +AV + ++ + + AG RPVILI P+L+D+ S + R R + ++ E+
Sbjct: 111 P-SAVE---VAQVEKLCNLAGDRPVILIIPQLEDI--SVVGIGYAARQLRERFLSTLESS 164
Query: 229 YFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 287
Y++R P+ A+ M +P +++++ ++ K+ +++ +KP + ++
Sbjct: 165 YYYR---------PLEDAVVMRVFPGLWQVWRELE----DNKFELITEVPKKPLGEVLDR 211
Query: 288 AFLGK 292
+G+
Sbjct: 212 ILMGQ 216
>gi|218439257|ref|YP_002377586.1| hypothetical protein PCC7424_2296 [Cyanothece sp. PCC 7424]
gi|218171985|gb|ACK70718.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
Length = 253
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 60/228 (26%), Positives = 101/228 (44%), Gaps = 47/228 (20%)
Query: 42 EIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 98
E+ N ++QA T+ A+ +G R++VE+ IPE+ + IAL F
Sbjct: 3 EVPNSLEATIIQAKEATRLALEEGLGRIQVELVIPEIALS------------AQTIALEF 50
Query: 99 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGS 151
+ S GEG P A L DWG D G++ T VE+
Sbjct: 51 --------TSLLDSYGEGLKVLFPDSGAAA---LARRDWGETPFKISDLGSRFTPVEM-- 97
Query: 152 IGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQ 211
++ +D+ F++V P + + + + D AG RPV+++ P+L+D+ GI
Sbjct: 98 ----KITPEDEAFLVVCP----SSIEVQSVAKLCDLAGDRPVVILIPQLEDV-SIVGIGY 148
Query: 212 TMGRDKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYEL 257
T R R + ++ ++ Y+ R L A YP + + + +GYEL
Sbjct: 149 T-ARQLRDNFLSTLQSSYYIRPLDGAVVLRAYPELWQVWLEKEDGYEL 195
>gi|126656438|ref|ZP_01727699.1| hypothetical protein CY0110_22082 [Cyanothece sp. CCY0110]
gi|126622124|gb|EAZ92831.1| hypothetical protein CY0110_22082 [Cyanothece sp. CCY0110]
Length = 246
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 62/271 (22%), Positives = 121/271 (44%), Gaps = 63/271 (23%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
+A + A+ G R+ +EI IPE+ + + +AL F +
Sbjct: 17 EAAKLALEAGVERICLEIVIPEIALQ------------AQPLALEFT------------T 52
Query: 113 MGEGALAGMPLQLAGT-RKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMF 164
+ EG +G+ + T L DWG D G++ T +E ++ E D+ F
Sbjct: 53 LFEGQESGLKVIFPDTGAAALARRDWGETPFQVTDLGSRATSIE------KKISEADEAF 106
Query: 165 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 224
+LV+P + ++ ++ + + AG R VIL+ P+L+D+ S + R R + ++
Sbjct: 107 LLVSP----SSVEVEIVEKLCNLAGHRSVILLIPQLEDV--SIVGIGYAARQLRERFIST 160
Query: 225 FENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 283
E+ Y+FR P+ G + + +YP+ + +Y + E Y +++ +KP +
Sbjct: 161 LESVYYFR---------PLDGVVVLRSYPSPWLVYLEKE-----EGYELIAEQGQKPMGE 206
Query: 284 D----INDAFLGKPRNREKKASGFWGFLSSV 310
+N+A G+ N ++ + G +SV
Sbjct: 207 ALDTLVNNALKGEEDNSDQPQAKKSGIFASV 237
>gi|171701065|gb|ACB54046.1| unknown [Cyanothece sp. ATCC 51142]
Length = 184
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/187 (24%), Positives = 90/187 (48%), Gaps = 38/187 (20%)
Query: 136 DWG-------DYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAA 188
DWG D G++ T ++ ++ E D+ F+LV+P + ++ ++ + + A
Sbjct: 15 DWGETPFQVTDLGSRATPID------KKISEADEAFLLVSP----SSVEVEIVEKLCNLA 64
Query: 189 GTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALR 248
G RPVIL+ P+L+D+ S + R R + ++ E+ Y+FR P+ G +
Sbjct: 65 GHRPVILLIPQLEDV--SIVGIGYAARQLRERFISTLESVYYFR---------PLDGVVV 113
Query: 249 M-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD----INDAFLGKPRNREKKASGF 303
+ +YP+ + +Y + E Y +++ +KP + +N+A G+ N ++
Sbjct: 114 LRSYPSPWLVYLEKE-----EGYELIAEQGQKPMGEALDILVNNALKGEEENSDQPQPKK 168
Query: 304 WGFLSSV 310
G +SV
Sbjct: 169 SGIFASV 175
>gi|75911048|ref|YP_325344.1| hypothetical protein Ava_4852 [Anabaena variabilis ATCC 29413]
gi|75704773|gb|ABA24449.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 245
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 56/252 (22%), Positives = 102/252 (40%), Gaps = 52/252 (20%)
Query: 42 EIRNYFYDDVLQ---ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 98
E+ N D + Q AT+ A+ DG R++VE+ PEL M +
Sbjct: 3 ELPNSLEDAIAQSREATKAALADGCHRIQVELLFPELK----------FMPVAEQFLPCL 52
Query: 99 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NE 156
+ R+KV + W D + + G + ++
Sbjct: 53 VEYESRLKVFFADAGAAALARRD---------------WKDAPFQILDIGTGRTASIQSK 97
Query: 157 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRD 216
+ +D++F+ ++P + + ++ + + G RP I +NPRL+D G+ GI T R
Sbjct: 98 IQPEDEIFLFISPTSVE----VPQLEKICEIIGDRPAIFLNPRLEDA-GTVGIGYT-ARQ 151
Query: 217 KRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILS 274
R + ++CY+ R PI AL +YP +E++ D + ++
Sbjct: 152 TRERFLNIIQSCYYLR---------PIDDETALFRSYPGDWEIWVEND-----GNWAKIA 197
Query: 275 TFTEKPSSDDIN 286
+KPS D+I+
Sbjct: 198 NLPKKPSGDEID 209
>gi|427713531|ref|YP_007062155.1| hypothetical protein Syn6312_2513 [Synechococcus sp. PCC 6312]
gi|427377660|gb|AFY61612.1| protein of unknown function (DUF1995) [Synechococcus sp. PCC 6312]
Length = 232
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 65/261 (24%), Positives = 105/261 (40%), Gaps = 58/261 (22%)
Query: 54 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
ATQ A++ G R+++EI +P+L +M L F D G +KV +
Sbjct: 17 ATQVALDAGINRIQIEILLPDLQ----------VMPLAWSYLELFQDLGVHLKVFFADAG 66
Query: 114 GEGALAGM---PL-QLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAP 169
P+ + G ++LE V + D F+ + P
Sbjct: 67 AAALARRDWQNPIFSVRGVNELLE----------------------PVQDSDQAFVCITP 104
Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
+ ++ M +AAG RP IL+NP+L+D+ GI GR R + + E Y
Sbjct: 105 SPVE----VSYIEQMCNAAGNRPFILLNPKLQDV-AIVGIGYA-GRQLRERFLNTLETAY 158
Query: 230 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
+ R L AL YP +E+++ V+ GE Y L KPS + I+
Sbjct: 159 YIRPL-------DDTTALLRAYPGDWEIWRDVN----GE-YQRLGATPAKPSGEQIDKVL 206
Query: 290 LGKPRNREKKASGFWGFLSSV 310
G+ + + G GF +S+
Sbjct: 207 YGE----QAETPGGGGFFASM 223
>gi|428769639|ref|YP_007161429.1| hypothetical protein Cyan10605_1269 [Cyanobacterium aponinum PCC
10605]
gi|428683918|gb|AFZ53385.1| protein of unknown function DUF1995-containing protein
[Cyanobacterium aponinum PCC 10605]
Length = 251
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/246 (21%), Positives = 115/246 (46%), Gaps = 53/246 (21%)
Query: 54 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
A + A+ G TR+ +++ IPE+ + L + A F++ G +K+ +
Sbjct: 20 ALKVALESGCTRITIDLVIPEIALKAQY--------LAQEFADFFSEYGAGLKLFFPDT- 70
Query: 114 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMFILV 167
G ALA R+ DWG+ T ++ IG++ ++ + D +F+++
Sbjct: 71 GAAALA---------RR-----DWGE-----TEFQVTDIGSSRSPIERKISDTDQVFLVI 111
Query: 168 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 227
+P N + + + + AG RPVIL+ P+L+D+ S + R R + + E+
Sbjct: 112 SPSAVEVNLV----EKLCNIAGDRPVILLIPQLEDV--SIVGIGYAARQLRERFISILES 165
Query: 228 CYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
Y+FR P+ A+ + +YP ++++++ + E Y +++ +KP ++++
Sbjct: 166 AYYFR---------PLESAVVLRSYPQLWQVWQQNE---ENEDYELIAEVPQKPLGENLD 213
Query: 287 DAFLGK 292
G+
Sbjct: 214 RILQGQ 219
>gi|427724296|ref|YP_007071573.1| hypothetical protein Lepto7376_2460 [Leptolyngbya sp. PCC 7376]
gi|427356016|gb|AFY38739.1| protein of unknown function DUF1995-containing protein
[Leptolyngbya sp. PCC 7376]
Length = 251
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 65/255 (25%), Positives = 116/255 (45%), Gaps = 58/255 (22%)
Query: 34 LNNIPHSREIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMEL 90
++N+P S F D + QA T ++ G R+ VE+ PE
Sbjct: 1 MSNLPKS------FPDTLEQAKSATLTSLEAGCGRVLVELCFPE---------------- 38
Query: 91 VRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGT-RKILEFMDWGDYGAKGTFVEI 149
+ + A S A D +V V GS G+ + T L DWG+ + + +I
Sbjct: 39 IALKAQSLAWDFAQVFVKEYGS-------GLKVLFPDTGAAALAKRDWGE--VEFSVTDI 89
Query: 150 GSIG---ANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGS 206
GS +V++ D +F++V P + + ++ + + AG RPVI++ P+L+D+ S
Sbjct: 90 GSSRNPVQYKVNDADQIFLVVCPSSVE----VAQVEKLCELAGDRPVIMLIPQLEDV--S 143
Query: 207 SGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPS 265
+ R R + ++ E+ Y+ R P GA+ ++P+G+E+Y +
Sbjct: 144 IVGIGYAARQLRERFISTLESAYYIR---------PYDGAMVWRSFPSGWEVYLEKE--- 191
Query: 266 GGEKYVILSTFTEKP 280
GE Y +++T T+KP
Sbjct: 192 EGE-YELIATETQKP 205
>gi|409992140|ref|ZP_11275348.1| hypothetical protein APPUASWS_13731 [Arthrospira platensis str.
Paraca]
gi|409936997|gb|EKN78453.1| hypothetical protein APPUASWS_13731 [Arthrospira platensis str.
Paraca]
Length = 262
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 103/237 (43%), Gaps = 47/237 (19%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
QA A+ DG ++VE+ PE+ E+ I + + AL D +V
Sbjct: 30 QAATAALEDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFF---PD 80
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 169
G ALA R+ DWG+ + T +IG+ + + D F++V+P
Sbjct: 81 AGAAALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 124
Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
++ ++ + AG R V+L+NPRL+D+ GI T R R + + E+CY
Sbjct: 125 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYT-ARQLRERFLNTIESCY 178
Query: 230 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
+ + L A AL YP +E++ +D GE Y ++ + KP D +
Sbjct: 179 YLKPLDGA--------ALFRCYPGTWEVWLEID----GE-YQKITEQSTKPVGDQLE 222
>gi|356535083|ref|XP_003536078.1| PREDICTED: uncharacterized protein LOC100803954 [Glycine max]
Length = 344
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 58/265 (21%), Positives = 108/265 (40%), Gaps = 42/265 (15%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK--RVKVCVQ 110
+A V+ G R VE+ IP+L D G L EL R+ + + K RVK
Sbjct: 107 EAAATFVSSGGMRAIVELLIPQLQFLDDEGAQGELWELSRIFLDTLIEKTKSQRVKAIFP 166
Query: 111 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-VDEQDDMFILVAP 169
AG +L+ Y K S+ + V+ D++ +++ P
Sbjct: 167 D--------------AGAAALLK------YRWKDALFRFASLSDRKPVESDDEIVVMIVP 206
Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
+ + +T+ P+I+ NPRL + G+ + + +R + ++F Y
Sbjct: 207 DYQMLEYVERIASNLTNVPEPTPLIMWNPRL--ISEDVGVGFNVRKLRRF-FLSTFTTVY 263
Query: 230 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
F R + + GA+ YP ++++ D ++Y++ F +P +DDI F
Sbjct: 264 FMRPMPF--------GAIFRCYPGLWKVFS--DDKERPDRYLLAKEFESRPDADDIELLF 313
Query: 290 LGKPRNREK------KASGFWGFLS 308
+ + E+ KA+G + +S
Sbjct: 314 ADEDQKSEQGQGLLDKAAGIFSSIS 338
>gi|428218666|ref|YP_007103131.1| hypothetical protein Pse7367_2442 [Pseudanabaena sp. PCC 7367]
gi|427990448|gb|AFY70703.1| protein of unknown function DUF1995-containing protein
[Pseudanabaena sp. PCC 7367]
Length = 261
Score = 46.6 bits (109), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 58/129 (44%), Gaps = 23/129 (17%)
Query: 136 DWGDYGAKGTFVEIGSIGANE----VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTR 191
DW D + E+ G NE + E D F+L+AP + +D ++ + AG R
Sbjct: 76 DWPDISDQ----EVSMRGVNEGRAAIREDDQAFLLIAP----SSVEVDQVEKLLALAGDR 127
Query: 192 PVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTY 251
P I+ NPRL++ G+ R R + +F CY+ + L G L Y
Sbjct: 128 PFIMFNPRLENSEVGIGLAT---RKMRERFLNTFTVCYYMQPLD--------AGLLWRCY 176
Query: 252 PNGYELYKR 260
P +++Y++
Sbjct: 177 PGLWQVYRQ 185
>gi|428214953|ref|YP_007088097.1| hypothetical protein Oscil6304_4664 [Oscillatoria acuminata PCC
6304]
gi|428003334|gb|AFY84177.1| protein of unknown function (DUF1995) [Oscillatoria acuminata PCC
6304]
Length = 247
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 60/231 (25%), Positives = 103/231 (44%), Gaps = 52/231 (22%)
Query: 59 VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 118
++DG ++VEI IPEL V I E + + F D KV + G AL
Sbjct: 23 LDDGYRLMQVEIGIPELK----VQPIAE--EFLPIFEDLFGD---HFKVYFPDA-GAAAL 72
Query: 119 AGMPLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNC 176
A R+ DWG+ Y +G S ++ +D F+ V P +AV
Sbjct: 73 A---------RR-----DWGERPYIIRGL-----SEPKGQMQPEDQGFLFVEP-SAVE-- 110
Query: 177 IIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 236
++ ++ M + AG RP +++ P+L+++ GI GR R + ++ E+CY+ +
Sbjct: 111 -VNTLEKMCEQAGDRPTVILMPKLENV-AIIGI-GLAGRQLRERFLSTIESCYYIQ---- 163
Query: 237 AGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
P G A+ YP+ ++++ G+ Y ++S KP DD+
Sbjct: 164 -----PSQGYAVFRYYPSPWQVWLET-----GDTYQLISETATKPVGDDLE 204
>gi|291567271|dbj|BAI89543.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 249
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 103/237 (43%), Gaps = 47/237 (19%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
QA A+ DG ++VE+ PE+ E+ I + + AL D +V GS
Sbjct: 17 QAATAALEDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFFPDAGS 70
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 169
ALA R+ DWG+ + T +IG+ + + D F++V+P
Sbjct: 71 ---AALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 111
Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
++ ++ + AG R V+L+NPRL+D+ GI R R + + E+CY
Sbjct: 112 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYA-ARQLRERFLNTIESCY 165
Query: 230 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
+ + L A AL YP +E++ +D GE Y ++ + KP D +
Sbjct: 166 YLKPLDGA--------ALFRCYPGTWEVWLEID----GE-YQKITEQSTKPVGDQLE 209
>gi|384249997|gb|EIE23477.1| hypothetical protein COCSUDRAFT_65935 [Coccomyxa subellipsoidea
C-169]
Length = 335
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 71/145 (48%), Gaps = 21/145 (14%)
Query: 148 EIGSIGA-NEVDEQDDMFILVA--PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLP 204
EI +I + V E DD+ I+ A PQ +++ K + + PVIL NPRL
Sbjct: 174 EIAAISERSAVAEGDDLVIIAAVDPQG------LNEAKRVAAGSSFTPVILFNPRLAS-- 225
Query: 205 GSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVP 264
G G+ + R R E+ ++F+ Y R + GT + +P ++++K D
Sbjct: 226 GDVGLGLNV-RRMRNEFLSTFQITYSLRPVNETGTVF-------RRFPGTWKVFKE-DAS 276
Query: 265 SGGEKYVILSTFTEKPSSDDINDAF 289
S G +Y + + F ++P+ DD++ F
Sbjct: 277 SPG-RYDLAAEFRDQPTGDDLDQIF 300
>gi|170076650|ref|YP_001733288.1| hypothetical protein SYNPCC7002_A0014 [Synechococcus sp. PCC 7002]
gi|169884319|gb|ACA98032.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
Length = 251
Score = 45.1 bits (105), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 40/160 (25%), Positives = 79/160 (49%), Gaps = 25/160 (15%)
Query: 136 DWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRP 192
DWG+ K +IG+ + N++ + D +FI+V P +AV + ++ + + AG RP
Sbjct: 78 DWGEIPFK--VADIGTSRSPIENKIGDDDQIFIIVCP-SAVE---VGQVEKLCNLAGDRP 131
Query: 193 VILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRM-TY 251
VI++ P+L+D+ S + R R + ++ E Y+ R P A+ +Y
Sbjct: 132 VIMLIPQLEDV--SIVGIGYAARQLRERFISTLETAYYIR---------PYESAMVWRSY 180
Query: 252 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLG 291
P+ +E+Y + ++Y ++++ T KP + + L
Sbjct: 181 PSAWEVYLEKE----EDQYELIASETTKPLGEYLERLLLA 216
>gi|428185262|gb|EKX54115.1| hypothetical protein GUITHDRAFT_100361 [Guillardia theta CCMP2712]
Length = 302
Score = 44.3 bits (103), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 56/242 (23%), Positives = 94/242 (38%), Gaps = 53/242 (21%)
Query: 12 ERKSRSNQVS--SSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVE 69
E + R N++ ++ K P +NN+ D V ++ A++DG RL+V+
Sbjct: 5 EHQPRRNEIKFDAAARYRKAFPLGVNNV----------LDQVRESLNLALDDGLFRLRVQ 54
Query: 70 INIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEG-ALAGMPLQLAGT 128
+NI + + + + G + ELV +I + G R + M A +P L
Sbjct: 55 LNIHQFDQRLQSFSDGKMPELVAMIGELLVERGMRPSIMFNTIMDASIARTVLPDHLREK 114
Query: 129 RKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAA 188
V I +G E E+ D +LVAP N GN + ++A+
Sbjct: 115 ------------------VSINVLGLGEFGEEYDAAVLVAPSNE-GNENLQRIEAVEKLI 155
Query: 189 GT-----------------RPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 231
+ RP+ILINP L+ + S T + + F+ YF
Sbjct: 156 YSGPPRGLVSRSARAKYMQRPIILINPVLEAIHSLS----TPHSKVPPMFMSDFDVVYFL 211
Query: 232 RL 233
+L
Sbjct: 212 QL 213
>gi|427739188|ref|YP_007058732.1| hypothetical protein Riv7116_5820 [Rivularia sp. PCC 7116]
gi|427374229|gb|AFY58185.1| protein of unknown function (DUF1995) [Rivularia sp. PCC 7116]
Length = 245
Score = 44.3 bits (103), Expect = 0.076, Method: Compositional matrix adjust.
Identities = 56/233 (24%), Positives = 98/233 (42%), Gaps = 54/233 (23%)
Query: 61 DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALA- 119
DG TRL+VE+ PEL + + +F + G ++K+ G ALA
Sbjct: 25 DGYTRLQVELLFPELKA----------LPAAQQFLPAFDEYGDKLKIFF-ADAGSAALAR 73
Query: 120 ----GMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGN 175
+P QL +D G G I S +V +D++F+ V P +
Sbjct: 74 RDWNEIPFQL---------LDIG----TGRMTSIES----KVQPEDEIFLFVQPSSVE-- 114
Query: 176 CIIDDMKAMTDAAG-TRPVILINPRLKDLPGSSGI-MQTMGRDKRLEYAASFENCYFFRL 233
+ ++ + G RP ++ PRL+D SS + + R R + + E+CY+ R
Sbjct: 115 --VPQLEKLCGIIGEQRPFVMFAPRLED---SSIVGIGYAARQTRQRFINTIESCYYLRP 169
Query: 234 LYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
++ A+ YP +E++ V G+ Y ++ KPS D+++
Sbjct: 170 IFEE-------AAVFRCYPGLWEVW----VEKNGD-YEKVAELPNKPSGDELD 210
>gi|428220338|ref|YP_007104508.1| hypothetical protein Syn7502_00205 [Synechococcus sp. PCC 7502]
gi|427993678|gb|AFY72373.1| protein of unknown function (DUF1995) [Synechococcus sp. PCC 7502]
Length = 233
Score = 43.9 bits (102), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 52/243 (21%), Positives = 96/243 (39%), Gaps = 53/243 (21%)
Query: 49 DDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVC 108
+ + AT A+ G R+++++ PEL M + A +F +
Sbjct: 13 EQAIAATMSAIAAGCKRIQIDMKFPEL----------AQMPIALAFAEAFTKQYGQAWQA 62
Query: 109 VQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE----VDEQDDMF 164
+ G ALA DW + E+ G NE + F
Sbjct: 63 LFADAGTAALAKR--------------DWAN---SDILREVSMRGVNEGRAAARPESQAF 105
Query: 165 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 224
+L++P + +D ++ + G RP +++NP+L++ G+ RD+ L
Sbjct: 106 LLISPTSVE----VDGIEKLLTLIGDRPFVMLNPKLENSEVGLGLAARRMRDRFLS---- 157
Query: 225 FENCYFFRLLYYAGTQYPI-MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 283
F ++YY PI GA+ YP +++++ D +G E ++ E+P D
Sbjct: 158 -----IFEIVYYI---QPIEEGAVWRCYPQLWQVWQ--DKTTGME---LIYEGEERPVGD 204
Query: 284 DIN 286
D+N
Sbjct: 205 DLN 207
>gi|317969886|ref|ZP_07971276.1| hypothetical protein SCB02_10139 [Synechococcus sp. CB0205]
Length = 232
Score = 43.9 bits (102), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 66/132 (50%), Gaps = 21/132 (15%)
Query: 161 DDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLE 220
D + +LVAP A +D++A+ A V+L+N +L+D G T+ R++R
Sbjct: 102 DGLLVLVAPTPA----DYEDVEAVC-AQHRGSVVLLNGKLEDAAVGIG---TVARERRKG 153
Query: 221 YAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKP 280
+ A++++ Y L G GALR YP ++LY+R P G Y + F KP
Sbjct: 154 FLATWQSAY---ALIPTGE-----GALRRAYPLDWQLYRR--DPDG---YRFVKDFELKP 200
Query: 281 SSDDINDAFLGK 292
++ + DA G+
Sbjct: 201 DAEQMADALEGE 212
>gi|357441663|ref|XP_003591109.1| hypothetical protein MTR_1g082900 [Medicago truncatula]
gi|355480157|gb|AES61360.1| hypothetical protein MTR_1g082900 [Medicago truncatula]
Length = 327
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 58/261 (22%), Positives = 106/261 (40%), Gaps = 35/261 (13%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
QAT V+ G R VE+ IP+L D L EL R+ + +Q +
Sbjct: 90 QATASFVSSGGMRAIVELLIPQLEFLDDEGAQLELWELSRLF----------LDTLIQET 139
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNA 172
+ A P AG +L++ W D A +F + VD D+M +++ P +
Sbjct: 140 NCQKVKAVFP--DAGAAALLKYR-WKD--ALFSFASLS--DRKLVDSGDEMVVMIVPDHQ 192
Query: 173 VGNCIIDDMKAMTDAAGTRPVILINPRL--KDLPGSSGIMQTMGRDKRLEYAASFENCYF 230
+ + ++D RP+I+ NPRL KD+ + R R + ++F Y
Sbjct: 193 MLEYVEKIASTLSDDP-PRPLIMWNPRLYSKDVGVGYNV-----RQLRQYFLSTFTTVYH 246
Query: 231 FRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 290
R + + GA+ YP ++++ D +Y++ + +P ++DI F
Sbjct: 247 MRPMPF--------GAVFRCYPEMWKVFS--DDKDRPNRYLLAKEYANRPDTEDIEMLFG 296
Query: 291 GKPRNREKKASGFWGFLSSVF 311
+ ++ + + VF
Sbjct: 297 NNEEQKSEQGQSLFDKAAGVF 317
>gi|225450962|ref|XP_002284712.1| PREDICTED: uncharacterized protein LOC100264831 [Vitis vinifera]
gi|296088326|emb|CBI36771.3| unnamed protein product [Vitis vinifera]
Length = 353
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 112/267 (41%), Gaps = 48/267 (17%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDG--KRVKVCVQ 110
QA+ V+ G R VE+ IP+L D L EL R+ + ++ +RVK
Sbjct: 115 QASASFVSSGGMRAIVELLIPQLQFLDDEGAQAELWELSRIFIDTLIEETGCQRVKAIFP 174
Query: 111 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQ 170
AG +L++ WGD A F + V+ +D++ ++V P
Sbjct: 175 D--------------AGAAALLKYR-WGD--AAFGFASLSD--RKPVEAEDELVVMVVPD 215
Query: 171 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG-----RDKRLEYAASF 225
+ + ++D RP+I+ NPRL I + +G R R + ++F
Sbjct: 216 YQMLAYVERIASNLSDDP-PRPLIMWNPRL--------ISEDVGVGFNVRKLRRSFLSTF 266
Query: 226 ENCYFFRLLYYAGTQYPIMGALRMTYPNGYEL-YKRVDVPSGGEKYVILSTFTEKPSSDD 284
Y R L GA+ YP +++ Y D P+ +Y++ +P +++
Sbjct: 267 TVVYSMRPL--------PAGAVFRCYPGSWKVFYDDKDRPN---RYLLAKEQISRPDAEE 315
Query: 285 INDAFLGKPRNREKKASGFWGFLSSVF 311
+ F G +++ +K F+G +S+F
Sbjct: 316 LEIIF-GDVQDKSEKGPSFFGQAASIF 341
>gi|209522945|ref|ZP_03271502.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|376001796|ref|ZP_09779650.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|209496532|gb|EDZ96830.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|375329707|emb|CCE15403.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 249
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 103/237 (43%), Gaps = 47/237 (19%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
QA A++DG ++VE+ PE+ E+ I + + AL D +V GS
Sbjct: 17 QAATAALDDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFFPDAGS 70
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 169
ALA R+ DWG+ + T +IG+ + + D F++V+P
Sbjct: 71 ---AALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 111
Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
++ ++ + AG R V+L+NPRL+D+ GI R R + E+CY
Sbjct: 112 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYA-ARQLRERFLNIIESCY 165
Query: 230 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
+ + L A AL YP +E++ ++ GE Y ++ + KP D +
Sbjct: 166 YLKPLDGA--------ALFRCYPGTWEVWLEIE----GE-YQKITEQSTKPVGDQLE 209
>gi|423062349|ref|ZP_17051139.1| hypothetical protein SPLC1_S032380 [Arthrospira platensis C1]
gi|406716257|gb|EKD11408.1| hypothetical protein SPLC1_S032380 [Arthrospira platensis C1]
Length = 262
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 103/237 (43%), Gaps = 47/237 (19%)
Query: 53 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
QA A++DG ++VE+ PE+ E+ I + + AL D +V GS
Sbjct: 30 QAATAALDDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFFPDAGS 83
Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 169
ALA R+ DWG+ + T +IG+ + + D F++V+P
Sbjct: 84 ---AALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 124
Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
++ ++ + AG R V+L+NPRL+D+ GI R R + E+CY
Sbjct: 125 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYA-ARQLRERFLNIIESCY 178
Query: 230 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
+ + L A AL YP +E++ ++ GE Y ++ + KP D +
Sbjct: 179 YLKPLDGA--------ALFRCYPGTWEVWLEIE----GE-YQKITEQSTKPVGDQLE 222
>gi|299116127|emb|CBN76036.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 284
Score = 42.7 bits (99), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 58/268 (21%), Positives = 110/268 (41%), Gaps = 48/268 (17%)
Query: 36 NIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPEL-NPEMDVYRIGTLMELVRVI 94
+P S +++ + +A +A++DG+TR + +++P + E+D + G
Sbjct: 48 ELPMSNDVQ---VEQATRAVWQALSDGKTRQILRLSLPLIGKTEIDDWPGG--------- 95
Query: 95 ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA 154
D +R K C G M E L G P G + + +D D +G +
Sbjct: 96 ------DRQRYKAC--GPMVESVLRGNPAAKEGASIVEQILDESD--------AVGLMQL 139
Query: 155 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 214
+ +DD +++ P + + + AG+R V L+NP + G+ Q G
Sbjct: 140 QCKEAKDDASVMIFPTTDT----LSLQQQVAAGAGSRLVALVNPAWRS-SSDFGLFQA-G 193
Query: 215 RDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILS 274
K E + FE Y + L G ++ YP + L+ + G K +L
Sbjct: 194 EAK--ETLSEFEVTYSLQSLVVYGYTLRLL----YCYPGPWRLFL---IGEGSGKRELLG 244
Query: 275 TFTEKPSSDDINDAFL----GKPRNREK 298
++ + P ++D ++ + GKP E+
Sbjct: 245 SWDQSPPTNDQVESAVFDKKGKPNASER 272
>gi|422294314|gb|EKU21614.1| hypothetical protein NGA_0177102 [Nannochloropsis gaditana CCMP526]
Length = 375
Score = 41.2 bits (95), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 53/116 (45%), Gaps = 10/116 (8%)
Query: 178 IDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYA 237
+ D++A+ A T+P+I N L+ G G+ +D + ++ + Y R YA
Sbjct: 221 LPDLEALHQADPTKPLIFFNLNLETHRGDLGLPAFPSKDLHYRFLSNIKPVYLLRTRQYA 280
Query: 238 GT--QYPIM----GALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 287
T + P + GAL TYP GY+ + +G +Y + T E+P+ D
Sbjct: 281 QTLSRPPFILNYQGALFRTYPGGYQCM----LDTGNGRYRRVETSRERPALSGFKD 332
>gi|297812961|ref|XP_002874364.1| hypothetical protein ARALYDRAFT_489571 [Arabidopsis lyrata subsp.
lyrata]
gi|297320201|gb|EFH50623.1| hypothetical protein ARALYDRAFT_489571 [Arabidopsis lyrata subsp.
lyrata]
Length = 340
Score = 40.4 bits (93), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 99/249 (39%), Gaps = 43/249 (17%)
Query: 59 VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 118
VN G TR VE+ IP+L D L +L RV + ++ + E
Sbjct: 110 VNSGGTRAIVELLIPQLQFLDDEGAQAELWDLSRVF----------LDTLIKETGCERVK 159
Query: 119 AGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCII 178
A P AG +L++ W D TF V+++D++ ++V P + +
Sbjct: 160 AVFP--DAGAAALLKYR-WKD----ATFGFASLSDRKPVEKEDEIIVMVVPDYQMLEYVE 212
Query: 179 DDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG-----RDKRLEYAASFENCYFFRL 233
K + D RP+I+ NPRL I + +G R R + +SF Y R
Sbjct: 213 KIAKGLADDP-PRPLIMWNPRL--------ISEEVGVGFNVRKLRRYFLSSFTTVYSMRP 263
Query: 234 LYYAGTQYPIMGALRMTYPNGYEL-YKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGK 292
L GA+ YP +++ Y D P +Y++ +P ++D+ + G
Sbjct: 264 L--------AAGAVFRCYPGKWKVFYDNKDRPG---RYLLAKELIGRPDAEDLEIIYGGV 312
Query: 293 PRNREKKAS 301
E+ S
Sbjct: 313 EEKSEEGPS 321
>gi|52079804|ref|YP_078595.1| hypothetical protein BL03758 [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|319646386|ref|ZP_08000616.1| hypothetical protein HMPREF1012_01652 [Bacillus sp. BT1B_CT2]
gi|404488678|ref|YP_006712784.1| hypothetical protein BLi01406 [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|52003015|gb|AAU22957.1| hypothetical protein BL03758 [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|52347676|gb|AAU40310.1| putative membrane protein [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|317392136|gb|EFV72933.1| hypothetical protein HMPREF1012_01652 [Bacillus sp. BT1B_CT2]
Length = 249
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 30/50 (60%), Gaps = 5/50 (10%)
Query: 93 VIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGA 142
++ALSF V VC+ + GAL+ +PL LAGT + ++M+W GA
Sbjct: 175 IMALSFL-----VIVCLSFRLKAGALSILPLLLAGTHFLFQYMNWDSAGA 219
>gi|123966201|ref|YP_001011282.1| hypothetical protein P9515_09681 [Prochlorococcus marinus str. MIT
9515]
gi|123200567|gb|ABM72175.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9515]
Length = 215
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 66/140 (47%), Gaps = 32/140 (22%)
Query: 153 GANE---VDEQDDMFILVAPQN---AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGS 206
G NE +D+ D + I + PQ + + ++ K + + +NP+ +D +
Sbjct: 97 GFNESIYIDQDDSIVISILPQPYDFGLFEPMCENFKGLHYS--------LNPKFED--AN 146
Query: 207 SGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIM-GALRMTYPNGYELYKRVDVPS 265
GI + R++R + +++N YF + PI GAL +YPN + L+K ++
Sbjct: 147 IGIGSVI-RERRKNFVKTWQNIYFLQ---------PINNGALMHSYPNNWLLFKEIN--- 193
Query: 266 GGEKYVILSTFTEKPSSDDI 285
KY+ F EKP ++ I
Sbjct: 194 --NKYLFKKEFEEKPDNETI 211
>gi|324998739|ref|ZP_08119851.1| UDP-N-acetylmuramate--L-alanine ligase [Pseudonocardia sp. P1]
Length = 478
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 55/130 (42%), Gaps = 5/130 (3%)
Query: 68 VEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM-PLQLA 126
V I E NPE+ R L + R +AL+ +G+R+ V V G+ G+ + M + L
Sbjct: 78 VSTAIRETNPELAAARDQGLQVVHRAVALAALTEGRRL-VAVAGTHGKTSTTSMVTVALQ 136
Query: 127 GTRKILEFMDWGDYGAKGTFVEIGS--IGANEVDEQDDMFILVAPQNA-VGNCIIDDMKA 183
T F GD G+ GS + E DE D F AP A V N D +
Sbjct: 137 HTGADPSFAIGGDLSVSGSGAHHGSGDVFVVEADESDGSFTAFAPSVAVVTNVEPDHLDH 196
Query: 184 MTDAAGTRPV 193
D A R V
Sbjct: 197 HGDEATYRAV 206
>gi|284991684|ref|YP_003410238.1| UDP-N-acetylmuramate/alanine ligase [Geodermatophilus obscurus DSM
43160]
gi|284064929|gb|ADB75867.1| UDP-N-acetylmuramate/alanine ligase [Geodermatophilus obscurus DSM
43160]
Length = 489
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 48/111 (43%), Gaps = 8/111 (7%)
Query: 68 VEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM---PLQ 124
V I E NPE+ R L L R +AL+ GKR V V G+ G+ + M +Q
Sbjct: 84 VSTAIREDNPELAAARERGLRVLPRAVALAAVMAGKR-SVAVAGTHGKTSTTSMLTVAVQ 142
Query: 125 LAGTRKILEFMDWGDYGAKGTFVEIGS--IGANEVDEQDDMFILVAPQNAV 173
G F GD G+ G + E DE D F+L+AP A+
Sbjct: 143 ACGVDP--SFAIGGDLNESGSNAHAGGGDVFVAEADESDRSFLLLAPHGAI 191
>gi|255087973|ref|XP_002505909.1| predicted protein [Micromonas sp. RCC299]
gi|226521180|gb|ACO67167.1| predicted protein [Micromonas sp. RCC299]
Length = 299
Score = 39.3 bits (90), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 47/101 (46%), Gaps = 8/101 (7%)
Query: 196 INPRLKDLPGSSGIMQTMG----RDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTY 251
IN +++ S ++ +G R K E+ ASFE Y+ G ++ +Y
Sbjct: 169 INNQIRTNDDGSNLISDLGIGPWRKKNEEFLASFELTYWISEQRIQGETVRLL----KSY 224
Query: 252 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGK 292
P+ ++L+ D+ SG + + TF EKPS ++ + +
Sbjct: 225 PHPWQLFVLTDMASGSAEPECIKTFDEKPSYKELESLLMSR 265
>gi|356576779|ref|XP_003556507.1| PREDICTED: uncharacterized protein LOC100782973 isoform 1 [Glycine
max]
Length = 340
Score = 38.5 bits (88), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 57/270 (21%), Positives = 112/270 (41%), Gaps = 45/270 (16%)
Query: 51 VLQATQRA---VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK--RV 105
V Q+++ A V+ G R VE+ IP+L D L EL R+ + + K RV
Sbjct: 98 VRQSSEAAAIFVSSGGIRAIVELLIPQLQFLDDEGAQVELWELSRIFLDTLIEKTKFQRV 157
Query: 106 KVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-VDEQDDMF 164
K AG +L+ Y K S+ + V+ D++
Sbjct: 158 KAIFPD--------------AGAAALLK------YRWKDALFRFASLSDRKPVESDDEVV 197
Query: 165 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 224
+++ P + + +++ P+I+ NPRL + G+ + + +R+ + ++
Sbjct: 198 VMIVPDYQMLEYVERIASNLSNIPEPTPLIMWNPRL--ISEDVGVGFNVRKLRRV-FLST 254
Query: 225 FENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 284
F YF R + + GA+ YP ++++ D ++Y++ F +P ++D
Sbjct: 255 FTTVYFMRPMPF--------GAIFRCYPGLWKVFS--DDKERPDRYLLAKEFEIRPDAED 304
Query: 285 INDAFLGKPRNREK------KASGFWGFLS 308
I F + + E+ KA+G + +S
Sbjct: 305 IELLFADEEQKSEQGQGLLDKAAGIFSSIS 334
>gi|300788096|ref|YP_003768387.1| UDP-N-acetylmuramate--L-alanine ligase [Amycolatopsis mediterranei
U32]
gi|384151526|ref|YP_005534342.1| UDP-N-acetylmuramate--L-alanine ligase [Amycolatopsis mediterranei
S699]
gi|399539979|ref|YP_006552641.1| UDP-N-acetylmuramate--alanine ligase [Amycolatopsis mediterranei
S699]
gi|299797610|gb|ADJ47985.1| UDP-N-acetylmuramate--alanine ligase [Amycolatopsis mediterranei
U32]
gi|340529680|gb|AEK44885.1| UDP-N-acetylmuramate--L-alanine ligase [Amycolatopsis mediterranei
S699]
gi|398320749|gb|AFO79696.1| UDP-N-acetylmuramate--alanine ligase [Amycolatopsis mediterranei
S699]
Length = 465
Score = 38.1 bits (87), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 45/109 (41%), Gaps = 4/109 (3%)
Query: 68 VEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM-PLQLA 126
V I E NPE+ R + L R AL+ +G RV C+ G+ G+ + M + L
Sbjct: 77 VSTAIKETNPELAAARAAGIPVLHRAQALAGLMEGHRVA-CIAGTHGKTSTTSMLTVALQ 135
Query: 127 GTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQNAV 173
R F GD G G G E DE D F+ +P AV
Sbjct: 136 HCRLDPSFAIGGDLNESGANAHHGEGGVFVAEADESDGSFLTYSPSVAV 184
>gi|383830926|ref|ZP_09986015.1| UDP-N-acetylmuramate--alanine ligase [Saccharomonospora
xinjiangensis XJ-54]
gi|383463579|gb|EID55669.1| UDP-N-acetylmuramate--alanine ligase [Saccharomonospora
xinjiangensis XJ-54]
Length = 474
Score = 38.1 bits (87), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 53/126 (42%), Gaps = 9/126 (7%)
Query: 56 QRAVN-----DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQ 110
QRA N D + + V I E NPE+ V R + L R AL+ +G RV C+
Sbjct: 66 QRAENLDVFDDAPSAVVVSTAIKETNPELVVARERGVPVLHRAEALAALMEGHRVA-CIA 124
Query: 111 GSMGEGALAGM-PLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG--ANEVDEQDDMFILV 167
G+ G+ + M + L R F GD G G G E DE D F+
Sbjct: 125 GTHGKTSTTSMLTVALQHCRLDPSFAIGGDLNESGANAHHGEGGIFVAEADESDGSFLSY 184
Query: 168 APQNAV 173
+P AV
Sbjct: 185 SPSVAV 190
>gi|418461004|ref|ZP_13032086.1| UDP-N-acetylmuramate--L-alanine ligase [Saccharomonospora azurea
SZMC 14600]
gi|359738959|gb|EHK87837.1| UDP-N-acetylmuramate--L-alanine ligase [Saccharomonospora azurea
SZMC 14600]
Length = 450
Score = 37.4 bits (85), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 53/130 (40%), Gaps = 9/130 (6%)
Query: 52 LQATQRAVN-----DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVK 106
++ QRA N G + + V I E NPE+ R + L R AL+ +G RV
Sbjct: 62 IEVGQRAENLDVFDGGPSAVIVSTAIKETNPELAAARERDVPVLHRAEALAALMEGHRVA 121
Query: 107 VCVQGSMGEGALAGM-PLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG--ANEVDEQDDM 163
C+ G+ G+ + M + L R F GD G G G E DE D
Sbjct: 122 -CIAGTHGKTSTTSMLTVALQHCRLDPSFAIGGDLNESGANAHHGEGGIFVAEADESDGS 180
Query: 164 FILVAPQNAV 173
F+ P AV
Sbjct: 181 FLSYTPSVAV 190
>gi|72382164|ref|YP_291519.1| hypothetical protein PMN2A_0324 [Prochlorococcus marinus str.
NATL2A]
gi|72002014|gb|AAZ57816.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL2A]
Length = 206
Score = 37.4 bits (85), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 59/131 (45%), Gaps = 21/131 (16%)
Query: 155 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 214
N D + + ++ +PQ I+ + + A + VI+IN +L+D G ++G
Sbjct: 93 NSTDLSNSILLVCSPQPYD----IEMFEQVCSHAKSN-VIMINGKLEDPIVGIG---SVG 144
Query: 215 RDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILS 274
R+ R +A ++ YF + L MGAL YPN +EL+K Y +
Sbjct: 145 REMRKRFAKKWKVLYFVQPLS--------MGALLKRYPNDWELFKL-----NNNGYTFVK 191
Query: 275 TFTEKPSSDDI 285
+F +P + I
Sbjct: 192 SFINRPDDETI 202
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.137 0.403
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,883,154,002
Number of Sequences: 23463169
Number of extensions: 212357855
Number of successful extensions: 432797
Number of sequences better than 100.0: 143
Number of HSP's better than 100.0 without gapping: 38
Number of HSP's successfully gapped in prelim test: 105
Number of HSP's that attempted gapping in prelim test: 432609
Number of HSP's gapped (non-prelim): 163
length of query: 311
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 169
effective length of database: 9,027,425,369
effective search space: 1525634887361
effective search space used: 1525634887361
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)