BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 021557
         (311 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|359484753|ref|XP_002264815.2| PREDICTED: uncharacterized protein LOC100256872 [Vitis vinifera]
          Length = 597

 Score =  548 bits (1412), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 257/304 (84%), Positives = 278/304 (91%)

Query: 8   KPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLK 67
           K +   +  SN  SSSQ+NW+GIPTRLNNIPHSREIR YFYDDVLQATQRAVNDGRTRLK
Sbjct: 294 KSIRASERLSNLASSSQENWRGIPTRLNNIPHSREIRTYFYDDVLQATQRAVNDGRTRLK 353

Query: 68  VEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAG 127
           VEINIPELNPEMDVYRIGTLMELVRV+ALSFADDGK VKVCVQGSMGEGALAGMPLQLAG
Sbjct: 354 VEINIPELNPEMDVYRIGTLMELVRVLALSFADDGKHVKVCVQGSMGEGALAGMPLQLAG 413

Query: 128 TRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDA 187
           TRKILEFMDWGDYGAKGTF+ IGSIG+ EV+EQDD+FILVAPQNAVGNCIIDD+KAMTDA
Sbjct: 414 TRKILEFMDWGDYGAKGTFINIGSIGSKEVEEQDDLFILVAPQNAVGNCIIDDLKAMTDA 473

Query: 188 AGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGAL 247
           AG RPVIL+NPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGA+
Sbjct: 474 AGNRPVILVNPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGAI 533

Query: 248 RMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFL 307
           RM+YP  YELYKRVD   G EKY+ILST + +P+ D+INDAFLGKPR+  KKA+GFWGFL
Sbjct: 534 RMSYPYRYELYKRVDESFGKEKYIILSTSSARPNIDEINDAFLGKPRSEAKKATGFWGFL 593

Query: 308 SSVF 311
           S + 
Sbjct: 594 SGIL 597


>gi|255565097|ref|XP_002523541.1| adenylate kinase, putative [Ricinus communis]
 gi|223537248|gb|EEF38880.1| adenylate kinase, putative [Ricinus communis]
          Length = 597

 Score =  547 bits (1409), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 256/302 (84%), Positives = 277/302 (91%)

Query: 10  VHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVE 69
           + + +S  NQ S SQD W+GIPTRLNNIPHSREIR YFYDDVLQATQRAVNDGR RLKVE
Sbjct: 296 IPKSESSYNQASVSQDKWRGIPTRLNNIPHSREIRTYFYDDVLQATQRAVNDGRMRLKVE 355

Query: 70  INIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTR 129
           I IPELNPEMDVYRIGTLMELVRV+ALSFADDGKRVKVCVQGSMG+GALAGMPLQL+GTR
Sbjct: 356 IAIPELNPEMDVYRIGTLMELVRVLALSFADDGKRVKVCVQGSMGQGALAGMPLQLSGTR 415

Query: 130 KILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG 189
           KILEFMDWGDYGA GTF+ IGSIGA EVDE DDMFILVAPQNAVGNCIIDD++AMTDAAG
Sbjct: 416 KILEFMDWGDYGALGTFIRIGSIGAKEVDEDDDMFILVAPQNAVGNCIIDDLRAMTDAAG 475

Query: 190 TRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRM 249
           +RPVIL+NPRLKDLPGSSGIMQTMGR +RLEYAASFENCY FRLLYY GTQYPIMGALRM
Sbjct: 476 SRPVILVNPRLKDLPGSSGIMQTMGRAERLEYAASFENCYSFRLLYYGGTQYPIMGALRM 535

Query: 250 TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSS 309
           +YP  YELYKRVD+P+ GEKY ILSTF+EKP+ DDINDAFLG+ RN++KKASGFWGFLS 
Sbjct: 536 SYPYHYELYKRVDLPARGEKYEILSTFSEKPTGDDINDAFLGRSRNQDKKASGFWGFLSG 595

Query: 310 VF 311
           V 
Sbjct: 596 VL 597


>gi|296084582|emb|CBI25603.3| unnamed protein product [Vitis vinifera]
          Length = 652

 Score =  543 bits (1399), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 254/293 (86%), Positives = 273/293 (93%)

Query: 19  QVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPE 78
           Q S SQ+NW+GIPTRLNNIPHSREIR YFYDDVLQATQRAVNDGRTRLKVEINIPELNPE
Sbjct: 360 QASLSQENWRGIPTRLNNIPHSREIRTYFYDDVLQATQRAVNDGRTRLKVEINIPELNPE 419

Query: 79  MDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG 138
           MDVYRIGTLMELVRV+ALSFADDGK VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG
Sbjct: 420 MDVYRIGTLMELVRVLALSFADDGKHVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG 479

Query: 139 DYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINP 198
           DYGAKGTF+ IGSIG+ EV+EQDD+FILVAPQNAVGNCIIDD+KAMTDAAG RPVIL+NP
Sbjct: 480 DYGAKGTFINIGSIGSKEVEEQDDLFILVAPQNAVGNCIIDDLKAMTDAAGNRPVILVNP 539

Query: 199 RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELY 258
           RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGA+RM+YP  YELY
Sbjct: 540 RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGAIRMSYPYRYELY 599

Query: 259 KRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
           KRVD   G EKY+ILST + +P+ D+INDAFLGKPR+  KKA+GFWGFLS + 
Sbjct: 600 KRVDESFGKEKYIILSTSSARPNIDEINDAFLGKPRSEAKKATGFWGFLSGIL 652


>gi|449493659|ref|XP_004159398.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101226743 [Cucumis sativus]
          Length = 609

 Score =  523 bits (1346), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 245/300 (81%), Positives = 269/300 (89%)

Query: 12  ERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEIN 71
           E K   NQ S S+D+W+GIPTRLNNIPHSREIR YFYDDVLQAT+RAV  GRTRLKVE N
Sbjct: 310 ENKGTVNQPSESRDSWRGIPTRLNNIPHSREIRKYFYDDVLQATKRAVQAGRTRLKVENN 369

Query: 72  IPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKI 131
           IPELNPEMDVYRIGTLMELVR IALSFADDG+RVKVC+QGSMGEGAL+GMPLQLAGTR+I
Sbjct: 370 IPELNPEMDVYRIGTLMELVRTIALSFADDGRRVKVCIQGSMGEGALSGMPLQLAGTRRI 429

Query: 132 LEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTR 191
           LE+MDWG+YGA GTFV+IGSIGA EVD++DDMFILVAPQNAVGNCIIDDMKAMTDAAG R
Sbjct: 430 LEYMDWGEYGALGTFVKIGSIGAKEVDDEDDMFILVAPQNAVGNCIIDDMKAMTDAAGDR 489

Query: 192 PVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTY 251
           PVIL+NPRLKDLPGSSGIMQTMGR+KRLEYAASFE CYFFRLLYYAGTQYPIMGALRM+Y
Sbjct: 490 PVILVNPRLKDLPGSSGIMQTMGREKRLEYAASFEICYFFRLLYYAGTQYPIMGALRMSY 549

Query: 252 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
           P GYELY+RVD PSG EKY+ LS + ++PS+DDINDAF G  R   K +SG WGFLS + 
Sbjct: 550 PYGYELYRRVDEPSGKEKYIELSKYPKRPSTDDINDAFQGNKRKEAKSSSGIWGFLSGIL 609


>gi|356532999|ref|XP_003535056.1| PREDICTED: uncharacterized protein LOC100796331 [Glycine max]
          Length = 1016

 Score =  519 bits (1337), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 241/291 (82%), Positives = 265/291 (91%)

Query: 21   SSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMD 80
            S+  DNW+GIPTRLNNIPHSREIR YFYDDVLQATQRA+NDG+TR+KV+INIPELNPEMD
Sbjct: 726  SAEADNWRGIPTRLNNIPHSREIRKYFYDDVLQATQRAINDGKTRVKVDINIPELNPEMD 785

Query: 81   VYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY 140
            VYRIGTLMELVR +ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY
Sbjct: 786  VYRIGTLMELVRTLALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY 845

Query: 141  GAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRL 200
            GAKGTF+ IGSIGA EV+E+DDMFILVAPQNAVGNCIIDD++AMT+AA  RPVILIN RL
Sbjct: 846  GAKGTFINIGSIGAAEVEEEDDMFILVAPQNAVGNCIIDDLRAMTNAAEHRPVILINARL 905

Query: 201  KDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKR 260
            KDLPGSSGIMQTMGRD+RL+YAASFE+CYFFRLLYYAGTQYPIMGA+RMTYP  YELYKR
Sbjct: 906  KDLPGSSGIMQTMGRDQRLKYAASFESCYFFRLLYYAGTQYPIMGAIRMTYPYQYELYKR 965

Query: 261  VDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
            VD   G EKY ILSTF E+P++D+INDAF GKP    +K SG WGFL+ + 
Sbjct: 966  VDESPGKEKYAILSTFPERPTTDEINDAFEGKPSKDPRKTSGIWGFLNGIL 1016


>gi|449443265|ref|XP_004139400.1| PREDICTED: uncharacterized protein LOC101203230 [Cucumis sativus]
          Length = 577

 Score =  517 bits (1332), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 241/308 (78%), Positives = 272/308 (88%)

Query: 4   TTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGR 63
           ++ +  + + K+  +  +S  D+W+GIPTRLNNIPHSREIR YFYDDVLQAT+RAV  GR
Sbjct: 270 SSLLSQIQKEKAMKSGNASFSDSWRGIPTRLNNIPHSREIRKYFYDDVLQATKRAVQAGR 329

Query: 64  TRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPL 123
           TRLKVE NIPELNPEMDVYRIGTLMELVR IALSFADDG+RVKVC+QGSMGEGAL+GMPL
Sbjct: 330 TRLKVENNIPELNPEMDVYRIGTLMELVRTIALSFADDGRRVKVCIQGSMGEGALSGMPL 389

Query: 124 QLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKA 183
           QLAGTR+ILE+MDWG+YGA GTFV+IGSIGA EVD++DDMFILVAPQNAVGNCIIDDMKA
Sbjct: 390 QLAGTRRILEYMDWGEYGALGTFVKIGSIGAKEVDDEDDMFILVAPQNAVGNCIIDDMKA 449

Query: 184 MTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPI 243
           MTDAAG RPVIL+NPRLKDLPGSSGIMQTMGR+KRLEYAASFE CYFFRLLYYAGTQYPI
Sbjct: 450 MTDAAGDRPVILVNPRLKDLPGSSGIMQTMGREKRLEYAASFEICYFFRLLYYAGTQYPI 509

Query: 244 MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGF 303
           MGALRM+YP GYELY+RVD PSG EKY+ LS + ++PS+DDINDAF G  R   K +SG 
Sbjct: 510 MGALRMSYPYGYELYRRVDEPSGKEKYIELSKYPKRPSTDDINDAFQGNKRKEAKSSSGI 569

Query: 304 WGFLSSVF 311
           WGFLS + 
Sbjct: 570 WGFLSGIL 577


>gi|356556590|ref|XP_003546607.1| PREDICTED: uncharacterized protein LOC100818830 [Glycine max]
          Length = 591

 Score =  517 bits (1331), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 240/288 (83%), Positives = 264/288 (91%)

Query: 24  QDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYR 83
           QD W+GIPTRLNNIPHSREIR YFYDDVLQATQRA++DG+TR+KV+INIPELNPEMDVYR
Sbjct: 304 QDKWRGIPTRLNNIPHSREIRKYFYDDVLQATQRAISDGKTRVKVDINIPELNPEMDVYR 363

Query: 84  IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAK 143
           IGTLMELVR +ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAK
Sbjct: 364 IGTLMELVRALALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAK 423

Query: 144 GTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDL 203
           GTF+ IGSIGA EV+EQDDMFILVAPQNAVGNCIIDD++AMT+AA  RPVILIN RLKDL
Sbjct: 424 GTFINIGSIGAAEVEEQDDMFILVAPQNAVGNCIIDDLRAMTNAAEHRPVILINARLKDL 483

Query: 204 PGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDV 263
           PGSSGIMQT+GRD+RL+YAASFE+CYFFRLLYYAGTQYPIMGA+RMTYP  Y+LYKRVD 
Sbjct: 484 PGSSGIMQTIGRDQRLKYAASFESCYFFRLLYYAGTQYPIMGAIRMTYPYQYDLYKRVDE 543

Query: 264 PSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
             G EKYVILSTF E+P++D+INDAF GKP    +K SG WGFLS + 
Sbjct: 544 SPGKEKYVILSTFPERPTTDEINDAFQGKPSKDPRKTSGIWGFLSGIL 591


>gi|224144020|ref|XP_002325159.1| predicted protein [Populus trichocarpa]
 gi|222866593|gb|EEF03724.1| predicted protein [Populus trichocarpa]
          Length = 520

 Score =  517 bits (1331), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 241/273 (88%), Positives = 261/273 (95%)

Query: 22  SSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDV 81
           S QD W+GIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRL+V+INIPELNPEMDV
Sbjct: 248 SLQDKWRGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLRVDINIPELNPEMDV 307

Query: 82  YRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYG 141
           YRIGTLMELVRVIALSF+DDGK VKVCVQGSMG+GALAGMPLQLAGTR ILEFMDWGDYG
Sbjct: 308 YRIGTLMELVRVIALSFSDDGKHVKVCVQGSMGKGALAGMPLQLAGTRNILEFMDWGDYG 367

Query: 142 AKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLK 201
           AKGTF++IGSIG  EV E+DDMFILVAPQNAVGNCII+D+KAMT AAG+RPVILINP+LK
Sbjct: 368 AKGTFIKIGSIGEQEVSEEDDMFILVAPQNAVGNCIIEDLKAMTVAAGSRPVILINPKLK 427

Query: 202 DLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRV 261
           DLPGSSGIMQTMGRDKRLEYAASFE+CYFFRLLYYAGTQYPIMGALRM+YP  YELYKRV
Sbjct: 428 DLPGSSGIMQTMGRDKRLEYAASFESCYFFRLLYYAGTQYPIMGALRMSYPYRYELYKRV 487

Query: 262 DVPSGGEKYVILSTFTEKPSSDDINDAFLGKPR 294
           D PSG EKY+ILSTF+EKPS+DD+NDAF+G+PR
Sbjct: 488 DQPSGTEKYMILSTFSEKPSTDDVNDAFMGRPR 520


>gi|22327339|ref|NP_198367.2| adenylate kinase family protein [Arabidopsis thaliana]
 gi|17979434|gb|AAL49859.1| putative adenylate kinase [Arabidopsis thaliana]
 gi|22136756|gb|AAM91697.1| putative adenylate kinase [Arabidopsis thaliana]
 gi|110740593|dbj|BAE98401.1| adenylate kinase -like protein [Arabidopsis thaliana]
 gi|332006553|gb|AED93936.1| adenylate kinase family protein [Arabidopsis thaliana]
          Length = 588

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 227/300 (75%), Positives = 265/300 (88%)

Query: 12  ERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEIN 71
           +R  ++++ S  QD W+GIPTRLNNIPHSR+IR YFY+DVLQAT R++ DG TRL+V+IN
Sbjct: 288 KRMIKTDKASPVQDKWRGIPTRLNNIPHSRDIRAYFYEDVLQATIRSIKDGNTRLRVDIN 347

Query: 72  IPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKI 131
           IPELNPEMDVYRIGTLMELV+ +ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKI
Sbjct: 348 IPELNPEMDVYRIGTLMELVQALALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKI 407

Query: 132 LEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTR 191
           LE+MDWGD    GTFV++G+IG  EVDE+DDMFILVAPQNAVGNCIIDD++AMT AAG R
Sbjct: 408 LEYMDWGDDETLGTFVKLGAIGGKEVDEEDDMFILVAPQNAVGNCIIDDLQAMTTAAGKR 467

Query: 192 PVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTY 251
           PV+LINPRLKDLP SSGIMQTMGR++RLEYA +F+NCY FRLLYY GTQYPIMGALRM+Y
Sbjct: 468 PVVLINPRLKDLPASSGIMQTMGREQRLEYALTFDNCYVFRLLYYLGTQYPIMGALRMSY 527

Query: 252 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
           P  YELYKRV+  +G EKYV+L+T+ E+P+ + I+DAF GK R++ KKASG WGFLSSVF
Sbjct: 528 PYRYELYKRVNEENGKEKYVLLATYAERPTPEQIDDAFSGKSRDQSKKASGIWGFLSSVF 587


>gi|297801048|ref|XP_002868408.1| adenylate kinase family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314244|gb|EFH44667.1| adenylate kinase family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 595

 Score =  490 bits (1261), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 227/293 (77%), Positives = 261/293 (89%)

Query: 19  QVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPE 78
           + S  QD W+GIPTRLNNIPHSR+IR YFY+DVLQAT R++ DG TRL+V+INIPELNPE
Sbjct: 302 KASPVQDKWRGIPTRLNNIPHSRDIRAYFYEDVLQATIRSIKDGNTRLRVDINIPELNPE 361

Query: 79  MDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG 138
           MDVYRIGTLMELVRV+ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILE+MDWG
Sbjct: 362 MDVYRIGTLMELVRVLALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEYMDWG 421

Query: 139 DYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINP 198
           D    GTFV++G+IG  EVDE+DDMFILVAPQNAVGNCIIDD++AMT AAG RPV+LINP
Sbjct: 422 DDETLGTFVKLGAIGGKEVDEEDDMFILVAPQNAVGNCIIDDLQAMTTAAGKRPVVLINP 481

Query: 199 RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELY 258
           RLKDLP SSGIMQTMGR++RLEYA +F+NCY FRLLYY GTQYPIMGALRM+YP  YELY
Sbjct: 482 RLKDLPASSGIMQTMGREQRLEYALTFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELY 541

Query: 259 KRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
           KRV+  +G EKYV+L+T++E+P+ + I+DAF GK R++ KK SG WGFLSSVF
Sbjct: 542 KRVNEENGKEKYVLLATYSERPTPEQIDDAFSGKSRDQSKKPSGIWGFLSSVF 594


>gi|357145496|ref|XP_003573662.1| PREDICTED: uncharacterized protein LOC100845101 [Brachypodium
           distachyon]
          Length = 596

 Score =  486 bits (1250), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 219/292 (75%), Positives = 259/292 (88%)

Query: 20  VSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEM 79
            ++S++ W+GIPTRLNNIPHSREIR YFYDDVLQAT+RA+ D +TRL+++INIPELNPEM
Sbjct: 305 TTTSKNEWRGIPTRLNNIPHSREIRKYFYDDVLQATKRAIEDKKTRLQIDINIPELNPEM 364

Query: 80  DVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGD 139
           DVYRIGTLMELVR I+L+FADDGKRVKV VQGSMG+GA AG+PLQLAGTRKILEFMDWGD
Sbjct: 365 DVYRIGTLMELVREISLAFADDGKRVKVSVQGSMGQGAFAGIPLQLAGTRKILEFMDWGD 424

Query: 140 YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPR 199
           YGAKG F+ IG++GA+EVD++DDMFIL+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPR
Sbjct: 425 YGAKGAFINIGAVGASEVDKEDDMFILIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPR 484

Query: 200 LKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYK 259
           LKD+P SSG+MQTMGRD RL+YAASFE CY FRLL+YAGT YPIMGALRM YPN YE+Y+
Sbjct: 485 LKDMPASSGVMQTMGRDMRLKYAASFETCYSFRLLFYAGTFYPIMGALRMGYPNKYEIYR 544

Query: 260 RVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
           RVD P+G EKYV+L+   +KP++DDI +AF G  + +E   SGFWGFLS + 
Sbjct: 545 RVDEPNGKEKYVLLAELMDKPTADDITNAFKGPKKEKESAPSGFWGFLSGIL 596


>gi|297608344|ref|NP_001061454.2| Os08g0288200 [Oryza sativa Japonica Group]
 gi|255678327|dbj|BAF23368.2| Os08g0288200, partial [Oryza sativa Japonica Group]
          Length = 493

 Score =  478 bits (1231), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 218/295 (73%), Positives = 258/295 (87%)

Query: 17  SNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELN 76
           S   ++S++ ++GIPTRLNNIPHSREIR YFY+DVL AT+ AV D +TRL+++INIPELN
Sbjct: 199 SKDTTASKNEFRGIPTRLNNIPHSREIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELN 258

Query: 77  PEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMD 136
           PEMDVYRIGTLMELVR ++LSFADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MD
Sbjct: 259 PEMDVYRIGTLMELVRELSLSFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMD 318

Query: 137 WGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILI 196
           WG+YGAKGTF+  G++GA+EVD++DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+
Sbjct: 319 WGEYGAKGTFINFGAVGASEVDKEDDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILV 378

Query: 197 NPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYE 256
           NPRLKD+PGSSG+MQTMGRD RL+YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE
Sbjct: 379 NPRLKDMPGSSGVMQTMGRDMRLKYAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYE 438

Query: 257 LYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
           +Y+RVD P+G E+YV+L  F EKP+ D+I +AF  +    EK ASGFWGFLS + 
Sbjct: 439 IYRRVDEPNGQERYVLLEEFVEKPTPDEITNAFRPRKNENEKSASGFWGFLSGIL 493


>gi|79329010|ref|NP_001031966.1| adenylate kinase family protein [Arabidopsis thaliana]
 gi|332006554|gb|AED93937.1| adenylate kinase family protein [Arabidopsis thaliana]
          Length = 580

 Score =  478 bits (1230), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 220/293 (75%), Positives = 258/293 (88%)

Query: 12  ERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEIN 71
           +R  ++++ S  QD W+GIPTRLNNIPHSR+IR YFY+DVLQAT R++ DG TRL+V+IN
Sbjct: 288 KRMIKTDKASPVQDKWRGIPTRLNNIPHSRDIRAYFYEDVLQATIRSIKDGNTRLRVDIN 347

Query: 72  IPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKI 131
           IPELNPEMDVYRIGTLMELV+ +ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKI
Sbjct: 348 IPELNPEMDVYRIGTLMELVQALALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKI 407

Query: 132 LEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTR 191
           LE+MDWGD    GTFV++G+IG  EVDE+DDMFILVAPQNAVGNCIIDD++AMT AAG R
Sbjct: 408 LEYMDWGDDETLGTFVKLGAIGGKEVDEEDDMFILVAPQNAVGNCIIDDLQAMTTAAGKR 467

Query: 192 PVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTY 251
           PV+LINPRLKDLP SSGIMQTMGR++RLEYA +F+NCY FRLLYY GTQYPIMGALRM+Y
Sbjct: 468 PVVLINPRLKDLPASSGIMQTMGREQRLEYALTFDNCYVFRLLYYLGTQYPIMGALRMSY 527

Query: 252 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFW 304
           P  YELYKRV+  +G EKYV+L+T+ E+P+ + I+DAF GK R++ KKASG W
Sbjct: 528 PYRYELYKRVNEENGKEKYVLLATYAERPTPEQIDDAFSGKSRDQSKKASGIW 580


>gi|326499868|dbj|BAJ90769.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326502206|dbj|BAJ95166.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 596

 Score =  478 bits (1229), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 214/292 (73%), Positives = 257/292 (88%), Gaps = 1/292 (0%)

Query: 20  VSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEM 79
            ++S+  W+GIPTRLNNIPHSREIR YFYDDV+QAT+RA+ D +TRL+++INIPELNPE+
Sbjct: 306 TTASKKEWRGIPTRLNNIPHSREIRKYFYDDVIQATKRAIEDKKTRLQIDINIPELNPEL 365

Query: 80  DVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGD 139
           DVYRIGTLMELVR ++L+FADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGD
Sbjct: 366 DVYRIGTLMELVRDLSLTFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGD 425

Query: 140 YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPR 199
           YGA G F+ IG++GA+EVD++DDMF+L+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPR
Sbjct: 426 YGAMGAFINIGAVGASEVDKEDDMFVLIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPR 485

Query: 200 LKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYK 259
           LKD+P SSG+MQTMGRD RL+YAASFE CY FRLL+Y GT YPIMGALRM YPN YE+Y+
Sbjct: 486 LKDMPASSGVMQTMGRDLRLQYAASFETCYSFRLLFYVGTFYPIMGALRMAYPNKYEIYR 545

Query: 260 RVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
           RVD P+ GEKY +L+ FT  P++DDI +AF+G  + +E   SGFWGFLS + 
Sbjct: 546 RVDEPN-GEKYDLLAEFTANPTADDITNAFVGTKKKKENAPSGFWGFLSGIL 596


>gi|38175634|dbj|BAD01341.1| putative Adenylate kinase, chloroplast [Oryza sativa Japonica
           Group]
          Length = 467

 Score =  477 bits (1227), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 218/295 (73%), Positives = 258/295 (87%)

Query: 17  SNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELN 76
           S   ++S++ ++GIPTRLNNIPHSREIR YFY+DVL AT+ AV D +TRL+++INIPELN
Sbjct: 173 SKDTTASKNEFRGIPTRLNNIPHSREIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELN 232

Query: 77  PEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMD 136
           PEMDVYRIGTLMELVR ++LSFADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MD
Sbjct: 233 PEMDVYRIGTLMELVRELSLSFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMD 292

Query: 137 WGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILI 196
           WG+YGAKGTF+  G++GA+EVD++DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+
Sbjct: 293 WGEYGAKGTFINFGAVGASEVDKEDDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILV 352

Query: 197 NPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYE 256
           NPRLKD+PGSSG+MQTMGRD RL+YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE
Sbjct: 353 NPRLKDMPGSSGVMQTMGRDMRLKYAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYE 412

Query: 257 LYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
           +Y+RVD P+G E+YV+L  F EKP+ D+I +AF  +    EK ASGFWGFLS + 
Sbjct: 413 IYRRVDEPNGQERYVLLEEFVEKPTPDEITNAFRPRKNENEKSASGFWGFLSGIL 467


>gi|293333213|ref|NP_001168735.1| uncharacterized protein LOC100382527 [Zea mays]
 gi|223972711|gb|ACN30543.1| unknown [Zea mays]
          Length = 599

 Score =  477 bits (1227), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 216/291 (74%), Positives = 258/291 (88%), Gaps = 1/291 (0%)

Query: 21  SSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMD 80
           + S++ W+GIPTRLNNIPHSREIR YFY+DVL+AT+ A+ D +TRL+++INIPELNPEMD
Sbjct: 310 TGSKNEWRGIPTRLNNIPHSREIREYFYNDVLKATKLAIEDKKTRLQIDINIPELNPEMD 369

Query: 81  VYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY 140
           VYRIGTLMELVR ++LSFADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGDY
Sbjct: 370 VYRIGTLMELVRELSLSFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGDY 429

Query: 141 GAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRL 200
           GAKGTF+ IG++GA +VD++DDMF+L+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPRL
Sbjct: 430 GAKGTFINIGAVGARDVDKEDDMFVLIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPRL 489

Query: 201 KDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKR 260
           KD+P SSG+MQTMGRD RL+YAASFE CY FRLLYYAG+ YPIMGALRM YPN YE+Y+R
Sbjct: 490 KDMPASSGVMQTMGRDIRLKYAASFETCYSFRLLYYAGSFYPIMGALRMAYPNKYEIYRR 549

Query: 261 VDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
           VD  +G E Y++++ FT KP+ D+I  AF G+ R  EK +SGFWGFLS +F
Sbjct: 550 VDTLNGKENYILIAEFTNKPTPDEITSAFKGQKRE-EKASSGFWGFLSGLF 599


>gi|326509295|dbj|BAJ91564.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 512

 Score =  476 bits (1224), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 214/292 (73%), Positives = 257/292 (88%), Gaps = 1/292 (0%)

Query: 20  VSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEM 79
            ++S+  W+GIPTRLNNIPHSREIR YFYDDV+QAT+RA+ D +TRL+++INIPELNPE+
Sbjct: 222 TTASKKEWRGIPTRLNNIPHSREIRKYFYDDVIQATKRAIEDKKTRLQIDINIPELNPEL 281

Query: 80  DVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGD 139
           DVYRIGTLMELVR ++L+FADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGD
Sbjct: 282 DVYRIGTLMELVRDLSLTFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGD 341

Query: 140 YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPR 199
           YGA G F+ IG++GA+EVD++DDMF+L+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPR
Sbjct: 342 YGAMGAFINIGAVGASEVDKEDDMFVLIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPR 401

Query: 200 LKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYK 259
           LKD+P SSG+MQTMGRD RL+YAASFE CY FRLL+Y GT YPIMGALRM YPN YE+Y+
Sbjct: 402 LKDMPASSGVMQTMGRDLRLQYAASFETCYSFRLLFYVGTFYPIMGALRMAYPNKYEIYR 461

Query: 260 RVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
           RVD P+ GEKY +L+ FT  P++DDI +AF+G  + +E   SGFWGFLS + 
Sbjct: 462 RVDEPN-GEKYDLLAEFTANPTADDITNAFVGTKKKKENAPSGFWGFLSGIL 512


>gi|10257489|dbj|BAB11193.1| unnamed protein product [Arabidopsis thaliana]
          Length = 337

 Score =  439 bits (1128), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 213/330 (64%), Positives = 249/330 (75%), Gaps = 48/330 (14%)

Query: 13  RKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINI 72
           ++  +++ S  QD W+GIPTRLNNIPHSR+IR YFY+DVLQAT R++ DG TRL+V+INI
Sbjct: 8   KQVLADKASPVQDKWRGIPTRLNNIPHSRDIRAYFYEDVLQATIRSIKDGNTRLRVDINI 67

Query: 73  PELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKIL 132
           PELNPEMDVYRIGTLMELV+ +ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKIL
Sbjct: 68  PELNPEMDVYRIGTLMELVQALALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKIL 127

Query: 133 EFMDWGDYGAKGTFVEIGSI---------------------------------------- 152
           E+MDWGD    GTFV++G+I                                        
Sbjct: 128 EYMDWGDDETLGTFVKLGAIVTRESLNLQKLHHQQNIVVSIIKLVMHNKVGYIELHIILL 187

Query: 153 --------GANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLP 204
                   G  EVDE+DDMFILVAPQNAVGNCIIDD++AMT AAG RPV+LINPRLKDLP
Sbjct: 188 FDYVFAMTGGKEVDEEDDMFILVAPQNAVGNCIIDDLQAMTTAAGKRPVVLINPRLKDLP 247

Query: 205 GSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVP 264
            SSGIMQTMGR++RLEYA +F+NCY FRLLYY GTQYPIMGALRM+YP  YELYKRV+  
Sbjct: 248 ASSGIMQTMGREQRLEYALTFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELYKRVNEE 307

Query: 265 SGGEKYVILSTFTEKPSSDDINDAFLGKPR 294
           +G EKYV+L+T+ E+P+ + I+DAF GK R
Sbjct: 308 NGKEKYVLLATYAERPTPEQIDDAFSGKSR 337


>gi|302769091|ref|XP_002967965.1| hypothetical protein SELMODRAFT_169443 [Selaginella moellendorffii]
 gi|300164703|gb|EFJ31312.1| hypothetical protein SELMODRAFT_169443 [Selaginella moellendorffii]
          Length = 575

 Score =  389 bits (1000), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 177/284 (62%), Positives = 223/284 (78%), Gaps = 1/284 (0%)

Query: 24  QDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYR 83
           QD+W+G+PT+LN+IPHSREIR YFY +V +  + AV DG  +LKVEI IPELNPEMDVYR
Sbjct: 291 QDSWRGMPTKLNSIPHSREIREYFYTEVCEGVRNAVEDGLHKLKVEITIPELNPEMDVYR 350

Query: 84  IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAK 143
            GTL+ELVR +A +FA+DGKRVKVCVQG MGEG   GMPLQL+G+R++LE MDWGDYGAK
Sbjct: 351 AGTLLELVRELAFTFANDGKRVKVCVQGPMGEGIFRGMPLQLSGSRRMLEVMDWGDYGAK 410

Query: 144 GTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDL 203
           GTFV IG+I A++V E DDMFIL+APQNAVGNCII+D++AM  AAG RPVIL+NP+LKD+
Sbjct: 411 GTFVNIGAIAASDVFETDDMFILMAPQNAVGNCIIEDLQAMVQAAGDRPVILVNPQLKDV 470

Query: 204 PGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDV 263
           PG+ G+MQ +GR++R+E++ SF  CY FRLLY AGT YPIMGALRM YP  YE++KR+D 
Sbjct: 471 PGAGGVMQVIGREQRMEFSDSFFVCYSFRLLYRAGTFYPIMGALRMKYPGPYEVHKRIDT 530

Query: 264 PSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFL 307
               E Y+  +TF  +P+  ++ D F G+ +   K  S  W F 
Sbjct: 531 GYSSETYIQEATFAGRPTPTELQDVFAGRSKEASKTKS-IWSFF 573


>gi|302761066|ref|XP_002963955.1| hypothetical protein SELMODRAFT_270406 [Selaginella moellendorffii]
 gi|300167684|gb|EFJ34288.1| hypothetical protein SELMODRAFT_270406 [Selaginella moellendorffii]
          Length = 575

 Score =  389 bits (1000), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 177/284 (62%), Positives = 222/284 (78%), Gaps = 1/284 (0%)

Query: 24  QDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYR 83
           QD+W+G+PT+LN+IPHSREIR YFY +V +  + AV DG  +LKVEI IPELNPEMDVYR
Sbjct: 291 QDSWRGMPTKLNSIPHSREIREYFYTEVCEGVRNAVEDGLHKLKVEITIPELNPEMDVYR 350

Query: 84  IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAK 143
            GTL+ELVR +A +FA+DGKRVKVCVQG MGEG   GMPLQL+G+R +LE MDWGDYGAK
Sbjct: 351 AGTLLELVRELAFTFANDGKRVKVCVQGPMGEGIFRGMPLQLSGSRMMLEVMDWGDYGAK 410

Query: 144 GTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDL 203
           GTFV IG+I A++V E DDMFIL+APQNAVGNCII+D++AM  AAG RPVIL+NP+LKD+
Sbjct: 411 GTFVNIGAIAASDVSETDDMFILMAPQNAVGNCIIEDLQAMVQAAGDRPVILVNPQLKDV 470

Query: 204 PGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDV 263
           PG+ G+MQ +GR++R+E++ SF  CY FRLLY AGT YPIMGALRM YP  YE++KR+D 
Sbjct: 471 PGAGGVMQVIGREQRMEFSDSFFVCYSFRLLYRAGTFYPIMGALRMKYPGPYEVHKRIDT 530

Query: 264 PSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFL 307
               E Y+  +TF  +P+  ++ D F G+ +   K  S  W F 
Sbjct: 531 GYSSETYIQEATFPGRPTPTELQDVFAGRSKEASKTKS-IWSFF 573


>gi|168018045|ref|XP_001761557.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687241|gb|EDQ73625.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 264

 Score =  361 bits (926), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 162/264 (61%), Positives = 211/264 (79%), Gaps = 1/264 (0%)

Query: 27  WKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGT 86
           W+G+PT+LN  PHSREIR YFY D  QA + A++DG+ +LKV + IPELNPEMDVYRIGT
Sbjct: 1   WRGVPTKLNTCPHSREIRQYFYKDACQAAKNAIDDGKHKLKVLLTIPELNPEMDVYRIGT 60

Query: 87  LMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTF 146
           L+EL+R +A +FA+DGKRV+VCVQG MGEG  +GMPL L+GTR +LE MDWG+Y  K TF
Sbjct: 61  LLELIRELAFTFANDGKRVRVCVQGPMGEGIFSGMPLALSGTRPMLERMDWGEYIEKDTF 120

Query: 147 VEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGS 206
           +  G++GA EV + DDMFIL+APQNAVGNCIIDD+KAM +AAG RPVI++NP+LKD+P S
Sbjct: 121 IRFGAVGAKEVYDDDDMFILMAPQNAVGNCIIDDLKAMVEAAGDRPVIILNPKLKDIPAS 180

Query: 207 SGIMQTM-GRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPS 265
            GIMQ + GR +R+E+A SF  CY FRL+Y +GTQYPI+GA RM YP  YE+Y+R D+  
Sbjct: 181 GGIMQQVGGRQERMEFAESFFTCYNFRLMYTSGTQYPIVGAFRMAYPGPYEIYRRFDLDR 240

Query: 266 GGEKYVILSTFTEKPSSDDINDAF 289
           G E Y ++++F  +P+  +I+DA+
Sbjct: 241 GVEVYKLIASFDHEPTRSEISDAY 264


>gi|218200847|gb|EEC83274.1| hypothetical protein OsI_28627 [Oryza sativa Indica Group]
          Length = 681

 Score =  320 bits (821), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 152/215 (70%), Positives = 184/215 (85%), Gaps = 6/215 (2%)

Query: 17  SNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELN 76
           S   ++S++ ++GIPTRLNNIPHSREIR YFY+DVL AT+ AV D +TRL+++INIPELN
Sbjct: 315 SKDTTASKNEFRGIPTRLNNIPHSREIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELN 374

Query: 77  PEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMD 136
           PEMDVYRIGTLMELVR ++LSFADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MD
Sbjct: 375 PEMDVYRIGTLMELVRELSLSFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMD 434

Query: 137 WGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILI 196
           WG+YGAKGTF+  G++GA+EVD++DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+
Sbjct: 435 WGEYGAKGTFINFGAVGASEVDKEDDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILV 494

Query: 197 NPRLKDLP------GSSGIMQTMGRDKRLEYAASF 225
           NPRLK         GS G ++  G+++ ++YAA  
Sbjct: 495 NPRLKGCGVGVRGWGSRGGVKVNGKNRVVKYAADL 529



 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 73/110 (66%), Positives = 88/110 (80%)

Query: 202 DLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRV 261
           D+PGSSG+MQTMGRD RL+YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE+Y+RV
Sbjct: 572 DMPGSSGVMQTMGRDMRLKYAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYEIYRRV 631

Query: 262 DVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
           D P+G E+YV+L  F EKP+ D+I +AF  +    EK ASGFWGFLS + 
Sbjct: 632 DEPNGQERYVLLEEFVEKPTPDEITNAFRPRKNENEKSASGFWGFLSGIL 681


>gi|222640270|gb|EEE68402.1| hypothetical protein OsJ_26756 [Oryza sativa Japonica Group]
          Length = 680

 Score =  318 bits (816), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 152/215 (70%), Positives = 183/215 (85%), Gaps = 6/215 (2%)

Query: 17  SNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELN 76
           S   ++S++ ++GIPTRLNNIPHSREIR YFY+DVL AT+ AV D +TRL+++INIPELN
Sbjct: 314 SKDTTASKNEFRGIPTRLNNIPHSREIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELN 373

Query: 77  PEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMD 136
           PEMDVYRIGTLMELVR ++LSFADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MD
Sbjct: 374 PEMDVYRIGTLMELVRELSLSFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMD 433

Query: 137 WGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILI 196
           WG+YGAKGTF+  G++GA+EVD++DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+
Sbjct: 434 WGEYGAKGTFINFGAVGASEVDKEDDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILV 493

Query: 197 NPRLKDLP------GSSGIMQTMGRDKRLEYAASF 225
           NPRLK         GS G  +  G+++ ++YAA  
Sbjct: 494 NPRLKGCGVGVRGWGSRGGGKVNGKNRVVKYAADL 528



 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 73/110 (66%), Positives = 88/110 (80%)

Query: 202 DLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRV 261
           D+PGSSG+MQTMGRD RL+YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE+Y+RV
Sbjct: 571 DMPGSSGVMQTMGRDMRLKYAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYEIYRRV 630

Query: 262 DVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 311
           D P+G E+YV+L  F EKP+ D+I +AF  +    EK ASGFWGFLS + 
Sbjct: 631 DEPNGQERYVLLEEFVEKPTPDEITNAFRPRKNENEKSASGFWGFLSGIL 680


>gi|384250870|gb|EIE24349.1| hypothetical protein COCSUDRAFT_65858 [Coccomyxa subellipsoidea
           C-169]
          Length = 291

 Score =  243 bits (620), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 126/278 (45%), Positives = 176/278 (63%), Gaps = 10/278 (3%)

Query: 30  IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 89
           IP RLNN+P SRE R +FY+DV +A ++AV      L V++ +PE NPEMDVYRIGT++E
Sbjct: 18  IPQRLNNVPASRETRQFFYNDVCRAVEKAVAAEERLLSVKLTVPETNPEMDVYRIGTVLE 77

Query: 90  LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 149
           LVR +    A DGKRVKVCVQ ++G+G   G+PL L+G R+I+E MDWG+      FV  
Sbjct: 78  LVRDLVTDLAADGKRVKVCVQQALGQGVFQGLPLSLSGVRRIMENMDWGEV---FEFVSF 134

Query: 150 GSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAA--GTRPVILINPRLKDLPGSS 207
           G++G++ +D+    +++VAPQN VG+ I+  +  M  AA    + +IL NP LKDLP +S
Sbjct: 135 GNVGSDHLDD-SAYYVIVAPQNVVGSTIMTKLLEMATAAVEQKKTLILFNPLLKDLPSAS 193

Query: 208 GIMQTMGRDKRLEYAASFENCYFFRLLYYAGT-QYPIMGALRMTYPNGYELYKRVDVPSG 266
           G+M   GR +R+E+A SF   Y FRLLY +    YPI GALR  Y   +E++KR D+   
Sbjct: 194 GVMGVRGRRERMEFADSFIPAYHFRLLYSSSAMMYPIRGALRHVYGGPWEVFKREDMGKR 253

Query: 267 GEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFW 304
            E+Y  + +F  +P    +   F  +P    KK SG W
Sbjct: 254 TEEYRFVGSFDSQPDGSLLTPLFQEQP---PKKNSGAW 288


>gi|413922034|gb|AFW61966.1| hypothetical protein ZEAMMB73_400224, partial [Zea mays]
          Length = 446

 Score =  236 bits (603), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 105/133 (78%), Positives = 124/133 (93%)

Query: 21  SSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMD 80
           + S++ W+GIPTRLNNIPHSREIR YFY+DVL+AT+ A+ D +TRL+++INIPELNPEMD
Sbjct: 310 TGSKNEWRGIPTRLNNIPHSREIREYFYNDVLKATKLAIEDKKTRLQIDINIPELNPEMD 369

Query: 81  VYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY 140
           VYRIGTLMELVR ++LSFADDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGDY
Sbjct: 370 VYRIGTLMELVRELSLSFADDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGDY 429

Query: 141 GAKGTFVEIGSIG 153
           GAKGTF+ IG++G
Sbjct: 430 GAKGTFINIGAVG 442


>gi|303283278|ref|XP_003060930.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457281|gb|EEH54580.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 354

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 113/233 (48%), Positives = 155/233 (66%), Gaps = 5/233 (2%)

Query: 30  IPTRLNNIPHSREIRNYFYDDVLQATQRAVN--DGRTRLKVEINIPELNPEMDVYRIGTL 87
           IP  LNN PH RE+R +FYDD + +  RAV   +  TR+K ++  PELN   DVYR+GTL
Sbjct: 85  IPIMLNNTPHPREVRRWFYDDCVTSVTRAVTGEEPMTRVKAKVEFPELNVNGDVYRVGTL 144

Query: 88  MELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFV 147
           +ELVR +A++ A DGKRV+VCVQGSMGEG   G+PL L+G  +IL+ MDWGD      F+
Sbjct: 145 LELVRELAMTLAKDGKRVRVCVQGSMGEGVFQGLPLSLSGVARILDMMDWGD---ADEFI 201

Query: 148 EIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSS 207
             G+I  +    +DD FIL+APQN VG  ++  + AM +AAG RP+I+INP+L D+  + 
Sbjct: 202 SRGAIDGDVPQPEDDFFILIAPQNIVGFAVLPRIIAMEEAAGDRPMIMINPKLDDIQSAG 261

Query: 208 GIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKR 260
            +M   GR +R EY +++   Y FRLLY     +PI GA+R T+   +ELYK+
Sbjct: 262 NVMSIRGRGERREYVSTWREVYHFRLLYRKPYFFPIYGAMRYTHDGKWELYKK 314


>gi|307109260|gb|EFN57498.1| hypothetical protein CHLNCDRAFT_21254 [Chlorella variabilis]
          Length = 396

 Score =  233 bits (593), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 123/314 (39%), Positives = 171/314 (54%), Gaps = 55/314 (17%)

Query: 30  IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 89
           IPTRLN IPH R  R +FY +   A +RA++ G TR+     IPELN E DVYR+GTL+E
Sbjct: 76  IPTRLNTIPHERTTRQHFYREATTAIKRAIDAGETRVLARCTIPELNTEFDVYRVGTLLE 135

Query: 90  LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 149
           LVR +A S A DGKRVKVCVQ ++G+G   GMPL L G  +I++ MDWG+   KG     
Sbjct: 136 LVREVATSLAADGKRVKVCVQQALGQGVFQGMPLSLNGVLRIMKQMDWGESAEKGVSSAR 195

Query: 150 GSI----------------------------------------------------GANEV 157
           G +                                                    GA EV
Sbjct: 196 GGVVGCLFCVHTGRSHEEGMCACSAWLVHSSPCPVLTSLTCHPATAADRILTGNLGAAEV 255

Query: 158 DEQDDMFILVAPQNAVGNCIIDDMKAMTDAA--GTRPVILINPRLKDLPGSSGIMQTMGR 215
            E  D F+L+APQN VG+ I+  +  M  AA    +P++LINP+L D+  ++G+M   GR
Sbjct: 256 -EDADAFVLIAPQNIVGHSIMPLLSEMVAAAEEAGKPMVLINPKLTDIQSAAGVMSVRGR 314

Query: 216 DKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILST 275
             RL++ A+F   Y FRLLY   + +PIMGALR  +   +E+++RV++  G E+Y ++ +
Sbjct: 315 QDRLDFTATFGTAYHFRLLYKGVSMHPIMGALRHEHDGPWEVFRRVELGPGVEEYQLIGS 374

Query: 276 FTEKPSSDDINDAF 289
           F  +P+   I DAF
Sbjct: 375 FDREPTPPLITDAF 388


>gi|255078468|ref|XP_002502814.1| predicted protein [Micromonas sp. RCC299]
 gi|226518080|gb|ACO64072.1| predicted protein [Micromonas sp. RCC299]
          Length = 331

 Score =  222 bits (566), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 120/267 (44%), Positives = 164/267 (61%), Gaps = 8/267 (2%)

Query: 30  IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDG--RTRLKVEINIPELNPEMDVYRIGTL 87
           IP  LNN PH RE+R +FYDD  ++   AV     +TR+K  I  PELN   DVYRIGTL
Sbjct: 68  IPKMLNNTPHPREVRKWFYDDCRESVVAAVTGANPKTRVKAFIEFPELNVNGDVYRIGTL 127

Query: 88  MELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFV 147
           +ELVR +A++ A DGKRV+VCVQGSMG+G   G+PL L+G  +IL+ MDWGD      F+
Sbjct: 128 LELVREVAMALAADGKRVRVCVQGSMGQGVFQGLPLSLSGVARILDMMDWGD---ADEFI 184

Query: 148 EIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSS 207
             GSIG +     D  FIL++PQN VG  ++  ++ M  AAG RP+I+INP+L D+  + 
Sbjct: 185 ARGSIGGDVPRPDDAYFILISPQNIVGYSVLPYLQEMEKAAGDRPIIMINPKLGDIQSAG 244

Query: 208 GIMQTMGRDKRLEYA-ASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSG 266
            +M   GR +R EY   ++E  Y FRLLY     +PI GALR      +ELYK+      
Sbjct: 245 NVMSIRGRGERREYVEKTWEEVYHFRLLYKKPFFFPIYGALRYGTGGMWELYKKFGKME- 303

Query: 267 GEKYVILSTFTE-KPSSDDINDAFLGK 292
            E++ +L T+ + +P + +I    L K
Sbjct: 304 QEEWKLLRTYDDGEPGAAEITKRILAK 330


>gi|412991462|emb|CCO16307.1| predicted protein [Bathycoccus prasinos]
          Length = 352

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 117/278 (42%), Positives = 169/278 (60%), Gaps = 17/278 (6%)

Query: 29  GIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGR-TRLKVEINIPELNPEMDVYRIGTL 87
            +PT LNNIPH+REIR +FYDD  ++  RA+ D R  R+KV    PE+N E DV+R GTL
Sbjct: 77  ALPTLLNNIPHTREIRRFFYDDATESVLRAIKDDRNVRMKVFTEFPEMNVEGDVFRAGTL 136

Query: 88  MELVRVIALSFA----DDG----KRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGD 139
           +E+VR +A   A     DG    +RVK+CVQGSMG G   G+PL L+G  ++++ MDW +
Sbjct: 137 LEMVRTMATELARMERKDGSGGKQRVKLCVQGSMGTGVFQGLPLALSGISRLIDLMDWEE 196

Query: 140 YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPR 199
             A+   +  G+IG +   + +D+FIL+ PQN VG  I+  M+ M + AG RP+IL+NP+
Sbjct: 197 DVAER--ISNGAIGKDHAKDDEDVFILICPQNIVGYSILPYMEEMMEVAGNRPMILLNPK 254

Query: 200 LKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNG----- 254
           L D+  +  +M   GR  R+E+A S+E  Y FRLLY     +PI GALR +Y +      
Sbjct: 255 LGDIQSAGNVMSIRGRQGRMEFANSWEEIYHFRLLYRKPFFFPIYGALRKSYSSAEEDDT 314

Query: 255 YELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGK 292
           +ELYKR       E Y ++  + ++P+ D +     GK
Sbjct: 315 WELYKRFG-KMDDEYYQLMEVYKKEPNPDQMTKVIWGK 351


>gi|145349122|ref|XP_001418989.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579219|gb|ABO97282.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 290

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 112/263 (42%), Positives = 158/263 (60%), Gaps = 4/263 (1%)

Query: 30  IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 89
           +PT LNN+PH+R  R +FY+D  ++   AV     R+K++   PELN   DVYR+GT +E
Sbjct: 31  VPTLLNNVPHARATRRWFYEDATKSIVAAVTARERRIKIKSEFPELNVNGDVYRVGTTLE 90

Query: 90  LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 149
           LVR  A + A DG RVK+CVQ SMG+G    +PL L+G  K+L  MDW + G +G     
Sbjct: 91  LVRETAAALARDGLRVKICVQRSMGQGVFQALPLSLSGVSKLLGMMDW-EPGTEGMIDPS 149

Query: 150 GSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGI 209
           G IGA +  E  D FILV PQN VG  ++  ++ M    G +P+I+INPRL D+  +  +
Sbjct: 150 GGIGAAD-GEGADCFILVQPQNIVGYSVLPYIEEMEAVVGDKPMIMINPRLDDVQSAGNV 208

Query: 210 MQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMT-YPNGYELYKRVDVPSGGE 268
           M   GR +R++  A+++  Y FRLLY     +PI GALR   Y N +ELYKR       E
Sbjct: 209 MSVRGRKERMDAVAAWKEVYHFRLLYRKPYFFPIYGALRFAYYENEWELYKRTGARE-LE 267

Query: 269 KYVILSTFTEKPSSDDINDAFLG 291
           +YV+L T+  +P+ D++     G
Sbjct: 268 QYVLLGTYPSEPTPDEMTKKIWG 290


>gi|428174173|gb|EKX43070.1| hypothetical protein GUITHDRAFT_110795 [Guillardia theta CCMP2712]
          Length = 584

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 119/273 (43%), Positives = 159/273 (58%), Gaps = 26/273 (9%)

Query: 34  LNNIPHSREIRNYFYDDVLQATQRAVNDGR------TRLKVEINIPELNPEMDVYRIGTL 87
           LN IP + EIR +FY DV  A Q A+ D R       RLK+E   PELNPEMD +R+GT+
Sbjct: 322 LNEIPCNTEIREWFYRDVCDAVQVALQDSRFLESCAGRLKIECVPPELNPEMDSFRVGTM 381

Query: 88  MELVRVIALSFADD-GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTF 146
           +E+VR IA+      G+RVKVC+QGS+GEG   GMPL L G R+++E MDW      G  
Sbjct: 382 LEMVREIAIRVGVVMGRRVKVCIQGSLGEGIFTGMPLMLNGMRRVMESMDW--QARPGEM 439

Query: 147 VEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG-TRPVILINPRLKDLPG 205
            E   I         ++FI++APQ+ VG  + + +  M +AAG  R VILINPRLKD P 
Sbjct: 440 YEGLLI---------NVFIIIAPQSMVGASVYEPLSQMVEAAGPKRTVILINPRLKDRPS 490

Query: 206 SSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGT-QYPIMGALRMTYPNG--YELYKRVD 262
           S  +M   GR +R+ +AASF   + FRLLY   T  +PI GALRM   N   + L+KR  
Sbjct: 491 SGNVMSVQGRSQRIAFAASFREIFHFRLLYTGTTFMFPIQGALRMNMANSPFWTLFKRET 550

Query: 263 VPS----GGEKYVILSTFTEKPSSDDINDAFLG 291
             +     GE Y  ++ F E+P ++ I +   G
Sbjct: 551 TTTPDERTGESYQPIAVFAEEPKTEAITNMMNG 583


>gi|219124197|ref|XP_002182396.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217406357|gb|EEC46297.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 406

 Score =  199 bits (506), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 113/284 (39%), Positives = 166/284 (58%), Gaps = 28/284 (9%)

Query: 34  LNNIPHSREIRNYFYDDVLQATQRAV---NDGRT--RLKVEINIPELNPEMDVYRIGTLM 88
           LN +PH+R +R+YFY+    A   AV   ++G+   R+ +    PE+NP MD YRIGT++
Sbjct: 121 LNQVPHNRYVRDYFYEQAALAVHDAVVLCSEGKCINRMLITSQFPEMNPSMDSYRIGTIL 180

Query: 89  ELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG------DYGA 142
           E+VR I +  A++  RV++CVQGSMG G   GMP QL G  KI++ MDW       + G 
Sbjct: 181 EMVRTIGIKLAEENLRVRICVQGSMGVGIFTGMPKQLNGVSKIIQMMDWQSGEGELNEGM 240

Query: 143 KGTFVEIGSIGA------------NEVDEQDDMFILVAPQNAVG--NCIIDDMKAMTDAA 188
            G ++  G++G             N V  QDD+FIL+APQ+ VG  + I+  ++ M +AA
Sbjct: 241 VGDYIRFGAVGPEHVLNEEKDKDDNVVQYQDDVFILIAPQSMVGTDSSIMPLLQGMVEAA 300

Query: 189 GTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQY-PIMGAL 247
           G RPVIL+NP L D   ++G     GR +R+++A SF+  Y F+ +Y +GT Y PI+GA+
Sbjct: 301 GNRPVILMNPDLTDKVSAAGQQSVRGRQQRIDFAESFQTVYHFQNIYISGTSYFPILGAI 360

Query: 248 RMTYPNGYEL-YKRVDVPSG-GEKYVILSTFTEKPSSDDINDAF 289
              +P    L ++R D   G GE YV +      P  ++I DAF
Sbjct: 361 TKLHPKEPWLAHQRRDYADGEGEIYVPVLAGEVIPKGEEILDAF 404


>gi|299473139|emb|CBN78715.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 442

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 107/269 (39%), Positives = 162/269 (60%), Gaps = 9/269 (3%)

Query: 26  NWKGIPTR--LNNIPHSREIRNYFYDDVLQATQRAVNDGRT--RLKVEINIPELNPEMDV 81
           ++ G+P    LNN+PHS+  RN+ Y    +A   AV D     R+K+   +PELNP MD 
Sbjct: 167 SYGGLPVNMLLNNVPHSKMARNFIYRGAAEALVLAVEDPTVPRRMKLVCTVPELNPAMDT 226

Query: 82  YRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYG 141
           YR+GT++ELVR +A S AD G   ++ +Q SMG+G    +PL L+G   I++ MDW D G
Sbjct: 227 YRVGTVLELVRELAYSMADVGLNTRILIQPSMGKGVFKSLPLALSGVMSIMQGMDWED-G 285

Query: 142 AKGTFVEIGSIG-ANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRL 200
             G+ V  G +G  ++V  +DD+ ++++PQ+  G  I   +    + A  RPV++INP L
Sbjct: 286 LVGSHVNFGQLGNKDDVSAEDDVILVISPQSIAGYSIHPLLVDTVEHANGRPVVIINPNL 345

Query: 201 KDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQ-YPIMGALRMTYPNG-YELY 258
           KD P S G+MQ  GR +R+ +A SF++ Y FRLLY +    +PI GA+    P+  Y ++
Sbjct: 346 KDRPSSGGVMQVRGRGERIAFAESFKDIYNFRLLYGSSVAFFPIKGAVMKAGPSEPYVVF 405

Query: 259 KRVDV-PSGGEKYVILSTFTEKPSSDDIN 286
            R +   S GE+YV L TF + P +++I 
Sbjct: 406 DRRNTGKSKGEEYVPLGTFDDDPDAENIT 434


>gi|397643725|gb|EJK76041.1| hypothetical protein THAOC_02216 [Thalassiosira oceanica]
          Length = 1862

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 115/284 (40%), Positives = 159/284 (55%), Gaps = 28/284 (9%)

Query: 34   LNNIPHSREIRNYFYDDVLQATQRAV---NDG--RTRLKVEINIPELNPEMDVYRIGTLM 88
            LN +PH+R +R YFY+    A   AV   + G    R+K+++  PE+NP+MD YRIGTL+
Sbjct: 1576 LNQVPHNRYVRQYFYEMAADAALEAVVLCSAGFISNRMKLQVMFPEMNPQMDSYRIGTLL 1635

Query: 89   ELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG------DYGA 142
            EL R IA+ FA+   RV+VCVQGS+G+G    +P QL G   IL+ MDW       + G 
Sbjct: 1636 ELTRTIAIRFAEQNLRVRVCVQGSIGQGIFTAIPKQLNGVATILKRMDWQSGEGEENEGM 1695

Query: 143  KGTFVEIGSIGANEV------------DEQDDMFILVAPQNAVG--NCIIDDMKAMTDAA 188
             G FV  G IG   V             +QDD+FIL+ PQN  G  + II  +  MT+A 
Sbjct: 1696 VGDFVNFGHIGKEHVVNFKEGEGGEADTQQDDVFILICPQNMQGIESSIIPYLLEMTEAV 1755

Query: 189  GTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQY-PIMGAL 247
            G RP+ILINP L D   S G     GR  R+++A SF+  + F+ +YY+GT Y PI+GA 
Sbjct: 1756 GDRPMILINPDLVDKASSQGQQSVRGRQDRIDFANSFKTIFQFQNIYYSGTSYFPILGAS 1815

Query: 248  RMTYPNG-YELYKRVDVPSG-GEKYVILSTFTEKPSSDDINDAF 289
             +  P   +  ++R D+  G GE YV +      P S+ +   F
Sbjct: 1816 TLLGPEEMWVSHQRRDLKGGKGEVYVPVYASESMPDSEVLMSTF 1859


>gi|308806305|ref|XP_003080464.1| putative Adenylate kinase, chloroplast (ISS) [Ostreococcus tauri]
 gi|116058924|emb|CAL54631.1| putative Adenylate kinase, chloroplast (ISS) [Ostreococcus tauri]
          Length = 284

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 106/264 (40%), Positives = 153/264 (57%), Gaps = 16/264 (6%)

Query: 30  IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 89
           IPT LNN+PH+RE R +FYDD   +T  AV  G  R+K++   P + P            
Sbjct: 35  IPTLLNNVPHARETRRWFYDDACASTLAAVGAGERRMKIKTEFPGVEP------------ 82

Query: 90  LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 149
           L R +A + A DGKRVK+CVQ SMG+G    + L L+G  K+L  MD+ +  A+      
Sbjct: 83  LTRAMAQTLARDGKRVKICVQRSMGQGVFQALTLSLSGVSKLLGMMDFEEDVAE-MIDPN 141

Query: 150 GSIGANE-VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSG 208
           G IGA +  ++  D +ILV PQN VG  ++  + AM +A G +P+I+INP+L D+  +  
Sbjct: 142 GGIGAEDGAEDGADAYILVQPQNIVGYSVLPYIAAMEEAVGDKPMIMINPKLDDIQSAGN 201

Query: 209 IMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMT-YPNGYELYKRVDVPSGG 267
           +M   GR +R++  A++   Y FRLLY     +PI GALR T Y N +ELYKRV +    
Sbjct: 202 VMSIRGRQERMDAVAAWREIYHFRLLYRKPYFFPIYGALRYTYYENAWELYKRVGMRE-E 260

Query: 268 EKYVILSTFTEKPSSDDINDAFLG 291
           E+YV+ ST+  +P  D++     G
Sbjct: 261 EEYVLRSTYDSEPDPDEMTKQIWG 284


>gi|223997276|ref|XP_002288311.1| adenylate-kinase [Thalassiosira pseudonana CCMP1335]
 gi|220975419|gb|EED93747.1| adenylate-kinase [Thalassiosira pseudonana CCMP1335]
          Length = 231

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 100/230 (43%), Positives = 141/230 (61%), Gaps = 16/230 (6%)

Query: 34  LNNIPHSREIRNYFYDDVLQATQRA---VNDGR--TRLKVEINIPELNPEMDVYRIGTLM 88
           LN +PH+R +R YFYD   QA   A    ++G+   R+K+ +  PE+NP MD YRIGTL+
Sbjct: 1   LNQVPHNRFVRKYFYDMASQAALDAAVLCSEGKLTNRMKMTVMFPEMNPSMDSYRIGTLL 60

Query: 89  ELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDW------GDYGA 142
           EL R IA+  A+   RV+VCVQGSMG G   G+P QL G   +L+ MDW       + G 
Sbjct: 61  ELARSIAIKLAEQNLRVRVCVQGSMGVGIFTGIPKQLNGVSTLLQRMDWESEVGEENEGM 120

Query: 143 KGTFVEIGSIGA--NEVDEQDDMFILVAPQNAVG-NC-IIDDMKAMTDAAGTRPVILINP 198
            G +V  G +G   N   +QDD+F+++ PQN +G  C II  +  M +AAG RPVI++NP
Sbjct: 121 VGDYVNFGFVGKEHNSAVQQDDVFLIICPQNMLGLECSIIQPLSEMVEAAGDRPVIILNP 180

Query: 199 RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQY-PIMGAL 247
            L D   S G     GR +R+++A SFE  + F+ +Y +GT Y PI+G++
Sbjct: 181 DLTDKISSQGQQSIRGRQQRIDFAESFETIFQFQNIYTSGTSYFPILGSV 230


>gi|62320102|dbj|BAD94280.1| adenylate kinase -like protein [Arabidopsis thaliana]
          Length = 95

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 64/95 (67%), Positives = 80/95 (84%)

Query: 210 MQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEK 269
           MQTMGR++RLEYA +F+NCY FRLLYY GTQYPIMGALRM+YP  YELYKRV+  +G EK
Sbjct: 1   MQTMGREQRLEYALTFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELYKRVNEENGKEK 60

Query: 270 YVILSTFTEKPSSDDINDAFLGKPRNREKKASGFW 304
           YV+L+T+ E+P+ + I+DAF GK R++ KKASG W
Sbjct: 61  YVLLATYAERPTPEQIDDAFSGKSRDQSKKASGIW 95


>gi|147768155|emb|CAN78322.1| hypothetical protein VITISV_001685 [Vitis vinifera]
          Length = 228

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 47/61 (77%), Positives = 51/61 (83%)

Query: 8   KPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLK 67
           K +   +  SN  SSSQ+ W+GIPTRLNNIPHSREIR YFYDDVLQATQRAVNDGRTRLK
Sbjct: 167 KSIRASERLSNLASSSQEXWRGIPTRLNNIPHSREIRTYFYDDVLQATQRAVNDGRTRLK 226

Query: 68  V 68
           V
Sbjct: 227 V 227


>gi|425442993|ref|ZP_18823225.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
 gi|389715818|emb|CCH99873.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
          Length = 247

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 74/270 (27%), Positives = 120/270 (44%), Gaps = 50/270 (18%)

Query: 45  NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
           N   + +LQA   TQ A+  G  R++VE+ IPE+  +             + +AL FA  
Sbjct: 5   NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFAS- 51

Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
                  +  S G G     P   A    +L   DWG+     T  ++G +G+      N
Sbjct: 52  -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96

Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
           ++  +D+ F++V P +      I+ ++ +   A  RPV+L+ P+L+D+    GI     R
Sbjct: 97  KIKPEDEAFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150

Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
             R  + ++ E+CY+FR L  A     YP +  + +   +GYEL         GE    L
Sbjct: 151 QLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207

Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
                K SS+  ND+    P N+ KK+  F
Sbjct: 208 ENLILKASSNTPNDS--SNPANKAKKSGLF 235


>gi|166367819|ref|YP_001660092.1| hypothetical protein MAE_50780 [Microcystis aeruginosa NIES-843]
 gi|425464571|ref|ZP_18843881.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
 gi|166090192|dbj|BAG04900.1| hypothetical protein MAE_50780 [Microcystis aeruginosa NIES-843]
 gi|389833386|emb|CCI22146.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
          Length = 247

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 73/270 (27%), Positives = 120/270 (44%), Gaps = 50/270 (18%)

Query: 45  NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
           N   + +LQA   TQ A+  G  R++VE+ IPE+  +             + +AL F   
Sbjct: 5   NSLEETILQAKAATQLALESGDRRIQVELVIPEIALQ------------AQALALDFTS- 51

Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
                  +  S G G     P   A    +L   DWG+     T  ++G +G+      N
Sbjct: 52  -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96

Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
           ++  +D+ F++V P +      I+ ++ + + A  RPV+L+ P+L+D+    GI     R
Sbjct: 97  KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150

Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
             R  + ++ E+CY+FR L  A     YP +  + +   +GYEL         GE    L
Sbjct: 151 QLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207

Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
                K SS+  ND+    P N+ KK+  F
Sbjct: 208 ENLILKASSNTPNDS--SNPANKAKKSGLF 235


>gi|422304988|ref|ZP_16392325.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
 gi|389789763|emb|CCI14274.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
          Length = 247

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 71/271 (26%), Positives = 123/271 (45%), Gaps = 52/271 (19%)

Query: 45  NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
           N   + +LQA   TQ A+  G  R++VE+ IPE+  +             + +AL FA  
Sbjct: 5   NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFA-- 50

Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGT-RKILEFMDWGDYGAKGTFVEIGSIGA------ 154
                     S+ +G  +G+ +    T   +L   DWG+     T  ++G +G+      
Sbjct: 51  ----------SIFDGYGSGLRVIFPDTGAAMLARRDWGE-----TVFQLGDLGSRFIPIE 95

Query: 155 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 214
           N++  +D+ F++V P +      I+ ++ + + A  RPV+L+ P+L+D+  S   +    
Sbjct: 96  NKIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV--SVVGIGYAA 149

Query: 215 RDKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVI 272
           R  R  + ++ E+CY+FR L  A     YP +  + +   +GYEL         GE    
Sbjct: 150 RQLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA--- 206

Query: 273 LSTFTEKPSSDDINDAFLGKPRNREKKASGF 303
           L     K SS++  D     P N+ KK+  F
Sbjct: 207 LENLILKASSNNPKDG--SNPANKAKKSGLF 235


>gi|425435402|ref|ZP_18815857.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
 gi|389680066|emb|CCH91215.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
          Length = 247

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 72/271 (26%), Positives = 121/271 (44%), Gaps = 52/271 (19%)

Query: 45  NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
           N   + +LQA   TQ A+  G  R++VE+ IPE+  +             + +AL F   
Sbjct: 5   NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFT-- 50

Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGT-RKILEFMDWGDYGAKGTFVEIGSIGA------ 154
                     S+ +G  +G+ +    T   +L   DWG+     T  ++G +G+      
Sbjct: 51  ----------SIFDGYGSGLRVIFPDTGAAMLARRDWGE-----TVFQLGDLGSRFIPVE 95

Query: 155 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 214
           N++  +D+ F++V P +      I+ ++ + + A  RPV+L+ P+L+D+  S   +    
Sbjct: 96  NKIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV--SVVGIGYAA 149

Query: 215 RDKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVI 272
           R  R  + +  E+CY+FR L  A     YP M  + +   +GYEL         GE    
Sbjct: 150 RQLRERFLSILESCYYFRPLESAIVYRSYPSMWQVWLEKEDGYELISEQSTKPMGEA--- 206

Query: 273 LSTFTEKPSSDDINDAFLGKPRNREKKASGF 303
           L     K SS+  ND     P N+ KK+  F
Sbjct: 207 LENLILKASSNTPNDG--SNPANKAKKSGLF 235


>gi|427731149|ref|YP_007077386.1| hypothetical protein Nos7524_4017 [Nostoc sp. PCC 7524]
 gi|427367068|gb|AFY49789.1| protein of unknown function (DUF1995) [Nostoc sp. PCC 7524]
          Length = 246

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 69/252 (27%), Positives = 116/252 (46%), Gaps = 52/252 (20%)

Query: 42  EIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 98
           E+ N   D + QA   T+ A+ DG TRL+VE+  PEL            M +       F
Sbjct: 3   ELPNTLEDAIAQAREATKAALADGYTRLQVELLFPELK----------FMPVAEQFLPLF 52

Query: 99  ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NE 156
           ++   R+K+      G  ALA         R+     DW D   +   +  G I +  ++
Sbjct: 53  SEYESRLKIFF-ADAGASALA---------RR-----DWADVPFQILDIGTGRIASIQSK 97

Query: 157 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRD 216
           V  +D++F+ VAP +      +  ++ + +  G RP++++NPRL+D PG+ GI     R 
Sbjct: 98  VQPEDEIFLFVAPTSVE----VPQVEKICEIIGDRPLVMLNPRLED-PGTVGIGYA-ARQ 151

Query: 217 KRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILS 274
            R  + +  E+CY+ R         P+    A+   YP  +EL+  V+     +K+  L 
Sbjct: 152 TRQRFISKIESCYYLR---------PVDDETAVFRCYPGLWELW--VENSGTWQKFAELP 200

Query: 275 TFTEKPSSDDIN 286
              +KPS D+I+
Sbjct: 201 ---KKPSGDEID 209


>gi|354566056|ref|ZP_08985229.1| protein of unknown function DUF1995-containing protein [Fischerella
           sp. JSC-11]
 gi|353546564|gb|EHC16012.1| protein of unknown function DUF1995-containing protein [Fischerella
           sp. JSC-11]
          Length = 250

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 68/246 (27%), Positives = 109/246 (44%), Gaps = 46/246 (18%)

Query: 59  VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 118
           + DG TRL+VE+  PEL           LME+ +     F     ++KV    + G G  
Sbjct: 23  IADGYTRLQVELLFPELK----------LMEVAQQFLPMFEQYNDKLKVFFADAGGAG-- 70

Query: 119 AGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQNAVGNC 176
                        L   DW D   K T +  G   + E  V  +D++F+ +AP    G+ 
Sbjct: 71  -------------LARRDWTDVRFKITDIGTGRAASIESKVQPEDEIFLFIAP----GSP 113

Query: 177 IIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 236
            +  ++ +    G RP++++NP L+D  G  GI     R  R+ + ++ E+CY+ R    
Sbjct: 114 EVPQLEKICAVIGDRPLVMLNPFLED-AGKIGIGYA-ARQTRIHFLSTIESCYYLR---- 167

Query: 237 AGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNR 296
             + YP +   R  YP  +E+     V + GE Y  ++   +KPS DD+ D  L K +  
Sbjct: 168 --SVYPDVALFR-CYPGLWEIL----VENNGE-YQKIAELPQKPSGDDL-DIILAKGQAA 218

Query: 297 EKKASG 302
              A G
Sbjct: 219 TAIAEG 224


>gi|425449243|ref|ZP_18829085.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
 gi|389764162|emb|CCI09454.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
          Length = 247

 Score = 71.2 bits (173), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 72/270 (26%), Positives = 121/270 (44%), Gaps = 50/270 (18%)

Query: 45  NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
           N   + +LQA   TQ A+  G  R++VE+ IPE+  +             + +AL F   
Sbjct: 5   NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51

Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
                  +  S G G     P   A    +L   DWG+     T  ++G +G+      N
Sbjct: 52  -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96

Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
           ++  +D++F++V P +      I+ ++ + + A  RPV+L+ P+L+D+    GI     R
Sbjct: 97  KIKPEDEVFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150

Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
             R  + +  E+CY+FR L  A     YP +  + +   +GYEL         GE    L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207

Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
                K SS++ +D+    P N+ KK+  F
Sbjct: 208 ENLILKASSNNPDDS--SNPANKAKKSGLF 235


>gi|434398468|ref|YP_007132472.1| protein of unknown function DUF1995-containing protein [Stanieria
           cyanosphaera PCC 7437]
 gi|428269565|gb|AFZ35506.1| protein of unknown function DUF1995-containing protein [Stanieria
           cyanosphaera PCC 7437]
          Length = 247

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 70/260 (26%), Positives = 114/260 (43%), Gaps = 59/260 (22%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +AT  A+ DG  R++VE+ IPE+  +  V        L    A  FA+ G  +KV    S
Sbjct: 17  EATLAALADGYNRVQVELVIPEITLQAQV--------LALEFATIFAEYGSGLKVMFPDS 68

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFI 165
            G  ALA                DWG       D G++ T VE       +V EQD +FI
Sbjct: 69  -GAAALAK--------------RDWGEVPFQVTDLGSRYTAVE------TQVSEQDQIFI 107

Query: 166 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 225
           +VAP        ++ ++ + + A  RPV+L+ P+L+D+    GI  T  R  R  +  S 
Sbjct: 108 VVAP----AAIEVEKVEKLCNVAEDRPVVLLIPQLEDI-AFVGIGYT-ARQMRERFIKSL 161

Query: 226 ENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDI 285
            +CY+ R L  A        A+   +P+ ++++   +     E+Y +      +P  +D+
Sbjct: 162 YSCYYLRPLEEA--------AILRCHPSSWQIWLEKE-----EEYQLAMELAHQPMGEDL 208

Query: 286 NDAFLG----KPRNREKKAS 301
               +      P N+ + AS
Sbjct: 209 ERLLMKLTNPNPENKNEVAS 228


>gi|425459538|ref|ZP_18839024.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
 gi|389822715|emb|CCI29585.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
          Length = 247

 Score = 70.9 bits (172), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 73/270 (27%), Positives = 118/270 (43%), Gaps = 50/270 (18%)

Query: 45  NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
           N   + +LQA   TQ A+  G  R++VE+ IPE+  +             + +AL F   
Sbjct: 5   NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51

Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
                  +  S G G     P   A    +L   DWG+     T  ++G +G+      N
Sbjct: 52  -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96

Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
           ++  +D+ F++V P +      I+ ++ + + A  RPV+L+ P+L+D+    GI     R
Sbjct: 97  KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150

Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
             R  + +  E+CY+FR L  A     YP +  + +   +GYEL         GE    L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207

Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
                K SS+  ND     P N+ KK+  F
Sbjct: 208 ENLILKASSNTPNDG--SNPANKAKKSGLF 235


>gi|425446838|ref|ZP_18826837.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
 gi|389732775|emb|CCI03345.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
          Length = 247

 Score = 70.9 bits (172), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 73/270 (27%), Positives = 119/270 (44%), Gaps = 50/270 (18%)

Query: 45  NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
           N   + +LQA   TQ A+  G  RL+VE+ IPE+  +             + +AL F   
Sbjct: 5   NSLEETILQAKAATQLALESGARRLQVELVIPEIALQ------------AQALALDFTS- 51

Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
                  +  S G G     P   A    +L   DWG+     T  ++G +G+      N
Sbjct: 52  -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96

Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
           ++  +D+ F++V P +      I+ ++ + + A  RPV+L+ P+L+D+    GI     R
Sbjct: 97  KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150

Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
             R  + +  E+CY+FR L  A     YP +  + +   +GYEL         GE    L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207

Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
                K SS++  D+    P N+ KK+  F
Sbjct: 208 ENLILKASSNNPKDS--SNPANKAKKSGLF 235


>gi|428781510|ref|YP_007173296.1| hypothetical protein Dacsa_3447 [Dactylococcopsis salina PCC 8305]
 gi|428695789|gb|AFZ51939.1| protein of unknown function (DUF1995) [Dactylococcopsis salina PCC
           8305]
          Length = 253

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 68/235 (28%), Positives = 116/235 (49%), Gaps = 41/235 (17%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD-GKRVKVCVQG 111
           Q+T  A+  G TRL+VE+ +PE+  + +         + +     FADD G  +KV    
Sbjct: 17  QSTVAALEAGLTRLQVELKVPEIALKAE--------SIAKEFTDLFADDYGSGLKVLFPD 68

Query: 112 SMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQN 171
           S G  ALA            +EF +  D G++ T +E       +V E+D +F++V+P +
Sbjct: 69  S-GAAALARR------NWSDVEF-NVNDLGSRNTPIE------KKVAEEDQIFLVVSP-S 113

Query: 172 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 231
           AV    +  ++ + + AG RPVIL+ P+L+D+  + GI     R  R  + ++ E+C++ 
Sbjct: 114 AVE---VQKVEKLCNLAGDRPVILLIPQLEDV-ATVGIGYA-ARQLRERFLSTLESCFYL 168

Query: 232 RLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
           + L  A        AL   YP+G++L+    +  G  +Y       EKP  DD++
Sbjct: 169 QPLDEA--------ALLKRYPSGWQLW----IEKGENQYEFFCEEVEKPVGDDLD 211


>gi|440753103|ref|ZP_20932306.1| hypothetical protein O53_1480 [Microcystis aeruginosa TAIHU98]
 gi|440177596|gb|ELP56869.1| hypothetical protein O53_1480 [Microcystis aeruginosa TAIHU98]
          Length = 247

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 72/270 (26%), Positives = 120/270 (44%), Gaps = 50/270 (18%)

Query: 45  NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
           N   + +LQA   TQ A+  G  R++VE+ IPE+  +             + +AL F   
Sbjct: 5   NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51

Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
                  +  S G G     P   A    +L   DWG+     T  ++G +G+      N
Sbjct: 52  -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96

Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
           ++  +D+ F++V P +      I+ ++ + + A  RPV+L+ P+L+D+    GI     R
Sbjct: 97  KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150

Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
             R  + +  E+CY+FR L  A     YP +  + +   +GYEL         GE    L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207

Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
                K SS++ +D+    P N+ KK+  F
Sbjct: 208 ENLILKASSNNPDDS--SNPANKAKKSGLF 235


>gi|22299764|ref|NP_683011.1| hypothetical protein tlr2221 [Thermosynechococcus elongatus BP-1]
 gi|22295948|dbj|BAC09773.1| tlr2221 [Thermosynechococcus elongatus BP-1]
          Length = 232

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 76/255 (29%), Positives = 113/255 (44%), Gaps = 59/255 (23%)

Query: 62  GRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM 121
           G+ RL++EI +P+LNP          M L       FA+ G  +KV      G  ALA  
Sbjct: 27  GKNRLQIEILLPDLNP----------MGLAAGYLPLFAELGSDLKVFF-ADAGSAALA-- 73

Query: 122 PLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIID 179
                  R+     +WG+  Y  +G    +  I A+     D   ++VAP        + 
Sbjct: 74  -------RR-----EWGETAYSVRGVNERLTPIEAS-----DRAIVIVAPTPVE----VT 112

Query: 180 DMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGT 239
            ++ M   AG RP IL+NPRL+D+    GI    GR  R  +  + E CY+ R L  A T
Sbjct: 113 AIEQMCLTAGDRPFILLNPRLQDV-AVVGIGYA-GRQLRERFLNTLEPCYYLRPL--AET 168

Query: 240 QYPIMGALRMTYPNGYELYK-RVDVPSGGEKYVILSTFTEKPSSDDINDAF--LGKPRNR 296
                  L   YP  +++++ R   P+       L+ F ++P+S+DI  A   LGKPR  
Sbjct: 169 VI-----LWRCYPQAWQIWQYRETAPT------CLAEFEQRPTSEDIERALSALGKPR-- 215

Query: 297 EKKASGFWGFLSSVF 311
              A  F   LS++ 
Sbjct: 216 ---AQSFLSRLSALL 227


>gi|425472740|ref|ZP_18851581.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
 gi|389881081|emb|CCI38316.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
          Length = 247

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 72/270 (26%), Positives = 118/270 (43%), Gaps = 50/270 (18%)

Query: 45  NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
           N   + +LQA   TQ A+  G  R++VE+ IPE+  +             + +AL F   
Sbjct: 5   NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51

Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
                  +  S G G     P   A    +L   DWG+     T  ++G +G+      N
Sbjct: 52  -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96

Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
           ++  +D++F++V P +      I+ ++ +   A  RPV+L+ P+L+D+  S   +    R
Sbjct: 97  KIKPEDEVFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV--SVVGIGYAAR 150

Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
             R  + +  E+CY+FR L  A     YP +  + +   +GYEL         GE    L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207

Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
                K SS+  ND     P N+ KK+  F
Sbjct: 208 ENLILKASSNTPNDG--SNPANKAKKSGLF 235


>gi|443658098|ref|ZP_21132025.1| hypothetical protein C789_2565 [Microcystis aeruginosa DIANCHI905]
 gi|159027702|emb|CAO89569.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443333038|gb|ELS47616.1| hypothetical protein C789_2565 [Microcystis aeruginosa DIANCHI905]
          Length = 247

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 72/270 (26%), Positives = 120/270 (44%), Gaps = 50/270 (18%)

Query: 45  NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
           N   + +LQA   TQ A+  G  R++VE+ IPE+  +             + +AL F   
Sbjct: 5   NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51

Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
                  +  S G G     P   A    +L   DWG+     T  ++G +G+      N
Sbjct: 52  -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96

Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
           ++ ++D+ F++V P +      I+ ++ +   A  RPV+L+ P+L+D+    GI     R
Sbjct: 97  KIKDEDEAFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150

Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
             R  + +  E+CY+FR L  A     YP +  + +   +GYEL         GE    L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207

Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
                K SS++ +D+    P N+ KK+  F
Sbjct: 208 ENLILKASSNNPDDS--SNPANKAKKSGLF 235


>gi|282899475|ref|ZP_06307441.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
 gi|281195632|gb|EFA70563.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
           CS-505]
          Length = 245

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 107/247 (43%), Gaps = 50/247 (20%)

Query: 54  ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
           A + A++DG+TR++VE+  PEL            M +       F +   R+KV    + 
Sbjct: 21  AVRSALSDGKTRIQVELLFPELQ----------FMPVAEQFLPLFTEYESRLKVFFADAG 70

Query: 114 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQN 171
                                 DWGD   K   +  G   ++E  +  +D++F+ +AP +
Sbjct: 71  AAALAR---------------RDWGDIPFKIMDIGTGRAASSESKIQPEDEIFLFIAPTS 115

Query: 172 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 231
                 +  ++ +    G RP +++NPRL+D   S   +    R+ R  + ++ E+CY+ 
Sbjct: 116 VE----VAQLEKLCQIIGERPFVMLNPRLED--SSVVGIGYAARETRRRFISTIESCYYL 169

Query: 232 RLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
           R         PI    AL  +YP  +E++   D    GE Y  ++   +KPS D+I+   
Sbjct: 170 R---------PIDEQSALMRSYPGNWEIWLETD----GE-YQKIAELPQKPSGDEIDSIL 215

Query: 290 L-GKPRN 295
           + G+P  
Sbjct: 216 IKGQPET 222


>gi|428311732|ref|YP_007122709.1| hypothetical protein Mic7113_3579 [Microcoleus sp. PCC 7113]
 gi|428253344|gb|AFZ19303.1| protein of unknown function (DUF1995) [Microcoleus sp. PCC 7113]
          Length = 249

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/236 (28%), Positives = 115/236 (48%), Gaps = 45/236 (19%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +ATQ A+NDGRTRL+VE+  PE+   +    I       + I L F + G  +KV    +
Sbjct: 17  EATQSALNDGRTRLQVELVFPEIA--LQAQSIAQ-----QFIPL-FEEYGSGLKVLFPDT 68

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIG-SIGANEVDEQDDMFILVAPQN 171
            G  ALA         R+     DWG+   K + V    S   N++  +D  F+LVAP +
Sbjct: 69  -GAAALA---------RR-----DWGEVPFKISDVGTSRSPITNKIQAEDKAFLLVAP-S 112

Query: 172 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 231
           AV    +  ++ + + AG RP +L+NP+L+D+  S   +    R  R    ++ E CY+ 
Sbjct: 113 AVE---VAQVETLCNLAGDRPCVLLNPQLEDI--SIVGIGMAARKLRERLLSTIEPCYYL 167

Query: 232 RLLYYAGTQYPI-MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
           R         PI   A+  +YP  ++++  +D     ++Y +++   ++P+ D + 
Sbjct: 168 R---------PIDPAAILRSYPGLWQVWLEID-----DEYQLIAEEPQRPAGDALE 209


>gi|425455386|ref|ZP_18835106.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
 gi|389803734|emb|CCI17368.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
          Length = 247

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 72/270 (26%), Positives = 119/270 (44%), Gaps = 50/270 (18%)

Query: 45  NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
           N   + +LQA   TQ A+  G  R++VE+ IPE+  +             + +AL F   
Sbjct: 5   NSLEETILQAKAATQLAIESGARRVQVELVIPEIALQ------------AQALALDFTS- 51

Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
                  +  S G G     P   A    +L   DWG+     T  ++G +G+      N
Sbjct: 52  -------LFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96

Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
           ++  +D+ F++V P +      I+ ++ +   A  RPV+L+ P+L+D+    GI     R
Sbjct: 97  KIKPEDEAFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150

Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
             R  + +  E+CY+FR L  A     YP +  + +   +GYEL         GE    L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207

Query: 274 STFTEKPSSDDINDAFLGKPRNREKKASGF 303
                K SS++ +D+    P N+ KK+  F
Sbjct: 208 ENLILKASSNNPDDS--SNPANKAKKSGLF 235


>gi|443475522|ref|ZP_21065469.1| protein of unknown function DUF1995-containing protein
           [Pseudanabaena biceps PCC 7429]
 gi|443019641|gb|ELS33702.1| protein of unknown function DUF1995-containing protein
           [Pseudanabaena biceps PCC 7429]
          Length = 236

 Score = 68.2 bits (165), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 67/268 (25%), Positives = 113/268 (42%), Gaps = 60/268 (22%)

Query: 49  DDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVC 108
           +  + AT +A+  G TR+  ++  PEL            M +    A SF          
Sbjct: 19  EQAIAATHQAIASGATRVLADLRFPELKA----------MPIAYEFAASFNQQYGEAWQA 68

Query: 109 VQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE----VDEQDDMF 164
           +    G  ALA         RK      W D       + +   G NE    + E+D  F
Sbjct: 69  IFSDAGAAALA--------KRK------WVD-------LNVSVRGVNEGRRAIREEDRAF 107

Query: 165 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 224
           +L+ P     +  ++ ++ +   AG RP +++NPRL++     G+     RD+   + ++
Sbjct: 108 LLIEP----TSIEVEQVEKLVQLAGDRPFVMLNPRLENSEVGLGLAARQMRDR---FLST 160

Query: 225 FENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 284
           FE  Y+ + L         +GAL   YP  ++++ + +   G +K   L+   ++PSSDD
Sbjct: 161 FETAYYIKPLE--------LGALWRCYPQTWQVWNQTE--EGMQK---LAEVEQRPSSDD 207

Query: 285 INDAFLGKPRNREKKASG-FWGFLSSVF 311
           I+  F    R +  K SG F G L   F
Sbjct: 208 IDRLF----RQKTGKQSGSFLGRLQQFF 231


>gi|186682862|ref|YP_001866058.1| hypothetical protein Npun_R2561 [Nostoc punctiforme PCC 73102]
 gi|186465314|gb|ACC81115.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
          Length = 244

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 67/269 (24%), Positives = 111/269 (41%), Gaps = 53/269 (19%)

Query: 42  EIRNYFYDDVLQ---ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 98
           E+ N   D + Q   A Q A+ DG TR++VE   PEL            M +       F
Sbjct: 3   ELPNTLEDAIAQSRTAVQAALADGCTRIQVEFLFPELK----------FMPVAEQFLPLF 52

Query: 99  ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NE 156
            +   R+K+    +                R+     +W D   +   +  G   +   +
Sbjct: 53  TEYDSRLKIFFADAGAAAL----------ARR-----NWTDAPFQILDIGTGRTASLQTK 97

Query: 157 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRD 216
           +  +D++F+ +AP +      +  ++ +    G RPV+ +NPRL+D  G+ GI     R 
Sbjct: 98  IQPEDEIFLFIAPTSVE----VPQVEKLCQEIGDRPVVFLNPRLED-SGTVGIGYA-ARQ 151

Query: 217 KRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILS 274
            RL +  + E+CY+ R         PI    AL   YP  +E++   D    GE Y  ++
Sbjct: 152 TRLRFTNTIESCYYLR---------PIDEQSALSRCYPGQWEVWLETD----GE-YQRIA 197

Query: 275 TFTEKPSSDDINDAFL-GKPRNREKKASG 302
               KPS DD++   L G+P+     A  
Sbjct: 198 ELPTKPSGDDLDQILLKGQPQTTTDAAPA 226


>gi|427418343|ref|ZP_18908526.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 7375]
 gi|425761056|gb|EKV01909.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 7375]
          Length = 237

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 71/269 (26%), Positives = 123/269 (45%), Gaps = 54/269 (20%)

Query: 34  LNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRV 93
           ++++P S E      +  + +T+ A+ DG  RL+VE+ IPEL  +    R  TL      
Sbjct: 1   MSDLPSSLE---ETIEQAIASTKAAIEDGIIRLQVEMVIPELKQQPIAERFLTL------ 51

Query: 94  IALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG--DYGAKGTFVEIGS 151
               F D G + KV    + G  ALA         R+     DW   D+  +G    I  
Sbjct: 52  ----FQDLGLQFKVYFPDA-GAAALA---------RR-----DWDNPDFTIRG----INE 88

Query: 152 IGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQ 211
           I A  +   D+ F++V P +AV    + D++ + + A  RPV+L+NP+L+D+  S   + 
Sbjct: 89  IQAA-ITPDDEAFLIVNP-SAVE---VQDVEKLCNEAQDRPVVLLNPQLEDV--SIVGIG 141

Query: 212 TMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYV 271
              R  R  + ++  +CY++R +  +        A+   +P G+++++ V     G  Y 
Sbjct: 142 YAARQLRERFLSTLTSCYYYRPMTDS--------AVLRRHPQGWQVWQDV-----GNDYE 188

Query: 272 ILSTFTEKPSSDDINDAFLGKPRNREKKA 300
           + S   E+PSS+ +     G      K A
Sbjct: 189 LKSELPERPSSEALEKILYGSEDTEGKAA 217


>gi|428225033|ref|YP_007109130.1| hypothetical protein GEI7407_1587 [Geitlerinema sp. PCC 7407]
 gi|427984934|gb|AFY66078.1| protein of unknown function DUF1995 [Geitlerinema sp. PCC 7407]
          Length = 244

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 72/252 (28%), Positives = 124/252 (49%), Gaps = 53/252 (21%)

Query: 54  ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF-ADDGKRVKVCVQGS 112
           ATQ A++DG +RL+VE+  PE++ +         + +  + A  F A  G R K+    +
Sbjct: 18  ATQAALDDGHSRLQVELVFPEIDLQA--------ISIAELFAADFEARYGDRFKILFPDA 69

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGAN-EVDEQDDMFILVAP 169
              GA A     LA  R       WG+  Y  +G    I  + A  + DEQ   F+++AP
Sbjct: 70  ---GAAA-----LAKHR-------WGEKPYEIRG----ISELKAQIQPDEQ--AFLMIAP 108

Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
                +  +  ++   + A  RPV+++NPRL+D+  + GI    GR  R  + ++  +CY
Sbjct: 109 ----SSVEVGPVEKFCEEASDRPVVMVNPRLEDV-ATIGIGYA-GRQLRERFLSTLLSCY 162

Query: 230 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
           + R   + G      GALR +YP  +E++  ++  SG EK   ++  ++KP  D + D  
Sbjct: 163 YLR--PFEG------GALRRSYPGPWEVW--LETESGYEK---VAEESQKPVGDAL-DQI 208

Query: 290 LGKPRNREKKAS 301
           +G+ +  E + S
Sbjct: 209 IGRVQGAETEGS 220


>gi|16329818|ref|NP_440546.1| hypothetical protein slr1702 [Synechocystis sp. PCC 6803]
 gi|383321560|ref|YP_005382413.1| hypothetical protein SYNGTI_0651 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383324730|ref|YP_005385583.1| hypothetical protein SYNPCCP_0651 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383490614|ref|YP_005408290.1| hypothetical protein SYNPCCN_0651 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384435880|ref|YP_005650604.1| hypothetical protein SYNGTS_0651 [Synechocystis sp. PCC 6803]
 gi|451813978|ref|YP_007450430.1| hypothetical protein MYO_16580 [Synechocystis sp. PCC 6803]
 gi|1652303|dbj|BAA17226.1| slr1702 [Synechocystis sp. PCC 6803]
 gi|339272912|dbj|BAK49399.1| hypothetical protein SYNGTS_0651 [Synechocystis sp. PCC 6803]
 gi|359270879|dbj|BAL28398.1| hypothetical protein SYNGTI_0651 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359274050|dbj|BAL31568.1| hypothetical protein SYNPCCN_0651 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359277220|dbj|BAL34737.1| hypothetical protein SYNPCCP_0651 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407957706|dbj|BAM50946.1| hypothetical protein BEST7613_2015 [Synechocystis sp. PCC 6803]
 gi|451779947|gb|AGF50916.1| hypothetical protein MYO_16580 [Synechocystis sp. PCC 6803]
          Length = 251

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 60/244 (24%), Positives = 120/244 (49%), Gaps = 55/244 (22%)

Query: 50  DVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCV 109
           + + ATQRA++DG  R++VE+ +PE+  +         +E  ++    FAD G  +++  
Sbjct: 15  EAIAATQRALDDGYRRIQVELAVPEIALQAQ----AIALEFSQL----FADQGLGLRIIF 66

Query: 110 QGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------NEVDEQDDM 163
             + G  ALA                DWG+        +IG +G+       ++ E D +
Sbjct: 67  PDT-GAAALAK--------------RDWGEVP-----FQIGDLGSRYTPIGRKIAESDQV 106

Query: 164 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 223
           F++V+P +AV    +  ++ + + AG RPV+L+ P+L+D+  S   +    R  R  + +
Sbjct: 107 FLVVSP-SAVE---VQSVEKLCELAGDRPVVLLIPQLEDV--SIVGIGYAARQLRQRFLS 160

Query: 224 SFENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSS 282
           +  + Y+FR         P+ GA+ + ++P+ +++     +   GE+Y +L+   EKP+ 
Sbjct: 161 TLFSAYYFR---------PLDGAVVLRSHPSRWQV-----LLEKGEEYELLTELPEKPAG 206

Query: 283 DDIN 286
           + + 
Sbjct: 207 EALE 210


>gi|282896506|ref|ZP_06304526.1| conserved hypothetical protein [Raphidiopsis brookii D9]
 gi|281198612|gb|EFA73493.1| conserved hypothetical protein [Raphidiopsis brookii D9]
          Length = 245

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 60/248 (24%), Positives = 108/248 (43%), Gaps = 52/248 (20%)

Query: 54  ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
           A + A++DG+TR++VE+  PEL            M +       F +    +KV    + 
Sbjct: 21  AAKSALSDGKTRIQVELLFPELQ----------FMPVAEQFLPLFTEYESGLKVFFADAG 70

Query: 114 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQN 171
                                 DWGD   K   +  G   ++E  +  +D++F+ +AP +
Sbjct: 71  AAALAR---------------RDWGDIPFKIMDIGTGRAASSESKIQPEDEIFLFIAPTS 115

Query: 172 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGI-MQTMGRDKRLEYAASFENCYF 230
                 +  ++ +    G RP +++NPRL+D   SS + +    R+ R  + ++ E+CY+
Sbjct: 116 VE----VAQLEKLCQIIGERPFVMLNPRLED---SSVVGIGYAARETRRRFISTIESCYY 168

Query: 231 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 288
            R         PI    AL  +YP  +E++   D    GE Y  ++   +KPS D+I+  
Sbjct: 169 LR---------PIDEQSALMRSYPGNWEIWLETD----GE-YRKIAELPQKPSGDEIDSI 214

Query: 289 FL-GKPRN 295
            + G+P  
Sbjct: 215 LIKGQPET 222


>gi|300865956|ref|ZP_07110692.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300336022|emb|CBN55850.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 248

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 73/273 (26%), Positives = 121/273 (44%), Gaps = 60/273 (21%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYR---IGTLMELVRVIALSFADDGKRVKVCV 109
           +A + A+ND +TRL++E+  PE+  +        I    EL   + + F D G       
Sbjct: 17  EAVKAALNDAQTRLQIELVFPEIALQAQSITQQFITDFEELANGLKVFFPDTGS------ 70

Query: 110 QGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG---ANEVDEQDDMFIL 166
                  ALA         R+     DWG    K T  ++GS     A +++ +D +F+L
Sbjct: 71  ------AALA---------RR-----DWGTVPFKIT--DLGSSRSPVAEKIEPEDQIFLL 108

Query: 167 VAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFE 226
           V   NA  +  +  ++ + +AA +RPVIL+NPRL+D   + GI    GR  R  +  + +
Sbjct: 109 V---NA-SSIEVAQVEQLCNAADSRPVILLNPRLEDA-ATIGIGYA-GRQLRDRFLNTLQ 162

Query: 227 NCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
           +CY+ R L  A        AL   YP  ++++      + GE Y ++S   +KP  D++ 
Sbjct: 163 SCYYIRPLPTA--------ALFRCYPQSWQVWLE---ETEGE-YKLISETAQKPVGDELE 210

Query: 287 DAFL--------GKPRNREKKASGFWGFLSSVF 311
                         P++   K  G    L S+F
Sbjct: 211 RIIAPTVQNGDPSAPQSAPVKKKGMLAELQSLF 243


>gi|434407903|ref|YP_007150788.1| protein of unknown function (DUF1995) [Cylindrospermum stagnale PCC
           7417]
 gi|428262158|gb|AFZ28108.1| protein of unknown function (DUF1995) [Cylindrospermum stagnale PCC
           7417]
          Length = 244

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 58/240 (24%), Positives = 111/240 (46%), Gaps = 53/240 (22%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +ATQ A+ DG  R++++   PEL           ++ +       FA+   R+KV     
Sbjct: 17  EATQAALADGCKRIQIDFLFPELK----------ILPVAEQFLPVFAEYDSRLKVFF-AD 65

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGS----IGANEVDEQDDMFILVA 168
            G  ALA                DW D  A  + V+IG+       +++  +D++F+ +A
Sbjct: 66  AGGAALARR--------------DWAD--AAFSIVDIGTGRTASLQSKIQPEDEIFLFIA 109

Query: 169 PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENC 228
           P +      +  ++ + +  G RPV+ +NPRL+D  G  GI  T  R+ R  + ++ E+C
Sbjct: 110 PTSVE----VPQLEKLCEIIGDRPVVFLNPRLED-SGVVGIGYT-ARETRRRFISTIESC 163

Query: 229 YFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
           Y+ R         P+    A+  +YP  +E++   +     ++Y  ++   ++PS D+++
Sbjct: 164 YYLR---------PVDDETAVFRSYPGQWEVWVETN-----DEYQRIAELPKRPSGDELD 209


>gi|428775196|ref|YP_007166983.1| hypothetical protein PCC7418_0540 [Halothece sp. PCC 7418]
 gi|428689475|gb|AFZ42769.1| protein of unknown function DUF1995 [Halothece sp. PCC 7418]
          Length = 253

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 109/243 (44%), Gaps = 57/243 (23%)

Query: 53  QATQRAVNDGRTRLKVEINIPE--LNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQ 110
           Q+T  A+  G TRL+VE+ IPE  L  E        L+E          D G  +KV   
Sbjct: 17  QSTLTALEAGLTRLQVELRIPEIALQGEKIAKEFADLLE---------DDYGSGLKVLFP 67

Query: 111 GSMGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDM 163
            + G  ALA         R+     +W        D G++ T +E       +V E D M
Sbjct: 68  DT-GAAALA---------RR-----NWADVSFQVNDLGSRNTPIE------KKVSEDDQM 106

Query: 164 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 223
           F+LV+P +AV    +  ++ + + AG RPVIL+ P+L+D+  + GI     R  R  + +
Sbjct: 107 FLLVSP-SAVE---VQKVEKLCNLAGDRPVILLIPQLEDV-ATVGIGYA-ARQLRERFLS 160

Query: 224 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 283
           + E+CY+ + L  A        AL   YP+ ++L+    +  G   Y       EKP  D
Sbjct: 161 TLESCYYLQPLEEA--------ALLKRYPSSWQLW----IEKGENNYEFFCEEPEKPVGD 208

Query: 284 DIN 286
            ++
Sbjct: 209 TLD 211


>gi|254422515|ref|ZP_05036233.1| hypothetical protein S7335_2667 [Synechococcus sp. PCC 7335]
 gi|196190004|gb|EDX84968.1| hypothetical protein S7335_2667 [Synechococcus sp. PCC 7335]
          Length = 248

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 65/237 (27%), Positives = 115/237 (48%), Gaps = 58/237 (24%)

Query: 54  ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFAD--DGKRVKVCVQG 111
           ATQ A++ G + L++E+ IPEL  +               IA  F +  + ++ KV    
Sbjct: 17  ATQAALDAGISLLQIEMAIPELKHQP--------------IAQQFVELFNDRQFKVLFPD 62

Query: 112 SMGEGALAGMPLQLAGTRKILEFMDWG--DYGAKGTFVEIGSIGANEVDEQDDMFILVAP 169
           + G  ALA         R+     DW   D+  +G    +G + A  VD +DD++++V P
Sbjct: 63  A-GAAALA---------RR-----DWNNPDFVIRG----LGEL-AQTVDPEDDLYLIVNP 102

Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
            +AV    +D ++A+ + A  +PV+L+NP+L+D+    GI     R  R  + +  E CY
Sbjct: 103 -SAVE---VDKVEALCNEALDQPVVLLNPQLEDV-AVVGIGYA-ARQLRDRFLSQIETCY 156

Query: 230 FFRLLYYAGTQYPI-MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDI 285
           + R         PI  G +   YP  +++++ +    G ++Y  +   + +PSS+DI
Sbjct: 157 YVR---------PIDQGVVYRAYPGPWQIWREI----GPDEYEHVQDLSNRPSSEDI 200


>gi|428317816|ref|YP_007115698.1| protein of unknown function DUF1995-containing protein
           [Oscillatoria nigro-viridis PCC 7112]
 gi|428241496|gb|AFZ07282.1| protein of unknown function DUF1995-containing protein
           [Oscillatoria nigro-viridis PCC 7112]
          Length = 248

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 67/240 (27%), Positives = 108/240 (45%), Gaps = 55/240 (22%)

Query: 54  ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
           AT  A++DG+T L+VE+  PE+  +     +  L E        F +    VKV    + 
Sbjct: 18  ATAAALSDGKTLLQVELVFPEIALQAQSITLQFLPE--------FEEIYPGVKVFFPDT- 68

Query: 114 GEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFIL 166
           G  ALA         R+     DWG       D G+  T VE      +++  +D +F+L
Sbjct: 69  GAAALA---------RR-----DWGETPFKVTDLGSSRTPVE------DKIAPEDQLFLL 108

Query: 167 VAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFE 226
           + P        +  ++ +  AA  RPVIL+NPRL+D+  + GI    GR  R  +    E
Sbjct: 109 INP----AAVEVAQVEKIYIAAAGRPVILLNPRLEDV-ATIGIGYA-GRQLRDRFLNKIE 162

Query: 227 NCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
           +CY+ R L  A        AL   YP  ++++   +     ++Y ++S   +KP  DD+ 
Sbjct: 163 SCYYIRPLDTA--------ALFRCYPQPWQVWLETN-----DEYELISETAQKPVGDDLE 209


>gi|427715914|ref|YP_007063908.1| hypothetical protein Cal7507_0584 [Calothrix sp. PCC 7507]
 gi|427348350|gb|AFY31074.1| protein of unknown function DUF1995-containing protein [Calothrix
           sp. PCC 7507]
          Length = 243

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 108/248 (43%), Gaps = 55/248 (22%)

Query: 54  ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
           ATQ A+ DG TRL+VE   PEL P          M +        AD   R+KV      
Sbjct: 18  ATQAALADGYTRLQVEFLFPELKP----------MPVAEQYLPLLADYESRLKVFF-ADT 66

Query: 114 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDE----QDDMFILVAP 169
           G  ALA         R+        D+    T  +IG+  A  V +    ++++F+ +AP
Sbjct: 67  GAAALA---------RR--------DWDVPFTISDIGTGRATSVSDKIQPEEEIFLFIAP 109

Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
                +  I  ++ +    G RP IL+NPRL+D     GI     R  R  +  + E CY
Sbjct: 110 ----SSVEISQLEKLFAEIGDRPAILLNPRLEDA-AIVGIGYA-ARQIRERFINTIETCY 163

Query: 230 FFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 287
           + R         P+    A+   YP  +E++    V + GE Y  ++   ++PS D+I+ 
Sbjct: 164 YLR---------PVDDQTAVFRCYPGLWEVW----VETNGE-YQKIAELPKRPSGDEIDL 209

Query: 288 AFL-GKPR 294
             L G+P+
Sbjct: 210 ILLKGQPQ 217


>gi|411117915|ref|ZP_11390296.1| protein of unknown function (DUF1995) [Oscillatoriales
           cyanobacterium JSC-12]
 gi|410711639|gb|EKQ69145.1| protein of unknown function (DUF1995) [Oscillatoriales
           cyanobacterium JSC-12]
          Length = 241

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 63/238 (26%), Positives = 107/238 (44%), Gaps = 53/238 (22%)

Query: 59  VNDGRTRLKVEINIPELNPEMDVYR-IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGA 117
           +  G +RL+VEI IPEL P    Y+ +    E    + + F D G         ++    
Sbjct: 23  LEAGYSRLQVEIAIPELKPMQPAYQYLSVFEEYGSHLKVFFTDAGA-------AALARRE 75

Query: 118 LAGMP-----LQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNA 172
            AG P     + + GTR+              T VE        V  +D +F+ VAP +A
Sbjct: 76  WAGSPYNIQSIDIQGTRQT-------------TPVE------QLVLSEDRVFVFVAP-SA 115

Query: 173 VGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFR 232
           V   +++    + +AAG RPVIL NPR++D+  S   +    R  R  +  + E CY+ +
Sbjct: 116 VEVGVVEQ---IANAAGDRPVILFNPRMEDV--SVVGIGYAARKLRERFLNTIEPCYYLK 170

Query: 233 LLYYAGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
                    P+ G AL   YP+ ++++         E Y +++  T+KP+ + +++ F
Sbjct: 171 ---------PLEGSALIRCYPSLWQVWAETS-----EGYTLIAEETQKPTLERLDEIF 214


>gi|220907967|ref|YP_002483278.1| hypothetical protein Cyan7425_2561 [Cyanothece sp. PCC 7425]
 gi|219864578|gb|ACL44917.1| conserved hypothetical protein [Cyanothece sp. PCC 7425]
          Length = 233

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 60/233 (25%), Positives = 101/233 (43%), Gaps = 54/233 (23%)

Query: 62  GRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM 121
           G +RL+V++ IPEL P          M + +     F+D G  +KV    +         
Sbjct: 26  GYSRLQVDLLIPELKP----------MPVAQQYLSLFSDYGAGLKVFFADAGAAALARRD 75

Query: 122 PLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE----VDEQDDMFILVAPQNAVGNCI 177
                          WGD       V     G NE    +  +D  F+LVAP        
Sbjct: 76  ---------------WGD-------VPFTIRGVNELLNPIQPEDRAFVLVAPTPVE---- 109

Query: 178 IDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYA 237
           +  ++AM + AG RP IL+N +L+D+  + GI    GR  R  + A+FE CY+ R L + 
Sbjct: 110 VMQVEAMANQAGDRPFILLNAKLEDI-ATIGIGYA-GRQLRQRFLATFEPCYYLRPLDW- 166

Query: 238 GTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 290
                  GA+   YP+ ++++    +    ++Y +++   E+P+ D ++   L
Sbjct: 167 -------GAVLRIYPSPWQVW----LEQAEDQYQLIAEEAERPAGDRLDQILL 208


>gi|119510288|ref|ZP_01629424.1| hypothetical protein N9414_16062 [Nodularia spumigena CCY9414]
 gi|119465032|gb|EAW45933.1| hypothetical protein N9414_16062 [Nodularia spumigena CCY9414]
          Length = 244

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 112/247 (45%), Gaps = 50/247 (20%)

Query: 54  ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
           ATQ A+ DG TRL+V+   PEL           LM +       F +   R+K+    + 
Sbjct: 18  ATQAALADGYTRLQVDFLFPELK----------LMPVAEQFLSLFTEYDSRLKIFFPDA- 66

Query: 114 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQN 171
           G  ALA                DW     K   +  G + +  +++  +D++F+ +AP +
Sbjct: 67  GGAALAN--------------RDWAGTPFKILDIGTGRVASIQSKIQPEDEIFLFIAPTS 112

Query: 172 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 231
                 +  ++ + +  G RP +++NPRL+D  G  GI  T  R  R  + ++ E+CY+ 
Sbjct: 113 VE----VPQVEKLCENIGDRPFVMLNPRLED-SGVVGIGYT-ARQTRQRFISTLESCYYL 166

Query: 232 RLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
           R         P+    A+   YP  +E++  ++    GE Y  ++   ++P+ D+++   
Sbjct: 167 R---------PVDDTTAVFRCYPGLWEVWVEIN----GE-YQKVAELPKRPTGDELDVIV 212

Query: 290 L-GKPRN 295
           + G+P+ 
Sbjct: 213 MQGQPQT 219


>gi|443309167|ref|ZP_21038918.1| protein of unknown function (DUF1995) [Synechocystis sp. PCC 7509]
 gi|442780785|gb|ELR90927.1| protein of unknown function (DUF1995) [Synechocystis sp. PCC 7509]
          Length = 243

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 60/239 (25%), Positives = 100/239 (41%), Gaps = 44/239 (18%)

Query: 54  ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
           ATQ A+ DG  RL+VEI +PELNP                +A  F  D + +        
Sbjct: 18  ATQNAIADGLNRLQVEILLPELNP--------------MPVAERFLSDEEGISHNFNKIF 63

Query: 114 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-VDEQDDMFILVAPQNA 172
              A A      A  R+     DWG+   +   +    +   E +  +D+  + +AP   
Sbjct: 64  FPDAGAA-----ALARR-----DWGEVPFELLDISTKRVSVEEQIQPEDEAILCIAPTAQ 113

Query: 173 VGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFR 232
                +  ++ + +A G RPV+L+NPRL+D+  S   +   GR  R  +  + ++CY+ R
Sbjct: 114 E----VLQIEKLCNAMGDRPVVLLNPRLEDV--SIVGIGYAGRQLRDRFLKNIQSCYYLR 167

Query: 233 LLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLG 291
            L          G L   YP  ++++         + Y ++     KP  DD++  F G
Sbjct: 168 PLEE--------GTLFRCYPGLWQVWCETT-----DGYELIKELPNKPVGDDLDLIFAG 213


>gi|434395506|ref|YP_007130453.1| protein of unknown function DUF1995-containing protein [Gloeocapsa
           sp. PCC 7428]
 gi|428267347|gb|AFZ33293.1| protein of unknown function DUF1995-containing protein [Gloeocapsa
           sp. PCC 7428]
          Length = 243

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 66/230 (28%), Positives = 109/230 (47%), Gaps = 50/230 (21%)

Query: 61  DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAG 120
           DG + L++EI IPELNP          M +      +    G ++KV    + G  ALA 
Sbjct: 25  DGYSLLQIEILIPELNP----------MPVAEQFLPALDSFGNQLKVFFPDA-GAAALA- 72

Query: 121 MPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQ----DDMFILVAPQNAVGNC 176
                   R+     DWG    K   ++IG+ G   VD+Q    D+ F+LVAP +AV   
Sbjct: 73  --------RR-----DWGSVPFK--ILDIGT-GRVPVDQQVLPEDEAFLLVAP-SAVE-- 113

Query: 177 IIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 236
            +  ++ +  A G RP IL+NPRL+D+  S   +   GR  R  +  + E+CY  R L  
Sbjct: 114 -VAQVEKLHQAVGERPFILLNPRLEDV--SIVGIGYAGRQLRARFLNTIESCYHLRPLDG 170

Query: 237 AGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
           A        A+   YP+ +++++     +   +Y +++   +KP  D+++
Sbjct: 171 A--------AVFRCYPSPWQVWQE----NKDGEYQLIAEQPKKPMGDEVD 208


>gi|56752173|ref|YP_172874.1| hypothetical protein syc2164_c [Synechococcus elongatus PCC 6301]
 gi|81300739|ref|YP_400947.1| hypothetical protein Synpcc7942_1930 [Synechococcus elongatus PCC
           7942]
 gi|56687132|dbj|BAD80354.1| hypothetical protein [Synechococcus elongatus PCC 6301]
 gi|81169620|gb|ABB57960.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
          Length = 247

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 58/242 (23%), Positives = 111/242 (45%), Gaps = 53/242 (21%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +A QRA+  G TR++V++ IPEL  +   ++             +F D G +++V     
Sbjct: 17  EAAQRALAAGLTRIQVDLAIPELKIQPIAWQFLA----------AFLDRGAKLRVFFPDP 66

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGANEVDEQDDMFILVAPQ 170
            G  ALA         R+     DWG+  +  +G    +  + A  V  ++++FI + P 
Sbjct: 67  -GAAALA---------RR-----DWGELPFSVRG----MEELKAA-VQPEEELFIFIEP- 105

Query: 171 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 230
               +  +  ++ +    G RPVIL+NPRL+D+  + GI     R  R  +   +++ Y+
Sbjct: 106 ---SSVEVQRLEQLCQEIGDRPVILLNPRLEDV-ATIGIGYA-ARQLRERFLNQWQSAYY 160

Query: 231 FRLLYYAGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
                      P+ G A+   YP  +++++  D       Y +L  + ++P+ +D++  F
Sbjct: 161 LS---------PLEGAAIFQAYPQRWQVWQETDT-----GYELLQEYDQRPNGEDLDRLF 206

Query: 290 LG 291
            G
Sbjct: 207 AG 208


>gi|334118025|ref|ZP_08492115.1| Domain of unknown function DUF1995-containing protein [Microcoleus
           vaginatus FGP-2]
 gi|333460010|gb|EGK88620.1| Domain of unknown function DUF1995-containing protein [Microcoleus
           vaginatus FGP-2]
          Length = 248

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 65/239 (27%), Positives = 109/239 (45%), Gaps = 53/239 (22%)

Query: 54  ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRV---IALSFADDGKRVKVCVQ 110
           AT  A++DG+T L+VE+  PE+  +        L EL  +   + + F D          
Sbjct: 18  ATAAALSDGKTLLQVELVFPEIALQAQSITEQFLPELEEIYPGVKVFFPD---------- 67

Query: 111 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILV 167
              G  ALA         R+     DWG+   K T  ++GS  +   +++  +D +F+L+
Sbjct: 68  --AGAAALA---------RR-----DWGETPFKVT--DLGSSRSPVEDKIAPEDQLFLLI 109

Query: 168 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 227
            P        +  ++ +  AA  RPVIL+NPRL+D+  + GI    GR  R  + +  E+
Sbjct: 110 NP----AAVEVAQVERLYIAAAGRPVILLNPRLEDV-ATIGIGYA-GRQLRDRFLSKIES 163

Query: 228 CYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
           CY+ R L  A        AL   YP  ++++   +      +Y ++S   +KP  DD+ 
Sbjct: 164 CYYVRPLDAA--------ALFRCYPQSWQVWLERN-----NQYELISETAQKPVGDDLE 209


>gi|452823755|gb|EME30763.1| hypothetical protein isoform 2 [Galdieria sulphuraria]
          Length = 1138

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 77/296 (26%), Positives = 131/296 (44%), Gaps = 56/296 (18%)

Query: 11  HERKSRSNQVSSSQDNWKGIPTR---LNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLK 67
           + R+  +NQ   SQ++  G+P +   ++ IP+S        ++ L A   A   G  RL 
Sbjct: 50  NSRQISANQTFLSQES--GLPDQGEVVSPIPNSL---TRLSENSLSALIAATEAGYKRLL 104

Query: 68  VEINIPELNPEMDVYR--IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQL 125
           ++I+  + +     ++  +  L EL+RVI    + + +     +    G  A A      
Sbjct: 105 IDIDFEQGDSTYTTFKNAVPFLRELLRVIE---SRNWQTTTTLLFPDEGSAAFA------ 155

Query: 126 AGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMT 185
              RK     DW    +K +F+  GS      D   D  I++ P +AV      D++AM 
Sbjct: 156 ---RK-----DWN--ISKCSFLSFGSYSGK--DSSQDWLIMICP-SAV------DVEAME 196

Query: 186 D---------AAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 236
                     +   RP+ILINP+L D+ G++G +    R  R ++ ++FE+ YF R+  +
Sbjct: 197 QIVNRESPFLSKDARPIILINPKLVDM-GATG-LGFNARQLRQQFLSTFESIYFLRVYTW 254

Query: 237 AGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 290
                QYP   ++ +   N  E     + P     Y +L TF  KP+ D I + FL
Sbjct: 255 GVVVRQYPFRWSVWLDTANSDENSSSEEAP-----YRLLRTFENKPNDDTIQEIFL 305


>gi|452823754|gb|EME30762.1| hypothetical protein isoform 1 [Galdieria sulphuraria]
          Length = 1152

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 77/296 (26%), Positives = 131/296 (44%), Gaps = 56/296 (18%)

Query: 11  HERKSRSNQVSSSQDNWKGIPTR---LNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLK 67
           + R+  +NQ   SQ++  G+P +   ++ IP+S        ++ L A   A   G  RL 
Sbjct: 50  NSRQISANQTFLSQES--GLPDQGEVVSPIPNSL---TRLSENSLSALIAATEAGYKRLL 104

Query: 68  VEINIPELNPEMDVYR--IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQL 125
           ++I+  + +     ++  +  L EL+RVI    + + +     +    G  A A      
Sbjct: 105 IDIDFEQGDSTYTTFKNAVPFLRELLRVIE---SRNWQTTTTLLFPDEGSAAFA------ 155

Query: 126 AGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMT 185
              RK     DW    +K +F+  GS      D   D  I++ P +AV      D++AM 
Sbjct: 156 ---RK-----DWN--ISKCSFLSFGSYSGK--DSSQDWLIMICP-SAV------DVEAME 196

Query: 186 D---------AAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 236
                     +   RP+ILINP+L D+ G++G +    R  R ++ ++FE+ YF R+  +
Sbjct: 197 QIVNRESPFLSKDARPIILINPKLVDM-GATG-LGFNARQLRQQFLSTFESIYFLRVYTW 254

Query: 237 AGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 290
                QYP   ++ +   N  E     + P     Y +L TF  KP+ D I + FL
Sbjct: 255 GVVVRQYPFRWSVWLDTANSDENSSSEEAP-----YRLLRTFENKPNDDTIQEIFL 305


>gi|440681570|ref|YP_007156365.1| protein of unknown function DUF1995-containing protein [Anabaena
           cylindrica PCC 7122]
 gi|428678689|gb|AFZ57455.1| protein of unknown function DUF1995-containing protein [Anabaena
           cylindrica PCC 7122]
          Length = 244

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 55/238 (23%), Positives = 101/238 (42%), Gaps = 49/238 (20%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +A + A+ DG TR++V+   PEL            M +       FA+   R+K+    +
Sbjct: 17  EAVKSALADGMTRIQVDFLFPELK----------FMPIAEQFIPLFAEYESRLKIFFADA 66

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQ 170
                                   W D   K   +  G   + E  +  +D++F+ +AP 
Sbjct: 67  GAAALARRD---------------WADSPFKVEDIGTGRTASLETKIQPEDEIFLFIAPT 111

Query: 171 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 230
           +      +  ++ + +  GTRP IL+NPRL+D  G  GI     R+ R  + ++ E+CY+
Sbjct: 112 SVE----VPQLEKLCEIIGTRPFILLNPRLED-SGVVGIGYA-ARETRRRFISTIESCYY 165

Query: 231 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
            R         P+    AL   YP  +E++   +     ++Y  ++   ++PS D+I+
Sbjct: 166 LR---------PVDDESALFRCYPGDWEVWLETN-----DEYQKIAELPKRPSGDEID 209


>gi|428300086|ref|YP_007138392.1| hypothetical protein Cal6303_3487 [Calothrix sp. PCC 6303]
 gi|428236630|gb|AFZ02420.1| protein of unknown function DUF1995-containing protein [Calothrix
           sp. PCC 6303]
          Length = 250

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 68/259 (26%), Positives = 112/259 (43%), Gaps = 54/259 (20%)

Query: 42  EIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 98
           ++ N   D + QA   T  A+ DG TRL+VE+  PEL P          M +      +F
Sbjct: 3   QLPNSLEDAIFQAREATLAAIADGCTRLQVELLFPELKP----------MPVAEEFIPTF 52

Query: 99  ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-- 156
           A  G ++KV    +                       DW D   K   V  G   A +  
Sbjct: 53  ASYGDKLKVFFADAGAAALAR---------------RDWKDVPFKIWDVGTGRASAAQPK 97

Query: 157 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG-TRPVILINPRLKDLPGSSGIMQTMGR 215
           +  +D+MF+ +AP +      + +++ +    G  RP +++NP+L+D  G+ GI     R
Sbjct: 98  IQAEDEMFLFIAPTSVE----VAELERLCGEIGEQRPFVMLNPKLED-SGTVGIGYA-AR 151

Query: 216 DKRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVIL 273
           + R+ + ++ E+CY+ R         P+    AL   YP  +E++    V   GE Y  +
Sbjct: 152 NIRMRFISTIESCYYLR---------PVDDETALFRCYPGMWEVW----VDKDGE-YKRI 197

Query: 274 STFTEKPSSDDINDAFLGK 292
           +    KPS D++ D  L K
Sbjct: 198 AELANKPSGDEL-DLILAK 215


>gi|428306245|ref|YP_007143070.1| hypothetical protein Cri9333_2705 [Crinalium epipsammum PCC 9333]
 gi|428247780|gb|AFZ13560.1| protein of unknown function DUF1995-containing protein [Crinalium
           epipsammum PCC 9333]
          Length = 248

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 67/242 (27%), Positives = 113/242 (46%), Gaps = 51/242 (21%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +AT+ A+ DG TRL VE+  PEL            M +         + G  +KV    +
Sbjct: 17  EATKTAIADGHTRLMVELVFPELK----------TMPVASQFIPELEEYGSHLKVFFTDA 66

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 169
            G  ALA         R+     DWG+   K    ++GS  +   N+++E+D +F+L+ P
Sbjct: 67  -GAAALA---------RR-----DWGEVPFK--ISDLGSSRSSVQNKIEEEDQIFLLIEP 109

Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
            +AV    I  ++ + +AAG RPVIL+ PRL+D     GI           YAA      
Sbjct: 110 -SAVE---IAQVEQLCNAAGDRPVILLVPRLEDA-AVVGIG----------YAARQLRDR 154

Query: 230 FFRLLYYAGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 288
           F + LY +    P+ G AL  ++P+ ++++   +     + Y +++  T+KP  + ++  
Sbjct: 155 FIKTLYSSYYIRPLEGAALFRSHPSPWQVWLETN-----DDYNLIAEETQKPVGETLDQI 209

Query: 289 FL 290
            L
Sbjct: 210 IL 211


>gi|427705929|ref|YP_007048306.1| hypothetical protein Nos7107_0483 [Nostoc sp. PCC 7107]
 gi|427358434|gb|AFY41156.1| protein of unknown function DUF1995-containing protein [Nostoc sp.
           PCC 7107]
          Length = 244

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 66/240 (27%), Positives = 113/240 (47%), Gaps = 53/240 (22%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +AT+ A+ DG TR++V++  PEL            M +       FA+   R+KV     
Sbjct: 17  EATKAALADGYTRVQVDLLFPELKQ----------MPVAEQFLPLFAEYESRLKVFF-AD 65

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA----NEVDEQDDMFILVA 168
            G  ALA         R+     DW D  A    ++IG+  A    +++  +D++F+ V+
Sbjct: 66  AGGAALA---------RR-----DWVD--AAFQILDIGTGRAASIQSKIKPEDEIFLFVS 109

Query: 169 PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENC 228
           P +AV    I  ++ + +  G RP++++NPRL+D  G+ GI     R  R  +  + E+C
Sbjct: 110 P-SAVE---IPQLEKVCEIIGDRPLVMLNPRLED-SGTVGIGYA-ARQIRERFLNTIESC 163

Query: 229 YFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
           Y+ R         P+    A+   YP  +E+  +      GE +  ++   +KPS DDI+
Sbjct: 164 YYLR---------PVDENTAVFRCYPGQWEVLVQ-----KGETWEKIAELPKKPSGDDID 209


>gi|443320726|ref|ZP_21049809.1| protein of unknown function (DUF1995) [Gloeocapsa sp. PCC 73106]
 gi|442789566|gb|ELR99216.1| protein of unknown function (DUF1995) [Gloeocapsa sp. PCC 73106]
          Length = 241

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 64/236 (27%), Positives = 109/236 (46%), Gaps = 59/236 (25%)

Query: 54  ATQRAVNDGRTRLKVEINIPE--LNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQG 111
           A   A+NDG  RL+VE+ IPE  L  +       TL          F + G  +KV +  
Sbjct: 17  AVTNALNDGYRRLQVELVIPEIALQAQALALEFTTL----------FTEYGSGLKVFLPD 66

Query: 112 SMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMFI 165
           + G  ALA         R+     DWG+     T   +  +G N       V++ D +F+
Sbjct: 67  T-GAAALA---------RR-----DWGE-----TPFAVTDLGTNRSPIETRVNDTDQIFL 106

Query: 166 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 225
           +V+P +AV    +  ++ + + AG RPV+L+ P+L+D+  S   +    R  R  + ++ 
Sbjct: 107 VVSP-SAVE---VTQVEKLCNLAGDRPVVLLIPQLEDV--SIVGIGLAARQLRERFLSTL 160

Query: 226 ENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKP 280
           E+CY+F+         P+ GA  +  YP+ +++++       G  Y +L    EKP
Sbjct: 161 ESCYYFK---------PLEGAAVLKNYPSPWQVWRE-----SGNDYQLLCEQGEKP 202


>gi|428208370|ref|YP_007092723.1| hypothetical protein Chro_3395 [Chroococcidiopsis thermalis PCC
           7203]
 gi|428010291|gb|AFY88854.1| protein of unknown function DUF1995 [Chroococcidiopsis thermalis
           PCC 7203]
          Length = 272

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 61/234 (26%), Positives = 111/234 (47%), Gaps = 51/234 (21%)

Query: 61  DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAG 120
           DG TRL+VE+ IPEL P          M +       F   G ++KV    + G  ALA 
Sbjct: 53  DGYTRLQVELAIPELKP----------MSVAEQFLAEFTQFGSQLKVFFPDA-GAAALA- 100

Query: 121 MPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQNAVGNCII 178
                   R+     DWG+   K   V+IG+    E  ++ +D+  + ++P +AV    +
Sbjct: 101 --------RR-----DWGEVPFK--IVDIGTRSPLEEKIEPEDEALLFISP-SAVE---V 141

Query: 179 DDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAG 238
           + ++ + + A T P +++NPRL+D+    GI    GR  R  +  + E+CY+ R      
Sbjct: 142 ERVEKLCELA-TCPTVMLNPRLEDV-AIVGIGYA-GRQLRTRFLNNIESCYYLR------ 192

Query: 239 TQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLG 291
              PI   ++  +YP  +++++ ++     E++ +++    KP  D+I+    G
Sbjct: 193 ---PIENISVFRSYPGEWQIWREIE-----EEFQLITEQPTKPMGDEIDAILYG 238


>gi|390438390|ref|ZP_10226864.1| conserved hypothetical protein [Microcystis sp. T1-4]
 gi|389838196|emb|CCI30988.1| conserved hypothetical protein [Microcystis sp. T1-4]
          Length = 247

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 61/235 (25%), Positives = 104/235 (44%), Gaps = 45/235 (19%)

Query: 45  NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 101
           N   + +LQA   TQ A+  G   ++VE+ IPE+  +             + +AL F   
Sbjct: 5   NSLEETILQAKAATQLALESGARCVQVELVIPEIALQ------------AQALALDFTS- 51

Query: 102 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 155
                  +  S G G     P   A    +L   DWG+     T  ++G +G+      N
Sbjct: 52  -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96

Query: 156 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 215
           ++  +D+ F++V P +      I+ ++ + + A  RPV+L+ P+L+D+    GI     R
Sbjct: 97  KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150

Query: 216 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGE 268
             R  + ++ E+CY+FR L  A     YP +  + +   +GYEL         GE
Sbjct: 151 QLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGE 205


>gi|307150883|ref|YP_003886267.1| hypothetical protein Cyan7822_0982 [Cyanothece sp. PCC 7822]
 gi|306981111|gb|ADN12992.1| Domain of unknown function DUF1995 [Cyanothece sp. PCC 7822]
          Length = 250

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 63/214 (29%), Positives = 98/214 (45%), Gaps = 44/214 (20%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +ATQ A+  G  RL+VE+ IPE+  E      G  +E   +    F   G  +KV    +
Sbjct: 17  EATQIALEAGLGRLQVELVIPEIALEAQ----GLALEFTSL----FEQYGSGLKVLFPDT 68

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFI 165
            G  ALA         R+     DWG       D G++   VE+      +V  +D+ F+
Sbjct: 69  -GAAALA---------RR-----DWGETPFKISDLGSRFIPVEM------KVTPEDEAFL 107

Query: 166 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 225
           +V P     +  I  +  + + A  RPV+L+ P+L+D+    GI  T  R  R  + ++ 
Sbjct: 108 VVCP----ASIEIQSVAKLCEIAEQRPVLLLIPQLEDV-SIVGIGYT-ARQLRDNFLSTL 161

Query: 226 ENCYFFRLLYYAGT--QYPIMGALRMTYPNGYEL 257
           E+CY+FR L  A     YP +  + +    GYEL
Sbjct: 162 ESCYYFRPLEGAAVVRSYPGLWQVWLEKEQGYEL 195


>gi|414076889|ref|YP_006996207.1| hypothetical protein ANA_C11624 [Anabaena sp. 90]
 gi|413970305|gb|AFW94394.1| hypothetical protein ANA_C11624 [Anabaena sp. 90]
          Length = 244

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 58/253 (22%), Positives = 107/253 (42%), Gaps = 49/253 (19%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +A + A+ DG TR++VE+  PEL            M +       F +   R+KV    +
Sbjct: 17  EAVKSALADGMTRIQVELLFPELK----------FMTVAEQFLPLFTEYESRLKVFFADA 66

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQ 170
                                   W D   + + +  G   + E  +  +D++F+ +AP 
Sbjct: 67  GAAALARRD---------------WTDAQFQISDIGTGRAASLEAKIQPEDEIFLFIAPT 111

Query: 171 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 230
           +      +  ++ + +  G RPVI++ PRL+D     GI  T  R+ R  + ++ E+CY+
Sbjct: 112 SVE----VPQLEKLCELIGERPVIMLTPRLED-SSVVGIGYT-ARETRRRFISTIESCYY 165

Query: 231 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 288
            R         P+    AL   YP  +E++    + + GE Y  ++   +KPS D+++  
Sbjct: 166 IR---------PVDDESALFRCYPGLWEVW----LETAGE-YQKVAELPKKPSGDELDAI 211

Query: 289 FLGKPRNREKKAS 301
            +G  R     A+
Sbjct: 212 LMGGQRENTTDAT 224


>gi|86604746|ref|YP_473509.1| hypothetical protein CYA_0013 [Synechococcus sp. JA-3-3Ab]
 gi|86553288|gb|ABC98246.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
          Length = 246

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 63/283 (22%), Positives = 117/283 (41%), Gaps = 59/283 (20%)

Query: 33  RLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVR 92
           +L   PH         +  +QAT+ A+  G  RL +EI  P++ PE+        +ELV 
Sbjct: 6   QLTQFPHLPGDLAQACEQAIQATRLALQAGYRRLLIEILAPDIKPEVLAR---PFLELVP 62

Query: 93  VIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSI 152
             AL    D            G  ALA                   ++G+    +++ S+
Sbjct: 63  PPALVLFSDA-----------GGAALAQR-----------------EWGSLPDGIQLQSL 94

Query: 153 GA--NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG-------TRPVILINPRLKDL 203
            A      EQ  +F++ A  +      +  + +  D +G        +PV+L+NP+L+D 
Sbjct: 95  TARTQPTPEQSVLFVMPAVYSLDQVERVCQLVSGGDPSGLAAKGRDPKPVVLLNPQLQD- 153

Query: 204 PGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDV 263
             ++  +   GR  R  + ++FE  Y+ R L        + GAL   YP+ + ++++ + 
Sbjct: 154 -AAAVGVGLAGRRLRQRFLSTFETSYYLRSL--------VEGALFRAYPDPWSVWQQEE- 203

Query: 264 PSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGF 306
                 Y +L TF  +PS +++ + F        K+   +W +
Sbjct: 204 ---PGLYSVLKTFRARPSGEEVAELF-----RASKRGDPWWSW 238


>gi|113475888|ref|YP_721949.1| hypothetical protein Tery_2247 [Trichodesmium erythraeum IMS101]
 gi|110166936|gb|ABG51476.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
          Length = 253

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 65/247 (26%), Positives = 118/247 (47%), Gaps = 54/247 (21%)

Query: 52  LQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQG 111
           ++AT+ A+ DG TR+++EI +P++  +           L +    +  +   ++KV    
Sbjct: 16  MEATKAALQDGYTRVQIEIVVPDIELQAQ--------SLAKQFIPALLETSTKLKVFFPD 67

Query: 112 SMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMFI 165
           S G  ALA         R+     DW D     TF +I  +G +      +V+ +D  F+
Sbjct: 68  S-GAAALA---------RR-----DWQD----ATF-KIEDLGTSRSPVDKKVEPEDQCFL 107

Query: 166 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 225
           L+AP +A+    +   + +++ AG RPVI++ P+L+D+    GI     R  R  +  + 
Sbjct: 108 LIAP-SAIE---VAQTEKLSNLAGDRPVIMLIPKLEDV-SIVGIGYA-ARQLRERFIKTI 161

Query: 226 ENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDI 285
           E+CY+ R L  A        AL   YP+ ++++    +   G+ Y +++   EKP  D++
Sbjct: 162 ESCYYIRSLGGA--------ALYRCYPSPWQVW----LEENGQ-YKLIAEQPEKPVGDEV 208

Query: 286 NDAFLGK 292
            D  L K
Sbjct: 209 -DMILAK 214


>gi|119484707|ref|ZP_01619189.1| hypothetical protein L8106_14580 [Lyngbya sp. PCC 8106]
 gi|119457525|gb|EAW38649.1| hypothetical protein L8106_14580 [Lyngbya sp. PCC 8106]
          Length = 249

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 66/247 (26%), Positives = 111/247 (44%), Gaps = 56/247 (22%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYR---IGTLMELVRVIALSFADDGKRVKVCV 109
           QATQ A++DG   ++VE+  PE+  +        I  + E   V+ + F D         
Sbjct: 17  QATQAALDDGYKLVQVELVFPEIELQAQAIAQQFIPAIEESGTVLKVFFPD--------- 67

Query: 110 QGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFIL 166
               G  ALA         R+     DWG+   K    ++GS  +   + V + D  F++
Sbjct: 68  ---AGAAALA---------RR-----DWGEIPFK--ISDLGSSRSPIDSRVKDDDGRFLV 108

Query: 167 VAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFE 226
           V+P        ++ ++ ++  AG R  IL+NPRL+D+    GI     R  R  + ++ E
Sbjct: 109 VSPTPVE----VEQVEKLSQLAGDRVTILLNPRLEDI-AIIGIGYA-ARALRDRFISTIE 162

Query: 227 NCYFFRLLYYAGTQYPIMG--ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 284
           +CY+ R         P+ G  AL   YP+ +E+++ +D    GE Y +L+    KP  D 
Sbjct: 163 SCYYLR---------PLEGDAALYRCYPSLWEVWQEID----GE-YTLLAQEQTKPVGDQ 208

Query: 285 INDAFLG 291
           ++    G
Sbjct: 209 LDQILAG 215


>gi|443327973|ref|ZP_21056578.1| protein of unknown function (DUF1995) [Xenococcus sp. PCC 7305]
 gi|442792382|gb|ELS01864.1| protein of unknown function (DUF1995) [Xenococcus sp. PCC 7305]
          Length = 248

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 68/266 (25%), Positives = 112/266 (42%), Gaps = 51/266 (19%)

Query: 45  NYFYDDVLQATQRA-------VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALS 97
           N F   + +A ++A       + DG TRL+VE+ IPE+  +             + +AL 
Sbjct: 2   NEFPQSIAEAVEQAKKTVIDAIADGYTRLQVELVIPEIPLQ------------AQSLALE 49

Query: 98  FADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIG 150
           FA+           S G G     P   A    +L   DWG       D G + T +E  
Sbjct: 50  FAN-------LFLESHGSGVKVMFPDTGAA---MLAKRDWGEIAFQVTDMGGRFTSIE-- 97

Query: 151 SIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIM 210
                ++ E D++FI+V P +      ++ ++ + +  G RPVIL+ P+L+D+    GI 
Sbjct: 98  ----QQIAETDEIFIMVCPSSVE----VERVETLCNLVGDRPVILLIPQLEDV-AVVGI- 147

Query: 211 QTMGRDKRLEYAASFENCYFFRLLYYAGTQ--YPIMGALRMTYPNGYELYKRVDV-PSGG 267
               R  R  + ++  +CY+ R L  A     +P    + +    GYEL   +   P G 
Sbjct: 148 GLAARRLRDRFISTLYSCYYIRPLEGASILRIHPSSWQIWLEKEEGYELAMELSSKPMGE 207

Query: 268 EKYVILSTFTEKPSSDDINDAFLGKP 293
           E   +L   T        N++ + KP
Sbjct: 208 ELDRLLIQLTTPEGEAGKNNSTIKKP 233


>gi|332705285|ref|ZP_08425366.1| protein of unknown function, DUF1995 [Moorea producens 3L]
 gi|332356028|gb|EGJ35487.1| protein of unknown function, DUF1995 [Moorea producens 3L]
          Length = 251

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 56/233 (24%), Positives = 109/233 (46%), Gaps = 43/233 (18%)

Query: 59  VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 118
           ++DG+TR++VE+  PE+  +          E + +    F   G  +K+    + G  AL
Sbjct: 23  LDDGQTRVQVELVFPEIALQAQ----SIAQEFIPI----FEQYGPGLKIFFPDT-GASAL 73

Query: 119 AGMPLQLAGTRKILEFMDWGDYGAKGTFVEIG-SIGANEVDEQDDMFILVAPQNAVGNCI 177
           A                DWG+   K T V    S     +++ D +F+LV P +AV    
Sbjct: 74  A--------------LRDWGEVPFKITDVGTSRSPAETRIEDDDQVFLLVGP-SAVE--- 115

Query: 178 IDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYA 237
           +  ++ + + AG RP +++NP+L+D+  S   +    R  R  +  + E+CY+ R   + 
Sbjct: 116 VAQVEKICNLAGDRPCVILNPQLEDV--SIVGIGYAARQLRDRFLKTLESCYYLR--PFP 171

Query: 238 GTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 290
           G      GAL   YP+ ++++  +D     ++Y +++    KP+++ ++   L
Sbjct: 172 G------GALWRCYPSMWQVWLEID-----DEYQLVTEEPSKPTAEALDQIIL 213


>gi|298489954|ref|YP_003720131.1| hypothetical protein Aazo_0482 ['Nostoc azollae' 0708]
 gi|298231872|gb|ADI63008.1| Domain of unknown function DUF1995 ['Nostoc azollae' 0708]
          Length = 244

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 56/246 (22%), Positives = 102/246 (41%), Gaps = 50/246 (20%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +A + A+ DG TR++V+   PEL            M +       FA+   RVKV    +
Sbjct: 17  EAVKSALADGVTRIQVDFLFPELK----------FMPVAEQFVPLFAEYESRVKVFFADA 66

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQ 170
                                   W +   K   +  G   +   ++  +D++F+ +AP 
Sbjct: 67  GAAALARRD---------------WQNVPFKVEDIGTGRAASLQTKIQPEDEIFLFIAPT 111

Query: 171 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 230
                  +  ++ M +   TRP++L+NPRL+D  G  GI     R+ R  + ++ E+CY+
Sbjct: 112 PVE----VPQLEKMCEIIDTRPIVLLNPRLED-SGVVGIGYA-ARETRRRFISTIESCYY 165

Query: 231 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 288
            R         P+    AL   YP  +E++          +Y  ++   ++PS D+I+  
Sbjct: 166 LR---------PVDDESALFRCYPGQWEVWL-----ESNNEYEKIAELPKRPSGDEIDMI 211

Query: 289 FL-GKP 293
            + G+P
Sbjct: 212 LMKGQP 217


>gi|67922775|ref|ZP_00516276.1| hypothetical protein CwatDRAFT_3614 [Crocosphaera watsonii WH 8501]
 gi|416392960|ref|ZP_11685949.1| hypothetical protein CWATWH0003_2757 [Crocosphaera watsonii WH
           0003]
 gi|67855391|gb|EAM50649.1| hypothetical protein CwatDRAFT_3614 [Crocosphaera watsonii WH 8501]
 gi|357263546|gb|EHJ12537.1| hypothetical protein CWATWH0003_2757 [Crocosphaera watsonii WH
           0003]
          Length = 246

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 104/244 (42%), Gaps = 46/244 (18%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +A + A+  G  R+ +EI IPE+  +             + +AL F             +
Sbjct: 17  EAAKIALESGVGRICLEIVIPEIAIQ------------AQALALEFT------------T 52

Query: 113 MGEGALAGMPLQLAGT-RKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMF 164
           + EG  +G+ +    T    L   DWG       D G++ T +E       ++ E D++F
Sbjct: 53  LFEGQQSGLKVIFPDTGAAALARRDWGETPFRVTDLGSRATGIE------RKISEADEIF 106

Query: 165 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 224
           +LV P +      ++ ++ + + AG RPVIL+ P+L+D+  S   +    R  R  + ++
Sbjct: 107 LLVCPSSVE----VETVEKLCELAGDRPVILLIPQLEDV--SIVGIGYAARQLRDRFIST 160

Query: 225 FENCYFFRLL--YYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSS 282
            E+ Y+FR L        YP    + +   +GYEL         GE   IL     K   
Sbjct: 161 LESVYYFRPLDDVVVLRSYPSPWLVFLEKEDGYELIAEQGQKPMGEALDILVNNALKGEE 220

Query: 283 DDIN 286
           DD N
Sbjct: 221 DDSN 224


>gi|218247522|ref|YP_002372893.1| hypothetical protein PCC8801_2736 [Cyanothece sp. PCC 8801]
 gi|257061142|ref|YP_003139030.1| hypothetical protein Cyan8802_3366 [Cyanothece sp. PCC 8802]
 gi|218168000|gb|ACK66737.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
 gi|256591308|gb|ACV02195.1| conserved hypothetical protein [Cyanothece sp. PCC 8802]
          Length = 245

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 63/232 (27%), Positives = 102/232 (43%), Gaps = 45/232 (19%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +AT+ A+     R++VE+ +PE+  +      G  +E   +      DD   +KV    +
Sbjct: 17  EATKLALEANLGRIQVELVVPEIALQAQ----GLALEFTTLF-----DDLSSLKVIFPDT 67

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFI 165
            G  ALA         R+     DWG       D G++ T VE+      +V EQD  F+
Sbjct: 68  -GAAALA---------RR-----DWGETPFKVTDLGSRATSVEM------KVSEQDRAFL 106

Query: 166 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 225
           LV P     +  ++ ++ + + AG RPV+L+ P+L+D+    GI     R  R  + ++ 
Sbjct: 107 LVCP----SSVEVESVEKLCNLAGDRPVVLLIPQLEDV-SVVGIGYA-ARQLRERFISTL 160

Query: 226 ENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILST 275
            + Y+FR L  A     YP    + +    GYEL         GE   I+ T
Sbjct: 161 NSAYYFRPLEGAAILRSYPSPWNVYLETETGYELIASEPQKPLGEALEIILT 212


>gi|254410487|ref|ZP_05024266.1| hypothetical protein MC7420_3002 [Coleofasciculus chthonoplastes
           PCC 7420]
 gi|196182693|gb|EDX77678.1| hypothetical protein MC7420_3002 [Coleofasciculus chthonoplastes
           PCC 7420]
          Length = 245

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 66/242 (27%), Positives = 112/242 (46%), Gaps = 37/242 (15%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +ATQ A+NDG TRL+VE+  PE+   +    I      +      F   G  +KV    +
Sbjct: 17  EATQAALNDGYTRLQVELVFPEIA--LQAQSIAWQFTPI------FEQYGSGLKVLFPDA 68

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 169
            G  ALA         R+     DWG+   K T  +IG+  +   + +  +D+ F++V+P
Sbjct: 69  -GAAALA---------RR-----DWGETPFKVT--DIGTSRSPVESRIQPEDEAFLIVSP 111

Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
            +AV    +  ++ + + AG RP +L+ P+L+DL    GI     R  R  + ++  +CY
Sbjct: 112 -SAVE---VSQVEKLCNLAGDRPCVLLTPQLEDL-KVVGIGYA-ARQLRERFLSTLTSCY 165

Query: 230 FFRLLYYAGTQ--YPIMGALRMTYPNGYELY-KRVDVPSGGEKYVILSTFTEKPSSDDIN 286
           + + L  A     YP +  + +   N Y+L  +    P G    +I++  T    S+D  
Sbjct: 166 YVQPLDGAALLRVYPGLWQIWLEKENAYQLIAEEPQKPVGDALELIIAKVTGSGDSEDTP 225

Query: 287 DA 288
            A
Sbjct: 226 AA 227


>gi|434386365|ref|YP_007096976.1| protein of unknown function (DUF1995) [Chamaesiphon minutus PCC
           6605]
 gi|428017355|gb|AFY93449.1| protein of unknown function (DUF1995) [Chamaesiphon minutus PCC
           6605]
          Length = 240

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 60/233 (25%), Positives = 102/233 (43%), Gaps = 55/233 (23%)

Query: 61  DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAG 120
           DG  R+ VEI IPEL            M L R    SF+D G  +KV    + G  ALA 
Sbjct: 25  DGHKRITVEIVIPELKS----------MGLARQFLESFSDRGNTLKVLFTDT-GAAALAK 73

Query: 121 MPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG------ANEVDEQDDMFILVAPQNAVG 174
                          DW     +G   +I  +G       +++  +D+++I + P +AV 
Sbjct: 74  R--------------DW-----QGVNFKIEDLGNSRSPITDKIQAEDELYIAIDP-SAVE 113

Query: 175 NCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLL 234
              ++ ++ + +AAG RPVIL  P+L+D     GI     R  R  +  +    Y+ + L
Sbjct: 114 ---VEQVEKLCNAAGDRPVILFLPKLED-AAIVGIGYA-ARQLRDRFLTTLTCAYYIKPL 168

Query: 235 YYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 287
                      AL   YP  +++++  D     + Y++L+   +KP  D++++
Sbjct: 169 E--------ASALYRCYPAQWQVWQEQD-----DDYILLAECPQKPVGDELDE 208


>gi|359457624|ref|ZP_09246187.1| hypothetical protein ACCM5_02793 [Acaryochloris sp. CCMEE 5410]
          Length = 234

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 64/270 (23%), Positives = 115/270 (42%), Gaps = 64/270 (23%)

Query: 52  LQATQRAVNDGRTRLKVEINIPELNP----EMDVYRIGTLMELVRVIALSFADDGKRVKV 107
           ++AT+ A+ D +T L+VEI IPEL P    +  + ++  L E V+V    F+D G     
Sbjct: 15  VEATKAAIADHKTLLQVEIAIPELKPLPVAQQYLSQLPNLGENVKVF---FSDTGA---- 67

Query: 108 CVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILV 167
               ++       +  +L G  +++                      + V  ++D F+ +
Sbjct: 68  ---AALARHQWKDISYELRGIEELI----------------------DPVQPENDAFVFI 102

Query: 168 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 227
           AP        +  ++ +   AG R  IL+NP+L+D+    GI    GR  R  +  + E+
Sbjct: 103 APTPVE----VGKVEQICSQAGDRVCILLNPKLEDV-SIVGIGMA-GRTLRERFLNNIES 156

Query: 228 CYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGE--KYVILSTFTEKPSSDDI 285
           CY F  L          GA+  +YP+ ++++      + GE  ++ IL+T   +PS + I
Sbjct: 157 CYSFLPLER--------GAVIRSYPSEWQIWW-----AEGEEDEHQILATEENRPSGERI 203

Query: 286 NDAFLGKPRNREKKASG-------FWGFLS 308
           +          E +  G       FW  L+
Sbjct: 204 SQLLASLTATEEIRKPGLLDNMQQFWKALT 233


>gi|158334629|ref|YP_001515801.1| hypothetical protein AM1_1459 [Acaryochloris marina MBIC11017]
 gi|158304870|gb|ABW26487.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 234

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/270 (22%), Positives = 114/270 (42%), Gaps = 64/270 (23%)

Query: 52  LQATQRAVNDGRTRLKVEINIPELNP----EMDVYRIGTLMELVRVIALSFADDGKRVKV 107
           ++AT+ A+ D +T L++EI IPEL P    +  + ++  L E V+V    F+D G     
Sbjct: 15  VEATKSAIADHKTLLQIEIAIPELKPLPVAQQYLSQLPNLGENVKVF---FSDTGA---- 67

Query: 108 CVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILV 167
               ++       +  +L G  +++                      + V  ++D F+ +
Sbjct: 68  ---AALARHQWKDISYELRGIEELI----------------------DPVQPENDAFVFI 102

Query: 168 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 227
           AP        +  ++ +   AG R  IL+NP+L+D+  S   +   GR  R  +  + E+
Sbjct: 103 APTPVE----VGKVEQICSQAGDRVCILLNPKLEDV--SIVGIGMAGRTLRERFLNNIES 156

Query: 228 CYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGE--KYVILSTFTEKPSSDDI 285
           CY F  L          GA+  +YP+ ++++      + GE  ++ IL+T   +PS + I
Sbjct: 157 CYSFLPLER--------GAVIRSYPSEWQIWW-----AEGEEDEHQILATEESRPSGERI 203

Query: 286 NDAFLGKPRNREKKASG-------FWGFLS 308
                      + +  G       FW  L+
Sbjct: 204 GQLLASSTTTEDNRKPGLLDNMQQFWKALT 233


>gi|428183504|gb|EKX52362.1| hypothetical protein GUITHDRAFT_157134 [Guillardia theta CCMP2712]
          Length = 325

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 71/156 (45%), Gaps = 13/156 (8%)

Query: 141 GAKGTFVE---IGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILIN 197
           G KG+ +E   I     +EV E DDMFI++          + D++ +  AAG RPVIL N
Sbjct: 123 GFKGSLIERVWINKDFVSEVQEDDDMFIIIGASAQE----LPDVRELCKAAGDRPVILFN 178

Query: 198 PRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY----AGTQYPI--MGALRMTY 251
            +L+ L G  G+     +    ++       YF     Y    AG  + I   GAL  TY
Sbjct: 179 LKLQVLRGDFGLPFFPSKSLHNDWLCEALPAYFMLPKSYTKTIAGPPFLINYSGALFRTY 238

Query: 252 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 287
           P  +++   V    GG +Y  +    ++P+  D+ +
Sbjct: 239 PGKWQMLLEVPDEDGGGRYQRVRMLDKRPALSDVRE 274


>gi|86607776|ref|YP_476538.1| hypothetical protein CYB_0277 [Synechococcus sp. JA-2-3B'a(2-13)]
 gi|86556318|gb|ABD01275.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
          Length = 233

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/250 (22%), Positives = 108/250 (43%), Gaps = 54/250 (21%)

Query: 49  DDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVC 108
           +  + AT+ A+  G  RL +EI  P+L PE+        +ELV+  AL    D       
Sbjct: 10  EQAIGATRLAMQAGYRRLLIEILAPDLKPEVLAR---PFLELVQPPALVLFSDA------ 60

Query: 109 VQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVA 168
                G  ALA                   ++G+    +E+ S+ +      +   +LV 
Sbjct: 61  -----GGAALAQR-----------------EWGSLPEGIELQSLTSRTQPTPEQSILLVM 98

Query: 169 PQNAVGNCIIDDMKAMT--DAAG-------TRPVILINPRLKDLPGSSGIMQTMGRDKRL 219
           P     + +    +A++  D +G        +PV+L+NP+L+D   ++  +   GR  R 
Sbjct: 99  PAVYSLDQVERVCQAVSGGDPSGLASKGRDPKPVVLLNPQLQD--AATVGVGLAGRRLRQ 156

Query: 220 EYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEK 279
            + ++FE  Y+ R L        + GAL   YP+ + ++++ +       Y +L TF  +
Sbjct: 157 RFLSTFETSYYLRSL--------VEGALFRAYPDPWSVWQQEE----PGLYSVLKTFIAQ 204

Query: 280 PSSDDINDAF 289
           P+ +++ + F
Sbjct: 205 PTGEEVAELF 214


>gi|428203362|ref|YP_007081951.1| hypothetical protein Ple7327_3166 [Pleurocapsa sp. PCC 7327]
 gi|427980794|gb|AFY78394.1| protein of unknown function (DUF1995) [Pleurocapsa sp. PCC 7327]
          Length = 249

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/243 (23%), Positives = 114/243 (46%), Gaps = 59/243 (24%)

Query: 53  QATQRAVNDGRTRLKVEINIPE--LNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQ 110
           +AT+ A+  G  R++VE+ +PE  L  +       +L+E          D G  ++V   
Sbjct: 17  EATKVALEAGCERIQVELVVPEIALQAQSLALEFTSLLE----------DYGSGLRVLFP 66

Query: 111 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMF 164
            + G  ALA         R+     DWG+     T  ++  +G+       ++ E+D  F
Sbjct: 67  DT-GAAALA---------RR-----DWGE-----TVFQVSDLGSRFIPVDMKISEEDTAF 106

Query: 165 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 224
           ++V+P     N +    + + + AG RPV+L+ P+L+D+  S   +    R  R  + ++
Sbjct: 107 LVVSPSAIEVNSV----EKLCNLAGDRPVVLLIPQLEDV--SVVGIGLAARQLRERFLST 160

Query: 225 FENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 283
            E+ Y+FR         P+ GA+ + +YP+ ++++       G E Y +++   +KP+ +
Sbjct: 161 LESSYYFR---------PLEGAVVLRSYPSLWQVWL-----GGEEDYQLIAEEPQKPTGE 206

Query: 284 DIN 286
            ++
Sbjct: 207 RLD 209


>gi|443314784|ref|ZP_21044317.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 6406]
 gi|442785626|gb|ELR95433.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 6406]
          Length = 246

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 60/240 (25%), Positives = 111/240 (46%), Gaps = 52/240 (21%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           QAT+ A+  G +R+ V+I IPEL       +I  + E    +   F + G + KV    +
Sbjct: 19  QATRAAIEAGVSRIVVDIAIPEL-------KIQPVAEQFYPL---FEEMGLQFKVYFPDA 68

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWG--DYGAKGTFVEIGSIGANEVDEQDDMFILVAPQ 170
            G  ALA                DWG  D+  +G     G + A+     D+ F+++ P 
Sbjct: 69  -GAAALAK--------------RDWGNPDFSIRGINEMKGRLEAD-----DEAFLIIEP- 107

Query: 171 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 230
               +  ++D+++  + A  R V+++NP+L+D+  + GI  T GR  R  + ++ E  Y+
Sbjct: 108 ---SSVEVNDVESFCNEATGRFVVMLNPKLEDI-ATIGIGYT-GRQLRERFLSTLETIYY 162

Query: 231 FRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
            +         P+ GA  +  YP  ++++      +  + Y +L+ F +KPS + +   F
Sbjct: 163 LQ---------PLEGATILRAYPGLWQVWGE----TTDDGYELLADFPQKPSGEALEKLF 209


>gi|304570679|ref|YP_001806112.2| hypothetical protein cce_4698 [Cyanothece sp. ATCC 51142]
 gi|354552131|ref|ZP_08971439.1| protein of unknown function DUF1995-containing protein [Cyanothece
           sp. ATCC 51472]
 gi|353555453|gb|EHC24841.1| protein of unknown function DUF1995-containing protein [Cyanothece
           sp. ATCC 51472]
          Length = 246

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 55/235 (23%), Positives = 107/235 (45%), Gaps = 56/235 (23%)

Query: 106 KVCVQGSMGEGALAGMPLQL----------AGTRKI--------LEFMDWG-------DY 140
           ++C++  + E AL   PL L          +G + I        L   DWG       D 
Sbjct: 29  RICLEIVIPEIALQAQPLALEFTTLFKGQESGLKVIFPDTGAAALARRDWGETPFQVTDL 88

Query: 141 GAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRL 200
           G++ T ++       ++ E D+ F+LV+P     +  ++ ++ + + AG RPVIL+ P+L
Sbjct: 89  GSRATPID------KKISEADEAFLLVSP----SSVEVEIVEKLCNLAGHRPVILLIPQL 138

Query: 201 KDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYK 259
           +D+  S   +    R  R  + ++ E+ Y+FR         P+ G + + +YP+ + +Y 
Sbjct: 139 EDV--SIVGIGYAARQLRERFISTLESVYYFR---------PLDGVVVLRSYPSPWLVYL 187

Query: 260 RVDVPSGGEKYVILSTFTEKPSSDD----INDAFLGKPRNREKKASGFWGFLSSV 310
             +     E Y +++   +KP  +     +N+A  G+  N ++      G  +SV
Sbjct: 188 EKE-----EGYELIAEQGQKPMGEALDILVNNALKGEEENSDQPQPKKSGIFASV 237


>gi|17230608|ref|NP_487156.1| hypothetical protein all3116 [Nostoc sp. PCC 7120]
 gi|17132211|dbj|BAB74815.1| all3116 [Nostoc sp. PCC 7120]
          Length = 245

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 55/248 (22%), Positives = 103/248 (41%), Gaps = 50/248 (20%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +AT+ A+ DG  R++VE+  PEL            M +         +   R+KV    +
Sbjct: 17  EATKSALADGCNRIQVELLFPELK----------FMPVAEQFLPCLVEYESRLKVFFADA 66

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQ 170
                                   W D   +   +  G I +  +++  +D++F+ ++P 
Sbjct: 67  GAAALARRD---------------WKDAAFQILDIGTGRIASIQSKIQPEDEIFLFISPT 111

Query: 171 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 230
           +      +  ++ + +  G RP I +NPRL+D  G+ GI  T  R  R  +    ++CY+
Sbjct: 112 SVE----VPQLEKICEIIGDRPAIFLNPRLEDA-GTVGIGYT-ARQTRERFLNIIQSCYY 165

Query: 231 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 288
            R         PI    AL  +YP  +E++   D       +  ++   +KPS D+I+  
Sbjct: 166 LR---------PIDDETALFRSYPGDWEIWVEND-----GNWAKIANLPKKPSGDEIDLI 211

Query: 289 FL-GKPRN 295
            + G+P+ 
Sbjct: 212 LMKGQPQT 219


>gi|428772277|ref|YP_007164065.1| hypothetical protein Cyast_0436 [Cyanobacterium stanieri PCC 7202]
 gi|428686556|gb|AFZ46416.1| protein of unknown function DUF1995 [Cyanobacterium stanieri PCC
           7202]
          Length = 244

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 61/245 (24%), Positives = 118/245 (48%), Gaps = 52/245 (21%)

Query: 54  ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
           AT  A+  G  RL++E+ IPE+  +          EL    A  F + G  +KV    + 
Sbjct: 18  ATLTALESGFNRLQIELIIPEIALKAQ--------ELAFAFAQLFGEYGAGLKVFFPDT- 68

Query: 114 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFV--EIGSIGA---NEVDEQDDMFILVA 168
           G  ALA         R+     +WG+     TFV  ++GS  +   N V + D+ +++V+
Sbjct: 69  GAAALA---------RR-----EWGE----TTFVVTDLGSSRSPIENRVSDTDEFYLVVS 110

Query: 169 PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENC 228
           P +AV    +  ++ + + AG RPVILI P+L+D+  S   +    R  R  + ++ E+ 
Sbjct: 111 P-SAVE---VAQVEKLCNLAGDRPVILIIPQLEDI--SVVGIGYAARQLRERFLSTLESS 164

Query: 229 YFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 287
           Y++R         P+  A+ M  +P  +++++ ++      K+ +++   +KP  + ++ 
Sbjct: 165 YYYR---------PLEDAVVMRVFPGLWQVWRELE----DNKFELITEVPKKPLGEVLDR 211

Query: 288 AFLGK 292
             +G+
Sbjct: 212 ILMGQ 216


>gi|218439257|ref|YP_002377586.1| hypothetical protein PCC7424_2296 [Cyanothece sp. PCC 7424]
 gi|218171985|gb|ACK70718.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
          Length = 253

 Score = 53.9 bits (128), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 60/228 (26%), Positives = 101/228 (44%), Gaps = 47/228 (20%)

Query: 42  EIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 98
           E+ N     ++QA   T+ A+ +G  R++VE+ IPE+                + IAL F
Sbjct: 3   EVPNSLEATIIQAKEATRLALEEGLGRIQVELVIPEIALS------------AQTIALEF 50

Query: 99  ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGS 151
                     +  S GEG     P   A     L   DWG       D G++ T VE+  
Sbjct: 51  --------TSLLDSYGEGLKVLFPDSGAAA---LARRDWGETPFKISDLGSRFTPVEM-- 97

Query: 152 IGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQ 211
               ++  +D+ F++V P     +  +  +  + D AG RPV+++ P+L+D+    GI  
Sbjct: 98  ----KITPEDEAFLVVCP----SSIEVQSVAKLCDLAGDRPVVILIPQLEDV-SIVGIGY 148

Query: 212 TMGRDKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYEL 257
           T  R  R  + ++ ++ Y+ R L  A     YP +  + +   +GYEL
Sbjct: 149 T-ARQLRDNFLSTLQSSYYIRPLDGAVVLRAYPELWQVWLEKEDGYEL 195


>gi|126656438|ref|ZP_01727699.1| hypothetical protein CY0110_22082 [Cyanothece sp. CCY0110]
 gi|126622124|gb|EAZ92831.1| hypothetical protein CY0110_22082 [Cyanothece sp. CCY0110]
          Length = 246

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 62/271 (22%), Positives = 121/271 (44%), Gaps = 63/271 (23%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           +A + A+  G  R+ +EI IPE+  +             + +AL F             +
Sbjct: 17  EAAKLALEAGVERICLEIVIPEIALQ------------AQPLALEFT------------T 52

Query: 113 MGEGALAGMPLQLAGT-RKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMF 164
           + EG  +G+ +    T    L   DWG       D G++ T +E       ++ E D+ F
Sbjct: 53  LFEGQESGLKVIFPDTGAAALARRDWGETPFQVTDLGSRATSIE------KKISEADEAF 106

Query: 165 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 224
           +LV+P     +  ++ ++ + + AG R VIL+ P+L+D+  S   +    R  R  + ++
Sbjct: 107 LLVSP----SSVEVEIVEKLCNLAGHRSVILLIPQLEDV--SIVGIGYAARQLRERFIST 160

Query: 225 FENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 283
            E+ Y+FR         P+ G + + +YP+ + +Y   +     E Y +++   +KP  +
Sbjct: 161 LESVYYFR---------PLDGVVVLRSYPSPWLVYLEKE-----EGYELIAEQGQKPMGE 206

Query: 284 D----INDAFLGKPRNREKKASGFWGFLSSV 310
                +N+A  G+  N ++  +   G  +SV
Sbjct: 207 ALDTLVNNALKGEEDNSDQPQAKKSGIFASV 237


>gi|171701065|gb|ACB54046.1| unknown [Cyanothece sp. ATCC 51142]
          Length = 184

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 45/187 (24%), Positives = 90/187 (48%), Gaps = 38/187 (20%)

Query: 136 DWG-------DYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAA 188
           DWG       D G++ T ++       ++ E D+ F+LV+P     +  ++ ++ + + A
Sbjct: 15  DWGETPFQVTDLGSRATPID------KKISEADEAFLLVSP----SSVEVEIVEKLCNLA 64

Query: 189 GTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALR 248
           G RPVIL+ P+L+D+  S   +    R  R  + ++ E+ Y+FR         P+ G + 
Sbjct: 65  GHRPVILLIPQLEDV--SIVGIGYAARQLRERFISTLESVYYFR---------PLDGVVV 113

Query: 249 M-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD----INDAFLGKPRNREKKASGF 303
           + +YP+ + +Y   +     E Y +++   +KP  +     +N+A  G+  N ++     
Sbjct: 114 LRSYPSPWLVYLEKE-----EGYELIAEQGQKPMGEALDILVNNALKGEEENSDQPQPKK 168

Query: 304 WGFLSSV 310
            G  +SV
Sbjct: 169 SGIFASV 175


>gi|75911048|ref|YP_325344.1| hypothetical protein Ava_4852 [Anabaena variabilis ATCC 29413]
 gi|75704773|gb|ABA24449.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
          Length = 245

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 56/252 (22%), Positives = 102/252 (40%), Gaps = 52/252 (20%)

Query: 42  EIRNYFYDDVLQ---ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 98
           E+ N   D + Q   AT+ A+ DG  R++VE+  PEL            M +        
Sbjct: 3   ELPNSLEDAIAQSREATKAALADGCHRIQVELLFPELK----------FMPVAEQFLPCL 52

Query: 99  ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NE 156
            +   R+KV    +                        W D   +   +  G   +  ++
Sbjct: 53  VEYESRLKVFFADAGAAALARRD---------------WKDAPFQILDIGTGRTASIQSK 97

Query: 157 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRD 216
           +  +D++F+ ++P +      +  ++ + +  G RP I +NPRL+D  G+ GI  T  R 
Sbjct: 98  IQPEDEIFLFISPTSVE----VPQLEKICEIIGDRPAIFLNPRLEDA-GTVGIGYT-ARQ 151

Query: 217 KRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILS 274
            R  +    ++CY+ R         PI    AL  +YP  +E++   D       +  ++
Sbjct: 152 TRERFLNIIQSCYYLR---------PIDDETALFRSYPGDWEIWVEND-----GNWAKIA 197

Query: 275 TFTEKPSSDDIN 286
              +KPS D+I+
Sbjct: 198 NLPKKPSGDEID 209


>gi|427713531|ref|YP_007062155.1| hypothetical protein Syn6312_2513 [Synechococcus sp. PCC 6312]
 gi|427377660|gb|AFY61612.1| protein of unknown function (DUF1995) [Synechococcus sp. PCC 6312]
          Length = 232

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 65/261 (24%), Positives = 105/261 (40%), Gaps = 58/261 (22%)

Query: 54  ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
           ATQ A++ G  R+++EI +P+L           +M L       F D G  +KV    + 
Sbjct: 17  ATQVALDAGINRIQIEILLPDLQ----------VMPLAWSYLELFQDLGVHLKVFFADAG 66

Query: 114 GEGALAGM---PL-QLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAP 169
                      P+  + G  ++LE                       V + D  F+ + P
Sbjct: 67  AAALARRDWQNPIFSVRGVNELLE----------------------PVQDSDQAFVCITP 104

Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
                   +  ++ M +AAG RP IL+NP+L+D+    GI    GR  R  +  + E  Y
Sbjct: 105 SPVE----VSYIEQMCNAAGNRPFILLNPKLQDV-AIVGIGYA-GRQLRERFLNTLETAY 158

Query: 230 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
           + R L           AL   YP  +E+++ V+    GE Y  L     KPS + I+   
Sbjct: 159 YIRPL-------DDTTALLRAYPGDWEIWRDVN----GE-YQRLGATPAKPSGEQIDKVL 206

Query: 290 LGKPRNREKKASGFWGFLSSV 310
            G+    + +  G  GF +S+
Sbjct: 207 YGE----QAETPGGGGFFASM 223


>gi|428769639|ref|YP_007161429.1| hypothetical protein Cyan10605_1269 [Cyanobacterium aponinum PCC
           10605]
 gi|428683918|gb|AFZ53385.1| protein of unknown function DUF1995-containing protein
           [Cyanobacterium aponinum PCC 10605]
          Length = 251

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 54/246 (21%), Positives = 115/246 (46%), Gaps = 53/246 (21%)

Query: 54  ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 113
           A + A+  G TR+ +++ IPE+  +           L +  A  F++ G  +K+    + 
Sbjct: 20  ALKVALESGCTRITIDLVIPEIALKAQY--------LAQEFADFFSEYGAGLKLFFPDT- 70

Query: 114 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMFILV 167
           G  ALA         R+     DWG+     T  ++  IG++      ++ + D +F+++
Sbjct: 71  GAAALA---------RR-----DWGE-----TEFQVTDIGSSRSPIERKISDTDQVFLVI 111

Query: 168 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 227
           +P     N +    + + + AG RPVIL+ P+L+D+  S   +    R  R  + +  E+
Sbjct: 112 SPSAVEVNLV----EKLCNIAGDRPVILLIPQLEDV--SIVGIGYAARQLRERFISILES 165

Query: 228 CYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
            Y+FR         P+  A+ + +YP  ++++++ +     E Y +++   +KP  ++++
Sbjct: 166 AYYFR---------PLESAVVLRSYPQLWQVWQQNE---ENEDYELIAEVPQKPLGENLD 213

Query: 287 DAFLGK 292
               G+
Sbjct: 214 RILQGQ 219


>gi|427724296|ref|YP_007071573.1| hypothetical protein Lepto7376_2460 [Leptolyngbya sp. PCC 7376]
 gi|427356016|gb|AFY38739.1| protein of unknown function DUF1995-containing protein
           [Leptolyngbya sp. PCC 7376]
          Length = 251

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 65/255 (25%), Positives = 116/255 (45%), Gaps = 58/255 (22%)

Query: 34  LNNIPHSREIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMEL 90
           ++N+P S      F D + QA   T  ++  G  R+ VE+  PE                
Sbjct: 1   MSNLPKS------FPDTLEQAKSATLTSLEAGCGRVLVELCFPE---------------- 38

Query: 91  VRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGT-RKILEFMDWGDYGAKGTFVEI 149
           + + A S A D  +V V   GS       G+ +    T    L   DWG+   + +  +I
Sbjct: 39  IALKAQSLAWDFAQVFVKEYGS-------GLKVLFPDTGAAALAKRDWGE--VEFSVTDI 89

Query: 150 GSIG---ANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGS 206
           GS       +V++ D +F++V P +      +  ++ + + AG RPVI++ P+L+D+  S
Sbjct: 90  GSSRNPVQYKVNDADQIFLVVCPSSVE----VAQVEKLCELAGDRPVIMLIPQLEDV--S 143

Query: 207 SGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPS 265
              +    R  R  + ++ E+ Y+ R         P  GA+   ++P+G+E+Y   +   
Sbjct: 144 IVGIGYAARQLRERFISTLESAYYIR---------PYDGAMVWRSFPSGWEVYLEKE--- 191

Query: 266 GGEKYVILSTFTEKP 280
            GE Y +++T T+KP
Sbjct: 192 EGE-YELIATETQKP 205


>gi|409992140|ref|ZP_11275348.1| hypothetical protein APPUASWS_13731 [Arthrospira platensis str.
           Paraca]
 gi|409936997|gb|EKN78453.1| hypothetical protein APPUASWS_13731 [Arthrospira platensis str.
           Paraca]
          Length = 262

 Score = 47.0 bits (110), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 62/237 (26%), Positives = 103/237 (43%), Gaps = 47/237 (19%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           QA   A+ DG   ++VE+  PE+  E+    I +      + AL   D   +V       
Sbjct: 30  QAATAALEDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFF---PD 80

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 169
            G  ALA         R+     DWG+   + T  +IG+  +     +   D  F++V+P
Sbjct: 81  AGAAALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 124

Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
                   ++ ++ +   AG R V+L+NPRL+D+    GI  T  R  R  +  + E+CY
Sbjct: 125 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYT-ARQLRERFLNTIESCY 178

Query: 230 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
           + + L  A        AL   YP  +E++  +D    GE Y  ++  + KP  D + 
Sbjct: 179 YLKPLDGA--------ALFRCYPGTWEVWLEID----GE-YQKITEQSTKPVGDQLE 222


>gi|356535083|ref|XP_003536078.1| PREDICTED: uncharacterized protein LOC100803954 [Glycine max]
          Length = 344

 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 58/265 (21%), Positives = 108/265 (40%), Gaps = 42/265 (15%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK--RVKVCVQ 110
           +A    V+ G  R  VE+ IP+L    D    G L EL R+   +  +  K  RVK    
Sbjct: 107 EAAATFVSSGGMRAIVELLIPQLQFLDDEGAQGELWELSRIFLDTLIEKTKSQRVKAIFP 166

Query: 111 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-VDEQDDMFILVAP 169
                          AG   +L+      Y  K       S+   + V+  D++ +++ P
Sbjct: 167 D--------------AGAAALLK------YRWKDALFRFASLSDRKPVESDDEIVVMIVP 206

Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
              +   +      +T+     P+I+ NPRL  +    G+   + + +R  + ++F   Y
Sbjct: 207 DYQMLEYVERIASNLTNVPEPTPLIMWNPRL--ISEDVGVGFNVRKLRRF-FLSTFTTVY 263

Query: 230 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 289
           F R + +        GA+   YP  ++++   D     ++Y++   F  +P +DDI   F
Sbjct: 264 FMRPMPF--------GAIFRCYPGLWKVFS--DDKERPDRYLLAKEFESRPDADDIELLF 313

Query: 290 LGKPRNREK------KASGFWGFLS 308
             + +  E+      KA+G +  +S
Sbjct: 314 ADEDQKSEQGQGLLDKAAGIFSSIS 338


>gi|428218666|ref|YP_007103131.1| hypothetical protein Pse7367_2442 [Pseudanabaena sp. PCC 7367]
 gi|427990448|gb|AFY70703.1| protein of unknown function DUF1995-containing protein
           [Pseudanabaena sp. PCC 7367]
          Length = 261

 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 58/129 (44%), Gaps = 23/129 (17%)

Query: 136 DWGDYGAKGTFVEIGSIGANE----VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTR 191
           DW D   +    E+   G NE    + E D  F+L+AP     +  +D ++ +   AG R
Sbjct: 76  DWPDISDQ----EVSMRGVNEGRAAIREDDQAFLLIAP----SSVEVDQVEKLLALAGDR 127

Query: 192 PVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTY 251
           P I+ NPRL++     G+     R  R  +  +F  CY+ + L          G L   Y
Sbjct: 128 PFIMFNPRLENSEVGIGLAT---RKMRERFLNTFTVCYYMQPLD--------AGLLWRCY 176

Query: 252 PNGYELYKR 260
           P  +++Y++
Sbjct: 177 PGLWQVYRQ 185


>gi|428214953|ref|YP_007088097.1| hypothetical protein Oscil6304_4664 [Oscillatoria acuminata PCC
           6304]
 gi|428003334|gb|AFY84177.1| protein of unknown function (DUF1995) [Oscillatoria acuminata PCC
           6304]
          Length = 247

 Score = 45.8 bits (107), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 60/231 (25%), Positives = 103/231 (44%), Gaps = 52/231 (22%)

Query: 59  VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 118
           ++DG   ++VEI IPEL     V  I    E + +    F D     KV    + G  AL
Sbjct: 23  LDDGYRLMQVEIGIPELK----VQPIAE--EFLPIFEDLFGD---HFKVYFPDA-GAAAL 72

Query: 119 AGMPLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNC 176
           A         R+     DWG+  Y  +G      S    ++  +D  F+ V P +AV   
Sbjct: 73  A---------RR-----DWGERPYIIRGL-----SEPKGQMQPEDQGFLFVEP-SAVE-- 110

Query: 177 IIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 236
            ++ ++ M + AG RP +++ P+L+++    GI    GR  R  + ++ E+CY+ +    
Sbjct: 111 -VNTLEKMCEQAGDRPTVILMPKLENV-AIIGI-GLAGRQLRERFLSTIESCYYIQ---- 163

Query: 237 AGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
                P  G A+   YP+ ++++        G+ Y ++S    KP  DD+ 
Sbjct: 164 -----PSQGYAVFRYYPSPWQVWLET-----GDTYQLISETATKPVGDDLE 204


>gi|291567271|dbj|BAI89543.1| hypothetical protein [Arthrospira platensis NIES-39]
          Length = 249

 Score = 45.8 bits (107), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 62/237 (26%), Positives = 103/237 (43%), Gaps = 47/237 (19%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           QA   A+ DG   ++VE+  PE+  E+    I +      + AL   D   +V     GS
Sbjct: 17  QAATAALEDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFFPDAGS 70

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 169
               ALA         R+     DWG+   + T  +IG+  +     +   D  F++V+P
Sbjct: 71  ---AALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 111

Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
                   ++ ++ +   AG R V+L+NPRL+D+    GI     R  R  +  + E+CY
Sbjct: 112 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYA-ARQLRERFLNTIESCY 165

Query: 230 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
           + + L  A        AL   YP  +E++  +D    GE Y  ++  + KP  D + 
Sbjct: 166 YLKPLDGA--------ALFRCYPGTWEVWLEID----GE-YQKITEQSTKPVGDQLE 209


>gi|384249997|gb|EIE23477.1| hypothetical protein COCSUDRAFT_65935 [Coccomyxa subellipsoidea
           C-169]
          Length = 335

 Score = 45.4 bits (106), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 41/145 (28%), Positives = 71/145 (48%), Gaps = 21/145 (14%)

Query: 148 EIGSIGA-NEVDEQDDMFILVA--PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLP 204
           EI +I   + V E DD+ I+ A  PQ       +++ K +   +   PVIL NPRL    
Sbjct: 174 EIAAISERSAVAEGDDLVIIAAVDPQG------LNEAKRVAAGSSFTPVILFNPRLAS-- 225

Query: 205 GSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVP 264
           G  G+   + R  R E+ ++F+  Y  R +   GT +         +P  ++++K  D  
Sbjct: 226 GDVGLGLNV-RRMRNEFLSTFQITYSLRPVNETGTVF-------RRFPGTWKVFKE-DAS 276

Query: 265 SGGEKYVILSTFTEKPSSDDINDAF 289
           S G +Y + + F ++P+ DD++  F
Sbjct: 277 SPG-RYDLAAEFRDQPTGDDLDQIF 300


>gi|170076650|ref|YP_001733288.1| hypothetical protein SYNPCC7002_A0014 [Synechococcus sp. PCC 7002]
 gi|169884319|gb|ACA98032.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
          Length = 251

 Score = 45.1 bits (105), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 40/160 (25%), Positives = 79/160 (49%), Gaps = 25/160 (15%)

Query: 136 DWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRP 192
           DWG+   K    +IG+  +   N++ + D +FI+V P +AV    +  ++ + + AG RP
Sbjct: 78  DWGEIPFK--VADIGTSRSPIENKIGDDDQIFIIVCP-SAVE---VGQVEKLCNLAGDRP 131

Query: 193 VILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRM-TY 251
           VI++ P+L+D+  S   +    R  R  + ++ E  Y+ R         P   A+   +Y
Sbjct: 132 VIMLIPQLEDV--SIVGIGYAARQLRERFISTLETAYYIR---------PYESAMVWRSY 180

Query: 252 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLG 291
           P+ +E+Y   +     ++Y ++++ T KP  + +    L 
Sbjct: 181 PSAWEVYLEKE----EDQYELIASETTKPLGEYLERLLLA 216


>gi|428185262|gb|EKX54115.1| hypothetical protein GUITHDRAFT_100361 [Guillardia theta CCMP2712]
          Length = 302

 Score = 44.3 bits (103), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 56/242 (23%), Positives = 94/242 (38%), Gaps = 53/242 (21%)

Query: 12  ERKSRSNQVS--SSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVE 69
           E + R N++   ++    K  P  +NN+           D V ++   A++DG  RL+V+
Sbjct: 5   EHQPRRNEIKFDAAARYRKAFPLGVNNV----------LDQVRESLNLALDDGLFRLRVQ 54

Query: 70  INIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEG-ALAGMPLQLAGT 128
           +NI + +  +  +  G + ELV +I     + G R  +     M    A   +P  L   
Sbjct: 55  LNIHQFDQRLQSFSDGKMPELVAMIGELLVERGMRPSIMFNTIMDASIARTVLPDHLREK 114

Query: 129 RKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAA 188
                             V I  +G  E  E+ D  +LVAP N  GN  +  ++A+    
Sbjct: 115 ------------------VSINVLGLGEFGEEYDAAVLVAPSNE-GNENLQRIEAVEKLI 155

Query: 189 GT-----------------RPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 231
            +                 RP+ILINP L+ +   S    T        + + F+  YF 
Sbjct: 156 YSGPPRGLVSRSARAKYMQRPIILINPVLEAIHSLS----TPHSKVPPMFMSDFDVVYFL 211

Query: 232 RL 233
           +L
Sbjct: 212 QL 213


>gi|427739188|ref|YP_007058732.1| hypothetical protein Riv7116_5820 [Rivularia sp. PCC 7116]
 gi|427374229|gb|AFY58185.1| protein of unknown function (DUF1995) [Rivularia sp. PCC 7116]
          Length = 245

 Score = 44.3 bits (103), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 56/233 (24%), Positives = 98/233 (42%), Gaps = 54/233 (23%)

Query: 61  DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALA- 119
           DG TRL+VE+  PEL            +   +    +F + G ++K+      G  ALA 
Sbjct: 25  DGYTRLQVELLFPELKA----------LPAAQQFLPAFDEYGDKLKIFF-ADAGSAALAR 73

Query: 120 ----GMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGN 175
                +P QL         +D G     G    I S    +V  +D++F+ V P +    
Sbjct: 74  RDWNEIPFQL---------LDIG----TGRMTSIES----KVQPEDEIFLFVQPSSVE-- 114

Query: 176 CIIDDMKAMTDAAG-TRPVILINPRLKDLPGSSGI-MQTMGRDKRLEYAASFENCYFFRL 233
             +  ++ +    G  RP ++  PRL+D   SS + +    R  R  +  + E+CY+ R 
Sbjct: 115 --VPQLEKLCGIIGEQRPFVMFAPRLED---SSIVGIGYAARQTRQRFINTIESCYYLRP 169

Query: 234 LYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
           ++          A+   YP  +E++    V   G+ Y  ++    KPS D+++
Sbjct: 170 IFEE-------AAVFRCYPGLWEVW----VEKNGD-YEKVAELPNKPSGDELD 210


>gi|428220338|ref|YP_007104508.1| hypothetical protein Syn7502_00205 [Synechococcus sp. PCC 7502]
 gi|427993678|gb|AFY72373.1| protein of unknown function (DUF1995) [Synechococcus sp. PCC 7502]
          Length = 233

 Score = 43.9 bits (102), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 52/243 (21%), Positives = 96/243 (39%), Gaps = 53/243 (21%)

Query: 49  DDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVC 108
           +  + AT  A+  G  R+++++  PEL            M +    A +F     +    
Sbjct: 13  EQAIAATMSAIAAGCKRIQIDMKFPEL----------AQMPIALAFAEAFTKQYGQAWQA 62

Query: 109 VQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE----VDEQDDMF 164
           +    G  ALA                DW +        E+   G NE       +   F
Sbjct: 63  LFADAGTAALAKR--------------DWAN---SDILREVSMRGVNEGRAAARPESQAF 105

Query: 165 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 224
           +L++P +      +D ++ +    G RP +++NP+L++     G+     RD+ L     
Sbjct: 106 LLISPTSVE----VDGIEKLLTLIGDRPFVMLNPKLENSEVGLGLAARRMRDRFLS---- 157

Query: 225 FENCYFFRLLYYAGTQYPI-MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 283
                 F ++YY     PI  GA+   YP  +++++  D  +G E   ++    E+P  D
Sbjct: 158 -----IFEIVYYI---QPIEEGAVWRCYPQLWQVWQ--DKTTGME---LIYEGEERPVGD 204

Query: 284 DIN 286
           D+N
Sbjct: 205 DLN 207


>gi|317969886|ref|ZP_07971276.1| hypothetical protein SCB02_10139 [Synechococcus sp. CB0205]
          Length = 232

 Score = 43.9 bits (102), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 40/132 (30%), Positives = 66/132 (50%), Gaps = 21/132 (15%)

Query: 161 DDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLE 220
           D + +LVAP  A      +D++A+  A     V+L+N +L+D     G   T+ R++R  
Sbjct: 102 DGLLVLVAPTPA----DYEDVEAVC-AQHRGSVVLLNGKLEDAAVGIG---TVARERRKG 153

Query: 221 YAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKP 280
           + A++++ Y    L   G      GALR  YP  ++LY+R   P G   Y  +  F  KP
Sbjct: 154 FLATWQSAY---ALIPTGE-----GALRRAYPLDWQLYRR--DPDG---YRFVKDFELKP 200

Query: 281 SSDDINDAFLGK 292
            ++ + DA  G+
Sbjct: 201 DAEQMADALEGE 212


>gi|357441663|ref|XP_003591109.1| hypothetical protein MTR_1g082900 [Medicago truncatula]
 gi|355480157|gb|AES61360.1| hypothetical protein MTR_1g082900 [Medicago truncatula]
          Length = 327

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 58/261 (22%), Positives = 106/261 (40%), Gaps = 35/261 (13%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           QAT   V+ G  R  VE+ IP+L    D      L EL R+           +   +Q +
Sbjct: 90  QATASFVSSGGMRAIVELLIPQLEFLDDEGAQLELWELSRLF----------LDTLIQET 139

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNA 172
             +   A  P   AG   +L++  W D  A  +F  +       VD  D+M +++ P + 
Sbjct: 140 NCQKVKAVFP--DAGAAALLKYR-WKD--ALFSFASLS--DRKLVDSGDEMVVMIVPDHQ 192

Query: 173 VGNCIIDDMKAMTDAAGTRPVILINPRL--KDLPGSSGIMQTMGRDKRLEYAASFENCYF 230
           +   +      ++D    RP+I+ NPRL  KD+     +     R  R  + ++F   Y 
Sbjct: 193 MLEYVEKIASTLSDDP-PRPLIMWNPRLYSKDVGVGYNV-----RQLRQYFLSTFTTVYH 246

Query: 231 FRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 290
            R + +        GA+   YP  ++++   D      +Y++   +  +P ++DI   F 
Sbjct: 247 MRPMPF--------GAVFRCYPEMWKVFS--DDKDRPNRYLLAKEYANRPDTEDIEMLFG 296

Query: 291 GKPRNREKKASGFWGFLSSVF 311
                + ++    +   + VF
Sbjct: 297 NNEEQKSEQGQSLFDKAAGVF 317


>gi|225450962|ref|XP_002284712.1| PREDICTED: uncharacterized protein LOC100264831 [Vitis vinifera]
 gi|296088326|emb|CBI36771.3| unnamed protein product [Vitis vinifera]
          Length = 353

 Score = 43.1 bits (100), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 112/267 (41%), Gaps = 48/267 (17%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDG--KRVKVCVQ 110
           QA+   V+ G  R  VE+ IP+L    D      L EL R+   +  ++   +RVK    
Sbjct: 115 QASASFVSSGGMRAIVELLIPQLQFLDDEGAQAELWELSRIFIDTLIEETGCQRVKAIFP 174

Query: 111 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQ 170
                          AG   +L++  WGD  A   F  +       V+ +D++ ++V P 
Sbjct: 175 D--------------AGAAALLKYR-WGD--AAFGFASLSD--RKPVEAEDELVVMVVPD 215

Query: 171 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG-----RDKRLEYAASF 225
             +   +      ++D    RP+I+ NPRL        I + +G     R  R  + ++F
Sbjct: 216 YQMLAYVERIASNLSDDP-PRPLIMWNPRL--------ISEDVGVGFNVRKLRRSFLSTF 266

Query: 226 ENCYFFRLLYYAGTQYPIMGALRMTYPNGYEL-YKRVDVPSGGEKYVILSTFTEKPSSDD 284
              Y  R L          GA+   YP  +++ Y   D P+   +Y++      +P +++
Sbjct: 267 TVVYSMRPL--------PAGAVFRCYPGSWKVFYDDKDRPN---RYLLAKEQISRPDAEE 315

Query: 285 INDAFLGKPRNREKKASGFWGFLSSVF 311
           +   F G  +++ +K   F+G  +S+F
Sbjct: 316 LEIIF-GDVQDKSEKGPSFFGQAASIF 341


>gi|209522945|ref|ZP_03271502.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|376001796|ref|ZP_09779650.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|209496532|gb|EDZ96830.1| conserved hypothetical protein [Arthrospira maxima CS-328]
 gi|375329707|emb|CCE15403.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 249

 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 61/237 (25%), Positives = 103/237 (43%), Gaps = 47/237 (19%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           QA   A++DG   ++VE+  PE+  E+    I +      + AL   D   +V     GS
Sbjct: 17  QAATAALDDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFFPDAGS 70

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 169
               ALA         R+     DWG+   + T  +IG+  +     +   D  F++V+P
Sbjct: 71  ---AALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 111

Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
                   ++ ++ +   AG R V+L+NPRL+D+    GI     R  R  +    E+CY
Sbjct: 112 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYA-ARQLRERFLNIIESCY 165

Query: 230 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
           + + L  A        AL   YP  +E++  ++    GE Y  ++  + KP  D + 
Sbjct: 166 YLKPLDGA--------ALFRCYPGTWEVWLEIE----GE-YQKITEQSTKPVGDQLE 209


>gi|423062349|ref|ZP_17051139.1| hypothetical protein SPLC1_S032380 [Arthrospira platensis C1]
 gi|406716257|gb|EKD11408.1| hypothetical protein SPLC1_S032380 [Arthrospira platensis C1]
          Length = 262

 Score = 43.1 bits (100), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 61/237 (25%), Positives = 103/237 (43%), Gaps = 47/237 (19%)

Query: 53  QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 112
           QA   A++DG   ++VE+  PE+  E+    I +      + AL   D   +V     GS
Sbjct: 30  QAATAALDDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFFPDAGS 83

Query: 113 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 169
               ALA         R+     DWG+   + T  +IG+  +     +   D  F++V+P
Sbjct: 84  ---AALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 124

Query: 170 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 229
                   ++ ++ +   AG R V+L+NPRL+D+    GI     R  R  +    E+CY
Sbjct: 125 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYA-ARQLRERFLNIIESCY 178

Query: 230 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 286
           + + L  A        AL   YP  +E++  ++    GE Y  ++  + KP  D + 
Sbjct: 179 YLKPLDGA--------ALFRCYPGTWEVWLEIE----GE-YQKITEQSTKPVGDQLE 222


>gi|299116127|emb|CBN76036.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 284

 Score = 42.7 bits (99), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 58/268 (21%), Positives = 110/268 (41%), Gaps = 48/268 (17%)

Query: 36  NIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPEL-NPEMDVYRIGTLMELVRVI 94
            +P S +++    +   +A  +A++DG+TR  + +++P +   E+D +  G         
Sbjct: 48  ELPMSNDVQ---VEQATRAVWQALSDGKTRQILRLSLPLIGKTEIDDWPGG--------- 95

Query: 95  ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA 154
                 D +R K C  G M E  L G P    G   + + +D  D         +G +  
Sbjct: 96  ------DRQRYKAC--GPMVESVLRGNPAAKEGASIVEQILDESD--------AVGLMQL 139

Query: 155 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 214
              + +DD  +++ P        +   + +   AG+R V L+NP  +      G+ Q  G
Sbjct: 140 QCKEAKDDASVMIFPTTDT----LSLQQQVAAGAGSRLVALVNPAWRS-SSDFGLFQA-G 193

Query: 215 RDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILS 274
             K  E  + FE  Y  + L   G    ++      YP  + L+    +  G  K  +L 
Sbjct: 194 EAK--ETLSEFEVTYSLQSLVVYGYTLRLL----YCYPGPWRLFL---IGEGSGKRELLG 244

Query: 275 TFTEKPSSDDINDAFL----GKPRNREK 298
           ++ + P ++D  ++ +    GKP   E+
Sbjct: 245 SWDQSPPTNDQVESAVFDKKGKPNASER 272


>gi|422294314|gb|EKU21614.1| hypothetical protein NGA_0177102 [Nannochloropsis gaditana CCMP526]
          Length = 375

 Score = 41.2 bits (95), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 31/116 (26%), Positives = 53/116 (45%), Gaps = 10/116 (8%)

Query: 178 IDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYA 237
           + D++A+  A  T+P+I  N  L+   G  G+     +D    + ++ +  Y  R   YA
Sbjct: 221 LPDLEALHQADPTKPLIFFNLNLETHRGDLGLPAFPSKDLHYRFLSNIKPVYLLRTRQYA 280

Query: 238 GT--QYPIM----GALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 287
            T  + P +    GAL  TYP GY+      + +G  +Y  + T  E+P+     D
Sbjct: 281 QTLSRPPFILNYQGALFRTYPGGYQCM----LDTGNGRYRRVETSRERPALSGFKD 332


>gi|297812961|ref|XP_002874364.1| hypothetical protein ARALYDRAFT_489571 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320201|gb|EFH50623.1| hypothetical protein ARALYDRAFT_489571 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 340

 Score = 40.4 bits (93), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 60/249 (24%), Positives = 99/249 (39%), Gaps = 43/249 (17%)

Query: 59  VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 118
           VN G TR  VE+ IP+L    D      L +L RV           +   ++ +  E   
Sbjct: 110 VNSGGTRAIVELLIPQLQFLDDEGAQAELWDLSRVF----------LDTLIKETGCERVK 159

Query: 119 AGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCII 178
           A  P   AG   +L++  W D     TF          V+++D++ ++V P   +   + 
Sbjct: 160 AVFP--DAGAAALLKYR-WKD----ATFGFASLSDRKPVEKEDEIIVMVVPDYQMLEYVE 212

Query: 179 DDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG-----RDKRLEYAASFENCYFFRL 233
              K + D    RP+I+ NPRL        I + +G     R  R  + +SF   Y  R 
Sbjct: 213 KIAKGLADDP-PRPLIMWNPRL--------ISEEVGVGFNVRKLRRYFLSSFTTVYSMRP 263

Query: 234 LYYAGTQYPIMGALRMTYPNGYEL-YKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGK 292
           L          GA+   YP  +++ Y   D P    +Y++      +P ++D+   + G 
Sbjct: 264 L--------AAGAVFRCYPGKWKVFYDNKDRPG---RYLLAKELIGRPDAEDLEIIYGGV 312

Query: 293 PRNREKKAS 301
               E+  S
Sbjct: 313 EEKSEEGPS 321


>gi|52079804|ref|YP_078595.1| hypothetical protein BL03758 [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|319646386|ref|ZP_08000616.1| hypothetical protein HMPREF1012_01652 [Bacillus sp. BT1B_CT2]
 gi|404488678|ref|YP_006712784.1| hypothetical protein BLi01406 [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|52003015|gb|AAU22957.1| hypothetical protein BL03758 [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|52347676|gb|AAU40310.1| putative membrane protein [Bacillus licheniformis DSM 13 = ATCC
           14580]
 gi|317392136|gb|EFV72933.1| hypothetical protein HMPREF1012_01652 [Bacillus sp. BT1B_CT2]
          Length = 249

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 30/50 (60%), Gaps = 5/50 (10%)

Query: 93  VIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGA 142
           ++ALSF      V VC+   +  GAL+ +PL LAGT  + ++M+W   GA
Sbjct: 175 IMALSFL-----VIVCLSFRLKAGALSILPLLLAGTHFLFQYMNWDSAGA 219


>gi|123966201|ref|YP_001011282.1| hypothetical protein P9515_09681 [Prochlorococcus marinus str. MIT
           9515]
 gi|123200567|gb|ABM72175.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9515]
          Length = 215

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 36/140 (25%), Positives = 66/140 (47%), Gaps = 32/140 (22%)

Query: 153 GANE---VDEQDDMFILVAPQN---AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGS 206
           G NE   +D+ D + I + PQ     +   + ++ K +  +        +NP+ +D   +
Sbjct: 97  GFNESIYIDQDDSIVISILPQPYDFGLFEPMCENFKGLHYS--------LNPKFED--AN 146

Query: 207 SGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIM-GALRMTYPNGYELYKRVDVPS 265
            GI   + R++R  +  +++N YF +         PI  GAL  +YPN + L+K ++   
Sbjct: 147 IGIGSVI-RERRKNFVKTWQNIYFLQ---------PINNGALMHSYPNNWLLFKEIN--- 193

Query: 266 GGEKYVILSTFTEKPSSDDI 285
              KY+    F EKP ++ I
Sbjct: 194 --NKYLFKKEFEEKPDNETI 211


>gi|324998739|ref|ZP_08119851.1| UDP-N-acetylmuramate--L-alanine ligase [Pseudonocardia sp. P1]
          Length = 478

 Score = 40.0 bits (92), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 41/130 (31%), Positives = 55/130 (42%), Gaps = 5/130 (3%)

Query: 68  VEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM-PLQLA 126
           V   I E NPE+   R   L  + R +AL+   +G+R+ V V G+ G+ +   M  + L 
Sbjct: 78  VSTAIRETNPELAAARDQGLQVVHRAVALAALTEGRRL-VAVAGTHGKTSTTSMVTVALQ 136

Query: 127 GTRKILEFMDWGDYGAKGTFVEIGS--IGANEVDEQDDMFILVAPQNA-VGNCIIDDMKA 183
            T     F   GD    G+    GS  +   E DE D  F   AP  A V N   D +  
Sbjct: 137 HTGADPSFAIGGDLSVSGSGAHHGSGDVFVVEADESDGSFTAFAPSVAVVTNVEPDHLDH 196

Query: 184 MTDAAGTRPV 193
             D A  R V
Sbjct: 197 HGDEATYRAV 206


>gi|284991684|ref|YP_003410238.1| UDP-N-acetylmuramate/alanine ligase [Geodermatophilus obscurus DSM
           43160]
 gi|284064929|gb|ADB75867.1| UDP-N-acetylmuramate/alanine ligase [Geodermatophilus obscurus DSM
           43160]
          Length = 489

 Score = 40.0 bits (92), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 48/111 (43%), Gaps = 8/111 (7%)

Query: 68  VEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM---PLQ 124
           V   I E NPE+   R   L  L R +AL+    GKR  V V G+ G+ +   M    +Q
Sbjct: 84  VSTAIREDNPELAAARERGLRVLPRAVALAAVMAGKR-SVAVAGTHGKTSTTSMLTVAVQ 142

Query: 125 LAGTRKILEFMDWGDYGAKGTFVEIGS--IGANEVDEQDDMFILVAPQNAV 173
             G      F   GD    G+    G   +   E DE D  F+L+AP  A+
Sbjct: 143 ACGVDP--SFAIGGDLNESGSNAHAGGGDVFVAEADESDRSFLLLAPHGAI 191


>gi|255087973|ref|XP_002505909.1| predicted protein [Micromonas sp. RCC299]
 gi|226521180|gb|ACO67167.1| predicted protein [Micromonas sp. RCC299]
          Length = 299

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 25/101 (24%), Positives = 47/101 (46%), Gaps = 8/101 (7%)

Query: 196 INPRLKDLPGSSGIMQTMG----RDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTY 251
           IN +++     S ++  +G    R K  E+ ASFE  Y+       G    ++     +Y
Sbjct: 169 INNQIRTNDDGSNLISDLGIGPWRKKNEEFLASFELTYWISEQRIQGETVRLL----KSY 224

Query: 252 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGK 292
           P+ ++L+   D+ SG  +   + TF EKPS  ++    + +
Sbjct: 225 PHPWQLFVLTDMASGSAEPECIKTFDEKPSYKELESLLMSR 265


>gi|356576779|ref|XP_003556507.1| PREDICTED: uncharacterized protein LOC100782973 isoform 1 [Glycine
           max]
          Length = 340

 Score = 38.5 bits (88), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 57/270 (21%), Positives = 112/270 (41%), Gaps = 45/270 (16%)

Query: 51  VLQATQRA---VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK--RV 105
           V Q+++ A   V+ G  R  VE+ IP+L    D      L EL R+   +  +  K  RV
Sbjct: 98  VRQSSEAAAIFVSSGGIRAIVELLIPQLQFLDDEGAQVELWELSRIFLDTLIEKTKFQRV 157

Query: 106 KVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-VDEQDDMF 164
           K                   AG   +L+      Y  K       S+   + V+  D++ 
Sbjct: 158 KAIFPD--------------AGAAALLK------YRWKDALFRFASLSDRKPVESDDEVV 197

Query: 165 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 224
           +++ P   +   +      +++     P+I+ NPRL  +    G+   + + +R+ + ++
Sbjct: 198 VMIVPDYQMLEYVERIASNLSNIPEPTPLIMWNPRL--ISEDVGVGFNVRKLRRV-FLST 254

Query: 225 FENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 284
           F   YF R + +        GA+   YP  ++++   D     ++Y++   F  +P ++D
Sbjct: 255 FTTVYFMRPMPF--------GAIFRCYPGLWKVFS--DDKERPDRYLLAKEFEIRPDAED 304

Query: 285 INDAFLGKPRNREK------KASGFWGFLS 308
           I   F  + +  E+      KA+G +  +S
Sbjct: 305 IELLFADEEQKSEQGQGLLDKAAGIFSSIS 334


>gi|300788096|ref|YP_003768387.1| UDP-N-acetylmuramate--L-alanine ligase [Amycolatopsis mediterranei
           U32]
 gi|384151526|ref|YP_005534342.1| UDP-N-acetylmuramate--L-alanine ligase [Amycolatopsis mediterranei
           S699]
 gi|399539979|ref|YP_006552641.1| UDP-N-acetylmuramate--alanine ligase [Amycolatopsis mediterranei
           S699]
 gi|299797610|gb|ADJ47985.1| UDP-N-acetylmuramate--alanine ligase [Amycolatopsis mediterranei
           U32]
 gi|340529680|gb|AEK44885.1| UDP-N-acetylmuramate--L-alanine ligase [Amycolatopsis mediterranei
           S699]
 gi|398320749|gb|AFO79696.1| UDP-N-acetylmuramate--alanine ligase [Amycolatopsis mediterranei
           S699]
          Length = 465

 Score = 38.1 bits (87), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 45/109 (41%), Gaps = 4/109 (3%)

Query: 68  VEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM-PLQLA 126
           V   I E NPE+   R   +  L R  AL+   +G RV  C+ G+ G+ +   M  + L 
Sbjct: 77  VSTAIKETNPELAAARAAGIPVLHRAQALAGLMEGHRVA-CIAGTHGKTSTTSMLTVALQ 135

Query: 127 GTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQNAV 173
             R    F   GD    G     G  G    E DE D  F+  +P  AV
Sbjct: 136 HCRLDPSFAIGGDLNESGANAHHGEGGVFVAEADESDGSFLTYSPSVAV 184


>gi|383830926|ref|ZP_09986015.1| UDP-N-acetylmuramate--alanine ligase [Saccharomonospora
           xinjiangensis XJ-54]
 gi|383463579|gb|EID55669.1| UDP-N-acetylmuramate--alanine ligase [Saccharomonospora
           xinjiangensis XJ-54]
          Length = 474

 Score = 38.1 bits (87), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 53/126 (42%), Gaps = 9/126 (7%)

Query: 56  QRAVN-----DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQ 110
           QRA N     D  + + V   I E NPE+ V R   +  L R  AL+   +G RV  C+ 
Sbjct: 66  QRAENLDVFDDAPSAVVVSTAIKETNPELVVARERGVPVLHRAEALAALMEGHRVA-CIA 124

Query: 111 GSMGEGALAGM-PLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG--ANEVDEQDDMFILV 167
           G+ G+ +   M  + L   R    F   GD    G     G  G    E DE D  F+  
Sbjct: 125 GTHGKTSTTSMLTVALQHCRLDPSFAIGGDLNESGANAHHGEGGIFVAEADESDGSFLSY 184

Query: 168 APQNAV 173
           +P  AV
Sbjct: 185 SPSVAV 190


>gi|418461004|ref|ZP_13032086.1| UDP-N-acetylmuramate--L-alanine ligase [Saccharomonospora azurea
           SZMC 14600]
 gi|359738959|gb|EHK87837.1| UDP-N-acetylmuramate--L-alanine ligase [Saccharomonospora azurea
           SZMC 14600]
          Length = 450

 Score = 37.4 bits (85), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 53/130 (40%), Gaps = 9/130 (6%)

Query: 52  LQATQRAVN-----DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVK 106
           ++  QRA N      G + + V   I E NPE+   R   +  L R  AL+   +G RV 
Sbjct: 62  IEVGQRAENLDVFDGGPSAVIVSTAIKETNPELAAARERDVPVLHRAEALAALMEGHRVA 121

Query: 107 VCVQGSMGEGALAGM-PLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG--ANEVDEQDDM 163
            C+ G+ G+ +   M  + L   R    F   GD    G     G  G    E DE D  
Sbjct: 122 -CIAGTHGKTSTTSMLTVALQHCRLDPSFAIGGDLNESGANAHHGEGGIFVAEADESDGS 180

Query: 164 FILVAPQNAV 173
           F+   P  AV
Sbjct: 181 FLSYTPSVAV 190


>gi|72382164|ref|YP_291519.1| hypothetical protein PMN2A_0324 [Prochlorococcus marinus str.
           NATL2A]
 gi|72002014|gb|AAZ57816.1| conserved hypothetical protein [Prochlorococcus marinus str.
           NATL2A]
          Length = 206

 Score = 37.4 bits (85), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 59/131 (45%), Gaps = 21/131 (16%)

Query: 155 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 214
           N  D  + + ++ +PQ       I+  + +   A +  VI+IN +L+D     G   ++G
Sbjct: 93  NSTDLSNSILLVCSPQPYD----IEMFEQVCSHAKSN-VIMINGKLEDPIVGIG---SVG 144

Query: 215 RDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILS 274
           R+ R  +A  ++  YF + L         MGAL   YPN +EL+K          Y  + 
Sbjct: 145 REMRKRFAKKWKVLYFVQPLS--------MGALLKRYPNDWELFKL-----NNNGYTFVK 191

Query: 275 TFTEKPSSDDI 285
           +F  +P  + I
Sbjct: 192 SFINRPDDETI 202


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.137    0.403 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,883,154,002
Number of Sequences: 23463169
Number of extensions: 212357855
Number of successful extensions: 432797
Number of sequences better than 100.0: 143
Number of HSP's better than 100.0 without gapping: 38
Number of HSP's successfully gapped in prelim test: 105
Number of HSP's that attempted gapping in prelim test: 432609
Number of HSP's gapped (non-prelim): 163
length of query: 311
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 169
effective length of database: 9,027,425,369
effective search space: 1525634887361
effective search space used: 1525634887361
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)