BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017996
(362 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255565097|ref|XP_002523541.1| adenylate kinase, putative [Ricinus communis]
gi|223537248|gb|EEF38880.1| adenylate kinase, putative [Ricinus communis]
Length = 597
Score = 546 bits (1406), Expect = e-153, Method: Compositional matrix adjust.
Identities = 265/362 (73%), Positives = 293/362 (80%), Gaps = 34/362 (9%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
+VF +I SLLSQ +++K + + + + +S NQ S SQD W+GIPTRLNNIPHSREI
Sbjct: 270 VVFEEISSLLSQVQKDKEEPVKSKNLIPKSESSYNQASVSQDKWRGIPTRLNNIPHSREI 329
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
R YFYDDVLQATQRAVNDGR RLKVEI IPELNPEMDVYRIGTLMELVRV+ALSFADDGK
Sbjct: 330 RTYFYDDVLQATQRAVNDGRMRLKVEIAIPELNPEMDVYRIGTLMELVRVLALSFADDGK 389
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
RVKVCVQGSMG+GALAGMPLQL+GTRKILEFMDWGDYGA GTF+ IGSIGA EVDE DDM
Sbjct: 390 RVKVCVQGSMGQGALAGMPLQLSGTRKILEFMDWGDYGALGTFIRIGSIGAKEVDEDDDM 449
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
FILVAPQNAVGNCIIDD++AMTDAAG+RPVIL+NPRLKDLPGSSGIMQTMGR +RLEYAA
Sbjct: 450 FILVAPQNAVGNCIIDDLRAMTDAAGSRPVILVNPRLKDLPGSSGIMQTMGRAERLEYAA 509
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
SFENCY FRLLYY GTQYPIMGALRM+YP YELYKRVD+P+ GEKY ILSTF+EKP+ D
Sbjct: 510 SFENCYSFRLLYYGGTQYPIMGALRMSYPYHYELYKRVDLPARGEKYEILSTFSEKPTGD 569
Query: 301 DINDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSRVLLCTCRNREKKASGFWGFLSS 360
DINDAFLG+ RN++KKASGFWGFLS
Sbjct: 570 DINDAFLGR----------------------------------SRNQDKKASGFWGFLSG 595
Query: 361 VF 362
V
Sbjct: 596 VL 597
>gi|359484753|ref|XP_002264815.2| PREDICTED: uncharacterized protein LOC100256872 [Vitis vinifera]
Length = 597
Score = 542 bits (1397), Expect = e-152, Method: Compositional matrix adjust.
Identities = 265/361 (73%), Positives = 291/361 (80%), Gaps = 34/361 (9%)
Query: 2 VFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIR 61
VF I+SLLS+ +++K K + + SN SSSQ+NW+GIPTRLNNIPHSREIR
Sbjct: 271 VFKPIDSLLSKMQQDKENMMKLEKSIRASERLSNLASSSQENWRGIPTRLNNIPHSREIR 330
Query: 62 NYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKR 121
YFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRV+ALSFADDGK
Sbjct: 331 TYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVLALSFADDGKH 390
Query: 122 VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMF 181
VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTF+ IGSIG+ EV+EQDD+F
Sbjct: 391 VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFINIGSIGSKEVEEQDDLF 450
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
ILVAPQNAVGNCIIDD+KAMTDAAG RPVIL+NPRLKDLPGSSGIMQTMGRDKRLEYAAS
Sbjct: 451 ILVAPQNAVGNCIIDDLKAMTDAAGNRPVILVNPRLKDLPGSSGIMQTMGRDKRLEYAAS 510
Query: 242 FENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 301
FENCYFFRLLYYAGTQYPIMGA+RM+YP YELYKRVD G EKY+ILST + +P+ D+
Sbjct: 511 FENCYFFRLLYYAGTQYPIMGAIRMSYPYRYELYKRVDESFGKEKYIILSTSSARPNIDE 570
Query: 302 INDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSRVLLCTCRNREKKASGFWGFLSSV 361
INDAFLGKP R+ KKA+GFWGFLS +
Sbjct: 571 INDAFLGKP----------------------------------RSEAKKATGFWGFLSGI 596
Query: 362 F 362
Sbjct: 597 L 597
>gi|296084582|emb|CBI25603.3| unnamed protein product [Vitis vinifera]
Length = 652
Score = 526 bits (1355), Expect = e-147, Method: Compositional matrix adjust.
Identities = 254/327 (77%), Positives = 273/327 (83%), Gaps = 34/327 (10%)
Query: 36 QVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPE 95
Q S SQ+NW+GIPTRLNNIPHSREIR YFYDDVLQATQRAVNDGRTRLKVEINIPELNPE
Sbjct: 360 QASLSQENWRGIPTRLNNIPHSREIRTYFYDDVLQATQRAVNDGRTRLKVEINIPELNPE 419
Query: 96 MDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG 155
MDVYRIGTLMELVRV+ALSFADDGK VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG
Sbjct: 420 MDVYRIGTLMELVRVLALSFADDGKHVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG 479
Query: 156 DYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINP 215
DYGAKGTF+ IGSIG+ EV+EQDD+FILVAPQNAVGNCIIDD+KAMTDAAG RPVIL+NP
Sbjct: 480 DYGAKGTFINIGSIGSKEVEEQDDLFILVAPQNAVGNCIIDDLKAMTDAAGNRPVILVNP 539
Query: 216 RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELY 275
RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGA+RM+YP YELY
Sbjct: 540 RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGAIRMSYPYRYELY 599
Query: 276 KRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRYVDYMDRILDYMDRILYHLLFHST 335
KRVD G EKY+ILST + +P+ D+INDAFLGKP
Sbjct: 600 KRVDESFGKEKYIILSTSSARPNIDEINDAFLGKP------------------------- 634
Query: 336 DSRVLLCTCRNREKKASGFWGFLSSVF 362
R+ KKA+GFWGFLS +
Sbjct: 635 ---------RSEAKKATGFWGFLSGIL 652
>gi|224144020|ref|XP_002325159.1| predicted protein [Populus trichocarpa]
gi|222866593|gb|EEF03724.1| predicted protein [Populus trichocarpa]
Length = 520
Score = 526 bits (1354), Expect = e-147, Method: Compositional matrix adjust.
Identities = 251/311 (80%), Positives = 279/311 (89%), Gaps = 13/311 (4%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
+VF +I+SLLSQ +R+++K + S QD W+GIPTRLNNIPHSREI
Sbjct: 223 VVFKEIDSLLSQVQRDEVKLVKSG-------------ISLQDKWRGIPTRLNNIPHSREI 269
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
RNYFYDDVLQATQRAVNDGRTRL+V+INIPELNPEMDVYRIGTLMELVRVIALSF+DDGK
Sbjct: 270 RNYFYDDVLQATQRAVNDGRTRLRVDINIPELNPEMDVYRIGTLMELVRVIALSFSDDGK 329
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
VKVCVQGSMG+GALAGMPLQLAGTR ILEFMDWGDYGAKGTF++IGSIG EV E+DDM
Sbjct: 330 HVKVCVQGSMGKGALAGMPLQLAGTRNILEFMDWGDYGAKGTFIKIGSIGEQEVSEEDDM 389
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
FILVAPQNAVGNCII+D+KAMT AAG+RPVILINP+LKDLPGSSGIMQTMGRDKRLEYAA
Sbjct: 390 FILVAPQNAVGNCIIEDLKAMTVAAGSRPVILINPKLKDLPGSSGIMQTMGRDKRLEYAA 449
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
SFE+CYFFRLLYYAGTQYPIMGALRM+YP YELYKRVD PSG EKY+ILSTF+EKPS+D
Sbjct: 450 SFESCYFFRLLYYAGTQYPIMGALRMSYPYRYELYKRVDQPSGTEKYMILSTFSEKPSTD 509
Query: 301 DINDAFLGKPR 311
D+NDAF+G+PR
Sbjct: 510 DVNDAFMGRPR 520
>gi|449493659|ref|XP_004159398.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101226743 [Cucumis sativus]
Length = 609
Score = 513 bits (1321), Expect = e-143, Method: Compositional matrix adjust.
Identities = 252/368 (68%), Positives = 287/368 (77%), Gaps = 41/368 (11%)
Query: 2 VFADIESLLSQAKREKMKKTTAVK-----P--VHERKSRSNQVSSSQDNWKGIPTRLNNI 54
+F ++ SLLSQ +++++ ++ P + E K NQ S S+D+W+GIPTRLNNI
Sbjct: 276 IFEELSSLLSQIQKKRLXSQVMLRFLWNSPQFLPENKGTVNQPSESRDSWRGIPTRLNNI 335
Query: 55 PHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALS 114
PHSREIR YFYDDVLQAT+RAV GRTRLKVE NIPELNPEMDVYRIGTLMELVR IALS
Sbjct: 336 PHSREIRKYFYDDVLQATKRAVQAGRTRLKVENNIPELNPEMDVYRIGTLMELVRTIALS 395
Query: 115 FADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEV 174
FADDG+RVKVC+QGSMGEGAL+GMPLQLAGTR+ILE+MDWG+YGA GTFV+IGSIGA EV
Sbjct: 396 FADDGRRVKVCIQGSMGEGALSGMPLQLAGTRRILEYMDWGEYGALGTFVKIGSIGAKEV 455
Query: 175 DEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDK 234
D++DDMFILVAPQNAVGNCIIDDMKAMTDAAG RPVIL+NPRLKDLPGSSGIMQTMGR+K
Sbjct: 456 DDEDDMFILVAPQNAVGNCIIDDMKAMTDAAGDRPVILVNPRLKDLPGSSGIMQTMGREK 515
Query: 235 RLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFT 294
RLEYAASFE CYFFRLLYYAGTQYPIMGALRM+YP GYELY+RVD PSG EKY+ LS +
Sbjct: 516 RLEYAASFEICYFFRLLYYAGTQYPIMGALRMSYPYGYELYRRVDEPSGKEKYIELSKYP 575
Query: 295 EKPSSDDINDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSRVLLCTCRNREKKASGF 354
++PS+DDINDAF G R K +SG
Sbjct: 576 KRPSTDDINDAFQGNK----------------------------------RKEAKSSSGI 601
Query: 355 WGFLSSVF 362
WGFLS +
Sbjct: 602 WGFLSGIL 609
>gi|356556590|ref|XP_003546607.1| PREDICTED: uncharacterized protein LOC100818830 [Glycine max]
Length = 591
Score = 510 bits (1314), Expect = e-142, Method: Compositional matrix adjust.
Identities = 254/361 (70%), Positives = 284/361 (78%), Gaps = 47/361 (13%)
Query: 2 VFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIR 61
VF +IESLLSQ +++K+K + + QD W+GIPTRLNNIPHSREIR
Sbjct: 278 VFKEIESLLSQLQQDKVKIIESGGMI-------------QDKWRGIPTRLNNIPHSREIR 324
Query: 62 NYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKR 121
YFYDDVLQATQRA++DG+TR+KV+INIPELNPEMDVYRIGTLMELVR +ALSFADDGKR
Sbjct: 325 KYFYDDVLQATQRAISDGKTRVKVDINIPELNPEMDVYRIGTLMELVRALALSFADDGKR 384
Query: 122 VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMF 181
VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTF+ IGSIGA EV+EQDDMF
Sbjct: 385 VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFINIGSIGAAEVEEQDDMF 444
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
ILVAPQNAVGNCIIDD++AMT+AA RPVILIN RLKDLPGSSGIMQT+GRD+RL+YAAS
Sbjct: 445 ILVAPQNAVGNCIIDDLRAMTNAAEHRPVILINARLKDLPGSSGIMQTIGRDQRLKYAAS 504
Query: 242 FENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 301
FE+CYFFRLLYYAGTQYPIMGA+RMTYP Y+LYKRVD G EKYVILSTF E+P++D+
Sbjct: 505 FESCYFFRLLYYAGTQYPIMGAIRMTYPYQYDLYKRVDESPGKEKYVILSTFPERPTTDE 564
Query: 302 INDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSRVLLCTCRNREKKASGFWGFLSSV 361
INDAF GKP S D R K SG WGFLS +
Sbjct: 565 INDAFQGKP-----------------------SKDPR-----------KTSGIWGFLSGI 590
Query: 362 F 362
Sbjct: 591 L 591
>gi|449443265|ref|XP_004139400.1| PREDICTED: uncharacterized protein LOC101203230 [Cucumis sativus]
Length = 577
Score = 506 bits (1304), Expect = e-141, Method: Compositional matrix adjust.
Identities = 249/361 (68%), Positives = 279/361 (77%), Gaps = 48/361 (13%)
Query: 2 VFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIR 61
+F ++ SLLSQ ++EK K+ +S D+W+GIPTRLNNIPHSREIR
Sbjct: 265 IFEELSSLLSQIQKEKAMKSGN--------------ASFSDSWRGIPTRLNNIPHSREIR 310
Query: 62 NYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKR 121
YFYDDVLQAT+RAV GRTRLKVE NIPELNPEMDVYRIGTLMELVR IALSFADDG+R
Sbjct: 311 KYFYDDVLQATKRAVQAGRTRLKVENNIPELNPEMDVYRIGTLMELVRTIALSFADDGRR 370
Query: 122 VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMF 181
VKVC+QGSMGEGAL+GMPLQLAGTR+ILE+MDWG+YGA GTFV+IGSIGA EVD++DDMF
Sbjct: 371 VKVCIQGSMGEGALSGMPLQLAGTRRILEYMDWGEYGALGTFVKIGSIGAKEVDDEDDMF 430
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
ILVAPQNAVGNCIIDDMKAMTDAAG RPVIL+NPRLKDLPGSSGIMQTMGR+KRLEYAAS
Sbjct: 431 ILVAPQNAVGNCIIDDMKAMTDAAGDRPVILVNPRLKDLPGSSGIMQTMGREKRLEYAAS 490
Query: 242 FENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 301
FE CYFFRLLYYAGTQYPIMGALRM+YP GYELY+RVD PSG EKY+ LS + ++PS+DD
Sbjct: 491 FEICYFFRLLYYAGTQYPIMGALRMSYPYGYELYRRVDEPSGKEKYIELSKYPKRPSTDD 550
Query: 302 INDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSRVLLCTCRNREKKASGFWGFLSSV 361
INDAF G R K +SG WGFLS +
Sbjct: 551 INDAFQGNK----------------------------------RKEAKSSSGIWGFLSGI 576
Query: 362 F 362
Sbjct: 577 L 577
>gi|356532999|ref|XP_003535056.1| PREDICTED: uncharacterized protein LOC100796331 [Glycine max]
Length = 1016
Score = 504 bits (1297), Expect = e-140, Method: Compositional matrix adjust.
Identities = 244/325 (75%), Positives = 267/325 (82%), Gaps = 34/325 (10%)
Query: 38 SSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMD 97
S+ DNW+GIPTRLNNIPHSREIR YFYDDVLQATQRA+NDG+TR+KV+INIPELNPEMD
Sbjct: 726 SAEADNWRGIPTRLNNIPHSREIRKYFYDDVLQATQRAINDGKTRVKVDINIPELNPEMD 785
Query: 98 VYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY 157
VYRIGTLMELVR +ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY
Sbjct: 786 VYRIGTLMELVRTLALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY 845
Query: 158 GAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRL 217
GAKGTF+ IGSIGA EV+E+DDMFILVAPQNAVGNCIIDD++AMT+AA RPVILIN RL
Sbjct: 846 GAKGTFINIGSIGAAEVEEEDDMFILVAPQNAVGNCIIDDLRAMTNAAEHRPVILINARL 905
Query: 218 KDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKR 277
KDLPGSSGIMQTMGRD+RL+YAASFE+CYFFRLLYYAGTQYPIMGA+RMTYP YELYKR
Sbjct: 906 KDLPGSSGIMQTMGRDQRLKYAASFESCYFFRLLYYAGTQYPIMGAIRMTYPYQYELYKR 965
Query: 278 VDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDS 337
VD G EKY ILSTF E+P++D+INDAF GKP S D
Sbjct: 966 VDESPGKEKYAILSTFPERPTTDEINDAFEGKP-----------------------SKDP 1002
Query: 338 RVLLCTCRNREKKASGFWGFLSSVF 362
R K SG WGFL+ +
Sbjct: 1003 R-----------KTSGIWGFLNGIL 1016
>gi|22327339|ref|NP_198367.2| adenylate kinase family protein [Arabidopsis thaliana]
gi|17979434|gb|AAL49859.1| putative adenylate kinase [Arabidopsis thaliana]
gi|22136756|gb|AAM91697.1| putative adenylate kinase [Arabidopsis thaliana]
gi|110740593|dbj|BAE98401.1| adenylate kinase -like protein [Arabidopsis thaliana]
gi|332006553|gb|AED93936.1| adenylate kinase family protein [Arabidopsis thaliana]
Length = 588
Score = 483 bits (1243), Expect = e-134, Method: Compositional matrix adjust.
Identities = 236/362 (65%), Positives = 276/362 (76%), Gaps = 47/362 (12%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
+VF + ++LLSQ + ++M KT PV QD W+GIPTRLNNIPHSR+I
Sbjct: 273 VVFEETQTLLSQIQLKRMIKTDKASPV-------------QDKWRGIPTRLNNIPHSRDI 319
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
R YFY+DVLQAT R++ DG TRL+V+INIPELNPEMDVYRIGTLMELV+ +ALSFADDGK
Sbjct: 320 RAYFYEDVLQATIRSIKDGNTRLRVDINIPELNPEMDVYRIGTLMELVQALALSFADDGK 379
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
RVKVCVQGSMGEGALAGMPLQLAGTRKILE+MDWGD GTFV++G+IG EVDE+DDM
Sbjct: 380 RVKVCVQGSMGEGALAGMPLQLAGTRKILEYMDWGDDETLGTFVKLGAIGGKEVDEEDDM 439
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
FILVAPQNAVGNCIIDD++AMT AAG RPV+LINPRLKDLP SSGIMQTMGR++RLEYA
Sbjct: 440 FILVAPQNAVGNCIIDDLQAMTTAAGKRPVVLINPRLKDLPASSGIMQTMGREQRLEYAL 499
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
+F+NCY FRLLYY GTQYPIMGALRM+YP YELYKRV+ +G EKYV+L+T+ E+P+ +
Sbjct: 500 TFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELYKRVNEENGKEKYVLLATYAERPTPE 559
Query: 301 DINDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSRVLLCTCRNREKKASGFWGFLSS 360
I+DAF GK R++ KKASG WGFLSS
Sbjct: 560 QIDDAFSGK----------------------------------SRDQSKKASGIWGFLSS 585
Query: 361 VF 362
VF
Sbjct: 586 VF 587
>gi|297801048|ref|XP_002868408.1| adenylate kinase family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314244|gb|EFH44667.1| adenylate kinase family protein [Arabidopsis lyrata subsp. lyrata]
Length = 595
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 239/365 (65%), Positives = 281/365 (76%), Gaps = 42/365 (11%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQV---SSSQDNWKGIPTRLNNIPHS 57
+VF + ++LLSQ + ++M KT +R S + V S QD W+GIPTRLNNIPHS
Sbjct: 269 VVFEETQTLLSQIQLKRMIKTG-----KKRDSGTPLVYKASPVQDKWRGIPTRLNNIPHS 323
Query: 58 REIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFAD 117
R+IR YFY+DVLQAT R++ DG TRL+V+INIPELNPEMDVYRIGTLMELVRV+ALSFAD
Sbjct: 324 RDIRAYFYEDVLQATIRSIKDGNTRLRVDINIPELNPEMDVYRIGTLMELVRVLALSFAD 383
Query: 118 DGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQ 177
DGKRVKVCVQGSMGEGALAGMPLQLAGTRKILE+MDWGD GTFV++G+IG EVDE+
Sbjct: 384 DGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEYMDWGDDETLGTFVKLGAIGGKEVDEE 443
Query: 178 DDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLE 237
DDMFILVAPQNAVGNCIIDD++AMT AAG RPV+LINPRLKDLP SSGIMQTMGR++RLE
Sbjct: 444 DDMFILVAPQNAVGNCIIDDLQAMTTAAGKRPVVLINPRLKDLPASSGIMQTMGREQRLE 503
Query: 238 YAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKP 297
YA +F+NCY FRLLYY GTQYPIMGALRM+YP YELYKRV+ +G EKYV+L+T++E+P
Sbjct: 504 YALTFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELYKRVNEENGKEKYVLLATYSERP 563
Query: 298 SSDDINDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSRVLLCTCRNREKKASGFWGF 357
+ + I+DAF GK R++ KK SG WGF
Sbjct: 564 TPEQIDDAFSGK----------------------------------SRDQSKKPSGIWGF 589
Query: 358 LSSVF 362
LSSVF
Sbjct: 590 LSSVF 594
>gi|357145496|ref|XP_003573662.1| PREDICTED: uncharacterized protein LOC100845101 [Brachypodium
distachyon]
Length = 596
Score = 478 bits (1229), Expect = e-132, Method: Compositional matrix adjust.
Identities = 232/366 (63%), Positives = 279/366 (76%), Gaps = 44/366 (12%)
Query: 1 MVFADIESLL----SQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPH 56
+VF +IESLL + A +K+ KT + S ++S++ W+GIPTRLNNIPH
Sbjct: 271 VVFQEIESLLQKICTNASEDKLAKTNG------KSSDFVGTTTSKNEWRGIPTRLNNIPH 324
Query: 57 SREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFA 116
SREIR YFYDDVLQAT+RA+ D +TRL+++INIPELNPEMDVYRIGTLMELVR I+L+FA
Sbjct: 325 SREIRKYFYDDVLQATKRAIEDKKTRLQIDINIPELNPEMDVYRIGTLMELVREISLAFA 384
Query: 117 DDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDE 176
DDGKRVKV VQGSMG+GA AG+PLQLAGTRKILEFMDWGDYGAKG F+ IG++GA+EVD+
Sbjct: 385 DDGKRVKVSVQGSMGQGAFAGIPLQLAGTRKILEFMDWGDYGAKGAFINIGAVGASEVDK 444
Query: 177 QDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRL 236
+DDMFIL+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPRLKD+P SSG+MQTMGRD RL
Sbjct: 445 EDDMFILIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPRLKDMPASSGVMQTMGRDMRL 504
Query: 237 EYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEK 296
+YAASFE CY FRLL+YAGT YPIMGALRM YPN YE+Y+RVD P+G EKYV+L+ +K
Sbjct: 505 KYAASFETCYSFRLLFYAGTFYPIMGALRMGYPNKYEIYRRVDEPNGKEKYVLLAELMDK 564
Query: 297 PSSDDINDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSRVLLCTCRNREKKASGFWG 356
P++DDI +AF G P+ + +E SGFWG
Sbjct: 565 PTADDITNAFKG-PK---------------------------------KEKESAPSGFWG 590
Query: 357 FLSSVF 362
FLS +
Sbjct: 591 FLSGIL 596
>gi|79329010|ref|NP_001031966.1| adenylate kinase family protein [Arabidopsis thaliana]
gi|332006554|gb|AED93937.1| adenylate kinase family protein [Arabidopsis thaliana]
Length = 580
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 223/311 (71%), Positives = 261/311 (83%), Gaps = 13/311 (4%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
+VF + ++LLSQ + ++M KT PV QD W+GIPTRLNNIPHSR+I
Sbjct: 273 VVFEETQTLLSQIQLKRMIKTDKASPV-------------QDKWRGIPTRLNNIPHSRDI 319
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
R YFY+DVLQAT R++ DG TRL+V+INIPELNPEMDVYRIGTLMELV+ +ALSFADDGK
Sbjct: 320 RAYFYEDVLQATIRSIKDGNTRLRVDINIPELNPEMDVYRIGTLMELVQALALSFADDGK 379
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
RVKVCVQGSMGEGALAGMPLQLAGTRKILE+MDWGD GTFV++G+IG EVDE+DDM
Sbjct: 380 RVKVCVQGSMGEGALAGMPLQLAGTRKILEYMDWGDDETLGTFVKLGAIGGKEVDEEDDM 439
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
FILVAPQNAVGNCIIDD++AMT AAG RPV+LINPRLKDLP SSGIMQTMGR++RLEYA
Sbjct: 440 FILVAPQNAVGNCIIDDLQAMTTAAGKRPVVLINPRLKDLPASSGIMQTMGREQRLEYAL 499
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
+F+NCY FRLLYY GTQYPIMGALRM+YP YELYKRV+ +G EKYV+L+T+ E+P+ +
Sbjct: 500 TFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELYKRVNEENGKEKYVLLATYAERPTPE 559
Query: 301 DINDAFLGKPR 311
I+DAF GK R
Sbjct: 560 QIDDAFSGKSR 570
>gi|326499868|dbj|BAJ90769.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326502206|dbj|BAJ95166.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 596
Score = 469 bits (1208), Expect = e-130, Method: Compositional matrix adjust.
Identities = 223/362 (61%), Positives = 275/362 (75%), Gaps = 40/362 (11%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
+VF +IES+L +K+ T+ + + + ++S+ W+GIPTRLNNIPHSREI
Sbjct: 275 VVFHEIESML-----QKICTNTSKNKLAKTNGKPPDTTASKKEWRGIPTRLNNIPHSREI 329
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
R YFYDDV+QAT+RA+ D +TRL+++INIPELNPE+DVYRIGTLMELVR ++L+FADDGK
Sbjct: 330 RKYFYDDVIQATKRAIEDKKTRLQIDINIPELNPELDVYRIGTLMELVRDLSLTFADDGK 389
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
RVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGDYGA G F+ IG++GA+EVD++DDM
Sbjct: 390 RVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGDYGAMGAFINIGAVGASEVDKEDDM 449
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
F+L+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPRLKD+P SSG+MQTMGRD RL+YAA
Sbjct: 450 FVLIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPRLKDMPASSGVMQTMGRDLRLQYAA 509
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
SFE CY FRLL+Y GT YPIMGALRM YPN YE+Y+RVD P+ GEKY +L+ FT P++D
Sbjct: 510 SFETCYSFRLLFYVGTFYPIMGALRMAYPNKYEIYRRVDEPN-GEKYDLLAEFTANPTAD 568
Query: 301 DINDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSRVLLCTCRNREKKASGFWGFLSS 360
DI +AF+G T + +E SGFWGFLS
Sbjct: 569 DITNAFVG----------------------------------TKKKKENAPSGFWGFLSG 594
Query: 361 VF 362
+
Sbjct: 595 IL 596
>gi|326509295|dbj|BAJ91564.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 512
Score = 469 bits (1206), Expect = e-129, Method: Compositional matrix adjust.
Identities = 223/362 (61%), Positives = 275/362 (75%), Gaps = 40/362 (11%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
+VF +IES+L +K+ T+ + + + ++S+ W+GIPTRLNNIPHSREI
Sbjct: 191 VVFHEIESML-----QKICTNTSKNKLAKTNGKPPDTTASKKEWRGIPTRLNNIPHSREI 245
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
R YFYDDV+QAT+RA+ D +TRL+++INIPELNPE+DVYRIGTLMELVR ++L+FADDGK
Sbjct: 246 RKYFYDDVIQATKRAIEDKKTRLQIDINIPELNPELDVYRIGTLMELVRDLSLTFADDGK 305
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
RVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGDYGA G F+ IG++GA+EVD++DDM
Sbjct: 306 RVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGDYGAMGAFINIGAVGASEVDKEDDM 365
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
F+L+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPRLKD+P SSG+MQTMGRD RL+YAA
Sbjct: 366 FVLIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPRLKDMPASSGVMQTMGRDLRLQYAA 425
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
SFE CY FRLL+Y GT YPIMGALRM YPN YE+Y+RVD P+ GEKY +L+ FT P++D
Sbjct: 426 SFETCYSFRLLFYVGTFYPIMGALRMAYPNKYEIYRRVDEPN-GEKYDLLAEFTANPTAD 484
Query: 301 DINDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSRVLLCTCRNREKKASGFWGFLSS 360
DI +AF+G T + +E SGFWGFLS
Sbjct: 485 DITNAFVG----------------------------------TKKKKENAPSGFWGFLSG 510
Query: 361 VF 362
+
Sbjct: 511 IL 512
>gi|297608344|ref|NP_001061454.2| Os08g0288200 [Oryza sativa Japonica Group]
gi|255678327|dbj|BAF23368.2| Os08g0288200, partial [Oryza sativa Japonica Group]
Length = 493
Score = 466 bits (1200), Expect = e-129, Method: Compositional matrix adjust.
Identities = 231/366 (63%), Positives = 276/366 (75%), Gaps = 44/366 (12%)
Query: 1 MVFADIESLLSQ----AKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPH 56
+VF +I+SLL + A + KT KP S ++S++ ++GIPTRLNNIPH
Sbjct: 168 VVFNEIDSLLQKICENASFNMLAKTNG-KP-----QDSKDTTASKNEFRGIPTRLNNIPH 221
Query: 57 SREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFA 116
SREIR YFY+DVL AT+ AV D +TRL+++INIPELNPEMDVYRIGTLMELVR ++LSFA
Sbjct: 222 SREIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELNPEMDVYRIGTLMELVRELSLSFA 281
Query: 117 DDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDE 176
DDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MDWG+YGAKGTF+ G++GA+EVD+
Sbjct: 282 DDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMDWGEYGAKGTFINFGAVGASEVDK 341
Query: 177 QDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRL 236
+DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+NPRLKD+PGSSG+MQTMGRD RL
Sbjct: 342 EDDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILVNPRLKDMPGSSGVMQTMGRDMRL 401
Query: 237 EYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEK 296
+YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE+Y+RVD P+G E+YV+L F EK
Sbjct: 402 KYAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYEIYRRVDEPNGQERYVLLEEFVEK 461
Query: 297 PSSDDINDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSRVLLCTCRNREKKASGFWG 356
P+ D+I +AF +PR EK ASGFWG
Sbjct: 462 PTPDEITNAF--RPRK--------------------------------NENEKSASGFWG 487
Query: 357 FLSSVF 362
FLS +
Sbjct: 488 FLSGIL 493
>gi|38175634|dbj|BAD01341.1| putative Adenylate kinase, chloroplast [Oryza sativa Japonica
Group]
Length = 467
Score = 466 bits (1199), Expect = e-129, Method: Compositional matrix adjust.
Identities = 231/366 (63%), Positives = 276/366 (75%), Gaps = 44/366 (12%)
Query: 1 MVFADIESLLSQ----AKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPH 56
+VF +I+SLL + A + KT KP S ++S++ ++GIPTRLNNIPH
Sbjct: 142 VVFNEIDSLLQKICENASFNMLAKTNG-KP-----QDSKDTTASKNEFRGIPTRLNNIPH 195
Query: 57 SREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFA 116
SREIR YFY+DVL AT+ AV D +TRL+++INIPELNPEMDVYRIGTLMELVR ++LSFA
Sbjct: 196 SREIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELNPEMDVYRIGTLMELVRELSLSFA 255
Query: 117 DDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDE 176
DDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MDWG+YGAKGTF+ G++GA+EVD+
Sbjct: 256 DDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMDWGEYGAKGTFINFGAVGASEVDK 315
Query: 177 QDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRL 236
+DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+NPRLKD+PGSSG+MQTMGRD RL
Sbjct: 316 EDDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILVNPRLKDMPGSSGVMQTMGRDMRL 375
Query: 237 EYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEK 296
+YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE+Y+RVD P+G E+YV+L F EK
Sbjct: 376 KYAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYEIYRRVDEPNGQERYVLLEEFVEK 435
Query: 297 PSSDDINDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSRVLLCTCRNREKKASGFWG 356
P+ D+I +AF +PR EK ASGFWG
Sbjct: 436 PTPDEITNAF--RPRK--------------------------------NENEKSASGFWG 461
Query: 357 FLSSVF 362
FLS +
Sbjct: 462 FLSGIL 467
>gi|293333213|ref|NP_001168735.1| uncharacterized protein LOC100382527 [Zea mays]
gi|223972711|gb|ACN30543.1| unknown [Zea mays]
Length = 599
Score = 466 bits (1198), Expect = e-129, Method: Compositional matrix adjust.
Identities = 225/363 (61%), Positives = 273/363 (75%), Gaps = 40/363 (11%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQV-SSSQDNWKGIPTRLNNIPHSRE 59
+VF I+ LL + + T+A K R + + S++ W+GIPTRLNNIPHSRE
Sbjct: 276 VVFQAIDFLLQKI----CENTSASKLTKTNGKRPDSADTGSKNEWRGIPTRLNNIPHSRE 331
Query: 60 IRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDG 119
IR YFY+DVL+AT+ A+ D +TRL+++INIPELNPEMDVYRIGTLMELVR ++LSFADDG
Sbjct: 332 IREYFYNDVLKATKLAIEDKKTRLQIDINIPELNPEMDVYRIGTLMELVRELSLSFADDG 391
Query: 120 KRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDD 179
KRVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGDYGAKGTF+ IG++GA +VD++DD
Sbjct: 392 KRVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGDYGAKGTFINIGAVGARDVDKEDD 451
Query: 180 MFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYA 239
MF+L+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPRLKD+P SSG+MQTMGRD RL+YA
Sbjct: 452 MFVLIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPRLKDMPASSGVMQTMGRDIRLKYA 511
Query: 240 ASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSS 299
ASFE CY FRLLYYAG+ YPIMGALRM YPN YE+Y+RVD +G E Y++++ FT KP+
Sbjct: 512 ASFETCYSFRLLYYAGSFYPIMGALRMAYPNKYEIYRRVDTLNGKENYILIAEFTNKPTP 571
Query: 300 DDINDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSRVLLCTCRNREKKASGFWGFLS 359
D+I AF G+ R EK +SGFWGFLS
Sbjct: 572 DEITSAFKGQKR-----------------------------------EEKASSGFWGFLS 596
Query: 360 SVF 362
+F
Sbjct: 597 GLF 599
>gi|10257489|dbj|BAB11193.1| unnamed protein product [Arabidopsis thaliana]
Length = 337
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 213/330 (64%), Positives = 249/330 (75%), Gaps = 48/330 (14%)
Query: 30 RKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINI 89
++ +++ S QD W+GIPTRLNNIPHSR+IR YFY+DVLQAT R++ DG TRL+V+INI
Sbjct: 8 KQVLADKASPVQDKWRGIPTRLNNIPHSRDIRAYFYEDVLQATIRSIKDGNTRLRVDINI 67
Query: 90 PELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKIL 149
PELNPEMDVYRIGTLMELV+ +ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKIL
Sbjct: 68 PELNPEMDVYRIGTLMELVQALALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKIL 127
Query: 150 EFMDWGDYGAKGTFVEIGSI---------------------------------------- 169
E+MDWGD GTFV++G+I
Sbjct: 128 EYMDWGDDETLGTFVKLGAIVTRESLNLQKLHHQQNIVVSIIKLVMHNKVGYIELHIILL 187
Query: 170 --------GANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLP 221
G EVDE+DDMFILVAPQNAVGNCIIDD++AMT AAG RPV+LINPRLKDLP
Sbjct: 188 FDYVFAMTGGKEVDEEDDMFILVAPQNAVGNCIIDDLQAMTTAAGKRPVVLINPRLKDLP 247
Query: 222 GSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVP 281
SSGIMQTMGR++RLEYA +F+NCY FRLLYY GTQYPIMGALRM+YP YELYKRV+
Sbjct: 248 ASSGIMQTMGREQRLEYALTFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELYKRVNEE 307
Query: 282 SGGEKYVILSTFTEKPSSDDINDAFLGKPR 311
+G EKYV+L+T+ E+P+ + I+DAF GK R
Sbjct: 308 NGKEKYVLLATYAERPTPEQIDDAFSGKSR 337
>gi|302761066|ref|XP_002963955.1| hypothetical protein SELMODRAFT_270406 [Selaginella moellendorffii]
gi|300167684|gb|EFJ34288.1| hypothetical protein SELMODRAFT_270406 [Selaginella moellendorffii]
Length = 575
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 185/322 (57%), Positives = 234/322 (72%), Gaps = 18/322 (5%)
Query: 2 VFADIESLLSQAKREKMKKTTAV-KPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
VF DI S L +++K T V KP QD+W+G+PT+LN+IPHSREI
Sbjct: 268 VFGDISSFLD----DRIKAATKVSKP-------------KQDSWRGMPTKLNSIPHSREI 310
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
R YFY +V + + AV DG +LKVEI IPELNPEMDVYR GTL+ELVR +A +FA+DGK
Sbjct: 311 REYFYTEVCEGVRNAVEDGLHKLKVEITIPELNPEMDVYRAGTLLELVRELAFTFANDGK 370
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
RVKVCVQG MGEG GMPLQL+G+R +LE MDWGDYGAKGTFV IG+I A++V E DDM
Sbjct: 371 RVKVCVQGPMGEGIFRGMPLQLSGSRMMLEVMDWGDYGAKGTFVNIGAIAASDVSETDDM 430
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
FIL+APQNAVGNCII+D++AM AAG RPVIL+NP+LKD+PG+ G+MQ +GR++R+E++
Sbjct: 431 FILMAPQNAVGNCIIEDLQAMVQAAGDRPVILVNPQLKDVPGAGGVMQVIGREQRMEFSD 490
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
SF CY FRLLY AGT YPIMGALRM YP YE++KR+D E Y+ +TF +P+
Sbjct: 491 SFFVCYSFRLLYRAGTFYPIMGALRMKYPGPYEVHKRIDTGYSSETYIQEATFPGRPTPT 550
Query: 301 DINDAFLGKPRYVDYMDRILDY 322
++ D F G+ + I +
Sbjct: 551 ELQDVFAGRSKEASKTKSIWSF 572
>gi|302769091|ref|XP_002967965.1| hypothetical protein SELMODRAFT_169443 [Selaginella moellendorffii]
gi|300164703|gb|EFJ31312.1| hypothetical protein SELMODRAFT_169443 [Selaginella moellendorffii]
Length = 575
Score = 386 bits (992), Expect = e-105, Method: Compositional matrix adjust.
Identities = 185/322 (57%), Positives = 234/322 (72%), Gaps = 18/322 (5%)
Query: 2 VFADIESLLSQAKREKMKKTTAV-KPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
VF DI S L + +K T V KP QD+W+G+PT+LN+IPHSREI
Sbjct: 268 VFGDISSFLD----DHIKAATKVSKP-------------KQDSWRGMPTKLNSIPHSREI 310
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
R YFY +V + + AV DG +LKVEI IPELNPEMDVYR GTL+ELVR +A +FA+DGK
Sbjct: 311 REYFYTEVCEGVRNAVEDGLHKLKVEITIPELNPEMDVYRAGTLLELVRELAFTFANDGK 370
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
RVKVCVQG MGEG GMPLQL+G+R++LE MDWGDYGAKGTFV IG+I A++V E DDM
Sbjct: 371 RVKVCVQGPMGEGIFRGMPLQLSGSRRMLEVMDWGDYGAKGTFVNIGAIAASDVFETDDM 430
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
FIL+APQNAVGNCII+D++AM AAG RPVIL+NP+LKD+PG+ G+MQ +GR++R+E++
Sbjct: 431 FILMAPQNAVGNCIIEDLQAMVQAAGDRPVILVNPQLKDVPGAGGVMQVIGREQRMEFSD 490
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
SF CY FRLLY AGT YPIMGALRM YP YE++KR+D E Y+ +TF +P+
Sbjct: 491 SFFVCYSFRLLYRAGTFYPIMGALRMKYPGPYEVHKRIDTGYSSETYIQEATFAGRPTPT 550
Query: 301 DINDAFLGKPRYVDYMDRILDY 322
++ D F G+ + I +
Sbjct: 551 ELQDVFAGRSKEASKTKSIWSF 572
>gi|168018045|ref|XP_001761557.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687241|gb|EDQ73625.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 264
Score = 360 bits (924), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 162/264 (61%), Positives = 211/264 (79%), Gaps = 1/264 (0%)
Query: 44 WKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGT 103
W+G+PT+LN PHSREIR YFY D QA + A++DG+ +LKV + IPELNPEMDVYRIGT
Sbjct: 1 WRGVPTKLNTCPHSREIRQYFYKDACQAAKNAIDDGKHKLKVLLTIPELNPEMDVYRIGT 60
Query: 104 LMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTF 163
L+EL+R +A +FA+DGKRV+VCVQG MGEG +GMPL L+GTR +LE MDWG+Y K TF
Sbjct: 61 LLELIRELAFTFANDGKRVRVCVQGPMGEGIFSGMPLALSGTRPMLERMDWGEYIEKDTF 120
Query: 164 VEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGS 223
+ G++GA EV + DDMFIL+APQNAVGNCIIDD+KAM +AAG RPVI++NP+LKD+P S
Sbjct: 121 IRFGAVGAKEVYDDDDMFILMAPQNAVGNCIIDDLKAMVEAAGDRPVIILNPKLKDIPAS 180
Query: 224 SGIMQTM-GRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPS 282
GIMQ + GR +R+E+A SF CY FRL+Y +GTQYPI+GA RM YP YE+Y+R D+
Sbjct: 181 GGIMQQVGGRQERMEFAESFFTCYNFRLMYTSGTQYPIVGAFRMAYPGPYEIYRRFDLDR 240
Query: 283 GGEKYVILSTFTEKPSSDDINDAF 306
G E Y ++++F +P+ +I+DA+
Sbjct: 241 GVEVYKLIASFDHEPTRSEISDAY 264
>gi|218200847|gb|EEC83274.1| hypothetical protein OsI_28627 [Oryza sativa Indica Group]
Length = 681
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 163/252 (64%), Positives = 200/252 (79%), Gaps = 16/252 (6%)
Query: 1 MVFADIESLLSQ----AKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPH 56
+VF +I+SLL + A + KT KP S ++S++ ++GIPTRLNNIPH
Sbjct: 284 VVFNEIDSLLQKICENASSNMLAKTNG-KP-----QDSKDTTASKNEFRGIPTRLNNIPH 337
Query: 57 SREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFA 116
SREIR YFY+DVL AT+ AV D +TRL+++INIPELNPEMDVYRIGTLMELVR ++LSFA
Sbjct: 338 SREIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELNPEMDVYRIGTLMELVRELSLSFA 397
Query: 117 DDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDE 176
DDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MDWG+YGAKGTF+ G++GA+EVD+
Sbjct: 398 DDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMDWGEYGAKGTFINFGAVGASEVDK 457
Query: 177 QDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLP------GSSGIMQTM 230
+DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+NPRLK GS G ++
Sbjct: 458 EDDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILVNPRLKGCGVGVRGWGSRGGVKVN 517
Query: 231 GRDKRLEYAASF 242
G+++ ++YAA
Sbjct: 518 GKNRVVKYAADL 529
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 75/144 (52%), Positives = 90/144 (62%), Gaps = 34/144 (23%)
Query: 219 DLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRV 278
D+PGSSG+MQTMGRD RL+YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE+Y+RV
Sbjct: 572 DMPGSSGVMQTMGRDMRLKYAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYEIYRRV 631
Query: 279 DVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSR 338
D P+G E+YV+L F EKP+ D+I +AF +PR
Sbjct: 632 DEPNGQERYVLLEEFVEKPTPDEITNAF--RPRK-------------------------- 663
Query: 339 VLLCTCRNREKKASGFWGFLSSVF 362
EK ASGFWGFLS +
Sbjct: 664 ------NENEKSASGFWGFLSGIL 681
>gi|222640270|gb|EEE68402.1| hypothetical protein OsJ_26756 [Oryza sativa Japonica Group]
Length = 680
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 162/251 (64%), Positives = 197/251 (78%), Gaps = 14/251 (5%)
Query: 1 MVFADIESLLSQAKRE---KMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHS 57
+VF +I+SLL + M T KP S ++S++ ++GIPTRLNNIPHS
Sbjct: 283 VVFNEIDSLLQKICENASFNMLAKTNGKP-----QDSKDTTASKNEFRGIPTRLNNIPHS 337
Query: 58 REIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFAD 117
REIR YFY+DVL AT+ AV D +TRL+++INIPELNPEMDVYRIGTLMELVR ++LSFAD
Sbjct: 338 REIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELNPEMDVYRIGTLMELVRELSLSFAD 397
Query: 118 DGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQ 177
DGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MDWG+YGAKGTF+ G++GA+EVD++
Sbjct: 398 DGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMDWGEYGAKGTFINFGAVGASEVDKE 457
Query: 178 DDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLP------GSSGIMQTMG 231
DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+NPRLK GS G + G
Sbjct: 458 DDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILVNPRLKGCGVGVRGWGSRGGGKVNG 517
Query: 232 RDKRLEYAASF 242
+++ ++YAA
Sbjct: 518 KNRVVKYAADL 528
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 75/144 (52%), Positives = 90/144 (62%), Gaps = 34/144 (23%)
Query: 219 DLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRV 278
D+PGSSG+MQTMGRD RL+YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE+Y+RV
Sbjct: 571 DMPGSSGVMQTMGRDMRLKYAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYEIYRRV 630
Query: 279 DVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRYVDYMDRILDYMDRILYHLLFHSTDSR 338
D P+G E+YV+L F EKP+ D+I +AF +PR
Sbjct: 631 DEPNGQERYVLLEEFVEKPTPDEITNAF--RPRK-------------------------- 662
Query: 339 VLLCTCRNREKKASGFWGFLSSVF 362
EK ASGFWGFLS +
Sbjct: 663 ------NENEKSASGFWGFLSGIL 680
>gi|384250870|gb|EIE24349.1| hypothetical protein COCSUDRAFT_65858 [Coccomyxa subellipsoidea
C-169]
Length = 291
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 121/267 (45%), Positives = 171/267 (64%), Gaps = 7/267 (2%)
Query: 47 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 106
IP RLNN+P SRE R +FY+DV +A ++AV L V++ +PE NPEMDVYRIGT++E
Sbjct: 18 IPQRLNNVPASRETRQFFYNDVCRAVEKAVAAEERLLSVKLTVPETNPEMDVYRIGTVLE 77
Query: 107 LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 166
LVR + A DGKRVKVCVQ ++G+G G+PL L+G R+I+E MDWG+ FV
Sbjct: 78 LVRDLVTDLAADGKRVKVCVQQALGQGVFQGLPLSLSGVRRIMENMDWGEV---FEFVSF 134
Query: 167 GSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAA--GTRPVILINPRLKDLPGSS 224
G++G++ +D+ +++VAPQN VG+ I+ + M AA + +IL NP LKDLP +S
Sbjct: 135 GNVGSDHLDD-SAYYVIVAPQNVVGSTIMTKLLEMATAAVEQKKTLILFNPLLKDLPSAS 193
Query: 225 GIMQTMGRDKRLEYAASFENCYFFRLLYYAGT-QYPIMGALRMTYPNGYELYKRVDVPSG 283
G+M GR +R+E+A SF Y FRLLY + YPI GALR Y +E++KR D+
Sbjct: 194 GVMGVRGRRERMEFADSFIPAYHFRLLYSSSAMMYPIRGALRHVYGGPWEVFKREDMGKR 253
Query: 284 GEKYVILSTFTEKPSSDDINDAFLGKP 310
E+Y + +F +P + F +P
Sbjct: 254 TEEYRFVGSFDSQPDGSLLTPLFQEQP 280
>gi|413922034|gb|AFW61966.1| hypothetical protein ZEAMMB73_400224, partial [Zea mays]
Length = 446
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 114/174 (65%), Positives = 139/174 (79%), Gaps = 11/174 (6%)
Query: 1 MVFADIESLLSQ----AKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPH 56
+VF I+ LL + K+ KT +P + + S++ W+GIPTRLNNIPH
Sbjct: 276 VVFQAIDFLLQKICENTSASKLTKTNGKRP-------DSADTGSKNEWRGIPTRLNNIPH 328
Query: 57 SREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFA 116
SREIR YFY+DVL+AT+ A+ D +TRL+++INIPELNPEMDVYRIGTLMELVR ++LSFA
Sbjct: 329 SREIREYFYNDVLKATKLAIEDKKTRLQIDINIPELNPEMDVYRIGTLMELVRELSLSFA 388
Query: 117 DDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG 170
DDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGDYGAKGTF+ IG++G
Sbjct: 389 DDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGDYGAKGTFINIGAVG 442
>gi|303283278|ref|XP_003060930.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457281|gb|EEH54580.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 354
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 113/233 (48%), Positives = 155/233 (66%), Gaps = 5/233 (2%)
Query: 47 IPTRLNNIPHSREIRNYFYDDVLQATQRAVN--DGRTRLKVEINIPELNPEMDVYRIGTL 104
IP LNN PH RE+R +FYDD + + RAV + TR+K ++ PELN DVYR+GTL
Sbjct: 85 IPIMLNNTPHPREVRRWFYDDCVTSVTRAVTGEEPMTRVKAKVEFPELNVNGDVYRVGTL 144
Query: 105 MELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFV 164
+ELVR +A++ A DGKRV+VCVQGSMGEG G+PL L+G +IL+ MDWGD F+
Sbjct: 145 LELVRELAMTLAKDGKRVRVCVQGSMGEGVFQGLPLSLSGVARILDMMDWGD---ADEFI 201
Query: 165 EIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSS 224
G+I + +DD FIL+APQN VG ++ + AM +AAG RP+I+INP+L D+ +
Sbjct: 202 SRGAIDGDVPQPEDDFFILIAPQNIVGFAVLPRIIAMEEAAGDRPMIMINPKLDDIQSAG 261
Query: 225 GIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKR 277
+M GR +R EY +++ Y FRLLY +PI GA+R T+ +ELYK+
Sbjct: 262 NVMSIRGRGERREYVSTWREVYHFRLLYRKPYFFPIYGAMRYTHDGKWELYKK 314
>gi|307109260|gb|EFN57498.1| hypothetical protein CHLNCDRAFT_21254 [Chlorella variabilis]
Length = 396
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 123/314 (39%), Positives = 171/314 (54%), Gaps = 55/314 (17%)
Query: 47 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 106
IPTRLN IPH R R +FY + A +RA++ G TR+ IPELN E DVYR+GTL+E
Sbjct: 76 IPTRLNTIPHERTTRQHFYREATTAIKRAIDAGETRVLARCTIPELNTEFDVYRVGTLLE 135
Query: 107 LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 166
LVR +A S A DGKRVKVCVQ ++G+G GMPL L G +I++ MDWG+ KG
Sbjct: 136 LVREVATSLAADGKRVKVCVQQALGQGVFQGMPLSLNGVLRIMKQMDWGESAEKGVSSAR 195
Query: 167 GSI----------------------------------------------------GANEV 174
G + GA EV
Sbjct: 196 GGVVGCLFCVHTGRSHEEGMCACSAWLVHSSPCPVLTSLTCHPATAADRILTGNLGAAEV 255
Query: 175 DEQDDMFILVAPQNAVGNCIIDDMKAMTDAA--GTRPVILINPRLKDLPGSSGIMQTMGR 232
E D F+L+APQN VG+ I+ + M AA +P++LINP+L D+ ++G+M GR
Sbjct: 256 -EDADAFVLIAPQNIVGHSIMPLLSEMVAAAEEAGKPMVLINPKLTDIQSAAGVMSVRGR 314
Query: 233 DKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILST 292
RL++ A+F Y FRLLY + +PIMGALR + +E+++RV++ G E+Y ++ +
Sbjct: 315 QDRLDFTATFGTAYHFRLLYKGVSMHPIMGALRHEHDGPWEVFRRVELGPGVEEYQLIGS 374
Query: 293 FTEKPSSDDINDAF 306
F +P+ I DAF
Sbjct: 375 FDREPTPPLITDAF 388
>gi|255078468|ref|XP_002502814.1| predicted protein [Micromonas sp. RCC299]
gi|226518080|gb|ACO64072.1| predicted protein [Micromonas sp. RCC299]
Length = 331
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 120/267 (44%), Positives = 164/267 (61%), Gaps = 8/267 (2%)
Query: 47 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDG--RTRLKVEINIPELNPEMDVYRIGTL 104
IP LNN PH RE+R +FYDD ++ AV +TR+K I PELN DVYRIGTL
Sbjct: 68 IPKMLNNTPHPREVRKWFYDDCRESVVAAVTGANPKTRVKAFIEFPELNVNGDVYRIGTL 127
Query: 105 MELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFV 164
+ELVR +A++ A DGKRV+VCVQGSMG+G G+PL L+G +IL+ MDWGD F+
Sbjct: 128 LELVREVAMALAADGKRVRVCVQGSMGQGVFQGLPLSLSGVARILDMMDWGD---ADEFI 184
Query: 165 EIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSS 224
GSIG + D FIL++PQN VG ++ ++ M AAG RP+I+INP+L D+ +
Sbjct: 185 ARGSIGGDVPRPDDAYFILISPQNIVGYSVLPYLQEMEKAAGDRPIIMINPKLGDIQSAG 244
Query: 225 GIMQTMGRDKRLEYA-ASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSG 283
+M GR +R EY ++E Y FRLLY +PI GALR +ELYK+
Sbjct: 245 NVMSIRGRGERREYVEKTWEEVYHFRLLYKKPFFFPIYGALRYGTGGMWELYKKFGKME- 303
Query: 284 GEKYVILSTFTE-KPSSDDINDAFLGK 309
E++ +L T+ + +P + +I L K
Sbjct: 304 QEEWKLLRTYDDGEPGAAEITKRILAK 330
>gi|412991462|emb|CCO16307.1| predicted protein [Bathycoccus prasinos]
Length = 352
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 117/278 (42%), Positives = 169/278 (60%), Gaps = 17/278 (6%)
Query: 46 GIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRT-RLKVEINIPELNPEMDVYRIGTL 104
+PT LNNIPH+REIR +FYDD ++ RA+ D R R+KV PE+N E DV+R GTL
Sbjct: 77 ALPTLLNNIPHTREIRRFFYDDATESVLRAIKDDRNVRMKVFTEFPEMNVEGDVFRAGTL 136
Query: 105 MELVRVIALSFA----DDG----KRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGD 156
+E+VR +A A DG +RVK+CVQGSMG G G+PL L+G ++++ MDW +
Sbjct: 137 LEMVRTMATELARMERKDGSGGKQRVKLCVQGSMGTGVFQGLPLALSGISRLIDLMDWEE 196
Query: 157 YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPR 216
A+ + G+IG + + +D+FIL+ PQN VG I+ M+ M + AG RP+IL+NP+
Sbjct: 197 DVAER--ISNGAIGKDHAKDDEDVFILICPQNIVGYSILPYMEEMMEVAGNRPMILLNPK 254
Query: 217 LKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNG----- 271
L D+ + +M GR R+E+A S+E Y FRLLY +PI GALR +Y +
Sbjct: 255 LGDIQSAGNVMSIRGRQGRMEFANSWEEIYHFRLLYRKPFFFPIYGALRKSYSSAEEDDT 314
Query: 272 YELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGK 309
+ELYKR E Y ++ + ++P+ D + GK
Sbjct: 315 WELYKRFG-KMDDEYYQLMEVYKKEPNPDQMTKVIWGK 351
>gi|145349122|ref|XP_001418989.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579219|gb|ABO97282.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 290
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 112/263 (42%), Positives = 158/263 (60%), Gaps = 4/263 (1%)
Query: 47 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 106
+PT LNN+PH+R R +FY+D ++ AV R+K++ PELN DVYR+GT +E
Sbjct: 31 VPTLLNNVPHARATRRWFYEDATKSIVAAVTARERRIKIKSEFPELNVNGDVYRVGTTLE 90
Query: 107 LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 166
LVR A + A DG RVK+CVQ SMG+G +PL L+G K+L MDW + G +G
Sbjct: 91 LVRETAAALARDGLRVKICVQRSMGQGVFQALPLSLSGVSKLLGMMDW-EPGTEGMIDPS 149
Query: 167 GSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGI 226
G IGA + E D FILV PQN VG ++ ++ M G +P+I+INPRL D+ + +
Sbjct: 150 GGIGAAD-GEGADCFILVQPQNIVGYSVLPYIEEMEAVVGDKPMIMINPRLDDVQSAGNV 208
Query: 227 MQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMT-YPNGYELYKRVDVPSGGE 285
M GR +R++ A+++ Y FRLLY +PI GALR Y N +ELYKR E
Sbjct: 209 MSVRGRKERMDAVAAWKEVYHFRLLYRKPYFFPIYGALRFAYYENEWELYKRTGARE-LE 267
Query: 286 KYVILSTFTEKPSSDDINDAFLG 308
+YV+L T+ +P+ D++ G
Sbjct: 268 QYVLLGTYPSEPTPDEMTKKIWG 290
>gi|428174173|gb|EKX43070.1| hypothetical protein GUITHDRAFT_110795 [Guillardia theta CCMP2712]
Length = 584
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/273 (43%), Positives = 159/273 (58%), Gaps = 26/273 (9%)
Query: 51 LNNIPHSREIRNYFYDDVLQATQRAVNDGR------TRLKVEINIPELNPEMDVYRIGTL 104
LN IP + EIR +FY DV A Q A+ D R RLK+E PELNPEMD +R+GT+
Sbjct: 322 LNEIPCNTEIREWFYRDVCDAVQVALQDSRFLESCAGRLKIECVPPELNPEMDSFRVGTM 381
Query: 105 MELVRVIALSFADD-GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTF 163
+E+VR IA+ G+RVKVC+QGS+GEG GMPL L G R+++E MDW G
Sbjct: 382 LEMVREIAIRVGVVMGRRVKVCIQGSLGEGIFTGMPLMLNGMRRVMESMDW--QARPGEM 439
Query: 164 VEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG-TRPVILINPRLKDLPG 222
E I ++FI++APQ+ VG + + + M +AAG R VILINPRLKD P
Sbjct: 440 YEGLLI---------NVFIIIAPQSMVGASVYEPLSQMVEAAGPKRTVILINPRLKDRPS 490
Query: 223 SSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGT-QYPIMGALRMTYPNG--YELYKRVD 279
S +M GR +R+ +AASF + FRLLY T +PI GALRM N + L+KR
Sbjct: 491 SGNVMSVQGRSQRIAFAASFREIFHFRLLYTGTTFMFPIQGALRMNMANSPFWTLFKRET 550
Query: 280 VPS----GGEKYVILSTFTEKPSSDDINDAFLG 308
+ GE Y ++ F E+P ++ I + G
Sbjct: 551 TTTPDERTGESYQPIAVFAEEPKTEAITNMMNG 583
>gi|219124197|ref|XP_002182396.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406357|gb|EEC46297.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 406
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 113/284 (39%), Positives = 166/284 (58%), Gaps = 28/284 (9%)
Query: 51 LNNIPHSREIRNYFYDDVLQATQRAV---NDGR--TRLKVEINIPELNPEMDVYRIGTLM 105
LN +PH+R +R+YFY+ A AV ++G+ R+ + PE+NP MD YRIGT++
Sbjct: 121 LNQVPHNRYVRDYFYEQAALAVHDAVVLCSEGKCINRMLITSQFPEMNPSMDSYRIGTIL 180
Query: 106 ELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG------DYGA 159
E+VR I + A++ RV++CVQGSMG G GMP QL G KI++ MDW + G
Sbjct: 181 EMVRTIGIKLAEENLRVRICVQGSMGVGIFTGMPKQLNGVSKIIQMMDWQSGEGELNEGM 240
Query: 160 KGTFVEIGSIGA------------NEVDEQDDMFILVAPQNAVG--NCIIDDMKAMTDAA 205
G ++ G++G N V QDD+FIL+APQ+ VG + I+ ++ M +AA
Sbjct: 241 VGDYIRFGAVGPEHVLNEEKDKDDNVVQYQDDVFILIAPQSMVGTDSSIMPLLQGMVEAA 300
Query: 206 GTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQY-PIMGAL 264
G RPVIL+NP L D ++G GR +R+++A SF+ Y F+ +Y +GT Y PI+GA+
Sbjct: 301 GNRPVILMNPDLTDKVSAAGQQSVRGRQQRIDFAESFQTVYHFQNIYISGTSYFPILGAI 360
Query: 265 RMTYPNGYEL-YKRVDVPSG-GEKYVILSTFTEKPSSDDINDAF 306
+P L ++R D G GE YV + P ++I DAF
Sbjct: 361 TKLHPKEPWLAHQRRDYADGEGEIYVPVLAGEVIPKGEEILDAF 404
>gi|299473139|emb|CBN78715.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 442
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 107/269 (39%), Positives = 162/269 (60%), Gaps = 9/269 (3%)
Query: 43 NWKGIPTR--LNNIPHSREIRNYFYDDVLQATQRAVNDGRT--RLKVEINIPELNPEMDV 98
++ G+P LNN+PHS+ RN+ Y +A AV D R+K+ +PELNP MD
Sbjct: 167 SYGGLPVNMLLNNVPHSKMARNFIYRGAAEALVLAVEDPTVPRRMKLVCTVPELNPAMDT 226
Query: 99 YRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYG 158
YR+GT++ELVR +A S AD G ++ +Q SMG+G +PL L+G I++ MDW D G
Sbjct: 227 YRVGTVLELVRELAYSMADVGLNTRILIQPSMGKGVFKSLPLALSGVMSIMQGMDWED-G 285
Query: 159 AKGTFVEIGSIG-ANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRL 217
G+ V G +G ++V +DD+ ++++PQ+ G I + + A RPV++INP L
Sbjct: 286 LVGSHVNFGQLGNKDDVSAEDDVILVISPQSIAGYSIHPLLVDTVEHANGRPVVIINPNL 345
Query: 218 KDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQ-YPIMGALRMTYPN-GYELY 275
KD P S G+MQ GR +R+ +A SF++ Y FRLLY + +PI GA+ P+ Y ++
Sbjct: 346 KDRPSSGGVMQVRGRGERIAFAESFKDIYNFRLLYGSSVAFFPIKGAVMKAGPSEPYVVF 405
Query: 276 KRVDV-PSGGEKYVILSTFTEKPSSDDIN 303
R + S GE+YV L TF + P +++I
Sbjct: 406 DRRNTGKSKGEEYVPLGTFDDDPDAENIT 434
>gi|397643725|gb|EJK76041.1| hypothetical protein THAOC_02216 [Thalassiosira oceanica]
Length = 1862
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 115/284 (40%), Positives = 159/284 (55%), Gaps = 28/284 (9%)
Query: 51 LNNIPHSREIRNYFYDDVLQATQRAV---NDG--RTRLKVEINIPELNPEMDVYRIGTLM 105
LN +PH+R +R YFY+ A AV + G R+K+++ PE+NP+MD YRIGTL+
Sbjct: 1576 LNQVPHNRYVRQYFYEMAADAALEAVVLCSAGFISNRMKLQVMFPEMNPQMDSYRIGTLL 1635
Query: 106 ELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG------DYGA 159
EL R IA+ FA+ RV+VCVQGS+G+G +P QL G IL+ MDW + G
Sbjct: 1636 ELTRTIAIRFAEQNLRVRVCVQGSIGQGIFTAIPKQLNGVATILKRMDWQSGEGEENEGM 1695
Query: 160 KGTFVEIGSIGANEV------------DEQDDMFILVAPQNAVG--NCIIDDMKAMTDAA 205
G FV G IG V +QDD+FIL+ PQN G + II + MT+A
Sbjct: 1696 VGDFVNFGHIGKEHVVNFKEGEGGEADTQQDDVFILICPQNMQGIESSIIPYLLEMTEAV 1755
Query: 206 GTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQY-PIMGAL 264
G RP+ILINP L D S G GR R+++A SF+ + F+ +YY+GT Y PI+GA
Sbjct: 1756 GDRPMILINPDLVDKASSQGQQSVRGRQDRIDFANSFKTIFQFQNIYYSGTSYFPILGAS 1815
Query: 265 RMTYPNG-YELYKRVDVPSG-GEKYVILSTFTEKPSSDDINDAF 306
+ P + ++R D+ G GE YV + P S+ + F
Sbjct: 1816 TLLGPEEMWVSHQRRDLKGGKGEVYVPVYASESMPDSEVLMSTF 1859
>gi|308806305|ref|XP_003080464.1| putative Adenylate kinase, chloroplast (ISS) [Ostreococcus tauri]
gi|116058924|emb|CAL54631.1| putative Adenylate kinase, chloroplast (ISS) [Ostreococcus tauri]
Length = 284
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 106/264 (40%), Positives = 153/264 (57%), Gaps = 16/264 (6%)
Query: 47 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 106
IPT LNN+PH+RE R +FYDD +T AV G R+K++ P + P
Sbjct: 35 IPTLLNNVPHARETRRWFYDDACASTLAAVGAGERRMKIKTEFPGVEP------------ 82
Query: 107 LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 166
L R +A + A DGKRVK+CVQ SMG+G + L L+G K+L MD+ + A+
Sbjct: 83 LTRAMAQTLARDGKRVKICVQRSMGQGVFQALTLSLSGVSKLLGMMDFEEDVAE-MIDPN 141
Query: 167 GSIGANE-VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSG 225
G IGA + ++ D +ILV PQN VG ++ + AM +A G +P+I+INP+L D+ +
Sbjct: 142 GGIGAEDGAEDGADAYILVQPQNIVGYSVLPYIAAMEEAVGDKPMIMINPKLDDIQSAGN 201
Query: 226 IMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMT-YPNGYELYKRVDVPSGG 284
+M GR +R++ A++ Y FRLLY +PI GALR T Y N +ELYKRV +
Sbjct: 202 VMSIRGRQERMDAVAAWREIYHFRLLYRKPYFFPIYGALRYTYYENAWELYKRVGMRE-E 260
Query: 285 EKYVILSTFTEKPSSDDINDAFLG 308
E+YV+ ST+ +P D++ G
Sbjct: 261 EEYVLRSTYDSEPDPDEMTKQIWG 284
>gi|223997276|ref|XP_002288311.1| adenylate-kinase [Thalassiosira pseudonana CCMP1335]
gi|220975419|gb|EED93747.1| adenylate-kinase [Thalassiosira pseudonana CCMP1335]
Length = 231
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/230 (43%), Positives = 141/230 (61%), Gaps = 16/230 (6%)
Query: 51 LNNIPHSREIRNYFYDDVLQATQRAV---NDGR--TRLKVEINIPELNPEMDVYRIGTLM 105
LN +PH+R +R YFYD QA A ++G+ R+K+ + PE+NP MD YRIGTL+
Sbjct: 1 LNQVPHNRFVRKYFYDMASQAALDAAVLCSEGKLTNRMKMTVMFPEMNPSMDSYRIGTLL 60
Query: 106 ELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDW------GDYGA 159
EL R IA+ A+ RV+VCVQGSMG G G+P QL G +L+ MDW + G
Sbjct: 61 ELARSIAIKLAEQNLRVRVCVQGSMGVGIFTGIPKQLNGVSTLLQRMDWESEVGEENEGM 120
Query: 160 KGTFVEIGSIGA--NEVDEQDDMFILVAPQNAVG-NC-IIDDMKAMTDAAGTRPVILINP 215
G +V G +G N +QDD+F+++ PQN +G C II + M +AAG RPVI++NP
Sbjct: 121 VGDYVNFGFVGKEHNSAVQQDDVFLIICPQNMLGLECSIIQPLSEMVEAAGDRPVIILNP 180
Query: 216 RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQY-PIMGAL 264
L D S G GR +R+++A SFE + F+ +Y +GT Y PI+G++
Sbjct: 181 DLTDKISSQGQQSIRGRQQRIDFAESFETIFQFQNIYTSGTSYFPILGSV 230
>gi|62320102|dbj|BAD94280.1| adenylate kinase -like protein [Arabidopsis thaliana]
Length = 95
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 58/85 (68%), Positives = 72/85 (84%)
Query: 227 MQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEK 286
MQTMGR++RLEYA +F+NCY FRLLYY GTQYPIMGALRM+YP YELYKRV+ +G EK
Sbjct: 1 MQTMGREQRLEYALTFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELYKRVNEENGKEK 60
Query: 287 YVILSTFTEKPSSDDINDAFLGKPR 311
YV+L+T+ E+P+ + I+DAF GK R
Sbjct: 61 YVLLATYAERPTPEQIDDAFSGKSR 85
>gi|147768155|emb|CAN78322.1| hypothetical protein VITISV_001685 [Vitis vinifera]
Length = 228
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 55/84 (65%), Positives = 64/84 (76%)
Query: 2 VFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIR 61
VF I+SLLS+ +++K K + + SN SSSQ+ W+GIPTRLNNIPHSREIR
Sbjct: 144 VFKAIDSLLSKXQQDKENMMKLEKSIRASERLSNLASSSQEXWRGIPTRLNNIPHSREIR 203
Query: 62 NYFYDDVLQATQRAVNDGRTRLKV 85
YFYDDVLQATQRAVNDGRTRLKV
Sbjct: 204 TYFYDDVLQATQRAVNDGRTRLKV 227
>gi|22299764|ref|NP_683011.1| hypothetical protein tlr2221 [Thermosynechococcus elongatus BP-1]
gi|22295948|dbj|BAC09773.1| tlr2221 [Thermosynechococcus elongatus BP-1]
Length = 232
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 71/246 (28%), Positives = 109/246 (44%), Gaps = 54/246 (21%)
Query: 79 GRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM 138
G+ RL++EI +P+LNP M L FA+ G +KV G ALA
Sbjct: 27 GKNRLQIEILLPDLNP----------MGLAAGYLPLFAELGSDLKVFF-ADAGSAALA-- 73
Query: 139 PLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIID 196
R+ +WG+ Y +G + I A+ D ++VAP +
Sbjct: 74 -------RR-----EWGETAYSVRGVNERLTPIEAS-----DRAIVIVAPTPVE----VT 112
Query: 197 DMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGT 256
++ M AG RP IL+NPRL+D+ GI GR R + + E CY+ R L
Sbjct: 113 AIEQMCLTAGDRPFILLNPRLQDV-AVVGIGYA-GRQLRERFLNTLEPCYYLRPLAETVI 170
Query: 257 QYPIMGALRMTYPNGYELYK-RVDVPSGGEKYVILSTFTEKPSSDDINDAF--LGKPRYV 313
L YP +++++ R P+ L+ F ++P+S+DI A LGKPR
Sbjct: 171 -------LWRCYPQAWQIWQYRETAPT------CLAEFEQRPTSEDIERALSALGKPRAQ 217
Query: 314 DYMDRI 319
++ R+
Sbjct: 218 SFLSRL 223
>gi|427731149|ref|YP_007077386.1| hypothetical protein Nos7524_4017 [Nostoc sp. PCC 7524]
gi|427367068|gb|AFY49789.1| protein of unknown function (DUF1995) [Nostoc sp. PCC 7524]
Length = 246
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 69/252 (27%), Positives = 116/252 (46%), Gaps = 52/252 (20%)
Query: 59 EIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 115
E+ N D + QA T+ A+ DG TRL+VE+ PEL M + F
Sbjct: 3 ELPNTLEDAIAQAREATKAALADGYTRLQVELLFPELK----------FMPVAEQFLPLF 52
Query: 116 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NE 173
++ R+K+ G ALA R+ DW D + + G I + ++
Sbjct: 53 SEYESRLKIFF-ADAGASALA---------RR-----DWADVPFQILDIGTGRIASIQSK 97
Query: 174 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRD 233
V +D++F+ VAP + + ++ + + G RP++++NPRL+D PG+ GI R
Sbjct: 98 VQPEDEIFLFVAPTSVE----VPQVEKICEIIGDRPLVMLNPRLED-PGTVGIGYA-ARQ 151
Query: 234 KRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILS 291
R + + E+CY+ R P+ A+ YP +EL+ V+ +K+ L
Sbjct: 152 TRQRFISKIESCYYLR---------PVDDETAVFRCYPGLWELW--VENSGTWQKFAELP 200
Query: 292 TFTEKPSSDDIN 303
+KPS D+I+
Sbjct: 201 ---KKPSGDEID 209
>gi|428781510|ref|YP_007173296.1| hypothetical protein Dacsa_3447 [Dactylococcopsis salina PCC 8305]
gi|428695789|gb|AFZ51939.1| protein of unknown function (DUF1995) [Dactylococcopsis salina PCC
8305]
Length = 253
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 68/235 (28%), Positives = 116/235 (49%), Gaps = 41/235 (17%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD-GKRVKVCVQG 128
Q+T A+ G TRL+VE+ +PE+ + + + + FADD G +KV
Sbjct: 17 QSTVAALEAGLTRLQVELKVPEIALKAE--------SIAKEFTDLFADDYGSGLKVLFPD 68
Query: 129 SMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQN 188
S G ALA +EF + D G++ T +E +V E+D +F++V+P +
Sbjct: 69 S-GAAALARR------NWSDVEF-NVNDLGSRNTPIE------KKVAEEDQIFLVVSP-S 113
Query: 189 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 248
AV + ++ + + AG RPVIL+ P+L+D+ + GI R R + ++ E+C++
Sbjct: 114 AVE---VQKVEKLCNLAGDRPVILLIPQLEDV-ATVGIGYA-ARQLRERFLSTLESCFYL 168
Query: 249 RLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+ L A AL YP+G++L+ + G +Y EKP DD++
Sbjct: 169 QPLDEA--------ALLKRYPSGWQLW----IEKGENQYEFFCEEVEKPVGDDLD 211
>gi|354566056|ref|ZP_08985229.1| protein of unknown function DUF1995-containing protein [Fischerella
sp. JSC-11]
gi|353546564|gb|EHC16012.1| protein of unknown function DUF1995-containing protein [Fischerella
sp. JSC-11]
Length = 250
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 63/230 (27%), Positives = 104/230 (45%), Gaps = 45/230 (19%)
Query: 76 VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 135
+ DG TRL+VE+ PEL LME+ + F ++KV + G G
Sbjct: 23 IADGYTRLQVELLFPELK----------LMEVAQQFLPMFEQYNDKLKVFFADAGGAG-- 70
Query: 136 AGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQNAVGNC 193
L DW D K T + G + E V +D++F+ +AP G+
Sbjct: 71 -------------LARRDWTDVRFKITDIGTGRAASIESKVQPEDEIFLFIAP----GSP 113
Query: 194 IIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 253
+ ++ + G RP++++NP L+D G GI R R+ + ++ E+CY+ R
Sbjct: 114 EVPQLEKICAVIGDRPLVMLNPFLEDA-GKIGIGYA-ARQTRIHFLSTIESCYYLR---- 167
Query: 254 AGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+ YP + R YP +E+ V + GE Y ++ +KPS DD++
Sbjct: 168 --SVYPDVALFR-CYPGLWEIL----VENNGE-YQKIAELPQKPSGDDLD 209
>gi|428311732|ref|YP_007122709.1| hypothetical protein Mic7113_3579 [Microcoleus sp. PCC 7113]
gi|428253344|gb|AFZ19303.1| protein of unknown function (DUF1995) [Microcoleus sp. PCC 7113]
Length = 249
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 66/235 (28%), Positives = 115/235 (48%), Gaps = 43/235 (18%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+ATQ A+NDGRTRL+VE+ PE+ + I + I L F + G +KV +
Sbjct: 17 EATQSALNDGRTRLQVELVFPEIA--LQAQSIAQ-----QFIPL-FEEYGSGLKVLFPDT 68
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIG-SIGANEVDEQDDMFILVAPQN 188
G ALA R+ DWG+ K + V S N++ +D F+LVAP +
Sbjct: 69 -GAAALA---------RR-----DWGEVPFKISDVGTSRSPITNKIQAEDKAFLLVAP-S 112
Query: 189 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 248
AV + ++ + + AG RP +L+NP+L+D+ S + R R ++ E CY+
Sbjct: 113 AVE---VAQVETLCNLAGDRPCVLLNPQLEDI--SIVGIGMAARKLRERLLSTIEPCYYL 167
Query: 249 RLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
R + A A+ +YP ++++ +D ++Y +++ ++P+ D +
Sbjct: 168 RPIDPA--------AILRSYPGLWQVWLEID-----DEYQLIAEEPQRPAGDALE 209
>gi|282899475|ref|ZP_06307441.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281195632|gb|EFA70563.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 245
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 107/245 (43%), Gaps = 50/245 (20%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
A + A++DG+TR++VE+ PEL M + F + R+KV +
Sbjct: 21 AVRSALSDGKTRIQVELLFPELQ----------FMPVAEQFLPLFTEYESRLKVFFADAG 70
Query: 131 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQN 188
DWGD K + G ++E + +D++F+ +AP +
Sbjct: 71 AAALAR---------------RDWGDIPFKIMDIGTGRAASSESKIQPEDEIFLFIAPTS 115
Query: 189 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 248
+ ++ + G RP +++NPRL+D S + R+ R + ++ E+CY+
Sbjct: 116 VE----VAQLEKLCQIIGERPFVMLNPRLED--SSVVGIGYAARETRRRFISTIESCYYL 169
Query: 249 RLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
R PI AL +YP +E++ D GE Y ++ +KPS D+I+
Sbjct: 170 R---------PIDEQSALMRSYPGNWEIWLETD----GE-YQKIAELPQKPSGDEIDSIL 215
Query: 307 L-GKP 310
+ G+P
Sbjct: 216 IKGQP 220
>gi|434398468|ref|YP_007132472.1| protein of unknown function DUF1995-containing protein [Stanieria
cyanosphaera PCC 7437]
gi|428269565|gb|AFZ35506.1| protein of unknown function DUF1995-containing protein [Stanieria
cyanosphaera PCC 7437]
Length = 247
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 66/241 (27%), Positives = 107/241 (44%), Gaps = 55/241 (22%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+AT A+ DG R++VE+ IPE+ + V L A FA+ G +KV S
Sbjct: 17 EATLAALADGYNRVQVELVIPEITLQAQV--------LALEFATIFAEYGSGLKVMFPDS 68
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFI 182
G ALA DWG D G++ T VE +V EQD +FI
Sbjct: 69 -GAAALAK--------------RDWGEVPFQVTDLGSRYTAVE------TQVSEQDQIFI 107
Query: 183 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 242
+VAP ++ ++ + + A RPV+L+ P+L+D+ GI T R R + S
Sbjct: 108 VVAP----AAIEVEKVEKLCNVAEDRPVVLLIPQLEDI-AFVGIGYT-ARQMRERFIKSL 161
Query: 243 ENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDI 302
+CY+ R L A A+ +P+ ++++ + E+Y + +P +D+
Sbjct: 162 YSCYYLRPLEEA--------AILRCHPSSWQIWLEKE-----EEYQLAMELAHQPMGEDL 208
Query: 303 N 303
Sbjct: 209 E 209
>gi|425442993|ref|ZP_18823225.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389715818|emb|CCH99873.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 247
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 68/255 (26%), Positives = 113/255 (44%), Gaps = 48/255 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL FA
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFAS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D+ F++V P + I+ ++ + A RPV+L+ P+L+D+ S + R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV--SVVGIGYAAR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + ++ E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDA 305
K SS+ ND+
Sbjct: 208 ENLILKASSNTPNDS 222
>gi|166367819|ref|YP_001660092.1| hypothetical protein MAE_50780 [Microcystis aeruginosa NIES-843]
gi|425464571|ref|ZP_18843881.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|166090192|dbj|BAG04900.1| hypothetical protein MAE_50780 [Microcystis aeruginosa NIES-843]
gi|389833386|emb|CCI22146.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 247
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/255 (26%), Positives = 113/255 (44%), Gaps = 48/255 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGDRRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ S + R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV--SVVGIGYAAR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + ++ E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDA 305
K SS+ ND+
Sbjct: 208 ENLILKASSNTPNDS 222
>gi|186682862|ref|YP_001866058.1| hypothetical protein Npun_R2561 [Nostoc punctiforme PCC 73102]
gi|186465314|gb|ACC81115.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 244
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/261 (25%), Positives = 110/261 (42%), Gaps = 53/261 (20%)
Query: 59 EIRNYFYDDVLQ---ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 115
E+ N D + Q A Q A+ DG TR++VE PEL M + F
Sbjct: 3 ELPNTLEDAIAQSRTAVQAALADGCTRIQVEFLFPELK----------FMPVAEQFLPLF 52
Query: 116 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NE 173
+ R+K+ + R+ +W D + + G + +
Sbjct: 53 TEYDSRLKIFFADAGAAAL----------ARR-----NWTDAPFQILDIGTGRTASLQTK 97
Query: 174 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRD 233
+ +D++F+ +AP + + ++ + G RPV+ +NPRL+D G+ GI R
Sbjct: 98 IQPEDEIFLFIAPTSVE----VPQVEKLCQEIGDRPVVFLNPRLED-SGTVGIGYA-ARQ 151
Query: 234 KRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILS 291
RL + + E+CY+ R PI AL YP +E++ D GE Y ++
Sbjct: 152 TRLRFTNTIESCYYLR---------PIDEQSALSRCYPGQWEVWLETD----GE-YQRIA 197
Query: 292 TFTEKPSSDDINDAFL-GKPR 311
KPS DD++ L G+P+
Sbjct: 198 ELPTKPSGDDLDQILLKGQPQ 218
>gi|422304988|ref|ZP_16392325.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|389789763|emb|CCI14274.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
Length = 247
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/255 (25%), Positives = 116/255 (45%), Gaps = 50/255 (19%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL FA
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFA-- 50
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGT-RKILEFMDWGDYGAKGTFVEIGSIGA------ 171
S+ +G +G+ + T +L DWG+ T ++G +G+
Sbjct: 51 ----------SIFDGYGSGLRVIFPDTGAAMLARRDWGE-----TVFQLGDLGSRFIPIE 95
Query: 172 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 231
N++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ S +
Sbjct: 96 NKIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV--SVVGIGYAA 149
Query: 232 RDKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVI 289
R R + ++ E+CY+FR L A YP + + + +GYEL GE
Sbjct: 150 RQLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA--- 206
Query: 290 LSTFTEKPSSDDIND 304
L K SS++ D
Sbjct: 207 LENLILKASSNNPKD 221
>gi|425435402|ref|ZP_18815857.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|389680066|emb|CCH91215.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
Length = 247
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 67/255 (26%), Positives = 114/255 (44%), Gaps = 50/255 (19%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFT-- 50
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGT-RKILEFMDWGDYGAKGTFVEIGSIGA------ 171
S+ +G +G+ + T +L DWG+ T ++G +G+
Sbjct: 51 ----------SIFDGYGSGLRVIFPDTGAAMLARRDWGE-----TVFQLGDLGSRFIPVE 95
Query: 172 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 231
N++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ S +
Sbjct: 96 NKIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV--SVVGIGYAA 149
Query: 232 RDKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVI 289
R R + + E+CY+FR L A YP M + + +GYEL GE
Sbjct: 150 RQLRERFLSILESCYYFRPLESAIVYRSYPSMWQVWLEKEDGYELISEQSTKPMGEA--- 206
Query: 290 LSTFTEKPSSDDIND 304
L K SS+ ND
Sbjct: 207 LENLILKASSNTPND 221
>gi|427418343|ref|ZP_18908526.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 7375]
gi|425761056|gb|EKV01909.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 7375]
Length = 237
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 69/260 (26%), Positives = 121/260 (46%), Gaps = 54/260 (20%)
Query: 51 LNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRV 110
++++P S E + + +T+ A+ DG RL+VE+ IPEL + R TL
Sbjct: 1 MSDLPSSLE---ETIEQAIASTKAAIEDGIIRLQVEMVIPELKQQPIAERFLTL------ 51
Query: 111 IALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG--DYGAKGTFVEIGS 168
F D G + KV + G ALA R+ DW D+ +G I
Sbjct: 52 ----FQDLGLQFKVYFPDA-GAAALA---------RR-----DWDNPDFTIRG----INE 88
Query: 169 IGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQ 228
I A + D+ F++V P +AV + D++ + + A RPV+L+NP+L+D+ S +
Sbjct: 89 IQAA-ITPDDEAFLIVNP-SAVE---VQDVEKLCNEAQDRPVVLLNPQLEDV--SIVGIG 141
Query: 229 TMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYV 288
R R + ++ +CY++R + + A+ +P G+++++ V G Y
Sbjct: 142 YAARQLRERFLSTLTSCYYYRPMTDS--------AVLRRHPQGWQVWQDV-----GNDYE 188
Query: 289 ILSTFTEKPSSDDINDAFLG 308
+ S E+PSS+ + G
Sbjct: 189 LKSELPERPSSEALEKILYG 208
>gi|282896506|ref|ZP_06304526.1| conserved hypothetical protein [Raphidiopsis brookii D9]
gi|281198612|gb|EFA73493.1| conserved hypothetical protein [Raphidiopsis brookii D9]
Length = 245
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 59/245 (24%), Positives = 106/245 (43%), Gaps = 50/245 (20%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
A + A++DG+TR++VE+ PEL M + F + +KV +
Sbjct: 21 AAKSALSDGKTRIQVELLFPELQ----------FMPVAEQFLPLFTEYESGLKVFFADAG 70
Query: 131 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQN 188
DWGD K + G ++E + +D++F+ +AP +
Sbjct: 71 AAALAR---------------RDWGDIPFKIMDIGTGRAASSESKIQPEDEIFLFIAPTS 115
Query: 189 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 248
+ ++ + G RP +++NPRL+D S + R+ R + ++ E+CY+
Sbjct: 116 VE----VAQLEKLCQIIGERPFVMLNPRLED--SSVVGIGYAARETRRRFISTIESCYYL 169
Query: 249 RLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
R PI AL +YP +E++ D GE Y ++ +KPS D+I+
Sbjct: 170 R---------PIDEQSALMRSYPGNWEIWLETD----GE-YRKIAELPQKPSGDEIDSIL 215
Query: 307 L-GKP 310
+ G+P
Sbjct: 216 IKGQP 220
>gi|425449243|ref|ZP_18829085.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
gi|389764162|emb|CCI09454.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
Length = 247
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/255 (25%), Positives = 114/255 (44%), Gaps = 48/255 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D++F++V P + I+ ++ + + A RPV+L+ P+L+D+ S + R
Sbjct: 97 KIKPEDEVFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV--SVVGIGYAAR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDA 305
K SS++ +D+
Sbjct: 208 ENLILKASSNNPDDS 222
>gi|425446838|ref|ZP_18826837.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
gi|389732775|emb|CCI03345.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
Length = 247
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 68/255 (26%), Positives = 112/255 (43%), Gaps = 48/255 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G RL+VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRLQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDA 305
K SS++ D+
Sbjct: 208 ENLILKASSNNPKDS 222
>gi|425459538|ref|ZP_18839024.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
gi|389822715|emb|CCI29585.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
Length = 247
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 67/254 (26%), Positives = 111/254 (43%), Gaps = 48/254 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ S + R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV--SVVGIGYAAR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDIND 304
K SS+ ND
Sbjct: 208 ENLILKASSNTPND 221
>gi|443475522|ref|ZP_21065469.1| protein of unknown function DUF1995-containing protein
[Pseudanabaena biceps PCC 7429]
gi|443019641|gb|ELS33702.1| protein of unknown function DUF1995-containing protein
[Pseudanabaena biceps PCC 7429]
Length = 236
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 60/248 (24%), Positives = 105/248 (42%), Gaps = 55/248 (22%)
Query: 66 DDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVC 125
+ + AT +A+ G TR+ ++ PEL M + A SF
Sbjct: 19 EQAIAATHQAIASGATRVLADLRFPELKA----------MPIAYEFAASFNQQYGEAWQA 68
Query: 126 VQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE----VDEQDDMF 181
+ G ALA RK W D + + G NE + E+D F
Sbjct: 69 IFSDAGAAALA--------KRK------WVD-------LNVSVRGVNEGRRAIREEDRAF 107
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
+L+ P + ++ ++ + AG RP +++NPRL++ G+ RD+ + ++
Sbjct: 108 LLIEP----TSIEVEQVEKLVQLAGDRPFVMLNPRLENSEVGLGLAARQMRDR---FLST 160
Query: 242 FENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 301
FE Y+ + L +GAL YP ++++ + + G +K L+ ++PSSDD
Sbjct: 161 FETAYYIKPLE--------LGALWRCYPQTWQVWNQTE--EGMQK---LAEVEQRPSSDD 207
Query: 302 INDAFLGK 309
I+ F K
Sbjct: 208 IDRLFRQK 215
>gi|16329818|ref|NP_440546.1| hypothetical protein slr1702 [Synechocystis sp. PCC 6803]
gi|383321560|ref|YP_005382413.1| hypothetical protein SYNGTI_0651 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383324730|ref|YP_005385583.1| hypothetical protein SYNPCCP_0651 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383490614|ref|YP_005408290.1| hypothetical protein SYNPCCN_0651 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384435880|ref|YP_005650604.1| hypothetical protein SYNGTS_0651 [Synechocystis sp. PCC 6803]
gi|451813978|ref|YP_007450430.1| hypothetical protein MYO_16580 [Synechocystis sp. PCC 6803]
gi|1652303|dbj|BAA17226.1| slr1702 [Synechocystis sp. PCC 6803]
gi|339272912|dbj|BAK49399.1| hypothetical protein SYNGTS_0651 [Synechocystis sp. PCC 6803]
gi|359270879|dbj|BAL28398.1| hypothetical protein SYNGTI_0651 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274050|dbj|BAL31568.1| hypothetical protein SYNPCCN_0651 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277220|dbj|BAL34737.1| hypothetical protein SYNPCCP_0651 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407957706|dbj|BAM50946.1| hypothetical protein BEST7613_2015 [Synechocystis sp. PCC 6803]
gi|451779947|gb|AGF50916.1| hypothetical protein MYO_16580 [Synechocystis sp. PCC 6803]
Length = 251
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 60/244 (24%), Positives = 119/244 (48%), Gaps = 55/244 (22%)
Query: 67 DVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCV 126
+ + ATQRA++DG R++VE+ +PE+ + +E ++ FAD G +++
Sbjct: 15 EAIAATQRALDDGYRRIQVELAVPEIALQAQ----AIALEFSQL----FADQGLGLRIIF 66
Query: 127 QGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------NEVDEQDDM 180
+ G ALA DWG+ +IG +G+ ++ E D +
Sbjct: 67 PDT-GAAALAK--------------RDWGEVP-----FQIGDLGSRYTPIGRKIAESDQV 106
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
F++V+P +AV + ++ + + AG RPV+L+ P+L+D+ S + R R + +
Sbjct: 107 FLVVSP-SAVE---VQSVEKLCELAGDRPVVLLIPQLEDV--SIVGIGYAARQLRQRFLS 160
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSS 299
+ + Y+FR P+ GA+ + ++P+ +++ GE+Y +L+ EKP+
Sbjct: 161 TLFSAYYFR---------PLDGAVVLRSHPSRWQVLL-----EKGEEYELLTELPEKPAG 206
Query: 300 DDIN 303
+ +
Sbjct: 207 EALE 210
>gi|434407903|ref|YP_007150788.1| protein of unknown function (DUF1995) [Cylindrospermum stagnale PCC
7417]
gi|428262158|gb|AFZ28108.1| protein of unknown function (DUF1995) [Cylindrospermum stagnale PCC
7417]
Length = 244
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 59/240 (24%), Positives = 113/240 (47%), Gaps = 53/240 (22%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+ATQ A+ DG R++++ PEL ++ + FA+ R+KV
Sbjct: 17 EATQAALADGCKRIQIDFLFPELK----------ILPVAEQFLPVFAEYDSRLKVFF-AD 65
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA----NEVDEQDDMFILVA 185
G ALA R+ DW D A + V+IG+ +++ +D++F+ +A
Sbjct: 66 AGGAALA---------RR-----DWAD--AAFSIVDIGTGRTASLQSKIQPEDEIFLFIA 109
Query: 186 PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENC 245
P + + ++ + + G RPV+ +NPRL+D G GI T R+ R + ++ E+C
Sbjct: 110 PTSVE----VPQLEKLCEIIGDRPVVFLNPRLED-SGVVGIGYT-ARETRRRFISTIESC 163
Query: 246 YFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
Y+ R P+ A+ +YP +E++ + ++Y ++ ++PS D+++
Sbjct: 164 YYLR---------PVDDETAVFRSYPGQWEVWVETN-----DEYQRIAELPKRPSGDELD 209
>gi|440753103|ref|ZP_20932306.1| hypothetical protein O53_1480 [Microcystis aeruginosa TAIHU98]
gi|440177596|gb|ELP56869.1| hypothetical protein O53_1480 [Microcystis aeruginosa TAIHU98]
Length = 247
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 66/255 (25%), Positives = 113/255 (44%), Gaps = 48/255 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ S + R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV--SVVGIGYAAR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDA 305
K SS++ +D+
Sbjct: 208 ENLILKASSNNPDDS 222
>gi|428775196|ref|YP_007166983.1| hypothetical protein PCC7418_0540 [Halothece sp. PCC 7418]
gi|428689475|gb|AFZ42769.1| protein of unknown function DUF1995 [Halothece sp. PCC 7418]
Length = 253
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 109/243 (44%), Gaps = 57/243 (23%)
Query: 70 QATQRAVNDGRTRLKVEINIPE--LNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQ 127
Q+T A+ G TRL+VE+ IPE L E L+E D G +KV
Sbjct: 17 QSTLTALEAGLTRLQVELRIPEIALQGEKIAKEFADLLE---------DDYGSGLKVLFP 67
Query: 128 GSMGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDM 180
+ G ALA R+ +W D G++ T +E +V E D M
Sbjct: 68 DT-GAAALA---------RR-----NWADVSFQVNDLGSRNTPIE------KKVSEDDQM 106
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
F+LV+P +AV + ++ + + AG RPVIL+ P+L+D+ + GI R R + +
Sbjct: 107 FLLVSP-SAVE---VQKVEKLCNLAGDRPVILLIPQLEDV-ATVGIGYA-ARQLRERFLS 160
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
+ E+CY+ + L A AL YP+ ++L+ + G Y EKP D
Sbjct: 161 TLESCYYLQPLEEA--------ALLKRYPSSWQLW----IEKGENNYEFFCEEPEKPVGD 208
Query: 301 DIN 303
++
Sbjct: 209 TLD 211
>gi|425472740|ref|ZP_18851581.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
gi|389881081|emb|CCI38316.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
Length = 247
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 67/254 (26%), Positives = 111/254 (43%), Gaps = 48/254 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D++F++V P + I+ ++ + A RPV+L+ P+L+D+ S + R
Sbjct: 97 KIKPEDEVFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV--SVVGIGYAAR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDIND 304
K SS+ ND
Sbjct: 208 ENLILKASSNTPND 221
>gi|428317816|ref|YP_007115698.1| protein of unknown function DUF1995-containing protein
[Oscillatoria nigro-viridis PCC 7112]
gi|428241496|gb|AFZ07282.1| protein of unknown function DUF1995-containing protein
[Oscillatoria nigro-viridis PCC 7112]
Length = 248
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 67/240 (27%), Positives = 108/240 (45%), Gaps = 55/240 (22%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
AT A++DG+T L+VE+ PE+ + + L E F + VKV +
Sbjct: 18 ATAAALSDGKTLLQVELVFPEIALQAQSITLQFLPE--------FEEIYPGVKVFFPDT- 68
Query: 131 GEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFIL 183
G ALA R+ DWG D G+ T VE +++ +D +F+L
Sbjct: 69 GAAALA---------RR-----DWGETPFKVTDLGSSRTPVE------DKIAPEDQLFLL 108
Query: 184 VAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFE 243
+ P + ++ + AA RPVIL+NPRL+D+ + GI GR R + E
Sbjct: 109 INP----AAVEVAQVEKIYIAAAGRPVILLNPRLEDV-ATIGIGYA-GRQLRDRFLNKIE 162
Query: 244 NCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+CY+ R L A AL YP ++++ + ++Y ++S +KP DD+
Sbjct: 163 SCYYIRPLDTA--------ALFRCYPQPWQVWLETN-----DEYELISETAQKPVGDDLE 209
>gi|300865956|ref|ZP_07110692.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300336022|emb|CBN55850.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 248
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 67/240 (27%), Positives = 112/240 (46%), Gaps = 52/240 (21%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYR---IGTLMELVRVIALSFADDGKRVKVCV 126
+A + A+ND +TRL++E+ PE+ + I EL + + F D G
Sbjct: 17 EAVKAALNDAQTRLQIELVFPEIALQAQSITQQFITDFEELANGLKVFFPDTGS------ 70
Query: 127 QGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG---ANEVDEQDDMFIL 183
ALA R+ DWG K T ++GS A +++ +D +F+L
Sbjct: 71 ------AALA---------RR-----DWGTVPFKIT--DLGSSRSPVAEKIEPEDQIFLL 108
Query: 184 VAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFE 243
V NA + + ++ + +AA +RPVIL+NPRL+D + GI GR R + + +
Sbjct: 109 V---NA-SSIEVAQVEQLCNAADSRPVILLNPRLEDA-ATIGIGYA-GRQLRDRFLNTLQ 162
Query: 244 NCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+CY+ R L A AL YP ++++ + GE Y ++S +KP D++
Sbjct: 163 SCYYIRPLPTA--------ALFRCYPQSWQVWLE---ETEGE-YKLISETAQKPVGDELE 210
>gi|443658098|ref|ZP_21132025.1| hypothetical protein C789_2565 [Microcystis aeruginosa DIANCHI905]
gi|159027702|emb|CAO89569.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443333038|gb|ELS47616.1| hypothetical protein C789_2565 [Microcystis aeruginosa DIANCHI905]
Length = 247
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 66/255 (25%), Positives = 113/255 (44%), Gaps = 48/255 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ ++D+ F++V P + I+ ++ + A RPV+L+ P+L+D+ S + R
Sbjct: 97 KIKDEDEAFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV--SVVGIGYAAR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDA 305
K SS++ +D+
Sbjct: 208 ENLILKASSNNPDDS 222
>gi|254422515|ref|ZP_05036233.1| hypothetical protein S7335_2667 [Synechococcus sp. PCC 7335]
gi|196190004|gb|EDX84968.1| hypothetical protein S7335_2667 [Synechococcus sp. PCC 7335]
Length = 248
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 65/237 (27%), Positives = 115/237 (48%), Gaps = 58/237 (24%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFAD--DGKRVKVCVQG 128
ATQ A++ G + L++E+ IPEL + IA F + + ++ KV
Sbjct: 17 ATQAALDAGISLLQIEMAIPELKHQP--------------IAQQFVELFNDRQFKVLFPD 62
Query: 129 SMGEGALAGMPLQLAGTRKILEFMDWG--DYGAKGTFVEIGSIGANEVDEQDDMFILVAP 186
+ G ALA R+ DW D+ +G +G + A VD +DD++++V P
Sbjct: 63 A-GAAALA---------RR-----DWNNPDFVIRG----LGEL-AQTVDPEDDLYLIVNP 102
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
+AV +D ++A+ + A +PV+L+NP+L+D+ GI R R + + E CY
Sbjct: 103 -SAVE---VDKVEALCNEALDQPVVLLNPQLEDV-AVVGIGYA-ARQLRDRFLSQIETCY 156
Query: 247 FFRLLYYAGTQYPI-MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDI 302
+ R PI G + YP +++++ + G ++Y + + +PSS+DI
Sbjct: 157 YVR---------PIDQGVVYRAYPGPWQIWREI----GPDEYEHVQDLSNRPSSEDI 200
>gi|428225033|ref|YP_007109130.1| hypothetical protein GEI7407_1587 [Geitlerinema sp. PCC 7407]
gi|427984934|gb|AFY66078.1| protein of unknown function DUF1995 [Geitlerinema sp. PCC 7407]
Length = 244
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 120/243 (49%), Gaps = 53/243 (21%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF-ADDGKRVKVCVQGS 129
ATQ A++DG +RL+VE+ PE++ + + + + A F A G R K+ +
Sbjct: 18 ATQAALDDGHSRLQVELVFPEIDLQA--------ISIAELFAADFEARYGDRFKILFPDA 69
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGAN-EVDEQDDMFILVAP 186
GA A LA R WG+ Y +G I + A + DEQ F+++AP
Sbjct: 70 ---GAAA-----LAKHR-------WGEKPYEIRG----ISELKAQIQPDEQ--AFLMIAP 108
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
+ + ++ + A RPV+++NPRL+D+ + GI GR R + ++ +CY
Sbjct: 109 ----SSVEVGPVEKFCEEASDRPVVMVNPRLEDV-ATIGIGYA-GRQLRERFLSTLLSCY 162
Query: 247 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
+ R + G GALR +YP +E++ ++ SG EK ++ ++KP D + D
Sbjct: 163 YLR--PFEG------GALRRSYPGPWEVW--LETESGYEK---VAEESQKPVGDAL-DQI 208
Query: 307 LGK 309
+G+
Sbjct: 209 IGR 211
>gi|411117915|ref|ZP_11390296.1| protein of unknown function (DUF1995) [Oscillatoriales
cyanobacterium JSC-12]
gi|410711639|gb|EKQ69145.1| protein of unknown function (DUF1995) [Oscillatoriales
cyanobacterium JSC-12]
Length = 241
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 63/235 (26%), Positives = 106/235 (45%), Gaps = 53/235 (22%)
Query: 79 GRTRLKVEINIPELNPEMDVYR-IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAG 137
G +RL+VEI IPEL P Y+ + E + + F D G ++ AG
Sbjct: 26 GYSRLQVEIAIPELKPMQPAYQYLSVFEEYGSHLKVFFTDAGA-------AALARREWAG 78
Query: 138 MP-----LQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGN 192
P + + GTR+ T VE V +D +F+ VAP +AV
Sbjct: 79 SPYNIQSIDIQGTRQT-------------TPVE------QLVLSEDRVFVFVAP-SAVEV 118
Query: 193 CIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLY 252
+++ + +AAG RPVIL NPR++D+ S + R R + + E CY+ +
Sbjct: 119 GVVEQ---IANAAGDRPVILFNPRMEDV--SVVGIGYAARKLRERFLNTIEPCYYLK--- 170
Query: 253 YAGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
P+ G AL YP+ ++++ E Y +++ T+KP+ + +++ F
Sbjct: 171 ------PLEGSALIRCYPSLWQVWAETS-----EGYTLIAEETQKPTLERLDEIF 214
>gi|427715914|ref|YP_007063908.1| hypothetical protein Cal7507_0584 [Calothrix sp. PCC 7507]
gi|427348350|gb|AFY31074.1| protein of unknown function DUF1995-containing protein [Calothrix
sp. PCC 7507]
Length = 243
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 108/248 (43%), Gaps = 55/248 (22%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
ATQ A+ DG TRL+VE PEL P M + AD R+KV
Sbjct: 18 ATQAALADGYTRLQVEFLFPELKP----------MPVAEQYLPLLADYESRLKVFF-ADT 66
Query: 131 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDE----QDDMFILVAP 186
G ALA R+ D+ T +IG+ A V + ++++F+ +AP
Sbjct: 67 GAAALA---------RR--------DWDVPFTISDIGTGRATSVSDKIQPEEEIFLFIAP 109
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
+ I ++ + G RP IL+NPRL+D GI R R + + E CY
Sbjct: 110 ----SSVEISQLEKLFAEIGDRPAILLNPRLEDA-AIVGIGYA-ARQIRERFINTIETCY 163
Query: 247 FFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 304
+ R P+ A+ YP +E++ V + GE Y ++ ++PS D+I+
Sbjct: 164 YLR---------PVDDQTAVFRCYPGLWEVW----VETNGE-YQKIAELPKRPSGDEIDL 209
Query: 305 AFL-GKPR 311
L G+P+
Sbjct: 210 ILLKGQPQ 217
>gi|254410487|ref|ZP_05024266.1| hypothetical protein MC7420_3002 [Coleofasciculus chthonoplastes
PCC 7420]
gi|196182693|gb|EDX77678.1| hypothetical protein MC7420_3002 [Coleofasciculus chthonoplastes
PCC 7420]
Length = 245
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 72/262 (27%), Positives = 120/262 (45%), Gaps = 39/262 (14%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+ATQ A+NDG TRL+VE+ PE+ + I + F G +KV +
Sbjct: 17 EATQAALNDGYTRLQVELVFPEIA--LQAQSIAWQFTPI------FEQYGSGLKVLFPDA 68
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIG---SIGANEVDEQDDMFILVAP 186
G ALA R+ DWG+ K T +IG S + + +D+ F++V+P
Sbjct: 69 -GAAALA---------RR-----DWGETPFKVT--DIGTSRSPVESRIQPEDEAFLIVSP 111
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
+AV + ++ + + AG RP +L+ P+L+DL GI R R + ++ +CY
Sbjct: 112 -SAVE---VSQVEKLCNLAGDRPCVLLTPQLEDL-KVVGIGYA-ARQLRERFLSTLTSCY 165
Query: 247 FFRLLYYAGTQ--YPIMGALRMTYPNGYELY-KRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+ + L A YP + + + N Y+L + P G +I++ T S+D
Sbjct: 166 YVQPLDGAALLRVYPGLWQIWLEKENAYQLIAEEPQKPVGDALELIIAKVTGSGDSEDTP 225
Query: 304 DAFLGKPRYVDYMDRILDYMDR 325
A KP ++ + LD + R
Sbjct: 226 AA--KKPGFLSTLQSFLDALSR 245
>gi|220907967|ref|YP_002483278.1| hypothetical protein Cyan7425_2561 [Cyanothece sp. PCC 7425]
gi|219864578|gb|ACL44917.1| conserved hypothetical protein [Cyanothece sp. PCC 7425]
Length = 233
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 102/233 (43%), Gaps = 54/233 (23%)
Query: 79 GRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM 138
G +RL+V++ IPEL P M + + F+D G +KV +
Sbjct: 26 GYSRLQVDLLIPELKP----------MPVAQQYLSLFSDYGAGLKVFFADAGAAALAR-- 73
Query: 139 PLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE----VDEQDDMFILVAPQNAVGNCI 194
DWGD V G NE + +D F+LVAP
Sbjct: 74 -------------RDWGD-------VPFTIRGVNELLNPIQPEDRAFVLVAPTPVE---- 109
Query: 195 IDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYA 254
+ ++AM + AG RP IL+N +L+D+ + GI GR R + A+FE CY+ R L +
Sbjct: 110 VMQVEAMANQAGDRPFILLNAKLEDI-ATIGIGYA-GRQLRQRFLATFEPCYYLRPLDW- 166
Query: 255 GTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 307
GA+ YP+ ++++ + ++Y +++ E+P+ D ++ L
Sbjct: 167 -------GAVLRIYPSPWQVW----LEQAEDQYQLIAEEAERPAGDRLDQILL 208
>gi|434395506|ref|YP_007130453.1| protein of unknown function DUF1995-containing protein [Gloeocapsa
sp. PCC 7428]
gi|428267347|gb|AFZ33293.1| protein of unknown function DUF1995-containing protein [Gloeocapsa
sp. PCC 7428]
Length = 243
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 116/264 (43%), Gaps = 61/264 (23%)
Query: 78 DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAG 137
DG + L++EI IPELNP M + + G ++KV + G ALA
Sbjct: 25 DGYSLLQIEILIPELNP----------MPVAEQFLPALDSFGNQLKVFFPDA-GAAALA- 72
Query: 138 MPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQ----DDMFILVAPQNAVGNC 193
R+ DWG K ++IG+ G VD+Q D+ F+LVAP +AV
Sbjct: 73 --------RR-----DWGSVPFK--ILDIGT-GRVPVDQQVLPEDEAFLLVAP-SAVE-- 113
Query: 194 IIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 253
+ ++ + A G RP IL+NPRL+D+ S + GR R + + E+CY R L
Sbjct: 114 -VAQVEKLHQAVGERPFILLNPRLEDV--SIVGIGYAGRQLRARFLNTIESCYHLRPLDG 170
Query: 254 AGTQYPIMGALRMTYPNGYELYK------------RVDVPSGGEKYVILSTFTEKPSSDD 301
A A+ YP+ +++++ + P G E +ILS + + SD
Sbjct: 171 A--------AVFRCYPSPWQVWQENKDGEYQLIAEQPKKPMGDEVDLILSGNSPQQVSDS 222
Query: 302 INDAFLGKPRYVDYMDRILDYMDR 325
+ KP M R L + R
Sbjct: 223 VQAK---KPGIFASMQRFLRTLSR 243
>gi|334118025|ref|ZP_08492115.1| Domain of unknown function DUF1995-containing protein [Microcoleus
vaginatus FGP-2]
gi|333460010|gb|EGK88620.1| Domain of unknown function DUF1995-containing protein [Microcoleus
vaginatus FGP-2]
Length = 248
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/239 (27%), Positives = 109/239 (45%), Gaps = 53/239 (22%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRV---IALSFADDGKRVKVCVQ 127
AT A++DG+T L+VE+ PE+ + L EL + + + F D
Sbjct: 18 ATAAALSDGKTLLQVELVFPEIALQAQSITEQFLPELEEIYPGVKVFFPD---------- 67
Query: 128 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILV 184
G ALA R+ DWG+ K T ++GS + +++ +D +F+L+
Sbjct: 68 --AGAAALA---------RR-----DWGETPFKVT--DLGSSRSPVEDKIAPEDQLFLLI 109
Query: 185 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 244
P + ++ + AA RPVIL+NPRL+D+ + GI GR R + + E+
Sbjct: 110 NP----AAVEVAQVERLYIAAAGRPVILLNPRLEDV-ATIGIGYA-GRQLRDRFLSKIES 163
Query: 245 CYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
CY+ R L A AL YP ++++ + +Y ++S +KP DD+
Sbjct: 164 CYYVRPLDAA--------ALFRCYPQSWQVWLERN-----NQYELISETAQKPVGDDLE 209
>gi|119510288|ref|ZP_01629424.1| hypothetical protein N9414_16062 [Nodularia spumigena CCY9414]
gi|119465032|gb|EAW45933.1| hypothetical protein N9414_16062 [Nodularia spumigena CCY9414]
Length = 244
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 112/246 (45%), Gaps = 50/246 (20%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
ATQ A+ DG TRL+V+ PEL LM + F + R+K+ +
Sbjct: 18 ATQAALADGYTRLQVDFLFPELK----------LMPVAEQFLSLFTEYDSRLKIFFPDA- 66
Query: 131 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQN 188
G ALA DW K + G + + +++ +D++F+ +AP +
Sbjct: 67 GGAALAN--------------RDWAGTPFKILDIGTGRVASIQSKIQPEDEIFLFIAPTS 112
Query: 189 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 248
+ ++ + + G RP +++NPRL+D G GI T R R + ++ E+CY+
Sbjct: 113 VE----VPQVEKLCENIGDRPFVMLNPRLED-SGVVGIGYT-ARQTRQRFISTLESCYYL 166
Query: 249 RLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
R P+ A+ YP +E++ ++ GE Y ++ ++P+ D+++
Sbjct: 167 R---------PVDDTTAVFRCYPGLWEVWVEIN----GE-YQKVAELPKRPTGDELDVIV 212
Query: 307 L-GKPR 311
+ G+P+
Sbjct: 213 MQGQPQ 218
>gi|443309167|ref|ZP_21038918.1| protein of unknown function (DUF1995) [Synechocystis sp. PCC 7509]
gi|442780785|gb|ELR90927.1| protein of unknown function (DUF1995) [Synechocystis sp. PCC 7509]
Length = 243
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 100/239 (41%), Gaps = 44/239 (18%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
ATQ A+ DG RL+VEI +PELNP +A F D + +
Sbjct: 18 ATQNAIADGLNRLQVEILLPELNP--------------MPVAERFLSDEEGISHNFNKIF 63
Query: 131 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-VDEQDDMFILVAPQNA 189
A A A R+ DWG+ + + + E + +D+ + +AP
Sbjct: 64 FPDAGAA-----ALARR-----DWGEVPFELLDISTKRVSVEEQIQPEDEAILCIAPTAQ 113
Query: 190 VGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFR 249
+ ++ + +A G RPV+L+NPRL+D+ S + GR R + + ++CY+ R
Sbjct: 114 E----VLQIEKLCNAMGDRPVVLLNPRLEDV--SIVGIGYAGRQLRDRFLKNIQSCYYLR 167
Query: 250 LLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLG 308
L G L YP ++++ + Y ++ KP DD++ F G
Sbjct: 168 PLEE--------GTLFRCYPGLWQVWCETT-----DGYELIKELPNKPVGDDLDLIFAG 213
>gi|56752173|ref|YP_172874.1| hypothetical protein syc2164_c [Synechococcus elongatus PCC 6301]
gi|81300739|ref|YP_400947.1| hypothetical protein Synpcc7942_1930 [Synechococcus elongatus PCC
7942]
gi|56687132|dbj|BAD80354.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169620|gb|ABB57960.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 247
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 110/242 (45%), Gaps = 53/242 (21%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+A QRA+ G TR++V++ IPEL + + +F D G +++V
Sbjct: 17 EAAQRALAAGLTRIQVDLAIPELK----------IQPIAWQFLAAFLDRGAKLRVFFPDP 66
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGANEVDEQDDMFILVAPQ 187
G ALA R+ DWG+ + +G E+ + V ++++FI + P
Sbjct: 67 -GAAALA---------RR-----DWGELPFSVRG-MEELKAA----VQPEEELFIFIEP- 105
Query: 188 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 247
+ + ++ + G RPVIL+NPRL+D+ + GI R R + +++ Y+
Sbjct: 106 ---SSVEVQRLEQLCQEIGDRPVILLNPRLEDV-ATIGIGYA-ARQLRERFLNQWQSAYY 160
Query: 248 FRLLYYAGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
P+ G A+ YP +++++ D Y +L + ++P+ +D++ F
Sbjct: 161 LS---------PLEGAAIFQAYPQRWQVWQETDT-----GYELLQEYDQRPNGEDLDRLF 206
Query: 307 LG 308
G
Sbjct: 207 AG 208
>gi|425455386|ref|ZP_18835106.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
gi|389803734|emb|CCI17368.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
Length = 247
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 66/255 (25%), Positives = 112/255 (43%), Gaps = 48/255 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLAIESGARRVQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------LFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D+ F++V P + I+ ++ + A RPV+L+ P+L+D+ S + R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV--SVVGIGYAAR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDA 305
K SS++ +D+
Sbjct: 208 ENLILKASSNNPDDS 222
>gi|452823755|gb|EME30763.1| hypothetical protein isoform 2 [Galdieria sulphuraria]
Length = 1138
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 131/296 (44%), Gaps = 56/296 (18%)
Query: 28 HERKSRSNQVSSSQDNWKGIPTR---LNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLK 84
+ R+ +NQ SQ++ G+P + ++ IP+S ++ L A A G RL
Sbjct: 50 NSRQISANQTFLSQES--GLPDQGEVVSPIPNSL---TRLSENSLSALIAATEAGYKRLL 104
Query: 85 VEINIPELNPEMDVYR--IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQL 142
++I+ + + ++ + L EL+RVI + + + + G A A
Sbjct: 105 IDIDFEQGDSTYTTFKNAVPFLRELLRVIE---SRNWQTTTTLLFPDEGSAAFA------ 155
Query: 143 AGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMT 202
RK DW +K +F+ GS D D I++ P +AV D++AM
Sbjct: 156 ---RK-----DWNI--SKCSFLSFGSYSGK--DSSQDWLIMICP-SAV------DVEAME 196
Query: 203 D---------AAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 253
+ RP+ILINP+L D+ G++G + R R ++ ++FE+ YF R+ +
Sbjct: 197 QIVNRESPFLSKDARPIILINPKLVDM-GATG-LGFNARQLRQQFLSTFESIYFLRVYTW 254
Query: 254 AGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 307
QYP ++ + N E + P Y +L TF KP+ D I + FL
Sbjct: 255 GVVVRQYPFRWSVWLDTANSDENSSSEEAP-----YRLLRTFENKPNDDTIQEIFL 305
>gi|452823754|gb|EME30762.1| hypothetical protein isoform 1 [Galdieria sulphuraria]
Length = 1152
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 131/296 (44%), Gaps = 56/296 (18%)
Query: 28 HERKSRSNQVSSSQDNWKGIPTR---LNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLK 84
+ R+ +NQ SQ++ G+P + ++ IP+S ++ L A A G RL
Sbjct: 50 NSRQISANQTFLSQES--GLPDQGEVVSPIPNSL---TRLSENSLSALIAATEAGYKRLL 104
Query: 85 VEINIPELNPEMDVYR--IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQL 142
++I+ + + ++ + L EL+RVI + + + + G A A
Sbjct: 105 IDIDFEQGDSTYTTFKNAVPFLRELLRVIE---SRNWQTTTTLLFPDEGSAAFA------ 155
Query: 143 AGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMT 202
RK DW +K +F+ GS D D I++ P +AV D++AM
Sbjct: 156 ---RK-----DWNI--SKCSFLSFGSYSGK--DSSQDWLIMICP-SAV------DVEAME 196
Query: 203 D---------AAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 253
+ RP+ILINP+L D+ G++G + R R ++ ++FE+ YF R+ +
Sbjct: 197 QIVNRESPFLSKDARPIILINPKLVDM-GATG-LGFNARQLRQQFLSTFESIYFLRVYTW 254
Query: 254 AGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 307
QYP ++ + N E + P Y +L TF KP+ D I + FL
Sbjct: 255 GVVVRQYPFRWSVWLDTANSDENSSSEEAP-----YRLLRTFENKPNDDTIQEIFL 305
>gi|440681570|ref|YP_007156365.1| protein of unknown function DUF1995-containing protein [Anabaena
cylindrica PCC 7122]
gi|428678689|gb|AFZ57455.1| protein of unknown function DUF1995-containing protein [Anabaena
cylindrica PCC 7122]
Length = 244
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 55/238 (23%), Positives = 101/238 (42%), Gaps = 49/238 (20%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+A + A+ DG TR++V+ PEL M + FA+ R+K+ +
Sbjct: 17 EAVKSALADGMTRIQVDFLFPELK----------FMPIAEQFIPLFAEYESRLKIFFADA 66
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQ 187
W D K + G + E + +D++F+ +AP
Sbjct: 67 GAAALARRD---------------WADSPFKVEDIGTGRTASLETKIQPEDEIFLFIAPT 111
Query: 188 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 247
+ + ++ + + GTRP IL+NPRL+D G GI R+ R + ++ E+CY+
Sbjct: 112 SVE----VPQLEKLCEIIGTRPFILLNPRLED-SGVVGIGYA-ARETRRRFISTIESCYY 165
Query: 248 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
R P+ AL YP +E++ + ++Y ++ ++PS D+I+
Sbjct: 166 LR---------PVDDESALFRCYPGDWEVWLETN-----DEYQKIAELPKRPSGDEID 209
>gi|428306245|ref|YP_007143070.1| hypothetical protein Cri9333_2705 [Crinalium epipsammum PCC 9333]
gi|428247780|gb|AFZ13560.1| protein of unknown function DUF1995-containing protein [Crinalium
epipsammum PCC 9333]
Length = 248
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 114/242 (47%), Gaps = 51/242 (21%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+AT+ A+ DG TRL VE+ PEL M + + G +KV +
Sbjct: 17 EATKTAIADGHTRLMVELVFPELK----------TMPVASQFIPELEEYGSHLKVFFTDA 66
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 186
G ALA R+ DWG+ K ++GS + N+++E+D +F+L+ P
Sbjct: 67 -GAAALA---------RR-----DWGEVPFK--ISDLGSSRSSVQNKIEEEDQIFLLIEP 109
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
+AV I ++ + +AAG RPVIL+ PRL+D +G + YAA
Sbjct: 110 -SAVE---IAQVEQLCNAAGDRPVILLVPRLED-------AAVVG----IGYAARQLRDR 154
Query: 247 FFRLLYYAGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 305
F + LY + P+ G AL ++P+ ++++ + + Y +++ T+KP + ++
Sbjct: 155 FIKTLYSSYYIRPLEGAALFRSHPSPWQVWLETN-----DDYNLIAEETQKPVGETLDQI 209
Query: 306 FL 307
L
Sbjct: 210 IL 211
>gi|427705929|ref|YP_007048306.1| hypothetical protein Nos7107_0483 [Nostoc sp. PCC 7107]
gi|427358434|gb|AFY41156.1| protein of unknown function DUF1995-containing protein [Nostoc sp.
PCC 7107]
Length = 244
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 66/240 (27%), Positives = 113/240 (47%), Gaps = 53/240 (22%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+AT+ A+ DG TR++V++ PEL M + FA+ R+KV
Sbjct: 17 EATKAALADGYTRVQVDLLFPELKQ----------MPVAEQFLPLFAEYESRLKVFF-AD 65
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA----NEVDEQDDMFILVA 185
G ALA R+ DW D A ++IG+ A +++ +D++F+ V+
Sbjct: 66 AGGAALA---------RR-----DWVD--AAFQILDIGTGRAASIQSKIKPEDEIFLFVS 109
Query: 186 PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENC 245
P +AV I ++ + + G RP++++NPRL+D G+ GI R R + + E+C
Sbjct: 110 P-SAVE---IPQLEKVCEIIGDRPLVMLNPRLED-SGTVGIGYA-ARQIRERFLNTIESC 163
Query: 246 YFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
Y+ R P+ A+ YP +E+ + GE + ++ +KPS DDI+
Sbjct: 164 YYLR---------PVDENTAVFRCYPGQWEVLVQ-----KGETWEKIAELPKKPSGDDID 209
>gi|428300086|ref|YP_007138392.1| hypothetical protein Cal6303_3487 [Calothrix sp. PCC 6303]
gi|428236630|gb|AFZ02420.1| protein of unknown function DUF1995-containing protein [Calothrix
sp. PCC 6303]
Length = 250
Score = 61.6 bits (148), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 65/253 (25%), Positives = 110/253 (43%), Gaps = 53/253 (20%)
Query: 59 EIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 115
++ N D + QA T A+ DG TRL+VE+ PEL P M + +F
Sbjct: 3 QLPNSLEDAIFQAREATLAAIADGCTRLQVELLFPELKP----------MPVAEEFIPTF 52
Query: 116 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-- 173
A G ++KV + DW D K V G A +
Sbjct: 53 ASYGDKLKVFFADAGAAALAR---------------RDWKDVPFKIWDVGTGRASAAQPK 97
Query: 174 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG-TRPVILINPRLKDLPGSSGIMQTMGR 232
+ +D+MF+ +AP + + +++ + G RP +++NP+L+D G+ GI R
Sbjct: 98 IQAEDEMFLFIAPTSVE----VAELERLCGEIGEQRPFVMLNPKLED-SGTVGIGYA-AR 151
Query: 233 DKRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
+ R+ + ++ E+CY+ R P+ AL YP +E++ V GE Y +
Sbjct: 152 NIRMRFISTIESCYYLR---------PVDDETALFRCYPGMWEVW----VDKDGE-YKRI 197
Query: 291 STFTEKPSSDDIN 303
+ KPS D+++
Sbjct: 198 AELANKPSGDELD 210
>gi|443320726|ref|ZP_21049809.1| protein of unknown function (DUF1995) [Gloeocapsa sp. PCC 73106]
gi|442789566|gb|ELR99216.1| protein of unknown function (DUF1995) [Gloeocapsa sp. PCC 73106]
Length = 241
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 64/236 (27%), Positives = 109/236 (46%), Gaps = 59/236 (25%)
Query: 71 ATQRAVNDGRTRLKVEINIPE--LNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQG 128
A A+NDG RL+VE+ IPE L + TL F + G +KV +
Sbjct: 17 AVTNALNDGYRRLQVELVIPEIALQAQALALEFTTL----------FTEYGSGLKVFLPD 66
Query: 129 SMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMFI 182
+ G ALA R+ DWG+ T + +G N V++ D +F+
Sbjct: 67 T-GAAALA---------RR-----DWGE-----TPFAVTDLGTNRSPIETRVNDTDQIFL 106
Query: 183 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 242
+V+P +AV + ++ + + AG RPV+L+ P+L+D+ S + R R + ++
Sbjct: 107 VVSP-SAVE---VTQVEKLCNLAGDRPVVLLIPQLEDV--SIVGIGLAARQLRERFLSTL 160
Query: 243 ENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKP 297
E+CY+F+ P+ GA + YP+ +++++ G Y +L EKP
Sbjct: 161 ESCYYFK---------PLEGAAVLKNYPSPWQVWRE-----SGNDYQLLCEQGEKP 202
>gi|428208370|ref|YP_007092723.1| hypothetical protein Chro_3395 [Chroococcidiopsis thermalis PCC
7203]
gi|428010291|gb|AFY88854.1| protein of unknown function DUF1995 [Chroococcidiopsis thermalis
PCC 7203]
Length = 272
Score = 61.2 bits (147), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 111/234 (47%), Gaps = 51/234 (21%)
Query: 78 DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAG 137
DG TRL+VE+ IPEL P M + F G ++KV + G ALA
Sbjct: 53 DGYTRLQVELAIPELKP----------MSVAEQFLAEFTQFGSQLKVFFPDA-GAAALA- 100
Query: 138 MPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQNAVGNCII 195
R+ DWG+ K V+IG+ E ++ +D+ + ++P +AV +
Sbjct: 101 --------RR-----DWGEVPFK--IVDIGTRSPLEEKIEPEDEALLFISP-SAVE---V 141
Query: 196 DDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAG 255
+ ++ + + A T P +++NPRL+D+ GI GR R + + E+CY+ R
Sbjct: 142 ERVEKLCELA-TCPTVMLNPRLEDV-AIVGIGYA-GRQLRTRFLNNIESCYYLR------ 192
Query: 256 TQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLG 308
PI ++ +YP +++++ ++ E++ +++ KP D+I+ G
Sbjct: 193 ---PIENISVFRSYPGEWQIWREIE-----EEFQLITEQPTKPMGDEIDAILYG 238
>gi|390438390|ref|ZP_10226864.1| conserved hypothetical protein [Microcystis sp. T1-4]
gi|389838196|emb|CCI30988.1| conserved hypothetical protein [Microcystis sp. T1-4]
Length = 247
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 61/235 (25%), Positives = 104/235 (44%), Gaps = 45/235 (19%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G ++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARCVQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGE 285
R + ++ E+CY+FR L A YP + + + +GYEL GE
Sbjct: 151 QLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGE 205
>gi|307150883|ref|YP_003886267.1| hypothetical protein Cyan7822_0982 [Cyanothece sp. PCC 7822]
gi|306981111|gb|ADN12992.1| Domain of unknown function DUF1995 [Cyanothece sp. PCC 7822]
Length = 250
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 75/269 (27%), Positives = 120/269 (44%), Gaps = 48/269 (17%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+ATQ A+ G RL+VE+ IPE+ E G +E + F G +KV +
Sbjct: 17 EATQIALEAGLGRLQVELVIPEIALEAQ----GLALEFTSL----FEQYGSGLKVLFPDT 68
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFI 182
G ALA R+ DWG D G++ VE+ +V +D+ F+
Sbjct: 69 -GAAALA---------RR-----DWGETPFKISDLGSRFIPVEM------KVTPEDEAFL 107
Query: 183 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 242
+V P + I + + + A RPV+L+ P+L+D+ GI T R R + ++
Sbjct: 108 VVCP----ASIEIQSVAKLCEIAEQRPVLLLIPQLEDV-SIVGIGYT-ARQLRDNFLSTL 161
Query: 243 ENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELY-KRVDVPSGGEKYVILSTFTEK--P 297
E+CY+FR L A YP + + + GYEL + P G +I++ T + P
Sbjct: 162 ESCYYFRPLEGAAVVRSYPGLWQVWLEKEQGYELIAEESQKPMGEALELIIARATAEIDP 221
Query: 298 SSDDI-NDAFLGKPRYVDYMDRILDYMDR 325
+S + N A K + M R L + +
Sbjct: 222 ASPQMANVAQPKKSGLLANMQRFLKALSQ 250
>gi|443327973|ref|ZP_21056578.1| protein of unknown function (DUF1995) [Xenococcus sp. PCC 7305]
gi|442792382|gb|ELS01864.1| protein of unknown function (DUF1995) [Xenococcus sp. PCC 7305]
Length = 248
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 118/281 (41%), Gaps = 51/281 (18%)
Query: 62 NYFYDDVLQATQRA-------VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALS 114
N F + +A ++A + DG TRL+VE+ IPE+ + + +AL
Sbjct: 2 NEFPQSIAEAVEQAKKTVIDAIADGYTRLQVELVIPEIPLQ------------AQSLALE 49
Query: 115 FADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIG 167
FA+ S G G P A +L DWG D G + T +E
Sbjct: 50 FAN-------LFLESHGSGVKVMFPDTGAA---MLAKRDWGEIAFQVTDMGGRFTSIE-- 97
Query: 168 SIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIM 227
++ E D++FI+V P + ++ ++ + + G RPVIL+ P+L+D+ GI
Sbjct: 98 ----QQIAETDEIFIMVCPSSVE----VERVETLCNLVGDRPVILLIPQLEDV-AVVGI- 147
Query: 228 QTMGRDKRLEYAASFENCYFFRLLYYAGTQ--YPIMGALRMTYPNGYELYKRVDV-PSGG 284
R R + ++ +CY+ R L A +P + + GYEL + P G
Sbjct: 148 GLAARRLRDRFISTLYSCYYIRPLEGASILRIHPSSWQIWLEKEEGYELAMELSSKPMGE 207
Query: 285 EKYVILSTFTEKPSSDDINDAFLGKPRYVDYMDRILDYMDR 325
E +L T N++ + KP + + + L + +
Sbjct: 208 ELDRLLIQLTTPEGEAGKNNSTIKKPGLLGNLQKFLKALSQ 248
>gi|86604746|ref|YP_473509.1| hypothetical protein CYA_0013 [Synechococcus sp. JA-3-3Ab]
gi|86553288|gb|ABC98246.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length = 246
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 61/266 (22%), Positives = 112/266 (42%), Gaps = 54/266 (20%)
Query: 50 RLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVR 109
+L PH + +QAT+ A+ G RL +EI P++ PE+ +ELV
Sbjct: 6 QLTQFPHLPGDLAQACEQAIQATRLALQAGYRRLLIEILAPDIKPEVLAR---PFLELVP 62
Query: 110 VIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSI 169
AL D G ALA ++G+ +++ S+
Sbjct: 63 PPALVLFSDA-----------GGAALAQR-----------------EWGSLPDGIQLQSL 94
Query: 170 GA--NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG-------TRPVILINPRLKDL 220
A EQ +F++ A + + + + D +G +PV+L+NP+L+D
Sbjct: 95 TARTQPTPEQSVLFVMPAVYSLDQVERVCQLVSGGDPSGLAAKGRDPKPVVLLNPQLQD- 153
Query: 221 PGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDV 280
++ + GR R + ++FE Y+ R L + GAL YP+ + ++++ +
Sbjct: 154 -AAAVGVGLAGRRLRQRFLSTFETSYYLRSL--------VEGALFRAYPDPWSVWQQEEP 204
Query: 281 PSGGEKYVILSTFTEKPSSDDINDAF 306
Y +L TF +PS +++ + F
Sbjct: 205 ----GLYSVLKTFRARPSGEEVAELF 226
>gi|113475888|ref|YP_721949.1| hypothetical protein Tery_2247 [Trichodesmium erythraeum IMS101]
gi|110166936|gb|ABG51476.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 253
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 64/247 (25%), Positives = 118/247 (47%), Gaps = 54/247 (21%)
Query: 69 LQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQG 128
++AT+ A+ DG TR+++EI +P++ + L + + + ++KV
Sbjct: 16 MEATKAALQDGYTRVQIEIVVPDIELQA--------QSLAKQFIPALLETSTKLKVFFPD 67
Query: 129 SMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMFI 182
S G ALA R+ DW D TF +I +G + +V+ +D F+
Sbjct: 68 S-GAAALA---------RR-----DWQD----ATF-KIEDLGTSRSPVDKKVEPEDQCFL 107
Query: 183 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 242
L+AP +A+ + + +++ AG RPVI++ P+L+D+ S + R R + +
Sbjct: 108 LIAP-SAIE---VAQTEKLSNLAGDRPVIMLIPKLEDV--SIVGIGYAARQLRERFIKTI 161
Query: 243 ENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDI 302
E+CY+ R L A AL YP+ ++++ + G+ Y +++ EKP D++
Sbjct: 162 ESCYYIRSLGGA--------ALYRCYPSPWQVW----LEENGQ-YKLIAEQPEKPVGDEV 208
Query: 303 NDAFLGK 309
D L K
Sbjct: 209 -DMILAK 214
>gi|414076889|ref|YP_006996207.1| hypothetical protein ANA_C11624 [Anabaena sp. 90]
gi|413970305|gb|AFW94394.1| hypothetical protein ANA_C11624 [Anabaena sp. 90]
Length = 244
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 61/260 (23%), Positives = 111/260 (42%), Gaps = 52/260 (20%)
Query: 59 EIRNYFYDDVLQATQ---RAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 115
E+ D + Q+T+ A+ DG TR++VE+ PEL M + F
Sbjct: 3 ELPKTLEDAIAQSTEAVKSALADGMTRIQVELLFPELK----------FMTVAEQFLPLF 52
Query: 116 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-- 173
+ R+KV + DW D + + + G + E
Sbjct: 53 TEYESRLKVFFADAGAAALAR---------------RDWTDAQFQISDIGTGRAASLEAK 97
Query: 174 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRD 233
+ +D++F+ +AP + + ++ + + G RPVI++ PRL+D GI T R+
Sbjct: 98 IQPEDEIFLFIAPTSVE----VPQLEKLCELIGERPVIMLTPRLED-SSVVGIGYT-ARE 151
Query: 234 KRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILS 291
R + ++ E+CY+ R P+ AL YP +E++ + + GE Y ++
Sbjct: 152 TRRRFISTIESCYYIR---------PVDDESALFRCYPGLWEVW----LETAGE-YQKVA 197
Query: 292 TFTEKPSSDDINDAFLGKPR 311
+KPS D+++ +G R
Sbjct: 198 ELPKKPSGDELDAILMGGQR 217
>gi|119484707|ref|ZP_01619189.1| hypothetical protein L8106_14580 [Lyngbya sp. PCC 8106]
gi|119457525|gb|EAW38649.1| hypothetical protein L8106_14580 [Lyngbya sp. PCC 8106]
Length = 249
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 66/247 (26%), Positives = 111/247 (44%), Gaps = 56/247 (22%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYR---IGTLMELVRVIALSFADDGKRVKVCV 126
QATQ A++DG ++VE+ PE+ + I + E V+ + F D
Sbjct: 17 QATQAALDDGYKLVQVELVFPEIELQAQAIAQQFIPAIEESGTVLKVFFPD--------- 67
Query: 127 QGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFIL 183
G ALA R+ DWG+ K ++GS + + V + D F++
Sbjct: 68 ---AGAAALA---------RR-----DWGEIPFK--ISDLGSSRSPIDSRVKDDDGRFLV 108
Query: 184 VAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFE 243
V+P ++ ++ ++ AG R IL+NPRL+D+ GI R R + ++ E
Sbjct: 109 VSPTPVE----VEQVEKLSQLAGDRVTILLNPRLEDI-AIIGIGYA-ARALRDRFISTIE 162
Query: 244 NCYFFRLLYYAGTQYPIMG--ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 301
+CY+ R P+ G AL YP+ +E+++ +D GE Y +L+ KP D
Sbjct: 163 SCYYLR---------PLEGDAALYRCYPSLWEVWQEID----GE-YTLLAQEQTKPVGDQ 208
Query: 302 INDAFLG 308
++ G
Sbjct: 209 LDQILAG 215
>gi|332705285|ref|ZP_08425366.1| protein of unknown function, DUF1995 [Moorea producens 3L]
gi|332356028|gb|EGJ35487.1| protein of unknown function, DUF1995 [Moorea producens 3L]
Length = 251
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 56/233 (24%), Positives = 109/233 (46%), Gaps = 43/233 (18%)
Query: 76 VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 135
++DG+TR++VE+ PE+ + E + + F G +K+ + G AL
Sbjct: 23 LDDGQTRVQVELVFPEIALQAQ----SIAQEFIPI----FEQYGPGLKIFFPDT-GASAL 73
Query: 136 AGMPLQLAGTRKILEFMDWGDYGAKGTFVEIG-SIGANEVDEQDDMFILVAPQNAVGNCI 194
A DWG+ K T V S +++ D +F+LV P +AV
Sbjct: 74 A--------------LRDWGEVPFKITDVGTSRSPAETRIEDDDQVFLLVGP-SAVE--- 115
Query: 195 IDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYA 254
+ ++ + + AG RP +++NP+L+D+ S + R R + + E+CY+ R +
Sbjct: 116 VAQVEKICNLAGDRPCVILNPQLEDV--SIVGIGYAARQLRDRFLKTLESCYYLR--PFP 171
Query: 255 GTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 307
G GAL YP+ ++++ +D ++Y +++ KP+++ ++ L
Sbjct: 172 G------GALWRCYPSMWQVWLEID-----DEYQLVTEEPSKPTAEALDQIIL 213
>gi|298489954|ref|YP_003720131.1| hypothetical protein Aazo_0482 ['Nostoc azollae' 0708]
gi|298231872|gb|ADI63008.1| Domain of unknown function DUF1995 ['Nostoc azollae' 0708]
Length = 244
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 103/248 (41%), Gaps = 54/248 (21%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+A + A+ DG TR++V+ PEL M + FA+ RVKV +
Sbjct: 17 EAVKSALADGVTRIQVDFLFPELK----------FMPVAEQFVPLFAEYESRVKVFFADA 66
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQ 187
W + K + G + ++ +D++F+ +AP
Sbjct: 67 GAAALARRD---------------WQNVPFKVEDIGTGRAASLQTKIQPEDEIFLFIAPT 111
Query: 188 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQT--MGRDKRLEYAASFENC 245
+ ++ M + TRP++L+NPRL+D SG++ R+ R + ++ E+C
Sbjct: 112 PVE----VPQLEKMCEIIDTRPIVLLNPRLED----SGVVGIGYAARETRRRFISTIESC 163
Query: 246 YFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
Y+ R P+ AL YP +E++ +Y ++ ++PS D+I+
Sbjct: 164 YYLR---------PVDDESALFRCYPGQWEVWL-----ESNNEYEKIAELPKRPSGDEID 209
Query: 304 DAFL-GKP 310
+ G+P
Sbjct: 210 MILMKGQP 217
>gi|67922775|ref|ZP_00516276.1| hypothetical protein CwatDRAFT_3614 [Crocosphaera watsonii WH 8501]
gi|416392960|ref|ZP_11685949.1| hypothetical protein CWATWH0003_2757 [Crocosphaera watsonii WH
0003]
gi|67855391|gb|EAM50649.1| hypothetical protein CwatDRAFT_3614 [Crocosphaera watsonii WH 8501]
gi|357263546|gb|EHJ12537.1| hypothetical protein CWATWH0003_2757 [Crocosphaera watsonii WH
0003]
Length = 246
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 66/258 (25%), Positives = 108/258 (41%), Gaps = 49/258 (18%)
Query: 59 EIRNYFYDDVLQATQRA---VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 115
E+ N V QA + A + G R+ +EI IPE+ + + +AL F
Sbjct: 3 EVPNSLEQAVSQAKEAAKIALESGVGRICLEIVIPEIAIQ------------AQALALEF 50
Query: 116 ADDGKRVKVCVQGSMGEGALAGMPLQLAGT-RKILEFMDWG-------DYGAKGTFVEIG 167
++ EG +G+ + T L DWG D G++ T +E
Sbjct: 51 T------------TLFEGQQSGLKVIFPDTGAAALARRDWGETPFRVTDLGSRATGIE-- 96
Query: 168 SIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIM 227
++ E D++F+LV P + ++ ++ + + AG RPVIL+ P+L+D+ S +
Sbjct: 97 ----RKISEADEIFLLVCPSSVE----VETVEKLCELAGDRPVILLIPQLEDV--SIVGI 146
Query: 228 QTMGRDKRLEYAASFENCYFFRLL--YYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGE 285
R R + ++ E+ Y+FR L YP + + +GYEL GE
Sbjct: 147 GYAARQLRDRFISTLESVYYFRPLDDVVVLRSYPSPWLVFLEKEDGYELIAEQGQKPMGE 206
Query: 286 KYVILSTFTEKPSSDDIN 303
IL K DD N
Sbjct: 207 ALDILVNNALKGEEDDSN 224
>gi|218247522|ref|YP_002372893.1| hypothetical protein PCC8801_2736 [Cyanothece sp. PCC 8801]
gi|257061142|ref|YP_003139030.1| hypothetical protein Cyan8802_3366 [Cyanothece sp. PCC 8802]
gi|218168000|gb|ACK66737.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
gi|256591308|gb|ACV02195.1| conserved hypothetical protein [Cyanothece sp. PCC 8802]
Length = 245
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 62/232 (26%), Positives = 102/232 (43%), Gaps = 45/232 (19%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+AT+ A+ R++VE+ +PE+ + G +E + DD +KV +
Sbjct: 17 EATKLALEANLGRIQVELVVPEIALQAQ----GLALEFTTLF-----DDLSSLKVIFPDT 67
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFI 182
G ALA R+ DWG D G++ T VE+ +V EQD F+
Sbjct: 68 -GAAALA---------RR-----DWGETPFKVTDLGSRATSVEM------KVSEQDRAFL 106
Query: 183 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 242
LV P + ++ ++ + + AG RPV+L+ P+L+D+ S + R R + ++
Sbjct: 107 LVCP----SSVEVESVEKLCNLAGDRPVVLLIPQLEDV--SVVGIGYAARQLRERFISTL 160
Query: 243 ENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILST 292
+ Y+FR L A YP + + GYEL GE I+ T
Sbjct: 161 NSAYYFRPLEGAAILRSYPSPWNVYLETETGYELIASEPQKPLGEALEIILT 212
>gi|434386365|ref|YP_007096976.1| protein of unknown function (DUF1995) [Chamaesiphon minutus PCC
6605]
gi|428017355|gb|AFY93449.1| protein of unknown function (DUF1995) [Chamaesiphon minutus PCC
6605]
Length = 240
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/233 (25%), Positives = 103/233 (44%), Gaps = 55/233 (23%)
Query: 78 DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAG 137
DG R+ VEI IPEL M L R SF+D G +KV + G ALA
Sbjct: 25 DGHKRITVEIVIPELKS----------MGLARQFLESFSDRGNTLKVLFTDT-GAAALAK 73
Query: 138 MPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG------ANEVDEQDDMFILVAPQNAVG 191
DW +G +I +G +++ +D+++I + P +AV
Sbjct: 74 R--------------DW-----QGVNFKIEDLGNSRSPITDKIQAEDELYIAIDP-SAVE 113
Query: 192 NCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLL 251
++ ++ + +AAG RPVIL P+L+D GI R R + + Y+ + L
Sbjct: 114 ---VEQVEKLCNAAGDRPVILFLPKLEDA-AIVGIGYA-ARQLRDRFLTTLTCAYYIKPL 168
Query: 252 YYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 304
+ AL YP +++++ D + Y++L+ +KP D++++
Sbjct: 169 EAS--------ALYRCYPAQWQVWQEQD-----DDYILLAECPQKPVGDELDE 208
>gi|428183504|gb|EKX52362.1| hypothetical protein GUITHDRAFT_157134 [Guillardia theta CCMP2712]
Length = 325
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/169 (28%), Positives = 77/169 (45%), Gaps = 14/169 (8%)
Query: 158 GAKGTFVE---IGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILIN 214
G KG+ +E I +EV E DDMFI++ + D++ + AAG RPVIL N
Sbjct: 123 GFKGSLIERVWINKDFVSEVQEDDDMFIIIG----ASAQELPDVRELCKAAGDRPVILFN 178
Query: 215 PRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY----AGTQYPI--MGALRMTY 268
+L+ L G G+ + ++ YF Y AG + I GAL TY
Sbjct: 179 LKLQVLRGDFGLPFFPSKSLHNDWLCEALPAYFMLPKSYTKTIAGPPFLINYSGALFRTY 238
Query: 269 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRYVDYMD 317
P +++ V GG +Y + ++P+ D+ + L K +D +D
Sbjct: 239 PGKWQMLLEVPDEDGGGRYQRVRMLDKRPALSDVREE-LAKELQLDGLD 286
>gi|86607776|ref|YP_476538.1| hypothetical protein CYB_0277 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86556318|gb|ABD01275.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 233
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/250 (22%), Positives = 108/250 (43%), Gaps = 54/250 (21%)
Query: 66 DDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVC 125
+ + AT+ A+ G RL +EI P+L PE+ +ELV+ AL D
Sbjct: 10 EQAIGATRLAMQAGYRRLLIEILAPDLKPEVLAR---PFLELVQPPALVLFSDA------ 60
Query: 126 VQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVA 185
G ALA ++G+ +E+ S+ + + +LV
Sbjct: 61 -----GGAALAQR-----------------EWGSLPEGIELQSLTSRTQPTPEQSILLVM 98
Query: 186 PQNAVGNCIIDDMKAMT--DAAG-------TRPVILINPRLKDLPGSSGIMQTMGRDKRL 236
P + + +A++ D +G +PV+L+NP+L+D ++ + GR R
Sbjct: 99 PAVYSLDQVERVCQAVSGGDPSGLASKGRDPKPVVLLNPQLQD--AATVGVGLAGRRLRQ 156
Query: 237 EYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEK 296
+ ++FE Y+ R L + GAL YP+ + ++++ + Y +L TF +
Sbjct: 157 RFLSTFETSYYLRSL--------VEGALFRAYPDPWSVWQQEE----PGLYSVLKTFIAQ 204
Query: 297 PSSDDINDAF 306
P+ +++ + F
Sbjct: 205 PTGEEVAELF 214
>gi|359457624|ref|ZP_09246187.1| hypothetical protein ACCM5_02793 [Acaryochloris sp. CCMEE 5410]
Length = 234
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 62/264 (23%), Positives = 115/264 (43%), Gaps = 65/264 (24%)
Query: 69 LQATQRAVNDGRTRLKVEINIPELNP----EMDVYRIGTLMELVRVIALSFADDGKRVKV 124
++AT+ A+ D +T L+VEI IPEL P + + ++ L E V+V F+D G
Sbjct: 15 VEATKAAIADHKTLLQVEIAIPELKPLPVAQQYLSQLPNLGENVKVF---FSDTGA---- 67
Query: 125 CVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILV 184
++ + +L G +++ + V ++D F+ +
Sbjct: 68 ---AALARHQWKDISYELRGIEELI----------------------DPVQPENDAFVFI 102
Query: 185 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 244
AP + ++ + AG R IL+NP+L+D+ S + GR R + + E+
Sbjct: 103 APTPVE----VGKVEQICSQAGDRVCILLNPKLEDV--SIVGIGMAGRTLRERFLNNIES 156
Query: 245 CYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGE--KYVILSTFTEKPSSDDI 302
CY F L GA+ +YP+ ++++ + GE ++ IL+T +PS + I
Sbjct: 157 CYSFLPLER--------GAVIRSYPSEWQIWW-----AEGEEDEHQILATEENRPSGERI 203
Query: 303 NDAF--------LGKPRYVDYMDR 318
+ + KP +D M +
Sbjct: 204 SQLLASLTATEEIRKPGLLDNMQQ 227
>gi|158334629|ref|YP_001515801.1| hypothetical protein AM1_1459 [Acaryochloris marina MBIC11017]
gi|158304870|gb|ABW26487.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 234
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 57/240 (23%), Positives = 107/240 (44%), Gaps = 57/240 (23%)
Query: 69 LQATQRAVNDGRTRLKVEINIPELNP----EMDVYRIGTLMELVRVIALSFADDGKRVKV 124
++AT+ A+ D +T L++EI IPEL P + + ++ L E V+V F+D G
Sbjct: 15 VEATKSAIADHKTLLQIEIAIPELKPLPVAQQYLSQLPNLGENVKVF---FSDTGA---- 67
Query: 125 CVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILV 184
++ + +L G +++ + V ++D F+ +
Sbjct: 68 ---AALARHQWKDISYELRGIEELI----------------------DPVQPENDAFVFI 102
Query: 185 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 244
AP + ++ + AG R IL+NP+L+D+ S + GR R + + E+
Sbjct: 103 APTPVE----VGKVEQICSQAGDRVCILLNPKLEDV--SIVGIGMAGRTLRERFLNNIES 156
Query: 245 CYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGE--KYVILSTFTEKPSSDDI 302
CY F L GA+ +YP+ ++++ + GE ++ IL+T +PS + I
Sbjct: 157 CYSFLPLER--------GAVIRSYPSEWQIWW-----AEGEEDEHQILATEESRPSGERI 203
>gi|428203362|ref|YP_007081951.1| hypothetical protein Ple7327_3166 [Pleurocapsa sp. PCC 7327]
gi|427980794|gb|AFY78394.1| protein of unknown function (DUF1995) [Pleurocapsa sp. PCC 7327]
Length = 249
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 56/243 (23%), Positives = 115/243 (47%), Gaps = 59/243 (24%)
Query: 70 QATQRAVNDGRTRLKVEINIPE--LNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQ 127
+AT+ A+ G R++VE+ +PE L + +L+E D G ++V
Sbjct: 17 EATKVALEAGCERIQVELVVPEIALQAQSLALEFTSLLE----------DYGSGLRVLFP 66
Query: 128 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMF 181
+ G ALA R+ DWG+ T ++ +G+ ++ E+D F
Sbjct: 67 DT-GAAALA---------RR-----DWGE-----TVFQVSDLGSRFIPVDMKISEEDTAF 106
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
++V+P ++ ++ + + AG RPV+L+ P+L+D+ S + R R + ++
Sbjct: 107 LVVSP----SAIEVNSVEKLCNLAGDRPVVLLIPQLEDV--SVVGIGLAARQLRERFLST 160
Query: 242 FENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
E+ Y+FR P+ GA+ + +YP+ ++++ G E Y +++ +KP+ +
Sbjct: 161 LESSYYFR---------PLEGAVVLRSYPSLWQVWL-----GGEEDYQLIAEEPQKPTGE 206
Query: 301 DIN 303
++
Sbjct: 207 RLD 209
>gi|443314784|ref|ZP_21044317.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 6406]
gi|442785626|gb|ELR95433.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 6406]
Length = 246
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 111/240 (46%), Gaps = 52/240 (21%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
QAT+ A+ G +R+ V+I IPEL +I + E + F + G + KV +
Sbjct: 19 QATRAAIEAGVSRIVVDIAIPEL-------KIQPVAEQFYPL---FEEMGLQFKVYFPDA 68
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWG--DYGAKGTFVEIGSIGANEVDEQDDMFILVAPQ 187
G ALA DWG D+ +G G + A+ D+ F+++ P
Sbjct: 69 -GAAALAK--------------RDWGNPDFSIRGINEMKGRLEAD-----DEAFLIIEP- 107
Query: 188 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 247
+ ++D+++ + A R V+++NP+L+D+ + GI T GR R + ++ E Y+
Sbjct: 108 ---SSVEVNDVESFCNEATGRFVVMLNPKLEDI-ATIGIGYT-GRQLRERFLSTLETIYY 162
Query: 248 FRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
+ P+ GA + YP ++++ + + Y +L+ F +KPS + + F
Sbjct: 163 LQ---------PLEGATILRAYPGLWQVWGE----TTDDGYELLADFPQKPSGEALEKLF 209
>gi|218439257|ref|YP_002377586.1| hypothetical protein PCC7424_2296 [Cyanothece sp. PCC 7424]
gi|218171985|gb|ACK70718.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
Length = 253
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 70/286 (24%), Positives = 122/286 (42%), Gaps = 54/286 (18%)
Query: 59 EIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 115
E+ N ++QA T+ A+ +G R++VE+ IPE+ + IAL F
Sbjct: 3 EVPNSLEATIIQAKEATRLALEEGLGRIQVELVIPEIALS------------AQTIALEF 50
Query: 116 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGS 168
+ S GEG P A L DWG D G++ T VE+
Sbjct: 51 --------TSLLDSYGEGLKVLFPDSGAAA---LARRDWGETPFKISDLGSRFTPVEM-- 97
Query: 169 IGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQ 228
++ +D+ F++V P + + + + D AG RPV+++ P+L+D+ GI
Sbjct: 98 ----KITPEDEAFLVVCP----SSIEVQSVAKLCDLAGDRPVVILIPQLEDV-SIVGIGY 148
Query: 229 TMGRDKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEK 286
T R R + ++ ++ Y+ R L A YP + + + +GYEL + GE
Sbjct: 149 T-ARQLRDNFLSTLQSSYYIRPLDGAVVLRAYPELWQVWLEKEDGYELIAQESQKPMGEA 207
Query: 287 YVIL------STFTEKPSSDDINDAF-LGKPRYVDYMDRILDYMDR 325
++ E P + +I+ + KP + M R L +++
Sbjct: 208 LELMIARATGQIAPENPENKEIDKSPKPKKPGLLANMQRFLRALNQ 253
>gi|17230608|ref|NP_487156.1| hypothetical protein all3116 [Nostoc sp. PCC 7120]
gi|17132211|dbj|BAB74815.1| all3116 [Nostoc sp. PCC 7120]
Length = 245
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 55/247 (22%), Positives = 103/247 (41%), Gaps = 50/247 (20%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+AT+ A+ DG R++VE+ PEL M + + R+KV +
Sbjct: 17 EATKSALADGCNRIQVELLFPELK----------FMPVAEQFLPCLVEYESRLKVFFADA 66
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQ 187
W D + + G I + +++ +D++F+ ++P
Sbjct: 67 GAAALARRD---------------WKDAAFQILDIGTGRIASIQSKIQPEDEIFLFISPT 111
Query: 188 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 247
+ + ++ + + G RP I +NPRL+D G+ GI T R R + ++CY+
Sbjct: 112 SVE----VPQLEKICEIIGDRPAIFLNPRLEDA-GTVGIGYT-ARQTRERFLNIIQSCYY 165
Query: 248 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 305
R PI AL +YP +E++ D + ++ +KPS D+I+
Sbjct: 166 LR---------PIDDETALFRSYPGDWEIWVEND-----GNWAKIANLPKKPSGDEIDLI 211
Query: 306 FL-GKPR 311
+ G+P+
Sbjct: 212 LMKGQPQ 218
>gi|428772277|ref|YP_007164065.1| hypothetical protein Cyast_0436 [Cyanobacterium stanieri PCC 7202]
gi|428686556|gb|AFZ46416.1| protein of unknown function DUF1995 [Cyanobacterium stanieri PCC
7202]
Length = 244
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/245 (24%), Positives = 118/245 (48%), Gaps = 52/245 (21%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
AT A+ G RL++E+ IPE+ + EL A F + G +KV +
Sbjct: 18 ATLTALESGFNRLQIELIIPEIALKAQ--------ELAFAFAQLFGEYGAGLKVFFPDT- 68
Query: 131 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFV--EIGSIGA---NEVDEQDDMFILVA 185
G ALA R+ +WG+ TFV ++GS + N V + D+ +++V+
Sbjct: 69 GAAALA---------RR-----EWGE----TTFVVTDLGSSRSPIENRVSDTDEFYLVVS 110
Query: 186 PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENC 245
P +AV + ++ + + AG RPVILI P+L+D+ S + R R + ++ E+
Sbjct: 111 P-SAVE---VAQVEKLCNLAGDRPVILIIPQLEDI--SVVGIGYAARQLRERFLSTLESS 164
Query: 246 YFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 304
Y++R P+ A+ M +P +++++ ++ K+ +++ +KP + ++
Sbjct: 165 YYYR---------PLEDAVVMRVFPGLWQVWRELE----DNKFELITEVPKKPLGEVLDR 211
Query: 305 AFLGK 309
+G+
Sbjct: 212 ILMGQ 216
>gi|75911048|ref|YP_325344.1| hypothetical protein Ava_4852 [Anabaena variabilis ATCC 29413]
gi|75704773|gb|ABA24449.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 245
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 56/252 (22%), Positives = 102/252 (40%), Gaps = 52/252 (20%)
Query: 59 EIRNYFYDDVLQ---ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 115
E+ N D + Q AT+ A+ DG R++VE+ PEL M +
Sbjct: 3 ELPNSLEDAIAQSREATKAALADGCHRIQVELLFPELK----------FMPVAEQFLPCL 52
Query: 116 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NE 173
+ R+KV + W D + + G + ++
Sbjct: 53 VEYESRLKVFFADAGAAALARRD---------------WKDAPFQILDIGTGRTASIQSK 97
Query: 174 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRD 233
+ +D++F+ ++P + + ++ + + G RP I +NPRL+D G+ GI T R
Sbjct: 98 IQPEDEIFLFISPTSVE----VPQLEKICEIIGDRPAIFLNPRLEDA-GTVGIGYT-ARQ 151
Query: 234 KRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILS 291
R + ++CY+ R PI AL +YP +E++ D + ++
Sbjct: 152 TRERFLNIIQSCYYLR---------PIDDETALFRSYPGDWEIWVEND-----GNWAKIA 197
Query: 292 TFTEKPSSDDIN 303
+KPS D+I+
Sbjct: 198 NLPKKPSGDEID 209
>gi|427713531|ref|YP_007062155.1| hypothetical protein Syn6312_2513 [Synechococcus sp. PCC 6312]
gi|427377660|gb|AFY61612.1| protein of unknown function (DUF1995) [Synechococcus sp. PCC 6312]
Length = 232
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 65/144 (45%), Gaps = 22/144 (15%)
Query: 170 GANE----VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSG 225
G NE V + D F+ + P + ++ M +AAG RP IL+NP+L+D+ G
Sbjct: 84 GVNELLEPVQDSDQAFVCITPSPVE----VSYIEQMCNAAGNRPFILLNPKLQDV-AIVG 138
Query: 226 IMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGE 285
I GR R + + E Y+ R L AL YP +E+++ V+ GE
Sbjct: 139 IGYA-GRQLRERFLNTLETAYYIRPL-------DDTTALLRAYPGDWEIWRDVN----GE 186
Query: 286 KYVILSTFTEKPSSDDINDAFLGK 309
Y L KPS + I+ G+
Sbjct: 187 -YQRLGATPAKPSGEQIDKVLYGE 209
>gi|304570679|ref|YP_001806112.2| hypothetical protein cce_4698 [Cyanothece sp. ATCC 51142]
gi|354552131|ref|ZP_08971439.1| protein of unknown function DUF1995-containing protein [Cyanothece
sp. ATCC 51472]
gi|353555453|gb|EHC24841.1| protein of unknown function DUF1995-containing protein [Cyanothece
sp. ATCC 51472]
Length = 246
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/217 (23%), Positives = 100/217 (46%), Gaps = 56/217 (25%)
Query: 123 KVCVQGSMGEGALAGMPLQL----------AGTRKI--------LEFMDWG-------DY 157
++C++ + E AL PL L +G + I L DWG D
Sbjct: 29 RICLEIVIPEIALQAQPLALEFTTLFKGQESGLKVIFPDTGAAALARRDWGETPFQVTDL 88
Query: 158 GAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRL 217
G++ T ++ ++ E D+ F+LV+P + ++ ++ + + AG RPVIL+ P+L
Sbjct: 89 GSRATPID------KKISEADEAFLLVSP----SSVEVEIVEKLCNLAGHRPVILLIPQL 138
Query: 218 KDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYK 276
+D+ S + R R + ++ E+ Y+FR P+ G + + +YP+ + +Y
Sbjct: 139 EDV--SIVGIGYAARQLRERFISTLESVYYFR---------PLDGVVVLRSYPSPWLVYL 187
Query: 277 RVDVPSGGEKYVILSTFTEKPSSDD----INDAFLGK 309
+ E Y +++ +KP + +N+A G+
Sbjct: 188 EKE-----EGYELIAEQGQKPMGEALDILVNNALKGE 219
>gi|126656438|ref|ZP_01727699.1| hypothetical protein CY0110_22082 [Cyanothece sp. CCY0110]
gi|126622124|gb|EAZ92831.1| hypothetical protein CY0110_22082 [Cyanothece sp. CCY0110]
Length = 246
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 113/253 (44%), Gaps = 63/253 (24%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+A + A+ G R+ +EI IPE+ + + +AL F +
Sbjct: 17 EAAKLALEAGVERICLEIVIPEIALQ------------AQPLALEFT------------T 52
Query: 130 MGEGALAGMPLQLAGT-RKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMF 181
+ EG +G+ + T L DWG D G++ T +E ++ E D+ F
Sbjct: 53 LFEGQESGLKVIFPDTGAAALARRDWGETPFQVTDLGSRATSIE------KKISEADEAF 106
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
+LV+P + ++ ++ + + AG R VIL+ P+L+D+ S + R R + ++
Sbjct: 107 LLVSP----SSVEVEIVEKLCNLAGHRSVILLIPQLEDV--SIVGIGYAARQLRERFIST 160
Query: 242 FENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
E+ Y+FR P+ G + + +YP+ + +Y + E Y +++ +KP +
Sbjct: 161 LESVYYFR---------PLDGVVVLRSYPSPWLVYLEKE-----EGYELIAEQGQKPMGE 206
Query: 301 D----INDAFLGK 309
+N+A G+
Sbjct: 207 ALDTLVNNALKGE 219
>gi|428769639|ref|YP_007161429.1| hypothetical protein Cyan10605_1269 [Cyanobacterium aponinum PCC
10605]
gi|428683918|gb|AFZ53385.1| protein of unknown function DUF1995-containing protein
[Cyanobacterium aponinum PCC 10605]
Length = 251
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/246 (21%), Positives = 115/246 (46%), Gaps = 53/246 (21%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
A + A+ G TR+ +++ IPE+ + L + A F++ G +K+ +
Sbjct: 20 ALKVALESGCTRITIDLVIPEIALKAQY--------LAQEFADFFSEYGAGLKLFFPDT- 70
Query: 131 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMFILV 184
G ALA R+ DWG+ T ++ IG++ ++ + D +F+++
Sbjct: 71 GAAALA---------RR-----DWGE-----TEFQVTDIGSSRSPIERKISDTDQVFLVI 111
Query: 185 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 244
+P N + + + + AG RPVIL+ P+L+D+ S + R R + + E+
Sbjct: 112 SPSAVEVNLV----EKLCNIAGDRPVILLIPQLEDV--SIVGIGYAARQLRERFISILES 165
Query: 245 CYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
Y+FR P+ A+ + +YP ++++++ + E Y +++ +KP ++++
Sbjct: 166 AYYFR---------PLESAVVLRSYPQLWQVWQQNE---ENEDYELIAEVPQKPLGENLD 213
Query: 304 DAFLGK 309
G+
Sbjct: 214 RILQGQ 219
>gi|171701065|gb|ACB54046.1| unknown [Cyanothece sp. ATCC 51142]
Length = 184
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 41/169 (24%), Positives = 83/169 (49%), Gaps = 38/169 (22%)
Query: 153 DWG-------DYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAA 205
DWG D G++ T ++ ++ E D+ F+LV+P + ++ ++ + + A
Sbjct: 15 DWGETPFQVTDLGSRATPID------KKISEADEAFLLVSP----SSVEVEIVEKLCNLA 64
Query: 206 GTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALR 265
G RPVIL+ P+L+D+ S + R R + ++ E+ Y+FR P+ G +
Sbjct: 65 GHRPVILLIPQLEDV--SIVGIGYAARQLRERFISTLESVYYFR---------PLDGVVV 113
Query: 266 M-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD----INDAFLGK 309
+ +YP+ + +Y + E Y +++ +KP + +N+A G+
Sbjct: 114 LRSYPSPWLVYLEKE-----EGYELIAEQGQKPMGEALDILVNNALKGE 157
>gi|427724296|ref|YP_007071573.1| hypothetical protein Lepto7376_2460 [Leptolyngbya sp. PCC 7376]
gi|427356016|gb|AFY38739.1| protein of unknown function DUF1995-containing protein
[Leptolyngbya sp. PCC 7376]
Length = 251
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 65/255 (25%), Positives = 116/255 (45%), Gaps = 58/255 (22%)
Query: 51 LNNIPHSREIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMEL 107
++N+P S F D + QA T ++ G R+ VE+ PE
Sbjct: 1 MSNLPKS------FPDTLEQAKSATLTSLEAGCGRVLVELCFPE---------------- 38
Query: 108 VRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGT-RKILEFMDWGDYGAKGTFVEI 166
+ + A S A D +V V GS G+ + T L DWG+ + + +I
Sbjct: 39 IALKAQSLAWDFAQVFVKEYGS-------GLKVLFPDTGAAALAKRDWGE--VEFSVTDI 89
Query: 167 GSIG---ANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGS 223
GS +V++ D +F++V P + + ++ + + AG RPVI++ P+L+D+ S
Sbjct: 90 GSSRNPVQYKVNDADQIFLVVCPSSVE----VAQVEKLCELAGDRPVIMLIPQLEDV--S 143
Query: 224 SGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPS 282
+ R R + ++ E+ Y+ R P GA+ ++P+G+E+Y +
Sbjct: 144 IVGIGYAARQLRERFISTLESAYYIR---------PYDGAMVWRSFPSGWEVYLEKE--- 191
Query: 283 GGEKYVILSTFTEKP 297
GE Y +++T T+KP
Sbjct: 192 EGE-YELIATETQKP 205
>gi|428218666|ref|YP_007103131.1| hypothetical protein Pse7367_2442 [Pseudanabaena sp. PCC 7367]
gi|427990448|gb|AFY70703.1| protein of unknown function DUF1995-containing protein
[Pseudanabaena sp. PCC 7367]
Length = 261
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 50/196 (25%), Positives = 81/196 (41%), Gaps = 40/196 (20%)
Query: 153 DWGDYGAKGTFVEIGSIGANE----VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTR 208
DW D + E+ G NE + E D F+L+AP + +D ++ + AG R
Sbjct: 76 DWPDISDQ----EVSMRGVNEGRAAIREDDQAFLLIAP----SSVEVDQVEKLLALAGDR 127
Query: 209 PVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTY 268
P I+ NPRL++ G+ R R + +F CY+ + L AG + L Y
Sbjct: 128 PFIMFNPRLENSEVGIGLAT---RKMRERFLNTFTVCYYMQPL-DAGLLWRCYPGLWQVY 183
Query: 269 PNGYE------------------LYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL--- 307
G E + D GG + +++ KPS D++ FL
Sbjct: 184 RQGNEVDSDKQSEQSEQSEQSQSVTTEPDDNQGG--WQLINEDDHKPSFDELERIFLRAT 241
Query: 308 GKPRYVDYMDRILDYM 323
GK R ++DR+ ++
Sbjct: 242 GK-RQTTFLDRMQQFL 256
>gi|409992140|ref|ZP_11275348.1| hypothetical protein APPUASWS_13731 [Arthrospira platensis str.
Paraca]
gi|409936997|gb|EKN78453.1| hypothetical protein APPUASWS_13731 [Arthrospira platensis str.
Paraca]
Length = 262
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 63/237 (26%), Positives = 102/237 (43%), Gaps = 47/237 (19%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
QA A+ DG ++VE+ PE+ E+ I + + AL D +V
Sbjct: 30 QAATAALEDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFFP---D 80
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIG---SIGANEVDEQDDMFILVAP 186
G ALA R+ DWG+ + T +IG S + D F++V+P
Sbjct: 81 AGAAALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 124
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
++ ++ + AG R V+L+NPRL+D+ GI T R R + + E+CY
Sbjct: 125 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYT-ARQLRERFLNTIESCY 178
Query: 247 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+ + L A AL YP +E++ +D GE Y ++ + KP D +
Sbjct: 179 YLKPLDGA--------ALFRCYPGTWEVWLEID----GE-YQKITEQSTKPVGDQLE 222
>gi|384249997|gb|EIE23477.1| hypothetical protein COCSUDRAFT_65935 [Coccomyxa subellipsoidea
C-169]
Length = 335
Score = 46.2 bits (108), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 71/145 (48%), Gaps = 21/145 (14%)
Query: 165 EIGSIGA-NEVDEQDDMFILVA--PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLP 221
EI +I + V E DD+ I+ A PQ +++ K + + PVIL NPRL
Sbjct: 174 EIAAISERSAVAEGDDLVIIAAVDPQG------LNEAKRVAAGSSFTPVILFNPRLAS-- 225
Query: 222 GSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVP 281
G G+ + R R E+ ++F+ Y R + GT + +P ++++K D
Sbjct: 226 GDVGLGLNV-RRMRNEFLSTFQITYSLRPVNETGTVF-------RRFPGTWKVFKE-DAS 276
Query: 282 SGGEKYVILSTFTEKPSSDDINDAF 306
S G +Y + + F ++P+ DD++ F
Sbjct: 277 SPG-RYDLAAEFRDQPTGDDLDQIF 300
>gi|428214953|ref|YP_007088097.1| hypothetical protein Oscil6304_4664 [Oscillatoria acuminata PCC
6304]
gi|428003334|gb|AFY84177.1| protein of unknown function (DUF1995) [Oscillatoria acuminata PCC
6304]
Length = 247
Score = 45.8 bits (107), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 60/231 (25%), Positives = 103/231 (44%), Gaps = 52/231 (22%)
Query: 76 VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 135
++DG ++VEI IPEL V I E + + F D KV + G AL
Sbjct: 23 LDDGYRLMQVEIGIPELK----VQPIAE--EFLPIFEDLFGD---HFKVYFPDA-GAAAL 72
Query: 136 AGMPLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNC 193
A R+ DWG+ Y +G S ++ +D F+ V P +AV
Sbjct: 73 A---------RR-----DWGERPYIIRGL-----SEPKGQMQPEDQGFLFVEP-SAVE-- 110
Query: 194 IIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 253
++ ++ M + AG RP +++ P+L+++ GI GR R + ++ E+CY+ +
Sbjct: 111 -VNTLEKMCEQAGDRPTVILMPKLENV-AIIGI-GLAGRQLRERFLSTIESCYYIQ---- 163
Query: 254 AGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
P G A+ YP+ ++++ G+ Y ++S KP DD+
Sbjct: 164 -----PSQGYAVFRYYPSPWQVWLET-----GDTYQLISETATKPVGDDLE 204
>gi|291567271|dbj|BAI89543.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 249
Score = 45.4 bits (106), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 63/237 (26%), Positives = 102/237 (43%), Gaps = 47/237 (19%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
QA A+ DG ++VE+ PE+ E+ I + + AL D +V GS
Sbjct: 17 QAATAALEDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFFPDAGS 70
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIG---SIGANEVDEQDDMFILVAP 186
ALA R+ DWG+ + T +IG S + D F++V+P
Sbjct: 71 ---AALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 111
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
++ ++ + AG R V+L+NPRL+D+ GI R R + + E+CY
Sbjct: 112 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYA-ARQLRERFLNTIESCY 165
Query: 247 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+ + L A AL YP +E++ +D GE Y ++ + KP D +
Sbjct: 166 YLKPLDGA--------ALFRCYPGTWEVWLEID----GE-YQKITEQSTKPVGDQLE 209
>gi|428220338|ref|YP_007104508.1| hypothetical protein Syn7502_00205 [Synechococcus sp. PCC 7502]
gi|427993678|gb|AFY72373.1| protein of unknown function (DUF1995) [Synechococcus sp. PCC 7502]
Length = 233
Score = 45.1 bits (105), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 56/269 (20%), Positives = 106/269 (39%), Gaps = 57/269 (21%)
Query: 66 DDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVC 125
+ + AT A+ G R+++++ PEL M + A +F +
Sbjct: 13 EQAIAATMSAIAAGCKRIQIDMKFPEL----------AQMPIALAFAEAFTKQYGQAWQA 62
Query: 126 VQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE----VDEQDDMF 181
+ G ALA DW + E+ G NE + F
Sbjct: 63 LFADAGTAALAKR--------------DWAN---SDILREVSMRGVNEGRAAARPESQAF 105
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
+L++P + +D ++ + G RP +++NP+L++ G+ RD+ L
Sbjct: 106 LLISPTSVE----VDGIEKLLTLIGDRPFVMLNPKLENSEVGLGLAARRMRDRFLS---- 157
Query: 242 FENCYFFRLLYYAGTQYPI-MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
F ++YY PI GA+ YP +++++ D +G E ++ E+P D
Sbjct: 158 -----IFEIVYYI---QPIEEGAVWRCYPQLWQVWQ--DKTTGME---LIYEGEERPVGD 204
Query: 301 DIN----DAFLGKPRYVDYMDRILDYMDR 325
D+N A KP + + + ++ + R
Sbjct: 205 DLNRIMGKATGKKPSLITQLQQFINALAR 233
>gi|317969886|ref|ZP_07971276.1| hypothetical protein SCB02_10139 [Synechococcus sp. CB0205]
Length = 232
Score = 44.7 bits (104), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 76/148 (51%), Gaps = 25/148 (16%)
Query: 178 DDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLE 237
D + +LVAP A +D++A+ A V+L+N +L+D + GI T+ R++R
Sbjct: 102 DGLLVLVAPTPA----DYEDVEAVC-AQHRGSVVLLNGKLED--AAVGI-GTVARERRKG 153
Query: 238 YAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKP 297
+ A++++ Y L G GALR YP ++LY+R P G Y + F KP
Sbjct: 154 FLATWQSAY---ALIPTGE-----GALRRAYPLDWQLYRR--DPDG---YRFVKDFELKP 200
Query: 298 SSDDINDAFLGK----PRYVDYMDRILD 321
++ + DA G+ R + +DR+++
Sbjct: 201 DAEQMADALEGEGGGVARGLKAVDRLIE 228
>gi|356535083|ref|XP_003536078.1| PREDICTED: uncharacterized protein LOC100803954 [Glycine max]
Length = 344
Score = 44.3 bits (103), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 53/240 (22%), Positives = 97/240 (40%), Gaps = 36/240 (15%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK--RVKVCVQ 127
+A V+ G R VE+ IP+L D G L EL R+ + + K RVK
Sbjct: 107 EAAATFVSSGGMRAIVELLIPQLQFLDDEGAQGELWELSRIFLDTLIEKTKSQRVKAIFP 166
Query: 128 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-VDEQDDMFILVAP 186
AG +L+ Y K S+ + V+ D++ +++ P
Sbjct: 167 D--------------AGAAALLK------YRWKDALFRFASLSDRKPVESDDEIVVMIVP 206
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
+ + +T+ P+I+ NPRL + G+ + + +R + ++F Y
Sbjct: 207 DYQMLEYVERIASNLTNVPEPTPLIMWNPRL--ISEDVGVGFNVRKLRRF-FLSTFTTVY 263
Query: 247 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
F R + + GA+ YP ++++ D ++Y++ F +P +DDI F
Sbjct: 264 FMRPMPF--------GAIFRCYPGLWKVFS--DDKERPDRYLLAKEFESRPDADDIELLF 313
>gi|170076650|ref|YP_001733288.1| hypothetical protein SYNPCC7002_A0014 [Synechococcus sp. PCC 7002]
gi|169884319|gb|ACA98032.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
Length = 251
Score = 44.3 bits (103), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 76/149 (51%), Gaps = 25/149 (16%)
Query: 153 DWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRP 209
DWG+ K +IG+ + N++ + D +FI+V P +AV + ++ + + AG RP
Sbjct: 78 DWGEIPFK--VADIGTSRSPIENKIGDDDQIFIIVCP-SAVE---VGQVEKLCNLAGDRP 131
Query: 210 VILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRM-TY 268
VI++ P+L+D+ S + R R + ++ E Y+ R P A+ +Y
Sbjct: 132 VIMLIPQLEDV--SIVGIGYAARQLRERFISTLETAYYIR---------PYESAMVWRSY 180
Query: 269 PNGYELYKRVDVPSGGEKYVILSTFTEKP 297
P+ +E+Y + ++Y ++++ T KP
Sbjct: 181 PSAWEVYLEKE----EDQYELIASETTKP 205
>gi|427739188|ref|YP_007058732.1| hypothetical protein Riv7116_5820 [Rivularia sp. PCC 7116]
gi|427374229|gb|AFY58185.1| protein of unknown function (DUF1995) [Rivularia sp. PCC 7116]
Length = 245
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 96/232 (41%), Gaps = 52/232 (22%)
Query: 78 DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALA- 136
DG TRL+VE+ PEL + + +F + G ++K+ G ALA
Sbjct: 25 DGYTRLQVELLFPELK----------ALPAAQQFLPAFDEYGDKLKIFF-ADAGSAALAR 73
Query: 137 ----GMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGN 192
+P QL +D G G I S +V +D++F+ V P +
Sbjct: 74 RDWNEIPFQL---------LDIG----TGRMTSIES----KVQPEDEIFLFVQPSSVE-- 114
Query: 193 CIIDDMKAMTDAAG-TRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLL 251
+ ++ + G RP ++ PRL+D S + R R + + E+CY+ R +
Sbjct: 115 --VPQLEKLCGIIGEQRPFVMFAPRLED--SSIVGIGYAARQTRQRFINTIESCYYLRPI 170
Query: 252 YYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+ A+ YP +E++ V G+ Y ++ KPS D+++
Sbjct: 171 FEE-------AAVFRCYPGLWEVW----VEKNGD-YEKVAELPNKPSGDELD 210
>gi|428185262|gb|EKX54115.1| hypothetical protein GUITHDRAFT_100361 [Guillardia theta CCMP2712]
Length = 302
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 100/241 (41%), Gaps = 51/241 (21%)
Query: 29 ERKSRSNQVS--SSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVE 86
E + R N++ ++ K P +NN+ D V ++ A++DG RL+V+
Sbjct: 5 EHQPRRNEIKFDAAARYRKAFPLGVNNV----------LDQVRESLNLALDDGLFRLRVQ 54
Query: 87 INIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTR 146
+NI + + + + G + ELV +I + G R + M + ++A R
Sbjct: 55 LNIHQFDQRLQSFSDGKMPELVAMIGELLVERGMRPSIMFNTIM-DASIA---------R 104
Query: 147 KILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG 206
+L D+ + V I +G E E+ D +LVAP N GN + ++A+
Sbjct: 105 TVLP-----DHLREK--VSINVLGLGEFGEEYDAAVLVAPSNE-GNENLQRIEAVEKLIY 156
Query: 207 T-----------------RPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFR 249
+ RP+ILINP L+ + S T + + F+ YF +
Sbjct: 157 SGPPRGLVSRSARAKYMQRPIILINPVLEAIHSLS----TPHSKVPPMFMSDFDVVYFLQ 212
Query: 250 L 250
L
Sbjct: 213 L 213
>gi|255087973|ref|XP_002505909.1| predicted protein [Micromonas sp. RCC299]
gi|226521180|gb|ACO67167.1| predicted protein [Micromonas sp. RCC299]
Length = 299
Score = 42.7 bits (99), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 32/127 (25%), Positives = 57/127 (44%), Gaps = 12/127 (9%)
Query: 213 INPRLKDLPGSSGIMQTMG----RDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTY 268
IN +++ S ++ +G R K E+ ASFE Y+ G ++ +Y
Sbjct: 169 INNQIRTNDDGSNLISDLGIGPWRKKNEEFLASFELTYWISEQRIQGETVRLL----KSY 224
Query: 269 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRYVDYMDRILDYMDRILY 328
P+ ++L+ D+ SG + + TF EKPS ++ + + V M I DR+
Sbjct: 225 PHPWQLFVLTDMASGSAEPECIKTFDEKPSYKELESLLMSREGSVAAMSII----DRVKR 280
Query: 329 HLLFHST 335
F++T
Sbjct: 281 EAQFNAT 287
>gi|209522945|ref|ZP_03271502.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|376001796|ref|ZP_09779650.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|209496532|gb|EDZ96830.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|375329707|emb|CCE15403.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 249
Score = 42.7 bits (99), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 102/237 (43%), Gaps = 47/237 (19%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
QA A++DG ++VE+ PE+ E+ I + + AL D +V GS
Sbjct: 17 QAATAALDDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFFPDAGS 70
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIG---SIGANEVDEQDDMFILVAP 186
ALA R+ DWG+ + T +IG S + D F++V+P
Sbjct: 71 ---AALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 111
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
++ ++ + AG R V+L+NPRL+D+ GI R R + E+CY
Sbjct: 112 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYA-ARQLRERFLNIIESCY 165
Query: 247 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+ + L A AL YP +E++ ++ GE Y ++ + KP D +
Sbjct: 166 YLKPLDGA--------ALFRCYPGTWEVWLEIE----GE-YQKITEQSTKPVGDQLE 209
>gi|423062349|ref|ZP_17051139.1| hypothetical protein SPLC1_S032380 [Arthrospira platensis C1]
gi|406716257|gb|EKD11408.1| hypothetical protein SPLC1_S032380 [Arthrospira platensis C1]
Length = 262
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 102/237 (43%), Gaps = 47/237 (19%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
QA A++DG ++VE+ PE+ E+ I + + AL D +V GS
Sbjct: 30 QAATAALDDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFFPDAGS 83
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIG---SIGANEVDEQDDMFILVAP 186
ALA R+ DWG+ + T +IG S + D F++V+P
Sbjct: 84 ---AALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 124
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
++ ++ + AG R V+L+NPRL+D+ GI R R + E+CY
Sbjct: 125 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYA-ARQLRERFLNIIESCY 178
Query: 247 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+ + L A AL YP +E++ ++ GE Y ++ + KP D +
Sbjct: 179 YLKPLDGA--------ALFRCYPGTWEVWLEIE----GE-YQKITEQSTKPVGDQLE 222
>gi|357441663|ref|XP_003591109.1| hypothetical protein MTR_1g082900 [Medicago truncatula]
gi|355480157|gb|AES61360.1| hypothetical protein MTR_1g082900 [Medicago truncatula]
Length = 327
Score = 42.4 bits (98), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 56/239 (23%), Positives = 99/239 (41%), Gaps = 35/239 (14%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
QAT V+ G R VE+ IP+L D L EL R+ + +Q +
Sbjct: 90 QATASFVSSGGMRAIVELLIPQLEFLDDEGAQLELWELSRLF----------LDTLIQET 139
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNA 189
+ A P AG +L++ W D A +F + VD D+M +++ P +
Sbjct: 140 NCQKVKAVFP--DAGAAALLKYR-WKD--ALFSFASLSD--RKLVDSGDEMVVMIVPDHQ 192
Query: 190 VGNCIIDDMKAMTDAAGTRPVILINPRL--KDLPGSSGIMQTMGRDKRLEYAASFENCYF 247
+ + ++D RP+I+ NPRL KD+ + R R + ++F Y
Sbjct: 193 MLEYVEKIASTLSDDP-PRPLIMWNPRLYSKDVGVGYNV-----RQLRQYFLSTFTTVYH 246
Query: 248 FRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
R + + GA+ YP ++++ D +Y++ + +P ++DI F
Sbjct: 247 MRPMPF--------GAVFRCYPEMWKVFS--DDKDRPNRYLLAKEYANRPDTEDIEMLF 295
>gi|422294314|gb|EKU21614.1| hypothetical protein NGA_0177102 [Nannochloropsis gaditana CCMP526]
Length = 375
Score = 40.8 bits (94), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 53/116 (45%), Gaps = 10/116 (8%)
Query: 195 IDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYA 254
+ D++A+ A T+P+I N L+ G G+ +D + ++ + Y R YA
Sbjct: 221 LPDLEALHQADPTKPLIFFNLNLETHRGDLGLPAFPSKDLHYRFLSNIKPVYLLRTRQYA 280
Query: 255 GT--QYPIM----GALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 304
T + P + GAL TYP GY+ + +G +Y + T E+P+ D
Sbjct: 281 QTLSRPPFILNYQGALFRTYPGGYQCM----LDTGNGRYRRVETSRERPALSGFKD 332
>gi|299116127|emb|CBN76036.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 284
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 57/263 (21%), Positives = 108/263 (41%), Gaps = 48/263 (18%)
Query: 53 NIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPEL-NPEMDVYRIGTLMELVRVI 111
+P S +++ + +A +A++DG+TR + +++P + E+D + G
Sbjct: 48 ELPMSNDVQ---VEQATRAVWQALSDGKTRQILRLSLPLIGKTEIDDWPGG--------- 95
Query: 112 ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA 171
D +R K C G M E L G P G + + +D D +G +
Sbjct: 96 ------DRQRYKAC--GPMVESVLRGNPAAKEGASIVEQILDESDA--------VGLMQL 139
Query: 172 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 231
+ +DD +++ P + + + AG+R V L+NP + G+ Q G
Sbjct: 140 QCKEAKDDASVMIFPTTDT----LSLQQQVAAGAGSRLVALVNPAWRS-SSDFGLFQA-G 193
Query: 232 RDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILS 291
K E + FE Y + L G ++ YP + L+ + G K +L
Sbjct: 194 EAK--ETLSEFEVTYSLQSLVVYGYTLRLL----YCYPGPWRLFL---IGEGSGKRELLG 244
Query: 292 TFTEKPSSDDINDAFL----GKP 310
++ + P ++D ++ + GKP
Sbjct: 245 SWDQSPPTNDQVESAVFDKKGKP 267
>gi|123966201|ref|YP_001011282.1| hypothetical protein P9515_09681 [Prochlorococcus marinus str. MIT
9515]
gi|123200567|gb|ABM72175.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9515]
Length = 215
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 66/140 (47%), Gaps = 32/140 (22%)
Query: 170 GANE---VDEQDDMFILVAPQN---AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGS 223
G NE +D+ D + I + PQ + + ++ K + + +NP+ +D +
Sbjct: 97 GFNESIYIDQDDSIVISILPQPYDFGLFEPMCENFKGLHYS--------LNPKFED--AN 146
Query: 224 SGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIM-GALRMTYPNGYELYKRVDVPS 282
GI + R++R + +++N YF + PI GAL +YPN + L+K ++
Sbjct: 147 IGIGSVI-RERRKNFVKTWQNIYFLQ---------PINNGALMHSYPNNWLLFKEIN--- 193
Query: 283 GGEKYVILSTFTEKPSSDDI 302
KY+ F EKP ++ I
Sbjct: 194 --NKYLFKKEFEEKPDNETI 211
>gi|297812961|ref|XP_002874364.1| hypothetical protein ARALYDRAFT_489571 [Arabidopsis lyrata subsp.
lyrata]
gi|297320201|gb|EFH50623.1| hypothetical protein ARALYDRAFT_489571 [Arabidopsis lyrata subsp.
lyrata]
Length = 340
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 59/245 (24%), Positives = 98/245 (40%), Gaps = 43/245 (17%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+A VN G TR VE+ IP+L D L +L RV + ++ +
Sbjct: 104 EAGAAFVNSGGTRAIVELLIPQLQFLDDEGAQAELWDLSRVF----------LDTLIKET 153
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNA 189
E A P AG +L++ W D TF V+++D++ ++V P
Sbjct: 154 GCERVKAVFP--DAGAAALLKYR-WKD----ATFGFASLSDRKPVEKEDEIIVMVVPDYQ 206
Query: 190 VGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG-----RDKRLEYAASFEN 244
+ + K + D RP+I+ NPRL I + +G R R + +SF
Sbjct: 207 MLEYVEKIAKGLADDP-PRPLIMWNPRL--------ISEEVGVGFNVRKLRRYFLSSFTT 257
Query: 245 CYFFRLLYYAGTQYPIMGALRMTYPNGYEL-YKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
Y R L GA+ YP +++ Y D P +Y++ +P ++D+
Sbjct: 258 VYSMRPL--------AAGAVFRCYPGKWKVFYDNKDRPG---RYLLAKELIGRPDAEDLE 306
Query: 304 DAFLG 308
+ G
Sbjct: 307 IIYGG 311
>gi|449544416|gb|EMD35389.1| hypothetical protein CERSUDRAFT_21069, partial [Ceriporiopsis
subvermispora B]
Length = 383
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 47/205 (22%), Positives = 73/205 (35%), Gaps = 68/205 (33%)
Query: 157 YGAKGTFVEIGSIGANEV------------DEQDDMFILVAPQNAVGNCIID-------- 196
Y + V I + GA + D QD M +V P N +G ++
Sbjct: 29 YEKRNDVVAIAATGAGKTLSFWMPLLMALEDNQDKMIFVVTPLNILGKQNVELLNAAGIR 88
Query: 197 ----DMKAMTD------AAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
D K++TD AG V++INP + +M G ++L SF +
Sbjct: 89 GIAIDAKSVTDENFKNIEAGMFQVVVINPEI--------LMTDSGHCEKLWKKPSFVS-- 138
Query: 247 FFRLLYYA-------------GTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTF 293
RLLY+ QY +G+LR P Y + S
Sbjct: 139 --RLLYFVFDEGHCMSEWSSFREQYKYLGSLRFLIPEVIPFY-------------VASAT 183
Query: 294 TEKPSSDDINDAFLGKPRYVDYMDR 318
P DD+ + +PR+ D++ R
Sbjct: 184 LPTPILDDVTEVLHLRPRHTDHIFR 208
>gi|284991684|ref|YP_003410238.1| UDP-N-acetylmuramate/alanine ligase [Geodermatophilus obscurus DSM
43160]
gi|284064929|gb|ADB75867.1| UDP-N-acetylmuramate/alanine ligase [Geodermatophilus obscurus DSM
43160]
Length = 489
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 48/111 (43%), Gaps = 8/111 (7%)
Query: 85 VEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM---PLQ 141
V I E NPE+ R L L R +AL+ GKR V V G+ G+ + M +Q
Sbjct: 84 VSTAIREDNPELAAARERGLRVLPRAVALAAVMAGKR-SVAVAGTHGKTSTTSMLTVAVQ 142
Query: 142 LAGTRKILEFMDWGDYGAKGTFVEIGS--IGANEVDEQDDMFILVAPQNAV 190
G F GD G+ G + E DE D F+L+AP A+
Sbjct: 143 ACGVDP--SFAIGGDLNESGSNAHAGGGDVFVAEADESDRSFLLLAPHGAI 191
>gi|324998739|ref|ZP_08119851.1| UDP-N-acetylmuramate--L-alanine ligase [Pseudonocardia sp. P1]
Length = 478
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 48/109 (44%), Gaps = 4/109 (3%)
Query: 85 VEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM-PLQLA 143
V I E NPE+ R L + R +AL+ +G+R+ V V G+ G+ + M + L
Sbjct: 78 VSTAIRETNPELAAARDQGLQVVHRAVALAALTEGRRL-VAVAGTHGKTSTTSMVTVALQ 136
Query: 144 GTRKILEFMDWGDYGAKGTFVEIGS--IGANEVDEQDDMFILVAPQNAV 190
T F GD G+ GS + E DE D F AP AV
Sbjct: 137 HTGADPSFAIGGDLSVSGSGAHHGSGDVFVVEADESDGSFTAFAPSVAV 185
>gi|383830926|ref|ZP_09986015.1| UDP-N-acetylmuramate--alanine ligase [Saccharomonospora
xinjiangensis XJ-54]
gi|383463579|gb|EID55669.1| UDP-N-acetylmuramate--alanine ligase [Saccharomonospora
xinjiangensis XJ-54]
Length = 474
Score = 38.1 bits (87), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 53/126 (42%), Gaps = 9/126 (7%)
Query: 73 QRAVN-----DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQ 127
QRA N D + + V I E NPE+ V R + L R AL+ +G RV C+
Sbjct: 66 QRAENLDVFDDAPSAVVVSTAIKETNPELVVARERGVPVLHRAEALAALMEGHRVA-CIA 124
Query: 128 GSMGEGALAGM-PLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG--ANEVDEQDDMFILV 184
G+ G+ + M + L R F GD G G G E DE D F+
Sbjct: 125 GTHGKTSTTSMLTVALQHCRLDPSFAIGGDLNESGANAHHGEGGIFVAEADESDGSFLSY 184
Query: 185 APQNAV 190
+P AV
Sbjct: 185 SPSVAV 190
>gi|72382164|ref|YP_291519.1| hypothetical protein PMN2A_0324 [Prochlorococcus marinus str.
NATL2A]
gi|72002014|gb|AAZ57816.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL2A]
Length = 206
Score = 38.1 bits (87), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 44/93 (47%), Gaps = 16/93 (17%)
Query: 210 VILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYP 269
VI+IN +L+D G ++GR+ R +A ++ YF + L MGAL YP
Sbjct: 126 VIMINGKLEDPIVGIG---SVGREMRKRFAKKWKVLYFVQPLS--------MGALLKRYP 174
Query: 270 NGYELYKRVDVPSGGEKYVILSTFTEKPSSDDI 302
N +EL+K Y + +F +P + I
Sbjct: 175 NDWELFKL-----NNNGYTFVKSFINRPDDETI 202
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.138 0.407
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,575,888,686
Number of Sequences: 23463169
Number of extensions: 242592068
Number of successful extensions: 520628
Number of sequences better than 100.0: 136
Number of HSP's better than 100.0 without gapping: 38
Number of HSP's successfully gapped in prelim test: 98
Number of HSP's that attempted gapping in prelim test: 520414
Number of HSP's gapped (non-prelim): 176
length of query: 362
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 218
effective length of database: 8,980,499,031
effective search space: 1957748788758
effective search space used: 1957748788758
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)