BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 020260
(328 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255565097|ref|XP_002523541.1| adenylate kinase, putative [Ricinus communis]
gi|223537248|gb|EEF38880.1| adenylate kinase, putative [Ricinus communis]
Length = 597
Score = 564 bits (1453), Expect = e-158, Method: Compositional matrix adjust.
Identities = 265/328 (80%), Positives = 293/328 (89%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
+VF +I SLLSQ +++K + + + + +S NQ S SQD W+GIPTRLNNIPHSREI
Sbjct: 270 VVFEEISSLLSQVQKDKEEPVKSKNLIPKSESSYNQASVSQDKWRGIPTRLNNIPHSREI 329
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
R YFYDDVLQATQRAVNDGR RLKVEI IPELNPEMDVYRIGTLMELVRV+ALSFADDGK
Sbjct: 330 RTYFYDDVLQATQRAVNDGRMRLKVEIAIPELNPEMDVYRIGTLMELVRVLALSFADDGK 389
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
RVKVCVQGSMG+GALAGMPLQL+GTRKILEFMDWGDYGA GTF+ IGSIGA EVDE DDM
Sbjct: 390 RVKVCVQGSMGQGALAGMPLQLSGTRKILEFMDWGDYGALGTFIRIGSIGAKEVDEDDDM 449
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
FILVAPQNAVGNCIIDD++AMTDAAG+RPVIL+NPRLKDLPGSSGIMQTMGR +RLEYAA
Sbjct: 450 FILVAPQNAVGNCIIDDLRAMTDAAGSRPVILVNPRLKDLPGSSGIMQTMGRAERLEYAA 509
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
SFENCY FRLLYY GTQYPIMGALRM+YP YELYKRVD+P+ GEKY ILSTF+EKP+ D
Sbjct: 510 SFENCYSFRLLYYGGTQYPIMGALRMSYPYHYELYKRVDLPARGEKYEILSTFSEKPTGD 569
Query: 301 DINDAFLGKPRNREKKASGFWGFLSSVF 328
DINDAFLG+ RN++KKASGFWGFLS V
Sbjct: 570 DINDAFLGRSRNQDKKASGFWGFLSGVL 597
>gi|359484753|ref|XP_002264815.2| PREDICTED: uncharacterized protein LOC100256872 [Vitis vinifera]
Length = 597
Score = 560 bits (1443), Expect = e-157, Method: Compositional matrix adjust.
Identities = 265/327 (81%), Positives = 291/327 (88%)
Query: 2 VFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIR 61
VF I+SLLS+ +++K K + + SN SSSQ+NW+GIPTRLNNIPHSREIR
Sbjct: 271 VFKPIDSLLSKMQQDKENMMKLEKSIRASERLSNLASSSQENWRGIPTRLNNIPHSREIR 330
Query: 62 NYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKR 121
YFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRV+ALSFADDGK
Sbjct: 331 TYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVLALSFADDGKH 390
Query: 122 VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMF 181
VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTF+ IGSIG+ EV+EQDD+F
Sbjct: 391 VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFINIGSIGSKEVEEQDDLF 450
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
ILVAPQNAVGNCIIDD+KAMTDAAG RPVIL+NPRLKDLPGSSGIMQTMGRDKRLEYAAS
Sbjct: 451 ILVAPQNAVGNCIIDDLKAMTDAAGNRPVILVNPRLKDLPGSSGIMQTMGRDKRLEYAAS 510
Query: 242 FENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 301
FENCYFFRLLYYAGTQYPIMGA+RM+YP YELYKRVD G EKY+ILST + +P+ D+
Sbjct: 511 FENCYFFRLLYYAGTQYPIMGAIRMSYPYRYELYKRVDESFGKEKYIILSTSSARPNIDE 570
Query: 302 INDAFLGKPRNREKKASGFWGFLSSVF 328
INDAFLGKPR+ KKA+GFWGFLS +
Sbjct: 571 INDAFLGKPRSEAKKATGFWGFLSGIL 597
>gi|296084582|emb|CBI25603.3| unnamed protein product [Vitis vinifera]
Length = 652
Score = 545 bits (1403), Expect = e-152, Method: Compositional matrix adjust.
Identities = 254/293 (86%), Positives = 273/293 (93%)
Query: 36 QVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPE 95
Q S SQ+NW+GIPTRLNNIPHSREIR YFYDDVLQATQRAVNDGRTRLKVEINIPELNPE
Sbjct: 360 QASLSQENWRGIPTRLNNIPHSREIRTYFYDDVLQATQRAVNDGRTRLKVEINIPELNPE 419
Query: 96 MDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG 155
MDVYRIGTLMELVRV+ALSFADDGK VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG
Sbjct: 420 MDVYRIGTLMELVRVLALSFADDGKHVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG 479
Query: 156 DYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINP 215
DYGAKGTF+ IGSIG+ EV+EQDD+FILVAPQNAVGNCIIDD+KAMTDAAG RPVIL+NP
Sbjct: 480 DYGAKGTFINIGSIGSKEVEEQDDLFILVAPQNAVGNCIIDDLKAMTDAAGNRPVILVNP 539
Query: 216 RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELY 275
RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGA+RM+YP YELY
Sbjct: 540 RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGAIRMSYPYRYELY 599
Query: 276 KRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 328
KRVD G EKY+ILST + +P+ D+INDAFLGKPR+ KKA+GFWGFLS +
Sbjct: 600 KRVDESFGKEKYIILSTSSARPNIDEINDAFLGKPRSEAKKATGFWGFLSGIL 652
>gi|449493659|ref|XP_004159398.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
LOC101226743 [Cucumis sativus]
Length = 609
Score = 531 bits (1369), Expect = e-148, Method: Compositional matrix adjust.
Identities = 252/334 (75%), Positives = 287/334 (85%), Gaps = 7/334 (2%)
Query: 2 VFADIESLLSQAKREKMKKTTAVK-----P--VHERKSRSNQVSSSQDNWKGIPTRLNNI 54
+F ++ SLLSQ +++++ ++ P + E K NQ S S+D+W+GIPTRLNNI
Sbjct: 276 IFEELSSLLSQIQKKRLXSQVMLRFLWNSPQFLPENKGTVNQPSESRDSWRGIPTRLNNI 335
Query: 55 PHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALS 114
PHSREIR YFYDDVLQAT+RAV GRTRLKVE NIPELNPEMDVYRIGTLMELVR IALS
Sbjct: 336 PHSREIRKYFYDDVLQATKRAVQAGRTRLKVENNIPELNPEMDVYRIGTLMELVRTIALS 395
Query: 115 FADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEV 174
FADDG+RVKVC+QGSMGEGAL+GMPLQLAGTR+ILE+MDWG+YGA GTFV+IGSIGA EV
Sbjct: 396 FADDGRRVKVCIQGSMGEGALSGMPLQLAGTRRILEYMDWGEYGALGTFVKIGSIGAKEV 455
Query: 175 DEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDK 234
D++DDMFILVAPQNAVGNCIIDDMKAMTDAAG RPVIL+NPRLKDLPGSSGIMQTMGR+K
Sbjct: 456 DDEDDMFILVAPQNAVGNCIIDDMKAMTDAAGDRPVILVNPRLKDLPGSSGIMQTMGREK 515
Query: 235 RLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFT 294
RLEYAASFE CYFFRLLYYAGTQYPIMGALRM+YP GYELY+RVD PSG EKY+ LS +
Sbjct: 516 RLEYAASFEICYFFRLLYYAGTQYPIMGALRMSYPYGYELYRRVDEPSGKEKYIELSKYP 575
Query: 295 EKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 328
++PS+DDINDAF G R K +SG WGFLS +
Sbjct: 576 KRPSTDDINDAFQGNKRKEAKSSSGIWGFLSGIL 609
>gi|356556590|ref|XP_003546607.1| PREDICTED: uncharacterized protein LOC100818830 [Glycine max]
Length = 591
Score = 527 bits (1358), Expect = e-147, Method: Compositional matrix adjust.
Identities = 251/327 (76%), Positives = 282/327 (86%), Gaps = 13/327 (3%)
Query: 2 VFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIR 61
VF +IESLLSQ +++K+K + + QD W+GIPTRLNNIPHSREIR
Sbjct: 278 VFKEIESLLSQLQQDKVKIIESGGMI-------------QDKWRGIPTRLNNIPHSREIR 324
Query: 62 NYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKR 121
YFYDDVLQATQRA++DG+TR+KV+INIPELNPEMDVYRIGTLMELVR +ALSFADDGKR
Sbjct: 325 KYFYDDVLQATQRAISDGKTRVKVDINIPELNPEMDVYRIGTLMELVRALALSFADDGKR 384
Query: 122 VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMF 181
VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTF+ IGSIGA EV+EQDDMF
Sbjct: 385 VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFINIGSIGAAEVEEQDDMF 444
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
ILVAPQNAVGNCIIDD++AMT+AA RPVILIN RLKDLPGSSGIMQT+GRD+RL+YAAS
Sbjct: 445 ILVAPQNAVGNCIIDDLRAMTNAAEHRPVILINARLKDLPGSSGIMQTIGRDQRLKYAAS 504
Query: 242 FENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 301
FE+CYFFRLLYYAGTQYPIMGA+RMTYP Y+LYKRVD G EKYVILSTF E+P++D+
Sbjct: 505 FESCYFFRLLYYAGTQYPIMGAIRMTYPYQYDLYKRVDESPGKEKYVILSTFPERPTTDE 564
Query: 302 INDAFLGKPRNREKKASGFWGFLSSVF 328
INDAF GKP +K SG WGFLS +
Sbjct: 565 INDAFQGKPSKDPRKTSGIWGFLSGIL 591
>gi|224144020|ref|XP_002325159.1| predicted protein [Populus trichocarpa]
gi|222866593|gb|EEF03724.1| predicted protein [Populus trichocarpa]
Length = 520
Score = 526 bits (1356), Expect = e-147, Method: Compositional matrix adjust.
Identities = 251/311 (80%), Positives = 279/311 (89%), Gaps = 13/311 (4%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
+VF +I+SLLSQ +R+++K + S QD W+GIPTRLNNIPHSREI
Sbjct: 223 VVFKEIDSLLSQVQRDEVKLVKSG-------------ISLQDKWRGIPTRLNNIPHSREI 269
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
RNYFYDDVLQATQRAVNDGRTRL+V+INIPELNPEMDVYRIGTLMELVRVIALSF+DDGK
Sbjct: 270 RNYFYDDVLQATQRAVNDGRTRLRVDINIPELNPEMDVYRIGTLMELVRVIALSFSDDGK 329
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
VKVCVQGSMG+GALAGMPLQLAGTR ILEFMDWGDYGAKGTF++IGSIG EV E+DDM
Sbjct: 330 HVKVCVQGSMGKGALAGMPLQLAGTRNILEFMDWGDYGAKGTFIKIGSIGEQEVSEEDDM 389
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
FILVAPQNAVGNCII+D+KAMT AAG+RPVILINP+LKDLPGSSGIMQTMGRDKRLEYAA
Sbjct: 390 FILVAPQNAVGNCIIEDLKAMTVAAGSRPVILINPKLKDLPGSSGIMQTMGRDKRLEYAA 449
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
SFE+CYFFRLLYYAGTQYPIMGALRM+YP YELYKRVD PSG EKY+ILSTF+EKPS+D
Sbjct: 450 SFESCYFFRLLYYAGTQYPIMGALRMSYPYRYELYKRVDQPSGTEKYMILSTFSEKPSTD 509
Query: 301 DINDAFLGKPR 311
D+NDAF+G+PR
Sbjct: 510 DVNDAFMGRPR 520
>gi|449443265|ref|XP_004139400.1| PREDICTED: uncharacterized protein LOC101203230 [Cucumis sativus]
Length = 577
Score = 524 bits (1350), Expect = e-146, Method: Compositional matrix adjust.
Identities = 249/327 (76%), Positives = 279/327 (85%), Gaps = 14/327 (4%)
Query: 2 VFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIR 61
+F ++ SLLSQ ++EK K+ +S D+W+GIPTRLNNIPHSREIR
Sbjct: 265 IFEELSSLLSQIQKEKAMKSGN--------------ASFSDSWRGIPTRLNNIPHSREIR 310
Query: 62 NYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKR 121
YFYDDVLQAT+RAV GRTRLKVE NIPELNPEMDVYRIGTLMELVR IALSFADDG+R
Sbjct: 311 KYFYDDVLQATKRAVQAGRTRLKVENNIPELNPEMDVYRIGTLMELVRTIALSFADDGRR 370
Query: 122 VKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMF 181
VKVC+QGSMGEGAL+GMPLQLAGTR+ILE+MDWG+YGA GTFV+IGSIGA EVD++DDMF
Sbjct: 371 VKVCIQGSMGEGALSGMPLQLAGTRRILEYMDWGEYGALGTFVKIGSIGAKEVDDEDDMF 430
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
ILVAPQNAVGNCIIDDMKAMTDAAG RPVIL+NPRLKDLPGSSGIMQTMGR+KRLEYAAS
Sbjct: 431 ILVAPQNAVGNCIIDDMKAMTDAAGDRPVILVNPRLKDLPGSSGIMQTMGREKRLEYAAS 490
Query: 242 FENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 301
FE CYFFRLLYYAGTQYPIMGALRM+YP GYELY+RVD PSG EKY+ LS + ++PS+DD
Sbjct: 491 FEICYFFRLLYYAGTQYPIMGALRMSYPYGYELYRRVDEPSGKEKYIELSKYPKRPSTDD 550
Query: 302 INDAFLGKPRNREKKASGFWGFLSSVF 328
INDAF G R K +SG WGFLS +
Sbjct: 551 INDAFQGNKRKEAKSSSGIWGFLSGIL 577
>gi|356532999|ref|XP_003535056.1| PREDICTED: uncharacterized protein LOC100796331 [Glycine max]
Length = 1016
Score = 520 bits (1339), Expect = e-145, Method: Compositional matrix adjust.
Identities = 241/291 (82%), Positives = 265/291 (91%)
Query: 38 SSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMD 97
S+ DNW+GIPTRLNNIPHSREIR YFYDDVLQATQRA+NDG+TR+KV+INIPELNPEMD
Sbjct: 726 SAEADNWRGIPTRLNNIPHSREIRKYFYDDVLQATQRAINDGKTRVKVDINIPELNPEMD 785
Query: 98 VYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY 157
VYRIGTLMELVR +ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY
Sbjct: 786 VYRIGTLMELVRTLALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDY 845
Query: 158 GAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRL 217
GAKGTF+ IGSIGA EV+E+DDMFILVAPQNAVGNCIIDD++AMT+AA RPVILIN RL
Sbjct: 846 GAKGTFINIGSIGAAEVEEEDDMFILVAPQNAVGNCIIDDLRAMTNAAEHRPVILINARL 905
Query: 218 KDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKR 277
KDLPGSSGIMQTMGRD+RL+YAASFE+CYFFRLLYYAGTQYPIMGA+RMTYP YELYKR
Sbjct: 906 KDLPGSSGIMQTMGRDQRLKYAASFESCYFFRLLYYAGTQYPIMGAIRMTYPYQYELYKR 965
Query: 278 VDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 328
VD G EKY ILSTF E+P++D+INDAF GKP +K SG WGFL+ +
Sbjct: 966 VDESPGKEKYAILSTFPERPTTDEINDAFEGKPSKDPRKTSGIWGFLNGIL 1016
>gi|22327339|ref|NP_198367.2| adenylate kinase family protein [Arabidopsis thaliana]
gi|17979434|gb|AAL49859.1| putative adenylate kinase [Arabidopsis thaliana]
gi|22136756|gb|AAM91697.1| putative adenylate kinase [Arabidopsis thaliana]
gi|110740593|dbj|BAE98401.1| adenylate kinase -like protein [Arabidopsis thaliana]
gi|332006553|gb|AED93936.1| adenylate kinase family protein [Arabidopsis thaliana]
Length = 588
Score = 501 bits (1290), Expect = e-139, Method: Compositional matrix adjust.
Identities = 236/328 (71%), Positives = 276/328 (84%), Gaps = 13/328 (3%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
+VF + ++LLSQ + ++M KT PV QD W+GIPTRLNNIPHSR+I
Sbjct: 273 VVFEETQTLLSQIQLKRMIKTDKASPV-------------QDKWRGIPTRLNNIPHSRDI 319
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
R YFY+DVLQAT R++ DG TRL+V+INIPELNPEMDVYRIGTLMELV+ +ALSFADDGK
Sbjct: 320 RAYFYEDVLQATIRSIKDGNTRLRVDINIPELNPEMDVYRIGTLMELVQALALSFADDGK 379
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
RVKVCVQGSMGEGALAGMPLQLAGTRKILE+MDWGD GTFV++G+IG EVDE+DDM
Sbjct: 380 RVKVCVQGSMGEGALAGMPLQLAGTRKILEYMDWGDDETLGTFVKLGAIGGKEVDEEDDM 439
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
FILVAPQNAVGNCIIDD++AMT AAG RPV+LINPRLKDLP SSGIMQTMGR++RLEYA
Sbjct: 440 FILVAPQNAVGNCIIDDLQAMTTAAGKRPVVLINPRLKDLPASSGIMQTMGREQRLEYAL 499
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
+F+NCY FRLLYY GTQYPIMGALRM+YP YELYKRV+ +G EKYV+L+T+ E+P+ +
Sbjct: 500 TFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELYKRVNEENGKEKYVLLATYAERPTPE 559
Query: 301 DINDAFLGKPRNREKKASGFWGFLSSVF 328
I+DAF GK R++ KKASG WGFLSSVF
Sbjct: 560 QIDDAFSGKSRDQSKKASGIWGFLSSVF 587
>gi|297801048|ref|XP_002868408.1| adenylate kinase family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314244|gb|EFH44667.1| adenylate kinase family protein [Arabidopsis lyrata subsp. lyrata]
Length = 595
Score = 498 bits (1283), Expect = e-138, Method: Compositional matrix adjust.
Identities = 239/331 (72%), Positives = 281/331 (84%), Gaps = 8/331 (2%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQV---SSSQDNWKGIPTRLNNIPHS 57
+VF + ++LLSQ + ++M KT +R S + V S QD W+GIPTRLNNIPHS
Sbjct: 269 VVFEETQTLLSQIQLKRMIKTG-----KKRDSGTPLVYKASPVQDKWRGIPTRLNNIPHS 323
Query: 58 REIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFAD 117
R+IR YFY+DVLQAT R++ DG TRL+V+INIPELNPEMDVYRIGTLMELVRV+ALSFAD
Sbjct: 324 RDIRAYFYEDVLQATIRSIKDGNTRLRVDINIPELNPEMDVYRIGTLMELVRVLALSFAD 383
Query: 118 DGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQ 177
DGKRVKVCVQGSMGEGALAGMPLQLAGTRKILE+MDWGD GTFV++G+IG EVDE+
Sbjct: 384 DGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEYMDWGDDETLGTFVKLGAIGGKEVDEE 443
Query: 178 DDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLE 237
DDMFILVAPQNAVGNCIIDD++AMT AAG RPV+LINPRLKDLP SSGIMQTMGR++RLE
Sbjct: 444 DDMFILVAPQNAVGNCIIDDLQAMTTAAGKRPVVLINPRLKDLPASSGIMQTMGREQRLE 503
Query: 238 YAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKP 297
YA +F+NCY FRLLYY GTQYPIMGALRM+YP YELYKRV+ +G EKYV+L+T++E+P
Sbjct: 504 YALTFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELYKRVNEENGKEKYVLLATYSERP 563
Query: 298 SSDDINDAFLGKPRNREKKASGFWGFLSSVF 328
+ + I+DAF GK R++ KK SG WGFLSSVF
Sbjct: 564 TPEQIDDAFSGKSRDQSKKPSGIWGFLSSVF 594
>gi|357145496|ref|XP_003573662.1| PREDICTED: uncharacterized protein LOC100845101 [Brachypodium
distachyon]
Length = 596
Score = 494 bits (1272), Expect = e-137, Method: Compositional matrix adjust.
Identities = 231/332 (69%), Positives = 277/332 (83%), Gaps = 10/332 (3%)
Query: 1 MVFADIESLL----SQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPH 56
+VF +IESLL + A +K+ KT + S ++S++ W+GIPTRLNNIPH
Sbjct: 271 VVFQEIESLLQKICTNASEDKLAKTNG------KSSDFVGTTTSKNEWRGIPTRLNNIPH 324
Query: 57 SREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFA 116
SREIR YFYDDVLQAT+RA+ D +TRL+++INIPELNPEMDVYRIGTLMELVR I+L+FA
Sbjct: 325 SREIRKYFYDDVLQATKRAIEDKKTRLQIDINIPELNPEMDVYRIGTLMELVREISLAFA 384
Query: 117 DDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDE 176
DDGKRVKV VQGSMG+GA AG+PLQLAGTRKILEFMDWGDYGAKG F+ IG++GA+EVD+
Sbjct: 385 DDGKRVKVSVQGSMGQGAFAGIPLQLAGTRKILEFMDWGDYGAKGAFINIGAVGASEVDK 444
Query: 177 QDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRL 236
+DDMFIL+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPRLKD+P SSG+MQTMGRD RL
Sbjct: 445 EDDMFILIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPRLKDMPASSGVMQTMGRDMRL 504
Query: 237 EYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEK 296
+YAASFE CY FRLL+YAGT YPIMGALRM YPN YE+Y+RVD P+G EKYV+L+ +K
Sbjct: 505 KYAASFETCYSFRLLFYAGTFYPIMGALRMGYPNKYEIYRRVDEPNGKEKYVLLAELMDK 564
Query: 297 PSSDDINDAFLGKPRNREKKASGFWGFLSSVF 328
P++DDI +AF G + +E SGFWGFLS +
Sbjct: 565 PTADDITNAFKGPKKEKESAPSGFWGFLSGIL 596
>gi|79329010|ref|NP_001031966.1| adenylate kinase family protein [Arabidopsis thaliana]
gi|332006554|gb|AED93937.1| adenylate kinase family protein [Arabidopsis thaliana]
Length = 580
Score = 487 bits (1254), Expect = e-135, Method: Compositional matrix adjust.
Identities = 229/321 (71%), Positives = 269/321 (83%), Gaps = 13/321 (4%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
+VF + ++LLSQ + ++M KT PV QD W+GIPTRLNNIPHSR+I
Sbjct: 273 VVFEETQTLLSQIQLKRMIKTDKASPV-------------QDKWRGIPTRLNNIPHSRDI 319
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
R YFY+DVLQAT R++ DG TRL+V+INIPELNPEMDVYRIGTLMELV+ +ALSFADDGK
Sbjct: 320 RAYFYEDVLQATIRSIKDGNTRLRVDINIPELNPEMDVYRIGTLMELVQALALSFADDGK 379
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
RVKVCVQGSMGEGALAGMPLQLAGTRKILE+MDWGD GTFV++G+IG EVDE+DDM
Sbjct: 380 RVKVCVQGSMGEGALAGMPLQLAGTRKILEYMDWGDDETLGTFVKLGAIGGKEVDEEDDM 439
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
FILVAPQNAVGNCIIDD++AMT AAG RPV+LINPRLKDLP SSGIMQTMGR++RLEYA
Sbjct: 440 FILVAPQNAVGNCIIDDLQAMTTAAGKRPVVLINPRLKDLPASSGIMQTMGREQRLEYAL 499
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
+F+NCY FRLLYY GTQYPIMGALRM+YP YELYKRV+ +G EKYV+L+T+ E+P+ +
Sbjct: 500 TFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELYKRVNEENGKEKYVLLATYAERPTPE 559
Query: 301 DINDAFLGKPRNREKKASGFW 321
I+DAF GK R++ KKASG W
Sbjct: 560 QIDDAFSGKSRDQSKKASGIW 580
>gi|326499868|dbj|BAJ90769.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326502206|dbj|BAJ95166.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 596
Score = 486 bits (1252), Expect = e-135, Method: Compositional matrix adjust.
Identities = 222/328 (67%), Positives = 274/328 (83%), Gaps = 6/328 (1%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
+VF +IES+L +K+ T+ + + + ++S+ W+GIPTRLNNIPHSREI
Sbjct: 275 VVFHEIESML-----QKICTNTSKNKLAKTNGKPPDTTASKKEWRGIPTRLNNIPHSREI 329
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
R YFYDDV+QAT+RA+ D +TRL+++INIPELNPE+DVYRIGTLMELVR ++L+FADDGK
Sbjct: 330 RKYFYDDVIQATKRAIEDKKTRLQIDINIPELNPELDVYRIGTLMELVRDLSLTFADDGK 389
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
RVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGDYGA G F+ IG++GA+EVD++DDM
Sbjct: 390 RVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGDYGAMGAFINIGAVGASEVDKEDDM 449
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
F+L+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPRLKD+P SSG+MQTMGRD RL+YAA
Sbjct: 450 FVLIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPRLKDMPASSGVMQTMGRDLRLQYAA 509
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
SFE CY FRLL+Y GT YPIMGALRM YPN YE+Y+RVD P+ GEKY +L+ FT P++D
Sbjct: 510 SFETCYSFRLLFYVGTFYPIMGALRMAYPNKYEIYRRVDEPN-GEKYDLLAEFTANPTAD 568
Query: 301 DINDAFLGKPRNREKKASGFWGFLSSVF 328
DI +AF+G + +E SGFWGFLS +
Sbjct: 569 DITNAFVGTKKKKENAPSGFWGFLSGIL 596
>gi|326509295|dbj|BAJ91564.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 512
Score = 484 bits (1247), Expect = e-134, Method: Compositional matrix adjust.
Identities = 222/328 (67%), Positives = 274/328 (83%), Gaps = 6/328 (1%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
+VF +IES+L +K+ T+ + + + ++S+ W+GIPTRLNNIPHSREI
Sbjct: 191 VVFHEIESML-----QKICTNTSKNKLAKTNGKPPDTTASKKEWRGIPTRLNNIPHSREI 245
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
R YFYDDV+QAT+RA+ D +TRL+++INIPELNPE+DVYRIGTLMELVR ++L+FADDGK
Sbjct: 246 RKYFYDDVIQATKRAIEDKKTRLQIDINIPELNPELDVYRIGTLMELVRDLSLTFADDGK 305
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
RVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGDYGA G F+ IG++GA+EVD++DDM
Sbjct: 306 RVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGDYGAMGAFINIGAVGASEVDKEDDM 365
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
F+L+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPRLKD+P SSG+MQTMGRD RL+YAA
Sbjct: 366 FVLIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPRLKDMPASSGVMQTMGRDLRLQYAA 425
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
SFE CY FRLL+Y GT YPIMGALRM YPN YE+Y+RVD P+ GEKY +L+ FT P++D
Sbjct: 426 SFETCYSFRLLFYVGTFYPIMGALRMAYPNKYEIYRRVDEPN-GEKYDLLAEFTANPTAD 484
Query: 301 DINDAFLGKPRNREKKASGFWGFLSSVF 328
DI +AF+G + +E SGFWGFLS +
Sbjct: 485 DITNAFVGTKKKKENAPSGFWGFLSGIL 512
>gi|297608344|ref|NP_001061454.2| Os08g0288200 [Oryza sativa Japonica Group]
gi|255678327|dbj|BAF23368.2| Os08g0288200, partial [Oryza sativa Japonica Group]
Length = 493
Score = 482 bits (1241), Expect = e-134, Method: Compositional matrix adjust.
Identities = 228/331 (68%), Positives = 272/331 (82%), Gaps = 8/331 (2%)
Query: 1 MVFADIESLLSQAKRE---KMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHS 57
+VF +I+SLL + M T KP S ++S++ ++GIPTRLNNIPHS
Sbjct: 168 VVFNEIDSLLQKICENASFNMLAKTNGKP-----QDSKDTTASKNEFRGIPTRLNNIPHS 222
Query: 58 REIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFAD 117
REIR YFY+DVL AT+ AV D +TRL+++INIPELNPEMDVYRIGTLMELVR ++LSFAD
Sbjct: 223 REIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELNPEMDVYRIGTLMELVRELSLSFAD 282
Query: 118 DGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQ 177
DGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MDWG+YGAKGTF+ G++GA+EVD++
Sbjct: 283 DGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMDWGEYGAKGTFINFGAVGASEVDKE 342
Query: 178 DDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLE 237
DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+NPRLKD+PGSSG+MQTMGRD RL+
Sbjct: 343 DDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILVNPRLKDMPGSSGVMQTMGRDMRLK 402
Query: 238 YAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKP 297
YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE+Y+RVD P+G E+YV+L F EKP
Sbjct: 403 YAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYEIYRRVDEPNGQERYVLLEEFVEKP 462
Query: 298 SSDDINDAFLGKPRNREKKASGFWGFLSSVF 328
+ D+I +AF + EK ASGFWGFLS +
Sbjct: 463 TPDEITNAFRPRKNENEKSASGFWGFLSGIL 493
>gi|38175634|dbj|BAD01341.1| putative Adenylate kinase, chloroplast [Oryza sativa Japonica
Group]
Length = 467
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 228/331 (68%), Positives = 272/331 (82%), Gaps = 8/331 (2%)
Query: 1 MVFADIESLLSQAKRE---KMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHS 57
+VF +I+SLL + M T KP S ++S++ ++GIPTRLNNIPHS
Sbjct: 142 VVFNEIDSLLQKICENASFNMLAKTNGKP-----QDSKDTTASKNEFRGIPTRLNNIPHS 196
Query: 58 REIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFAD 117
REIR YFY+DVL AT+ AV D +TRL+++INIPELNPEMDVYRIGTLMELVR ++LSFAD
Sbjct: 197 REIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELNPEMDVYRIGTLMELVRELSLSFAD 256
Query: 118 DGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQ 177
DGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MDWG+YGAKGTF+ G++GA+EVD++
Sbjct: 257 DGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMDWGEYGAKGTFINFGAVGASEVDKE 316
Query: 178 DDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLE 237
DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+NPRLKD+PGSSG+MQTMGRD RL+
Sbjct: 317 DDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILVNPRLKDMPGSSGVMQTMGRDMRLK 376
Query: 238 YAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKP 297
YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE+Y+RVD P+G E+YV+L F EKP
Sbjct: 377 YAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYEIYRRVDEPNGQERYVLLEEFVEKP 436
Query: 298 SSDDINDAFLGKPRNREKKASGFWGFLSSVF 328
+ D+I +AF + EK ASGFWGFLS +
Sbjct: 437 TPDEITNAFRPRKNENEKSASGFWGFLSGIL 467
>gi|293333213|ref|NP_001168735.1| uncharacterized protein LOC100382527 [Zea mays]
gi|223972711|gb|ACN30543.1| unknown [Zea mays]
Length = 599
Score = 479 bits (1234), Expect = e-133, Method: Compositional matrix adjust.
Identities = 225/329 (68%), Positives = 273/329 (82%), Gaps = 6/329 (1%)
Query: 1 MVFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQV-SSSQDNWKGIPTRLNNIPHSRE 59
+VF I+ LL + + T+A K R + + S++ W+GIPTRLNNIPHSRE
Sbjct: 276 VVFQAIDFLLQKI----CENTSASKLTKTNGKRPDSADTGSKNEWRGIPTRLNNIPHSRE 331
Query: 60 IRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDG 119
IR YFY+DVL+AT+ A+ D +TRL+++INIPELNPEMDVYRIGTLMELVR ++LSFADDG
Sbjct: 332 IREYFYNDVLKATKLAIEDKKTRLQIDINIPELNPEMDVYRIGTLMELVRELSLSFADDG 391
Query: 120 KRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDD 179
KRVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGDYGAKGTF+ IG++GA +VD++DD
Sbjct: 392 KRVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGDYGAKGTFINIGAVGARDVDKEDD 451
Query: 180 MFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYA 239
MF+L+APQNAVGNCIIDDM+AMTDAAG RPVIL+NPRLKD+P SSG+MQTMGRD RL+YA
Sbjct: 452 MFVLIAPQNAVGNCIIDDMRAMTDAAGDRPVILVNPRLKDMPASSGVMQTMGRDIRLKYA 511
Query: 240 ASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSS 299
ASFE CY FRLLYYAG+ YPIMGALRM YPN YE+Y+RVD +G E Y++++ FT KP+
Sbjct: 512 ASFETCYSFRLLYYAGSFYPIMGALRMAYPNKYEIYRRVDTLNGKENYILIAEFTNKPTP 571
Query: 300 DDINDAFLGKPRNREKKASGFWGFLSSVF 328
D+I AF G+ R EK +SGFWGFLS +F
Sbjct: 572 DEITSAFKGQKRE-EKASSGFWGFLSGLF 599
>gi|10257489|dbj|BAB11193.1| unnamed protein product [Arabidopsis thaliana]
Length = 337
Score = 439 bits (1130), Expect = e-121, Method: Compositional matrix adjust.
Identities = 213/330 (64%), Positives = 249/330 (75%), Gaps = 48/330 (14%)
Query: 30 RKSRSNQVSSSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINI 89
++ +++ S QD W+GIPTRLNNIPHSR+IR YFY+DVLQAT R++ DG TRL+V+INI
Sbjct: 8 KQVLADKASPVQDKWRGIPTRLNNIPHSRDIRAYFYEDVLQATIRSIKDGNTRLRVDINI 67
Query: 90 PELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKIL 149
PELNPEMDVYRIGTLMELV+ +ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKIL
Sbjct: 68 PELNPEMDVYRIGTLMELVQALALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKIL 127
Query: 150 EFMDWGDYGAKGTFVEIGSI---------------------------------------- 169
E+MDWGD GTFV++G+I
Sbjct: 128 EYMDWGDDETLGTFVKLGAIVTRESLNLQKLHHQQNIVVSIIKLVMHNKVGYIELHIILL 187
Query: 170 --------GANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLP 221
G EVDE+DDMFILVAPQNAVGNCIIDD++AMT AAG RPV+LINPRLKDLP
Sbjct: 188 FDYVFAMTGGKEVDEEDDMFILVAPQNAVGNCIIDDLQAMTTAAGKRPVVLINPRLKDLP 247
Query: 222 GSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVP 281
SSGIMQTMGR++RLEYA +F+NCY FRLLYY GTQYPIMGALRM+YP YELYKRV+
Sbjct: 248 ASSGIMQTMGREQRLEYALTFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELYKRVNEE 307
Query: 282 SGGEKYVILSTFTEKPSSDDINDAFLGKPR 311
+G EKYV+L+T+ E+P+ + I+DAF GK R
Sbjct: 308 NGKEKYVLLATYAERPTPEQIDDAFSGKSR 337
>gi|302761066|ref|XP_002963955.1| hypothetical protein SELMODRAFT_270406 [Selaginella moellendorffii]
gi|300167684|gb|EFJ34288.1| hypothetical protein SELMODRAFT_270406 [Selaginella moellendorffii]
Length = 575
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 188/324 (58%), Positives = 236/324 (72%), Gaps = 19/324 (5%)
Query: 2 VFADIESLLSQAKREKMKKTTAV-KPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
VF DI S L +++K T V KP QD+W+G+PT+LN+IPHSREI
Sbjct: 268 VFGDISSFLD----DRIKAATKVSKP-------------KQDSWRGMPTKLNSIPHSREI 310
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
R YFY +V + + AV DG +LKVEI IPELNPEMDVYR GTL+ELVR +A +FA+DGK
Sbjct: 311 REYFYTEVCEGVRNAVEDGLHKLKVEITIPELNPEMDVYRAGTLLELVRELAFTFANDGK 370
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
RVKVCVQG MGEG GMPLQL+G+R +LE MDWGDYGAKGTFV IG+I A++V E DDM
Sbjct: 371 RVKVCVQGPMGEGIFRGMPLQLSGSRMMLEVMDWGDYGAKGTFVNIGAIAASDVSETDDM 430
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
FIL+APQNAVGNCII+D++AM AAG RPVIL+NP+LKD+PG+ G+MQ +GR++R+E++
Sbjct: 431 FILMAPQNAVGNCIIEDLQAMVQAAGDRPVILVNPQLKDVPGAGGVMQVIGREQRMEFSD 490
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
SF CY FRLLY AGT YPIMGALRM YP YE++KR+D E Y+ +TF +P+
Sbjct: 491 SFFVCYSFRLLYRAGTFYPIMGALRMKYPGPYEVHKRIDTGYSSETYIQEATFPGRPTPT 550
Query: 301 DINDAFLGKPRNREKKASGFWGFL 324
++ D F G+ + K S W F
Sbjct: 551 ELQDVFAGRSKEASKTKS-IWSFF 573
>gi|302769091|ref|XP_002967965.1| hypothetical protein SELMODRAFT_169443 [Selaginella moellendorffii]
gi|300164703|gb|EFJ31312.1| hypothetical protein SELMODRAFT_169443 [Selaginella moellendorffii]
Length = 575
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 188/324 (58%), Positives = 236/324 (72%), Gaps = 19/324 (5%)
Query: 2 VFADIESLLSQAKREKMKKTTAV-KPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREI 60
VF DI S L + +K T V KP QD+W+G+PT+LN+IPHSREI
Sbjct: 268 VFGDISSFLD----DHIKAATKVSKP-------------KQDSWRGMPTKLNSIPHSREI 310
Query: 61 RNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK 120
R YFY +V + + AV DG +LKVEI IPELNPEMDVYR GTL+ELVR +A +FA+DGK
Sbjct: 311 REYFYTEVCEGVRNAVEDGLHKLKVEITIPELNPEMDVYRAGTLLELVRELAFTFANDGK 370
Query: 121 RVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDM 180
RVKVCVQG MGEG GMPLQL+G+R++LE MDWGDYGAKGTFV IG+I A++V E DDM
Sbjct: 371 RVKVCVQGPMGEGIFRGMPLQLSGSRRMLEVMDWGDYGAKGTFVNIGAIAASDVFETDDM 430
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
FIL+APQNAVGNCII+D++AM AAG RPVIL+NP+LKD+PG+ G+MQ +GR++R+E++
Sbjct: 431 FILMAPQNAVGNCIIEDLQAMVQAAGDRPVILVNPQLKDVPGAGGVMQVIGREQRMEFSD 490
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
SF CY FRLLY AGT YPIMGALRM YP YE++KR+D E Y+ +TF +P+
Sbjct: 491 SFFVCYSFRLLYRAGTFYPIMGALRMKYPGPYEVHKRIDTGYSSETYIQEATFAGRPTPT 550
Query: 301 DINDAFLGKPRNREKKASGFWGFL 324
++ D F G+ + K S W F
Sbjct: 551 ELQDVFAGRSKEASKTKS-IWSFF 573
>gi|168018045|ref|XP_001761557.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687241|gb|EDQ73625.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 264
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 162/264 (61%), Positives = 211/264 (79%), Gaps = 1/264 (0%)
Query: 44 WKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGT 103
W+G+PT+LN PHSREIR YFY D QA + A++DG+ +LKV + IPELNPEMDVYRIGT
Sbjct: 1 WRGVPTKLNTCPHSREIRQYFYKDACQAAKNAIDDGKHKLKVLLTIPELNPEMDVYRIGT 60
Query: 104 LMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTF 163
L+EL+R +A +FA+DGKRV+VCVQG MGEG +GMPL L+GTR +LE MDWG+Y K TF
Sbjct: 61 LLELIRELAFTFANDGKRVRVCVQGPMGEGIFSGMPLALSGTRPMLERMDWGEYIEKDTF 120
Query: 164 VEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGS 223
+ G++GA EV + DDMFIL+APQNAVGNCIIDD+KAM +AAG RPVI++NP+LKD+P S
Sbjct: 121 IRFGAVGAKEVYDDDDMFILMAPQNAVGNCIIDDLKAMVEAAGDRPVIILNPKLKDIPAS 180
Query: 224 SGIMQTM-GRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPS 282
GIMQ + GR +R+E+A SF CY FRL+Y +GTQYPI+GA RM YP YE+Y+R D+
Sbjct: 181 GGIMQQVGGRQERMEFAESFFTCYNFRLMYTSGTQYPIVGAFRMAYPGPYEIYRRFDLDR 240
Query: 283 GGEKYVILSTFTEKPSSDDINDAF 306
G E Y ++++F +P+ +I+DA+
Sbjct: 241 GVEVYKLIASFDHEPTRSEISDAY 264
>gi|218200847|gb|EEC83274.1| hypothetical protein OsI_28627 [Oryza sativa Indica Group]
Length = 681
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 163/252 (64%), Positives = 200/252 (79%), Gaps = 16/252 (6%)
Query: 1 MVFADIESLLSQ----AKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPH 56
+VF +I+SLL + A + KT KP S ++S++ ++GIPTRLNNIPH
Sbjct: 284 VVFNEIDSLLQKICENASSNMLAKTNG-KP-----QDSKDTTASKNEFRGIPTRLNNIPH 337
Query: 57 SREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFA 116
SREIR YFY+DVL AT+ AV D +TRL+++INIPELNPEMDVYRIGTLMELVR ++LSFA
Sbjct: 338 SREIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELNPEMDVYRIGTLMELVRELSLSFA 397
Query: 117 DDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDE 176
DDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MDWG+YGAKGTF+ G++GA+EVD+
Sbjct: 398 DDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMDWGEYGAKGTFINFGAVGASEVDK 457
Query: 177 QDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLP------GSSGIMQTM 230
+DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+NPRLK GS G ++
Sbjct: 458 EDDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILVNPRLKGCGVGVRGWGSRGGVKVN 517
Query: 231 GRDKRLEYAASF 242
G+++ ++YAA
Sbjct: 518 GKNRVVKYAADL 529
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 73/110 (66%), Positives = 88/110 (80%)
Query: 219 DLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRV 278
D+PGSSG+MQTMGRD RL+YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE+Y+RV
Sbjct: 572 DMPGSSGVMQTMGRDMRLKYAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYEIYRRV 631
Query: 279 DVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 328
D P+G E+YV+L F EKP+ D+I +AF + EK ASGFWGFLS +
Sbjct: 632 DEPNGQERYVLLEEFVEKPTPDEITNAFRPRKNENEKSASGFWGFLSGIL 681
>gi|222640270|gb|EEE68402.1| hypothetical protein OsJ_26756 [Oryza sativa Japonica Group]
Length = 680
Score = 323 bits (827), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 162/251 (64%), Positives = 197/251 (78%), Gaps = 14/251 (5%)
Query: 1 MVFADIESLLSQAKRE---KMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHS 57
+VF +I+SLL + M T KP S ++S++ ++GIPTRLNNIPHS
Sbjct: 283 VVFNEIDSLLQKICENASFNMLAKTNGKP-----QDSKDTTASKNEFRGIPTRLNNIPHS 337
Query: 58 REIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFAD 117
REIR YFY+DVL AT+ AV D +TRL+++INIPELNPEMDVYRIGTLMELVR ++LSFAD
Sbjct: 338 REIRKYFYNDVLVATRHAVEDKKTRLQIDINIPELNPEMDVYRIGTLMELVRELSLSFAD 397
Query: 118 DGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQ 177
DGKRVKVCVQGSMG+GA AG+PLQLAGTRKILE MDWG+YGAKGTF+ G++GA+EVD++
Sbjct: 398 DGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEIMDWGEYGAKGTFINFGAVGASEVDKE 457
Query: 178 DDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLP------GSSGIMQTMG 231
DDMFIL+APQNAVGNCIIDDMKAMTDAAG RPVIL+NPRLK GS G + G
Sbjct: 458 DDMFILIAPQNAVGNCIIDDMKAMTDAAGDRPVILVNPRLKGCGVGVRGWGSRGGGKVNG 517
Query: 232 RDKRLEYAASF 242
+++ ++YAA
Sbjct: 518 KNRVVKYAADL 528
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 73/110 (66%), Positives = 88/110 (80%)
Query: 219 DLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRV 278
D+PGSSG+MQTMGRD RL+YAASFE CY FRLL+YAG+ YPIMGALRM YPN YE+Y+RV
Sbjct: 571 DMPGSSGVMQTMGRDMRLKYAASFETCYSFRLLFYAGSFYPIMGALRMAYPNKYEIYRRV 630
Query: 279 DVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGFLSSVF 328
D P+G E+YV+L F EKP+ D+I +AF + EK ASGFWGFLS +
Sbjct: 631 DEPNGQERYVLLEEFVEKPTPDEITNAFRPRKNENEKSASGFWGFLSGIL 680
>gi|384250870|gb|EIE24349.1| hypothetical protein COCSUDRAFT_65858 [Coccomyxa subellipsoidea
C-169]
Length = 291
Score = 243 bits (621), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 126/278 (45%), Positives = 176/278 (63%), Gaps = 10/278 (3%)
Query: 47 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 106
IP RLNN+P SRE R +FY+DV +A ++AV L V++ +PE NPEMDVYRIGT++E
Sbjct: 18 IPQRLNNVPASRETRQFFYNDVCRAVEKAVAAEERLLSVKLTVPETNPEMDVYRIGTVLE 77
Query: 107 LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 166
LVR + A DGKRVKVCVQ ++G+G G+PL L+G R+I+E MDWG+ FV
Sbjct: 78 LVRDLVTDLAADGKRVKVCVQQALGQGVFQGLPLSLSGVRRIMENMDWGEV---FEFVSF 134
Query: 167 GSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAA--GTRPVILINPRLKDLPGSS 224
G++G++ +D+ +++VAPQN VG+ I+ + M AA + +IL NP LKDLP +S
Sbjct: 135 GNVGSDHLDD-SAYYVIVAPQNVVGSTIMTKLLEMATAAVEQKKTLILFNPLLKDLPSAS 193
Query: 225 GIMQTMGRDKRLEYAASFENCYFFRLLYYAGT-QYPIMGALRMTYPNGYELYKRVDVPSG 283
G+M GR +R+E+A SF Y FRLLY + YPI GALR Y +E++KR D+
Sbjct: 194 GVMGVRGRRERMEFADSFIPAYHFRLLYSSSAMMYPIRGALRHVYGGPWEVFKREDMGKR 253
Query: 284 GEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFW 321
E+Y + +F +P + F +P KK SG W
Sbjct: 254 TEEYRFVGSFDSQPDGSLLTPLFQEQP---PKKNSGAW 288
>gi|413922034|gb|AFW61966.1| hypothetical protein ZEAMMB73_400224, partial [Zea mays]
Length = 446
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 114/174 (65%), Positives = 139/174 (79%), Gaps = 11/174 (6%)
Query: 1 MVFADIESLLSQ----AKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPH 56
+VF I+ LL + K+ KT +P + + S++ W+GIPTRLNNIPH
Sbjct: 276 VVFQAIDFLLQKICENTSASKLTKTNGKRP-------DSADTGSKNEWRGIPTRLNNIPH 328
Query: 57 SREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFA 116
SREIR YFY+DVL+AT+ A+ D +TRL+++INIPELNPEMDVYRIGTLMELVR ++LSFA
Sbjct: 329 SREIREYFYNDVLKATKLAIEDKKTRLQIDINIPELNPEMDVYRIGTLMELVRELSLSFA 388
Query: 117 DDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG 170
DDGKRVKVCVQGSMG+GA AG+PLQLAGTRKILEFMDWGDYGAKGTF+ IG++G
Sbjct: 389 DDGKRVKVCVQGSMGQGAFAGIPLQLAGTRKILEFMDWGDYGAKGTFINIGAVG 442
>gi|303283278|ref|XP_003060930.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457281|gb|EEH54580.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 354
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 113/233 (48%), Positives = 155/233 (66%), Gaps = 5/233 (2%)
Query: 47 IPTRLNNIPHSREIRNYFYDDVLQATQRAVN--DGRTRLKVEINIPELNPEMDVYRIGTL 104
IP LNN PH RE+R +FYDD + + RAV + TR+K ++ PELN DVYR+GTL
Sbjct: 85 IPIMLNNTPHPREVRRWFYDDCVTSVTRAVTGEEPMTRVKAKVEFPELNVNGDVYRVGTL 144
Query: 105 MELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFV 164
+ELVR +A++ A DGKRV+VCVQGSMGEG G+PL L+G +IL+ MDWGD F+
Sbjct: 145 LELVRELAMTLAKDGKRVRVCVQGSMGEGVFQGLPLSLSGVARILDMMDWGD---ADEFI 201
Query: 165 EIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSS 224
G+I + +DD FIL+APQN VG ++ + AM +AAG RP+I+INP+L D+ +
Sbjct: 202 SRGAIDGDVPQPEDDFFILIAPQNIVGFAVLPRIIAMEEAAGDRPMIMINPKLDDIQSAG 261
Query: 225 GIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKR 277
+M GR +R EY +++ Y FRLLY +PI GA+R T+ +ELYK+
Sbjct: 262 NVMSIRGRGERREYVSTWREVYHFRLLYRKPYFFPIYGAMRYTHDGKWELYKK 314
>gi|307109260|gb|EFN57498.1| hypothetical protein CHLNCDRAFT_21254 [Chlorella variabilis]
Length = 396
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 124/314 (39%), Positives = 173/314 (55%), Gaps = 55/314 (17%)
Query: 47 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 106
IPTRLN IPH R R +FY + A +RA++ G TR+ IPELN E DVYR+GTL+E
Sbjct: 76 IPTRLNTIPHERTTRQHFYREATTAIKRAIDAGETRVLARCTIPELNTEFDVYRVGTLLE 135
Query: 107 LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGT---- 162
LVR +A S A DGKRVKVCVQ ++G+G GMPL L G +I++ MDWG+ KG
Sbjct: 136 LVREVATSLAADGKRVKVCVQQALGQGVFQGMPLSLNGVLRIMKQMDWGESAEKGVSSAR 195
Query: 163 --------FVEIG----------------------------------------SIGANEV 174
V G ++GA EV
Sbjct: 196 GGVVGCLFCVHTGRSHEEGMCACSAWLVHSSPCPVLTSLTCHPATAADRILTGNLGAAEV 255
Query: 175 DEQDDMFILVAPQNAVGNCIIDDMKAMTDAA--GTRPVILINPRLKDLPGSSGIMQTMGR 232
E D F+L+APQN VG+ I+ + M AA +P++LINP+L D+ ++G+M GR
Sbjct: 256 -EDADAFVLIAPQNIVGHSIMPLLSEMVAAAEEAGKPMVLINPKLTDIQSAAGVMSVRGR 314
Query: 233 DKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILST 292
RL++ A+F Y FRLLY + +PIMGALR + +E+++RV++ G E+Y ++ +
Sbjct: 315 QDRLDFTATFGTAYHFRLLYKGVSMHPIMGALRHEHDGPWEVFRRVELGPGVEEYQLIGS 374
Query: 293 FTEKPSSDDINDAF 306
F +P+ I DAF
Sbjct: 375 FDREPTPPLITDAF 388
>gi|255078468|ref|XP_002502814.1| predicted protein [Micromonas sp. RCC299]
gi|226518080|gb|ACO64072.1| predicted protein [Micromonas sp. RCC299]
Length = 331
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 120/267 (44%), Positives = 164/267 (61%), Gaps = 8/267 (2%)
Query: 47 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDG--RTRLKVEINIPELNPEMDVYRIGTL 104
IP LNN PH RE+R +FYDD ++ AV +TR+K I PELN DVYRIGTL
Sbjct: 68 IPKMLNNTPHPREVRKWFYDDCRESVVAAVTGANPKTRVKAFIEFPELNVNGDVYRIGTL 127
Query: 105 MELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFV 164
+ELVR +A++ A DGKRV+VCVQGSMG+G G+PL L+G +IL+ MDWGD F+
Sbjct: 128 LELVREVAMALAADGKRVRVCVQGSMGQGVFQGLPLSLSGVARILDMMDWGD---ADEFI 184
Query: 165 EIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSS 224
GSIG + D FIL++PQN VG ++ ++ M AAG RP+I+INP+L D+ +
Sbjct: 185 ARGSIGGDVPRPDDAYFILISPQNIVGYSVLPYLQEMEKAAGDRPIIMINPKLGDIQSAG 244
Query: 225 GIMQTMGRDKRLEYA-ASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSG 283
+M GR +R EY ++E Y FRLLY +PI GALR +ELYK+
Sbjct: 245 NVMSIRGRGERREYVEKTWEEVYHFRLLYKKPFFFPIYGALRYGTGGMWELYKKFGKME- 303
Query: 284 GEKYVILSTFTE-KPSSDDINDAFLGK 309
E++ +L T+ + +P + +I L K
Sbjct: 304 QEEWKLLRTYDDGEPGAAEITKRILAK 330
>gi|412991462|emb|CCO16307.1| predicted protein [Bathycoccus prasinos]
Length = 352
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 117/278 (42%), Positives = 169/278 (60%), Gaps = 17/278 (6%)
Query: 46 GIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGR-TRLKVEINIPELNPEMDVYRIGTL 104
+PT LNNIPH+REIR +FYDD ++ RA+ D R R+KV PE+N E DV+R GTL
Sbjct: 77 ALPTLLNNIPHTREIRRFFYDDATESVLRAIKDDRNVRMKVFTEFPEMNVEGDVFRAGTL 136
Query: 105 MELVRVIALSFA----DDG----KRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGD 156
+E+VR +A A DG +RVK+CVQGSMG G G+PL L+G ++++ MDW +
Sbjct: 137 LEMVRTMATELARMERKDGSGGKQRVKLCVQGSMGTGVFQGLPLALSGISRLIDLMDWEE 196
Query: 157 YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPR 216
A+ + G+IG + + +D+FIL+ PQN VG I+ M+ M + AG RP+IL+NP+
Sbjct: 197 DVAER--ISNGAIGKDHAKDDEDVFILICPQNIVGYSILPYMEEMMEVAGNRPMILLNPK 254
Query: 217 LKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNG----- 271
L D+ + +M GR R+E+A S+E Y FRLLY +PI GALR +Y +
Sbjct: 255 LGDIQSAGNVMSIRGRQGRMEFANSWEEIYHFRLLYRKPFFFPIYGALRKSYSSAEEDDT 314
Query: 272 YELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGK 309
+ELYKR E Y ++ + ++P+ D + GK
Sbjct: 315 WELYKRFG-KMDDEYYQLMEVYKKEPNPDQMTKVIWGK 351
>gi|145349122|ref|XP_001418989.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579219|gb|ABO97282.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 290
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 112/263 (42%), Positives = 158/263 (60%), Gaps = 4/263 (1%)
Query: 47 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 106
+PT LNN+PH+R R +FY+D ++ AV R+K++ PELN DVYR+GT +E
Sbjct: 31 VPTLLNNVPHARATRRWFYEDATKSIVAAVTARERRIKIKSEFPELNVNGDVYRVGTTLE 90
Query: 107 LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 166
LVR A + A DG RVK+CVQ SMG+G +PL L+G K+L MDW + G +G
Sbjct: 91 LVRETAAALARDGLRVKICVQRSMGQGVFQALPLSLSGVSKLLGMMDW-EPGTEGMIDPS 149
Query: 167 GSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGI 226
G IGA + E D FILV PQN VG ++ ++ M G +P+I+INPRL D+ + +
Sbjct: 150 GGIGAAD-GEGADCFILVQPQNIVGYSVLPYIEEMEAVVGDKPMIMINPRLDDVQSAGNV 208
Query: 227 MQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMT-YPNGYELYKRVDVPSGGE 285
M GR +R++ A+++ Y FRLLY +PI GALR Y N +ELYKR E
Sbjct: 209 MSVRGRKERMDAVAAWKEVYHFRLLYRKPYFFPIYGALRFAYYENEWELYKRTGARE-LE 267
Query: 286 KYVILSTFTEKPSSDDINDAFLG 308
+YV+L T+ +P+ D++ G
Sbjct: 268 QYVLLGTYPSEPTPDEMTKKIWG 290
>gi|428174173|gb|EKX43070.1| hypothetical protein GUITHDRAFT_110795 [Guillardia theta CCMP2712]
Length = 584
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 119/273 (43%), Positives = 159/273 (58%), Gaps = 26/273 (9%)
Query: 51 LNNIPHSREIRNYFYDDVLQATQRAVNDGR------TRLKVEINIPELNPEMDVYRIGTL 104
LN IP + EIR +FY DV A Q A+ D R RLK+E PELNPEMD +R+GT+
Sbjct: 322 LNEIPCNTEIREWFYRDVCDAVQVALQDSRFLESCAGRLKIECVPPELNPEMDSFRVGTM 381
Query: 105 MELVRVIALSFADD-GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTF 163
+E+VR IA+ G+RVKVC+QGS+GEG GMPL L G R+++E MDW G
Sbjct: 382 LEMVREIAIRVGVVMGRRVKVCIQGSLGEGIFTGMPLMLNGMRRVMESMDW--QARPGEM 439
Query: 164 VEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG-TRPVILINPRLKDLPG 222
E I ++FI++APQ+ VG + + + M +AAG R VILINPRLKD P
Sbjct: 440 YEGLLI---------NVFIIIAPQSMVGASVYEPLSQMVEAAGPKRTVILINPRLKDRPS 490
Query: 223 SSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGT-QYPIMGALRMTYPNG--YELYKRVD 279
S +M GR +R+ +AASF + FRLLY T +PI GALRM N + L+KR
Sbjct: 491 SGNVMSVQGRSQRIAFAASFREIFHFRLLYTGTTFMFPIQGALRMNMANSPFWTLFKRET 550
Query: 280 VPS----GGEKYVILSTFTEKPSSDDINDAFLG 308
+ GE Y ++ F E+P ++ I + G
Sbjct: 551 TTTPDERTGESYQPIAVFAEEPKTEAITNMMNG 583
>gi|219124197|ref|XP_002182396.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406357|gb|EEC46297.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 406
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 113/284 (39%), Positives = 166/284 (58%), Gaps = 28/284 (9%)
Query: 51 LNNIPHSREIRNYFYDDVLQATQRAV---NDGRT--RLKVEINIPELNPEMDVYRIGTLM 105
LN +PH+R +R+YFY+ A AV ++G+ R+ + PE+NP MD YRIGT++
Sbjct: 121 LNQVPHNRYVRDYFYEQAALAVHDAVVLCSEGKCINRMLITSQFPEMNPSMDSYRIGTIL 180
Query: 106 ELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG------DYGA 159
E+VR I + A++ RV++CVQGSMG G GMP QL G KI++ MDW + G
Sbjct: 181 EMVRTIGIKLAEENLRVRICVQGSMGVGIFTGMPKQLNGVSKIIQMMDWQSGEGELNEGM 240
Query: 160 KGTFVEIGSIGA------------NEVDEQDDMFILVAPQNAVG--NCIIDDMKAMTDAA 205
G ++ G++G N V QDD+FIL+APQ+ VG + I+ ++ M +AA
Sbjct: 241 VGDYIRFGAVGPEHVLNEEKDKDDNVVQYQDDVFILIAPQSMVGTDSSIMPLLQGMVEAA 300
Query: 206 GTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQY-PIMGAL 264
G RPVIL+NP L D ++G GR +R+++A SF+ Y F+ +Y +GT Y PI+GA+
Sbjct: 301 GNRPVILMNPDLTDKVSAAGQQSVRGRQQRIDFAESFQTVYHFQNIYISGTSYFPILGAI 360
Query: 265 RMTYPNGYEL-YKRVDVPSG-GEKYVILSTFTEKPSSDDINDAF 306
+P L ++R D G GE YV + P ++I DAF
Sbjct: 361 TKLHPKEPWLAHQRRDYADGEGEIYVPVLAGEVIPKGEEILDAF 404
>gi|299473139|emb|CBN78715.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 442
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 107/269 (39%), Positives = 162/269 (60%), Gaps = 9/269 (3%)
Query: 43 NWKGIPTR--LNNIPHSREIRNYFYDDVLQATQRAVNDGRT--RLKVEINIPELNPEMDV 98
++ G+P LNN+PHS+ RN+ Y +A AV D R+K+ +PELNP MD
Sbjct: 167 SYGGLPVNMLLNNVPHSKMARNFIYRGAAEALVLAVEDPTVPRRMKLVCTVPELNPAMDT 226
Query: 99 YRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYG 158
YR+GT++ELVR +A S AD G ++ +Q SMG+G +PL L+G I++ MDW D G
Sbjct: 227 YRVGTVLELVRELAYSMADVGLNTRILIQPSMGKGVFKSLPLALSGVMSIMQGMDWED-G 285
Query: 159 AKGTFVEIGSIG-ANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRL 217
G+ V G +G ++V +DD+ ++++PQ+ G I + + A RPV++INP L
Sbjct: 286 LVGSHVNFGQLGNKDDVSAEDDVILVISPQSIAGYSIHPLLVDTVEHANGRPVVIINPNL 345
Query: 218 KDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQ-YPIMGALRMTYPN-GYELY 275
KD P S G+MQ GR +R+ +A SF++ Y FRLLY + +PI GA+ P+ Y ++
Sbjct: 346 KDRPSSGGVMQVRGRGERIAFAESFKDIYNFRLLYGSSVAFFPIKGAVMKAGPSEPYVVF 405
Query: 276 KRVDV-PSGGEKYVILSTFTEKPSSDDIN 303
R + S GE+YV L TF + P +++I
Sbjct: 406 DRRNTGKSKGEEYVPLGTFDDDPDAENIT 434
>gi|397643725|gb|EJK76041.1| hypothetical protein THAOC_02216 [Thalassiosira oceanica]
Length = 1862
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 115/284 (40%), Positives = 159/284 (55%), Gaps = 28/284 (9%)
Query: 51 LNNIPHSREIRNYFYDDVLQATQRAV---NDG--RTRLKVEINIPELNPEMDVYRIGTLM 105
LN +PH+R +R YFY+ A AV + G R+K+++ PE+NP+MD YRIGTL+
Sbjct: 1576 LNQVPHNRYVRQYFYEMAADAALEAVVLCSAGFISNRMKLQVMFPEMNPQMDSYRIGTLL 1635
Query: 106 ELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG------DYGA 159
EL R IA+ FA+ RV+VCVQGS+G+G +P QL G IL+ MDW + G
Sbjct: 1636 ELTRTIAIRFAEQNLRVRVCVQGSIGQGIFTAIPKQLNGVATILKRMDWQSGEGEENEGM 1695
Query: 160 KGTFVEIGSIGANEV------------DEQDDMFILVAPQNAVG--NCIIDDMKAMTDAA 205
G FV G IG V +QDD+FIL+ PQN G + II + MT+A
Sbjct: 1696 VGDFVNFGHIGKEHVVNFKEGEGGEADTQQDDVFILICPQNMQGIESSIIPYLLEMTEAV 1755
Query: 206 GTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQY-PIMGAL 264
G RP+ILINP L D S G GR R+++A SF+ + F+ +YY+GT Y PI+GA
Sbjct: 1756 GDRPMILINPDLVDKASSQGQQSVRGRQDRIDFANSFKTIFQFQNIYYSGTSYFPILGAS 1815
Query: 265 RMTYPNG-YELYKRVDVPSG-GEKYVILSTFTEKPSSDDINDAF 306
+ P + ++R D+ G GE YV + P S+ + F
Sbjct: 1816 TLLGPEEMWVSHQRRDLKGGKGEVYVPVYASESMPDSEVLMSTF 1859
>gi|308806305|ref|XP_003080464.1| putative Adenylate kinase, chloroplast (ISS) [Ostreococcus tauri]
gi|116058924|emb|CAL54631.1| putative Adenylate kinase, chloroplast (ISS) [Ostreococcus tauri]
Length = 284
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 106/264 (40%), Positives = 153/264 (57%), Gaps = 16/264 (6%)
Query: 47 IPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 106
IPT LNN+PH+RE R +FYDD +T AV G R+K++ P + P
Sbjct: 35 IPTLLNNVPHARETRRWFYDDACASTLAAVGAGERRMKIKTEFPGVEP------------ 82
Query: 107 LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEI 166
L R +A + A DGKRVK+CVQ SMG+G + L L+G K+L MD+ + A+
Sbjct: 83 LTRAMAQTLARDGKRVKICVQRSMGQGVFQALTLSLSGVSKLLGMMDFEEDVAE-MIDPN 141
Query: 167 GSIGANE-VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSG 225
G IGA + ++ D +ILV PQN VG ++ + AM +A G +P+I+INP+L D+ +
Sbjct: 142 GGIGAEDGAEDGADAYILVQPQNIVGYSVLPYIAAMEEAVGDKPMIMINPKLDDIQSAGN 201
Query: 226 IMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMT-YPNGYELYKRVDVPSGG 284
+M GR +R++ A++ Y FRLLY +PI GALR T Y N +ELYKRV +
Sbjct: 202 VMSIRGRQERMDAVAAWREIYHFRLLYRKPYFFPIYGALRYTYYENAWELYKRVGMRE-E 260
Query: 285 EKYVILSTFTEKPSSDDINDAFLG 308
E+YV+ ST+ +P D++ G
Sbjct: 261 EEYVLRSTYDSEPDPDEMTKQIWG 284
>gi|223997276|ref|XP_002288311.1| adenylate-kinase [Thalassiosira pseudonana CCMP1335]
gi|220975419|gb|EED93747.1| adenylate-kinase [Thalassiosira pseudonana CCMP1335]
Length = 231
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 100/230 (43%), Positives = 141/230 (61%), Gaps = 16/230 (6%)
Query: 51 LNNIPHSREIRNYFYDDVLQATQRA---VNDGR--TRLKVEINIPELNPEMDVYRIGTLM 105
LN +PH+R +R YFYD QA A ++G+ R+K+ + PE+NP MD YRIGTL+
Sbjct: 1 LNQVPHNRFVRKYFYDMASQAALDAAVLCSEGKLTNRMKMTVMFPEMNPSMDSYRIGTLL 60
Query: 106 ELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDW------GDYGA 159
EL R IA+ A+ RV+VCVQGSMG G G+P QL G +L+ MDW + G
Sbjct: 61 ELARSIAIKLAEQNLRVRVCVQGSMGVGIFTGIPKQLNGVSTLLQRMDWESEVGEENEGM 120
Query: 160 KGTFVEIGSIGA--NEVDEQDDMFILVAPQNAVG-NC-IIDDMKAMTDAAGTRPVILINP 215
G +V G +G N +QDD+F+++ PQN +G C II + M +AAG RPVI++NP
Sbjct: 121 VGDYVNFGFVGKEHNSAVQQDDVFLIICPQNMLGLECSIIQPLSEMVEAAGDRPVIILNP 180
Query: 216 RLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQY-PIMGAL 264
L D S G GR +R+++A SFE + F+ +Y +GT Y PI+G++
Sbjct: 181 DLTDKISSQGQQSIRGRQQRIDFAESFETIFQFQNIYTSGTSYFPILGSV 230
>gi|62320102|dbj|BAD94280.1| adenylate kinase -like protein [Arabidopsis thaliana]
Length = 95
Score = 143 bits (361), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 64/95 (67%), Positives = 80/95 (84%)
Query: 227 MQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEK 286
MQTMGR++RLEYA +F+NCY FRLLYY GTQYPIMGALRM+YP YELYKRV+ +G EK
Sbjct: 1 MQTMGREQRLEYALTFDNCYVFRLLYYLGTQYPIMGALRMSYPYRYELYKRVNEENGKEK 60
Query: 287 YVILSTFTEKPSSDDINDAFLGKPRNREKKASGFW 321
YV+L+T+ E+P+ + I+DAF GK R++ KKASG W
Sbjct: 61 YVLLATYAERPTPEQIDDAFSGKSRDQSKKASGIW 95
>gi|147768155|emb|CAN78322.1| hypothetical protein VITISV_001685 [Vitis vinifera]
Length = 228
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 55/84 (65%), Positives = 64/84 (76%)
Query: 2 VFADIESLLSQAKREKMKKTTAVKPVHERKSRSNQVSSSQDNWKGIPTRLNNIPHSREIR 61
VF I+SLLS+ +++K K + + SN SSSQ+ W+GIPTRLNNIPHSREIR
Sbjct: 144 VFKAIDSLLSKXQQDKENMMKLEKSIRASERLSNLASSSQEXWRGIPTRLNNIPHSREIR 203
Query: 62 NYFYDDVLQATQRAVNDGRTRLKV 85
YFYDDVLQATQRAVNDGRTRLKV
Sbjct: 204 TYFYDDVLQATQRAVNDGRTRLKV 227
>gi|425442993|ref|ZP_18823225.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389715818|emb|CCH99873.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 247
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/270 (27%), Positives = 120/270 (44%), Gaps = 50/270 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL FA
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFAS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D+ F++V P + I+ ++ + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + ++ E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDAFLGKPRNREKKASGF 320
K SS+ ND+ P N+ KK+ F
Sbjct: 208 ENLILKASSNTPNDS--SNPANKAKKSGLF 235
>gi|166367819|ref|YP_001660092.1| hypothetical protein MAE_50780 [Microcystis aeruginosa NIES-843]
gi|425464571|ref|ZP_18843881.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
gi|166090192|dbj|BAG04900.1| hypothetical protein MAE_50780 [Microcystis aeruginosa NIES-843]
gi|389833386|emb|CCI22146.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
Length = 247
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 120/270 (44%), Gaps = 50/270 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGDRRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + ++ E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDAFLGKPRNREKKASGF 320
K SS+ ND+ P N+ KK+ F
Sbjct: 208 ENLILKASSNTPNDS--SNPANKAKKSGLF 235
>gi|422304988|ref|ZP_16392325.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
gi|389789763|emb|CCI14274.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
Length = 247
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 71/271 (26%), Positives = 123/271 (45%), Gaps = 52/271 (19%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL FA
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFA-- 50
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGT-RKILEFMDWGDYGAKGTFVEIGSIGA------ 171
S+ +G +G+ + T +L DWG+ T ++G +G+
Sbjct: 51 ----------SIFDGYGSGLRVIFPDTGAAMLARRDWGE-----TVFQLGDLGSRFIPIE 95
Query: 172 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 231
N++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ S +
Sbjct: 96 NKIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV--SVVGIGYAA 149
Query: 232 RDKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVI 289
R R + ++ E+CY+FR L A YP + + + +GYEL GE
Sbjct: 150 RQLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA--- 206
Query: 290 LSTFTEKPSSDDINDAFLGKPRNREKKASGF 320
L K SS++ D P N+ KK+ F
Sbjct: 207 LENLILKASSNNPKDG--SNPANKAKKSGLF 235
>gi|427731149|ref|YP_007077386.1| hypothetical protein Nos7524_4017 [Nostoc sp. PCC 7524]
gi|427367068|gb|AFY49789.1| protein of unknown function (DUF1995) [Nostoc sp. PCC 7524]
Length = 246
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 69/252 (27%), Positives = 116/252 (46%), Gaps = 52/252 (20%)
Query: 59 EIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 115
E+ N D + QA T+ A+ DG TRL+VE+ PEL M + F
Sbjct: 3 ELPNTLEDAIAQAREATKAALADGYTRLQVELLFPELK----------FMPVAEQFLPLF 52
Query: 116 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NE 173
++ R+K+ G ALA R+ DW D + + G I + ++
Sbjct: 53 SEYESRLKIFF-ADAGASALA---------RR-----DWADVPFQILDIGTGRIASIQSK 97
Query: 174 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRD 233
V +D++F+ VAP + + ++ + + G RP++++NPRL+D PG+ GI R
Sbjct: 98 VQPEDEIFLFVAPTSVE----VPQVEKICEIIGDRPLVMLNPRLED-PGTVGIGYA-ARQ 151
Query: 234 KRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILS 291
R + + E+CY+ R P+ A+ YP +EL+ V+ +K+ L
Sbjct: 152 TRQRFISKIESCYYLR---------PVDDETAVFRCYPGLWELW--VENSGTWQKFAELP 200
Query: 292 TFTEKPSSDDIN 303
+KPS D+I+
Sbjct: 201 ---KKPSGDEID 209
>gi|354566056|ref|ZP_08985229.1| protein of unknown function DUF1995-containing protein [Fischerella
sp. JSC-11]
gi|353546564|gb|EHC16012.1| protein of unknown function DUF1995-containing protein [Fischerella
sp. JSC-11]
Length = 250
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 109/246 (44%), Gaps = 46/246 (18%)
Query: 76 VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 135
+ DG TRL+VE+ PEL LME+ + F ++KV + G G
Sbjct: 23 IADGYTRLQVELLFPELK----------LMEVAQQFLPMFEQYNDKLKVFFADAGGAG-- 70
Query: 136 AGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQNAVGNC 193
L DW D K T + G + E V +D++F+ +AP G+
Sbjct: 71 -------------LARRDWTDVRFKITDIGTGRAASIESKVQPEDEIFLFIAP----GSP 113
Query: 194 IIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 253
+ ++ + G RP++++NP L+D G GI R R+ + ++ E+CY+ R
Sbjct: 114 EVPQLEKICAVIGDRPLVMLNPFLED-AGKIGIGYA-ARQTRIHFLSTIESCYYLR---- 167
Query: 254 AGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGKPRNR 313
+ YP + R YP +E+ V + GE Y ++ +KPS DD+ D L K +
Sbjct: 168 --SVYPDVALFR-CYPGLWEIL----VENNGE-YQKIAELPQKPSGDDL-DIILAKGQAA 218
Query: 314 EKKASG 319
A G
Sbjct: 219 TAIAEG 224
>gi|425435402|ref|ZP_18815857.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|389680066|emb|CCH91215.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
Length = 247
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 72/271 (26%), Positives = 121/271 (44%), Gaps = 52/271 (19%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFT-- 50
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGT-RKILEFMDWGDYGAKGTFVEIGSIGA------ 171
S+ +G +G+ + T +L DWG+ T ++G +G+
Sbjct: 51 ----------SIFDGYGSGLRVIFPDTGAAMLARRDWGE-----TVFQLGDLGSRFIPVE 95
Query: 172 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 231
N++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ S +
Sbjct: 96 NKIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV--SVVGIGYAA 149
Query: 232 RDKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVI 289
R R + + E+CY+FR L A YP M + + +GYEL GE
Sbjct: 150 RQLRERFLSILESCYYFRPLESAIVYRSYPSMWQVWLEKEDGYELISEQSTKPMGEA--- 206
Query: 290 LSTFTEKPSSDDINDAFLGKPRNREKKASGF 320
L K SS+ ND P N+ KK+ F
Sbjct: 207 LENLILKASSNTPNDG--SNPANKAKKSGLF 235
>gi|434398468|ref|YP_007132472.1| protein of unknown function DUF1995-containing protein [Stanieria
cyanosphaera PCC 7437]
gi|428269565|gb|AFZ35506.1| protein of unknown function DUF1995-containing protein [Stanieria
cyanosphaera PCC 7437]
Length = 247
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 70/260 (26%), Positives = 114/260 (43%), Gaps = 59/260 (22%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+AT A+ DG R++VE+ IPE+ + V L A FA+ G +KV S
Sbjct: 17 EATLAALADGYNRVQVELVIPEITLQAQV--------LALEFATIFAEYGSGLKVMFPDS 68
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFI 182
G ALA DWG D G++ T VE +V EQD +FI
Sbjct: 69 -GAAALAK--------------RDWGEVPFQVTDLGSRYTAVE------TQVSEQDQIFI 107
Query: 183 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 242
+VAP ++ ++ + + A RPV+L+ P+L+D+ GI T R R + S
Sbjct: 108 VVAP----AAIEVEKVEKLCNVAEDRPVVLLIPQLEDI-AFVGIGYT-ARQMRERFIKSL 161
Query: 243 ENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDI 302
+CY+ R L A A+ +P+ ++++ + E+Y + +P +D+
Sbjct: 162 YSCYYLRPLEEA--------AILRCHPSSWQIWLEKE-----EEYQLAMELAHQPMGEDL 208
Query: 303 NDAFLG----KPRNREKKAS 318
+ P N+ + AS
Sbjct: 209 ERLLMKLTNPNPENKNEVAS 228
>gi|425449243|ref|ZP_18829085.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
gi|389764162|emb|CCI09454.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
Length = 247
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 121/270 (44%), Gaps = 50/270 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D++F++V P + I+ ++ + + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEVFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDAFLGKPRNREKKASGF 320
K SS++ +D+ P N+ KK+ F
Sbjct: 208 ENLILKASSNNPDDS--SNPANKAKKSGLF 235
>gi|425446838|ref|ZP_18826837.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
gi|389732775|emb|CCI03345.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
Length = 247
Score = 70.9 bits (172), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 119/270 (44%), Gaps = 50/270 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G RL+VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRLQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDAFLGKPRNREKKASGF 320
K SS++ D+ P N+ KK+ F
Sbjct: 208 ENLILKASSNNPKDS--SNPANKAKKSGLF 235
>gi|425459538|ref|ZP_18839024.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
gi|389822715|emb|CCI29585.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
Length = 247
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 118/270 (43%), Gaps = 50/270 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ S + R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV--SVVGIGYAAR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDAFLGKPRNREKKASGF 320
K SS+ ND P N+ KK+ F
Sbjct: 208 ENLILKASSNTPNDG--SNPANKAKKSGLF 235
>gi|428781510|ref|YP_007173296.1| hypothetical protein Dacsa_3447 [Dactylococcopsis salina PCC 8305]
gi|428695789|gb|AFZ51939.1| protein of unknown function (DUF1995) [Dactylococcopsis salina PCC
8305]
Length = 253
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 68/235 (28%), Positives = 116/235 (49%), Gaps = 41/235 (17%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD-GKRVKVCVQG 128
Q+T A+ G TRL+VE+ +PE+ + + + + FADD G +KV
Sbjct: 17 QSTVAALEAGLTRLQVELKVPEIALKAE--------SIAKEFTDLFADDYGSGLKVLFPD 68
Query: 129 SMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQN 188
S G ALA +EF + D G++ T +E +V E+D +F++V+P +
Sbjct: 69 S-GAAALARR------NWSDVEF-NVNDLGSRNTPIE------KKVAEEDQIFLVVSP-S 113
Query: 189 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 248
AV + ++ + + AG RPVIL+ P+L+D+ + GI R R + ++ E+C++
Sbjct: 114 AVE---VQKVEKLCNLAGDRPVILLIPQLEDV-ATVGIGYA-ARQLRERFLSTLESCFYL 168
Query: 249 RLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+ L A AL YP+G++L+ + G +Y EKP DD++
Sbjct: 169 QPLDEA--------ALLKRYPSGWQLW----IEKGENQYEFFCEEVEKPVGDDLD 211
>gi|440753103|ref|ZP_20932306.1| hypothetical protein O53_1480 [Microcystis aeruginosa TAIHU98]
gi|440177596|gb|ELP56869.1| hypothetical protein O53_1480 [Microcystis aeruginosa TAIHU98]
Length = 247
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 120/270 (44%), Gaps = 50/270 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDAFLGKPRNREKKASGF 320
K SS++ +D+ P N+ KK+ F
Sbjct: 208 ENLILKASSNNPDDS--SNPANKAKKSGLF 235
>gi|22299764|ref|NP_683011.1| hypothetical protein tlr2221 [Thermosynechococcus elongatus BP-1]
gi|22295948|dbj|BAC09773.1| tlr2221 [Thermosynechococcus elongatus BP-1]
Length = 232
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 113/255 (44%), Gaps = 59/255 (23%)
Query: 79 GRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM 138
G+ RL++EI +P+LNP M L FA+ G +KV G ALA
Sbjct: 27 GKNRLQIEILLPDLNP----------MGLAAGYLPLFAELGSDLKVFF-ADAGSAALA-- 73
Query: 139 PLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIID 196
R+ +WG+ Y +G + I A+ D ++VAP +
Sbjct: 74 -------RR-----EWGETAYSVRGVNERLTPIEAS-----DRAIVIVAPTPVE----VT 112
Query: 197 DMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGT 256
++ M AG RP IL+NPRL+D+ GI GR R + + E CY+ R L A T
Sbjct: 113 AIEQMCLTAGDRPFILLNPRLQDV-AVVGIGYA-GRQLRERFLNTLEPCYYLRPL--AET 168
Query: 257 QYPIMGALRMTYPNGYELYK-RVDVPSGGEKYVILSTFTEKPSSDDINDAF--LGKPRNR 313
L YP +++++ R P+ L+ F ++P+S+DI A LGKPR
Sbjct: 169 VI-----LWRCYPQAWQIWQYRETAPT------CLAEFEQRPTSEDIERALSALGKPR-- 215
Query: 314 EKKASGFWGFLSSVF 328
A F LS++
Sbjct: 216 ---AQSFLSRLSALL 227
>gi|425472740|ref|ZP_18851581.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
gi|389881081|emb|CCI38316.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
Length = 247
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 118/270 (43%), Gaps = 50/270 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D++F++V P + I+ ++ + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEVFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDAFLGKPRNREKKASGF 320
K SS+ ND P N+ KK+ F
Sbjct: 208 ENLILKASSNTPNDG--SNPANKAKKSGLF 235
>gi|282899475|ref|ZP_06307441.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281195632|gb|EFA70563.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 245
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 107/247 (43%), Gaps = 50/247 (20%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
A + A++DG+TR++VE+ PEL M + F + R+KV +
Sbjct: 21 AVRSALSDGKTRIQVELLFPELQ----------FMPVAEQFLPLFTEYESRLKVFFADAG 70
Query: 131 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQN 188
DWGD K + G ++E + +D++F+ +AP +
Sbjct: 71 AAALAR---------------RDWGDIPFKIMDIGTGRAASSESKIQPEDEIFLFIAPTS 115
Query: 189 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 248
+ ++ + G RP +++NPRL+D S + R+ R + ++ E+CY+
Sbjct: 116 VE----VAQLEKLCQIIGERPFVMLNPRLED--SSVVGIGYAARETRRRFISTIESCYYL 169
Query: 249 RLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
R PI AL +YP +E++ D GE Y ++ +KPS D+I+
Sbjct: 170 R---------PIDEQSALMRSYPGNWEIWLETD----GE-YQKIAELPQKPSGDEIDSIL 215
Query: 307 L-GKPRN 312
+ G+P
Sbjct: 216 IKGQPET 222
>gi|428311732|ref|YP_007122709.1| hypothetical protein Mic7113_3579 [Microcoleus sp. PCC 7113]
gi|428253344|gb|AFZ19303.1| protein of unknown function (DUF1995) [Microcoleus sp. PCC 7113]
Length = 249
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/236 (28%), Positives = 115/236 (48%), Gaps = 45/236 (19%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+ATQ A+NDGRTRL+VE+ PE+ + I + I L F + G +KV +
Sbjct: 17 EATQSALNDGRTRLQVELVFPEIA--LQAQSIAQ-----QFIPL-FEEYGSGLKVLFPDT 68
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIG-SIGANEVDEQDDMFILVAPQN 188
G ALA R+ DWG+ K + V S N++ +D F+LVAP +
Sbjct: 69 -GAAALA---------RR-----DWGEVPFKISDVGTSRSPITNKIQAEDKAFLLVAP-S 112
Query: 189 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 248
AV + ++ + + AG RP +L+NP+L+D+ S + R R ++ E CY+
Sbjct: 113 AVE---VAQVETLCNLAGDRPCVLLNPQLEDI--SIVGIGMAARKLRERLLSTIEPCYYL 167
Query: 249 RLLYYAGTQYPI-MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
R PI A+ +YP ++++ +D ++Y +++ ++P+ D +
Sbjct: 168 R---------PIDPAAILRSYPGLWQVWLEID-----DEYQLIAEEPQRPAGDALE 209
>gi|443658098|ref|ZP_21132025.1| hypothetical protein C789_2565 [Microcystis aeruginosa DIANCHI905]
gi|159027702|emb|CAO89569.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443333038|gb|ELS47616.1| hypothetical protein C789_2565 [Microcystis aeruginosa DIANCHI905]
Length = 247
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 120/270 (44%), Gaps = 50/270 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARRIQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ ++D+ F++V P + I+ ++ + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKDEDEAFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDAFLGKPRNREKKASGF 320
K SS++ +D+ P N+ KK+ F
Sbjct: 208 ENLILKASSNNPDDS--SNPANKAKKSGLF 235
>gi|425455386|ref|ZP_18835106.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
gi|389803734|emb|CCI17368.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
Length = 247
Score = 68.2 bits (165), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 119/270 (44%), Gaps = 50/270 (18%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G R++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLAIESGARRVQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------LFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPVEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D+ F++V P + I+ ++ + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCYLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
R + + E+CY+FR L A YP + + + +GYEL GE L
Sbjct: 151 QLRERFLSILESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGEA---L 207
Query: 291 STFTEKPSSDDINDAFLGKPRNREKKASGF 320
K SS++ +D+ P N+ KK+ F
Sbjct: 208 ENLILKASSNNPDDS--SNPANKAKKSGLF 235
>gi|443475522|ref|ZP_21065469.1| protein of unknown function DUF1995-containing protein
[Pseudanabaena biceps PCC 7429]
gi|443019641|gb|ELS33702.1| protein of unknown function DUF1995-containing protein
[Pseudanabaena biceps PCC 7429]
Length = 236
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 67/268 (25%), Positives = 113/268 (42%), Gaps = 60/268 (22%)
Query: 66 DDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVC 125
+ + AT +A+ G TR+ ++ PEL M + A SF
Sbjct: 19 EQAIAATHQAIASGATRVLADLRFPELKA----------MPIAYEFAASFNQQYGEAWQA 68
Query: 126 VQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE----VDEQDDMF 181
+ G ALA RK W D + + G NE + E+D F
Sbjct: 69 IFSDAGAAALA--------KRK------WVD-------LNVSVRGVNEGRRAIREEDRAF 107
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
+L+ P + ++ ++ + AG RP +++NPRL++ G+ RD+ + ++
Sbjct: 108 LLIEP----TSIEVEQVEKLVQLAGDRPFVMLNPRLENSEVGLGLAARQMRDR---FLST 160
Query: 242 FENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 301
FE Y+ + L +GAL YP ++++ + + G +K L+ ++PSSDD
Sbjct: 161 FETAYYIKPLE--------LGALWRCYPQTWQVWNQTE--EGMQK---LAEVEQRPSSDD 207
Query: 302 INDAFLGKPRNREKKASG-FWGFLSSVF 328
I+ F R + K SG F G L F
Sbjct: 208 IDRLF----RQKTGKQSGSFLGRLQQFF 231
>gi|186682862|ref|YP_001866058.1| hypothetical protein Npun_R2561 [Nostoc punctiforme PCC 73102]
gi|186465314|gb|ACC81115.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 244
Score = 67.4 bits (163), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 67/269 (24%), Positives = 111/269 (41%), Gaps = 53/269 (19%)
Query: 59 EIRNYFYDDVLQ---ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 115
E+ N D + Q A Q A+ DG TR++VE PEL M + F
Sbjct: 3 ELPNTLEDAIAQSRTAVQAALADGCTRIQVEFLFPELK----------FMPVAEQFLPLF 52
Query: 116 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NE 173
+ R+K+ + R+ +W D + + G + +
Sbjct: 53 TEYDSRLKIFFADAGAAAL----------ARR-----NWTDAPFQILDIGTGRTASLQTK 97
Query: 174 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRD 233
+ +D++F+ +AP + + ++ + G RPV+ +NPRL+D G+ GI R
Sbjct: 98 IQPEDEIFLFIAPTSVE----VPQVEKLCQEIGDRPVVFLNPRLED-SGTVGIGYA-ARQ 151
Query: 234 KRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILS 291
RL + + E+CY+ R PI AL YP +E++ D GE Y ++
Sbjct: 152 TRLRFTNTIESCYYLR---------PIDEQSALSRCYPGQWEVWLETD----GE-YQRIA 197
Query: 292 TFTEKPSSDDINDAFL-GKPRNREKKASG 319
KPS DD++ L G+P+ A
Sbjct: 198 ELPTKPSGDDLDQILLKGQPQTTTDAAPA 226
>gi|427418343|ref|ZP_18908526.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 7375]
gi|425761056|gb|EKV01909.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 7375]
Length = 237
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/269 (26%), Positives = 123/269 (45%), Gaps = 54/269 (20%)
Query: 51 LNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRV 110
++++P S E + + +T+ A+ DG RL+VE+ IPEL + R TL
Sbjct: 1 MSDLPSSLE---ETIEQAIASTKAAIEDGIIRLQVEMVIPELKQQPIAERFLTL------ 51
Query: 111 IALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG--DYGAKGTFVEIGS 168
F D G + KV + G ALA R+ DW D+ +G I
Sbjct: 52 ----FQDLGLQFKVYFPDA-GAAALA---------RR-----DWDNPDFTIRG----INE 88
Query: 169 IGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQ 228
I A + D+ F++V P +AV + D++ + + A RPV+L+NP+L+D+ GI
Sbjct: 89 IQAA-ITPDDEAFLIVNP-SAVE---VQDVEKLCNEAQDRPVVLLNPQLEDV-SIVGIGY 142
Query: 229 TMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYV 288
R R + ++ +CY++R + + A+ +P G+++++ V G Y
Sbjct: 143 A-ARQLRERFLSTLTSCYYYRPMTDS--------AVLRRHPQGWQVWQDV-----GNDYE 188
Query: 289 ILSTFTEKPSSDDINDAFLGKPRNREKKA 317
+ S E+PSS+ + G K A
Sbjct: 189 LKSELPERPSSEALEKILYGSEDTEGKAA 217
>gi|428225033|ref|YP_007109130.1| hypothetical protein GEI7407_1587 [Geitlerinema sp. PCC 7407]
gi|427984934|gb|AFY66078.1| protein of unknown function DUF1995 [Geitlerinema sp. PCC 7407]
Length = 244
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/252 (28%), Positives = 124/252 (49%), Gaps = 53/252 (21%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF-ADDGKRVKVCVQGS 129
ATQ A++DG +RL+VE+ PE++ + + + + A F A G R K+ +
Sbjct: 18 ATQAALDDGHSRLQVELVFPEIDLQA--------ISIAELFAADFEARYGDRFKILFPDA 69
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGAN-EVDEQDDMFILVAP 186
GA A LA R WG+ Y +G I + A + DEQ F+++AP
Sbjct: 70 ---GAAA-----LAKHR-------WGEKPYEIRG----ISELKAQIQPDEQ--AFLMIAP 108
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
+ + ++ + A RPV+++NPRL+D+ + GI GR R + ++ +CY
Sbjct: 109 ----SSVEVGPVEKFCEEASDRPVVMVNPRLEDV-ATIGIGYA-GRQLRERFLSTLLSCY 162
Query: 247 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
+ R + G GALR +YP +E++ ++ SG EK ++ ++KP D + D
Sbjct: 163 YLR--PFEG------GALRRSYPGPWEVW--LETESGYEK---VAEESQKPVGDAL-DQI 208
Query: 307 LGKPRNREKKAS 318
+G+ + E + S
Sbjct: 209 IGRVQGAETEGS 220
>gi|282896506|ref|ZP_06304526.1| conserved hypothetical protein [Raphidiopsis brookii D9]
gi|281198612|gb|EFA73493.1| conserved hypothetical protein [Raphidiopsis brookii D9]
Length = 245
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/247 (23%), Positives = 106/247 (42%), Gaps = 50/247 (20%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
A + A++DG+TR++VE+ PEL M + F + +KV +
Sbjct: 21 AAKSALSDGKTRIQVELLFPELQ----------FMPVAEQFLPLFTEYESGLKVFFADAG 70
Query: 131 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQN 188
DWGD K + G ++E + +D++F+ +AP +
Sbjct: 71 AAALAR---------------RDWGDIPFKIMDIGTGRAASSESKIQPEDEIFLFIAPTS 115
Query: 189 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 248
+ ++ + G RP +++NPRL+D S + R+ R + ++ E+CY+
Sbjct: 116 VE----VAQLEKLCQIIGERPFVMLNPRLED--SSVVGIGYAARETRRRFISTIESCYYL 169
Query: 249 RLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
R PI AL +YP +E++ D GE Y ++ +KPS D+I+
Sbjct: 170 R---------PIDEQSALMRSYPGNWEIWLETD----GE-YRKIAELPQKPSGDEIDSIL 215
Query: 307 L-GKPRN 312
+ G+P
Sbjct: 216 IKGQPET 222
>gi|16329818|ref|NP_440546.1| hypothetical protein slr1702 [Synechocystis sp. PCC 6803]
gi|383321560|ref|YP_005382413.1| hypothetical protein SYNGTI_0651 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383324730|ref|YP_005385583.1| hypothetical protein SYNPCCP_0651 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383490614|ref|YP_005408290.1| hypothetical protein SYNPCCN_0651 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384435880|ref|YP_005650604.1| hypothetical protein SYNGTS_0651 [Synechocystis sp. PCC 6803]
gi|451813978|ref|YP_007450430.1| hypothetical protein MYO_16580 [Synechocystis sp. PCC 6803]
gi|1652303|dbj|BAA17226.1| slr1702 [Synechocystis sp. PCC 6803]
gi|339272912|dbj|BAK49399.1| hypothetical protein SYNGTS_0651 [Synechocystis sp. PCC 6803]
gi|359270879|dbj|BAL28398.1| hypothetical protein SYNGTI_0651 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359274050|dbj|BAL31568.1| hypothetical protein SYNPCCN_0651 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359277220|dbj|BAL34737.1| hypothetical protein SYNPCCP_0651 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407957706|dbj|BAM50946.1| hypothetical protein BEST7613_2015 [Synechocystis sp. PCC 6803]
gi|451779947|gb|AGF50916.1| hypothetical protein MYO_16580 [Synechocystis sp. PCC 6803]
Length = 251
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 60/244 (24%), Positives = 119/244 (48%), Gaps = 55/244 (22%)
Query: 67 DVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCV 126
+ + ATQRA++DG R++VE+ +PE+ + +E ++ FAD G +++
Sbjct: 15 EAIAATQRALDDGYRRIQVELAVPEIALQAQ----AIALEFSQL----FADQGLGLRIIF 66
Query: 127 QGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------NEVDEQDDM 180
+ G ALA DWG+ +IG +G+ ++ E D +
Sbjct: 67 PDT-GAAALAK--------------RDWGEVP-----FQIGDLGSRYTPIGRKIAESDQV 106
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
F++V+P +AV + ++ + + AG RPV+L+ P+L+D+ S + R R + +
Sbjct: 107 FLVVSP-SAVE---VQSVEKLCELAGDRPVVLLIPQLEDV--SIVGIGYAARQLRQRFLS 160
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSS 299
+ + Y+FR P+ GA+ + ++P+ +++ GE+Y +L+ EKP+
Sbjct: 161 TLFSAYYFR---------PLDGAVVLRSHPSRWQVLL-----EKGEEYELLTELPEKPAG 206
Query: 300 DDIN 303
+ +
Sbjct: 207 EALE 210
>gi|434407903|ref|YP_007150788.1| protein of unknown function (DUF1995) [Cylindrospermum stagnale PCC
7417]
gi|428262158|gb|AFZ28108.1| protein of unknown function (DUF1995) [Cylindrospermum stagnale PCC
7417]
Length = 244
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 59/240 (24%), Positives = 113/240 (47%), Gaps = 53/240 (22%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+ATQ A+ DG R++++ PEL ++ + FA+ R+KV
Sbjct: 17 EATQAALADGCKRIQIDFLFPELK----------ILPVAEQFLPVFAEYDSRLKVFF-AD 65
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA----NEVDEQDDMFILVA 185
G ALA R+ DW D A + V+IG+ +++ +D++F+ +A
Sbjct: 66 AGGAALA---------RR-----DWAD--AAFSIVDIGTGRTASLQSKIQPEDEIFLFIA 109
Query: 186 PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENC 245
P + + ++ + + G RPV+ +NPRL+D G GI T R+ R + ++ E+C
Sbjct: 110 PTSVE----VPQLEKLCEIIGDRPVVFLNPRLED-SGVVGIGYT-ARETRRRFISTIESC 163
Query: 246 YFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
Y+ R P+ A+ +YP +E++ + ++Y ++ ++PS D+++
Sbjct: 164 YYLR---------PVDDETAVFRSYPGQWEVWVETN-----DEYQRIAELPKRPSGDELD 209
>gi|300865956|ref|ZP_07110692.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300336022|emb|CBN55850.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 248
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 73/273 (26%), Positives = 121/273 (44%), Gaps = 60/273 (21%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYR---IGTLMELVRVIALSFADDGKRVKVCV 126
+A + A+ND +TRL++E+ PE+ + I EL + + F D G
Sbjct: 17 EAVKAALNDAQTRLQIELVFPEIALQAQSITQQFITDFEELANGLKVFFPDTGS------ 70
Query: 127 QGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG---ANEVDEQDDMFIL 183
ALA R+ DWG K T ++GS A +++ +D +F+L
Sbjct: 71 ------AALA---------RR-----DWGTVPFKIT--DLGSSRSPVAEKIEPEDQIFLL 108
Query: 184 VAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFE 243
V NA + + ++ + +AA +RPVIL+NPRL+D + GI GR R + + +
Sbjct: 109 V---NA-SSIEVAQVEQLCNAADSRPVILLNPRLEDA-ATIGIGYA-GRQLRDRFLNTLQ 162
Query: 244 NCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+CY+ R L A AL YP ++++ + GE Y ++S +KP D++
Sbjct: 163 SCYYIRPLPTA--------ALFRCYPQSWQVWLE---ETEGE-YKLISETAQKPVGDELE 210
Query: 304 DAFL--------GKPRNREKKASGFWGFLSSVF 328
P++ K G L S+F
Sbjct: 211 RIIAPTVQNGDPSAPQSAPVKKKGMLAELQSLF 243
>gi|428775196|ref|YP_007166983.1| hypothetical protein PCC7418_0540 [Halothece sp. PCC 7418]
gi|428689475|gb|AFZ42769.1| protein of unknown function DUF1995 [Halothece sp. PCC 7418]
Length = 253
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 109/243 (44%), Gaps = 57/243 (23%)
Query: 70 QATQRAVNDGRTRLKVEINIPE--LNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQ 127
Q+T A+ G TRL+VE+ IPE L E L+E D G +KV
Sbjct: 17 QSTLTALEAGLTRLQVELRIPEIALQGEKIAKEFADLLE---------DDYGSGLKVLFP 67
Query: 128 GSMGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDM 180
+ G ALA R+ +W D G++ T +E +V E D M
Sbjct: 68 DT-GAAALA---------RR-----NWADVSFQVNDLGSRNTPIE------KKVSEDDQM 106
Query: 181 FILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAA 240
F+LV+P +AV + ++ + + AG RPVIL+ P+L+D+ + GI R R + +
Sbjct: 107 FLLVSP-SAVE---VQKVEKLCNLAGDRPVILLIPQLEDV-ATVGIGYA-ARQLRERFLS 160
Query: 241 SFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
+ E+CY+ + L A AL YP+ ++L+ + G Y EKP D
Sbjct: 161 TLESCYYLQPLEEA--------ALLKRYPSSWQLW----IEKGENNYEFFCEEPEKPVGD 208
Query: 301 DIN 303
++
Sbjct: 209 TLD 211
>gi|254422515|ref|ZP_05036233.1| hypothetical protein S7335_2667 [Synechococcus sp. PCC 7335]
gi|196190004|gb|EDX84968.1| hypothetical protein S7335_2667 [Synechococcus sp. PCC 7335]
Length = 248
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 65/237 (27%), Positives = 115/237 (48%), Gaps = 58/237 (24%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFAD--DGKRVKVCVQG 128
ATQ A++ G + L++E+ IPEL + IA F + + ++ KV
Sbjct: 17 ATQAALDAGISLLQIEMAIPELKHQP--------------IAQQFVELFNDRQFKVLFPD 62
Query: 129 SMGEGALAGMPLQLAGTRKILEFMDWG--DYGAKGTFVEIGSIGANEVDEQDDMFILVAP 186
+ G ALA R+ DW D+ +G +G + A VD +DD++++V P
Sbjct: 63 A-GAAALA---------RR-----DWNNPDFVIRG----LGEL-AQTVDPEDDLYLIVNP 102
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
+AV +D ++A+ + A +PV+L+NP+L+D+ GI R R + + E CY
Sbjct: 103 -SAVE---VDKVEALCNEALDQPVVLLNPQLEDV-AVVGIGYA-ARQLRDRFLSQIETCY 156
Query: 247 FFRLLYYAGTQYPI-MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDI 302
+ R PI G + YP +++++ + G ++Y + + +PSS+DI
Sbjct: 157 YVR---------PIDQGVVYRAYPGPWQIWREI----GPDEYEHVQDLSNRPSSEDI 200
>gi|428317816|ref|YP_007115698.1| protein of unknown function DUF1995-containing protein
[Oscillatoria nigro-viridis PCC 7112]
gi|428241496|gb|AFZ07282.1| protein of unknown function DUF1995-containing protein
[Oscillatoria nigro-viridis PCC 7112]
Length = 248
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 67/240 (27%), Positives = 108/240 (45%), Gaps = 55/240 (22%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
AT A++DG+T L+VE+ PE+ + + L E F + VKV +
Sbjct: 18 ATAAALSDGKTLLQVELVFPEIALQAQSITLQFLPE--------FEEIYPGVKVFFPDT- 68
Query: 131 GEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFIL 183
G ALA R+ DWG D G+ T VE +++ +D +F+L
Sbjct: 69 GAAALA---------RR-----DWGETPFKVTDLGSSRTPVE------DKIAPEDQLFLL 108
Query: 184 VAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFE 243
+ P + ++ + AA RPVIL+NPRL+D+ + GI GR R + E
Sbjct: 109 INP----AAVEVAQVEKIYIAAAGRPVILLNPRLEDV-ATIGIGYA-GRQLRDRFLNKIE 162
Query: 244 NCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+CY+ R L A AL YP ++++ + ++Y ++S +KP DD+
Sbjct: 163 SCYYIRPLDTA--------ALFRCYPQPWQVWLETN-----DEYELISETAQKPVGDDLE 209
>gi|427715914|ref|YP_007063908.1| hypothetical protein Cal7507_0584 [Calothrix sp. PCC 7507]
gi|427348350|gb|AFY31074.1| protein of unknown function DUF1995-containing protein [Calothrix
sp. PCC 7507]
Length = 243
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 108/248 (43%), Gaps = 55/248 (22%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
ATQ A+ DG TRL+VE PEL P M + AD R+KV
Sbjct: 18 ATQAALADGYTRLQVEFLFPELKP----------MPVAEQYLPLLADYESRLKVFF-ADT 66
Query: 131 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDE----QDDMFILVAP 186
G ALA R+ D+ T +IG+ A V + ++++F+ +AP
Sbjct: 67 GAAALA---------RR--------DWDVPFTISDIGTGRATSVSDKIQPEEEIFLFIAP 109
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
+ I ++ + G RP IL+NPRL+D GI R R + + E CY
Sbjct: 110 ----SSVEISQLEKLFAEIGDRPAILLNPRLEDA-AIVGIGYA-ARQIRERFINTIETCY 163
Query: 247 FFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 304
+ R P+ A+ YP +E++ V + GE Y ++ ++PS D+I+
Sbjct: 164 YLR---------PVDDQTAVFRCYPGLWEVW----VETNGE-YQKIAELPKRPSGDEIDL 209
Query: 305 AFL-GKPR 311
L G+P+
Sbjct: 210 ILLKGQPQ 217
>gi|411117915|ref|ZP_11390296.1| protein of unknown function (DUF1995) [Oscillatoriales
cyanobacterium JSC-12]
gi|410711639|gb|EKQ69145.1| protein of unknown function (DUF1995) [Oscillatoriales
cyanobacterium JSC-12]
Length = 241
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 63/235 (26%), Positives = 106/235 (45%), Gaps = 53/235 (22%)
Query: 79 GRTRLKVEINIPELNPEMDVYR-IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAG 137
G +RL+VEI IPEL P Y+ + E + + F D G ++ AG
Sbjct: 26 GYSRLQVEIAIPELKPMQPAYQYLSVFEEYGSHLKVFFTDAGA-------AALARREWAG 78
Query: 138 MP-----LQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGN 192
P + + GTR+ T VE V +D +F+ VAP +AV
Sbjct: 79 SPYNIQSIDIQGTRQT-------------TPVE------QLVLSEDRVFVFVAP-SAVEV 118
Query: 193 CIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLY 252
+++ + +AAG RPVIL NPR++D+ S + R R + + E CY+ +
Sbjct: 119 GVVEQ---IANAAGDRPVILFNPRMEDV--SVVGIGYAARKLRERFLNTIEPCYYLK--- 170
Query: 253 YAGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
P+ G AL YP+ ++++ E Y +++ T+KP+ + +++ F
Sbjct: 171 ------PLEGSALIRCYPSLWQVWAETS-----EGYTLIAEETQKPTLERLDEIF 214
>gi|220907967|ref|YP_002483278.1| hypothetical protein Cyan7425_2561 [Cyanothece sp. PCC 7425]
gi|219864578|gb|ACL44917.1| conserved hypothetical protein [Cyanothece sp. PCC 7425]
Length = 233
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 102/233 (43%), Gaps = 54/233 (23%)
Query: 79 GRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM 138
G +RL+V++ IPEL P M + + F+D G +KV +
Sbjct: 26 GYSRLQVDLLIPELKP----------MPVAQQYLSLFSDYGAGLKVFFADAGAAALAR-- 73
Query: 139 PLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE----VDEQDDMFILVAPQNAVGNCI 194
DWGD V G NE + +D F+LVAP
Sbjct: 74 -------------RDWGD-------VPFTIRGVNELLNPIQPEDRAFVLVAPTPVE---- 109
Query: 195 IDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYA 254
+ ++AM + AG RP IL+N +L+D+ + GI GR R + A+FE CY+ R L +
Sbjct: 110 VMQVEAMANQAGDRPFILLNAKLEDI-ATIGIGYA-GRQLRQRFLATFEPCYYLRPLDW- 166
Query: 255 GTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 307
GA+ YP+ ++++ + ++Y +++ E+P+ D ++ L
Sbjct: 167 -------GAVLRIYPSPWQVW----LEQAEDQYQLIAEEAERPAGDRLDQILL 208
>gi|434395506|ref|YP_007130453.1| protein of unknown function DUF1995-containing protein [Gloeocapsa
sp. PCC 7428]
gi|428267347|gb|AFZ33293.1| protein of unknown function DUF1995-containing protein [Gloeocapsa
sp. PCC 7428]
Length = 243
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 109/230 (47%), Gaps = 50/230 (21%)
Query: 78 DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAG 137
DG + L++EI IPELNP M + + G ++KV + G ALA
Sbjct: 25 DGYSLLQIEILIPELNP----------MPVAEQFLPALDSFGNQLKVFFPDA-GAAALA- 72
Query: 138 MPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQ----DDMFILVAPQNAVGNC 193
R+ DWG K ++IG+ G VD+Q D+ F+LVAP +AV
Sbjct: 73 --------RR-----DWGSVPFK--ILDIGT-GRVPVDQQVLPEDEAFLLVAP-SAVE-- 113
Query: 194 IIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 253
+ ++ + A G RP IL+NPRL+D+ S + GR R + + E+CY R L
Sbjct: 114 -VAQVEKLHQAVGERPFILLNPRLEDV--SIVGIGYAGRQLRARFLNTIESCYHLRPLDG 170
Query: 254 AGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
A A+ YP+ +++++ + +Y +++ +KP D+++
Sbjct: 171 A--------AVFRCYPSPWQVWQE----NKDGEYQLIAEQPKKPMGDEVD 208
>gi|119510288|ref|ZP_01629424.1| hypothetical protein N9414_16062 [Nodularia spumigena CCY9414]
gi|119465032|gb|EAW45933.1| hypothetical protein N9414_16062 [Nodularia spumigena CCY9414]
Length = 244
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 112/247 (45%), Gaps = 50/247 (20%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
ATQ A+ DG TRL+V+ PEL LM + F + R+K+ +
Sbjct: 18 ATQAALADGYTRLQVDFLFPELK----------LMPVAEQFLSLFTEYDSRLKIFFPDA- 66
Query: 131 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQN 188
G ALA DW K + G + + +++ +D++F+ +AP +
Sbjct: 67 GGAALAN--------------RDWAGTPFKILDIGTGRVASIQSKIQPEDEIFLFIAPTS 112
Query: 189 AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 248
+ ++ + + G RP +++NPRL+D G GI T R R + ++ E+CY+
Sbjct: 113 VE----VPQVEKLCENIGDRPFVMLNPRLED-SGVVGIGYT-ARQTRQRFISTLESCYYL 166
Query: 249 RLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
R P+ A+ YP +E++ ++ GE Y ++ ++P+ D+++
Sbjct: 167 R---------PVDDTTAVFRCYPGLWEVWVEIN----GE-YQKVAELPKRPTGDELDVIV 212
Query: 307 L-GKPRN 312
+ G+P+
Sbjct: 213 MQGQPQT 219
>gi|443309167|ref|ZP_21038918.1| protein of unknown function (DUF1995) [Synechocystis sp. PCC 7509]
gi|442780785|gb|ELR90927.1| protein of unknown function (DUF1995) [Synechocystis sp. PCC 7509]
Length = 243
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 100/239 (41%), Gaps = 44/239 (18%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
ATQ A+ DG RL+VEI +PELNP +A F D + +
Sbjct: 18 ATQNAIADGLNRLQVEILLPELNP--------------MPVAERFLSDEEGISHNFNKIF 63
Query: 131 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-VDEQDDMFILVAPQNA 189
A A A R+ DWG+ + + + E + +D+ + +AP
Sbjct: 64 FPDAGAA-----ALARR-----DWGEVPFELLDISTKRVSVEEQIQPEDEAILCIAPTAQ 113
Query: 190 VGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFR 249
+ ++ + +A G RPV+L+NPRL+D+ S + GR R + + ++CY+ R
Sbjct: 114 E----VLQIEKLCNAMGDRPVVLLNPRLEDV--SIVGIGYAGRQLRDRFLKNIQSCYYLR 167
Query: 250 LLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLG 308
L G L YP ++++ + Y ++ KP DD++ F G
Sbjct: 168 PLEE--------GTLFRCYPGLWQVWCETT-----DGYELIKELPNKPVGDDLDLIFAG 213
>gi|334118025|ref|ZP_08492115.1| Domain of unknown function DUF1995-containing protein [Microcoleus
vaginatus FGP-2]
gi|333460010|gb|EGK88620.1| Domain of unknown function DUF1995-containing protein [Microcoleus
vaginatus FGP-2]
Length = 248
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/239 (27%), Positives = 109/239 (45%), Gaps = 53/239 (22%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRV---IALSFADDGKRVKVCVQ 127
AT A++DG+T L+VE+ PE+ + L EL + + + F D
Sbjct: 18 ATAAALSDGKTLLQVELVFPEIALQAQSITEQFLPELEEIYPGVKVFFPD---------- 67
Query: 128 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILV 184
G ALA R+ DWG+ K T ++GS + +++ +D +F+L+
Sbjct: 68 --AGAAALA---------RR-----DWGETPFKVT--DLGSSRSPVEDKIAPEDQLFLLI 109
Query: 185 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 244
P + ++ + AA RPVIL+NPRL+D+ + GI GR R + + E+
Sbjct: 110 NP----AAVEVAQVERLYIAAAGRPVILLNPRLEDV-ATIGIGYA-GRQLRDRFLSKIES 163
Query: 245 CYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
CY+ R L A AL YP ++++ + +Y ++S +KP DD+
Sbjct: 164 CYYVRPLDAA--------ALFRCYPQSWQVWLERN-----NQYELISETAQKPVGDDLE 209
>gi|56752173|ref|YP_172874.1| hypothetical protein syc2164_c [Synechococcus elongatus PCC 6301]
gi|81300739|ref|YP_400947.1| hypothetical protein Synpcc7942_1930 [Synechococcus elongatus PCC
7942]
gi|56687132|dbj|BAD80354.1| hypothetical protein [Synechococcus elongatus PCC 6301]
gi|81169620|gb|ABB57960.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 247
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 111/242 (45%), Gaps = 53/242 (21%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+A QRA+ G TR++V++ IPEL + ++ +F D G +++V
Sbjct: 17 EAAQRALAAGLTRIQVDLAIPELKIQPIAWQFLA----------AFLDRGAKLRVFFPDP 66
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGANEVDEQDDMFILVAPQ 187
G ALA R+ DWG+ + +G + + A V ++++FI + P
Sbjct: 67 -GAAALA---------RR-----DWGELPFSVRG----MEELKAA-VQPEEELFIFIEP- 105
Query: 188 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 247
+ + ++ + G RPVIL+NPRL+D+ + GI R R + +++ Y+
Sbjct: 106 ---SSVEVQRLEQLCQEIGDRPVILLNPRLEDV-ATIGIGYA-ARQLRERFLNQWQSAYY 160
Query: 248 FRLLYYAGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
P+ G A+ YP +++++ D Y +L + ++P+ +D++ F
Sbjct: 161 LS---------PLEGAAIFQAYPQRWQVWQETDT-----GYELLQEYDQRPNGEDLDRLF 206
Query: 307 LG 308
G
Sbjct: 207 AG 208
>gi|452823755|gb|EME30763.1| hypothetical protein isoform 2 [Galdieria sulphuraria]
Length = 1138
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 131/296 (44%), Gaps = 56/296 (18%)
Query: 28 HERKSRSNQVSSSQDNWKGIPTR---LNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLK 84
+ R+ +NQ SQ++ G+P + ++ IP+S ++ L A A G RL
Sbjct: 50 NSRQISANQTFLSQES--GLPDQGEVVSPIPNSL---TRLSENSLSALIAATEAGYKRLL 104
Query: 85 VEINIPELNPEMDVYR--IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQL 142
++I+ + + ++ + L EL+RVI + + + + G A A
Sbjct: 105 IDIDFEQGDSTYTTFKNAVPFLRELLRVIE---SRNWQTTTTLLFPDEGSAAFA------ 155
Query: 143 AGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMT 202
RK DW +K +F+ GS D D I++ P +AV D++AM
Sbjct: 156 ---RK-----DWNI--SKCSFLSFGSYSGK--DSSQDWLIMICP-SAV------DVEAME 196
Query: 203 D---------AAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 253
+ RP+ILINP+L D+ G++G + R R ++ ++FE+ YF R+ +
Sbjct: 197 QIVNRESPFLSKDARPIILINPKLVDM-GATG-LGFNARQLRQQFLSTFESIYFLRVYTW 254
Query: 254 AGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 307
QYP ++ + N E + P Y +L TF KP+ D I + FL
Sbjct: 255 GVVVRQYPFRWSVWLDTANSDENSSSEEAP-----YRLLRTFENKPNDDTIQEIFL 305
>gi|452823754|gb|EME30762.1| hypothetical protein isoform 1 [Galdieria sulphuraria]
Length = 1152
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 131/296 (44%), Gaps = 56/296 (18%)
Query: 28 HERKSRSNQVSSSQDNWKGIPTR---LNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLK 84
+ R+ +NQ SQ++ G+P + ++ IP+S ++ L A A G RL
Sbjct: 50 NSRQISANQTFLSQES--GLPDQGEVVSPIPNSL---TRLSENSLSALIAATEAGYKRLL 104
Query: 85 VEINIPELNPEMDVYR--IGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQL 142
++I+ + + ++ + L EL+RVI + + + + G A A
Sbjct: 105 IDIDFEQGDSTYTTFKNAVPFLRELLRVIE---SRNWQTTTTLLFPDEGSAAFA------ 155
Query: 143 AGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMT 202
RK DW +K +F+ GS D D I++ P +AV D++AM
Sbjct: 156 ---RK-----DWN--ISKCSFLSFGSYSGK--DSSQDWLIMICP-SAV------DVEAME 196
Query: 203 D---------AAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 253
+ RP+ILINP+L D+ G++G + R R ++ ++FE+ YF R+ +
Sbjct: 197 QIVNRESPFLSKDARPIILINPKLVDM-GATG-LGFNARQLRQQFLSTFESIYFLRVYTW 254
Query: 254 AGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 307
QYP ++ + N E + P Y +L TF KP+ D I + FL
Sbjct: 255 GVVVRQYPFRWSVWLDTANSDENSSSEEAP-----YRLLRTFENKPNDDTIQEIFL 305
>gi|440681570|ref|YP_007156365.1| protein of unknown function DUF1995-containing protein [Anabaena
cylindrica PCC 7122]
gi|428678689|gb|AFZ57455.1| protein of unknown function DUF1995-containing protein [Anabaena
cylindrica PCC 7122]
Length = 244
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 55/238 (23%), Positives = 101/238 (42%), Gaps = 49/238 (20%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+A + A+ DG TR++V+ PEL M + FA+ R+K+ +
Sbjct: 17 EAVKSALADGMTRIQVDFLFPELK----------FMPIAEQFIPLFAEYESRLKIFFADA 66
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQ 187
W D K + G + E + +D++F+ +AP
Sbjct: 67 GAAALARRD---------------WADSPFKVEDIGTGRTASLETKIQPEDEIFLFIAPT 111
Query: 188 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 247
+ + ++ + + GTRP IL+NPRL+D G GI R+ R + ++ E+CY+
Sbjct: 112 SVE----VPQLEKLCEIIGTRPFILLNPRLED-SGVVGIGYA-ARETRRRFISTIESCYY 165
Query: 248 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
R P+ AL YP +E++ + ++Y ++ ++PS D+I+
Sbjct: 166 LR---------PVDDESALFRCYPGDWEVWLETN-----DEYQKIAELPKRPSGDEID 209
>gi|428306245|ref|YP_007143070.1| hypothetical protein Cri9333_2705 [Crinalium epipsammum PCC 9333]
gi|428247780|gb|AFZ13560.1| protein of unknown function DUF1995-containing protein [Crinalium
epipsammum PCC 9333]
Length = 248
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 114/242 (47%), Gaps = 51/242 (21%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+AT+ A+ DG TRL VE+ PEL M + + G +KV +
Sbjct: 17 EATKTAIADGHTRLMVELVFPELK----------TMPVASQFIPELEEYGSHLKVFFTDA 66
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 186
G ALA R+ DWG+ K ++GS + N+++E+D +F+L+ P
Sbjct: 67 -GAAALA---------RR-----DWGEVPFK--ISDLGSSRSSVQNKIEEEDQIFLLIEP 109
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
+AV I ++ + +AAG RPVIL+ PRL+D +G + YAA
Sbjct: 110 -SAVE---IAQVEQLCNAAGDRPVILLVPRLED-------AAVVG----IGYAARQLRDR 154
Query: 247 FFRLLYYAGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 305
F + LY + P+ G AL ++P+ ++++ + + Y +++ T+KP + ++
Sbjct: 155 FIKTLYSSYYIRPLEGAALFRSHPSPWQVWLETN-----DDYNLIAEETQKPVGETLDQI 209
Query: 306 FL 307
L
Sbjct: 210 IL 211
>gi|427705929|ref|YP_007048306.1| hypothetical protein Nos7107_0483 [Nostoc sp. PCC 7107]
gi|427358434|gb|AFY41156.1| protein of unknown function DUF1995-containing protein [Nostoc sp.
PCC 7107]
Length = 244
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 66/240 (27%), Positives = 113/240 (47%), Gaps = 53/240 (22%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+AT+ A+ DG TR++V++ PEL M + FA+ R+KV
Sbjct: 17 EATKAALADGYTRVQVDLLFPELKQ----------MPVAEQFLPLFAEYESRLKVFF-AD 65
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA----NEVDEQDDMFILVA 185
G ALA R+ DW D A ++IG+ A +++ +D++F+ V+
Sbjct: 66 AGGAALA---------RR-----DWVD--AAFQILDIGTGRAASIQSKIKPEDEIFLFVS 109
Query: 186 PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENC 245
P +AV I ++ + + G RP++++NPRL+D G+ GI R R + + E+C
Sbjct: 110 P-SAVE---IPQLEKVCEIIGDRPLVMLNPRLED-SGTVGIGYA-ARQIRERFLNTIESC 163
Query: 246 YFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
Y+ R P+ A+ YP +E+ + GE + ++ +KPS DDI+
Sbjct: 164 YYLR---------PVDENTAVFRCYPGQWEVLVQ-----KGETWEKIAELPKKPSGDDID 209
>gi|428300086|ref|YP_007138392.1| hypothetical protein Cal6303_3487 [Calothrix sp. PCC 6303]
gi|428236630|gb|AFZ02420.1| protein of unknown function DUF1995-containing protein [Calothrix
sp. PCC 6303]
Length = 250
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 65/253 (25%), Positives = 110/253 (43%), Gaps = 53/253 (20%)
Query: 59 EIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 115
++ N D + QA T A+ DG TRL+VE+ PEL P M + +F
Sbjct: 3 QLPNSLEDAIFQAREATLAAIADGCTRLQVELLFPELKP----------MPVAEEFIPTF 52
Query: 116 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-- 173
A G ++KV + DW D K V G A +
Sbjct: 53 ASYGDKLKVFFADAGAAALAR---------------RDWKDVPFKIWDVGTGRASAAQPK 97
Query: 174 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG-TRPVILINPRLKDLPGSSGIMQTMGR 232
+ +D+MF+ +AP + + +++ + G RP +++NP+L+D G+ GI R
Sbjct: 98 IQAEDEMFLFIAPTSVE----VAELERLCGEIGEQRPFVMLNPKLED-SGTVGIGYA-AR 151
Query: 233 DKRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVIL 290
+ R+ + ++ E+CY+ R P+ AL YP +E++ V GE Y +
Sbjct: 152 NIRMRFISTIESCYYLR---------PVDDETALFRCYPGMWEVW----VDKDGE-YKRI 197
Query: 291 STFTEKPSSDDIN 303
+ KPS D+++
Sbjct: 198 AELANKPSGDELD 210
>gi|443320726|ref|ZP_21049809.1| protein of unknown function (DUF1995) [Gloeocapsa sp. PCC 73106]
gi|442789566|gb|ELR99216.1| protein of unknown function (DUF1995) [Gloeocapsa sp. PCC 73106]
Length = 241
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 64/236 (27%), Positives = 109/236 (46%), Gaps = 59/236 (25%)
Query: 71 ATQRAVNDGRTRLKVEINIPE--LNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQG 128
A A+NDG RL+VE+ IPE L + TL F + G +KV +
Sbjct: 17 AVTNALNDGYRRLQVELVIPEIALQAQALALEFTTL----------FTEYGSGLKVFLPD 66
Query: 129 SMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMFI 182
+ G ALA R+ DWG+ T + +G N V++ D +F+
Sbjct: 67 T-GAAALA---------RR-----DWGE-----TPFAVTDLGTNRSPIETRVNDTDQIFL 106
Query: 183 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 242
+V+P +AV + ++ + + AG RPV+L+ P+L+D+ S + R R + ++
Sbjct: 107 VVSP-SAVE---VTQVEKLCNLAGDRPVVLLIPQLEDV--SIVGIGLAARQLRERFLSTL 160
Query: 243 ENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKP 297
E+CY+F+ P+ GA + YP+ +++++ G Y +L EKP
Sbjct: 161 ESCYYFK---------PLEGAAVLKNYPSPWQVWRE-----SGNDYQLLCEQGEKP 202
>gi|428208370|ref|YP_007092723.1| hypothetical protein Chro_3395 [Chroococcidiopsis thermalis PCC
7203]
gi|428010291|gb|AFY88854.1| protein of unknown function DUF1995 [Chroococcidiopsis thermalis
PCC 7203]
Length = 272
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 61/234 (26%), Positives = 111/234 (47%), Gaps = 51/234 (21%)
Query: 78 DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAG 137
DG TRL+VE+ IPEL P M + F G ++KV + G ALA
Sbjct: 53 DGYTRLQVELAIPELKP----------MSVAEQFLAEFTQFGSQLKVFFPDA-GAAALA- 100
Query: 138 MPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQNAVGNCII 195
R+ DWG+ K V+IG+ E ++ +D+ + ++P +AV +
Sbjct: 101 --------RR-----DWGEVPFK--IVDIGTRSPLEEKIEPEDEALLFISP-SAVE---V 141
Query: 196 DDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAG 255
+ ++ + + A T P +++NPRL+D+ GI GR R + + E+CY+ R
Sbjct: 142 ERVEKLCELA-TCPTVMLNPRLEDV-AIVGIGYA-GRQLRTRFLNNIESCYYLR------ 192
Query: 256 TQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLG 308
PI ++ +YP +++++ ++ E++ +++ KP D+I+ G
Sbjct: 193 ---PIENISVFRSYPGEWQIWREIE-----EEFQLITEQPTKPMGDEIDAILYG 238
>gi|307150883|ref|YP_003886267.1| hypothetical protein Cyan7822_0982 [Cyanothece sp. PCC 7822]
gi|306981111|gb|ADN12992.1| Domain of unknown function DUF1995 [Cyanothece sp. PCC 7822]
Length = 250
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 63/214 (29%), Positives = 98/214 (45%), Gaps = 44/214 (20%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+ATQ A+ G RL+VE+ IPE+ E G +E + F G +KV +
Sbjct: 17 EATQIALEAGLGRLQVELVIPEIALEAQ----GLALEFTSL----FEQYGSGLKVLFPDT 68
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFI 182
G ALA R+ DWG D G++ VE+ +V +D+ F+
Sbjct: 69 -GAAALA---------RR-----DWGETPFKISDLGSRFIPVEM------KVTPEDEAFL 107
Query: 183 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 242
+V P + I + + + A RPV+L+ P+L+D+ GI T R R + ++
Sbjct: 108 VVCP----ASIEIQSVAKLCEIAEQRPVLLLIPQLEDV-SIVGIGYT-ARQLRDNFLSTL 161
Query: 243 ENCYFFRLLYYAGT--QYPIMGALRMTYPNGYEL 274
E+CY+FR L A YP + + + GYEL
Sbjct: 162 ESCYYFRPLEGAAVVRSYPGLWQVWLEKEQGYEL 195
>gi|390438390|ref|ZP_10226864.1| conserved hypothetical protein [Microcystis sp. T1-4]
gi|389838196|emb|CCI30988.1| conserved hypothetical protein [Microcystis sp. T1-4]
Length = 247
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 61/235 (25%), Positives = 104/235 (44%), Gaps = 45/235 (19%)
Query: 62 NYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADD 118
N + +LQA TQ A+ G ++VE+ IPE+ + + +AL F
Sbjct: 5 NSLEETILQAKAATQLALESGARCVQVELVIPEIALQ------------AQALALDFTS- 51
Query: 119 GKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA------N 172
+ S G G P A +L DWG+ T ++G +G+ N
Sbjct: 52 -------IFDSYGSGLRVIFPDTGAA---MLARRDWGE-----TVFQLGDLGSRFIPIEN 96
Query: 173 EVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGR 232
++ +D+ F++V P + I+ ++ + + A RPV+L+ P+L+D+ GI R
Sbjct: 97 KIKPEDEAFLVVCPSSVE----INSVEKLCNLAEDRPVVLLIPQLEDV-SVVGIGYA-AR 150
Query: 233 DKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGE 285
R + ++ E+CY+FR L A YP + + + +GYEL GE
Sbjct: 151 QLRERFLSTLESCYYFRPLESAIVYRSYPSLWQVWLEKEDGYELISEQSTKPMGE 205
>gi|414076889|ref|YP_006996207.1| hypothetical protein ANA_C11624 [Anabaena sp. 90]
gi|413970305|gb|AFW94394.1| hypothetical protein ANA_C11624 [Anabaena sp. 90]
Length = 244
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 58/253 (22%), Positives = 107/253 (42%), Gaps = 49/253 (19%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+A + A+ DG TR++VE+ PEL M + F + R+KV +
Sbjct: 17 EAVKSALADGMTRIQVELLFPELK----------FMTVAEQFLPLFTEYESRLKVFFADA 66
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE--VDEQDDMFILVAPQ 187
W D + + + G + E + +D++F+ +AP
Sbjct: 67 GAAALARRD---------------WTDAQFQISDIGTGRAASLEAKIQPEDEIFLFIAPT 111
Query: 188 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 247
+ + ++ + + G RPVI++ PRL+D GI T R+ R + ++ E+CY+
Sbjct: 112 SVE----VPQLEKLCELIGERPVIMLTPRLED-SSVVGIGYT-ARETRRRFISTIESCYY 165
Query: 248 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 305
R P+ AL YP +E++ + + GE Y ++ +KPS D+++
Sbjct: 166 IR---------PVDDESALFRCYPGLWEVW----LETAGE-YQKVAELPKKPSGDELDAI 211
Query: 306 FLGKPRNREKKAS 318
+G R A+
Sbjct: 212 LMGGQRENTTDAT 224
>gi|86604746|ref|YP_473509.1| hypothetical protein CYA_0013 [Synechococcus sp. JA-3-3Ab]
gi|86553288|gb|ABC98246.1| conserved hypothetical protein [Synechococcus sp. JA-3-3Ab]
Length = 246
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/283 (22%), Positives = 117/283 (41%), Gaps = 59/283 (20%)
Query: 50 RLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVR 109
+L PH + +QAT+ A+ G RL +EI P++ PE+ +ELV
Sbjct: 6 QLTQFPHLPGDLAQACEQAIQATRLALQAGYRRLLIEILAPDIKPEVLAR---PFLELVP 62
Query: 110 VIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSI 169
AL D G ALA ++G+ +++ S+
Sbjct: 63 PPALVLFSDA-----------GGAALAQR-----------------EWGSLPDGIQLQSL 94
Query: 170 GA--NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAG-------TRPVILINPRLKDL 220
A EQ +F++ A + + + + D +G +PV+L+NP+L+D
Sbjct: 95 TARTQPTPEQSVLFVMPAVYSLDQVERVCQLVSGGDPSGLAAKGRDPKPVVLLNPQLQD- 153
Query: 221 PGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDV 280
++ + GR R + ++FE Y+ R L + GAL YP+ + ++++ +
Sbjct: 154 -AAAVGVGLAGRRLRQRFLSTFETSYYLRSL--------VEGALFRAYPDPWSVWQQEE- 203
Query: 281 PSGGEKYVILSTFTEKPSSDDINDAFLGKPRNREKKASGFWGF 323
Y +L TF +PS +++ + F K+ +W +
Sbjct: 204 ---PGLYSVLKTFRARPSGEEVAELF-----RASKRGDPWWSW 238
>gi|113475888|ref|YP_721949.1| hypothetical protein Tery_2247 [Trichodesmium erythraeum IMS101]
gi|110166936|gb|ABG51476.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 253
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/247 (26%), Positives = 118/247 (47%), Gaps = 54/247 (21%)
Query: 69 LQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQG 128
++AT+ A+ DG TR+++EI +P++ + L + + + ++KV
Sbjct: 16 MEATKAALQDGYTRVQIEIVVPDIELQAQ--------SLAKQFIPALLETSTKLKVFFPD 67
Query: 129 SMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMFI 182
S G ALA R+ DW D TF +I +G + +V+ +D F+
Sbjct: 68 S-GAAALA---------RR-----DWQD----ATF-KIEDLGTSRSPVDKKVEPEDQCFL 107
Query: 183 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 242
L+AP +A+ + + +++ AG RPVI++ P+L+D+ GI R R + +
Sbjct: 108 LIAP-SAIE---VAQTEKLSNLAGDRPVIMLIPKLEDV-SIVGIGYA-ARQLRERFIKTI 161
Query: 243 ENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDI 302
E+CY+ R L A AL YP+ ++++ + G+ Y +++ EKP D++
Sbjct: 162 ESCYYIRSLGGA--------ALYRCYPSPWQVW----LEENGQ-YKLIAEQPEKPVGDEV 208
Query: 303 NDAFLGK 309
D L K
Sbjct: 209 -DMILAK 214
>gi|119484707|ref|ZP_01619189.1| hypothetical protein L8106_14580 [Lyngbya sp. PCC 8106]
gi|119457525|gb|EAW38649.1| hypothetical protein L8106_14580 [Lyngbya sp. PCC 8106]
Length = 249
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 66/247 (26%), Positives = 111/247 (44%), Gaps = 56/247 (22%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYR---IGTLMELVRVIALSFADDGKRVKVCV 126
QATQ A++DG ++VE+ PE+ + I + E V+ + F D
Sbjct: 17 QATQAALDDGYKLVQVELVFPEIELQAQAIAQQFIPAIEESGTVLKVFFPD--------- 67
Query: 127 QGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFIL 183
G ALA R+ DWG+ K ++GS + + V + D F++
Sbjct: 68 ---AGAAALA---------RR-----DWGEIPFK--ISDLGSSRSPIDSRVKDDDGRFLV 108
Query: 184 VAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFE 243
V+P ++ ++ ++ AG R IL+NPRL+D+ GI R R + ++ E
Sbjct: 109 VSPTPVE----VEQVEKLSQLAGDRVTILLNPRLEDI-AIIGIGYA-ARALRDRFISTIE 162
Query: 244 NCYFFRLLYYAGTQYPIMG--ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 301
+CY+ R P+ G AL YP+ +E+++ +D GE Y +L+ KP D
Sbjct: 163 SCYYLR---------PLEGDAALYRCYPSLWEVWQEID----GE-YTLLAQEQTKPVGDQ 208
Query: 302 INDAFLG 308
++ G
Sbjct: 209 LDQILAG 215
>gi|443327973|ref|ZP_21056578.1| protein of unknown function (DUF1995) [Xenococcus sp. PCC 7305]
gi|442792382|gb|ELS01864.1| protein of unknown function (DUF1995) [Xenococcus sp. PCC 7305]
Length = 248
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 68/266 (25%), Positives = 112/266 (42%), Gaps = 51/266 (19%)
Query: 62 NYFYDDVLQATQRA-------VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALS 114
N F + +A ++A + DG TRL+VE+ IPE+ + + +AL
Sbjct: 2 NEFPQSIAEAVEQAKKTVIDAIADGYTRLQVELVIPEIPLQ------------AQSLALE 49
Query: 115 FADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIG 167
FA+ S G G P A +L DWG D G + T +E
Sbjct: 50 FAN-------LFLESHGSGVKVMFPDTGAA---MLAKRDWGEIAFQVTDMGGRFTSIE-- 97
Query: 168 SIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIM 227
++ E D++FI+V P + ++ ++ + + G RPVIL+ P+L+D+ GI
Sbjct: 98 ----QQIAETDEIFIMVCPSSVE----VERVETLCNLVGDRPVILLIPQLEDV-AVVGI- 147
Query: 228 QTMGRDKRLEYAASFENCYFFRLLYYAGTQ--YPIMGALRMTYPNGYELYKRVDV-PSGG 284
R R + ++ +CY+ R L A +P + + GYEL + P G
Sbjct: 148 GLAARRLRDRFISTLYSCYYIRPLEGASILRIHPSSWQIWLEKEEGYELAMELSSKPMGE 207
Query: 285 EKYVILSTFTEKPSSDDINDAFLGKP 310
E +L T N++ + KP
Sbjct: 208 ELDRLLIQLTTPEGEAGKNNSTIKKP 233
>gi|332705285|ref|ZP_08425366.1| protein of unknown function, DUF1995 [Moorea producens 3L]
gi|332356028|gb|EGJ35487.1| protein of unknown function, DUF1995 [Moorea producens 3L]
Length = 251
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 56/233 (24%), Positives = 109/233 (46%), Gaps = 43/233 (18%)
Query: 76 VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 135
++DG+TR++VE+ PE+ + E + + F G +K+ + G AL
Sbjct: 23 LDDGQTRVQVELVFPEIALQAQ----SIAQEFIPI----FEQYGPGLKIFFPDT-GASAL 73
Query: 136 AGMPLQLAGTRKILEFMDWGDYGAKGTFVEIG-SIGANEVDEQDDMFILVAPQNAVGNCI 194
A DWG+ K T V S +++ D +F+LV P +AV
Sbjct: 74 A--------------LRDWGEVPFKITDVGTSRSPAETRIEDDDQVFLLVGP-SAVE--- 115
Query: 195 IDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYA 254
+ ++ + + AG RP +++NP+L+D+ S + R R + + E+CY+ R +
Sbjct: 116 VAQVEKICNLAGDRPCVILNPQLEDV--SIVGIGYAARQLRDRFLKTLESCYYLR--PFP 171
Query: 255 GTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 307
G GAL YP+ ++++ +D ++Y +++ KP+++ ++ L
Sbjct: 172 G------GALWRCYPSMWQVWLEID-----DEYQLVTEEPSKPTAEALDQIIL 213
>gi|298489954|ref|YP_003720131.1| hypothetical protein Aazo_0482 ['Nostoc azollae' 0708]
gi|298231872|gb|ADI63008.1| Domain of unknown function DUF1995 ['Nostoc azollae' 0708]
Length = 244
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 56/246 (22%), Positives = 102/246 (41%), Gaps = 50/246 (20%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+A + A+ DG TR++V+ PEL M + FA+ RVKV +
Sbjct: 17 EAVKSALADGVTRIQVDFLFPELK----------FMPVAEQFVPLFAEYESRVKVFFADA 66
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQ 187
W + K + G + ++ +D++F+ +AP
Sbjct: 67 GAAALARRD---------------WQNVPFKVEDIGTGRAASLQTKIQPEDEIFLFIAPT 111
Query: 188 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 247
+ ++ M + TRP++L+NPRL+D G GI R+ R + ++ E+CY+
Sbjct: 112 PVE----VPQLEKMCEIIDTRPIVLLNPRLED-SGVVGIGYA-ARETRRRFISTIESCYY 165
Query: 248 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 305
R P+ AL YP +E++ +Y ++ ++PS D+I+
Sbjct: 166 LR---------PVDDESALFRCYPGQWEVWL-----ESNNEYEKIAELPKRPSGDEIDMI 211
Query: 306 FL-GKP 310
+ G+P
Sbjct: 212 LMKGQP 217
>gi|67922775|ref|ZP_00516276.1| hypothetical protein CwatDRAFT_3614 [Crocosphaera watsonii WH 8501]
gi|416392960|ref|ZP_11685949.1| hypothetical protein CWATWH0003_2757 [Crocosphaera watsonii WH
0003]
gi|67855391|gb|EAM50649.1| hypothetical protein CwatDRAFT_3614 [Crocosphaera watsonii WH 8501]
gi|357263546|gb|EHJ12537.1| hypothetical protein CWATWH0003_2757 [Crocosphaera watsonii WH
0003]
Length = 246
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 104/244 (42%), Gaps = 46/244 (18%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+A + A+ G R+ +EI IPE+ + + +AL F +
Sbjct: 17 EAAKIALESGVGRICLEIVIPEIAIQ------------AQALALEFT------------T 52
Query: 130 MGEGALAGMPLQLAGT-RKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMF 181
+ EG +G+ + T L DWG D G++ T +E ++ E D++F
Sbjct: 53 LFEGQQSGLKVIFPDTGAAALARRDWGETPFRVTDLGSRATGIE------RKISEADEIF 106
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
+LV P + ++ ++ + + AG RPVIL+ P+L+D+ S + R R + ++
Sbjct: 107 LLVCPSSVE----VETVEKLCELAGDRPVILLIPQLEDV--SIVGIGYAARQLRDRFIST 160
Query: 242 FENCYFFRLL--YYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSS 299
E+ Y+FR L YP + + +GYEL GE IL K
Sbjct: 161 LESVYYFRPLDDVVVLRSYPSPWLVFLEKEDGYELIAEQGQKPMGEALDILVNNALKGEE 220
Query: 300 DDIN 303
DD N
Sbjct: 221 DDSN 224
>gi|218247522|ref|YP_002372893.1| hypothetical protein PCC8801_2736 [Cyanothece sp. PCC 8801]
gi|257061142|ref|YP_003139030.1| hypothetical protein Cyan8802_3366 [Cyanothece sp. PCC 8802]
gi|218168000|gb|ACK66737.1| conserved hypothetical protein [Cyanothece sp. PCC 8801]
gi|256591308|gb|ACV02195.1| conserved hypothetical protein [Cyanothece sp. PCC 8802]
Length = 245
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 63/232 (27%), Positives = 102/232 (43%), Gaps = 45/232 (19%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+AT+ A+ R++VE+ +PE+ + G +E + DD +KV +
Sbjct: 17 EATKLALEANLGRIQVELVVPEIALQAQ----GLALEFTTLF-----DDLSSLKVIFPDT 67
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMFI 182
G ALA R+ DWG D G++ T VE+ +V EQD F+
Sbjct: 68 -GAAALA---------RR-----DWGETPFKVTDLGSRATSVEM------KVSEQDRAFL 106
Query: 183 LVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASF 242
LV P + ++ ++ + + AG RPV+L+ P+L+D+ GI R R + ++
Sbjct: 107 LVCP----SSVEVESVEKLCNLAGDRPVVLLIPQLEDV-SVVGIGYA-ARQLRERFISTL 160
Query: 243 ENCYFFRLLYYAGT--QYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILST 292
+ Y+FR L A YP + + GYEL GE I+ T
Sbjct: 161 NSAYYFRPLEGAAILRSYPSPWNVYLETETGYELIASEPQKPLGEALEIILT 212
>gi|254410487|ref|ZP_05024266.1| hypothetical protein MC7420_3002 [Coleofasciculus chthonoplastes
PCC 7420]
gi|196182693|gb|EDX77678.1| hypothetical protein MC7420_3002 [Coleofasciculus chthonoplastes
PCC 7420]
Length = 245
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 112/242 (46%), Gaps = 37/242 (15%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+ATQ A+NDG TRL+VE+ PE+ + I + F G +KV +
Sbjct: 17 EATQAALNDGYTRLQVELVFPEIA--LQAQSIAWQFTPI------FEQYGSGLKVLFPDA 68
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 186
G ALA R+ DWG+ K T +IG+ + + + +D+ F++V+P
Sbjct: 69 -GAAALA---------RR-----DWGETPFKVT--DIGTSRSPVESRIQPEDEAFLIVSP 111
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
+AV + ++ + + AG RP +L+ P+L+DL GI R R + ++ +CY
Sbjct: 112 -SAVE---VSQVEKLCNLAGDRPCVLLTPQLEDL-KVVGIGYA-ARQLRERFLSTLTSCY 165
Query: 247 FFRLLYYAGTQ--YPIMGALRMTYPNGYELY-KRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+ + L A YP + + + N Y+L + P G +I++ T S+D
Sbjct: 166 YVQPLDGAALLRVYPGLWQIWLEKENAYQLIAEEPQKPVGDALELIIAKVTGSGDSEDTP 225
Query: 304 DA 305
A
Sbjct: 226 AA 227
>gi|434386365|ref|YP_007096976.1| protein of unknown function (DUF1995) [Chamaesiphon minutus PCC
6605]
gi|428017355|gb|AFY93449.1| protein of unknown function (DUF1995) [Chamaesiphon minutus PCC
6605]
Length = 240
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 60/233 (25%), Positives = 102/233 (43%), Gaps = 55/233 (23%)
Query: 78 DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAG 137
DG R+ VEI IPEL M L R SF+D G +KV + G ALA
Sbjct: 25 DGHKRITVEIVIPELKS----------MGLARQFLESFSDRGNTLKVLFTDT-GAAALAK 73
Query: 138 MPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG------ANEVDEQDDMFILVAPQNAVG 191
DW +G +I +G +++ +D+++I + P +AV
Sbjct: 74 R--------------DW-----QGVNFKIEDLGNSRSPITDKIQAEDELYIAIDP-SAVE 113
Query: 192 NCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLL 251
++ ++ + +AAG RPVIL P+L+D GI R R + + Y+ + L
Sbjct: 114 ---VEQVEKLCNAAGDRPVILFLPKLED-AAIVGIGYA-ARQLRDRFLTTLTCAYYIKPL 168
Query: 252 YYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 304
AL YP +++++ D + Y++L+ +KP D++++
Sbjct: 169 E--------ASALYRCYPAQWQVWQEQD-----DDYILLAECPQKPVGDELDE 208
>gi|359457624|ref|ZP_09246187.1| hypothetical protein ACCM5_02793 [Acaryochloris sp. CCMEE 5410]
Length = 234
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 63/270 (23%), Positives = 115/270 (42%), Gaps = 64/270 (23%)
Query: 69 LQATQRAVNDGRTRLKVEINIPELNP----EMDVYRIGTLMELVRVIALSFADDGKRVKV 124
++AT+ A+ D +T L+VEI IPEL P + + ++ L E V+V F+D G
Sbjct: 15 VEATKAAIADHKTLLQVEIAIPELKPLPVAQQYLSQLPNLGENVKVF---FSDTGA---- 67
Query: 125 CVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILV 184
++ + +L G +++ + V ++D F+ +
Sbjct: 68 ---AALARHQWKDISYELRGIEELI----------------------DPVQPENDAFVFI 102
Query: 185 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 244
AP + ++ + AG R IL+NP+L+D+ S + GR R + + E+
Sbjct: 103 APTPVE----VGKVEQICSQAGDRVCILLNPKLEDV--SIVGIGMAGRTLRERFLNNIES 156
Query: 245 CYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGE--KYVILSTFTEKPSSDDI 302
CY F L GA+ +YP+ ++++ + GE ++ IL+T +PS + I
Sbjct: 157 CYSFLPLER--------GAVIRSYPSEWQIWW-----AEGEEDEHQILATEENRPSGERI 203
Query: 303 NDAFLGKPRNREKKASG-------FWGFLS 325
+ E + G FW L+
Sbjct: 204 SQLLASLTATEEIRKPGLLDNMQQFWKALT 233
>gi|158334629|ref|YP_001515801.1| hypothetical protein AM1_1459 [Acaryochloris marina MBIC11017]
gi|158304870|gb|ABW26487.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 234
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 61/270 (22%), Positives = 114/270 (42%), Gaps = 64/270 (23%)
Query: 69 LQATQRAVNDGRTRLKVEINIPELNP----EMDVYRIGTLMELVRVIALSFADDGKRVKV 124
++AT+ A+ D +T L++EI IPEL P + + ++ L E V+V F+D G
Sbjct: 15 VEATKSAIADHKTLLQIEIAIPELKPLPVAQQYLSQLPNLGENVKVF---FSDTGA---- 67
Query: 125 CVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILV 184
++ + +L G +++ + V ++D F+ +
Sbjct: 68 ---AALARHQWKDISYELRGIEELI----------------------DPVQPENDAFVFI 102
Query: 185 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 244
AP + ++ + AG R IL+NP+L+D+ S + GR R + + E+
Sbjct: 103 APTPVE----VGKVEQICSQAGDRVCILLNPKLEDV--SIVGIGMAGRTLRERFLNNIES 156
Query: 245 CYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGE--KYVILSTFTEKPSSDDI 302
CY F L GA+ +YP+ ++++ + GE ++ IL+T +PS + I
Sbjct: 157 CYSFLPLER--------GAVIRSYPSEWQIWW-----AEGEEDEHQILATEESRPSGERI 203
Query: 303 NDAFLGKPRNREKKASG-------FWGFLS 325
+ + G FW L+
Sbjct: 204 GQLLASSTTTEDNRKPGLLDNMQQFWKALT 233
>gi|428183504|gb|EKX52362.1| hypothetical protein GUITHDRAFT_157134 [Guillardia theta CCMP2712]
Length = 325
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 71/156 (45%), Gaps = 13/156 (8%)
Query: 158 GAKGTFVE---IGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILIN 214
G KG+ +E I +EV E DDMFI++ + D++ + AAG RPVIL N
Sbjct: 123 GFKGSLIERVWINKDFVSEVQEDDDMFIIIGASAQE----LPDVRELCKAAGDRPVILFN 178
Query: 215 PRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY----AGTQYPI--MGALRMTY 268
+L+ L G G+ + ++ YF Y AG + I GAL TY
Sbjct: 179 LKLQVLRGDFGLPFFPSKSLHNDWLCEALPAYFMLPKSYTKTIAGPPFLINYSGALFRTY 238
Query: 269 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 304
P +++ V GG +Y + ++P+ D+ +
Sbjct: 239 PGKWQMLLEVPDEDGGGRYQRVRMLDKRPALSDVRE 274
>gi|86607776|ref|YP_476538.1| hypothetical protein CYB_0277 [Synechococcus sp. JA-2-3B'a(2-13)]
gi|86556318|gb|ABD01275.1| conserved hypothetical protein [Synechococcus sp. JA-2-3B'a(2-13)]
Length = 233
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/250 (22%), Positives = 108/250 (43%), Gaps = 54/250 (21%)
Query: 66 DDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVC 125
+ + AT+ A+ G RL +EI P+L PE+ +ELV+ AL D
Sbjct: 10 EQAIGATRLAMQAGYRRLLIEILAPDLKPEVLAR---PFLELVQPPALVLFSDA------ 60
Query: 126 VQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVA 185
G ALA ++G+ +E+ S+ + + +LV
Sbjct: 61 -----GGAALAQR-----------------EWGSLPEGIELQSLTSRTQPTPEQSILLVM 98
Query: 186 PQNAVGNCIIDDMKAMT--DAAG-------TRPVILINPRLKDLPGSSGIMQTMGRDKRL 236
P + + +A++ D +G +PV+L+NP+L+D ++ + GR R
Sbjct: 99 PAVYSLDQVERVCQAVSGGDPSGLASKGRDPKPVVLLNPQLQD--AATVGVGLAGRRLRQ 156
Query: 237 EYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEK 296
+ ++FE Y+ R L + GAL YP+ + ++++ + Y +L TF +
Sbjct: 157 RFLSTFETSYYLRSL--------VEGALFRAYPDPWSVWQQEE----PGLYSVLKTFIAQ 204
Query: 297 PSSDDINDAF 306
P+ +++ + F
Sbjct: 205 PTGEEVAELF 214
>gi|428203362|ref|YP_007081951.1| hypothetical protein Ple7327_3166 [Pleurocapsa sp. PCC 7327]
gi|427980794|gb|AFY78394.1| protein of unknown function (DUF1995) [Pleurocapsa sp. PCC 7327]
Length = 249
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 114/243 (46%), Gaps = 59/243 (24%)
Query: 70 QATQRAVNDGRTRLKVEINIPE--LNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQ 127
+AT+ A+ G R++VE+ +PE L + +L+E D G ++V
Sbjct: 17 EATKVALEAGCERIQVELVVPEIALQAQSLALEFTSLLE----------DYGSGLRVLFP 66
Query: 128 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMF 181
+ G ALA R+ DWG+ T ++ +G+ ++ E+D F
Sbjct: 67 DT-GAAALA---------RR-----DWGE-----TVFQVSDLGSRFIPVDMKISEEDTAF 106
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
++V+P N + + + + AG RPV+L+ P+L+D+ S + R R + ++
Sbjct: 107 LVVSPSAIEVNSV----EKLCNLAGDRPVVLLIPQLEDV--SVVGIGLAARQLRERFLST 160
Query: 242 FENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
E+ Y+FR P+ GA+ + +YP+ ++++ G E Y +++ +KP+ +
Sbjct: 161 LESSYYFR---------PLEGAVVLRSYPSLWQVWL-----GGEEDYQLIAEEPQKPTGE 206
Query: 301 DIN 303
++
Sbjct: 207 RLD 209
>gi|443314784|ref|ZP_21044317.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 6406]
gi|442785626|gb|ELR95433.1| protein of unknown function (DUF1995) [Leptolyngbya sp. PCC 6406]
Length = 246
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 111/240 (46%), Gaps = 52/240 (21%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
QAT+ A+ G +R+ V+I IPEL +I + E + F + G + KV +
Sbjct: 19 QATRAAIEAGVSRIVVDIAIPEL-------KIQPVAEQFYPL---FEEMGLQFKVYFPDA 68
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWG--DYGAKGTFVEIGSIGANEVDEQDDMFILVAPQ 187
G ALA DWG D+ +G G + A+ D+ F+++ P
Sbjct: 69 -GAAALAK--------------RDWGNPDFSIRGINEMKGRLEAD-----DEAFLIIEP- 107
Query: 188 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 247
+ ++D+++ + A R V+++NP+L+D+ + GI T GR R + ++ E Y+
Sbjct: 108 ---SSVEVNDVESFCNEATGRFVVMLNPKLEDI-ATIGIGYT-GRQLRERFLSTLETIYY 162
Query: 248 FRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
+ P+ GA + YP ++++ + + Y +L+ F +KPS + + F
Sbjct: 163 LQ---------PLEGATILRAYPGLWQVWGE----TTDDGYELLADFPQKPSGEALEKLF 209
>gi|304570679|ref|YP_001806112.2| hypothetical protein cce_4698 [Cyanothece sp. ATCC 51142]
gi|354552131|ref|ZP_08971439.1| protein of unknown function DUF1995-containing protein [Cyanothece
sp. ATCC 51472]
gi|353555453|gb|EHC24841.1| protein of unknown function DUF1995-containing protein [Cyanothece
sp. ATCC 51472]
Length = 246
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 55/235 (23%), Positives = 107/235 (45%), Gaps = 56/235 (23%)
Query: 123 KVCVQGSMGEGALAGMPLQL----------AGTRKI--------LEFMDWG-------DY 157
++C++ + E AL PL L +G + I L DWG D
Sbjct: 29 RICLEIVIPEIALQAQPLALEFTTLFKGQESGLKVIFPDTGAAALARRDWGETPFQVTDL 88
Query: 158 GAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRL 217
G++ T ++ ++ E D+ F+LV+P + ++ ++ + + AG RPVIL+ P+L
Sbjct: 89 GSRATPID------KKISEADEAFLLVSP----SSVEVEIVEKLCNLAGHRPVILLIPQL 138
Query: 218 KDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYK 276
+D+ S + R R + ++ E+ Y+FR P+ G + + +YP+ + +Y
Sbjct: 139 EDV--SIVGIGYAARQLRERFISTLESVYYFR---------PLDGVVVLRSYPSPWLVYL 187
Query: 277 RVDVPSGGEKYVILSTFTEKPSSDD----INDAFLGKPRNREKKASGFWGFLSSV 327
+ E Y +++ +KP + +N+A G+ N ++ G +SV
Sbjct: 188 EKE-----EGYELIAEQGQKPMGEALDILVNNALKGEEENSDQPQPKKSGIFASV 237
>gi|17230608|ref|NP_487156.1| hypothetical protein all3116 [Nostoc sp. PCC 7120]
gi|17132211|dbj|BAB74815.1| all3116 [Nostoc sp. PCC 7120]
Length = 245
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 103/248 (41%), Gaps = 50/248 (20%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+AT+ A+ DG R++VE+ PEL M + + R+KV +
Sbjct: 17 EATKSALADGCNRIQVELLFPELK----------FMPVAEQFLPCLVEYESRLKVFFADA 66
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQ 187
W D + + G I + +++ +D++F+ ++P
Sbjct: 67 GAAALARRD---------------WKDAAFQILDIGTGRIASIQSKIQPEDEIFLFISPT 111
Query: 188 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYF 247
+ + ++ + + G RP I +NPRL+D G+ GI T R R + ++CY+
Sbjct: 112 SVE----VPQLEKICEIIGDRPAIFLNPRLEDA-GTVGIGYT-ARQTRERFLNIIQSCYY 165
Query: 248 FRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDA 305
R PI AL +YP +E++ D + ++ +KPS D+I+
Sbjct: 166 LR---------PIDDETALFRSYPGDWEIWVEND-----GNWAKIANLPKKPSGDEIDLI 211
Query: 306 FL-GKPRN 312
+ G+P+
Sbjct: 212 LMKGQPQT 219
>gi|218439257|ref|YP_002377586.1| hypothetical protein PCC7424_2296 [Cyanothece sp. PCC 7424]
gi|218171985|gb|ACK70718.1| conserved hypothetical protein [Cyanothece sp. PCC 7424]
Length = 253
Score = 54.3 bits (129), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 60/228 (26%), Positives = 101/228 (44%), Gaps = 47/228 (20%)
Query: 59 EIRNYFYDDVLQA---TQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 115
E+ N ++QA T+ A+ +G R++VE+ IPE+ + IAL F
Sbjct: 3 EVPNSLEATIIQAKEATRLALEEGLGRIQVELVIPEIALS------------AQTIALEF 50
Query: 116 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWG-------DYGAKGTFVEIGS 168
+ S GEG P A L DWG D G++ T VE+
Sbjct: 51 --------TSLLDSYGEGLKVLFPDSGAAA---LARRDWGETPFKISDLGSRFTPVEM-- 97
Query: 169 IGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQ 228
++ +D+ F++V P + + + + D AG RPV+++ P+L+D+ GI
Sbjct: 98 ----KITPEDEAFLVVCP----SSIEVQSVAKLCDLAGDRPVVILIPQLEDV-SIVGIGY 148
Query: 229 TMGRDKRLEYAASFENCYFFRLLYYAGT--QYPIMGALRMTYPNGYEL 274
T R R + ++ ++ Y+ R L A YP + + + +GYEL
Sbjct: 149 T-ARQLRDNFLSTLQSSYYIRPLDGAVVLRAYPELWQVWLEKEDGYEL 195
>gi|428772277|ref|YP_007164065.1| hypothetical protein Cyast_0436 [Cyanobacterium stanieri PCC 7202]
gi|428686556|gb|AFZ46416.1| protein of unknown function DUF1995 [Cyanobacterium stanieri PCC
7202]
Length = 244
Score = 53.9 bits (128), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 62/245 (25%), Positives = 118/245 (48%), Gaps = 52/245 (21%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
AT A+ G RL++E+ IPE+ + EL A F + G +KV +
Sbjct: 18 ATLTALESGFNRLQIELIIPEIALKAQ--------ELAFAFAQLFGEYGAGLKVFFPDT- 68
Query: 131 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFV--EIGSIGA---NEVDEQDDMFILVA 185
G ALA R+ +WG+ TFV ++GS + N V + D+ +++V+
Sbjct: 69 GAAALA---------RR-----EWGE----TTFVVTDLGSSRSPIENRVSDTDEFYLVVS 110
Query: 186 PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENC 245
P +AV + ++ + + AG RPVILI P+L+D+ GI R R + ++ E+
Sbjct: 111 P-SAVE---VAQVEKLCNLAGDRPVILIIPQLEDI-SVVGIGYA-ARQLRERFLSTLESS 164
Query: 246 YFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 304
Y++R P+ A+ M +P +++++ ++ K+ +++ +KP + ++
Sbjct: 165 YYYR---------PLEDAVVMRVFPGLWQVWRELE----DNKFELITEVPKKPLGEVLDR 211
Query: 305 AFLGK 309
+G+
Sbjct: 212 ILMGQ 216
>gi|126656438|ref|ZP_01727699.1| hypothetical protein CY0110_22082 [Cyanothece sp. CCY0110]
gi|126622124|gb|EAZ92831.1| hypothetical protein CY0110_22082 [Cyanothece sp. CCY0110]
Length = 246
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 62/271 (22%), Positives = 121/271 (44%), Gaps = 63/271 (23%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
+A + A+ G R+ +EI IPE+ + + +AL F +
Sbjct: 17 EAAKLALEAGVERICLEIVIPEIALQ------------AQPLALEFT------------T 52
Query: 130 MGEGALAGMPLQLAGT-RKILEFMDWG-------DYGAKGTFVEIGSIGANEVDEQDDMF 181
+ EG +G+ + T L DWG D G++ T +E ++ E D+ F
Sbjct: 53 LFEGQESGLKVIFPDTGAAALARRDWGETPFQVTDLGSRATSIE------KKISEADEAF 106
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
+LV+P + ++ ++ + + AG R VIL+ P+L+D+ S + R R + ++
Sbjct: 107 LLVSP----SSVEVEIVEKLCNLAGHRSVILLIPQLEDV--SIVGIGYAARQLRERFIST 160
Query: 242 FENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
E+ Y+FR P+ G + + +YP+ + +Y + E Y +++ +KP +
Sbjct: 161 LESVYYFR---------PLDGVVVLRSYPSPWLVYLEKE-----EGYELIAEQGQKPMGE 206
Query: 301 D----INDAFLGKPRNREKKASGFWGFLSSV 327
+N+A G+ N ++ + G +SV
Sbjct: 207 ALDTLVNNALKGEEDNSDQPQAKKSGIFASV 237
>gi|171701065|gb|ACB54046.1| unknown [Cyanothece sp. ATCC 51142]
Length = 184
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/187 (24%), Positives = 90/187 (48%), Gaps = 38/187 (20%)
Query: 153 DWG-------DYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAA 205
DWG D G++ T ++ ++ E D+ F+LV+P + ++ ++ + + A
Sbjct: 15 DWGETPFQVTDLGSRATPID------KKISEADEAFLLVSP----SSVEVEIVEKLCNLA 64
Query: 206 GTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALR 265
G RPVIL+ P+L+D+ S + R R + ++ E+ Y+FR P+ G +
Sbjct: 65 GHRPVILLIPQLEDV--SIVGIGYAARQLRERFISTLESVYYFR---------PLDGVVV 113
Query: 266 M-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD----INDAFLGKPRNREKKASGF 320
+ +YP+ + +Y + E Y +++ +KP + +N+A G+ N ++
Sbjct: 114 LRSYPSPWLVYLEKE-----EGYELIAEQGQKPMGEALDILVNNALKGEEENSDQPQPKK 168
Query: 321 WGFLSSV 327
G +SV
Sbjct: 169 SGIFASV 175
>gi|427713531|ref|YP_007062155.1| hypothetical protein Syn6312_2513 [Synechococcus sp. PCC 6312]
gi|427377660|gb|AFY61612.1| protein of unknown function (DUF1995) [Synechococcus sp. PCC 6312]
Length = 232
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 65/261 (24%), Positives = 105/261 (40%), Gaps = 58/261 (22%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
ATQ A++ G R+++EI +P+L +M L F D G +KV +
Sbjct: 17 ATQVALDAGINRIQIEILLPDLQ----------VMPLAWSYLELFQDLGVHLKVFFADAG 66
Query: 131 GEGALAGM---PL-QLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAP 186
P+ + G ++LE V + D F+ + P
Sbjct: 67 AAALARRDWQNPIFSVRGVNELLE----------------------PVQDSDQAFVCITP 104
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
+ ++ M +AAG RP IL+NP+L+D+ GI GR R + + E Y
Sbjct: 105 SPVE----VSYIEQMCNAAGNRPFILLNPKLQDV-AIVGIGYA-GRQLRERFLNTLETAY 158
Query: 247 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
+ R L AL YP +E+++ V+ GE Y L KPS + I+
Sbjct: 159 YIRPL-------DDTTALLRAYPGDWEIWRDVN----GE-YQRLGATPAKPSGEQIDKVL 206
Query: 307 LGKPRNREKKASGFWGFLSSV 327
G+ + + G GF +S+
Sbjct: 207 YGE----QAETPGGGGFFASM 223
>gi|75911048|ref|YP_325344.1| hypothetical protein Ava_4852 [Anabaena variabilis ATCC 29413]
gi|75704773|gb|ABA24449.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 245
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 56/252 (22%), Positives = 102/252 (40%), Gaps = 52/252 (20%)
Query: 59 EIRNYFYDDVLQ---ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSF 115
E+ N D + Q AT+ A+ DG R++VE+ PEL M +
Sbjct: 3 ELPNSLEDAIAQSREATKAALADGCHRIQVELLFPELK----------FMPVAEQFLPCL 52
Query: 116 ADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA--NE 173
+ R+KV + W D + + G + ++
Sbjct: 53 VEYESRLKVFFADAGAAALARRD---------------WKDAPFQILDIGTGRTASIQSK 97
Query: 174 VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRD 233
+ +D++F+ ++P + + ++ + + G RP I +NPRL+D G+ GI T R
Sbjct: 98 IQPEDEIFLFISPTSVE----VPQLEKICEIIGDRPAIFLNPRLEDA-GTVGIGYT-ARQ 151
Query: 234 KRLEYAASFENCYFFRLLYYAGTQYPI--MGALRMTYPNGYELYKRVDVPSGGEKYVILS 291
R + ++CY+ R PI AL +YP +E++ D + ++
Sbjct: 152 TRERFLNIIQSCYYLR---------PIDDETALFRSYPGDWEIWVEND-----GNWAKIA 197
Query: 292 TFTEKPSSDDIN 303
+KPS D+I+
Sbjct: 198 NLPKKPSGDEID 209
>gi|428769639|ref|YP_007161429.1| hypothetical protein Cyan10605_1269 [Cyanobacterium aponinum PCC
10605]
gi|428683918|gb|AFZ53385.1| protein of unknown function DUF1995-containing protein
[Cyanobacterium aponinum PCC 10605]
Length = 251
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/246 (21%), Positives = 115/246 (46%), Gaps = 53/246 (21%)
Query: 71 ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSM 130
A + A+ G TR+ +++ IPE+ + L + A F++ G +K+ +
Sbjct: 20 ALKVALESGCTRITIDLVIPEIALKAQY--------LAQEFADFFSEYGAGLKLFFPDT- 70
Query: 131 GEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGAN------EVDEQDDMFILV 184
G ALA R+ DWG+ T ++ IG++ ++ + D +F+++
Sbjct: 71 GAAALA---------RR-----DWGE-----TEFQVTDIGSSRSPIERKISDTDQVFLVI 111
Query: 185 APQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFEN 244
+P N + + + + AG RPVIL+ P+L+D+ S + R R + + E+
Sbjct: 112 SPSAVEVNLV----EKLCNIAGDRPVILLIPQLEDV--SIVGIGYAARQLRERFISILES 165
Query: 245 CYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
Y+FR P+ A+ + +YP ++++++ + E Y +++ +KP ++++
Sbjct: 166 AYYFR---------PLESAVVLRSYPQLWQVWQQNE---ENEDYELIAEVPQKPLGENLD 213
Query: 304 DAFLGK 309
G+
Sbjct: 214 RILQGQ 219
>gi|427724296|ref|YP_007071573.1| hypothetical protein Lepto7376_2460 [Leptolyngbya sp. PCC 7376]
gi|427356016|gb|AFY38739.1| protein of unknown function DUF1995-containing protein
[Leptolyngbya sp. PCC 7376]
Length = 251
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 64/256 (25%), Positives = 116/256 (45%), Gaps = 60/256 (23%)
Query: 51 LNNIPHSREIRNYFYDDVLQ----ATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLME 106
++N+P S + D L+ AT ++ G R+ VE+ PE
Sbjct: 1 MSNLPKS-------FPDTLEQAKSATLTSLEAGCGRVLVELCFPE--------------- 38
Query: 107 LVRVIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGT-RKILEFMDWGDYGAKGTFVE 165
+ + A S A D +V V GS G+ + T L DWG+ + + +
Sbjct: 39 -IALKAQSLAWDFAQVFVKEYGS-------GLKVLFPDTGAAALAKRDWGE--VEFSVTD 88
Query: 166 IGSIG---ANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPG 222
IGS +V++ D +F++V P + + ++ + + AG RPVI++ P+L+D+
Sbjct: 89 IGSSRNPVQYKVNDADQIFLVVCPSSVE----VAQVEKLCELAGDRPVIMLIPQLEDV-- 142
Query: 223 SSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRM-TYPNGYELYKRVDVP 281
S + R R + ++ E+ Y+ R P GA+ ++P+G+E+Y +
Sbjct: 143 SIVGIGYAARQLRERFISTLESAYYIR---------PYDGAMVWRSFPSGWEVYLEKE-- 191
Query: 282 SGGEKYVILSTFTEKP 297
GE Y +++T T+KP
Sbjct: 192 -EGE-YELIATETQKP 205
>gi|409992140|ref|ZP_11275348.1| hypothetical protein APPUASWS_13731 [Arthrospira platensis str.
Paraca]
gi|409936997|gb|EKN78453.1| hypothetical protein APPUASWS_13731 [Arthrospira platensis str.
Paraca]
Length = 262
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 103/237 (43%), Gaps = 47/237 (19%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
QA A+ DG ++VE+ PE+ E+ I + + AL D +V
Sbjct: 30 QAATAALEDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFF---PD 80
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 186
G ALA R+ DWG+ + T +IG+ + + D F++V+P
Sbjct: 81 AGAAALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 124
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
++ ++ + AG R V+L+NPRL+D+ GI T R R + + E+CY
Sbjct: 125 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYT-ARQLRERFLNTIESCY 178
Query: 247 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+ + L A AL YP +E++ +D GE Y ++ + KP D +
Sbjct: 179 YLKPLDGA--------ALFRCYPGTWEVWLEID----GE-YQKITEQSTKPVGDQLE 222
>gi|428218666|ref|YP_007103131.1| hypothetical protein Pse7367_2442 [Pseudanabaena sp. PCC 7367]
gi|427990448|gb|AFY70703.1| protein of unknown function DUF1995-containing protein
[Pseudanabaena sp. PCC 7367]
Length = 261
Score = 46.6 bits (109), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 35/129 (27%), Positives = 58/129 (44%), Gaps = 23/129 (17%)
Query: 153 DWGDYGAKGTFVEIGSIGANE----VDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTR 208
DW D + E+ G NE + E D F+L+AP + +D ++ + AG R
Sbjct: 76 DWPDISDQ----EVSMRGVNEGRAAIREDDQAFLLIAP----SSVEVDQVEKLLALAGDR 127
Query: 209 PVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTY 268
P I+ NPRL++ G+ R R + +F CY+ + L G L Y
Sbjct: 128 PFIMFNPRLENSEVGIGLAT---RKMRERFLNTFTVCYYMQPLD--------AGLLWRCY 176
Query: 269 PNGYELYKR 277
P +++Y++
Sbjct: 177 PGLWQVYRQ 185
>gi|356535083|ref|XP_003536078.1| PREDICTED: uncharacterized protein LOC100803954 [Glycine max]
Length = 344
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 58/265 (21%), Positives = 108/265 (40%), Gaps = 42/265 (15%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK--RVKVCVQ 127
+A V+ G R VE+ IP+L D G L EL R+ + + K RVK
Sbjct: 107 EAAATFVSSGGMRAIVELLIPQLQFLDDEGAQGELWELSRIFLDTLIEKTKSQRVKAIFP 166
Query: 128 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-VDEQDDMFILVAP 186
AG +L+ Y K S+ + V+ D++ +++ P
Sbjct: 167 D--------------AGAAALLK------YRWKDALFRFASLSDRKPVESDDEIVVMIVP 206
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
+ + +T+ P+I+ NPRL + G+ + + +R + ++F Y
Sbjct: 207 DYQMLEYVERIASNLTNVPEPTPLIMWNPRL--ISEDVGVGFNVRKLRRF-FLSTFTTVY 263
Query: 247 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAF 306
F R + + GA+ YP ++++ D ++Y++ F +P +DDI F
Sbjct: 264 FMRPMPF--------GAIFRCYPGLWKVFS--DDKERPDRYLLAKEFESRPDADDIELLF 313
Query: 307 LGKPRNREK------KASGFWGFLS 325
+ + E+ KA+G + +S
Sbjct: 314 ADEDQKSEQGQGLLDKAAGIFSSIS 338
>gi|428214953|ref|YP_007088097.1| hypothetical protein Oscil6304_4664 [Oscillatoria acuminata PCC
6304]
gi|428003334|gb|AFY84177.1| protein of unknown function (DUF1995) [Oscillatoria acuminata PCC
6304]
Length = 247
Score = 45.8 bits (107), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 60/231 (25%), Positives = 103/231 (44%), Gaps = 52/231 (22%)
Query: 76 VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 135
++DG ++VEI IPEL V I E + + F D KV + G AL
Sbjct: 23 LDDGYRLMQVEIGIPELK----VQPIAE--EFLPIFEDLFGD---HFKVYFPDA-GAAAL 72
Query: 136 AGMPLQLAGTRKILEFMDWGD--YGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNC 193
A R+ DWG+ Y +G S ++ +D F+ V P +AV
Sbjct: 73 A---------RR-----DWGERPYIIRGL-----SEPKGQMQPEDQGFLFVEP-SAVE-- 110
Query: 194 IIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYY 253
++ ++ M + AG RP +++ P+L+++ GI GR R + ++ E+CY+ +
Sbjct: 111 -VNTLEKMCEQAGDRPTVILMPKLENV-AIIGI-GLAGRQLRERFLSTIESCYYIQ---- 163
Query: 254 AGTQYPIMG-ALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
P G A+ YP+ ++++ G+ Y ++S KP DD+
Sbjct: 164 -----PSQGYAVFRYYPSPWQVWLET-----GDTYQLISETATKPVGDDLE 204
>gi|291567271|dbj|BAI89543.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 249
Score = 45.8 bits (107), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 103/237 (43%), Gaps = 47/237 (19%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
QA A+ DG ++VE+ PE+ E+ I + + AL D +V GS
Sbjct: 17 QAATAALEDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFFPDAGS 70
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 186
ALA R+ DWG+ + T +IG+ + + D F++V+P
Sbjct: 71 ---AALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 111
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
++ ++ + AG R V+L+NPRL+D+ GI R R + + E+CY
Sbjct: 112 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYA-ARQLRERFLNTIESCY 165
Query: 247 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+ + L A AL YP +E++ +D GE Y ++ + KP D +
Sbjct: 166 YLKPLDGA--------ALFRCYPGTWEVWLEID----GE-YQKITEQSTKPVGDQLE 209
>gi|384249997|gb|EIE23477.1| hypothetical protein COCSUDRAFT_65935 [Coccomyxa subellipsoidea
C-169]
Length = 335
Score = 45.8 bits (107), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 71/145 (48%), Gaps = 21/145 (14%)
Query: 165 EIGSIGA-NEVDEQDDMFILVA--PQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLP 221
EI +I + V E DD+ I+ A PQ +++ K + + PVIL NPRL
Sbjct: 174 EIAAISERSAVAEGDDLVIIAAVDPQG------LNEAKRVAAGSSFTPVILFNPRLAS-- 225
Query: 222 GSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVP 281
G G+ + R R E+ ++F+ Y R + GT + +P ++++K D
Sbjct: 226 GDVGLGLNV-RRMRNEFLSTFQITYSLRPVNETGTVF-------RRFPGTWKVFKE-DAS 276
Query: 282 SGGEKYVILSTFTEKPSSDDINDAF 306
S G +Y + + F ++P+ DD++ F
Sbjct: 277 SPG-RYDLAAEFRDQPTGDDLDQIF 300
>gi|170076650|ref|YP_001733288.1| hypothetical protein SYNPCC7002_A0014 [Synechococcus sp. PCC 7002]
gi|169884319|gb|ACA98032.1| conserved hypothetical protein [Synechococcus sp. PCC 7002]
Length = 251
Score = 45.1 bits (105), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 76/149 (51%), Gaps = 25/149 (16%)
Query: 153 DWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRP 209
DWG+ K +IG+ + N++ + D +FI+V P +AV + ++ + + AG RP
Sbjct: 78 DWGEIPFK--VADIGTSRSPIENKIGDDDQIFIIVCP-SAVE---VGQVEKLCNLAGDRP 131
Query: 210 VILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRM-TY 268
VI++ P+L+D+ S + R R + ++ E Y+ R P A+ +Y
Sbjct: 132 VIMLIPQLEDV--SIVGIGYAARQLRERFISTLETAYYIR---------PYESAMVWRSY 180
Query: 269 PNGYELYKRVDVPSGGEKYVILSTFTEKP 297
P+ +E+Y + ++Y ++++ T KP
Sbjct: 181 PSAWEVYLEKE----EDQYELIASETTKP 205
>gi|427739188|ref|YP_007058732.1| hypothetical protein Riv7116_5820 [Rivularia sp. PCC 7116]
gi|427374229|gb|AFY58185.1| protein of unknown function (DUF1995) [Rivularia sp. PCC 7116]
Length = 245
Score = 44.3 bits (103), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 56/233 (24%), Positives = 98/233 (42%), Gaps = 54/233 (23%)
Query: 78 DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALA- 136
DG TRL+VE+ PEL + + +F + G ++K+ G ALA
Sbjct: 25 DGYTRLQVELLFPELKA----------LPAAQQFLPAFDEYGDKLKIFF-ADAGSAALAR 73
Query: 137 ----GMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGN 192
+P QL +D G G I S +V +D++F+ V P +
Sbjct: 74 RDWNEIPFQL---------LDIG----TGRMTSIES----KVQPEDEIFLFVQPSSVE-- 114
Query: 193 CIIDDMKAMTDAAG-TRPVILINPRLKDLPGSSGI-MQTMGRDKRLEYAASFENCYFFRL 250
+ ++ + G RP ++ PRL+D SS + + R R + + E+CY+ R
Sbjct: 115 --VPQLEKLCGIIGEQRPFVMFAPRLED---SSIVGIGYAARQTRQRFINTIESCYYLRP 169
Query: 251 LYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
++ A+ YP +E++ V G+ Y ++ KPS D+++
Sbjct: 170 IFEE-------AAVFRCYPGLWEVW----VEKNGD-YEKVAELPNKPSGDELD 210
>gi|428185262|gb|EKX54115.1| hypothetical protein GUITHDRAFT_100361 [Guillardia theta CCMP2712]
Length = 302
Score = 43.9 bits (102), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 56/242 (23%), Positives = 94/242 (38%), Gaps = 53/242 (21%)
Query: 29 ERKSRSNQVS--SSQDNWKGIPTRLNNIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVE 86
E + R N++ ++ K P +NN+ D V ++ A++DG RL+V+
Sbjct: 5 EHQPRRNEIKFDAAARYRKAFPLGVNNV----------LDQVRESLNLALDDGLFRLRVQ 54
Query: 87 INIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEG-ALAGMPLQLAGT 145
+NI + + + + G + ELV +I + G R + M A +P L
Sbjct: 55 LNIHQFDQRLQSFSDGKMPELVAMIGELLVERGMRPSIMFNTIMDASIARTVLPDHLREK 114
Query: 146 RKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAA 205
V I +G E E+ D +LVAP N GN + ++A+
Sbjct: 115 ------------------VSINVLGLGEFGEEYDAAVLVAPSNE-GNENLQRIEAVEKLI 155
Query: 206 GT-----------------RPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFF 248
+ RP+ILINP L+ + S T + + F+ YF
Sbjct: 156 YSGPPRGLVSRSARAKYMQRPIILINPVLEAIHSLS----TPHSKVPPMFMSDFDVVYFL 211
Query: 249 RL 250
+L
Sbjct: 212 QL 213
>gi|428220338|ref|YP_007104508.1| hypothetical protein Syn7502_00205 [Synechococcus sp. PCC 7502]
gi|427993678|gb|AFY72373.1| protein of unknown function (DUF1995) [Synechococcus sp. PCC 7502]
Length = 233
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 52/243 (21%), Positives = 96/243 (39%), Gaps = 53/243 (21%)
Query: 66 DDVLQATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVC 125
+ + AT A+ G R+++++ PEL M + A +F +
Sbjct: 13 EQAIAATMSAIAAGCKRIQIDMKFPEL----------AQMPIALAFAEAFTKQYGQAWQA 62
Query: 126 VQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE----VDEQDDMF 181
+ G ALA DW + E+ G NE + F
Sbjct: 63 LFADAGTAALAKR--------------DWAN---SDILREVSMRGVNEGRAAARPESQAF 105
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
+L++P + +D ++ + G RP +++NP+L++ G+ RD+ L
Sbjct: 106 LLISPTSVE----VDGIEKLLTLIGDRPFVMLNPKLENSEVGLGLAARRMRDRFLS---- 157
Query: 242 FENCYFFRLLYYAGTQYPI-MGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSD 300
F ++YY PI GA+ YP +++++ D +G E ++ E+P D
Sbjct: 158 -----IFEIVYYI---QPIEEGAVWRCYPQLWQVWQ--DKTTGME---LIYEGEERPVGD 204
Query: 301 DIN 303
D+N
Sbjct: 205 DLN 207
>gi|317969886|ref|ZP_07971276.1| hypothetical protein SCB02_10139 [Synechococcus sp. CB0205]
Length = 232
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 66/132 (50%), Gaps = 21/132 (15%)
Query: 178 DDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLE 237
D + +LVAP A +D++A+ A V+L+N +L+D G T+ R++R
Sbjct: 102 DGLLVLVAPTPA----DYEDVEAVC-AQHRGSVVLLNGKLEDAAVGIG---TVARERRKG 153
Query: 238 YAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKP 297
+ A++++ Y L G GALR YP ++LY+R P G Y + F KP
Sbjct: 154 FLATWQSAY---ALIPTGE-----GALRRAYPLDWQLYRR--DPDG---YRFVKDFELKP 200
Query: 298 SSDDINDAFLGK 309
++ + DA G+
Sbjct: 201 DAEQMADALEGE 212
>gi|357441663|ref|XP_003591109.1| hypothetical protein MTR_1g082900 [Medicago truncatula]
gi|355480157|gb|AES61360.1| hypothetical protein MTR_1g082900 [Medicago truncatula]
Length = 327
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 58/261 (22%), Positives = 106/261 (40%), Gaps = 35/261 (13%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
QAT V+ G R VE+ IP+L D L EL R+ + +Q +
Sbjct: 90 QATASFVSSGGMRAIVELLIPQLEFLDDEGAQLELWELSRLF----------LDTLIQET 139
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNA 189
+ A P AG +L++ W D A +F + VD D+M +++ P +
Sbjct: 140 NCQKVKAVFP--DAGAAALLKYR-WKD--ALFSFASLS--DRKLVDSGDEMVVMIVPDHQ 192
Query: 190 VGNCIIDDMKAMTDAAGTRPVILINPRL--KDLPGSSGIMQTMGRDKRLEYAASFENCYF 247
+ + ++D RP+I+ NPRL KD+ + R R + ++F Y
Sbjct: 193 MLEYVEKIASTLSDDP-PRPLIMWNPRLYSKDVGVGYNV-----RQLRQYFLSTFTTVYH 246
Query: 248 FRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFL 307
R + + GA+ YP ++++ D +Y++ + +P ++DI F
Sbjct: 247 MRPMPF--------GAVFRCYPEMWKVFS--DDKDRPNRYLLAKEYANRPDTEDIEMLFG 296
Query: 308 GKPRNREKKASGFWGFLSSVF 328
+ ++ + + VF
Sbjct: 297 NNEEQKSEQGQSLFDKAAGVF 317
>gi|225450962|ref|XP_002284712.1| PREDICTED: uncharacterized protein LOC100264831 [Vitis vinifera]
gi|296088326|emb|CBI36771.3| unnamed protein product [Vitis vinifera]
Length = 353
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 62/267 (23%), Positives = 112/267 (41%), Gaps = 48/267 (17%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDG--KRVKVCVQ 127
QA+ V+ G R VE+ IP+L D L EL R+ + ++ +RVK
Sbjct: 115 QASASFVSSGGMRAIVELLIPQLQFLDDEGAQAELWELSRIFIDTLIEETGCQRVKAIFP 174
Query: 128 GSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQ 187
AG +L++ WGD A F + V+ +D++ ++V P
Sbjct: 175 D--------------AGAAALLKYR-WGD--AAFGFASLSD--RKPVEAEDELVVMVVPD 215
Query: 188 NAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG-----RDKRLEYAASF 242
+ + ++D RP+I+ NPRL I + +G R R + ++F
Sbjct: 216 YQMLAYVERIASNLSDDP-PRPLIMWNPRL--------ISEDVGVGFNVRKLRRSFLSTF 266
Query: 243 ENCYFFRLLYYAGTQYPIMGALRMTYPNGYEL-YKRVDVPSGGEKYVILSTFTEKPSSDD 301
Y R L GA+ YP +++ Y D P+ +Y++ +P +++
Sbjct: 267 TVVYSMRPL--------PAGAVFRCYPGSWKVFYDDKDRPN---RYLLAKEQISRPDAEE 315
Query: 302 INDAFLGKPRNREKKASGFWGFLSSVF 328
+ F G +++ +K F+G +S+F
Sbjct: 316 LEIIF-GDVQDKSEKGPSFFGQAASIF 341
>gi|209522945|ref|ZP_03271502.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|376001796|ref|ZP_09779650.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|209496532|gb|EDZ96830.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|375329707|emb|CCE15403.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 249
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 103/237 (43%), Gaps = 47/237 (19%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
QA A++DG ++VE+ PE+ E+ I + + AL D +V GS
Sbjct: 17 QAATAALDDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFFPDAGS 70
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 186
ALA R+ DWG+ + T +IG+ + + D F++V+P
Sbjct: 71 ---AALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 111
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
++ ++ + AG R V+L+NPRL+D+ GI R R + E+CY
Sbjct: 112 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYA-ARQLRERFLNIIESCY 165
Query: 247 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+ + L A AL YP +E++ ++ GE Y ++ + KP D +
Sbjct: 166 YLKPLDGA--------ALFRCYPGTWEVWLEIE----GE-YQKITEQSTKPVGDQLE 209
>gi|423062349|ref|ZP_17051139.1| hypothetical protein SPLC1_S032380 [Arthrospira platensis C1]
gi|406716257|gb|EKD11408.1| hypothetical protein SPLC1_S032380 [Arthrospira platensis C1]
Length = 262
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 61/237 (25%), Positives = 103/237 (43%), Gaps = 47/237 (19%)
Query: 70 QATQRAVNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGS 129
QA A++DG ++VE+ PE+ E+ I + + AL D +V GS
Sbjct: 30 QAATAALDDGYKLIQVELVFPEI--ELQAQSIASQF----IPALEKPDTLLKVFFPDAGS 83
Query: 130 MGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA---NEVDEQDDMFILVAP 186
ALA R+ DWG+ + T +IG+ + + D F++V+P
Sbjct: 84 ---AALA---------RR-----DWGETPFRVT--DIGTSRSPVETRLQPDDGQFLVVSP 124
Query: 187 QNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCY 246
++ ++ + AG R V+L+NPRL+D+ GI R R + E+CY
Sbjct: 125 ----SPVEVNQVENLHKLAGDRSVVLLNPRLEDV-AIIGIGYA-ARQLRERFLNIIESCY 178
Query: 247 FFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIN 303
+ + L A AL YP +E++ ++ GE Y ++ + KP D +
Sbjct: 179 YLKPLDGA--------ALFRCYPGTWEVWLEIE----GE-YQKITEQSTKPVGDQLE 222
>gi|299116127|emb|CBN76036.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 284
Score = 42.7 bits (99), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 58/268 (21%), Positives = 110/268 (41%), Gaps = 48/268 (17%)
Query: 53 NIPHSREIRNYFYDDVLQATQRAVNDGRTRLKVEINIPEL-NPEMDVYRIGTLMELVRVI 111
+P S +++ + +A +A++DG+TR + +++P + E+D + G
Sbjct: 48 ELPMSNDVQ---VEQATRAVWQALSDGKTRQILRLSLPLIGKTEIDDWPGG--------- 95
Query: 112 ALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGA 171
D +R K C G M E L G P G + + +D D +G +
Sbjct: 96 ------DRQRYKAC--GPMVESVLRGNPAAKEGASIVEQILDESD--------AVGLMQL 139
Query: 172 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 231
+ +DD +++ P + + + AG+R V L+NP + G+ Q G
Sbjct: 140 QCKEAKDDASVMIFPTTDT----LSLQQQVAAGAGSRLVALVNPAWRS-SSDFGLFQA-G 193
Query: 232 RDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILS 291
K E + FE Y + L G ++ YP + L+ + G K +L
Sbjct: 194 EAK--ETLSEFEVTYSLQSLVVYGYTLRLL----YCYPGPWRLFL---IGEGSGKRELLG 244
Query: 292 TFTEKPSSDDINDAFL----GKPRNREK 315
++ + P ++D ++ + GKP E+
Sbjct: 245 SWDQSPPTNDQVESAVFDKKGKPNASER 272
>gi|422294314|gb|EKU21614.1| hypothetical protein NGA_0177102 [Nannochloropsis gaditana CCMP526]
Length = 375
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 31/116 (26%), Positives = 53/116 (45%), Gaps = 10/116 (8%)
Query: 195 IDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAASFENCYFFRLLYYA 254
+ D++A+ A T+P+I N L+ G G+ +D + ++ + Y R YA
Sbjct: 221 LPDLEALHQADPTKPLIFFNLNLETHRGDLGLPAFPSKDLHYRFLSNIKPVYLLRTRQYA 280
Query: 255 GT--QYPIM----GALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDIND 304
T + P + GAL TYP GY+ + +G +Y + T E+P+ D
Sbjct: 281 QTLSRPPFILNYQGALFRTYPGGYQCM----LDTGNGRYRRVETSRERPALSGFKD 332
>gi|297812961|ref|XP_002874364.1| hypothetical protein ARALYDRAFT_489571 [Arabidopsis lyrata subsp.
lyrata]
gi|297320201|gb|EFH50623.1| hypothetical protein ARALYDRAFT_489571 [Arabidopsis lyrata subsp.
lyrata]
Length = 340
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 60/249 (24%), Positives = 99/249 (39%), Gaps = 43/249 (17%)
Query: 76 VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGAL 135
VN G TR VE+ IP+L D L +L RV + ++ + E
Sbjct: 110 VNSGGTRAIVELLIPQLQFLDDEGAQAELWDLSRVF----------LDTLIKETGCERVK 159
Query: 136 AGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANEVDEQDDMFILVAPQNAVGNCII 195
A P AG +L++ W D TF V+++D++ ++V P + +
Sbjct: 160 AVFP--DAGAAALLKYR-WKD----ATFGFASLSDRKPVEKEDEIIVMVVPDYQMLEYVE 212
Query: 196 DDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG-----RDKRLEYAASFENCYFFRL 250
K + D RP+I+ NPRL I + +G R R + +SF Y R
Sbjct: 213 KIAKGLADDP-PRPLIMWNPRL--------ISEEVGVGFNVRKLRRYFLSSFTTVYSMRP 263
Query: 251 LYYAGTQYPIMGALRMTYPNGYEL-YKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGK 309
L GA+ YP +++ Y D P +Y++ +P ++D+ + G
Sbjct: 264 L--------AAGAVFRCYPGKWKVFYDNKDRPG---RYLLAKELIGRPDAEDLEIIYGGV 312
Query: 310 PRNREKKAS 318
E+ S
Sbjct: 313 EEKSEEGPS 321
>gi|123966201|ref|YP_001011282.1| hypothetical protein P9515_09681 [Prochlorococcus marinus str. MIT
9515]
gi|123200567|gb|ABM72175.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9515]
Length = 215
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 66/140 (47%), Gaps = 32/140 (22%)
Query: 170 GANE---VDEQDDMFILVAPQN---AVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGS 223
G NE +D+ D + I + PQ + + ++ K + + +NP+ +D +
Sbjct: 97 GFNESIYIDQDDSIVISILPQPYDFGLFEPMCENFKGLHYS--------LNPKFED--AN 146
Query: 224 SGIMQTMGRDKRLEYAASFENCYFFRLLYYAGTQYPIM-GALRMTYPNGYELYKRVDVPS 282
GI + R++R + +++N YF + PI GAL +YPN + L+K ++
Sbjct: 147 IGIGSVI-RERRKNFVKTWQNIYFLQ---------PINNGALMHSYPNNWLLFKEIN--- 193
Query: 283 GGEKYVILSTFTEKPSSDDI 302
KY+ F EKP ++ I
Sbjct: 194 --NKYLFKKEFEEKPDNETI 211
>gi|52079804|ref|YP_078595.1| hypothetical protein BL03758 [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|319646386|ref|ZP_08000616.1| hypothetical protein HMPREF1012_01652 [Bacillus sp. BT1B_CT2]
gi|404488678|ref|YP_006712784.1| hypothetical protein BLi01406 [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|52003015|gb|AAU22957.1| hypothetical protein BL03758 [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|52347676|gb|AAU40310.1| putative membrane protein [Bacillus licheniformis DSM 13 = ATCC
14580]
gi|317392136|gb|EFV72933.1| hypothetical protein HMPREF1012_01652 [Bacillus sp. BT1B_CT2]
Length = 249
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 30/50 (60%), Gaps = 5/50 (10%)
Query: 110 VIALSFADDGKRVKVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGA 159
++ALSF V VC+ + GAL+ +PL LAGT + ++M+W GA
Sbjct: 175 IMALSFL-----VIVCLSFRLKAGALSILPLLLAGTHFLFQYMNWDSAGA 219
>gi|324998739|ref|ZP_08119851.1| UDP-N-acetylmuramate--L-alanine ligase [Pseudonocardia sp. P1]
Length = 478
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 55/130 (42%), Gaps = 5/130 (3%)
Query: 85 VEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM-PLQLA 143
V I E NPE+ R L + R +AL+ +G+R+ V V G+ G+ + M + L
Sbjct: 78 VSTAIRETNPELAAARDQGLQVVHRAVALAALTEGRRL-VAVAGTHGKTSTTSMVTVALQ 136
Query: 144 GTRKILEFMDWGDYGAKGTFVEIGS--IGANEVDEQDDMFILVAPQNA-VGNCIIDDMKA 200
T F GD G+ GS + E DE D F AP A V N D +
Sbjct: 137 HTGADPSFAIGGDLSVSGSGAHHGSGDVFVVEADESDGSFTAFAPSVAVVTNVEPDHLDH 196
Query: 201 MTDAAGTRPV 210
D A R V
Sbjct: 197 HGDEATYRAV 206
>gi|284991684|ref|YP_003410238.1| UDP-N-acetylmuramate/alanine ligase [Geodermatophilus obscurus DSM
43160]
gi|284064929|gb|ADB75867.1| UDP-N-acetylmuramate/alanine ligase [Geodermatophilus obscurus DSM
43160]
Length = 489
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 48/111 (43%), Gaps = 8/111 (7%)
Query: 85 VEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM---PLQ 141
V I E NPE+ R L L R +AL+ GKR V V G+ G+ + M +Q
Sbjct: 84 VSTAIREDNPELAAARERGLRVLPRAVALAAVMAGKR-SVAVAGTHGKTSTTSMLTVAVQ 142
Query: 142 LAGTRKILEFMDWGDYGAKGTFVEIGS--IGANEVDEQDDMFILVAPQNAV 190
G F GD G+ G + E DE D F+L+AP A+
Sbjct: 143 ACGVDP--SFAIGGDLNESGSNAHAGGGDVFVAEADESDRSFLLLAPHGAI 191
>gi|255087973|ref|XP_002505909.1| predicted protein [Micromonas sp. RCC299]
gi|226521180|gb|ACO67167.1| predicted protein [Micromonas sp. RCC299]
Length = 299
Score = 39.3 bits (90), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 25/101 (24%), Positives = 47/101 (46%), Gaps = 8/101 (7%)
Query: 213 INPRLKDLPGSSGIMQTMG----RDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTY 268
IN +++ S ++ +G R K E+ ASFE Y+ G ++ +Y
Sbjct: 169 INNQIRTNDDGSNLISDLGIGPWRKKNEEFLASFELTYWISEQRIQGETVRLL----KSY 224
Query: 269 PNGYELYKRVDVPSGGEKYVILSTFTEKPSSDDINDAFLGK 309
P+ ++L+ D+ SG + + TF EKPS ++ + +
Sbjct: 225 PHPWQLFVLTDMASGSAEPECIKTFDEKPSYKELESLLMSR 265
>gi|356576779|ref|XP_003556507.1| PREDICTED: uncharacterized protein LOC100782973 isoform 1 [Glycine
max]
Length = 340
Score = 38.1 bits (87), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 57/270 (21%), Positives = 112/270 (41%), Gaps = 45/270 (16%)
Query: 68 VLQATQRA---VNDGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGK--RV 122
V Q+++ A V+ G R VE+ IP+L D L EL R+ + + K RV
Sbjct: 98 VRQSSEAAAIFVSSGGIRAIVELLIPQLQFLDDEGAQVELWELSRIFLDTLIEKTKFQRV 157
Query: 123 KVCVQGSMGEGALAGMPLQLAGTRKILEFMDWGDYGAKGTFVEIGSIGANE-VDEQDDMF 181
K AG +L+ Y K S+ + V+ D++
Sbjct: 158 KAIFPD--------------AGAAALLK------YRWKDALFRFASLSDRKPVESDDEVV 197
Query: 182 ILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMGRDKRLEYAAS 241
+++ P + + +++ P+I+ NPRL + G+ + + +R+ + ++
Sbjct: 198 VMIVPDYQMLEYVERIASNLSNIPEPTPLIMWNPRL--ISEDVGVGFNVRKLRRV-FLST 254
Query: 242 FENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILSTFTEKPSSDD 301
F YF R + + GA+ YP ++++ D ++Y++ F +P ++D
Sbjct: 255 FTTVYFMRPMPF--------GAIFRCYPGLWKVFS--DDKERPDRYLLAKEFEIRPDAED 304
Query: 302 INDAFLGKPRNREK------KASGFWGFLS 325
I F + + E+ KA+G + +S
Sbjct: 305 IELLFADEEQKSEQGQGLLDKAAGIFSSIS 334
>gi|300788096|ref|YP_003768387.1| UDP-N-acetylmuramate--L-alanine ligase [Amycolatopsis mediterranei
U32]
gi|384151526|ref|YP_005534342.1| UDP-N-acetylmuramate--L-alanine ligase [Amycolatopsis mediterranei
S699]
gi|399539979|ref|YP_006552641.1| UDP-N-acetylmuramate--alanine ligase [Amycolatopsis mediterranei
S699]
gi|299797610|gb|ADJ47985.1| UDP-N-acetylmuramate--alanine ligase [Amycolatopsis mediterranei
U32]
gi|340529680|gb|AEK44885.1| UDP-N-acetylmuramate--L-alanine ligase [Amycolatopsis mediterranei
S699]
gi|398320749|gb|AFO79696.1| UDP-N-acetylmuramate--alanine ligase [Amycolatopsis mediterranei
S699]
Length = 465
Score = 38.1 bits (87), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 45/109 (41%), Gaps = 4/109 (3%)
Query: 85 VEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQGSMGEGALAGM-PLQLA 143
V I E NPE+ R + L R AL+ +G RV C+ G+ G+ + M + L
Sbjct: 77 VSTAIKETNPELAAARAAGIPVLHRAQALAGLMEGHRVA-CIAGTHGKTSTTSMLTVALQ 135
Query: 144 GTRKILEFMDWGDYGAKGTFVEIGSIGA--NEVDEQDDMFILVAPQNAV 190
R F GD G G G E DE D F+ +P AV
Sbjct: 136 HCRLDPSFAIGGDLNESGANAHHGEGGVFVAEADESDGSFLTYSPSVAV 184
>gi|383830926|ref|ZP_09986015.1| UDP-N-acetylmuramate--alanine ligase [Saccharomonospora
xinjiangensis XJ-54]
gi|383463579|gb|EID55669.1| UDP-N-acetylmuramate--alanine ligase [Saccharomonospora
xinjiangensis XJ-54]
Length = 474
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 40/126 (31%), Positives = 53/126 (42%), Gaps = 9/126 (7%)
Query: 73 QRAVN-----DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVKVCVQ 127
QRA N D + + V I E NPE+ V R + L R AL+ +G RV C+
Sbjct: 66 QRAENLDVFDDAPSAVVVSTAIKETNPELVVARERGVPVLHRAEALAALMEGHRVA-CIA 124
Query: 128 GSMGEGALAGM-PLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG--ANEVDEQDDMFILV 184
G+ G+ + M + L R F GD G G G E DE D F+
Sbjct: 125 GTHGKTSTTSMLTVALQHCRLDPSFAIGGDLNESGANAHHGEGGIFVAEADESDGSFLSY 184
Query: 185 APQNAV 190
+P AV
Sbjct: 185 SPSVAV 190
>gi|72382164|ref|YP_291519.1| hypothetical protein PMN2A_0324 [Prochlorococcus marinus str.
NATL2A]
gi|72002014|gb|AAZ57816.1| conserved hypothetical protein [Prochlorococcus marinus str.
NATL2A]
Length = 206
Score = 37.7 bits (86), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 59/131 (45%), Gaps = 21/131 (16%)
Query: 172 NEVDEQDDMFILVAPQNAVGNCIIDDMKAMTDAAGTRPVILINPRLKDLPGSSGIMQTMG 231
N D + + ++ +PQ I+ + + A + VI+IN +L+D G ++G
Sbjct: 93 NSTDLSNSILLVCSPQPYD----IEMFEQVCSHAKSN-VIMINGKLEDPIVGIG---SVG 144
Query: 232 RDKRLEYAASFENCYFFRLLYYAGTQYPIMGALRMTYPNGYELYKRVDVPSGGEKYVILS 291
R+ R +A ++ YF + L MGAL YPN +EL+K Y +
Sbjct: 145 REMRKRFAKKWKVLYFVQPLS--------MGALLKRYPNDWELFKL-----NNNGYTFVK 191
Query: 292 TFTEKPSSDDI 302
+F +P + I
Sbjct: 192 SFINRPDDETI 202
>gi|418461004|ref|ZP_13032086.1| UDP-N-acetylmuramate--L-alanine ligase [Saccharomonospora azurea
SZMC 14600]
gi|359738959|gb|EHK87837.1| UDP-N-acetylmuramate--L-alanine ligase [Saccharomonospora azurea
SZMC 14600]
Length = 450
Score = 37.4 bits (85), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 53/130 (40%), Gaps = 9/130 (6%)
Query: 69 LQATQRAVN-----DGRTRLKVEINIPELNPEMDVYRIGTLMELVRVIALSFADDGKRVK 123
++ QRA N G + + V I E NPE+ R + L R AL+ +G RV
Sbjct: 62 IEVGQRAENLDVFDGGPSAVIVSTAIKETNPELAAARERDVPVLHRAEALAALMEGHRVA 121
Query: 124 VCVQGSMGEGALAGM-PLQLAGTRKILEFMDWGDYGAKGTFVEIGSIG--ANEVDEQDDM 180
C+ G+ G+ + M + L R F GD G G G E DE D
Sbjct: 122 -CIAGTHGKTSTTSMLTVALQHCRLDPSFAIGGDLNESGANAHHGEGGIFVAEADESDGS 180
Query: 181 FILVAPQNAV 190
F+ P AV
Sbjct: 181 FLSYTPSVAV 190
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.136 0.398
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,046,264,803
Number of Sequences: 23463169
Number of extensions: 218477110
Number of successful extensions: 474372
Number of sequences better than 100.0: 143
Number of HSP's better than 100.0 without gapping: 38
Number of HSP's successfully gapped in prelim test: 105
Number of HSP's that attempted gapping in prelim test: 474177
Number of HSP's gapped (non-prelim): 164
length of query: 328
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 186
effective length of database: 9,027,425,369
effective search space: 1679101118634
effective search space used: 1679101118634
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)