BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 029172
         (198 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255583634|ref|XP_002532572.1| conserved hypothetical protein [Ricinus communis]
 gi|223527699|gb|EEF29806.1| conserved hypothetical protein [Ricinus communis]
          Length = 280

 Score =  207 bits (528), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 112/173 (64%), Positives = 130/173 (75%), Gaps = 2/173 (1%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MA +SISPLSIKS+N   SSS+ PY L + SKP  + CQ++  TE + +  DCS  +   
Sbjct: 1   MAFTSISPLSIKSVNISPSSSRSPYHLPSQSKPFHILCQLA--TEREDRILDCSTTRYKV 58

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
            ++K KNWR  VSTALAAA   +    + A ADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 59  HHSKPKNWRTLVSTALAAAAAVNLGFGLPAAADLNKFEAELRGEFGIGSAAQFGSADLRK 118

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           AVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANFT  ++   L+
Sbjct: 119 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLM 171


>gi|224071571|ref|XP_002303521.1| predicted protein [Populus trichocarpa]
 gi|222840953|gb|EEE78500.1| predicted protein [Populus trichocarpa]
          Length = 275

 Score =  202 bits (514), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 107/173 (61%), Positives = 127/173 (73%), Gaps = 7/173 (4%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MA +SIS +SIKS N  +     P+++ +LSKP  +A Q+   TE   QF DCS N    
Sbjct: 1   MAFTSISSMSIKSPNIST-----PHRILSLSKPFRIAYQLD--TERGNQFADCSKNGYEV 53

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             AK KNW   VST L AA ++  S N+ A+ADLN++EAETRGEFGIGSAAQFGSADLRK
Sbjct: 54  ETAKAKNWARVVSTTLVAAAISFSSCNLPAVADLNRFEAETRGEFGIGSAAQFGSADLRK 113

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           AVH+ ENFRRANFT+ADMRESDFSGS FNGAYLEKAVAYKANFT  ++   L+
Sbjct: 114 AVHLNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLM 166


>gi|449459702|ref|XP_004147585.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Cucumis sativus]
 gi|449520611|ref|XP_004167327.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Cucumis sativus]
          Length = 279

 Score =  187 bits (476), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 102/173 (58%), Positives = 123/173 (71%), Gaps = 4/173 (2%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSSIS LS+K L   SS S+ P  L    K + +  QI+ + +   Q  DCS  +  G
Sbjct: 1   MALSSISSLSVKCLPLNSSKSRHPCSLQT-RKQISMVSQINPQKD---QTQDCSERKHIG 56

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
              + K W+  VSTALAAA V   SS + ++A+LNKYEA+TRGEFGIGSAAQ+GSADLRK
Sbjct: 57  KITEPKRWQKLVSTALAAAAVIGFSSGMPSVAELNKYEADTRGEFGIGSAAQYGSADLRK 116

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           AVH+ ENFRRANFTSADMRESDFSG  FNGAYLEKAVAYK NF+  ++   L+
Sbjct: 117 AVHINENFRRANFTSADMRESDFSGCTFNGAYLEKAVAYKTNFSGADLSDTLM 169


>gi|297741150|emb|CBI31881.3| unnamed protein product [Vitis vinifera]
          Length = 261

 Score =  186 bits (473), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 109/173 (63%), Positives = 122/173 (70%), Gaps = 21/173 (12%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSS+SPL I         SK P  L + SKP  V C+I  +    G +  C  N    
Sbjct: 1   MALSSVSPLYI---------SKSPNHLQSPSKPFTVVCRIELQR---GNY--CRAN---- 42

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             A+ K W+  VSTALAAAVV + S  + A+ADLNKYEAETRGEFGIGSAAQFGSADLRK
Sbjct: 43  --AESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRK 99

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           AVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANFT  ++   L+
Sbjct: 100 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFTGADLSDTLM 152


>gi|297741151|emb|CBI31882.3| unnamed protein product [Vitis vinifera]
          Length = 201

 Score =  186 bits (472), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 105/164 (64%), Positives = 115/164 (70%), Gaps = 19/164 (11%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSS+SPL I         SK P  L +LSKP  V C+I  + E         NN    
Sbjct: 1   MALSSVSPLYI---------SKSPNHLRSLSKPFTVVCRIERQRE---------NNWRGE 42

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             A+ K W+  VSTALAAAVV + S  + A+ADLNKYE ETRGEFGIGSAAQFGSADLRK
Sbjct: 43  ANAESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEVETRGEFGIGSAAQFGSADLRK 101

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           AVHV ENFRRANFTSADMRESDFSGS FNG YLEKAVAYKA+ T
Sbjct: 102 AVHVNENFRRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLT 145


>gi|359474379|ref|XP_002265958.2| PREDICTED: uncharacterized protein LOC100250522 isoform 2 [Vitis
           vinifera]
          Length = 596

 Score =  185 bits (470), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 108/164 (65%), Positives = 118/164 (71%), Gaps = 21/164 (12%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSS+SPL I         SK P  L + SKP  V C+I  +    G +  C  N    
Sbjct: 336 MALSSVSPLYI---------SKSPNHLQSPSKPFTVVCRIELQR---GNY--CRAN---- 377

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             A+ K W+  VSTALAAAVV + S  + A+ADLNKYEAETRGEFGIGSAAQFGSADLRK
Sbjct: 378 --AESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEAETRGEFGIGSAAQFGSADLRK 434

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           AVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANFT
Sbjct: 435 AVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFT 478



 Score =  185 bits (469), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 105/164 (64%), Positives = 115/164 (70%), Gaps = 19/164 (11%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MALSS+SPL I         SK P  L +LSKP  V C+I  + E         NN    
Sbjct: 1   MALSSVSPLYI---------SKSPNHLRSLSKPFTVVCRIERQRE---------NNWRGE 42

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             A+ K W+  VSTALAAAVV + S  + A+ADLNKYE ETRGEFGIGSAAQFGSADLRK
Sbjct: 43  ANAESKKWQRLVSTALAAAVV-TLSPVMPAVADLNKYEVETRGEFGIGSAAQFGSADLRK 101

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           AVHV ENFRRANFTSADMRESDFSGS FNG YLEKAVAYKA+ T
Sbjct: 102 AVHVNENFRRANFTSADMRESDFSGSTFNGEYLEKAVAYKASLT 145


>gi|388505216|gb|AFK40674.1| unknown [Lotus japonicus]
          Length = 273

 Score =  174 bits (442), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 106/179 (59%), Positives = 124/179 (69%), Gaps = 24/179 (13%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQL------HALSKPLWVACQISSKTESDGQFPDCS 54
           MAL+S+SPLSI ++N    SS+   +L      H  S P+ V CQ++S  +     P  S
Sbjct: 2   MALNSLSPLSI-NINSLHVSSRPTSELSNSLHFHPKSSPI-VLCQMNSNRD----HPQES 55

Query: 55  NNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFG 114
                      K W   VS  LAAAV+A  SS++SALADLNK+EAE RGEFGIGSAAQFG
Sbjct: 56  -----------KKWGKLVSATLAAAVIA-FSSDMSALADLNKFEAEIRGEFGIGSAAQFG 103

Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           SADLRKAVHV ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+  ++   L+
Sbjct: 104 SADLRKAVHVNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 162


>gi|255638223|gb|ACU19425.1| unknown [Glycine max]
          Length = 199

 Score =  173 bits (438), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 105/173 (60%), Positives = 126/173 (72%), Gaps = 17/173 (9%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S+SPLSI SL+  SSS+      H+ S P+ V CQI+S  +         + Q + 
Sbjct: 2   MALNSLSPLSINSLHVSSSSTSKISHSHSKSFPV-VVCQINSNRD---------HRQEST 51

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
            + K+      VS  LAAAV+A  SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 52  KWGKV------VSATLAAAVIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 104

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           AVHV ENFRRANFT+ADMRESDFSGS FNGAYLEKAVAYKANF+  ++   L+
Sbjct: 105 AVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 157


>gi|356540500|ref|XP_003538726.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Glycine max]
          Length = 260

 Score =  165 bits (418), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 100/173 (57%), Positives = 119/173 (68%), Gaps = 22/173 (12%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S+SPLSI SL+  SSS+      H+ S P+ V    ++++                
Sbjct: 1   MALNSLSPLSINSLHVSSSSTSKISHSHSKSFPVVVKSVANAES---------------- 44

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
                  W   VS  LAAAV+A  SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 45  -----TKWGKVVSATLAAAVIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 98

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           AVHV ENFRRANFT+ADMRESDFSGS FNGAYLEKAVAYKANF+  ++   L+
Sbjct: 99  AVHVNENFRRANFTAADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 151


>gi|357481967|ref|XP_003611269.1| Thylakoid lumenal protein [Medicago truncatula]
 gi|355512604|gb|AES94227.1| Thylakoid lumenal protein [Medicago truncatula]
          Length = 147

 Score =  163 bits (412), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 95/164 (57%), Positives = 110/164 (67%), Gaps = 20/164 (12%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S +PLSI S +            +  S  +  + Q+  K   +   P  SN     
Sbjct: 1   MALNSFTPLSINSHH---------VSCYPSSSKVSKSSQVICKMSLNNDHPQESN----- 46

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
                KNW   VS  LAAAV+   SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 47  -----KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKK 100

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
            VHV ENFRRANFTSADMRESDFSGS FNGAY+EKAVA+KANFT
Sbjct: 101 TVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFT 144


>gi|357481963|ref|XP_003611267.1| Thylakoid lumenal protein [Medicago truncatula]
 gi|355512602|gb|AES94225.1| Thylakoid lumenal protein [Medicago truncatula]
          Length = 262

 Score =  162 bits (411), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 96/173 (55%), Positives = 114/173 (65%), Gaps = 20/173 (11%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S +PLSI S +            +  S  +  + Q+  K   +   P  SN     
Sbjct: 1   MALNSFTPLSINSHHV---------SCYPSSSKVSKSSQVICKMSLNNDHPQESN----- 46

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
                KNW   VS  LAAAV+   SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 47  -----KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKK 100

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
            VHV ENFRRANFTSADMRESDFSGS FNGAY+EKAVA+KANFT  ++   L+
Sbjct: 101 TVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTGADLSDTLM 153


>gi|357481965|ref|XP_003611268.1| Thylakoid lumenal protein [Medicago truncatula]
 gi|355512603|gb|AES94226.1| Thylakoid lumenal protein [Medicago truncatula]
          Length = 232

 Score =  161 bits (408), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 81/108 (75%), Positives = 91/108 (84%), Gaps = 1/108 (0%)

Query: 66  KNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
           KNW   VS  LAAAV+   SS++SALADLNK+EAE RGEFGIGSAAQFGSADL+K VHV 
Sbjct: 17  KNWGKLVSATLAAAVIV-FSSDMSALADLNKFEAEVRGEFGIGSAAQFGSADLKKTVHVN 75

Query: 126 ENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           ENFRRANFTSADMRESDFSGS FNGAY+EKAVA+KANFT  ++   L+
Sbjct: 76  ENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFKANFTGADLSDTLM 123


>gi|356495617|ref|XP_003516671.1| PREDICTED: LOW QUALITY PROTEIN: thylakoid lumenal protein
           At1g12250, chloroplastic-like [Glycine max]
          Length = 222

 Score =  158 bits (399), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 97/163 (59%), Positives = 112/163 (68%), Gaps = 23/163 (14%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL+S SPLS+ SL+  S SS    +  + S P  V CQ +S  +               
Sbjct: 1   MALNSFSPLSVNSLHVSSISSSKISRSLSKSFP--VVCQTNSNRDH-------------- 44

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
                +   V VS  LAAA++A  SS++SALADLNK+EAE RGEFGIGSAAQFGSADLRK
Sbjct: 45  -----RQGNV-VSATLAAAIIA-FSSDMSALADLNKFEAEMRGEFGIGSAAQFGSADLRK 97

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           AVHV ENFR +NFT+ADMRESDFSGS FNGAYLEKAVAYKANF
Sbjct: 98  AVHVNENFRXSNFTAADMRESDFSGSTFNGAYLEKAVAYKANF 140


>gi|116785652|gb|ABK23807.1| unknown [Picea sitchensis]
          Length = 291

 Score =  152 bits (384), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 81/134 (60%), Positives = 96/134 (71%), Gaps = 6/134 (4%)

Query: 40  ISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEA 99
           I+ K  +D    D    Q A    + KNW+  ++ ALA  V+ +    ++A ADLNKYEA
Sbjct: 52  ITGKISTDQHKKDA---QPASATPESKNWQRCLAAALATIVIGT---GMNAEADLNKYEA 105

Query: 100 ETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
           ETRGEFGIGSAAQFGSA+LRK VH  ENFRRANFTSAD+RESDFSGS FNGAYLEKAVAY
Sbjct: 106 ETRGEFGIGSAAQFGSAELRKTVHANENFRRANFTSADIRESDFSGSTFNGAYLEKAVAY 165

Query: 160 KANFTVDEICLPLL 173
           K NFT  ++   L+
Sbjct: 166 KTNFTGADLSDTLM 179


>gi|18391370|ref|NP_563902.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
 gi|75151954|sp|Q8H1Q1.1|TL225_ARATH RecName: Full=Thylakoid lumenal protein At1g12250, chloroplastic;
           Flags: Precursor
 gi|23297125|gb|AAN13098.1| unknown protein [Arabidopsis thaliana]
 gi|332190736|gb|AEE28857.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
          Length = 280

 Score =  148 bits (374), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 98/176 (55%), Positives = 125/176 (71%), Gaps = 8/176 (4%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--- 57
           MA SS+SPL +KSL+   SSS      +   + L    Q+SS+  S+ +  D SN +   
Sbjct: 1   MAFSSLSPLPMKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTREGC 58

Query: 58  CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
           C+   A+   W+  +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSAD
Sbjct: 59  CSS--AESNTWKRILSAAMAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSAD 115

Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           L K VH  ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+  ++   L+
Sbjct: 116 LSKTVHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 171


>gi|14334898|gb|AAK59627.1| unknown protein [Arabidopsis thaliana]
          Length = 280

 Score =  148 bits (374), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 98/176 (55%), Positives = 125/176 (71%), Gaps = 8/176 (4%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--- 57
           MA SS+SPL +KSL+   SSS      +   + L    Q+SS+  S+ +  D SN +   
Sbjct: 1   MAFSSLSPLPMKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTREGC 58

Query: 58  CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
           C+   A+   W+  +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSAD
Sbjct: 59  CSS--AESNKWKRILSAAMAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSAD 115

Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           L K VH  ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+  ++   L+
Sbjct: 116 LSKTVHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 171


>gi|145323868|ref|NP_001077523.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
 gi|332190737|gb|AEE28858.1| Pentapeptide repeat-containing protein [Arabidopsis thaliana]
          Length = 206

 Score =  145 bits (366), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 72/98 (73%), Positives = 85/98 (86%), Gaps = 1/98 (1%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           +AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH  ENFRRANFTS
Sbjct: 1   MAAAVIAS-SSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTS 59

Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           ADMRESDFSGS FNGAYLEKAVAYKANF+  ++   L+
Sbjct: 60  ADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 97


>gi|297844088|ref|XP_002889925.1| hypothetical protein ARALYDRAFT_471375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335767|gb|EFH66184.1| hypothetical protein ARALYDRAFT_471375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 280

 Score =  145 bits (365), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 100/179 (55%), Positives = 127/179 (70%), Gaps = 14/179 (7%)

Query: 1   MALSSISPLSIKSLNFCSSSSKG---PYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ 57
           MA SS+SPL +KSL+   SSS     PY  H    PL    Q+SS++ S  +  D SN +
Sbjct: 1   MAFSSLSPLPMKSLDISRSSSSVSRSPY--HYQRYPLR-RLQLSSRSNS--EIKDSSNAR 55

Query: 58  ---CAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFG 114
              C+   ++   W+  +S A+AAAV+AS SS + A+A+LN++EA+TRGEFGIGSAAQ+G
Sbjct: 56  EGCCS--RSESNTWKRILSAAMAAAVIASSSS-VPAMAELNRFEADTRGEFGIGSAAQYG 112

Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           SADL K +H  ENFRRANFTSADMRESDFSGS FNGAYLEKAVAYKANF+  ++   L+
Sbjct: 113 SADLSKTIHSNENFRRANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 171


>gi|212721536|ref|NP_001132582.1| uncharacterized protein LOC100194053 [Zea mays]
 gi|194694816|gb|ACF81492.1| unknown [Zea mays]
 gi|195647732|gb|ACG43334.1| hypothetical protein [Zea mays]
 gi|413937988|gb|AFW72539.1| hypothetical protein ZEAMMB73_749291 [Zea mays]
          Length = 268

 Score =  144 bits (363), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 70/86 (81%), Positives = 76/86 (88%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           + A ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFRRANFTSADMRESDFSGS 
Sbjct: 74  MPAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTSADMRESDFSGST 133

Query: 148 FNGAYLEKAVAYKANFTVDEICLPLL 173
           FNGAYLEKAVAYKANFT  ++   L+
Sbjct: 134 FNGAYLEKAVAYKANFTGADLSDTLM 159


>gi|242066558|ref|XP_002454568.1| hypothetical protein SORBIDRAFT_04g033580 [Sorghum bicolor]
 gi|241934399|gb|EES07544.1| hypothetical protein SORBIDRAFT_04g033580 [Sorghum bicolor]
          Length = 270

 Score =  144 bits (362), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 70/86 (81%), Positives = 76/86 (88%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           + A ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFRRANFTSADMRESDFSGS 
Sbjct: 76  MPAYADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTSADMRESDFSGST 135

Query: 148 FNGAYLEKAVAYKANFTVDEICLPLL 173
           FNGAYLEKAVAYKANFT  ++   L+
Sbjct: 136 FNGAYLEKAVAYKANFTGADLSDTLM 161


>gi|357136761|ref|XP_003569972.1| PREDICTED: thylakoid lumenal protein At1g12250, chloroplastic-like
           [Brachypodium distachyon]
          Length = 268

 Score =  140 bits (354), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 67/86 (77%), Positives = 75/86 (87%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           + A ADLNK+EAE RGEFGIGSAAQFG+ADL+K VHV ENFRRANFTSADMRESDFSGS 
Sbjct: 74  MPAYADLNKFEAEQRGEFGIGSAAQFGNADLKKTVHVNENFRRANFTSADMRESDFSGST 133

Query: 148 FNGAYLEKAVAYKANFTVDEICLPLL 173
           FNGAY+EKAVAYKANFT  ++   L+
Sbjct: 134 FNGAYMEKAVAYKANFTGADLSDTLM 159


>gi|125540470|gb|EAY86865.1| hypothetical protein OsI_08249 [Oryza sativa Indica Group]
          Length = 276

 Score =  139 bits (349), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 92/173 (53%), Positives = 114/173 (65%), Gaps = 6/173 (3%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG 60
           MAL + SPL+  +   C+  +    +   L +   V+CQ +     DG     S +  A 
Sbjct: 1   MALPTTSPLAAAAARPCAFPTPWRCRSPPLRRLPHVSCQANRGGSRDGN--SLSTSAAAA 58

Query: 61  PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK 120
             +    WR  VS ALAAA+V++      A ADLNK+EAE RGEFGIGSAAQFGSADL+K
Sbjct: 59  AASPPPRWRAAVSAALAAAIVSA----APAYADLNKFEAEQRGEFGIGSAAQFGSADLKK 114

Query: 121 AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           AVHV ENFRRANFT+ADMRES+FSGS FNGAYLEKAVAY+ANFT  ++   L+
Sbjct: 115 AVHVNENFRRANFTAADMRESNFSGSTFNGAYLEKAVAYRANFTGADLSDTLM 167


>gi|115447561|ref|NP_001047560.1| Os02g0643500 [Oryza sativa Japonica Group]
 gi|49388647|dbj|BAD25782.1| thylakoid lumenal protein-like [Oryza sativa Japonica Group]
 gi|113537091|dbj|BAF09474.1| Os02g0643500 [Oryza sativa Japonica Group]
 gi|125583041|gb|EAZ23972.1| hypothetical protein OsJ_07699 [Oryza sativa Japonica Group]
 gi|215687060|dbj|BAG90906.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 277

 Score =  137 bits (346), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 66/82 (80%), Positives = 74/82 (90%)

Query: 92  ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
           ADLNK+EAE RGEFGIGSAAQFGSADL+KAVHV ENFRRANFT+ADMRES+FSGS FNGA
Sbjct: 87  ADLNKFEAEQRGEFGIGSAAQFGSADLKKAVHVNENFRRANFTAADMRESNFSGSTFNGA 146

Query: 152 YLEKAVAYKANFTVDEICLPLL 173
           YLEKAVAY+ANFT  ++   L+
Sbjct: 147 YLEKAVAYRANFTGADLSDTLM 168


>gi|10086510|gb|AAG12570.1|AC022522_3 Hypothetical protein [Arabidopsis thaliana]
          Length = 293

 Score =  137 bits (345), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 87/163 (53%), Positives = 110/163 (67%), Gaps = 8/163 (4%)

Query: 11  IKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRV 70
           +KSL+   SSS      +   + L    Q+SS+  S+ +  D SN       A+   W+ 
Sbjct: 1   MKSLDISRSSSSVSRSPYHFQRYLLRRLQLSSR--SNLEIKDSSNTS-----AESNTWKR 53

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
            +S A  AA V + SS + A+A+LN++EA+TRGEFGIGSAAQ+GSADL K VH  ENFRR
Sbjct: 54  ILSAA-MAAAVIASSSGVPAMAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRR 112

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           ANFTSADMRESDFSGS FNGAYLEKAVAYKANF+  ++   L+
Sbjct: 113 ANFTSADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLM 155


>gi|326490876|dbj|BAJ90105.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 267

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 95/175 (54%), Positives = 115/175 (65%), Gaps = 19/175 (10%)

Query: 1   MALSSISPLSIKSLNFCSSSSKGPYQL-HALSKPLW-VACQISSKTESDGQFPDCSNNQC 58
           MAL+S SPL+        +  K P  L    S+ L  ++CQ ++     G   + SN   
Sbjct: 1   MALASTSPLAA-----TVARPKAPASLTRCRSRRLQRISCQATTDRSGGG---NASNTSP 52

Query: 59  AGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADL 118
           A P      WRV VS ALAAAVV +    + A ADLNKYEA+ RGEFGIGSAAQFG+ADL
Sbjct: 53  APPR-----WRVAVSAALAAAVVVA----MPAHADLNKYEADQRGEFGIGSAAQFGNADL 103

Query: 119 RKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           +  VHV ENFRRANFTSADMRESDFSGS FNGAY+EKAVA++ANFT  ++   L+
Sbjct: 104 KNTVHVNENFRRANFTSADMRESDFSGSTFNGAYMEKAVAFRANFTGADLSDTLM 158


>gi|302822738|ref|XP_002993025.1| hypothetical protein SELMODRAFT_187158 [Selaginella moellendorffii]
 gi|300139117|gb|EFJ05864.1| hypothetical protein SELMODRAFT_187158 [Selaginella moellendorffii]
          Length = 196

 Score =  130 bits (328), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 61/91 (67%), Positives = 75/91 (82%)

Query: 88  ISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           ++A A+LNK+EAE+RGEFGIGSAAQFGSADLR+  H  ENFRRANFTSADMRE+DFSGS 
Sbjct: 1   MNAGAELNKFEAESRGEFGIGSAAQFGSADLRQTSHANENFRRANFTSADMREADFSGST 60

Query: 148 FNGAYLEKAVAYKANFTVDEICLPLLVSLPM 178
           FNG YLEKAVAY+ NF+  ++   L+  + +
Sbjct: 61  FNGGYLEKAVAYRTNFSGADLSDTLMDRMVL 91


>gi|302780733|ref|XP_002972141.1| hypothetical protein SELMODRAFT_96317 [Selaginella moellendorffii]
 gi|300160440|gb|EFJ27058.1| hypothetical protein SELMODRAFT_96317 [Selaginella moellendorffii]
          Length = 219

 Score =  129 bits (324), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 66/104 (63%), Positives = 82/104 (78%), Gaps = 4/104 (3%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF-RRANFT 134
           LAA V+A+    ++A A+LNK+EAE+RGEFGIGSAAQFGSADLR+  H  ENF RRANFT
Sbjct: 14  LAATVLAT---GMNAGAELNKFEAESRGEFGIGSAAQFGSADLRQTSHANENFSRRANFT 70

Query: 135 SADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLLVSLPM 178
           SADMRE+DFSGS FNG YLEKAVAY+ NF+  ++   L+  + +
Sbjct: 71  SADMREADFSGSTFNGGYLEKAVAYRTNFSGADLSDTLMDRMVL 114


>gi|168028137|ref|XP_001766585.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682230|gb|EDQ68650.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 225

 Score =  122 bits (305), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 57/88 (64%), Positives = 69/88 (78%)

Query: 86  SNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
           ++  +LADLN  EA TRGEFGIGSA QFGSADL+K  H  ENFRR NFTSADM+E++FS 
Sbjct: 24  TSTDSLADLNSLEANTRGEFGIGSAVQFGSADLKKTQHANENFRRGNFTSADMKEANFSN 83

Query: 146 SKFNGAYLEKAVAYKANFTVDEICLPLL 173
           S FNGAYLEKAVAY+ NF+  ++   L+
Sbjct: 84  STFNGAYLEKAVAYRTNFSGADLSDTLM 111


>gi|159478056|ref|XP_001697120.1| thylakoid lumenal protein [Chlamydomonas reinhardtii]
 gi|158274594|gb|EDP00375.1| thylakoid lumenal protein [Chlamydomonas reinhardtii]
          Length = 239

 Score = 89.0 bits (219), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 43/74 (58%), Positives = 51/74 (68%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
           ALADLN YEA T GEFGIGSA Q+G AD++      ++ RR+NFTSAD R + F GS   
Sbjct: 51  ALADLNAYEAATGGEFGIGSAMQYGEADIQGRDFSNQDLRRSNFTSADCRNATFKGSNLQ 110

Query: 150 GAYLEKAVAYKANF 163
           GAY  KAV Y+ NF
Sbjct: 111 GAYFIKAVTYRTNF 124


>gi|302829835|ref|XP_002946484.1| hypothetical protein VOLCADRAFT_56064 [Volvox carteri f.
           nagariensis]
 gi|300268230|gb|EFJ52411.1| hypothetical protein VOLCADRAFT_56064 [Volvox carteri f.
           nagariensis]
          Length = 214

 Score = 86.7 bits (213), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 42/74 (56%), Positives = 51/74 (68%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
           A ADLN YEAE  GEFGIGSA Q+G AD++      ++ RR+NFTSAD R ++F GS   
Sbjct: 26  AFADLNVYEAEAGGEFGIGSAQQYGEADVQGRDFSGQDLRRSNFTSADCRNANFKGSNLQ 85

Query: 150 GAYLEKAVAYKANF 163
           GAY  KAV Y+ NF
Sbjct: 86  GAYFIKAVTYRTNF 99


>gi|384248119|gb|EIE21604.1| thylakoid lumenal protein [Coccomyxa subellipsoidea C-169]
          Length = 217

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 40/74 (54%), Positives = 49/74 (66%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
           A+ADLNKYEA   GEFG G+A Q+G ADL+      E+ RR+NFT+AD R  +F  S   
Sbjct: 29  AIADLNKYEAAAGGEFGNGTAQQYGEADLKGRDFHGEDLRRSNFTAADCRNCNFKDSNLQ 88

Query: 150 GAYLEKAVAYKANF 163
           GAY  K+V  KANF
Sbjct: 89  GAYFIKSVVPKANF 102


>gi|424513452|emb|CCO66074.1| pentapeptide repeat-containing protein [Bathycoccus prasinos]
          Length = 231

 Score = 75.9 bits (185), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 44/88 (50%), Positives = 56/88 (63%), Gaps = 5/88 (5%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF----RRANFTSADMRESDFSG 145
           A+A+LN  EA   GEF  GSA QFG  DLR A +V E +    R +NFT A+MR+S   G
Sbjct: 39  AVAELNSREANQGGEFNRGSAQQFGGYDLR-AENVSEKYGTDLRLSNFTGAEMRDSKLVG 97

Query: 146 SKFNGAYLEKAVAYKANFTVDEICLPLL 173
           +K NGAYL KAVA  A+FT  ++   L+
Sbjct: 98  AKLNGAYLMKAVAANADFTDADLSDALM 125


>gi|307105880|gb|EFN54127.1| hypothetical protein CHLNCDRAFT_31689 [Chlorella variabilis]
          Length = 259

 Score = 68.2 bits (165), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 52/84 (61%)

Query: 90  ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFN 149
           A A+LNKYE    GEF +G+A Q+G AD++      ++ +R+NFT+AD R+++F  SK  
Sbjct: 71  ASAELNKYEFGVTGEFNVGTARQYGEADVKGQDFSNQDLQRSNFTAADCRDANFQNSKLQ 130

Query: 150 GAYLEKAVAYKANFTVDEICLPLL 173
            AY  K+V  +AN    ++   L+
Sbjct: 131 AAYFMKSVLARANLENADLSDALM 154


>gi|308811122|ref|XP_003082869.1| thylakoid lumenal protein-like (ISS) [Ostreococcus tauri]
 gi|116054747|emb|CAL56824.1| thylakoid lumenal protein-like (ISS) [Ostreococcus tauri]
          Length = 247

 Score = 67.4 bits (163), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 46/111 (41%), Positives = 59/111 (53%), Gaps = 6/111 (5%)

Query: 66  KNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
           K   V  S ALA A   S +    A A+LN+ EA   GEF  GSA QFG  DL K    K
Sbjct: 34  KKGHVITSIALATAFALSGAP---AHAELNRAEANRGGEFNRGSAKQFGGYDLVKVDIAK 90

Query: 126 E---NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
           E   + R +NFT ADMR +   G+   GAY+ K VA + +FT  ++   L+
Sbjct: 91  EYGKDLRLSNFTGADMRFAKLRGANLRGAYMMKMVAPEVDFTGADMSDALM 141


>gi|303288862|ref|XP_003063719.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454787|gb|EEH52092.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 277

 Score = 64.7 bits (156), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/78 (47%), Positives = 48/78 (61%), Gaps = 3/78 (3%)

Query: 89  SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE---NFRRANFTSADMRESDFSG 145
           +A A+LN  EA   GEF  GSA QFG  DLR    V +   + R +NFT A+MR +   G
Sbjct: 84  AAHAELNAREANRGGEFNRGSAQQFGGYDLRNEDVVGKYGADLRLSNFTGAEMRGAKLRG 143

Query: 146 SKFNGAYLEKAVAYKANF 163
           +   GAYL KAVA++A+F
Sbjct: 144 ANLTGAYLMKAVAFEADF 161


>gi|427725361|ref|YP_007072638.1| pentapeptide repeat-containing protein [Leptolyngbya sp. PCC 7376]
 gi|427357081|gb|AFY39804.1| pentapeptide repeat protein [Leptolyngbya sp. PCC 7376]
          Length = 919

 Score = 47.8 bits (112), Expect = 0.003,   Method: Composition-based stats.
 Identities = 26/64 (40%), Positives = 40/64 (62%), Gaps = 4/64 (6%)

Query: 109 SAAQFGSADLRK----AVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A   SA+LR+    A+ ++ NF  AN T AD+ E++ +GS F+ A L+ AV   ANFT
Sbjct: 748 SDANLSSANLRRSHLRAICLEANFTGANLTQADLCEANVTGSNFSDANLQGAVLKDANFT 807

Query: 165 VDEI 168
           + ++
Sbjct: 808 MTDL 811


>gi|332707710|ref|ZP_08427737.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332353413|gb|EGJ32926.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 285

 Score = 45.1 bits (105), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 41/80 (51%), Gaps = 1/80 (1%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           S A    A+L +A+  + N R A+   AD+  +D +G+   GAY+ +A   KAN T+  +
Sbjct: 58  SQATLTGANLSQAILREANLRGADLRGADLTGADLTGADLEGAYVNRADLRKANLTMANL 117

Query: 169 C-LPLLVSLPMATPVFPAGF 187
               L V+L      +P GF
Sbjct: 118 NETNLQVALYDRETTWPEGF 137


>gi|145219796|ref|YP_001130505.1| pentapeptide repeat-containing protein [Chlorobium phaeovibrioides
           DSM 265]
 gi|145205960|gb|ABP37003.1| pentapeptide repeat protein [Chlorobium phaeovibrioides DSM 265]
          Length = 412

 Score = 43.5 bits (101), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 31/54 (57%), Gaps = 5/54 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAV 157
           S A F  ADLR+A   K  FR AN  +A  RE+     DFSG+   GAYL +A+
Sbjct: 135 SGADFSGADLRRAECSKAGFRGANLQNAHFREASLRSVDFSGADLRGAYLWRAI 188


>gi|78033474|emb|CAJ30090.1| hypothetical acidic protein, pentapeptide repeat [Magnetospirillum
           gryphiswaldense MSR-1]
 gi|144901135|emb|CAM77999.1| pentapeptide repeat containing protein [Magnetospirillum
           gryphiswaldense MSR-1]
          Length = 503

 Score = 43.5 bits (101), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 29/49 (59%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A+LRKAV    N R +N   A + ++D SG+K  GA L  A   +ANF+
Sbjct: 28  ANLRKAVLSGANLRDSNLPRASLEDADLSGAKLQGANLAGATLLRANFS 76


>gi|167907368|ref|ZP_02494573.1| pentapeptide repeat protein [Burkholderia pseudomallei NCTC 13177]
          Length = 269

 Score = 43.5 bits (101), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 42/81 (51%), Gaps = 10/81 (12%)

Query: 84  CSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF 143
           C +N+S  ADL+  +A+ RG       A    ADLR A     N   AN + AD+ ++D 
Sbjct: 67  CGANLSG-ADLS--DADLRG-------ADLSDADLRGADLSVANLSGANLSGADLSDADL 116

Query: 144 SGSKFNGAYLEKAVAYKANFT 164
           SG+  +GAYL  A    AN +
Sbjct: 117 SGANLSGAYLSYANLSGANLS 137


>gi|407005745|gb|EKE21794.1| pentapeptide repeat protein [uncultured bacterium]
          Length = 189

 Score = 43.1 bits (100), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 29/123 (23%), Positives = 51/123 (41%), Gaps = 9/123 (7%)

Query: 44  TESD---GQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAE 100
           TE+D    +F DC  N+C    +K+ N       +    +   C  +  +  ++NK+   
Sbjct: 36  TETDFVGTKFIDCVFNECNFSNSKILN------CSFCNVIFKECKMSGVSFNEINKFLLV 89

Query: 101 TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
              +  +     F   D++K+  ++      +F  AD+ ESDFS S   G   +     K
Sbjct: 90  WEFDNCVIKLCNFSKLDIKKSKFIQCVIHETDFVDADLSESDFSNSDLRGCKFQNTNLSK 149

Query: 161 ANF 163
            NF
Sbjct: 150 VNF 152


>gi|428215909|ref|YP_007089053.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428004290|gb|AFY85133.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 447

 Score = 43.1 bits (100), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 39/77 (50%), Gaps = 3/77 (3%)

Query: 91  LADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           LAD N   ++ RG   IG++        ADLR+A   + + R AN   AD+RE+D +G+ 
Sbjct: 327 LADANMKGSDLRGADLIGASLNKVNLTQADLREADLTRADLRGANLRLADLREADLTGAS 386

Query: 148 FNGAYLEKAVAYKANFT 164
            N   L +A     + T
Sbjct: 387 LNQVNLAEADLRGVDLT 403


>gi|338740277|ref|YP_004677239.1| hypothetical protein HYPMC_3462 [Hyphomicrobium sp. MC1]
 gi|337760840|emb|CCB66673.1| protein of unknown function [Hyphomicrobium sp. MC1]
          Length = 1588

 Score = 42.7 bits (99), Expect = 0.070,   Method: Composition-based stats.
 Identities = 29/89 (32%), Positives = 42/89 (47%), Gaps = 7/89 (7%)

Query: 83  SCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
           +CSS   A A +  +       F + S   F  ADL+ A   +E  R A F++AD+R+ D
Sbjct: 876 NCSSGDCANAKMKGWN------FSVISQTDFSGADLKGAEFPRET-RGAKFSNADLRDVD 928

Query: 143 FSGSKFNGAYLEKAVAYKANFTVDEICLP 171
            SG +F       A   +ANF   E+  P
Sbjct: 929 ISGKQFQSCSFIGANLREANFGSSEVAGP 957



 Score = 41.6 bits (96), Expect = 0.18,   Method: Composition-based stats.
 Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 27/109 (24%)

Query: 52  DCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLN--KYEAETRGEFGIGS 109
           +CS+  CA   AK+K W   V +           ++ S  ADL   ++  ETRG      
Sbjct: 876 NCSSGDCAN--AKMKGWNFSVIS----------QTDFSG-ADLKGAEFPRETRG------ 916

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDF-----SGSKFNGAYL 153
            A+F +ADLR      + F+  +F  A++RE++F     +G  F+G++L
Sbjct: 917 -AKFSNADLRDVDISGKQFQSCSFIGANLREANFGSSEVAGPNFSGSFL 964


>gi|168705224|ref|ZP_02737501.1| pentapeptide repeat [Gemmata obscuriglobus UQM 2246]
          Length = 831

 Score = 42.7 bits (99), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 60/146 (41%), Gaps = 23/146 (15%)

Query: 52  DCSNNQCAGPYAKLKNWRV----FVSTALAAAVVASCSSNISALADLNKYEAE---TRGE 104
           D SN + AG  A+L N  +    F    L+ A  +      ++ AD+   +A     R  
Sbjct: 522 DLSNEKLAG--ARLNNLDLRGAKFDGAMLSEASFSGSQIQGASFADVPARKANFASARAA 579

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
             +   A   +A+LR A  ++ NF+  + T AD   SD  G+ F GA L+ A   +A F 
Sbjct: 580 DAVFRGAILANANLRAATFLRTNFQNVDLTGADFAFSDLRGADFTGATLKNASFSQAKFD 639

Query: 165 VDEICLPLLVSLPMATPVFPAGFCAP 190
            D                FP G  AP
Sbjct: 640 AD--------------TKFPKGLTAP 651



 Score = 38.9 bits (89), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 31/52 (59%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           F +A+L  A  V  + R  NFT+AD+R+++F G+   GA L  A    A+FT
Sbjct: 266 FTAANLAGATCVDADLRGTNFTNADLRKANFRGANLAGADLTGANVAGADFT 317


>gi|431802241|ref|YP_007229144.1| pentapeptide repeat-containing protein [Pseudomonas putida HB3267]
 gi|430793006|gb|AGA73201.1| pentapeptide repeat-containing protein [Pseudomonas putida HB3267]
          Length = 219

 Score = 42.7 bits (99), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   ADLR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGADLRGANLANQDLRKMNLAGADLRDADLRHARLDLANLEKANLQGANLT 93


>gi|78211810|ref|YP_380589.1| hypothetical protein Syncc9605_0258 [Synechococcus sp. CC9605]
 gi|78196269|gb|ABB34034.1| conserved hypothetical protein [Synechococcus sp. CC9605]
          Length = 147

 Score = 42.7 bits (99), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 30/54 (55%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
            AD R+A  +  +FR ++   AD+RE++  G+   GA LE A    AN T  E+
Sbjct: 48  QADFRQAHLIGADFRGSDLRGADLREANLEGADLTGALLEGADLRGANLTNAEL 101


>gi|339487133|ref|YP_004701661.1| pentapeptide repeat-containing protein [Pseudomonas putida S16]
 gi|338837976|gb|AEJ12781.1| pentapeptide repeat-containing protein [Pseudomonas putida S16]
          Length = 219

 Score = 42.7 bits (99), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   ADLR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGADLRGANLANQDLRKMNLAGADLRDADLRHARLDLANLEKANLQGANLT 93


>gi|421082377|ref|ZP_15543263.1| Pentapeptide repeat protein [Pectobacterium wasabiae CFBP 3304]
 gi|401702907|gb|EJS93144.1| Pentapeptide repeat protein [Pectobacterium wasabiae CFBP 3304]
          Length = 846

 Score = 42.7 bits (99), Expect = 0.087,   Method: Composition-based stats.
 Identities = 32/116 (27%), Positives = 54/116 (46%), Gaps = 8/116 (6%)

Query: 56  NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQ 112
           + C+    +    R   +T L +AV +  S N +        ++  R    IG+    A+
Sbjct: 691 DSCSWVETQANEARFVGATWLTSAVASGSSMNGADFTQATLRQSNLRQASLIGAVFARAK 750

Query: 113 FGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
             ++DL +A   + NF+RAN     F   D RE++F+ +   GA L+K+    ANF
Sbjct: 751 LENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLMGALLQKSQLSGANF 806


>gi|157372424|ref|YP_001480413.1| pentapeptide repeat-containing protein [Serratia proteamaculans
           568]
 gi|157324188|gb|ABV43285.1| pentapeptide repeat protein [Serratia proteamaculans 568]
          Length = 844

 Score = 42.7 bits (99), Expect = 0.087,   Method: Composition-based stats.
 Identities = 33/141 (23%), Positives = 66/141 (46%), Gaps = 9/141 (6%)

Query: 30  LSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNIS 89
           L K ++  C++ +   +      C+  +   PYA+ K   +    A+  + ++    + +
Sbjct: 668 LRKTVFQQCELQAAVFNGAWLESCNWVESKLPYAQFKAASLLTCAAVMESDLSGADFSEA 727

Query: 90  ALADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDF 143
            L + N  +A  T+  F +   A+  ++DL +A   + NF RAN   +     D R+ +F
Sbjct: 728 TLKESNLRQALLTQANFTL---AKVENSDLSEADCQRANFTRANLVGSLLIRTDFRQVNF 784

Query: 144 SGSKFNGAYLEKAVAYKANFT 164
           +G+   GA ++K     A+FT
Sbjct: 785 TGANLMGALMQKTQLGGADFT 805


>gi|22299142|ref|NP_682389.1| hypothetical protein tlr1599 [Thermosynechococcus elongatus BP-1]
 gi|22295324|dbj|BAC09151.1| tlr1599 [Thermosynechococcus elongatus BP-1]
          Length = 309

 Score = 42.7 bits (99), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 43/96 (44%), Gaps = 13/96 (13%)

Query: 89  SALADLNKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFR-----RANFTSADMRE 140
           +AL   N   A+ RG    G   S A    ADLR  + V  + R     +AN T AD+  
Sbjct: 45  AALQSTNLQRADLRGAILTGANLSQADLRGADLRGVILVSADLRWVSLRKANLTGADLTR 104

Query: 141 -----SDFSGSKFNGAYLEKAVAYKANFTVDEICLP 171
                +D S +   GA L +A+   AN T+ ++ L 
Sbjct: 105 ANLANADLSEANLTGAQLSEAIVRDANLTLTDLTLA 140


>gi|374583660|ref|ZP_09656754.1| putative low-complexity protein [Desulfosporosinus youngiae DSM
           17734]
 gi|374419742|gb|EHQ92177.1| putative low-complexity protein [Desulfosporosinus youngiae DSM
           17734]
          Length = 367

 Score = 42.7 bits (99), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 36/67 (53%), Gaps = 3/67 (4%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT---V 165
           S A    ADL +A     N RRA+ + A++R +D SG+    A L +A   +AN +   +
Sbjct: 233 SGANLSEADLSRADLSGANLRRADLSGANLRRADLSGANLRRADLSEANLSEANLSGADL 292

Query: 166 DEICLPL 172
           D  CLPL
Sbjct: 293 DFSCLPL 299



 Score = 41.6 bits (96), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 32/56 (57%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A    ADL +A     N RRAN + A++ E+D SG+  +GA L +A   +A+ +
Sbjct: 153 SGANLSEADLSRADLSGANLRRANLSGANLSEADLSGANLSGANLSEADLSRADLS 208



 Score = 36.6 bits (83), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 31/56 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A    ADL +A     N  RA+ + A++ E+D SG+  +GA L +A   +A+ +
Sbjct: 193 SGANLSEADLSRADLSGANLSRADLSGANLSEADLSGANLSGANLSEADLSRADLS 248


>gi|421528695|ref|ZP_15975254.1| pentapeptide repeat-containing protein [Pseudomonas putida S11]
 gi|402213838|gb|EJT85176.1| pentapeptide repeat-containing protein [Pseudomonas putida S11]
          Length = 200

 Score = 42.4 bits (98), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   ADLR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGADLRGANLANQDLRKMNLAGADLRDADLRHARLDLANLEKANLQGANLT 93


>gi|390441101|ref|ZP_10229280.1| Genome sequencing data, contig C319 [Microcystis sp. T1-4]
 gi|389835591|emb|CCI33406.1| Genome sequencing data, contig C319 [Microcystis sp. T1-4]
          Length = 436

 Score = 42.4 bits (98), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 30/53 (56%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           S A    A+LR+A  +K N RRAN   A + E+D SG+    A L KA+  +A
Sbjct: 324 SGANLIDANLRRANLIKANLRRANLIEAILSEADLSGANLRRANLIKAILIEA 376


>gi|443475317|ref|ZP_21065270.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443019839|gb|ELS33873.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 377

 Score = 42.4 bits (98), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 34/60 (56%), Gaps = 5/60 (8%)

Query: 111 AQFGSADLRKAVHVKENFR-----RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTV 165
           A    A+L  A+ VK + +     RAN T AD+RE+D SG++   A L KA   KAN ++
Sbjct: 140 ADLTQANLSAAILVKASLKQVILNRANLTEADLREADLSGAQLYLAVLSKANLAKANLSL 199


>gi|209528100|ref|ZP_03276576.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|209491459|gb|EDZ91838.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
          Length = 351

 Score = 42.4 bits (98), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 31/56 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A    ADL ++V    NF  AN T A++  ++ +G+  NGA L +A   +AN T
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTRANLTRANLT 245


>gi|406706438|ref|YP_006756791.1| pentapeptide repeat-containing protein [alpha proteobacterium
           HIMB5]
 gi|406652214|gb|AFS47614.1| pentapeptide repeat protein [alpha proteobacterium HIMB5]
          Length = 174

 Score = 42.0 bits (97), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 35/64 (54%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           FG    + F  A+L ++V +  NF + NFT A++ ++DF GS    A  + A   +ANFT
Sbjct: 80  FGTFPESTFYRANLYESVMIGANFEKTNFTGANLTKADFMGSTLIEANFQNANLMEANFT 139

Query: 165 VDEI 168
              I
Sbjct: 140 SANI 143


>gi|325272495|ref|ZP_08138874.1| pentapeptide repeat-containing protein [Pseudomonas sp. TJI-51]
 gi|324102372|gb|EGB99839.1| pentapeptide repeat-containing protein [Pseudomonas sp. TJI-51]
          Length = 219

 Score = 42.0 bits (97), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   ADLR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGADLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93


>gi|119485597|ref|ZP_01619872.1| hypothetical protein L8106_24480 [Lyngbya sp. PCC 8106]
 gi|119456922|gb|EAW38049.1| hypothetical protein L8106_24480 [Lyngbya sp. PCC 8106]
          Length = 253

 Score = 42.0 bits (97), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 11/88 (12%)

Query: 92  ADLNKYEAETRGEFGIG------SAAQFGSADLRKAVHVKENFRRANFTSA-----DMRE 140
           ADL K + +    F +       S A F +ADLR+    K N   ANFT A     D+R 
Sbjct: 126 ADLRKADLQDANLFKVNFSEAYLSEANFENADLRQVTFFKANLADANFTDANLFGSDLRL 185

Query: 141 SDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           ++  G+ F+ A L+ A+    N    E+
Sbjct: 186 ANLKGADFSNANLQAAILVNTNIAQAEL 213



 Score = 35.8 bits (81), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 35/82 (42%), Gaps = 8/82 (9%)

Query: 91  LADLNKYEAETRGEFGIG--------SAAQFGSADLRKAVHVKENFRRANFTSADMRESD 142
           LAD N YEA  R     G        S A    ADLRKA     N  + NF+ A + E++
Sbjct: 93  LADANLYEANLRYANLQGADLRQADLSRASLTRADLRKADLQDANLFKVNFSEAYLSEAN 152

Query: 143 FSGSKFNGAYLEKAVAYKANFT 164
           F  +        KA    ANFT
Sbjct: 153 FENADLRQVTFFKANLADANFT 174


>gi|354567474|ref|ZP_08986643.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
 gi|353542746|gb|EHC12207.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
          Length = 164

 Score = 42.0 bits (97), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 45/100 (45%), Gaps = 10/100 (10%)

Query: 65  LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
           +K+WRVF    LA  V+      +  L+      + +R         Q  +AD      +
Sbjct: 1   MKSWRVFAVLILAMVVL------LFPLSAEAAKSSSSR----FAGYKQMSNADFSGQTLI 50

Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           +E F +     A+   +D  G+ FN AYLEKA  + A+FT
Sbjct: 51  REEFTKVKLDKANFSNADLRGAVFNNAYLEKANLHGADFT 90


>gi|390438037|ref|ZP_10226537.1| hypothetical protein MICAI_1320003 [Microcystis sp. T1-4]
 gi|389838536|emb|CCI30661.1| hypothetical protein MICAI_1320003 [Microcystis sp. T1-4]
          Length = 260

 Score = 42.0 bits (97), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 40/86 (46%), Gaps = 10/86 (11%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF------- 163
           A    ADL +A     N R AN  SA + E++   S   GA L+ A  Y+AN        
Sbjct: 165 ANLAGADLFRANLRGANLRGANLHSAGLVEANLQSSDLAGAKLQMATLYRANLQDAKYTD 224

Query: 164 --TVDEICLPLLVSLPMATPVFPAGF 187
             T  ++C  L +S P +T  FP GF
Sbjct: 225 ASTSPKLCESLSLSYPCSTG-FPEGF 249


>gi|162450958|ref|YP_001613325.1| WD repeat-containing protein [Sorangium cellulosum So ce56]
 gi|161161540|emb|CAN92845.1| Hypothetical WD-repeat protein [Sorangium cellulosum So ce56]
          Length = 2305

 Score = 42.0 bits (97), Expect = 0.15,   Method: Composition-based stats.
 Identities = 27/84 (32%), Positives = 39/84 (46%), Gaps = 13/84 (15%)

Query: 97   YEAETRGEFGIGS---AAQFGSADLRKAVHVKENFRRANFTSADMRESDF---------- 143
            +  ET G    G+     Q    DLR A     N R AN + AD+  +D           
Sbjct: 1111 WAEETAGWISEGADLHGVQLAGEDLRGAPLAGANLRDANLSGADLSGADLTDAALSGAML 1170

Query: 144  SGSKFNGAYLEKAVAYKANFTVDE 167
            SG+K +G  L +A+A++A+FT  E
Sbjct: 1171 SGAKLHGTILRRAIAHRADFTQAE 1194


>gi|260436217|ref|ZP_05790187.1| pentapeptide repeat protein [Synechococcus sp. WH 8109]
 gi|260414091|gb|EEX07387.1| pentapeptide repeat protein [Synechococcus sp. WH 8109]
          Length = 147

 Score = 41.6 bits (96), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 29/54 (53%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
            AD R+A  +  +FR  +   AD+RE++  G+   GA LE A    AN T  E+
Sbjct: 48  QADFRQAHLIGADFRGTDLRGADLREANLEGADLTGALLEGADLRGANLTNAEL 101


>gi|261821705|ref|YP_003259811.1| hypothetical protein Pecwa_2443 [Pectobacterium wasabiae WPP163]
 gi|261605718|gb|ACX88204.1| Protein of unknown function DUF2169 [Pectobacterium wasabiae
           WPP163]
          Length = 846

 Score = 41.6 bits (96), Expect = 0.16,   Method: Composition-based stats.
 Identities = 32/116 (27%), Positives = 54/116 (46%), Gaps = 8/116 (6%)

Query: 56  NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQ 112
           + C+    +    R   +T L +AV +  S N +        ++  R    IG+    A+
Sbjct: 691 DSCSWVETQANEARFTGATWLTSAVASGSSMNSADFTQATLRQSNLRQASLIGAVFALAK 750

Query: 113 FGSADLRKAVHVKENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
             ++DL +A   + NF+RAN     F   D RE++F+ +   GA L+K+    ANF
Sbjct: 751 LENSDLSEADCQQTNFQRANLAGSLFVRTDFREANFTDANLIGALLQKSQLGGANF 806


>gi|257061367|ref|YP_003139255.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8802]
 gi|256591533|gb|ACV02420.1| pentapeptide repeat protein [Cyanothece sp. PCC 8802]
          Length = 371

 Score = 41.6 bits (96), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 5/89 (5%)

Query: 80  VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR 139
           + A+ + N++ L  L  +   T    G   AA+  + +L  A   + NFR AN T A++ 
Sbjct: 218 LYAANTHNLAELIKLAHFNPLTDLAGGNFLAAELSAVELSGANLTQTNFRGANLTDAELS 277

Query: 140 ES-----DFSGSKFNGAYLEKAVAYKANF 163
           E+      FSG+  +GAYL  A   KA+F
Sbjct: 278 EAILNYCKFSGADLSGAYLGNAQLVKADF 306


>gi|113476913|ref|YP_722974.1| serine/threonine protein kinase [Trichodesmium erythraeum IMS101]
 gi|110167961|gb|ABG52501.1| serine/threonine protein kinase [Trichodesmium erythraeum IMS101]
          Length = 567

 Score = 41.6 bits (96), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 30/53 (56%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A    A+L KAV V  N RR N + A++  ++   + F+GAYL +A   +AN 
Sbjct: 418 ASLEGANLTKAVLVSANLRRVNLSGANLNSTNLRAANFSGAYLREAKLSRANL 470


>gi|218247298|ref|YP_002372669.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 8801]
 gi|218167776|gb|ACK66513.1| pentapeptide repeat protein [Cyanothece sp. PCC 8801]
          Length = 371

 Score = 41.6 bits (96), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 45/89 (50%), Gaps = 5/89 (5%)

Query: 80  VVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR 139
           + A+ + N++ L  L  +   T    G   AA+  + +L  A   + NFR AN T A++ 
Sbjct: 218 LYAANTHNLAELIKLAHFNPLTDLAGGNFLAAELSAVELSGANLTQTNFRGANLTDAELS 277

Query: 140 ES-----DFSGSKFNGAYLEKAVAYKANF 163
           E+      FSG+  +GAYL  A   KA+F
Sbjct: 278 EAILNYCKFSGADLSGAYLGNAQLVKADF 306


>gi|334137987|ref|ZP_08511411.1| pentapeptide repeat protein [Paenibacillus sp. HGF7]
 gi|333604520|gb|EGL15910.1| pentapeptide repeat protein [Paenibacillus sp. HGF7]
          Length = 242

 Score = 41.6 bits (96), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 55/119 (46%), Gaps = 4/119 (3%)

Query: 49  QFPDCSNNQCAGPYAKLKNWRVFVSTALAAAV--VASCSSNISALADLNKYEAETRGEFG 106
              DC  ++     A++K+  + +ST +      V  C+ N+S  + L K         G
Sbjct: 89  DIADCVLSEATLRNAQMKDAEIKISTCIETCFDEVELCNGNLSG-STLIKATFRQANLHG 147

Query: 107 I-GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  S A F  +DLR A  V  +F  ++F SA++ E D S + F G  L  A+    NFT
Sbjct: 148 ISASKAYFDESDLRGANLVNGDFEESDFISANLSEVDASYANFTGGNLTGAILCNGNFT 206


>gi|119486371|ref|ZP_01620430.1| hypothetical protein L8106_16994 [Lyngbya sp. PCC 8106]
 gi|119456584|gb|EAW37714.1| hypothetical protein L8106_16994 [Lyngbya sp. PCC 8106]
          Length = 772

 Score = 41.6 bits (96), Expect = 0.20,   Method: Composition-based stats.
 Identities = 32/103 (31%), Positives = 52/103 (50%), Gaps = 3/103 (2%)

Query: 62  YAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 121
           +A LKN  +  +  +AA++ ++  S+ + L+  N   A  +G    G  A    A+LR A
Sbjct: 532 HANLKNANLSTANLMAASLNSANLSD-ANLSHANLECANLKGANLTG--ANLSYANLRGA 588

Query: 122 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
                N R AN + AD+R  + S +  + AYL  A  Y+AN +
Sbjct: 589 NLSGVNLRDANLSYADLRRVNLSQANLDSAYLRGANLYRANIS 631


>gi|427415571|ref|ZP_18905754.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
 gi|425758284|gb|EKU99136.1| putative low-complexity protein [Leptolyngbya sp. PCC 7375]
          Length = 184

 Score = 41.2 bits (95), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 33/53 (62%), Gaps = 5/53 (9%)

Query: 109 SAAQFGSADLRKA-VHVKE----NFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           S A  G ADLRKA +H  +    + R A+ T A+++E+D S +  +GAYL +A
Sbjct: 103 SGANLGGADLRKADLHKADLSDSDLRCADLTGANLQETDLSDANLDGAYLGEA 155


>gi|434398536|ref|YP_007132540.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428269633|gb|AFZ35574.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 284

 Score = 41.2 bits (95), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 42/92 (45%), Gaps = 15/92 (16%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDF----------SGSKFNGAYLEKAVA 158
           S     +A+L +A     N   AN T A++  +D           +G+   GAYL ++  
Sbjct: 49  SGVDLTNANLSQATLTNANLSGANLTGANLTGTDLRGINLTGANLTGANLEGAYLNRSDL 108

Query: 159 YKANFT---VDEICLPLLVSLPMATPVFPAGF 187
            +ANFT   +D + L   VSL     +FP GF
Sbjct: 109 RQANFTDAKLDNVKLQ--VSLYDQATIFPEGF 138


>gi|300864976|ref|ZP_07109808.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
 gi|300337032|emb|CBN54958.1| conserved hypothetical protein [Oscillatoria sp. PCC 6506]
          Length = 279

 Score = 41.2 bits (95), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 33/66 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           + A    ADLR+A+ +  N   AN T A++R ++ S S   GA L  A  Y+A      +
Sbjct: 183 NGANLSGADLRQAIAIGSNLSDANLTQANLRVANVSWSTLRGANLTGANLYRAKLNWSNL 242

Query: 169 CLPLLV 174
              +LV
Sbjct: 243 SGAILV 248


>gi|345872411|ref|ZP_08824346.1| pentapeptide repeat protein [Thiorhodococcus drewsii AZ1]
 gi|343918959|gb|EGV29716.1| pentapeptide repeat protein [Thiorhodococcus drewsii AZ1]
          Length = 284

 Score = 41.2 bits (95), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 32/61 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           S AQ   ADLR A     N + AN + AD+R +DF GS  +     KA+  +AN    ++
Sbjct: 103 SDAQLTGADLRCAEVRYANLKHANLSHADLRGTDFHGSDLSHMVAIKALLIRANLRETDL 162

Query: 169 C 169
           C
Sbjct: 163 C 163


>gi|37521689|ref|NP_925066.1| hypothetical protein glr2120 [Gloeobacter violaceus PCC 7421]
 gi|35212687|dbj|BAC90061.1| glr2120 [Gloeobacter violaceus PCC 7421]
          Length = 278

 Score = 41.2 bits (95), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 24/53 (45%), Positives = 28/53 (52%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A    ADLR+A  V  N RRAN   AD RESD   +    A L +A  +KAN 
Sbjct: 179 ANLEGADLREASFVSANLRRANLRRADCRESDLFDANLCEADLREAKLHKANL 231



 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 27/45 (60%), Gaps = 5/45 (11%)

Query: 115 SADLRKAVHVKENFRRANFTS-----ADMRESDFSGSKFNGAYLE 154
            ADLR+A   K N R+A   S     AD+RE+D SG+   GA+LE
Sbjct: 218 EADLREAKLHKANLRQALLVSADLRGADLREADLSGANLQGAHLE 262


>gi|434394476|ref|YP_007129423.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
 gi|428266317|gb|AFZ32263.1| pentapeptide repeat protein [Gloeocapsa sp. PCC 7428]
          Length = 183

 Score = 41.2 bits (95), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 29/53 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A   SADL +A     N ++AN   AD+ E+D  G+  +GA L+ A   +AN 
Sbjct: 89  ANLQSADLDQANLRDANLQQANLRDADLEEADLQGANLSGANLQSADLEEANL 141



 Score = 39.3 bits (90), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 29/55 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A   SADL +A     NF+ AN  +AD+ ++   G+ F+GA L+ A     N 
Sbjct: 127 SGANLQSADLEEANLQNANFQNANLQNADLEDARVQGANFDGANLQGADLEGTNL 181


>gi|298250074|ref|ZP_06973878.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297548078|gb|EFH81945.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 471

 Score = 41.2 bits (95), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 26/43 (60%), Gaps = 5/43 (11%)

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFT 164
           + R+AN + A M  +D SG+   GA LE      AVA+KANFT
Sbjct: 135 DLRKANLSMARMHHTDLSGANLTGAILEGIDLKDAVAHKANFT 177


>gi|119488860|ref|ZP_01621822.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
 gi|119455021|gb|EAW36163.1| Pentapeptide repeat protein [Lyngbya sp. PCC 8106]
          Length = 1011

 Score = 40.8 bits (94), Expect = 0.26,   Method: Composition-based stats.
 Identities = 21/55 (38%), Positives = 30/55 (54%), Gaps = 5/55 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A   +ADLR A     N  RAN + A++R ++ SG+  +G YL  A   +AN 
Sbjct: 850 SGADLRTADLRSA-----NLIRANLSDANLRSANLSGANLSGVYLNSADLRRANL 899


>gi|434399306|ref|YP_007133310.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428270403|gb|AFZ36344.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 298

 Score = 40.8 bits (94), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 35/71 (49%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLLVS 175
           A+LR A  +  N R AN + AD++ ++  G+ F GA L KA    ANF    +   +  +
Sbjct: 224 ANLRDANLIGANLRGANLSQADLKGANLEGANFKGANLTKADLRGANFKGANLQDAIFKN 283

Query: 176 LPMATPVFPAG 186
             +   + P G
Sbjct: 284 TKLQGTIMPDG 294


>gi|21674877|ref|NP_662942.1| pentapeptide repeat-containing protein [Chlorobium tepidum TLS]
 gi|21648101|gb|AAM73284.1| pentapeptide repeat family protein [Chlorobium tepidum TLS]
          Length = 439

 Score = 40.8 bits (94), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 33/69 (47%), Gaps = 15/69 (21%)

Query: 111 AQFGSADLRKAVHVKENFRRA---------------NFTSADMRESDFSGSKFNGAYLEK 155
           A+ G  DLRKA   K +F RA               NF  ADM+E++  G+   GA L++
Sbjct: 285 AELGGVDLRKASLSKSDFERANLDKANLAGANLAGVNFQRADMKEANLKGANLEGANLDR 344

Query: 156 AVAYKANFT 164
           A    A+ +
Sbjct: 345 AFLKGADLS 353


>gi|254416875|ref|ZP_05030623.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196176239|gb|EDX71255.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 332

 Score = 40.8 bits (94), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 32/67 (47%), Gaps = 2/67 (2%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           Y A+ RG   I        ADLR A  +K N R AN    ++RE+D  G+  +GA L  A
Sbjct: 144 YTAKLRG--AILQNVDLQGADLRGADLLKVNLRGANLRETNLREADLRGANLSGANLSSA 201

Query: 157 VAYKANF 163
              + N 
Sbjct: 202 FLTEVNL 208


>gi|354564859|ref|ZP_08984035.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
 gi|353549985|gb|EHC19424.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
          Length = 166

 Score = 40.8 bits (94), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 30/56 (53%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A   +ADL KA     N   AN T+AD+ E++ +G+   GA  ++A    AN T
Sbjct: 89  SNANLTNADLEKANLSNANLSGANLTNADLEEANLTGANLRGANFQRADLEDANLT 144


>gi|150014700|gb|ABR57221.1| PedD [Pseudomonas putida]
          Length = 219

 Score = 40.8 bits (94), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   A+LR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGANLRGAKLANQDLRKMNLAGADLRDADLRHARLDLANLEKARLQGANLT 93


>gi|423066634|ref|ZP_17055424.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|406711942|gb|EKD07140.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 351

 Score = 40.8 bits (94), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 30/56 (53%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A    ADL ++V    NF  AN T A++  ++ +G+  NGA L  A   +AN T
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTRANLT 245


>gi|425440351|ref|ZP_18820656.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
 gi|389719234|emb|CCH96913.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
          Length = 333

 Score = 40.8 bits (94), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|220909896|ref|YP_002485207.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219866507|gb|ACL46846.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 184

 Score = 40.8 bits (94), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 30/53 (56%), Gaps = 5/53 (9%)

Query: 109 SAAQFGSADLRKAVHVKENF-----RRANFTSADMRESDFSGSKFNGAYLEKA 156
           S A  G ADLRKA   K +      R A+ + A++RE+D S +  +GAYL  A
Sbjct: 103 SGANLGGADLRKADLSKADLSGADLRGADLSGANLRETDLSDADLDGAYLGHA 155


>gi|330809494|ref|YP_004353956.1| hypothetical protein PSEBR_a2659 [Pseudomonas brassicacearum subsp.
           brassicacearum NFM421]
 gi|423697147|ref|ZP_17671637.1| pentapeptide repeat protein PedD [Pseudomonas fluorescens Q8r1-96]
 gi|327377602|gb|AEA68952.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp.
           brassicacearum NFM421]
 gi|388004031|gb|EIK65358.1| pentapeptide repeat protein PedD [Pseudomonas fluorescens Q8r1-96]
          Length = 214

 Score = 40.8 bits (94), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 33/58 (56%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  ++Q   A+LR A    ++ R+ N + AD+R++D   ++ + A LEKA    AN T
Sbjct: 31  IAESSQCPGANLRGAKLANQDLRKMNLSGADLRDADLRHARLDLANLEKAQLQGANLT 88


>gi|409994014|ref|ZP_11277136.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|291569676|dbj|BAI91948.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
 gi|409935088|gb|EKN76630.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 331

 Score = 40.8 bits (94), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 41/101 (40%), Gaps = 14/101 (13%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 129
           F  T L AA +   +  ++ L D N  +A+ RG       A    ADLR A     N R 
Sbjct: 87  FHGTILQAADLRKANLTLATLVDANLIQADLRG-------ANLQGADLRGACLRGANMRY 139

Query: 130 ------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
                   N   AD+R +D  G    GA L +A    AN T
Sbjct: 140 ERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLMGANLT 180


>gi|166364712|ref|YP_001656985.1| hypothetical protein MAE_19710 [Microcystis aeruginosa NIES-843]
 gi|425466893|ref|ZP_18846187.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
 gi|166087085|dbj|BAG01793.1| hypothetical protein MAE_19710 [Microcystis aeruginosa NIES-843]
 gi|389830484|emb|CCI27530.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9809]
          Length = 333

 Score = 40.8 bits (94), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|90019736|ref|YP_525563.1| hypothetical protein Sde_0087 [Saccharophagus degradans 2-40]
 gi|89949336|gb|ABD79351.1| pentapeptide repeat [Saccharophagus degradans 2-40]
          Length = 600

 Score = 40.8 bits (94), Expect = 0.33,   Method: Composition-based stats.
 Identities = 19/45 (42%), Positives = 25/45 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
           S A    ADLR A     NF+ A     D+R++D SG +F+GA L
Sbjct: 197 SGANLRRADLRDAKFCSTNFKNAELNGVDLRKADLSGLEFDGADL 241


>gi|403357343|gb|EJY78297.1| hypothetical protein OXYTRI_24550 [Oxytricha trifallax]
          Length = 290

 Score = 40.8 bits (94), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 24/53 (45%), Positives = 28/53 (52%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
            A F   D  KAV    NF +A  ++ADMRE DF  S FN A L  A   +AN
Sbjct: 199 GANFMHVDFVKAVGKDCNFLKAKLSNADMREGDFENSNFNEASLHGANLERAN 251


>gi|425435715|ref|ZP_18816162.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
 gi|425462172|ref|ZP_18841646.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
 gi|440755045|ref|ZP_20934247.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
 gi|389679721|emb|CCH91528.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
 gi|389824858|emb|CCI25881.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
 gi|440175251|gb|ELP54620.1| pentapeptide repeats family protein [Microcystis aeruginosa
           TAIHU98]
          Length = 333

 Score = 40.8 bits (94), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|46202237|ref|ZP_00053526.2| COG1357: Uncharacterized low-complexity proteins [Magnetospirillum
           magnetotacticum MS-1]
          Length = 542

 Score = 40.8 bits (94), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 28/49 (57%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A+LRKAV    N R  N   A + ++D SG+K  GA L  A   +ANF+
Sbjct: 54  ANLRKAVLSGANLRDCNLPRACLEDADLSGAKLQGANLAGATLLRANFS 102


>gi|425444319|ref|ZP_18824373.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
 gi|425455654|ref|ZP_18835369.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
 gi|389730303|emb|CCI05384.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
 gi|389803421|emb|CCI17652.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
          Length = 333

 Score = 40.4 bits (93), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|390441606|ref|ZP_10229649.1| conserved hypothetical protein [Microcystis sp. T1-4]
 gi|389835072|emb|CCI33775.1| conserved hypothetical protein [Microcystis sp. T1-4]
          Length = 333

 Score = 40.4 bits (93), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|254414225|ref|ZP_05027992.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196178900|gb|EDX73897.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 963

 Score = 40.4 bits (93), Expect = 0.35,   Method: Composition-based stats.
 Identities = 20/54 (37%), Positives = 30/54 (55%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            A    A L+ A   + N +RAN   A++ E++F G+ F GA LE A  ++AN 
Sbjct: 890 GANLEGAHLKGANLKRANLKRANLKRANLFEANFEGANFEGATLEWANLFEANL 943


>gi|428220816|ref|YP_007104986.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
 gi|427994156|gb|AFY72851.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
          Length = 418

 Score = 40.4 bits (93), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 35/60 (58%), Gaps = 5/60 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANF 163
           S A F  +DL  A+ ++ + RRAN + A++ E     +D SG  F+G+ L +A   +ANF
Sbjct: 143 SMANFTGSDLSGAIMIRADLRRANISRANLNEADISRADLSGVDFSGSNLSQANFEEANF 202



 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 28/72 (38%), Positives = 38/72 (52%), Gaps = 8/72 (11%)

Query: 95  NKYEAETRGEFGIG---SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
           N  E +  G   IG   S A F  ADLR+A     N   ANF +A+++E+D SG+   GA
Sbjct: 221 NFREVDLSGSDLIGADLSNANFAEADLRRA-----NLVGANFNNANLKEADLSGAYLIGA 275

Query: 152 YLEKAVAYKANF 163
            L  A   +A+F
Sbjct: 276 TLVNANIVRADF 287


>gi|443315235|ref|ZP_21044737.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
 gi|442785176|gb|ELR95014.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
          Length = 402

 Score = 40.4 bits (93), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 12/67 (17%)

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFN 149
           N + A+ RG       A   S DLR+A+  + N R+ANF  A+MR     E+D  G+   
Sbjct: 318 NLHRADLRG-------ANLESTDLREAILRQANLRQANFRYANMRMAHLAEADLRGADLR 370

Query: 150 GAYLEKA 156
           GA L  A
Sbjct: 371 GADLTHA 377


>gi|452962545|gb|EME67671.1| hypothetical protein H261_22313 [Magnetospirillum sp. SO-1]
          Length = 542

 Score = 40.4 bits (93), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 28/49 (57%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A+LRKAV    N R  N   A + ++D SG+K  GA L  A   +ANF+
Sbjct: 54  ANLRKAVLSGANLRDCNLPRACLEDADLSGAKLQGANLAGATLLRANFS 102


>gi|443662162|ref|ZP_21132897.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|159030702|emb|CAO88375.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443332138|gb|ELS46762.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 333

 Score = 40.4 bits (93), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|422302321|ref|ZP_16389684.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
 gi|389788496|emb|CCI15816.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9806]
          Length = 333

 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|378950893|ref|YP_005208381.1| protein PedD [Pseudomonas fluorescens F113]
 gi|359760907|gb|AEV62986.1| PedD [Pseudomonas fluorescens F113]
          Length = 214

 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 33/58 (56%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  ++Q   A+LR A    ++ R+ N + AD+R++D   ++ + A LEKA    AN T
Sbjct: 31  IAESSQCPGANLRGAKLANQDLRKMNLSGADLRDADLRHARLDLANLEKAQLQGANLT 88


>gi|254413444|ref|ZP_05027214.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196179551|gb|EDX74545.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 768

 Score = 40.4 bits (93), Expect = 0.37,   Method: Composition-based stats.
 Identities = 23/80 (28%), Positives = 41/80 (51%)

Query: 89  SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF 148
           SA+++L  Y    + +      +    AD+R +     + R AN +SAD+ E++ S +K 
Sbjct: 414 SAVSELEAYRIFCQLKGANLRGSDLKGADIRNSDLSAADLREANLSSADLSEANLSLAKL 473

Query: 149 NGAYLEKAVAYKANFTVDEI 168
            GA L  A+   A+ TV ++
Sbjct: 474 GGANLSSAILLGADLTVTDL 493


>gi|83310097|ref|YP_420361.1| hypothetical protein amb0998 [Magnetospirillum magneticum AMB-1]
 gi|82944938|dbj|BAE49802.1| Uncharacterized low-complexity protein [Magnetospirillum magneticum
           AMB-1]
          Length = 542

 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 21/49 (42%), Positives = 28/49 (57%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A+LRKAV    N R  N   A + ++D SG+K  GA L  A   +ANF+
Sbjct: 54  ANLRKAVLSGANLRDCNLPRACLEDADLSGAKLQGANLAGATLLRANFS 102


>gi|332711030|ref|ZP_08430965.1| hypothetical cyclic nucleotide-binding domain protein [Moorea
           producens 3L]
 gi|332350156|gb|EGJ29761.1| hypothetical cyclic nucleotide-binding domain protein [Moorea
           producens 3L]
          Length = 328

 Score = 40.4 bits (93), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 37/73 (50%), Gaps = 7/73 (9%)

Query: 86  SNISALADLNKYEAETRGEFG--IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDF 143
           S ++ +A+LN YE   R +            +ADLR       NFR AN T AD R ++ 
Sbjct: 34  SQLAEIAELNLYEDLARVDLSGVNLENVNLNNADLRGT-----NFRNANLTGADFRNANL 88

Query: 144 SGSKFNGAYLEKA 156
           +G+ FN A L+ A
Sbjct: 89  TGADFNDAILDNA 101


>gi|376002767|ref|ZP_09780589.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375328823|emb|CCE16342.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 517

 Score = 40.4 bits (93), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 7/81 (8%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           L  A++   + N++ LA ++  EA+      I        A+L +A   K NF +AN   
Sbjct: 86  LTKAILNQATINVANLARVDLTEAQLINSLLI-------RAELIRAKLTKANFTQANLNG 138

Query: 136 ADMRESDFSGSKFNGAYLEKA 156
           AD+RE+    + FNGA L  A
Sbjct: 139 ADLRETKLQQTNFNGANLSGA 159



 Score = 35.8 bits (81), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 34/64 (53%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICL 170
           A F +A+LR+A     N   A+F+ A+MR  D  G+  +GA L +A    AN +   +  
Sbjct: 189 ADFSNAELRQANLTYANLSNADFSGANMRWIDLQGADLSGANLTEANLSGANLSGANLSS 248

Query: 171 PLLV 174
            +LV
Sbjct: 249 AVLV 252


>gi|359460720|ref|ZP_09249283.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
           5410]
          Length = 294

 Score = 40.4 bits (93), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 37/65 (56%), Gaps = 12/65 (18%)

Query: 111 AQFGSADLRKAVHVKE------NFRRANF-----TSADMRESDFSGSKFNGAYLEKAVAY 159
           A+F  ADLR+ V++++      NF RAN      T AD+RE+DF+ +    A L +A   
Sbjct: 173 ARFQDADLRR-VNLQQAFVKSANFARANLVGADLTKADLRETDFTRANLTQAVLTQAKLR 231

Query: 160 KANFT 164
            ANF+
Sbjct: 232 DANFS 236


>gi|425453004|ref|ZP_18832819.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
 gi|389764929|emb|CCI09042.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
          Length = 333

 Score = 40.4 bits (93), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRYANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|354564871|ref|ZP_08984047.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
 gi|353549997|gb|EHC19436.1| pentapeptide repeat protein [Fischerella sp. JSC-11]
          Length = 105

 Score = 40.4 bits (93), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 29/56 (51%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A    ADL KA     N   AN T+AD+ E++ +G+   GA  ++A    AN T
Sbjct: 28  SNANLTGADLEKANLSNANLSGANLTNADLEEANLTGANLKGANFQRADLEDANLT 83


>gi|209526071|ref|ZP_03274603.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|423067542|ref|ZP_17056332.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|209493459|gb|EDZ93782.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|406711116|gb|EKD06318.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 517

 Score = 40.4 bits (93), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 7/81 (8%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           L  A++   + N++ LA ++  EA+      I        A+L +A   K NF +AN   
Sbjct: 86  LTKAILNQATINVANLARVDLTEAQLINSLLI-------RAELIRAKLTKANFTQANLNG 138

Query: 136 ADMRESDFSGSKFNGAYLEKA 156
           AD+RE+    + FNGA L  A
Sbjct: 139 ADLRETKLQQTNFNGANLSGA 159



 Score = 35.8 bits (81), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 22/64 (34%), Positives = 34/64 (53%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICL 170
           A F +A+LR+A     N   A+F+ A+MR  D  G+  +GA L +A    AN +   +  
Sbjct: 189 ADFSNAELRQANLTYANLSNADFSGANMRWIDLQGADLSGANLTEANLSGANLSGANLSS 248

Query: 171 PLLV 174
            +LV
Sbjct: 249 AVLV 252


>gi|427702634|ref|YP_007045856.1| low-complexity protein [Cyanobium gracile PCC 6307]
 gi|427345802|gb|AFY28515.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
          Length = 182

 Score = 40.4 bits (93), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 5/63 (7%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKA 161
            G  A+F  ADL  A+  +  F  A+F  AD     M + D SG+   GA L  A+A  +
Sbjct: 76  TGRQARFRDADLHGAILTQAAFPEADFHGADLSDALMDKVDMSGTDLTGAVLRGAIASGS 135

Query: 162 NFT 164
           NFT
Sbjct: 136 NFT 138


>gi|209526959|ref|ZP_03275476.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|376005813|ref|ZP_09783205.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|423064919|ref|ZP_17053709.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|209492561|gb|EDZ92899.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|375325803|emb|CCE18958.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|406714162|gb|EKD09330.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 331

 Score = 40.4 bits (93), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 41/101 (40%), Gaps = 14/101 (13%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR- 129
           F  T L AA +   +  ++ L D N  +A+ RG       A    ADLR A     N R 
Sbjct: 87  FHGTILQAADLRKANLTLATLVDANLIQADLRG-------ANLQGADLRGACLRGANMRY 139

Query: 130 ------RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
                   N   AD+R +D  G    GA L +A    AN T
Sbjct: 140 ERRIYESVNLRGADLRGTDLQGVNLTGADLTRANLTGANLT 180


>gi|407781463|ref|ZP_11128681.1| pentapeptide repeat-containing protein [Oceanibaculum indicum P24]
 gi|407207680|gb|EKE77611.1| pentapeptide repeat-containing protein [Oceanibaculum indicum P24]
          Length = 443

 Score = 40.0 bits (92), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 38/75 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           S A   +ADLR A     NFR A  T  ++   + +G+ F+GA L  A  + ANF   ++
Sbjct: 171 SEANLSNADLRNADLRMSNFRNAIMTGVNLIGVNAAGADFHGAVLTNARIHDANFDGVDL 230

Query: 169 CLPLLVSLPMATPVF 183
              +L    + +PVF
Sbjct: 231 TGAILDLTHLTSPVF 245


>gi|344339023|ref|ZP_08769953.1| pentapeptide repeat protein [Thiocapsa marina 5811]
 gi|343800943|gb|EGV18887.1| pentapeptide repeat protein [Thiocapsa marina 5811]
          Length = 284

 Score = 40.0 bits (92), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 30/61 (49%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           S A    ADLR A   + + R AN   ADMR++DF GS        KA+  +AN     +
Sbjct: 103 SKANLERADLRHADVRRADLRGANLAHADMRDTDFQGSDLCHVVAPKALFIRANLREANL 162

Query: 169 C 169
           C
Sbjct: 163 C 163



 Score = 36.2 bits (82), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 38/126 (30%), Positives = 56/126 (44%), Gaps = 19/126 (15%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETR----GEFGIGSAAQFGSADLR----KAVHVKE- 126
           L  A +  C  N + LA  + +EA+      G F + + A F  ADLR    ++V  +E 
Sbjct: 162 LCGADLRDCHLNDANLAGASMHEADLTSALPGGFTVINLANFEGADLRGSKLRSVSAQET 221

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLLVSLPMATPVFPAG 186
           NFR AN T  D+     + +   GA L +A    A+F+  E     L S+ M    F   
Sbjct: 222 NFRNANLTDVDL-----TNAVLGGAILRRADVTNADFSGVE-----LASVTMEFANFSKA 271

Query: 187 FCAPFP 192
             A +P
Sbjct: 272 RNAVYP 277


>gi|220907627|ref|YP_002482938.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219864238|gb|ACL44577.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 267

 Score = 40.0 bits (92), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 37/77 (48%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           S A   +A+ +KA  +      +N T AD+ ++D +G   + A L +A   + NFT  ++
Sbjct: 132 SQANMSAANFQKATLISAYLHNSNLTQADLSDADLTGINLSDANLSQATLIRTNFTGGDL 191

Query: 169 CLPLLVSLPMATPVFPA 185
              +LV   +A     A
Sbjct: 192 SRVMLVGANLAETNLTA 208


>gi|428202965|ref|YP_007081554.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
 gi|427980397|gb|AFY77997.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
          Length = 179

 Score = 40.0 bits (92), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 28/56 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           + A    ADL K      N + AN  +AD+ E++   +   GA L++A   KAN T
Sbjct: 97  AGANLQGADLEKGNLAGANLQTANLINADLEEANLQNANLQGASLQRADLEKANLT 152


>gi|313204014|ref|YP_004042671.1| pentapeptide repeat-containing protein [Paludibacter
           propionicigenes WB4]
 gi|312443330|gb|ADQ79686.1| pentapeptide repeat protein [Paludibacter propionicigenes WB4]
          Length = 186

 Score = 40.0 bits (92), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 34/129 (26%), Positives = 45/129 (34%), Gaps = 6/129 (4%)

Query: 35  WVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADL 94
           ++ C   S    D  F DC+   C    A LKN      TAL+      C        + 
Sbjct: 27  FLNCNFYSSNLVDVSFRDCTFESCDFSLASLKN------TALSDIQFIGCKLVGVQFDEC 80

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
           N +    R E  +   A F    L+K   +  N    +FT ADM          N A   
Sbjct: 81  NPFLFSVRFENCVLKLAVFQKVKLKKTRFINCNLEETDFTEADMSSGVLDNCNLNRAIFH 140

Query: 155 KAVAYKANF 163
           K    KA+F
Sbjct: 141 KTNLEKADF 149


>gi|163795566|ref|ZP_02189532.1| hypothetical protein BAL199_26237 [alpha proteobacterium BAL199]
 gi|159179165|gb|EDP63698.1| hypothetical protein BAL199_26237 [alpha proteobacterium BAL199]
          Length = 427

 Score = 40.0 bits (92), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 42/94 (44%), Gaps = 2/94 (2%)

Query: 94  LNKYEAETRGEF--GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
           LN Y    R +   G  + AQ    DLR+A+    +FR A F  A++ E+  +GS+   A
Sbjct: 23  LNNYPGGQRADMRGGRHNGAQLNGVDLRRAMMSAADFRGAQFVGANLSEATLAGSQLRVA 82

Query: 152 YLEKAVAYKANFTVDEICLPLLVSLPMATPVFPA 185
            L  A   K +F   ++    L S  +    F A
Sbjct: 83  DLSGAKLVKTDFRGADLEQAKLTSSDITDADFRA 116


>gi|348176753|ref|ZP_08883647.1| pentapeptide repeat-containing protein [Saccharopolyspora spinosa
           NRRL 18395]
          Length = 198

 Score = 40.0 bits (92), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 47/101 (46%), Gaps = 7/101 (6%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
           F  T L  + +  CS   S+ AD  +  A T  E  + +    G ADLR        FR 
Sbjct: 71  FERTVLGKSTLDGCSLLGSSFADC-RLRAWTLRETDL-TLVGMGKADLRGLDLRGIRFRE 128

Query: 131 ANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTVD 166
           AN T  D+R     E+DF+G++  GA LE+A   ++    D
Sbjct: 129 ANLTECDLRRCDLREADFTGARLLGARLEEADLRESRIDAD 169


>gi|271962831|ref|YP_003337027.1| hypothetical protein [Streptosporangium roseum DSM 43021]
 gi|270506006|gb|ACZ84284.1| Uncharacterized low-complexity protein-like protein
           [Streptosporangium roseum DSM 43021]
          Length = 412

 Score = 40.0 bits (92), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 27/87 (31%), Positives = 41/87 (47%), Gaps = 7/87 (8%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT-VDEIC 169
           A F  A LR+A   + N R A F  AD+ E++ + ++ +GA    A+   A+F   D+  
Sbjct: 177 ADFTRAKLREAKLGQANLRNATFKDADLSEAELTQAELDGAVFTGALVEGASFVQADDAD 236

Query: 170 L------PLLVSLPMATPVFPAGFCAP 190
           L      P  +SLP    + P G   P
Sbjct: 237 LAGAKGTPKGLSLPTTDLLIPDGIFTP 263


>gi|397695427|ref|YP_006533310.1| pentapeptide repeat-containing protein [Pseudomonas putida DOT-T1E]
 gi|421520705|ref|ZP_15967367.1| pentapeptide repeat-containing protein [Pseudomonas putida LS46]
 gi|298682200|gb|ADI95267.1| PedD [Pseudomonas putida DOT-T1E]
 gi|397332157|gb|AFO48516.1| pentapeptide repeat-containing protein [Pseudomonas putida DOT-T1E]
 gi|402755315|gb|EJX15787.1| pentapeptide repeat-containing protein [Pseudomonas putida LS46]
          Length = 219

 Score = 40.0 bits (92), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   A+LR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93


>gi|425470595|ref|ZP_18849461.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
 gi|389883733|emb|CCI35905.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
          Length = 333

 Score = 40.0 bits (92), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 25/55 (45%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADLR A     N   AN T A++  SDF G+   GA L K  A KANF
Sbjct: 199 SYADLRGADLRGADLRCANLEGANLTGANLNCSDFEGANLTGADLSKTDANKANF 253


>gi|148548300|ref|YP_001268402.1| pentapeptide repeat-containing protein [Pseudomonas putida F1]
 gi|395448857|ref|YP_006389110.1| pentapeptide repeat-containing protein [Pseudomonas putida ND6]
 gi|148512358|gb|ABQ79218.1| pentapeptide repeat protein [Pseudomonas putida F1]
 gi|388562854|gb|AFK71995.1| pentapeptide repeat-containing protein [Pseudomonas putida ND6]
          Length = 219

 Score = 39.7 bits (91), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   A+LR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93


>gi|332705327|ref|ZP_08425405.1| hypothetical protein LYNGBM3L_08020 [Moorea producens 3L]
 gi|332355687|gb|EGJ35149.1| hypothetical protein LYNGBM3L_08020 [Moorea producens 3L]
          Length = 221

 Score = 39.7 bits (91), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 27/53 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A    ADLR  +    + R AN T AD+R +D  G+   GA L +A   +AN 
Sbjct: 111 AILTRADLRLTILQDTDLRGANLTRADLRYADLRGANLTGACLHQADLTRANL 163


>gi|434396750|ref|YP_007130754.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428267847|gb|AFZ33788.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 331

 Score = 39.7 bits (91), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 5/56 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A    ++L KA  ++ NF RAN T A + ++D S     GA L  A+  K N T
Sbjct: 65  SGADLSQSNLEKAQLIETNFSRANLTEASLIQADLS-----GAILSSAIGTKTNLT 115


>gi|126661305|ref|ZP_01732374.1| hypothetical protein CY0110_08576 [Cyanothece sp. CCY0110]
 gi|126617401|gb|EAZ88201.1| hypothetical protein CY0110_08576 [Cyanothece sp. CCY0110]
          Length = 368

 Score = 39.7 bits (91), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 34/70 (48%), Gaps = 5/70 (7%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFT 164
             +    +L  A   + NFR AN T AD+ E     + FSG+  +GAYL  A   KA+F 
Sbjct: 246 GTELSGVELNGANLTQSNFRGANLTDADLSEAILSYTRFSGADLSGAYLGNANLQKADFY 305

Query: 165 VDEICLPLLV 174
              + L  L+
Sbjct: 306 RSSLALANLI 315


>gi|86608820|ref|YP_477582.1| pentapeptide repeat-containing protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
 gi|86557362|gb|ABD02319.1| pentapeptide repeat family protein [Synechococcus sp.
           JA-2-3B'a(2-13)]
          Length = 328

 Score = 39.7 bits (91), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 30/53 (56%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTV 165
            G A L+KA  V  N   AN + AD+ E+D   ++ +GA L+ A  + AN T+
Sbjct: 52  LGRAKLQKANLVGANLGGANLSQADLSEADLRDAQLHGATLQGADLHGANLTL 104


>gi|116073351|ref|ZP_01470613.1| hypothetical protein RS9916_32912 [Synechococcus sp. RS9916]
 gi|116068656|gb|EAU74408.1| hypothetical protein RS9916_32912 [Synechococcus sp. RS9916]
          Length = 167

 Score = 39.7 bits (91), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 32/62 (51%), Gaps = 5/62 (8%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRES-----DFSGSKFNGAYLEKAVAYKA 161
           +G  A F  ADL  A+  +  F  A+F+ AD+ +S     DFSG+    A L   +A  +
Sbjct: 62  VGRGADFSGADLHGAIFTQGAFAEADFSDADLSDSLMDRADFSGTNLTNALLNGVIASGS 121

Query: 162 NF 163
           +F
Sbjct: 122 SF 123


>gi|347735787|ref|ZP_08868588.1| pentapeptide repeat family protein [Azospirillum amazonense Y2]
 gi|346920906|gb|EGY01818.1| pentapeptide repeat family protein [Azospirillum amazonense Y2]
          Length = 451

 Score = 39.7 bits (91), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 7/70 (10%)

Query: 117 DLRKAVHVKENFRRANFTS-----ADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLP 171
           DLR A+ VK +   ++ T      AD+ E++ SG+K +GA L +A+   AN +   +   
Sbjct: 178 DLRGAIFVKADLSGSDLTGCNLEGADLSEANLSGTKLDGAVLTRALLRSANLSKASLLGA 237

Query: 172 LL--VSLPMA 179
           LL  V L MA
Sbjct: 238 LLDDVDLSMA 247


>gi|320156222|ref|YP_004188601.1| hypothetical protein VVMO6_01376 [Vibrio vulnificus MO6-24/O]
 gi|319931534|gb|ADV86398.1| hypothetical protein VVMO6_01376 [Vibrio vulnificus MO6-24/O]
          Length = 689

 Score = 39.7 bits (91), Expect = 0.67,   Method: Composition-based stats.
 Identities = 20/60 (33%), Positives = 28/60 (46%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           S A   SAD + ++ V  NF +A+ T AD    DF+ +   GA L      +A  T   I
Sbjct: 607 SKASLDSADFKSSIFVNANFEKADLTQADFGGCDFTNANLQGAELSGCDLTQARLTSSNI 666


>gi|434388230|ref|YP_007098841.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
 gi|428019220|gb|AFY95314.1| putative low-complexity protein [Chamaesiphon minutus PCC 6605]
          Length = 193

 Score = 39.7 bits (91), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 21/57 (36%), Positives = 26/57 (45%)

Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           G  A    ADLR A     N  + N   AD+R +D +G    GA L +A    AN T
Sbjct: 97  GDRASLHKADLRLASLQGANLSQVNLVGADLRYADLTGVNLTGANLSRANLTGANLT 153


>gi|26989392|ref|NP_744817.1| pentapeptide repeat-containing protein [Pseudomonas putida KT2440]
 gi|24984254|gb|AAN68281.1|AE016462_7 pentapeptide repeat family protein [Pseudomonas putida KT2440]
          Length = 219

 Score = 39.7 bits (91), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   A+LR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93


>gi|386012542|ref|YP_005930819.1| Pentapeptide repeat-containing protein [Pseudomonas putida BIRD-1]
 gi|313499248|gb|ADR60614.1| Pentapeptide repeat-containing protein [Pseudomonas putida BIRD-1]
          Length = 219

 Score = 39.7 bits (91), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   A+LR A    ++ R+ N   AD+R++D   ++ + A LEKA    AN T
Sbjct: 36  IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLEKARLQGANLT 93


>gi|156081718|ref|XP_001608352.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148800923|gb|EDL42328.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 1301

 Score = 39.7 bits (91), Expect = 0.69,   Method: Composition-based stats.
 Identities = 19/66 (28%), Positives = 34/66 (51%)

Query: 97  YEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           Y     G   +G+ A    AD+ +A  ++ +F RA+F  AD   +D + + FN A + +A
Sbjct: 33  YREALPGRAALGTEADLSRADVSRADAIRADFNRADFNRADFNRADVNRADFNRADVSRA 92

Query: 157 VAYKAN 162
              +A+
Sbjct: 93  NFNRAD 98


>gi|434398137|ref|YP_007132141.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428269234|gb|AFZ35175.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 223

 Score = 39.7 bits (91), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 33/72 (45%), Gaps = 4/72 (5%)

Query: 92  ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
           ADL + + +     G    A    ADL K      N  RAN   AD+ +++ +G+   GA
Sbjct: 129 ADLERADLQKTNLIG----ANLQGADLGKTNIAGANLERANLFDADLEKANLAGTNLAGA 184

Query: 152 YLEKAVAYKANF 163
            L+KA   K N 
Sbjct: 185 NLQKADLEKTNL 196


>gi|298489886|ref|YP_003720063.1| pentapeptide repeat-containing protein ['Nostoc azollae' 0708]
 gi|298231804|gb|ADI62940.1| pentapeptide repeat protein ['Nostoc azollae' 0708]
          Length = 256

 Score = 39.7 bits (91), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 28/54 (51%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A    ADLR A     N  RAN T AD+R ++ +G+   G  L +A   +AN T
Sbjct: 51  ADLSGADLRGANLEGANLSRANLTGADLRSANLAGASLFGVNLSRAKLNEANLT 104


>gi|113477694|ref|YP_723755.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
           IMS101]
 gi|110168742|gb|ABG53282.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
          Length = 204

 Score = 39.7 bits (91), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 1/52 (1%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            F  A+L+KA  ++ N R A+FT AD+R +DF  +   GA L  A   +A+F
Sbjct: 53  NFAGANLQKA-KLRANLRGADFTGADLRGADFRNADLRGAILIDAQLREASF 103


>gi|443328868|ref|ZP_21057461.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442791604|gb|ELS01098.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 266

 Score = 39.7 bits (91), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 18/54 (33%), Positives = 28/54 (51%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A    ADLR+A  ++ N    + + AD+R ++  G    GA L KA   +AN +
Sbjct: 153 ADLNDADLREAQLIRANLSEVDLSGADLRAANLKGVNLRGADLNKADLSRANLS 206


>gi|242277903|ref|YP_002990032.1| pentapeptide repeat-containing protein [Desulfovibrio salexigens DSM
            2638]
 gi|242120797|gb|ACS78493.1| pentapeptide repeat protein [Desulfovibrio salexigens DSM 2638]
          Length = 1277

 Score = 39.7 bits (91), Expect = 0.73,   Method: Composition-based stats.
 Identities = 21/58 (36%), Positives = 31/58 (53%)

Query: 106  GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
             IG +A F  A LR+A   +  F +A F  +D+ E++ + + F GA   KAV    NF
Sbjct: 1004 AIGMSADFSKASLRRADLSRGLFNKALFVESDLSEANGAQAIFKGAQFPKAVLRDTNF 1061


>gi|425452313|ref|ZP_18832131.1| Genome sequencing data, contig C306 [Microcystis aeruginosa PCC
           7941]
 gi|389765978|emb|CCI08285.1| Genome sequencing data, contig C306 [Microcystis aeruginosa PCC
           7941]
          Length = 188

 Score = 39.7 bits (91), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 16/42 (38%), Positives = 27/42 (64%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY 152
           +Q    +LR A  +  N RRANFT AD+ +++F+G++ +  Y
Sbjct: 128 SQLMDVNLRGASLINANIRRANFTGADVTDTNFTGAQCSDGY 169


>gi|418939008|ref|ZP_13492446.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
 gi|375054283|gb|EHS50653.1| pentapeptide repeat protein, partial [Rhizobium sp. PDO1-076]
          Length = 229

 Score = 39.7 bits (91), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 30/56 (53%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           + A    A+LR A     NF RA+  SAD+R +D  G+ F GA LE AV    + T
Sbjct: 63  TEANLKGANLRGADCDGANFTRADLKSADLRWADCDGANFTGANLESAVLQHTDLT 118


>gi|144900552|emb|CAM77416.1| low-complexity proteins [Magnetospirillum gryphiswaldense MSR-1]
          Length = 433

 Score = 39.7 bits (91), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 43/84 (51%), Gaps = 3/84 (3%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFRRAN 132
           L+ A++A+ S   + L+D    E+   G    + +  AAQ G A+L  A     + R AN
Sbjct: 300 LSGAILANASFREADLSDAFMAESRLDGADFRYAVLGAAQLGGANLGVAQLRHADMRLAN 359

Query: 133 FTSADMRESDFSGSKFNGAYLEKA 156
              A +R +D SG++ +GA L  A
Sbjct: 360 LEGAQLRGADLSGARLSGAKLSGA 383


>gi|443326309|ref|ZP_21054967.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
 gi|442794049|gb|ELS03478.1| putative low-complexity protein [Xenococcus sp. PCC 7305]
          Length = 366

 Score = 39.7 bits (91), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 42/93 (45%), Gaps = 5/93 (5%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-----R 130
           L   +  + +  IS LA+L K + +T    G   AA     DL  A   K N R      
Sbjct: 211 LIEQIYIAKTEQISELAELAKLDLKTDLAGGNLLAANLAGIDLNGANLQKTNLRGVILND 270

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A+ +  ++R ++  G+  +GAYLE A    AN 
Sbjct: 271 ADLSETNLRHANLGGADLSGAYLENADLTHANL 303


>gi|428777412|ref|YP_007169199.1| pentapeptide repeat-containing protein [Halothece sp. PCC 7418]
 gi|428691691|gb|AFZ44985.1| pentapeptide repeat protein [Halothece sp. PCC 7418]
          Length = 333

 Score = 39.7 bits (91), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 34/63 (53%), Gaps = 2/63 (3%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           S A   +ADL KA     N R AN   A++  +D SG+   GAYL +A  ++A  ++D +
Sbjct: 198 SEANLFNADLSKANLKGANLRGANLIRANLERADLSGADLRGAYLNEAKMFEA--SLDNV 255

Query: 169 CLP 171
            L 
Sbjct: 256 NLS 258


>gi|376001358|ref|ZP_09779228.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375330187|emb|CCE14981.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 351

 Score = 39.3 bits (90), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 29/56 (51%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A    ADL ++V    NF  AN T A++  ++ +G+  NGA L  A    AN T
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLTGANLTGANLNGANLTGANLTGANLT 245


>gi|297569025|ref|YP_003690369.1| pentapeptide repeat protein [Desulfurivibrio alkaliphilus AHT2]
 gi|296924940|gb|ADH85750.1| pentapeptide repeat protein [Desulfurivibrio alkaliphilus AHT2]
          Length = 830

 Score = 39.3 bits (90), Expect = 0.78,   Method: Composition-based stats.
 Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 7/79 (8%)

Query: 90  ALADLNKYEAE----TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
           ALADL   +      +R  F   S A+   ADLR+ +  + +FR A+   AD RE+    
Sbjct: 225 ALADLGGADLRRADLSRANF---SQARLRQADLRQVLFSESDFRHADARRADFREATLRQ 281

Query: 146 SKFNGAYLEKAVAYKANFT 164
           + F+GA L +A+    + T
Sbjct: 282 ANFSGADLSRAIFSGTDLT 300


>gi|16331545|ref|NP_442273.1| hypothetical protein slr0719 [Synechocystis sp. PCC 6803]
 gi|383323287|ref|YP_005384141.1| hypothetical protein SYNGTI_2379 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383326456|ref|YP_005387310.1| hypothetical protein SYNPCCP_2378 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383492340|ref|YP_005410017.1| hypothetical protein SYNPCCN_2378 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384437608|ref|YP_005652333.1| hypothetical protein SYNGTS_2380 [Synechocystis sp. PCC 6803]
 gi|451815697|ref|YP_007452149.1| hypothetical protein MYO_124040 [Synechocystis sp. PCC 6803]
 gi|1001199|dbj|BAA10343.1| slr0719 [Synechocystis sp. PCC 6803]
 gi|339274641|dbj|BAK51128.1| hypothetical protein SYNGTS_2380 [Synechocystis sp. PCC 6803]
 gi|359272607|dbj|BAL30126.1| hypothetical protein SYNGTI_2379 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359275777|dbj|BAL33295.1| hypothetical protein SYNPCCN_2378 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359278947|dbj|BAL36464.1| hypothetical protein SYNPCCP_2378 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|407961067|dbj|BAM54307.1| hypothetical protein BEST7613_5376 [Bacillus subtilis BEST7613]
 gi|451781666|gb|AGF52635.1| hypothetical protein MYO_124040 [Synechocystis sp. PCC 6803]
          Length = 388

 Score = 39.3 bits (90), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 33/70 (47%), Gaps = 5/70 (7%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSA-----DMRESDFSGSKFNGAYLEKAVAYKANFT 164
            A   S DL     +  NFR A  T +     D+R S+F G+  +GAYLE A   + +F 
Sbjct: 259 GANLNSIDLSNGQLMDSNFRGAILTDSDLSNTDLRRSNFRGADLSGAYLEGANLSQVDFR 318

Query: 165 VDEICLPLLV 174
              + L  L+
Sbjct: 319 KSSLALATLI 328


>gi|162455067|ref|YP_001617434.1| hypothetical protein sce6785 [Sorangium cellulosum So ce56]
 gi|161165649|emb|CAN96954.1| hypothetical protein sce6785 [Sorangium cellulosum So ce56]
          Length = 973

 Score = 39.3 bits (90), Expect = 0.82,   Method: Composition-based stats.
 Identities = 25/94 (26%), Positives = 47/94 (50%), Gaps = 3/94 (3%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEF---GIGSAAQFGSADLRKAVHVKEN 127
           F     + A +A  +   ++LA  +  +A+ RG        + A+   A+L +A+  + N
Sbjct: 854 FAGADFSGATLAGANLMGTSLAGTDLSDADLRGALLNEADLTEARLDRANLAEAMLTRAN 913

Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
             RA+  +AD+R+S  + ++  GA  EKA  + A
Sbjct: 914 LTRASLYAADLRQSILNSARVEGASFEKASLFSA 947


>gi|116753519|ref|YP_842637.1| pentapeptide repeat-containing protein [Methanosaeta thermophila
           PT]
 gi|116664970|gb|ABK13997.1| pentapeptide repeat protein [Methanosaeta thermophila PT]
          Length = 862

 Score = 39.3 bits (90), Expect = 0.83,   Method: Composition-based stats.
 Identities = 24/65 (36%), Positives = 33/65 (50%), Gaps = 2/65 (3%)

Query: 101 TRGE-FGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
           TR E FG   S      ADL KA  ++ N   A+ T A + ++DFSG+   GA + + V 
Sbjct: 669 TRAELFGADLSGTDLSGADLVKAYALRANLSGADLTDAKLDDADFSGAILRGAKMPELVI 728

Query: 159 YKANF 163
              NF
Sbjct: 729 RSVNF 733



 Score = 38.5 bits (88), Expect = 1.5,   Method: Composition-based stats.
 Identities = 20/52 (38%), Positives = 26/52 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
           A F  A LR A   +   R  NF  AD+ ++D SG +F   Y+  AV   AN
Sbjct: 711 ADFSGAILRGAKMPELVIRSVNFGQADLSDADMSGCRFEALYVSNAVMRSAN 762


>gi|320353524|ref|YP_004194863.1| pentapeptide repeat-containing protein [Desulfobulbus propionicus
           DSM 2032]
 gi|320122026|gb|ADW17572.1| pentapeptide repeat protein [Desulfobulbus propionicus DSM 2032]
          Length = 342

 Score = 39.3 bits (90), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 34/100 (34%), Positives = 44/100 (44%), Gaps = 3/100 (3%)

Query: 92  ADLNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
           ADL + + E     G   + AQ   ADL  A   K N R AN   AD+  +D  G+   G
Sbjct: 67  ADLRQSKLENANLEGANLTGAQLSLADLSGANLKKANLRNANLHGADLAYADLEGANLTG 126

Query: 151 AYLEKAVAYKANFTVDEICLPLLVSLPMATPVFPAGFCAP 190
           A LE A+ +KA      I   LL +     P  PA   +P
Sbjct: 127 ASLEGAI-FKATKMKGRIVNRLLHA-DQVRPETPAAPVSP 164


>gi|167034127|ref|YP_001669358.1| pentapeptide repeat-containing protein [Pseudomonas putida GB-1]
 gi|166860615|gb|ABY99022.1| pentapeptide repeat protein [Pseudomonas putida GB-1]
          Length = 219

 Score = 39.3 bits (90), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 32/58 (55%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           I  A+Q   A+LR A    ++ R+ N   AD+R++D   ++ + A LE+A    AN T
Sbjct: 36  IAEASQCPGANLRGANLANQDLRKMNLAGADLRDADLRHAQLDLANLERARLQGANLT 93


>gi|118592119|ref|ZP_01549513.1| hypothetical protein SIAM614_25622 [Stappia aggregata IAM 12614]
 gi|118435415|gb|EAV42062.1| hypothetical protein SIAM614_25622 [Labrenzia aggregata IAM 12614]
          Length = 275

 Score = 39.3 bits (90), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 38/73 (52%), Gaps = 8/73 (10%)

Query: 99  AETRG---EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAY--- 152
           AE RG   E G  +       DL++A+    NF+ ++F   +   +DFSGS F+GA    
Sbjct: 50  AELRGLVLENGDFAGTNLREVDLKEAMLPNANFKNSDFRRTEAERADFSGSDFSGANMRS 109

Query: 153 --LEKAVAYKANF 163
             LEKA   KANF
Sbjct: 110 VDLEKANLNKANF 122



 Score = 37.7 bits (86), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 26/82 (31%), Positives = 39/82 (47%), Gaps = 14/82 (17%)

Query: 92  ADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESD--------- 142
           +D  + EAE R +F   S + F  A++R     K N  +ANF  AD+R+ D         
Sbjct: 85  SDFRRTEAE-RADF---SGSDFSGANMRSVDLEKANLNKANFQDADLRDGDLNTVEANEA 140

Query: 143 -FSGSKFNGAYLEKAVAYKANF 163
            F G+        ++VA KA+F
Sbjct: 141 IFDGADMRNVLFTRSVANKASF 162


>gi|334118424|ref|ZP_08492513.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333459431|gb|EGK88044.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 479

 Score = 39.3 bits (90), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 31/55 (56%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADL ++     N  RA+ T A +RE++  G++F GA L++A   KAN 
Sbjct: 60  SGANLSGADLAESFLNLANLTRADLTGAVLREANLVGAEFTGANLKQASLIKANL 114



 Score = 37.4 bits (85), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 31/53 (58%), Gaps = 5/53 (9%)

Query: 110 AAQFGSADLRKAVHVK-----ENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
            A+F  A+L++A  +K      N   AN T A++  +D  GS+ +GA L+KAV
Sbjct: 96  GAEFTGANLKQASLIKANLVGANLHEANLTRANLSGADLRGSQLSGAILDKAV 148


>gi|302556667|ref|ZP_07309009.1| pentapeptide repeats-containing protein [Streptomyces griseoflavus
           Tu4000]
 gi|302474285|gb|EFL37378.1| pentapeptide repeats-containing protein [Streptomyces griseoflavus
           Tu4000]
          Length = 355

 Score = 39.3 bits (90), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 40/93 (43%), Gaps = 16/93 (17%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF------- 163
           A   +A LR+A  V  + R A     D+R++DF+G+    A L KA A+ A F       
Sbjct: 226 ADLTTAVLRRARCVLADLRAAKLVETDLRDADFTGTDLREANLRKAGAHGAVFQRADLRM 285

Query: 164 ---------TVDEICLPLLVSLPMATPVFPAGF 187
                    T D +   L  +L      +PAGF
Sbjct: 286 ADLRGTDLSTADLVAARLTGALASERTRWPAGF 318


>gi|158337660|ref|YP_001518836.1| pentapeptide repeat-containing serine/threonine kinase
           [Acaryochloris marina MBIC11017]
 gi|158307901|gb|ABW29518.1| serine/threonine kinase with pentapeptide repeats [Acaryochloris
           marina MBIC11017]
          Length = 532

 Score = 39.3 bits (90), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 10/69 (14%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVAYK 160
            +F + DLR A+ +  NF RANFT A++R           +D + +   GA L  A    
Sbjct: 428 GKFQNTDLRDAILINANFGRANFTGANLRNANLMQAYMSHADLANADLRGANLSDAYLSH 487

Query: 161 ANFTVDEIC 169
           AN     +C
Sbjct: 488 ANLRGANLC 496


>gi|193213002|ref|YP_001998955.1| pentapeptide repeat-containing protein [Chlorobaculum parvum NCIB
           8327]
 gi|193086479|gb|ACF11755.1| pentapeptide repeat protein [Chlorobaculum parvum NCIB 8327]
          Length = 193

 Score = 39.3 bits (90), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 31/128 (24%), Positives = 49/128 (38%), Gaps = 11/128 (8%)

Query: 35  WVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADL 94
           +V C ++    S   F +CS  QC    AKL      + T         C       +D 
Sbjct: 34  FVQCNLAQADLSGFMFRECSFEQCDMGLAKL------IDTGFQEVKFIDCKLLGVQFSDC 87

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
            K   E   +  I   + F + DL+  V     F   +   AD  E++ +GS+F+   L 
Sbjct: 88  RKLLLEINFKRCILKLSVFTNLDLKNTV-----FDDCDMQEADFTEANLTGSRFDNCDLR 142

Query: 155 KAVAYKAN 162
            A+ +  N
Sbjct: 143 LAIFFHTN 150


>gi|354556796|ref|ZP_08976083.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
 gi|353551246|gb|EHC20655.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
          Length = 253

 Score = 39.3 bits (90), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 47/98 (47%), Gaps = 8/98 (8%)

Query: 70  VFVSTALAAAVVASCSSNISALADLNKYEAETRG---EFGIGSAAQFGSADLRKAVHVKE 126
           + ++    + V    + N + L D N  +A+  G    +   S A   SA+LR A     
Sbjct: 57  ILLNLRFTSKVTKKANLNYADLKDHNLSKADLSGADLNYANLSGANLTSANLRYA----- 111

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           N R A+ + AD+ E++F+ +  +GA L  A   +AN T
Sbjct: 112 NLRGADLSGADLSETNFTYANLSGASLRYANLSRANLT 149


>gi|172055186|ref|YP_001806513.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
           51142]
 gi|171701467|gb|ACB54447.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
          Length = 280

 Score = 39.3 bits (90), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 28/98 (28%), Positives = 47/98 (47%), Gaps = 8/98 (8%)

Query: 70  VFVSTALAAAVVASCSSNISALADLNKYEAETRG---EFGIGSAAQFGSADLRKAVHVKE 126
           + ++    + V    + N + L D N  +A+  G    +   S A   SA+LR A     
Sbjct: 84  ILLNLRFTSKVTKKANLNYADLKDHNLSKADLSGADLNYANLSGANLTSANLRYA----- 138

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           N R A+ + AD+ E++F+ +  +GA L  A   +AN T
Sbjct: 139 NLRGADLSGADLSETNFTYANLSGASLRYANLSRANLT 176


>gi|427714529|ref|YP_007063153.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
 gi|427378658|gb|AFY62610.1| putative low-complexity protein [Synechococcus sp. PCC 6312]
          Length = 333

 Score = 39.3 bits (90), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 32/59 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEIC 169
           A  G A+ R+A     + R AN T AD+ ES    +  +GA LEKA+   A+ T+ ++ 
Sbjct: 53  ALLGRANFRRANLAGADLRGANLTQADLTESLLQEANLHGASLEKAILVGADITLADLT 111


>gi|443314265|ref|ZP_21043839.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
 gi|442786137|gb|ELR95903.1| putative low-complexity protein [Leptolyngbya sp. PCC 6406]
          Length = 887

 Score = 38.9 bits (89), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 32/56 (57%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S ++F  A L  A  ++ +  R N    ++ E++F  ++F+GA L  +VA KANF 
Sbjct: 233 SESEFRGAKLAHAKFIRADLSRTNLIRTNLAEANFERARFHGANLNNSVAKKANFN 288


>gi|359459150|ref|ZP_09247713.1| pentapeptide repeat-containing serine/threonine kinase
           [Acaryochloris sp. CCMEE 5410]
          Length = 514

 Score = 38.9 bits (89), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 10/69 (14%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMR----------ESDFSGSKFNGAYLEKAVAYK 160
            +F + DLR A+ +  NF RANFT A++R           +D + +   GA L  A    
Sbjct: 410 GKFQNTDLRDAILINANFGRANFTGANLRNANLMQAYMSHADLANADLRGANLSDAYLSH 469

Query: 161 ANFTVDEIC 169
           AN     +C
Sbjct: 470 ANLRGANLC 478


>gi|427718922|ref|YP_007066916.1| peptidase C14 caspase catalytic subunit p20 [Calothrix sp. PCC
           7507]
 gi|427351358|gb|AFY34082.1| peptidase C14 caspase catalytic subunit p20 [Calothrix sp. PCC
           7507]
          Length = 1102

 Score = 38.9 bits (89), Expect = 1.1,   Method: Composition-based stats.
 Identities = 22/56 (39%), Positives = 28/56 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S+A  G ADLR A       R AN + AD+R +D  G+   GA L  A    AN +
Sbjct: 839 SSANLGGADLRGADLSSAYLRGANLSYADLRGADLRGADLRGADLRGANLSSANLS 894



 Score = 38.1 bits (87), Expect = 1.8,   Method: Composition-based stats.
 Identities = 25/80 (31%), Positives = 44/80 (55%), Gaps = 5/80 (6%)

Query: 85   SSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS 144
            S+N+S  ADL+  +  +  + G   +A  GSA+L +A   + N  RAN +SAD+ +++ S
Sbjct: 971  SANLSG-ADLSDADLSS-ADLG---SADLGSANLSRANLSRANLSRANLSSADLSDANLS 1025

Query: 145  GSKFNGAYLEKAVAYKANFT 164
             +  +   L  A   +AN +
Sbjct: 1026 SANLSSTDLSSADLRRANLS 1045


>gi|118593941|ref|ZP_01551297.1| PipB-like protein [Stappia aggregata IAM 12614]
 gi|118433481|gb|EAV40152.1| PipB-like protein [Stappia aggregata IAM 12614]
          Length = 162

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            AA    A  + ++ V   F  AN TS +M  SDF+G+ F  A +   +A + NF
Sbjct: 7   DAANLTGASFKNSIGVNATFIEANLTSVEMNNSDFTGADFTKADMRHVIASETNF 61



 Score = 35.8 bits (81), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 24/62 (38%), Positives = 30/62 (48%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVD 166
           IG  A F  A+L        +F  A+FT ADMR    S + F  A  + AVA  ANF   
Sbjct: 20  IGVNATFIEANLTSVEMNNSDFTGADFTKADMRHVIASETNFQEATFKDAVAINANFVAA 79

Query: 167 EI 168
           +I
Sbjct: 80  DI 81


>gi|392384479|ref|YP_005033675.1| putative Pentapeptide repeat family protein [Azospirillum
           brasilense Sp245]
 gi|356881194|emb|CCD02176.1| putative Pentapeptide repeat family protein [Azospirillum
           brasilense Sp245]
          Length = 428

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 29/50 (58%)

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLLVSLPMAT 180
           AN + AD+R +DFS +K  GA L  AV   A F   ++    L ++PMAT
Sbjct: 180 ANLSGADLRGADFSMAKLKGAILNNAVVAGATFQGADLRDAELRNVPMAT 229


>gi|389694674|ref|ZP_10182768.1| putative low-complexity protein [Microvirga sp. WSM3557]
 gi|388588060|gb|EIM28353.1| putative low-complexity protein [Microvirga sp. WSM3557]
          Length = 251

 Score = 38.9 bits (89), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 39/146 (26%), Positives = 61/146 (41%), Gaps = 29/146 (19%)

Query: 33  PLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRV--------------FVSTALAA 78
           P W  CQ       DG  P    + C+     L N  +              F S+ +A 
Sbjct: 22  PAWAKCQ-------DGPGPGVDWSGCSKARLMLTNEDLTGTNFQRSLLTLSDFASSKMAG 74

Query: 79  AVVASCSSNISAL--ADLNKYEAET----RGEFGIG--SAAQFGSADLRKAVHVKENFRR 130
           A ++    + +    ADL+K         R  FG    + A FGSAD+ ++   +     
Sbjct: 75  ANLSETEVSRTRFEGADLSKANFTKALGWRANFGQANLTGADFGSADMNRSNFAQVKAAG 134

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKA 156
           ANF+ +++  SDFSG+  +GA + KA
Sbjct: 135 ANFSKSELNRSDFSGADLSGANISKA 160


>gi|167645176|ref|YP_001682839.1| pentapeptide repeat-containing protein [Caulobacter sp. K31]
 gi|167347606|gb|ABZ70341.1| pentapeptide repeat protein [Caulobacter sp. K31]
          Length = 419

 Score = 38.9 bits (89), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 29/51 (56%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
           I + A F  A L+ A  V+ N ++ANF  A++  +D SG+   GA L  AV
Sbjct: 166 IATKADFSDAILKDAKLVRANLKQANFNGANLAGADLSGANLTGADLRNAV 216


>gi|288957355|ref|YP_003447696.1| hypothetical protein AZL_005140 [Azospirillum sp. B510]
 gi|288909663|dbj|BAI71152.1| hypothetical protein AZL_005140 [Azospirillum sp. B510]
          Length = 450

 Score = 38.9 bits (89), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 24/41 (58%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           ADLRKA  V  N   A+ T AD+ E+D +G+   GA L  A
Sbjct: 395 ADLRKANLVGANLAGADLTGADLSEADLTGADLTGAMLTGA 435


>gi|218439290|ref|YP_002377619.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218172018|gb|ACK70751.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 231

 Score = 38.9 bits (89), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 27/49 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
           S A F  AD R +   K NF  A F  AD+ E+   G+ F GA LEKA+
Sbjct: 33  SGADFSKADFRSSRLGKTNFAYACFFGADLSEAILWGTDFTGANLEKAI 81


>gi|166363932|ref|YP_001656205.1| pentapeptide repeat-containing protein [Microcystis aeruginosa
           NIES-843]
 gi|166086305|dbj|BAG01013.1| pentapeptide repeat family protein [Microcystis aeruginosa
           NIES-843]
          Length = 164

 Score = 38.9 bits (89), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 35/62 (56%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           +     SA+L++AV +  +FR  +    D+ +++F G+  N A L  ++ Y+ANF    +
Sbjct: 52  NGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANFADCRL 111

Query: 169 CL 170
           CL
Sbjct: 112 CL 113


>gi|418019711|ref|ZP_12659144.1| putative low-complexity protein [Candidatus Regiella insecticola
           R5.15]
 gi|347604938|gb|EGY29471.1| putative low-complexity protein [Candidatus Regiella insecticola
           R5.15]
          Length = 381

 Score = 38.9 bits (89), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 26/45 (57%)

Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           DL K    + N  +AN T A++RE D +G+   GA LE+A   +A
Sbjct: 84  DLSKMDLSRVNLEKANLTGANLREMDLTGANLTGANLERARLVRA 128


>gi|209964001|ref|YP_002296916.1| pentapeptide repeat-containing protein [Rhodospirillum centenum SW]
 gi|209957467|gb|ACI98103.1| pentapeptide repeat family protein [Rhodospirillum centenum SW]
          Length = 433

 Score = 38.9 bits (89), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/61 (36%), Positives = 31/61 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICL 170
           A    A L KA  V+ N R AN + AD+R +D +G+    A L  A+  +A  T   + L
Sbjct: 367 ANLSGAKLVKASLVRANLRNANLSGADLRGADLTGANLIDANLRGALLDEAVLTGAALPL 426

Query: 171 P 171
           P
Sbjct: 427 P 427


>gi|399075150|ref|ZP_10751398.1| putative low-complexity protein [Caulobacter sp. AP07]
 gi|398039446|gb|EJL32581.1| putative low-complexity protein [Caulobacter sp. AP07]
          Length = 380

 Score = 38.9 bits (89), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 20/51 (39%), Positives = 29/51 (56%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
           I + A F  A L+ A  V+ N ++ANF  A++  +D SG+   GA L  AV
Sbjct: 127 IATKADFSDAILKDAKLVRANLKQANFNGANLAGADLSGANLTGADLRNAV 177


>gi|337286774|ref|YP_004626247.1| Ion transport 2 domain-containing protein [Thermodesulfatator
           indicus DSM 15286]
 gi|335359602|gb|AEH45283.1| Ion transport 2 domain protein [Thermodesulfatator indicus DSM
           15286]
          Length = 304

 Score = 38.9 bits (89), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 32/65 (49%), Gaps = 20/65 (30%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFT--------------------SADMRESDFSGSKF 148
            AA FG A+L+KA      FR A+FT                     AD+RE+DFSG+KF
Sbjct: 68  EAAGFGMANLKKARLFNAKFRHASFTKATLKGADAKCADFSLARLREADLREADFSGAKF 127

Query: 149 NGAYL 153
             A+L
Sbjct: 128 KEAHL 132



 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 28/52 (53%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            F   DL  A   + N +RA FT A+++ +DF+G+   GA LE   A  A F
Sbjct: 21  DFSGEDLAGAKFFRANLKRALFTGANLKGADFTGADLEGANLEGVDAEAAGF 72


>gi|332711043|ref|ZP_08430978.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332350169|gb|EGJ29774.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 343

 Score = 38.9 bits (89), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 9/100 (9%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFR 129
            +   LA A++   S N + L   N   A+ T+      + A   +A L KA+ ++ N  
Sbjct: 170 LIDIDLANAILHQASLNDAELTGANLTGADLTKANL---ARANLNTAKLSKALLIRANLS 226

Query: 130 RANFT-----SADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           + N +     +AD+R +D SG+ F GA L  A    AN T
Sbjct: 227 KTNLSITELRNADLRNADLSGANFMGADLTGADLTSANLT 266


>gi|291571459|dbj|BAI93731.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
          Length = 351

 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/56 (37%), Positives = 29/56 (51%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A    ADL ++V    NF  AN T A++  ++ +G+  NGA L  A    AN T
Sbjct: 190 SGANLTGADLSESVIQNSNFCIANLTGANLAGANLAGANLNGANLTGANLTGANLT 245


>gi|256397701|ref|YP_003119265.1| pentapeptide repeat-containing protein [Catenulispora acidiphila
           DSM 44928]
 gi|256363927|gb|ACU77424.1| pentapeptide repeat protein [Catenulispora acidiphila DSM 44928]
          Length = 354

 Score = 38.5 bits (88), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 44/94 (46%), Gaps = 9/94 (9%)

Query: 70  VFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKEN-- 127
           V VS   A   +A  +     LADL       R    + + A F  ADLR+AV  K    
Sbjct: 218 VSVSLQHAEMRLAKLTEARCVLADLRG----ARMAEAVLNGADFTRADLREAVLRKTQAQ 273

Query: 128 ---FRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
              F  A+  +AD+R +D S ++ +GA  E AVA
Sbjct: 274 NTVFHHADLRNADLRGADLSSAELDGARFEGAVA 307


>gi|407684714|ref|YP_006799888.1| pentapeptide repeat-containing protein [Alteromonas macleodii str.
           'English Channel 673']
 gi|407246325|gb|AFT75511.1| pentapeptide repeat-containing protein [Alteromonas macleodii str.
           'English Channel 673']
          Length = 451

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 31/58 (53%), Gaps = 5/58 (8%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           GIG  + F SADLRKA     N RRA+   A M +S+ + ++ +    ++A    A F
Sbjct: 267 GIGQLSLFDSADLRKA-----NLRRADIRQAQMNQSNLNDAELDYTIFDRAQLQSAQF 319


>gi|425467653|ref|ZP_18846932.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9809]
 gi|389829528|emb|CCI29082.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9809]
          Length = 220

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 35/62 (56%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           +     SA+L++AV +  +FR  +    D+ +++F G+  N A L  ++ Y+ANF    +
Sbjct: 108 NGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANFADCRL 167

Query: 169 CL 170
           CL
Sbjct: 168 CL 169


>gi|126655992|ref|ZP_01727376.1| hypothetical protein CY0110_02879 [Cyanothece sp. CCY0110]
 gi|126622272|gb|EAZ92978.1| hypothetical protein CY0110_02879 [Cyanothece sp. CCY0110]
          Length = 319

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 28/53 (52%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           Q   ADLR       +FR  +F+ A++RE DF+G+    AYL +A     N T
Sbjct: 25  QLRRADLRGLNLSNTDFRGVDFSYANLREVDFTGADLRDAYLNEADLTGVNLT 77


>gi|87302980|ref|ZP_01085784.1| hypothetical protein WH5701_07396 [Synechococcus sp. WH 5701]
 gi|87282476|gb|EAQ74435.1| hypothetical protein WH5701_07396 [Synechococcus sp. WH 5701]
          Length = 203

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/59 (33%), Positives = 33/59 (55%), Gaps = 5/59 (8%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKANFT 164
           A F  ADL  ++  +  F R++F+ AD     M  +DFSG+  +GA L   +A  ++F+
Sbjct: 101 ADFSGADLHGSILTQAAFLRSDFSGADLSDALMDRADFSGTDLSGALLRGVIAAGSSFS 159


>gi|254417642|ref|ZP_05031376.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196175560|gb|EDX70590.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 436

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 29/54 (53%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           +A    ADLR+A     + R AN + AD+RE++ SG+    A L  A   +A F
Sbjct: 348 SADLSDADLREANLSGADLREANLSGADLREANLSGADLREANLSGANVKQAKF 401


>gi|378719423|ref|YP_005284312.1| pentapeptide repeat-containing protein [Gordonia polyisoprenivorans
           VH2]
 gi|375754126|gb|AFA74946.1| pentapeptide repeat family protein [Gordonia polyisoprenivorans
           VH2]
          Length = 481

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 27/54 (50%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            A F  AD R A     + R AN T A++ +  F+G+   GA L  A   +ANF
Sbjct: 394 GASFVGADGRLASFTGADLRGANLTGANLSQGSFTGANLTGANLSGANLTEANF 447


>gi|254425612|ref|ZP_05039329.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
 gi|196188035|gb|EDX83000.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
          Length = 215

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 31/56 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A   SADL +A   + N R A+ +SAD+R +D  G+K  GA L  A    AN T
Sbjct: 68  SGADLRSADLFRADLSEANLRSADLSSADLRGADLPGAKLIGANLIGANLSIANVT 123


>gi|425471163|ref|ZP_18850023.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9701]
 gi|389882952|emb|CCI36586.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9701]
          Length = 220

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 35/62 (56%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           +     SA+L++AV +  +FR  +    D+ +++F G+  N A L  ++ Y+ANF    +
Sbjct: 108 NGVNLNSANLQQAVLIDADFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANFADCRL 167

Query: 169 CL 170
           CL
Sbjct: 168 CL 169


>gi|384246084|gb|EIE19575.1| hypothetical protein COCSUDRAFT_31020 [Coccomyxa subellipsoidea
           C-169]
          Length = 203

 Score = 38.5 bits (88), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 3/64 (4%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV---AYKANFTVD 166
            A F  AD+  AV  + +FR+AN ++     +  +G+ F+GA L+ A+   A   N  V 
Sbjct: 123 GANFSGADMTNAVIDRVDFRKANLSNVKFINAVITGTAFDGANLDGAIFEDALIGNEDVK 182

Query: 167 EICL 170
            +CL
Sbjct: 183 RLCL 186


>gi|307152500|ref|YP_003887884.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306982728|gb|ADN14609.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 305

 Score = 38.5 bits (88), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 40/75 (53%), Gaps = 5/75 (6%)

Query: 90  ALADL-NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF 148
           A+ DL NKY+A  R      S  +    DLR     + NF+ A+F+ A++RE DFSG+  
Sbjct: 6   AVIDLKNKYDAGERN----FSKIELRRVDLRGFNLSQANFKGADFSYANLREVDFSGADL 61

Query: 149 NGAYLEKAVAYKANF 163
           + A+  +A    AN 
Sbjct: 62  SEAFFNEADLTGANL 76


>gi|451981277|ref|ZP_21929641.1| putative Pentapeptide repeat protein [Nitrospina gracilis 3/211]
 gi|451761500|emb|CCQ90895.1| putative Pentapeptide repeat protein [Nitrospina gracilis 3/211]
          Length = 484

 Score = 38.5 bits (88), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 44/89 (49%), Gaps = 2/89 (2%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           L  AV+   S   SALA  +  +A+ +G   +   A    A LR A  VK + + A+   
Sbjct: 377 LKEAVLGKASLKNSALAGADLRKAKLKG--AVLEGADLAGARLRHASLVKAHLKGADLHR 434

Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFT 164
            ++ E+DFS +   GA L  A  ++AN T
Sbjct: 435 TELDEADFSNADLQGANLTGAKLWEANLT 463


>gi|317970566|ref|ZP_07971956.1| pentapeptide repeat-containing protein [Synechococcus sp. CB0205]
          Length = 175

 Score = 38.5 bits (88), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 31/63 (49%), Gaps = 5/63 (7%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKA 161
           +G AA F  ADL  A+  +  F  ANF  AD+ +     +D SG+    A L   +A  +
Sbjct: 70  VGKAANFSGADLHGAILTQGAFPDANFNGADLSDVLLDRTDMSGTDLRNAVLVGVIASGS 129

Query: 162 NFT 164
            FT
Sbjct: 130 TFT 132


>gi|159029340|emb|CAO90206.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
          Length = 405

 Score = 38.5 bits (88), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 33/65 (50%), Gaps = 7/65 (10%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
           A+ RG F   S A    ADLR+A         AN + AD+ E++ SG+   GA L  A+ 
Sbjct: 245 ADLRGAFL--SEANLKGADLRRAF-----LSEANLSGADLSEANLSGADLRGAILSGAIL 297

Query: 159 YKANF 163
           + AN 
Sbjct: 298 WGANL 302


>gi|443668754|ref|ZP_21134246.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|443330716|gb|ELS45411.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 403

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 33/65 (50%), Gaps = 7/65 (10%)

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVA 158
           A+ RG F   S A    ADLR+A         AN + AD+ E++ SG+   GA L  A+ 
Sbjct: 243 ADLRGAFL--SEANLKGADLRRAF-----LSEANLSGADLSEANLSGADLRGAILSGAIL 295

Query: 159 YKANF 163
           + AN 
Sbjct: 296 WGANL 300


>gi|209967175|ref|YP_002300090.1| pentapeptide repeat-containing protein [Rhodospirillum centenum SW]
 gi|209960641|gb|ACJ01278.1| pentapeptide repeat family protein [Rhodospirillum centenum SW]
          Length = 429

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 18/50 (36%), Positives = 27/50 (54%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           I + A F    +  AV ++ +F  AN    D+R++D  G+ F GA LE A
Sbjct: 152 IAAKADFSEVRMNGAVVLRADFTDANLARVDLRDADLRGANFRGANLEGA 201


>gi|254411535|ref|ZP_05025312.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196182036|gb|EDX77023.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 125

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 28/54 (51%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            A    ADLR+A     N   AN   AD+RE++ +G+   GA++  A   +AN 
Sbjct: 22  GAHLIGADLREANLQGANLSHANLEGADLREANLAGANLTGAFVTNADMKEANL 75


>gi|428320418|ref|YP_007118300.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428244098|gb|AFZ09884.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 479

 Score = 38.1 bits (87), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 30/55 (54%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADL ++     N  RA+ T A +RE++  G +F GA L++A   KAN 
Sbjct: 60  SGANLSGADLAESFLNLANLTRADLTGAVLREANLVGVEFTGANLKQASLIKANL 114



 Score = 37.0 bits (84), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 27/49 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
           + A    A L KA  V  N   AN T A++  +D  GS+ +GA L+KAV
Sbjct: 100 TGANLKQASLIKANLVGANLHEANLTRANLSGADLRGSQLSGAILDKAV 148


>gi|428217541|ref|YP_007102006.1| pentapeptide repeat-containing protein [Pseudanabaena sp. PCC 7367]
 gi|427989323|gb|AFY69578.1| pentapeptide repeat protein [Pseudanabaena sp. PCC 7367]
          Length = 353

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 29/53 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A FGSA+L  A   + N  +AN   AD+ ++D  G+K  G  L +A   +AN 
Sbjct: 54  ANFGSANLLGANLSEANLTKANLREADLYKADLGGAKLIGTSLIRAYLREANL 106


>gi|193213578|ref|YP_001999531.1| pentapeptide repeat-containing protein [Chlorobaculum parvum NCIB
           8327]
 gi|193087055|gb|ACF12331.1| pentapeptide repeat protein [Chlorobaculum parvum NCIB 8327]
          Length = 439

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 30/54 (55%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           + F SADL KA     N    NF+ ADM +++  G+   GA L++A   +A+ +
Sbjct: 301 SDFESADLDKANLAGANLAGGNFSRADMEKANLKGANLEGAVLDRAFMKQADLS 354


>gi|254430459|ref|ZP_05044162.1| pentapeptide repeat family protein [Cyanobium sp. PCC 7001]
 gi|197624912|gb|EDY37471.1| pentapeptide repeat family protein [Cyanobium sp. PCC 7001]
          Length = 180

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 5/62 (8%)

Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKAN 162
           G  A F  A+L  A+  +  F  A+F  AD     M + DFSG+ F GA L   +A  +N
Sbjct: 76  GRHADFSGANLHGAILTQAAFPEASFAGADLSGVLMDKVDFSGADFTGADLSDVIASGSN 135

Query: 163 FT 164
           F+
Sbjct: 136 FS 137


>gi|392382619|ref|YP_005031816.1| conserved protein of unknown function; Pentapeptide repeat
           [Azospirillum brasilense Sp245]
 gi|356877584|emb|CCC98426.1| conserved protein of unknown function; Pentapeptide repeat
           [Azospirillum brasilense Sp245]
          Length = 439

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 26/47 (55%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           AA    ADLR+A+       +AN T AD+  +D  G+   GA L++A
Sbjct: 383 AANLMGADLRQAMLTDSRMVQANLTDADLESADLDGADLAGAKLQRA 429


>gi|417147800|ref|ZP_11988300.1| pentapeptide repeat protein [Escherichia coli 1.2264]
 gi|432414449|ref|ZP_19657095.1| hypothetical protein WG9_04959 [Escherichia coli KTE39]
 gi|432449036|ref|ZP_19691321.1| hypothetical protein A13S_05120 [Escherichia coli KTE191]
 gi|432639030|ref|ZP_19874892.1| hypothetical protein A1UY_04405 [Escherichia coli KTE81]
 gi|433026911|ref|ZP_20214794.1| hypothetical protein WI9_05012 [Escherichia coli KTE106]
 gi|433186914|ref|ZP_20371055.1| hypothetical protein WGO_05288 [Escherichia coli KTE85]
 gi|215272912|emb|CAT00693.1| protein mcbG [Escherichia coli]
 gi|386162365|gb|EIH24165.1| pentapeptide repeat protein [Escherichia coli 1.2264]
 gi|430931206|gb|ELC51659.1| hypothetical protein WG9_04959 [Escherichia coli KTE39]
 gi|430969334|gb|ELC86475.1| hypothetical protein A13S_05120 [Escherichia coli KTE191]
 gi|431167788|gb|ELE68043.1| hypothetical protein A1UY_04405 [Escherichia coli KTE81]
 gi|431524910|gb|ELI01731.1| hypothetical protein WI9_05012 [Escherichia coli KTE106]
 gi|431695578|gb|ELJ60883.1| hypothetical protein WGO_05288 [Escherichia coli KTE85]
          Length = 187

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 18/57 (31%), Positives = 30/57 (52%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDE 167
             F S  L+K++ +   FR   F   D+R+SDF+GS+FN      +     +F++ E
Sbjct: 97  VDFISLRLQKSIFLSSRFRDCLFEETDLRKSDFTGSEFNNTEFRHSDLSHCDFSMTE 153


>gi|300867252|ref|ZP_07111912.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
 gi|300334729|emb|CBN57078.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
          Length = 508

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 12/98 (12%)

Query: 73  STALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRAN 132
           S+ L  A++   + N++ L   +  EA+  G       A     +L +A   K NF +AN
Sbjct: 75  SSHLVRAILQGATLNVANLVRADLSEAQLMG-------AALIRGELIRAELSKANFSKAN 127

Query: 133 FTSADMRES-----DFSGSKFNGAYLEKAVAYKANFTV 165
            T AD+RE+     +FS +  +GA L  A    ANF +
Sbjct: 128 LTGADLREAKLTEVNFSEANLSGANLRGASGTAANFEL 165



 Score = 36.2 bits (82), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 29/55 (52%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           + A F   DLR+A   + N   AN + A++R +D SG+   GA L +A    AN 
Sbjct: 179 NGADFSGTDLRQANLCQVNLSGANLSGANLRWADLSGANLRGADLNEAKLSGANL 233


>gi|288957041|ref|YP_003447382.1| hypothetical protein AZL_002000 [Azospirillum sp. B510]
 gi|288909349|dbj|BAI70838.1| hypothetical protein AZL_002000 [Azospirillum sp. B510]
          Length = 424

 Score = 38.1 bits (87), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 20/47 (42%), Positives = 27/47 (57%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           AA F +  L  A   + + R ANF+ AD+R +D +GS   GA LE A
Sbjct: 166 AADFTNTRLAGARLDRTDLRDANFSGADLRGADLNGSDLRGAILEGA 212


>gi|158335471|ref|YP_001516643.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158305712|gb|ABW27329.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 502

 Score = 38.1 bits (87), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 44/97 (45%), Gaps = 13/97 (13%)

Query: 91  LADLNKYEAE-TRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           L D+N   A  +R    +   S A    +DL  A   + N  R NF+ AD+ ++D S ++
Sbjct: 33  LKDINLINANLSRANLSLANLSGAFLAGSDLSDAFLSEANLSRVNFSRADLTKADLSFAR 92

Query: 148 FNGAYLEKAVAYKANFTVDEICLPLLVSLPMATPVFP 184
             GA L +A  Y+A          +L+   M   +FP
Sbjct: 93  LQGATLIEATLYQA----------ILIEACMVQVIFP 119


>gi|149922858|ref|ZP_01911281.1| serine/threonine kinase [Plesiocystis pacifica SIR-1]
 gi|149816325|gb|EDM75829.1| serine/threonine kinase [Plesiocystis pacifica SIR-1]
          Length = 655

 Score = 38.1 bits (87), Expect = 1.9,   Method: Composition-based stats.
 Identities = 18/55 (32%), Positives = 32/55 (58%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A+ G   L KA  ++ +  RA+   AD+R + F+ +  +GA L +A+ + A+F
Sbjct: 556 SGARLGGLRLDKAEFIQASMARAHLRGADLRRARFNHADLSGADLREAIVWNADF 610


>gi|427734924|ref|YP_007054468.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427369965|gb|AFY53921.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 213

 Score = 38.1 bits (87), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 29/56 (51%)

Query: 108 GSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           G   Q   A+L      + +  RAN   A++  ++F+GSKF GA+LE A    AN 
Sbjct: 9   GELKQLAGANLEDENLSQTDLSRANLAGANLVGTNFAGSKFEGAHLEGANLMGANL 64


>gi|23014351|ref|ZP_00054172.1| COG1357: Uncharacterized low-complexity proteins [Magnetospirillum
           magnetotacticum MS-1]
          Length = 164

 Score = 38.1 bits (87), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 34/63 (53%)

Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           R + G+ S A F  ADL  A  V+ + RRA F  A +R +D +G+K  GA L  A    A
Sbjct: 87  RLDDGLFSDADFTKADLGGASLVRADLRRARFFHASLRGADLTGAKTLGAELLNADLSGA 146

Query: 162 NFT 164
            +T
Sbjct: 147 RWT 149


>gi|16331083|ref|NP_441811.1| hypothetical protein sll0274 [Synechocystis sp. PCC 6803]
 gi|383322826|ref|YP_005383679.1| hypothetical protein SYNGTI_1917 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|383325995|ref|YP_005386848.1| hypothetical protein SYNPCCP_1916 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|383491879|ref|YP_005409555.1| hypothetical protein SYNPCCN_1916 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|384437147|ref|YP_005651871.1| hypothetical protein SYNGTS_1918 [Synechocystis sp. PCC 6803]
 gi|451815240|ref|YP_007451692.1| hypothetical protein MYO_119360 [Synechocystis sp. PCC 6803]
 gi|1653576|dbj|BAA18489.1| sll0274 [Synechocystis sp. PCC 6803]
 gi|339274179|dbj|BAK50666.1| hypothetical protein SYNGTS_1918 [Synechocystis sp. PCC 6803]
 gi|359272145|dbj|BAL29664.1| hypothetical protein SYNGTI_1917 [Synechocystis sp. PCC 6803
           substr. GT-I]
 gi|359275315|dbj|BAL32833.1| hypothetical protein SYNPCCN_1916 [Synechocystis sp. PCC 6803
           substr. PCC-N]
 gi|359278485|dbj|BAL36002.1| hypothetical protein SYNPCCP_1916 [Synechocystis sp. PCC 6803
           substr. PCC-P]
 gi|451781209|gb|AGF52178.1| hypothetical protein MYO_119360 [Synechocystis sp. PCC 6803]
          Length = 196

 Score = 38.1 bits (87), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 6/99 (6%)

Query: 65  LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
           L  W+  V T +   +VA+    + +LA  +      RG       A F   DLR ++  
Sbjct: 34  LGRWQFVVRTGI---LVATFILALGSLASPSLALDYNRGNL---VGADFSHQDLRGSIFD 87

Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
             N R A+FT A+++ + F  +  +GA LE A A   +F
Sbjct: 88  HANLRGADFTGANLQGARFFSANMDGAILEGADARGVDF 126


>gi|218440259|ref|YP_002378588.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218172987|gb|ACK71720.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 340

 Score = 38.1 bits (87), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 37/81 (45%), Gaps = 10/81 (12%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA----------YLEKAVAYKANFTV 165
           A+LR+A+    N R  N  SAD+ E+DF G+  +GA           L  A   +AN   
Sbjct: 246 ANLRQAILTYANLRGCNLLSADLAEADFEGANLSGAGLLLTYMRATNLRHANLDQANLIG 305

Query: 166 DEICLPLLVSLPMATPVFPAG 186
             +    L++  +A  + P G
Sbjct: 306 ASLVQTNLMAASLAQTILPNG 326


>gi|376003692|ref|ZP_09781500.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375327990|emb|CCE17253.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 740

 Score = 38.1 bits (87), Expect = 2.1,   Method: Composition-based stats.
 Identities = 20/54 (37%), Positives = 27/54 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A     +LR A     N   A+   AD+R +D  G+ F GA L +A  Y+AN T
Sbjct: 580 ANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANIT 633


>gi|209526910|ref|ZP_03275429.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|423063829|ref|ZP_17052619.1| pentapeptide repeat protein [Arthrospira platensis C1]
 gi|209492689|gb|EDZ93025.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|406714678|gb|EKD09839.1| pentapeptide repeat protein [Arthrospira platensis C1]
          Length = 740

 Score = 38.1 bits (87), Expect = 2.1,   Method: Composition-based stats.
 Identities = 20/54 (37%), Positives = 27/54 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A     +LR A     N   A+   AD+R +D  G+ F GA L +A  Y+AN T
Sbjct: 580 ANLRGVNLRNANLRGGNLEGAHLEGADLRGADLQGANFKGANLHRANFYQANIT 633


>gi|407961546|dbj|BAM54786.1| hypothetical protein BEST7613_5855 [Synechocystis sp. PCC 6803]
          Length = 194

 Score = 38.1 bits (87), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 6/99 (6%)

Query: 65  LKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHV 124
           L  W+  V T +   +VA+    + +LA  +      RG       A F   DLR ++  
Sbjct: 32  LGRWQFVVRTGI---LVATFILALGSLASPSLALDYNRGNL---VGADFSHQDLRGSIFD 85

Query: 125 KENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
             N R A+FT A+++ + F  +  +GA LE A A   +F
Sbjct: 86  HANLRGADFTGANLQGARFFSANMDGAILEGADARGVDF 124


>gi|254411218|ref|ZP_05024995.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
 gi|196181719|gb|EDX76706.1| Pentapeptide repeat protein [Coleofasciculus chthonoplastes PCC
           7420]
          Length = 293

 Score = 38.1 bits (87), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 21/70 (30%), Positives = 33/70 (47%), Gaps = 10/70 (14%)

Query: 110 AAQFGSADLRKAVHVKENFRRAN----------FTSADMRESDFSGSKFNGAYLEKAVAY 159
           +A    A+L  A+ ++ N ++AN          FT AD+ E D S ++ NG  L +A+  
Sbjct: 163 SANLEKANLTNAILLETNLKQANLNKALLHGANFTQADLTEVDLSQARLNGVNLTRAILV 222

Query: 160 KANFTVDEIC 169
            A      IC
Sbjct: 223 GAKLRGVSIC 232


>gi|448684742|ref|ZP_21692829.1| pentapeptide repeat-containing protein [Haloarcula japonica DSM
           6131]
 gi|445782673|gb|EMA33514.1| pentapeptide repeat-containing protein [Haloarcula japonica DSM
           6131]
          Length = 710

 Score = 37.7 bits (86), Expect = 2.2,   Method: Composition-based stats.
 Identities = 19/47 (40%), Positives = 26/47 (55%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
           AQFG+ D   A   + +FR A F +A    + FSG  FNG   ++AV
Sbjct: 230 AQFGTGDFYHATFDEADFRWAEFGTARFYGATFSGGYFNGTSYDEAV 276


>gi|258612055|ref|ZP_05243959.2| phage protein [Listeria monocytogenes FSL R2-503]
 gi|258608006|gb|EEW20614.1| phage protein [Listeria monocytogenes FSL R2-503]
          Length = 187

 Score = 37.7 bits (86), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 22/61 (36%), Positives = 31/61 (50%)

Query: 96  KYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEK 155
           K+  +  GE      A    ADLR A     N RRA+ + AD+  +D +G+  NGA L +
Sbjct: 15  KWLRDGYGERANLRGANLRGADLRGADLSYANLRRADLSRADLNGADLNGADLNGADLSR 74

Query: 156 A 156
           A
Sbjct: 75  A 75


>gi|409994208|ref|ZP_11277326.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|409934956|gb|EKN76502.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 517

 Score = 37.7 bits (86), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 7/81 (8%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           L  A++   + N++ L   +  EA+      I        A+L +A   K NF +AN   
Sbjct: 86  LTKAILNQATINVANLVRADLTEAQLINTLLI-------RAELVRAKLSKANFTQANLNG 138

Query: 136 ADMRESDFSGSKFNGAYLEKA 156
           AD+RES    + FNGA L  A
Sbjct: 139 ADLRESKLQQTNFNGANLSGA 159


>gi|186683437|ref|YP_001866633.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
           73102]
 gi|186465889|gb|ACC81690.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
          Length = 176

 Score = 37.7 bits (86), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 30/119 (25%), Positives = 54/119 (45%), Gaps = 4/119 (3%)

Query: 41  SSKTESDGQFPDCSNNQCAGPY--AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYE 98
           SS  E+D    D +N   +G      + +  +    A+  A ++     ++ L + N  E
Sbjct: 55  SSLIEADLNGADLTNANLSGSNLSGAILDGAILDGAAMEGANLSQADLTVAKLIETNLSE 114

Query: 99  AETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
           A+ +    I  AA    ADL  A     +  +AN T AD+ +++ SG+  +GA +E  +
Sbjct: 115 ADLQEASLI--AANLDGADLSGADLTVADLSQANLTQADLNQTNLSGANLDGANIEGTI 171


>gi|428213860|ref|YP_007087004.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428002241|gb|AFY83084.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 331

 Score = 37.7 bits (86), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 47/122 (38%), Gaps = 14/122 (11%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR----- 130
           L  A +   + N++ L D N  +A+ RG              LR A     NFR      
Sbjct: 92  LQGADLRKANLNLANLLDANLSDADLRG-------TTLSGVCLRGACLRGANFREERRIY 144

Query: 131 --ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLLVSLPMATPVFPAGFC 188
             AN   AD+R +D  G   +GA L KA    AN T   +    L    MA  +   GF 
Sbjct: 145 SAANLRGADLRGADLRGVNLSGADLTKADLSGANLTETNLRGANLERAKMALAIVNGGFL 204

Query: 189 AP 190
           + 
Sbjct: 205 SD 206


>gi|381207646|ref|ZP_09914717.1| hypothetical protein SclubJA_18738 [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 219

 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 27/55 (49%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            +A    ADL +A   K + R AN   AD+RES+  G    GA L+ A    AN 
Sbjct: 126 QSADLSEADLYRADLEKSDLRDANLYKADLRESNLQGVNLQGANLQGADLEGANL 180


>gi|291570912|dbj|BAI93184.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
          Length = 517

 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 7/81 (8%)

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           L  A++   + N++ L   +  EA+      I        A+L +A   K NF +AN   
Sbjct: 86  LTKAILNQATINVANLVRADLTEAQLINTLLI-------RAELVRAKLSKANFTQANLNG 138

Query: 136 ADMRESDFSGSKFNGAYLEKA 156
           AD+RES    + FNGA L  A
Sbjct: 139 ADLRESKLQQTNFNGANLSGA 159


>gi|430900982|ref|ZP_19484783.1| LPXTG-domain-containing protein cell wall anchor domain
           [Enterococcus faecium E1575]
 gi|430554860|gb|ELA94429.1| LPXTG-domain-containing protein cell wall anchor domain
           [Enterococcus faecium E1575]
          Length = 1074

 Score = 37.7 bits (86), Expect = 2.4,   Method: Composition-based stats.
 Identities = 20/51 (39%), Positives = 31/51 (60%), Gaps = 1/51 (1%)

Query: 63  AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQF 113
           +KLKNWRV V   +   ++AS  S+I   AD+N  E +   E+G+G+  +F
Sbjct: 4   SKLKNWRVAVVVVMIIQLLASFVSSIIVHADINHPE-QVSIEYGVGTGYRF 53


>gi|428774386|ref|YP_007166174.1| serine/threonine protein kinase with pentapeptide repeats
           [Cyanobacterium stanieri PCC 7202]
 gi|428688665|gb|AFZ48525.1| serine/threonine protein kinase with pentapeptide repeats
           [Cyanobacterium stanieri PCC 7202]
          Length = 506

 Score = 37.7 bits (86), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 29/53 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A F  AD  +A  V+ N  +A+   A+++ +DF  +   GA LE A  YKAN 
Sbjct: 420 ANFYHADFSRARLVRANLTKAHLFKAELQYADFRNANLTGANLEGANLYKANL 472


>gi|284929723|ref|YP_003422245.1| hypothetical protein UCYN_11960 [cyanobacterium UCYN-A]
 gi|284810167|gb|ADB95864.1| uncharacterized low-complexity protein [cyanobacterium UCYN-A]
          Length = 243

 Score = 37.7 bits (86), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 9/63 (14%)

Query: 94  LNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYL 153
           LNKY+   R          F S  LR+    + N  + NF SAD+R+S    S FNGA L
Sbjct: 7   LNKYDLGER---------NFQSICLREVDLTEVNLPKINFESADIRQSRLGKSNFNGAIL 57

Query: 154 EKA 156
           ++A
Sbjct: 58  KQA 60


>gi|385679319|ref|ZP_10053247.1| pentapeptide repeat-containing protein [Amycolatopsis sp. ATCC
           39116]
          Length = 194

 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 53/119 (44%), Gaps = 18/119 (15%)

Query: 53  CSNNQCAGPYAKLKNWR---------VFVSTALAAAVVASCSSNISALADLNKYEAETRG 103
           C+ ++C    A L   R         VF  T LA +  ++CS   S+  D        R 
Sbjct: 40  CTFDECDFSGADLGESRHQASAFRSCVFDRTVLADSTWSACSLLGSSFVDGGLRGMSVRD 99

Query: 104 -EFGIGSAAQFGSADLRKAVHVKENFRRANF-----TSADMRESDFSGSKFNGAYLEKA 156
            +F   S A F  A+LR+       FR A+F     T AD+R+SDF G++  GA L  A
Sbjct: 100 SDF---SLANFSRANLRRRSLSGLRFREASFVDANLTEADLRDSDFRGARLGGADLTGA 155


>gi|288960397|ref|YP_003450737.1| pentapeptide repeat protein [Azospirillum sp. B510]
 gi|288912705|dbj|BAI74193.1| pentapeptide repeat protein [Azospirillum sp. B510]
          Length = 431

 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 29/57 (50%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
            F  A +R    V+   R ANFT ++M  +D SG+   GA L  AV   A  T+ ++
Sbjct: 167 DFSDAVMRGCKLVRATMRGANFTGSNMEGADLSGADLRGACLRGAVLTGATMTMTDL 223


>gi|172037842|ref|YP_001804343.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
           51142]
 gi|354556328|ref|ZP_08975624.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
 gi|171699296|gb|ACB52277.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
 gi|353551765|gb|EHC21165.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
          Length = 319

 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 28/53 (52%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           Q   ADLR       +FR  + + A++RE DF+G+    AYL +A     NFT
Sbjct: 25  QLRRADLRGLNLSHTDFRGVDLSYANLREVDFTGADLRDAYLNEADLTAVNFT 77


>gi|389874428|ref|YP_006373784.1| pentapeptide repeat-containing protein [Tistrella mobilis
           KA081020-065]
 gi|388531608|gb|AFK56802.1| pentapeptide repeat-containing protein [Tistrella mobilis
           KA081020-065]
          Length = 178

 Score = 37.7 bits (86), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 31/63 (49%), Gaps = 5/63 (7%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYK 160
           G   AA F  ADL+ A   +    RA+FT AD+R      +D  G+ F GA L  A  Y 
Sbjct: 95  GKAEAAIFAEADLQSADFTRSKAARADFTGADLRRARFYRADLRGADFTGANLTGADLYD 154

Query: 161 ANF 163
           A+ 
Sbjct: 155 ADL 157


>gi|428299369|ref|YP_007137675.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428235913|gb|AFZ01703.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 255

 Score = 37.7 bits (86), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 40/86 (46%), Gaps = 10/86 (11%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN-----FTV 165
           A    ADL +A     N R AN  +A + E++   S   GA L+KA    AN     +T 
Sbjct: 160 ANLAEADLFRANLRSANLRGANLQNAGLVEANLQSSNLAGAKLQKATLNGANLKDAKYTS 219

Query: 166 D----EICLPLLVSLPMATPVFPAGF 187
           +    E+C  L VS P  T VF  GF
Sbjct: 220 ENASPELCKSLSVSYPCPT-VFLEGF 244


>gi|385871982|gb|AFI90502.1| Pentapeptide repeat protein [Pectobacterium sp. SCC3193]
          Length = 273

 Score = 37.7 bits (86), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 50/103 (48%), Gaps = 8/103 (7%)

Query: 69  RVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSA---AQFGSADLRKAVHVK 125
           R   +T L +AV +  S N +        ++  R    IG+    A+  ++DL +A   +
Sbjct: 131 RFTGATWLTSAVASGSSMNSADFTQATLRQSNLRQASLIGAVFALAKLENSDLSEADCQQ 190

Query: 126 ENFRRAN-----FTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            NF+RAN     F   D RE++F+ +   GA L+K+    ANF
Sbjct: 191 TNFQRANLAGSLFVRTDFREANFTDANLIGALLQKSQLGGANF 233


>gi|334118008|ref|ZP_08492098.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
 gi|333459993|gb|EGK88603.1| pentapeptide repeat protein [Microcoleus vaginatus FGP-2]
          Length = 171

 Score = 37.7 bits (86), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 29/52 (55%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           F  A+LR +     + R  +F +A+M E++  G+ F GA L+ A   KAN T
Sbjct: 60  FNKANLRNSNFTNADLRGVSFFAANMEEANLEGANFTGATLDLARMMKANLT 111


>gi|336250332|ref|YP_004594042.1| hypothetical protein EAE_19280 [Enterobacter aerogenes KCTC 2190]
 gi|334736388|gb|AEG98763.1| hypothetical protein EAE_19280 [Enterobacter aerogenes KCTC 2190]
          Length = 846

 Score = 37.7 bits (86), Expect = 2.9,   Method: Composition-based stats.
 Identities = 18/53 (33%), Positives = 27/53 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           +    AD R A  ++ N   + F   D RE+DF+ +   GA L+K+    ANF
Sbjct: 754 SDLSEADCRDASFIRANLVGSLFVRTDFREADFTDANLMGALLQKSQLAGANF 806


>gi|444351422|ref|YP_007387566.1| pentapeptide repeat [Enterobacter aerogenes EA1509E]
 gi|443902252|emb|CCG30026.1| pentapeptide repeat [Enterobacter aerogenes EA1509E]
          Length = 846

 Score = 37.4 bits (85), Expect = 2.9,   Method: Composition-based stats.
 Identities = 18/53 (33%), Positives = 27/53 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           +    AD R A  ++ N   + F   D RE+DF+ +   GA L+K+    ANF
Sbjct: 754 SDLSEADCRDASFIRANLVGSLFVRTDFREADFTDANLMGALLQKSQLAGANF 806


>gi|300865105|ref|ZP_07109930.1| serine/threonine protein kinase [Oscillatoria sp. PCC 6506]
 gi|300336876|emb|CBN55080.1| serine/threonine protein kinase [Oscillatoria sp. PCC 6506]
          Length = 540

 Score = 37.4 bits (85), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 37/79 (46%), Gaps = 9/79 (11%)

Query: 91  LADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVKENFRRANFT-----SADMRESDFS 144
           LA  N YEA  TR        A    A+L  A  V+ N R AN T     +A+++ +D  
Sbjct: 430 LAGANFYEARLTRANL---QGADLSEANLGHARLVEANLRDANLTQAYCSTANLQSADLR 486

Query: 145 GSKFNGAYLEKAVAYKANF 163
           G+   GAYL KA    AN 
Sbjct: 487 GANLAGAYLSKANLRGANL 505


>gi|383763560|ref|YP_005442542.1| hypothetical protein CLDAP_26050 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
 gi|381383828|dbj|BAM00645.1| hypothetical protein CLDAP_26050 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
          Length = 189

 Score = 37.4 bits (85), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A    A+L++A     N  RAN + AD+  +D SG+   GA L  A   +AN T
Sbjct: 40  ADLSFANLQRANLAGANLERANLSGADLEGADLSGANLVGANLTGARLMRANLT 93


>gi|409991580|ref|ZP_11274829.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|291567915|dbj|BAI90187.1| pentapeptide repeat-containing protein [Arthrospira platensis
           NIES-39]
 gi|409937560|gb|EKN78975.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 390

 Score = 37.4 bits (85), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 33/54 (61%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           +A    ADL +A+ +K NF +A+ +SA++ +S+   + F  AYL KA   +A+ 
Sbjct: 111 SAHLNWADLTEAIFIKTNFHKADLSSANLTKSNLQSANFVRAYLIKANLSEADL 164


>gi|428317848|ref|YP_007115730.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428241528|gb|AFZ07314.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 171

 Score = 37.4 bits (85), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 29/52 (55%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           F  A+LR +     + R  +F +A+M E++F G+   GA L+ A   KAN T
Sbjct: 60  FNKANLRNSNFTNADLRGVSFFAANMEEANFEGANLTGATLDLARMMKANLT 111


>gi|428201834|ref|YP_007080423.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
 gi|427979266|gb|AFY76866.1| putative low-complexity protein [Pleurocapsa sp. PCC 7327]
          Length = 143

 Score = 37.4 bits (85), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 28/55 (50%), Gaps = 5/55 (9%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           SAA    ADLR+A     N   AN T A++  +D +G+   GA L KA    A F
Sbjct: 49  SAAHLIGADLREA-----NLSGANLTEANLEGADLTGANLQGANLTKAFVTNATF 98


>gi|425439840|ref|ZP_18820154.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9717]
 gi|389719844|emb|CCH96379.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9717]
          Length = 225

 Score = 37.4 bits (85), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 18/61 (29%), Positives = 34/61 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           +     SA+L++AV +  +FR  +    D+ +++F G+  N A L  ++ Y+ANF    +
Sbjct: 113 NGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANFADCRL 172

Query: 169 C 169
           C
Sbjct: 173 C 173


>gi|268325885|emb|CBH39473.1| conserved hypothetical protein [uncultured archaeon]
          Length = 358

 Score = 37.4 bits (85), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 33/77 (42%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           + A      L KA   K + R A    AD+R +D S +K NGA L  A  Y A+ +  ++
Sbjct: 258 TGASLNGGKLYKAKLRKADLRGAKMYKADLRWADLSSTKLNGADLTDADLYGADLSGAKL 317

Query: 169 CLPLLVSLPMATPVFPA 185
           C   L    +    F  
Sbjct: 318 CEADLRKTDLRGTTFDG 334


>gi|448736468|ref|ZP_21718581.1| Ion transport 2 domain-containing protein [Halococcus thailandensis
           JCM 13552]
 gi|445806103|gb|EMA56272.1| Ion transport 2 domain-containing protein [Halococcus thailandensis
           JCM 13552]
          Length = 345

 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 27/55 (49%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
            A    ADLR+A   + + RRA F  AD+  + F  +    A L +A  Y+  FT
Sbjct: 90  GADLSGADLRRATFDRVDARRARFDGADVEGATFENADLRDASLNRAKLYRTGFT 144


>gi|297170923|gb|ADI21940.1| uncharacterized low-complexity proteins [uncultured nuHF2 cluster
           bacterium HF0130_29D04]
          Length = 695

 Score = 37.4 bits (85), Expect = 3.4,   Method: Composition-based stats.
 Identities = 20/54 (37%), Positives = 29/54 (53%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
            A F  ADLR+A  V  + R ANF  A+++ +    +   GA LE+A  Y A+ 
Sbjct: 139 GANFRGADLREAKLVGADLREANFRGANLQTAYLIKADLKGANLEEASLYGADL 192


>gi|440681678|ref|YP_007156473.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
 gi|428678797|gb|AFZ57563.1| pentapeptide repeat protein [Anabaena cylindrica PCC 7122]
          Length = 402

 Score = 37.4 bits (85), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 35/117 (29%), Positives = 52/117 (44%), Gaps = 26/117 (22%)

Query: 71  FVSTALAAAVV--ASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENF 128
           F    L  A++  A+    I A ADL K +A   G       A F  A L +A+ +  NF
Sbjct: 263 FTRAILTEAILIGANFEEAILAGADLTKAKANFTG-------ANFTGAILTEAILIGANF 315

Query: 129 RRA---------------NFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICL 170
            +A               N T AD+ E+D +G+    AYL KA+  +A   ++E+ L
Sbjct: 316 EKAYLIRADLTGANLTGTNLTRADLTEADLTGANLTRAYLIKAILEEA--ILEEVIL 370



 Score = 35.8 bits (81), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 2/55 (3%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMR--ESDFSGSKFNGAYLEKAVAYKANF 163
           A F  A L +A+ +  NF  A    AD+   +++F+G+ F GA L +A+   ANF
Sbjct: 261 ANFTRAILTEAILIGANFEEAILAGADLTKAKANFTGANFTGAILTEAILIGANF 315


>gi|254421888|ref|ZP_05035606.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
 gi|196189377|gb|EDX84341.1| Pentapeptide repeat protein [Synechococcus sp. PCC 7335]
          Length = 194

 Score = 37.4 bits (85), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 59/149 (39%), Gaps = 12/149 (8%)

Query: 21  SKGPYQLHALSKPLWVACQIS-SKTESDGQFPDCSNNQCAGPYAKLK--NWRV--FVSTA 75
           S G   L   + P W   Q       +D + P C  N+     A+L   N +V       
Sbjct: 18  SIGLIGLLGFAAPSWAYLQEDVDMLMNDNECPVCILNEADLVGAQLNHANLKVASLTGAN 77

Query: 76  LAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTS 135
           L  A ++  +  +S L   N   A   G       AQ   A L+ AV    +   AN T 
Sbjct: 78  LTGADLSETNLMLSELIGTNLTNASLAG-------AQMNGAQLKDAVLKGADLSGANLTQ 130

Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A++ +++F G+K     +  AV   ANFT
Sbjct: 131 ANLEDANFVGAKLINTEMTAAVVGVANFT 159


>gi|390440421|ref|ZP_10228750.1| membrane hypothetical protein [Microcystis sp. T1-4]
 gi|389836163|emb|CCI32876.1| membrane hypothetical protein [Microcystis sp. T1-4]
          Length = 904

 Score = 37.4 bits (85), Expect = 3.5,   Method: Composition-based stats.
 Identities = 20/67 (29%), Positives = 36/67 (53%), Gaps = 7/67 (10%)

Query: 91  LADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNG 150
           L D+N  E++  G       A   +A+L+KA   +    R +FT+AD+ ++D +G+   G
Sbjct: 534 LKDINFTESDLSG-------ALLRNANLKKANLTRTILNRVDFTNADLSDADLTGASVKG 586

Query: 151 AYLEKAV 157
           A  + A+
Sbjct: 587 AKFDNAI 593


>gi|428317459|ref|YP_007115341.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
 gi|428241139|gb|AFZ06925.1| pentapeptide repeat protein [Oscillatoria nigro-viridis PCC 7112]
          Length = 197

 Score = 37.4 bits (85), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 38/75 (50%), Gaps = 8/75 (10%)

Query: 89  SALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF 148
           S LAD N  +A   G       A    A+L++AV ++ N R A+ + AD+R +DF  +  
Sbjct: 29  SDLADANLSQANLSG-------ANLVGANLQRAV-LRANLRGADLSGADLRGADFRNADL 80

Query: 149 NGAYLEKAVAYKANF 163
            GA    A+   A+F
Sbjct: 81  RGASFANALVRDASF 95


>gi|307154970|ref|YP_003890354.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306985198|gb|ADN17079.1| pentapeptide repeat protein [Cyanothece sp. PCC 7822]
          Length = 231

 Score = 37.4 bits (85), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 18/47 (38%), Positives = 28/47 (59%)

Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           DLR+A   + N   AN   AD+ +++ SG+  + A+LEKA+   AN 
Sbjct: 55  DLREANLTQANLNWANLHKADLTQANLSGANLSQAFLEKAILIAANL 101


>gi|359457318|ref|ZP_09245881.1| pentapeptide repeat-containing protein [Acaryochloris sp. CCMEE
           5410]
          Length = 510

 Score = 37.4 bits (85), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 28/97 (28%), Positives = 44/97 (45%), Gaps = 13/97 (13%)

Query: 91  LADLNKYEAE-TRGEFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK 147
           L D+N   A  +R    +   S A    +DL  A   + N  R NF+ AD+ ++D S ++
Sbjct: 41  LQDINLINANLSRANLSLANLSGAFLAGSDLSNAFLSEANLSRVNFSRADLTKADLSFAR 100

Query: 148 FNGAYLEKAVAYKANFTVDEICLPLLVSLPMATPVFP 184
             GA L +A  Y+A          +L+   M   +FP
Sbjct: 101 LQGATLIEANLYQA----------ILIEACMVQVIFP 127


>gi|402773132|ref|YP_006592669.1| pentapeptide repeat protein [Methylocystis sp. SC2]
 gi|401775152|emb|CCJ08018.1| Pentapeptide repeat protein [Methylocystis sp. SC2]
          Length = 261

 Score = 37.4 bits (85), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 28/53 (52%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A F S  L  A   K +    NFT AD++ +DFSG++ N A L  A+   A F
Sbjct: 115 ADFFSTKLAGAKLAKADLSATNFTRADLQNADFSGARMNAATLYAALLDGATF 167


>gi|381204293|ref|ZP_09911364.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 156

 Score = 37.4 bits (85), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 47/102 (46%), Gaps = 9/102 (8%)

Query: 71  FVSTALAAAVVASCSSNISALADL-NKYEAETRG----EFGIGSAAQFGS----ADLRKA 121
            V+T L A   A    ++  L D  N  + + RG    EF +     + S    ADLRKA
Sbjct: 20  IVATLLTADASAYKQEDLDKLQDTYNCVKCDLRGAILREFNLTGTNLYKSDLRKADLRKA 79

Query: 122 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
                N    N T A +RE++ +G+  +GA+L +A    AN 
Sbjct: 80  DLRDTNLGDTNLTGAVLREANLTGANMSGAHLWEANLTGANL 121


>gi|428318454|ref|YP_007116336.1| serine/threonine protein kinase with pentapeptide repeats
           [Oscillatoria nigro-viridis PCC 7112]
 gi|428242134|gb|AFZ07920.1| serine/threonine protein kinase with pentapeptide repeats
           [Oscillatoria nigro-viridis PCC 7112]
          Length = 543

 Score = 37.0 bits (84), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 29/60 (48%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEIC 169
           AA    A+L  A  ++ N R AN T A    ++F G+ F GA L  A   KAN     +C
Sbjct: 450 AADLSGANLGHARLIQANLRDANLTEAYCSTANFEGADFRGADLTGAYLTKANLRGANLC 509


>gi|428224166|ref|YP_007108263.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427984067|gb|AFY65211.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 583

 Score = 37.0 bits (84), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 17/46 (36%), Positives = 25/46 (54%)

Query: 128 FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLL 173
             RAN T A +  ++   +  N A L++AV   A+ T  E+CL LL
Sbjct: 52  LNRANLTEASLHHANLRNASLNSALLDRAVLSGADLTKAELCLALL 97


>gi|217977179|ref|YP_002361326.1| pentapeptide repeat-containing protein [Methylocella silvestris
           BL2]
 gi|217502555|gb|ACK49964.1| pentapeptide repeat protein [Methylocella silvestris BL2]
          Length = 260

 Score = 37.0 bits (84), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query: 132 NFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           NF +A M  ++FSG K +GA L++A A KANF
Sbjct: 79  NFRAARMNNTNFSGGKLDGAVLDQAWALKANF 110


>gi|158340181|ref|YP_001521351.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158310422|gb|ABW32037.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 535

 Score = 37.0 bits (84), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 51/106 (48%), Gaps = 12/106 (11%)

Query: 71  FVSTALAAAVVASCSSNIS-----ALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVK 125
            V+  L+ A++ S   N S      L + N   A+    + I ++     ADLR A  +K
Sbjct: 387 LVNADLSKAILKSAELNKSYLTFAKLQEANLTNAQLTEAYLISTS--LREADLRSANLLK 444

Query: 126 ENFRRANFTSADMR-----ESDFSGSKFNGAYLEKAVAYKANFTVD 166
            + R A+  ++D+R     E+  SG++  GA L+ A   KAN + D
Sbjct: 445 ADLRWADLINSDLRGANLRETKLSGARLYGANLKDADLSKANLSAD 490


>gi|16126499|ref|NP_421063.1| pentapeptide repeat-containing protein [Caulobacter crescentus
           CB15]
 gi|221235279|ref|YP_002517716.1| hypothetical protein CCNA_02343 [Caulobacter crescentus NA1000]
 gi|13423771|gb|AAK24231.1| pentapeptide repeat family protein [Caulobacter crescentus CB15]
 gi|220964452|gb|ACL95808.1| hypothetical protein with pentapeptide repeats [Caulobacter
           crescentus NA1000]
          Length = 419

 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 29/51 (56%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
           + + A F  A L+ A  V+ N ++ANF  A++  +D SG+   GA L  AV
Sbjct: 166 VATKADFSDAILKDAKLVRANLKQANFNGANLAGADLSGANLAGADLRNAV 216


>gi|301029833|ref|ZP_07192875.1| pentapeptide repeat protein [Escherichia coli MS 196-1]
 gi|126812|sp|P05530.1|MCBG_ECOLX RecName: Full=Protein McbG
 gi|41983|emb|CAA30724.1| unnamed protein product [Escherichia coli]
 gi|299877321|gb|EFI85532.1| pentapeptide repeat protein [Escherichia coli MS 196-1]
          Length = 187

 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 18/57 (31%), Positives = 30/57 (52%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDE 167
             F S  L+K++ +   FR   F   D+R+SDF+GS+FN      +     +F++ E
Sbjct: 97  VDFISLRLQKSIFLSCRFRDCLFEETDLRKSDFTGSEFNNTEFRHSDLSHCDFSMTE 153


>gi|119512769|ref|ZP_01631839.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
 gi|119462587|gb|EAW43554.1| Pentapeptide repeat protein [Nodularia spumigena CCY9414]
          Length = 268

 Score = 37.0 bits (84), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 23/37 (62%)

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           N   AN  +AD+ E++   ++ NGAYL KA  YKAN 
Sbjct: 160 NLIEANLINADLSEANLYEAQLNGAYLYKANFYKANL 196


>gi|409990095|ref|ZP_11273525.1| pentapeptide repeat-containing protein, partial [Arthrospira
           platensis str. Paraca]
 gi|409939047|gb|EKN80281.1| pentapeptide repeat-containing protein, partial [Arthrospira
           platensis str. Paraca]
          Length = 220

 Score = 37.0 bits (84), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 48/112 (42%), Gaps = 5/112 (4%)

Query: 56  NQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIG---SAAQ 112
           N+    YA+    R F   +L AA+    + N   L+  N  EA       IG   S +Q
Sbjct: 10  NKLLTRYAQ--GERNFSDISLVAAIFNEVTLNRINLSGANLAEALMVHTRLIGANLSRSQ 67

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
              ADL  AV +  N   A+ T   + ++D SG+  +GA L +      N T
Sbjct: 68  LSYADLSMAVLIDANLTGASMTETVLHQADLSGASLSGAILSQVNLTGVNLT 119


>gi|409989360|ref|ZP_11272974.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
 gi|409939778|gb|EKN80828.1| pentapeptide repeat-containing protein [Arthrospira platensis str.
           Paraca]
          Length = 333

 Score = 37.0 bits (84), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 39/87 (44%), Gaps = 7/87 (8%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
           F+   L  A +  C    + L++ N   A  RG       A    A+LR A     N   
Sbjct: 242 FIKANLMKADLEECDLRNADLSNTNLNLANLRG-------ADLTGANLRGAYLWGANLDG 294

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAV 157
           AN  +AD+R++ F  +  NGA L+ A+
Sbjct: 295 ANLENADLRDASFRHATLNGAMLQDAI 321


>gi|443478905|ref|ZP_21068593.1| serine/threonine protein kinase with pentapeptide repeats
           [Pseudanabaena biceps PCC 7429]
 gi|443015732|gb|ELS30565.1| serine/threonine protein kinase with pentapeptide repeats
           [Pseudanabaena biceps PCC 7429]
          Length = 545

 Score = 37.0 bits (84), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 29/54 (53%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEIC 169
           ADL  A  +  + R AN  SA M ++D SG+  +GA L+ A   +AN     +C
Sbjct: 459 ADLGSASMILADMREANLQSAYMSKADLSGANLSGANLKGAYLSQANLNGTNLC 512


>gi|425458309|ref|ZP_18837797.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9808]
 gi|389827863|emb|CCI20729.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 9808]
          Length = 220

 Score = 37.0 bits (84), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 18/61 (29%), Positives = 34/61 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           +     SA+L++AV +  +FR  +    D+ +++F G+  N A L  ++ Y+ANF    +
Sbjct: 108 NGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFRGADLNYANLSGSLLYRANFADCRL 167

Query: 169 C 169
           C
Sbjct: 168 C 168


>gi|39997499|ref|NP_953450.1| pentapeptide repeat-containing protein [Geobacter sulfurreducens
           PCA]
 gi|39984390|gb|AAR35777.1| pentapeptide repeat domain protein [Geobacter sulfurreducens PCA]
          Length = 254

 Score = 37.0 bits (84), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 37/73 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICL 170
           A+F  A+L  A   K N  + NF+ A++  ++FSG+K   A L  AV    NF+  ++  
Sbjct: 117 AKFVGANLSGADMRKVNVEKGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSA 176

Query: 171 PLLVSLPMATPVF 183
             L SL +    F
Sbjct: 177 TDLGSLDLEGANF 189


>gi|409912856|ref|YP_006891321.1| pentapeptide repeat-containing protein [Geobacter sulfurreducens
           KN400]
 gi|298506440|gb|ADI85163.1| pentapeptide repeat domain protein [Geobacter sulfurreducens KN400]
          Length = 259

 Score = 37.0 bits (84), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 37/73 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICL 170
           A+F  A+L  A   K N  + NF+ A++  ++FSG+K   A L  AV    NF+  ++  
Sbjct: 117 AKFVGANLSGADMRKVNVEKGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSA 176

Query: 171 PLLVSLPMATPVF 183
             L SL +    F
Sbjct: 177 TDLGSLDLEGANF 189


>gi|67924929|ref|ZP_00518320.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
 gi|67853235|gb|EAM48603.1| Pentapeptide repeat [Crocosphaera watsonii WH 8501]
          Length = 366

 Score = 37.0 bits (84), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 33/70 (47%), Gaps = 5/70 (7%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFT 164
           A +    +L  A     NFR AN T  D+ E     S FSG+  +GAYL  A   +A+F 
Sbjct: 246 ATELSGIELSGANLTHSNFRGANLTDVDLSEAILSYSRFSGADLSGAYLGNANLQQADFY 305

Query: 165 VDEICLPLLV 174
              + L  L+
Sbjct: 306 RSSLALANLI 315


>gi|113476307|ref|YP_722368.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
           IMS101]
 gi|110167355|gb|ABG51895.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
          Length = 225

 Score = 37.0 bits (84), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 27/53 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           A F   +L+ A     N    NF  AD+  ++ SG+   GA LEKA  Y+A+ 
Sbjct: 52  ANFHDINLKNANMSGANLTGVNFQGADLNGANLSGANLTGANLEKANLYRADI 104


>gi|428314067|ref|YP_007125044.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428255679|gb|AFZ21638.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 745

 Score = 37.0 bits (84), Expect = 4.3,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 33/56 (58%), Gaps = 5/56 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S+A+  SADLR+ V        A+ T AD+ E+ F+ +  +GA L K  A +++FT
Sbjct: 564 SSAKLISADLRQGV-----LENASLTGADLGEAKFARANLHGARLGKVKAVRSDFT 614


>gi|428307960|ref|YP_007144785.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
           9333]
 gi|428249495|gb|AFZ15275.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
          Length = 201

 Score = 37.0 bits (84), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 42/84 (50%), Gaps = 11/84 (13%)

Query: 84  CSSNISALADL---NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRE 140
           C +N+   ADL   + +EA   G   IG  A+   ADL  A     NFR AN   AD+ E
Sbjct: 93  CEANLGG-ADLIEADLFEANLTGANLIG--AKLIGADLTGA-----NFREANLMGADLFE 144

Query: 141 SDFSGSKFNGAYLEKAVAYKANFT 164
           ++ SG+  +GA L  A    AN +
Sbjct: 145 ANLSGANLSGANLSGANLTLANLS 168


>gi|416406325|ref|ZP_11688097.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
 gi|357261078|gb|EHJ10386.1| Pentapeptide repeat [Crocosphaera watsonii WH 0003]
          Length = 366

 Score = 37.0 bits (84), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 33/70 (47%), Gaps = 5/70 (7%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFT 164
           A +    +L  A     NFR AN T  D+ E     S FSG+  +GAYL  A   +A+F 
Sbjct: 246 ATELSGIELSGANLTHSNFRGANLTDVDLSEAILSYSRFSGADLSGAYLGNANLQQADFY 305

Query: 165 VDEICLPLLV 174
              + L  L+
Sbjct: 306 RSSLALANLI 315


>gi|218440380|ref|YP_002378709.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7424]
 gi|218173108|gb|ACK71841.1| pentapeptide repeat protein [Cyanothece sp. PCC 7424]
          Length = 206

 Score = 37.0 bits (84), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 28/54 (51%), Gaps = 1/54 (1%)

Query: 115 SADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           +ADLR A+    +   AN   AD+R   F G+   GA L  A+  K N  +DEI
Sbjct: 127 NADLRGAIFEGTSLVNANLCFADLRRCQFDGANLEGATLTNAI-LKDNQKIDEI 179


>gi|381204405|ref|ZP_09911476.1| pentapeptide repeat-containing protein [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 135

 Score = 37.0 bits (84), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 31/61 (50%), Gaps = 3/61 (4%)

Query: 103 GEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKAN 162
           G+F     A     DLR+A       ++AN ++ADM E++ S +   GA LE A   +AN
Sbjct: 18  GDF---EGANLSGMDLRRANLSGAALKKANLSNADMTEANLSVADLTGAKLENAKLRQAN 74

Query: 163 F 163
            
Sbjct: 75  L 75


>gi|334119964|ref|ZP_08494048.1| serine/threonine protein kinase with pentapeptide repeats
           [Microcoleus vaginatus FGP-2]
 gi|333457605|gb|EGK86228.1| serine/threonine protein kinase with pentapeptide repeats
           [Microcoleus vaginatus FGP-2]
          Length = 543

 Score = 37.0 bits (84), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 29/60 (48%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEIC 169
           AA    A+L  A  ++ N R AN T A    ++F G+ F GA L  A   KAN     +C
Sbjct: 450 AADLSGANLGHARLIQANLRDANLTEAYCSTANFEGADFRGADLTGAYLTKANLRGANLC 509


>gi|113477518|ref|YP_723579.1| pentapeptide repeat-containing protein [Trichodesmium erythraeum
           IMS101]
 gi|110168566|gb|ABG53106.1| pentapeptide repeat [Trichodesmium erythraeum IMS101]
          Length = 710

 Score = 37.0 bits (84), Expect = 4.4,   Method: Composition-based stats.
 Identities = 28/105 (26%), Positives = 52/105 (49%), Gaps = 10/105 (9%)

Query: 68  WRVFVSTA-LAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSADLRKAVHVK 125
           +R  +S A +  + +   + + + L + N ++A  T   F   + A  GSADL KA   +
Sbjct: 510 FRATLSKAIMPGSTITQANFSSAKLIETNLHQANLTEATF---TGADLGSADLSKANLYR 566

Query: 126 ENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTV 165
            N  +       F  +D+RES++ G+  +GA   +A   KA+ ++
Sbjct: 567 ANLSKVKAEGTTFQLSDLRESNWQGANLSGANFSRANLKKADLSL 611


>gi|16330983|ref|NP_441711.1| eukariotic protein kinase [Synechocystis sp. PCC 6803]
 gi|383322725|ref|YP_005383578.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr. GT-I]
 gi|383325894|ref|YP_005386747.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr.
           PCC-P]
 gi|383491778|ref|YP_005409454.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr.
           PCC-N]
 gi|384437045|ref|YP_005651769.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803]
 gi|451815141|ref|YP_007451593.1| eukariotic protein kinase [Synechocystis sp. PCC 6803]
 gi|15214308|sp|P74297.1|SPKB_SYNY3 RecName: Full=Serine/threonine-protein kinase B
 gi|1653478|dbj|BAA18391.1| eukariotic protein kinase [Synechocystis sp. PCC 6803]
 gi|11022717|dbj|BAB17034.1| Ser/Thr protein kinase SpkB [Synechocystis sp. PCC 6803]
 gi|339274077|dbj|BAK50564.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803]
 gi|359272044|dbj|BAL29563.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr. GT-I]
 gi|359275214|dbj|BAL32732.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr.
           PCC-N]
 gi|359278384|dbj|BAL35901.1| eukaryotic protein kinase [Synechocystis sp. PCC 6803 substr.
           PCC-P]
 gi|407961651|dbj|BAM54891.1| eukariotic protein kinase [Bacillus subtilis BEST7613]
 gi|451781110|gb|AGF52079.1| eukariotic protein kinase [Synechocystis sp. PCC 6803]
          Length = 574

 Score = 37.0 bits (84), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 40/94 (42%), Gaps = 7/94 (7%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
            V   LA A V   +   + L + N  +AE        + A FG A L+  +    N   
Sbjct: 456 LVGIVLAKAFVPGINCYQANLTNANFEQAEL-------TRADFGKARLKNVIFKGANLSD 508

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A F  AD+R +D  G+  NG   + A    ANF+
Sbjct: 509 AYFGYADLRGADLRGANLNGVNFKYANLQGANFS 542


>gi|376007502|ref|ZP_09784697.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
 gi|375324138|emb|CCE20450.1| Pentapeptide repeat protein [Arthrospira sp. PCC 8005]
          Length = 179

 Score = 37.0 bits (84), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 29/62 (46%)

Query: 102 RGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           RGE+        G AD+          R+AN T+A+M + DF+G+ F  + L  +     
Sbjct: 40  RGEYSSCQGCNLGGADMSNQSRRNAQLRQANLTNANMSDGDFTGAFFTCSNLSNSNLSGG 99

Query: 162 NF 163
           NF
Sbjct: 100 NF 101


>gi|332710048|ref|ZP_08430003.1| uncharacterized low-complexity protein [Moorea producens 3L]
 gi|332351191|gb|EGJ30776.1| uncharacterized low-complexity protein [Moorea producens 3L]
          Length = 739

 Score = 37.0 bits (84), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 18/53 (33%), Positives = 29/53 (54%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           + F SADL ++     N  RAN ++A+++  DF+ ++  GA L  A  Y A  
Sbjct: 608 SDFSSADLSQSSWQGANLSRANLSNANLKNVDFNSTQLVGANLRNAKLYNAKL 660


>gi|194336259|ref|YP_002018053.1| pentapeptide repeat-containing protein [Pelodictyon
           phaeoclathratiforme BU-1]
 gi|194308736|gb|ACF43436.1| pentapeptide repeat protein [Pelodictyon phaeoclathratiforme BU-1]
          Length = 180

 Score = 37.0 bits (84), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 29/129 (22%), Positives = 55/129 (42%), Gaps = 11/129 (8%)

Query: 35  WVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNISALADL 94
           ++ C  +S   S  +  DC    C    AKLKN      ++L      +C       +D 
Sbjct: 21  FIHCNFNSADLSGVRMIDCRFEGCDLSLAKLKN------SSLQKVKFVNCKLLGVLFSDC 74

Query: 95  NKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE 154
            K+  +   +  I   + F    L+    +  + + A+F+ AD+     SG+KF+G+ L 
Sbjct: 75  RKFMLDLDFDRCILKLSLFAGLKLKNTRFINCDLQEADFSEADL-----SGAKFDGSDLL 129

Query: 155 KAVAYKANF 163
           + + + +N 
Sbjct: 130 QTIFFHSNL 138


>gi|428226949|ref|YP_007111046.1| hypothetical protein GEI7407_3527 [Geitlerinema sp. PCC 7407]
 gi|427986850|gb|AFY67994.1| Tetratricopeptide TPR_1 repeat-containing protein [Geitlerinema sp.
           PCC 7407]
          Length = 575

 Score = 37.0 bits (84), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 50/123 (40%), Gaps = 21/123 (17%)

Query: 68  WRVFVSTALAAAVVASCSSNISALADLNKYEAETR---------GEFGIGS--------- 109
           WR   + AL  A +    + ++   +  K   ETR         G++G  +         
Sbjct: 15  WRSLAALALVVAPMVGTDAALAEKPEHRKQLLETRRCISCDLSNGDYGRANLSGFDLSNS 74

Query: 110 ---AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVD 166
               A F SADL++      N RRA+   AD+  +DF  +  NGA L  +    AN +  
Sbjct: 75  NLENADFESADLQRTDFSSANLRRADLERADLERADFQSAILNGADLSNSDLSYANLSNS 134

Query: 167 EIC 169
           ++ 
Sbjct: 135 DLS 137


>gi|425452623|ref|ZP_18832440.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 7941]
 gi|389765493|emb|CCI08619.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           aeruginosa PCC 7941]
          Length = 220

 Score = 37.0 bits (84), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 18/61 (29%), Positives = 34/61 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           +     SA+L++AV +  +FR  +    D+ +++F G+  N A L  ++ Y+ANF    +
Sbjct: 108 NGVNLNSANLQQAVLIDTDFRSTSDQRTDLGKTNFCGADLNYANLSGSLLYRANFANCRL 167

Query: 169 C 169
           C
Sbjct: 168 C 168


>gi|172039549|ref|YP_001806050.1| rfrA pentapeptide repeat-containing protein [Cyanothece sp. ATCC
           51142]
 gi|354552189|ref|ZP_08971497.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
 gi|171701003|gb|ACB53984.1| rfrA family pentapeptide repeat [Cyanothece sp. ATCC 51142]
 gi|353555511|gb|EHC24899.1| pentapeptide repeat protein [Cyanothece sp. ATCC 51472]
          Length = 367

 Score = 37.0 bits (84), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 33/70 (47%), Gaps = 5/70 (7%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFT 164
             +    +L  A     NFR AN T AD+ E     + FSG+  +GAYL  A   +A+F 
Sbjct: 245 GTELSGIELSGANLTHSNFRGANLTDADLSEAILSYTRFSGADLSGAYLGNANLQQADFY 304

Query: 165 VDEICLPLLV 174
              + L  L+
Sbjct: 305 RSSLALANLI 314


>gi|427736321|ref|YP_007055865.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427371362|gb|AFY55318.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 642

 Score = 37.0 bits (84), Expect = 4.9,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 33/68 (48%), Gaps = 9/68 (13%)

Query: 110 AAQFGSADLRKAVHVKENFRRA---------NFTSADMRESDFSGSKFNGAYLEKAVAYK 160
            A F  A+L+K V +  N   A         NF  AD+ ++DFS +K   A L+KA   K
Sbjct: 421 GANFCKANLKKTVFIAANLTEAIESEEVIVTNFEEADLEKADFSCAKLIRANLQKANLVK 480

Query: 161 ANFTVDEI 168
           AN    ++
Sbjct: 481 ANLKAADL 488


>gi|300868113|ref|ZP_07112748.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
 gi|300333887|emb|CBN57928.1| pentapeptide repeat-containing protein [Oscillatoria sp. PCC 6506]
          Length = 169

 Score = 37.0 bits (84), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 30/54 (55%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           AQF  A+LR +     N +  +F +A+M +++F G+   GA L+ A   K N T
Sbjct: 57  AQFTKANLRNSNFSNANLQGVSFFAANMEDANFEGANLRGATLDLARMIKVNLT 110


>gi|220910319|ref|YP_002485630.1| pentapeptide repeat-containing protein [Cyanothece sp. PCC 7425]
 gi|219866930|gb|ACL47269.1| pentapeptide repeat protein [Cyanothece sp. PCC 7425]
          Length = 165

 Score = 37.0 bits (84), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 35/66 (53%), Gaps = 3/66 (4%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF---TV 165
           + AQ   ADLRKAV    N   AN   A +R ++F+G+  +GA L  A A ++      +
Sbjct: 93  TGAQLPKADLRKAVLSGANLAGANLRDAKLRGANFAGADLHGADLFGAEALRSELLEGIL 152

Query: 166 DEICLP 171
           ++  +P
Sbjct: 153 NQTIMP 158


>gi|33866170|ref|NP_897729.1| hypothetical protein SYNW1636 [Synechococcus sp. WH 8102]
 gi|33639145|emb|CAE08151.1| conserved hypothetical protein [Synechococcus sp. WH 8102]
          Length = 171

 Score = 36.6 bits (83), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 5/68 (7%)

Query: 107 IGSAAQFGSADLRKAVHVKENFRRANFTSAD-----MRESDFSGSKFNGAYLEKAVAYKA 161
           +G  A F  ADL  A+  +  F  A+F+ AD     M  +DF+G+    A L   +A  +
Sbjct: 66  VGRGANFSGADLHGAIFTQGAFAEADFSGADLSDALMDRADFAGTNLRDAVLTGIIASGS 125

Query: 162 NFTVDEIC 169
           +F+  +I 
Sbjct: 126 SFSDAQIA 133


>gi|145356542|ref|XP_001422487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144582730|gb|ABP00804.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 114

 Score = 36.6 bits (83), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 29/56 (51%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S+  F  ADLR A     N R A        E DF+G+  + A +++AV  KANFT
Sbjct: 1   SSQNFTGADLRFAKLRGANLRGAYMMKMVAPEVDFTGADMSDALMDRAVLVKANFT 56


>gi|406987204|gb|EKE07615.1| hypothetical protein ACD_18C00027G0001, partial [uncultured
           bacterium]
          Length = 406

 Score = 36.6 bits (83), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 28/63 (44%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           +   F ++DLR       N    NFT++D+R +DF G+ F G   E      A F     
Sbjct: 337 TLTNFTNSDLRNVNFRDANLTWTNFTNSDLRNADFRGASFTGTIFENTNLEGAKFDKKNK 396

Query: 169 CLP 171
            LP
Sbjct: 397 NLP 399


>gi|186684326|ref|YP_001867522.1| pentapeptide repeat-containing protein [Nostoc punctiforme PCC
           73102]
 gi|186466778|gb|ACC82579.1| pentapeptide repeat protein [Nostoc punctiforme PCC 73102]
          Length = 413

 Score = 36.6 bits (83), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 31/60 (51%), Gaps = 5/60 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFS-----GSKFNGAYLEKAVAYKANF 163
           S A    A+L KA+ V  N +  NFT A++ E+D S     GS F  A L KA   +AN 
Sbjct: 216 SNADLTEANLSKAIFVGANLQWVNFTQANLSEADLSITNLCGSVFYEANLSKATLPEANL 275


>gi|189499620|ref|YP_001959090.1| pentapeptide repeat-containing protein [Chlorobium phaeobacteroides
           BS1]
 gi|189495061|gb|ACE03609.1| pentapeptide repeat protein [Chlorobium phaeobacteroides BS1]
          Length = 300

 Score = 36.6 bits (83), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 30/55 (54%), Gaps = 1/55 (1%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSK-FNGAYLEKAVAYKANF 163
           AA F  ADLR A   + N R A+ T AD+R +  S S+   G+ L  A+ + AN 
Sbjct: 200 AANFSGADLRDADLSEVNLRNADLTGADLRGARLSFSQNMTGSTLNNAILHSANL 254


>gi|427734465|ref|YP_007054009.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427369506|gb|AFY53462.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 269

 Score = 36.6 bits (83), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 28/56 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A  G ADLR+A     N   AN T A +  ++ SGS  +GA L  A    AN T
Sbjct: 68  SEADLGEADLREANLKGANLTGANLTGATLMNANLSGSNLSGACLSGAKLSGANLT 123


>gi|329850490|ref|ZP_08265335.1| pentapeptide repeat 8 copies family protein [Asticcacaulis
           biprosthecum C19]
 gi|328840805|gb|EGF90376.1| pentapeptide repeat 8 copies family protein [Asticcacaulis
           biprosthecum C19]
          Length = 163

 Score = 36.6 bits (83), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 26/63 (41%), Positives = 30/63 (47%), Gaps = 7/63 (11%)

Query: 104 EFGIG--SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
           + GIG  S   F  ADLR        F RA+F  ADM  ++F      GAYLE A    A
Sbjct: 64  DLGIGIFSGTNFSGADLRDVNGSAALFGRASFAGADMTNANFV-----GAYLEHANFRGA 118

Query: 162 NFT 164
           N T
Sbjct: 119 NLT 121


>gi|428224795|ref|YP_007108892.1| pentapeptide repeat-containing protein [Geitlerinema sp. PCC 7407]
 gi|427984696|gb|AFY65840.1| pentapeptide repeat protein [Geitlerinema sp. PCC 7407]
          Length = 284

 Score = 36.6 bits (83), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 26/52 (50%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKA 161
            A    ADL +A     N RRANFT+A MR +    S   GA + +   Y+A
Sbjct: 184 GANLSDADLTRANLGSTNLRRANFTNAKMRGASLIWSSLRGAKMIRVNLYRA 235


>gi|443475902|ref|ZP_21065833.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
 gi|443019187|gb|ELS33316.1| pentapeptide repeat protein [Pseudanabaena biceps PCC 7429]
          Length = 133

 Score = 36.6 bits (83), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 8/111 (7%)

Query: 79  AVVASCSSNISALADLNKYEA--ETRGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTS 135
           A + S S+  SALAD N+     +TR   G   S  +   A+LR A     N R AN  S
Sbjct: 16  AAITSISAIESALADPNQIRQVLQTRECAGCNLSREKLSFANLRGA-----NLRNANLFS 70

Query: 136 ADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLPLLVSLPMATPVFPAG 186
           AD++ +D   +   GA L+KA    A+ T  ++    +    +   + P G
Sbjct: 71  ADLKLADLREANLIGAILDKADLRGADLTGADLTGAYMSETNLCGAIMPDG 121


>gi|428309499|ref|YP_007120476.1| low-complexity protein [Microcoleus sp. PCC 7113]
 gi|428251111|gb|AFZ17070.1| putative low-complexity protein [Microcoleus sp. PCC 7113]
          Length = 166

 Score = 36.6 bits (83), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 52/125 (41%), Gaps = 22/125 (17%)

Query: 72  VSTALAAAVVASCSSNISALADLNKY--------EAETRGEFGIGSAAQFGSADLRKAVH 123
           ++T L A +V  C   + ALA   KY         AE +G+        F    LR A  
Sbjct: 6   LATFLLALIVWCCP--LPALAQATKYYPPPLSYSNAELKGK-------DFSGQTLRSAEF 56

Query: 124 VKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTVDEICLPLLVSLPM 178
              N  R NFT AD+R + FS S       +GA L  A+  + +FT  ++   +L    M
Sbjct: 57  SNANLERTNFTDADLRGTIFSASVMTHANLHGADLSNAMIDQVSFTNADLSDAVLTESIM 116

Query: 179 ATPVF 183
               F
Sbjct: 117 LRSTF 121


>gi|428222472|ref|YP_007106642.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
 gi|427995812|gb|AFY74507.1| putative low-complexity protein [Synechococcus sp. PCC 7502]
          Length = 340

 Score = 36.6 bits (83), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 52/109 (47%), Gaps = 12/109 (11%)

Query: 66  KNWR--VFVSTA------LAAAVVASCSSNISALADLNKYEAE-TRGEFGIGSAAQFGSA 116
            NWR  VF S        L+AA ++S + +++ L  +N   A  ++      S A  G A
Sbjct: 18  NNWRSEVFRSKIDLSYADLSAATLSSINLSLANLRSINLSRANLSKANL---SGAILGKA 74

Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTV 165
           +L +A  +  N   ANF  AD+  +  S S  + A L  AVA  ANF +
Sbjct: 75  NLTEASLINANLSMANFIMADLSGAYLSESNLSRANLGNAVAIAANFIM 123


>gi|158341584|ref|YP_001522748.1| pentapeptide repeat-containing protein [Acaryochloris marina
           MBIC11017]
 gi|158311825|gb|ABW33434.1| pentapeptide repeat protein [Acaryochloris marina MBIC11017]
          Length = 521

 Score = 36.6 bits (83), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           A  G ADL  A     N  RANF  A ++E+D + +  +GA+L  A    AN +
Sbjct: 88  AYLGGADLYSANLRGANLIRANFNDAHLKEADLTNANLSGAHLRGANLLNANLS 141


>gi|427719675|ref|YP_007067669.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 7507]
 gi|427352111|gb|AFY34835.1| pentapeptide repeat protein [Calothrix sp. PCC 7507]
          Length = 291

 Score = 36.6 bits (83), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 49/102 (48%), Gaps = 8/102 (7%)

Query: 63  AKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAV 122
           AKL    +  +  +AA ++A+       L++ N YEAE  G +     A    A+L KA 
Sbjct: 172 AKLMRANLSFANLIAANLIATD------LSEANLYEAEVMGAYLY--QADLYKANLSKAH 223

Query: 123 HVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
                  RAN T AD+R ++ S S  +GA L  A   +AN T
Sbjct: 224 LSSAYLFRANLTKADLRGANLSWSNLSGANLAGADLCRANLT 265


>gi|83309857|ref|YP_420121.1| hypothetical protein amb0758 [Magnetospirillum magneticum AMB-1]
 gi|82944698|dbj|BAE49562.1| Uncharacterized low-complexity protein [Magnetospirillum magneticum
           AMB-1]
          Length = 164

 Score = 36.6 bits (83), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 45/103 (43%), Gaps = 23/103 (22%)

Query: 63  AKLKNWRV-FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKA 121
           AK+  +RV F+S AL  A                      R + G+ S A F  ADL  A
Sbjct: 69  AKVDGYRVRFISAALVGA----------------------RLDDGVFSEADFTKADLGGA 106

Query: 122 VHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
              + + RRA F  A +R +D +G++  GA L  A    A +T
Sbjct: 107 SLARADLRRARFYHASLRGADLTGARTLGAELLNADLSGARWT 149


>gi|374293141|ref|YP_005040176.1| hypothetical protein AZOLI_2775 [Azospirillum lipoferum 4B]
 gi|357425080|emb|CBS87961.1| Conserved protein of unknown function; pentapeptide repeat domains
           [Azospirillum lipoferum 4B]
          Length = 425

 Score = 36.6 bits (83), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 27/47 (57%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           AA F +  L  A   + + R ANF+ AD+R +D +GS   GA L+ A
Sbjct: 166 AADFTNTRLAGARLDRTDLRDANFSGADLRGADLNGSDLRGAILDGA 212


>gi|218711080|ref|YP_002418700.1| Microcin immunity mcbG [Escherichia coli ED1a]
 gi|218349863|emb|CAQ87265.1| Microcin immunity mcbG [Escherichia coli ED1a]
          Length = 187

 Score = 36.6 bits (83), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 32/142 (22%), Positives = 54/142 (38%), Gaps = 10/142 (7%)

Query: 30  LSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWRVFVSTALAAAVVASCSSNIS 89
           LS   +  C        + +F DC   +C      +KN +      L    +  C     
Sbjct: 18  LSGVNYYNCIFERIQLDNFKFRDCEFEKCRFVNCSIKNLK------LNFFKLIDCEFKDC 71

Query: 90  ALADLNKYEAETRGEFGIGSA----AQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
            L  +N  +      F + S       F    L+K++ +  +FR   F   D+R+SDF+G
Sbjct: 72  LLQGVNAADIMFPCTFSLVSCDLRFVDFIGLRLQKSIFLSSHFRDCLFEETDLRKSDFTG 131

Query: 146 SKFNGAYLEKAVAYKANFTVDE 167
           S FN      +     +F++ E
Sbjct: 132 SAFNNTEFRHSDLSHCDFSMTE 153


>gi|319793574|ref|YP_004155214.1| pentapeptide repeat-containing protein [Variovorax paradoxus EPS]
 gi|315596037|gb|ADU37103.1| pentapeptide repeat protein [Variovorax paradoxus EPS]
          Length = 372

 Score = 36.6 bits (83), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 35/72 (48%)

Query: 112 QFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLP 171
            F   DL   + ++ +F  AN + A+++ S F+    +GA LE+AV  + NF   + C  
Sbjct: 25  DFRGLDLSGGMFIESDFTGANMSGANLKGSIFATCGLSGATLERAVLDRCNFHRVDACAA 84

Query: 172 LLVSLPMATPVF 183
           ++    M    F
Sbjct: 85  MMAGATMHGTSF 96


>gi|298241513|ref|ZP_06965320.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
 gi|297554567|gb|EFH88431.1| pentapeptide repeat protein [Ktedonobacter racemifer DSM 44963]
          Length = 413

 Score = 36.6 bits (83), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 31/59 (52%), Gaps = 2/59 (3%)

Query: 98  EAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           E E +G    GS  Q   ADLRKA     +F  AN   AD+ +++  G+ F GA LE A
Sbjct: 256 EVEAKGANFTGS--QLAGADLRKANLQGASFLGANLRGADLSQANLEGAVFVGAQLEGA 312


>gi|153871558|ref|ZP_02000700.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
 gi|152071976|gb|EDN69300.1| pentapeptide repeat family protein [Beggiatoa sp. PS]
          Length = 179

 Score = 36.6 bits (83), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    ADL + +    N   AN  SAD+ E+D SG+  +GA L +    +AN 
Sbjct: 104 SGADLRWADLYRTILNDANLSYANLCSADLSEADLSGANLSGANLSRVDLSEANL 158


>gi|418068212|ref|ZP_12705516.1| pentapeptide repeat protein, partial [Geobacter metallireducens
           RCH3]
 gi|373557348|gb|EHP83782.1| pentapeptide repeat protein, partial [Geobacter metallireducens
           RCH3]
          Length = 153

 Score = 36.6 bits (83), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 37/75 (49%), Gaps = 5/75 (6%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           + A F  AD++K      N  + NFT A++  + F+G+K   A  + AV   A+F+  ++
Sbjct: 12  NGANFTGADMKKV-----NIEKGNFTDANLTNASFTGAKLRYATFKGAVLKGADFSFADL 66

Query: 169 CLPLLVSLPMATPVF 183
               L SL +    F
Sbjct: 67  SYTDLSSLDLGGANF 81


>gi|434397472|ref|YP_007131476.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
 gi|428268569|gb|AFZ34510.1| pentapeptide repeat protein [Stanieria cyanosphaera PCC 7437]
          Length = 455

 Score = 36.6 bits (83), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 25/41 (60%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           ADL +A   K NF+RAN T AD  E++   + F GA L +A
Sbjct: 189 ADLSEANLNKANFQRANLTEADFVEANLVQTNFKGANLSRA 229


>gi|108762763|ref|YP_635370.1| pentapeptide repeat-containing protein [Myxococcus xanthus DK 1622]
 gi|108466643|gb|ABF91828.1| pentapeptide repeat domain protein [Myxococcus xanthus DK 1622]
          Length = 203

 Score = 36.6 bits (83), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 21/55 (38%), Positives = 28/55 (50%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           S A    A LR A+  + N  RA+F  AD++ +D  G+   GAYL  A    AN 
Sbjct: 87  SKANLDYALLRGAILTQVNALRASFGEADLQGADLQGADLQGAYLVSANLASANL 141


>gi|428214178|ref|YP_007087322.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
 gi|428002559|gb|AFY83402.1| putative low-complexity protein [Oscillatoria acuminata PCC 6304]
          Length = 346

 Score = 36.6 bits (83), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 27/98 (27%), Positives = 47/98 (47%), Gaps = 2/98 (2%)

Query: 67  NWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKE 126
           NW       L+ A +A+   + + L+  N   A+    + IG+     S DLR+A     
Sbjct: 95  NWADLSGANLSGANLANADVSGANLSGANLSGAKLNQTYLIGT--NLKSVDLREANLSLA 152

Query: 127 NFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           +  +A+ T A++R++D +G+K   + L  A    AN T
Sbjct: 153 SLNKADLTKANLRQADLTGAKLKQSNLNLADLTHANLT 190


>gi|254456441|ref|ZP_05069870.1| Pentapeptide repeat protein [Candidatus Pelagibacter sp. HTCC7211]
 gi|207083443|gb|EDZ60869.1| Pentapeptide repeat protein [Candidatus Pelagibacter sp. HTCC7211]
          Length = 169

 Score = 36.6 bits (83), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 19/64 (29%), Positives = 32/64 (50%)

Query: 105 FGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           FG    + F  A+L + V +  NF + NF+ +++   DF G+    A  + +   +ANFT
Sbjct: 74  FGTFPESTFVRANLYETVSIGANFEKTNFSGSNLTRVDFMGATLIEANFQNSNLMEANFT 133

Query: 165 VDEI 168
              I
Sbjct: 134 SSNI 137


>gi|87311950|ref|ZP_01094060.1| hypothetical protein DSM3645_13340 [Blastopirellula marina DSM
           3645]
 gi|87285312|gb|EAQ77236.1| hypothetical protein DSM3645_13340 [Blastopirellula marina DSM
           3645]
          Length = 586

 Score = 36.6 bits (83), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 28/55 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTV 165
           AQ    DLR+    + NF+  NF  AD+  SDF+G++   A L    A  A + +
Sbjct: 128 AQLPGCDLREVSGKQANFQDVNFARADLSRSDFTGAQLAEADLSGVTAVAAQWKL 182


>gi|417305110|ref|ZP_12092092.1| oxidoreductase molybdopterin binding protein [Rhodopirellula
           baltica WH47]
 gi|327538543|gb|EGF25205.1| oxidoreductase molybdopterin binding protein [Rhodopirellula
           baltica WH47]
          Length = 349

 Score = 36.6 bits (83), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 5/46 (10%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           A F S +LR+A     NFR AN +++ ++ SD  G+ F GA LE A
Sbjct: 244 ADFRSCNLRQA-----NFRDANLSNSKLQRSDLQGANFTGADLEGA 284


>gi|428307622|ref|YP_007144447.1| endoribonuclease L-PSP [Crinalium epipsammum PCC 9333]
 gi|428249157|gb|AFZ14937.1| endoribonuclease L-PSP [Crinalium epipsammum PCC 9333]
          Length = 378

 Score = 36.2 bits (82), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 20/56 (35%), Positives = 29/56 (51%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFT 164
           S A    A L+ A  ++ N   A+ + AD+R +D SG+    A L KA   +AN T
Sbjct: 43  SNADLSRASLKDAKLIRVNLSNADLSWADLRGADLSGANLENANLSKASLDQANLT 98


>gi|428303610|ref|YP_007113059.1| pentapeptide repeat-containing protein [Calothrix sp. PCC 6303]
 gi|428238815|gb|AFZ04603.1| pentapeptide repeat protein [Calothrix sp. PCC 6303]
          Length = 490

 Score = 36.2 bits (82), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 19/54 (35%), Positives = 29/54 (53%)

Query: 116 ADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEIC 169
           ADL +A   K + R A    AD+RE++  G+   GA L  A  + A+ T  ++C
Sbjct: 182 ADLERASFKKADLRNAILEGADLREANLEGADLRGADLRGANLWGADLTGVDLC 235


>gi|440713213|ref|ZP_20893815.1| oxidoreductase molybdopterin binding protein [Rhodopirellula
           baltica SWK14]
 gi|436442020|gb|ELP35204.1| oxidoreductase molybdopterin binding protein [Rhodopirellula
           baltica SWK14]
          Length = 365

 Score = 36.2 bits (82), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 20/46 (43%), Positives = 28/46 (60%), Gaps = 5/46 (10%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           A F S +LR+A     NFR AN +++ ++ SD  G+ F GA LE A
Sbjct: 260 ADFRSCNLRQA-----NFRDANLSNSKLQRSDLQGANFTGADLEGA 300


>gi|428305676|ref|YP_007142501.1| pentapeptide repeat-containing protein [Crinalium epipsammum PCC
           9333]
 gi|428247211|gb|AFZ12991.1| pentapeptide repeat protein [Crinalium epipsammum PCC 9333]
          Length = 330

 Score = 36.2 bits (82), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 24/46 (52%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           A+   ADLR A     N   AN   AD+R++D SG+   GA L  A
Sbjct: 90  ARLQGADLRGADITLANLLDANLMEADLRDADLSGANLTGACLRGA 135


>gi|428181173|gb|EKX50038.1| hypothetical protein GUITHDRAFT_135709 [Guillardia theta CCMP2712]
          Length = 1263

 Score = 36.2 bits (82), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 24/50 (48%)

Query: 110 AAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAY 159
            A F   DL  A+    N R ANFT A +  +DFSGS   GA +     Y
Sbjct: 577 GADFSHCDLSFAMLQNCNLRGANFTGAKLTGTDFSGSDLEGAIMPDMEGY 626


>gi|427736744|ref|YP_007056288.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427371785|gb|AFY55741.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 443

 Score = 36.2 bits (82), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 33/61 (54%), Gaps = 5/61 (8%)

Query: 109 SAAQFGSADLRKAVHVKEN-----FRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           ++ +F  ADLR+A  V  N     F  AN +  ++  +D SG+  +GAYL  A  Y A+ 
Sbjct: 319 TSTKFIGADLREANFVGANLDNVDFSNANLSGTNLSGADLSGADLSGAYLSGAYFYDADL 378

Query: 164 T 164
           +
Sbjct: 379 S 379


>gi|17230748|ref|NP_487296.1| hypothetical protein all3256 [Nostoc sp. PCC 7120]
 gi|17132351|dbj|BAB74955.1| all3256 [Nostoc sp. PCC 7120]
          Length = 268

 Score = 36.2 bits (82), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 25/83 (30%), Positives = 35/83 (42%), Gaps = 30/83 (36%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKF---------------------- 148
           A F  A+LR A+  + N    +F+SAD+R++D +G+K                       
Sbjct: 114 ADFSGANLRGAIVTEANLIGTDFSSADLRDADLAGAKLIRSNLCFANLIAANFIAVDFSE 173

Query: 149 --------NGAYLEKAVAYKANF 163
                    GAYL KA  YKAN 
Sbjct: 174 ANLYQAEVMGAYLYKANFYKANL 196


>gi|166362955|ref|YP_001655228.1| hypothetical protein MAE_02140 [Microcystis aeruginosa NIES-843]
 gi|166085328|dbj|BAG00036.1| hypothetical protein MAE_02140 [Microcystis aeruginosa NIES-843]
          Length = 186

 Score = 36.2 bits (82), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 21/82 (25%), Positives = 38/82 (46%), Gaps = 5/82 (6%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANF 163
           +   F   +L+ A     N + +NF+SAD+R + F+G+      F+GA L   +AY + F
Sbjct: 61  TGKDFSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTF 120

Query: 164 TVDEICLPLLVSLPMATPVFPA 185
              ++   +     M   +F  
Sbjct: 121 KNSDLSDAIFAEAIMLRTIFEG 142


>gi|422301609|ref|ZP_16388976.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9806]
 gi|389789327|emb|CCI14609.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9806]
          Length = 169

 Score = 36.2 bits (82), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 5/78 (6%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTVDE 167
           F   +L+ A     N + +NF+SAD+R + F+G+      F+GA L   +AY + F   +
Sbjct: 48  FSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSD 107

Query: 168 ICLPLLVSLPMATPVFPA 185
           +   +     M   +F  
Sbjct: 108 LSDAIFAEAIMLRTIFEG 125


>gi|254501374|ref|ZP_05113525.1| Pentapeptide repeat protein [Labrenzia alexandrii DFL-11]
 gi|222437445|gb|EEE44124.1| Pentapeptide repeat protein [Labrenzia alexandrii DFL-11]
          Length = 296

 Score = 36.2 bits (82), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 26/47 (55%)

Query: 117 DLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           DL++AV  + NF R++F   +   +DFS S F GA +      K+N 
Sbjct: 92  DLKEAVMPRSNFERSDFRRTEAERADFSASDFAGASMRAVDLEKSNL 138


>gi|209525619|ref|ZP_03274157.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|423065193|ref|ZP_17053983.1| hypothetical protein SPLC1_S230580 [Arthrospira platensis C1]
 gi|209493952|gb|EDZ94269.1| pentapeptide repeat protein [Arthrospira maxima CS-328]
 gi|406713325|gb|EKD08496.1| hypothetical protein SPLC1_S230580 [Arthrospira platensis C1]
          Length = 333

 Score = 36.2 bits (82), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 38/87 (43%), Gaps = 7/87 (8%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
           F+   L  A +  C    + L++ N   A  RG       A    A+LR A     N   
Sbjct: 242 FIKANLMKADLQECDLRNADLSNTNLNLANLRG-------ADLTGANLRGAYLWGANLDG 294

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAV 157
           AN  +AD+R++ F  +  NGA L  A+
Sbjct: 295 ANLENADLRDASFRDATLNGAILNGAI 321


>gi|443666115|ref|ZP_21133744.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|159030126|emb|CAO91018.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443331286|gb|ELS45952.1| pentapeptide repeats family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 169

 Score = 36.2 bits (82), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 5/78 (6%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTVDE 167
           F   +L+ A     N + +NF+SAD+R + F+G+      F+GA L   +AY + F   +
Sbjct: 48  FSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSD 107

Query: 168 ICLPLLVSLPMATPVFPA 185
           +   +     M   +F  
Sbjct: 108 LSDAIFAEAIMLRTIFEG 125


>gi|68171987|ref|ZP_00545289.1| Pentapeptide repeat [Ehrlichia chaffeensis str. Sapulpa]
 gi|67998589|gb|EAM85340.1| Pentapeptide repeat [Ehrlichia chaffeensis str. Sapulpa]
          Length = 435

 Score = 36.2 bits (82), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 24/62 (38%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 102 RGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
           + EFG   S A F   DLR +V    N   ANFT A++  S F  S   GA    A   K
Sbjct: 83  KKEFGNNLSGADFSDLDLRGSVFDNVNLLHANFTRANLSNSTFIDSNMQGASFINANLSK 142

Query: 161 AN 162
           +N
Sbjct: 143 SN 144


>gi|390438199|ref|ZP_10226689.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
 gi|425441109|ref|ZP_18821396.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9717]
 gi|425454770|ref|ZP_18834496.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9807]
 gi|425466166|ref|ZP_18845469.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9809]
 gi|425468563|ref|ZP_18847571.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9701]
 gi|389718271|emb|CCH97753.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9717]
 gi|389804467|emb|CCI16499.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9807]
 gi|389831470|emb|CCI25816.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9809]
 gi|389838386|emb|CCI30813.1| conserved exported hypothetical protein [Microcystis sp. T1-4]
 gi|389884775|emb|CCI34954.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9701]
          Length = 169

 Score = 36.2 bits (82), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 5/78 (6%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTVDE 167
           F   +L+ A     N + +NF+SAD+R + F+G+      F+GA L   +AY + F   +
Sbjct: 48  FSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSD 107

Query: 168 ICLPLLVSLPMATPVFPA 185
           +   +     M   +F  
Sbjct: 108 LSDAIFAEAIMLRTIFEG 125


>gi|427739890|ref|YP_007059434.1| putative low-complexity protein [Rivularia sp. PCC 7116]
 gi|427374931|gb|AFY58887.1| putative low-complexity protein [Rivularia sp. PCC 7116]
          Length = 447

 Score = 35.8 bits (81), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 25/79 (31%), Positives = 36/79 (45%), Gaps = 6/79 (7%)

Query: 92  ADLNKYEAETRGEFGIG-SAAQFGSADLRKAVHVKENFRRA-----NFTSADMRESDFSG 145
           A L+K      G  G     A    A+LR+A       ++A     NF  A +RE+D SG
Sbjct: 327 AKLDKARMHETGLIGANLQQANLNGANLRQANLNAARLQQAEVFFANFAEASLREADLSG 386

Query: 146 SKFNGAYLEKAVAYKANFT 164
           +   G   +KAV Y+ N +
Sbjct: 387 ANLMGTDFQKAVLYETNLS 405


>gi|170076886|ref|YP_001733524.1| pentapeptide repeat-containing protein [Synechococcus sp. PCC 7002]
 gi|169884555|gb|ACA98268.1| pentapeptide repeats protein [Synechococcus sp. PCC 7002]
          Length = 324

 Score = 35.8 bits (81), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 30/65 (46%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICL 170
           A+  S D R A   + +FR AN   A+   +D  G+KF GA L +     A F  D    
Sbjct: 260 AELVSTDFRHAQLQRADFRGANLWGANFARADLRGAKFQGAKLNQTNFQGAVFEFDPRTQ 319

Query: 171 PLLVS 175
            LL S
Sbjct: 320 TLLAS 324


>gi|75911046|ref|YP_325342.1| pentapeptide repeat-containing protein [Anabaena variabilis ATCC
           29413]
 gi|75704771|gb|ABA24447.1| Pentapeptide repeat protein [Anabaena variabilis ATCC 29413]
          Length = 576

 Score = 35.8 bits (81), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 28/51 (54%)

Query: 106 GIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKA 156
           GI S A    ADL  AV +  +F  AN  SA++  S+ SG+  NGA L  A
Sbjct: 475 GILSEADLTGADLSDAVLLGTDFSFANLNSANLSGSNLSGAILNGADLSSA 525


>gi|284008627|emb|CBA75237.1| conserved hypothetical protein [Arsenophonus nasoniae]
          Length = 823

 Score = 35.8 bits (81), Expect = 8.8,   Method: Composition-based stats.
 Identities = 19/58 (32%), Positives = 29/58 (50%)

Query: 111 AQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           A    ADL KA   K  F +AN T  D  ES+    KF+  ++++    + N  VD++
Sbjct: 748 ADLTDADLTKANCQKAKFSKANLTRTDFTESNLQDVKFSKHHIKREFLQEINVAVDKL 805


>gi|376004329|ref|ZP_09782046.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375327291|emb|CCE17799.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 340

 Score = 35.8 bits (81), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 38/87 (43%), Gaps = 7/87 (8%)

Query: 71  FVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRR 130
           F+   L  A +  C    + L++ N   A  RG       A    A+LR A     N   
Sbjct: 249 FIKANLMKADLQECDLRNADLSNTNLNLANLRG-------ADLTGANLRGAYLWGANLDG 301

Query: 131 ANFTSADMRESDFSGSKFNGAYLEKAV 157
           AN  +AD+R++ F  +  NGA L  A+
Sbjct: 302 ANLENADLRDASFRDATLNGAILNGAI 328


>gi|427702733|ref|YP_007045955.1| low-complexity protein [Cyanobium gracile PCC 6307]
 gi|427345901|gb|AFY28614.1| putative low-complexity protein [Cyanobium gracile PCC 6307]
          Length = 247

 Score = 35.8 bits (81), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 5/49 (10%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAV 157
           +AA F  ADLR A     NF  A+ T AD+R +   G++F+GA L + +
Sbjct: 187 TAADFRGADLRGA-----NFSGADLTQADLRGALLDGARFHGAVLSRTL 230


>gi|409437276|ref|ZP_11264395.1| putative pentapeptide repeat protein [Rhizobium mesoamericanum
           STM3625]
 gi|408751000|emb|CCM75551.1| putative pentapeptide repeat protein [Rhizobium mesoamericanum
           STM3625]
          Length = 234

 Score = 35.8 bits (81), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 20/61 (32%), Positives = 34/61 (55%), Gaps = 5/61 (8%)

Query: 109 SAAQFGSADLRKAVHVKENFRR-----ANFTSADMRESDFSGSKFNGAYLEKAVAYKANF 163
           + AQ G+A+  K    +  F       A+F  A+++ ++F+G+K  GA  EKA   +ANF
Sbjct: 92  TGAQAGNANFSKIEAYRSGFESVFAEGASFAGAELQRANFNGAKLTGANFEKAELGRANF 151

Query: 164 T 164
           +
Sbjct: 152 S 152


>gi|425445790|ref|ZP_18825810.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9443]
 gi|389734131|emb|CCI02174.1| conserved exported hypothetical protein [Microcystis aeruginosa PCC
           9443]
          Length = 169

 Score = 35.8 bits (81), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 5/78 (6%)

Query: 113 FGSADLRKAVHVKENFRRANFTSADMRESDFSGS-----KFNGAYLEKAVAYKANFTVDE 167
           F   +L+ A     N + +NF+SAD+R + F+G+      F+GA L   +AY + F   +
Sbjct: 48  FSGQNLQSAQFTNVNLQDSNFSSADLRGAVFNGASIIEGNFHGADLTNGLAYLSTFKNSD 107

Query: 168 ICLPLLVSLPMATPVFPA 185
           +   +     M   +F  
Sbjct: 108 LSDAIFAEAIMLRTIFEG 125


>gi|334145352|ref|YP_004538562.1| pentapeptide repeat-containing protein [Novosphingobium sp. PP1Y]
 gi|333937236|emb|CCA90595.1| pentapeptide repeat-containing protein [Novosphingobium sp. PP1Y]
          Length = 228

 Score = 35.8 bits (81), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 53/131 (40%), Gaps = 26/131 (19%)

Query: 30  LSKPLWVACQISSKTESDGQFPDCSNNQCAGPYAKLKNWR----VFVSTALAAAVVASCS 85
           L    +VAC  ++ T            +C    A+L   R     F  T L  A++A  S
Sbjct: 85  LGDARFVACDFNNATFKRANLQSARFERCKLTGAELSELRGIDIAFEETLLVNAILAGHS 144

Query: 86  SNISALADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSG 145
                 A+L + +              F  ADLRK      +FR+A+FT   +RE+   G
Sbjct: 145 FR---RANLKRTD--------------FSQADLRKC-----DFRQAHFTECSLREASMEG 182

Query: 146 SKFNGAYLEKA 156
           ++F GA L  A
Sbjct: 183 ARFEGADLRGA 193


>gi|225631183|ref|ZP_03787884.1| pentapeptide repeat domain protein [Wolbachia endosymbiont of
           Muscidifurax uniraptor]
 gi|225591121|gb|EEH12302.1| pentapeptide repeat domain protein [Wolbachia endosymbiont of
           Muscidifurax uniraptor]
          Length = 601

 Score = 35.8 bits (81), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 17/43 (39%), Positives = 26/43 (60%), Gaps = 2/43 (4%)

Query: 129 RRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEICLP 171
           + AN   A+++ESDF+GS  + AYL  ++   +NF  DE  L 
Sbjct: 231 KNANIYGAELKESDFTGSNLSAAYLNSSIIINSNF--DETNLS 271


>gi|227498348|ref|ZP_03928498.1| pentapeptide repeat protein [Acidaminococcus sp. D21]
 gi|352685375|ref|YP_004897360.1| pentapeptide repeat-containing protein [Acidaminococcus intestini
           RyC-MR95]
 gi|226903810|gb|EEH89728.1| pentapeptide repeat protein [Acidaminococcus sp. D21]
 gi|350280030|gb|AEQ23220.1| pentapeptide repeat protein [Acidaminococcus intestini RyC-MR95]
          Length = 250

 Score = 35.8 bits (81), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 40/73 (54%), Gaps = 8/73 (10%)

Query: 95  NKYEAETRG---EFGIGSAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGA 151
           N Y A+ R    ++  G+ A F SA+L ++   + NF  ANFTSA++ +     ++F  A
Sbjct: 138 NLYTADLRESNFDYASGAMANFYSANLARSWFFRSNFMSANFTSANLYD-----ARFRRA 192

Query: 152 YLEKAVAYKANFT 164
            L +A+   AN T
Sbjct: 193 NLSEALLRSANLT 205


>gi|17232102|ref|NP_488650.1| hypothetical protein alr4610 [Nostoc sp. PCC 7120]
 gi|17133747|dbj|BAB76309.1| alr4610 [Nostoc sp. PCC 7120]
          Length = 164

 Score = 35.8 bits (81), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 40/132 (30%), Positives = 62/132 (46%), Gaps = 24/132 (18%)

Query: 65  LKNWRVFVSTALAAAV-------VASCSSNISALADLNKYEAETRGEFGIGSAAQFGSAD 117
           +K+WRV VS  LA  +        A+ SS+I+  A       +  G+  IGS  +F + D
Sbjct: 1   MKDWRVVVSFVLAMVLFLFPGSAQAASSSSITRSAGDELKAKDFSGQSLIGS--EFTNVD 58

Query: 118 LRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLE-----KAVAYKANFTVDEICLPL 172
           L       EN   ANF++AD+R   F+G+   G  L      + +AY ANF   ++   +
Sbjct: 59  L-------EN---ANFSNADLRGGVFNGTVLEGVNLHGVDFSEGIAYLANFKNADLSDAI 108

Query: 173 LVSLPMATPVFP 184
           L +  M   +F 
Sbjct: 109 LTNAMMLRSIFD 120


>gi|88658408|ref|YP_507868.1| pentapeptide repeat-containing protein [Ehrlichia chaffeensis str.
           Arkansas]
 gi|88599865|gb|ABD45334.1| pentapeptide repeat protein [Ehrlichia chaffeensis str. Arkansas]
          Length = 607

 Score = 35.8 bits (81), Expect = 9.5,   Method: Composition-based stats.
 Identities = 24/62 (38%), Positives = 29/62 (46%), Gaps = 1/62 (1%)

Query: 102 RGEFGIG-SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYK 160
           + EFG   S A F   DLR +V    N   ANFT A++  S F  S   GA    A   K
Sbjct: 64  KKEFGNNLSGADFSDLDLRGSVFDNVNLLHANFTRANLSNSTFIDSNMQGASFINANLSK 123

Query: 161 AN 162
           +N
Sbjct: 124 SN 125


>gi|390442100|ref|ZP_10230118.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           sp. T1-4]
 gi|389834544|emb|CCI34244.1| Similar to tr|Q3MCS8|Q3MCS8_ANAVT Pentapeptide repeat [Microcystis
           sp. T1-4]
          Length = 220

 Score = 35.8 bits (81), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 17/61 (27%), Positives = 34/61 (55%)

Query: 109 SAAQFGSADLRKAVHVKENFRRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTVDEI 168
           +     SA+L++A+ +  +FR  +    D+ +++F G+  N A L  ++ Y+ANF    +
Sbjct: 108 NGVNLNSANLQQALLIDADFRSTSDQRTDLGKTNFRGADLNYANLSGSLLYRANFADCRL 167

Query: 169 C 169
           C
Sbjct: 168 C 168


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.130    0.394 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,018,727,694
Number of Sequences: 23463169
Number of extensions: 109804791
Number of successful extensions: 264004
Number of sequences better than 100.0: 437
Number of HSP's better than 100.0 without gapping: 254
Number of HSP's successfully gapped in prelim test: 183
Number of HSP's that attempted gapping in prelim test: 260489
Number of HSP's gapped (non-prelim): 3224
length of query: 198
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 63
effective length of database: 9,191,667,552
effective search space: 579075055776
effective search space used: 579075055776
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 73 (32.7 bits)